目錄

1150210 meeting

前情提要

本次實驗改用 MERRA-2 資料集作為實驗樣本,由上次實驗(1150203 meeting)可知,當 diffusion 參數設為

  • T: 500
  • beta_0: 0.0008
  • beta_T: 0.08

時,可以得到較佳的時間序列預測結果,故本次實驗皆使用此參數進行。

時間序列失真問題

本次實驗所調整的參數為 s4_state_dims4_dropout ,用於測試調整此參數是否對於實驗的預測結果有影響。

以下實驗同樣包含對空間進行標準化(SP)的未知地點填補實驗結果。

命名方式

以下各實驗命名方式遵照 XXX-XXXX-XX 的方式進行命名,其中 XXX 指的是是否有在訓練過程中使用 $autoFRK$ , XXXX 指的是訓練的迭代次數, XX 則是是否有對地點做標準化。如 NoFRK-4000-NoSP 指的是在訓練中未使用 $autoFRK$ ,迭代 4,000 次,且在填補時未對地點做標準化。

s4_state_dim128-s4_dropout0.1

FRK-4000-SP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE35.51545364.99668350.542729123.393501225.940579174.7704930.0262411.091020e-040.373824
RMSPE5.9594848.0620527.10934111.10826315.03132013.2200790.1619901.044519e-020.611412
MSPE%0.1330150.2518140.1873750.4621540.8753530.6481360.0000933.927311e-070.001299
RMSPE%0.3647120.5018110.4328690.6798190.9356030.8050690.0096266.266826e-040.036046
MAPE2.7135163.9910643.8305939.35328913.85859412.2158520.0320696.100190e-030.444239
MAPE%0.0100280.0153470.0140670.0345830.0532940.0450540.0001122.198877e-050.001552

FRK-4000-NoSP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE1.287440e+2866.3546972.552651e+291.193546e+28230.4987542.365173e+291.325359e+280.0657512.628364e+29
RMSPE1.134654e+148.1458395.052377e+141.092495e+1415.1821854.863305e+141.151242e+140.2564195.126757e+14
MSPE%4.453999e+250.2569158.911748e+264.220588e+250.8924988.532070e+264.548261e+250.0002379.065080e+26
RMSPE%6.673828e+120.5068682.985255e+136.496605e+120.9447212.920971e+136.744079e+120.0154103.010827e+13
MAPE7.132146e+124.1735701.420333e+147.233808e+1214.0217101.440218e+147.091090e+120.1964361.412302e+14
MAPE%2.469299e+100.0160134.960848e+112.560003e+100.0539045.197848e+112.432669e+100.0007114.865136e+11

NoFRK-4000-SP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE36.30755766.53096251.517622126.034584231.110863177.8915370.0716410.0660020.482003
RMSPE6.0255758.1566517.17757811.22651315.20233113.3375990.2676590.2569090.694264
MSPE%0.1359250.2576370.1909040.4718720.8950060.6594670.0002530.0002380.001676
RMSPE%0.3686790.5075800.4369250.6869300.9460470.8120760.0159170.0154390.040944
MAPE2.8528834.1755833.9369439.48951814.02798312.3806620.1727030.1967300.526980
MAPE%0.0105250.0160220.0144450.0350810.0539340.0456500.0006080.0007120.001843

NoFRK-4000-NoSP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE1.287440e+2866.3546972.552651e+291.193546e+28230.4987542.365173e+291.325359e+280.0657512.628364e+29
RMSPE1.134654e+148.1458395.052377e+141.092495e+1415.1821854.863305e+141.151242e+140.2564195.126757e+14
MSPE%4.453999e+250.2569158.911748e+264.220588e+250.8924988.532070e+264.548261e+250.0002379.065080e+26
RMSPE%6.673828e+120.5068682.985255e+136.496605e+120.9447212.920971e+136.744079e+120.0154103.010827e+13
MAPE7.132146e+124.1735701.420333e+147.233808e+1214.0217101.440218e+147.091090e+120.1964361.412302e+14
MAPE%2.469299e+100.0160134.960848e+112.560003e+100.0539045.197848e+112.432669e+100.0007114.865136e+11

s4_state_dim512-s4_dropout0.1

FRK-4000-SP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE40.59702874.55376857.979254141.057018259.162674200.6165170.0266481.717701e-040.375744
RMSPE6.3715808.6344527.61441111.87674316.09853014.1639160.1632411.310611e-020.612980
MSPE%0.1518130.2881440.2146020.5274981.0016420.7427620.0000946.186296e-070.001306
RMSPE%0.3896320.5367910.4632510.7262911.0008210.8618370.0097037.865301e-040.036143
MAPE2.9237054.2967244.12385310.08441414.91868413.2320860.0318807.085702e-030.445528
MAPE%0.0107980.0165010.0153440.0372580.0572980.0487530.0001122.558032e-050.001557

FRK-4000-NoSP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE8.477766e+2766.7053981.673428e+298.332321e+27231.6879111.644648e+298.536503e+270.0778441.685051e+29
RMSPE9.207478e+138.1673374.090756e+149.128155e+1315.2212984.055425e+149.239320e+130.2790054.104938e+14
MSPE%2.944733e+250.2583115.876121e+262.960311e+250.8972425.970990e+262.938442e+250.0002825.837809e+26
RMSPE%5.426539e+120.5082442.424071e+135.440874e+120.9472292.443561e+135.420740e+120.0167842.416156e+13
MAPE6.644246e+124.1952141.321311e+146.673485e+1214.0208921.327326e+146.632438e+120.2271521.318882e+14
MAPE%2.304933e+100.0160944.629502e+112.366691e+100.0539084.805183e+112.279992e+100.0008224.558554e+11

NoFRK-4000-SP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE36.50471066.81197051.807780126.698688232.056481178.8337280.0802190.0786100.508840
RMSPE6.0419138.1738597.19776211.25605115.23340013.3728730.2832290.2803750.713330
MSPE%0.1366560.2586980.1919430.4743380.8985780.6628390.0002840.0002840.001773
RMSPE%0.3696700.5086230.4381130.6887220.9479330.8141500.0168590.0168650.042107
MAPE2.8695414.1997813.9525329.49841314.03264412.3929170.1924970.2288170.543915
MAPE%0.0105850.0161100.0145010.0351140.0539500.0456930.0006790.0008280.001904

NoFRK-4000-NoSP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE8.477766e+2766.7053981.673428e+298.332321e+27231.6879111.644648e+298.536503e+270.0778441.685051e+29
RMSPE9.207478e+138.1673374.090756e+149.128155e+1315.2212984.055425e+149.239320e+130.2790054.104938e+14
MSPE%2.944733e+250.2583115.876121e+262.960311e+250.8972425.970990e+262.938442e+250.0002825.837809e+26
RMSPE%5.426539e+120.5082442.424071e+135.440874e+120.9472292.443561e+135.420740e+120.0167842.416156e+13
MAPE6.644246e+124.1952141.321311e+146.673485e+1214.0208921.327326e+146.632438e+120.2271521.318882e+14
MAPE%2.304933e+100.0160944.629502e+112.366691e+100.0539084.805183e+112.279992e+100.0008224.558554e+11

s4_state_dim128-s4_dropout0.5

FRK-4000-SP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE34.02728963.75788748.076690118.220545221.634477166.1971800.0261663.396507e-050.374184
RMSPE5.8332917.9848546.93373610.87292714.88739312.8917490.1617605.827956e-030.611706
MSPE%0.1273900.2467350.1781610.4426030.8576980.6161020.0000921.212103e-070.001301
RMSPE%0.3569170.4967240.4220920.6652840.9261200.7849220.0096133.481526e-040.036063
MAPE2.6580603.9540343.7531379.16580313.73643911.9428560.0299333.446824e-030.445751
MAPE%0.0098220.0151950.0137770.0338830.0527920.0440330.0001051.237200e-050.001558

FRK-4000-NoSP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE3.381975e+2766.5259076.658326e+283.156037e+27231.1185706.211643e+283.473219e+270.0557936.838718e+28
RMSPE5.815475e+138.1563422.580373e+145.617861e+1315.2025842.492317e+145.893402e+130.2362052.615094e+14
MSPE%1.178127e+250.2576622.345083e+261.124813e+250.8951882.260235e+261.199657e+250.0002002.379348e+26
RMSPE%3.432385e+120.5076041.531366e+133.353823e+120.9461441.503408e+133.463607e+120.0141411.542514e+13
MAPE3.057809e+124.1584026.071690e+132.958253e+1213.9904485.873241e+133.098014e+120.1877686.151833e+13
MAPE%1.064673e+100.0159582.135997e+111.053138e+100.0537982.134399e+111.069331e+100.0006762.136642e+11

NoFRK-4000-SP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE36.40177166.96506351.791219126.379117232.646364178.8693890.0647650.0553070.471189
RMSPE6.0333888.1832187.19661211.24184715.25274913.3742060.2544900.2351740.686432
MSPE%0.1362720.2593330.1918750.4731430.9010010.6629420.0002280.0001980.001637
RMSPE%0.3691510.5092480.4380360.6878540.9492110.8142130.0151120.0140820.040460
MAPE2.8458114.1777303.9359739.48468414.06176412.3940910.1647270.1861010.520194
MAPE%0.0104990.0160310.0144410.0350630.0540670.0456970.0005790.0006700.001818

NoFRK-4000-NoSP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE3.381975e+2766.5259076.658326e+283.156037e+27231.1185706.211643e+283.473219e+270.0557936.838718e+28
RMSPE5.815475e+138.1563422.580373e+145.617861e+1315.2025842.492317e+145.893402e+130.2362052.615094e+14
MSPE%1.178127e+250.2576622.345083e+261.124813e+250.8951882.260235e+261.199657e+250.0002002.379348e+26
RMSPE%3.432385e+120.5076041.531366e+133.353823e+120.9461441.503408e+133.463607e+120.0141411.542514e+13
MAPE3.057809e+124.1584026.071690e+132.958253e+1213.9904485.873241e+133.098014e+120.1877686.151833e+13
MAPE%1.064673e+100.0159582.135997e+111.053138e+100.0537982.134399e+111.069331e+100.0006762.136642e+11

s4_state_dim512-s4_dropout0.5

FRK-4000-SP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE36.07598665.77270351.514553125.226083228.477354177.8375810.0730620.0650560.499484
RMSPE6.0063298.1100377.17736411.19044615.11546713.3355760.2702990.2550610.706741
MSPE%0.1350840.2548910.1909240.4689380.8854660.6593880.0002580.0002350.001737
RMSPE%0.3675380.5048670.4369490.6847900.9409920.8120270.0160690.0153370.041674
MAPE2.8503244.1530953.9427219.46698913.93887512.3795590.1782090.2011460.535536
MAPE%0.0105160.0159410.0144670.0350010.0536130.0456530.0006270.0007270.001872

FRK-4000-NoSP

ALL Locs & All TimeKnown Locs & All TimeUnknown Locs & All TimeALL Locs & FutureKnown Locs & FutureUnknown Locs & FutureALL Locs & PastKnown Locs & PastUnknown Locs & Past
MSPE1.985254e+2765.9524473.920845e+282.366638e+27229.0960854.672664e+281.831234e+270.0675153.617226e+28
RMSPE4.455619e+138.1211111.980112e+144.864810e+1315.1359202.161635e+144.279292e+130.2598371.901901e+14
MSPE%6.893532e+240.2555691.377161e+268.388031e+240.8878031.696977e+266.289984e+240.0002441.248005e+26
RMSPE%2.625554e+120.5055391.173525e+132.896210e+120.9422331.302681e+132.507984e+120.0156231.117141e+13
MAPE2.244495e+124.1612214.460411e+132.584819e+1213.9589985.136579e+132.107056e+120.2044264.187342e+13
MAPE%7.808101e+090.0159711.568851e+119.177635e+090.0536881.864585e+117.255019e+090.0007391.449420e+11

NoFRK-4000-SP

尚未進行實驗。

NoFRK-4000-NoSP

尚未進行實驗。

結論

本次實驗結果顯示,當 s4_state_dim 是 128 與 s4_dropout 是 0.5 時,可取得全實驗組別中的最佳預測結果。實驗結果顯示較小的狀態維度(128)配合較高的 Dropout 率(0.5) ,能最有效地捕捉時空特徵並抑制過擬合,顯著提升模型對未知時空的泛化能力。

本次實驗也同樣發現,在進行最終的未知地點的填補時,經由空間標準化(SP)可以有效降低填補誤差。

FRK-SP

以表現最穩定的 FRK-SP(有 AutoFRK 且有空間標準化)組別為例,針對「未知地點的未來預測(Unknown Locs & Future)」指標進行對比:

  • s4_state_dim = 128 (Dropout 0.1)

    MSPE 為 174.77 , RMSPE 為 13.22

  • s4_state_dim = 128 (Dropout 0.5)

    MSPE 為 166.19,RMSPE 為 12.89

  • s4_state_dim = 512 (Dropout 0.1)

    MSPE 為 200.62 , RMSPE 為 14.16

  • s4_state_dim = 512 (Dropout 0.5)

    MSPE 為 177.84 , RMSPE 為 13.34

由以上結果可知,在此資料集與任務規模下,128 維的狀態空間已足以捕捉時空特徵。提升至 512 維可能導致模型參數過多,在訓練數據有限的情況下產生了過擬合(Overfitting)現象,反而降低了對未知地點的推論能力。

SP

本次實驗發現,空間標準化對於資料填補是一件很重要的事情,可以防範預測數值面臨災難性的爆炸。

  • 有標準化 (SP)

    所有 SP 組別的 MSPE 均保持在正常範圍(35 ~ 200 之間)。

  • 無標準化 (NoSP)

    幾乎所有 NoSP 組別(無論是 128 或 512 維,無論是否有 FRK),其 Unknown Locs 的 MSPE 均暴增至 $10^{27}$ ~ $10^{29}$ 的量級。

AutoFRK

在填補時使用空間標準化(SP)的前提下,可發現當 s4_state_dim 是 128 時,在訓練過程中接入 FRK 可降低未知地點的預測誤差;而當 s4_state_dim 是 512 時,未在訓練過程中使用 FRK 反而預測效果更佳。

s4_dropout 的影響

對比維度 128 的兩組實驗發現,提高 Dropout 率對於處理 MERRA-2 資料集有顯著幫助:

  • s4_dropout = 0.5 (Best)

    在「Unknown Locs & Future」指標取得 MSPE 166.19

  • s4_dropout = 0.1

    同指標下 MSPE 為 174.77

實驗結果顯示,較高的 Dropout 能顯著降低預測誤差。

參考資料

  • Zhu X, Xiong Y, Wu M, et al. Weather2K: A Multivariate Spatio-Temporal Benchmark Dataset for Meteorological Forecasting Based on Real-Time Observation Data from Ground Weather Stations[C]//International Conference on Artificial Intelligence and Statistics. PMLR, 2023: 2704-2722.
  • Juan Lopez Alcaraz 、 Nils Strodthoff(2022)。Diffusion-based time series imputation and forecasting with structured state space models。Transactions on Machine Learning Research。參考自 https://openreview.net/forum?id=hHiIbk7ApW
  • SSSD(2022)。GitHub。參考自 https://github.com/AI4HealthUOL/SSSD