-
Notifications
You must be signed in to change notification settings - Fork 404
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[enhancement] Enhance positional encoding and introduce dynamic multi-scale patching for improved forecasting #233
Comments
Could u like to suggest some papers and code about these ideas? Such as dynamic/multi-scale patching? |
|
Thank you very much for your quick, detailed reply. These papers look too complicated for me. I need to spend some time reading. Besides, I am trying to modify the attention mechanism(scale dot-product attention) to causal attention(masked self-attention) in Patchtst. However, I find the results worse than Patchtst in ETTh1.csv. Its validation loss decreases by 0.0011 during the training, but its test loss increases from 0.4148351252 to 0.4190029799938202. Could you give some suggestions about this? |
I propose the following concise enhancements:
Enhanced Positional Encoding:
Replace the standard sinusoidal positional encodings with relative or learnable encodings—such as Fourier features or RoPE—to better capture the relationships between tokens. This adjustment will allow the model to more effectively learn cyclic and seasonal patterns, independent of absolute positions.
Dynamic/Multi-Scale Patching:
Instead of using a fixed patch size, implement dynamic patching that adapts to local variability, or design a multi-scale framework that processes both small (fine-grained) and large (coarse-grained) patches in parallel. Merging these representations (via an attention-based fusion layer) will enable the model to capture both detailed local fluctuations and broad long-term trends.
These improvements are aimed at enhancing the model’s ability to model complex temporal dependencies,
The text was updated successfully, but these errors were encountered: