# Request to Include TTM-R1 and TTM-R2 Models in Comparison #244
Replies: 2 comments 2 replies
-
Hey, thank you for the suggestion. We don't update the benchmark in this repo often but you might want to check the FEV leaderboard that we just released (cc @shchur). Currently, that includes Benchmark II from the paper and we might consider adding other models such as TTM into the mix. That said, a direct comparison with TTM may be a little bit unfair (for TTM) since it seems to have been mainly developed for relatively high frequencies (hourly and higher). You might want to check out the GIFT-Eval leaderboard which includes TTM and several other models including Chronos, Chronos-Bolt, Moirai and TimesFM. |
Beta Was this translation helpful? Give feedback.
-
Thanks for the response and for pointing me to the GIFT-Eval leaderboard! I spent some time looking through it, and it’s clear that Chronos-Bolt is an absolute beast, especially in minutely and hourly CRPS/MASE metrics. I can see why it’s such a strong choice for high-frequency time series tasks. That got me thinking - how would TTM (especially TTM-R2) stack up in this setup? Since TTM is specifically designed for high-frequency data, I feel like it could offer some interesting trade-offs compared to Chronos-Bolt, especially given how lightweight it is. GIFT-Eval seems like a perfect framework to explore this, so I wanted to ask: is there any chance you might include TTM in the leaderboard at some point? I think it would add a lot of value to see how it performs in these benchmarks. If it’s not something you’re planning to add, would you have any recommendations for how I could test it myself? Specifically: What dataset(s) would you recommend for a fair comparison? Should I stick with something from the GIFT-Eval benchmarks, or is there a better option for this kind of evaluation? Looking forward to hearing your thoughts! |
Beta Was this translation helpful? Give feedback.
-
Request to Include TTM-R1 and TTM-R2 Models in Comparison
Context
Сurrently evaluate various time series forecasting models as shown in the comparison chart. These models include local, task-specific, and pretrained options. However, the evaluation does not yet include the TinyTimeMixers (TTM-R1 and TTM-R2) models, which are lightweight and pretrained models developed by IBM Research for multivariate time series forecasting. Including these models in the comparison would provide a more comprehensive analysis and benchmark their performance against the current state-of-the-art.
Why Include TTM-R1 and TTM-R2?
Proposed Actions
Resources
Expected Outcome
Inclusion of TTM models in the comparison framework will:
Beta Was this translation helpful? Give feedback.
All reactions