Skip to content

Why does Chronos-Bolt achieve significantly better results and performance compared to Chronos-T5 #231

Answered by lostella
shenxinyuan-hash asked this question in Q&A
Discussion options

You must be logged in to vote

Regarding performance: while both model families rely on the T5 architecture under the hood, chronos-bolt models embed observations in the context in non-overlapping windows of multiple observations. This is the usual "patch-based" embedding that is used by other models, most notably PatchTST. This happens here in the code, and uses a patch length of 16 in the models we released: this effectively "compresses" the context length by a factor 16 in the embedding space, enabling much of the speedup. On the decoding side, instead of autoregressive generation the chronos-bolt models perform direct multi-step prediction for 9 quantiles, which is also faster.

Regarding accuracy, it is hard to say…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@shenxinyuan-hash
Comment options

Answer selected by abdulfatir
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants