You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for your issue! It is just an initial version now, and we're improving our project on at least three aspects:
A stronger base model like Qwen2-VL
A stronger inference time scaling method like MCTS.
You mentioned "a reflective error correction mechanism". Yeah, we are going to support this feature after we find a strong inference time scaling method! We will use a new pair of tag called <REFLECTION></REFLECTION> and we are making progress on this. Stay tuned!
What are the differences? The O1 route includes post-training and a reflective error correction mechanism.
The text was updated successfully, but these errors were encountered: