Skip to content

Conversation

@gxlvera
Copy link
Contributor

@gxlvera gxlvera commented Jan 11, 2026

In this PR is related to PR #1141 , three changes are made

  1. update the experiment result using megatron as backend in README.md
  2. change obs_log_probs to be 0.0 instead of '-inf' to prevent NaN value for logging
  3. change dummy_messages to a static variable

Copy link
Collaborator

@zhaochenyang20 zhaochenyang20 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

NB

@zhaochenyang20 zhaochenyang20 merged commit f68dc4a into THUDM:main Jan 11, 2026
@zhaochenyang20
Copy link
Collaborator

/gemini review

Beichen-Ma pushed a commit to Beichen-Ma/slime that referenced this pull request Jan 12, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants