Handle CP OPSM masks centrally and restore loss guardrails #10

cklxx · 2025-12-22T03:46:50Z

Summary

refactor OPSM mask computation to be context-parallel aware without duplicating logic in the loss
restore gradient connectivity when policy log-prob tensors are empty to avoid distributed hangs
validate rollout buffer group sizes to prevent silently misaligned sample counts

python -m ruff check slime slime_plugins tests
python -m pytest (fails: missing optional dependencies such as flash_attn and model-specific test modules)

Refine CP OPSM handling and loss safety

873fe1d

cklxx added the codex label Dec 22, 2025 — with ChatGPT Codex Connector

cklxx merged commit d398b45 into codex/optimize-training-time-for-context-parallelism Dec 22, 2025