-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat(trainer): Enhance SIFT trainer initialization and metrics handling
- Add proper device/dtype handling for Mac M3 compatibility - Initialize optimizer (AdamW) and scheduler (OneCycleLR) - Add robust checkpoint saving with optimizer state - Implement EMA loss tracking and visualization - Add comprehensive metrics computation and tracking - Fix tensor dtype handling for embedding indices - Add adaptive stopping criterion with uncertainty tracking Key changes: - SIFTTrainer initialization (lines 65-158) - Metrics computation and visualization (lines 583-620) - Tensor dtype handling (lines 646-655) - Checkpoint management (lines 552-573) This commit improves training stability and adds better monitoring capabilities while ensuring compatibility across different hardware configurations.
- Loading branch information
1 parent
6389f0d
commit 3411341
Showing
9 changed files
with
573 additions
and
277 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -20,6 +20,7 @@ data/ | |
Llama-3.2-1B-Instruct-Complaint/ | ||
dataset/ | ||
cache/ | ||
checkpoints/ | ||
|
||
|
||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.