🏔️
Gradient ascending
An RLer, NLPer, functional programmer. CS PhD at Stanford.
- Stanford, CA
- anie.me
Pinned Loading
-
microsoft/Trace
microsoft/Trace PublicEnd-to-end Generative Optimization for AI Agents
-
play-to-grade
play-to-grade PublicApplying RL to grade coding games. NeurIPS 2021.
-
microsoft/LLF-Bench
microsoft/LLF-Bench PublicA benchmark for evaluating learning agents based on just language feedback
-
cicl-stanford/moca
cicl-stanford/moca PublicLanguage model evaluation for morality and causality
Python 16
-
DisExtract
DisExtract PublicThe library that uses dependency parsing to preprocess text to train DisSent model
-
Pragmatic-ISIC
Pragmatic-ISIC PublicBayesian Rational Speech Act Model with Context for Issue Sensitive Image Captioning
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.