Stars
[NeurIPS 2024] Can Language Models Learn to Skip Steps?
AceParse: A Comprehensive Dataset with Diverse Structured Texts for Academic Literature Parsing
RAGChecker: A Fine-grained Framework For Diagnosing RAG
Automatically split your PyTorch models on multiple GPUs for training & inference
[EMNLP 2024 Findings] Code for the paper ''SH2: Self-Highlighted Hesitation Helps You Decode More Truthfully''
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
This repo contains the dataset and code in the EMNLP'23 paper: StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding.
The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>
Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.