Change the repository type filter
All
Repositories list
129 repositories
AbsPyramid
PublicActPlan-1K
PublicNegotiationToM
PublicDataset and Source Code for EMNLP 2024 paper “NegotiationToM: A Benchmark for Stress-testing Machine Theory of Mind on Negotiation Surrounding” (https://arxiv.org/abs/2404.13627)EvidenceConflict
PublicGoldCoin
PublicAdv-WSC
PublicConstrained-Chain-of-ToM
PublicPersonaPrompt
PublicCodeGraph
PublicPrivLM-Bench
PublicAbductiveKGR
PublicSessionCQA
PublicNeuralSubgraphCounting
PublicCEQA
PublicPrivateNGDB
PublicMARS
PublicCode and dataset for the paper: MARS: Benchmarking the Metaphysical Reasoning Abilities of Language Models with a Multi-task Evaluation Dataset (https://arxiv.org/pdf/2406.02106).GEIA
PublicAbsInstruct
Public- Code and dataset for the ACL2024 paper: CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning (https://arxiv.org/pdf/2401.07286).
MIND_Distillation
PublicCode and data for the paper: MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding (https://arxiv.org/pdf/2406.10701).IntentionQA
PublicCode and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models in E-commerce (https://arxiv.org/pdf/2406.10173)EventGround
Public- ASER (Activities, States, Events, and their Relations): a large-scale weighted eventuality knowledge graph.