GitHub - 27yw/cv-arxiv-daily: 🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Updated on 2025.02.26

Usage instructions: here

Table of Contents

Agent
Large Language Model Agent
Tool learning
Embodied AI

Agent

Publish Date	Title	Authors	PDF	Code
2025-02-24	Event-Based Limit Order Book Simulation under a Neural Hawkes Process: Application in Market-Making	Luca Lalor et.al.	2502.17417	null
2025-02-24	Distributed Coordination for Heterogeneous Non-Terrestrial Networks	Jikang Deng et.al.	2502.17366	null
2025-02-24	Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents	Prafulla Kumar Choubey et.al.	2502.17321	null
2025-02-24	Survey on Strategic Mining in Blockchain: A Reinforcement Learning Approach	Jichen Li et.al.	2502.17307	null
2025-02-24	IGDA: Interactive Graph Discovery through Large Language Model Agents	Alex Havrilla et.al.	2502.17189	null
2025-02-24	Teleology-Driven Affective Computing: A Causal Framework for Sustained Well-Being	Bin Yin et.al.	2502.17172	null
2025-02-24	A Novel Multiple Access Scheme for Heterogeneous Wireless Communications using Symmetry-aware Continual Deep Reinforcement Learning	Hamidreza Mazandarani et.al.	2502.17167	null
2025-02-24	Semantic-Aware Dynamic and Distributed Power Allocation: a Multi-UAV Area Coverage Use Case	Hamidreza Mazandarani et.al.	2502.17120	null
2025-02-24	Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration	Junyang Wang et.al.	2502.17110	null
2025-02-24	Generative Models in Decision Making: A Survey	Yinchuan Li et.al.	2502.17100	null
2025-02-24	MA2RL: Masked Autoencoders for Generalizable Multi-Agent Reinforcement Learning	Jinyuan Feng et.al.	2502.17046	null
2025-02-24	A data-driven econo-financial stress-testing framework to estimate the effect of supply chain networks on financial systemic risk	Jan Fialkowski et.al.	2502.17044	null
2025-02-24	Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs	Enea Monzio Compagnoni et.al.	2502.17009	null
2025-02-24	Deep-reinforcement-learning-based separation control in a two-dimensional airfoil	Xavier Garcia et.al.	2502.16993	null
2025-02-24	Engineering and Validating Cyber-Physical Energy Systems: Needs, Status Quo, and Research Trends	Thomas I. Strasser et.al.	2502.16991	null
2025-02-24	A Multi-LLM-Agent-Based Framework for Economic and Public Policy Analysis	Yuzhi Hao et.al.	2502.16879	null
2025-02-24	Graphy'our Data: Towards End-to-End Modeling, Exploring and Generating Report from Raw Data	Longbin Lai et.al.	2502.16868	null
2025-02-24	Toward Agentic AI: Generative Information Retrieval Inspired Intelligent Communications and Networking	Ruichen Zhang et.al.	2502.16866	null
2025-02-24	Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit Assignment	Kartik Nagpal et.al.	2502.16863	null
2025-02-24	Grounded Persuasive Language Generation for Automated Marketing	Jibang Wu et.al.	2502.16810	null
2025-02-21	AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind	Zhining Zhang et.al.	2502.15676	null
2025-02-21	Multi-Agent Architecture in Distributed Environment Control Systems: vision, challenges, and opportunities	Natasha Astudillo et.al.	2502.15663	null
2025-02-21	Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian Network	Vincent Hsiao et.al.	2502.15662	null
2025-02-21	Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?	Yoshua Bengio et.al.	2502.15657	null
2025-02-21	A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications	Jefferson Silveira et.al.	2502.15649	null
2025-02-21	WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents	Xinhang Liu et.al.	2502.15601	null
2025-02-21	SOTOPIA-Ω: Dynamic Strategy Injection Learning and Social Instrucion Following Evaluation for Social Agents	Wenyuan Zhang et.al.	2502.15538	null
2025-02-21	Contract DesignUnderApproximate Best Responses	Francesco Bacchiocchi et.al.	2502.15523	null
2025-02-21	SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning	Xuyang Li et.al.	2502.15512	null
2025-02-21	Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing	Masaya Kobayashi et.al.	2502.15506	null
2025-02-21	Pub-Guard-LLM: Detecting Fraudulent Biomedical Articles with Reliable Explanations	Lihu Chen et.al.	2502.15429	null
2025-02-21	TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning	Giuseppe Paolo et.al.	2502.15425	null
2025-02-21	Textual-to-Visual Iterative Self-Verification for Slide Generation	Yunqing Xu et.al.	2502.15412	null
2025-02-21	LongCaptioning: Unlocking the Power of Long Caption Generation in Large Multimodal Models	Hongchen Wei et.al.	2502.15393	null
2025-02-21	Multi-Group Dynamics with Tolerant Switching in the Kolkata Paise Restaurant Problem with Dining Clubs	Akshat Harlalka et.al.	2502.15377	null
2025-02-21	ARS: Automatic Routing Solver with Large Language Models	Kai Li et.al.	2502.15359	null
2025-02-21	Learning with Limited Shared Information in Multi-agent Multi-armed Bandit	Junning Shao et.al.	2502.15338	null
2025-02-21	DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation	Luzhou Ge et.al.	2502.15309	link
2025-02-21	Leader-Follower Formation Tracking Control of Quadrotor UAVs Using Bearing Measurements	S. Doodeman et.al.	2502.15303	null
2025-02-21	Collective behaviors of self-propelled particles with tunable alignment angles	Zichen Qin et.al.	2502.15301	null
2025-02-20	GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks	Jianwen Luo et.al.	2502.14848	null
2025-02-20	Red-Teaming LLM Multi-Agent Systems via Communication Attacks	Pengfei He et.al.	2502.14847	null
2025-02-20	Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation	Yue Yang et.al.	2502.14846	null
2025-02-20	Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models	Vlad Sobal et.al.	2502.14819	null
2025-02-20	Optimizing Model Selection for Compound AI Systems	Lingjiao Chen et.al.	2502.14815	link
2025-02-20	Byzantine Game Theory: Sun Tzus Boxes	Andrei Constantinescu et.al.	2502.14812	null
2025-02-20	Planning, scheduling, and execution on the Moon: the CADRE technology demonstration mission	Gregg Rabideau et.al.	2502.14803	null
2025-02-20	A Multi-Agent Perspective on Modern Information Retrieval	Haya Nachimovsky et.al.	2502.14796	null
2025-02-20	Making Universal Policies Universal	Niklas Höpner et.al.	2502.14777	null
2025-02-20	Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis	Priyanka Kargupta et.al.	2502.14767	link
2025-02-20	Multi-Agent Coordination across Diverse Applications: A Survey	Lijun Sun et.al.	2502.14743	null
2025-02-20	Reinforcement Learning with Graph Attention for Routing and Wavelength Assignment with Lightpath Reuse	Michael Doherty et.al.	2502.14741	null
2025-02-20	FLIGHT: Facility Location Integrating Generalized, Holistic Theory of Welfare	Avyukta Manjunatha Vummintala et.al.	2502.14732	null
2025-02-20	Ranking Joint Policies in Dynamic Games using Evolutionary Dynamics	Natalia Koliou et.al.	2502.14724	link
2025-02-20	Building reliable sim driving agents by scaling self-play	Daphne Cornelisse et.al.	2502.14706	null
2025-02-20	I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search	Zujie Liang et.al.	2502.14693	null
2025-02-20	BP-SGCN: Behavioral Pseudo-Label Informed Sparse Graph Convolution Network for Pedestrian and Heterogeneous Trajectory Prediction	Ruochen Li et.al.	2502.14676	link
2025-02-20	InstructAgent: Building User Controllable Recommender via LLM Agent	Wujiang Xu et.al.	2502.14662	link
2025-02-20	Online Envy Minimization and Multicolor Discrepancy: Equivalences and Separations	Daniel Halpern et.al.	2502.14624	null
2025-02-20	Curiosity Driven Multi-agent Reinforcement Learning for 3D Game Testing	Raihana Ferdous et.al.	2502.14606	link
2025-02-19	Autellix: An Efficient Serving Engine for LLM Agents as General Programs	Michael Luo et.al.	2502.13965	null
2025-02-19	LIDDIA: Language-based Intelligent Drug Discovery Agent	Reza Averly et.al.	2502.13959	null
2025-02-19	RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision	Guangzhi Xiong et.al.	2502.13957	null
2025-02-19	Qwen2.5-VL Technical Report	Shuai Bai et.al.	2502.13923	null
2025-02-19	Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health	Xingbo Wang et.al.	2502.13920	null
2025-02-19	DataSciBench: An LLM Agent Benchmark for Data Science	Dan Zhang et.al.	2502.13897	link
2025-02-19	NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants	Yiran Qin et.al.	2502.13894	null
2025-02-19	Enhancing Cross-Domain Recommendations with Memory-Optimized LLM-Based User Agents	Jiahao Liu et.al.	2502.13843	null
2025-02-19	ArtMentor: AI-Assisted Evaluation of Artworks to Explore Multimodal Large Language Models Capabilities	Chanjin Zheng et.al.	2502.13832	link
2025-02-19	Learning to explore when mistakes are not allowed	Charly Pecqueux-Guézénec et.al.	2502.13801	null
2025-02-19	From Correctness to Comprehension: AI Agents for Personalized Error Diagnosis in Education	Yi-Fan Zhang et.al.	2502.13789	null
2025-02-19	Poster: SpiderSim: Multi-Agent Driven Theoretical Cybersecurity Simulation for Industrial Digitalization	Jiaqi Li et.al.	2502.13778	link
2025-02-19	Quantile agent utility and implications to randomized social choice	Ioannis Caragiannis et.al.	2502.13772	null
2025-02-19	AI Software Engineer: Programming with Trust	Abhik Roychoudhury et.al.	2502.13767	null
2025-02-19	GPA: Grover Policy Agent for Generating Optimal Quantum Sensor Circuits	Ahmad Alomari et.al.	2502.13755	null
2025-02-19	Kinetic modelling of economic markets with individual and collective transactions	Chuandong Lin et.al.	2502.13735	null
2025-02-19	Hierarchical RL-MPC for Demand Response Scheduling	Maximilian Bloor et.al.	2502.13714	null
2025-02-19	Parameterized Complexity of Hedonic Games with Enemy-Oriented Preferences	Martin Durand et.al.	2502.13703	null
2025-02-19	Causes and Strategies in Multiagent Systems	Sylvia S. Kerkhove et.al.	2502.13701	null
2025-02-19	An LLM-based Agent for Reliable Docker Environment Configuration	Ruida Hu et.al.	2502.13681	null
2025-02-18	AIDE: AI-Driven Exploration in the Space of Code	Zhengyao Jiang et.al.	2502.13138	link
2025-02-18	Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions	Taedong Yun et.al.	2502.13135	null
2025-02-18	Magma: A Foundation Model for Multimodal AI Agents	Jianwei Yang et.al.	2502.13130	link
2025-02-18	Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning	Jingyang Lin et.al.	2502.13127	null
2025-02-18	Approximately Efficient Bilateral Trade with Samples	Yuan Deng et.al.	2502.13122	null
2025-02-18	Text2World: Benchmarking Large Language Models for Symbolic World Model Generation	Mengkang Hu et.al.	2502.13092	null
2025-02-18	Interactive Agents to Overcome Ambiguity in Software Engineering	Sanidhya Vijayvargiya et.al.	2502.13069	link
2025-02-18	Improved Fine-Tuning of Large Multimodal Models for Hateful Meme Detection	Jingbiao Mei et.al.	2502.13061	null
2025-02-18	AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks	Yurun Chen et.al.	2502.13053	null
2025-02-18	Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks	Markus J. Buehler et.al.	2502.13025	null
2025-02-18	Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents	Chaoran Chen et.al.	2502.13012	null
2025-02-18	Integrating Reinforcement Learning, Action Model Learning, and Numeric Planning for Tackling Complex Tasks	Yarin Benyamin et.al.	2502.13006	link
2025-02-18	You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with a Multi-Agent Conversations	Frederic Kirstein et.al.	2502.13001	null
2025-02-18	Free Argumentative Exchanges for Explaining Image Classifiers	Avinash Kori et.al.	2502.12995	link
2025-02-18	Generative AI and Information Asymmetry: Impacts on Adverse Selection and Moral Hazard	Yukun Zhang et.al.	2502.12969	null
2025-02-18	AI-Enabled Rent-Seeking: How Generative AI Alters Market Transparency and Efficiency	Yukun Zhang et.al.	2502.12956	null
2025-02-18	Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options	Lakshmi Nair et.al.	2502.12929	link
2025-02-18	SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems	Mike Zhang et.al.	2502.12927	null
2025-02-18	Towards more Contextual Agents: An extractor-Generator Optimization Framework	Mourad Aouini et.al.	2502.12926	null
2025-02-18	Knapsack Optimization-based Schema Linking for LLM-based Text-to-SQL Generation	Zheng Yuan et.al.	2502.12911	null
2025-02-17	HARBOR: Exploring Persona Dynamics in Multi-Agent Competition	Kenan Jiang et.al.	2502.12149	null
2025-02-17	Scaling Autonomous Agents via Automatic Reward Modeling And Planning	Zhenfang Chen et.al.	2502.12130	null
2025-02-17	A-MEM: Agentic Memory for LLM Agents	Wujiang Xu et.al.	2502.12110	link
2025-02-17	Relational Norms for Human-AI Cooperation	Brian D. Earp et.al.	2502.12102	null
2025-02-17	A Study on Leveraging Search and Self-Feedback for Agent Reasoning	Karthikeyan K et.al.	2502.12094	null
2025-02-17	Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation	Zhongyi Qiu et.al.	2502.12073	null
2025-02-17	A survey about perceptions of mobility to inform an agent-based simulator of subjective modal choice	Carole Adam et.al.	2502.12058	null
2025-02-17	Multi-agent coordination via communication partitions	Wei-Chen Lee et.al.	2502.12042	null
2025-02-17	Machine Learning Should Maximize Welfare, Not (Only) Accuracy	Nir Rosenfeld et.al.	2502.11981	null
2025-02-17	FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control	Yutong Ye et.al.	2502.11937	null
2025-02-17	CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning	Yanxiao Zhao et.al.	2502.11896	null
2025-02-17	Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration	Shao Zhang et.al.	2502.11882	link
2025-02-17	Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models	Hyunwoo Kim et.al.	2502.11881	null
2025-02-17	Does Knowledge About Perceptual Uncertainty Help an Agent in Automated Driving?	Natalie Grabowsky et.al.	2502.11864	null
2025-02-17	Can LLM Agents Maintain a Persona in Discourse?	Pranav Bhandari et.al.	2502.11843	null
2025-02-17	Assessing the impacts of tradable credit schemes through agent-based simulation	Renming Liu et.al.	2502.11822	null
2025-02-17	Table-Critic: A Multi-Agent Framework for Collaborative Criticism and Refinement in Table Reasoning	Peiying Yu et.al.	2502.11799	null
2025-02-17	Personality Editing for Language Models through Relevant Knowledge Editing	Seojin Hwang et.al.	2502.11789	null
2025-02-17	Changing the Rules of the Game: Reasoning about Dynamic Phenomena in Multi-Agent Systems	Rustam Galimullin et.al.	2502.11785	null
2025-02-17	Plant in Cupboard, Orange on Table, Book on Shelf. Benchmarking Practical Reasoning and Situation Modelling in a Text-Simulated Situated Environment	Jonathan Jordan et.al.	2502.11733	null
2025-02-14	Representation and Interpretation in Artificial and Natural Computing	Luis A. Pineda et.al.	2502.10383	null
2025-02-14	Agentic Verification for Ambiguous Query Disambiguation	Youngwon Lee et.al.	2502.10352	null
2025-02-14	Process Reward Models for LLM Agents: Practical Framework and Directions	Sanjiban Choudhury et.al.	2502.10325	link
2025-02-14	Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations	Abdelrhman Shaheen et.al.	2502.10303	null
2025-02-14	Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers	Aivin V. Solatorio et.al.	2502.10263	null
2025-02-14	Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel Decoding	Laurin Luttmann et.al.	2502.10233	link
2025-02-14	A Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation	Redha Taguelmimt et.al.	2502.10226	null
2025-02-14	Do Large Language Models Reason Causally Like Us? Even Better?	Hanna M. Dettki et.al.	2502.10215	null
2025-02-14	Dynamic Reinforcement Learning for Actors	Katsunari Shibata et.al.	2502.10200	null
2025-02-14	Reinforcement Learning based Constrained Optimal Control: an Interpretable Reward Design	Jingjie Ni et.al.	2502.10187	null
2025-02-14	STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning	Mingcong Lei et.al.	2502.10177	null
2025-02-14	Agentic End-to-End De Novo Protein Design for Tailored Dynamics Using a Language Diffusion Model	Bo Ni et.al.	2502.10173	null
2025-02-14	Modeling biases in binary decision-making within the generalized nonlinear q-voter model	Maciej Doniec et.al.	2502.10172	null
2025-02-14	Combinatorial Reinforcement Learning with Preference Feedback	Joongkyu Lee et.al.	2502.10158	null
2025-02-14	Cooperative Multi-Agent Planning with Adaptive Skill Synthesis	Zhiyuan Li et.al.	2502.10148	null
2025-02-14	Provably Efficient RL under Episode-Wise Safety in Linear CMDPs	Toshinori Kitamura et.al.	2502.10138	null
2025-02-14	ScamFerret: Detecting Scam Websites Autonomously with Large Language Models	Hiroki Nakano et.al.	2502.10110	link
2025-02-14	Causal Information Prioritization for Efficient Reinforcement Learning	Hongye Cao et.al.	2502.10097	null
2025-02-14	Enhancing Patient Acceptance of Robotic Ultrasound through Conversational Virtual Agent and Immersive Visualizations	Tianyu Song et.al.	2502.10088	null
2025-02-14	Towards Empowerment Gain through Causal Structure Learning in Model-Based RL	Hongye Cao et.al.	2502.10077	null
2025-02-13	Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs	Siyan Zhao et.al.	2502.09597	link
2025-02-13	KIMAs: A Configurable Knowledge Integrated Multi-Agent System	Zitao Li et.al.	2502.09596	null
2025-02-13	Rolling Ahead Diffusion for Traffic Scene Simulation	Yunpeng Liu et.al.	2502.09587	null
2025-02-13	Learning to Coordinate with Experts	Mohamad H. Danesh et.al.	2502.09583	link
2025-02-13	Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks	Qian Wan et.al.	2502.09577	null
2025-02-13	MDCrow: Automating Molecular Dynamics Workflows with Large Language Models	Quintina Campbell et.al.	2502.09565	link
2025-02-13	EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents	Rui Yang et.al.	2502.09560	null
2025-02-13	Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages	Shreyan Biswas et.al.	2502.09532	null
2025-02-13	Exact Leader Estimation: A New Approach for Distributed Differentiation	Rodrigo Aldana-Lopez et.al.	2502.09529	null
2025-02-13	Forward-backward Contention Resolution Schemes for Fair Rationing	Will Ma et.al.	2502.09521	null
2025-02-13	Coupled Rendezvous and Docking Maneuver control of satellite using Reinforcement learning-based Adaptive Fixed-Time Sliding Mode Controller	Rakesh Kumar Sahoo et.al.	2502.09517	null
2025-02-13	Package Bids in Combinatorial Electricity Auctions: Selection, Welfare Losses, and Alternatives	Thomas Hübner et.al.	2502.09420	link
2025-02-13	Dialectics of antimicrobial peptides I: common mechanisms of offensive and protecting roles of the peptides	Marta V. Volovik et.al.	2502.09408	null
2025-02-13	Fair Division via Resource Augmentation	Hannaneh Akrami et.al.	2502.09377	null
2025-02-13	Language Agents as Digital Representatives in Collective Decision-Making	Daniel Jarrett et.al.	2502.09369	null
2025-02-13	Convex Is Back: Solving Belief MDPs With Convexity-Informed Deep Reinforcement Learning	Daniel Koutas et.al.	2502.09298	link
2025-02-13	Reliable Conversational Agents under ASP Control that Understand Natural Language	Yankai Zeng et.al.	2502.09237	null
2025-02-13	Pearce's Characterisation in an Epistemic Domain	Ezgi Iraz Su et.al.	2502.09221	null
2025-02-13	Mind the Gaps: Logical English, Prolog, and Multi-agent Systems for Autonomous Vehicles	Galileo Sartor et.al.	2502.09216	null
2025-02-13	Architecture for Simulating Behavior Mode Changes in Norm-Aware Autonomous Agents	Sean Glaze et.al.	2502.09215	null
2025-02-12	Poly-Autoregressive Prediction for Modeling Interactions	Neerja Thakkar et.al.	2502.08646	null
2025-02-12	Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs	Mantas Mazeika et.al.	2502.08640	null
2025-02-12	SPeCtrum: A Grounded Framework for Multidimensional Identity Representation in LLM-Based Agent	Keyeun Lee et.al.	2502.08599	link
2025-02-12	Learning in Markets with Heterogeneous Agents: Dynamics and Survival of Bayesian vs. No-Regret Learners	David Easley et.al.	2502.08597	null
2025-02-12	Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks	Ang Li et.al.	2502.08586	null
2025-02-12	Statistically validated projection of bipartite signed networks	Anna Gallo et.al.	2502.08567	null
2025-02-12	Human-Centric Foundation Models: Perception, Generation and Agentic Modeling	Shixiang Tang et.al.	2502.08556	link
2025-02-12	Extreme vulnerability to intruder attacks destabilizes network dynamics	Amirhossein Nazerian et.al.	2502.08552	null
2025-02-12	Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation	Mahnaz Koupaee et.al.	2502.08514	link
2025-02-12	Resilient Quantized Consensus in Multi-Hop Relay Networks	Liwei Yuan et.al.	2502.08455	null
2025-02-12	Non-Monetary Mechanism Design without Distributional Information: Using Scarce Audits Wisely	Yan Dai et.al.	2502.08412	null
2025-02-12	Towards Principled Multi-Agent Task Agnostic Exploration	Riccardo Zamboni et.al.	2502.08365	null
2025-02-12	Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems	Yuxin Pan et.al.	2502.08340	link
2025-02-12	Hierarchical Multi-Agent Framework for Carbon-Efficient Liquid-Cooled Data Center Clusters	Soumyendu Sarkar et.al.	2502.08337	null
2025-02-12	Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement Learning	Sun Jingbo et.al.	2502.08336	null
2025-02-12	Decentralised multi-agent coordination for real-time railway traffic management	Leo D'Amato et.al.	2502.08324	null
2025-02-12	Compromising Honesty and Harmlessness in Language Models via Deception Attacks	Laurène Vaugrante et.al.	2502.08301	null
2025-02-12	Higher-order Laplacian dynamics on hypergraphs with cooperative and antagonistic interactions	Shaoxuan Cui et.al.	2502.08276	null
2025-02-12	Principles and Framework for the Operationalisation of Meaningful Human Control over Autonomous Systems	Simeon C. Calvert et.al.	2502.08255	null
2025-02-12	The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks	Alejandro Cuadron et.al.	2502.08235	link
2025-02-11	MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces	Loris Gaven et.al.	2502.07709	link
2025-02-11	Human Decision-making is Susceptible to AI-driven Manipulation	Sahand Sabour et.al.	2502.07663	null
2025-02-11	Robust-Sorting and Applications to Ulam-Median	Ragesh Jaiswal et.al.	2502.07653	null
2025-02-11	Distributed Value Decomposition Networks with Networked Agents	Guilherme S. Varela et.al.	2502.07635	null
2025-02-11	Decision-Making Under Complete Uncertainty: You Will Regret Not Being Greedy	Kristijan Atanasov et.al.	2502.07593	null
2025-02-11	DMWM: Dual-Mind World Model with Long-Term Imagination	Lingyi Wang et.al.	2502.07591	null
2025-02-11	Pure $ε$ -equilibrium in random games	Bary S. R. Pradelski et.al.	2502.07585	null
2025-02-11	Genetic evolution of a multi-generational population in the context of interstellar space travels -- Part II: Phenotypic effects of gene expression	Frédéric Marin et.al.	2502.07559	null
2025-02-11	Unsupervised Translation of Emergent Communication	Ido Levy et.al.	2502.07552	null
2025-02-11	A Near-optimal, Scalable and Corruption-tolerant Framework for Stochastic Bandits: From Single-Agent to Multi-Agent and Beyond	Zicheng Hu et.al.	2502.07514	null
2025-02-11	Exploring Word-Representable Temporal Graphs	Duncan Adamson et.al.	2502.07496	null
2025-02-11	Multi-Agent Collaboration for Multilingual Code Instruction Tuning	Jian Yang et.al.	2502.07487	null
2025-02-11	On Event-Triggered Resilient Consensus Using Auxiliary Layer	Pushkal Purohit et.al.	2502.07470	null
2025-02-11	Approximating Human Strategic Reasoning with LLM-Enhanced Recursive Reasoners Leveraging Multi-agent Hypergames	Vince Trencsenyi et.al.	2502.07443	null
2025-02-11	Coupling Agent-Based Simulations and VR universes: the case of GAMA and Unity	Alexis Drogoul et.al.	2502.07405	null
2025-02-11	FinRL-DeepSeek: LLM-Infused Risk-Sensitive Reinforcement Learning for Trading Agents	Mostapha Benhenda et.al.	2502.07393	link
2025-02-11	EvoFlow: Evolving Diverse Agentic Workflows On The Fly	Guibin Zhang et.al.	2502.07373	null
2025-02-11	KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems	Jusheng Zhang et.al.	2502.07350	null
2025-02-11	The Combined Problem of Online Task Assignment and Lifelong Path Finding in Logistics Warehouses: A Case Study	Fengming Zhu et.al.	2502.07332	null
2025-02-11	CreAgent: Towards Long-Term Evaluation of Recommender System under Platform-Creator Information Asymmetry	Xiaopeng Ye et.al.	2502.07307	link
2025-02-10	Visual Agentic AI for Spatial Reasoning with a Dynamic API	Damiano Marsili et.al.	2502.06787	null
2025-02-10	Towards Internet-Scale Training For Agents	Brandon Trabucco et.al.	2502.06776	null
2025-02-10	Distributed Constraint-Coupled Optimization: Harnessing ADMM-consensus for robustness	Mohamed Abdelmouamin Messilem et.al.	2502.06763	null
2025-02-10	Incentivizing Desirable Effort Profiles in Strategic Classification: The Role of Causality and Uncertainty	Valia Efthymiou et.al.	2502.06749	null
2025-02-10	Institutional Preferences in the Laboratory	Qiankun Zhong et.al.	2502.06748	null
2025-02-10	Wandering around: A bioinspired approach to visual attention through object motion sensitivity	Giulia D Angelo et.al.	2502.06747	link
2025-02-10	AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection	Roohan Ahmed Khan et.al.	2502.06725	null
2025-02-10	Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene	Tai-Yu Pan et.al.	2502.06682	null
2025-02-10	Quantile Multi-Armed Bandits with 1-bit Feedback	Ivan Lau et.al.	2502.06678	null
2025-02-10	Unbiased Evaluation of Large Language Models from a Causal Perspective	Meilin Chen et.al.	2502.06655	null
2025-02-10	Enhancing healthcare infrastructure resilience through agent-based simulation methods	David Carramiñana et.al.	2502.06636	null
2025-02-10	Hinderance of cooperation by individual solutions: Evolutionary dynamics of three-strategy games combining the prisoner's dilemma and stag hunt	Hirofumi Takesue et.al.	2502.06624	null
2025-02-10	Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training	Yuchen Zhuang et.al.	2502.06589	null
2025-02-10	Network Creation Games with 2-Neighborhood Maximization	Merlin de la Haye et.al.	2502.06561	null
2025-02-10	Marginal Mechanisms For Balanced Exchange	Vikram Manjunath et.al.	2502.06499	null
2025-02-10	Utilitarian Distortion with Predictions	Aris Filos-Ratsikas et.al.	2502.06489	null
2025-02-10	KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment	Yuxing Lu et.al.	2502.06472	link
2025-02-10	A Quadratic Lower Bound for Stable Roommates Solvability	Will Rosenbaum et.al.	2502.06464	null
2025-02-10	SIGMA: Sheaf-Informed Geometric Multi-Agent Pathfinding	Shuhao Liao et.al.	2502.06440	null
2025-02-10	The AI off-switch problem as a signalling game: bounded rationality and incomparability	Alessio benavoli et.al.	2502.06403	null
2025-02-07	Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray	Yunhang Shen et.al.	2502.05177	link
2025-02-07	MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison	Kaijie Zhu et.al.	2502.05174	null
2025-02-07	From Restless to Contextual: A Thresholding Bandit Approach to Improve Finite-horizon Performance	Jiamin Xu et.al.	2502.05145	link
2025-02-07	Maximin Share Guarantees for Few Agents with Subadditive Valuations	George Christodoulou et.al.	2502.05141	null
2025-02-07	Joint TITE-CRM for Dual Agent Dose Finding Studies	Helen Barnett et.al.	2502.05072	null
2025-02-07	Exploring the Generalizability of Geomagnetic Navigation: A Deep Reinforcement Learning approach with Policy Distillation	Wenqi Bai et.al.	2502.05069	null
2025-02-07	nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow	Geliang Ouyang et.al.	2502.05036	link
2025-02-07	Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency	Qixin Zhang et.al.	2502.05028	null
2025-02-07	Seasonal Station-Keeping of Short Duration High Altitude Balloons using Deep Reinforcement Learning	Tristan K. Schuler et.al.	2502.05014	null
2025-02-07	The Rising Threat to Emerging AI-Powered Search Engines	Zeren Luo et.al.	2502.04951	null
2025-02-07	$TAR^2$ : Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning	Aditya Kapoor et.al.	2502.04864	null
2025-02-07	Humans Co-exist, So Must Embodied Artificial Agents	Hannah Kuehn et.al.	2502.04809	null
2025-02-07	Unified description of viscous, viscoelastic, or elastic thin active films on substrates	Henning Reinken et.al.	2502.04802	null
2025-02-07	S $^2$ -MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency	Yuting Zeng et.al.	2502.04790	null
2025-02-07	A non-zero-sum game with reinforcement learning under mean-variance framework	Junyi Guo et.al.	2502.04788	null
2025-02-07	SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning	Wanjia Zhao et.al.	2502.04780	link
2025-02-07	An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks	George Papadopoulos et.al.	2502.04773	link
2025-02-07	Shapley Value Approximation Based on k-Additive Games	Guilherme Dean Pelegrina et.al.	2502.04763	null
2025-02-07	Every Software as an Agent: Blueprint and Case Study	Mengwei Xu et.al.	2502.04747	null
2025-02-07	Multi-Agent Coverage Control in Non-Convex Annulus Region with Conformal Mapping	Xun Feng et.al.	2502.04697	null
2025-02-06	ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization	Yinjie Wang et.al.	2502.04306	link
2025-02-06	Mutual Multilinearity of Nonequilibrium Network Currents	Sara Dal Cengio et.al.	2502.04298	null
2025-02-06	DECAF: Learning to be Fair in Multi-agent Resource Allocation	Ashwin Kumar et.al.	2502.04281	null
2025-02-06	Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study	Michael Walters et.al.	2502.04249	null
2025-02-06	Multi-agent Architecture Search via Agentic Supernet	Guibin Zhang et.al.	2502.04180	null
2025-02-06	Dense Fixed-Wing Swarming using Receding-Horizon NMPC	Varun Madabushi et.al.	2502.04174	null
2025-02-06	Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning	Wesley A. Suttle et.al.	2502.04141	null
2025-02-06	Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation	Jiahao Lu et.al.	2502.04139	null
2025-02-06	VTutor: An Open-Source SDK for Generative AI-Powered Animated Pedagogical Agents with Multi-Media Output	Eason Chen et.al.	2502.04103	null
2025-02-06	Strategic Learning with Local Explanations as Feedback	Kiet Q. H. Vo et.al.	2502.04058	null
2025-02-06	Simulating the Emergence of Differential Case Marking with Communicating Neural-Network Agents	Yuchen Lian et.al.	2502.04038	null
2025-02-06	Deep Meta Coordination Graphs for Multi-agent Reinforcement Learning	Nikunj Gupta et.al.	2502.04028	link
2025-02-06	Near-optimal Regret Using Policy Optimization in Online MDPs with Aggregate Bandit Feedback	Tal Lancewicki et.al.	2502.04004	null
2025-02-06	Fairness Aware Reinforcement Learning via Proximal Policy Optimization	Gabriele La Malfa et.al.	2502.03953	null
2025-02-06	Enhancing Online Learning Efficiency Through Heterogeneous Resource Integration with a Multi-Agent RAG System	Devansh Srivastav et.al.	2502.03948	null
2025-02-06	Geometric Stabilization of Virtual Nonlinear Nonholonomic Constraints	Efstratios Stratoglou et.al.	2502.03902	null
2025-02-06	Any theory that admits a Wigner's Friend type multi-agent paradox is logically contextual	Nuriya Nurgalieva et.al.	2502.03874	null
2025-02-06	PAGNet: Pluggable Adaptive Generative Networks for Information Completion in Multi-Agent Communication	Zhuohui Zhang et.al.	2502.03845	null
2025-02-06	PsyPlay: Personality-Infused Role-Playing Conversational Agents	Tao Yang et.al.	2502.03821	null
2025-02-06	Large Language Models for Multi-Robot Systems: A Survey	Peihan Li et.al.	2502.03814	null
2025-02-05	A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs)	Yiye Chen et.al.	2502.03450	null
2025-02-05	Prediction of the Most Fire-Sensitive Point in Building Structures with Differentiable Agents for Thermal Simulators	Yuan Xinjie et.al.	2502.03424	null
2025-02-05	Energy-Efficient Flying LoRa Gateways: A Multi-Agent Reinforcement Learning Approach	Abdullahi Isa Ahmed et.al.	2502.03377	null
2025-02-05	Learning from Active Human Involvement through Proxy Value Propagation	Zhenghao Peng et.al.	2502.03369	null
2025-02-05	PalimpChat: Declarative and Interactive AI analytics	Chunwei Liu et.al.	2502.03368	null
2025-02-05	Inverse Mixed Strategy Games with Generative Trajectory Models	Max Muchen Sun et.al.	2502.03356	null
2025-02-05	Implicit Communication in Human-Robot Collaborative Transport	Elvin Yang et.al.	2502.03346	link
2025-02-05	Actions Speak Louder Than Words: Rate-Reward Trade-off in Markov Decision Processes	Haotian Wu et.al.	2502.03335	null
2025-02-05	SymAgent: A Neural-Symbolic Self-Learning Agent Framework for Complex Reasoning over Knowledge Graphs	Ben Liu et.al.	2502.03283	null
2025-02-05	Modeling and Optimization of Insulin Injection for Type-1 Diabetes Mellitus Management	Rinrada Jadsadaphongphaibool et.al.	2502.03269	null
2025-02-05	iVISPAR -- An Interactive Visual-Spatial Reasoning Benchmark for VLMs	Julius Mayer et.al.	2502.03214	link
2025-02-05	MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent	Xinyao Liao et.al.	2502.03207	null
2025-02-05	Cooperative Behavior in Pre-State Societies: An Agent-Based Approach of the Axum Civilization	Riccardo Vasellini et.al.	2502.03191	null
2025-02-05	Strategizing with AI: Insights from a Beauty Contest Experiment	Iuliia Alekseenko et.al.	2502.03158	null
2025-02-05	Group Trip Planning Query Problem with Multimodal Journey	Dildar Ali et.al.	2502.03144	null
2025-02-05	Underwater Soft Fin Flapping Motion with Deep Neural Network Based Surrogate Model	Yuya Hamamatsu et.al.	2502.03135	link
2025-02-05	Double Distillation Network for Multi-Agent Reinforcement Learning	Yang Zhou et.al.	2502.03125	null
2025-02-05	Cooperation, satisfaction, and rationality in social games on complex networks with aspiration-driven players	M. Aguilar-Janita et.al.	2502.03109	null
2025-02-05	Learning Efficient Flocking Control based on Gibbs Random Fields	Dengyu Zhang et.al.	2502.02984	null
2025-02-05	FedMobileAgent: Training Mobile Agents Using Decentralized Self-Sourced Data from Diverse Users	Wenhao Wang et.al.	2502.02982	null
2025-02-04	QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search	Zongyu Lin et.al.	2502.02584	link
2025-02-04	Decision Theoretic Foundations for Conformal Prediction: Optimal Uncertainty Quantification for Risk-Averse Agents	Shayan Kiyani et.al.	2502.02561	null
2025-02-04	AAD-DCE: An Aggregated Multimodal Attention Mechanism for Early and Late Dynamic Contrast Enhanced Prostate MRI Synthesis	Divya Bharti et.al.	2502.02555	link
2025-02-04	Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks	Huiqun Huang et.al.	2502.02537	null
2025-02-04	Adaptive Self-improvement LLM Agentic System for ML Library Development	Genghan Zhang et.al.	2502.02534	link
2025-02-04	Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies	Han Zhou et.al.	2502.02533	null
2025-02-04	Why human-AI relationships need socioaffective alignment	Hannah Rose Kirk et.al.	2502.02528	null
2025-02-04	The Cost Perspective of Liquid Democracy: Feasibility and Control	Shiri Alouf-Heffetz et.al.	2502.02380	null
2025-02-04	Mirai: A Wearable Proactive AI "Inner-Voice" for Contextual Nudging	Cathy Mengying Fang et.al.	2502.02370	null
2025-02-04	MAGNNET: Multi-Agent Graph Neural Network-based Efficient Task Allocation for Autonomous Vehicles with Deep Reinforcement Learning	Lavanya Ratnabala et.al.	2502.02311	null
2025-02-04	Adviser-Actor-Critic: Eliminating Steady-State Error in Reinforcement Learning Control	Donghe Chen et.al.	2502.02265	null
2025-02-04	An altruistic resource-sharing mechanism for synchronization: The energy-speed-accuracy tradeoff	Dongliang Zhang et.al.	2502.02242	null
2025-02-04	The Induced Matching Distance: A Novel Topological Metric with Applications in Robotics	Javier Perera-Lago et.al.	2502.02112	link
2025-02-04	Sequential Multi-objective Multi-agent Reinforcement Learning Approach for Predictive Maintenance	Yan Chen et.al.	2502.02071	null
2025-02-04	AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement	Shivam Singh et.al.	2502.02067	link
2025-02-04	Anticipate & Act : Integrating LLMs and Classical Planning for Efficient Task Execution in Household Environments	Raghav Arora et.al.	2502.02066	null
2025-02-04	CH-MARL: Constrained Hierarchical Multiagent Reinforcement Learning for Sustainable Maritime Logistics	Saad Alqithami et.al.	2502.02060	null
2025-02-04	RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation	Minwoo Kim et.al.	2502.02054	null
2025-02-04	Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer	Yaodong Yang et.al.	2502.02018	link
2025-02-04	The Wisdom of Intellectually Humble Networks	Mohammad Ratul Mahjabin et.al.	2502.02015	link
2025-01-31	Vintix: Action Model via In-Context Reinforcement Learning	Andrey Polubarov et.al.	2501.19400	link
2025-01-31	Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game	Mustafa O. Karabag et.al.	2501.19398	link
2025-01-31	Learning Contracts in Hierarchical Multi-Agent Systems	Antoine Scheid et.al.	2501.19388	null
2025-01-31	The Physics and Metaphysics of Social Powers: Bridging Cognitive Processing and Social Dynamics, a New Perspective on Power through Active Inference	Mahault Albarracin et.al.	2501.19368	null
2025-01-31	PixelWorld: Towards Perceiving Everything as Pixels	Zhiheng Lyu et.al.	2501.19339	null
2025-01-31	MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems	Anirudh Chari et.al.	2501.19318	null
2025-01-31	Objective Metrics for Human-Subjects Evaluation in Explainable Reinforcement Learning	Balint Gyevnar et.al.	2501.19256	null
2025-02-03	SHARPIE: A Modular Framework for Reinforcement Learning and Human-AI Interaction Experiments	Hüseyin Aydın et.al.	2501.19245	link
2025-01-31	Multi-agent Multi-armed Bandit with Fully Heavy-tailed Dynamics	Xingyu Wang et.al.	2501.19239	null
2025-01-31	A parallelizable variant of HCA*	Sreenivasan Ganti et.al.	2501.19218	null
2025-01-31	An Empirical Game-Theoretic Analysis of Autonomous Cyber-Defence Agents	Gregory Palmer et.al.	2501.19206	null
2025-01-31	Autonomous Legacy Web Application Upgrades Using a Multi-Agent System	Valtteri Ala-Salmi et.al.	2501.19204	link
2025-01-31	A Comunication Framework for Compositional Generation	Rafael Elberg et.al.	2501.19182	null
2025-01-31	Augmented Intelligence for Multimodal Virtual Biopsy in Breast Cancer Using Generative Artificial Intelligence	Aurora Rofena et.al.	2501.19176	null
2025-01-31	Implications of zero-growth economics analysed with an agent-based model	Dylan C. Terry-Doyle et.al.	2501.19168	null
2025-01-31	Test-Time Training Scaling for Chemical Exploration in Drug Design	Morgan Thomas et.al.	2501.19153	null
2025-01-31	Constant-Factor Distortion Mechanisms for $k$ -Committee Election	Haripriya Pulyassary et.al.	2501.19148	null
2025-01-31	Prediction-Aware Learning in Multi-Agent Systems	Aymeric Capitaine et.al.	2501.19144	null
2025-01-31	Imitation Game for Adversarial Disillusion with Multimodal Generative Chain-of-Thought Role-Play	Ching-Chun Chang et.al.	2501.19143	null
2025-01-31	Shaping Sparse Rewards in Reinforcement Learning: A Semi-supervised Approach	Wenyun Li et.al.	2501.19128	null
2025-01-30	Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method	Peter Baile Chen et.al.	2501.18539	null
2025-01-30	Design and Validation of Learning Aware HMI For Learning-Enabled Increasingly Autonomous Systems	Parth Ganeriwala et.al.	2501.18506	null
2025-01-30	Graph Exploration with Edge Weight Estimates	Matthias Gehnen et.al.	2501.18496	null
2025-01-30	Conversation Games and a Strategic View of the Turing Test	Kaveh Aryan et.al.	2501.18455	null
2025-01-30	Stable Marriage: Loyalty vs. Competition	Amit Ronen et.al.	2501.18442	null
2025-01-30	Gravity-Bench-v1: A Benchmark on Gravitational Physics Discovery for Agents	Nolan Koblischke et.al.	2501.18411	null
2025-01-30	Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach	Tianpeng Pan et.al.	2501.18320	null
2025-01-30	Model-Free RL Agents Demonstrate System 1-Like Intentionality	Hal Ashton et.al.	2501.18299	null
2025-01-30	CueTip: An Interactive and Explainable Physics-aware Pool Assistant	Sean Memery et.al.	2501.18291	null
2025-01-30	Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents	ShuiDe Wen et.al.	2501.18190	null
2025-01-30	Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation	Teddy Lazebnik et.al.	2501.18177	null
2025-01-30	RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing	Jinyao Guo et.al.	2501.18160	null
2025-01-30	Model Checking for Multi-Agent Systems Modeled By Epistemic Process Calculus	Qixian Yu et.al.	2501.18155	null
2025-01-30	Utilizing API Response for Test Refinement	Devika Sondhi et.al.	2501.18145	null
2025-01-30	B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning	Woojun Kim et.al.	2501.18138	null
2025-01-30	DCatalyst: A Unified Accelerated Framework for Decentralized Optimization	Tianyu Cao et.al.	2501.18114	null
2025-01-29	Joint Pricing and Resource Allocation: An Optimal Online-Learning Approach	Jianyu Xu et.al.	2501.18049	null
2025-01-29	A Case Study in Acceleration AI Ethics: The TELUS GenAI Conversational Agent	James Brusseau et.al.	2501.18038	null
2025-01-29	Large Language Models Think Too Fast To Explore Effectively	Lan Pan et.al.	2501.18009	null
2025-01-29	Agentic Workflows for Conversational Human-AI Interaction Design	Arthur Caetano et.al.	2501.18002	null
2025-01-29	From Sparse to Dense: Toddler-inspired Reward Transition in Goal-Oriented Reinforcement Learning	Junseok Park et.al.	2501.17842	null
2025-01-29	A note on the Cucker-Smale model with time delay and communication failures	Elisa Continelli et.al.	2501.17743	null
2025-01-29	RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts	Eujeong Choi et.al.	2501.17715	link
2025-01-29	Inferring Implicit Goals Across Differing Task Models	Silvia Tulli et.al.	2501.17704	null
2025-01-29	CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius Maximization	Derui Wang et.al.	2501.17667	link
2025-01-29	Multi-Agent Path Finding Using Conflict-Based Search and Structural-Semantic Topometric Maps	Scott Fredriksson et.al.	2501.17661	null
2025-01-29	Coalitional control: a bottom-up approach	Filiberto Fele et.al.	2501.17614	null
2025-01-29	Coalitional model predictive control of an irrigation canal	Filiberto Fele et.al.	2501.17561	null
2025-01-29	Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant	Gaole He et.al.	2501.17546	link
2025-01-29	Sequential Learning of the Pareto Front for Multi-objective Bandits	Elise Crépon et.al.	2501.17513	link
2025-01-29	Monetary-Fiscal Interaction and the Liquidity of Government Debt	Cristiano Cantore et.al.	2501.17458	null
2025-01-29	Human-Aligned Skill Discovery: Balancing Behaviour Exploration and Alignment	Maxence Hussonnois et.al.	2501.17431	null
2025-01-29	Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models	Yuxuan Li et.al.	2501.17420	null
2025-01-29	General Scene Adaptation for Vision-and-Language Navigation	Haodong Hong et.al.	2501.17403	link
2025-01-29	Optimal Utility Design with Arbitrary Information Networks	Vartika Singh et.al.	2501.17385	null
2025-01-29	A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning	Zhengpeng Xie et.al.	2501.17384	null
2025-01-28	Anomaly Detection in Cooperative Vehicle Perception Systems under Imperfect Communication	Ashish Bastola et.al.	2501.17329	null
2025-01-28	A sketch of an AI control safety case	Tomek Korbak et.al.	2501.17315	null
2025-01-28	Controlling AI Agent Participation in Group Conversations: A Human-Centered Approach	Stephanie Houde et.al.	2501.17258	null
2025-01-28	Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning	Rémy Hosseinkhan Boucher et.al.	2501.17115	null
2025-01-28	CRSet: Non-Interactive Verifiable Credential Revocation with Metadata Privacy for Issuers and Everyone Else	Felix Hoops et.al.	2501.17089	null
2025-01-28	Learning Mean Field Control on Sparse Graphs	Christian Fabian et.al.	2501.17079	null
2025-01-28	Induced Modularity and Community Detection for Functionally Interpretable Reinforcement Learning	Anna Soligo et.al.	2501.17077	null
2025-01-28	Context is Key in Agent Security	Lillian Tsai et.al.	2501.17070	null
2025-01-28	Revisit Mixture Models for Multi-Agent Simulation: Experimental Study within a Unified Framework	Longzhong Lin et.al.	2501.17015	null
2025-01-28	Towards Open-Source and Modular Space Systems with ATMOS	Pedro Roque et.al.	2501.16973	null
2025-01-28	Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning	Xi Chen et.al.	2501.16966	null
2025-01-28	ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations	Xinyi Ni et.al.	2501.16945	null
2025-01-28	Beyond Human Intervention: Algorithmic Collusion through Multi-Agent Learning Strategies	Suzie Grondin et.al.	2501.16935	null
2025-01-28	Optimization and Learning in Open Multi-Agent Systems	Diego Deplano et.al.	2501.16847	null
2025-01-28	RG-Attn: Radian Glue Attention for Multi-modality Multi-agent Cooperative Perception	Lantao Li et.al.	2501.16803	null
2025-01-28	A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process	Jack David Carson et.al.	2501.16783	null
2025-01-28	Target-driven Self-Distillation for Partial Observed Trajectories Forecasting	Pengfei Zhu et.al.	2501.16767	null
2025-01-28	Quantum advantage in decentralized control of POMDPs: A control-theoretic view of the Mermin-Peres square	Venkat Anantharam et.al.	2501.16690	null
2025-01-28	MACI: Multi-Agent Collaborative Intelligence for Robust Reasoning and Temporal Planning	Edward Y. Chang et.al.	2501.16689	null
2025-01-28	Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting	Li Yin et.al.	2501.16673	link
2025-01-28	Jupybara: Operationalizing a Design Space for Actionable Data Analysis and Storytelling with LLMs	Huichen Will Wang et.al.	2501.16661	null
2025-01-28	Large Language Model Critics for Execution-Free Evaluation of Code Changes	Aashish Yadavally et.al.	2501.16655	link
2025-01-28	More Efficient Sybil Detection Mechanisms Leveraging Resistance of Users to Attack Requests	Ali Safarpoor Dehkordi et.al.	2501.16624	link
2025-01-27	LUCY: Linguistic Understanding and Control Yielding Early Stage of Her	Heting Gao et.al.	2501.16327	link
2025-01-27	Privacy-aware Nash Equilibrium Synthesis with Partially Ordered LTL $_f$ Objectives	Caleb Probine et.al.	2501.16307	null
2025-01-27	Multi-Agent Geospatial Copilots for Remote Sensing Workflows	Chaehong Lee et.al.	2501.16254	null
2025-01-27	Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma	Richard Willis et.al.	2501.16173	link
2025-01-27	AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants	Pascal J. Sager et.al.	2501.16150	null
2025-01-27	Quantifying the Self-Interest Level of Markov Social Dilemmas	Richard Willis et.al.	2501.16138	null
2025-01-27	Multi-Agent Meta-Offline Reinforcement Learning for Timely UAV Path Planning and Data Collection	Eslam Eldeeb et.al.	2501.16098	null
2025-01-27	Galaxy Era: Agent-based Simulation of Execution Tickets	Pascal Stichler et.al.	2501.16090	link
2025-01-27	Value-oriented forecast reconciliation for renewables in electricity markets	Honglin Wen et.al.	2501.16086	null
2025-01-27	Generating Spatial Synthetic Populations Using Wasserstein Generative Adversarial Network: A Case Study with EU-SILC Data for Helsinki and Thessaloniki	Vanja Falck et.al.	2501.16080	null
2025-01-27	Translating and evaluating single-cell Boolean network interventions in the multiscale setting	John Metzcar et.al.	2501.16052	link
2025-01-27	Strategic Multi-Armed Bandit Problems Under Debt-Free Reporting	Ahmed Ben Yahmed et.al.	2501.16018	null
2025-01-27	Modeling and stability analysis of live systems with time-varying dimension	Andrii Mironchenko et.al.	2501.15991	null
2025-01-27	Online Housing Market	Julien Lesca et.al.	2501.15916	null
2025-01-27	Explaining Facial Expression Recognition	Sanjeev Nahulanthran et.al.	2501.15864	null
2025-01-27	LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models	Yuewen Mei et.al.	2501.15850	null
2025-01-27	The Strong Core of Housing Markets with Partial Order Preferences	Ildikó Schlotter et.al.	2501.15834	null
2025-01-27	MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer	Qi Chen et.al.	2501.15826	null
2025-01-27	Adaptive AI-based Decentralized Resource Management in the Cloud-Edge Continuum	Lanpei Li et.al.	2501.15802	null
2025-01-27	Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs	Yu Li et.al.	2501.15791	link
2025-01-24	An Attentive Graph Agent for Topology-Adaptive Cyber Defence	Ilya Orson Sandoval et.al.	2501.14700	link
2025-01-24	The Division of Surplus and the Burden of Proof	Deniz Kattwinkel et.al.	2501.14686	null
2025-01-24	MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications	Yixing Jiang et.al.	2501.14654	link
2025-01-24	Whisper D-SGD: Correlated Noise Across Agents for Differentially Private Decentralized Learning	Angelo Rodio et.al.	2501.14644	link
2025-01-24	Fair Division Beyond Monotone Valuations	Siddharth Barman et.al.	2501.14609	null
2025-01-24	Hybrid Quantum-Classical Multi-Agent Pathfinding	Thore Gerlach et.al.	2501.14568	null
2025-01-24	Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation	Wenzhang Liu et.al.	2501.14543	link
2025-01-24	Breaking the Pre-Planning Barrier: Real-Time Adaptive Coordination of Mission and Charging UAVs Using Graph Reinforcement Learning	Yuhan Hu et.al.	2501.14488	null
2025-01-24	Avoiding Overfitting in Variable-Order Markov Models: a Cross-Validation Approach	Valeria Secchini et.al.	2501.14476	null
2025-01-24	The Pseudo-Dimension of Contracts	Paul Duetting et.al.	2501.14474	null
2025-01-24	MARL-OT: Multi-Agent Reinforcement Learning Guided Online Fuzzing to Detect Safety Violation in Autonomous Driving Systems	Linfeng Liang et.al.	2501.14451	null
2025-01-24	Learning more with the same effort: how randomization improves the robustness of a robotic deep reinforcement learning agent	Lucía Güitta-López et.al.	2501.14443	null
2025-01-24	DeepFlow: Serverless Large Language Model Serving at Scale	Junhao Hu et.al.	2501.14417	null
2025-01-24	DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing	Xinyu Ma et.al.	2501.14371	link
2025-01-24	Online Inverse Linear Optimization: Improved Regret Bound, Robustness to Suboptimality, and Toward Tight Regret Analysis	Shinsaku Sakaue et.al.	2501.14349	null
2025-01-24	Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts	Clément Desroches et.al.	2501.14334	null
2025-01-24	MASTER: A Multi-Agent System with LLM Specialized MCTS	Bingzheng Gan et.al.	2501.14304	null
2025-01-24	TrajFlow: A Generative Framework for Occupancy Density Estimation Using Normalizing Flows	Mitch Kosieradzki et.al.	2501.14266	link
2025-01-24	Non-selective evaporation mechanism of binary aerosol generating agent on porous atomizer and its experimental verification	Xie Guoyong et.al.	2501.14262	null
2025-01-24	Optimal Investment under Mutual Strategy Influence among Agents	Huisheng Wang et.al.	2501.14259	null
2025-01-23	GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration	Yue Fan et.al.	2501.13896	null
2025-01-23	Utilizing Evolution Strategies to Train Transformers in Reinforcement Learning	Matyáš Lorenc et.al.	2501.13883	link
2025-01-23	Eye Gaze as a Signal for Conveying User Attention in Contextual AI Systems	Ethan Wilson et.al.	2501.13878	null
2025-01-23	EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents	Yuhui Yun et.al.	2501.13746	null
2025-01-23	Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent System	Haikuo Du et.al.	2501.13727	link
2025-01-23	A Non-Parametric Approach to Heterogeneity Analysis	Avner Seror et.al.	2501.13721	null
2025-01-23	Revisiting Online Learning Approach to Inverse Linear Optimization: A Fenchel--Young Loss Perspective and Gap-Dependent Regret Analysis	Shinsaku Sakaue et.al.	2501.13648	null
2025-01-23	WFCRL: A Multi-Agent Reinforcement Learning Benchmark for Wind Farm Control	Claire Bizon Monroc et.al.	2501.13592	link
2025-01-23	Explainable AI-aided Feature Selection and Model Reduction for DRL-based V2X Resource Allocation	Nasir Khan et.al.	2501.13552	null
2025-01-23	Towards a Theory of AI Personhood	Francis Rhys Ward et.al.	2501.13533	null
2025-01-23	Communication-Efficient Stochastic Distributed Learning	Xiaoxing Ren et.al.	2501.13516	null
2025-01-23	A Polynomial-Time Algorithm for EFX Orientations of Chores	Kevin Hsu et.al.	2501.13481	null
2025-01-23	Knowledge-Informed Multi-Agent Trajectory Prediction at Signalized Intersections for Infrastructure-to-Everything	Huilin Yin et.al.	2501.13461	null
2025-01-23	BMG-Q: Localized Bipartite Match Graph Attention Q-Learning for Ride-Pooling Order Dispatch	Yulong Hu et.al.	2501.13448	null
2025-01-23	VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework	He Kong et.al.	2501.13411	link
2025-01-23	Concurrent Learning with Aggregated States via Randomized Least Squares Value Iteration	Yan Chen et.al.	2501.13394	null
2025-01-23	Do as We Do, Not as You Think: the Conformity of Large Language Models	Zhiyuan Weng et.al.	2501.13381	link
2025-01-23	Task Allocation in Customer-led Two-sided Markets with Satellite Constellation Services	Jianglin Qiao et.al.	2501.13364	null
2025-01-23	AgentRec: Agent Recommendation Using Sentence Embeddings Aligned to Human Feedback	Joshua Park et.al.	2501.13333	link
2025-01-23	Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents	Shrinidhi Kumbhar et.al.	2501.13299	null
2025-01-22	Boosting MCTS with Free Energy Minimization	Mawaba Pascal Dao et.al.	2501.13083	null
2025-01-22	Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment	Melissa Kazemi Rad et.al.	2501.13080	null
2025-01-22	Evolution and The Knightian Blindspot of Machine Learning	Joel Lehman et.al.	2501.13075	null
2025-01-22	Optimizing Return Distributions with Distributional Dynamic Programming	Bernardo Ávila Pires et.al.	2501.13028	null
2025-01-22	The regret lower bound for communicating Markov Decision Processes	Victor Boone et.al.	2501.13013	null
2025-01-22	MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking	Sebastian Farquhar et.al.	2501.13011	null
2025-01-22	Constructive characterisations of the must-preorder for asynchrony	Giovanni Bernardi et.al.	2501.13002	null
2025-01-22	An Offline Multi-Agent Reinforcement Learning Framework for Radio Resource Management	Eslam Eldeeb et.al.	2501.12991	null
2025-01-22	Learning-based Distributed Model Predictive Control using Multi-Agent Bayesian Optimization	Hossein Nejatbakhsh Esfahani et.al.	2501.12989	null
2025-01-22	Quantification of Ultrafast Nonlinear Photothermal and Photoacoustic Effects in Molecular Thin Films via Time-Domain Brillouin Scattering	Valentin Cherruault et.al.	2501.12912	null
2025-01-22	FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces	Zhenran Xu et.al.	2501.12909	null
2025-01-22	Mutation-Guided LLM-based Test Generation at Meta	Christopher Foster et.al.	2501.12862	null
2025-01-22	ACEBench: Who Wins the Match Point in Tool Learning?	Chen Chen et.al.	2501.12851	null
2025-01-22	To Measure or Not: A Cost-Sensitive, Selective Measuring Environment for Agricultural Management Decisions with Reinforcement Learning	Hilmy Baja et.al.	2501.12823	link
2025-01-22	PSGSL: A Probabilistic Framework Integrating Semantic Scene Understanding and Gas Sensing for Gas Source Localization	Pepe Ojeda et.al.	2501.12812	null
2025-01-22	Information Design for Adaptive Organizations	Wataru Tamura et.al.	2501.12669	null
2025-01-22	NBDI: A Simple and Efficient Termination Condition for Skill Extraction from Task-Agnostic Demonstrations	Myunsoo Kim et.al.	2501.12668	null
2025-01-22	Optimal Rebate Design: Incentives, Competition and Efficiency in Auction Markets	Thibaut Mastrolia et.al.	2501.12591	null
2025-01-22	Leveraging LLMs to Create a Haptic Devices' Recommendation System	Yang Liu et.al.	2501.12573	null
2025-01-21	Reinforcement Learning Constrained Beam Search for Parameter Optimization of Paper Drying Under Flexible Constraints	Siyuan Chen et.al.	2501.12542	null
2025-01-21	Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists	Thomas F. Eisenmann et.al.	2501.12374	link
2025-01-21	UI-TARS: Pioneering Automated GUI Interaction with Native Agents	Yujia Qin et.al.	2501.12326	link
2025-01-21	Transitions to synchronization in adaptive multilayer networks with higher-order interactions	Richita Ghosh et.al.	2501.12301	null
2025-01-21	mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework	Bingyi Liu et.al.	2501.12263	null
2025-01-21	Multi-Agent Feedback Motion Planning using Probably Approximately Correct Nonlinear Model Predictive Control	Mark Gonzales et.al.	2501.12234	null
2025-01-21	Empower Healthcare through a Self-Sovereign Identity Infrastructure for Secure Electronic Health Data Access	Antonio López Martínez et.al.	2501.12229	null
2025-01-21	Convergence of time-delayed opinion dynamics with complex interaction types	Lingling Yao et.al.	2501.12219	null
2025-01-21	RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression	Uri Gadot et.al.	2501.12216	null
2025-01-21	Experience-replay Innovative Dynamics	Tuo Zhang et.al.	2501.12199	null
2025-01-21	Opinion dynamics in bounded confidence models with manipulative agents: Moving the Overton window	A. Bautista et.al.	2501.12198	null
2025-01-21	BotDetect: A Decentralized Federated Learning Framework for Detecting Financial Bots on the EVM Blockchains	Ahmed Mounsf Rafik Bendada et.al.	2501.12112	null
2025-01-21	Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics	Somnath Hazra et.al.	2501.12061	link
2025-01-21	Growth model with externalities for energetic transition via MFG with common external variable	Pierre Lavigne et.al.	2501.11988	null
2025-01-21	Simultaneously decoding the unknown stationary state and function parameters for mean field games	Hongyu Liu et.al.	2501.11955	null
2025-01-21	GLAM: Global-Local Variation Awareness in Mamba-based World Model	Qian He et.al.	2501.11949	null
2025-01-21	Equilibria under Dynamic Benchmark Consistency in Non-Stationary Multi-Agent Systems	Ludovico Crippa et.al.	2501.11897	null
2025-01-21	Connection-Coordination Rapport (CCR) Scale: A Dual-Factor Scale to Measure Human-Robot Rapport	Ting-Han Lin et.al.	2501.11887	null
2025-01-21	Developing an Agent-Based Mathematical Model for Simulating Post-Irradiation Cellular Response: A Crucial Component of a Digital Twin Framework for Personalized Radiation Treatment	Ruirui Liu et.al.	2501.11875	null
2025-01-21	LLM-Agents Driven Automated Simulation Testing and Analysis of small Uncrewed Aerial Systems	Venkata Sai Aswath Duvvuru et.al.	2501.11864	null
2025-01-21	EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents	Zhili Cheng et.al.	2501.11858	link
2025-01-17	Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems	Weibo Gao et.al.	2501.10332	null
2025-01-17	Towards Human-Guided, Data-Centric LLM Co-Pilots	Evgeny Saveliev et.al.	2501.10321	null
2025-01-17	Towards Preventing Overreliance on Task-Oriented Conversational AI Through Accountability Modeling	Suvodip Dey et.al.	2501.10316	link
2025-01-17	Enhancing AI Transparency: XRL-Based Resource Management and RAN Slicing for 6G ORAN Architecture	Suvidha Mhatre et.al.	2501.10292	null
2025-01-17	Evidence for the gravity-driven and magnetically-regularized gas flows feeding the massive protostellar cluster in Cep A	Panigrahy Sandhyarani et.al.	2501.10280	null
2025-01-17	Grey-Box Fuzzing in Constrained Ultra-Large Systems: Lessons for SE Community	Jiazhao Yu et.al.	2501.10269	null
2025-01-17	Deployment of an Aerial Multi-agent System for Automated Task Execution in Large-scale Underground Mining Environments	Niklas Dahlquist et.al.	2501.10262	null
2025-01-17	Logarithmic Regret for Nonlinear Control	James Wang et.al.	2501.10261	null
2025-01-17	Secure Semantic Communication With Homomorphic Encryption	Rui Meng et.al.	2501.10182	null
2025-01-17	PaSa: An LLM Agent for Comprehensive Academic Paper Search	Yichen He et.al.	2501.10120	link
2025-01-17	GAWM: Global-Aware World Model for Multi-Agent Reinforcement Learning	Zifeng Shi et.al.	2501.10116	null
2025-01-17	Infrastructure for AI Agents	Alan Chan et.al.	2501.10114	null
2025-01-17	LLM Reasoner and Automated Planner: A new NPC approach	Israel Puerta-Merino et.al.	2501.10106	null
2025-01-17	Universal Actions for Enhanced Embodied Foundation Models	Jinliang Zheng et.al.	2501.10105	link
2025-01-17	A Survey on LLM Test-Time Compute via Search: Tasks, LLM Profiling, Search Algorithms, and Relevant Frameworks	Xinzhe Li et.al.	2501.10069	null
2025-01-17	Agent-as-Judge for Factual Summarization of Long Narratives	Yeonseok Jeong et.al.	2501.09993	link
2025-01-17	A Survey on Multi-Turn Interaction Capabilities of Large Language Models	Chen Zhang et.al.	2501.09959	null
2025-01-17	ForestProtector: An IoT Architecture Integrating Machine Vision and Deep Reinforcement Learning for Efficient Wildfire Monitoring	Kenneth Bonilla-Ormachea et.al.	2501.09926	null
2025-01-17	Towards A Litmus Test for Common Sense	Hugo Latapie et.al.	2501.09913	null
2025-01-17	Chatbot apologies: Beyond bullshit	P. D. Magnus et.al.	2501.09910	null
2025-01-16	CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education	Tianyu Wang et.al.	2501.09709	link
2025-01-16	The Goofus & Gallant Story Corpus for Practical Value Alignment	Md Sultan Al Nahian et.al.	2501.09707	null
2025-01-16	Authenticated Delegation and Authorized AI Agents	Tobin South et.al.	2501.09674	null
2025-01-16	NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision Processes	Nathaniel S. Keplinger et.al.	2501.09646	link
2025-01-16	Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework	Yushen Lin et.al.	2501.09631	null
2025-01-16	A Multi-agent System for Hybrid Optimization	Eric S. Fraga et.al.	2501.09563	null
2025-01-16	Solving the unsolvable: Translating case law in Hong Kong	King-kui Sin et.al.	2501.09444	null
2025-01-16	ADAGE: A generic two-layer framework for adaptive agent based modelling	Benjamin Patrick Evans et.al.	2501.09429	null
2025-01-16	AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling	Ancheng Xu et.al.	2501.09426	null
2025-01-16	Agent-Based Simulation of a Perpetual Futures Market	Ramshreyas Rao et.al.	2501.09404	null
2025-01-16	The sleeping bacterium: shedding light on the resuscitation mechanism	Eleonora Alfinito et.al.	2501.09366	null
2025-01-16	YETI (YET to Intervene) Proactive Interventions by Multimodal AI Agents in Augmented Reality Tasks	Saptarashmi Bandyopadhyay et.al.	2501.09355	null
2025-01-16	ChartInsighter: An Approach for Mitigating Hallucination in Time-series Chart Summary Generation with A Benchmark Dataset	Fen Wang et.al.	2501.09349	link
2025-01-16	Solving Infinite-Player Games with Player-to-Strategy Networks	Carlos Martin et.al.	2501.09330	null
2025-01-16	On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression	Zichang Ge et.al.	2501.09327	link
2025-01-16	SOP-Agent: Empower General Purpose AI Agent with Domain-Specific SOPs	Anbang Ye et.al.	2501.09316	null
2025-01-16	Interoceptive Robots for Convergent Shared Control in Collaborative Construction Work	Xiaoshan Zhou et.al.	2501.09290	link
2025-01-16	Hierarchical Deep Reinforcement Learning for Adaptive Resource Management in Integrated Terrestrial and Non-Terrestrial Networks	Muhammad Ahmed Mohsin et.al.	2501.09212	link
2025-01-15	Embodied Scene Understanding for Vision Language Models via MetaVQA	Weizhen Wang et.al.	2501.09167	null
2025-01-15	AutoLoop: Fast Visual SLAM Fine-tuning through Agentic Curriculum Learning	Assaf Lahiany et.al.	2501.09160	null
2025-01-15	Personality Modeling for Persuasion of Misinformation using AI Agent	Qianmin Lou et.al.	2501.08985	null
2025-01-15	Physical AI Agents: Integrating Cognitive Intelligence with Real-World Action	Fouad Bousetouane et.al.	2501.08944	null
2025-01-15	A Reinforcement Learning Approach to Quiet and Safe UAM Traffic Management	Surya Murthy et.al.	2501.08941	null
2025-01-15	Disentangling Exploration of Large Language Models by Optimal Exploitation	Tim Grams et.al.	2501.08925	null
2025-01-15	Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning	Qinyu Ma et.al.	2501.08897	link
2025-01-15	Silent Abandonment in Text-Based Contact Centers: Identifying, Quantifying, and Mitigating its Operational Impacts	Antonio Castellanos et.al.	2501.08869	null
2025-01-15	The geometry of moral decision making	Roland M. Friedrich et.al.	2501.08865	null
2025-01-15	On the Dominance of Truth-Telling in Gradual Mechanisms	Wenqian Wang et.al.	2501.08802	null
2025-01-15	Networked Agents in the Dark: Team Value Learning under Partial Observability	Guilherme S. Varela et.al.	2501.08778	null
2025-01-15	Leveraging LLM Agents for Translating Network Configurations	Yunze Wei et.al.	2501.08760	null
2025-01-15	Efficient Shape Reconfiguration by Hybrid Programmable Matter	Jonas Friemel et.al.	2501.08663	null
2025-01-15	Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance	Raúl Arranz et.al.	2501.08655	null
2025-01-15	Towards Intelligent Active Particles	Hartmut Löwen et.al.	2501.08632	null
2025-01-15	Neural Risk-sensitive Satisficing in Contextual Bandits	Shogo Ito et.al.	2501.08612	null
2025-01-15	AutoRestTest: A Tool for Automated REST API Testing Using LLMs and MARL	Tyler Stennett et.al.	2501.08600	null
2025-01-15	Effects of taxes, redistribution actions and fiscal evasion on wealth inequality: an agent-based model approach	Iago Nascimento Barros et.al.	2501.08573	null
2025-01-15	Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation	Jiaxin Guo et.al.	2501.08523	null
2025-01-15	Ensuring Truthfulness in Distributed Aggregative Optimization	Ziqin Chen et.al.	2501.08512	null
2025-01-14	Empathetic Conversational Agents: Utilizing Neural and Physiological Signals for Enhanced Empathetic Interactions	Nastaran Saffaryazdi et.al.	2501.08393	null
2025-01-14	ADAM-1: AI and Bioinformatics for Alzheimer's Detection and Microbiome-Clinical Data Integrations	Ziyuan Huang et.al.	2501.08324	null
2025-01-14	Using Gamified Experiments to Tame Complexity: the case of the Schelling Model of Segregation	Aleix Nicolás Olivé et.al.	2501.08280	null
2025-01-14	Addressing the sustainable AI trilemma: a case study on LLM agents and RAG	Hui Wu et.al.	2501.08262	null
2025-01-14	Engineering LLM Powered Multi-agent Framework for Autonomous CloudOps	Kannan Parthasarathy et.al.	2501.08243	null
2025-01-14	Dynamic Pricing in High-Speed Railways Using Multi-Agent Reinforcement Learning	Enrique Adrian Villarrubia-Martin et.al.	2501.08234	null
2025-01-14	ASTRID -- An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems	Mohita Chowdhury et.al.	2501.08208	null
2025-01-14	An Elementary Microscopic Model of Sympatric Speciation	Franco Bagnoli et.al.	2501.08130	null
2025-01-14	Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving	Guizhe Jin et.al.	2501.08096	null
2025-01-14	AgentPose: Progressive Distribution Alignment via Feature Agent for Human Pose Distillation	Feng Zhang et.al.	2501.08088	null
2025-01-14	CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning	Guoliang He et.al.	2501.08071	link
2025-01-14	Hydrodynamics-driven phase-locking and collective motility of sessile active dumbbells	Urvi Mahendra Bora et.al.	2501.08065	null
2025-01-14	Cooperative Patrol Routing: Optimizing Urban Crime Surveillance through Multi-Agent Reinforcement Learning	Juan Palma-Borda et.al.	2501.08020	link
2025-01-14	Decentralized Learning with Approximate Finite-Time Consensus	Aaron Fainman et.al.	2501.07967	null
2025-01-14	Governing AI Agents	Noam Kolt et.al.	2501.07913	null
2025-01-14	Flow: A Modular Approach to Automated Agentic Workflow Generation	Boye Niu et.al.	2501.07834	null
2025-01-14	Agent-Centric Projection of Prompting Techniques and Implications for Synthetic Training Data for Large Language Models	Dhruv Dhamani et.al.	2501.07815	null
2025-01-14	Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering	Feijie Wu et.al.	2501.07813	null
2025-01-14	CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation	Ruwei Pan et.al.	2501.07811	null
2025-01-14	Visual Language Models as Operator Agents in the Space Domain	Alejandro Carrasco et.al.	2501.07802	null
2025-01-13	CBS with Continuous-Time Revisit	Andy Li et.al.	2501.07744	null
2025-01-13	WebWalker: Benchmarking LLMs in Web Traversal	Jialong Wu et.al.	2501.07572	link
2025-01-13	SafeSwarm: Decentralized Safe RL for the Swarm of Drones Landing in Dense Crowds	Grik Tadevosyan et.al.	2501.07566	null
2025-01-13	SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing	Varun Biyyala et.al.	2501.07554	link
2025-01-13	Evaluating Agent-based Program Repair at Google	Pat Rondon et.al.	2501.07531	null
2025-01-13	Improving DeFi Accessibility through Efficient Liquidity Provisioning with Deep Reinforcement Learning	Haonan Xu et.al.	2501.07508	null
2025-01-13	How low-cost AI universal approximators reshape market efficiency	Paolo Barucca et.al.	2501.07489	null
2025-01-13	SynthSoM: A synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM)	Xiang Cheng et.al.	2501.07459	link
2025-01-13	Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI	Rolf Pfister et.al.	2501.07458	null
2025-01-13	Online inductive learning from answer sets for efficient reinforcement learning exploration	Celeste Veronese et.al.	2501.07445	null
2025-01-13	Attention when you need	Lokesh Boominathan et.al.	2501.07440	null
2025-01-13	Lifelong Learning of Large Language Model based Agents: A Roadmap	Junhao Zheng et.al.	2501.07278	link
2025-01-13	Multi-face emotion detection for effective Human-Robot Interaction	Mohamed Ala Yahyaoui et.al.	2501.07213	null
2025-01-13	Combined effect of incentives and coupling in multigames in two-layer networks	Luo-Luo Jiang et.al.	2501.07193	null
2025-01-13	TIMRL: A Novel Meta-Reinforcement Learning Framework for Non-Stationary and Multi-Task Environments	Chenyang Qi et.al.	2501.07146	null
2025-01-13	How GPT learns layer by layer	Jason Du et.al.	2501.07108	link
2025-01-13	PPO-Q: Proximal Policy Optimization with Parametrized Quantum Policies or Values	Yu-Xin Jin et.al.	2501.07085	null
2025-01-13	PoAct: Policy and Action Dual-Control Agent for Generalized Applications	Guozhi Yuan et.al.	2501.07054	null
2025-01-13	Differentially Private Kernelized Contextual Bandits	Nikola Pavlovic et.al.	2501.07046	null
2025-01-12	Learning Implicit Social Navigation Behavior using Deep Inverse Reinforcement Learning	Tribhi Kathuria et.al.	2501.06946	null
2025-01-12	AdaSlicing: Adaptive Online Network Slicing under Continual Network Dynamics in Open Radio Access Networks	Ming Zhao et.al.	2501.06943	null
2025-01-10	PEACE: Empowering Geologic Map Holistic Understanding with MLLMs	Yangyu Huang et.al.	2501.06184	null
2025-01-10	A Mixed-Integer Conic Program for the Multi-Agent Moving-Target Traveling Salesman Problem	Allen George Philip et.al.	2501.06130	null
2025-01-10	Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation	Guojun Xiong et.al.	2501.06103	null
2025-01-10	Learning Flexible Heterogeneous Coordination with Capability-Aware Shared Hypernetworks	Kevin Fu et.al.	2501.06058	link
2025-01-10	Investigating the Impact of Observation Space Design Choices On Training Reinforcement Learning Solutions for Spacecraft Problems	Nathaniel Hamilton et.al.	2501.06016	null
2025-01-10	Enhanced Acoustic Beamforming with Sub-Aperture Angular Multiply and Sum -- in vivo and in Human Demonstration	Matthieu Toulemonde et.al.	2501.05837	null
2025-01-10	CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech	Madhurananda Pahar et.al.	2501.05755	null
2025-01-10	Semantic Mapping in Indoor Embodied AI -- A Comprehensive Survey and Future Directions	Sonia Raychaudhuri et.al.	2501.05750	null
2025-01-10	How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond	Chen Huang et.al.	2501.05714	null
2025-01-10	Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains	Vighnesh Subramaniam et.al.	2501.05707	null
2025-01-10	A Two-timescale Primal-dual Algorithm for Decentralized Optimization with Compression	Haoming Liu et.al.	2501.05701	null
2025-01-10	Scaling Safe Multi-Agent Control for Signal Temporal Logic Specifications	Joe Eappen et.al.	2501.05639	link
2025-01-09	Towards Probabilistic Inference of Human Motor Intentions by Assistive Mobile Robots Controlled via a Brain-Computer Interface	Xiaoshan Zhou et.al.	2501.05610	null
2025-01-09	NSChat: A Chatbot System To Rule Them All	Zenon Lamprou et.al.	2501.05541	null
2025-01-09	OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?	Yifei Li et.al.	2501.05510	link
2025-01-09	Strategy Masking: A Method for Guardrails in Value-based Reinforcement Learning Agents	Jonathan Keane et.al.	2501.05501	null
2025-01-09	Search-o1: Agentic Search-Enhanced Large Reasoning Models	Xiaoxi Li et.al.	2501.05366	link
2025-01-09	Control of Overpopulated Tails in Kinetic Epidemic Models	Mattia Zanella et.al.	2501.05365	null
2025-01-09	A Path Variant of the Explorer Director Game on Graphs	Abigail Raz et.al.	2501.05364	null
2025-01-09	On Corrigibility and Alignment in Multi Agent Games	Edmund Dable-Heath et.al.	2501.05360	null
2025-01-09	A learning agent-based approach to the characterization of open quantum systems	Lorenzo Fioroni et.al.	2501.05350	null
2025-01-09	The Bakers and Millers Game with Restricted Locations	Simon Krogmann et.al.	2501.05334	null
2025-01-09	Knowledge Transfer in Model-Based Reinforcement Learning Agents for Efficient Multi-Task Learning	Dmytro Kuzmenko et.al.	2501.05329	null
2025-01-09	Contrast-Free Myocardial Scar Segmentation in Cine MRI using Motion and Texture Fusion	Guang Yang et.al.	2501.05241	null
2025-01-09	CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness	Shoucheng Song et.al.	2501.05207	null
2025-01-09	Emergence of human-like polarization among large language model agents	Jinghua Piao et.al.	2501.05171	null
2025-01-09	Constrained Optimization of Charged Particle Tracking with Multi-Agent Reinforcement Learning	Tobias Kortus et.al.	2501.05113	null
2025-01-09	LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models	Zengqi Peng et.al.	2501.05057	null
2025-01-09	ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark	Ronghao Dang et.al.	2501.05031	link
2025-01-09	CuRLA: Curriculum Learning Based Deep Reinforcement Learning for Autonomous Driving	Bhargava Uppuluri et.al.	2501.04982	null
2025-01-08	RadGPT: Constructing 3D Image-Text Tumor Datasets	Pedro R. A. S. Bassi et.al.	2501.04678	link
2025-01-08	InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection	Yuhang Liu et.al.	2501.04575	link
2025-01-08	The importance of being discrete -- An agent-based model for active nematics and more	Mathieu Dedenon et.al.	2501.04559	null
2025-01-08	Approximately EFX and PO Allocations for Bivalued Chores	Zehan Lin et.al.	2501.04550	null
2025-01-08	Cyber-Physical Steganography in Robotic Motion Control	Ching-Chun Chang et.al.	2501.04541	null
2025-01-08	Safe Reinforcement Learning with Minimal Supervision	Alexander Quessy et.al.	2501.04481	null
2025-01-08	Hybrid Artificial Intelligence Strategies for Drone Navigation	Rubén San-Segundo et.al.	2501.04472	null
2025-01-08	A Digital Shadow for Modeling, Studying and Preventing Urban Crime	Juan Palma-Borda et.al.	2501.04435	null
2025-01-08	User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation	Krisztian Balog et.al.	2501.04410	null
2025-01-08	Agent Laboratory: Using LLM Agents as Research Assistants	Samuel Schmidgall et.al.	2501.04227	null
2025-01-08	Unattainability of Common Knowledge in Asymmetric Games with Imperfect Information	Fabian Farestam et.al.	2501.04199	null
2025-01-07	HIVEX: A High-Impact Environment Suite for Multi-Agent Research (extended version)	Philipp D. Siedler et.al.	2501.04180	null
2025-01-07	Collaborative Spacecraft Servicing under Partial Feedback using Lyapunov-based Deep Neural Networks	Cristian F. Nino et.al.	2501.04160	null
2025-01-07	Implementing Systemic Thinking for Automatic Schema Matching: An Agent-Based Modeling Approach	Hicham Assoudi et.al.	2501.04136	null
2025-01-07	Kinetic theory of decentralized learning for smart active matter	Gerhard Jung et.al.	2501.03948	null
2025-01-07	Implicit Coordination using Active Epistemic Inference	Lauren Bramblett et.al.	2501.03907	null
2025-01-07	Truthful mechanisms for linear bandit games with private contexts	Yiting Hu et.al.	2501.03865	null
2025-01-07	Rendezfood: A Design Case Study of a Conversational Location-based Approach in Restaurants	Philip Weber et.al.	2501.03862	null
2025-01-07	Run-and-tumble chemotaxis using reinforcement learning	Ramesh Pramanik et.al.	2501.03687	null
2025-01-07	The Textbook of Tomorrow: Rethinking Course Material Interfacing in the Era of GPT	Audrey Olson et.al.	2501.03618	null
2025-01-07	Distributed Observer for Descriptor Linear System: The Luenberger Observer Method	Shuai Liu et.al.	2501.03564	null
2025-01-07	Rethinking Adversarial Attacks in Reinforcement Learning from Policy Distribution Perspective	Tianyang Duan et.al.	2501.03562	null
2025-01-07	FgC2F-UDiff: Frequency-guided and Coarse-to-fine Unified Diffusion Model for Multi-modality Missing MRI Synthesis	Xiaojiao Xiao et.al.	2501.03526	link
2025-01-07	A Unified Attack Detection Strategy for Multi-Agent Systems over Transient and Steady Stages	Jinming Gao et.al.	2501.03496	null
2025-01-06	Designing Telepresence Robots to Support Place Attachment	Yaxin Hu et.al.	2501.03420	null
2025-01-06	ScaleMAI: Accelerating the Development of Trusted Datasets and AI Models	Wenxuan Li et.al.	2501.03410	link
2025-01-06	Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation	Yuhui Zhang et.al.	2501.03225	link
2025-01-06	Turn-based Multi-Agent Reinforcement Learning Model Checking	Dennis Gross et.al.	2501.03187	null
2025-01-06	Deep-Relative-Trust-Based Diffusion for Decentralized Deep Learning	Muyun Li et.al.	2501.03162	null
2025-01-06	Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches	Alhassan Mumuni et.al.	2501.03151	null
2025-01-06	Probably Correct Optimal Stable Matching for Two-Sided Markets Under Uncertainty	Andreas Athanasopoulos et.al.	2501.03018	link
2025-01-06	Approximating N-Player Nash Equilibrium through Gradient Descent	Dongge Wang et.al.	2501.03001	null
2025-01-06	CALM: Curiosity-Driven Auditing for Large Language Models	Xiang Zheng et.al.	2501.02997	link
2025-01-06	CAMP: Collaborative Attention Model with Profiles for Vehicle Routing Problems	Chuanbo Hua et.al.	2501.02977	link
2025-01-06	Revisiting Communication Efficiency in Multi-Agent Reinforcement Learning from the Dimensional Analysis Perspective	Chuxiong Sun et.al.	2501.02888	null
2025-01-06	A Novel Vision Transformer for Camera-LiDAR Fusion based Traffic Object Segmentation	Toomas Tahves et.al.	2501.02858	null
2025-01-06	Proteomic Learning of Gamma-Aminobutyric Acid (GABA) Receptor-Mediated Anesthesia	Jian Jiang et.al.	2501.02824	link
2025-01-06	Enhancing Lifelong Multi-Agent Path Finding with Cache Mechanism	Yimin Tang et.al.	2501.02803	null
2025-01-06	Gaming on Coincident Peak Shaving: Equilibrium and Strategic Behavior	Liudong Chen et.al.	2501.02792	null
2025-01-06	Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes	Zijian Wang et.al.	2501.02774	null
2025-01-06	Multi-Agent Path Finding under Limited Communication Range Constraint via Dynamic Leading	Hoang-Dung Bui et.al.	2501.02770	null
2025-01-06	Tree-based RAG-Agent Recommendation System: A Case Study in Medical Test Data	Yahe Yang et.al.	2501.02727	null
2025-01-05	A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model	Shivaram Kalyanakrishnan et.al.	2501.02652	null
2025-01-05	Slow modulation of the contraction patterns in Physarum polycephalum	Raphael Saiseau et.al.	2501.02651	null
2025-01-05	LLMs Help Alleviate the Cross-Subject Variability in Brain Signal and Language Alignment	Yifei Liu et.al.	2501.02621	null
2025-01-05	Back to Base: Towards Hands-Off Learning via Safe Resets with Reach-Avoid Safety Filters	Azra Begzadić et.al.	2501.02620	null
2025-01-03	QuArch: A Question-Answering Dataset for AI Agents in Computer Architecture	Shvetank Prakash et.al.	2501.01892	null
2025-01-03	Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification	Xiangxiang Dai et.al.	2501.01849	link
2025-01-03	MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning	Pu Yang et.al.	2501.01834	null
2025-01-03	SDPO: Segment-Level Direct Preference Optimization for Social Agents	Aobo Kong et.al.	2501.01821	link
2025-01-03	Distributed Framework Construction for Affine Formation Control	Huiming Li et.al.	2501.01817	null
2025-01-03	Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery	Baoru Huang et.al.	2501.01752	null
2025-01-03	Proposing Hierarchical Goal-Conditioned Policy Planning in Multi-Goal Reinforcement Learning	Gavin B. Rens et.al.	2501.01727	null
2025-01-03	AgentRefine: Enhancing Agent Generalization through Refinement Tuning	Dayuan Fu et.al.	2501.01702	null
2025-01-03	The (Exact) Price of Cardinality for Indivisible Goods: A Parametric Perspective	Alexander Lam et.al.	2501.01660	null
2025-01-03	PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents	Jingoo Lee et.al.	2501.01594	null
2025-01-03	BLAST: A Stealthy Backdoor Leverage Attack against Cooperative Multi-Agent Deep Reinforcement Learning based Systems	Yinbo Yu et.al.	2501.01593	null
2025-01-02	Reinforcement-learning-based control of turbulent channel flows at high Reynolds numbers	Zisong Zhou et.al.	2501.01573	null
2025-01-02	BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery	Kanishk Gandhi et.al.	2501.01540	link
2025-01-02	In Search of a Lost Metric: Human Empowerment as a Pillar of Socially Conscious Navigation	Vasanth Reddy Baddam et.al.	2501.01539	null
2025-01-02	Optimal Strategy Revision in Population Games: A Mean Field Game Theory Perspective	Julian Barreiro-Gomez et.al.	2501.01389	null
2025-01-02	PIMAEX: Multi-Agent Exploration through Peer Incentivization	Michael Kölle et.al.	2501.01266	null
2025-01-02	Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants	Lixiong Qin et.al.	2501.01243	null
2025-01-02	From Interaction to Attitude: Exploring the Impact of Human-AI Cooperation on Mental Illness Stigma	Tianqi Song et.al.	2501.01220	null
2025-01-02	D-HAT: a Diatom-inspired structure for a Helmet concept Against Trauma	Ludovico Musenich et.al.	2501.01211	null
2025-01-02	Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects	Abdullah Mushtaq et.al.	2501.01205	null
2025-01-02	3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer	Jiajun Deng et.al.	2501.01163	null
2025-01-02	A3: Android Agent Arena for Mobile GUI Agents	Yuxiang Chai et.al.	2501.01149	null
2025-01-02	Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method	Ruichen Zhang et.al.	2501.01141	null
2025-01-02	Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning	Min Whoo Lee et.al.	2501.01140	null
2025-01-02	Symmetries-enhanced Multi-Agent Reinforcement Learning	Nikolaos Bousias et.al.	2501.01136	null
2025-01-02	Regularized Proportional Fairness Mechanism for Resource Allocation Without Money	Sihan Zeng et.al.	2501.01111	null
2025-01-02	MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model	Chengze Zhang et.al.	2501.01014	null
2025-01-02	Cyber-physical Defense for Heterogeneous Multi-agent Systems Against Exponentially Unbounded Attacks on Signed Digraphs	Yichao Wang et.al.	2501.00990	null
2025-01-02	Bootstrapped Reward Shaping	Jacob Adamczyk et.al.	2501.00989	null
2025-01-01	Non-obvious Manipulability in Hedonic Games with Friends Appreciation Preferences	Michele Flammini et.al.	2501.00976	null
2025-01-01	Defense Strategies for Autonomous Multi-agent Systems: Ensuring Safety and Resilience Under Exponentially Unbounded FDI Attacks	Yichao Wang et.al.	2501.00973	null
2025-01-01	Intent-based Radio Scheduler for RAN Slicing: Learning to deal with different network scenarios	Cleverson Nahum et.al.	2501.00950	link
2025-01-01	Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things	Talha Zeeshan et.al.	2501.00906	null
2025-01-01	Agentic Systems: A Guide to Transforming Industries with Vertical AI Agents	Fouad Bousetouane et.al.	2501.00881	null
2024-12-30	Distributed Mixture-of-Agents for Edge Inference with Large Language Models	Purbesh Mitra et.al.	2412.21200	link
2024-12-30	Aviary: training language agents on challenging scientific tasks	Siddharth Narayanan et.al.	2412.21154	null
2024-12-30	Training Software Engineering Agents and Verifiers with SWE-Gym	Jiayi Pan et.al.	2412.21139	link
2024-12-30	Positional information trade-offs in boundary-driven reaction-diffusion systems	Jonas Berx et.al.	2412.21113	null
2024-12-30	Exploring and Controlling Diversity in LLM-Agent Conversation	KuanChao Chu et.al.	2412.21102	null
2024-12-30	Advances in Multi-agent Reinforcement Learning: Persistent Autonomy and Robot Learning Lab Report 2024	Reza Azadeh et.al.	2412.21088	null
2024-12-30	Privacy-Aware Multi-Device Cooperative Edge Inference with Distributed Resource Bidding	Wenhao Zhuang et.al.	2412.21069	null
2024-12-30	Plancraft: an evaluation dataset for planning with LLM agents	Gautier Dagan et.al.	2412.21033	link
2024-12-30	UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI	Fangwei Zhong et.al.	2412.20977	null
2024-12-31	SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity	Pengfei Jing et.al.	2412.20787	null
2024-12-30	Joint Scoring Rules: Zero-Sum Competition Avoids Performative Prediction	Rubi Hudson et.al.	2412.20732	null
2024-12-30	Modeling and Simulating Agent-Based City Migration Using Conway's Game of Life	Bruce Deng et.al.	2412.20691	null
2024-12-30	Blockchain-Empowered Cyber-Secure Federated Learning for Trustworthy Edge Computing	Ervin Moore et.al.	2412.20674	null
2024-12-29	The intrinsic motivation of reinforcement and imitation learning for sequential tasks	Sao Mai Nguyen et.al.	2412.20573	null
2024-12-29	Game Theory and Multi-Agent Reinforcement Learning : From Nash Equilibria to Evolutionary Dynamics	Neil De La Fuente et.al.	2412.20523	null
2024-12-29	Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning	Hang Ni et.al.	2412.20505	null
2024-12-29	Exploiting NOMA Transmissions in Multi-UAV-assisted Wireless Networks: From Aerial-RIS to Mode-switching UAVs	Songhan Zhao et.al.	2412.20484	null
2024-12-29	SatFlow: Scalable Network Planning for LEO Mega-Constellations	Sheng Cen et.al.	2412.20475	null
2024-12-29	Image Augmentation Agent for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2412.20439	null
2024-12-29	Learning Policies for Dynamic Coalition Formation in Multi-Robot Task Allocation	Lucas C. D. Bezerra et.al.	2412.20397	null
2024-12-27	Bottom-up robust modeling for the foraging behavior of Physarum polycephalum	Damiano Reginato et.al.	2412.19790	null
2024-12-27	Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration	Le Chen et.al.	2412.19770	link
2024-12-27	Can Large Language Models Adapt to Other Agents In-Context?	Matthew Riemer et.al.	2412.19726	null
2024-12-27	OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis	Qiushi Sun et.al.	2412.19723	null
2024-12-27	The Value of Recall in Extensive-Form Games	Ratip Emin Berker et.al.	2412.19659	null
2024-12-27	Xmodel-2 Technical Report	Wang Qun et.al.	2412.19638	null
2024-12-27	Bidding Games on Markov Decision Processes with Quantitative Reachability Objectives	Guy Avni et.al.	2412.19609	null
2024-12-27	Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following	Yuxiao Yang et.al.	2412.19562	null
2024-12-27	Quantiles under ambiguity and risk sharing	Peng Liu et.al.	2412.19546	null
2024-12-27	TARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured Data	Xiang Huang et.al.	2412.19544	link
2024-12-27	Scalable Hierarchical Reinforcement Learning for Hyper Scale Multi-Robot Task Planning	Xuan Zhou et.al.	2412.19538	null
2024-12-27	Casevo: A Cognitive Agents and Social Evolution Simulator	Zexun Jiang et.al.	2412.19498	link
2024-12-27	Knowledge Graph-Based Multi-Agent Path Planning in Dynamic Environments using WAITR	Ted Edward Holmberg et.al.	2412.19469	null
2024-12-27	Online distributed algorithms for mixed equilibrium problems in dynamic environments	Hang Xu et.al.	2412.19399	null
2024-12-26	Preventive Energy Management for Distribution Systems Under Uncertain Events: A Deep Reinforcement Learning Approach	Md Isfakul Anam et.al.	2412.19382	null
2024-12-26	Minimal Batch Adaptive Learning Policy Engine for Real-Time Mid-Price Forecasting in High-Frequency Trading	Adamantios Ntakaris et.al.	2412.19372	null
2024-12-26	xSRL: Safety-Aware Explainable Reinforcement Learning -- Safety as a Product of Explainability	Risal Shahriar Shefin et.al.	2412.19311	link
2024-12-26	Reforming an Unfair Allocation by Exchanging Goods	Sheung Man Yuen et.al.	2412.19264	null
2024-12-26	Swarm Contract: A Multi-Sovereign Agent Consensus Mechanism	Haowei Yang et.al.	2412.19256	null
2024-12-26	VINEVI: A Virtualized Network Vision Architecture for Smart Monitoring of Heterogeneous Applications and Infrastructures	Rodrigo Moreira et.al.	2412.19226	null
2024-12-24	Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems	Fernando Jia et.al.	2412.18601	link
2024-12-24	Automated Code Review In Practice	Umut Cihan et.al.	2412.18531	null
2024-12-24	Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving	Hao Pang et.al.	2412.18511	null
2024-12-24	Calibrating the Subjective	Mark Whitmeyer et.al.	2412.18486	null
2024-12-24	Multi-Agent Norm Perception and Induction in Distributed Healthcare	Chao Li et.al.	2412.18454	null
2024-12-24	3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding	Tatiana Zemskova et.al.	2412.18450	link
2024-12-24	GeAR: Graph-enhanced Agent for Retrieval-augmented Generation	Zhili Shen et.al.	2412.18431	null
2024-12-24	Explainable Multi-Modal Data Exploration in Natural Language via LLM Agent	Farhad Nooralahzadeh et.al.	2412.18428	link
2024-12-24	GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent	Kangjia Zhao et.al.	2412.18426	null
2024-12-24	Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles	Zihan Wang et.al.	2412.18416	null
2024-12-24	Contrastive Representation for Interactive Recommendation	Jingyu Li et.al.	2412.18396	link
2024-12-24	Defining and Detecting the Defects of the Large Language Model-based Autonomous Agents	Kaiwen Ning et.al.	2412.18371	link
2024-12-24	Extracting triples from dialogues for conversational social agents	Piek Vossen et.al.	2412.18364	null
2024-12-24	The Thousand Brains Project: A New Paradigm for Sensorimotor Intelligence	Viviane Clay et.al.	2412.18354	link
2024-12-24	Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering	Zhongjian Hu et.al.	2412.18351	null
2024-12-24	The Constitutional Filter	Simon Kohaut et.al.	2412.18347	link
2024-12-24	Learning to Play Against Unknown Opponents	Eshwar Ram Arunachaleswaran et.al.	2412.18297	null
2024-12-24	MinsStudio: A Streamlined Package for Minecraft AI Agent Development	Shaofei Cai et.al.	2412.18293	link
2024-12-24	Quantum framework for Reinforcement Learning: integrating Markov Decision Process, quantum arithmetic, and trajectory search	Thet Htar Su et.al.	2412.18208	null
2024-12-24	VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks	Shiduo Zhang et.al.	2412.18194	null
2024-12-23	Observation Interference in Partially Observable Assistance Games	Scott Emmons et.al.	2412.17797	null
2024-12-23	ResearchTown: Simulator of Human Research Community	Haofei Yu et.al.	2412.17767	link
2024-12-23	Sensitivity Curve Maximization: Attacking Robust Aggregators in Distributed Learning	Christian A. Schroth et.al.	2412.17740	null
2024-12-23	Robin Hood Reachability Bidding Games	Shaull Almagor et.al.	2412.17718	null
2024-12-23	SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC	Yue Deng et.al.	2412.17707	link
2024-12-23	Large Language Model Safety: A Holistic Survey	Dan Shi et.al.	2412.17686	link
2024-12-23	Shape and Performance of Fastest Paths over Networks with Interacting Selfish Agents	Marco Cogoni et.al.	2412.17665	null
2024-12-23	CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction	Yuanyuan Gao et.al.	2412.17612	null
2024-12-23	Fluid-Derived Lattices for Unbiased Modeling of Bacterial Colony Growth	Bryan Verhoef et.al.	2412.17604	null
2024-12-23	PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World	Yanheng He et.al.	2412.17589	null
2024-12-23	Complete aging in the noisy voter model enhances consensus formation	Jaume Llabrés et.al.	2412.17569	null
2024-12-23	DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought	Jiaan Wang et.al.	2412.17498	link
2024-12-23	A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers	Shuaihang Chen et.al.	2412.17481	link
2024-12-23	Should public health policy exempt cases with low viral load from isolation during an epidemic?: a modelling study	Jiahao Diao et.al.	2412.17428	null
2024-12-23	Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets	Akane Tsuboya et.al.	2412.17344	null
2024-12-23	Multimodal Deep Reinforcement Learning for Portfolio Optimization	Sumit Nawathe et.al.	2412.17293	null
2024-12-23	Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples	Taewoong Kim et.al.	2412.17288	link
2024-12-23	LegalAgentBench: Evaluating LLM Agents in Legal Domain	Haitao Li et.al.	2412.17259	link
2024-12-23	A Coalition Game for On-demand Multi-modal 3D Automated Delivery System	Farzan Moosavi et.al.	2412.17252	null
2024-12-22	A Multi-AI Agent System for Autonomous Optimization of Agentic AI Solutions via Iterative Refinement and LLM-Driven Feedback Loops	Kamer Ali Yuksel et.al.	2412.17149	null
2024-12-20	Offline Reinforcement Learning for LLM Multi-Step Reasoning	Huaijie Wang et.al.	2412.16145	link
2024-12-20	Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information	Dirk Bergemann et.al.	2412.16132	null
2024-12-20	Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAG	Hasan Md Tusfiqur Alam et.al.	2412.16086	link
2024-12-20	Active Flow Control for Bluff Body under High Reynolds Number Turbulent Flow Conditions Using Deep Reinforcement Learning	Jingbo Chen et.al.	2412.15975	null
2024-12-20	The multilayer garbage disposal game	Hsin-Lun Li et.al.	2412.15942	null
2024-12-20	Speedup Techniques for Switchable Temporal Plan Graph Optimization	He Jiang et.al.	2412.15908	null
2024-12-20	Exploring the Effects of AI Nonverbal Emotional Cues on Human Decision Certainty in Moral Dilemmas	Chenyi Zhang et.al.	2412.15834	null
2024-12-20	WebLLM: A High-Performance In-Browser LLM Inference Engine	Charlie F. Ruan et.al.	2412.15803	link
2024-12-20	FTISS Adaptive Bearing-Only Formation Tracking Control with Unknown Disturbance Rejection	Hong Liang Cheah et.al.	2412.15757	null
2024-12-20	Online Optimization Algorithms in Repeated Price Competition: Equilibrium Learning and Algorithmic Collusion	Martin Bichler et.al.	2412.15707	null
2024-12-20	Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration	Yijia Shao et.al.	2412.15701	link
2024-12-20	AIR: Unifying Individual and Cooperative Exploration in Collective Multi-Agent Reinforcement Learning	Guangchong Zhou et.al.	2412.15700	link
2024-12-20	Asynchronous Vector Consensus over Matrix-Weighted Networks	P Raghavendra Rao et.al.	2412.15681	null
2024-12-20	Learning Group Interactions and Semantic Intentions for Multi-Object Trajectory Prediction	Mengshi Qi et.al.	2412.15673	link
2024-12-20	Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline	Guancheng Zeng et.al.	2412.15660	null
2024-12-20	Tacit Learning with Adaptive Information Selection for Cooperative Multi-Agent Reinforcement Learning	Lunjun Liu et.al.	2412.15639	null
2024-12-20	Understanding Individual Agent Importance in Multi-Agent System via Counterfactual Reasoning	Chen Jianming et.al.	2412.15619	null
2024-12-20	Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage	Zhi Gao et.al.	2412.15606	null
2024-12-20	NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization	Danial Kamali et.al.	2412.15588	link
2024-12-20	Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems	Joshua Holder et.al.	2412.15573	link
2024-12-19	AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving	Shuo Xing et.al.	2412.15206	link
2024-12-19	Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from Demonstration	Junjia Liu et.al.	2412.15166	null
2024-12-19	Operationalising Rawlsian Ethics for Fairness in Norm-Learning Agents	Jessica Woodgate et.al.	2412.15163	null
2024-12-19	Equal Merit Does Not Imply Equality: Discrimination at Equilibrium in a Hiring Market with Symmetric Agents	Serafina Kamp et.al.	2412.15162	null
2024-12-19	Probabilistic Strategy Logic with Degrees of Observability	Chunyan Mu et.al.	2412.15135	null
2024-12-19	From Nonequilibrium to Equilibrium: Insights from a Two-Population Occupation Model	Jerome Garnier-Brun et.al.	2412.14996	null
2024-12-19	Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination	Leonardo Barcellona et.al.	2412.14957	null
2024-12-19	Long Time Behavior and Stabilization for Displacement Monotone Mean Field Games	Marco Cirant et.al.	2412.14903	null
2024-12-19	Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning	Anthony Kobanda et.al.	2412.14865	null
2024-12-19	Entropy Regularized Task Representation Learning for Offline Meta-Reinforcement Learning	Mohammadreza nakhaei et.al.	2412.14834	link
2024-12-19	Fair Division with Social Impact	Michele Flammini et.al.	2412.14818	null
2024-12-19	Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning	Ziang Ye et.al.	2412.14780	null
2024-12-19	Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning	Aditya Kapoor et.al.	2412.14779	null
2024-12-19	Testing linearity of spatial interaction functions à la Ramsey	Abhimanyu Gupta et.al.	2412.14778	null
2024-12-19	PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children	Yiqun Zhang et.al.	2412.14769	link
2024-12-19	Active Inference and Human--Computer Interaction	Roderick Murray-Smith et.al.	2412.14741	null
2024-12-19	On Verbalized Confidence Scores for LLMs	Daniel Yang et.al.	2412.14737	link
2024-12-19	Bel Esprit: Multi-Agent Framework for Building AI Model Pipelines	Yunsu Kim et.al.	2412.14684	null
2024-12-19	A Model-free Biomimetics Algorithm for Deterministic Partially Observable Markov Decision Process	Yide Yu et.al.	2412.14614	null
2024-12-19	Computational Sociology of Humans and Machines; Conflict and Collaboration	Taha Yasseri et.al.	2412.14606	null
2024-12-18	TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks	Frank F. Xu et.al.	2412.14161	link
2024-12-18	Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Report	Markus Dablander et.al.	2412.14085	null
2024-12-18	A Computationally Grounded Framework for Cognitive Attitudes (extended version)	Tiago de Lima et.al.	2412.14073	null
2024-12-18	Spatio-Temporal SIR Model of Pandemic Spread During Warfare with Optimal Dual-use Healthcare System Administration using Deep Reinforcement Learning	Adi Shuchami et.al.	2412.14039	link
2024-12-18	Decentralized Convergence to Equilibrium Prices in Trading Networks	Edwin Lock et.al.	2412.13972	null
2024-12-18	Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves	Martin Kurečka et.al.	2412.13962	null
2024-12-18	Harvesting energy from turbulent winds with Reinforcement Learning	Lorenzo Basile et.al.	2412.13961	null
2024-12-18	Towards privacy-preserving cooperative control via encrypted distributed optimization	Philipp Binfet et.al.	2412.13953	null
2024-12-18	Strategyproof Matching of Roommates and Rooms	Hadi Hosseini et.al.	2412.13887	null
2024-12-18	Who Saves us From Risk? Altruists Promote Cooperation in a Public Investment Game	Shen Zhang et.al.	2412.13816	null
2024-12-18	CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers?	Dimitrios Mallis et.al.	2412.13810	null
2024-12-18	Meta-Reflection: A Feedback-Free Reflection Learning Framework	Yaoke Wang et.al.	2412.13781	null
2024-12-18	Heuristic Planner for Communication-Constrained Multi-Agent Multi-Goal Path Planning	Jáchym Herynek et.al.	2412.13719	null
2024-12-18	A2H: A UI Converter from Android to HarmonyOS Platform	Chen Wang et.al.	2412.13693	link
2024-12-18	A hybrid learning agent for episodic learning tasks with unknown target distance	Oliver Sefrin et.al.	2412.13686	null
2024-12-18	ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning	Jie-Jing Shao et.al.	2412.13682	null
2024-12-18	Exploring Multi-Modal Integration with Tool-Augmented LLM Agents for Precise Causal Discovery	ChengAo Shen et.al.	2412.13667	null
2024-12-18	Large Language Model Federated Learning with Blockchain and Unlearning for Cross-Organizational Collaboration	Xuhan Zuo et.al.	2412.13551	null
2024-12-18	EscapeBench: Pushing Language Models to Think Outside the Box	Cheng Qian et.al.	2412.13549	link
2024-12-18	Models for common knowledge logic	Yoshihito Tanaka et.al.	2412.13537	null
2024-12-17	Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents	Yifei Zhou et.al.	2412.13194	null
2024-12-17	GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding	Haoyi Jiang et.al.	2412.13193	link
2024-12-17	SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents	Sheng Yin et.al.	2412.13178	link
2024-12-17	Practicable Black-box Evasion Attacks on Link Prediction in Dynamic Graphs -- A Graph Sequential Embedding Method	Jiate Li et.al.	2412.13134	link
2024-12-17	Contract-based Design and Verification of Multi-Agent Systems with Quantitative Temporal Requirements	Rafael Dewes et.al.	2412.13114	null
2024-12-17	Active Reinforcement Learning Strategies for Offline Policy Improvement	Ambedkar Dukkipati et.al.	2412.13106	null
2024-12-17	AI PERSONA: Towards Life-long Personalization of LLMs	Tiannan Wang et.al.	2412.13103	null
2024-12-17	Reservoir Computing for Fast, Simplified Reinforcement Learning on Memory Tasks	Kevin McKee et.al.	2412.13093	null
2024-12-17	Distributed Normal Map-based Stochastic Proximal Gradient Methods over Networks	Kun Huang et.al.	2412.13054	null
2024-12-17	NAVCON: A Cognitively Inspired and Linguistically Grounded Corpus for Vision and Language Navigation	Karan Wanchoo et.al.	2412.13026	null
2024-12-17	The Emergence of Strategic Reasoning of Large Language Models	Dongwoo Lee et.al.	2412.13013	null
2024-12-17	Adaptations of AI models for querying the LandMatrix database in natural language	Fatiha Ait Kbir et.al.	2412.12961	link
2024-12-17	4DRGS: 4D Radiative Gaussian Splatting for Efficient 3D Vessel Reconstruction from Sparse-View Dynamic DSA Images	Zhentao Liu et.al.	2412.12919	link
2024-12-17	An Agentic Approach to Automatic Creation of P&ID Diagrams from Natural Language Descriptions	Shreeyash Gowaikar et.al.	2412.12898	null
2024-12-17	Bayesian Persuasion with Externalities: Exploiting Agent Types	Jonathan Shaki et.al.	2412.12859	null
2024-12-17	From An LLM Swarm To A PDDL-Empowered HIVE: Planning Self-Executed Instructions In A Multi-Modal Jungle	Kaustubh Vyas et.al.	2412.12839	null
2024-12-17	GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models	Mukai Li et.al.	2412.12735	link
2024-12-17	Enhancing Naturalness in LLM-Generated Utterances through Disfluency Insertion	Syed Zohaib Hassan et.al.	2412.12710	null
2024-12-17	ParMod: A Parallel and Modular Framework for Learning Non-Markovian Tasks	Ruixuan Miao et.al.	2412.12700	null
2024-12-17	Everyday AR through AI-in-the-Loop	Ryo Suzuki et.al.	2412.12681	null
2024-12-16	Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives	Marius Belly et.al.	2412.12063	link
2024-12-16	Virtual Agent-Based Communication Skills Training to Facilitate Health Persuasion Among Peers	Farnaz Nouraei et.al.	2412.12061	null
2024-12-16	Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps	Linfeng Zhao et.al.	2412.12024	null
2024-12-16	Agentic AI-Driven Technical Troubleshooting for Enterprise Systems: A Novel Weighted Retrieval-Augmented Generation Paradigm	Rajat Khanda et.al.	2412.12006	null
2024-12-16	CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird's Eye View Perception	Senkang Hu et.al.	2412.12000	null
2024-12-16	AlphaZero Neural Scaling and Zipf's Law: a Tale of Board Games and Power Laws	Oren Neumann et.al.	2412.11979	link
2024-12-16	Learning Human-Aware Robot Policies for Adaptive Assistance	Jason Qin et.al.	2412.11913	null
2024-12-16	Reentrant phase behavior in binary topological flocks with nonreciprocal alignment	Tian Tang et.al.	2412.11871	null
2024-12-16	The Black Ninjas and the Sniper: On Robustness of Population Protocols	Benno Lossin et.al.	2412.11783	null
2024-12-16	Prediction of social dilemmas in networked populations via graph neural networks	Huaiyu Tan et.al.	2412.11775	null
2024-12-16	Harnessing Language for Coordination: A Framework and Benchmark for LLM-Driven Multi-Agent Control	Timothée Anne et.al.	2412.11761	null
2024-12-16	Common Ground, Diverse Roots: The Difficulty of Classifying Common Examples in Spanish Varieties	Javier A. Lopetegui et.al.	2412.11750	null
2024-12-16	GHIssuemarket: A Sandbox Environment for SWE-Agents Economic Experimentation	Mohamed A. Fouad et.al.	2412.11722	link
2024-12-16	Learning UAV-based path planning for efficient localization of objects using prior knowledge	Rick van Essen et.al.	2412.11717	link
2024-12-16	LLMs Can Simulate Standardized Patients via Agent Coevolution	Zhuoyun Du et.al.	2412.11716	link
2024-12-16	Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework	Xuanming Zhang et.al.	2412.11713	null
2024-12-16	Loosely Synchronized Rule-Based Planning for Multi-Agent Path Finding with Asynchronous Actions	Shuai Zhou et.al.	2412.11678	link
2024-12-16	VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting	Muhammet Furkan Ilaslan et.al.	2412.11621	link
2024-12-16	VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis	Zhipeng Chen et.al.	2412.11594	link
2024-12-16	Embodied CoT Distillation From LLM To Off-the-shelf Agents	Wonje Choi et.al.	2412.11499	null
2024-12-13	Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining	Zhiqi Ge et.al.	2412.10342	null
2024-12-13	Reciprocity in Interbank Markets	Lutz Honvehlmann et.al.	2412.10329	null
2024-12-13	MeshA: Efficient Path Planing With Motion Primitives*	Marat Agranovskiy et.al.	2412.10320	null
2024-12-13	BrushEdit: All-In-One Image Inpainting and Editing	Yaowei Li et.al.	2412.10316	null
2024-12-13	Cultural Evolution of Cooperation among LLM Agents	Aron Vallinder et.al.	2412.10270	null
2024-12-13	ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL	Yang Qin et.al.	2412.10138	link
2024-12-13	You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects	Islem Bouzenia et.al.	2412.10133	null
2024-12-13	Reward Machine Inference for Robotic Manipulation	Mattijs Baert et.al.	2412.10096	null
2024-12-13	Heterogeneous Multi-Robot Graph Coverage with Proximity and Movement Constraints	Dolev Mutzari et.al.	2412.10083	null
2024-12-13	Large Action Models: From Inception to Implementation	Lu Wang et.al.	2412.10047	link
2024-12-13	Cooperative Target Defense under Communication and Sensing Constraints	Dipankar Maity et.al.	2412.09939	null
2024-12-13	CaLoRAify: Calorie Estimation with Visual-Text Pairing and LoRA-Driven Visual Language Models	Dongyu Yao et.al.	2412.09936	link
2024-12-13	ProxyLLM : LLM-Driven Framework for Customer Support Through Text-Style Transfer	Sehyeong Jo et.al.	2412.09916	link
2024-12-13	Optimized Coordination Strategy for Multi-Aerospace Systems in Pick-and-Place Tasks By Deep Neural Network	Ye Zhang et.al.	2412.09877	null
2024-12-13	AutoPatent: A Multi-Agent Framework for Automatic Patent Generation	Qiyao Wang et.al.	2412.09796	link
2024-12-13	Learning Visually Grounded Domain Ontologies via Embodied Conversation and Explanation	Jonghyuk Park et.al.	2412.09770	link
2024-12-12	AiEDA: Agentic AI Design Framework for Digital ASIC System Design	Aditya Patra et.al.	2412.09745	null
2024-12-12	MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction	Xiaohao Xu et.al.	2412.09723	link
2024-12-12	TransferLight: Zero-Shot Traffic Signal Control on any Road-Network	Johann Schmidt et.al.	2412.09719	null
2024-12-12	CUAL: Continual Uncertainty-aware Active Learner	Amanda Rios et.al.	2412.09701	null
2024-12-12	GenEx: Generating an Explorable World	Taiming Lu et.al.	2412.09624	null
2024-12-12	AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials	Yiheng Xu et.al.	2412.09605	null
2024-12-12	DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction	Yu Feng et.al.	2412.09572	null
2024-12-12	Can Modern LLMs Act as Agent Cores in Radiology~Environments?	Qiaoyu Zheng et.al.	2412.09529	link
2024-12-12	Agent-based Video Trimming	Lingfeng Yang et.al.	2412.09513	null
2024-12-12	Solving Multiagent Path Finding on Highly Centralized Networks	Foivos Fioravantes et.al.	2412.09433	null
2024-12-12	From Intention To Implementation: Automating Biomedical Research via LLMs	Yi Luo et.al.	2412.09429	null
2024-12-12	Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer	Adam Labiosa et.al.	2412.09417	null
2024-12-12	Uncommon Belief in Rationality	Qi Shi et.al.	2412.09407	null
2024-12-12	Falcon-UI: Understanding GUI Before Following User Instructions	Huawen Shen et.al.	2412.09362	null
2024-12-12	Does Low Spoilage Under Cold Conditions Foster Cultural Complexity During the Foraging Era? -- A Theoretical and Computational Inquiry	Minhyeok Lee et.al.	2412.09335	null
2024-12-12	Beware of Metacognitive Laziness: Effects of Generative Artificial Intelligence on Learning Motivation, Processes, and Performance	Yizhou Fan et.al.	2412.09315	null
2024-12-12	A Systematic Review of Knowledge Tracing and Large Language Models in Education: Opportunities, Issues, and Future Research	Yongwan Cho et.al.	2412.09248	null
2024-12-12	LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation	Yijun Liu et.al.	2412.09237	null
2024-12-12	Reconfigurable Intelligent Surface for Internet of Robotic Things	Wanli Ni et.al.	2412.09117	null
2024-12-12	Understanding Opportunities and Risks of Synthetic Relationships: Leveraging the Power of Longitudinal Research with Customised AI Tools	Alfio Ventura et.al.	2412.09086	null
2024-12-12	Towards the Structure and Mechanisms of Complex Systems, the Approach of the Quantitative Theory of Meaning	Inga Ivanova et.al.	2412.09007	null
2024-12-12	Dynamics of swarmalators in the presence of a contrarian	Gourab Kumar Sar et.al.	2412.08966	null
2024-12-12	From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning	Pusen Dong et.al.	2412.08920	null
2024-12-12	Neural Interactive Proofs	Lewis Hammond et.al.	2412.08897	null
2024-12-11	GPD-1: Generative Pre-training for Driving	Zixun Xie et.al.	2412.08643	link
2024-12-11	Generative Semantic Communication: Architectures, Technologies, and Applications	Jinke Ren et.al.	2412.08642	null
2024-12-11	RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation	Mingfei Han et.al.	2412.08591	null
2024-12-11	Automated Soap Opera Testing Directed by LLMs and Scenario Knowledge: Feasibility, Challenges, and Road Ahead	Yanqi Su et.al.	2412.08581	null
2024-12-11	GenPlan: Generative sequence models as adaptive planners	Akash Karthikeyan et.al.	2412.08565	link
2024-12-11	An End-to-End Collaborative Learning Approach for Connected Autonomous Vehicles in Occluded Scenarios	Leandro Parada et.al.	2412.08562	null
2024-12-11	Exact Algorithms for Multiagent Path Finding with Communication Constraints on Tree-Like Structures	Foivos Fioravantes et.al.	2412.08556	null
2024-12-11	Grimm: A Plug-and-Play Perturbation Rectifier for Graph Neural Networks Defending against Poisoning Attacks	Ao Liu et.al.	2412.08555	null
2024-12-11	MaestroMotif: Skill Design from Artificial Intelligence Feedback	Martin Klissarov et.al.	2412.08542	null
2024-12-11	Spatial segregation across travelling fronts in individual-based and continuum models for the growth of heterogeneous cell populations	José A. Carrillo et.al.	2412.08535	null
2024-12-11	Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel	Zun Wang et.al.	2412.08467	link
2024-12-11	IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health	Gauri Jain et.al.	2412.08463	link
2024-12-11	TapeAgents: a Holistic Framework for Agent Development and Optimization	Dzmitry Bahdanau et.al.	2412.08445	null
2024-12-11	From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons	Andrew Szot et.al.	2412.08442	null
2024-12-11	SweetieChat: A Strategy-Enhanced Role-playing Framework for Diverse Scenarios Handling Emotional Support Agent	Jing Ye et.al.	2412.08389	null
2024-12-11	Agency and Morality as part of Text Entry AI Assistant Personas	Andreas Komninos et.al.	2412.08360	null
2024-12-11	Lachesis: Predicting LLM Inference Accuracy using Structural Properties of Reasoning Paths	Naryeong Kim et.al.	2412.08281	null
2024-12-11	Can transformative AI shape a new age for our civilization?: Navigating between speculation and reality	Jesus L. Lobo et.al.	2412.08273	null
2024-12-11	Deep learning assisted SERS detection of prolines and hydroxylated prolines using nitrilotriacetic acid functionalized gold nanopillars	Yuan Zhang et.al.	2412.08239	null
2024-12-11	Learn How to Query from Unlabeled Data Streams in Federated Learning	Yuchang Sun et.al.	2412.08138	link
2024-12-10	Balancing Mobility Behaviors to avoid Global epidemics from Local Outbreaks	Pablo Valgañón et.al.	2412.07656	null
2024-12-10	Searching for Structure: Investigating Emergent Communication with Large Language Models	Tom Kouwenhoven et.al.	2412.07646	null
2024-12-10	Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization	Zongkai Liu et.al.	2412.07639	link
2024-12-10	Swarm Behavior Cloning	Jonas Nüßlein et.al.	2412.07617	null
2024-12-10	Modeling Speculative Trading Patterns in Token Markets: An Agent-Based Analysis with TokenLab	Mengjue Wang et.al.	2412.07512	null
2024-12-10	ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement Learning	Hongshu Guo et.al.	2412.07507	null
2024-12-10	SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World	Jiaqi Zhang et.al.	2412.07472	link
2024-12-10	Event-Triggered Memory Control for Interval Type-2 Fuzzy Heterogeneous Multi-Agent Systems	Sen Kong et.al.	2412.07471	null
2024-12-10	Dynamic Ensemble Reasoning for LLM Experts	Jinwu Hu et.al.	2412.07448	null
2024-12-10	ITPNet: Towards Instantaneous Trajectory Prediction for Autonomous Driving	Rongqing Li et.al.	2412.07369	null
2024-12-10	My Words Imply Your Opinion: Reader Agent-Based Propagation Enhancement for Personalized Implicit Emotion Analysis	Jian Liao et.al.	2412.07367	null
2024-12-10	IntraLayer: A Platform of Digital Finance Platforms	Arman Abgaryan et.al.	2412.07348	null
2024-12-10	CoMA: Compositional Human Motion Generation with Multi-modal Agents	Shanlin Sun et.al.	2412.07320	null
2024-12-10	Superficial Consciousness Hypothesis for Autoregressive Transformers	Yosuke Miyanishi et.al.	2412.07278	link
2024-12-10	Reconciling Human Development and Giant Panda Protection Goals: Cost-efficiency Evaluation of Farmland Reverting and Energy Substitution Programs in Wolong National Reserve	Keyi Liu et.al.	2412.07275	null
2024-12-10	Speaker effects in spoken language comprehension	Hanlin Wu et.al.	2412.07238	null
2024-12-10	Parseval Regularization for Continual Reinforcement Learning	Wesley Chung et.al.	2412.07224	null
2024-12-10	A Distributed Deep Koopman Learning Algorithm for Control	Wenjian Hao et.al.	2412.07212	null
2024-12-10	Epidemiological Model Calibration via Graybox Bayesian Optimization	Puhua Niu et.al.	2412.07193	null
2024-12-10	Effective Reward Specification in Deep Reinforcement Learning	Julien Roy et.al.	2412.07177	null
2024-12-09	Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty	Meera Hahn et.al.	2412.06771	link
2024-12-09	AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark	Lan Li et.al.	2412.06724	link
2024-12-09	Asynchronous Agents with Perfect Recall: Model Reductions, Knowledge-Based Construction, and Model Checking for Coalitional Strategies	Dilian Gurov et.al.	2412.06706	null
2024-12-09	Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework	Tianming Liu et.al.	2412.06681	null
2024-12-09	Self-Interested Agents in Collaborative Learning: An Incentivized Adaptive Data-Centric Framework	Nithia Vijayan et.al.	2412.06597	null
2024-12-09	Argentine ants regulate traffic flow with stopped individuals	Ulrich Dobramysl et.al.	2412.06587	null
2024-12-09	Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation	Egor Cherepanov et.al.	2412.06531	null
2024-12-09	EFX Allocations on Some Multi-graph Classes	Umang Bhaskar et.al.	2412.06513	null
2024-12-09	The Fusion of Large Language Models and Formal Methods for Trustworthy AI Agents: A Roadmap	Yedi Zhang et.al.	2412.06512	null
2024-12-09	Reasoning about Strategic Abilities in Stochastic Multi-agent Systems	Yedi Zhang et.al.	2412.06509	null
2024-12-09	PPT: Pre-Training with Pseudo-Labeled Trajectories for Motion Forecasting	Yihong Xu et.al.	2412.06491	null
2024-12-09	Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Language Navigation	Xuesong Zhang et.al.	2412.06465	link
2024-12-09	Simulating Human-like Daily Activities with Desire-driven Autonomy	Yiding Wang et.al.	2412.06435	null
2024-12-09	World-Consistent Data Generation for Vision-and-Language Navigation	Yu Zhong et.al.	2412.06413	null
2024-12-09	StarWhisper Telescope: Agent-Based Observation Assistant System to Approach AI Astrophysicist	Cunshi Wang et.al.	2412.06412	null
2024-12-09	Augmenting the action space with conventions to improve multi-agent cooperation in Hanabi	F. Bredell et.al.	2412.06333	link
2024-12-09	Vision-Based Deep Reinforcement Learning of UAV Autonomous Navigation Using Privileged Information	Junqiao Wang et.al.	2412.06313	null
2024-12-09	Beyond pip install: Evaluating LLM Agents for the Automated Installation of Python Projects	Louis Milliken et.al.	2412.06294	link
2024-12-09	Enhanced Multi-Object Tracking Using Pose-based Virtual Markers in 3x3 Basketball	Li Yin et.al.	2412.06258	null
2024-12-09	In Silico Pharmacokinetic and Molecular Docking Studies of Natural Plants against Essential Protein KRAS for Treatment of Pancreatic Cancer	Marsha Mariya Kappan et.al.	2412.06237	null
2024-12-06	TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft	Qian Long et.al.	2412.05255	link
2024-12-06	AI's assigned gender affects human-AI cooperation	Sepideh Bazazi et.al.	2412.05214	null
2024-12-06	SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot	Jinlin Wu et.al.	2412.05187	link
2024-12-06	Sense and Sensitivity: Evaluating the simulation of social dynamics via Large Language Models	Da Ju et.al.	2412.05093	null
2024-12-06	Synchronization and desynchronization in ensembles of mobile agents	E. M. Varvarin et.al.	2412.05040	null
2024-12-06	Frontier Models are Capable of In-context Scheming	Alexander Meinke et.al.	2412.04984	null
2024-12-06	Putting the Iterative Training of Decision Trees to the Test on a Real-World Robotic Task	Raphael C. Engelhardt et.al.	2412.04974	null
2024-12-06	Who Speaks Next? Multi-party AI Discussion Leveraging the Systematics of Turn-taking in Murder Mystery Games	Ryota Nonomura et.al.	2412.04937	link
2024-12-06	Probing the contents of semantic representations from text, behavior, and brain data using the psychNorms metabase	Zak Hussain et.al.	2412.04936	link
2024-12-06	PERCY: A Multimodal Dataset and Conversational System for Personalized and Emotionally Aware Human-Robot Interaction	Mohammed Althubyani et.al.	2412.04908	null
2024-12-06	DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling	Minzheng Wang et.al.	2412.04905	link
2024-12-06	Estimating causal effects of customer satisfaction on downstream metrics in a multi-queue contact center	Sebastián Orellana et.al.	2412.04860	null
2024-12-06	Breaking Event Rumor Detection via Stance-Separated Multi-Agent Debate	Mingqing Zhang et.al.	2412.04859	null
2024-12-06	MTSpark: Enabling Multi-Task Learning with Spiking Neural Networks for Generalist Agents	Avaneesh Devkota et.al.	2412.04847	null
2024-12-06	A Temporally Correlated Latent Exploration for Reinforcement Learning	SuMin Oh et.al.	2412.04775	null
2024-12-06	REGENT: A Retrieval-Augmented Generalist Agent That Can Act In-Context in New Environments	Kaustubh Sridhar et.al.	2412.04759	null
2024-12-05	LiveNet: Robust, Minimally Invasive Multi-Robot Control for Safe and Live Navigation in Constrained Environments	Srikar Gouru et.al.	2412.04659	link
2024-12-05	Mutation mitigates finite-size effects in spatial evolutionary games	Chen Shen et.al.	2412.04654	null
2024-12-05	Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction	Yiheng Xu et.al.	2412.04454	null
2024-12-05	GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration	Kaiyi Huang et.al.	2412.04440	null
2024-12-05	Sub-diffraction Imaging of Carrier Dynamics in Halide Perovskite Semiconductors: Effects of Passivation, Morphology, and Ion Motion	Madeleine D. Breshears et.al.	2412.04423	null
2024-12-05	Targeting the Core: A Simple and Effective Method to Attack RAG-based Agents via Direct LLM Manipulation	Xuying Li et.al.	2412.04415	null
2024-12-05	EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding	Yuqi Wu et.al.	2412.04380	link
2024-12-05	Intersection-Aware Assessment of EMS Accessibility in NYC: A Data-Driven Approach	Haoran Su et.al.	2412.04369	null
2024-12-05	Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting	Edoardo Cetin et.al.	2412.04368	null
2024-12-05	Machine Theory of Mind for Autonomous Cyber-Defence	Luke Swaby et.al.	2412.04367	null
2024-12-05	Reinforcement Learning for Freeway Lane-Change Regulation via Connected Vehicles	Ke Sun et.al.	2412.04341	null
2024-12-05	Action Mapping for Reinforcement Learning in Continuous Environments with Constraints	Mirco Theile et.al.	2412.04327	null
2024-12-05	Transient Multi-Agent Path Finding for Lifelong Navigation in Dense Environments	Jonathan Morag et.al.	2412.04256	null
2024-12-05	HyperMARL: Adaptive Hypernetworks for Multi-Agent RL	Kale-ab Abebe Tessera et.al.	2412.04233	null
2024-12-05	A Dynamic Safety Shield for Safe and Efficient Reinforcement Learning of Navigation Tasks	Murad Dawood et.al.	2412.04153	null
2024-12-05	Practical Considerations for Agentic LLM Systems	Chris Sypherd et.al.	2412.04093	null
2024-12-05	LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents	Bingchen Li et.al.	2412.04090	null
2024-12-05	Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning	Shicheng Zhou et.al.	2412.04078	link
2024-12-05	Prompt Engineering Guidance for Conceptual Agent-based Model Extraction using Large Language Models	Siamak Khatami et.al.	2412.04056	null
2024-12-05	Demonstration of Enhanced Qubit Readout via Reinforcement Learning	Aniket Chatterjee et.al.	2412.04053	null
2024-12-05	INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations	Yongming Zhu et.al.	2412.04037	null
2024-12-05	Dynamic Graph Representation with Contrastive Learning for Financial Market Prediction: Integrating Temporal Evolution and Static Relations	Yunhua Pei et.al.	2412.04034	null
2024-12-04	Navigation World Models	Amir Bar et.al.	2412.03572	null
2024-12-04	From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents	Xinyi Mou et.al.	2412.03563	link
2024-12-04	Categorize and randomize: a model of sequential stochastic choice	Ester Sudano et.al.	2412.03554	null
2024-12-04	SPICE: Smart Projection Interface for Cooking Enhancement	Vera Prohaska et.al.	2412.03551	null
2024-12-04	Risk-aware Classification via Uncertainty Quantification	Murat Sensoy et.al.	2412.03391	null
2024-12-04	WiS Platform: Enhancing Evaluation of LLM-Based Multi-Agent Systems Through Game-Based Analysis	Chengwei Hu et.al.	2412.03359	null
2024-12-04	AI-Driven Day-to-Day Route Choice	Leizhen Wang et.al.	2412.03338	link
2024-12-04	Mean-field Concentration of Opinion Dynamics in Random Graphs	Javiera Gutiérrez-Ramírez et.al.	2412.03207	null
2024-12-04	AffordDP: Generalizable Diffusion Policy with Transferable Affordance	Shijie Wu et.al.	2412.03142	null
2024-12-04	ChatTS: Aligning Time Series with LLMs via Synthetic Data for Enhanced Understanding and Reasoning	Zhe Xie et.al.	2412.03104	link
2024-12-04	Decentralized Mobile Target Tracking Using Consensus-Based Estimation with Nearly-Constant-Velocity Modeling	Amir Ahmad Ghods et.al.	2412.03095	null
2024-12-04	Coordinated Multi-Armed Bandits for Improved Spatial Reuse in Wi-Fi	Francesc Wilhelmi et.al.	2412.03076	null
2024-12-04	Preference-based opponent shaping in differentiable games	Xinyu Qiao et.al.	2412.03072	null
2024-12-04	Constrained portfolio game with heterogeneous agents	Zongxia Liang et.al.	2412.03070	null
2024-12-04	Impact Of Income And Leisure On Optimal Portfolio, Consumption, Retirement Decisions Under Exponential Utility	Tae Ung Gang et.al.	2412.03001	null
2024-12-04	New HI views of the Galaxy and the Magellanic Clouds	Snezana Stanimirovic et.al.	2412.02981	null
2024-12-03	A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration	Thulio Amorim et.al.	2412.02881	null
2024-12-03	Out-of-Distribution Detection for Neurosymbolic Autonomous Cyber Agents	Ankita Samaddar et.al.	2412.02875	null
2024-12-03	An Information-Theoretic Analysis of Thompson Sampling for Logistic Bandits	Amaury Gouverneur et.al.	2412.02861	null
2024-12-03	Algorithmic idealism: what should you believe to experience next?	Markus P. Mueller et.al.	2412.02826	null
2024-12-03	Leveraging Tactile Sensing to Render both Haptic Feedback and Virtual Reality 3D Object Reconstruction in Robotic Telemanipulation	Gabriele Giudici et.al.	2412.02644	null
2024-12-03	Mobile Cell-Free Massive MIMO with Multi-Agent Reinforcement Learning: A Scalable Framework	Ziheng Liu et.al.	2412.02581	null
2024-12-03	Generating Critical Scenarios for Testing Automated Driving Systems	Trung-Hieu Nguyen et.al.	2412.02574	link
2024-12-03	TAB-Fields: A Maximum Entropy Framework for Mission-Aware Adversarial Planning	Gokul Puthumanaillam et.al.	2412.02570	link
2024-12-03	Defending Against Diverse Attacks in Federated Learning Through Consensus-Based Bi-Level Optimization	Nicolás García Trillos et.al.	2412.02535	link
2024-12-03	General Resetting Theory for Group Avoidance	Juhee Lee et.al.	2412.02524	null
2024-12-03	Resonance: Learning to Predict Social-Aware Pedestrian Trajectories as Co-Vibrations	Conghao Wong et.al.	2412.02447	null
2024-12-03	A Multi-Agent Framework for Extensible Structured Text Generation in PLCs	Donghao Yang et.al.	2412.02410	null
2024-12-03	Who Walks With You Matters: Perceiving Social Interactions with Groups for Pedestrian Trajectory Prediction	Ziqian Zou et.al.	2412.02395	null
2024-12-03	Bio-inspired visual relative localization for large swarms of UAVs	Martin Křížek et.al.	2412.02393	null
2024-12-03	Social patch foraging theory in an egalitarian group	Lisa Blum Moyse et.al.	2412.02381	null
2024-12-03	Reinforcement learning to learn quantum states for Heisenberg scaling accuracy	Jeongwoo Jae et.al.	2412.02334	link
2024-12-03	Optimizing Plastic Waste Collection in Water Bodies Using Heterogeneous Autonomous Surface Vehicles with Deep Reinforcement Learning	Alejandro Mendoza Barrionuevo et.al.	2412.02316	link
2024-12-03	Large Multimodal Agents for Accurate Phishing Detection with Enhanced Token Optimization and Cost Reduction	Fouad Trad et.al.	2412.02301	null
2024-12-03	Conformal Symplectic Optimization for Stable Reinforcement Learning	Yao Lyu et.al.	2412.02291	link
2024-12-03	BOTracle: A framework for Discriminating Bots and Humans	Jan Kadel et.al.	2412.02266	null
2024-12-03	Selective Reviews of Bandit Problems in AI via a Statistical View	Pengjie Zhou et.al.	2412.02251	null
2024-12-03	DataLab: A Unifed Platform for LLM-Powered Business Intelligence	Luoxuan Weng et.al.	2412.02205	null
2024-12-03	Distributed Task Allocation for Multi-Agent Systems: A Submodular Optimization Approach	Jing Liu et.al.	2412.02146	null
2024-12-03	A privacy-preserving distributed credible evidence fusion algorithm for collective decision-making	Chaoxiong Ma et.al.	2412.02130	null
2024-11-29	EF1 Allocations for Identical Trilean and Separable Single-Peaked Valuations	Umang Bhaskar et.al.	2411.19881	null
2024-11-29	Neuroplasticity and Psychedelics: a comprehensive examination of classic and non-classic compounds in pre and clinical models	Claudio Agnorelli et.al.	2411.19840	null
2024-11-29	Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation	Robin D. Pesl et.al.	2411.19804	null
2024-11-29	CAREL: Instruction-guided reinforcement learning with cross-modal auxiliary objectives	Armin Saghafian et.al.	2411.19787	link
2024-11-29	The 2024 Motile Active Matter Roadmap	Gerhard Gompper et.al.	2411.19783	null
2024-11-29	HVAC-DPT: A Decision Pretrained Transformer for HVAC Control	Anaïs Berkes et.al.	2411.19746	null
2024-11-29	Relative Representations of Latent Spaces enable Efficient Semantic Channel Equalization	Tomás Hüttebräucker et.al.	2411.19719	null
2024-11-29	RMIO: A Model-Based MARL Framework for Scenarios with Observation Loss in Some Agents	Shi Zifeng et.al.	2411.19639	null
2024-11-29	Build An Influential Bot In Social Media Simulations With Large Language Models	Bailu Jin et.al.	2411.19635	null
2024-11-29	Solving Rubik's Cube Without Tricky Sampling	Yicheng Lin et.al.	2411.19583	null
2024-11-29	Early Versus Late Traffic Management For Autonomous Agents	Salman Ghori et.al.	2411.19582	null
2024-11-29	The ATTUNE model for Artificial Trust Towards Human Operators	Giannis Petousakis et.al.	2411.19580	null
2024-12-02	Fixed-relative-switch strategies for learning based event-triggered control of nonlinear multiagent systems	Ziming Wang et.al.	2411.19571	null
2024-11-29	Training Agents with Weakly Supervised Feedback from Large Language Models	Dihong Gong et.al.	2411.19547	null
2024-11-29	A Local Information Aggregation based Multi-Agent Reinforcement Learning for Robot Swarm Dynamic Task Allocation	Yang Lv et.al.	2411.19526	null
2024-11-29	RL-MILP Solver: A Reinforcement Learning Approach for Solving Mixed-Integer Linear Programs with Graph Neural Networks	Tae-Hoon Lee et.al.	2411.19517	null
2024-11-29	SANGO: Socially Aware Navigation through Grouped Obstacles	Rahath Malladi et.al.	2411.19497	null
2024-11-29	Two Timescale EXTRA for Smooth Non-convex Distributed Optimization Problems	Zeyu Peng et.al.	2411.19483	null
2024-11-29	Proto Successor Measure: Representing the Space of All Possible Solutions of Reinforcement Learning	Siddhant Agarwal et.al.	2411.19418	null
2024-11-28	Dynamic matching games: stationary equilibria under varying commitments	Nadia Guiñazú et.al.	2411.19372	null
2024-11-28	Integrating Transit Signal Priority into Multi-Agent Reinforcement Learning based Traffic Signal Control	Dickness Kakitahi Kwesiga et.al.	2411.19359	null
2024-11-27	Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective	Zhi Zhang et.al.	2411.18615	null
2024-11-27	Robust Offline Reinforcement Learning with Linearly Structured $f$ -Divergence Regularization	Cheng Tang et.al.	2411.18612	null
2024-11-27	AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans	Dillon Loh et.al.	2411.18539	link
2024-11-27	Biswas-Chatterjee-Sen kinetic exchange opinion model for two connected groups	Krzysztof Suchecki et.al.	2411.18527	null
2024-11-27	NeuroAI for AI Safety	Patrick Mineault et.al.	2411.18526	null
2024-11-27	Collective decision making by embodied neural agents	Nicolas Coucke et.al.	2411.18498	null
2024-11-27	Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator	Frederic Kirstein et.al.	2411.18444	null
2024-11-27	An AI-Assisted Multi-Agent Dual Dialogue System to Support Mental Health Care Providers	Onno P. Kampman et.al.	2411.18429	null
2024-11-27	Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration	Esmaeel Mohammadi et.al.	2411.18305	null
2024-11-27	InterHub: A Naturalistic Trajectory Dataset with Dense Interaction for Autonomous Driving	Xiyan Jiang et.al.	2411.18302	link
2024-11-27	Large Language Model-Brained GUI Agents: A Survey	Chaoyun Zhang et.al.	2411.18279	link
2024-11-27	Grid-augumented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agents	Joongwon Chae et.al.	2411.18270	link
2024-11-27	Wearable intelligent throat enables natural speech in stroke patients with dysarthria	Chenyu Tang et.al.	2411.18266	null
2024-11-27	Exploration of LLM Multi-Agent Application Implementation Based on LangGraph+CrewAI	Zhihua Duan et.al.	2411.18241	null
2024-11-27	Scalable Multi-Objective Reinforcement Learning with Fairness Guarantees using Lorenz Dominance	Dimitris Michailidis et.al.	2411.18195	link
2024-11-27	DMVC-Tracker: Distributed Multi-Agent Trajectory Planning for Target Tracking Using Dynamic Buffered Voronoi and Inter-Visibility Cells	Yunwoo Lee et.al.	2411.18086	null
2024-11-27	RL for Mitigating Cascading Failures: Targeted Exploration via Sensitivity Factors	Anmol Dwivedi et.al.	2411.18050	link
2024-11-27	The Trusted Caregiver: The Influence of Eye and Mouth Design Incorporating the Baby Schema Effect in Virtual Humanoid Agents on Older Adults Users' Perception of Trustworthiness	Jennifer Hu et.al.	2411.18047	null
2024-11-27	Normative Feeling: Socially Patterned Affective Mechanisms	Stavros Anagnou et.al.	2411.18037	null
2024-11-27	AEGIS: An Agent-based Framework for General Bug Reproduction from Issue Descriptions	Xinchen Wang et.al.	2411.18015	null
2024-11-26	SketchAgent: Language-Driven Sequential Sketch Generation	Yael Vinker et.al.	2411.17673	null
2024-11-26	MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation	Harsh Singh et.al.	2411.17636	null
2024-11-26	Making History Readable	Bipasha Banerjee et.al.	2411.17600	null
2024-11-26	Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals	William A. Ingram et.al.	2411.17598	null
2024-11-26	Decision making in stochastic extensive form II: Stochastic extensive forms and games	E. Emanuel Rapsch et.al.	2411.17587	null
2024-11-26	Multi-Objective Reinforcement Learning for Automated Resilient Cyber Defence	Ross O'Driscoll et.al.	2411.17585	null
2024-11-26	Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach	Yaosheng Deng et.al.	2411.17552	null
2024-11-26	ShowUI: One Vision-Language-Action Model for GUI Visual Agent	Kevin Qinghong Lin et.al.	2411.17465	link
2024-11-26	Object-centric proto-symbolic behavioural reasoning from pixels	Ruben van Bergen et.al.	2411.17438	link
2024-11-26	Joint Combinatorial Node Selection and Resource Allocations in the Lightning Network using Attention-based Reinforcement Learning	Mahdi Salahshour et.al.	2411.17353	null
2024-11-26	Towards Intention Recognition for Robotic Assistants Through Online POMDP Planning	Juan Carlos Saborio et.al.	2411.17326	null
2024-11-26	A "Breathing" Mobile Communication Network	Chao Ge et.al.	2411.17290	null
2024-11-26	APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents	Jun Yu Chen et.al.	2411.17255	link
2024-11-26	Short-duration gamma-ray bursts from Kerr-Newman black hole mergers	Shad Ali et.al.	2411.17205	null
2024-11-26	P2DFlow: A Protein Ensemble Generative Model with SE(3) Flow Matching	Yaowei Jin et.al.	2411.17196	link
2024-11-26	Interleaved Scene Graph for Interleaved Text-and-Image Generation Assessment	Dongping Chen et.al.	2411.17188	null
2024-11-26	LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble	Yujeong Lee et.al.	2411.17135	null
2024-11-26	Creative Agents: Simulating the Systems Model of Creativity with Generative Agents	Naomi Imasato et.al.	2411.17065	null
2024-11-26	g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks	Zihan Wang et.al.	2411.17030	link
2024-11-26	CRASH: Challenging Reinforcement-Learning Based Adversarial Scenarios For Safety Hardening	Amar Kulkarni et.al.	2411.16996	null
2024-11-25	Winning opinion: Following Your Friends' Advice or That of Their Friends?	Francisco J. Muñoz et.al.	2411.16671	null
2024-11-25	Barriers on the EDGE: A scalable CBF architecture over EDGE for safe aerial-ground multi-agent coordination	Viswa Narayanan Sankaranarayanan et.al.	2411.16608	null
2024-11-25	Naive Algorithmic Collusion: When Do Bandit Learners Cooperate and When Do They Compete?	Connor Douglas et.al.	2411.16574	null
2024-11-25	Continual Deep Reinforcement Learning with Task-Agnostic Policy Distillation	Muhammad Burhan Hafez et.al.	2411.16532	link
2024-11-25	Reinforcement Learning for Bidding Strategy Optimization in Day-Ahead Energy Market	Luca Di Persio et.al.	2411.16519	null
2024-11-25	Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding	Hongzhi Zang et.al.	2411.16506	link
2024-11-25	Distributed Online Optimization with Stochastic Agent Availability	Juliette Achddou et.al.	2411.16477	null
2024-11-25	Generating social networks with static and dynamic utility-maximization approaches	Aldric Labarthe et.al.	2411.16464	link
2024-11-25	Characterized Diffusion Networks for Enhanced Autonomous Driving Trajectory Prediction	Haoming Li et.al.	2411.16457	null
2024-11-25	TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation	Linqing Zhong et.al.	2411.16425	null
2024-11-25	A Multi-agent Framework for Materials Laws Discovery	Bo Hu et.al.	2411.16416	null
2024-11-25	Functionality understanding and segmentation in 3D scenes	Jaime Corsetti et.al.	2411.16310	null
2024-11-25	Probing for Consciousness in Machines	Mathis Immertreu et.al.	2411.16262	null
2024-11-25	Open-Vocabulary Octree-Graph for 3D Scene Understanding	Zhigang Wang et.al.	2411.16253	null
2024-11-25	Enhancing Multi-Agent Consensus through Third-Party LLM Integration: Analyzing Uncertainty and Mitigating Hallucinations in Large Language Models	Zhihua Duan et.al.	2411.16189	null
2024-11-25	Stop Playing the Guessing Game! Target-free User Simulation for Evaluating Conversational Recommender Systems	Sunghwan Kim et.al.	2411.16160	null
2024-11-25	Multi-Robot Reliable Navigation in Uncertain Topological Environments with Graph Attention Networks	Zhuoyuan Yu et.al.	2411.16134	link
2024-11-25	Why the Agent Made that Decision: Explaining Deep Reinforcement Learning with Vision Masks	Rui Zuo et.al.	2411.16120	null
2024-11-25	Leverage Task Context for Object Affordance Ranking	Haojie Huang et.al.	2411.16082	null
2024-11-25	SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text	Reshmi Ghosh et.al.	2411.16077	null
2024-11-22	RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts	Hjalmar Wijk et.al.	2411.15114	link
2024-11-22	XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models	Yixin Dong et.al.	2411.15100	null
2024-11-22	On Multi-Agent Inverse Reinforcement Learning	Till Freihaut et.al.	2411.15046	null
2024-11-22	Safe Multi-Agent Reinforcement Learning with Convergence to Generalized Nash Equilibrium	Zeyang Li et.al.	2411.15036	null
2024-11-22	On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations	Guojun Xiong et.al.	2411.15014	null
2024-11-22	ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow Data	Junhong Shen et.al.	2411.15004	link
2024-11-22	Free Energy Projective Simulation (FEPS): Active inference with interpretability	Joséphine Pazem et.al.	2411.14991	null
2024-11-22	BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence	Xuewu Lin et.al.	2411.14869	link
2024-11-22	Universal and Context-Independent Triggers for Precise Control of LLM Outputs	Jiashuo Liang et.al.	2411.14738	null
2024-11-22	Enhancing Clinical Trial Patient Matching through Knowledge Augmentation with Multi-Agents	Hanwen Shi et.al.	2411.14637	null
2024-11-21	Learning Autonomous Surgical Irrigation and Suction with the da Vinci Research Kit Using Reinforcement Learning	Yafei Ou et.al.	2411.14622	null
2024-11-21	A Systematic Study of Multi-Agent Deep Reinforcement Learning for Safe and Robust Autonomous Highway Ramp Entry	Larry Schester et.al.	2411.14593	null
2024-11-21	G-RAG: Knowledge Expansion in Material Science	Radeen Mostafa et.al.	2411.14592	link
2024-11-21	SRSA: A Cost-Efficient Strategy-Router Search Agent for Real-world Human-Machine Interactions	Yaqi Wang et.al.	2411.14574	null
2024-11-21	Energy Efficient Automated Driving as a GNEP: Vehicle-in-the-loop Experiments	Viranjan Bhattacharyya et.al.	2411.14567	null
2024-11-21	Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models	Yuhao Dong et.al.	2411.14432	link
2024-11-21	Multi-Agent Environments for Vehicle Routing Problems	Ricardo Gama et.al.	2411.14411	link
2024-11-21	Resolving Multiple-Dynamic Model Uncertainty in Hypothesis-Driven Belief-MDPs	Ofer Dagan et.al.	2411.14404	null
2024-11-21	SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching	Arjun P S et.al.	2411.14322	link
2024-11-21	Q-CSM: Q-Learning-based Cognitive Service Management in Heterogeneous IoT Networks	Kubra Duran et.al.	2411.14281	null
2024-11-21	Explainable Multi-Agent Reinforcement Learning for Extended Reality Codec Adaptation	Pedro Enrique Iturria-Rivera et.al.	2411.14264	null
2024-11-21	Physics-Informed LLM-Agent for Automated Modulation Design in Power Electronics Systems	Junhua Liu et.al.	2411.14214	null
2024-11-21	SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel Optimization	Shuchen Zhu et.al.	2411.14166	null
2024-11-21	Multi-terminal Strong Coordination subject to Secrecy Constraints	Viswanathan Ramachandran et.al.	2411.14123	null
2024-11-21	Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems	Egor E. Nuzhin et.al.	2411.14117	null
2024-11-21	RAG-Thief: Scalable Extraction of Private Data from Retrieval-Augmented Generation Applications with Agent-based Attacks	Changyue Jiang et.al.	2411.14110	null
2024-11-21	Asymmetric Opinion Formation of Emotional Eccitable Agents	Irene Ferri et.al.	2411.14099	null
2024-11-21	Exploration by Running Away from the Past	Paul-Antoine Le Tolguenec et.al.	2411.14085	null
2024-11-21	On PI-control in Capacity-Limited Networks	Felix Agner et.al.	2411.14077	null
2024-11-21	Multi-LLM-Agent Systems: Techniques and Business Perspectives	Yingxuan Yang et.al.	2411.14033	null
2024-11-21	GPT versus Humans: Uncovering Ethical Concerns in Conversational Generative AI-empowered Multi-Robot Systems	Rebekah Rousi et.al.	2411.14009	null
2024-11-21	Approximating One-Sided and Two-Sided Nash Social Welfare With Capacities	Salil Gokhale et.al.	2411.14007	null
2024-11-21	Learning Two-agent Motion Planning Strategies from Generalized Nash Equilibrium for Model Predictive Control	Hansung Kim et.al.	2411.13983	link
2024-11-21	Movable Antenna-Equipped UAV for Data Collection in Backscatter Sensor Networks: A Deep Reinforcement Learning-based Approach	Yu Bai et.al.	2411.13970	null
2024-11-21	Cooperative Grasping and Transportation using Multi-agent Reinforcement Learning with Ternary Force Representation	Ing-Sheng Bernard-Tiong et.al.	2411.13942	null
2024-11-20	BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games	Davide Paglieri et.al.	2411.13543	null
2024-11-20	Metacognition for Unknown Situations and Environments (MUSE)	Rodolfo Valiente et.al.	2411.13537	null
2024-11-20	AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations	Gaurav Verma et.al.	2411.13451	null
2024-11-20	Robust Monocular Visual Odometry using Curriculum Learning	Assaf Lahiany et.al.	2411.13438	null
2024-11-20	A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback	Alireza Rashidi Laleh et.al.	2411.13410	null
2024-11-20	Simulating Liquidity: Agent-Based Modeling of Illiquid Markets for Fractional Ownership	Lars Fluri et.al.	2411.13381	null
2024-11-20	WHALES: A Multi-agent Scheduling Dataset for Enhanced Cooperation in Autonomous Driving	Siwei Chen et.al.	2411.13340	link
2024-11-20	Revealed Information	Laura Doval et.al.	2411.13293	null
2024-11-20	Transforming the Hybrid Cloud for Emerging AI Workloads	Deming Chen et.al.	2411.13239	null
2024-11-20	Extremum and Nash Equilibrium Seeking with Delays and PDEs: Designs & Applications	Tiago Roux Oliveira et.al.	2411.13234	null
2024-11-20	ViSTa Dataset: Do vision-language models understand sequential tasks?	Evžen Wybitul et.al.	2411.13211	link
2024-11-20	Engagement-Driven Content Generation with Large Language Models	Erica Coppolillo et.al.	2411.13187	null
2024-11-20	Cyborg Insect Factory: Automatic Assembly System to Build up Insect-computer Hybrid Robot Based on Vision-guided Robotic Arm Manipulation of Custom Bipolar Electrodes	Qifeng Lin et.al.	2411.13164	null
2024-11-20	Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning	Zhi Luo et.al.	2411.13116	null
2024-11-20	Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension	Yongdong Luo et.al.	2411.13093	link
2024-11-20	AMaze: An intuitive benchmark generator for fast prototyping of generalizable agents	Kevin Godin-Dubois et.al.	2411.13072	null
2024-11-20	Breaking the Cycle of Recurring Failures: Applying Generative AI to Root Cause Analysis in Legacy Banking Systems	Siyuan Jin et.al.	2411.13017	null
2024-11-20	MindForge: Empowering Embodied Agents with Theory of Mind for Lifelong Collaborative Learning	Mircea Lică et.al.	2411.12977	null
2024-11-19	Non-Newtonian corrections to radiative viscosity: Israel-Stewart theory as a viscosity limiter	Lorenzo Gavassino et.al.	2411.12929	null
2024-11-19	Human-In-the-Loop Software Development Agents	Wannita Takerngsaksiri et.al.	2411.12924	null
2024-11-19	Reinforcement Learning, Collusion, and the Folk Theorem	Galit Askenazi-Golan et.al.	2411.12725	null
2024-11-19	UBSoft: A Simulation Platform for Robotic Skill Learning in Unbounded Soft Environments	Chunru Lin et.al.	2411.12711	null
2024-11-19	Weighted Envy Freeness With Limited Subsidies	Noga Klein Elmalem et.al.	2411.12696	null
2024-11-19	Quasi-stability notions in two-sided matching models	Nadia Guiñazú et.al.	2411.12533	null
2024-11-19	Coevolution of relationship-driven cooperation under recommendation protocol on multiplex networks	Hongyu Yue et.al.	2411.12436	null
2024-11-19	Instrumentation of Software Systems with OpenTelemetry for Software Visualization	Malte Hansen et.al.	2411.12380	null
2024-11-19	C $^{2}$ INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention	Xiaohe Li et.al.	2411.12313	null
2024-11-19	SNN-Based Online Learning of Concepts and Action Laws in an Open World	Christel Grimaud et.al.	2411.12308	null
2024-11-19	Emergence of Implicit World Models from Mortal Agents	Kazuya Horibe et.al.	2411.12304	null
2024-11-19	Could Humans Outshine AI in Visual Data Analysis?	Ratanond Koonchanok et.al.	2411.12299	null
2024-11-19	Efficient Training in Multi-Agent Reinforcement Learning: A Communication-Free Framework for the Box-Pushing Problem	David Ge et.al.	2411.12246	null
2024-11-19	Safe Navigation in Dynamic Environments using Density Functions	Sriram S. K. S Narayanan et.al.	2411.12206	link
2024-11-19	A More Advanced Group Polarization Measurement Approach Based on LLM-Based Agents and Graphs	Zixin Liu et.al.	2411.12196	null
2024-11-19	Action-Attentive Deep Reinforcement Learning for Autonomous Alignment of Beamlines	Siyu Wang et.al.	2411.12183	link
2024-11-19	A Combined Encoder and Transformer Approach for Coherent and High-Quality Text Generation	Jiajing Chen et.al.	2411.12157	null
2024-11-19	Reinforcement Learning with Action Sequence for Data-Efficient Robot Learning	Younggyo Seo et.al.	2411.12155	null
2024-11-19	HEIGHT: Heterogeneous Interaction Graph Transformer for Robot Navigation in Crowded and Constrained Environments	Shuijing Liu et.al.	2411.12150	null
2024-11-19	Hierarchical Trait-State Model for Decoding Dyadic Social Interactions	Qianying Wu et.al.	2411.12145	null
2024-11-19	Adversarial Multi-Agent Reinforcement Learning for Proactive False Data Injection Detection	Kejun Chen et.al.	2411.12130	null
2024-11-18	On-the-Go Path Planning and Repair in Static and Dynamic Scenarios	Daniel Ajeleye et.al.	2411.12014	null
2024-11-18	Generative World Explorer	Taiming Lu et.al.	2411.11844	null
2024-11-18	Reinterpreting Delay and Procrastination	Conrad Kosowsky et.al.	2411.11828	null
2024-11-18	Competing Bandits in Decentralized Large Contextual Matching Markets	Satush Parikh et.al.	2411.11794	null
2024-11-18	LLM-IE: A Python Package for Generative Information Extraction with Large Language Models	Enshuo Hsu et.al.	2411.11779	null
2024-11-18	Mapping out the Space of Human Feedback for Reinforcement Learning: A Conceptual Framework	Yannick Metz et.al.	2411.11761	null
2024-11-18	The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning	Longju Bai et.al.	2411.11758	link
2024-11-18	Distributed Asynchronous Time-Varying Quadratic Programming with Asynchronous Objective Sampling	Gabriel Behrendt et.al.	2411.11732	null
2024-11-18	Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment	Allison Huang et.al.	2411.11731	link
2024-11-18	TrojanRobot: Backdoor Attacks Against Robotic Manipulation in the Physical World	Xianlong Wang et.al.	2411.11683	null
2024-11-18	Artificial Scientific Discovery	Antonio Norelli et.al.	2411.11672	null
2024-11-18	No-regret Exploration in Shuffle Private Reinforcement Learning	Shaojie Bai et.al.	2411.11647	null
2024-11-18	Signaling and Social Learning in Swarms of Robots	Leo Cazenille et.al.	2411.11616	null
2024-11-18	OASIS: Open Agents Social Interaction Simulations on One Million Agents	Ziyi Yang et.al.	2411.11581	link
2024-11-18	A Code Knowledge Graph-Enhanced System for LLM-Based Fuzz Driver Generation	Hanxiang Xu et.al.	2411.11532	link
2024-11-18	Structure learning with Temporal Gaussian Mixture for model-based Reinforcement Learning	Théophile Champion et.al.	2411.11511	null
2024-11-18	Timescale-agnostic characterisation for collective attention events	Tristan J. B. Cann et.al.	2411.11500	null
2024-11-18	Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models	Chenhang Cui et.al.	2411.11496	link
2024-11-18	Quantifying Preferences of Vision-Language Models via Value Decomposition in Social Media Contexts	Jingxuan Li et.al.	2411.11479	null
2024-11-18	Distributed Learning with Partial Information Sharing	P Raghavendra Rao et.al.	2411.11411	null
2024-11-18	IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos	Yunong Liu et.al.	2411.11409	link
2024-11-15	Fair Division via the Cake-Cutting Share	Yannan Bai et.al.	2411.10434	null
2024-11-15	Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash	Parsa Hejabi et.al.	2411.10422	link
2024-11-15	The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use	Siyuan Hu et.al.	2411.10323	link
2024-11-15	Static network structure cannot stabilize cooperation among Large Language Model agents	Jin Han et.al.	2411.10294	null
2024-11-15	Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review	Hossein Hassani et.al.	2411.10268	null
2024-11-15	Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning	Jingru Yang et.al.	2411.10252	null
2024-11-15	An Empirical Study on LLM-based Agents for Automated Bug Fixing	Xiangxin Meng et.al.	2411.10213	null
2024-11-15	Agentic LLMs in the Supply Chain: Towards Autonomous Multi-Agent Consensus-Seeking	Valeria Jannelli et.al.	2411.10184	null
2024-11-15	Let people fail! Exploring the influence of explainable virtual and robotic agents in learning-by-doing tasks	Marco Matarese et.al.	2411.10176	null
2024-11-15	The Surprising Ineffectiveness of Pre-Trained Visual Representations for Model-Based Reinforcement Learning	Moritz Schneider et.al.	2411.10175	null
2024-11-15	Semantics and Spatiality of Emergent Communication	Rotem Ben Zion et.al.	2411.10173	link
2024-11-15	Multi-UAV Search and Rescue in Wilderness Using Smart Agent-Based Probability Models	Zijian Ge et.al.	2411.10148	null
2024-11-15	Omnichain Web: The Universal Framework for Streamlined Chain Abstraction and Cross-Layer Interaction	Hardik Gajera et.al.	2411.10132	null
2024-11-15	Generative Agent Simulations of 1,000 People	Joon Sung Park et.al.	2411.10109	null
2024-11-15	Neural Port-Hamiltonian Models for Nonlinear Distributed Control: An Unconstrained Parametrization Approach	Muhammad Zakwan et.al.	2411.10096	null
2024-11-15	Enforcing Cooperative Safety for Reinforcement Learning-based Mixed-Autonomy Platoon Control	Jingyuan Zhou et.al.	2411.10031	null
2024-11-15	Orca: Enhancing Role-Playing Abilities of Large Language Models by Integrating Personality Traits	Yuxuan Huang et.al.	2411.10006	null
2024-11-15	Solvated Electrons and Hydroxyl Radicals at the Plasma-Liquid Interface	Seungjun Lee et.al.	2411.09991	null
2024-11-15	Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems	Taaha Kazi et.al.	2411.09972	null
2024-11-15	Sublinear-time Collision Detection with a Polynomial Number of States in Population Protocols	Takumi Araya et.al.	2411.09957	null
2024-11-14	Nash equilibrium seeking for a class of quadratic-bilinear Wasserstein distributionally robust games	Georgios Pantazis et.al.	2411.09636	null
2024-11-14	Navigating the Risks: A Survey of Security, Privacy, and Ethics Threats in LLM-Based Agents	Yuyou Gan et.al.	2411.09523	null
2024-11-14	Randomized Truthful Auctions with Learning Agents	Gagan Aggarwal et.al.	2411.09517	null
2024-11-14	Strategic Sacrifice: Self-Organized Robot Swarm Localization for Inspection Productivity	Sneha Ramshanker et.al.	2411.09493	null
2024-11-14	Socio-Economic Consequences of Generative AI: A Review of Methodological Approaches	Carlos J. Costa et.al.	2411.09313	null
2024-11-14	Embedding Space Allocation with Angle-Norm Joint Classifiers for Few-Shot Class-Incremental Learning	Dunwei Tu et.al.	2411.09250	null
2024-11-14	Risk-aware MPPI for Stochastic Hybrid Systems	Hardik Parwana et.al.	2411.09198	link
2024-11-14	Enhancing reinforcement learning for population setpoint tracking in co-cultures	Sebastián Espinel-Ríos et.al.	2411.09177	null
2024-11-14	Artificial Theory of Mind and Self-Guided Social Organisation	Michael S. Harré et.al.	2411.09169	null
2024-11-14	Theory of Mind Enhances Collective Intelligence	Michael S. Harré et.al.	2411.09168	null
2024-11-14	Rationality based Innate-Values-driven Reinforcement Learning	Qin Yang et.al.	2411.09160	null
2024-11-14	The \emph{Optimist}: Towards Fully Automated Graph Theory Research	Randy Davila et.al.	2411.09158	link
2024-11-14	Personalized Help for Optimizing Low-Skilled Users' Strategy	Feng Gu et.al.	2411.09109	null
2024-11-13	Pheromone-Guided Navigation of Potential Mates: A Distinct Exploration Strategy	Nick Dashti et.al.	2411.09092	null
2024-11-13	Microfoundation Inference for Strategic Prediction	Daniele Bracale et.al.	2411.08998	null
2024-11-13	The Impact of Social Value Orientation on Nash Equilibria of Two Player Quadratic Games	Dan Calderone et.al.	2411.08809	null
2024-11-13	FinRobot: AI Agent for Equity Research and Valuation with Large Language Models	Tianyu Zhou et.al.	2411.08804	link
2024-11-13	Evaluating World Models with LLM for Decision Making	Chang Yang et.al.	2411.08794	null
2024-11-13	Towards Fair and Efficient Public Transportation: A Bus Stop Model	Martin Bullinger et.al.	2411.08784	link
2024-11-13	Logic-based Knowledge Awareness for Autonomous Agents in Continuous Spaces	Arabinda Ghosh et.al.	2411.08754	null
2024-11-13	Statistical Operating Characteristics of Current Early Phase Dose Finding Designs with Toxicity and Efficacy in Oncology	Hao Sun et.al.	2411.08698	null
2024-11-13	Inferring Parameter Distributions in Heterogeneous Motile Particle Ensembles: A Likelihood Approach for Second Order Langevin Models	Jan Albrecht et.al.	2411.08692	null
2024-11-13	Robot See, Robot Do: Imitation Reward for Noisy Financial Environments	Sven Goluža et.al.	2411.08637	null
2024-11-13	On the Application of Model Predictive Control to a Weighted Coverage Path Planning Problem	Kilian Schweppe et.al.	2411.08634	null
2024-11-13	NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation	Youzhi Liu et.al.	2411.08579	null
2024-11-13	Grammarization-Based Grasping with Deep Multi-Autoencoder Latent Space Exploration by Reinforcement Learning Agent	Leonidas Askianakis et.al.	2411.08566	null
2024-11-13	TimeLess: A Vision for the Next Generation of Software Development	Zeeshan Rasheed et.al.	2411.08507	null
2024-11-13	Towards Objective and Unbiased Decision Assessments with LLM-Enhanced Hierarchical Attention Networks	Junhua Liu et.al.	2411.08504	link
2024-11-13	AD-DINO: Attention-Dynamic DINO for Distance-Aware Embodied Reference Understanding	Hao Guo et.al.	2411.08451	null
2024-11-13	Towards Evaluating Large Language Models for Graph Query Generation	Siraj Munir et.al.	2411.08449	null
2024-11-13	Learning Dynamic Cognitive Map with Autonomous Navigation	Daria de Tinguy et.al.	2411.08447	link
2024-11-13	Anonymous Distributed Localisation via Spatial Population Protocols	Leszek Gąsieniec et.al.	2411.08434	null
2024-11-13	One STEP at a time: Language Agents are Stepwise Planners	Minh Nguyen et.al.	2411.08432	link
2024-11-13	Enhanced Classroom Dialogue Sequences Analysis with a Hybrid AI Agent: Merging Expert Rule-Base with Large Language Models	Yun Long et.al.	2411.08418	null
2024-11-13	BAMAX: Backtrack Assisted Multi-Agent Exploration using Reinforcement Learning	Geetansh Kalra et.al.	2411.08400	null
2024-11-12	LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models	Anoop Cherian et.al.	2411.08027	null
2024-11-12	Incentive Design with Spillovers	Krishna Dasaratha et.al.	2411.08026	null
2024-11-12	From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents	Chuyi Kong et.al.	2411.07965	null
2024-11-12	Learning Memory Mechanisms for Decision Making through Demonstrations	William Yue et.al.	2411.07954	link
2024-11-12	RedCode: Risky Code Execution and Generation Benchmark for Code Agents	Chengquan Guo et.al.	2411.07781	link
2024-11-12	Efficiency of energy-consuming random walkers: Variability in energy helps	Mohsen Ghasemi Nezhadhaghighi et.al.	2411.07771	null
2024-11-12	Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows	Fangyu Lei et.al.	2411.07763	null
2024-11-12	Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement Learning	Stefan Pranger et.al.	2411.07700	null
2024-11-12	World Models: The Safety Perspective	Zifan Zeng et.al.	2411.07690	null
2024-11-12	Safe Exploitative Play with Untrusted Type Beliefs	Tongxin Li et.al.	2411.07679	null
2024-11-12	The relationship between general equilibrium models with infinite-lived agents and overlapping generations models, and some applications	Ngoc-Sang Pham et.al.	2411.07674	null
2024-11-12	Mitigating Bias in Queer Representation within Large Language Models: A Collaborative Agent Approach	Tianyi Huang et.al.	2411.07656	link
2024-11-12	Exploring Multi-Agent Reinforcement Learning for Unrelated Parallel Machine Scheduling	Maria Zampella et.al.	2411.07634	null
2024-11-12	A Simple Multi-agent Joint Prediction Method for Autonomous Driving	Mingyi Wang et.al.	2411.07612	null
2024-11-12	Multiple Non-cooperative Targets Encirclement by Relative Distance based Positioning and Neural Anti-Synchronization Control	Fen Liu et.al.	2411.07590	null
2024-11-12	Reinforcement Learning Framework for Quantitative Trading	Alhassan S. Yasin et.al.	2411.07585	null
2024-11-12	Stability for a stochastic fractional differential variational inequality with Lévy jump	Yue Zeng et.al.	2411.07557	null
2024-11-12	Collaborative and Federated Black-box Optimization: A Bayesian Optimization Perspective	Raed Al Kontar et.al.	2411.07523	null
2024-11-12	Two-Layer Attention Optimization for Bimanual Coordination	Justin Ting et.al.	2411.07470	null
2024-11-12	BudgetMLAgent: A Cost-Effective LLM Multi-Agent system for Automating Machine Learning Tasks	Shubham Gandhi et.al.	2411.07464	null
2024-11-11	Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving	Botao Yu et.al.	2411.07228	null
2024-11-11	Grounding Video Models to Actions through Goal Conditioned Exploration	Yunhao Luo et.al.	2411.07223	null
2024-11-11	'Explaining RL Decisions with Trajectories': A Reproducibility Study	Karim Abdel Sadek et.al.	2411.07200	link
2024-11-11	Gradual Fine-Tuning with Graph Routing for Multi-Source Unsupervised Domain Adaptation	Yao Ma et.al.	2411.07185	null
2024-11-11	RoundTable: Investigating Group Decision-Making Mechanism in Multi-Agent Collaboration	Young-Min Cho et.al.	2411.07161	null
2024-11-11	Azurin-Based Peptide p28 Arrests the p53-HDM2 Interactions: A Novel Anti-Cancer Pathway	Albin Joy et.al.	2411.07124	null
2024-11-11	Learning Multi-Agent Collaborative Manipulation for Long-Horizon Quadrupedal Pushing	Chuye Hong et.al.	2411.07104	null
2024-11-11	Bounded Rationality Equilibrium Learning in Mean Field Games	Yannick Eich et.al.	2411.07099	link
2024-11-11	A Multi-Agent Approach for REST API Testing with Semantic Graphs and LLM-Driven Inputs	Myeongsoo Kim et.al.	2411.07098	null
2024-11-11	Differentially-Private Collaborative Online Personalized Mean Estimation	Yauhen Yakimenka et.al.	2411.07094	null
2024-11-11	To Train or Not to Train: Balancing Efficiency and Training Cost in Deep Reinforcement Learning for Mobile Edge Computing	Maddalena Boscaro et.al.	2411.07086	null
2024-11-11	Learning Collective Dynamics of Multi-Agent Systems using Event-based Vision	Minah Lee et.al.	2411.07039	null
2024-11-11	Designing Reliable Experiments with Generative Agent-Based Modeling: A Comprehensive Guide Using Concordia by Google DeepMind	Alejandro Leonardo García Navarro et.al.	2411.07038	null
2024-11-11	Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching	Arnav Kumar Jain et.al.	2411.07007	link
2024-11-11	Enhancing Robot Assistive Behaviour with Reinforcement Learning and Theory of Mind	Antonio Andriella et.al.	2411.07003	link
2024-11-11	Maximizing Nash Social Welfare in 2-Value Instances: A Simpler Proof for the Half-Integer Case	Kurt Mehlhorn et.al.	2411.06924	null
2024-11-11	Scalable Distributed Least Squares Algorithm for Linear Algebraic Equations via Scheduling	Shenyu Liu et.al.	2411.06883	null
2024-11-11	Distributed Graph Augmentation Protocols to Achieve Strong Connectivity in Multi-Agent Networks	Guilherme Ramos et.al.	2411.06880	link
2024-11-11	Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC	Aditya Soni et.al.	2411.06815	null
2024-11-11	Generative midtended cognition and Artificial Intelligence. Thinging with thinging things	Xabier E. Barandiaran et.al.	2411.06812	null
2024-11-08	Topology-aware Reinforcement Feature Space Reconstruction for Graph Data	Wangyang Ying et.al.	2411.05742	null
2024-11-08	A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics	Puze Liu et.al.	2411.05718	null
2024-11-08	Settling the Complexity of Popularity in Additively Separable and Fractional Hedonic Games	Martin Bullinger et.al.	2411.05713	null
2024-11-08	Data-Driven Distributed Common Operational Picture from Heterogeneous Platforms using Multi-Agent Reinforcement Learning	Indranil Sur et.al.	2411.05683	null
2024-11-08	The influence of persona and conversational task on social interactions with a LLM-controlled embodied conversational agent	Leon O. H. Kroczek et.al.	2411.05653	null
2024-11-08	LightVA: Lightweight Visual Analytics with LLM Agent-Based Task Planning and Execution	Yuheng Zhao et.al.	2411.05651	null
2024-11-08	Expectation vs. Reality: Towards Verification of Psychological Games	Marta Kwiatkowska et.al.	2411.05599	null
2024-11-08	Smart navigation through a rotating barrier: Deep reinforcement learning with application to size-based separation of active microagents	Mohammad Hossein Masoudi et.al.	2411.05587	null
2024-11-08	Tangled Program Graphs as an alternative to DRL-based control algorithms for UAVs	Hubert Szolc et.al.	2411.05586	link
2024-11-08	Parameterized Voter Relevance in Facility Location Games with Tree-Shaped Invitation Graphs	Ryoto Ando et.al.	2411.05574	null
2024-11-08	Time-to-reach Bounds for Verification of Dynamical Systems Using the Koopman Spectrum	Jianqiang Ding et.al.	2411.05554	null
2024-11-08	Evolution of cooperation in a three-strategy game combining snowdrift and stag hunt games	Hirofumi Takesue et.al.	2411.05543	null
2024-11-08	Generating surrogate temporal networks from mesoscale building blocks	Giulia Cencetti et.al.	2411.05477	link
2024-11-08	Enhancing Robustness in Language-Driven Robotics: A Modular Approach to Failure Reduction	Émiland Garrabé et.al.	2411.05474	null
2024-11-08	Emergent Cooperative Strategies for Multi-Agent Shepherding via Reinforcement Learning	Italo Napolitano et.al.	2411.05454	null
2024-11-08	WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models	Shengda Fan et.al.	2411.05451	link
2024-11-08	VISTA: Visual Integrated System for Tailored Automation in Math Problem Generation Using LLM	Jeongwoo Lee et.al.	2411.05423	null
2024-11-08	Towards Low-Resource Harmful Meme Detection with LMM Agents	Jianzhao Huang et.al.	2411.05383	link
2024-11-08	Enhancing Cluster Resilience: LLM-agent Based Autonomous Intelligent Cluster Diagnosis System and Evaluation Framework	Honghao Shi et.al.	2411.05349	null
2024-11-08	LLM-PySC2: Starcraft II learning environment for Large Language Models	Zongyuan Li et.al.	2411.05348	link
2024-11-07	Few-Shot Task Learning through Inverse Generative Modeling	Aviv Netanyahu et.al.	2411.04987	null
2024-11-07	Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games	Usman Anwar et.al.	2411.04976	link
2024-11-07	StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration	Panwen Hu et.al.	2411.04925	null
2024-11-07	OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models	Siming Huang et.al.	2411.04905	null
2024-11-07	Achieving superconductivity in infinite-layer nickelate thin films by aluminum sputtering deposition	Dongxin Zhang et.al.	2411.04896	null
2024-11-07	GUI Agents with Foundation Models: A Comprehensive Survey	Shuai Wang et.al.	2411.04890	null
2024-11-07	Think Smart, Act SMARL! Analyzing Probabilistic Logic Driven Safety in Multi-Agent Reinforcement Learning	Satchit Chatterji et.al.	2411.04867	link
2024-11-07	Robust Regulation of Labour Contracts	Théo Durandard et.al.	2411.04841	null
2024-11-07	Plasticity Loss in Deep Reinforcement Learning: A Survey	Timo Klein et.al.	2411.04832	null
2024-11-07	MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation	Sayan Paul et.al.	2411.04796	null
2024-11-07	A Continuification-Based Control Solution for Large-Scale Shepherding	Beniamino Di Lorenzo et.al.	2411.04791	null
2024-11-07	Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial Research	Xuewen Han et.al.	2411.04788	link
2024-11-07	Navigating Trade-offs: Policy Summarization for Multi-Objective Reinforcement Learning	Zuzanna Osika et.al.	2411.04784	link
2024-11-07	Learning from Demonstration with Hierarchical Policy Abstractions Toward High-Performance and Courteous Autonomous Racing	Chanyoung Chung et.al.	2411.04735	null
2024-11-07	A dynamical model of platform choice and online segregation	Sven Banisch et.al.	2411.04681	null
2024-11-07	CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation	Jie Liu et.al.	2411.04679	null
2024-11-07	Semantic-Aware Resource Management for C-V2X Platooning via Multi-Agent Reinforcement Learning	Zhiyu Shao et.al.	2411.04672	link
2024-11-07	CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR	Kadir Burak Buldu et.al.	2411.04671	null
2024-11-07	IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving	Clémence Grislain et.al.	2411.04653	link
2024-11-07	Mint: Cost-Efficient Tracing with All Requests Collection via Commonality and Variability Analysis	Haiyu Huang et.al.	2411.04605	null
2024-11-06	Predicting and Publishing Accurate Imbalance Prices Using Monte Carlo Tree Search	Fabio Pavirani et.al.	2411.04011	null
2024-11-06	Temporal Network Creation Games: The Impact of Non-Locality and Terminals	Davide Bilò et.al.	2411.03973	null
2024-11-06	Almost Time-Optimal Loosely-Stabilizing Leader Election on Arbitrary Graphs Without Identifiers in Population Protocols	Haruki Kanaya et.al.	2411.03902	null
2024-11-06	AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making	Yizhe Huang et.al.	2411.03865	link
2024-11-06	Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC	Tyler Clark et.al.	2411.03820	null
2024-11-06	From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning	Zhirui Deng et.al.	2411.03817	null
2024-11-06	MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue	Fengxiang Wang et.al.	2411.03814	null
2024-11-06	Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data	Chengrui Qu et.al.	2411.03810	link
2024-11-06	Multi-Modal Intelligent Channel Modeling: A New Modeling Paradigm via Synesthesia of Machines	Lu Bai et.al.	2411.03711	null
2024-11-06	Learn to Slice, Slice to Learn: Unveiling Online Optimization and Reinforcement Learning for Slicing AI Services	Amr Abo-eleneen et.al.	2411.03686	null
2024-11-06	Imagined Potential Games: A Framework for Simulating, Learning and Evaluating Interactive Behaviors	Lingfeng Sun et.al.	2411.03669	null
2024-11-06	Privacy-Preserving Resilient Vector Consensus	Bing Liu et.al.	2411.03633	null
2024-11-06	CPEG: Leveraging Consistency Policy with Consensus Guidance for Multi-agent Exploration	Yuqian Fu et.al.	2411.03603	null
2024-11-05	Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level	Antoine Grosnit et.al.	2411.03562	null
2024-11-05	VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation	Haochen Zhang et.al.	2411.03540	link
2024-11-05	AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution	Zhiqiang Xie et.al.	2411.03519	null
2024-11-05	An Open-source Sim2Real Approach for Sensor-independent Robot Navigation in a Grid	Murad Mehrab Abrar et.al.	2411.03494	link
2024-11-05	Watson: A Cognitive Observability Framework for the Reasoning of Foundation Model-Powered Agents	Benjamin Rombaut et.al.	2411.03455	null
2024-11-05	SAUCE: Synchronous and Asynchronous User-Customizable Environment for Multi-Agent LLM Interaction	Shlomo Neuberger et.al.	2411.03397	link
2024-11-05	SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents	Dawei Li et.al.	2411.03284	link
2024-11-05	Causal Responsibility Attribution for Human-AI Collaboration	Yahang Qi et.al.	2411.03275	link
2024-11-05	Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities	Ryosuke Takata et.al.	2411.03252	null
2024-11-05	Troll Farms	Philipp Denter et.al.	2411.03241	null
2024-11-05	A resolved Lyman-Alpha profile with doubly peaked emission at z~7	C. Moya-Sierralta et.al.	2411.03222	null
2024-11-05	GIS Copilot: Towards an Autonomous GIS Agent for Spatial Analysis	Temitope Akinboyewa et.al.	2411.03205	link
2024-11-05	Online Data Collection for Efficient Semiparametric Inference	Shantanu Gupta et.al.	2411.03195	link
2024-11-05	Hierarchical Orchestra of Policies	Thomas P Cannon et.al.	2411.03008	null
2024-11-05	Accelerating Task Generalisation with Multi-Level Hierarchical Options	Thomas P Cannon et.al.	2411.02998	null
2024-11-05	Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation	Francisco Giral et.al.	2411.02975	null
2024-11-05	Embedding Safety into RL: A New Take on Trust Region Methods	Nikola Milosevic et.al.	2411.02957	null
2024-11-05	Constant Approximation for Weighted Nash Social Welfare with Submodular Valuations	Yuda Feng et.al.	2411.02942	null
2024-11-05	Multi-Modal 3D Scene Graph Updater for Shared and Dynamic Environments	Emilio Olivastri et.al.	2411.02938	null
2024-11-05	Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent	Yangning Li et.al.	2411.02937	link
2024-11-05	Polyhedral study of a temporal rural postman problem: application in inspection of railway track without disturbing train schedules	Somnath Buriuly et.al.	2411.02822	null
2024-11-05	DroidSpeak: Enhancing Cross-LLM Communication	Yuhan Liu et.al.	2411.02820	null
2024-11-04	Fair and Welfare-Efficient Constrained Multi-matchings under Uncertainty	Elita Lobo et.al.	2411.02654	link
2024-11-04	Fine Grained Insider Risk Detection	Birkett Huber et.al.	2411.02645	null
2024-11-04	Learning to Assist Humans without Inferring Rewards	Vivek Myers et.al.	2411.02623	link
2024-11-04	Multi-Agent Decision Transformers for Dynamic Dispatching in Material Handling Systems Leveraging Enterprise Big Data	Xian Yeow Lee et.al.	2411.02584	null
2024-11-04	Attacking Vision-Language Computer Agents via Pop-ups	Yanzhe Zhang et.al.	2411.02391	link
2024-11-04	Two-Sided Learning in Decentralized Matching Markets	Vade Shah et.al.	2411.02377	null
2024-11-04	Social-RAG: Retrieving from Group Interactions to Socially Ground Proactive AI Generation to Group Preferences	Ruotong Wang et.al.	2411.02353	null
2024-11-04	WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning	Zehan Qi et.al.	2411.02337	link
2024-11-04	CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments	Kung-Hsiang Huang et.al.	2411.02305	link
2024-11-04	Kinetic exchange opinion dynamics for the battleground-states in the 2024 US presidential elections	Soumyajyoti Biswas et.al.	2411.02240	null
2024-11-04	Positive Experience Reflection for Agents in Interactive Text Environments	Philip Lippmann et.al.	2411.02223	null
2024-11-04	CryptoEL: A Novel Experiential Learning Tool for Enhancing K-12 Cryptography Education	Pranathi Rayavaram et.al.	2411.02143	null
2024-11-04	Foundations and Recent Trends in Multimodal Mobile Agents: A Survey	Biao Wu et.al.	2411.02006	link
2024-11-04	Deep memetic models for combinatorial optimization problems: application to the tool switching problem	Jhon Edgar Amaya et.al.	2411.01922	null
2024-11-04	Efficient Active Imitation Learning with Random Network Distillation	Emilien Biré et.al.	2411.01894	null
2024-11-04	ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation	Hengkai Tan et.al.	2411.01850	null
2024-11-04	IRS-Enhanced Secure Semantic Communication Networks: Cross-Layer and Context-Awared Resource Allocation	Lingyi Wang et.al.	2411.01821	null
2024-11-04	A Polynomial-Time Algorithm for Fair and Efficient Allocation with a Fixed Number of Agents	Ryoga Mahara et.al.	2411.01810	null
2024-11-04	Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge	Weihua Du et.al.	2411.01796	link
2024-11-04	Revisiting Game-Theoretic Control in Socio-Technical Networks: Emerging Design Frameworks and Contemporary Applications	Quanyan Zhu et.al.	2411.01794	null
2024-11-04	Lyapunov-guided Multi-Agent Reinforcement Learning for Delay-Sensitive Wireless Scheduling	Cheng Zhang et.al.	2411.01766	null
2024-11-04	Show, Don't Tell: Learning Reward Machines from Demonstrations for Reinforcement Learning-Based Cardiac Pacemaker Synthesis	John Komp et.al.	2411.01750	null
2024-11-04	DynaSaur: Large Language Agents Beyond Predefined Actions	Dang Nguyen et.al.	2411.01747	null
2024-11-04	Taking AI Welfare Seriously	Robert Long et.al.	2411.00986	null

(back to top)

Large Language Model Agent

Publish Date	Title	Authors	PDF	Code
2025-02-24	IGDA: Interactive Graph Discovery through Large Language Model Agents	Alex Havrilla et.al.	2502.17189	null
2025-02-24	Grounded Persuasive Language Generation for Automated Marketing	Jibang Wu et.al.	2502.16810	null
2025-02-24	Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent Advances	Yaozu Wu et.al.	2502.16804	null
2025-02-23	Guardians of the Agentic System: Preventing Many Shots Jailbreak with Agentic System	Saikat Barua et.al.	2502.16750	null
2025-02-23	RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents	Sho Nakatani et.al.	2502.16730	null
2025-02-20	Vending-Bench: A Benchmark for Long-Term Coherence of Autonomous Agents	Axel Backlund et.al.	2502.15840	null
2025-02-18	LLM Trading: Analysis of LLM Agent Behavior in Experimental Asset Markets	Thomas Henning et.al.	2502.15800	null
2025-02-21	Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing	Masaya Kobayashi et.al.	2502.15506	null
2025-02-21	Textual-to-Visual Iterative Self-Verification for Slide Generation	Yunqing Xu et.al.	2502.15412	null
2025-02-21	I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search	Zujie Liang et.al.	2502.14693	null
2025-02-20	Enhancing Language Multi-Agent Learning with Multi-Agent Credit Re-Assignment for Interactive Environment Generalization	Zhitao He et.al.	2502.14496	null
2025-02-20	FlowAgent: Achieving Compliance and Flexibility for Workflow Agents	Yuchen Shi et.al.	2502.14345	link
2025-02-19	Investigating Non-Transitivity in LLM-as-a-Judge	Yi Xu et.al.	2502.14074	null
2025-02-19	An LLM-based Agent for Reliable Docker Environment Configuration	Ruida Hu et.al.	2502.13681	null
2025-02-16	Understanding Dynamic Diffusion Process of LLM-based Agents under Information Asymmetry	Yiwen Zhang et.al.	2502.13160	null
2025-02-18	SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems	Mike Zhang et.al.	2502.12927	link
2025-02-18	Towards more Contextual Agents: An extractor-Generator Optimization Framework	Mourad Aouini et.al.	2502.12926	null
2025-02-18	DemonAgent: Dynamically Encrypted Multi-Backdoor Implantation Attack on LLM-based Agent	Pengyu Zhu et.al.	2502.12575	link
2025-02-18	Investigating and Extending Homans' Social Exchange Theory with Large Language Model based Agents	Lei Wang et.al.	2502.12450	link
2025-02-17	Connecting Large Language Model Agent to High Performance Computing Resource	Heng Ma et.al.	2502.12280	null
2025-02-17	Scaling Autonomous Agents via Automatic Reward Modeling And Planning	Zhenfang Chen et.al.	2502.12130	null
2025-02-17	TimeCAP: Learning to Contextualize, Augment, and Predict Time Series Events with Large Language Model Agents	Geon Lee et.al.	2502.11418	null
2025-02-16	A Survey of LLM-based Agents in Medicine: How far are we from Baymax?	Wenxuan Wang et.al.	2502.11211	null
2025-02-16	SCALE: Towards Collaborative Content Analysis in Social Science with Large Language Model Agents and Human Intervention	Chengshuai Zhao et.al.	2502.10937	null
2025-02-14	Can Large Language Model Agents Balance Energy Systems?	Xinxing Ren et.al.	2502.10557	null
2025-02-13	MDCrow: Automating Molecular Dynamics Workflows with Large Language Models	Quintina Campbell et.al.	2502.09565	link
2025-02-12	SPeCtrum: A Grounded Framework for Multidimensional Identity Representation in LLM-Based Agent	Keyeun Lee et.al.	2502.08599	link
2025-02-13	Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation	Mahnaz Koupaee et.al.	2502.08514	link
2025-02-07	Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization	Zelai Xu et.al.	2502.04686	null
2025-02-06	Multi-Agent Reinforcement Learning with Focal Diversity Optimization	Selim Furkan Tekin et.al.	2502.04492	link
2025-02-04	Position: Scaling LLM Agents Requires Asymptotic Analysis with LLM Primitives	Elliot Meyerson et.al.	2502.04358	null
2025-02-03	Simulating Rumor Spreading in Social Networks using LLM Agents	Tianrui Hu et.al.	2502.01450	link
2025-02-03	PlotGen: Multi-Agent LLM-based Scientific Data Visualization via Multimodal Feedback	Kanika Goswami et.al.	2502.00988	null
2025-02-02	RTBAgent: A LLM-based Agent System for Real-Time Bidding	Leng Cai et.al.	2502.00792	link
2025-02-02	Meta-Prompt Optimization for LLM-Based Sequential Decision Making	Mingze Kong et.al.	2502.00728	null
2025-02-02	PhiP-G: Physics-Guided Text-to-3D Compositional Scene Generation	Qixuan Li et.al.	2502.00708	null
2025-01-31	Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game	Mustafa O. Karabag et.al.	2501.19398	link
2025-01-28	Large Language Model Critics for Execution-Free Evaluation of Code Changes	Aashish Yadavally et.al.	2501.16655	link
2024-12-30	DropMicroFluidAgents (DMFAs): Autonomous Droplet Microfluidic Research Framework Through Large Language Model Agents	Dinh-Nguyen Nguyen et.al.	2501.14772	link
2025-01-24	AI Chatbots as Professional Service Agents: Developing a Professional Identity	Wenwen Li et.al.	2501.14179	null
2025-02-08	Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents	Shrinidhi Kumbhar et.al.	2501.13299	null
2025-01-20	Towards Advancing Code Generation with Large Language Models: A Research Roadmap	Haolin Jin et.al.	2501.11354	null
2025-02-13	Large Language Model Agents for Radio Map Generation and Wireless Network Planning	Hongye Quan et.al.	2501.11283	null
2024-12-18	Autonomous Microscopy Experiments through Large Language Model Agents	Indrajeet Mandal et.al.	2501.10385	null
2025-01-13	Lifelong Learning of Large Language Model based Agents: A Roadmap	Junhao Zheng et.al.	2501.07278	link
2025-01-10	Multi-Agent Collaboration Mechanisms: A Survey of LLMs	Khanh-Tung Tran et.al.	2501.06322	null
2025-01-09	Emergence of human-like polarization among large language model agents	Jinghua Piao et.al.	2501.05171	null
2025-01-27	MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning	Pu Yang et.al.	2501.01834	null
2025-01-03	SDPO: Segment-Level Direct Preference Optimization for Social Agents	Aobo Kong et.al.	2501.01821	link
2025-01-03	AgentRefine: Enhancing Agent Generalization through Refinement Tuning	Dayuan Fu et.al.	2501.01702	null
2025-01-02	BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery	Kanishk Gandhi et.al.	2501.01540	link
2024-12-31	Enabling New HDLs with Agents	Mark Zakharov et.al.	2501.00642	null
2025-01-09	Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding	Yue Fan et.al.	2501.00358	null
2024-12-30	AI Agent for Education: von Neumann Multi-Agent System Framework	Yuan-Hao Jiang et.al.	2501.00083	null
2024-12-17	AnalogXpert: Automating Analog Topology Synthesis by Incorporating Circuit Design Expertise into Large Language Models	Haoyi Zhang et.al.	2412.19824	null
2024-12-24	Explainable Multi-Modal Data Exploration in Natural Language via LLM Agent	Farhad Nooralahzadeh et.al.	2412.18428	link
2024-12-24	Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering	Zhongjian Hu et.al.	2412.18351	null
2024-12-24	INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent	Haohang Li et.al.	2412.18174	null
2024-12-24	Molly: Making Large Language Model Agents Solve Python Problem More Logically	Rui Xiao et.al.	2412.18093	null
2024-12-17	On the Structural Memory of LLM Agents	Ruihong Zeng et.al.	2412.15266	link
2024-12-18	Tree-of-Code: A Hybrid Approach for Robust Complex Task Planning and Execution	Ziyi Ni et.al.	2412.14212	null
2024-12-17	RareAgents: Autonomous Multi-disciplinary Team for Rare Disease Diagnosis and Treatment	Xuanzhong Chen et.al.	2412.12475	null
2024-12-14	Towards Action Hijacking of Large Language Model-based Agent	Yuyang Zhang et.al.	2412.10807	null
2025-01-09	Active Inference for Self-Organizing Multi-LLM Systems: A Bayesian Thermodynamic Approach to Adaptation	Rithvik Prakki et.al.	2412.10425	link
2024-12-19	Can Modern LLMs Act as Agent Cores in Radiology Environments?	Qiaoyu Zheng et.al.	2412.09529	link
2024-12-09	Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework	Tianming Liu et.al.	2412.06681	null
2024-12-09	Simulating Human-like Daily Activities with Desire-driven Autonomy	Yiding Wang et.al.	2412.06435	null
2024-12-09	StarWhisper Telescope: Agent-Based Observation Assistant System to Approach AI Astrophysicist	Cunshi Wang et.al.	2412.06412	null
2024-12-09	Beyond pip install: Evaluating LLM Agents for the Automated Installation of Python Projects	Louis Milliken et.al.	2412.06294	link
2024-12-08	Cooperative SQL Generation for Segmented Databases By Using Multi-functional LLM Agents	Zhiguang Wu et.al.	2412.05850	null
2024-12-04	DataLab: A Unified Platform for LLM-Powered Business Intelligence	Luoxuan Weng et.al.	2412.02205	null
2024-12-02	HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing	Lajos Muzsai et.al.	2412.01778	link
2024-12-02	SAUP: Situation Awareness Uncertainty Propagation on LLM Agent	Qiwei Zhao et.al.	2412.01033	null
2024-12-03	Multi-Agent System for Cosmological Parameter Analysis	Andrew Laverick et.al.	2412.00431	link
2024-11-28	SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments	Yue Cao et.al.	2412.00114	null
2024-11-29	Training Agents with Weakly Supervised Feedback from Large Language Models	Dihong Gong et.al.	2411.19547	null
2024-11-26	LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble	Yujeong Lee et.al.	2411.17135	null
2024-11-21	Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning	Song Jiang et.al.	2411.13904	null
2024-11-19	Human-In-the-Loop Software Development Agents	Wannita Takerngsaksiri et.al.	2411.12924	null
2024-12-16	A More Advanced Group Polarization Measurement Approach Based on LLM-Based Agents and Graphs	Zixin Liu et.al.	2411.12196	null
2024-11-15	Static network structure cannot stabilize cooperation among Large Language Model agents	Jin Han et.al.	2411.10294	null
2024-11-15	An Empirical Study on LLM-based Agents for Automated Bug Fixing	Xiangxin Meng et.al.	2411.10213	null
2024-11-14	Navigating the Risks: A Survey of Security, Privacy, and Ethics Threats in LLM-Based Agents	Yuyou Gan et.al.	2411.09523	null
2024-10-29	FinVision: A Multi-Agent Framework for Stock Market Prediction	Sorouralsadat Fatemi et.al.	2411.08899	null
2024-11-11	Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving	Botao Yu et.al.	2411.07228	null
2024-11-05	Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities	Ryosuke Takata et.al.	2411.03252	null
2024-11-02	Interacting Large Language Model Agents. Interpretable Models and Social Learning	Adit Jain et.al.	2411.01271	null
2024-11-02	AutoPT: How Far Are We from the End2End Automated Web Penetration Testing?	Benlong Wu et.al.	2411.01236	link
2024-11-02	A Large-scale Time-aware Agents Simulation for Influencer Selection in Digital Advertising Campaigns	Xiaoqing Zhang et.al.	2411.01143	null
2024-11-01	Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software Improvement	Yingwei Ma et.al.	2411.00622	link
2024-10-31	From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents	Nalin Tiwary et.al.	2410.23555	null
2024-10-30	Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration	Yanchu Guan et.al.	2410.22916	null
2024-10-29	SceneGenAgent: Precise Industrial Scene Generation with Coding Agent	Xiao Xia et.al.	2410.21909	link
2024-10-28	Guide-LLM: An Embodied LLM Agent and Text-Based Topological Map for Robotic Guidance of People with Visual Impairments	Sangmim Song et.al.	2410.20666	null
2024-10-29	Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting	Mohamed Salim Aissi et.al.	2410.19920	null
2024-11-07	GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration	Xin Li et.al.	2410.18032	link
2024-10-25	MiniFed : Integrating LLM-based Agentic-Workflow for Simulating FOMC Meeting	Sungil Seok et.al.	2410.18012	null
2024-10-22	SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning	Yizhou Chi et.al.	2410.17238	link
2024-10-22	Adsorb-Agent: Autonomous Identification of Stable Adsorption Configurations via Large Language Model Agent	Janghoon Ock et.al.	2410.16658	link
2024-10-21	NetSafe: Exploring the Topological Safety of Multi-agent Networks	Miao Yu et.al.	2410.15686	null
2024-10-20	When Machine Unlearning Meets Retrieval-Augmented Generation (RAG): Keep Secret or Forget Knowledge?	Shang Wang et.al.	2410.15267	null
2024-10-19	SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation	Jingxuan Chen et.al.	2410.15164	link
2024-10-18	Agents4PLC: Automating Closed-loop PLC Code Generation and Verification in Industrial Control Systems using LLM-based Agents	Zihan Liu et.al.	2410.14209	link
2024-10-18	SRAP-Agent: Simulating and Optimizing Scarce Resource Allocation Policy with LLM-based Agent	Jiarui Ji et.al.	2410.14152	link
2024-10-17	AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents	Ke Yang et.al.	2410.13825	null
2024-10-17	Rapid and Automated Alloy Design with Graph Neural Network-Powered LLM-Driven Multi-Agent Systems	Alireza Ghafarollahi et.al.	2410.13768	null

(back to top)

Tool learning

Publish Date	Title	Authors	PDF	Code
2025-02-17	ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models	Hanxing Ding et.al.	2502.11404	link
2025-02-17	Mimicking the Familiar: Dynamic Command Generation for Information Theft Attacks in LLM Tool-Learning System	Ziyou Jiang et.al.	2502.11358	null
2025-02-14	RTBAS: Defending LLM Agents Against Prompt Injection and Privacy Leakage	Peter Yong Zhong et.al.	2502.08966	null
2025-02-03	Tool Unlearning for Tool-Augmented LLMs	Jiali Cheng et.al.	2502.01083	null
2025-01-30	ACEBench: Who Wins the Match Point in Tool Learning?	Chen Chen et.al.	2501.12851	null
2025-01-21	Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation	Dongsheng Zhu et.al.	2501.12432	null
2024-12-11	GraphTool-Instruction: Revolutionizing Graph Reasoning in LLMs through Decomposed Subtask Instruction	Rongzheng Wang et.al.	2412.12152	null
2024-12-11	Federated In-Context LLM Agent Learning	Panlong Wu et.al.	2412.08054	null
2024-12-08	TOOL-ED: Enhancing Empathetic Response Generation with the Tool Calling Capability of LLM	Huiying Cao et.al.	2412.03096	link
2024-10-15	Toolken+: Improving LLM Tool Usage with Reranking and a Reject Option	Konstantin Yakovlev et.al.	2410.12004	null
2025-01-07	NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models	Han Han et.al.	2410.11805	link
2024-10-10	From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions	Changle Qu et.al.	2410.08197	link
2025-02-18	StepTool: Enhancing Multi-Step Tool Usage in LLMs through Step-Grained Reinforcement Learning	Yuanqing Yu et.al.	2410.07745	link
2025-02-24	Learning Evolving Tools for Large Language Models	Guoxin Chen et.al.	2410.06617	link
2024-10-08	ToolGen: Unified Tool Retrieval and Calling via Generation	Renxi Wang et.al.	2410.03439	link
2024-09-23	CITI: Enhancing Tool Utilizing Ability in Large Language Models without Sacrificing General Performance	Yupu Hao et.al.	2409.13202	link
2024-09-02	ToolACE: Winning the Points of LLM Function Calling	Weiwen Liu et.al.	2409.00920	null
2025-02-16	Learning to Ask: When LLM Agents Meet Unclear Instruction	Wenxuan Wang et.al.	2409.00557	null
2024-10-08	MetaTool: Facilitating Large Language Models to Master Tools with Meta-task Augmentation	Xiaohan Wang et.al.	2407.12871	null
2024-07-02	WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models	Kangyun Ning et.al.	2407.12823	null
2024-07-03	What Affects the Stability of Tool Learning? An Empirical Study on the Robustness of Tool Learning Frameworks	Chengrui Huang et.al.	2407.03007	null
2024-06-28	Simulating Financial Market via Large Language Model based Agents	Shen Gao et.al.	2406.19966	null
2024-09-29	Enhancing Tool Retrieval with Iterative Feedback from Large Language Models	Qiancheng Xu et.al.	2406.17465	link
2024-09-30	Query Routing for Homogeneous Tools: An Instantiation in the RAG Scenario	Feiteng Mu et.al.	2406.12429	null
2024-10-02	Tool-Planner: Task Planning with Clusters across Multiple Tools	Yanming Liu et.al.	2406.03807	link
2024-06-03	A Survey of Useful LLM Evaluation	Ji-Lun Peng et.al.	2406.00936	null
2024-11-04	Tool Learning with Large Language Models: A Survey	Changle Qu et.al.	2405.17935	link
2024-05-24	Let Me Do It For You: Towards LLM Empowered Recommendation via Tool Learning	Yuyue Zhao et.al.	2405.15114	null
2024-05-14	Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark	Mengsong Wu et.al.	2405.08355	link

(back to top)

Embodied AI

Publish Date	Title	Authors	PDF	Code
2025-02-20	CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space	Yong Zhao et.al.	2502.12532	link
2025-02-16	NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLM	Zihan Wang et.al.	2502.11142	link
2025-02-14	STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning	Mingcong Lei et.al.	2502.10177	null
2025-02-11	Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning	Yuhang Dong et.al.	2502.09649	null
2025-02-23	EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents	Rui Yang et.al.	2502.09560	null
2025-02-10	Visual Agentic AI for Spatial Reasoning with a Dynamic API	Damiano Marsili et.al.	2502.06787	null
2025-02-09	EvoAgent: Agent Autonomous Evolution with Continual World Model for Long-Horizon Tasks	Tongtong Feng et.al.	2502.05907	null
2025-02-10	Humans Co-exist, So Must Embodied Artificial Agents	Hannah Kuehn et.al.	2502.04809	null
2025-02-04	AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement	Shivam Singh et.al.	2502.02067	link
2025-02-03	Provable Ordering and Continuity in Vision-Language Pretraining for Generalizable Embodied Agents	Zhizhen Zhang et.al.	2502.01218	link
2025-01-31	MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems	Anirudh Chari et.al.	2501.19318	null
2025-01-31	GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling	Pinxin Liu et.al.	2501.18898	link
2025-02-03	UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent	Jianke Zhang et.al.	2501.18867	null
2025-01-29	PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding	Wei Chow et.al.	2501.16411	null
2025-02-13	What if Eye...? Computationally Recreating Vision Evolution	Kushagra Tiwary et.al.	2501.15001	link
2025-01-21	EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents	Zhili Cheng et.al.	2501.11858	link
2025-01-17	Universal Actions for Enhanced Embodied Foundation Models	Jinliang Zheng et.al.	2501.10105	link
2025-01-15	Embodied Scene Understanding for Vision Language Models via MetaVQA	Weizhen Wang et.al.	2501.09167	null
2025-01-10	Semantic Mapping in Indoor Embodied AI -- A Comprehensive Survey and Future Directions	Sonia Raychaudhuri et.al.	2501.05750	null
2025-01-09	ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark	Ronghao Dang et.al.	2501.05031	link
2025-01-29	Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey	Zongxia Li et.al.	2501.02189	link
2025-01-02	Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method	Ruichen Zhang et.al.	2501.01141	null
2024-12-30	UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI	Fangwei Zhong et.al.	2412.20977	null
2024-12-28	FaGeL: Fabric LLMs Agent empowered Embodied Intelligence Evolution with Autonomous Human-Machine Collaboration	Jia Liu et.al.	2412.20297	null
2024-12-30	Embodied Image Quality Assessment for Robotic Intelligence	Jianbo Zhang et.al.	2412.18774	link
2024-12-24	Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems	Fernando Jia et.al.	2412.18601	link
2024-12-24	VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks	Shiduo Zhang et.al.	2412.18194	null
2024-12-23	Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples	Taewoong Kim et.al.	2412.17288	link
2024-12-25	Offline Reinforcement Learning for LLM Multi-Step Reasoning	Huaijie Wang et.al.	2412.16145	link
2024-12-17	GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding	Haoyi Jiang et.al.	2412.13193	link
2024-12-18	SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents	Sheng Yin et.al.	2412.13178	link
2024-12-16	Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agents	Wonje Choi et.al.	2412.11484	null
2024-12-05	TANGO: Training-free Embodied AI Agents for Open-world Tasks	Filippo Ziliotto et.al.	2412.10402	null
2024-12-11	From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons	Andrew Szot et.al.	2412.08442	null
2024-12-23	SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World	Jiaqi Zhang et.al.	2412.07472	link
2024-12-08	InfiniteWorld: A Unified Scalable Simulation Framework for General Visual-Language Robot Interaction	Pengzhen Ren et.al.	2412.05789	link
2024-12-06	TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft	Qian Long et.al.	2412.05255	link
2024-12-06	EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding	Yuqi Wu et.al.	2412.04380	link
2024-12-03	Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks	Zijiao Yang et.al.	2412.02795	null
2024-12-25	Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation	Yiyuan Pan et.al.	2412.01857	null
2024-12-02	The Bare Necessities: Designing Simple, Effective Open-Vocabulary Scene Graphs	Christina Kassab et.al.	2412.01539	null
2024-12-02	Generating Freeform Endoskeletal Robots	Muhan Li et.al.	2412.01036	null
2024-12-01	STEVE-Audio: Expanding the Goal Conditioning Modalities of Embodied Agents in Minecraft	Nicholas Lenzen et.al.	2412.00949	null
2024-11-30	Benchmark Real-time Adaptation and Communication Capabilities of Embodied Agent in Collaborative Scenarios	Shipeng Liu et.al.	2412.00435	null
2024-11-28	CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos	Xinhao Liu et.al.	2411.17820	link
2024-12-15	3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning	Yuncong Yang et.al.	2411.17735	null
2024-11-26	LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble	Yujeong Lee et.al.	2411.17135	null
2024-11-23	Two Heads Are Better Than One: Collaborative LLM Embodied Agents for Human-Robot Interaction	Mitchell Rosser et.al.	2411.16723	null
2024-11-25	TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation	Linqing Zhong et.al.	2411.16425	null
2024-12-04	Functionality understanding and segmentation in 3D scenes	Jaime Corsetti et.al.	2411.16310	null
2024-11-25	Open-Vocabulary Octree-Graph for 3D Scene Understanding	Zhigang Wang et.al.	2411.16253	null
2024-11-27	XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models	Yixin Dong et.al.	2411.15100	null
2024-11-20	AMaze: An intuitive benchmark generator for fast prototyping of generalizable agents	Kevin Godin-Dubois et.al.	2411.13072	null
2024-11-25	MindForge: Empowering Embodied Agents with Theory of Mind for Lifelong Collaborative Learning	Mircea Lică et.al.	2411.12977	null
2024-11-15	Voxel-Aggergated Feature Synthesis: Efficient Dense Mapping for Simulated 3D Reasoning	Owen Burns et.al.	2411.10616	null
2024-11-13	NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation	Youzhi Liu et.al.	2411.08579	null
2024-11-08	Enhancing Robustness in Language-Driven Robotics: A Modular Approach to Failure Reduction	Émiland Garrabé et.al.	2411.05474	null
2024-11-07	MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation	Sayan Paul et.al.	2411.04796	null
2024-11-07	CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation	Jie Liu et.al.	2411.04679	null
2024-11-07	Scaling Laws for Pre-training Agents and World Models	Tim Pearce et.al.	2411.04434	null
2024-11-05	VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation	Haochen Zhang et.al.	2411.03540	link
2024-11-04	ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation	Hengkai Tan et.al.	2411.01850	null
2024-11-05	Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge	Weihua Du et.al.	2411.01796	link
2024-10-31	PARTNR: A Benchmark for Planning and Reasoning in Embodied Multi-agent Tasks	Matthew Chang et.al.	2411.00081	link
2024-10-31	Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use	Jiajun Xi et.al.	2410.24218	link
2024-10-31	Simulating User Agents for Embodied Conversational-AI	Daniel Philipov et.al.	2410.23535	null
2024-10-30	A little less conversation, a little more action, please: Investigating the physical common-sense of LLMs in a 3D embodied environment	Matteo G. Mecattaf et.al.	2410.23242	link
2024-10-29	ADAM: An Embodied Causal Agent in Open-World Environments	Shu Yu et.al.	2410.22194	null
2024-10-23	Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments	Luca Barsellotti et.al.	2410.18195	link
2024-10-21	Agent-Based Emulation for Deploying Robot Swarm Behaviors	Ricardo Vega et.al.	2410.16444	null
2024-10-18	Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied Agents	Sabit Hassan et.al.	2410.14141	null
2024-10-17	Goal Inference from Open-Ended Dialog	Rachel Ma et.al.	2410.13957	null
2024-10-15	M2Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes	Sixu Yan et.al.	2410.11402	null
2024-10-14	Embodied Active Learning of Generative Sensor-Object Models	Allison Pinosky et.al.	2410.11130	null
2024-10-16	PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation	Kaidong Zhang et.al.	2410.10394	null
2024-10-12	EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment	Chen Gao et.al.	2410.09604	null
2024-10-05	Semantic Environment Atlas for Object-Goal Navigation	Nuri Kim et.al.	2410.09081	null
2024-11-01	Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making	Manling Li et.al.	2410.07166	link
2024-10-15	M3Bench: Benchmarking Whole-body Motion Generation for Mobile Manipulation in 3D Scenes	Zeyu Zhang et.al.	2410.06678	null
2024-10-08	Entering Real Social World! Benchmarking the Theory of Mind and Socialization Capabilities of LLMs from a First-person Perspective	Guiyang Hou et.al.	2410.06195	link
2024-10-07	How do we Observe Relational Observables?	Emily Adlam et.al.	2410.05508	null

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 2,349 Commits
.github		.github
assets		assets
docs		docs
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Updated on 2025.02.26

Agent

Large Language Model Agent

Tool learning

Embodied AI

About

Releases

Packages

Languages

License

27yw/cv-arxiv-daily

Folders and files

Latest commit

History

Repository files navigation

Updated on 2025.02.26

Agent

Large Language Model Agent

Tool learning

Embodied AI

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages