Skip to content

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

License

Notifications You must be signed in to change notification settings

27yw/cv-arxiv-daily

 
 

Repository files navigation

Contributors Forks Stargazers Issues

Updated on 2025.02.26

Usage instructions: here

Table of Contents
  1. Agent
  2. Large Language Model Agent
  3. Tool learning
  4. Embodied AI

Agent

Publish Date Title Authors PDF Code
2025-02-24 Event-Based Limit Order Book Simulation under a Neural Hawkes Process: Application in Market-Making Luca Lalor et.al. 2502.17417 null
2025-02-24 Distributed Coordination for Heterogeneous Non-Terrestrial Networks Jikang Deng et.al. 2502.17366 null
2025-02-24 Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents Prafulla Kumar Choubey et.al. 2502.17321 null
2025-02-24 Survey on Strategic Mining in Blockchain: A Reinforcement Learning Approach Jichen Li et.al. 2502.17307 null
2025-02-24 IGDA: Interactive Graph Discovery through Large Language Model Agents Alex Havrilla et.al. 2502.17189 null
2025-02-24 Teleology-Driven Affective Computing: A Causal Framework for Sustained Well-Being Bin Yin et.al. 2502.17172 null
2025-02-24 A Novel Multiple Access Scheme for Heterogeneous Wireless Communications using Symmetry-aware Continual Deep Reinforcement Learning Hamidreza Mazandarani et.al. 2502.17167 null
2025-02-24 Semantic-Aware Dynamic and Distributed Power Allocation: a Multi-UAV Area Coverage Use Case Hamidreza Mazandarani et.al. 2502.17120 null
2025-02-24 Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration Junyang Wang et.al. 2502.17110 null
2025-02-24 Generative Models in Decision Making: A Survey Yinchuan Li et.al. 2502.17100 null
2025-02-24 MA2RL: Masked Autoencoders for Generalizable Multi-Agent Reinforcement Learning Jinyuan Feng et.al. 2502.17046 null
2025-02-24 A data-driven econo-financial stress-testing framework to estimate the effect of supply chain networks on financial systemic risk Jan Fialkowski et.al. 2502.17044 null
2025-02-24 Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs Enea Monzio Compagnoni et.al. 2502.17009 null
2025-02-24 Deep-reinforcement-learning-based separation control in a two-dimensional airfoil Xavier Garcia et.al. 2502.16993 null
2025-02-24 Engineering and Validating Cyber-Physical Energy Systems: Needs, Status Quo, and Research Trends Thomas I. Strasser et.al. 2502.16991 null
2025-02-24 A Multi-LLM-Agent-Based Framework for Economic and Public Policy Analysis Yuzhi Hao et.al. 2502.16879 null
2025-02-24 Graphy'our Data: Towards End-to-End Modeling, Exploring and Generating Report from Raw Data Longbin Lai et.al. 2502.16868 null
2025-02-24 Toward Agentic AI: Generative Information Retrieval Inspired Intelligent Communications and Networking Ruichen Zhang et.al. 2502.16866 null
2025-02-24 Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit Assignment Kartik Nagpal et.al. 2502.16863 null
2025-02-24 Grounded Persuasive Language Generation for Automated Marketing Jibang Wu et.al. 2502.16810 null
2025-02-21 AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind Zhining Zhang et.al. 2502.15676 null
2025-02-21 Multi-Agent Architecture in Distributed Environment Control Systems: vision, challenges, and opportunities Natasha Astudillo et.al. 2502.15663 null
2025-02-21 Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian Network Vincent Hsiao et.al. 2502.15662 null
2025-02-21 Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path? Yoshua Bengio et.al. 2502.15657 null
2025-02-21 A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications Jefferson Silveira et.al. 2502.15649 null
2025-02-21 WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents Xinhang Liu et.al. 2502.15601 null
2025-02-21 SOTOPIA-Ω: Dynamic Strategy Injection Learning and Social Instrucion Following Evaluation for Social Agents Wenyuan Zhang et.al. 2502.15538 null
2025-02-21 Contract DesignUnderApproximate Best Responses Francesco Bacchiocchi et.al. 2502.15523 null
2025-02-21 SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning Xuyang Li et.al. 2502.15512 null
2025-02-21 Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing Masaya Kobayashi et.al. 2502.15506 null
2025-02-21 Pub-Guard-LLM: Detecting Fraudulent Biomedical Articles with Reliable Explanations Lihu Chen et.al. 2502.15429 null
2025-02-21 TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning Giuseppe Paolo et.al. 2502.15425 null
2025-02-21 Textual-to-Visual Iterative Self-Verification for Slide Generation Yunqing Xu et.al. 2502.15412 null
2025-02-21 LongCaptioning: Unlocking the Power of Long Caption Generation in Large Multimodal Models Hongchen Wei et.al. 2502.15393 null
2025-02-21 Multi-Group Dynamics with Tolerant Switching in the Kolkata Paise Restaurant Problem with Dining Clubs Akshat Harlalka et.al. 2502.15377 null
2025-02-21 ARS: Automatic Routing Solver with Large Language Models Kai Li et.al. 2502.15359 null
2025-02-21 Learning with Limited Shared Information in Multi-agent Multi-armed Bandit Junning Shao et.al. 2502.15338 null
2025-02-21 DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation Luzhou Ge et.al. 2502.15309 link
2025-02-21 Leader-Follower Formation Tracking Control of Quadrotor UAVs Using Bearing Measurements S. Doodeman et.al. 2502.15303 null
2025-02-21 Collective behaviors of self-propelled particles with tunable alignment angles Zichen Qin et.al. 2502.15301 null
2025-02-20 GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks Jianwen Luo et.al. 2502.14848 null
2025-02-20 Red-Teaming LLM Multi-Agent Systems via Communication Attacks Pengfei He et.al. 2502.14847 null
2025-02-20 Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation Yue Yang et.al. 2502.14846 null
2025-02-20 Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models Vlad Sobal et.al. 2502.14819 null
2025-02-20 Optimizing Model Selection for Compound AI Systems Lingjiao Chen et.al. 2502.14815 link
2025-02-20 Byzantine Game Theory: Sun Tzus Boxes Andrei Constantinescu et.al. 2502.14812 null
2025-02-20 Planning, scheduling, and execution on the Moon: the CADRE technology demonstration mission Gregg Rabideau et.al. 2502.14803 null
2025-02-20 A Multi-Agent Perspective on Modern Information Retrieval Haya Nachimovsky et.al. 2502.14796 null
2025-02-20 Making Universal Policies Universal Niklas Höpner et.al. 2502.14777 null
2025-02-20 Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis Priyanka Kargupta et.al. 2502.14767 link
2025-02-20 Multi-Agent Coordination across Diverse Applications: A Survey Lijun Sun et.al. 2502.14743 null
2025-02-20 Reinforcement Learning with Graph Attention for Routing and Wavelength Assignment with Lightpath Reuse Michael Doherty et.al. 2502.14741 null
2025-02-20 FLIGHT: Facility Location Integrating Generalized, Holistic Theory of Welfare Avyukta Manjunatha Vummintala et.al. 2502.14732 null
2025-02-20 Ranking Joint Policies in Dynamic Games using Evolutionary Dynamics Natalia Koliou et.al. 2502.14724 link
2025-02-20 Building reliable sim driving agents by scaling self-play Daphne Cornelisse et.al. 2502.14706 null
2025-02-20 I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search Zujie Liang et.al. 2502.14693 null
2025-02-20 BP-SGCN: Behavioral Pseudo-Label Informed Sparse Graph Convolution Network for Pedestrian and Heterogeneous Trajectory Prediction Ruochen Li et.al. 2502.14676 link
2025-02-20 InstructAgent: Building User Controllable Recommender via LLM Agent Wujiang Xu et.al. 2502.14662 link
2025-02-20 Online Envy Minimization and Multicolor Discrepancy: Equivalences and Separations Daniel Halpern et.al. 2502.14624 null
2025-02-20 Curiosity Driven Multi-agent Reinforcement Learning for 3D Game Testing Raihana Ferdous et.al. 2502.14606 link
2025-02-19 Autellix: An Efficient Serving Engine for LLM Agents as General Programs Michael Luo et.al. 2502.13965 null
2025-02-19 LIDDIA: Language-based Intelligent Drug Discovery Agent Reza Averly et.al. 2502.13959 null
2025-02-19 RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision Guangzhi Xiong et.al. 2502.13957 null
2025-02-19 Qwen2.5-VL Technical Report Shuai Bai et.al. 2502.13923 null
2025-02-19 Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health Xingbo Wang et.al. 2502.13920 null
2025-02-19 DataSciBench: An LLM Agent Benchmark for Data Science Dan Zhang et.al. 2502.13897 link
2025-02-19 NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants Yiran Qin et.al. 2502.13894 null
2025-02-19 Enhancing Cross-Domain Recommendations with Memory-Optimized LLM-Based User Agents Jiahao Liu et.al. 2502.13843 null
2025-02-19 ArtMentor: AI-Assisted Evaluation of Artworks to Explore Multimodal Large Language Models Capabilities Chanjin Zheng et.al. 2502.13832 link
2025-02-19 Learning to explore when mistakes are not allowed Charly Pecqueux-Guézénec et.al. 2502.13801 null
2025-02-19 From Correctness to Comprehension: AI Agents for Personalized Error Diagnosis in Education Yi-Fan Zhang et.al. 2502.13789 null
2025-02-19 Poster: SpiderSim: Multi-Agent Driven Theoretical Cybersecurity Simulation for Industrial Digitalization Jiaqi Li et.al. 2502.13778 link
2025-02-19 Quantile agent utility and implications to randomized social choice Ioannis Caragiannis et.al. 2502.13772 null
2025-02-19 AI Software Engineer: Programming with Trust Abhik Roychoudhury et.al. 2502.13767 null
2025-02-19 GPA: Grover Policy Agent for Generating Optimal Quantum Sensor Circuits Ahmad Alomari et.al. 2502.13755 null
2025-02-19 Kinetic modelling of economic markets with individual and collective transactions Chuandong Lin et.al. 2502.13735 null
2025-02-19 Hierarchical RL-MPC for Demand Response Scheduling Maximilian Bloor et.al. 2502.13714 null
2025-02-19 Parameterized Complexity of Hedonic Games with Enemy-Oriented Preferences Martin Durand et.al. 2502.13703 null
2025-02-19 Causes and Strategies in Multiagent Systems Sylvia S. Kerkhove et.al. 2502.13701 null
2025-02-19 An LLM-based Agent for Reliable Docker Environment Configuration Ruida Hu et.al. 2502.13681 null
2025-02-18 AIDE: AI-Driven Exploration in the Space of Code Zhengyao Jiang et.al. 2502.13138 link
2025-02-18 Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions Taedong Yun et.al. 2502.13135 null
2025-02-18 Magma: A Foundation Model for Multimodal AI Agents Jianwei Yang et.al. 2502.13130 link
2025-02-18 Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning Jingyang Lin et.al. 2502.13127 null
2025-02-18 Approximately Efficient Bilateral Trade with Samples Yuan Deng et.al. 2502.13122 null
2025-02-18 Text2World: Benchmarking Large Language Models for Symbolic World Model Generation Mengkang Hu et.al. 2502.13092 null
2025-02-18 Interactive Agents to Overcome Ambiguity in Software Engineering Sanidhya Vijayvargiya et.al. 2502.13069 link
2025-02-18 Improved Fine-Tuning of Large Multimodal Models for Hateful Meme Detection Jingbiao Mei et.al. 2502.13061 null
2025-02-18 AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks Yurun Chen et.al. 2502.13053 null
2025-02-18 Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks Markus J. Buehler et.al. 2502.13025 null
2025-02-18 Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents Chaoran Chen et.al. 2502.13012 null
2025-02-18 Integrating Reinforcement Learning, Action Model Learning, and Numeric Planning for Tackling Complex Tasks Yarin Benyamin et.al. 2502.13006 link
2025-02-18 You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with a Multi-Agent Conversations Frederic Kirstein et.al. 2502.13001 null
2025-02-18 Free Argumentative Exchanges for Explaining Image Classifiers Avinash Kori et.al. 2502.12995 link
2025-02-18 Generative AI and Information Asymmetry: Impacts on Adverse Selection and Moral Hazard Yukun Zhang et.al. 2502.12969 null
2025-02-18 AI-Enabled Rent-Seeking: How Generative AI Alters Market Transparency and Efficiency Yukun Zhang et.al. 2502.12956 null
2025-02-18 Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options Lakshmi Nair et.al. 2502.12929 link
2025-02-18 SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems Mike Zhang et.al. 2502.12927 null
2025-02-18 Towards more Contextual Agents: An extractor-Generator Optimization Framework Mourad Aouini et.al. 2502.12926 null
2025-02-18 Knapsack Optimization-based Schema Linking for LLM-based Text-to-SQL Generation Zheng Yuan et.al. 2502.12911 null
2025-02-17 HARBOR: Exploring Persona Dynamics in Multi-Agent Competition Kenan Jiang et.al. 2502.12149 null
2025-02-17 Scaling Autonomous Agents via Automatic Reward Modeling And Planning Zhenfang Chen et.al. 2502.12130 null
2025-02-17 A-MEM: Agentic Memory for LLM Agents Wujiang Xu et.al. 2502.12110 link
2025-02-17 Relational Norms for Human-AI Cooperation Brian D. Earp et.al. 2502.12102 null
2025-02-17 A Study on Leveraging Search and Self-Feedback for Agent Reasoning Karthikeyan K et.al. 2502.12094 null
2025-02-17 Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation Zhongyi Qiu et.al. 2502.12073 null
2025-02-17 A survey about perceptions of mobility to inform an agent-based simulator of subjective modal choice Carole Adam et.al. 2502.12058 null
2025-02-17 Multi-agent coordination via communication partitions Wei-Chen Lee et.al. 2502.12042 null
2025-02-17 Machine Learning Should Maximize Welfare, Not (Only) Accuracy Nir Rosenfeld et.al. 2502.11981 null
2025-02-17 FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control Yutong Ye et.al. 2502.11937 null
2025-02-17 CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning Yanxiao Zhao et.al. 2502.11896 null
2025-02-17 Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration Shao Zhang et.al. 2502.11882 link
2025-02-17 Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models Hyunwoo Kim et.al. 2502.11881 null
2025-02-17 Does Knowledge About Perceptual Uncertainty Help an Agent in Automated Driving? Natalie Grabowsky et.al. 2502.11864 null
2025-02-17 Can LLM Agents Maintain a Persona in Discourse? Pranav Bhandari et.al. 2502.11843 null
2025-02-17 Assessing the impacts of tradable credit schemes through agent-based simulation Renming Liu et.al. 2502.11822 null
2025-02-17 Table-Critic: A Multi-Agent Framework for Collaborative Criticism and Refinement in Table Reasoning Peiying Yu et.al. 2502.11799 null
2025-02-17 Personality Editing for Language Models through Relevant Knowledge Editing Seojin Hwang et.al. 2502.11789 null
2025-02-17 Changing the Rules of the Game: Reasoning about Dynamic Phenomena in Multi-Agent Systems Rustam Galimullin et.al. 2502.11785 null
2025-02-17 Plant in Cupboard, Orange on Table, Book on Shelf. Benchmarking Practical Reasoning and Situation Modelling in a Text-Simulated Situated Environment Jonathan Jordan et.al. 2502.11733 null
2025-02-14 Representation and Interpretation in Artificial and Natural Computing Luis A. Pineda et.al. 2502.10383 null
2025-02-14 Agentic Verification for Ambiguous Query Disambiguation Youngwon Lee et.al. 2502.10352 null
2025-02-14 Process Reward Models for LLM Agents: Practical Framework and Directions Sanjiban Choudhury et.al. 2502.10325 link
2025-02-14 Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations Abdelrhman Shaheen et.al. 2502.10303 null
2025-02-14 Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers Aivin V. Solatorio et.al. 2502.10263 null
2025-02-14 Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel Decoding Laurin Luttmann et.al. 2502.10233 link
2025-02-14 A Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation Redha Taguelmimt et.al. 2502.10226 null
2025-02-14 Do Large Language Models Reason Causally Like Us? Even Better? Hanna M. Dettki et.al. 2502.10215 null
2025-02-14 Dynamic Reinforcement Learning for Actors Katsunari Shibata et.al. 2502.10200 null
2025-02-14 Reinforcement Learning based Constrained Optimal Control: an Interpretable Reward Design Jingjie Ni et.al. 2502.10187 null
2025-02-14 STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning Mingcong Lei et.al. 2502.10177 null
2025-02-14 Agentic End-to-End De Novo Protein Design for Tailored Dynamics Using a Language Diffusion Model Bo Ni et.al. 2502.10173 null
2025-02-14 Modeling biases in binary decision-making within the generalized nonlinear q-voter model Maciej Doniec et.al. 2502.10172 null
2025-02-14 Combinatorial Reinforcement Learning with Preference Feedback Joongkyu Lee et.al. 2502.10158 null
2025-02-14 Cooperative Multi-Agent Planning with Adaptive Skill Synthesis Zhiyuan Li et.al. 2502.10148 null
2025-02-14 Provably Efficient RL under Episode-Wise Safety in Linear CMDPs Toshinori Kitamura et.al. 2502.10138 null
2025-02-14 ScamFerret: Detecting Scam Websites Autonomously with Large Language Models Hiroki Nakano et.al. 2502.10110 link
2025-02-14 Causal Information Prioritization for Efficient Reinforcement Learning Hongye Cao et.al. 2502.10097 null
2025-02-14 Enhancing Patient Acceptance of Robotic Ultrasound through Conversational Virtual Agent and Immersive Visualizations Tianyu Song et.al. 2502.10088 null
2025-02-14 Towards Empowerment Gain through Causal Structure Learning in Model-Based RL Hongye Cao et.al. 2502.10077 null
2025-02-13 Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs Siyan Zhao et.al. 2502.09597 link
2025-02-13 KIMAs: A Configurable Knowledge Integrated Multi-Agent System Zitao Li et.al. 2502.09596 null
2025-02-13 Rolling Ahead Diffusion for Traffic Scene Simulation Yunpeng Liu et.al. 2502.09587 null
2025-02-13 Learning to Coordinate with Experts Mohamad H. Danesh et.al. 2502.09583 link
2025-02-13 Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks Qian Wan et.al. 2502.09577 null
2025-02-13 MDCrow: Automating Molecular Dynamics Workflows with Large Language Models Quintina Campbell et.al. 2502.09565 link
2025-02-13 EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents Rui Yang et.al. 2502.09560 null
2025-02-13 Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages Shreyan Biswas et.al. 2502.09532 null
2025-02-13 Exact Leader Estimation: A New Approach for Distributed Differentiation Rodrigo Aldana-Lopez et.al. 2502.09529 null
2025-02-13 Forward-backward Contention Resolution Schemes for Fair Rationing Will Ma et.al. 2502.09521 null
2025-02-13 Coupled Rendezvous and Docking Maneuver control of satellite using Reinforcement learning-based Adaptive Fixed-Time Sliding Mode Controller Rakesh Kumar Sahoo et.al. 2502.09517 null
2025-02-13 Package Bids in Combinatorial Electricity Auctions: Selection, Welfare Losses, and Alternatives Thomas Hübner et.al. 2502.09420 link
2025-02-13 Dialectics of antimicrobial peptides I: common mechanisms of offensive and protecting roles of the peptides Marta V. Volovik et.al. 2502.09408 null
2025-02-13 Fair Division via Resource Augmentation Hannaneh Akrami et.al. 2502.09377 null
2025-02-13 Language Agents as Digital Representatives in Collective Decision-Making Daniel Jarrett et.al. 2502.09369 null
2025-02-13 Convex Is Back: Solving Belief MDPs With Convexity-Informed Deep Reinforcement Learning Daniel Koutas et.al. 2502.09298 link
2025-02-13 Reliable Conversational Agents under ASP Control that Understand Natural Language Yankai Zeng et.al. 2502.09237 null
2025-02-13 Pearce's Characterisation in an Epistemic Domain Ezgi Iraz Su et.al. 2502.09221 null
2025-02-13 Mind the Gaps: Logical English, Prolog, and Multi-agent Systems for Autonomous Vehicles Galileo Sartor et.al. 2502.09216 null
2025-02-13 Architecture for Simulating Behavior Mode Changes in Norm-Aware Autonomous Agents Sean Glaze et.al. 2502.09215 null
2025-02-12 Poly-Autoregressive Prediction for Modeling Interactions Neerja Thakkar et.al. 2502.08646 null
2025-02-12 Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs Mantas Mazeika et.al. 2502.08640 null
2025-02-12 SPeCtrum: A Grounded Framework for Multidimensional Identity Representation in LLM-Based Agent Keyeun Lee et.al. 2502.08599 link
2025-02-12 Learning in Markets with Heterogeneous Agents: Dynamics and Survival of Bayesian vs. No-Regret Learners David Easley et.al. 2502.08597 null
2025-02-12 Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks Ang Li et.al. 2502.08586 null
2025-02-12 Statistically validated projection of bipartite signed networks Anna Gallo et.al. 2502.08567 null
2025-02-12 Human-Centric Foundation Models: Perception, Generation and Agentic Modeling Shixiang Tang et.al. 2502.08556 link
2025-02-12 Extreme vulnerability to intruder attacks destabilizes network dynamics Amirhossein Nazerian et.al. 2502.08552 null
2025-02-12 Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation Mahnaz Koupaee et.al. 2502.08514 link
2025-02-12 Resilient Quantized Consensus in Multi-Hop Relay Networks Liwei Yuan et.al. 2502.08455 null
2025-02-12 Non-Monetary Mechanism Design without Distributional Information: Using Scarce Audits Wisely Yan Dai et.al. 2502.08412 null
2025-02-12 Towards Principled Multi-Agent Task Agnostic Exploration Riccardo Zamboni et.al. 2502.08365 null
2025-02-12 Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems Yuxin Pan et.al. 2502.08340 link
2025-02-12 Hierarchical Multi-Agent Framework for Carbon-Efficient Liquid-Cooled Data Center Clusters Soumyendu Sarkar et.al. 2502.08337 null
2025-02-12 Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement Learning Sun Jingbo et.al. 2502.08336 null
2025-02-12 Decentralised multi-agent coordination for real-time railway traffic management Leo D'Amato et.al. 2502.08324 null
2025-02-12 Compromising Honesty and Harmlessness in Language Models via Deception Attacks Laurène Vaugrante et.al. 2502.08301 null
2025-02-12 Higher-order Laplacian dynamics on hypergraphs with cooperative and antagonistic interactions Shaoxuan Cui et.al. 2502.08276 null
2025-02-12 Principles and Framework for the Operationalisation of Meaningful Human Control over Autonomous Systems Simeon C. Calvert et.al. 2502.08255 null
2025-02-12 The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks Alejandro Cuadron et.al. 2502.08235 link
2025-02-11 MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces Loris Gaven et.al. 2502.07709 link
2025-02-11 Human Decision-making is Susceptible to AI-driven Manipulation Sahand Sabour et.al. 2502.07663 null
2025-02-11 Robust-Sorting and Applications to Ulam-Median Ragesh Jaiswal et.al. 2502.07653 null
2025-02-11 Distributed Value Decomposition Networks with Networked Agents Guilherme S. Varela et.al. 2502.07635 null
2025-02-11 Decision-Making Under Complete Uncertainty: You Will Regret Not Being Greedy Kristijan Atanasov et.al. 2502.07593 null
2025-02-11 DMWM: Dual-Mind World Model with Long-Term Imagination Lingyi Wang et.al. 2502.07591 null
2025-02-11 Pure $ε$ -equilibrium in random games Bary S. R. Pradelski et.al. 2502.07585 null
2025-02-11 Genetic evolution of a multi-generational population in the context of interstellar space travels -- Part II: Phenotypic effects of gene expression Frédéric Marin et.al. 2502.07559 null
2025-02-11 Unsupervised Translation of Emergent Communication Ido Levy et.al. 2502.07552 null
2025-02-11 A Near-optimal, Scalable and Corruption-tolerant Framework for Stochastic Bandits: From Single-Agent to Multi-Agent and Beyond Zicheng Hu et.al. 2502.07514 null
2025-02-11 Exploring Word-Representable Temporal Graphs Duncan Adamson et.al. 2502.07496 null
2025-02-11 Multi-Agent Collaboration for Multilingual Code Instruction Tuning Jian Yang et.al. 2502.07487 null
2025-02-11 On Event-Triggered Resilient Consensus Using Auxiliary Layer Pushkal Purohit et.al. 2502.07470 null
2025-02-11 Approximating Human Strategic Reasoning with LLM-Enhanced Recursive Reasoners Leveraging Multi-agent Hypergames Vince Trencsenyi et.al. 2502.07443 null
2025-02-11 Coupling Agent-Based Simulations and VR universes: the case of GAMA and Unity Alexis Drogoul et.al. 2502.07405 null
2025-02-11 FinRL-DeepSeek: LLM-Infused Risk-Sensitive Reinforcement Learning for Trading Agents Mostapha Benhenda et.al. 2502.07393 link
2025-02-11 EvoFlow: Evolving Diverse Agentic Workflows On The Fly Guibin Zhang et.al. 2502.07373 null
2025-02-11 KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems Jusheng Zhang et.al. 2502.07350 null
2025-02-11 The Combined Problem of Online Task Assignment and Lifelong Path Finding in Logistics Warehouses: A Case Study Fengming Zhu et.al. 2502.07332 null
2025-02-11 CreAgent: Towards Long-Term Evaluation of Recommender System under Platform-Creator Information Asymmetry Xiaopeng Ye et.al. 2502.07307 link
2025-02-10 Visual Agentic AI for Spatial Reasoning with a Dynamic API Damiano Marsili et.al. 2502.06787 null
2025-02-10 Towards Internet-Scale Training For Agents Brandon Trabucco et.al. 2502.06776 null
2025-02-10 Distributed Constraint-Coupled Optimization: Harnessing ADMM-consensus for robustness Mohamed Abdelmouamin Messilem et.al. 2502.06763 null
2025-02-10 Incentivizing Desirable Effort Profiles in Strategic Classification: The Role of Causality and Uncertainty Valia Efthymiou et.al. 2502.06749 null
2025-02-10 Institutional Preferences in the Laboratory Qiankun Zhong et.al. 2502.06748 null
2025-02-10 Wandering around: A bioinspired approach to visual attention through object motion sensitivity Giulia D Angelo et.al. 2502.06747 link
2025-02-10 AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection Roohan Ahmed Khan et.al. 2502.06725 null
2025-02-10 Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene Tai-Yu Pan et.al. 2502.06682 null
2025-02-10 Quantile Multi-Armed Bandits with 1-bit Feedback Ivan Lau et.al. 2502.06678 null
2025-02-10 Unbiased Evaluation of Large Language Models from a Causal Perspective Meilin Chen et.al. 2502.06655 null
2025-02-10 Enhancing healthcare infrastructure resilience through agent-based simulation methods David Carramiñana et.al. 2502.06636 null
2025-02-10 Hinderance of cooperation by individual solutions: Evolutionary dynamics of three-strategy games combining the prisoner's dilemma and stag hunt Hirofumi Takesue et.al. 2502.06624 null
2025-02-10 Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training Yuchen Zhuang et.al. 2502.06589 null
2025-02-10 Network Creation Games with 2-Neighborhood Maximization Merlin de la Haye et.al. 2502.06561 null
2025-02-10 Marginal Mechanisms For Balanced Exchange Vikram Manjunath et.al. 2502.06499 null
2025-02-10 Utilitarian Distortion with Predictions Aris Filos-Ratsikas et.al. 2502.06489 null
2025-02-10 KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment Yuxing Lu et.al. 2502.06472 link
2025-02-10 A Quadratic Lower Bound for Stable Roommates Solvability Will Rosenbaum et.al. 2502.06464 null
2025-02-10 SIGMA: Sheaf-Informed Geometric Multi-Agent Pathfinding Shuhao Liao et.al. 2502.06440 null
2025-02-10 The AI off-switch problem as a signalling game: bounded rationality and incomparability Alessio benavoli et.al. 2502.06403 null
2025-02-07 Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray Yunhang Shen et.al. 2502.05177 link
2025-02-07 MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison Kaijie Zhu et.al. 2502.05174 null
2025-02-07 From Restless to Contextual: A Thresholding Bandit Approach to Improve Finite-horizon Performance Jiamin Xu et.al. 2502.05145 link
2025-02-07 Maximin Share Guarantees for Few Agents with Subadditive Valuations George Christodoulou et.al. 2502.05141 null
2025-02-07 Joint TITE-CRM for Dual Agent Dose Finding Studies Helen Barnett et.al. 2502.05072 null
2025-02-07 Exploring the Generalizability of Geomagnetic Navigation: A Deep Reinforcement Learning approach with Policy Distillation Wenqi Bai et.al. 2502.05069 null
2025-02-07 nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow Geliang Ouyang et.al. 2502.05036 link
2025-02-07 Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency Qixin Zhang et.al. 2502.05028 null
2025-02-07 Seasonal Station-Keeping of Short Duration High Altitude Balloons using Deep Reinforcement Learning Tristan K. Schuler et.al. 2502.05014 null
2025-02-07 The Rising Threat to Emerging AI-Powered Search Engines Zeren Luo et.al. 2502.04951 null
2025-02-07 $TAR^2$ : Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning Aditya Kapoor et.al. 2502.04864 null
2025-02-07 Humans Co-exist, So Must Embodied Artificial Agents Hannah Kuehn et.al. 2502.04809 null
2025-02-07 Unified description of viscous, viscoelastic, or elastic thin active films on substrates Henning Reinken et.al. 2502.04802 null
2025-02-07 S $^2$ -MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency Yuting Zeng et.al. 2502.04790 null
2025-02-07 A non-zero-sum game with reinforcement learning under mean-variance framework Junyi Guo et.al. 2502.04788 null
2025-02-07 SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning Wanjia Zhao et.al. 2502.04780 link
2025-02-07 An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks George Papadopoulos et.al. 2502.04773 link
2025-02-07 Shapley Value Approximation Based on k-Additive Games Guilherme Dean Pelegrina et.al. 2502.04763 null
2025-02-07 Every Software as an Agent: Blueprint and Case Study Mengwei Xu et.al. 2502.04747 null
2025-02-07 Multi-Agent Coverage Control in Non-Convex Annulus Region with Conformal Mapping Xun Feng et.al. 2502.04697 null
2025-02-06 ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization Yinjie Wang et.al. 2502.04306 link
2025-02-06 Mutual Multilinearity of Nonequilibrium Network Currents Sara Dal Cengio et.al. 2502.04298 null
2025-02-06 DECAF: Learning to be Fair in Multi-agent Resource Allocation Ashwin Kumar et.al. 2502.04281 null
2025-02-06 Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study Michael Walters et.al. 2502.04249 null
2025-02-06 Multi-agent Architecture Search via Agentic Supernet Guibin Zhang et.al. 2502.04180 null
2025-02-06 Dense Fixed-Wing Swarming using Receding-Horizon NMPC Varun Madabushi et.al. 2502.04174 null
2025-02-06 Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning Wesley A. Suttle et.al. 2502.04141 null
2025-02-06 Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation Jiahao Lu et.al. 2502.04139 null
2025-02-06 VTutor: An Open-Source SDK for Generative AI-Powered Animated Pedagogical Agents with Multi-Media Output Eason Chen et.al. 2502.04103 null
2025-02-06 Strategic Learning with Local Explanations as Feedback Kiet Q. H. Vo et.al. 2502.04058 null
2025-02-06 Simulating the Emergence of Differential Case Marking with Communicating Neural-Network Agents Yuchen Lian et.al. 2502.04038 null
2025-02-06 Deep Meta Coordination Graphs for Multi-agent Reinforcement Learning Nikunj Gupta et.al. 2502.04028 link
2025-02-06 Near-optimal Regret Using Policy Optimization in Online MDPs with Aggregate Bandit Feedback Tal Lancewicki et.al. 2502.04004 null
2025-02-06 Fairness Aware Reinforcement Learning via Proximal Policy Optimization Gabriele La Malfa et.al. 2502.03953 null
2025-02-06 Enhancing Online Learning Efficiency Through Heterogeneous Resource Integration with a Multi-Agent RAG System Devansh Srivastav et.al. 2502.03948 null
2025-02-06 Geometric Stabilization of Virtual Nonlinear Nonholonomic Constraints Efstratios Stratoglou et.al. 2502.03902 null
2025-02-06 Any theory that admits a Wigner's Friend type multi-agent paradox is logically contextual Nuriya Nurgalieva et.al. 2502.03874 null
2025-02-06 PAGNet: Pluggable Adaptive Generative Networks for Information Completion in Multi-Agent Communication Zhuohui Zhang et.al. 2502.03845 null
2025-02-06 PsyPlay: Personality-Infused Role-Playing Conversational Agents Tao Yang et.al. 2502.03821 null
2025-02-06 Large Language Models for Multi-Robot Systems: A Survey Peihan Li et.al. 2502.03814 null
2025-02-05 A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs) Yiye Chen et.al. 2502.03450 null
2025-02-05 Prediction of the Most Fire-Sensitive Point in Building Structures with Differentiable Agents for Thermal Simulators Yuan Xinjie et.al. 2502.03424 null
2025-02-05 Energy-Efficient Flying LoRa Gateways: A Multi-Agent Reinforcement Learning Approach Abdullahi Isa Ahmed et.al. 2502.03377 null
2025-02-05 Learning from Active Human Involvement through Proxy Value Propagation Zhenghao Peng et.al. 2502.03369 null
2025-02-05 PalimpChat: Declarative and Interactive AI analytics Chunwei Liu et.al. 2502.03368 null
2025-02-05 Inverse Mixed Strategy Games with Generative Trajectory Models Max Muchen Sun et.al. 2502.03356 null
2025-02-05 Implicit Communication in Human-Robot Collaborative Transport Elvin Yang et.al. 2502.03346 link
2025-02-05 Actions Speak Louder Than Words: Rate-Reward Trade-off in Markov Decision Processes Haotian Wu et.al. 2502.03335 null
2025-02-05 SymAgent: A Neural-Symbolic Self-Learning Agent Framework for Complex Reasoning over Knowledge Graphs Ben Liu et.al. 2502.03283 null
2025-02-05 Modeling and Optimization of Insulin Injection for Type-1 Diabetes Mellitus Management Rinrada Jadsadaphongphaibool et.al. 2502.03269 null
2025-02-05 iVISPAR -- An Interactive Visual-Spatial Reasoning Benchmark for VLMs Julius Mayer et.al. 2502.03214 link
2025-02-05 MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent Xinyao Liao et.al. 2502.03207 null
2025-02-05 Cooperative Behavior in Pre-State Societies: An Agent-Based Approach of the Axum Civilization Riccardo Vasellini et.al. 2502.03191 null
2025-02-05 Strategizing with AI: Insights from a Beauty Contest Experiment Iuliia Alekseenko et.al. 2502.03158 null
2025-02-05 Group Trip Planning Query Problem with Multimodal Journey Dildar Ali et.al. 2502.03144 null
2025-02-05 Underwater Soft Fin Flapping Motion with Deep Neural Network Based Surrogate Model Yuya Hamamatsu et.al. 2502.03135 link
2025-02-05 Double Distillation Network for Multi-Agent Reinforcement Learning Yang Zhou et.al. 2502.03125 null
2025-02-05 Cooperation, satisfaction, and rationality in social games on complex networks with aspiration-driven players M. Aguilar-Janita et.al. 2502.03109 null
2025-02-05 Learning Efficient Flocking Control based on Gibbs Random Fields Dengyu Zhang et.al. 2502.02984 null
2025-02-05 FedMobileAgent: Training Mobile Agents Using Decentralized Self-Sourced Data from Diverse Users Wenhao Wang et.al. 2502.02982 null
2025-02-04 QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search Zongyu Lin et.al. 2502.02584 link
2025-02-04 Decision Theoretic Foundations for Conformal Prediction: Optimal Uncertainty Quantification for Risk-Averse Agents Shayan Kiyani et.al. 2502.02561 null
2025-02-04 AAD-DCE: An Aggregated Multimodal Attention Mechanism for Early and Late Dynamic Contrast Enhanced Prostate MRI Synthesis Divya Bharti et.al. 2502.02555 link
2025-02-04 Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks Huiqun Huang et.al. 2502.02537 null
2025-02-04 Adaptive Self-improvement LLM Agentic System for ML Library Development Genghan Zhang et.al. 2502.02534 link
2025-02-04 Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies Han Zhou et.al. 2502.02533 null
2025-02-04 Why human-AI relationships need socioaffective alignment Hannah Rose Kirk et.al. 2502.02528 null
2025-02-04 The Cost Perspective of Liquid Democracy: Feasibility and Control Shiri Alouf-Heffetz et.al. 2502.02380 null
2025-02-04 Mirai: A Wearable Proactive AI "Inner-Voice" for Contextual Nudging Cathy Mengying Fang et.al. 2502.02370 null
2025-02-04 MAGNNET: Multi-Agent Graph Neural Network-based Efficient Task Allocation for Autonomous Vehicles with Deep Reinforcement Learning Lavanya Ratnabala et.al. 2502.02311 null
2025-02-04 Adviser-Actor-Critic: Eliminating Steady-State Error in Reinforcement Learning Control Donghe Chen et.al. 2502.02265 null
2025-02-04 An altruistic resource-sharing mechanism for synchronization: The energy-speed-accuracy tradeoff Dongliang Zhang et.al. 2502.02242 null
2025-02-04 The Induced Matching Distance: A Novel Topological Metric with Applications in Robotics Javier Perera-Lago et.al. 2502.02112 link
2025-02-04 Sequential Multi-objective Multi-agent Reinforcement Learning Approach for Predictive Maintenance Yan Chen et.al. 2502.02071 null
2025-02-04 AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement Shivam Singh et.al. 2502.02067 link
2025-02-04 Anticipate & Act : Integrating LLMs and Classical Planning for Efficient Task Execution in Household Environments Raghav Arora et.al. 2502.02066 null
2025-02-04 CH-MARL: Constrained Hierarchical Multiagent Reinforcement Learning for Sustainable Maritime Logistics Saad Alqithami et.al. 2502.02060 null
2025-02-04 RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation Minwoo Kim et.al. 2502.02054 null
2025-02-04 Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer Yaodong Yang et.al. 2502.02018 link
2025-02-04 The Wisdom of Intellectually Humble Networks Mohammad Ratul Mahjabin et.al. 2502.02015 link
2025-01-31 Vintix: Action Model via In-Context Reinforcement Learning Andrey Polubarov et.al. 2501.19400 link
2025-01-31 Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game Mustafa O. Karabag et.al. 2501.19398 link
2025-01-31 Learning Contracts in Hierarchical Multi-Agent Systems Antoine Scheid et.al. 2501.19388 null
2025-01-31 The Physics and Metaphysics of Social Powers: Bridging Cognitive Processing and Social Dynamics, a New Perspective on Power through Active Inference Mahault Albarracin et.al. 2501.19368 null
2025-01-31 PixelWorld: Towards Perceiving Everything as Pixels Zhiheng Lyu et.al. 2501.19339 null
2025-01-31 MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems Anirudh Chari et.al. 2501.19318 null
2025-01-31 Objective Metrics for Human-Subjects Evaluation in Explainable Reinforcement Learning Balint Gyevnar et.al. 2501.19256 null
2025-02-03 SHARPIE: A Modular Framework for Reinforcement Learning and Human-AI Interaction Experiments Hüseyin Aydın et.al. 2501.19245 link
2025-01-31 Multi-agent Multi-armed Bandit with Fully Heavy-tailed Dynamics Xingyu Wang et.al. 2501.19239 null
2025-01-31 A parallelizable variant of HCA* Sreenivasan Ganti et.al. 2501.19218 null
2025-01-31 An Empirical Game-Theoretic Analysis of Autonomous Cyber-Defence Agents Gregory Palmer et.al. 2501.19206 null
2025-01-31 Autonomous Legacy Web Application Upgrades Using a Multi-Agent System Valtteri Ala-Salmi et.al. 2501.19204 link
2025-01-31 A Comunication Framework for Compositional Generation Rafael Elberg et.al. 2501.19182 null
2025-01-31 Augmented Intelligence for Multimodal Virtual Biopsy in Breast Cancer Using Generative Artificial Intelligence Aurora Rofena et.al. 2501.19176 null
2025-01-31 Implications of zero-growth economics analysed with an agent-based model Dylan C. Terry-Doyle et.al. 2501.19168 null
2025-01-31 Test-Time Training Scaling for Chemical Exploration in Drug Design Morgan Thomas et.al. 2501.19153 null
2025-01-31 Constant-Factor Distortion Mechanisms for $k$ -Committee Election Haripriya Pulyassary et.al. 2501.19148 null
2025-01-31 Prediction-Aware Learning in Multi-Agent Systems Aymeric Capitaine et.al. 2501.19144 null
2025-01-31 Imitation Game for Adversarial Disillusion with Multimodal Generative Chain-of-Thought Role-Play Ching-Chun Chang et.al. 2501.19143 null
2025-01-31 Shaping Sparse Rewards in Reinforcement Learning: A Semi-supervised Approach Wenyun Li et.al. 2501.19128 null
2025-01-30 Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method Peter Baile Chen et.al. 2501.18539 null
2025-01-30 Design and Validation of Learning Aware HMI For Learning-Enabled Increasingly Autonomous Systems Parth Ganeriwala et.al. 2501.18506 null
2025-01-30 Graph Exploration with Edge Weight Estimates Matthias Gehnen et.al. 2501.18496 null
2025-01-30 Conversation Games and a Strategic View of the Turing Test Kaveh Aryan et.al. 2501.18455 null
2025-01-30 Stable Marriage: Loyalty vs. Competition Amit Ronen et.al. 2501.18442 null
2025-01-30 Gravity-Bench-v1: A Benchmark on Gravitational Physics Discovery for Agents Nolan Koblischke et.al. 2501.18411 null
2025-01-30 Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach Tianpeng Pan et.al. 2501.18320 null
2025-01-30 Model-Free RL Agents Demonstrate System 1-Like Intentionality Hal Ashton et.al. 2501.18299 null
2025-01-30 CueTip: An Interactive and Explainable Physics-aware Pool Assistant Sean Memery et.al. 2501.18291 null
2025-01-30 Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents ShuiDe Wen et.al. 2501.18190 null
2025-01-30 Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation Teddy Lazebnik et.al. 2501.18177 null
2025-01-30 RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing Jinyao Guo et.al. 2501.18160 null
2025-01-30 Model Checking for Multi-Agent Systems Modeled By Epistemic Process Calculus Qixian Yu et.al. 2501.18155 null
2025-01-30 Utilizing API Response for Test Refinement Devika Sondhi et.al. 2501.18145 null
2025-01-30 B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning Woojun Kim et.al. 2501.18138 null
2025-01-30 DCatalyst: A Unified Accelerated Framework for Decentralized Optimization Tianyu Cao et.al. 2501.18114 null
2025-01-29 Joint Pricing and Resource Allocation: An Optimal Online-Learning Approach Jianyu Xu et.al. 2501.18049 null
2025-01-29 A Case Study in Acceleration AI Ethics: The TELUS GenAI Conversational Agent James Brusseau et.al. 2501.18038 null
2025-01-29 Large Language Models Think Too Fast To Explore Effectively Lan Pan et.al. 2501.18009 null
2025-01-29 Agentic Workflows for Conversational Human-AI Interaction Design Arthur Caetano et.al. 2501.18002 null
2025-01-29 From Sparse to Dense: Toddler-inspired Reward Transition in Goal-Oriented Reinforcement Learning Junseok Park et.al. 2501.17842 null
2025-01-29 A note on the Cucker-Smale model with time delay and communication failures Elisa Continelli et.al. 2501.17743 null
2025-01-29 RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts Eujeong Choi et.al. 2501.17715 link
2025-01-29 Inferring Implicit Goals Across Differing Task Models Silvia Tulli et.al. 2501.17704 null
2025-01-29 CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius Maximization Derui Wang et.al. 2501.17667 link
2025-01-29 Multi-Agent Path Finding Using Conflict-Based Search and Structural-Semantic Topometric Maps Scott Fredriksson et.al. 2501.17661 null
2025-01-29 Coalitional control: a bottom-up approach Filiberto Fele et.al. 2501.17614 null
2025-01-29 Coalitional model predictive control of an irrigation canal Filiberto Fele et.al. 2501.17561 null
2025-01-29 Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant Gaole He et.al. 2501.17546 link
2025-01-29 Sequential Learning of the Pareto Front for Multi-objective Bandits Elise Crépon et.al. 2501.17513 link
2025-01-29 Monetary-Fiscal Interaction and the Liquidity of Government Debt Cristiano Cantore et.al. 2501.17458 null
2025-01-29 Human-Aligned Skill Discovery: Balancing Behaviour Exploration and Alignment Maxence Hussonnois et.al. 2501.17431 null
2025-01-29 Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models Yuxuan Li et.al. 2501.17420 null
2025-01-29 General Scene Adaptation for Vision-and-Language Navigation Haodong Hong et.al. 2501.17403 link
2025-01-29 Optimal Utility Design with Arbitrary Information Networks Vartika Singh et.al. 2501.17385 null
2025-01-29 A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning Zhengpeng Xie et.al. 2501.17384 null
2025-01-28 Anomaly Detection in Cooperative Vehicle Perception Systems under Imperfect Communication Ashish Bastola et.al. 2501.17329 null
2025-01-28 A sketch of an AI control safety case Tomek Korbak et.al. 2501.17315 null
2025-01-28 Controlling AI Agent Participation in Group Conversations: A Human-Centered Approach Stephanie Houde et.al. 2501.17258 null
2025-01-28 Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning Rémy Hosseinkhan Boucher et.al. 2501.17115 null
2025-01-28 CRSet: Non-Interactive Verifiable Credential Revocation with Metadata Privacy for Issuers and Everyone Else Felix Hoops et.al. 2501.17089 null
2025-01-28 Learning Mean Field Control on Sparse Graphs Christian Fabian et.al. 2501.17079 null
2025-01-28 Induced Modularity and Community Detection for Functionally Interpretable Reinforcement Learning Anna Soligo et.al. 2501.17077 null
2025-01-28 Context is Key in Agent Security Lillian Tsai et.al. 2501.17070 null
2025-01-28 Revisit Mixture Models for Multi-Agent Simulation: Experimental Study within a Unified Framework Longzhong Lin et.al. 2501.17015 null
2025-01-28 Towards Open-Source and Modular Space Systems with ATMOS Pedro Roque et.al. 2501.16973 null
2025-01-28 Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning Xi Chen et.al. 2501.16966 null
2025-01-28 ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations Xinyi Ni et.al. 2501.16945 null
2025-01-28 Beyond Human Intervention: Algorithmic Collusion through Multi-Agent Learning Strategies Suzie Grondin et.al. 2501.16935 null
2025-01-28 Optimization and Learning in Open Multi-Agent Systems Diego Deplano et.al. 2501.16847 null
2025-01-28 RG-Attn: Radian Glue Attention for Multi-modality Multi-agent Cooperative Perception Lantao Li et.al. 2501.16803 null
2025-01-28 A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process Jack David Carson et.al. 2501.16783 null
2025-01-28 Target-driven Self-Distillation for Partial Observed Trajectories Forecasting Pengfei Zhu et.al. 2501.16767 null
2025-01-28 Quantum advantage in decentralized control of POMDPs: A control-theoretic view of the Mermin-Peres square Venkat Anantharam et.al. 2501.16690 null
2025-01-28 MACI: Multi-Agent Collaborative Intelligence for Robust Reasoning and Temporal Planning Edward Y. Chang et.al. 2501.16689 null
2025-01-28 Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting Li Yin et.al. 2501.16673 link
2025-01-28 Jupybara: Operationalizing a Design Space for Actionable Data Analysis and Storytelling with LLMs Huichen Will Wang et.al. 2501.16661 null
2025-01-28 Large Language Model Critics for Execution-Free Evaluation of Code Changes Aashish Yadavally et.al. 2501.16655 link
2025-01-28 More Efficient Sybil Detection Mechanisms Leveraging Resistance of Users to Attack Requests Ali Safarpoor Dehkordi et.al. 2501.16624 link
2025-01-27 LUCY: Linguistic Understanding and Control Yielding Early Stage of Her Heting Gao et.al. 2501.16327 link
2025-01-27 Privacy-aware Nash Equilibrium Synthesis with Partially Ordered LTL $_f$ Objectives Caleb Probine et.al. 2501.16307 null
2025-01-27 Multi-Agent Geospatial Copilots for Remote Sensing Workflows Chaehong Lee et.al. 2501.16254 null
2025-01-27 Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma Richard Willis et.al. 2501.16173 link
2025-01-27 AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants Pascal J. Sager et.al. 2501.16150 null
2025-01-27 Quantifying the Self-Interest Level of Markov Social Dilemmas Richard Willis et.al. 2501.16138 null
2025-01-27 Multi-Agent Meta-Offline Reinforcement Learning for Timely UAV Path Planning and Data Collection Eslam Eldeeb et.al. 2501.16098 null
2025-01-27 Galaxy Era: Agent-based Simulation of Execution Tickets Pascal Stichler et.al. 2501.16090 link
2025-01-27 Value-oriented forecast reconciliation for renewables in electricity markets Honglin Wen et.al. 2501.16086 null
2025-01-27 Generating Spatial Synthetic Populations Using Wasserstein Generative Adversarial Network: A Case Study with EU-SILC Data for Helsinki and Thessaloniki Vanja Falck et.al. 2501.16080 null
2025-01-27 Translating and evaluating single-cell Boolean network interventions in the multiscale setting John Metzcar et.al. 2501.16052 link
2025-01-27 Strategic Multi-Armed Bandit Problems Under Debt-Free Reporting Ahmed Ben Yahmed et.al. 2501.16018 null
2025-01-27 Modeling and stability analysis of live systems with time-varying dimension Andrii Mironchenko et.al. 2501.15991 null
2025-01-27 Online Housing Market Julien Lesca et.al. 2501.15916 null
2025-01-27 Explaining Facial Expression Recognition Sanjeev Nahulanthran et.al. 2501.15864 null
2025-01-27 LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models Yuewen Mei et.al. 2501.15850 null
2025-01-27 The Strong Core of Housing Markets with Partial Order Preferences Ildikó Schlotter et.al. 2501.15834 null
2025-01-27 MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer Qi Chen et.al. 2501.15826 null
2025-01-27 Adaptive AI-based Decentralized Resource Management in the Cloud-Edge Continuum Lanpei Li et.al. 2501.15802 null
2025-01-27 Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs Yu Li et.al. 2501.15791 link
2025-01-24 An Attentive Graph Agent for Topology-Adaptive Cyber Defence Ilya Orson Sandoval et.al. 2501.14700 link
2025-01-24 The Division of Surplus and the Burden of Proof Deniz Kattwinkel et.al. 2501.14686 null
2025-01-24 MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications Yixing Jiang et.al. 2501.14654 link
2025-01-24 Whisper D-SGD: Correlated Noise Across Agents for Differentially Private Decentralized Learning Angelo Rodio et.al. 2501.14644 link
2025-01-24 Fair Division Beyond Monotone Valuations Siddharth Barman et.al. 2501.14609 null
2025-01-24 Hybrid Quantum-Classical Multi-Agent Pathfinding Thore Gerlach et.al. 2501.14568 null
2025-01-24 Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation Wenzhang Liu et.al. 2501.14543 link
2025-01-24 Breaking the Pre-Planning Barrier: Real-Time Adaptive Coordination of Mission and Charging UAVs Using Graph Reinforcement Learning Yuhan Hu et.al. 2501.14488 null
2025-01-24 Avoiding Overfitting in Variable-Order Markov Models: a Cross-Validation Approach Valeria Secchini et.al. 2501.14476 null
2025-01-24 The Pseudo-Dimension of Contracts Paul Duetting et.al. 2501.14474 null
2025-01-24 MARL-OT: Multi-Agent Reinforcement Learning Guided Online Fuzzing to Detect Safety Violation in Autonomous Driving Systems Linfeng Liang et.al. 2501.14451 null
2025-01-24 Learning more with the same effort: how randomization improves the robustness of a robotic deep reinforcement learning agent Lucía Güitta-López et.al. 2501.14443 null
2025-01-24 DeepFlow: Serverless Large Language Model Serving at Scale Junhao Hu et.al. 2501.14417 null
2025-01-24 DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing Xinyu Ma et.al. 2501.14371 link
2025-01-24 Online Inverse Linear Optimization: Improved Regret Bound, Robustness to Suboptimality, and Toward Tight Regret Analysis Shinsaku Sakaue et.al. 2501.14349 null
2025-01-24 Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts Clément Desroches et.al. 2501.14334 null
2025-01-24 MASTER: A Multi-Agent System with LLM Specialized MCTS Bingzheng Gan et.al. 2501.14304 null
2025-01-24 TrajFlow: A Generative Framework for Occupancy Density Estimation Using Normalizing Flows Mitch Kosieradzki et.al. 2501.14266 link
2025-01-24 Non-selective evaporation mechanism of binary aerosol generating agent on porous atomizer and its experimental verification Xie Guoyong et.al. 2501.14262 null
2025-01-24 Optimal Investment under Mutual Strategy Influence among Agents Huisheng Wang et.al. 2501.14259 null
2025-01-23 GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration Yue Fan et.al. 2501.13896 null
2025-01-23 Utilizing Evolution Strategies to Train Transformers in Reinforcement Learning Matyáš Lorenc et.al. 2501.13883 link
2025-01-23 Eye Gaze as a Signal for Conveying User Attention in Contextual AI Systems Ethan Wilson et.al. 2501.13878 null
2025-01-23 EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents Yuhui Yun et.al. 2501.13746 null
2025-01-23 Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent System Haikuo Du et.al. 2501.13727 link
2025-01-23 A Non-Parametric Approach to Heterogeneity Analysis Avner Seror et.al. 2501.13721 null
2025-01-23 Revisiting Online Learning Approach to Inverse Linear Optimization: A Fenchel--Young Loss Perspective and Gap-Dependent Regret Analysis Shinsaku Sakaue et.al. 2501.13648 null
2025-01-23 WFCRL: A Multi-Agent Reinforcement Learning Benchmark for Wind Farm Control Claire Bizon Monroc et.al. 2501.13592 link
2025-01-23 Explainable AI-aided Feature Selection and Model Reduction for DRL-based V2X Resource Allocation Nasir Khan et.al. 2501.13552 null
2025-01-23 Towards a Theory of AI Personhood Francis Rhys Ward et.al. 2501.13533 null
2025-01-23 Communication-Efficient Stochastic Distributed Learning Xiaoxing Ren et.al. 2501.13516 null
2025-01-23 A Polynomial-Time Algorithm for EFX Orientations of Chores Kevin Hsu et.al. 2501.13481 null
2025-01-23 Knowledge-Informed Multi-Agent Trajectory Prediction at Signalized Intersections for Infrastructure-to-Everything Huilin Yin et.al. 2501.13461 null
2025-01-23 BMG-Q: Localized Bipartite Match Graph Attention Q-Learning for Ride-Pooling Order Dispatch Yulong Hu et.al. 2501.13448 null
2025-01-23 VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework He Kong et.al. 2501.13411 link
2025-01-23 Concurrent Learning with Aggregated States via Randomized Least Squares Value Iteration Yan Chen et.al. 2501.13394 null
2025-01-23 Do as We Do, Not as You Think: the Conformity of Large Language Models Zhiyuan Weng et.al. 2501.13381 link
2025-01-23 Task Allocation in Customer-led Two-sided Markets with Satellite Constellation Services Jianglin Qiao et.al. 2501.13364 null
2025-01-23 AgentRec: Agent Recommendation Using Sentence Embeddings Aligned to Human Feedback Joshua Park et.al. 2501.13333 link
2025-01-23 Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents Shrinidhi Kumbhar et.al. 2501.13299 null
2025-01-22 Boosting MCTS with Free Energy Minimization Mawaba Pascal Dao et.al. 2501.13083 null
2025-01-22 Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment Melissa Kazemi Rad et.al. 2501.13080 null
2025-01-22 Evolution and The Knightian Blindspot of Machine Learning Joel Lehman et.al. 2501.13075 null
2025-01-22 Optimizing Return Distributions with Distributional Dynamic Programming Bernardo Ávila Pires et.al. 2501.13028 null
2025-01-22 The regret lower bound for communicating Markov Decision Processes Victor Boone et.al. 2501.13013 null
2025-01-22 MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking Sebastian Farquhar et.al. 2501.13011 null
2025-01-22 Constructive characterisations of the must-preorder for asynchrony Giovanni Bernardi et.al. 2501.13002 null
2025-01-22 An Offline Multi-Agent Reinforcement Learning Framework for Radio Resource Management Eslam Eldeeb et.al. 2501.12991 null
2025-01-22 Learning-based Distributed Model Predictive Control using Multi-Agent Bayesian Optimization Hossein Nejatbakhsh Esfahani et.al. 2501.12989 null
2025-01-22 Quantification of Ultrafast Nonlinear Photothermal and Photoacoustic Effects in Molecular Thin Films via Time-Domain Brillouin Scattering Valentin Cherruault et.al. 2501.12912 null
2025-01-22 FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Zhenran Xu et.al. 2501.12909 null
2025-01-22 Mutation-Guided LLM-based Test Generation at Meta Christopher Foster et.al. 2501.12862 null
2025-01-22 ACEBench: Who Wins the Match Point in Tool Learning? Chen Chen et.al. 2501.12851 null
2025-01-22 To Measure or Not: A Cost-Sensitive, Selective Measuring Environment for Agricultural Management Decisions with Reinforcement Learning Hilmy Baja et.al. 2501.12823 link
2025-01-22 PSGSL: A Probabilistic Framework Integrating Semantic Scene Understanding and Gas Sensing for Gas Source Localization Pepe Ojeda et.al. 2501.12812 null
2025-01-22 Information Design for Adaptive Organizations Wataru Tamura et.al. 2501.12669 null
2025-01-22 NBDI: A Simple and Efficient Termination Condition for Skill Extraction from Task-Agnostic Demonstrations Myunsoo Kim et.al. 2501.12668 null
2025-01-22 Optimal Rebate Design: Incentives, Competition and Efficiency in Auction Markets Thibaut Mastrolia et.al. 2501.12591 null
2025-01-22 Leveraging LLMs to Create a Haptic Devices' Recommendation System Yang Liu et.al. 2501.12573 null
2025-01-21 Reinforcement Learning Constrained Beam Search for Parameter Optimization of Paper Drying Under Flexible Constraints Siyuan Chen et.al. 2501.12542 null
2025-01-21 Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists Thomas F. Eisenmann et.al. 2501.12374 link
2025-01-21 UI-TARS: Pioneering Automated GUI Interaction with Native Agents Yujia Qin et.al. 2501.12326 link
2025-01-21 Transitions to synchronization in adaptive multilayer networks with higher-order interactions Richita Ghosh et.al. 2501.12301 null
2025-01-21 mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework Bingyi Liu et.al. 2501.12263 null
2025-01-21 Multi-Agent Feedback Motion Planning using Probably Approximately Correct Nonlinear Model Predictive Control Mark Gonzales et.al. 2501.12234 null
2025-01-21 Empower Healthcare through a Self-Sovereign Identity Infrastructure for Secure Electronic Health Data Access Antonio López Martínez et.al. 2501.12229 null
2025-01-21 Convergence of time-delayed opinion dynamics with complex interaction types Lingling Yao et.al. 2501.12219 null
2025-01-21 RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression Uri Gadot et.al. 2501.12216 null
2025-01-21 Experience-replay Innovative Dynamics Tuo Zhang et.al. 2501.12199 null
2025-01-21 Opinion dynamics in bounded confidence models with manipulative agents: Moving the Overton window A. Bautista et.al. 2501.12198 null
2025-01-21 BotDetect: A Decentralized Federated Learning Framework for Detecting Financial Bots on the EVM Blockchains Ahmed Mounsf Rafik Bendada et.al. 2501.12112 null
2025-01-21 Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics Somnath Hazra et.al. 2501.12061 link
2025-01-21 Growth model with externalities for energetic transition via MFG with common external variable Pierre Lavigne et.al. 2501.11988 null
2025-01-21 Simultaneously decoding the unknown stationary state and function parameters for mean field games Hongyu Liu et.al. 2501.11955 null
2025-01-21 GLAM: Global-Local Variation Awareness in Mamba-based World Model Qian He et.al. 2501.11949 null
2025-01-21 Equilibria under Dynamic Benchmark Consistency in Non-Stationary Multi-Agent Systems Ludovico Crippa et.al. 2501.11897 null
2025-01-21 Connection-Coordination Rapport (CCR) Scale: A Dual-Factor Scale to Measure Human-Robot Rapport Ting-Han Lin et.al. 2501.11887 null
2025-01-21 Developing an Agent-Based Mathematical Model for Simulating Post-Irradiation Cellular Response: A Crucial Component of a Digital Twin Framework for Personalized Radiation Treatment Ruirui Liu et.al. 2501.11875 null
2025-01-21 LLM-Agents Driven Automated Simulation Testing and Analysis of small Uncrewed Aerial Systems Venkata Sai Aswath Duvvuru et.al. 2501.11864 null
2025-01-21 EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents Zhili Cheng et.al. 2501.11858 link
2025-01-17 Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems Weibo Gao et.al. 2501.10332 null
2025-01-17 Towards Human-Guided, Data-Centric LLM Co-Pilots Evgeny Saveliev et.al. 2501.10321 null
2025-01-17 Towards Preventing Overreliance on Task-Oriented Conversational AI Through Accountability Modeling Suvodip Dey et.al. 2501.10316 link
2025-01-17 Enhancing AI Transparency: XRL-Based Resource Management and RAN Slicing for 6G ORAN Architecture Suvidha Mhatre et.al. 2501.10292 null
2025-01-17 Evidence for the gravity-driven and magnetically-regularized gas flows feeding the massive protostellar cluster in Cep A Panigrahy Sandhyarani et.al. 2501.10280 null
2025-01-17 Grey-Box Fuzzing in Constrained Ultra-Large Systems: Lessons for SE Community Jiazhao Yu et.al. 2501.10269 null
2025-01-17 Deployment of an Aerial Multi-agent System for Automated Task Execution in Large-scale Underground Mining Environments Niklas Dahlquist et.al. 2501.10262 null
2025-01-17 Logarithmic Regret for Nonlinear Control James Wang et.al. 2501.10261 null
2025-01-17 Secure Semantic Communication With Homomorphic Encryption Rui Meng et.al. 2501.10182 null
2025-01-17 PaSa: An LLM Agent for Comprehensive Academic Paper Search Yichen He et.al. 2501.10120 link
2025-01-17 GAWM: Global-Aware World Model for Multi-Agent Reinforcement Learning Zifeng Shi et.al. 2501.10116 null
2025-01-17 Infrastructure for AI Agents Alan Chan et.al. 2501.10114 null
2025-01-17 LLM Reasoner and Automated Planner: A new NPC approach Israel Puerta-Merino et.al. 2501.10106 null
2025-01-17 Universal Actions for Enhanced Embodied Foundation Models Jinliang Zheng et.al. 2501.10105 link
2025-01-17 A Survey on LLM Test-Time Compute via Search: Tasks, LLM Profiling, Search Algorithms, and Relevant Frameworks Xinzhe Li et.al. 2501.10069 null
2025-01-17 Agent-as-Judge for Factual Summarization of Long Narratives Yeonseok Jeong et.al. 2501.09993 link
2025-01-17 A Survey on Multi-Turn Interaction Capabilities of Large Language Models Chen Zhang et.al. 2501.09959 null
2025-01-17 ForestProtector: An IoT Architecture Integrating Machine Vision and Deep Reinforcement Learning for Efficient Wildfire Monitoring Kenneth Bonilla-Ormachea et.al. 2501.09926 null
2025-01-17 Towards A Litmus Test for Common Sense Hugo Latapie et.al. 2501.09913 null
2025-01-17 Chatbot apologies: Beyond bullshit P. D. Magnus et.al. 2501.09910 null
2025-01-16 CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education Tianyu Wang et.al. 2501.09709 link
2025-01-16 The Goofus & Gallant Story Corpus for Practical Value Alignment Md Sultan Al Nahian et.al. 2501.09707 null
2025-01-16 Authenticated Delegation and Authorized AI Agents Tobin South et.al. 2501.09674 null
2025-01-16 NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision Processes Nathaniel S. Keplinger et.al. 2501.09646 link
2025-01-16 Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework Yushen Lin et.al. 2501.09631 null
2025-01-16 A Multi-agent System for Hybrid Optimization Eric S. Fraga et.al. 2501.09563 null
2025-01-16 Solving the unsolvable: Translating case law in Hong Kong King-kui Sin et.al. 2501.09444 null
2025-01-16 ADAGE: A generic two-layer framework for adaptive agent based modelling Benjamin Patrick Evans et.al. 2501.09429 null
2025-01-16 AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling Ancheng Xu et.al. 2501.09426 null
2025-01-16 Agent-Based Simulation of a Perpetual Futures Market Ramshreyas Rao et.al. 2501.09404 null
2025-01-16 The sleeping bacterium: shedding light on the resuscitation mechanism Eleonora Alfinito et.al. 2501.09366 null
2025-01-16 YETI (YET to Intervene) Proactive Interventions by Multimodal AI Agents in Augmented Reality Tasks Saptarashmi Bandyopadhyay et.al. 2501.09355 null
2025-01-16 ChartInsighter: An Approach for Mitigating Hallucination in Time-series Chart Summary Generation with A Benchmark Dataset Fen Wang et.al. 2501.09349 link
2025-01-16 Solving Infinite-Player Games with Player-to-Strategy Networks Carlos Martin et.al. 2501.09330 null
2025-01-16 On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression Zichang Ge et.al. 2501.09327 link
2025-01-16 SOP-Agent: Empower General Purpose AI Agent with Domain-Specific SOPs Anbang Ye et.al. 2501.09316 null
2025-01-16 Interoceptive Robots for Convergent Shared Control in Collaborative Construction Work Xiaoshan Zhou et.al. 2501.09290 link
2025-01-16 Hierarchical Deep Reinforcement Learning for Adaptive Resource Management in Integrated Terrestrial and Non-Terrestrial Networks Muhammad Ahmed Mohsin et.al. 2501.09212 link
2025-01-15 Embodied Scene Understanding for Vision Language Models via MetaVQA Weizhen Wang et.al. 2501.09167 null
2025-01-15 AutoLoop: Fast Visual SLAM Fine-tuning through Agentic Curriculum Learning Assaf Lahiany et.al. 2501.09160 null
2025-01-15 Personality Modeling for Persuasion of Misinformation using AI Agent Qianmin Lou et.al. 2501.08985 null
2025-01-15 Physical AI Agents: Integrating Cognitive Intelligence with Real-World Action Fouad Bousetouane et.al. 2501.08944 null
2025-01-15 A Reinforcement Learning Approach to Quiet and Safe UAM Traffic Management Surya Murthy et.al. 2501.08941 null
2025-01-15 Disentangling Exploration of Large Language Models by Optimal Exploitation Tim Grams et.al. 2501.08925 null
2025-01-15 Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning Qinyu Ma et.al. 2501.08897 link
2025-01-15 Silent Abandonment in Text-Based Contact Centers: Identifying, Quantifying, and Mitigating its Operational Impacts Antonio Castellanos et.al. 2501.08869 null
2025-01-15 The geometry of moral decision making Roland M. Friedrich et.al. 2501.08865 null
2025-01-15 On the Dominance of Truth-Telling in Gradual Mechanisms Wenqian Wang et.al. 2501.08802 null
2025-01-15 Networked Agents in the Dark: Team Value Learning under Partial Observability Guilherme S. Varela et.al. 2501.08778 null
2025-01-15 Leveraging LLM Agents for Translating Network Configurations Yunze Wei et.al. 2501.08760 null
2025-01-15 Efficient Shape Reconfiguration by Hybrid Programmable Matter Jonas Friemel et.al. 2501.08663 null
2025-01-15 Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance Raúl Arranz et.al. 2501.08655 null
2025-01-15 Towards Intelligent Active Particles Hartmut Löwen et.al. 2501.08632 null
2025-01-15 Neural Risk-sensitive Satisficing in Contextual Bandits Shogo Ito et.al. 2501.08612 null
2025-01-15 AutoRestTest: A Tool for Automated REST API Testing Using LLMs and MARL Tyler Stennett et.al. 2501.08600 null
2025-01-15 Effects of taxes, redistribution actions and fiscal evasion on wealth inequality: an agent-based model approach Iago Nascimento Barros et.al. 2501.08573 null
2025-01-15 Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation Jiaxin Guo et.al. 2501.08523 null
2025-01-15 Ensuring Truthfulness in Distributed Aggregative Optimization Ziqin Chen et.al. 2501.08512 null
2025-01-14 Empathetic Conversational Agents: Utilizing Neural and Physiological Signals for Enhanced Empathetic Interactions Nastaran Saffaryazdi et.al. 2501.08393 null
2025-01-14 ADAM-1: AI and Bioinformatics for Alzheimer's Detection and Microbiome-Clinical Data Integrations Ziyuan Huang et.al. 2501.08324 null
2025-01-14 Using Gamified Experiments to Tame Complexity: the case of the Schelling Model of Segregation Aleix Nicolás Olivé et.al. 2501.08280 null
2025-01-14 Addressing the sustainable AI trilemma: a case study on LLM agents and RAG Hui Wu et.al. 2501.08262 null
2025-01-14 Engineering LLM Powered Multi-agent Framework for Autonomous CloudOps Kannan Parthasarathy et.al. 2501.08243 null
2025-01-14 Dynamic Pricing in High-Speed Railways Using Multi-Agent Reinforcement Learning Enrique Adrian Villarrubia-Martin et.al. 2501.08234 null
2025-01-14 ASTRID -- An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems Mohita Chowdhury et.al. 2501.08208 null
2025-01-14 An Elementary Microscopic Model of Sympatric Speciation Franco Bagnoli et.al. 2501.08130 null
2025-01-14 Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving Guizhe Jin et.al. 2501.08096 null
2025-01-14 AgentPose: Progressive Distribution Alignment via Feature Agent for Human Pose Distillation Feng Zhang et.al. 2501.08088 null
2025-01-14 CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning Guoliang He et.al. 2501.08071 link
2025-01-14 Hydrodynamics-driven phase-locking and collective motility of sessile active dumbbells Urvi Mahendra Bora et.al. 2501.08065 null
2025-01-14 Cooperative Patrol Routing: Optimizing Urban Crime Surveillance through Multi-Agent Reinforcement Learning Juan Palma-Borda et.al. 2501.08020 link
2025-01-14 Decentralized Learning with Approximate Finite-Time Consensus Aaron Fainman et.al. 2501.07967 null
2025-01-14 Governing AI Agents Noam Kolt et.al. 2501.07913 null
2025-01-14 Flow: A Modular Approach to Automated Agentic Workflow Generation Boye Niu et.al. 2501.07834 null
2025-01-14 Agent-Centric Projection of Prompting Techniques and Implications for Synthetic Training Data for Large Language Models Dhruv Dhamani et.al. 2501.07815 null
2025-01-14 Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering Feijie Wu et.al. 2501.07813 null
2025-01-14 CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation Ruwei Pan et.al. 2501.07811 null
2025-01-14 Visual Language Models as Operator Agents in the Space Domain Alejandro Carrasco et.al. 2501.07802 null
2025-01-13 CBS with Continuous-Time Revisit Andy Li et.al. 2501.07744 null
2025-01-13 WebWalker: Benchmarking LLMs in Web Traversal Jialong Wu et.al. 2501.07572 link
2025-01-13 SafeSwarm: Decentralized Safe RL for the Swarm of Drones Landing in Dense Crowds Grik Tadevosyan et.al. 2501.07566 null
2025-01-13 SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing Varun Biyyala et.al. 2501.07554 link
2025-01-13 Evaluating Agent-based Program Repair at Google Pat Rondon et.al. 2501.07531 null
2025-01-13 Improving DeFi Accessibility through Efficient Liquidity Provisioning with Deep Reinforcement Learning Haonan Xu et.al. 2501.07508 null
2025-01-13 How low-cost AI universal approximators reshape market efficiency Paolo Barucca et.al. 2501.07489 null
2025-01-13 SynthSoM: A synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM) Xiang Cheng et.al. 2501.07459 link
2025-01-13 Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI Rolf Pfister et.al. 2501.07458 null
2025-01-13 Online inductive learning from answer sets for efficient reinforcement learning exploration Celeste Veronese et.al. 2501.07445 null
2025-01-13 Attention when you need Lokesh Boominathan et.al. 2501.07440 null
2025-01-13 Lifelong Learning of Large Language Model based Agents: A Roadmap Junhao Zheng et.al. 2501.07278 link
2025-01-13 Multi-face emotion detection for effective Human-Robot Interaction Mohamed Ala Yahyaoui et.al. 2501.07213 null
2025-01-13 Combined effect of incentives and coupling in multigames in two-layer networks Luo-Luo Jiang et.al. 2501.07193 null
2025-01-13 TIMRL: A Novel Meta-Reinforcement Learning Framework for Non-Stationary and Multi-Task Environments Chenyang Qi et.al. 2501.07146 null
2025-01-13 How GPT learns layer by layer Jason Du et.al. 2501.07108 link
2025-01-13 PPO-Q: Proximal Policy Optimization with Parametrized Quantum Policies or Values Yu-Xin Jin et.al. 2501.07085 null
2025-01-13 PoAct: Policy and Action Dual-Control Agent for Generalized Applications Guozhi Yuan et.al. 2501.07054 null
2025-01-13 Differentially Private Kernelized Contextual Bandits Nikola Pavlovic et.al. 2501.07046 null
2025-01-12 Learning Implicit Social Navigation Behavior using Deep Inverse Reinforcement Learning Tribhi Kathuria et.al. 2501.06946 null
2025-01-12 AdaSlicing: Adaptive Online Network Slicing under Continual Network Dynamics in Open Radio Access Networks Ming Zhao et.al. 2501.06943 null
2025-01-10 PEACE: Empowering Geologic Map Holistic Understanding with MLLMs Yangyu Huang et.al. 2501.06184 null
2025-01-10 A Mixed-Integer Conic Program for the Multi-Agent Moving-Target Traveling Salesman Problem Allen George Philip et.al. 2501.06130 null
2025-01-10 Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation Guojun Xiong et.al. 2501.06103 null
2025-01-10 Learning Flexible Heterogeneous Coordination with Capability-Aware Shared Hypernetworks Kevin Fu et.al. 2501.06058 link
2025-01-10 Investigating the Impact of Observation Space Design Choices On Training Reinforcement Learning Solutions for Spacecraft Problems Nathaniel Hamilton et.al. 2501.06016 null
2025-01-10 Enhanced Acoustic Beamforming with Sub-Aperture Angular Multiply and Sum -- in vivo and in Human Demonstration Matthieu Toulemonde et.al. 2501.05837 null
2025-01-10 CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech Madhurananda Pahar et.al. 2501.05755 null
2025-01-10 Semantic Mapping in Indoor Embodied AI -- A Comprehensive Survey and Future Directions Sonia Raychaudhuri et.al. 2501.05750 null
2025-01-10 How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond Chen Huang et.al. 2501.05714 null
2025-01-10 Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains Vighnesh Subramaniam et.al. 2501.05707 null
2025-01-10 A Two-timescale Primal-dual Algorithm for Decentralized Optimization with Compression Haoming Liu et.al. 2501.05701 null
2025-01-10 Scaling Safe Multi-Agent Control for Signal Temporal Logic Specifications Joe Eappen et.al. 2501.05639 link
2025-01-09 Towards Probabilistic Inference of Human Motor Intentions by Assistive Mobile Robots Controlled via a Brain-Computer Interface Xiaoshan Zhou et.al. 2501.05610 null
2025-01-09 NSChat: A Chatbot System To Rule Them All Zenon Lamprou et.al. 2501.05541 null
2025-01-09 OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? Yifei Li et.al. 2501.05510 link
2025-01-09 Strategy Masking: A Method for Guardrails in Value-based Reinforcement Learning Agents Jonathan Keane et.al. 2501.05501 null
2025-01-09 Search-o1: Agentic Search-Enhanced Large Reasoning Models Xiaoxi Li et.al. 2501.05366 link
2025-01-09 Control of Overpopulated Tails in Kinetic Epidemic Models Mattia Zanella et.al. 2501.05365 null
2025-01-09 A Path Variant of the Explorer Director Game on Graphs Abigail Raz et.al. 2501.05364 null
2025-01-09 On Corrigibility and Alignment in Multi Agent Games Edmund Dable-Heath et.al. 2501.05360 null
2025-01-09 A learning agent-based approach to the characterization of open quantum systems Lorenzo Fioroni et.al. 2501.05350 null
2025-01-09 The Bakers and Millers Game with Restricted Locations Simon Krogmann et.al. 2501.05334 null
2025-01-09 Knowledge Transfer in Model-Based Reinforcement Learning Agents for Efficient Multi-Task Learning Dmytro Kuzmenko et.al. 2501.05329 null
2025-01-09 Contrast-Free Myocardial Scar Segmentation in Cine MRI using Motion and Texture Fusion Guang Yang et.al. 2501.05241 null
2025-01-09 CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness Shoucheng Song et.al. 2501.05207 null
2025-01-09 Emergence of human-like polarization among large language model agents Jinghua Piao et.al. 2501.05171 null
2025-01-09 Constrained Optimization of Charged Particle Tracking with Multi-Agent Reinforcement Learning Tobias Kortus et.al. 2501.05113 null
2025-01-09 LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models Zengqi Peng et.al. 2501.05057 null
2025-01-09 ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark Ronghao Dang et.al. 2501.05031 link
2025-01-09 CuRLA: Curriculum Learning Based Deep Reinforcement Learning for Autonomous Driving Bhargava Uppuluri et.al. 2501.04982 null
2025-01-08 RadGPT: Constructing 3D Image-Text Tumor Datasets Pedro R. A. S. Bassi et.al. 2501.04678 link
2025-01-08 InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection Yuhang Liu et.al. 2501.04575 link
2025-01-08 The importance of being discrete -- An agent-based model for active nematics and more Mathieu Dedenon et.al. 2501.04559 null
2025-01-08 Approximately EFX and PO Allocations for Bivalued Chores Zehan Lin et.al. 2501.04550 null
2025-01-08 Cyber-Physical Steganography in Robotic Motion Control Ching-Chun Chang et.al. 2501.04541 null
2025-01-08 Safe Reinforcement Learning with Minimal Supervision Alexander Quessy et.al. 2501.04481 null
2025-01-08 Hybrid Artificial Intelligence Strategies for Drone Navigation Rubén San-Segundo et.al. 2501.04472 null
2025-01-08 A Digital Shadow for Modeling, Studying and Preventing Urban Crime Juan Palma-Borda et.al. 2501.04435 null
2025-01-08 User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation Krisztian Balog et.al. 2501.04410 null
2025-01-08 Agent Laboratory: Using LLM Agents as Research Assistants Samuel Schmidgall et.al. 2501.04227 null
2025-01-08 Unattainability of Common Knowledge in Asymmetric Games with Imperfect Information Fabian Farestam et.al. 2501.04199 null
2025-01-07 HIVEX: A High-Impact Environment Suite for Multi-Agent Research (extended version) Philipp D. Siedler et.al. 2501.04180 null
2025-01-07 Collaborative Spacecraft Servicing under Partial Feedback using Lyapunov-based Deep Neural Networks Cristian F. Nino et.al. 2501.04160 null
2025-01-07 Implementing Systemic Thinking for Automatic Schema Matching: An Agent-Based Modeling Approach Hicham Assoudi et.al. 2501.04136 null
2025-01-07 Kinetic theory of decentralized learning for smart active matter Gerhard Jung et.al. 2501.03948 null
2025-01-07 Implicit Coordination using Active Epistemic Inference Lauren Bramblett et.al. 2501.03907 null
2025-01-07 Truthful mechanisms for linear bandit games with private contexts Yiting Hu et.al. 2501.03865 null
2025-01-07 Rendezfood: A Design Case Study of a Conversational Location-based Approach in Restaurants Philip Weber et.al. 2501.03862 null
2025-01-07 Run-and-tumble chemotaxis using reinforcement learning Ramesh Pramanik et.al. 2501.03687 null
2025-01-07 The Textbook of Tomorrow: Rethinking Course Material Interfacing in the Era of GPT Audrey Olson et.al. 2501.03618 null
2025-01-07 Distributed Observer for Descriptor Linear System: The Luenberger Observer Method Shuai Liu et.al. 2501.03564 null
2025-01-07 Rethinking Adversarial Attacks in Reinforcement Learning from Policy Distribution Perspective Tianyang Duan et.al. 2501.03562 null
2025-01-07 FgC2F-UDiff: Frequency-guided and Coarse-to-fine Unified Diffusion Model for Multi-modality Missing MRI Synthesis Xiaojiao Xiao et.al. 2501.03526 link
2025-01-07 A Unified Attack Detection Strategy for Multi-Agent Systems over Transient and Steady Stages Jinming Gao et.al. 2501.03496 null
2025-01-06 Designing Telepresence Robots to Support Place Attachment Yaxin Hu et.al. 2501.03420 null
2025-01-06 ScaleMAI: Accelerating the Development of Trusted Datasets and AI Models Wenxuan Li et.al. 2501.03410 link
2025-01-06 Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation Yuhui Zhang et.al. 2501.03225 link
2025-01-06 Turn-based Multi-Agent Reinforcement Learning Model Checking Dennis Gross et.al. 2501.03187 null
2025-01-06 Deep-Relative-Trust-Based Diffusion for Decentralized Deep Learning Muyun Li et.al. 2501.03162 null
2025-01-06 Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches Alhassan Mumuni et.al. 2501.03151 null
2025-01-06 Probably Correct Optimal Stable Matching for Two-Sided Markets Under Uncertainty Andreas Athanasopoulos et.al. 2501.03018 link
2025-01-06 Approximating N-Player Nash Equilibrium through Gradient Descent Dongge Wang et.al. 2501.03001 null
2025-01-06 CALM: Curiosity-Driven Auditing for Large Language Models Xiang Zheng et.al. 2501.02997 link
2025-01-06 CAMP: Collaborative Attention Model with Profiles for Vehicle Routing Problems Chuanbo Hua et.al. 2501.02977 link
2025-01-06 Revisiting Communication Efficiency in Multi-Agent Reinforcement Learning from the Dimensional Analysis Perspective Chuxiong Sun et.al. 2501.02888 null
2025-01-06 A Novel Vision Transformer for Camera-LiDAR Fusion based Traffic Object Segmentation Toomas Tahves et.al. 2501.02858 null
2025-01-06 Proteomic Learning of Gamma-Aminobutyric Acid (GABA) Receptor-Mediated Anesthesia Jian Jiang et.al. 2501.02824 link
2025-01-06 Enhancing Lifelong Multi-Agent Path Finding with Cache Mechanism Yimin Tang et.al. 2501.02803 null
2025-01-06 Gaming on Coincident Peak Shaving: Equilibrium and Strategic Behavior Liudong Chen et.al. 2501.02792 null
2025-01-06 Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes Zijian Wang et.al. 2501.02774 null
2025-01-06 Multi-Agent Path Finding under Limited Communication Range Constraint via Dynamic Leading Hoang-Dung Bui et.al. 2501.02770 null
2025-01-06 Tree-based RAG-Agent Recommendation System: A Case Study in Medical Test Data Yahe Yang et.al. 2501.02727 null
2025-01-05 A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model Shivaram Kalyanakrishnan et.al. 2501.02652 null
2025-01-05 Slow modulation of the contraction patterns in Physarum polycephalum Raphael Saiseau et.al. 2501.02651 null
2025-01-05 LLMs Help Alleviate the Cross-Subject Variability in Brain Signal and Language Alignment Yifei Liu et.al. 2501.02621 null
2025-01-05 Back to Base: Towards Hands-Off Learning via Safe Resets with Reach-Avoid Safety Filters Azra Begzadić et.al. 2501.02620 null
2025-01-03 QuArch: A Question-Answering Dataset for AI Agents in Computer Architecture Shvetank Prakash et.al. 2501.01892 null
2025-01-03 Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification Xiangxiang Dai et.al. 2501.01849 link
2025-01-03 MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning Pu Yang et.al. 2501.01834 null
2025-01-03 SDPO: Segment-Level Direct Preference Optimization for Social Agents Aobo Kong et.al. 2501.01821 link
2025-01-03 Distributed Framework Construction for Affine Formation Control Huiming Li et.al. 2501.01817 null
2025-01-03 Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery Baoru Huang et.al. 2501.01752 null
2025-01-03 Proposing Hierarchical Goal-Conditioned Policy Planning in Multi-Goal Reinforcement Learning Gavin B. Rens et.al. 2501.01727 null
2025-01-03 AgentRefine: Enhancing Agent Generalization through Refinement Tuning Dayuan Fu et.al. 2501.01702 null
2025-01-03 The (Exact) Price of Cardinality for Indivisible Goods: A Parametric Perspective Alexander Lam et.al. 2501.01660 null
2025-01-03 PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents Jingoo Lee et.al. 2501.01594 null
2025-01-03 BLAST: A Stealthy Backdoor Leverage Attack against Cooperative Multi-Agent Deep Reinforcement Learning based Systems Yinbo Yu et.al. 2501.01593 null
2025-01-02 Reinforcement-learning-based control of turbulent channel flows at high Reynolds numbers Zisong Zhou et.al. 2501.01573 null
2025-01-02 BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery Kanishk Gandhi et.al. 2501.01540 link
2025-01-02 In Search of a Lost Metric: Human Empowerment as a Pillar of Socially Conscious Navigation Vasanth Reddy Baddam et.al. 2501.01539 null
2025-01-02 Optimal Strategy Revision in Population Games: A Mean Field Game Theory Perspective Julian Barreiro-Gomez et.al. 2501.01389 null
2025-01-02 PIMAEX: Multi-Agent Exploration through Peer Incentivization Michael Kölle et.al. 2501.01266 null
2025-01-02 Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants Lixiong Qin et.al. 2501.01243 null
2025-01-02 From Interaction to Attitude: Exploring the Impact of Human-AI Cooperation on Mental Illness Stigma Tianqi Song et.al. 2501.01220 null
2025-01-02 D-HAT: a Diatom-inspired structure for a Helmet concept Against Trauma Ludovico Musenich et.al. 2501.01211 null
2025-01-02 Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects Abdullah Mushtaq et.al. 2501.01205 null
2025-01-02 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer Jiajun Deng et.al. 2501.01163 null
2025-01-02 A3: Android Agent Arena for Mobile GUI Agents Yuxiang Chai et.al. 2501.01149 null
2025-01-02 Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method Ruichen Zhang et.al. 2501.01141 null
2025-01-02 Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning Min Whoo Lee et.al. 2501.01140 null
2025-01-02 Symmetries-enhanced Multi-Agent Reinforcement Learning Nikolaos Bousias et.al. 2501.01136 null
2025-01-02 Regularized Proportional Fairness Mechanism for Resource Allocation Without Money Sihan Zeng et.al. 2501.01111 null
2025-01-02 MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model Chengze Zhang et.al. 2501.01014 null
2025-01-02 Cyber-physical Defense for Heterogeneous Multi-agent Systems Against Exponentially Unbounded Attacks on Signed Digraphs Yichao Wang et.al. 2501.00990 null
2025-01-02 Bootstrapped Reward Shaping Jacob Adamczyk et.al. 2501.00989 null
2025-01-01 Non-obvious Manipulability in Hedonic Games with Friends Appreciation Preferences Michele Flammini et.al. 2501.00976 null
2025-01-01 Defense Strategies for Autonomous Multi-agent Systems: Ensuring Safety and Resilience Under Exponentially Unbounded FDI Attacks Yichao Wang et.al. 2501.00973 null
2025-01-01 Intent-based Radio Scheduler for RAN Slicing: Learning to deal with different network scenarios Cleverson Nahum et.al. 2501.00950 link
2025-01-01 Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things Talha Zeeshan et.al. 2501.00906 null
2025-01-01 Agentic Systems: A Guide to Transforming Industries with Vertical AI Agents Fouad Bousetouane et.al. 2501.00881 null
2024-12-30 Distributed Mixture-of-Agents for Edge Inference with Large Language Models Purbesh Mitra et.al. 2412.21200 link
2024-12-30 Aviary: training language agents on challenging scientific tasks Siddharth Narayanan et.al. 2412.21154 null
2024-12-30 Training Software Engineering Agents and Verifiers with SWE-Gym Jiayi Pan et.al. 2412.21139 link
2024-12-30 Positional information trade-offs in boundary-driven reaction-diffusion systems Jonas Berx et.al. 2412.21113 null
2024-12-30 Exploring and Controlling Diversity in LLM-Agent Conversation KuanChao Chu et.al. 2412.21102 null
2024-12-30 Advances in Multi-agent Reinforcement Learning: Persistent Autonomy and Robot Learning Lab Report 2024 Reza Azadeh et.al. 2412.21088 null
2024-12-30 Privacy-Aware Multi-Device Cooperative Edge Inference with Distributed Resource Bidding Wenhao Zhuang et.al. 2412.21069 null
2024-12-30 Plancraft: an evaluation dataset for planning with LLM agents Gautier Dagan et.al. 2412.21033 link
2024-12-30 UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI Fangwei Zhong et.al. 2412.20977 null
2024-12-31 SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity Pengfei Jing et.al. 2412.20787 null
2024-12-30 Joint Scoring Rules: Zero-Sum Competition Avoids Performative Prediction Rubi Hudson et.al. 2412.20732 null
2024-12-30 Modeling and Simulating Agent-Based City Migration Using Conway's Game of Life Bruce Deng et.al. 2412.20691 null
2024-12-30 Blockchain-Empowered Cyber-Secure Federated Learning for Trustworthy Edge Computing Ervin Moore et.al. 2412.20674 null
2024-12-29 The intrinsic motivation of reinforcement and imitation learning for sequential tasks Sao Mai Nguyen et.al. 2412.20573 null
2024-12-29 Game Theory and Multi-Agent Reinforcement Learning : From Nash Equilibria to Evolutionary Dynamics Neil De La Fuente et.al. 2412.20523 null
2024-12-29 Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning Hang Ni et.al. 2412.20505 null
2024-12-29 Exploiting NOMA Transmissions in Multi-UAV-assisted Wireless Networks: From Aerial-RIS to Mode-switching UAVs Songhan Zhao et.al. 2412.20484 null
2024-12-29 SatFlow: Scalable Network Planning for LEO Mega-Constellations Sheng Cen et.al. 2412.20475 null
2024-12-29 Image Augmentation Agent for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2412.20439 null
2024-12-29 Learning Policies for Dynamic Coalition Formation in Multi-Robot Task Allocation Lucas C. D. Bezerra et.al. 2412.20397 null
2024-12-27 Bottom-up robust modeling for the foraging behavior of Physarum polycephalum Damiano Reginato et.al. 2412.19790 null
2024-12-27 Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration Le Chen et.al. 2412.19770 link
2024-12-27 Can Large Language Models Adapt to Other Agents In-Context? Matthew Riemer et.al. 2412.19726 null
2024-12-27 OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Qiushi Sun et.al. 2412.19723 null
2024-12-27 The Value of Recall in Extensive-Form Games Ratip Emin Berker et.al. 2412.19659 null
2024-12-27 Xmodel-2 Technical Report Wang Qun et.al. 2412.19638 null
2024-12-27 Bidding Games on Markov Decision Processes with Quantitative Reachability Objectives Guy Avni et.al. 2412.19609 null
2024-12-27 Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following Yuxiao Yang et.al. 2412.19562 null
2024-12-27 Quantiles under ambiguity and risk sharing Peng Liu et.al. 2412.19546 null
2024-12-27 TARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured Data Xiang Huang et.al. 2412.19544 link
2024-12-27 Scalable Hierarchical Reinforcement Learning for Hyper Scale Multi-Robot Task Planning Xuan Zhou et.al. 2412.19538 null
2024-12-27 Casevo: A Cognitive Agents and Social Evolution Simulator Zexun Jiang et.al. 2412.19498 link
2024-12-27 Knowledge Graph-Based Multi-Agent Path Planning in Dynamic Environments using WAITR Ted Edward Holmberg et.al. 2412.19469 null
2024-12-27 Online distributed algorithms for mixed equilibrium problems in dynamic environments Hang Xu et.al. 2412.19399 null
2024-12-26 Preventive Energy Management for Distribution Systems Under Uncertain Events: A Deep Reinforcement Learning Approach Md Isfakul Anam et.al. 2412.19382 null
2024-12-26 Minimal Batch Adaptive Learning Policy Engine for Real-Time Mid-Price Forecasting in High-Frequency Trading Adamantios Ntakaris et.al. 2412.19372 null
2024-12-26 xSRL: Safety-Aware Explainable Reinforcement Learning -- Safety as a Product of Explainability Risal Shahriar Shefin et.al. 2412.19311 link
2024-12-26 Reforming an Unfair Allocation by Exchanging Goods Sheung Man Yuen et.al. 2412.19264 null
2024-12-26 Swarm Contract: A Multi-Sovereign Agent Consensus Mechanism Haowei Yang et.al. 2412.19256 null
2024-12-26 VINEVI: A Virtualized Network Vision Architecture for Smart Monitoring of Heterogeneous Applications and Infrastructures Rodrigo Moreira et.al. 2412.19226 null
2024-12-24 Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems Fernando Jia et.al. 2412.18601 link
2024-12-24 Automated Code Review In Practice Umut Cihan et.al. 2412.18531 null
2024-12-24 Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving Hao Pang et.al. 2412.18511 null
2024-12-24 Calibrating the Subjective Mark Whitmeyer et.al. 2412.18486 null
2024-12-24 Multi-Agent Norm Perception and Induction in Distributed Healthcare Chao Li et.al. 2412.18454 null
2024-12-24 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding Tatiana Zemskova et.al. 2412.18450 link
2024-12-24 GeAR: Graph-enhanced Agent for Retrieval-augmented Generation Zhili Shen et.al. 2412.18431 null
2024-12-24 Explainable Multi-Modal Data Exploration in Natural Language via LLM Agent Farhad Nooralahzadeh et.al. 2412.18428 link
2024-12-24 GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent Kangjia Zhao et.al. 2412.18426 null
2024-12-24 Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles Zihan Wang et.al. 2412.18416 null
2024-12-24 Contrastive Representation for Interactive Recommendation Jingyu Li et.al. 2412.18396 link
2024-12-24 Defining and Detecting the Defects of the Large Language Model-based Autonomous Agents Kaiwen Ning et.al. 2412.18371 link
2024-12-24 Extracting triples from dialogues for conversational social agents Piek Vossen et.al. 2412.18364 null
2024-12-24 The Thousand Brains Project: A New Paradigm for Sensorimotor Intelligence Viviane Clay et.al. 2412.18354 link
2024-12-24 Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering Zhongjian Hu et.al. 2412.18351 null
2024-12-24 The Constitutional Filter Simon Kohaut et.al. 2412.18347 link
2024-12-24 Learning to Play Against Unknown Opponents Eshwar Ram Arunachaleswaran et.al. 2412.18297 null
2024-12-24 MinsStudio: A Streamlined Package for Minecraft AI Agent Development Shaofei Cai et.al. 2412.18293 link
2024-12-24 Quantum framework for Reinforcement Learning: integrating Markov Decision Process, quantum arithmetic, and trajectory search Thet Htar Su et.al. 2412.18208 null
2024-12-24 VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks Shiduo Zhang et.al. 2412.18194 null
2024-12-23 Observation Interference in Partially Observable Assistance Games Scott Emmons et.al. 2412.17797 null
2024-12-23 ResearchTown: Simulator of Human Research Community Haofei Yu et.al. 2412.17767 link
2024-12-23 Sensitivity Curve Maximization: Attacking Robust Aggregators in Distributed Learning Christian A. Schroth et.al. 2412.17740 null
2024-12-23 Robin Hood Reachability Bidding Games Shaull Almagor et.al. 2412.17718 null
2024-12-23 SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC Yue Deng et.al. 2412.17707 link
2024-12-23 Large Language Model Safety: A Holistic Survey Dan Shi et.al. 2412.17686 link
2024-12-23 Shape and Performance of Fastest Paths over Networks with Interacting Selfish Agents Marco Cogoni et.al. 2412.17665 null
2024-12-23 CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction Yuanyuan Gao et.al. 2412.17612 null
2024-12-23 Fluid-Derived Lattices for Unbiased Modeling of Bacterial Colony Growth Bryan Verhoef et.al. 2412.17604 null
2024-12-23 PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World Yanheng He et.al. 2412.17589 null
2024-12-23 Complete aging in the noisy voter model enhances consensus formation Jaume Llabrés et.al. 2412.17569 null
2024-12-23 DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Jiaan Wang et.al. 2412.17498 link
2024-12-23 A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers Shuaihang Chen et.al. 2412.17481 link
2024-12-23 Should public health policy exempt cases with low viral load from isolation during an epidemic?: a modelling study Jiahao Diao et.al. 2412.17428 null
2024-12-23 Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets Akane Tsuboya et.al. 2412.17344 null
2024-12-23 Multimodal Deep Reinforcement Learning for Portfolio Optimization Sumit Nawathe et.al. 2412.17293 null
2024-12-23 Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples Taewoong Kim et.al. 2412.17288 link
2024-12-23 LegalAgentBench: Evaluating LLM Agents in Legal Domain Haitao Li et.al. 2412.17259 link
2024-12-23 A Coalition Game for On-demand Multi-modal 3D Automated Delivery System Farzan Moosavi et.al. 2412.17252 null
2024-12-22 A Multi-AI Agent System for Autonomous Optimization of Agentic AI Solutions via Iterative Refinement and LLM-Driven Feedback Loops Kamer Ali Yuksel et.al. 2412.17149 null
2024-12-20 Offline Reinforcement Learning for LLM Multi-Step Reasoning Huaijie Wang et.al. 2412.16145 link
2024-12-20 Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information Dirk Bergemann et.al. 2412.16132 null
2024-12-20 Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAG Hasan Md Tusfiqur Alam et.al. 2412.16086 link
2024-12-20 Active Flow Control for Bluff Body under High Reynolds Number Turbulent Flow Conditions Using Deep Reinforcement Learning Jingbo Chen et.al. 2412.15975 null
2024-12-20 The multilayer garbage disposal game Hsin-Lun Li et.al. 2412.15942 null
2024-12-20 Speedup Techniques for Switchable Temporal Plan Graph Optimization He Jiang et.al. 2412.15908 null
2024-12-20 Exploring the Effects of AI Nonverbal Emotional Cues on Human Decision Certainty in Moral Dilemmas Chenyi Zhang et.al. 2412.15834 null
2024-12-20 WebLLM: A High-Performance In-Browser LLM Inference Engine Charlie F. Ruan et.al. 2412.15803 link
2024-12-20 FTISS Adaptive Bearing-Only Formation Tracking Control with Unknown Disturbance Rejection Hong Liang Cheah et.al. 2412.15757 null
2024-12-20 Online Optimization Algorithms in Repeated Price Competition: Equilibrium Learning and Algorithmic Collusion Martin Bichler et.al. 2412.15707 null
2024-12-20 Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration Yijia Shao et.al. 2412.15701 link
2024-12-20 AIR: Unifying Individual and Cooperative Exploration in Collective Multi-Agent Reinforcement Learning Guangchong Zhou et.al. 2412.15700 link
2024-12-20 Asynchronous Vector Consensus over Matrix-Weighted Networks P Raghavendra Rao et.al. 2412.15681 null
2024-12-20 Learning Group Interactions and Semantic Intentions for Multi-Object Trajectory Prediction Mengshi Qi et.al. 2412.15673 link
2024-12-20 Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline Guancheng Zeng et.al. 2412.15660 null
2024-12-20 Tacit Learning with Adaptive Information Selection for Cooperative Multi-Agent Reinforcement Learning Lunjun Liu et.al. 2412.15639 null
2024-12-20 Understanding Individual Agent Importance in Multi-Agent System via Counterfactual Reasoning Chen Jianming et.al. 2412.15619 null
2024-12-20 Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage Zhi Gao et.al. 2412.15606 null
2024-12-20 NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization Danial Kamali et.al. 2412.15588 link
2024-12-20 Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems Joshua Holder et.al. 2412.15573 link
2024-12-19 AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving Shuo Xing et.al. 2412.15206 link
2024-12-19 Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from Demonstration Junjia Liu et.al. 2412.15166 null
2024-12-19 Operationalising Rawlsian Ethics for Fairness in Norm-Learning Agents Jessica Woodgate et.al. 2412.15163 null
2024-12-19 Equal Merit Does Not Imply Equality: Discrimination at Equilibrium in a Hiring Market with Symmetric Agents Serafina Kamp et.al. 2412.15162 null
2024-12-19 Probabilistic Strategy Logic with Degrees of Observability Chunyan Mu et.al. 2412.15135 null
2024-12-19 From Nonequilibrium to Equilibrium: Insights from a Two-Population Occupation Model Jerome Garnier-Brun et.al. 2412.14996 null
2024-12-19 Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination Leonardo Barcellona et.al. 2412.14957 null
2024-12-19 Long Time Behavior and Stabilization for Displacement Monotone Mean Field Games Marco Cirant et.al. 2412.14903 null
2024-12-19 Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning Anthony Kobanda et.al. 2412.14865 null
2024-12-19 Entropy Regularized Task Representation Learning for Offline Meta-Reinforcement Learning Mohammadreza nakhaei et.al. 2412.14834 link
2024-12-19 Fair Division with Social Impact Michele Flammini et.al. 2412.14818 null
2024-12-19 Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning Ziang Ye et.al. 2412.14780 null
2024-12-19 Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning Aditya Kapoor et.al. 2412.14779 null
2024-12-19 Testing linearity of spatial interaction functions à la Ramsey Abhimanyu Gupta et.al. 2412.14778 null
2024-12-19 PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children Yiqun Zhang et.al. 2412.14769 link
2024-12-19 Active Inference and Human--Computer Interaction Roderick Murray-Smith et.al. 2412.14741 null
2024-12-19 On Verbalized Confidence Scores for LLMs Daniel Yang et.al. 2412.14737 link
2024-12-19 Bel Esprit: Multi-Agent Framework for Building AI Model Pipelines Yunsu Kim et.al. 2412.14684 null
2024-12-19 A Model-free Biomimetics Algorithm for Deterministic Partially Observable Markov Decision Process Yide Yu et.al. 2412.14614 null
2024-12-19 Computational Sociology of Humans and Machines; Conflict and Collaboration Taha Yasseri et.al. 2412.14606 null
2024-12-18 TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Frank F. Xu et.al. 2412.14161 link
2024-12-18 Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Report Markus Dablander et.al. 2412.14085 null
2024-12-18 A Computationally Grounded Framework for Cognitive Attitudes (extended version) Tiago de Lima et.al. 2412.14073 null
2024-12-18 Spatio-Temporal SIR Model of Pandemic Spread During Warfare with Optimal Dual-use Healthcare System Administration using Deep Reinforcement Learning Adi Shuchami et.al. 2412.14039 link
2024-12-18 Decentralized Convergence to Equilibrium Prices in Trading Networks Edwin Lock et.al. 2412.13972 null
2024-12-18 Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves Martin Kurečka et.al. 2412.13962 null
2024-12-18 Harvesting energy from turbulent winds with Reinforcement Learning Lorenzo Basile et.al. 2412.13961 null
2024-12-18 Towards privacy-preserving cooperative control via encrypted distributed optimization Philipp Binfet et.al. 2412.13953 null
2024-12-18 Strategyproof Matching of Roommates and Rooms Hadi Hosseini et.al. 2412.13887 null
2024-12-18 Who Saves us From Risk? Altruists Promote Cooperation in a Public Investment Game Shen Zhang et.al. 2412.13816 null
2024-12-18 CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers? Dimitrios Mallis et.al. 2412.13810 null
2024-12-18 Meta-Reflection: A Feedback-Free Reflection Learning Framework Yaoke Wang et.al. 2412.13781 null
2024-12-18 Heuristic Planner for Communication-Constrained Multi-Agent Multi-Goal Path Planning Jáchym Herynek et.al. 2412.13719 null
2024-12-18 A2H: A UI Converter from Android to HarmonyOS Platform Chen Wang et.al. 2412.13693 link
2024-12-18 A hybrid learning agent for episodic learning tasks with unknown target distance Oliver Sefrin et.al. 2412.13686 null
2024-12-18 ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning Jie-Jing Shao et.al. 2412.13682 null
2024-12-18 Exploring Multi-Modal Integration with Tool-Augmented LLM Agents for Precise Causal Discovery ChengAo Shen et.al. 2412.13667 null
2024-12-18 Large Language Model Federated Learning with Blockchain and Unlearning for Cross-Organizational Collaboration Xuhan Zuo et.al. 2412.13551 null
2024-12-18 EscapeBench: Pushing Language Models to Think Outside the Box Cheng Qian et.al. 2412.13549 link
2024-12-18 Models for common knowledge logic Yoshihito Tanaka et.al. 2412.13537 null
2024-12-17 Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents Yifei Zhou et.al. 2412.13194 null
2024-12-17 GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding Haoyi Jiang et.al. 2412.13193 link
2024-12-17 SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents Sheng Yin et.al. 2412.13178 link
2024-12-17 Practicable Black-box Evasion Attacks on Link Prediction in Dynamic Graphs -- A Graph Sequential Embedding Method Jiate Li et.al. 2412.13134 link
2024-12-17 Contract-based Design and Verification of Multi-Agent Systems with Quantitative Temporal Requirements Rafael Dewes et.al. 2412.13114 null
2024-12-17 Active Reinforcement Learning Strategies for Offline Policy Improvement Ambedkar Dukkipati et.al. 2412.13106 null
2024-12-17 AI PERSONA: Towards Life-long Personalization of LLMs Tiannan Wang et.al. 2412.13103 null
2024-12-17 Reservoir Computing for Fast, Simplified Reinforcement Learning on Memory Tasks Kevin McKee et.al. 2412.13093 null
2024-12-17 Distributed Normal Map-based Stochastic Proximal Gradient Methods over Networks Kun Huang et.al. 2412.13054 null
2024-12-17 NAVCON: A Cognitively Inspired and Linguistically Grounded Corpus for Vision and Language Navigation Karan Wanchoo et.al. 2412.13026 null
2024-12-17 The Emergence of Strategic Reasoning of Large Language Models Dongwoo Lee et.al. 2412.13013 null
2024-12-17 Adaptations of AI models for querying the LandMatrix database in natural language Fatiha Ait Kbir et.al. 2412.12961 link
2024-12-17 4DRGS: 4D Radiative Gaussian Splatting for Efficient 3D Vessel Reconstruction from Sparse-View Dynamic DSA Images Zhentao Liu et.al. 2412.12919 link
2024-12-17 An Agentic Approach to Automatic Creation of P&ID Diagrams from Natural Language Descriptions Shreeyash Gowaikar et.al. 2412.12898 null
2024-12-17 Bayesian Persuasion with Externalities: Exploiting Agent Types Jonathan Shaki et.al. 2412.12859 null
2024-12-17 From An LLM Swarm To A PDDL-Empowered HIVE: Planning Self-Executed Instructions In A Multi-Modal Jungle Kaustubh Vyas et.al. 2412.12839 null
2024-12-17 GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models Mukai Li et.al. 2412.12735 link
2024-12-17 Enhancing Naturalness in LLM-Generated Utterances through Disfluency Insertion Syed Zohaib Hassan et.al. 2412.12710 null
2024-12-17 ParMod: A Parallel and Modular Framework for Learning Non-Markovian Tasks Ruixuan Miao et.al. 2412.12700 null
2024-12-17 Everyday AR through AI-in-the-Loop Ryo Suzuki et.al. 2412.12681 null
2024-12-16 Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives Marius Belly et.al. 2412.12063 link
2024-12-16 Virtual Agent-Based Communication Skills Training to Facilitate Health Persuasion Among Peers Farnaz Nouraei et.al. 2412.12061 null
2024-12-16 Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps Linfeng Zhao et.al. 2412.12024 null
2024-12-16 Agentic AI-Driven Technical Troubleshooting for Enterprise Systems: A Novel Weighted Retrieval-Augmented Generation Paradigm Rajat Khanda et.al. 2412.12006 null
2024-12-16 CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird's Eye View Perception Senkang Hu et.al. 2412.12000 null
2024-12-16 AlphaZero Neural Scaling and Zipf's Law: a Tale of Board Games and Power Laws Oren Neumann et.al. 2412.11979 link
2024-12-16 Learning Human-Aware Robot Policies for Adaptive Assistance Jason Qin et.al. 2412.11913 null
2024-12-16 Reentrant phase behavior in binary topological flocks with nonreciprocal alignment Tian Tang et.al. 2412.11871 null
2024-12-16 The Black Ninjas and the Sniper: On Robustness of Population Protocols Benno Lossin et.al. 2412.11783 null
2024-12-16 Prediction of social dilemmas in networked populations via graph neural networks Huaiyu Tan et.al. 2412.11775 null
2024-12-16 Harnessing Language for Coordination: A Framework and Benchmark for LLM-Driven Multi-Agent Control Timothée Anne et.al. 2412.11761 null
2024-12-16 Common Ground, Diverse Roots: The Difficulty of Classifying Common Examples in Spanish Varieties Javier A. Lopetegui et.al. 2412.11750 null
2024-12-16 GHIssuemarket: A Sandbox Environment for SWE-Agents Economic Experimentation Mohamed A. Fouad et.al. 2412.11722 link
2024-12-16 Learning UAV-based path planning for efficient localization of objects using prior knowledge Rick van Essen et.al. 2412.11717 link
2024-12-16 LLMs Can Simulate Standardized Patients via Agent Coevolution Zhuoyun Du et.al. 2412.11716 link
2024-12-16 Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework Xuanming Zhang et.al. 2412.11713 null
2024-12-16 Loosely Synchronized Rule-Based Planning for Multi-Agent Path Finding with Asynchronous Actions Shuai Zhou et.al. 2412.11678 link
2024-12-16 VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting Muhammet Furkan Ilaslan et.al. 2412.11621 link
2024-12-16 VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis Zhipeng Chen et.al. 2412.11594 link
2024-12-16 Embodied CoT Distillation From LLM To Off-the-shelf Agents Wonje Choi et.al. 2412.11499 null
2024-12-13 Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining Zhiqi Ge et.al. 2412.10342 null
2024-12-13 Reciprocity in Interbank Markets Lutz Honvehlmann et.al. 2412.10329 null
2024-12-13 MeshA: Efficient Path Planing With Motion Primitives* Marat Agranovskiy et.al. 2412.10320 null
2024-12-13 BrushEdit: All-In-One Image Inpainting and Editing Yaowei Li et.al. 2412.10316 null
2024-12-13 Cultural Evolution of Cooperation among LLM Agents Aron Vallinder et.al. 2412.10270 null
2024-12-13 ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL Yang Qin et.al. 2412.10138 link
2024-12-13 You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects Islem Bouzenia et.al. 2412.10133 null
2024-12-13 Reward Machine Inference for Robotic Manipulation Mattijs Baert et.al. 2412.10096 null
2024-12-13 Heterogeneous Multi-Robot Graph Coverage with Proximity and Movement Constraints Dolev Mutzari et.al. 2412.10083 null
2024-12-13 Large Action Models: From Inception to Implementation Lu Wang et.al. 2412.10047 link
2024-12-13 Cooperative Target Defense under Communication and Sensing Constraints Dipankar Maity et.al. 2412.09939 null
2024-12-13 CaLoRAify: Calorie Estimation with Visual-Text Pairing and LoRA-Driven Visual Language Models Dongyu Yao et.al. 2412.09936 link
2024-12-13 ProxyLLM : LLM-Driven Framework for Customer Support Through Text-Style Transfer Sehyeong Jo et.al. 2412.09916 link
2024-12-13 Optimized Coordination Strategy for Multi-Aerospace Systems in Pick-and-Place Tasks By Deep Neural Network Ye Zhang et.al. 2412.09877 null
2024-12-13 AutoPatent: A Multi-Agent Framework for Automatic Patent Generation Qiyao Wang et.al. 2412.09796 link
2024-12-13 Learning Visually Grounded Domain Ontologies via Embodied Conversation and Explanation Jonghyuk Park et.al. 2412.09770 link
2024-12-12 AiEDA: Agentic AI Design Framework for Digital ASIC System Design Aditya Patra et.al. 2412.09745 null
2024-12-12 MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction Xiaohao Xu et.al. 2412.09723 link
2024-12-12 TransferLight: Zero-Shot Traffic Signal Control on any Road-Network Johann Schmidt et.al. 2412.09719 null
2024-12-12 CUAL: Continual Uncertainty-aware Active Learner Amanda Rios et.al. 2412.09701 null
2024-12-12 GenEx: Generating an Explorable World Taiming Lu et.al. 2412.09624 null
2024-12-12 AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials Yiheng Xu et.al. 2412.09605 null
2024-12-12 DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction Yu Feng et.al. 2412.09572 null
2024-12-12 Can Modern LLMs Act as Agent Cores in Radiology~Environments? Qiaoyu Zheng et.al. 2412.09529 link
2024-12-12 Agent-based Video Trimming Lingfeng Yang et.al. 2412.09513 null
2024-12-12 Solving Multiagent Path Finding on Highly Centralized Networks Foivos Fioravantes et.al. 2412.09433 null
2024-12-12 From Intention To Implementation: Automating Biomedical Research via LLMs Yi Luo et.al. 2412.09429 null
2024-12-12 Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer Adam Labiosa et.al. 2412.09417 null
2024-12-12 Uncommon Belief in Rationality Qi Shi et.al. 2412.09407 null
2024-12-12 Falcon-UI: Understanding GUI Before Following User Instructions Huawen Shen et.al. 2412.09362 null
2024-12-12 Does Low Spoilage Under Cold Conditions Foster Cultural Complexity During the Foraging Era? -- A Theoretical and Computational Inquiry Minhyeok Lee et.al. 2412.09335 null
2024-12-12 Beware of Metacognitive Laziness: Effects of Generative Artificial Intelligence on Learning Motivation, Processes, and Performance Yizhou Fan et.al. 2412.09315 null
2024-12-12 A Systematic Review of Knowledge Tracing and Large Language Models in Education: Opportunities, Issues, and Future Research Yongwan Cho et.al. 2412.09248 null
2024-12-12 LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation Yijun Liu et.al. 2412.09237 null
2024-12-12 Reconfigurable Intelligent Surface for Internet of Robotic Things Wanli Ni et.al. 2412.09117 null
2024-12-12 Understanding Opportunities and Risks of Synthetic Relationships: Leveraging the Power of Longitudinal Research with Customised AI Tools Alfio Ventura et.al. 2412.09086 null
2024-12-12 Towards the Structure and Mechanisms of Complex Systems, the Approach of the Quantitative Theory of Meaning Inga Ivanova et.al. 2412.09007 null
2024-12-12 Dynamics of swarmalators in the presence of a contrarian Gourab Kumar Sar et.al. 2412.08966 null
2024-12-12 From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning Pusen Dong et.al. 2412.08920 null
2024-12-12 Neural Interactive Proofs Lewis Hammond et.al. 2412.08897 null
2024-12-11 GPD-1: Generative Pre-training for Driving Zixun Xie et.al. 2412.08643 link
2024-12-11 Generative Semantic Communication: Architectures, Technologies, and Applications Jinke Ren et.al. 2412.08642 null
2024-12-11 RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation Mingfei Han et.al. 2412.08591 null
2024-12-11 Automated Soap Opera Testing Directed by LLMs and Scenario Knowledge: Feasibility, Challenges, and Road Ahead Yanqi Su et.al. 2412.08581 null
2024-12-11 GenPlan: Generative sequence models as adaptive planners Akash Karthikeyan et.al. 2412.08565 link
2024-12-11 An End-to-End Collaborative Learning Approach for Connected Autonomous Vehicles in Occluded Scenarios Leandro Parada et.al. 2412.08562 null
2024-12-11 Exact Algorithms for Multiagent Path Finding with Communication Constraints on Tree-Like Structures Foivos Fioravantes et.al. 2412.08556 null
2024-12-11 Grimm: A Plug-and-Play Perturbation Rectifier for Graph Neural Networks Defending against Poisoning Attacks Ao Liu et.al. 2412.08555 null
2024-12-11 MaestroMotif: Skill Design from Artificial Intelligence Feedback Martin Klissarov et.al. 2412.08542 null
2024-12-11 Spatial segregation across travelling fronts in individual-based and continuum models for the growth of heterogeneous cell populations José A. Carrillo et.al. 2412.08535 null
2024-12-11 Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel Zun Wang et.al. 2412.08467 link
2024-12-11 IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health Gauri Jain et.al. 2412.08463 link
2024-12-11 TapeAgents: a Holistic Framework for Agent Development and Optimization Dzmitry Bahdanau et.al. 2412.08445 null
2024-12-11 From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons Andrew Szot et.al. 2412.08442 null
2024-12-11 SweetieChat: A Strategy-Enhanced Role-playing Framework for Diverse Scenarios Handling Emotional Support Agent Jing Ye et.al. 2412.08389 null
2024-12-11 Agency and Morality as part of Text Entry AI Assistant Personas Andreas Komninos et.al. 2412.08360 null
2024-12-11 Lachesis: Predicting LLM Inference Accuracy using Structural Properties of Reasoning Paths Naryeong Kim et.al. 2412.08281 null
2024-12-11 Can transformative AI shape a new age for our civilization?: Navigating between speculation and reality Jesus L. Lobo et.al. 2412.08273 null
2024-12-11 Deep learning assisted SERS detection of prolines and hydroxylated prolines using nitrilotriacetic acid functionalized gold nanopillars Yuan Zhang et.al. 2412.08239 null
2024-12-11 Learn How to Query from Unlabeled Data Streams in Federated Learning Yuchang Sun et.al. 2412.08138 link
2024-12-10 Balancing Mobility Behaviors to avoid Global epidemics from Local Outbreaks Pablo Valgañón et.al. 2412.07656 null
2024-12-10 Searching for Structure: Investigating Emergent Communication with Large Language Models Tom Kouwenhoven et.al. 2412.07646 null
2024-12-10 Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization Zongkai Liu et.al. 2412.07639 link
2024-12-10 Swarm Behavior Cloning Jonas Nüßlein et.al. 2412.07617 null
2024-12-10 Modeling Speculative Trading Patterns in Token Markets: An Agent-Based Analysis with TokenLab Mengjue Wang et.al. 2412.07512 null
2024-12-10 ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement Learning Hongshu Guo et.al. 2412.07507 null
2024-12-10 SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World Jiaqi Zhang et.al. 2412.07472 link
2024-12-10 Event-Triggered Memory Control for Interval Type-2 Fuzzy Heterogeneous Multi-Agent Systems Sen Kong et.al. 2412.07471 null
2024-12-10 Dynamic Ensemble Reasoning for LLM Experts Jinwu Hu et.al. 2412.07448 null
2024-12-10 ITPNet: Towards Instantaneous Trajectory Prediction for Autonomous Driving Rongqing Li et.al. 2412.07369 null
2024-12-10 My Words Imply Your Opinion: Reader Agent-Based Propagation Enhancement for Personalized Implicit Emotion Analysis Jian Liao et.al. 2412.07367 null
2024-12-10 IntraLayer: A Platform of Digital Finance Platforms Arman Abgaryan et.al. 2412.07348 null
2024-12-10 CoMA: Compositional Human Motion Generation with Multi-modal Agents Shanlin Sun et.al. 2412.07320 null
2024-12-10 Superficial Consciousness Hypothesis for Autoregressive Transformers Yosuke Miyanishi et.al. 2412.07278 link
2024-12-10 Reconciling Human Development and Giant Panda Protection Goals: Cost-efficiency Evaluation of Farmland Reverting and Energy Substitution Programs in Wolong National Reserve Keyi Liu et.al. 2412.07275 null
2024-12-10 Speaker effects in spoken language comprehension Hanlin Wu et.al. 2412.07238 null
2024-12-10 Parseval Regularization for Continual Reinforcement Learning Wesley Chung et.al. 2412.07224 null
2024-12-10 A Distributed Deep Koopman Learning Algorithm for Control Wenjian Hao et.al. 2412.07212 null
2024-12-10 Epidemiological Model Calibration via Graybox Bayesian Optimization Puhua Niu et.al. 2412.07193 null
2024-12-10 Effective Reward Specification in Deep Reinforcement Learning Julien Roy et.al. 2412.07177 null
2024-12-09 Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty Meera Hahn et.al. 2412.06771 link
2024-12-09 AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark Lan Li et.al. 2412.06724 link
2024-12-09 Asynchronous Agents with Perfect Recall: Model Reductions, Knowledge-Based Construction, and Model Checking for Coalitional Strategies Dilian Gurov et.al. 2412.06706 null
2024-12-09 Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework Tianming Liu et.al. 2412.06681 null
2024-12-09 Self-Interested Agents in Collaborative Learning: An Incentivized Adaptive Data-Centric Framework Nithia Vijayan et.al. 2412.06597 null
2024-12-09 Argentine ants regulate traffic flow with stopped individuals Ulrich Dobramysl et.al. 2412.06587 null
2024-12-09 Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Egor Cherepanov et.al. 2412.06531 null
2024-12-09 EFX Allocations on Some Multi-graph Classes Umang Bhaskar et.al. 2412.06513 null
2024-12-09 The Fusion of Large Language Models and Formal Methods for Trustworthy AI Agents: A Roadmap Yedi Zhang et.al. 2412.06512 null
2024-12-09 Reasoning about Strategic Abilities in Stochastic Multi-agent Systems Yedi Zhang et.al. 2412.06509 null
2024-12-09 PPT: Pre-Training with Pseudo-Labeled Trajectories for Motion Forecasting Yihong Xu et.al. 2412.06491 null
2024-12-09 Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Language Navigation Xuesong Zhang et.al. 2412.06465 link
2024-12-09 Simulating Human-like Daily Activities with Desire-driven Autonomy Yiding Wang et.al. 2412.06435 null
2024-12-09 World-Consistent Data Generation for Vision-and-Language Navigation Yu Zhong et.al. 2412.06413 null
2024-12-09 StarWhisper Telescope: Agent-Based Observation Assistant System to Approach AI Astrophysicist Cunshi Wang et.al. 2412.06412 null
2024-12-09 Augmenting the action space with conventions to improve multi-agent cooperation in Hanabi F. Bredell et.al. 2412.06333 link
2024-12-09 Vision-Based Deep Reinforcement Learning of UAV Autonomous Navigation Using Privileged Information Junqiao Wang et.al. 2412.06313 null
2024-12-09 Beyond pip install: Evaluating LLM Agents for the Automated Installation of Python Projects Louis Milliken et.al. 2412.06294 link
2024-12-09 Enhanced Multi-Object Tracking Using Pose-based Virtual Markers in 3x3 Basketball Li Yin et.al. 2412.06258 null
2024-12-09 In Silico Pharmacokinetic and Molecular Docking Studies of Natural Plants against Essential Protein KRAS for Treatment of Pancreatic Cancer Marsha Mariya Kappan et.al. 2412.06237 null
2024-12-06 TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft Qian Long et.al. 2412.05255 link
2024-12-06 AI's assigned gender affects human-AI cooperation Sepideh Bazazi et.al. 2412.05214 null
2024-12-06 SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot Jinlin Wu et.al. 2412.05187 link
2024-12-06 Sense and Sensitivity: Evaluating the simulation of social dynamics via Large Language Models Da Ju et.al. 2412.05093 null
2024-12-06 Synchronization and desynchronization in ensembles of mobile agents E. M. Varvarin et.al. 2412.05040 null
2024-12-06 Frontier Models are Capable of In-context Scheming Alexander Meinke et.al. 2412.04984 null
2024-12-06 Putting the Iterative Training of Decision Trees to the Test on a Real-World Robotic Task Raphael C. Engelhardt et.al. 2412.04974 null
2024-12-06 Who Speaks Next? Multi-party AI Discussion Leveraging the Systematics of Turn-taking in Murder Mystery Games Ryota Nonomura et.al. 2412.04937 link
2024-12-06 Probing the contents of semantic representations from text, behavior, and brain data using the psychNorms metabase Zak Hussain et.al. 2412.04936 link
2024-12-06 PERCY: A Multimodal Dataset and Conversational System for Personalized and Emotionally Aware Human-Robot Interaction Mohammed Althubyani et.al. 2412.04908 null
2024-12-06 DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling Minzheng Wang et.al. 2412.04905 link
2024-12-06 Estimating causal effects of customer satisfaction on downstream metrics in a multi-queue contact center Sebastián Orellana et.al. 2412.04860 null
2024-12-06 Breaking Event Rumor Detection via Stance-Separated Multi-Agent Debate Mingqing Zhang et.al. 2412.04859 null
2024-12-06 MTSpark: Enabling Multi-Task Learning with Spiking Neural Networks for Generalist Agents Avaneesh Devkota et.al. 2412.04847 null
2024-12-06 A Temporally Correlated Latent Exploration for Reinforcement Learning SuMin Oh et.al. 2412.04775 null
2024-12-06 REGENT: A Retrieval-Augmented Generalist Agent That Can Act In-Context in New Environments Kaustubh Sridhar et.al. 2412.04759 null
2024-12-05 LiveNet: Robust, Minimally Invasive Multi-Robot Control for Safe and Live Navigation in Constrained Environments Srikar Gouru et.al. 2412.04659 link
2024-12-05 Mutation mitigates finite-size effects in spatial evolutionary games Chen Shen et.al. 2412.04654 null
2024-12-05 Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Yiheng Xu et.al. 2412.04454 null
2024-12-05 GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration Kaiyi Huang et.al. 2412.04440 null
2024-12-05 Sub-diffraction Imaging of Carrier Dynamics in Halide Perovskite Semiconductors: Effects of Passivation, Morphology, and Ion Motion Madeleine D. Breshears et.al. 2412.04423 null
2024-12-05 Targeting the Core: A Simple and Effective Method to Attack RAG-based Agents via Direct LLM Manipulation Xuying Li et.al. 2412.04415 null
2024-12-05 EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding Yuqi Wu et.al. 2412.04380 link
2024-12-05 Intersection-Aware Assessment of EMS Accessibility in NYC: A Data-Driven Approach Haoran Su et.al. 2412.04369 null
2024-12-05 Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting Edoardo Cetin et.al. 2412.04368 null
2024-12-05 Machine Theory of Mind for Autonomous Cyber-Defence Luke Swaby et.al. 2412.04367 null
2024-12-05 Reinforcement Learning for Freeway Lane-Change Regulation via Connected Vehicles Ke Sun et.al. 2412.04341 null
2024-12-05 Action Mapping for Reinforcement Learning in Continuous Environments with Constraints Mirco Theile et.al. 2412.04327 null
2024-12-05 Transient Multi-Agent Path Finding for Lifelong Navigation in Dense Environments Jonathan Morag et.al. 2412.04256 null
2024-12-05 HyperMARL: Adaptive Hypernetworks for Multi-Agent RL Kale-ab Abebe Tessera et.al. 2412.04233 null
2024-12-05 A Dynamic Safety Shield for Safe and Efficient Reinforcement Learning of Navigation Tasks Murad Dawood et.al. 2412.04153 null
2024-12-05 Practical Considerations for Agentic LLM Systems Chris Sypherd et.al. 2412.04093 null
2024-12-05 LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents Bingchen Li et.al. 2412.04090 null
2024-12-05 Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning Shicheng Zhou et.al. 2412.04078 link
2024-12-05 Prompt Engineering Guidance for Conceptual Agent-based Model Extraction using Large Language Models Siamak Khatami et.al. 2412.04056 null
2024-12-05 Demonstration of Enhanced Qubit Readout via Reinforcement Learning Aniket Chatterjee et.al. 2412.04053 null
2024-12-05 INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations Yongming Zhu et.al. 2412.04037 null
2024-12-05 Dynamic Graph Representation with Contrastive Learning for Financial Market Prediction: Integrating Temporal Evolution and Static Relations Yunhua Pei et.al. 2412.04034 null
2024-12-04 Navigation World Models Amir Bar et.al. 2412.03572 null
2024-12-04 From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents Xinyi Mou et.al. 2412.03563 link
2024-12-04 Categorize and randomize: a model of sequential stochastic choice Ester Sudano et.al. 2412.03554 null
2024-12-04 SPICE: Smart Projection Interface for Cooking Enhancement Vera Prohaska et.al. 2412.03551 null
2024-12-04 Risk-aware Classification via Uncertainty Quantification Murat Sensoy et.al. 2412.03391 null
2024-12-04 WiS Platform: Enhancing Evaluation of LLM-Based Multi-Agent Systems Through Game-Based Analysis Chengwei Hu et.al. 2412.03359 null
2024-12-04 AI-Driven Day-to-Day Route Choice Leizhen Wang et.al. 2412.03338 link
2024-12-04 Mean-field Concentration of Opinion Dynamics in Random Graphs Javiera Gutiérrez-Ramírez et.al. 2412.03207 null
2024-12-04 AffordDP: Generalizable Diffusion Policy with Transferable Affordance Shijie Wu et.al. 2412.03142 null
2024-12-04 ChatTS: Aligning Time Series with LLMs via Synthetic Data for Enhanced Understanding and Reasoning Zhe Xie et.al. 2412.03104 link
2024-12-04 Decentralized Mobile Target Tracking Using Consensus-Based Estimation with Nearly-Constant-Velocity Modeling Amir Ahmad Ghods et.al. 2412.03095 null
2024-12-04 Coordinated Multi-Armed Bandits for Improved Spatial Reuse in Wi-Fi Francesc Wilhelmi et.al. 2412.03076 null
2024-12-04 Preference-based opponent shaping in differentiable games Xinyu Qiao et.al. 2412.03072 null
2024-12-04 Constrained portfolio game with heterogeneous agents Zongxia Liang et.al. 2412.03070 null
2024-12-04 Impact Of Income And Leisure On Optimal Portfolio, Consumption, Retirement Decisions Under Exponential Utility Tae Ung Gang et.al. 2412.03001 null
2024-12-04 New HI views of the Galaxy and the Magellanic Clouds Snezana Stanimirovic et.al. 2412.02981 null
2024-12-03 A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration Thulio Amorim et.al. 2412.02881 null
2024-12-03 Out-of-Distribution Detection for Neurosymbolic Autonomous Cyber Agents Ankita Samaddar et.al. 2412.02875 null
2024-12-03 An Information-Theoretic Analysis of Thompson Sampling for Logistic Bandits Amaury Gouverneur et.al. 2412.02861 null
2024-12-03 Algorithmic idealism: what should you believe to experience next? Markus P. Mueller et.al. 2412.02826 null
2024-12-03 Leveraging Tactile Sensing to Render both Haptic Feedback and Virtual Reality 3D Object Reconstruction in Robotic Telemanipulation Gabriele Giudici et.al. 2412.02644 null
2024-12-03 Mobile Cell-Free Massive MIMO with Multi-Agent Reinforcement Learning: A Scalable Framework Ziheng Liu et.al. 2412.02581 null
2024-12-03 Generating Critical Scenarios for Testing Automated Driving Systems Trung-Hieu Nguyen et.al. 2412.02574 link
2024-12-03 TAB-Fields: A Maximum Entropy Framework for Mission-Aware Adversarial Planning Gokul Puthumanaillam et.al. 2412.02570 link
2024-12-03 Defending Against Diverse Attacks in Federated Learning Through Consensus-Based Bi-Level Optimization Nicolás García Trillos et.al. 2412.02535 link
2024-12-03 General Resetting Theory for Group Avoidance Juhee Lee et.al. 2412.02524 null
2024-12-03 Resonance: Learning to Predict Social-Aware Pedestrian Trajectories as Co-Vibrations Conghao Wong et.al. 2412.02447 null
2024-12-03 A Multi-Agent Framework for Extensible Structured Text Generation in PLCs Donghao Yang et.al. 2412.02410 null
2024-12-03 Who Walks With You Matters: Perceiving Social Interactions with Groups for Pedestrian Trajectory Prediction Ziqian Zou et.al. 2412.02395 null
2024-12-03 Bio-inspired visual relative localization for large swarms of UAVs Martin Křížek et.al. 2412.02393 null
2024-12-03 Social patch foraging theory in an egalitarian group Lisa Blum Moyse et.al. 2412.02381 null
2024-12-03 Reinforcement learning to learn quantum states for Heisenberg scaling accuracy Jeongwoo Jae et.al. 2412.02334 link
2024-12-03 Optimizing Plastic Waste Collection in Water Bodies Using Heterogeneous Autonomous Surface Vehicles with Deep Reinforcement Learning Alejandro Mendoza Barrionuevo et.al. 2412.02316 link
2024-12-03 Large Multimodal Agents for Accurate Phishing Detection with Enhanced Token Optimization and Cost Reduction Fouad Trad et.al. 2412.02301 null
2024-12-03 Conformal Symplectic Optimization for Stable Reinforcement Learning Yao Lyu et.al. 2412.02291 link
2024-12-03 BOTracle: A framework for Discriminating Bots and Humans Jan Kadel et.al. 2412.02266 null
2024-12-03 Selective Reviews of Bandit Problems in AI via a Statistical View Pengjie Zhou et.al. 2412.02251 null
2024-12-03 DataLab: A Unifed Platform for LLM-Powered Business Intelligence Luoxuan Weng et.al. 2412.02205 null
2024-12-03 Distributed Task Allocation for Multi-Agent Systems: A Submodular Optimization Approach Jing Liu et.al. 2412.02146 null
2024-12-03 A privacy-preserving distributed credible evidence fusion algorithm for collective decision-making Chaoxiong Ma et.al. 2412.02130 null
2024-11-29 EF1 Allocations for Identical Trilean and Separable Single-Peaked Valuations Umang Bhaskar et.al. 2411.19881 null
2024-11-29 Neuroplasticity and Psychedelics: a comprehensive examination of classic and non-classic compounds in pre and clinical models Claudio Agnorelli et.al. 2411.19840 null
2024-11-29 Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation Robin D. Pesl et.al. 2411.19804 null
2024-11-29 CAREL: Instruction-guided reinforcement learning with cross-modal auxiliary objectives Armin Saghafian et.al. 2411.19787 link
2024-11-29 The 2024 Motile Active Matter Roadmap Gerhard Gompper et.al. 2411.19783 null
2024-11-29 HVAC-DPT: A Decision Pretrained Transformer for HVAC Control Anaïs Berkes et.al. 2411.19746 null
2024-11-29 Relative Representations of Latent Spaces enable Efficient Semantic Channel Equalization Tomás Hüttebräucker et.al. 2411.19719 null
2024-11-29 RMIO: A Model-Based MARL Framework for Scenarios with Observation Loss in Some Agents Shi Zifeng et.al. 2411.19639 null
2024-11-29 Build An Influential Bot In Social Media Simulations With Large Language Models Bailu Jin et.al. 2411.19635 null
2024-11-29 Solving Rubik's Cube Without Tricky Sampling Yicheng Lin et.al. 2411.19583 null
2024-11-29 Early Versus Late Traffic Management For Autonomous Agents Salman Ghori et.al. 2411.19582 null
2024-11-29 The ATTUNE model for Artificial Trust Towards Human Operators Giannis Petousakis et.al. 2411.19580 null
2024-12-02 Fixed-relative-switch strategies for learning based event-triggered control of nonlinear multiagent systems Ziming Wang et.al. 2411.19571 null
2024-11-29 Training Agents with Weakly Supervised Feedback from Large Language Models Dihong Gong et.al. 2411.19547 null
2024-11-29 A Local Information Aggregation based Multi-Agent Reinforcement Learning for Robot Swarm Dynamic Task Allocation Yang Lv et.al. 2411.19526 null
2024-11-29 RL-MILP Solver: A Reinforcement Learning Approach for Solving Mixed-Integer Linear Programs with Graph Neural Networks Tae-Hoon Lee et.al. 2411.19517 null
2024-11-29 SANGO: Socially Aware Navigation through Grouped Obstacles Rahath Malladi et.al. 2411.19497 null
2024-11-29 Two Timescale EXTRA for Smooth Non-convex Distributed Optimization Problems Zeyu Peng et.al. 2411.19483 null
2024-11-29 Proto Successor Measure: Representing the Space of All Possible Solutions of Reinforcement Learning Siddhant Agarwal et.al. 2411.19418 null
2024-11-28 Dynamic matching games: stationary equilibria under varying commitments Nadia Guiñazú et.al. 2411.19372 null
2024-11-28 Integrating Transit Signal Priority into Multi-Agent Reinforcement Learning based Traffic Signal Control Dickness Kakitahi Kwesiga et.al. 2411.19359 null
2024-11-27 Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective Zhi Zhang et.al. 2411.18615 null
2024-11-27 Robust Offline Reinforcement Learning with Linearly Structured $f$ -Divergence Regularization Cheng Tang et.al. 2411.18612 null
2024-11-27 AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans Dillon Loh et.al. 2411.18539 link
2024-11-27 Biswas-Chatterjee-Sen kinetic exchange opinion model for two connected groups Krzysztof Suchecki et.al. 2411.18527 null
2024-11-27 NeuroAI for AI Safety Patrick Mineault et.al. 2411.18526 null
2024-11-27 Collective decision making by embodied neural agents Nicolas Coucke et.al. 2411.18498 null
2024-11-27 Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator Frederic Kirstein et.al. 2411.18444 null
2024-11-27 An AI-Assisted Multi-Agent Dual Dialogue System to Support Mental Health Care Providers Onno P. Kampman et.al. 2411.18429 null
2024-11-27 Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration Esmaeel Mohammadi et.al. 2411.18305 null
2024-11-27 InterHub: A Naturalistic Trajectory Dataset with Dense Interaction for Autonomous Driving Xiyan Jiang et.al. 2411.18302 link
2024-11-27 Large Language Model-Brained GUI Agents: A Survey Chaoyun Zhang et.al. 2411.18279 link
2024-11-27 Grid-augumented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agents Joongwon Chae et.al. 2411.18270 link
2024-11-27 Wearable intelligent throat enables natural speech in stroke patients with dysarthria Chenyu Tang et.al. 2411.18266 null
2024-11-27 Exploration of LLM Multi-Agent Application Implementation Based on LangGraph+CrewAI Zhihua Duan et.al. 2411.18241 null
2024-11-27 Scalable Multi-Objective Reinforcement Learning with Fairness Guarantees using Lorenz Dominance Dimitris Michailidis et.al. 2411.18195 link
2024-11-27 DMVC-Tracker: Distributed Multi-Agent Trajectory Planning for Target Tracking Using Dynamic Buffered Voronoi and Inter-Visibility Cells Yunwoo Lee et.al. 2411.18086 null
2024-11-27 RL for Mitigating Cascading Failures: Targeted Exploration via Sensitivity Factors Anmol Dwivedi et.al. 2411.18050 link
2024-11-27 The Trusted Caregiver: The Influence of Eye and Mouth Design Incorporating the Baby Schema Effect in Virtual Humanoid Agents on Older Adults Users' Perception of Trustworthiness Jennifer Hu et.al. 2411.18047 null
2024-11-27 Normative Feeling: Socially Patterned Affective Mechanisms Stavros Anagnou et.al. 2411.18037 null
2024-11-27 AEGIS: An Agent-based Framework for General Bug Reproduction from Issue Descriptions Xinchen Wang et.al. 2411.18015 null
2024-11-26 SketchAgent: Language-Driven Sequential Sketch Generation Yael Vinker et.al. 2411.17673 null
2024-11-26 MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation Harsh Singh et.al. 2411.17636 null
2024-11-26 Making History Readable Bipasha Banerjee et.al. 2411.17600 null
2024-11-26 Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals William A. Ingram et.al. 2411.17598 null
2024-11-26 Decision making in stochastic extensive form II: Stochastic extensive forms and games E. Emanuel Rapsch et.al. 2411.17587 null
2024-11-26 Multi-Objective Reinforcement Learning for Automated Resilient Cyber Defence Ross O'Driscoll et.al. 2411.17585 null
2024-11-26 Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach Yaosheng Deng et.al. 2411.17552 null
2024-11-26 ShowUI: One Vision-Language-Action Model for GUI Visual Agent Kevin Qinghong Lin et.al. 2411.17465 link
2024-11-26 Object-centric proto-symbolic behavioural reasoning from pixels Ruben van Bergen et.al. 2411.17438 link
2024-11-26 Joint Combinatorial Node Selection and Resource Allocations in the Lightning Network using Attention-based Reinforcement Learning Mahdi Salahshour et.al. 2411.17353 null
2024-11-26 Towards Intention Recognition for Robotic Assistants Through Online POMDP Planning Juan Carlos Saborio et.al. 2411.17326 null
2024-11-26 A "Breathing" Mobile Communication Network Chao Ge et.al. 2411.17290 null
2024-11-26 APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents Jun Yu Chen et.al. 2411.17255 link
2024-11-26 Short-duration gamma-ray bursts from Kerr-Newman black hole mergers Shad Ali et.al. 2411.17205 null
2024-11-26 P2DFlow: A Protein Ensemble Generative Model with SE(3) Flow Matching Yaowei Jin et.al. 2411.17196 link
2024-11-26 Interleaved Scene Graph for Interleaved Text-and-Image Generation Assessment Dongping Chen et.al. 2411.17188 null
2024-11-26 LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble Yujeong Lee et.al. 2411.17135 null
2024-11-26 Creative Agents: Simulating the Systems Model of Creativity with Generative Agents Naomi Imasato et.al. 2411.17065 null
2024-11-26 g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks Zihan Wang et.al. 2411.17030 link
2024-11-26 CRASH: Challenging Reinforcement-Learning Based Adversarial Scenarios For Safety Hardening Amar Kulkarni et.al. 2411.16996 null
2024-11-25 Winning opinion: Following Your Friends' Advice or That of Their Friends? Francisco J. Muñoz et.al. 2411.16671 null
2024-11-25 Barriers on the EDGE: A scalable CBF architecture over EDGE for safe aerial-ground multi-agent coordination Viswa Narayanan Sankaranarayanan et.al. 2411.16608 null
2024-11-25 Naive Algorithmic Collusion: When Do Bandit Learners Cooperate and When Do They Compete? Connor Douglas et.al. 2411.16574 null
2024-11-25 Continual Deep Reinforcement Learning with Task-Agnostic Policy Distillation Muhammad Burhan Hafez et.al. 2411.16532 link
2024-11-25 Reinforcement Learning for Bidding Strategy Optimization in Day-Ahead Energy Market Luca Di Persio et.al. 2411.16519 null
2024-11-25 Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding Hongzhi Zang et.al. 2411.16506 link
2024-11-25 Distributed Online Optimization with Stochastic Agent Availability Juliette Achddou et.al. 2411.16477 null
2024-11-25 Generating social networks with static and dynamic utility-maximization approaches Aldric Labarthe et.al. 2411.16464 link
2024-11-25 Characterized Diffusion Networks for Enhanced Autonomous Driving Trajectory Prediction Haoming Li et.al. 2411.16457 null
2024-11-25 TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation Linqing Zhong et.al. 2411.16425 null
2024-11-25 A Multi-agent Framework for Materials Laws Discovery Bo Hu et.al. 2411.16416 null
2024-11-25 Functionality understanding and segmentation in 3D scenes Jaime Corsetti et.al. 2411.16310 null
2024-11-25 Probing for Consciousness in Machines Mathis Immertreu et.al. 2411.16262 null
2024-11-25 Open-Vocabulary Octree-Graph for 3D Scene Understanding Zhigang Wang et.al. 2411.16253 null
2024-11-25 Enhancing Multi-Agent Consensus through Third-Party LLM Integration: Analyzing Uncertainty and Mitigating Hallucinations in Large Language Models Zhihua Duan et.al. 2411.16189 null
2024-11-25 Stop Playing the Guessing Game! Target-free User Simulation for Evaluating Conversational Recommender Systems Sunghwan Kim et.al. 2411.16160 null
2024-11-25 Multi-Robot Reliable Navigation in Uncertain Topological Environments with Graph Attention Networks Zhuoyuan Yu et.al. 2411.16134 link
2024-11-25 Why the Agent Made that Decision: Explaining Deep Reinforcement Learning with Vision Masks Rui Zuo et.al. 2411.16120 null
2024-11-25 Leverage Task Context for Object Affordance Ranking Haojie Huang et.al. 2411.16082 null
2024-11-25 SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text Reshmi Ghosh et.al. 2411.16077 null
2024-11-22 RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts Hjalmar Wijk et.al. 2411.15114 link
2024-11-22 XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models Yixin Dong et.al. 2411.15100 null
2024-11-22 On Multi-Agent Inverse Reinforcement Learning Till Freihaut et.al. 2411.15046 null
2024-11-22 Safe Multi-Agent Reinforcement Learning with Convergence to Generalized Nash Equilibrium Zeyang Li et.al. 2411.15036 null
2024-11-22 On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations Guojun Xiong et.al. 2411.15014 null
2024-11-22 ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow Data Junhong Shen et.al. 2411.15004 link
2024-11-22 Free Energy Projective Simulation (FEPS): Active inference with interpretability Joséphine Pazem et.al. 2411.14991 null
2024-11-22 BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence Xuewu Lin et.al. 2411.14869 link
2024-11-22 Universal and Context-Independent Triggers for Precise Control of LLM Outputs Jiashuo Liang et.al. 2411.14738 null
2024-11-22 Enhancing Clinical Trial Patient Matching through Knowledge Augmentation with Multi-Agents Hanwen Shi et.al. 2411.14637 null
2024-11-21 Learning Autonomous Surgical Irrigation and Suction with the da Vinci Research Kit Using Reinforcement Learning Yafei Ou et.al. 2411.14622 null
2024-11-21 A Systematic Study of Multi-Agent Deep Reinforcement Learning for Safe and Robust Autonomous Highway Ramp Entry Larry Schester et.al. 2411.14593 null
2024-11-21 G-RAG: Knowledge Expansion in Material Science Radeen Mostafa et.al. 2411.14592 link
2024-11-21 SRSA: A Cost-Efficient Strategy-Router Search Agent for Real-world Human-Machine Interactions Yaqi Wang et.al. 2411.14574 null
2024-11-21 Energy Efficient Automated Driving as a GNEP: Vehicle-in-the-loop Experiments Viranjan Bhattacharyya et.al. 2411.14567 null
2024-11-21 Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models Yuhao Dong et.al. 2411.14432 link
2024-11-21 Multi-Agent Environments for Vehicle Routing Problems Ricardo Gama et.al. 2411.14411 link
2024-11-21 Resolving Multiple-Dynamic Model Uncertainty in Hypothesis-Driven Belief-MDPs Ofer Dagan et.al. 2411.14404 null
2024-11-21 SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching Arjun P S et.al. 2411.14322 link
2024-11-21 Q-CSM: Q-Learning-based Cognitive Service Management in Heterogeneous IoT Networks Kubra Duran et.al. 2411.14281 null
2024-11-21 Explainable Multi-Agent Reinforcement Learning for Extended Reality Codec Adaptation Pedro Enrique Iturria-Rivera et.al. 2411.14264 null
2024-11-21 Physics-Informed LLM-Agent for Automated Modulation Design in Power Electronics Systems Junhua Liu et.al. 2411.14214 null
2024-11-21 SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel Optimization Shuchen Zhu et.al. 2411.14166 null
2024-11-21 Multi-terminal Strong Coordination subject to Secrecy Constraints Viswanathan Ramachandran et.al. 2411.14123 null
2024-11-21 Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems Egor E. Nuzhin et.al. 2411.14117 null
2024-11-21 RAG-Thief: Scalable Extraction of Private Data from Retrieval-Augmented Generation Applications with Agent-based Attacks Changyue Jiang et.al. 2411.14110 null
2024-11-21 Asymmetric Opinion Formation of Emotional Eccitable Agents Irene Ferri et.al. 2411.14099 null
2024-11-21 Exploration by Running Away from the Past Paul-Antoine Le Tolguenec et.al. 2411.14085 null
2024-11-21 On PI-control in Capacity-Limited Networks Felix Agner et.al. 2411.14077 null
2024-11-21 Multi-LLM-Agent Systems: Techniques and Business Perspectives Yingxuan Yang et.al. 2411.14033 null
2024-11-21 GPT versus Humans: Uncovering Ethical Concerns in Conversational Generative AI-empowered Multi-Robot Systems Rebekah Rousi et.al. 2411.14009 null
2024-11-21 Approximating One-Sided and Two-Sided Nash Social Welfare With Capacities Salil Gokhale et.al. 2411.14007 null
2024-11-21 Learning Two-agent Motion Planning Strategies from Generalized Nash Equilibrium for Model Predictive Control Hansung Kim et.al. 2411.13983 link
2024-11-21 Movable Antenna-Equipped UAV for Data Collection in Backscatter Sensor Networks: A Deep Reinforcement Learning-based Approach Yu Bai et.al. 2411.13970 null
2024-11-21 Cooperative Grasping and Transportation using Multi-agent Reinforcement Learning with Ternary Force Representation Ing-Sheng Bernard-Tiong et.al. 2411.13942 null
2024-11-20 BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games Davide Paglieri et.al. 2411.13543 null
2024-11-20 Metacognition for Unknown Situations and Environments (MUSE) Rodolfo Valiente et.al. 2411.13537 null
2024-11-20 AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations Gaurav Verma et.al. 2411.13451 null
2024-11-20 Robust Monocular Visual Odometry using Curriculum Learning Assaf Lahiany et.al. 2411.13438 null
2024-11-20 A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback Alireza Rashidi Laleh et.al. 2411.13410 null
2024-11-20 Simulating Liquidity: Agent-Based Modeling of Illiquid Markets for Fractional Ownership Lars Fluri et.al. 2411.13381 null
2024-11-20 WHALES: A Multi-agent Scheduling Dataset for Enhanced Cooperation in Autonomous Driving Siwei Chen et.al. 2411.13340 link
2024-11-20 Revealed Information Laura Doval et.al. 2411.13293 null
2024-11-20 Transforming the Hybrid Cloud for Emerging AI Workloads Deming Chen et.al. 2411.13239 null
2024-11-20 Extremum and Nash Equilibrium Seeking with Delays and PDEs: Designs & Applications Tiago Roux Oliveira et.al. 2411.13234 null
2024-11-20 ViSTa Dataset: Do vision-language models understand sequential tasks? Evžen Wybitul et.al. 2411.13211 link
2024-11-20 Engagement-Driven Content Generation with Large Language Models Erica Coppolillo et.al. 2411.13187 null
2024-11-20 Cyborg Insect Factory: Automatic Assembly System to Build up Insect-computer Hybrid Robot Based on Vision-guided Robotic Arm Manipulation of Custom Bipolar Electrodes Qifeng Lin et.al. 2411.13164 null
2024-11-20 Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning Zhi Luo et.al. 2411.13116 null
2024-11-20 Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension Yongdong Luo et.al. 2411.13093 link
2024-11-20 AMaze: An intuitive benchmark generator for fast prototyping of generalizable agents Kevin Godin-Dubois et.al. 2411.13072 null
2024-11-20 Breaking the Cycle of Recurring Failures: Applying Generative AI to Root Cause Analysis in Legacy Banking Systems Siyuan Jin et.al. 2411.13017 null
2024-11-20 MindForge: Empowering Embodied Agents with Theory of Mind for Lifelong Collaborative Learning Mircea Lică et.al. 2411.12977 null
2024-11-19 Non-Newtonian corrections to radiative viscosity: Israel-Stewart theory as a viscosity limiter Lorenzo Gavassino et.al. 2411.12929 null
2024-11-19 Human-In-the-Loop Software Development Agents Wannita Takerngsaksiri et.al. 2411.12924 null
2024-11-19 Reinforcement Learning, Collusion, and the Folk Theorem Galit Askenazi-Golan et.al. 2411.12725 null
2024-11-19 UBSoft: A Simulation Platform for Robotic Skill Learning in Unbounded Soft Environments Chunru Lin et.al. 2411.12711 null
2024-11-19 Weighted Envy Freeness With Limited Subsidies Noga Klein Elmalem et.al. 2411.12696 null
2024-11-19 Quasi-stability notions in two-sided matching models Nadia Guiñazú et.al. 2411.12533 null
2024-11-19 Coevolution of relationship-driven cooperation under recommendation protocol on multiplex networks Hongyu Yue et.al. 2411.12436 null
2024-11-19 Instrumentation of Software Systems with OpenTelemetry for Software Visualization Malte Hansen et.al. 2411.12380 null
2024-11-19 C $^{2}$ INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention Xiaohe Li et.al. 2411.12313 null
2024-11-19 SNN-Based Online Learning of Concepts and Action Laws in an Open World Christel Grimaud et.al. 2411.12308 null
2024-11-19 Emergence of Implicit World Models from Mortal Agents Kazuya Horibe et.al. 2411.12304 null
2024-11-19 Could Humans Outshine AI in Visual Data Analysis? Ratanond Koonchanok et.al. 2411.12299 null
2024-11-19 Efficient Training in Multi-Agent Reinforcement Learning: A Communication-Free Framework for the Box-Pushing Problem David Ge et.al. 2411.12246 null
2024-11-19 Safe Navigation in Dynamic Environments using Density Functions Sriram S. K. S Narayanan et.al. 2411.12206 link
2024-11-19 A More Advanced Group Polarization Measurement Approach Based on LLM-Based Agents and Graphs Zixin Liu et.al. 2411.12196 null
2024-11-19 Action-Attentive Deep Reinforcement Learning for Autonomous Alignment of Beamlines Siyu Wang et.al. 2411.12183 link
2024-11-19 A Combined Encoder and Transformer Approach for Coherent and High-Quality Text Generation Jiajing Chen et.al. 2411.12157 null
2024-11-19 Reinforcement Learning with Action Sequence for Data-Efficient Robot Learning Younggyo Seo et.al. 2411.12155 null
2024-11-19 HEIGHT: Heterogeneous Interaction Graph Transformer for Robot Navigation in Crowded and Constrained Environments Shuijing Liu et.al. 2411.12150 null
2024-11-19 Hierarchical Trait-State Model for Decoding Dyadic Social Interactions Qianying Wu et.al. 2411.12145 null
2024-11-19 Adversarial Multi-Agent Reinforcement Learning for Proactive False Data Injection Detection Kejun Chen et.al. 2411.12130 null
2024-11-18 On-the-Go Path Planning and Repair in Static and Dynamic Scenarios Daniel Ajeleye et.al. 2411.12014 null
2024-11-18 Generative World Explorer Taiming Lu et.al. 2411.11844 null
2024-11-18 Reinterpreting Delay and Procrastination Conrad Kosowsky et.al. 2411.11828 null
2024-11-18 Competing Bandits in Decentralized Large Contextual Matching Markets Satush Parikh et.al. 2411.11794 null
2024-11-18 LLM-IE: A Python Package for Generative Information Extraction with Large Language Models Enshuo Hsu et.al. 2411.11779 null
2024-11-18 Mapping out the Space of Human Feedback for Reinforcement Learning: A Conceptual Framework Yannick Metz et.al. 2411.11761 null
2024-11-18 The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning Longju Bai et.al. 2411.11758 link
2024-11-18 Distributed Asynchronous Time-Varying Quadratic Programming with Asynchronous Objective Sampling Gabriel Behrendt et.al. 2411.11732 null
2024-11-18 Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment Allison Huang et.al. 2411.11731 link
2024-11-18 TrojanRobot: Backdoor Attacks Against Robotic Manipulation in the Physical World Xianlong Wang et.al. 2411.11683 null
2024-11-18 Artificial Scientific Discovery Antonio Norelli et.al. 2411.11672 null
2024-11-18 No-regret Exploration in Shuffle Private Reinforcement Learning Shaojie Bai et.al. 2411.11647 null
2024-11-18 Signaling and Social Learning in Swarms of Robots Leo Cazenille et.al. 2411.11616 null
2024-11-18 OASIS: Open Agents Social Interaction Simulations on One Million Agents Ziyi Yang et.al. 2411.11581 link
2024-11-18 A Code Knowledge Graph-Enhanced System for LLM-Based Fuzz Driver Generation Hanxiang Xu et.al. 2411.11532 link
2024-11-18 Structure learning with Temporal Gaussian Mixture for model-based Reinforcement Learning Théophile Champion et.al. 2411.11511 null
2024-11-18 Timescale-agnostic characterisation for collective attention events Tristan J. B. Cann et.al. 2411.11500 null
2024-11-18 Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models Chenhang Cui et.al. 2411.11496 link
2024-11-18 Quantifying Preferences of Vision-Language Models via Value Decomposition in Social Media Contexts Jingxuan Li et.al. 2411.11479 null
2024-11-18 Distributed Learning with Partial Information Sharing P Raghavendra Rao et.al. 2411.11411 null
2024-11-18 IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos Yunong Liu et.al. 2411.11409 link
2024-11-15 Fair Division via the Cake-Cutting Share Yannan Bai et.al. 2411.10434 null
2024-11-15 Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash Parsa Hejabi et.al. 2411.10422 link
2024-11-15 The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use Siyuan Hu et.al. 2411.10323 link
2024-11-15 Static network structure cannot stabilize cooperation among Large Language Model agents Jin Han et.al. 2411.10294 null
2024-11-15 Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review Hossein Hassani et.al. 2411.10268 null
2024-11-15 Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning Jingru Yang et.al. 2411.10252 null
2024-11-15 An Empirical Study on LLM-based Agents for Automated Bug Fixing Xiangxin Meng et.al. 2411.10213 null
2024-11-15 Agentic LLMs in the Supply Chain: Towards Autonomous Multi-Agent Consensus-Seeking Valeria Jannelli et.al. 2411.10184 null
2024-11-15 Let people fail! Exploring the influence of explainable virtual and robotic agents in learning-by-doing tasks Marco Matarese et.al. 2411.10176 null
2024-11-15 The Surprising Ineffectiveness of Pre-Trained Visual Representations for Model-Based Reinforcement Learning Moritz Schneider et.al. 2411.10175 null
2024-11-15 Semantics and Spatiality of Emergent Communication Rotem Ben Zion et.al. 2411.10173 link
2024-11-15 Multi-UAV Search and Rescue in Wilderness Using Smart Agent-Based Probability Models Zijian Ge et.al. 2411.10148 null
2024-11-15 Omnichain Web: The Universal Framework for Streamlined Chain Abstraction and Cross-Layer Interaction Hardik Gajera et.al. 2411.10132 null
2024-11-15 Generative Agent Simulations of 1,000 People Joon Sung Park et.al. 2411.10109 null
2024-11-15 Neural Port-Hamiltonian Models for Nonlinear Distributed Control: An Unconstrained Parametrization Approach Muhammad Zakwan et.al. 2411.10096 null
2024-11-15 Enforcing Cooperative Safety for Reinforcement Learning-based Mixed-Autonomy Platoon Control Jingyuan Zhou et.al. 2411.10031 null
2024-11-15 Orca: Enhancing Role-Playing Abilities of Large Language Models by Integrating Personality Traits Yuxuan Huang et.al. 2411.10006 null
2024-11-15 Solvated Electrons and Hydroxyl Radicals at the Plasma-Liquid Interface Seungjun Lee et.al. 2411.09991 null
2024-11-15 Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems Taaha Kazi et.al. 2411.09972 null
2024-11-15 Sublinear-time Collision Detection with a Polynomial Number of States in Population Protocols Takumi Araya et.al. 2411.09957 null
2024-11-14 Nash equilibrium seeking for a class of quadratic-bilinear Wasserstein distributionally robust games Georgios Pantazis et.al. 2411.09636 null
2024-11-14 Navigating the Risks: A Survey of Security, Privacy, and Ethics Threats in LLM-Based Agents Yuyou Gan et.al. 2411.09523 null
2024-11-14 Randomized Truthful Auctions with Learning Agents Gagan Aggarwal et.al. 2411.09517 null
2024-11-14 Strategic Sacrifice: Self-Organized Robot Swarm Localization for Inspection Productivity Sneha Ramshanker et.al. 2411.09493 null
2024-11-14 Socio-Economic Consequences of Generative AI: A Review of Methodological Approaches Carlos J. Costa et.al. 2411.09313 null
2024-11-14 Embedding Space Allocation with Angle-Norm Joint Classifiers for Few-Shot Class-Incremental Learning Dunwei Tu et.al. 2411.09250 null
2024-11-14 Risk-aware MPPI for Stochastic Hybrid Systems Hardik Parwana et.al. 2411.09198 link
2024-11-14 Enhancing reinforcement learning for population setpoint tracking in co-cultures Sebastián Espinel-Ríos et.al. 2411.09177 null
2024-11-14 Artificial Theory of Mind and Self-Guided Social Organisation Michael S. Harré et.al. 2411.09169 null
2024-11-14 Theory of Mind Enhances Collective Intelligence Michael S. Harré et.al. 2411.09168 null
2024-11-14 Rationality based Innate-Values-driven Reinforcement Learning Qin Yang et.al. 2411.09160 null
2024-11-14 The \emph{Optimist}: Towards Fully Automated Graph Theory Research Randy Davila et.al. 2411.09158 link
2024-11-14 Personalized Help for Optimizing Low-Skilled Users' Strategy Feng Gu et.al. 2411.09109 null
2024-11-13 Pheromone-Guided Navigation of Potential Mates: A Distinct Exploration Strategy Nick Dashti et.al. 2411.09092 null
2024-11-13 Microfoundation Inference for Strategic Prediction Daniele Bracale et.al. 2411.08998 null
2024-11-13 The Impact of Social Value Orientation on Nash Equilibria of Two Player Quadratic Games Dan Calderone et.al. 2411.08809 null
2024-11-13 FinRobot: AI Agent for Equity Research and Valuation with Large Language Models Tianyu Zhou et.al. 2411.08804 link
2024-11-13 Evaluating World Models with LLM for Decision Making Chang Yang et.al. 2411.08794 null
2024-11-13 Towards Fair and Efficient Public Transportation: A Bus Stop Model Martin Bullinger et.al. 2411.08784 link
2024-11-13 Logic-based Knowledge Awareness for Autonomous Agents in Continuous Spaces Arabinda Ghosh et.al. 2411.08754 null
2024-11-13 Statistical Operating Characteristics of Current Early Phase Dose Finding Designs with Toxicity and Efficacy in Oncology Hao Sun et.al. 2411.08698 null
2024-11-13 Inferring Parameter Distributions in Heterogeneous Motile Particle Ensembles: A Likelihood Approach for Second Order Langevin Models Jan Albrecht et.al. 2411.08692 null
2024-11-13 Robot See, Robot Do: Imitation Reward for Noisy Financial Environments Sven Goluža et.al. 2411.08637 null
2024-11-13 On the Application of Model Predictive Control to a Weighted Coverage Path Planning Problem Kilian Schweppe et.al. 2411.08634 null
2024-11-13 NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation Youzhi Liu et.al. 2411.08579 null
2024-11-13 Grammarization-Based Grasping with Deep Multi-Autoencoder Latent Space Exploration by Reinforcement Learning Agent Leonidas Askianakis et.al. 2411.08566 null
2024-11-13 TimeLess: A Vision for the Next Generation of Software Development Zeeshan Rasheed et.al. 2411.08507 null
2024-11-13 Towards Objective and Unbiased Decision Assessments with LLM-Enhanced Hierarchical Attention Networks Junhua Liu et.al. 2411.08504 link
2024-11-13 AD-DINO: Attention-Dynamic DINO for Distance-Aware Embodied Reference Understanding Hao Guo et.al. 2411.08451 null
2024-11-13 Towards Evaluating Large Language Models for Graph Query Generation Siraj Munir et.al. 2411.08449 null
2024-11-13 Learning Dynamic Cognitive Map with Autonomous Navigation Daria de Tinguy et.al. 2411.08447 link
2024-11-13 Anonymous Distributed Localisation via Spatial Population Protocols Leszek Gąsieniec et.al. 2411.08434 null
2024-11-13 One STEP at a time: Language Agents are Stepwise Planners Minh Nguyen et.al. 2411.08432 link
2024-11-13 Enhanced Classroom Dialogue Sequences Analysis with a Hybrid AI Agent: Merging Expert Rule-Base with Large Language Models Yun Long et.al. 2411.08418 null
2024-11-13 BAMAX: Backtrack Assisted Multi-Agent Exploration using Reinforcement Learning Geetansh Kalra et.al. 2411.08400 null
2024-11-12 LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models Anoop Cherian et.al. 2411.08027 null
2024-11-12 Incentive Design with Spillovers Krishna Dasaratha et.al. 2411.08026 null
2024-11-12 From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents Chuyi Kong et.al. 2411.07965 null
2024-11-12 Learning Memory Mechanisms for Decision Making through Demonstrations William Yue et.al. 2411.07954 link
2024-11-12 RedCode: Risky Code Execution and Generation Benchmark for Code Agents Chengquan Guo et.al. 2411.07781 link
2024-11-12 Efficiency of energy-consuming random walkers: Variability in energy helps Mohsen Ghasemi Nezhadhaghighi et.al. 2411.07771 null
2024-11-12 Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows Fangyu Lei et.al. 2411.07763 null
2024-11-12 Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement Learning Stefan Pranger et.al. 2411.07700 null
2024-11-12 World Models: The Safety Perspective Zifan Zeng et.al. 2411.07690 null
2024-11-12 Safe Exploitative Play with Untrusted Type Beliefs Tongxin Li et.al. 2411.07679 null
2024-11-12 The relationship between general equilibrium models with infinite-lived agents and overlapping generations models, and some applications Ngoc-Sang Pham et.al. 2411.07674 null
2024-11-12 Mitigating Bias in Queer Representation within Large Language Models: A Collaborative Agent Approach Tianyi Huang et.al. 2411.07656 link
2024-11-12 Exploring Multi-Agent Reinforcement Learning for Unrelated Parallel Machine Scheduling Maria Zampella et.al. 2411.07634 null
2024-11-12 A Simple Multi-agent Joint Prediction Method for Autonomous Driving Mingyi Wang et.al. 2411.07612 null
2024-11-12 Multiple Non-cooperative Targets Encirclement by Relative Distance based Positioning and Neural Anti-Synchronization Control Fen Liu et.al. 2411.07590 null
2024-11-12 Reinforcement Learning Framework for Quantitative Trading Alhassan S. Yasin et.al. 2411.07585 null
2024-11-12 Stability for a stochastic fractional differential variational inequality with Lévy jump Yue Zeng et.al. 2411.07557 null
2024-11-12 Collaborative and Federated Black-box Optimization: A Bayesian Optimization Perspective Raed Al Kontar et.al. 2411.07523 null
2024-11-12 Two-Layer Attention Optimization for Bimanual Coordination Justin Ting et.al. 2411.07470 null
2024-11-12 BudgetMLAgent: A Cost-Effective LLM Multi-Agent system for Automating Machine Learning Tasks Shubham Gandhi et.al. 2411.07464 null
2024-11-11 Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving Botao Yu et.al. 2411.07228 null
2024-11-11 Grounding Video Models to Actions through Goal Conditioned Exploration Yunhao Luo et.al. 2411.07223 null
2024-11-11 'Explaining RL Decisions with Trajectories': A Reproducibility Study Karim Abdel Sadek et.al. 2411.07200 link
2024-11-11 Gradual Fine-Tuning with Graph Routing for Multi-Source Unsupervised Domain Adaptation Yao Ma et.al. 2411.07185 null
2024-11-11 RoundTable: Investigating Group Decision-Making Mechanism in Multi-Agent Collaboration Young-Min Cho et.al. 2411.07161 null
2024-11-11 Azurin-Based Peptide p28 Arrests the p53-HDM2 Interactions: A Novel Anti-Cancer Pathway Albin Joy et.al. 2411.07124 null
2024-11-11 Learning Multi-Agent Collaborative Manipulation for Long-Horizon Quadrupedal Pushing Chuye Hong et.al. 2411.07104 null
2024-11-11 Bounded Rationality Equilibrium Learning in Mean Field Games Yannick Eich et.al. 2411.07099 link
2024-11-11 A Multi-Agent Approach for REST API Testing with Semantic Graphs and LLM-Driven Inputs Myeongsoo Kim et.al. 2411.07098 null
2024-11-11 Differentially-Private Collaborative Online Personalized Mean Estimation Yauhen Yakimenka et.al. 2411.07094 null
2024-11-11 To Train or Not to Train: Balancing Efficiency and Training Cost in Deep Reinforcement Learning for Mobile Edge Computing Maddalena Boscaro et.al. 2411.07086 null
2024-11-11 Learning Collective Dynamics of Multi-Agent Systems using Event-based Vision Minah Lee et.al. 2411.07039 null
2024-11-11 Designing Reliable Experiments with Generative Agent-Based Modeling: A Comprehensive Guide Using Concordia by Google DeepMind Alejandro Leonardo García Navarro et.al. 2411.07038 null
2024-11-11 Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching Arnav Kumar Jain et.al. 2411.07007 link
2024-11-11 Enhancing Robot Assistive Behaviour with Reinforcement Learning and Theory of Mind Antonio Andriella et.al. 2411.07003 link
2024-11-11 Maximizing Nash Social Welfare in 2-Value Instances: A Simpler Proof for the Half-Integer Case Kurt Mehlhorn et.al. 2411.06924 null
2024-11-11 Scalable Distributed Least Squares Algorithm for Linear Algebraic Equations via Scheduling Shenyu Liu et.al. 2411.06883 null
2024-11-11 Distributed Graph Augmentation Protocols to Achieve Strong Connectivity in Multi-Agent Networks Guilherme Ramos et.al. 2411.06880 link
2024-11-11 Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC Aditya Soni et.al. 2411.06815 null
2024-11-11 Generative midtended cognition and Artificial Intelligence. Thinging with thinging things Xabier E. Barandiaran et.al. 2411.06812 null
2024-11-08 Topology-aware Reinforcement Feature Space Reconstruction for Graph Data Wangyang Ying et.al. 2411.05742 null
2024-11-08 A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics Puze Liu et.al. 2411.05718 null
2024-11-08 Settling the Complexity of Popularity in Additively Separable and Fractional Hedonic Games Martin Bullinger et.al. 2411.05713 null
2024-11-08 Data-Driven Distributed Common Operational Picture from Heterogeneous Platforms using Multi-Agent Reinforcement Learning Indranil Sur et.al. 2411.05683 null
2024-11-08 The influence of persona and conversational task on social interactions with a LLM-controlled embodied conversational agent Leon O. H. Kroczek et.al. 2411.05653 null
2024-11-08 LightVA: Lightweight Visual Analytics with LLM Agent-Based Task Planning and Execution Yuheng Zhao et.al. 2411.05651 null
2024-11-08 Expectation vs. Reality: Towards Verification of Psychological Games Marta Kwiatkowska et.al. 2411.05599 null
2024-11-08 Smart navigation through a rotating barrier: Deep reinforcement learning with application to size-based separation of active microagents Mohammad Hossein Masoudi et.al. 2411.05587 null
2024-11-08 Tangled Program Graphs as an alternative to DRL-based control algorithms for UAVs Hubert Szolc et.al. 2411.05586 link
2024-11-08 Parameterized Voter Relevance in Facility Location Games with Tree-Shaped Invitation Graphs Ryoto Ando et.al. 2411.05574 null
2024-11-08 Time-to-reach Bounds for Verification of Dynamical Systems Using the Koopman Spectrum Jianqiang Ding et.al. 2411.05554 null
2024-11-08 Evolution of cooperation in a three-strategy game combining snowdrift and stag hunt games Hirofumi Takesue et.al. 2411.05543 null
2024-11-08 Generating surrogate temporal networks from mesoscale building blocks Giulia Cencetti et.al. 2411.05477 link
2024-11-08 Enhancing Robustness in Language-Driven Robotics: A Modular Approach to Failure Reduction Émiland Garrabé et.al. 2411.05474 null
2024-11-08 Emergent Cooperative Strategies for Multi-Agent Shepherding via Reinforcement Learning Italo Napolitano et.al. 2411.05454 null
2024-11-08 WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models Shengda Fan et.al. 2411.05451 link
2024-11-08 VISTA: Visual Integrated System for Tailored Automation in Math Problem Generation Using LLM Jeongwoo Lee et.al. 2411.05423 null
2024-11-08 Towards Low-Resource Harmful Meme Detection with LMM Agents Jianzhao Huang et.al. 2411.05383 link
2024-11-08 Enhancing Cluster Resilience: LLM-agent Based Autonomous Intelligent Cluster Diagnosis System and Evaluation Framework Honghao Shi et.al. 2411.05349 null
2024-11-08 LLM-PySC2: Starcraft II learning environment for Large Language Models Zongyuan Li et.al. 2411.05348 link
2024-11-07 Few-Shot Task Learning through Inverse Generative Modeling Aviv Netanyahu et.al. 2411.04987 null
2024-11-07 Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games Usman Anwar et.al. 2411.04976 link
2024-11-07 StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration Panwen Hu et.al. 2411.04925 null
2024-11-07 OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Siming Huang et.al. 2411.04905 null
2024-11-07 Achieving superconductivity in infinite-layer nickelate thin films by aluminum sputtering deposition Dongxin Zhang et.al. 2411.04896 null
2024-11-07 GUI Agents with Foundation Models: A Comprehensive Survey Shuai Wang et.al. 2411.04890 null
2024-11-07 Think Smart, Act SMARL! Analyzing Probabilistic Logic Driven Safety in Multi-Agent Reinforcement Learning Satchit Chatterji et.al. 2411.04867 link
2024-11-07 Robust Regulation of Labour Contracts Théo Durandard et.al. 2411.04841 null
2024-11-07 Plasticity Loss in Deep Reinforcement Learning: A Survey Timo Klein et.al. 2411.04832 null
2024-11-07 MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation Sayan Paul et.al. 2411.04796 null
2024-11-07 A Continuification-Based Control Solution for Large-Scale Shepherding Beniamino Di Lorenzo et.al. 2411.04791 null
2024-11-07 Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial Research Xuewen Han et.al. 2411.04788 link
2024-11-07 Navigating Trade-offs: Policy Summarization for Multi-Objective Reinforcement Learning Zuzanna Osika et.al. 2411.04784 link
2024-11-07 Learning from Demonstration with Hierarchical Policy Abstractions Toward High-Performance and Courteous Autonomous Racing Chanyoung Chung et.al. 2411.04735 null
2024-11-07 A dynamical model of platform choice and online segregation Sven Banisch et.al. 2411.04681 null
2024-11-07 CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation Jie Liu et.al. 2411.04679 null
2024-11-07 Semantic-Aware Resource Management for C-V2X Platooning via Multi-Agent Reinforcement Learning Zhiyu Shao et.al. 2411.04672 link
2024-11-07 CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR Kadir Burak Buldu et.al. 2411.04671 null
2024-11-07 IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving Clémence Grislain et.al. 2411.04653 link
2024-11-07 Mint: Cost-Efficient Tracing with All Requests Collection via Commonality and Variability Analysis Haiyu Huang et.al. 2411.04605 null
2024-11-06 Predicting and Publishing Accurate Imbalance Prices Using Monte Carlo Tree Search Fabio Pavirani et.al. 2411.04011 null
2024-11-06 Temporal Network Creation Games: The Impact of Non-Locality and Terminals Davide Bilò et.al. 2411.03973 null
2024-11-06 Almost Time-Optimal Loosely-Stabilizing Leader Election on Arbitrary Graphs Without Identifiers in Population Protocols Haruki Kanaya et.al. 2411.03902 null
2024-11-06 AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making Yizhe Huang et.al. 2411.03865 link
2024-11-06 Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC Tyler Clark et.al. 2411.03820 null
2024-11-06 From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning Zhirui Deng et.al. 2411.03817 null
2024-11-06 MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue Fengxiang Wang et.al. 2411.03814 null
2024-11-06 Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data Chengrui Qu et.al. 2411.03810 link
2024-11-06 Multi-Modal Intelligent Channel Modeling: A New Modeling Paradigm via Synesthesia of Machines Lu Bai et.al. 2411.03711 null
2024-11-06 Learn to Slice, Slice to Learn: Unveiling Online Optimization and Reinforcement Learning for Slicing AI Services Amr Abo-eleneen et.al. 2411.03686 null
2024-11-06 Imagined Potential Games: A Framework for Simulating, Learning and Evaluating Interactive Behaviors Lingfeng Sun et.al. 2411.03669 null
2024-11-06 Privacy-Preserving Resilient Vector Consensus Bing Liu et.al. 2411.03633 null
2024-11-06 CPEG: Leveraging Consistency Policy with Consensus Guidance for Multi-agent Exploration Yuqian Fu et.al. 2411.03603 null
2024-11-05 Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level Antoine Grosnit et.al. 2411.03562 null
2024-11-05 VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation Haochen Zhang et.al. 2411.03540 link
2024-11-05 AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution Zhiqiang Xie et.al. 2411.03519 null
2024-11-05 An Open-source Sim2Real Approach for Sensor-independent Robot Navigation in a Grid Murad Mehrab Abrar et.al. 2411.03494 link
2024-11-05 Watson: A Cognitive Observability Framework for the Reasoning of Foundation Model-Powered Agents Benjamin Rombaut et.al. 2411.03455 null
2024-11-05 SAUCE: Synchronous and Asynchronous User-Customizable Environment for Multi-Agent LLM Interaction Shlomo Neuberger et.al. 2411.03397 link
2024-11-05 SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents Dawei Li et.al. 2411.03284 link
2024-11-05 Causal Responsibility Attribution for Human-AI Collaboration Yahang Qi et.al. 2411.03275 link
2024-11-05 Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities Ryosuke Takata et.al. 2411.03252 null
2024-11-05 Troll Farms Philipp Denter et.al. 2411.03241 null
2024-11-05 A resolved Lyman-Alpha profile with doubly peaked emission at z~7 C. Moya-Sierralta et.al. 2411.03222 null
2024-11-05 GIS Copilot: Towards an Autonomous GIS Agent for Spatial Analysis Temitope Akinboyewa et.al. 2411.03205 link
2024-11-05 Online Data Collection for Efficient Semiparametric Inference Shantanu Gupta et.al. 2411.03195 link
2024-11-05 Hierarchical Orchestra of Policies Thomas P Cannon et.al. 2411.03008 null
2024-11-05 Accelerating Task Generalisation with Multi-Level Hierarchical Options Thomas P Cannon et.al. 2411.02998 null
2024-11-05 Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation Francisco Giral et.al. 2411.02975 null
2024-11-05 Embedding Safety into RL: A New Take on Trust Region Methods Nikola Milosevic et.al. 2411.02957 null
2024-11-05 Constant Approximation for Weighted Nash Social Welfare with Submodular Valuations Yuda Feng et.al. 2411.02942 null
2024-11-05 Multi-Modal 3D Scene Graph Updater for Shared and Dynamic Environments Emilio Olivastri et.al. 2411.02938 null
2024-11-05 Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent Yangning Li et.al. 2411.02937 link
2024-11-05 Polyhedral study of a temporal rural postman problem: application in inspection of railway track without disturbing train schedules Somnath Buriuly et.al. 2411.02822 null
2024-11-05 DroidSpeak: Enhancing Cross-LLM Communication Yuhan Liu et.al. 2411.02820 null
2024-11-04 Fair and Welfare-Efficient Constrained Multi-matchings under Uncertainty Elita Lobo et.al. 2411.02654 link
2024-11-04 Fine Grained Insider Risk Detection Birkett Huber et.al. 2411.02645 null
2024-11-04 Learning to Assist Humans without Inferring Rewards Vivek Myers et.al. 2411.02623 link
2024-11-04 Multi-Agent Decision Transformers for Dynamic Dispatching in Material Handling Systems Leveraging Enterprise Big Data Xian Yeow Lee et.al. 2411.02584 null
2024-11-04 Attacking Vision-Language Computer Agents via Pop-ups Yanzhe Zhang et.al. 2411.02391 link
2024-11-04 Two-Sided Learning in Decentralized Matching Markets Vade Shah et.al. 2411.02377 null
2024-11-04 Social-RAG: Retrieving from Group Interactions to Socially Ground Proactive AI Generation to Group Preferences Ruotong Wang et.al. 2411.02353 null
2024-11-04 WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning Zehan Qi et.al. 2411.02337 link
2024-11-04 CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments Kung-Hsiang Huang et.al. 2411.02305 link
2024-11-04 Kinetic exchange opinion dynamics for the battleground-states in the 2024 US presidential elections Soumyajyoti Biswas et.al. 2411.02240 null
2024-11-04 Positive Experience Reflection for Agents in Interactive Text Environments Philip Lippmann et.al. 2411.02223 null
2024-11-04 CryptoEL: A Novel Experiential Learning Tool for Enhancing K-12 Cryptography Education Pranathi Rayavaram et.al. 2411.02143 null
2024-11-04 Foundations and Recent Trends in Multimodal Mobile Agents: A Survey Biao Wu et.al. 2411.02006 link
2024-11-04 Deep memetic models for combinatorial optimization problems: application to the tool switching problem Jhon Edgar Amaya et.al. 2411.01922 null
2024-11-04 Efficient Active Imitation Learning with Random Network Distillation Emilien Biré et.al. 2411.01894 null
2024-11-04 ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation Hengkai Tan et.al. 2411.01850 null
2024-11-04 IRS-Enhanced Secure Semantic Communication Networks: Cross-Layer and Context-Awared Resource Allocation Lingyi Wang et.al. 2411.01821 null
2024-11-04 A Polynomial-Time Algorithm for Fair and Efficient Allocation with a Fixed Number of Agents Ryoga Mahara et.al. 2411.01810 null
2024-11-04 Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge Weihua Du et.al. 2411.01796 link
2024-11-04 Revisiting Game-Theoretic Control in Socio-Technical Networks: Emerging Design Frameworks and Contemporary Applications Quanyan Zhu et.al. 2411.01794 null
2024-11-04 Lyapunov-guided Multi-Agent Reinforcement Learning for Delay-Sensitive Wireless Scheduling Cheng Zhang et.al. 2411.01766 null
2024-11-04 Show, Don't Tell: Learning Reward Machines from Demonstrations for Reinforcement Learning-Based Cardiac Pacemaker Synthesis John Komp et.al. 2411.01750 null
2024-11-04 DynaSaur: Large Language Agents Beyond Predefined Actions Dang Nguyen et.al. 2411.01747 null
2024-11-04 Taking AI Welfare Seriously Robert Long et.al. 2411.00986 null

(back to top)

Large Language Model Agent

Publish Date Title Authors PDF Code
2025-02-24 IGDA: Interactive Graph Discovery through Large Language Model Agents Alex Havrilla et.al. 2502.17189 null
2025-02-24 Grounded Persuasive Language Generation for Automated Marketing Jibang Wu et.al. 2502.16810 null
2025-02-24 Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent Advances Yaozu Wu et.al. 2502.16804 null
2025-02-23 Guardians of the Agentic System: Preventing Many Shots Jailbreak with Agentic System Saikat Barua et.al. 2502.16750 null
2025-02-23 RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents Sho Nakatani et.al. 2502.16730 null
2025-02-20 Vending-Bench: A Benchmark for Long-Term Coherence of Autonomous Agents Axel Backlund et.al. 2502.15840 null
2025-02-18 LLM Trading: Analysis of LLM Agent Behavior in Experimental Asset Markets Thomas Henning et.al. 2502.15800 null
2025-02-21 Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing Masaya Kobayashi et.al. 2502.15506 null
2025-02-21 Textual-to-Visual Iterative Self-Verification for Slide Generation Yunqing Xu et.al. 2502.15412 null
2025-02-21 I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search Zujie Liang et.al. 2502.14693 null
2025-02-20 Enhancing Language Multi-Agent Learning with Multi-Agent Credit Re-Assignment for Interactive Environment Generalization Zhitao He et.al. 2502.14496 null
2025-02-20 FlowAgent: Achieving Compliance and Flexibility for Workflow Agents Yuchen Shi et.al. 2502.14345 link
2025-02-19 Investigating Non-Transitivity in LLM-as-a-Judge Yi Xu et.al. 2502.14074 null
2025-02-19 An LLM-based Agent for Reliable Docker Environment Configuration Ruida Hu et.al. 2502.13681 null
2025-02-16 Understanding Dynamic Diffusion Process of LLM-based Agents under Information Asymmetry Yiwen Zhang et.al. 2502.13160 null
2025-02-18 SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems Mike Zhang et.al. 2502.12927 link
2025-02-18 Towards more Contextual Agents: An extractor-Generator Optimization Framework Mourad Aouini et.al. 2502.12926 null
2025-02-18 DemonAgent: Dynamically Encrypted Multi-Backdoor Implantation Attack on LLM-based Agent Pengyu Zhu et.al. 2502.12575 link
2025-02-18 Investigating and Extending Homans' Social Exchange Theory with Large Language Model based Agents Lei Wang et.al. 2502.12450 link
2025-02-17 Connecting Large Language Model Agent to High Performance Computing Resource Heng Ma et.al. 2502.12280 null
2025-02-17 Scaling Autonomous Agents via Automatic Reward Modeling And Planning Zhenfang Chen et.al. 2502.12130 null
2025-02-17 TimeCAP: Learning to Contextualize, Augment, and Predict Time Series Events with Large Language Model Agents Geon Lee et.al. 2502.11418 null
2025-02-16 A Survey of LLM-based Agents in Medicine: How far are we from Baymax? Wenxuan Wang et.al. 2502.11211 null
2025-02-16 SCALE: Towards Collaborative Content Analysis in Social Science with Large Language Model Agents and Human Intervention Chengshuai Zhao et.al. 2502.10937 null
2025-02-14 Can Large Language Model Agents Balance Energy Systems? Xinxing Ren et.al. 2502.10557 null
2025-02-13 MDCrow: Automating Molecular Dynamics Workflows with Large Language Models Quintina Campbell et.al. 2502.09565 link
2025-02-12 SPeCtrum: A Grounded Framework for Multidimensional Identity Representation in LLM-Based Agent Keyeun Lee et.al. 2502.08599 link
2025-02-13 Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation Mahnaz Koupaee et.al. 2502.08514 link
2025-02-07 Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization Zelai Xu et.al. 2502.04686 null
2025-02-06 Multi-Agent Reinforcement Learning with Focal Diversity Optimization Selim Furkan Tekin et.al. 2502.04492 link
2025-02-04 Position: Scaling LLM Agents Requires Asymptotic Analysis with LLM Primitives Elliot Meyerson et.al. 2502.04358 null
2025-02-03 Simulating Rumor Spreading in Social Networks using LLM Agents Tianrui Hu et.al. 2502.01450 link
2025-02-03 PlotGen: Multi-Agent LLM-based Scientific Data Visualization via Multimodal Feedback Kanika Goswami et.al. 2502.00988 null
2025-02-02 RTBAgent: A LLM-based Agent System for Real-Time Bidding Leng Cai et.al. 2502.00792 link
2025-02-02 Meta-Prompt Optimization for LLM-Based Sequential Decision Making Mingze Kong et.al. 2502.00728 null
2025-02-02 PhiP-G: Physics-Guided Text-to-3D Compositional Scene Generation Qixuan Li et.al. 2502.00708 null
2025-01-31 Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game Mustafa O. Karabag et.al. 2501.19398 link
2025-01-28 Large Language Model Critics for Execution-Free Evaluation of Code Changes Aashish Yadavally et.al. 2501.16655 link
2024-12-30 DropMicroFluidAgents (DMFAs): Autonomous Droplet Microfluidic Research Framework Through Large Language Model Agents Dinh-Nguyen Nguyen et.al. 2501.14772 link
2025-01-24 AI Chatbots as Professional Service Agents: Developing a Professional Identity Wenwen Li et.al. 2501.14179 null
2025-02-08 Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents Shrinidhi Kumbhar et.al. 2501.13299 null
2025-01-20 Towards Advancing Code Generation with Large Language Models: A Research Roadmap Haolin Jin et.al. 2501.11354 null
2025-02-13 Large Language Model Agents for Radio Map Generation and Wireless Network Planning Hongye Quan et.al. 2501.11283 null
2024-12-18 Autonomous Microscopy Experiments through Large Language Model Agents Indrajeet Mandal et.al. 2501.10385 null
2025-01-13 Lifelong Learning of Large Language Model based Agents: A Roadmap Junhao Zheng et.al. 2501.07278 link
2025-01-10 Multi-Agent Collaboration Mechanisms: A Survey of LLMs Khanh-Tung Tran et.al. 2501.06322 null
2025-01-09 Emergence of human-like polarization among large language model agents Jinghua Piao et.al. 2501.05171 null
2025-01-27 MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning Pu Yang et.al. 2501.01834 null
2025-01-03 SDPO: Segment-Level Direct Preference Optimization for Social Agents Aobo Kong et.al. 2501.01821 link
2025-01-03 AgentRefine: Enhancing Agent Generalization through Refinement Tuning Dayuan Fu et.al. 2501.01702 null
2025-01-02 BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery Kanishk Gandhi et.al. 2501.01540 link
2024-12-31 Enabling New HDLs with Agents Mark Zakharov et.al. 2501.00642 null
2025-01-09 Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding Yue Fan et.al. 2501.00358 null
2024-12-30 AI Agent for Education: von Neumann Multi-Agent System Framework Yuan-Hao Jiang et.al. 2501.00083 null
2024-12-17 AnalogXpert: Automating Analog Topology Synthesis by Incorporating Circuit Design Expertise into Large Language Models Haoyi Zhang et.al. 2412.19824 null
2024-12-24 Explainable Multi-Modal Data Exploration in Natural Language via LLM Agent Farhad Nooralahzadeh et.al. 2412.18428 link
2024-12-24 Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering Zhongjian Hu et.al. 2412.18351 null
2024-12-24 INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent Haohang Li et.al. 2412.18174 null
2024-12-24 Molly: Making Large Language Model Agents Solve Python Problem More Logically Rui Xiao et.al. 2412.18093 null
2024-12-17 On the Structural Memory of LLM Agents Ruihong Zeng et.al. 2412.15266 link
2024-12-18 Tree-of-Code: A Hybrid Approach for Robust Complex Task Planning and Execution Ziyi Ni et.al. 2412.14212 null
2024-12-17 RareAgents: Autonomous Multi-disciplinary Team for Rare Disease Diagnosis and Treatment Xuanzhong Chen et.al. 2412.12475 null
2024-12-14 Towards Action Hijacking of Large Language Model-based Agent Yuyang Zhang et.al. 2412.10807 null
2025-01-09 Active Inference for Self-Organizing Multi-LLM Systems: A Bayesian Thermodynamic Approach to Adaptation Rithvik Prakki et.al. 2412.10425 link
2024-12-19 Can Modern LLMs Act as Agent Cores in Radiology Environments? Qiaoyu Zheng et.al. 2412.09529 link
2024-12-09 Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework Tianming Liu et.al. 2412.06681 null
2024-12-09 Simulating Human-like Daily Activities with Desire-driven Autonomy Yiding Wang et.al. 2412.06435 null
2024-12-09 StarWhisper Telescope: Agent-Based Observation Assistant System to Approach AI Astrophysicist Cunshi Wang et.al. 2412.06412 null
2024-12-09 Beyond pip install: Evaluating LLM Agents for the Automated Installation of Python Projects Louis Milliken et.al. 2412.06294 link
2024-12-08 Cooperative SQL Generation for Segmented Databases By Using Multi-functional LLM Agents Zhiguang Wu et.al. 2412.05850 null
2024-12-04 DataLab: A Unified Platform for LLM-Powered Business Intelligence Luoxuan Weng et.al. 2412.02205 null
2024-12-02 HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing Lajos Muzsai et.al. 2412.01778 link
2024-12-02 SAUP: Situation Awareness Uncertainty Propagation on LLM Agent Qiwei Zhao et.al. 2412.01033 null
2024-12-03 Multi-Agent System for Cosmological Parameter Analysis Andrew Laverick et.al. 2412.00431 link
2024-11-28 SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments Yue Cao et.al. 2412.00114 null
2024-11-29 Training Agents with Weakly Supervised Feedback from Large Language Models Dihong Gong et.al. 2411.19547 null
2024-11-26 LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble Yujeong Lee et.al. 2411.17135 null
2024-11-21 Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning Song Jiang et.al. 2411.13904 null
2024-11-19 Human-In-the-Loop Software Development Agents Wannita Takerngsaksiri et.al. 2411.12924 null
2024-12-16 A More Advanced Group Polarization Measurement Approach Based on LLM-Based Agents and Graphs Zixin Liu et.al. 2411.12196 null
2024-11-15 Static network structure cannot stabilize cooperation among Large Language Model agents Jin Han et.al. 2411.10294 null
2024-11-15 An Empirical Study on LLM-based Agents for Automated Bug Fixing Xiangxin Meng et.al. 2411.10213 null
2024-11-14 Navigating the Risks: A Survey of Security, Privacy, and Ethics Threats in LLM-Based Agents Yuyou Gan et.al. 2411.09523 null
2024-10-29 FinVision: A Multi-Agent Framework for Stock Market Prediction Sorouralsadat Fatemi et.al. 2411.08899 null
2024-11-11 Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving Botao Yu et.al. 2411.07228 null
2024-11-05 Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities Ryosuke Takata et.al. 2411.03252 null
2024-11-02 Interacting Large Language Model Agents. Interpretable Models and Social Learning Adit Jain et.al. 2411.01271 null
2024-11-02 AutoPT: How Far Are We from the End2End Automated Web Penetration Testing? Benlong Wu et.al. 2411.01236 link
2024-11-02 A Large-scale Time-aware Agents Simulation for Influencer Selection in Digital Advertising Campaigns Xiaoqing Zhang et.al. 2411.01143 null
2024-11-01 Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software Improvement Yingwei Ma et.al. 2411.00622 link
2024-10-31 From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents Nalin Tiwary et.al. 2410.23555 null
2024-10-30 Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration Yanchu Guan et.al. 2410.22916 null
2024-10-29 SceneGenAgent: Precise Industrial Scene Generation with Coding Agent Xiao Xia et.al. 2410.21909 link
2024-10-28 Guide-LLM: An Embodied LLM Agent and Text-Based Topological Map for Robotic Guidance of People with Visual Impairments Sangmim Song et.al. 2410.20666 null
2024-10-29 Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting Mohamed Salim Aissi et.al. 2410.19920 null
2024-11-07 GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration Xin Li et.al. 2410.18032 link
2024-10-25 MiniFed : Integrating LLM-based Agentic-Workflow for Simulating FOMC Meeting Sungil Seok et.al. 2410.18012 null
2024-10-22 SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning Yizhou Chi et.al. 2410.17238 link
2024-10-22 Adsorb-Agent: Autonomous Identification of Stable Adsorption Configurations via Large Language Model Agent Janghoon Ock et.al. 2410.16658 link
2024-10-21 NetSafe: Exploring the Topological Safety of Multi-agent Networks Miao Yu et.al. 2410.15686 null
2024-10-20 When Machine Unlearning Meets Retrieval-Augmented Generation (RAG): Keep Secret or Forget Knowledge? Shang Wang et.al. 2410.15267 null
2024-10-19 SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation Jingxuan Chen et.al. 2410.15164 link
2024-10-18 Agents4PLC: Automating Closed-loop PLC Code Generation and Verification in Industrial Control Systems using LLM-based Agents Zihan Liu et.al. 2410.14209 link
2024-10-18 SRAP-Agent: Simulating and Optimizing Scarce Resource Allocation Policy with LLM-based Agent Jiarui Ji et.al. 2410.14152 link
2024-10-17 AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents Ke Yang et.al. 2410.13825 null
2024-10-17 Rapid and Automated Alloy Design with Graph Neural Network-Powered LLM-Driven Multi-Agent Systems Alireza Ghafarollahi et.al. 2410.13768 null

(back to top)

Tool learning

Publish Date Title Authors PDF Code
2025-02-17 ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models Hanxing Ding et.al. 2502.11404 link
2025-02-17 Mimicking the Familiar: Dynamic Command Generation for Information Theft Attacks in LLM Tool-Learning System Ziyou Jiang et.al. 2502.11358 null
2025-02-14 RTBAS: Defending LLM Agents Against Prompt Injection and Privacy Leakage Peter Yong Zhong et.al. 2502.08966 null
2025-02-03 Tool Unlearning for Tool-Augmented LLMs Jiali Cheng et.al. 2502.01083 null
2025-01-30 ACEBench: Who Wins the Match Point in Tool Learning? Chen Chen et.al. 2501.12851 null
2025-01-21 Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation Dongsheng Zhu et.al. 2501.12432 null
2024-12-11 GraphTool-Instruction: Revolutionizing Graph Reasoning in LLMs through Decomposed Subtask Instruction Rongzheng Wang et.al. 2412.12152 null
2024-12-11 Federated In-Context LLM Agent Learning Panlong Wu et.al. 2412.08054 null
2024-12-08 TOOL-ED: Enhancing Empathetic Response Generation with the Tool Calling Capability of LLM Huiying Cao et.al. 2412.03096 link
2024-10-15 Toolken+: Improving LLM Tool Usage with Reranking and a Reject Option Konstantin Yakovlev et.al. 2410.12004 null
2025-01-07 NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Han Han et.al. 2410.11805 link
2024-10-10 From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions Changle Qu et.al. 2410.08197 link
2025-02-18 StepTool: Enhancing Multi-Step Tool Usage in LLMs through Step-Grained Reinforcement Learning Yuanqing Yu et.al. 2410.07745 link
2025-02-24 Learning Evolving Tools for Large Language Models Guoxin Chen et.al. 2410.06617 link
2024-10-08 ToolGen: Unified Tool Retrieval and Calling via Generation Renxi Wang et.al. 2410.03439 link
2024-09-23 CITI: Enhancing Tool Utilizing Ability in Large Language Models without Sacrificing General Performance Yupu Hao et.al. 2409.13202 link
2024-09-02 ToolACE: Winning the Points of LLM Function Calling Weiwen Liu et.al. 2409.00920 null
2025-02-16 Learning to Ask: When LLM Agents Meet Unclear Instruction Wenxuan Wang et.al. 2409.00557 null
2024-10-08 MetaTool: Facilitating Large Language Models to Master Tools with Meta-task Augmentation Xiaohan Wang et.al. 2407.12871 null
2024-07-02 WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models Kangyun Ning et.al. 2407.12823 null
2024-07-03 What Affects the Stability of Tool Learning? An Empirical Study on the Robustness of Tool Learning Frameworks Chengrui Huang et.al. 2407.03007 null
2024-06-28 Simulating Financial Market via Large Language Model based Agents Shen Gao et.al. 2406.19966 null
2024-09-29 Enhancing Tool Retrieval with Iterative Feedback from Large Language Models Qiancheng Xu et.al. 2406.17465 link
2024-09-30 Query Routing for Homogeneous Tools: An Instantiation in the RAG Scenario Feiteng Mu et.al. 2406.12429 null
2024-10-02 Tool-Planner: Task Planning with Clusters across Multiple Tools Yanming Liu et.al. 2406.03807 link
2024-06-03 A Survey of Useful LLM Evaluation Ji-Lun Peng et.al. 2406.00936 null
2024-11-04 Tool Learning with Large Language Models: A Survey Changle Qu et.al. 2405.17935 link
2024-05-24 Let Me Do It For You: Towards LLM Empowered Recommendation via Tool Learning Yuyue Zhao et.al. 2405.15114 null
2024-05-14 Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark Mengsong Wu et.al. 2405.08355 link

(back to top)

Embodied AI

Publish Date Title Authors PDF Code
2025-02-20 CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space Yong Zhao et.al. 2502.12532 link
2025-02-16 NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLM Zihan Wang et.al. 2502.11142 link
2025-02-14 STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning Mingcong Lei et.al. 2502.10177 null
2025-02-11 Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning Yuhang Dong et.al. 2502.09649 null
2025-02-23 EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents Rui Yang et.al. 2502.09560 null
2025-02-10 Visual Agentic AI for Spatial Reasoning with a Dynamic API Damiano Marsili et.al. 2502.06787 null
2025-02-09 EvoAgent: Agent Autonomous Evolution with Continual World Model for Long-Horizon Tasks Tongtong Feng et.al. 2502.05907 null
2025-02-10 Humans Co-exist, So Must Embodied Artificial Agents Hannah Kuehn et.al. 2502.04809 null
2025-02-04 AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement Shivam Singh et.al. 2502.02067 link
2025-02-03 Provable Ordering and Continuity in Vision-Language Pretraining for Generalizable Embodied Agents Zhizhen Zhang et.al. 2502.01218 link
2025-01-31 MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems Anirudh Chari et.al. 2501.19318 null
2025-01-31 GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling Pinxin Liu et.al. 2501.18898 link
2025-02-03 UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent Jianke Zhang et.al. 2501.18867 null
2025-01-29 PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding Wei Chow et.al. 2501.16411 null
2025-02-13 What if Eye...? Computationally Recreating Vision Evolution Kushagra Tiwary et.al. 2501.15001 link
2025-01-21 EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents Zhili Cheng et.al. 2501.11858 link
2025-01-17 Universal Actions for Enhanced Embodied Foundation Models Jinliang Zheng et.al. 2501.10105 link
2025-01-15 Embodied Scene Understanding for Vision Language Models via MetaVQA Weizhen Wang et.al. 2501.09167 null
2025-01-10 Semantic Mapping in Indoor Embodied AI -- A Comprehensive Survey and Future Directions Sonia Raychaudhuri et.al. 2501.05750 null
2025-01-09 ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark Ronghao Dang et.al. 2501.05031 link
2025-01-29 Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey Zongxia Li et.al. 2501.02189 link
2025-01-02 Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method Ruichen Zhang et.al. 2501.01141 null
2024-12-30 UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI Fangwei Zhong et.al. 2412.20977 null
2024-12-28 FaGeL: Fabric LLMs Agent empowered Embodied Intelligence Evolution with Autonomous Human-Machine Collaboration Jia Liu et.al. 2412.20297 null
2024-12-30 Embodied Image Quality Assessment for Robotic Intelligence Jianbo Zhang et.al. 2412.18774 link
2024-12-24 Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems Fernando Jia et.al. 2412.18601 link
2024-12-24 VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks Shiduo Zhang et.al. 2412.18194 null
2024-12-23 Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples Taewoong Kim et.al. 2412.17288 link
2024-12-25 Offline Reinforcement Learning for LLM Multi-Step Reasoning Huaijie Wang et.al. 2412.16145 link
2024-12-17 GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding Haoyi Jiang et.al. 2412.13193 link
2024-12-18 SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents Sheng Yin et.al. 2412.13178 link
2024-12-16 Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agents Wonje Choi et.al. 2412.11484 null
2024-12-05 TANGO: Training-free Embodied AI Agents for Open-world Tasks Filippo Ziliotto et.al. 2412.10402 null
2024-12-11 From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons Andrew Szot et.al. 2412.08442 null
2024-12-23 SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World Jiaqi Zhang et.al. 2412.07472 link
2024-12-08 InfiniteWorld: A Unified Scalable Simulation Framework for General Visual-Language Robot Interaction Pengzhen Ren et.al. 2412.05789 link
2024-12-06 TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft Qian Long et.al. 2412.05255 link
2024-12-06 EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding Yuqi Wu et.al. 2412.04380 link
2024-12-03 Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks Zijiao Yang et.al. 2412.02795 null
2024-12-25 Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation Yiyuan Pan et.al. 2412.01857 null
2024-12-02 The Bare Necessities: Designing Simple, Effective Open-Vocabulary Scene Graphs Christina Kassab et.al. 2412.01539 null
2024-12-02 Generating Freeform Endoskeletal Robots Muhan Li et.al. 2412.01036 null
2024-12-01 STEVE-Audio: Expanding the Goal Conditioning Modalities of Embodied Agents in Minecraft Nicholas Lenzen et.al. 2412.00949 null
2024-11-30 Benchmark Real-time Adaptation and Communication Capabilities of Embodied Agent in Collaborative Scenarios Shipeng Liu et.al. 2412.00435 null
2024-11-28 CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos Xinhao Liu et.al. 2411.17820 link
2024-12-15 3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning Yuncong Yang et.al. 2411.17735 null
2024-11-26 LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble Yujeong Lee et.al. 2411.17135 null
2024-11-23 Two Heads Are Better Than One: Collaborative LLM Embodied Agents for Human-Robot Interaction Mitchell Rosser et.al. 2411.16723 null
2024-11-25 TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation Linqing Zhong et.al. 2411.16425 null
2024-12-04 Functionality understanding and segmentation in 3D scenes Jaime Corsetti et.al. 2411.16310 null
2024-11-25 Open-Vocabulary Octree-Graph for 3D Scene Understanding Zhigang Wang et.al. 2411.16253 null
2024-11-27 XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models Yixin Dong et.al. 2411.15100 null
2024-11-20 AMaze: An intuitive benchmark generator for fast prototyping of generalizable agents Kevin Godin-Dubois et.al. 2411.13072 null
2024-11-25 MindForge: Empowering Embodied Agents with Theory of Mind for Lifelong Collaborative Learning Mircea Lică et.al. 2411.12977 null
2024-11-15 Voxel-Aggergated Feature Synthesis: Efficient Dense Mapping for Simulated 3D Reasoning Owen Burns et.al. 2411.10616 null
2024-11-13 NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation Youzhi Liu et.al. 2411.08579 null
2024-11-08 Enhancing Robustness in Language-Driven Robotics: A Modular Approach to Failure Reduction Émiland Garrabé et.al. 2411.05474 null
2024-11-07 MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation Sayan Paul et.al. 2411.04796 null
2024-11-07 CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation Jie Liu et.al. 2411.04679 null
2024-11-07 Scaling Laws for Pre-training Agents and World Models Tim Pearce et.al. 2411.04434 null
2024-11-05 VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation Haochen Zhang et.al. 2411.03540 link
2024-11-04 ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation Hengkai Tan et.al. 2411.01850 null
2024-11-05 Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge Weihua Du et.al. 2411.01796 link
2024-10-31 PARTNR: A Benchmark for Planning and Reasoning in Embodied Multi-agent Tasks Matthew Chang et.al. 2411.00081 link
2024-10-31 Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use Jiajun Xi et.al. 2410.24218 link
2024-10-31 Simulating User Agents for Embodied Conversational-AI Daniel Philipov et.al. 2410.23535 null
2024-10-30 A little less conversation, a little more action, please: Investigating the physical common-sense of LLMs in a 3D embodied environment Matteo G. Mecattaf et.al. 2410.23242 link
2024-10-29 ADAM: An Embodied Causal Agent in Open-World Environments Shu Yu et.al. 2410.22194 null
2024-10-23 Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments Luca Barsellotti et.al. 2410.18195 link
2024-10-21 Agent-Based Emulation for Deploying Robot Swarm Behaviors Ricardo Vega et.al. 2410.16444 null
2024-10-18 Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied Agents Sabit Hassan et.al. 2410.14141 null
2024-10-17 Goal Inference from Open-Ended Dialog Rachel Ma et.al. 2410.13957 null
2024-10-15 M2Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes Sixu Yan et.al. 2410.11402 null
2024-10-14 Embodied Active Learning of Generative Sensor-Object Models Allison Pinosky et.al. 2410.11130 null
2024-10-16 PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation Kaidong Zhang et.al. 2410.10394 null
2024-10-12 EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment Chen Gao et.al. 2410.09604 null
2024-10-05 Semantic Environment Atlas for Object-Goal Navigation Nuri Kim et.al. 2410.09081 null
2024-11-01 Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making Manling Li et.al. 2410.07166 link
2024-10-15 M3Bench: Benchmarking Whole-body Motion Generation for Mobile Manipulation in 3D Scenes Zeyu Zhang et.al. 2410.06678 null
2024-10-08 Entering Real Social World! Benchmarking the Theory of Mind and Socialization Capabilities of LLMs from a First-person Perspective Guiyang Hou et.al. 2410.06195 link
2024-10-07 How do we Observe Relational Observables? Emily Adlam et.al. 2410.05508 null

(back to top)

About

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%