Usage instructions: here
Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-24 | Event-Based Limit Order Book Simulation under a Neural Hawkes Process: Application in Market-Making | Luca Lalor et.al. | 2502.17417 | null |
2025-02-24 | Distributed Coordination for Heterogeneous Non-Terrestrial Networks | Jikang Deng et.al. | 2502.17366 | null |
2025-02-24 | Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents | Prafulla Kumar Choubey et.al. | 2502.17321 | null |
2025-02-24 | Survey on Strategic Mining in Blockchain: A Reinforcement Learning Approach | Jichen Li et.al. | 2502.17307 | null |
2025-02-24 | IGDA: Interactive Graph Discovery through Large Language Model Agents | Alex Havrilla et.al. | 2502.17189 | null |
2025-02-24 | Teleology-Driven Affective Computing: A Causal Framework for Sustained Well-Being | Bin Yin et.al. | 2502.17172 | null |
2025-02-24 | A Novel Multiple Access Scheme for Heterogeneous Wireless Communications using Symmetry-aware Continual Deep Reinforcement Learning | Hamidreza Mazandarani et.al. | 2502.17167 | null |
2025-02-24 | Semantic-Aware Dynamic and Distributed Power Allocation: a Multi-UAV Area Coverage Use Case | Hamidreza Mazandarani et.al. | 2502.17120 | null |
2025-02-24 | Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration | Junyang Wang et.al. | 2502.17110 | null |
2025-02-24 | Generative Models in Decision Making: A Survey | Yinchuan Li et.al. | 2502.17100 | null |
2025-02-24 | MA2RL: Masked Autoencoders for Generalizable Multi-Agent Reinforcement Learning | Jinyuan Feng et.al. | 2502.17046 | null |
2025-02-24 | A data-driven econo-financial stress-testing framework to estimate the effect of supply chain networks on financial systemic risk | Jan Fialkowski et.al. | 2502.17044 | null |
2025-02-24 | Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs | Enea Monzio Compagnoni et.al. | 2502.17009 | null |
2025-02-24 | Deep-reinforcement-learning-based separation control in a two-dimensional airfoil | Xavier Garcia et.al. | 2502.16993 | null |
2025-02-24 | Engineering and Validating Cyber-Physical Energy Systems: Needs, Status Quo, and Research Trends | Thomas I. Strasser et.al. | 2502.16991 | null |
2025-02-24 | A Multi-LLM-Agent-Based Framework for Economic and Public Policy Analysis | Yuzhi Hao et.al. | 2502.16879 | null |
2025-02-24 | Graphy'our Data: Towards End-to-End Modeling, Exploring and Generating Report from Raw Data | Longbin Lai et.al. | 2502.16868 | null |
2025-02-24 | Toward Agentic AI: Generative Information Retrieval Inspired Intelligent Communications and Networking | Ruichen Zhang et.al. | 2502.16866 | null |
2025-02-24 | Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit Assignment | Kartik Nagpal et.al. | 2502.16863 | null |
2025-02-24 | Grounded Persuasive Language Generation for Automated Marketing | Jibang Wu et.al. | 2502.16810 | null |
2025-02-21 | AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind | Zhining Zhang et.al. | 2502.15676 | null |
2025-02-21 | Multi-Agent Architecture in Distributed Environment Control Systems: vision, challenges, and opportunities | Natasha Astudillo et.al. | 2502.15663 | null |
2025-02-21 | Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian Network | Vincent Hsiao et.al. | 2502.15662 | null |
2025-02-21 | Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path? | Yoshua Bengio et.al. | 2502.15657 | null |
2025-02-21 | A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications | Jefferson Silveira et.al. | 2502.15649 | null |
2025-02-21 | WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents | Xinhang Liu et.al. | 2502.15601 | null |
2025-02-21 | SOTOPIA-Ω: Dynamic Strategy Injection Learning and Social Instrucion Following Evaluation for Social Agents | Wenyuan Zhang et.al. | 2502.15538 | null |
2025-02-21 | Contract DesignUnderApproximate Best Responses | Francesco Bacchiocchi et.al. | 2502.15523 | null |
2025-02-21 | SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning | Xuyang Li et.al. | 2502.15512 | null |
2025-02-21 | Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing | Masaya Kobayashi et.al. | 2502.15506 | null |
2025-02-21 | Pub-Guard-LLM: Detecting Fraudulent Biomedical Articles with Reliable Explanations | Lihu Chen et.al. | 2502.15429 | null |
2025-02-21 | TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning | Giuseppe Paolo et.al. | 2502.15425 | null |
2025-02-21 | Textual-to-Visual Iterative Self-Verification for Slide Generation | Yunqing Xu et.al. | 2502.15412 | null |
2025-02-21 | LongCaptioning: Unlocking the Power of Long Caption Generation in Large Multimodal Models | Hongchen Wei et.al. | 2502.15393 | null |
2025-02-21 | Multi-Group Dynamics with Tolerant Switching in the Kolkata Paise Restaurant Problem with Dining Clubs | Akshat Harlalka et.al. | 2502.15377 | null |
2025-02-21 | ARS: Automatic Routing Solver with Large Language Models | Kai Li et.al. | 2502.15359 | null |
2025-02-21 | Learning with Limited Shared Information in Multi-agent Multi-armed Bandit | Junning Shao et.al. | 2502.15338 | null |
2025-02-21 | DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation | Luzhou Ge et.al. | 2502.15309 | link |
2025-02-21 | Leader-Follower Formation Tracking Control of Quadrotor UAVs Using Bearing Measurements | S. Doodeman et.al. | 2502.15303 | null |
2025-02-21 | Collective behaviors of self-propelled particles with tunable alignment angles | Zichen Qin et.al. | 2502.15301 | null |
2025-02-20 | GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks | Jianwen Luo et.al. | 2502.14848 | null |
2025-02-20 | Red-Teaming LLM Multi-Agent Systems via Communication Attacks | Pengfei He et.al. | 2502.14847 | null |
2025-02-20 | Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation | Yue Yang et.al. | 2502.14846 | null |
2025-02-20 | Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models | Vlad Sobal et.al. | 2502.14819 | null |
2025-02-20 | Optimizing Model Selection for Compound AI Systems | Lingjiao Chen et.al. | 2502.14815 | link |
2025-02-20 | Byzantine Game Theory: Sun Tzus Boxes | Andrei Constantinescu et.al. | 2502.14812 | null |
2025-02-20 | Planning, scheduling, and execution on the Moon: the CADRE technology demonstration mission | Gregg Rabideau et.al. | 2502.14803 | null |
2025-02-20 | A Multi-Agent Perspective on Modern Information Retrieval | Haya Nachimovsky et.al. | 2502.14796 | null |
2025-02-20 | Making Universal Policies Universal | Niklas Höpner et.al. | 2502.14777 | null |
2025-02-20 | Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis | Priyanka Kargupta et.al. | 2502.14767 | link |
2025-02-20 | Multi-Agent Coordination across Diverse Applications: A Survey | Lijun Sun et.al. | 2502.14743 | null |
2025-02-20 | Reinforcement Learning with Graph Attention for Routing and Wavelength Assignment with Lightpath Reuse | Michael Doherty et.al. | 2502.14741 | null |
2025-02-20 | FLIGHT: Facility Location Integrating Generalized, Holistic Theory of Welfare | Avyukta Manjunatha Vummintala et.al. | 2502.14732 | null |
2025-02-20 | Ranking Joint Policies in Dynamic Games using Evolutionary Dynamics | Natalia Koliou et.al. | 2502.14724 | link |
2025-02-20 | Building reliable sim driving agents by scaling self-play | Daphne Cornelisse et.al. | 2502.14706 | null |
2025-02-20 | I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search | Zujie Liang et.al. | 2502.14693 | null |
2025-02-20 | BP-SGCN: Behavioral Pseudo-Label Informed Sparse Graph Convolution Network for Pedestrian and Heterogeneous Trajectory Prediction | Ruochen Li et.al. | 2502.14676 | link |
2025-02-20 | InstructAgent: Building User Controllable Recommender via LLM Agent | Wujiang Xu et.al. | 2502.14662 | link |
2025-02-20 | Online Envy Minimization and Multicolor Discrepancy: Equivalences and Separations | Daniel Halpern et.al. | 2502.14624 | null |
2025-02-20 | Curiosity Driven Multi-agent Reinforcement Learning for 3D Game Testing | Raihana Ferdous et.al. | 2502.14606 | link |
2025-02-19 | Autellix: An Efficient Serving Engine for LLM Agents as General Programs | Michael Luo et.al. | 2502.13965 | null |
2025-02-19 | LIDDIA: Language-based Intelligent Drug Discovery Agent | Reza Averly et.al. | 2502.13959 | null |
2025-02-19 | RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision | Guangzhi Xiong et.al. | 2502.13957 | null |
2025-02-19 | Qwen2.5-VL Technical Report | Shuai Bai et.al. | 2502.13923 | null |
2025-02-19 | Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health | Xingbo Wang et.al. | 2502.13920 | null |
2025-02-19 | DataSciBench: An LLM Agent Benchmark for Data Science | Dan Zhang et.al. | 2502.13897 | link |
2025-02-19 | NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants | Yiran Qin et.al. | 2502.13894 | null |
2025-02-19 | Enhancing Cross-Domain Recommendations with Memory-Optimized LLM-Based User Agents | Jiahao Liu et.al. | 2502.13843 | null |
2025-02-19 | ArtMentor: AI-Assisted Evaluation of Artworks to Explore Multimodal Large Language Models Capabilities | Chanjin Zheng et.al. | 2502.13832 | link |
2025-02-19 | Learning to explore when mistakes are not allowed | Charly Pecqueux-Guézénec et.al. | 2502.13801 | null |
2025-02-19 | From Correctness to Comprehension: AI Agents for Personalized Error Diagnosis in Education | Yi-Fan Zhang et.al. | 2502.13789 | null |
2025-02-19 | Poster: SpiderSim: Multi-Agent Driven Theoretical Cybersecurity Simulation for Industrial Digitalization | Jiaqi Li et.al. | 2502.13778 | link |
2025-02-19 | Quantile agent utility and implications to randomized social choice | Ioannis Caragiannis et.al. | 2502.13772 | null |
2025-02-19 | AI Software Engineer: Programming with Trust | Abhik Roychoudhury et.al. | 2502.13767 | null |
2025-02-19 | GPA: Grover Policy Agent for Generating Optimal Quantum Sensor Circuits | Ahmad Alomari et.al. | 2502.13755 | null |
2025-02-19 | Kinetic modelling of economic markets with individual and collective transactions | Chuandong Lin et.al. | 2502.13735 | null |
2025-02-19 | Hierarchical RL-MPC for Demand Response Scheduling | Maximilian Bloor et.al. | 2502.13714 | null |
2025-02-19 | Parameterized Complexity of Hedonic Games with Enemy-Oriented Preferences | Martin Durand et.al. | 2502.13703 | null |
2025-02-19 | Causes and Strategies in Multiagent Systems | Sylvia S. Kerkhove et.al. | 2502.13701 | null |
2025-02-19 | An LLM-based Agent for Reliable Docker Environment Configuration | Ruida Hu et.al. | 2502.13681 | null |
2025-02-18 | AIDE: AI-Driven Exploration in the Space of Code | Zhengyao Jiang et.al. | 2502.13138 | link |
2025-02-18 | Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions | Taedong Yun et.al. | 2502.13135 | null |
2025-02-18 | Magma: A Foundation Model for Multimodal AI Agents | Jianwei Yang et.al. | 2502.13130 | link |
2025-02-18 | Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning | Jingyang Lin et.al. | 2502.13127 | null |
2025-02-18 | Approximately Efficient Bilateral Trade with Samples | Yuan Deng et.al. | 2502.13122 | null |
2025-02-18 | Text2World: Benchmarking Large Language Models for Symbolic World Model Generation | Mengkang Hu et.al. | 2502.13092 | null |
2025-02-18 | Interactive Agents to Overcome Ambiguity in Software Engineering | Sanidhya Vijayvargiya et.al. | 2502.13069 | link |
2025-02-18 | Improved Fine-Tuning of Large Multimodal Models for Hateful Meme Detection | Jingbiao Mei et.al. | 2502.13061 | null |
2025-02-18 | AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks | Yurun Chen et.al. | 2502.13053 | null |
2025-02-18 | Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks | Markus J. Buehler et.al. | 2502.13025 | null |
2025-02-18 | Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents | Chaoran Chen et.al. | 2502.13012 | null |
2025-02-18 | Integrating Reinforcement Learning, Action Model Learning, and Numeric Planning for Tackling Complex Tasks | Yarin Benyamin et.al. | 2502.13006 | link |
2025-02-18 | You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with a Multi-Agent Conversations | Frederic Kirstein et.al. | 2502.13001 | null |
2025-02-18 | Free Argumentative Exchanges for Explaining Image Classifiers | Avinash Kori et.al. | 2502.12995 | link |
2025-02-18 | Generative AI and Information Asymmetry: Impacts on Adverse Selection and Moral Hazard | Yukun Zhang et.al. | 2502.12969 | null |
2025-02-18 | AI-Enabled Rent-Seeking: How Generative AI Alters Market Transparency and Efficiency | Yukun Zhang et.al. | 2502.12956 | null |
2025-02-18 | Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options | Lakshmi Nair et.al. | 2502.12929 | link |
2025-02-18 | SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems | Mike Zhang et.al. | 2502.12927 | null |
2025-02-18 | Towards more Contextual Agents: An extractor-Generator Optimization Framework | Mourad Aouini et.al. | 2502.12926 | null |
2025-02-18 | Knapsack Optimization-based Schema Linking for LLM-based Text-to-SQL Generation | Zheng Yuan et.al. | 2502.12911 | null |
2025-02-17 | HARBOR: Exploring Persona Dynamics in Multi-Agent Competition | Kenan Jiang et.al. | 2502.12149 | null |
2025-02-17 | Scaling Autonomous Agents via Automatic Reward Modeling And Planning | Zhenfang Chen et.al. | 2502.12130 | null |
2025-02-17 | A-MEM: Agentic Memory for LLM Agents | Wujiang Xu et.al. | 2502.12110 | link |
2025-02-17 | Relational Norms for Human-AI Cooperation | Brian D. Earp et.al. | 2502.12102 | null |
2025-02-17 | A Study on Leveraging Search and Self-Feedback for Agent Reasoning | Karthikeyan K et.al. | 2502.12094 | null |
2025-02-17 | Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation | Zhongyi Qiu et.al. | 2502.12073 | null |
2025-02-17 | A survey about perceptions of mobility to inform an agent-based simulator of subjective modal choice | Carole Adam et.al. | 2502.12058 | null |
2025-02-17 | Multi-agent coordination via communication partitions | Wei-Chen Lee et.al. | 2502.12042 | null |
2025-02-17 | Machine Learning Should Maximize Welfare, Not (Only) Accuracy | Nir Rosenfeld et.al. | 2502.11981 | null |
2025-02-17 | FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control | Yutong Ye et.al. | 2502.11937 | null |
2025-02-17 | CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning | Yanxiao Zhao et.al. | 2502.11896 | null |
2025-02-17 | Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration | Shao Zhang et.al. | 2502.11882 | link |
2025-02-17 | Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models | Hyunwoo Kim et.al. | 2502.11881 | null |
2025-02-17 | Does Knowledge About Perceptual Uncertainty Help an Agent in Automated Driving? | Natalie Grabowsky et.al. | 2502.11864 | null |
2025-02-17 | Can LLM Agents Maintain a Persona in Discourse? | Pranav Bhandari et.al. | 2502.11843 | null |
2025-02-17 | Assessing the impacts of tradable credit schemes through agent-based simulation | Renming Liu et.al. | 2502.11822 | null |
2025-02-17 | Table-Critic: A Multi-Agent Framework for Collaborative Criticism and Refinement in Table Reasoning | Peiying Yu et.al. | 2502.11799 | null |
2025-02-17 | Personality Editing for Language Models through Relevant Knowledge Editing | Seojin Hwang et.al. | 2502.11789 | null |
2025-02-17 | Changing the Rules of the Game: Reasoning about Dynamic Phenomena in Multi-Agent Systems | Rustam Galimullin et.al. | 2502.11785 | null |
2025-02-17 | Plant in Cupboard, Orange on Table, Book on Shelf. Benchmarking Practical Reasoning and Situation Modelling in a Text-Simulated Situated Environment | Jonathan Jordan et.al. | 2502.11733 | null |
2025-02-14 | Representation and Interpretation in Artificial and Natural Computing | Luis A. Pineda et.al. | 2502.10383 | null |
2025-02-14 | Agentic Verification for Ambiguous Query Disambiguation | Youngwon Lee et.al. | 2502.10352 | null |
2025-02-14 | Process Reward Models for LLM Agents: Practical Framework and Directions | Sanjiban Choudhury et.al. | 2502.10325 | link |
2025-02-14 | Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations | Abdelrhman Shaheen et.al. | 2502.10303 | null |
2025-02-14 | Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers | Aivin V. Solatorio et.al. | 2502.10263 | null |
2025-02-14 | Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel Decoding | Laurin Luttmann et.al. | 2502.10233 | link |
2025-02-14 | A Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation | Redha Taguelmimt et.al. | 2502.10226 | null |
2025-02-14 | Do Large Language Models Reason Causally Like Us? Even Better? | Hanna M. Dettki et.al. | 2502.10215 | null |
2025-02-14 | Dynamic Reinforcement Learning for Actors | Katsunari Shibata et.al. | 2502.10200 | null |
2025-02-14 | Reinforcement Learning based Constrained Optimal Control: an Interpretable Reward Design | Jingjie Ni et.al. | 2502.10187 | null |
2025-02-14 | STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning | Mingcong Lei et.al. | 2502.10177 | null |
2025-02-14 | Agentic End-to-End De Novo Protein Design for Tailored Dynamics Using a Language Diffusion Model | Bo Ni et.al. | 2502.10173 | null |
2025-02-14 | Modeling biases in binary decision-making within the generalized nonlinear q-voter model | Maciej Doniec et.al. | 2502.10172 | null |
2025-02-14 | Combinatorial Reinforcement Learning with Preference Feedback | Joongkyu Lee et.al. | 2502.10158 | null |
2025-02-14 | Cooperative Multi-Agent Planning with Adaptive Skill Synthesis | Zhiyuan Li et.al. | 2502.10148 | null |
2025-02-14 | Provably Efficient RL under Episode-Wise Safety in Linear CMDPs | Toshinori Kitamura et.al. | 2502.10138 | null |
2025-02-14 | ScamFerret: Detecting Scam Websites Autonomously with Large Language Models | Hiroki Nakano et.al. | 2502.10110 | link |
2025-02-14 | Causal Information Prioritization for Efficient Reinforcement Learning | Hongye Cao et.al. | 2502.10097 | null |
2025-02-14 | Enhancing Patient Acceptance of Robotic Ultrasound through Conversational Virtual Agent and Immersive Visualizations | Tianyu Song et.al. | 2502.10088 | null |
2025-02-14 | Towards Empowerment Gain through Causal Structure Learning in Model-Based RL | Hongye Cao et.al. | 2502.10077 | null |
2025-02-13 | Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs | Siyan Zhao et.al. | 2502.09597 | link |
2025-02-13 | KIMAs: A Configurable Knowledge Integrated Multi-Agent System | Zitao Li et.al. | 2502.09596 | null |
2025-02-13 | Rolling Ahead Diffusion for Traffic Scene Simulation | Yunpeng Liu et.al. | 2502.09587 | null |
2025-02-13 | Learning to Coordinate with Experts | Mohamad H. Danesh et.al. | 2502.09583 | link |
2025-02-13 | Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks | Qian Wan et.al. | 2502.09577 | null |
2025-02-13 | MDCrow: Automating Molecular Dynamics Workflows with Large Language Models | Quintina Campbell et.al. | 2502.09565 | link |
2025-02-13 | EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents | Rui Yang et.al. | 2502.09560 | null |
2025-02-13 | Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages | Shreyan Biswas et.al. | 2502.09532 | null |
2025-02-13 | Exact Leader Estimation: A New Approach for Distributed Differentiation | Rodrigo Aldana-Lopez et.al. | 2502.09529 | null |
2025-02-13 | Forward-backward Contention Resolution Schemes for Fair Rationing | Will Ma et.al. | 2502.09521 | null |
2025-02-13 | Coupled Rendezvous and Docking Maneuver control of satellite using Reinforcement learning-based Adaptive Fixed-Time Sliding Mode Controller | Rakesh Kumar Sahoo et.al. | 2502.09517 | null |
2025-02-13 | Package Bids in Combinatorial Electricity Auctions: Selection, Welfare Losses, and Alternatives | Thomas Hübner et.al. | 2502.09420 | link |
2025-02-13 | Dialectics of antimicrobial peptides I: common mechanisms of offensive and protecting roles of the peptides | Marta V. Volovik et.al. | 2502.09408 | null |
2025-02-13 | Fair Division via Resource Augmentation | Hannaneh Akrami et.al. | 2502.09377 | null |
2025-02-13 | Language Agents as Digital Representatives in Collective Decision-Making | Daniel Jarrett et.al. | 2502.09369 | null |
2025-02-13 | Convex Is Back: Solving Belief MDPs With Convexity-Informed Deep Reinforcement Learning | Daniel Koutas et.al. | 2502.09298 | link |
2025-02-13 | Reliable Conversational Agents under ASP Control that Understand Natural Language | Yankai Zeng et.al. | 2502.09237 | null |
2025-02-13 | Pearce's Characterisation in an Epistemic Domain | Ezgi Iraz Su et.al. | 2502.09221 | null |
2025-02-13 | Mind the Gaps: Logical English, Prolog, and Multi-agent Systems for Autonomous Vehicles | Galileo Sartor et.al. | 2502.09216 | null |
2025-02-13 | Architecture for Simulating Behavior Mode Changes in Norm-Aware Autonomous Agents | Sean Glaze et.al. | 2502.09215 | null |
2025-02-12 | Poly-Autoregressive Prediction for Modeling Interactions | Neerja Thakkar et.al. | 2502.08646 | null |
2025-02-12 | Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs | Mantas Mazeika et.al. | 2502.08640 | null |
2025-02-12 | SPeCtrum: A Grounded Framework for Multidimensional Identity Representation in LLM-Based Agent | Keyeun Lee et.al. | 2502.08599 | link |
2025-02-12 | Learning in Markets with Heterogeneous Agents: Dynamics and Survival of Bayesian vs. No-Regret Learners | David Easley et.al. | 2502.08597 | null |
2025-02-12 | Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks | Ang Li et.al. | 2502.08586 | null |
2025-02-12 | Statistically validated projection of bipartite signed networks | Anna Gallo et.al. | 2502.08567 | null |
2025-02-12 | Human-Centric Foundation Models: Perception, Generation and Agentic Modeling | Shixiang Tang et.al. | 2502.08556 | link |
2025-02-12 | Extreme vulnerability to intruder attacks destabilizes network dynamics | Amirhossein Nazerian et.al. | 2502.08552 | null |
2025-02-12 | Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation | Mahnaz Koupaee et.al. | 2502.08514 | link |
2025-02-12 | Resilient Quantized Consensus in Multi-Hop Relay Networks | Liwei Yuan et.al. | 2502.08455 | null |
2025-02-12 | Non-Monetary Mechanism Design without Distributional Information: Using Scarce Audits Wisely | Yan Dai et.al. | 2502.08412 | null |
2025-02-12 | Towards Principled Multi-Agent Task Agnostic Exploration | Riccardo Zamboni et.al. | 2502.08365 | null |
2025-02-12 | Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems | Yuxin Pan et.al. | 2502.08340 | link |
2025-02-12 | Hierarchical Multi-Agent Framework for Carbon-Efficient Liquid-Cooled Data Center Clusters | Soumyendu Sarkar et.al. | 2502.08337 | null |
2025-02-12 | Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement Learning | Sun Jingbo et.al. | 2502.08336 | null |
2025-02-12 | Decentralised multi-agent coordination for real-time railway traffic management | Leo D'Amato et.al. | 2502.08324 | null |
2025-02-12 | Compromising Honesty and Harmlessness in Language Models via Deception Attacks | Laurène Vaugrante et.al. | 2502.08301 | null |
2025-02-12 | Higher-order Laplacian dynamics on hypergraphs with cooperative and antagonistic interactions | Shaoxuan Cui et.al. | 2502.08276 | null |
2025-02-12 | Principles and Framework for the Operationalisation of Meaningful Human Control over Autonomous Systems | Simeon C. Calvert et.al. | 2502.08255 | null |
2025-02-12 | The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks | Alejandro Cuadron et.al. | 2502.08235 | link |
2025-02-11 | MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces | Loris Gaven et.al. | 2502.07709 | link |
2025-02-11 | Human Decision-making is Susceptible to AI-driven Manipulation | Sahand Sabour et.al. | 2502.07663 | null |
2025-02-11 | Robust-Sorting and Applications to Ulam-Median | Ragesh Jaiswal et.al. | 2502.07653 | null |
2025-02-11 | Distributed Value Decomposition Networks with Networked Agents | Guilherme S. Varela et.al. | 2502.07635 | null |
2025-02-11 | Decision-Making Under Complete Uncertainty: You Will Regret Not Being Greedy | Kristijan Atanasov et.al. | 2502.07593 | null |
2025-02-11 | DMWM: Dual-Mind World Model with Long-Term Imagination | Lingyi Wang et.al. | 2502.07591 | null |
2025-02-11 | Pure |
Bary S. R. Pradelski et.al. | 2502.07585 | null |
2025-02-11 | Genetic evolution of a multi-generational population in the context of interstellar space travels -- Part II: Phenotypic effects of gene expression | Frédéric Marin et.al. | 2502.07559 | null |
2025-02-11 | Unsupervised Translation of Emergent Communication | Ido Levy et.al. | 2502.07552 | null |
2025-02-11 | A Near-optimal, Scalable and Corruption-tolerant Framework for Stochastic Bandits: From Single-Agent to Multi-Agent and Beyond | Zicheng Hu et.al. | 2502.07514 | null |
2025-02-11 | Exploring Word-Representable Temporal Graphs | Duncan Adamson et.al. | 2502.07496 | null |
2025-02-11 | Multi-Agent Collaboration for Multilingual Code Instruction Tuning | Jian Yang et.al. | 2502.07487 | null |
2025-02-11 | On Event-Triggered Resilient Consensus Using Auxiliary Layer | Pushkal Purohit et.al. | 2502.07470 | null |
2025-02-11 | Approximating Human Strategic Reasoning with LLM-Enhanced Recursive Reasoners Leveraging Multi-agent Hypergames | Vince Trencsenyi et.al. | 2502.07443 | null |
2025-02-11 | Coupling Agent-Based Simulations and VR universes: the case of GAMA and Unity | Alexis Drogoul et.al. | 2502.07405 | null |
2025-02-11 | FinRL-DeepSeek: LLM-Infused Risk-Sensitive Reinforcement Learning for Trading Agents | Mostapha Benhenda et.al. | 2502.07393 | link |
2025-02-11 | EvoFlow: Evolving Diverse Agentic Workflows On The Fly | Guibin Zhang et.al. | 2502.07373 | null |
2025-02-11 | KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems | Jusheng Zhang et.al. | 2502.07350 | null |
2025-02-11 | The Combined Problem of Online Task Assignment and Lifelong Path Finding in Logistics Warehouses: A Case Study | Fengming Zhu et.al. | 2502.07332 | null |
2025-02-11 | CreAgent: Towards Long-Term Evaluation of Recommender System under Platform-Creator Information Asymmetry | Xiaopeng Ye et.al. | 2502.07307 | link |
2025-02-10 | Visual Agentic AI for Spatial Reasoning with a Dynamic API | Damiano Marsili et.al. | 2502.06787 | null |
2025-02-10 | Towards Internet-Scale Training For Agents | Brandon Trabucco et.al. | 2502.06776 | null |
2025-02-10 | Distributed Constraint-Coupled Optimization: Harnessing ADMM-consensus for robustness | Mohamed Abdelmouamin Messilem et.al. | 2502.06763 | null |
2025-02-10 | Incentivizing Desirable Effort Profiles in Strategic Classification: The Role of Causality and Uncertainty | Valia Efthymiou et.al. | 2502.06749 | null |
2025-02-10 | Institutional Preferences in the Laboratory | Qiankun Zhong et.al. | 2502.06748 | null |
2025-02-10 | Wandering around: A bioinspired approach to visual attention through object motion sensitivity | Giulia D Angelo et.al. | 2502.06747 | link |
2025-02-10 | AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection | Roohan Ahmed Khan et.al. | 2502.06725 | null |
2025-02-10 | Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene | Tai-Yu Pan et.al. | 2502.06682 | null |
2025-02-10 | Quantile Multi-Armed Bandits with 1-bit Feedback | Ivan Lau et.al. | 2502.06678 | null |
2025-02-10 | Unbiased Evaluation of Large Language Models from a Causal Perspective | Meilin Chen et.al. | 2502.06655 | null |
2025-02-10 | Enhancing healthcare infrastructure resilience through agent-based simulation methods | David Carramiñana et.al. | 2502.06636 | null |
2025-02-10 | Hinderance of cooperation by individual solutions: Evolutionary dynamics of three-strategy games combining the prisoner's dilemma and stag hunt | Hirofumi Takesue et.al. | 2502.06624 | null |
2025-02-10 | Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training | Yuchen Zhuang et.al. | 2502.06589 | null |
2025-02-10 | Network Creation Games with 2-Neighborhood Maximization | Merlin de la Haye et.al. | 2502.06561 | null |
2025-02-10 | Marginal Mechanisms For Balanced Exchange | Vikram Manjunath et.al. | 2502.06499 | null |
2025-02-10 | Utilitarian Distortion with Predictions | Aris Filos-Ratsikas et.al. | 2502.06489 | null |
2025-02-10 | KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment | Yuxing Lu et.al. | 2502.06472 | link |
2025-02-10 | A Quadratic Lower Bound for Stable Roommates Solvability | Will Rosenbaum et.al. | 2502.06464 | null |
2025-02-10 | SIGMA: Sheaf-Informed Geometric Multi-Agent Pathfinding | Shuhao Liao et.al. | 2502.06440 | null |
2025-02-10 | The AI off-switch problem as a signalling game: bounded rationality and incomparability | Alessio benavoli et.al. | 2502.06403 | null |
2025-02-07 | Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray | Yunhang Shen et.al. | 2502.05177 | link |
2025-02-07 | MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison | Kaijie Zhu et.al. | 2502.05174 | null |
2025-02-07 | From Restless to Contextual: A Thresholding Bandit Approach to Improve Finite-horizon Performance | Jiamin Xu et.al. | 2502.05145 | link |
2025-02-07 | Maximin Share Guarantees for Few Agents with Subadditive Valuations | George Christodoulou et.al. | 2502.05141 | null |
2025-02-07 | Joint TITE-CRM for Dual Agent Dose Finding Studies | Helen Barnett et.al. | 2502.05072 | null |
2025-02-07 | Exploring the Generalizability of Geomagnetic Navigation: A Deep Reinforcement Learning approach with Policy Distillation | Wenqi Bai et.al. | 2502.05069 | null |
2025-02-07 | nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow | Geliang Ouyang et.al. | 2502.05036 | link |
2025-02-07 | Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency | Qixin Zhang et.al. | 2502.05028 | null |
2025-02-07 | Seasonal Station-Keeping of Short Duration High Altitude Balloons using Deep Reinforcement Learning | Tristan K. Schuler et.al. | 2502.05014 | null |
2025-02-07 | The Rising Threat to Emerging AI-Powered Search Engines | Zeren Luo et.al. | 2502.04951 | null |
2025-02-07 | Aditya Kapoor et.al. | 2502.04864 | null | |
2025-02-07 | Humans Co-exist, So Must Embodied Artificial Agents | Hannah Kuehn et.al. | 2502.04809 | null |
2025-02-07 | Unified description of viscous, viscoelastic, or elastic thin active films on substrates | Henning Reinken et.al. | 2502.04802 | null |
2025-02-07 | S |
Yuting Zeng et.al. | 2502.04790 | null |
2025-02-07 | A non-zero-sum game with reinforcement learning under mean-variance framework | Junyi Guo et.al. | 2502.04788 | null |
2025-02-07 | SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning | Wanjia Zhao et.al. | 2502.04780 | link |
2025-02-07 | An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks | George Papadopoulos et.al. | 2502.04773 | link |
2025-02-07 | Shapley Value Approximation Based on k-Additive Games | Guilherme Dean Pelegrina et.al. | 2502.04763 | null |
2025-02-07 | Every Software as an Agent: Blueprint and Case Study | Mengwei Xu et.al. | 2502.04747 | null |
2025-02-07 | Multi-Agent Coverage Control in Non-Convex Annulus Region with Conformal Mapping | Xun Feng et.al. | 2502.04697 | null |
2025-02-06 | ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization | Yinjie Wang et.al. | 2502.04306 | link |
2025-02-06 | Mutual Multilinearity of Nonequilibrium Network Currents | Sara Dal Cengio et.al. | 2502.04298 | null |
2025-02-06 | DECAF: Learning to be Fair in Multi-agent Resource Allocation | Ashwin Kumar et.al. | 2502.04281 | null |
2025-02-06 | Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study | Michael Walters et.al. | 2502.04249 | null |
2025-02-06 | Multi-agent Architecture Search via Agentic Supernet | Guibin Zhang et.al. | 2502.04180 | null |
2025-02-06 | Dense Fixed-Wing Swarming using Receding-Horizon NMPC | Varun Madabushi et.al. | 2502.04174 | null |
2025-02-06 | Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning | Wesley A. Suttle et.al. | 2502.04141 | null |
2025-02-06 | Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation | Jiahao Lu et.al. | 2502.04139 | null |
2025-02-06 | VTutor: An Open-Source SDK for Generative AI-Powered Animated Pedagogical Agents with Multi-Media Output | Eason Chen et.al. | 2502.04103 | null |
2025-02-06 | Strategic Learning with Local Explanations as Feedback | Kiet Q. H. Vo et.al. | 2502.04058 | null |
2025-02-06 | Simulating the Emergence of Differential Case Marking with Communicating Neural-Network Agents | Yuchen Lian et.al. | 2502.04038 | null |
2025-02-06 | Deep Meta Coordination Graphs for Multi-agent Reinforcement Learning | Nikunj Gupta et.al. | 2502.04028 | link |
2025-02-06 | Near-optimal Regret Using Policy Optimization in Online MDPs with Aggregate Bandit Feedback | Tal Lancewicki et.al. | 2502.04004 | null |
2025-02-06 | Fairness Aware Reinforcement Learning via Proximal Policy Optimization | Gabriele La Malfa et.al. | 2502.03953 | null |
2025-02-06 | Enhancing Online Learning Efficiency Through Heterogeneous Resource Integration with a Multi-Agent RAG System | Devansh Srivastav et.al. | 2502.03948 | null |
2025-02-06 | Geometric Stabilization of Virtual Nonlinear Nonholonomic Constraints | Efstratios Stratoglou et.al. | 2502.03902 | null |
2025-02-06 | Any theory that admits a Wigner's Friend type multi-agent paradox is logically contextual | Nuriya Nurgalieva et.al. | 2502.03874 | null |
2025-02-06 | PAGNet: Pluggable Adaptive Generative Networks for Information Completion in Multi-Agent Communication | Zhuohui Zhang et.al. | 2502.03845 | null |
2025-02-06 | PsyPlay: Personality-Infused Role-Playing Conversational Agents | Tao Yang et.al. | 2502.03821 | null |
2025-02-06 | Large Language Models for Multi-Robot Systems: A Survey | Peihan Li et.al. | 2502.03814 | null |
2025-02-05 | A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs) | Yiye Chen et.al. | 2502.03450 | null |
2025-02-05 | Prediction of the Most Fire-Sensitive Point in Building Structures with Differentiable Agents for Thermal Simulators | Yuan Xinjie et.al. | 2502.03424 | null |
2025-02-05 | Energy-Efficient Flying LoRa Gateways: A Multi-Agent Reinforcement Learning Approach | Abdullahi Isa Ahmed et.al. | 2502.03377 | null |
2025-02-05 | Learning from Active Human Involvement through Proxy Value Propagation | Zhenghao Peng et.al. | 2502.03369 | null |
2025-02-05 | PalimpChat: Declarative and Interactive AI analytics | Chunwei Liu et.al. | 2502.03368 | null |
2025-02-05 | Inverse Mixed Strategy Games with Generative Trajectory Models | Max Muchen Sun et.al. | 2502.03356 | null |
2025-02-05 | Implicit Communication in Human-Robot Collaborative Transport | Elvin Yang et.al. | 2502.03346 | link |
2025-02-05 | Actions Speak Louder Than Words: Rate-Reward Trade-off in Markov Decision Processes | Haotian Wu et.al. | 2502.03335 | null |
2025-02-05 | SymAgent: A Neural-Symbolic Self-Learning Agent Framework for Complex Reasoning over Knowledge Graphs | Ben Liu et.al. | 2502.03283 | null |
2025-02-05 | Modeling and Optimization of Insulin Injection for Type-1 Diabetes Mellitus Management | Rinrada Jadsadaphongphaibool et.al. | 2502.03269 | null |
2025-02-05 | iVISPAR -- An Interactive Visual-Spatial Reasoning Benchmark for VLMs | Julius Mayer et.al. | 2502.03214 | link |
2025-02-05 | MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent | Xinyao Liao et.al. | 2502.03207 | null |
2025-02-05 | Cooperative Behavior in Pre-State Societies: An Agent-Based Approach of the Axum Civilization | Riccardo Vasellini et.al. | 2502.03191 | null |
2025-02-05 | Strategizing with AI: Insights from a Beauty Contest Experiment | Iuliia Alekseenko et.al. | 2502.03158 | null |
2025-02-05 | Group Trip Planning Query Problem with Multimodal Journey | Dildar Ali et.al. | 2502.03144 | null |
2025-02-05 | Underwater Soft Fin Flapping Motion with Deep Neural Network Based Surrogate Model | Yuya Hamamatsu et.al. | 2502.03135 | link |
2025-02-05 | Double Distillation Network for Multi-Agent Reinforcement Learning | Yang Zhou et.al. | 2502.03125 | null |
2025-02-05 | Cooperation, satisfaction, and rationality in social games on complex networks with aspiration-driven players | M. Aguilar-Janita et.al. | 2502.03109 | null |
2025-02-05 | Learning Efficient Flocking Control based on Gibbs Random Fields | Dengyu Zhang et.al. | 2502.02984 | null |
2025-02-05 | FedMobileAgent: Training Mobile Agents Using Decentralized Self-Sourced Data from Diverse Users | Wenhao Wang et.al. | 2502.02982 | null |
2025-02-04 | QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search | Zongyu Lin et.al. | 2502.02584 | link |
2025-02-04 | Decision Theoretic Foundations for Conformal Prediction: Optimal Uncertainty Quantification for Risk-Averse Agents | Shayan Kiyani et.al. | 2502.02561 | null |
2025-02-04 | AAD-DCE: An Aggregated Multimodal Attention Mechanism for Early and Late Dynamic Contrast Enhanced Prostate MRI Synthesis | Divya Bharti et.al. | 2502.02555 | link |
2025-02-04 | Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks | Huiqun Huang et.al. | 2502.02537 | null |
2025-02-04 | Adaptive Self-improvement LLM Agentic System for ML Library Development | Genghan Zhang et.al. | 2502.02534 | link |
2025-02-04 | Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies | Han Zhou et.al. | 2502.02533 | null |
2025-02-04 | Why human-AI relationships need socioaffective alignment | Hannah Rose Kirk et.al. | 2502.02528 | null |
2025-02-04 | The Cost Perspective of Liquid Democracy: Feasibility and Control | Shiri Alouf-Heffetz et.al. | 2502.02380 | null |
2025-02-04 | Mirai: A Wearable Proactive AI "Inner-Voice" for Contextual Nudging | Cathy Mengying Fang et.al. | 2502.02370 | null |
2025-02-04 | MAGNNET: Multi-Agent Graph Neural Network-based Efficient Task Allocation for Autonomous Vehicles with Deep Reinforcement Learning | Lavanya Ratnabala et.al. | 2502.02311 | null |
2025-02-04 | Adviser-Actor-Critic: Eliminating Steady-State Error in Reinforcement Learning Control | Donghe Chen et.al. | 2502.02265 | null |
2025-02-04 | An altruistic resource-sharing mechanism for synchronization: The energy-speed-accuracy tradeoff | Dongliang Zhang et.al. | 2502.02242 | null |
2025-02-04 | The Induced Matching Distance: A Novel Topological Metric with Applications in Robotics | Javier Perera-Lago et.al. | 2502.02112 | link |
2025-02-04 | Sequential Multi-objective Multi-agent Reinforcement Learning Approach for Predictive Maintenance | Yan Chen et.al. | 2502.02071 | null |
2025-02-04 | AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement | Shivam Singh et.al. | 2502.02067 | link |
2025-02-04 | Anticipate & Act : Integrating LLMs and Classical Planning for Efficient Task Execution in Household Environments | Raghav Arora et.al. | 2502.02066 | null |
2025-02-04 | CH-MARL: Constrained Hierarchical Multiagent Reinforcement Learning for Sustainable Maritime Logistics | Saad Alqithami et.al. | 2502.02060 | null |
2025-02-04 | RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation | Minwoo Kim et.al. | 2502.02054 | null |
2025-02-04 | Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer | Yaodong Yang et.al. | 2502.02018 | link |
2025-02-04 | The Wisdom of Intellectually Humble Networks | Mohammad Ratul Mahjabin et.al. | 2502.02015 | link |
2025-01-31 | Vintix: Action Model via In-Context Reinforcement Learning | Andrey Polubarov et.al. | 2501.19400 | link |
2025-01-31 | Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game | Mustafa O. Karabag et.al. | 2501.19398 | link |
2025-01-31 | Learning Contracts in Hierarchical Multi-Agent Systems | Antoine Scheid et.al. | 2501.19388 | null |
2025-01-31 | The Physics and Metaphysics of Social Powers: Bridging Cognitive Processing and Social Dynamics, a New Perspective on Power through Active Inference | Mahault Albarracin et.al. | 2501.19368 | null |
2025-01-31 | PixelWorld: Towards Perceiving Everything as Pixels | Zhiheng Lyu et.al. | 2501.19339 | null |
2025-01-31 | MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems | Anirudh Chari et.al. | 2501.19318 | null |
2025-01-31 | Objective Metrics for Human-Subjects Evaluation in Explainable Reinforcement Learning | Balint Gyevnar et.al. | 2501.19256 | null |
2025-02-03 | SHARPIE: A Modular Framework for Reinforcement Learning and Human-AI Interaction Experiments | Hüseyin Aydın et.al. | 2501.19245 | link |
2025-01-31 | Multi-agent Multi-armed Bandit with Fully Heavy-tailed Dynamics | Xingyu Wang et.al. | 2501.19239 | null |
2025-01-31 | A parallelizable variant of HCA* | Sreenivasan Ganti et.al. | 2501.19218 | null |
2025-01-31 | An Empirical Game-Theoretic Analysis of Autonomous Cyber-Defence Agents | Gregory Palmer et.al. | 2501.19206 | null |
2025-01-31 | Autonomous Legacy Web Application Upgrades Using a Multi-Agent System | Valtteri Ala-Salmi et.al. | 2501.19204 | link |
2025-01-31 | A Comunication Framework for Compositional Generation | Rafael Elberg et.al. | 2501.19182 | null |
2025-01-31 | Augmented Intelligence for Multimodal Virtual Biopsy in Breast Cancer Using Generative Artificial Intelligence | Aurora Rofena et.al. | 2501.19176 | null |
2025-01-31 | Implications of zero-growth economics analysed with an agent-based model | Dylan C. Terry-Doyle et.al. | 2501.19168 | null |
2025-01-31 | Test-Time Training Scaling for Chemical Exploration in Drug Design | Morgan Thomas et.al. | 2501.19153 | null |
2025-01-31 | Constant-Factor Distortion Mechanisms for |
Haripriya Pulyassary et.al. | 2501.19148 | null |
2025-01-31 | Prediction-Aware Learning in Multi-Agent Systems | Aymeric Capitaine et.al. | 2501.19144 | null |
2025-01-31 | Imitation Game for Adversarial Disillusion with Multimodal Generative Chain-of-Thought Role-Play | Ching-Chun Chang et.al. | 2501.19143 | null |
2025-01-31 | Shaping Sparse Rewards in Reinforcement Learning: A Semi-supervised Approach | Wenyun Li et.al. | 2501.19128 | null |
2025-01-30 | Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method | Peter Baile Chen et.al. | 2501.18539 | null |
2025-01-30 | Design and Validation of Learning Aware HMI For Learning-Enabled Increasingly Autonomous Systems | Parth Ganeriwala et.al. | 2501.18506 | null |
2025-01-30 | Graph Exploration with Edge Weight Estimates | Matthias Gehnen et.al. | 2501.18496 | null |
2025-01-30 | Conversation Games and a Strategic View of the Turing Test | Kaveh Aryan et.al. | 2501.18455 | null |
2025-01-30 | Stable Marriage: Loyalty vs. Competition | Amit Ronen et.al. | 2501.18442 | null |
2025-01-30 | Gravity-Bench-v1: A Benchmark on Gravitational Physics Discovery for Agents | Nolan Koblischke et.al. | 2501.18411 | null |
2025-01-30 | Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach | Tianpeng Pan et.al. | 2501.18320 | null |
2025-01-30 | Model-Free RL Agents Demonstrate System 1-Like Intentionality | Hal Ashton et.al. | 2501.18299 | null |
2025-01-30 | CueTip: An Interactive and Explainable Physics-aware Pool Assistant | Sean Memery et.al. | 2501.18291 | null |
2025-01-30 | Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents | ShuiDe Wen et.al. | 2501.18190 | null |
2025-01-30 | Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation | Teddy Lazebnik et.al. | 2501.18177 | null |
2025-01-30 | RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing | Jinyao Guo et.al. | 2501.18160 | null |
2025-01-30 | Model Checking for Multi-Agent Systems Modeled By Epistemic Process Calculus | Qixian Yu et.al. | 2501.18155 | null |
2025-01-30 | Utilizing API Response for Test Refinement | Devika Sondhi et.al. | 2501.18145 | null |
2025-01-30 | B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning | Woojun Kim et.al. | 2501.18138 | null |
2025-01-30 | DCatalyst: A Unified Accelerated Framework for Decentralized Optimization | Tianyu Cao et.al. | 2501.18114 | null |
2025-01-29 | Joint Pricing and Resource Allocation: An Optimal Online-Learning Approach | Jianyu Xu et.al. | 2501.18049 | null |
2025-01-29 | A Case Study in Acceleration AI Ethics: The TELUS GenAI Conversational Agent | James Brusseau et.al. | 2501.18038 | null |
2025-01-29 | Large Language Models Think Too Fast To Explore Effectively | Lan Pan et.al. | 2501.18009 | null |
2025-01-29 | Agentic Workflows for Conversational Human-AI Interaction Design | Arthur Caetano et.al. | 2501.18002 | null |
2025-01-29 | From Sparse to Dense: Toddler-inspired Reward Transition in Goal-Oriented Reinforcement Learning | Junseok Park et.al. | 2501.17842 | null |
2025-01-29 | A note on the Cucker-Smale model with time delay and communication failures | Elisa Continelli et.al. | 2501.17743 | null |
2025-01-29 | RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts | Eujeong Choi et.al. | 2501.17715 | link |
2025-01-29 | Inferring Implicit Goals Across Differing Task Models | Silvia Tulli et.al. | 2501.17704 | null |
2025-01-29 | CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius Maximization | Derui Wang et.al. | 2501.17667 | link |
2025-01-29 | Multi-Agent Path Finding Using Conflict-Based Search and Structural-Semantic Topometric Maps | Scott Fredriksson et.al. | 2501.17661 | null |
2025-01-29 | Coalitional control: a bottom-up approach | Filiberto Fele et.al. | 2501.17614 | null |
2025-01-29 | Coalitional model predictive control of an irrigation canal | Filiberto Fele et.al. | 2501.17561 | null |
2025-01-29 | Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant | Gaole He et.al. | 2501.17546 | link |
2025-01-29 | Sequential Learning of the Pareto Front for Multi-objective Bandits | Elise Crépon et.al. | 2501.17513 | link |
2025-01-29 | Monetary-Fiscal Interaction and the Liquidity of Government Debt | Cristiano Cantore et.al. | 2501.17458 | null |
2025-01-29 | Human-Aligned Skill Discovery: Balancing Behaviour Exploration and Alignment | Maxence Hussonnois et.al. | 2501.17431 | null |
2025-01-29 | Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models | Yuxuan Li et.al. | 2501.17420 | null |
2025-01-29 | General Scene Adaptation for Vision-and-Language Navigation | Haodong Hong et.al. | 2501.17403 | link |
2025-01-29 | Optimal Utility Design with Arbitrary Information Networks | Vartika Singh et.al. | 2501.17385 | null |
2025-01-29 | A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning | Zhengpeng Xie et.al. | 2501.17384 | null |
2025-01-28 | Anomaly Detection in Cooperative Vehicle Perception Systems under Imperfect Communication | Ashish Bastola et.al. | 2501.17329 | null |
2025-01-28 | A sketch of an AI control safety case | Tomek Korbak et.al. | 2501.17315 | null |
2025-01-28 | Controlling AI Agent Participation in Group Conversations: A Human-Centered Approach | Stephanie Houde et.al. | 2501.17258 | null |
2025-01-28 | Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning | Rémy Hosseinkhan Boucher et.al. | 2501.17115 | null |
2025-01-28 | CRSet: Non-Interactive Verifiable Credential Revocation with Metadata Privacy for Issuers and Everyone Else | Felix Hoops et.al. | 2501.17089 | null |
2025-01-28 | Learning Mean Field Control on Sparse Graphs | Christian Fabian et.al. | 2501.17079 | null |
2025-01-28 | Induced Modularity and Community Detection for Functionally Interpretable Reinforcement Learning | Anna Soligo et.al. | 2501.17077 | null |
2025-01-28 | Context is Key in Agent Security | Lillian Tsai et.al. | 2501.17070 | null |
2025-01-28 | Revisit Mixture Models for Multi-Agent Simulation: Experimental Study within a Unified Framework | Longzhong Lin et.al. | 2501.17015 | null |
2025-01-28 | Towards Open-Source and Modular Space Systems with ATMOS | Pedro Roque et.al. | 2501.16973 | null |
2025-01-28 | Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning | Xi Chen et.al. | 2501.16966 | null |
2025-01-28 | ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations | Xinyi Ni et.al. | 2501.16945 | null |
2025-01-28 | Beyond Human Intervention: Algorithmic Collusion through Multi-Agent Learning Strategies | Suzie Grondin et.al. | 2501.16935 | null |
2025-01-28 | Optimization and Learning in Open Multi-Agent Systems | Diego Deplano et.al. | 2501.16847 | null |
2025-01-28 | RG-Attn: Radian Glue Attention for Multi-modality Multi-agent Cooperative Perception | Lantao Li et.al. | 2501.16803 | null |
2025-01-28 | A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process | Jack David Carson et.al. | 2501.16783 | null |
2025-01-28 | Target-driven Self-Distillation for Partial Observed Trajectories Forecasting | Pengfei Zhu et.al. | 2501.16767 | null |
2025-01-28 | Quantum advantage in decentralized control of POMDPs: A control-theoretic view of the Mermin-Peres square | Venkat Anantharam et.al. | 2501.16690 | null |
2025-01-28 | MACI: Multi-Agent Collaborative Intelligence for Robust Reasoning and Temporal Planning | Edward Y. Chang et.al. | 2501.16689 | null |
2025-01-28 | Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting | Li Yin et.al. | 2501.16673 | link |
2025-01-28 | Jupybara: Operationalizing a Design Space for Actionable Data Analysis and Storytelling with LLMs | Huichen Will Wang et.al. | 2501.16661 | null |
2025-01-28 | Large Language Model Critics for Execution-Free Evaluation of Code Changes | Aashish Yadavally et.al. | 2501.16655 | link |
2025-01-28 | More Efficient Sybil Detection Mechanisms Leveraging Resistance of Users to Attack Requests | Ali Safarpoor Dehkordi et.al. | 2501.16624 | link |
2025-01-27 | LUCY: Linguistic Understanding and Control Yielding Early Stage of Her | Heting Gao et.al. | 2501.16327 | link |
2025-01-27 | Privacy-aware Nash Equilibrium Synthesis with Partially Ordered LTL |
Caleb Probine et.al. | 2501.16307 | null |
2025-01-27 | Multi-Agent Geospatial Copilots for Remote Sensing Workflows | Chaehong Lee et.al. | 2501.16254 | null |
2025-01-27 | Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma | Richard Willis et.al. | 2501.16173 | link |
2025-01-27 | AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants | Pascal J. Sager et.al. | 2501.16150 | null |
2025-01-27 | Quantifying the Self-Interest Level of Markov Social Dilemmas | Richard Willis et.al. | 2501.16138 | null |
2025-01-27 | Multi-Agent Meta-Offline Reinforcement Learning for Timely UAV Path Planning and Data Collection | Eslam Eldeeb et.al. | 2501.16098 | null |
2025-01-27 | Galaxy Era: Agent-based Simulation of Execution Tickets | Pascal Stichler et.al. | 2501.16090 | link |
2025-01-27 | Value-oriented forecast reconciliation for renewables in electricity markets | Honglin Wen et.al. | 2501.16086 | null |
2025-01-27 | Generating Spatial Synthetic Populations Using Wasserstein Generative Adversarial Network: A Case Study with EU-SILC Data for Helsinki and Thessaloniki | Vanja Falck et.al. | 2501.16080 | null |
2025-01-27 | Translating and evaluating single-cell Boolean network interventions in the multiscale setting | John Metzcar et.al. | 2501.16052 | link |
2025-01-27 | Strategic Multi-Armed Bandit Problems Under Debt-Free Reporting | Ahmed Ben Yahmed et.al. | 2501.16018 | null |
2025-01-27 | Modeling and stability analysis of live systems with time-varying dimension | Andrii Mironchenko et.al. | 2501.15991 | null |
2025-01-27 | Online Housing Market | Julien Lesca et.al. | 2501.15916 | null |
2025-01-27 | Explaining Facial Expression Recognition | Sanjeev Nahulanthran et.al. | 2501.15864 | null |
2025-01-27 | LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models | Yuewen Mei et.al. | 2501.15850 | null |
2025-01-27 | The Strong Core of Housing Markets with Partial Order Preferences | Ildikó Schlotter et.al. | 2501.15834 | null |
2025-01-27 | MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer | Qi Chen et.al. | 2501.15826 | null |
2025-01-27 | Adaptive AI-based Decentralized Resource Management in the Cloud-Edge Continuum | Lanpei Li et.al. | 2501.15802 | null |
2025-01-27 | Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs | Yu Li et.al. | 2501.15791 | link |
2025-01-24 | An Attentive Graph Agent for Topology-Adaptive Cyber Defence | Ilya Orson Sandoval et.al. | 2501.14700 | link |
2025-01-24 | The Division of Surplus and the Burden of Proof | Deniz Kattwinkel et.al. | 2501.14686 | null |
2025-01-24 | MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications | Yixing Jiang et.al. | 2501.14654 | link |
2025-01-24 | Whisper D-SGD: Correlated Noise Across Agents for Differentially Private Decentralized Learning | Angelo Rodio et.al. | 2501.14644 | link |
2025-01-24 | Fair Division Beyond Monotone Valuations | Siddharth Barman et.al. | 2501.14609 | null |
2025-01-24 | Hybrid Quantum-Classical Multi-Agent Pathfinding | Thore Gerlach et.al. | 2501.14568 | null |
2025-01-24 | Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation | Wenzhang Liu et.al. | 2501.14543 | link |
2025-01-24 | Breaking the Pre-Planning Barrier: Real-Time Adaptive Coordination of Mission and Charging UAVs Using Graph Reinforcement Learning | Yuhan Hu et.al. | 2501.14488 | null |
2025-01-24 | Avoiding Overfitting in Variable-Order Markov Models: a Cross-Validation Approach | Valeria Secchini et.al. | 2501.14476 | null |
2025-01-24 | The Pseudo-Dimension of Contracts | Paul Duetting et.al. | 2501.14474 | null |
2025-01-24 | MARL-OT: Multi-Agent Reinforcement Learning Guided Online Fuzzing to Detect Safety Violation in Autonomous Driving Systems | Linfeng Liang et.al. | 2501.14451 | null |
2025-01-24 | Learning more with the same effort: how randomization improves the robustness of a robotic deep reinforcement learning agent | Lucía Güitta-López et.al. | 2501.14443 | null |
2025-01-24 | DeepFlow: Serverless Large Language Model Serving at Scale | Junhao Hu et.al. | 2501.14417 | null |
2025-01-24 | DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing | Xinyu Ma et.al. | 2501.14371 | link |
2025-01-24 | Online Inverse Linear Optimization: Improved Regret Bound, Robustness to Suboptimality, and Toward Tight Regret Analysis | Shinsaku Sakaue et.al. | 2501.14349 | null |
2025-01-24 | Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts | Clément Desroches et.al. | 2501.14334 | null |
2025-01-24 | MASTER: A Multi-Agent System with LLM Specialized MCTS | Bingzheng Gan et.al. | 2501.14304 | null |
2025-01-24 | TrajFlow: A Generative Framework for Occupancy Density Estimation Using Normalizing Flows | Mitch Kosieradzki et.al. | 2501.14266 | link |
2025-01-24 | Non-selective evaporation mechanism of binary aerosol generating agent on porous atomizer and its experimental verification | Xie Guoyong et.al. | 2501.14262 | null |
2025-01-24 | Optimal Investment under Mutual Strategy Influence among Agents | Huisheng Wang et.al. | 2501.14259 | null |
2025-01-23 | GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration | Yue Fan et.al. | 2501.13896 | null |
2025-01-23 | Utilizing Evolution Strategies to Train Transformers in Reinforcement Learning | Matyáš Lorenc et.al. | 2501.13883 | link |
2025-01-23 | Eye Gaze as a Signal for Conveying User Attention in Contextual AI Systems | Ethan Wilson et.al. | 2501.13878 | null |
2025-01-23 | EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents | Yuhui Yun et.al. | 2501.13746 | null |
2025-01-23 | Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent System | Haikuo Du et.al. | 2501.13727 | link |
2025-01-23 | A Non-Parametric Approach to Heterogeneity Analysis | Avner Seror et.al. | 2501.13721 | null |
2025-01-23 | Revisiting Online Learning Approach to Inverse Linear Optimization: A Fenchel--Young Loss Perspective and Gap-Dependent Regret Analysis | Shinsaku Sakaue et.al. | 2501.13648 | null |
2025-01-23 | WFCRL: A Multi-Agent Reinforcement Learning Benchmark for Wind Farm Control | Claire Bizon Monroc et.al. | 2501.13592 | link |
2025-01-23 | Explainable AI-aided Feature Selection and Model Reduction for DRL-based V2X Resource Allocation | Nasir Khan et.al. | 2501.13552 | null |
2025-01-23 | Towards a Theory of AI Personhood | Francis Rhys Ward et.al. | 2501.13533 | null |
2025-01-23 | Communication-Efficient Stochastic Distributed Learning | Xiaoxing Ren et.al. | 2501.13516 | null |
2025-01-23 | A Polynomial-Time Algorithm for EFX Orientations of Chores | Kevin Hsu et.al. | 2501.13481 | null |
2025-01-23 | Knowledge-Informed Multi-Agent Trajectory Prediction at Signalized Intersections for Infrastructure-to-Everything | Huilin Yin et.al. | 2501.13461 | null |
2025-01-23 | BMG-Q: Localized Bipartite Match Graph Attention Q-Learning for Ride-Pooling Order Dispatch | Yulong Hu et.al. | 2501.13448 | null |
2025-01-23 | VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework | He Kong et.al. | 2501.13411 | link |
2025-01-23 | Concurrent Learning with Aggregated States via Randomized Least Squares Value Iteration | Yan Chen et.al. | 2501.13394 | null |
2025-01-23 | Do as We Do, Not as You Think: the Conformity of Large Language Models | Zhiyuan Weng et.al. | 2501.13381 | link |
2025-01-23 | Task Allocation in Customer-led Two-sided Markets with Satellite Constellation Services | Jianglin Qiao et.al. | 2501.13364 | null |
2025-01-23 | AgentRec: Agent Recommendation Using Sentence Embeddings Aligned to Human Feedback | Joshua Park et.al. | 2501.13333 | link |
2025-01-23 | Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents | Shrinidhi Kumbhar et.al. | 2501.13299 | null |
2025-01-22 | Boosting MCTS with Free Energy Minimization | Mawaba Pascal Dao et.al. | 2501.13083 | null |
2025-01-22 | Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment | Melissa Kazemi Rad et.al. | 2501.13080 | null |
2025-01-22 | Evolution and The Knightian Blindspot of Machine Learning | Joel Lehman et.al. | 2501.13075 | null |
2025-01-22 | Optimizing Return Distributions with Distributional Dynamic Programming | Bernardo Ávila Pires et.al. | 2501.13028 | null |
2025-01-22 | The regret lower bound for communicating Markov Decision Processes | Victor Boone et.al. | 2501.13013 | null |
2025-01-22 | MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking | Sebastian Farquhar et.al. | 2501.13011 | null |
2025-01-22 | Constructive characterisations of the must-preorder for asynchrony | Giovanni Bernardi et.al. | 2501.13002 | null |
2025-01-22 | An Offline Multi-Agent Reinforcement Learning Framework for Radio Resource Management | Eslam Eldeeb et.al. | 2501.12991 | null |
2025-01-22 | Learning-based Distributed Model Predictive Control using Multi-Agent Bayesian Optimization | Hossein Nejatbakhsh Esfahani et.al. | 2501.12989 | null |
2025-01-22 | Quantification of Ultrafast Nonlinear Photothermal and Photoacoustic Effects in Molecular Thin Films via Time-Domain Brillouin Scattering | Valentin Cherruault et.al. | 2501.12912 | null |
2025-01-22 | FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces | Zhenran Xu et.al. | 2501.12909 | null |
2025-01-22 | Mutation-Guided LLM-based Test Generation at Meta | Christopher Foster et.al. | 2501.12862 | null |
2025-01-22 | ACEBench: Who Wins the Match Point in Tool Learning? | Chen Chen et.al. | 2501.12851 | null |
2025-01-22 | To Measure or Not: A Cost-Sensitive, Selective Measuring Environment for Agricultural Management Decisions with Reinforcement Learning | Hilmy Baja et.al. | 2501.12823 | link |
2025-01-22 | PSGSL: A Probabilistic Framework Integrating Semantic Scene Understanding and Gas Sensing for Gas Source Localization | Pepe Ojeda et.al. | 2501.12812 | null |
2025-01-22 | Information Design for Adaptive Organizations | Wataru Tamura et.al. | 2501.12669 | null |
2025-01-22 | NBDI: A Simple and Efficient Termination Condition for Skill Extraction from Task-Agnostic Demonstrations | Myunsoo Kim et.al. | 2501.12668 | null |
2025-01-22 | Optimal Rebate Design: Incentives, Competition and Efficiency in Auction Markets | Thibaut Mastrolia et.al. | 2501.12591 | null |
2025-01-22 | Leveraging LLMs to Create a Haptic Devices' Recommendation System | Yang Liu et.al. | 2501.12573 | null |
2025-01-21 | Reinforcement Learning Constrained Beam Search for Parameter Optimization of Paper Drying Under Flexible Constraints | Siyuan Chen et.al. | 2501.12542 | null |
2025-01-21 | Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists | Thomas F. Eisenmann et.al. | 2501.12374 | link |
2025-01-21 | UI-TARS: Pioneering Automated GUI Interaction with Native Agents | Yujia Qin et.al. | 2501.12326 | link |
2025-01-21 | Transitions to synchronization in adaptive multilayer networks with higher-order interactions | Richita Ghosh et.al. | 2501.12301 | null |
2025-01-21 | mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework | Bingyi Liu et.al. | 2501.12263 | null |
2025-01-21 | Multi-Agent Feedback Motion Planning using Probably Approximately Correct Nonlinear Model Predictive Control | Mark Gonzales et.al. | 2501.12234 | null |
2025-01-21 | Empower Healthcare through a Self-Sovereign Identity Infrastructure for Secure Electronic Health Data Access | Antonio López Martínez et.al. | 2501.12229 | null |
2025-01-21 | Convergence of time-delayed opinion dynamics with complex interaction types | Lingling Yao et.al. | 2501.12219 | null |
2025-01-21 | RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression | Uri Gadot et.al. | 2501.12216 | null |
2025-01-21 | Experience-replay Innovative Dynamics | Tuo Zhang et.al. | 2501.12199 | null |
2025-01-21 | Opinion dynamics in bounded confidence models with manipulative agents: Moving the Overton window | A. Bautista et.al. | 2501.12198 | null |
2025-01-21 | BotDetect: A Decentralized Federated Learning Framework for Detecting Financial Bots on the EVM Blockchains | Ahmed Mounsf Rafik Bendada et.al. | 2501.12112 | null |
2025-01-21 | Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics | Somnath Hazra et.al. | 2501.12061 | link |
2025-01-21 | Growth model with externalities for energetic transition via MFG with common external variable | Pierre Lavigne et.al. | 2501.11988 | null |
2025-01-21 | Simultaneously decoding the unknown stationary state and function parameters for mean field games | Hongyu Liu et.al. | 2501.11955 | null |
2025-01-21 | GLAM: Global-Local Variation Awareness in Mamba-based World Model | Qian He et.al. | 2501.11949 | null |
2025-01-21 | Equilibria under Dynamic Benchmark Consistency in Non-Stationary Multi-Agent Systems | Ludovico Crippa et.al. | 2501.11897 | null |
2025-01-21 | Connection-Coordination Rapport (CCR) Scale: A Dual-Factor Scale to Measure Human-Robot Rapport | Ting-Han Lin et.al. | 2501.11887 | null |
2025-01-21 | Developing an Agent-Based Mathematical Model for Simulating Post-Irradiation Cellular Response: A Crucial Component of a Digital Twin Framework for Personalized Radiation Treatment | Ruirui Liu et.al. | 2501.11875 | null |
2025-01-21 | LLM-Agents Driven Automated Simulation Testing and Analysis of small Uncrewed Aerial Systems | Venkata Sai Aswath Duvvuru et.al. | 2501.11864 | null |
2025-01-21 | EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents | Zhili Cheng et.al. | 2501.11858 | link |
2025-01-17 | Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems | Weibo Gao et.al. | 2501.10332 | null |
2025-01-17 | Towards Human-Guided, Data-Centric LLM Co-Pilots | Evgeny Saveliev et.al. | 2501.10321 | null |
2025-01-17 | Towards Preventing Overreliance on Task-Oriented Conversational AI Through Accountability Modeling | Suvodip Dey et.al. | 2501.10316 | link |
2025-01-17 | Enhancing AI Transparency: XRL-Based Resource Management and RAN Slicing for 6G ORAN Architecture | Suvidha Mhatre et.al. | 2501.10292 | null |
2025-01-17 | Evidence for the gravity-driven and magnetically-regularized gas flows feeding the massive protostellar cluster in Cep A | Panigrahy Sandhyarani et.al. | 2501.10280 | null |
2025-01-17 | Grey-Box Fuzzing in Constrained Ultra-Large Systems: Lessons for SE Community | Jiazhao Yu et.al. | 2501.10269 | null |
2025-01-17 | Deployment of an Aerial Multi-agent System for Automated Task Execution in Large-scale Underground Mining Environments | Niklas Dahlquist et.al. | 2501.10262 | null |
2025-01-17 | Logarithmic Regret for Nonlinear Control | James Wang et.al. | 2501.10261 | null |
2025-01-17 | Secure Semantic Communication With Homomorphic Encryption | Rui Meng et.al. | 2501.10182 | null |
2025-01-17 | PaSa: An LLM Agent for Comprehensive Academic Paper Search | Yichen He et.al. | 2501.10120 | link |
2025-01-17 | GAWM: Global-Aware World Model for Multi-Agent Reinforcement Learning | Zifeng Shi et.al. | 2501.10116 | null |
2025-01-17 | Infrastructure for AI Agents | Alan Chan et.al. | 2501.10114 | null |
2025-01-17 | LLM Reasoner and Automated Planner: A new NPC approach | Israel Puerta-Merino et.al. | 2501.10106 | null |
2025-01-17 | Universal Actions for Enhanced Embodied Foundation Models | Jinliang Zheng et.al. | 2501.10105 | link |
2025-01-17 | A Survey on LLM Test-Time Compute via Search: Tasks, LLM Profiling, Search Algorithms, and Relevant Frameworks | Xinzhe Li et.al. | 2501.10069 | null |
2025-01-17 | Agent-as-Judge for Factual Summarization of Long Narratives | Yeonseok Jeong et.al. | 2501.09993 | link |
2025-01-17 | A Survey on Multi-Turn Interaction Capabilities of Large Language Models | Chen Zhang et.al. | 2501.09959 | null |
2025-01-17 | ForestProtector: An IoT Architecture Integrating Machine Vision and Deep Reinforcement Learning for Efficient Wildfire Monitoring | Kenneth Bonilla-Ormachea et.al. | 2501.09926 | null |
2025-01-17 | Towards A Litmus Test for Common Sense | Hugo Latapie et.al. | 2501.09913 | null |
2025-01-17 | Chatbot apologies: Beyond bullshit | P. D. Magnus et.al. | 2501.09910 | null |
2025-01-16 | CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education | Tianyu Wang et.al. | 2501.09709 | link |
2025-01-16 | The Goofus & Gallant Story Corpus for Practical Value Alignment | Md Sultan Al Nahian et.al. | 2501.09707 | null |
2025-01-16 | Authenticated Delegation and Authorized AI Agents | Tobin South et.al. | 2501.09674 | null |
2025-01-16 | NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision Processes | Nathaniel S. Keplinger et.al. | 2501.09646 | link |
2025-01-16 | Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework | Yushen Lin et.al. | 2501.09631 | null |
2025-01-16 | A Multi-agent System for Hybrid Optimization | Eric S. Fraga et.al. | 2501.09563 | null |
2025-01-16 | Solving the unsolvable: Translating case law in Hong Kong | King-kui Sin et.al. | 2501.09444 | null |
2025-01-16 | ADAGE: A generic two-layer framework for adaptive agent based modelling | Benjamin Patrick Evans et.al. | 2501.09429 | null |
2025-01-16 | AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling | Ancheng Xu et.al. | 2501.09426 | null |
2025-01-16 | Agent-Based Simulation of a Perpetual Futures Market | Ramshreyas Rao et.al. | 2501.09404 | null |
2025-01-16 | The sleeping bacterium: shedding light on the resuscitation mechanism | Eleonora Alfinito et.al. | 2501.09366 | null |
2025-01-16 | YETI (YET to Intervene) Proactive Interventions by Multimodal AI Agents in Augmented Reality Tasks | Saptarashmi Bandyopadhyay et.al. | 2501.09355 | null |
2025-01-16 | ChartInsighter: An Approach for Mitigating Hallucination in Time-series Chart Summary Generation with A Benchmark Dataset | Fen Wang et.al. | 2501.09349 | link |
2025-01-16 | Solving Infinite-Player Games with Player-to-Strategy Networks | Carlos Martin et.al. | 2501.09330 | null |
2025-01-16 | On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression | Zichang Ge et.al. | 2501.09327 | link |
2025-01-16 | SOP-Agent: Empower General Purpose AI Agent with Domain-Specific SOPs | Anbang Ye et.al. | 2501.09316 | null |
2025-01-16 | Interoceptive Robots for Convergent Shared Control in Collaborative Construction Work | Xiaoshan Zhou et.al. | 2501.09290 | link |
2025-01-16 | Hierarchical Deep Reinforcement Learning for Adaptive Resource Management in Integrated Terrestrial and Non-Terrestrial Networks | Muhammad Ahmed Mohsin et.al. | 2501.09212 | link |
2025-01-15 | Embodied Scene Understanding for Vision Language Models via MetaVQA | Weizhen Wang et.al. | 2501.09167 | null |
2025-01-15 | AutoLoop: Fast Visual SLAM Fine-tuning through Agentic Curriculum Learning | Assaf Lahiany et.al. | 2501.09160 | null |
2025-01-15 | Personality Modeling for Persuasion of Misinformation using AI Agent | Qianmin Lou et.al. | 2501.08985 | null |
2025-01-15 | Physical AI Agents: Integrating Cognitive Intelligence with Real-World Action | Fouad Bousetouane et.al. | 2501.08944 | null |
2025-01-15 | A Reinforcement Learning Approach to Quiet and Safe UAM Traffic Management | Surya Murthy et.al. | 2501.08941 | null |
2025-01-15 | Disentangling Exploration of Large Language Models by Optimal Exploitation | Tim Grams et.al. | 2501.08925 | null |
2025-01-15 | Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning | Qinyu Ma et.al. | 2501.08897 | link |
2025-01-15 | Silent Abandonment in Text-Based Contact Centers: Identifying, Quantifying, and Mitigating its Operational Impacts | Antonio Castellanos et.al. | 2501.08869 | null |
2025-01-15 | The geometry of moral decision making | Roland M. Friedrich et.al. | 2501.08865 | null |
2025-01-15 | On the Dominance of Truth-Telling in Gradual Mechanisms | Wenqian Wang et.al. | 2501.08802 | null |
2025-01-15 | Networked Agents in the Dark: Team Value Learning under Partial Observability | Guilherme S. Varela et.al. | 2501.08778 | null |
2025-01-15 | Leveraging LLM Agents for Translating Network Configurations | Yunze Wei et.al. | 2501.08760 | null |
2025-01-15 | Efficient Shape Reconfiguration by Hybrid Programmable Matter | Jonas Friemel et.al. | 2501.08663 | null |
2025-01-15 | Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance | Raúl Arranz et.al. | 2501.08655 | null |
2025-01-15 | Towards Intelligent Active Particles | Hartmut Löwen et.al. | 2501.08632 | null |
2025-01-15 | Neural Risk-sensitive Satisficing in Contextual Bandits | Shogo Ito et.al. | 2501.08612 | null |
2025-01-15 | AutoRestTest: A Tool for Automated REST API Testing Using LLMs and MARL | Tyler Stennett et.al. | 2501.08600 | null |
2025-01-15 | Effects of taxes, redistribution actions and fiscal evasion on wealth inequality: an agent-based model approach | Iago Nascimento Barros et.al. | 2501.08573 | null |
2025-01-15 | Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation | Jiaxin Guo et.al. | 2501.08523 | null |
2025-01-15 | Ensuring Truthfulness in Distributed Aggregative Optimization | Ziqin Chen et.al. | 2501.08512 | null |
2025-01-14 | Empathetic Conversational Agents: Utilizing Neural and Physiological Signals for Enhanced Empathetic Interactions | Nastaran Saffaryazdi et.al. | 2501.08393 | null |
2025-01-14 | ADAM-1: AI and Bioinformatics for Alzheimer's Detection and Microbiome-Clinical Data Integrations | Ziyuan Huang et.al. | 2501.08324 | null |
2025-01-14 | Using Gamified Experiments to Tame Complexity: the case of the Schelling Model of Segregation | Aleix Nicolás Olivé et.al. | 2501.08280 | null |
2025-01-14 | Addressing the sustainable AI trilemma: a case study on LLM agents and RAG | Hui Wu et.al. | 2501.08262 | null |
2025-01-14 | Engineering LLM Powered Multi-agent Framework for Autonomous CloudOps | Kannan Parthasarathy et.al. | 2501.08243 | null |
2025-01-14 | Dynamic Pricing in High-Speed Railways Using Multi-Agent Reinforcement Learning | Enrique Adrian Villarrubia-Martin et.al. | 2501.08234 | null |
2025-01-14 | ASTRID -- An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems | Mohita Chowdhury et.al. | 2501.08208 | null |
2025-01-14 | An Elementary Microscopic Model of Sympatric Speciation | Franco Bagnoli et.al. | 2501.08130 | null |
2025-01-14 | Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving | Guizhe Jin et.al. | 2501.08096 | null |
2025-01-14 | AgentPose: Progressive Distribution Alignment via Feature Agent for Human Pose Distillation | Feng Zhang et.al. | 2501.08088 | null |
2025-01-14 | CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning | Guoliang He et.al. | 2501.08071 | link |
2025-01-14 | Hydrodynamics-driven phase-locking and collective motility of sessile active dumbbells | Urvi Mahendra Bora et.al. | 2501.08065 | null |
2025-01-14 | Cooperative Patrol Routing: Optimizing Urban Crime Surveillance through Multi-Agent Reinforcement Learning | Juan Palma-Borda et.al. | 2501.08020 | link |
2025-01-14 | Decentralized Learning with Approximate Finite-Time Consensus | Aaron Fainman et.al. | 2501.07967 | null |
2025-01-14 | Governing AI Agents | Noam Kolt et.al. | 2501.07913 | null |
2025-01-14 | Flow: A Modular Approach to Automated Agentic Workflow Generation | Boye Niu et.al. | 2501.07834 | null |
2025-01-14 | Agent-Centric Projection of Prompting Techniques and Implications for Synthetic Training Data for Large Language Models | Dhruv Dhamani et.al. | 2501.07815 | null |
2025-01-14 | Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering | Feijie Wu et.al. | 2501.07813 | null |
2025-01-14 | CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation | Ruwei Pan et.al. | 2501.07811 | null |
2025-01-14 | Visual Language Models as Operator Agents in the Space Domain | Alejandro Carrasco et.al. | 2501.07802 | null |
2025-01-13 | CBS with Continuous-Time Revisit | Andy Li et.al. | 2501.07744 | null |
2025-01-13 | WebWalker: Benchmarking LLMs in Web Traversal | Jialong Wu et.al. | 2501.07572 | link |
2025-01-13 | SafeSwarm: Decentralized Safe RL for the Swarm of Drones Landing in Dense Crowds | Grik Tadevosyan et.al. | 2501.07566 | null |
2025-01-13 | SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing | Varun Biyyala et.al. | 2501.07554 | link |
2025-01-13 | Evaluating Agent-based Program Repair at Google | Pat Rondon et.al. | 2501.07531 | null |
2025-01-13 | Improving DeFi Accessibility through Efficient Liquidity Provisioning with Deep Reinforcement Learning | Haonan Xu et.al. | 2501.07508 | null |
2025-01-13 | How low-cost AI universal approximators reshape market efficiency | Paolo Barucca et.al. | 2501.07489 | null |
2025-01-13 | SynthSoM: A synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM) | Xiang Cheng et.al. | 2501.07459 | link |
2025-01-13 | Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI | Rolf Pfister et.al. | 2501.07458 | null |
2025-01-13 | Online inductive learning from answer sets for efficient reinforcement learning exploration | Celeste Veronese et.al. | 2501.07445 | null |
2025-01-13 | Attention when you need | Lokesh Boominathan et.al. | 2501.07440 | null |
2025-01-13 | Lifelong Learning of Large Language Model based Agents: A Roadmap | Junhao Zheng et.al. | 2501.07278 | link |
2025-01-13 | Multi-face emotion detection for effective Human-Robot Interaction | Mohamed Ala Yahyaoui et.al. | 2501.07213 | null |
2025-01-13 | Combined effect of incentives and coupling in multigames in two-layer networks | Luo-Luo Jiang et.al. | 2501.07193 | null |
2025-01-13 | TIMRL: A Novel Meta-Reinforcement Learning Framework for Non-Stationary and Multi-Task Environments | Chenyang Qi et.al. | 2501.07146 | null |
2025-01-13 | How GPT learns layer by layer | Jason Du et.al. | 2501.07108 | link |
2025-01-13 | PPO-Q: Proximal Policy Optimization with Parametrized Quantum Policies or Values | Yu-Xin Jin et.al. | 2501.07085 | null |
2025-01-13 | PoAct: Policy and Action Dual-Control Agent for Generalized Applications | Guozhi Yuan et.al. | 2501.07054 | null |
2025-01-13 | Differentially Private Kernelized Contextual Bandits | Nikola Pavlovic et.al. | 2501.07046 | null |
2025-01-12 | Learning Implicit Social Navigation Behavior using Deep Inverse Reinforcement Learning | Tribhi Kathuria et.al. | 2501.06946 | null |
2025-01-12 | AdaSlicing: Adaptive Online Network Slicing under Continual Network Dynamics in Open Radio Access Networks | Ming Zhao et.al. | 2501.06943 | null |
2025-01-10 | PEACE: Empowering Geologic Map Holistic Understanding with MLLMs | Yangyu Huang et.al. | 2501.06184 | null |
2025-01-10 | A Mixed-Integer Conic Program for the Multi-Agent Moving-Target Traveling Salesman Problem | Allen George Philip et.al. | 2501.06130 | null |
2025-01-10 | Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation | Guojun Xiong et.al. | 2501.06103 | null |
2025-01-10 | Learning Flexible Heterogeneous Coordination with Capability-Aware Shared Hypernetworks | Kevin Fu et.al. | 2501.06058 | link |
2025-01-10 | Investigating the Impact of Observation Space Design Choices On Training Reinforcement Learning Solutions for Spacecraft Problems | Nathaniel Hamilton et.al. | 2501.06016 | null |
2025-01-10 | Enhanced Acoustic Beamforming with Sub-Aperture Angular Multiply and Sum -- in vivo and in Human Demonstration | Matthieu Toulemonde et.al. | 2501.05837 | null |
2025-01-10 | CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech | Madhurananda Pahar et.al. | 2501.05755 | null |
2025-01-10 | Semantic Mapping in Indoor Embodied AI -- A Comprehensive Survey and Future Directions | Sonia Raychaudhuri et.al. | 2501.05750 | null |
2025-01-10 | How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond | Chen Huang et.al. | 2501.05714 | null |
2025-01-10 | Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains | Vighnesh Subramaniam et.al. | 2501.05707 | null |
2025-01-10 | A Two-timescale Primal-dual Algorithm for Decentralized Optimization with Compression | Haoming Liu et.al. | 2501.05701 | null |
2025-01-10 | Scaling Safe Multi-Agent Control for Signal Temporal Logic Specifications | Joe Eappen et.al. | 2501.05639 | link |
2025-01-09 | Towards Probabilistic Inference of Human Motor Intentions by Assistive Mobile Robots Controlled via a Brain-Computer Interface | Xiaoshan Zhou et.al. | 2501.05610 | null |
2025-01-09 | NSChat: A Chatbot System To Rule Them All | Zenon Lamprou et.al. | 2501.05541 | null |
2025-01-09 | OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? | Yifei Li et.al. | 2501.05510 | link |
2025-01-09 | Strategy Masking: A Method for Guardrails in Value-based Reinforcement Learning Agents | Jonathan Keane et.al. | 2501.05501 | null |
2025-01-09 | Search-o1: Agentic Search-Enhanced Large Reasoning Models | Xiaoxi Li et.al. | 2501.05366 | link |
2025-01-09 | Control of Overpopulated Tails in Kinetic Epidemic Models | Mattia Zanella et.al. | 2501.05365 | null |
2025-01-09 | A Path Variant of the Explorer Director Game on Graphs | Abigail Raz et.al. | 2501.05364 | null |
2025-01-09 | On Corrigibility and Alignment in Multi Agent Games | Edmund Dable-Heath et.al. | 2501.05360 | null |
2025-01-09 | A learning agent-based approach to the characterization of open quantum systems | Lorenzo Fioroni et.al. | 2501.05350 | null |
2025-01-09 | The Bakers and Millers Game with Restricted Locations | Simon Krogmann et.al. | 2501.05334 | null |
2025-01-09 | Knowledge Transfer in Model-Based Reinforcement Learning Agents for Efficient Multi-Task Learning | Dmytro Kuzmenko et.al. | 2501.05329 | null |
2025-01-09 | Contrast-Free Myocardial Scar Segmentation in Cine MRI using Motion and Texture Fusion | Guang Yang et.al. | 2501.05241 | null |
2025-01-09 | CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness | Shoucheng Song et.al. | 2501.05207 | null |
2025-01-09 | Emergence of human-like polarization among large language model agents | Jinghua Piao et.al. | 2501.05171 | null |
2025-01-09 | Constrained Optimization of Charged Particle Tracking with Multi-Agent Reinforcement Learning | Tobias Kortus et.al. | 2501.05113 | null |
2025-01-09 | LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models | Zengqi Peng et.al. | 2501.05057 | null |
2025-01-09 | ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark | Ronghao Dang et.al. | 2501.05031 | link |
2025-01-09 | CuRLA: Curriculum Learning Based Deep Reinforcement Learning for Autonomous Driving | Bhargava Uppuluri et.al. | 2501.04982 | null |
2025-01-08 | RadGPT: Constructing 3D Image-Text Tumor Datasets | Pedro R. A. S. Bassi et.al. | 2501.04678 | link |
2025-01-08 | InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection | Yuhang Liu et.al. | 2501.04575 | link |
2025-01-08 | The importance of being discrete -- An agent-based model for active nematics and more | Mathieu Dedenon et.al. | 2501.04559 | null |
2025-01-08 | Approximately EFX and PO Allocations for Bivalued Chores | Zehan Lin et.al. | 2501.04550 | null |
2025-01-08 | Cyber-Physical Steganography in Robotic Motion Control | Ching-Chun Chang et.al. | 2501.04541 | null |
2025-01-08 | Safe Reinforcement Learning with Minimal Supervision | Alexander Quessy et.al. | 2501.04481 | null |
2025-01-08 | Hybrid Artificial Intelligence Strategies for Drone Navigation | Rubén San-Segundo et.al. | 2501.04472 | null |
2025-01-08 | A Digital Shadow for Modeling, Studying and Preventing Urban Crime | Juan Palma-Borda et.al. | 2501.04435 | null |
2025-01-08 | User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation | Krisztian Balog et.al. | 2501.04410 | null |
2025-01-08 | Agent Laboratory: Using LLM Agents as Research Assistants | Samuel Schmidgall et.al. | 2501.04227 | null |
2025-01-08 | Unattainability of Common Knowledge in Asymmetric Games with Imperfect Information | Fabian Farestam et.al. | 2501.04199 | null |
2025-01-07 | HIVEX: A High-Impact Environment Suite for Multi-Agent Research (extended version) | Philipp D. Siedler et.al. | 2501.04180 | null |
2025-01-07 | Collaborative Spacecraft Servicing under Partial Feedback using Lyapunov-based Deep Neural Networks | Cristian F. Nino et.al. | 2501.04160 | null |
2025-01-07 | Implementing Systemic Thinking for Automatic Schema Matching: An Agent-Based Modeling Approach | Hicham Assoudi et.al. | 2501.04136 | null |
2025-01-07 | Kinetic theory of decentralized learning for smart active matter | Gerhard Jung et.al. | 2501.03948 | null |
2025-01-07 | Implicit Coordination using Active Epistemic Inference | Lauren Bramblett et.al. | 2501.03907 | null |
2025-01-07 | Truthful mechanisms for linear bandit games with private contexts | Yiting Hu et.al. | 2501.03865 | null |
2025-01-07 | Rendezfood: A Design Case Study of a Conversational Location-based Approach in Restaurants | Philip Weber et.al. | 2501.03862 | null |
2025-01-07 | Run-and-tumble chemotaxis using reinforcement learning | Ramesh Pramanik et.al. | 2501.03687 | null |
2025-01-07 | The Textbook of Tomorrow: Rethinking Course Material Interfacing in the Era of GPT | Audrey Olson et.al. | 2501.03618 | null |
2025-01-07 | Distributed Observer for Descriptor Linear System: The Luenberger Observer Method | Shuai Liu et.al. | 2501.03564 | null |
2025-01-07 | Rethinking Adversarial Attacks in Reinforcement Learning from Policy Distribution Perspective | Tianyang Duan et.al. | 2501.03562 | null |
2025-01-07 | FgC2F-UDiff: Frequency-guided and Coarse-to-fine Unified Diffusion Model for Multi-modality Missing MRI Synthesis | Xiaojiao Xiao et.al. | 2501.03526 | link |
2025-01-07 | A Unified Attack Detection Strategy for Multi-Agent Systems over Transient and Steady Stages | Jinming Gao et.al. | 2501.03496 | null |
2025-01-06 | Designing Telepresence Robots to Support Place Attachment | Yaxin Hu et.al. | 2501.03420 | null |
2025-01-06 | ScaleMAI: Accelerating the Development of Trusted Datasets and AI Models | Wenxuan Li et.al. | 2501.03410 | link |
2025-01-06 | Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation | Yuhui Zhang et.al. | 2501.03225 | link |
2025-01-06 | Turn-based Multi-Agent Reinforcement Learning Model Checking | Dennis Gross et.al. | 2501.03187 | null |
2025-01-06 | Deep-Relative-Trust-Based Diffusion for Decentralized Deep Learning | Muyun Li et.al. | 2501.03162 | null |
2025-01-06 | Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches | Alhassan Mumuni et.al. | 2501.03151 | null |
2025-01-06 | Probably Correct Optimal Stable Matching for Two-Sided Markets Under Uncertainty | Andreas Athanasopoulos et.al. | 2501.03018 | link |
2025-01-06 | Approximating N-Player Nash Equilibrium through Gradient Descent | Dongge Wang et.al. | 2501.03001 | null |
2025-01-06 | CALM: Curiosity-Driven Auditing for Large Language Models | Xiang Zheng et.al. | 2501.02997 | link |
2025-01-06 | CAMP: Collaborative Attention Model with Profiles for Vehicle Routing Problems | Chuanbo Hua et.al. | 2501.02977 | link |
2025-01-06 | Revisiting Communication Efficiency in Multi-Agent Reinforcement Learning from the Dimensional Analysis Perspective | Chuxiong Sun et.al. | 2501.02888 | null |
2025-01-06 | A Novel Vision Transformer for Camera-LiDAR Fusion based Traffic Object Segmentation | Toomas Tahves et.al. | 2501.02858 | null |
2025-01-06 | Proteomic Learning of Gamma-Aminobutyric Acid (GABA) Receptor-Mediated Anesthesia | Jian Jiang et.al. | 2501.02824 | link |
2025-01-06 | Enhancing Lifelong Multi-Agent Path Finding with Cache Mechanism | Yimin Tang et.al. | 2501.02803 | null |
2025-01-06 | Gaming on Coincident Peak Shaving: Equilibrium and Strategic Behavior | Liudong Chen et.al. | 2501.02792 | null |
2025-01-06 | Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes | Zijian Wang et.al. | 2501.02774 | null |
2025-01-06 | Multi-Agent Path Finding under Limited Communication Range Constraint via Dynamic Leading | Hoang-Dung Bui et.al. | 2501.02770 | null |
2025-01-06 | Tree-based RAG-Agent Recommendation System: A Case Study in Medical Test Data | Yahe Yang et.al. | 2501.02727 | null |
2025-01-05 | A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model | Shivaram Kalyanakrishnan et.al. | 2501.02652 | null |
2025-01-05 | Slow modulation of the contraction patterns in Physarum polycephalum | Raphael Saiseau et.al. | 2501.02651 | null |
2025-01-05 | LLMs Help Alleviate the Cross-Subject Variability in Brain Signal and Language Alignment | Yifei Liu et.al. | 2501.02621 | null |
2025-01-05 | Back to Base: Towards Hands-Off Learning via Safe Resets with Reach-Avoid Safety Filters | Azra Begzadić et.al. | 2501.02620 | null |
2025-01-03 | QuArch: A Question-Answering Dataset for AI Agents in Computer Architecture | Shvetank Prakash et.al. | 2501.01892 | null |
2025-01-03 | Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification | Xiangxiang Dai et.al. | 2501.01849 | link |
2025-01-03 | MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning | Pu Yang et.al. | 2501.01834 | null |
2025-01-03 | SDPO: Segment-Level Direct Preference Optimization for Social Agents | Aobo Kong et.al. | 2501.01821 | link |
2025-01-03 | Distributed Framework Construction for Affine Formation Control | Huiming Li et.al. | 2501.01817 | null |
2025-01-03 | Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery | Baoru Huang et.al. | 2501.01752 | null |
2025-01-03 | Proposing Hierarchical Goal-Conditioned Policy Planning in Multi-Goal Reinforcement Learning | Gavin B. Rens et.al. | 2501.01727 | null |
2025-01-03 | AgentRefine: Enhancing Agent Generalization through Refinement Tuning | Dayuan Fu et.al. | 2501.01702 | null |
2025-01-03 | The (Exact) Price of Cardinality for Indivisible Goods: A Parametric Perspective | Alexander Lam et.al. | 2501.01660 | null |
2025-01-03 | PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents | Jingoo Lee et.al. | 2501.01594 | null |
2025-01-03 | BLAST: A Stealthy Backdoor Leverage Attack against Cooperative Multi-Agent Deep Reinforcement Learning based Systems | Yinbo Yu et.al. | 2501.01593 | null |
2025-01-02 | Reinforcement-learning-based control of turbulent channel flows at high Reynolds numbers | Zisong Zhou et.al. | 2501.01573 | null |
2025-01-02 | BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery | Kanishk Gandhi et.al. | 2501.01540 | link |
2025-01-02 | In Search of a Lost Metric: Human Empowerment as a Pillar of Socially Conscious Navigation | Vasanth Reddy Baddam et.al. | 2501.01539 | null |
2025-01-02 | Optimal Strategy Revision in Population Games: A Mean Field Game Theory Perspective | Julian Barreiro-Gomez et.al. | 2501.01389 | null |
2025-01-02 | PIMAEX: Multi-Agent Exploration through Peer Incentivization | Michael Kölle et.al. | 2501.01266 | null |
2025-01-02 | Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants | Lixiong Qin et.al. | 2501.01243 | null |
2025-01-02 | From Interaction to Attitude: Exploring the Impact of Human-AI Cooperation on Mental Illness Stigma | Tianqi Song et.al. | 2501.01220 | null |
2025-01-02 | D-HAT: a Diatom-inspired structure for a Helmet concept Against Trauma | Ludovico Musenich et.al. | 2501.01211 | null |
2025-01-02 | Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects | Abdullah Mushtaq et.al. | 2501.01205 | null |
2025-01-02 | 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer | Jiajun Deng et.al. | 2501.01163 | null |
2025-01-02 | A3: Android Agent Arena for Mobile GUI Agents | Yuxiang Chai et.al. | 2501.01149 | null |
2025-01-02 | Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method | Ruichen Zhang et.al. | 2501.01141 | null |
2025-01-02 | Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning | Min Whoo Lee et.al. | 2501.01140 | null |
2025-01-02 | Symmetries-enhanced Multi-Agent Reinforcement Learning | Nikolaos Bousias et.al. | 2501.01136 | null |
2025-01-02 | Regularized Proportional Fairness Mechanism for Resource Allocation Without Money | Sihan Zeng et.al. | 2501.01111 | null |
2025-01-02 | MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model | Chengze Zhang et.al. | 2501.01014 | null |
2025-01-02 | Cyber-physical Defense for Heterogeneous Multi-agent Systems Against Exponentially Unbounded Attacks on Signed Digraphs | Yichao Wang et.al. | 2501.00990 | null |
2025-01-02 | Bootstrapped Reward Shaping | Jacob Adamczyk et.al. | 2501.00989 | null |
2025-01-01 | Non-obvious Manipulability in Hedonic Games with Friends Appreciation Preferences | Michele Flammini et.al. | 2501.00976 | null |
2025-01-01 | Defense Strategies for Autonomous Multi-agent Systems: Ensuring Safety and Resilience Under Exponentially Unbounded FDI Attacks | Yichao Wang et.al. | 2501.00973 | null |
2025-01-01 | Intent-based Radio Scheduler for RAN Slicing: Learning to deal with different network scenarios | Cleverson Nahum et.al. | 2501.00950 | link |
2025-01-01 | Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things | Talha Zeeshan et.al. | 2501.00906 | null |
2025-01-01 | Agentic Systems: A Guide to Transforming Industries with Vertical AI Agents | Fouad Bousetouane et.al. | 2501.00881 | null |
2024-12-30 | Distributed Mixture-of-Agents for Edge Inference with Large Language Models | Purbesh Mitra et.al. | 2412.21200 | link |
2024-12-30 | Aviary: training language agents on challenging scientific tasks | Siddharth Narayanan et.al. | 2412.21154 | null |
2024-12-30 | Training Software Engineering Agents and Verifiers with SWE-Gym | Jiayi Pan et.al. | 2412.21139 | link |
2024-12-30 | Positional information trade-offs in boundary-driven reaction-diffusion systems | Jonas Berx et.al. | 2412.21113 | null |
2024-12-30 | Exploring and Controlling Diversity in LLM-Agent Conversation | KuanChao Chu et.al. | 2412.21102 | null |
2024-12-30 | Advances in Multi-agent Reinforcement Learning: Persistent Autonomy and Robot Learning Lab Report 2024 | Reza Azadeh et.al. | 2412.21088 | null |
2024-12-30 | Privacy-Aware Multi-Device Cooperative Edge Inference with Distributed Resource Bidding | Wenhao Zhuang et.al. | 2412.21069 | null |
2024-12-30 | Plancraft: an evaluation dataset for planning with LLM agents | Gautier Dagan et.al. | 2412.21033 | link |
2024-12-30 | UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI | Fangwei Zhong et.al. | 2412.20977 | null |
2024-12-31 | SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity | Pengfei Jing et.al. | 2412.20787 | null |
2024-12-30 | Joint Scoring Rules: Zero-Sum Competition Avoids Performative Prediction | Rubi Hudson et.al. | 2412.20732 | null |
2024-12-30 | Modeling and Simulating Agent-Based City Migration Using Conway's Game of Life | Bruce Deng et.al. | 2412.20691 | null |
2024-12-30 | Blockchain-Empowered Cyber-Secure Federated Learning for Trustworthy Edge Computing | Ervin Moore et.al. | 2412.20674 | null |
2024-12-29 | The intrinsic motivation of reinforcement and imitation learning for sequential tasks | Sao Mai Nguyen et.al. | 2412.20573 | null |
2024-12-29 | Game Theory and Multi-Agent Reinforcement Learning : From Nash Equilibria to Evolutionary Dynamics | Neil De La Fuente et.al. | 2412.20523 | null |
2024-12-29 | Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning | Hang Ni et.al. | 2412.20505 | null |
2024-12-29 | Exploiting NOMA Transmissions in Multi-UAV-assisted Wireless Networks: From Aerial-RIS to Mode-switching UAVs | Songhan Zhao et.al. | 2412.20484 | null |
2024-12-29 | SatFlow: Scalable Network Planning for LEO Mega-Constellations | Sheng Cen et.al. | 2412.20475 | null |
2024-12-29 | Image Augmentation Agent for Weakly Supervised Semantic Segmentation | Wangyu Wu et.al. | 2412.20439 | null |
2024-12-29 | Learning Policies for Dynamic Coalition Formation in Multi-Robot Task Allocation | Lucas C. D. Bezerra et.al. | 2412.20397 | null |
2024-12-27 | Bottom-up robust modeling for the foraging behavior of Physarum polycephalum | Damiano Reginato et.al. | 2412.19790 | null |
2024-12-27 | Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration | Le Chen et.al. | 2412.19770 | link |
2024-12-27 | Can Large Language Models Adapt to Other Agents In-Context? | Matthew Riemer et.al. | 2412.19726 | null |
2024-12-27 | OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis | Qiushi Sun et.al. | 2412.19723 | null |
2024-12-27 | The Value of Recall in Extensive-Form Games | Ratip Emin Berker et.al. | 2412.19659 | null |
2024-12-27 | Xmodel-2 Technical Report | Wang Qun et.al. | 2412.19638 | null |
2024-12-27 | Bidding Games on Markov Decision Processes with Quantitative Reachability Objectives | Guy Avni et.al. | 2412.19609 | null |
2024-12-27 | Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following | Yuxiao Yang et.al. | 2412.19562 | null |
2024-12-27 | Quantiles under ambiguity and risk sharing | Peng Liu et.al. | 2412.19546 | null |
2024-12-27 | TARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured Data | Xiang Huang et.al. | 2412.19544 | link |
2024-12-27 | Scalable Hierarchical Reinforcement Learning for Hyper Scale Multi-Robot Task Planning | Xuan Zhou et.al. | 2412.19538 | null |
2024-12-27 | Casevo: A Cognitive Agents and Social Evolution Simulator | Zexun Jiang et.al. | 2412.19498 | link |
2024-12-27 | Knowledge Graph-Based Multi-Agent Path Planning in Dynamic Environments using WAITR | Ted Edward Holmberg et.al. | 2412.19469 | null |
2024-12-27 | Online distributed algorithms for mixed equilibrium problems in dynamic environments | Hang Xu et.al. | 2412.19399 | null |
2024-12-26 | Preventive Energy Management for Distribution Systems Under Uncertain Events: A Deep Reinforcement Learning Approach | Md Isfakul Anam et.al. | 2412.19382 | null |
2024-12-26 | Minimal Batch Adaptive Learning Policy Engine for Real-Time Mid-Price Forecasting in High-Frequency Trading | Adamantios Ntakaris et.al. | 2412.19372 | null |
2024-12-26 | xSRL: Safety-Aware Explainable Reinforcement Learning -- Safety as a Product of Explainability | Risal Shahriar Shefin et.al. | 2412.19311 | link |
2024-12-26 | Reforming an Unfair Allocation by Exchanging Goods | Sheung Man Yuen et.al. | 2412.19264 | null |
2024-12-26 | Swarm Contract: A Multi-Sovereign Agent Consensus Mechanism | Haowei Yang et.al. | 2412.19256 | null |
2024-12-26 | VINEVI: A Virtualized Network Vision Architecture for Smart Monitoring of Heterogeneous Applications and Infrastructures | Rodrigo Moreira et.al. | 2412.19226 | null |
2024-12-24 | Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems | Fernando Jia et.al. | 2412.18601 | link |
2024-12-24 | Automated Code Review In Practice | Umut Cihan et.al. | 2412.18531 | null |
2024-12-24 | Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving | Hao Pang et.al. | 2412.18511 | null |
2024-12-24 | Calibrating the Subjective | Mark Whitmeyer et.al. | 2412.18486 | null |
2024-12-24 | Multi-Agent Norm Perception and Induction in Distributed Healthcare | Chao Li et.al. | 2412.18454 | null |
2024-12-24 | 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding | Tatiana Zemskova et.al. | 2412.18450 | link |
2024-12-24 | GeAR: Graph-enhanced Agent for Retrieval-augmented Generation | Zhili Shen et.al. | 2412.18431 | null |
2024-12-24 | Explainable Multi-Modal Data Exploration in Natural Language via LLM Agent | Farhad Nooralahzadeh et.al. | 2412.18428 | link |
2024-12-24 | GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent | Kangjia Zhao et.al. | 2412.18426 | null |
2024-12-24 | Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles | Zihan Wang et.al. | 2412.18416 | null |
2024-12-24 | Contrastive Representation for Interactive Recommendation | Jingyu Li et.al. | 2412.18396 | link |
2024-12-24 | Defining and Detecting the Defects of the Large Language Model-based Autonomous Agents | Kaiwen Ning et.al. | 2412.18371 | link |
2024-12-24 | Extracting triples from dialogues for conversational social agents | Piek Vossen et.al. | 2412.18364 | null |
2024-12-24 | The Thousand Brains Project: A New Paradigm for Sensorimotor Intelligence | Viviane Clay et.al. | 2412.18354 | link |
2024-12-24 | Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering | Zhongjian Hu et.al. | 2412.18351 | null |
2024-12-24 | The Constitutional Filter | Simon Kohaut et.al. | 2412.18347 | link |
2024-12-24 | Learning to Play Against Unknown Opponents | Eshwar Ram Arunachaleswaran et.al. | 2412.18297 | null |
2024-12-24 | MinsStudio: A Streamlined Package for Minecraft AI Agent Development | Shaofei Cai et.al. | 2412.18293 | link |
2024-12-24 | Quantum framework for Reinforcement Learning: integrating Markov Decision Process, quantum arithmetic, and trajectory search | Thet Htar Su et.al. | 2412.18208 | null |
2024-12-24 | VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks | Shiduo Zhang et.al. | 2412.18194 | null |
2024-12-23 | Observation Interference in Partially Observable Assistance Games | Scott Emmons et.al. | 2412.17797 | null |
2024-12-23 | ResearchTown: Simulator of Human Research Community | Haofei Yu et.al. | 2412.17767 | link |
2024-12-23 | Sensitivity Curve Maximization: Attacking Robust Aggregators in Distributed Learning | Christian A. Schroth et.al. | 2412.17740 | null |
2024-12-23 | Robin Hood Reachability Bidding Games | Shaull Almagor et.al. | 2412.17718 | null |
2024-12-23 | SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC | Yue Deng et.al. | 2412.17707 | link |
2024-12-23 | Large Language Model Safety: A Holistic Survey | Dan Shi et.al. | 2412.17686 | link |
2024-12-23 | Shape and Performance of Fastest Paths over Networks with Interacting Selfish Agents | Marco Cogoni et.al. | 2412.17665 | null |
2024-12-23 | CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction | Yuanyuan Gao et.al. | 2412.17612 | null |
2024-12-23 | Fluid-Derived Lattices for Unbiased Modeling of Bacterial Colony Growth | Bryan Verhoef et.al. | 2412.17604 | null |
2024-12-23 | PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World | Yanheng He et.al. | 2412.17589 | null |
2024-12-23 | Complete aging in the noisy voter model enhances consensus formation | Jaume Llabrés et.al. | 2412.17569 | null |
2024-12-23 | DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought | Jiaan Wang et.al. | 2412.17498 | link |
2024-12-23 | A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers | Shuaihang Chen et.al. | 2412.17481 | link |
2024-12-23 | Should public health policy exempt cases with low viral load from isolation during an epidemic?: a modelling study | Jiahao Diao et.al. | 2412.17428 | null |
2024-12-23 | Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets | Akane Tsuboya et.al. | 2412.17344 | null |
2024-12-23 | Multimodal Deep Reinforcement Learning for Portfolio Optimization | Sumit Nawathe et.al. | 2412.17293 | null |
2024-12-23 | Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples | Taewoong Kim et.al. | 2412.17288 | link |
2024-12-23 | LegalAgentBench: Evaluating LLM Agents in Legal Domain | Haitao Li et.al. | 2412.17259 | link |
2024-12-23 | A Coalition Game for On-demand Multi-modal 3D Automated Delivery System | Farzan Moosavi et.al. | 2412.17252 | null |
2024-12-22 | A Multi-AI Agent System for Autonomous Optimization of Agentic AI Solutions via Iterative Refinement and LLM-Driven Feedback Loops | Kamer Ali Yuksel et.al. | 2412.17149 | null |
2024-12-20 | Offline Reinforcement Learning for LLM Multi-Step Reasoning | Huaijie Wang et.al. | 2412.16145 | link |
2024-12-20 | Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information | Dirk Bergemann et.al. | 2412.16132 | null |
2024-12-20 | Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAG | Hasan Md Tusfiqur Alam et.al. | 2412.16086 | link |
2024-12-20 | Active Flow Control for Bluff Body under High Reynolds Number Turbulent Flow Conditions Using Deep Reinforcement Learning | Jingbo Chen et.al. | 2412.15975 | null |
2024-12-20 | The multilayer garbage disposal game | Hsin-Lun Li et.al. | 2412.15942 | null |
2024-12-20 | Speedup Techniques for Switchable Temporal Plan Graph Optimization | He Jiang et.al. | 2412.15908 | null |
2024-12-20 | Exploring the Effects of AI Nonverbal Emotional Cues on Human Decision Certainty in Moral Dilemmas | Chenyi Zhang et.al. | 2412.15834 | null |
2024-12-20 | WebLLM: A High-Performance In-Browser LLM Inference Engine | Charlie F. Ruan et.al. | 2412.15803 | link |
2024-12-20 | FTISS Adaptive Bearing-Only Formation Tracking Control with Unknown Disturbance Rejection | Hong Liang Cheah et.al. | 2412.15757 | null |
2024-12-20 | Online Optimization Algorithms in Repeated Price Competition: Equilibrium Learning and Algorithmic Collusion | Martin Bichler et.al. | 2412.15707 | null |
2024-12-20 | Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration | Yijia Shao et.al. | 2412.15701 | link |
2024-12-20 | AIR: Unifying Individual and Cooperative Exploration in Collective Multi-Agent Reinforcement Learning | Guangchong Zhou et.al. | 2412.15700 | link |
2024-12-20 | Asynchronous Vector Consensus over Matrix-Weighted Networks | P Raghavendra Rao et.al. | 2412.15681 | null |
2024-12-20 | Learning Group Interactions and Semantic Intentions for Multi-Object Trajectory Prediction | Mengshi Qi et.al. | 2412.15673 | link |
2024-12-20 | Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline | Guancheng Zeng et.al. | 2412.15660 | null |
2024-12-20 | Tacit Learning with Adaptive Information Selection for Cooperative Multi-Agent Reinforcement Learning | Lunjun Liu et.al. | 2412.15639 | null |
2024-12-20 | Understanding Individual Agent Importance in Multi-Agent System via Counterfactual Reasoning | Chen Jianming et.al. | 2412.15619 | null |
2024-12-20 | Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage | Zhi Gao et.al. | 2412.15606 | null |
2024-12-20 | NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization | Danial Kamali et.al. | 2412.15588 | link |
2024-12-20 | Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems | Joshua Holder et.al. | 2412.15573 | link |
2024-12-19 | AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving | Shuo Xing et.al. | 2412.15206 | link |
2024-12-19 | Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from Demonstration | Junjia Liu et.al. | 2412.15166 | null |
2024-12-19 | Operationalising Rawlsian Ethics for Fairness in Norm-Learning Agents | Jessica Woodgate et.al. | 2412.15163 | null |
2024-12-19 | Equal Merit Does Not Imply Equality: Discrimination at Equilibrium in a Hiring Market with Symmetric Agents | Serafina Kamp et.al. | 2412.15162 | null |
2024-12-19 | Probabilistic Strategy Logic with Degrees of Observability | Chunyan Mu et.al. | 2412.15135 | null |
2024-12-19 | From Nonequilibrium to Equilibrium: Insights from a Two-Population Occupation Model | Jerome Garnier-Brun et.al. | 2412.14996 | null |
2024-12-19 | Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination | Leonardo Barcellona et.al. | 2412.14957 | null |
2024-12-19 | Long Time Behavior and Stabilization for Displacement Monotone Mean Field Games | Marco Cirant et.al. | 2412.14903 | null |
2024-12-19 | Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning | Anthony Kobanda et.al. | 2412.14865 | null |
2024-12-19 | Entropy Regularized Task Representation Learning for Offline Meta-Reinforcement Learning | Mohammadreza nakhaei et.al. | 2412.14834 | link |
2024-12-19 | Fair Division with Social Impact | Michele Flammini et.al. | 2412.14818 | null |
2024-12-19 | Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning | Ziang Ye et.al. | 2412.14780 | null |
2024-12-19 | Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning | Aditya Kapoor et.al. | 2412.14779 | null |
2024-12-19 | Testing linearity of spatial interaction functions à la Ramsey | Abhimanyu Gupta et.al. | 2412.14778 | null |
2024-12-19 | PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children | Yiqun Zhang et.al. | 2412.14769 | link |
2024-12-19 | Active Inference and Human--Computer Interaction | Roderick Murray-Smith et.al. | 2412.14741 | null |
2024-12-19 | On Verbalized Confidence Scores for LLMs | Daniel Yang et.al. | 2412.14737 | link |
2024-12-19 | Bel Esprit: Multi-Agent Framework for Building AI Model Pipelines | Yunsu Kim et.al. | 2412.14684 | null |
2024-12-19 | A Model-free Biomimetics Algorithm for Deterministic Partially Observable Markov Decision Process | Yide Yu et.al. | 2412.14614 | null |
2024-12-19 | Computational Sociology of Humans and Machines; Conflict and Collaboration | Taha Yasseri et.al. | 2412.14606 | null |
2024-12-18 | TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks | Frank F. Xu et.al. | 2412.14161 | link |
2024-12-18 | Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Report | Markus Dablander et.al. | 2412.14085 | null |
2024-12-18 | A Computationally Grounded Framework for Cognitive Attitudes (extended version) | Tiago de Lima et.al. | 2412.14073 | null |
2024-12-18 | Spatio-Temporal SIR Model of Pandemic Spread During Warfare with Optimal Dual-use Healthcare System Administration using Deep Reinforcement Learning | Adi Shuchami et.al. | 2412.14039 | link |
2024-12-18 | Decentralized Convergence to Equilibrium Prices in Trading Networks | Edwin Lock et.al. | 2412.13972 | null |
2024-12-18 | Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves | Martin Kurečka et.al. | 2412.13962 | null |
2024-12-18 | Harvesting energy from turbulent winds with Reinforcement Learning | Lorenzo Basile et.al. | 2412.13961 | null |
2024-12-18 | Towards privacy-preserving cooperative control via encrypted distributed optimization | Philipp Binfet et.al. | 2412.13953 | null |
2024-12-18 | Strategyproof Matching of Roommates and Rooms | Hadi Hosseini et.al. | 2412.13887 | null |
2024-12-18 | Who Saves us From Risk? Altruists Promote Cooperation in a Public Investment Game | Shen Zhang et.al. | 2412.13816 | null |
2024-12-18 | CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers? | Dimitrios Mallis et.al. | 2412.13810 | null |
2024-12-18 | Meta-Reflection: A Feedback-Free Reflection Learning Framework | Yaoke Wang et.al. | 2412.13781 | null |
2024-12-18 | Heuristic Planner for Communication-Constrained Multi-Agent Multi-Goal Path Planning | Jáchym Herynek et.al. | 2412.13719 | null |
2024-12-18 | A2H: A UI Converter from Android to HarmonyOS Platform | Chen Wang et.al. | 2412.13693 | link |
2024-12-18 | A hybrid learning agent for episodic learning tasks with unknown target distance | Oliver Sefrin et.al. | 2412.13686 | null |
2024-12-18 | ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning | Jie-Jing Shao et.al. | 2412.13682 | null |
2024-12-18 | Exploring Multi-Modal Integration with Tool-Augmented LLM Agents for Precise Causal Discovery | ChengAo Shen et.al. | 2412.13667 | null |
2024-12-18 | Large Language Model Federated Learning with Blockchain and Unlearning for Cross-Organizational Collaboration | Xuhan Zuo et.al. | 2412.13551 | null |
2024-12-18 | EscapeBench: Pushing Language Models to Think Outside the Box | Cheng Qian et.al. | 2412.13549 | link |
2024-12-18 | Models for common knowledge logic | Yoshihito Tanaka et.al. | 2412.13537 | null |
2024-12-17 | Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents | Yifei Zhou et.al. | 2412.13194 | null |
2024-12-17 | GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding | Haoyi Jiang et.al. | 2412.13193 | link |
2024-12-17 | SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents | Sheng Yin et.al. | 2412.13178 | link |
2024-12-17 | Practicable Black-box Evasion Attacks on Link Prediction in Dynamic Graphs -- A Graph Sequential Embedding Method | Jiate Li et.al. | 2412.13134 | link |
2024-12-17 | Contract-based Design and Verification of Multi-Agent Systems with Quantitative Temporal Requirements | Rafael Dewes et.al. | 2412.13114 | null |
2024-12-17 | Active Reinforcement Learning Strategies for Offline Policy Improvement | Ambedkar Dukkipati et.al. | 2412.13106 | null |
2024-12-17 | AI PERSONA: Towards Life-long Personalization of LLMs | Tiannan Wang et.al. | 2412.13103 | null |
2024-12-17 | Reservoir Computing for Fast, Simplified Reinforcement Learning on Memory Tasks | Kevin McKee et.al. | 2412.13093 | null |
2024-12-17 | Distributed Normal Map-based Stochastic Proximal Gradient Methods over Networks | Kun Huang et.al. | 2412.13054 | null |
2024-12-17 | NAVCON: A Cognitively Inspired and Linguistically Grounded Corpus for Vision and Language Navigation | Karan Wanchoo et.al. | 2412.13026 | null |
2024-12-17 | The Emergence of Strategic Reasoning of Large Language Models | Dongwoo Lee et.al. | 2412.13013 | null |
2024-12-17 | Adaptations of AI models for querying the LandMatrix database in natural language | Fatiha Ait Kbir et.al. | 2412.12961 | link |
2024-12-17 | 4DRGS: 4D Radiative Gaussian Splatting for Efficient 3D Vessel Reconstruction from Sparse-View Dynamic DSA Images | Zhentao Liu et.al. | 2412.12919 | link |
2024-12-17 | An Agentic Approach to Automatic Creation of P&ID Diagrams from Natural Language Descriptions | Shreeyash Gowaikar et.al. | 2412.12898 | null |
2024-12-17 | Bayesian Persuasion with Externalities: Exploiting Agent Types | Jonathan Shaki et.al. | 2412.12859 | null |
2024-12-17 | From An LLM Swarm To A PDDL-Empowered HIVE: Planning Self-Executed Instructions In A Multi-Modal Jungle | Kaustubh Vyas et.al. | 2412.12839 | null |
2024-12-17 | GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models | Mukai Li et.al. | 2412.12735 | link |
2024-12-17 | Enhancing Naturalness in LLM-Generated Utterances through Disfluency Insertion | Syed Zohaib Hassan et.al. | 2412.12710 | null |
2024-12-17 | ParMod: A Parallel and Modular Framework for Learning Non-Markovian Tasks | Ruixuan Miao et.al. | 2412.12700 | null |
2024-12-17 | Everyday AR through AI-in-the-Loop | Ryo Suzuki et.al. | 2412.12681 | null |
2024-12-16 | Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives | Marius Belly et.al. | 2412.12063 | link |
2024-12-16 | Virtual Agent-Based Communication Skills Training to Facilitate Health Persuasion Among Peers | Farnaz Nouraei et.al. | 2412.12061 | null |
2024-12-16 | Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps | Linfeng Zhao et.al. | 2412.12024 | null |
2024-12-16 | Agentic AI-Driven Technical Troubleshooting for Enterprise Systems: A Novel Weighted Retrieval-Augmented Generation Paradigm | Rajat Khanda et.al. | 2412.12006 | null |
2024-12-16 | CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird's Eye View Perception | Senkang Hu et.al. | 2412.12000 | null |
2024-12-16 | AlphaZero Neural Scaling and Zipf's Law: a Tale of Board Games and Power Laws | Oren Neumann et.al. | 2412.11979 | link |
2024-12-16 | Learning Human-Aware Robot Policies for Adaptive Assistance | Jason Qin et.al. | 2412.11913 | null |
2024-12-16 | Reentrant phase behavior in binary topological flocks with nonreciprocal alignment | Tian Tang et.al. | 2412.11871 | null |
2024-12-16 | The Black Ninjas and the Sniper: On Robustness of Population Protocols | Benno Lossin et.al. | 2412.11783 | null |
2024-12-16 | Prediction of social dilemmas in networked populations via graph neural networks | Huaiyu Tan et.al. | 2412.11775 | null |
2024-12-16 | Harnessing Language for Coordination: A Framework and Benchmark for LLM-Driven Multi-Agent Control | Timothée Anne et.al. | 2412.11761 | null |
2024-12-16 | Common Ground, Diverse Roots: The Difficulty of Classifying Common Examples in Spanish Varieties | Javier A. Lopetegui et.al. | 2412.11750 | null |
2024-12-16 | GHIssuemarket: A Sandbox Environment for SWE-Agents Economic Experimentation | Mohamed A. Fouad et.al. | 2412.11722 | link |
2024-12-16 | Learning UAV-based path planning for efficient localization of objects using prior knowledge | Rick van Essen et.al. | 2412.11717 | link |
2024-12-16 | LLMs Can Simulate Standardized Patients via Agent Coevolution | Zhuoyun Du et.al. | 2412.11716 | link |
2024-12-16 | Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework | Xuanming Zhang et.al. | 2412.11713 | null |
2024-12-16 | Loosely Synchronized Rule-Based Planning for Multi-Agent Path Finding with Asynchronous Actions | Shuai Zhou et.al. | 2412.11678 | link |
2024-12-16 | VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting | Muhammet Furkan Ilaslan et.al. | 2412.11621 | link |
2024-12-16 | VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis | Zhipeng Chen et.al. | 2412.11594 | link |
2024-12-16 | Embodied CoT Distillation From LLM To Off-the-shelf Agents | Wonje Choi et.al. | 2412.11499 | null |
2024-12-13 | Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining | Zhiqi Ge et.al. | 2412.10342 | null |
2024-12-13 | Reciprocity in Interbank Markets | Lutz Honvehlmann et.al. | 2412.10329 | null |
2024-12-13 | MeshA: Efficient Path Planing With Motion Primitives* | Marat Agranovskiy et.al. | 2412.10320 | null |
2024-12-13 | BrushEdit: All-In-One Image Inpainting and Editing | Yaowei Li et.al. | 2412.10316 | null |
2024-12-13 | Cultural Evolution of Cooperation among LLM Agents | Aron Vallinder et.al. | 2412.10270 | null |
2024-12-13 | ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL | Yang Qin et.al. | 2412.10138 | link |
2024-12-13 | You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects | Islem Bouzenia et.al. | 2412.10133 | null |
2024-12-13 | Reward Machine Inference for Robotic Manipulation | Mattijs Baert et.al. | 2412.10096 | null |
2024-12-13 | Heterogeneous Multi-Robot Graph Coverage with Proximity and Movement Constraints | Dolev Mutzari et.al. | 2412.10083 | null |
2024-12-13 | Large Action Models: From Inception to Implementation | Lu Wang et.al. | 2412.10047 | link |
2024-12-13 | Cooperative Target Defense under Communication and Sensing Constraints | Dipankar Maity et.al. | 2412.09939 | null |
2024-12-13 | CaLoRAify: Calorie Estimation with Visual-Text Pairing and LoRA-Driven Visual Language Models | Dongyu Yao et.al. | 2412.09936 | link |
2024-12-13 | ProxyLLM : LLM-Driven Framework for Customer Support Through Text-Style Transfer | Sehyeong Jo et.al. | 2412.09916 | link |
2024-12-13 | Optimized Coordination Strategy for Multi-Aerospace Systems in Pick-and-Place Tasks By Deep Neural Network | Ye Zhang et.al. | 2412.09877 | null |
2024-12-13 | AutoPatent: A Multi-Agent Framework for Automatic Patent Generation | Qiyao Wang et.al. | 2412.09796 | link |
2024-12-13 | Learning Visually Grounded Domain Ontologies via Embodied Conversation and Explanation | Jonghyuk Park et.al. | 2412.09770 | link |
2024-12-12 | AiEDA: Agentic AI Design Framework for Digital ASIC System Design | Aditya Patra et.al. | 2412.09745 | null |
2024-12-12 | MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction | Xiaohao Xu et.al. | 2412.09723 | link |
2024-12-12 | TransferLight: Zero-Shot Traffic Signal Control on any Road-Network | Johann Schmidt et.al. | 2412.09719 | null |
2024-12-12 | CUAL: Continual Uncertainty-aware Active Learner | Amanda Rios et.al. | 2412.09701 | null |
2024-12-12 | GenEx: Generating an Explorable World | Taiming Lu et.al. | 2412.09624 | null |
2024-12-12 | AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials | Yiheng Xu et.al. | 2412.09605 | null |
2024-12-12 | DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction | Yu Feng et.al. | 2412.09572 | null |
2024-12-12 | Can Modern LLMs Act as Agent Cores in Radiology~Environments? | Qiaoyu Zheng et.al. | 2412.09529 | link |
2024-12-12 | Agent-based Video Trimming | Lingfeng Yang et.al. | 2412.09513 | null |
2024-12-12 | Solving Multiagent Path Finding on Highly Centralized Networks | Foivos Fioravantes et.al. | 2412.09433 | null |
2024-12-12 | From Intention To Implementation: Automating Biomedical Research via LLMs | Yi Luo et.al. | 2412.09429 | null |
2024-12-12 | Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer | Adam Labiosa et.al. | 2412.09417 | null |
2024-12-12 | Uncommon Belief in Rationality | Qi Shi et.al. | 2412.09407 | null |
2024-12-12 | Falcon-UI: Understanding GUI Before Following User Instructions | Huawen Shen et.al. | 2412.09362 | null |
2024-12-12 | Does Low Spoilage Under Cold Conditions Foster Cultural Complexity During the Foraging Era? -- A Theoretical and Computational Inquiry | Minhyeok Lee et.al. | 2412.09335 | null |
2024-12-12 | Beware of Metacognitive Laziness: Effects of Generative Artificial Intelligence on Learning Motivation, Processes, and Performance | Yizhou Fan et.al. | 2412.09315 | null |
2024-12-12 | A Systematic Review of Knowledge Tracing and Large Language Models in Education: Opportunities, Issues, and Future Research | Yongwan Cho et.al. | 2412.09248 | null |
2024-12-12 | LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation | Yijun Liu et.al. | 2412.09237 | null |
2024-12-12 | Reconfigurable Intelligent Surface for Internet of Robotic Things | Wanli Ni et.al. | 2412.09117 | null |
2024-12-12 | Understanding Opportunities and Risks of Synthetic Relationships: Leveraging the Power of Longitudinal Research with Customised AI Tools | Alfio Ventura et.al. | 2412.09086 | null |
2024-12-12 | Towards the Structure and Mechanisms of Complex Systems, the Approach of the Quantitative Theory of Meaning | Inga Ivanova et.al. | 2412.09007 | null |
2024-12-12 | Dynamics of swarmalators in the presence of a contrarian | Gourab Kumar Sar et.al. | 2412.08966 | null |
2024-12-12 | From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning | Pusen Dong et.al. | 2412.08920 | null |
2024-12-12 | Neural Interactive Proofs | Lewis Hammond et.al. | 2412.08897 | null |
2024-12-11 | GPD-1: Generative Pre-training for Driving | Zixun Xie et.al. | 2412.08643 | link |
2024-12-11 | Generative Semantic Communication: Architectures, Technologies, and Applications | Jinke Ren et.al. | 2412.08642 | null |
2024-12-11 | RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation | Mingfei Han et.al. | 2412.08591 | null |
2024-12-11 | Automated Soap Opera Testing Directed by LLMs and Scenario Knowledge: Feasibility, Challenges, and Road Ahead | Yanqi Su et.al. | 2412.08581 | null |
2024-12-11 | GenPlan: Generative sequence models as adaptive planners | Akash Karthikeyan et.al. | 2412.08565 | link |
2024-12-11 | An End-to-End Collaborative Learning Approach for Connected Autonomous Vehicles in Occluded Scenarios | Leandro Parada et.al. | 2412.08562 | null |
2024-12-11 | Exact Algorithms for Multiagent Path Finding with Communication Constraints on Tree-Like Structures | Foivos Fioravantes et.al. | 2412.08556 | null |
2024-12-11 | Grimm: A Plug-and-Play Perturbation Rectifier for Graph Neural Networks Defending against Poisoning Attacks | Ao Liu et.al. | 2412.08555 | null |
2024-12-11 | MaestroMotif: Skill Design from Artificial Intelligence Feedback | Martin Klissarov et.al. | 2412.08542 | null |
2024-12-11 | Spatial segregation across travelling fronts in individual-based and continuum models for the growth of heterogeneous cell populations | José A. Carrillo et.al. | 2412.08535 | null |
2024-12-11 | Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel | Zun Wang et.al. | 2412.08467 | link |
2024-12-11 | IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health | Gauri Jain et.al. | 2412.08463 | link |
2024-12-11 | TapeAgents: a Holistic Framework for Agent Development and Optimization | Dzmitry Bahdanau et.al. | 2412.08445 | null |
2024-12-11 | From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons | Andrew Szot et.al. | 2412.08442 | null |
2024-12-11 | SweetieChat: A Strategy-Enhanced Role-playing Framework for Diverse Scenarios Handling Emotional Support Agent | Jing Ye et.al. | 2412.08389 | null |
2024-12-11 | Agency and Morality as part of Text Entry AI Assistant Personas | Andreas Komninos et.al. | 2412.08360 | null |
2024-12-11 | Lachesis: Predicting LLM Inference Accuracy using Structural Properties of Reasoning Paths | Naryeong Kim et.al. | 2412.08281 | null |
2024-12-11 | Can transformative AI shape a new age for our civilization?: Navigating between speculation and reality | Jesus L. Lobo et.al. | 2412.08273 | null |
2024-12-11 | Deep learning assisted SERS detection of prolines and hydroxylated prolines using nitrilotriacetic acid functionalized gold nanopillars | Yuan Zhang et.al. | 2412.08239 | null |
2024-12-11 | Learn How to Query from Unlabeled Data Streams in Federated Learning | Yuchang Sun et.al. | 2412.08138 | link |
2024-12-10 | Balancing Mobility Behaviors to avoid Global epidemics from Local Outbreaks | Pablo Valgañón et.al. | 2412.07656 | null |
2024-12-10 | Searching for Structure: Investigating Emergent Communication with Large Language Models | Tom Kouwenhoven et.al. | 2412.07646 | null |
2024-12-10 | Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization | Zongkai Liu et.al. | 2412.07639 | link |
2024-12-10 | Swarm Behavior Cloning | Jonas Nüßlein et.al. | 2412.07617 | null |
2024-12-10 | Modeling Speculative Trading Patterns in Token Markets: An Agent-Based Analysis with TokenLab | Mengjue Wang et.al. | 2412.07512 | null |
2024-12-10 | ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement Learning | Hongshu Guo et.al. | 2412.07507 | null |
2024-12-10 | SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World | Jiaqi Zhang et.al. | 2412.07472 | link |
2024-12-10 | Event-Triggered Memory Control for Interval Type-2 Fuzzy Heterogeneous Multi-Agent Systems | Sen Kong et.al. | 2412.07471 | null |
2024-12-10 | Dynamic Ensemble Reasoning for LLM Experts | Jinwu Hu et.al. | 2412.07448 | null |
2024-12-10 | ITPNet: Towards Instantaneous Trajectory Prediction for Autonomous Driving | Rongqing Li et.al. | 2412.07369 | null |
2024-12-10 | My Words Imply Your Opinion: Reader Agent-Based Propagation Enhancement for Personalized Implicit Emotion Analysis | Jian Liao et.al. | 2412.07367 | null |
2024-12-10 | IntraLayer: A Platform of Digital Finance Platforms | Arman Abgaryan et.al. | 2412.07348 | null |
2024-12-10 | CoMA: Compositional Human Motion Generation with Multi-modal Agents | Shanlin Sun et.al. | 2412.07320 | null |
2024-12-10 | Superficial Consciousness Hypothesis for Autoregressive Transformers | Yosuke Miyanishi et.al. | 2412.07278 | link |
2024-12-10 | Reconciling Human Development and Giant Panda Protection Goals: Cost-efficiency Evaluation of Farmland Reverting and Energy Substitution Programs in Wolong National Reserve | Keyi Liu et.al. | 2412.07275 | null |
2024-12-10 | Speaker effects in spoken language comprehension | Hanlin Wu et.al. | 2412.07238 | null |
2024-12-10 | Parseval Regularization for Continual Reinforcement Learning | Wesley Chung et.al. | 2412.07224 | null |
2024-12-10 | A Distributed Deep Koopman Learning Algorithm for Control | Wenjian Hao et.al. | 2412.07212 | null |
2024-12-10 | Epidemiological Model Calibration via Graybox Bayesian Optimization | Puhua Niu et.al. | 2412.07193 | null |
2024-12-10 | Effective Reward Specification in Deep Reinforcement Learning | Julien Roy et.al. | 2412.07177 | null |
2024-12-09 | Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty | Meera Hahn et.al. | 2412.06771 | link |
2024-12-09 | AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark | Lan Li et.al. | 2412.06724 | link |
2024-12-09 | Asynchronous Agents with Perfect Recall: Model Reductions, Knowledge-Based Construction, and Model Checking for Coalitional Strategies | Dilian Gurov et.al. | 2412.06706 | null |
2024-12-09 | Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework | Tianming Liu et.al. | 2412.06681 | null |
2024-12-09 | Self-Interested Agents in Collaborative Learning: An Incentivized Adaptive Data-Centric Framework | Nithia Vijayan et.al. | 2412.06597 | null |
2024-12-09 | Argentine ants regulate traffic flow with stopped individuals | Ulrich Dobramysl et.al. | 2412.06587 | null |
2024-12-09 | Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation | Egor Cherepanov et.al. | 2412.06531 | null |
2024-12-09 | EFX Allocations on Some Multi-graph Classes | Umang Bhaskar et.al. | 2412.06513 | null |
2024-12-09 | The Fusion of Large Language Models and Formal Methods for Trustworthy AI Agents: A Roadmap | Yedi Zhang et.al. | 2412.06512 | null |
2024-12-09 | Reasoning about Strategic Abilities in Stochastic Multi-agent Systems | Yedi Zhang et.al. | 2412.06509 | null |
2024-12-09 | PPT: Pre-Training with Pseudo-Labeled Trajectories for Motion Forecasting | Yihong Xu et.al. | 2412.06491 | null |
2024-12-09 | Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Language Navigation | Xuesong Zhang et.al. | 2412.06465 | link |
2024-12-09 | Simulating Human-like Daily Activities with Desire-driven Autonomy | Yiding Wang et.al. | 2412.06435 | null |
2024-12-09 | World-Consistent Data Generation for Vision-and-Language Navigation | Yu Zhong et.al. | 2412.06413 | null |
2024-12-09 | StarWhisper Telescope: Agent-Based Observation Assistant System to Approach AI Astrophysicist | Cunshi Wang et.al. | 2412.06412 | null |
2024-12-09 | Augmenting the action space with conventions to improve multi-agent cooperation in Hanabi | F. Bredell et.al. | 2412.06333 | link |
2024-12-09 | Vision-Based Deep Reinforcement Learning of UAV Autonomous Navigation Using Privileged Information | Junqiao Wang et.al. | 2412.06313 | null |
2024-12-09 | Beyond pip install: Evaluating LLM Agents for the Automated Installation of Python Projects | Louis Milliken et.al. | 2412.06294 | link |
2024-12-09 | Enhanced Multi-Object Tracking Using Pose-based Virtual Markers in 3x3 Basketball | Li Yin et.al. | 2412.06258 | null |
2024-12-09 | In Silico Pharmacokinetic and Molecular Docking Studies of Natural Plants against Essential Protein KRAS for Treatment of Pancreatic Cancer | Marsha Mariya Kappan et.al. | 2412.06237 | null |
2024-12-06 | TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft | Qian Long et.al. | 2412.05255 | link |
2024-12-06 | AI's assigned gender affects human-AI cooperation | Sepideh Bazazi et.al. | 2412.05214 | null |
2024-12-06 | SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot | Jinlin Wu et.al. | 2412.05187 | link |
2024-12-06 | Sense and Sensitivity: Evaluating the simulation of social dynamics via Large Language Models | Da Ju et.al. | 2412.05093 | null |
2024-12-06 | Synchronization and desynchronization in ensembles of mobile agents | E. M. Varvarin et.al. | 2412.05040 | null |
2024-12-06 | Frontier Models are Capable of In-context Scheming | Alexander Meinke et.al. | 2412.04984 | null |
2024-12-06 | Putting the Iterative Training of Decision Trees to the Test on a Real-World Robotic Task | Raphael C. Engelhardt et.al. | 2412.04974 | null |
2024-12-06 | Who Speaks Next? Multi-party AI Discussion Leveraging the Systematics of Turn-taking in Murder Mystery Games | Ryota Nonomura et.al. | 2412.04937 | link |
2024-12-06 | Probing the contents of semantic representations from text, behavior, and brain data using the psychNorms metabase | Zak Hussain et.al. | 2412.04936 | link |
2024-12-06 | PERCY: A Multimodal Dataset and Conversational System for Personalized and Emotionally Aware Human-Robot Interaction | Mohammed Althubyani et.al. | 2412.04908 | null |
2024-12-06 | DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling | Minzheng Wang et.al. | 2412.04905 | link |
2024-12-06 | Estimating causal effects of customer satisfaction on downstream metrics in a multi-queue contact center | Sebastián Orellana et.al. | 2412.04860 | null |
2024-12-06 | Breaking Event Rumor Detection via Stance-Separated Multi-Agent Debate | Mingqing Zhang et.al. | 2412.04859 | null |
2024-12-06 | MTSpark: Enabling Multi-Task Learning with Spiking Neural Networks for Generalist Agents | Avaneesh Devkota et.al. | 2412.04847 | null |
2024-12-06 | A Temporally Correlated Latent Exploration for Reinforcement Learning | SuMin Oh et.al. | 2412.04775 | null |
2024-12-06 | REGENT: A Retrieval-Augmented Generalist Agent That Can Act In-Context in New Environments | Kaustubh Sridhar et.al. | 2412.04759 | null |
2024-12-05 | LiveNet: Robust, Minimally Invasive Multi-Robot Control for Safe and Live Navigation in Constrained Environments | Srikar Gouru et.al. | 2412.04659 | link |
2024-12-05 | Mutation mitigates finite-size effects in spatial evolutionary games | Chen Shen et.al. | 2412.04654 | null |
2024-12-05 | Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction | Yiheng Xu et.al. | 2412.04454 | null |
2024-12-05 | GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration | Kaiyi Huang et.al. | 2412.04440 | null |
2024-12-05 | Sub-diffraction Imaging of Carrier Dynamics in Halide Perovskite Semiconductors: Effects of Passivation, Morphology, and Ion Motion | Madeleine D. Breshears et.al. | 2412.04423 | null |
2024-12-05 | Targeting the Core: A Simple and Effective Method to Attack RAG-based Agents via Direct LLM Manipulation | Xuying Li et.al. | 2412.04415 | null |
2024-12-05 | EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding | Yuqi Wu et.al. | 2412.04380 | link |
2024-12-05 | Intersection-Aware Assessment of EMS Accessibility in NYC: A Data-Driven Approach | Haoran Su et.al. | 2412.04369 | null |
2024-12-05 | Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting | Edoardo Cetin et.al. | 2412.04368 | null |
2024-12-05 | Machine Theory of Mind for Autonomous Cyber-Defence | Luke Swaby et.al. | 2412.04367 | null |
2024-12-05 | Reinforcement Learning for Freeway Lane-Change Regulation via Connected Vehicles | Ke Sun et.al. | 2412.04341 | null |
2024-12-05 | Action Mapping for Reinforcement Learning in Continuous Environments with Constraints | Mirco Theile et.al. | 2412.04327 | null |
2024-12-05 | Transient Multi-Agent Path Finding for Lifelong Navigation in Dense Environments | Jonathan Morag et.al. | 2412.04256 | null |
2024-12-05 | HyperMARL: Adaptive Hypernetworks for Multi-Agent RL | Kale-ab Abebe Tessera et.al. | 2412.04233 | null |
2024-12-05 | A Dynamic Safety Shield for Safe and Efficient Reinforcement Learning of Navigation Tasks | Murad Dawood et.al. | 2412.04153 | null |
2024-12-05 | Practical Considerations for Agentic LLM Systems | Chris Sypherd et.al. | 2412.04093 | null |
2024-12-05 | LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents | Bingchen Li et.al. | 2412.04090 | null |
2024-12-05 | Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning | Shicheng Zhou et.al. | 2412.04078 | link |
2024-12-05 | Prompt Engineering Guidance for Conceptual Agent-based Model Extraction using Large Language Models | Siamak Khatami et.al. | 2412.04056 | null |
2024-12-05 | Demonstration of Enhanced Qubit Readout via Reinforcement Learning | Aniket Chatterjee et.al. | 2412.04053 | null |
2024-12-05 | INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations | Yongming Zhu et.al. | 2412.04037 | null |
2024-12-05 | Dynamic Graph Representation with Contrastive Learning for Financial Market Prediction: Integrating Temporal Evolution and Static Relations | Yunhua Pei et.al. | 2412.04034 | null |
2024-12-04 | Navigation World Models | Amir Bar et.al. | 2412.03572 | null |
2024-12-04 | From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents | Xinyi Mou et.al. | 2412.03563 | link |
2024-12-04 | Categorize and randomize: a model of sequential stochastic choice | Ester Sudano et.al. | 2412.03554 | null |
2024-12-04 | SPICE: Smart Projection Interface for Cooking Enhancement | Vera Prohaska et.al. | 2412.03551 | null |
2024-12-04 | Risk-aware Classification via Uncertainty Quantification | Murat Sensoy et.al. | 2412.03391 | null |
2024-12-04 | WiS Platform: Enhancing Evaluation of LLM-Based Multi-Agent Systems Through Game-Based Analysis | Chengwei Hu et.al. | 2412.03359 | null |
2024-12-04 | AI-Driven Day-to-Day Route Choice | Leizhen Wang et.al. | 2412.03338 | link |
2024-12-04 | Mean-field Concentration of Opinion Dynamics in Random Graphs | Javiera Gutiérrez-Ramírez et.al. | 2412.03207 | null |
2024-12-04 | AffordDP: Generalizable Diffusion Policy with Transferable Affordance | Shijie Wu et.al. | 2412.03142 | null |
2024-12-04 | ChatTS: Aligning Time Series with LLMs via Synthetic Data for Enhanced Understanding and Reasoning | Zhe Xie et.al. | 2412.03104 | link |
2024-12-04 | Decentralized Mobile Target Tracking Using Consensus-Based Estimation with Nearly-Constant-Velocity Modeling | Amir Ahmad Ghods et.al. | 2412.03095 | null |
2024-12-04 | Coordinated Multi-Armed Bandits for Improved Spatial Reuse in Wi-Fi | Francesc Wilhelmi et.al. | 2412.03076 | null |
2024-12-04 | Preference-based opponent shaping in differentiable games | Xinyu Qiao et.al. | 2412.03072 | null |
2024-12-04 | Constrained portfolio game with heterogeneous agents | Zongxia Liang et.al. | 2412.03070 | null |
2024-12-04 | Impact Of Income And Leisure On Optimal Portfolio, Consumption, Retirement Decisions Under Exponential Utility | Tae Ung Gang et.al. | 2412.03001 | null |
2024-12-04 | New HI views of the Galaxy and the Magellanic Clouds | Snezana Stanimirovic et.al. | 2412.02981 | null |
2024-12-03 | A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration | Thulio Amorim et.al. | 2412.02881 | null |
2024-12-03 | Out-of-Distribution Detection for Neurosymbolic Autonomous Cyber Agents | Ankita Samaddar et.al. | 2412.02875 | null |
2024-12-03 | An Information-Theoretic Analysis of Thompson Sampling for Logistic Bandits | Amaury Gouverneur et.al. | 2412.02861 | null |
2024-12-03 | Algorithmic idealism: what should you believe to experience next? | Markus P. Mueller et.al. | 2412.02826 | null |
2024-12-03 | Leveraging Tactile Sensing to Render both Haptic Feedback and Virtual Reality 3D Object Reconstruction in Robotic Telemanipulation | Gabriele Giudici et.al. | 2412.02644 | null |
2024-12-03 | Mobile Cell-Free Massive MIMO with Multi-Agent Reinforcement Learning: A Scalable Framework | Ziheng Liu et.al. | 2412.02581 | null |
2024-12-03 | Generating Critical Scenarios for Testing Automated Driving Systems | Trung-Hieu Nguyen et.al. | 2412.02574 | link |
2024-12-03 | TAB-Fields: A Maximum Entropy Framework for Mission-Aware Adversarial Planning | Gokul Puthumanaillam et.al. | 2412.02570 | link |
2024-12-03 | Defending Against Diverse Attacks in Federated Learning Through Consensus-Based Bi-Level Optimization | Nicolás García Trillos et.al. | 2412.02535 | link |
2024-12-03 | General Resetting Theory for Group Avoidance | Juhee Lee et.al. | 2412.02524 | null |
2024-12-03 | Resonance: Learning to Predict Social-Aware Pedestrian Trajectories as Co-Vibrations | Conghao Wong et.al. | 2412.02447 | null |
2024-12-03 | A Multi-Agent Framework for Extensible Structured Text Generation in PLCs | Donghao Yang et.al. | 2412.02410 | null |
2024-12-03 | Who Walks With You Matters: Perceiving Social Interactions with Groups for Pedestrian Trajectory Prediction | Ziqian Zou et.al. | 2412.02395 | null |
2024-12-03 | Bio-inspired visual relative localization for large swarms of UAVs | Martin Křížek et.al. | 2412.02393 | null |
2024-12-03 | Social patch foraging theory in an egalitarian group | Lisa Blum Moyse et.al. | 2412.02381 | null |
2024-12-03 | Reinforcement learning to learn quantum states for Heisenberg scaling accuracy | Jeongwoo Jae et.al. | 2412.02334 | link |
2024-12-03 | Optimizing Plastic Waste Collection in Water Bodies Using Heterogeneous Autonomous Surface Vehicles with Deep Reinforcement Learning | Alejandro Mendoza Barrionuevo et.al. | 2412.02316 | link |
2024-12-03 | Large Multimodal Agents for Accurate Phishing Detection with Enhanced Token Optimization and Cost Reduction | Fouad Trad et.al. | 2412.02301 | null |
2024-12-03 | Conformal Symplectic Optimization for Stable Reinforcement Learning | Yao Lyu et.al. | 2412.02291 | link |
2024-12-03 | BOTracle: A framework for Discriminating Bots and Humans | Jan Kadel et.al. | 2412.02266 | null |
2024-12-03 | Selective Reviews of Bandit Problems in AI via a Statistical View | Pengjie Zhou et.al. | 2412.02251 | null |
2024-12-03 | DataLab: A Unifed Platform for LLM-Powered Business Intelligence | Luoxuan Weng et.al. | 2412.02205 | null |
2024-12-03 | Distributed Task Allocation for Multi-Agent Systems: A Submodular Optimization Approach | Jing Liu et.al. | 2412.02146 | null |
2024-12-03 | A privacy-preserving distributed credible evidence fusion algorithm for collective decision-making | Chaoxiong Ma et.al. | 2412.02130 | null |
2024-11-29 | EF1 Allocations for Identical Trilean and Separable Single-Peaked Valuations | Umang Bhaskar et.al. | 2411.19881 | null |
2024-11-29 | Neuroplasticity and Psychedelics: a comprehensive examination of classic and non-classic compounds in pre and clinical models | Claudio Agnorelli et.al. | 2411.19840 | null |
2024-11-29 | Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation | Robin D. Pesl et.al. | 2411.19804 | null |
2024-11-29 | CAREL: Instruction-guided reinforcement learning with cross-modal auxiliary objectives | Armin Saghafian et.al. | 2411.19787 | link |
2024-11-29 | The 2024 Motile Active Matter Roadmap | Gerhard Gompper et.al. | 2411.19783 | null |
2024-11-29 | HVAC-DPT: A Decision Pretrained Transformer for HVAC Control | Anaïs Berkes et.al. | 2411.19746 | null |
2024-11-29 | Relative Representations of Latent Spaces enable Efficient Semantic Channel Equalization | Tomás Hüttebräucker et.al. | 2411.19719 | null |
2024-11-29 | RMIO: A Model-Based MARL Framework for Scenarios with Observation Loss in Some Agents | Shi Zifeng et.al. | 2411.19639 | null |
2024-11-29 | Build An Influential Bot In Social Media Simulations With Large Language Models | Bailu Jin et.al. | 2411.19635 | null |
2024-11-29 | Solving Rubik's Cube Without Tricky Sampling | Yicheng Lin et.al. | 2411.19583 | null |
2024-11-29 | Early Versus Late Traffic Management For Autonomous Agents | Salman Ghori et.al. | 2411.19582 | null |
2024-11-29 | The ATTUNE model for Artificial Trust Towards Human Operators | Giannis Petousakis et.al. | 2411.19580 | null |
2024-12-02 | Fixed-relative-switch strategies for learning based event-triggered control of nonlinear multiagent systems | Ziming Wang et.al. | 2411.19571 | null |
2024-11-29 | Training Agents with Weakly Supervised Feedback from Large Language Models | Dihong Gong et.al. | 2411.19547 | null |
2024-11-29 | A Local Information Aggregation based Multi-Agent Reinforcement Learning for Robot Swarm Dynamic Task Allocation | Yang Lv et.al. | 2411.19526 | null |
2024-11-29 | RL-MILP Solver: A Reinforcement Learning Approach for Solving Mixed-Integer Linear Programs with Graph Neural Networks | Tae-Hoon Lee et.al. | 2411.19517 | null |
2024-11-29 | SANGO: Socially Aware Navigation through Grouped Obstacles | Rahath Malladi et.al. | 2411.19497 | null |
2024-11-29 | Two Timescale EXTRA for Smooth Non-convex Distributed Optimization Problems | Zeyu Peng et.al. | 2411.19483 | null |
2024-11-29 | Proto Successor Measure: Representing the Space of All Possible Solutions of Reinforcement Learning | Siddhant Agarwal et.al. | 2411.19418 | null |
2024-11-28 | Dynamic matching games: stationary equilibria under varying commitments | Nadia Guiñazú et.al. | 2411.19372 | null |
2024-11-28 | Integrating Transit Signal Priority into Multi-Agent Reinforcement Learning based Traffic Signal Control | Dickness Kakitahi Kwesiga et.al. | 2411.19359 | null |
2024-11-27 | Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective | Zhi Zhang et.al. | 2411.18615 | null |
2024-11-27 | Robust Offline Reinforcement Learning with Linearly Structured |
Cheng Tang et.al. | 2411.18612 | null |
2024-11-27 | AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans | Dillon Loh et.al. | 2411.18539 | link |
2024-11-27 | Biswas-Chatterjee-Sen kinetic exchange opinion model for two connected groups | Krzysztof Suchecki et.al. | 2411.18527 | null |
2024-11-27 | NeuroAI for AI Safety | Patrick Mineault et.al. | 2411.18526 | null |
2024-11-27 | Collective decision making by embodied neural agents | Nicolas Coucke et.al. | 2411.18498 | null |
2024-11-27 | Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator | Frederic Kirstein et.al. | 2411.18444 | null |
2024-11-27 | An AI-Assisted Multi-Agent Dual Dialogue System to Support Mental Health Care Providers | Onno P. Kampman et.al. | 2411.18429 | null |
2024-11-27 | Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration | Esmaeel Mohammadi et.al. | 2411.18305 | null |
2024-11-27 | InterHub: A Naturalistic Trajectory Dataset with Dense Interaction for Autonomous Driving | Xiyan Jiang et.al. | 2411.18302 | link |
2024-11-27 | Large Language Model-Brained GUI Agents: A Survey | Chaoyun Zhang et.al. | 2411.18279 | link |
2024-11-27 | Grid-augumented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agents | Joongwon Chae et.al. | 2411.18270 | link |
2024-11-27 | Wearable intelligent throat enables natural speech in stroke patients with dysarthria | Chenyu Tang et.al. | 2411.18266 | null |
2024-11-27 | Exploration of LLM Multi-Agent Application Implementation Based on LangGraph+CrewAI | Zhihua Duan et.al. | 2411.18241 | null |
2024-11-27 | Scalable Multi-Objective Reinforcement Learning with Fairness Guarantees using Lorenz Dominance | Dimitris Michailidis et.al. | 2411.18195 | link |
2024-11-27 | DMVC-Tracker: Distributed Multi-Agent Trajectory Planning for Target Tracking Using Dynamic Buffered Voronoi and Inter-Visibility Cells | Yunwoo Lee et.al. | 2411.18086 | null |
2024-11-27 | RL for Mitigating Cascading Failures: Targeted Exploration via Sensitivity Factors | Anmol Dwivedi et.al. | 2411.18050 | link |
2024-11-27 | The Trusted Caregiver: The Influence of Eye and Mouth Design Incorporating the Baby Schema Effect in Virtual Humanoid Agents on Older Adults Users' Perception of Trustworthiness | Jennifer Hu et.al. | 2411.18047 | null |
2024-11-27 | Normative Feeling: Socially Patterned Affective Mechanisms | Stavros Anagnou et.al. | 2411.18037 | null |
2024-11-27 | AEGIS: An Agent-based Framework for General Bug Reproduction from Issue Descriptions | Xinchen Wang et.al. | 2411.18015 | null |
2024-11-26 | SketchAgent: Language-Driven Sequential Sketch Generation | Yael Vinker et.al. | 2411.17673 | null |
2024-11-26 | MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation | Harsh Singh et.al. | 2411.17636 | null |
2024-11-26 | Making History Readable | Bipasha Banerjee et.al. | 2411.17600 | null |
2024-11-26 | Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals | William A. Ingram et.al. | 2411.17598 | null |
2024-11-26 | Decision making in stochastic extensive form II: Stochastic extensive forms and games | E. Emanuel Rapsch et.al. | 2411.17587 | null |
2024-11-26 | Multi-Objective Reinforcement Learning for Automated Resilient Cyber Defence | Ross O'Driscoll et.al. | 2411.17585 | null |
2024-11-26 | Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach | Yaosheng Deng et.al. | 2411.17552 | null |
2024-11-26 | ShowUI: One Vision-Language-Action Model for GUI Visual Agent | Kevin Qinghong Lin et.al. | 2411.17465 | link |
2024-11-26 | Object-centric proto-symbolic behavioural reasoning from pixels | Ruben van Bergen et.al. | 2411.17438 | link |
2024-11-26 | Joint Combinatorial Node Selection and Resource Allocations in the Lightning Network using Attention-based Reinforcement Learning | Mahdi Salahshour et.al. | 2411.17353 | null |
2024-11-26 | Towards Intention Recognition for Robotic Assistants Through Online POMDP Planning | Juan Carlos Saborio et.al. | 2411.17326 | null |
2024-11-26 | A "Breathing" Mobile Communication Network | Chao Ge et.al. | 2411.17290 | null |
2024-11-26 | APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents | Jun Yu Chen et.al. | 2411.17255 | link |
2024-11-26 | Short-duration gamma-ray bursts from Kerr-Newman black hole mergers | Shad Ali et.al. | 2411.17205 | null |
2024-11-26 | P2DFlow: A Protein Ensemble Generative Model with SE(3) Flow Matching | Yaowei Jin et.al. | 2411.17196 | link |
2024-11-26 | Interleaved Scene Graph for Interleaved Text-and-Image Generation Assessment | Dongping Chen et.al. | 2411.17188 | null |
2024-11-26 | LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble | Yujeong Lee et.al. | 2411.17135 | null |
2024-11-26 | Creative Agents: Simulating the Systems Model of Creativity with Generative Agents | Naomi Imasato et.al. | 2411.17065 | null |
2024-11-26 | g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks | Zihan Wang et.al. | 2411.17030 | link |
2024-11-26 | CRASH: Challenging Reinforcement-Learning Based Adversarial Scenarios For Safety Hardening | Amar Kulkarni et.al. | 2411.16996 | null |
2024-11-25 | Winning opinion: Following Your Friends' Advice or That of Their Friends? | Francisco J. Muñoz et.al. | 2411.16671 | null |
2024-11-25 | Barriers on the EDGE: A scalable CBF architecture over EDGE for safe aerial-ground multi-agent coordination | Viswa Narayanan Sankaranarayanan et.al. | 2411.16608 | null |
2024-11-25 | Naive Algorithmic Collusion: When Do Bandit Learners Cooperate and When Do They Compete? | Connor Douglas et.al. | 2411.16574 | null |
2024-11-25 | Continual Deep Reinforcement Learning with Task-Agnostic Policy Distillation | Muhammad Burhan Hafez et.al. | 2411.16532 | link |
2024-11-25 | Reinforcement Learning for Bidding Strategy Optimization in Day-Ahead Energy Market | Luca Di Persio et.al. | 2411.16519 | null |
2024-11-25 | Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding | Hongzhi Zang et.al. | 2411.16506 | link |
2024-11-25 | Distributed Online Optimization with Stochastic Agent Availability | Juliette Achddou et.al. | 2411.16477 | null |
2024-11-25 | Generating social networks with static and dynamic utility-maximization approaches | Aldric Labarthe et.al. | 2411.16464 | link |
2024-11-25 | Characterized Diffusion Networks for Enhanced Autonomous Driving Trajectory Prediction | Haoming Li et.al. | 2411.16457 | null |
2024-11-25 | TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation | Linqing Zhong et.al. | 2411.16425 | null |
2024-11-25 | A Multi-agent Framework for Materials Laws Discovery | Bo Hu et.al. | 2411.16416 | null |
2024-11-25 | Functionality understanding and segmentation in 3D scenes | Jaime Corsetti et.al. | 2411.16310 | null |
2024-11-25 | Probing for Consciousness in Machines | Mathis Immertreu et.al. | 2411.16262 | null |
2024-11-25 | Open-Vocabulary Octree-Graph for 3D Scene Understanding | Zhigang Wang et.al. | 2411.16253 | null |
2024-11-25 | Enhancing Multi-Agent Consensus through Third-Party LLM Integration: Analyzing Uncertainty and Mitigating Hallucinations in Large Language Models | Zhihua Duan et.al. | 2411.16189 | null |
2024-11-25 | Stop Playing the Guessing Game! Target-free User Simulation for Evaluating Conversational Recommender Systems | Sunghwan Kim et.al. | 2411.16160 | null |
2024-11-25 | Multi-Robot Reliable Navigation in Uncertain Topological Environments with Graph Attention Networks | Zhuoyuan Yu et.al. | 2411.16134 | link |
2024-11-25 | Why the Agent Made that Decision: Explaining Deep Reinforcement Learning with Vision Masks | Rui Zuo et.al. | 2411.16120 | null |
2024-11-25 | Leverage Task Context for Object Affordance Ranking | Haojie Huang et.al. | 2411.16082 | null |
2024-11-25 | SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text | Reshmi Ghosh et.al. | 2411.16077 | null |
2024-11-22 | RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts | Hjalmar Wijk et.al. | 2411.15114 | link |
2024-11-22 | XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models | Yixin Dong et.al. | 2411.15100 | null |
2024-11-22 | On Multi-Agent Inverse Reinforcement Learning | Till Freihaut et.al. | 2411.15046 | null |
2024-11-22 | Safe Multi-Agent Reinforcement Learning with Convergence to Generalized Nash Equilibrium | Zeyang Li et.al. | 2411.15036 | null |
2024-11-22 | On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations | Guojun Xiong et.al. | 2411.15014 | null |
2024-11-22 | ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow Data | Junhong Shen et.al. | 2411.15004 | link |
2024-11-22 | Free Energy Projective Simulation (FEPS): Active inference with interpretability | Joséphine Pazem et.al. | 2411.14991 | null |
2024-11-22 | BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence | Xuewu Lin et.al. | 2411.14869 | link |
2024-11-22 | Universal and Context-Independent Triggers for Precise Control of LLM Outputs | Jiashuo Liang et.al. | 2411.14738 | null |
2024-11-22 | Enhancing Clinical Trial Patient Matching through Knowledge Augmentation with Multi-Agents | Hanwen Shi et.al. | 2411.14637 | null |
2024-11-21 | Learning Autonomous Surgical Irrigation and Suction with the da Vinci Research Kit Using Reinforcement Learning | Yafei Ou et.al. | 2411.14622 | null |
2024-11-21 | A Systematic Study of Multi-Agent Deep Reinforcement Learning for Safe and Robust Autonomous Highway Ramp Entry | Larry Schester et.al. | 2411.14593 | null |
2024-11-21 | G-RAG: Knowledge Expansion in Material Science | Radeen Mostafa et.al. | 2411.14592 | link |
2024-11-21 | SRSA: A Cost-Efficient Strategy-Router Search Agent for Real-world Human-Machine Interactions | Yaqi Wang et.al. | 2411.14574 | null |
2024-11-21 | Energy Efficient Automated Driving as a GNEP: Vehicle-in-the-loop Experiments | Viranjan Bhattacharyya et.al. | 2411.14567 | null |
2024-11-21 | Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models | Yuhao Dong et.al. | 2411.14432 | link |
2024-11-21 | Multi-Agent Environments for Vehicle Routing Problems | Ricardo Gama et.al. | 2411.14411 | link |
2024-11-21 | Resolving Multiple-Dynamic Model Uncertainty in Hypothesis-Driven Belief-MDPs | Ofer Dagan et.al. | 2411.14404 | null |
2024-11-21 | SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching | Arjun P S et.al. | 2411.14322 | link |
2024-11-21 | Q-CSM: Q-Learning-based Cognitive Service Management in Heterogeneous IoT Networks | Kubra Duran et.al. | 2411.14281 | null |
2024-11-21 | Explainable Multi-Agent Reinforcement Learning for Extended Reality Codec Adaptation | Pedro Enrique Iturria-Rivera et.al. | 2411.14264 | null |
2024-11-21 | Physics-Informed LLM-Agent for Automated Modulation Design in Power Electronics Systems | Junhua Liu et.al. | 2411.14214 | null |
2024-11-21 | SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel Optimization | Shuchen Zhu et.al. | 2411.14166 | null |
2024-11-21 | Multi-terminal Strong Coordination subject to Secrecy Constraints | Viswanathan Ramachandran et.al. | 2411.14123 | null |
2024-11-21 | Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems | Egor E. Nuzhin et.al. | 2411.14117 | null |
2024-11-21 | RAG-Thief: Scalable Extraction of Private Data from Retrieval-Augmented Generation Applications with Agent-based Attacks | Changyue Jiang et.al. | 2411.14110 | null |
2024-11-21 | Asymmetric Opinion Formation of Emotional Eccitable Agents | Irene Ferri et.al. | 2411.14099 | null |
2024-11-21 | Exploration by Running Away from the Past | Paul-Antoine Le Tolguenec et.al. | 2411.14085 | null |
2024-11-21 | On PI-control in Capacity-Limited Networks | Felix Agner et.al. | 2411.14077 | null |
2024-11-21 | Multi-LLM-Agent Systems: Techniques and Business Perspectives | Yingxuan Yang et.al. | 2411.14033 | null |
2024-11-21 | GPT versus Humans: Uncovering Ethical Concerns in Conversational Generative AI-empowered Multi-Robot Systems | Rebekah Rousi et.al. | 2411.14009 | null |
2024-11-21 | Approximating One-Sided and Two-Sided Nash Social Welfare With Capacities | Salil Gokhale et.al. | 2411.14007 | null |
2024-11-21 | Learning Two-agent Motion Planning Strategies from Generalized Nash Equilibrium for Model Predictive Control | Hansung Kim et.al. | 2411.13983 | link |
2024-11-21 | Movable Antenna-Equipped UAV for Data Collection in Backscatter Sensor Networks: A Deep Reinforcement Learning-based Approach | Yu Bai et.al. | 2411.13970 | null |
2024-11-21 | Cooperative Grasping and Transportation using Multi-agent Reinforcement Learning with Ternary Force Representation | Ing-Sheng Bernard-Tiong et.al. | 2411.13942 | null |
2024-11-20 | BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games | Davide Paglieri et.al. | 2411.13543 | null |
2024-11-20 | Metacognition for Unknown Situations and Environments (MUSE) | Rodolfo Valiente et.al. | 2411.13537 | null |
2024-11-20 | AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations | Gaurav Verma et.al. | 2411.13451 | null |
2024-11-20 | Robust Monocular Visual Odometry using Curriculum Learning | Assaf Lahiany et.al. | 2411.13438 | null |
2024-11-20 | A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback | Alireza Rashidi Laleh et.al. | 2411.13410 | null |
2024-11-20 | Simulating Liquidity: Agent-Based Modeling of Illiquid Markets for Fractional Ownership | Lars Fluri et.al. | 2411.13381 | null |
2024-11-20 | WHALES: A Multi-agent Scheduling Dataset for Enhanced Cooperation in Autonomous Driving | Siwei Chen et.al. | 2411.13340 | link |
2024-11-20 | Revealed Information | Laura Doval et.al. | 2411.13293 | null |
2024-11-20 | Transforming the Hybrid Cloud for Emerging AI Workloads | Deming Chen et.al. | 2411.13239 | null |
2024-11-20 | Extremum and Nash Equilibrium Seeking with Delays and PDEs: Designs & Applications | Tiago Roux Oliveira et.al. | 2411.13234 | null |
2024-11-20 | ViSTa Dataset: Do vision-language models understand sequential tasks? | Evžen Wybitul et.al. | 2411.13211 | link |
2024-11-20 | Engagement-Driven Content Generation with Large Language Models | Erica Coppolillo et.al. | 2411.13187 | null |
2024-11-20 | Cyborg Insect Factory: Automatic Assembly System to Build up Insect-computer Hybrid Robot Based on Vision-guided Robotic Arm Manipulation of Custom Bipolar Electrodes | Qifeng Lin et.al. | 2411.13164 | null |
2024-11-20 | Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning | Zhi Luo et.al. | 2411.13116 | null |
2024-11-20 | Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension | Yongdong Luo et.al. | 2411.13093 | link |
2024-11-20 | AMaze: An intuitive benchmark generator for fast prototyping of generalizable agents | Kevin Godin-Dubois et.al. | 2411.13072 | null |
2024-11-20 | Breaking the Cycle of Recurring Failures: Applying Generative AI to Root Cause Analysis in Legacy Banking Systems | Siyuan Jin et.al. | 2411.13017 | null |
2024-11-20 | MindForge: Empowering Embodied Agents with Theory of Mind for Lifelong Collaborative Learning | Mircea Lică et.al. | 2411.12977 | null |
2024-11-19 | Non-Newtonian corrections to radiative viscosity: Israel-Stewart theory as a viscosity limiter | Lorenzo Gavassino et.al. | 2411.12929 | null |
2024-11-19 | Human-In-the-Loop Software Development Agents | Wannita Takerngsaksiri et.al. | 2411.12924 | null |
2024-11-19 | Reinforcement Learning, Collusion, and the Folk Theorem | Galit Askenazi-Golan et.al. | 2411.12725 | null |
2024-11-19 | UBSoft: A Simulation Platform for Robotic Skill Learning in Unbounded Soft Environments | Chunru Lin et.al. | 2411.12711 | null |
2024-11-19 | Weighted Envy Freeness With Limited Subsidies | Noga Klein Elmalem et.al. | 2411.12696 | null |
2024-11-19 | Quasi-stability notions in two-sided matching models | Nadia Guiñazú et.al. | 2411.12533 | null |
2024-11-19 | Coevolution of relationship-driven cooperation under recommendation protocol on multiplex networks | Hongyu Yue et.al. | 2411.12436 | null |
2024-11-19 | Instrumentation of Software Systems with OpenTelemetry for Software Visualization | Malte Hansen et.al. | 2411.12380 | null |
2024-11-19 | C |
Xiaohe Li et.al. | 2411.12313 | null |
2024-11-19 | SNN-Based Online Learning of Concepts and Action Laws in an Open World | Christel Grimaud et.al. | 2411.12308 | null |
2024-11-19 | Emergence of Implicit World Models from Mortal Agents | Kazuya Horibe et.al. | 2411.12304 | null |
2024-11-19 | Could Humans Outshine AI in Visual Data Analysis? | Ratanond Koonchanok et.al. | 2411.12299 | null |
2024-11-19 | Efficient Training in Multi-Agent Reinforcement Learning: A Communication-Free Framework for the Box-Pushing Problem | David Ge et.al. | 2411.12246 | null |
2024-11-19 | Safe Navigation in Dynamic Environments using Density Functions | Sriram S. K. S Narayanan et.al. | 2411.12206 | link |
2024-11-19 | A More Advanced Group Polarization Measurement Approach Based on LLM-Based Agents and Graphs | Zixin Liu et.al. | 2411.12196 | null |
2024-11-19 | Action-Attentive Deep Reinforcement Learning for Autonomous Alignment of Beamlines | Siyu Wang et.al. | 2411.12183 | link |
2024-11-19 | A Combined Encoder and Transformer Approach for Coherent and High-Quality Text Generation | Jiajing Chen et.al. | 2411.12157 | null |
2024-11-19 | Reinforcement Learning with Action Sequence for Data-Efficient Robot Learning | Younggyo Seo et.al. | 2411.12155 | null |
2024-11-19 | HEIGHT: Heterogeneous Interaction Graph Transformer for Robot Navigation in Crowded and Constrained Environments | Shuijing Liu et.al. | 2411.12150 | null |
2024-11-19 | Hierarchical Trait-State Model for Decoding Dyadic Social Interactions | Qianying Wu et.al. | 2411.12145 | null |
2024-11-19 | Adversarial Multi-Agent Reinforcement Learning for Proactive False Data Injection Detection | Kejun Chen et.al. | 2411.12130 | null |
2024-11-18 | On-the-Go Path Planning and Repair in Static and Dynamic Scenarios | Daniel Ajeleye et.al. | 2411.12014 | null |
2024-11-18 | Generative World Explorer | Taiming Lu et.al. | 2411.11844 | null |
2024-11-18 | Reinterpreting Delay and Procrastination | Conrad Kosowsky et.al. | 2411.11828 | null |
2024-11-18 | Competing Bandits in Decentralized Large Contextual Matching Markets | Satush Parikh et.al. | 2411.11794 | null |
2024-11-18 | LLM-IE: A Python Package for Generative Information Extraction with Large Language Models | Enshuo Hsu et.al. | 2411.11779 | null |
2024-11-18 | Mapping out the Space of Human Feedback for Reinforcement Learning: A Conceptual Framework | Yannick Metz et.al. | 2411.11761 | null |
2024-11-18 | The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning | Longju Bai et.al. | 2411.11758 | link |
2024-11-18 | Distributed Asynchronous Time-Varying Quadratic Programming with Asynchronous Objective Sampling | Gabriel Behrendt et.al. | 2411.11732 | null |
2024-11-18 | Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment | Allison Huang et.al. | 2411.11731 | link |
2024-11-18 | TrojanRobot: Backdoor Attacks Against Robotic Manipulation in the Physical World | Xianlong Wang et.al. | 2411.11683 | null |
2024-11-18 | Artificial Scientific Discovery | Antonio Norelli et.al. | 2411.11672 | null |
2024-11-18 | No-regret Exploration in Shuffle Private Reinforcement Learning | Shaojie Bai et.al. | 2411.11647 | null |
2024-11-18 | Signaling and Social Learning in Swarms of Robots | Leo Cazenille et.al. | 2411.11616 | null |
2024-11-18 | OASIS: Open Agents Social Interaction Simulations on One Million Agents | Ziyi Yang et.al. | 2411.11581 | link |
2024-11-18 | A Code Knowledge Graph-Enhanced System for LLM-Based Fuzz Driver Generation | Hanxiang Xu et.al. | 2411.11532 | link |
2024-11-18 | Structure learning with Temporal Gaussian Mixture for model-based Reinforcement Learning | Théophile Champion et.al. | 2411.11511 | null |
2024-11-18 | Timescale-agnostic characterisation for collective attention events | Tristan J. B. Cann et.al. | 2411.11500 | null |
2024-11-18 | Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models | Chenhang Cui et.al. | 2411.11496 | link |
2024-11-18 | Quantifying Preferences of Vision-Language Models via Value Decomposition in Social Media Contexts | Jingxuan Li et.al. | 2411.11479 | null |
2024-11-18 | Distributed Learning with Partial Information Sharing | P Raghavendra Rao et.al. | 2411.11411 | null |
2024-11-18 | IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos | Yunong Liu et.al. | 2411.11409 | link |
2024-11-15 | Fair Division via the Cake-Cutting Share | Yannan Bai et.al. | 2411.10434 | null |
2024-11-15 | Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash | Parsa Hejabi et.al. | 2411.10422 | link |
2024-11-15 | The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use | Siyuan Hu et.al. | 2411.10323 | link |
2024-11-15 | Static network structure cannot stabilize cooperation among Large Language Model agents | Jin Han et.al. | 2411.10294 | null |
2024-11-15 | Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review | Hossein Hassani et.al. | 2411.10268 | null |
2024-11-15 | Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning | Jingru Yang et.al. | 2411.10252 | null |
2024-11-15 | An Empirical Study on LLM-based Agents for Automated Bug Fixing | Xiangxin Meng et.al. | 2411.10213 | null |
2024-11-15 | Agentic LLMs in the Supply Chain: Towards Autonomous Multi-Agent Consensus-Seeking | Valeria Jannelli et.al. | 2411.10184 | null |
2024-11-15 | Let people fail! Exploring the influence of explainable virtual and robotic agents in learning-by-doing tasks | Marco Matarese et.al. | 2411.10176 | null |
2024-11-15 | The Surprising Ineffectiveness of Pre-Trained Visual Representations for Model-Based Reinforcement Learning | Moritz Schneider et.al. | 2411.10175 | null |
2024-11-15 | Semantics and Spatiality of Emergent Communication | Rotem Ben Zion et.al. | 2411.10173 | link |
2024-11-15 | Multi-UAV Search and Rescue in Wilderness Using Smart Agent-Based Probability Models | Zijian Ge et.al. | 2411.10148 | null |
2024-11-15 | Omnichain Web: The Universal Framework for Streamlined Chain Abstraction and Cross-Layer Interaction | Hardik Gajera et.al. | 2411.10132 | null |
2024-11-15 | Generative Agent Simulations of 1,000 People | Joon Sung Park et.al. | 2411.10109 | null |
2024-11-15 | Neural Port-Hamiltonian Models for Nonlinear Distributed Control: An Unconstrained Parametrization Approach | Muhammad Zakwan et.al. | 2411.10096 | null |
2024-11-15 | Enforcing Cooperative Safety for Reinforcement Learning-based Mixed-Autonomy Platoon Control | Jingyuan Zhou et.al. | 2411.10031 | null |
2024-11-15 | Orca: Enhancing Role-Playing Abilities of Large Language Models by Integrating Personality Traits | Yuxuan Huang et.al. | 2411.10006 | null |
2024-11-15 | Solvated Electrons and Hydroxyl Radicals at the Plasma-Liquid Interface | Seungjun Lee et.al. | 2411.09991 | null |
2024-11-15 | Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems | Taaha Kazi et.al. | 2411.09972 | null |
2024-11-15 | Sublinear-time Collision Detection with a Polynomial Number of States in Population Protocols | Takumi Araya et.al. | 2411.09957 | null |
2024-11-14 | Nash equilibrium seeking for a class of quadratic-bilinear Wasserstein distributionally robust games | Georgios Pantazis et.al. | 2411.09636 | null |
2024-11-14 | Navigating the Risks: A Survey of Security, Privacy, and Ethics Threats in LLM-Based Agents | Yuyou Gan et.al. | 2411.09523 | null |
2024-11-14 | Randomized Truthful Auctions with Learning Agents | Gagan Aggarwal et.al. | 2411.09517 | null |
2024-11-14 | Strategic Sacrifice: Self-Organized Robot Swarm Localization for Inspection Productivity | Sneha Ramshanker et.al. | 2411.09493 | null |
2024-11-14 | Socio-Economic Consequences of Generative AI: A Review of Methodological Approaches | Carlos J. Costa et.al. | 2411.09313 | null |
2024-11-14 | Embedding Space Allocation with Angle-Norm Joint Classifiers for Few-Shot Class-Incremental Learning | Dunwei Tu et.al. | 2411.09250 | null |
2024-11-14 | Risk-aware MPPI for Stochastic Hybrid Systems | Hardik Parwana et.al. | 2411.09198 | link |
2024-11-14 | Enhancing reinforcement learning for population setpoint tracking in co-cultures | Sebastián Espinel-Ríos et.al. | 2411.09177 | null |
2024-11-14 | Artificial Theory of Mind and Self-Guided Social Organisation | Michael S. Harré et.al. | 2411.09169 | null |
2024-11-14 | Theory of Mind Enhances Collective Intelligence | Michael S. Harré et.al. | 2411.09168 | null |
2024-11-14 | Rationality based Innate-Values-driven Reinforcement Learning | Qin Yang et.al. | 2411.09160 | null |
2024-11-14 | The \emph{Optimist}: Towards Fully Automated Graph Theory Research | Randy Davila et.al. | 2411.09158 | link |
2024-11-14 | Personalized Help for Optimizing Low-Skilled Users' Strategy | Feng Gu et.al. | 2411.09109 | null |
2024-11-13 | Pheromone-Guided Navigation of Potential Mates: A Distinct Exploration Strategy | Nick Dashti et.al. | 2411.09092 | null |
2024-11-13 | Microfoundation Inference for Strategic Prediction | Daniele Bracale et.al. | 2411.08998 | null |
2024-11-13 | The Impact of Social Value Orientation on Nash Equilibria of Two Player Quadratic Games | Dan Calderone et.al. | 2411.08809 | null |
2024-11-13 | FinRobot: AI Agent for Equity Research and Valuation with Large Language Models | Tianyu Zhou et.al. | 2411.08804 | link |
2024-11-13 | Evaluating World Models with LLM for Decision Making | Chang Yang et.al. | 2411.08794 | null |
2024-11-13 | Towards Fair and Efficient Public Transportation: A Bus Stop Model | Martin Bullinger et.al. | 2411.08784 | link |
2024-11-13 | Logic-based Knowledge Awareness for Autonomous Agents in Continuous Spaces | Arabinda Ghosh et.al. | 2411.08754 | null |
2024-11-13 | Statistical Operating Characteristics of Current Early Phase Dose Finding Designs with Toxicity and Efficacy in Oncology | Hao Sun et.al. | 2411.08698 | null |
2024-11-13 | Inferring Parameter Distributions in Heterogeneous Motile Particle Ensembles: A Likelihood Approach for Second Order Langevin Models | Jan Albrecht et.al. | 2411.08692 | null |
2024-11-13 | Robot See, Robot Do: Imitation Reward for Noisy Financial Environments | Sven Goluža et.al. | 2411.08637 | null |
2024-11-13 | On the Application of Model Predictive Control to a Weighted Coverage Path Planning Problem | Kilian Schweppe et.al. | 2411.08634 | null |
2024-11-13 | NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation | Youzhi Liu et.al. | 2411.08579 | null |
2024-11-13 | Grammarization-Based Grasping with Deep Multi-Autoencoder Latent Space Exploration by Reinforcement Learning Agent | Leonidas Askianakis et.al. | 2411.08566 | null |
2024-11-13 | TimeLess: A Vision for the Next Generation of Software Development | Zeeshan Rasheed et.al. | 2411.08507 | null |
2024-11-13 | Towards Objective and Unbiased Decision Assessments with LLM-Enhanced Hierarchical Attention Networks | Junhua Liu et.al. | 2411.08504 | link |
2024-11-13 | AD-DINO: Attention-Dynamic DINO for Distance-Aware Embodied Reference Understanding | Hao Guo et.al. | 2411.08451 | null |
2024-11-13 | Towards Evaluating Large Language Models for Graph Query Generation | Siraj Munir et.al. | 2411.08449 | null |
2024-11-13 | Learning Dynamic Cognitive Map with Autonomous Navigation | Daria de Tinguy et.al. | 2411.08447 | link |
2024-11-13 | Anonymous Distributed Localisation via Spatial Population Protocols | Leszek Gąsieniec et.al. | 2411.08434 | null |
2024-11-13 | One STEP at a time: Language Agents are Stepwise Planners | Minh Nguyen et.al. | 2411.08432 | link |
2024-11-13 | Enhanced Classroom Dialogue Sequences Analysis with a Hybrid AI Agent: Merging Expert Rule-Base with Large Language Models | Yun Long et.al. | 2411.08418 | null |
2024-11-13 | BAMAX: Backtrack Assisted Multi-Agent Exploration using Reinforcement Learning | Geetansh Kalra et.al. | 2411.08400 | null |
2024-11-12 | LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models | Anoop Cherian et.al. | 2411.08027 | null |
2024-11-12 | Incentive Design with Spillovers | Krishna Dasaratha et.al. | 2411.08026 | null |
2024-11-12 | From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents | Chuyi Kong et.al. | 2411.07965 | null |
2024-11-12 | Learning Memory Mechanisms for Decision Making through Demonstrations | William Yue et.al. | 2411.07954 | link |
2024-11-12 | RedCode: Risky Code Execution and Generation Benchmark for Code Agents | Chengquan Guo et.al. | 2411.07781 | link |
2024-11-12 | Efficiency of energy-consuming random walkers: Variability in energy helps | Mohsen Ghasemi Nezhadhaghighi et.al. | 2411.07771 | null |
2024-11-12 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | Fangyu Lei et.al. | 2411.07763 | null |
2024-11-12 | Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement Learning | Stefan Pranger et.al. | 2411.07700 | null |
2024-11-12 | World Models: The Safety Perspective | Zifan Zeng et.al. | 2411.07690 | null |
2024-11-12 | Safe Exploitative Play with Untrusted Type Beliefs | Tongxin Li et.al. | 2411.07679 | null |
2024-11-12 | The relationship between general equilibrium models with infinite-lived agents and overlapping generations models, and some applications | Ngoc-Sang Pham et.al. | 2411.07674 | null |
2024-11-12 | Mitigating Bias in Queer Representation within Large Language Models: A Collaborative Agent Approach | Tianyi Huang et.al. | 2411.07656 | link |
2024-11-12 | Exploring Multi-Agent Reinforcement Learning for Unrelated Parallel Machine Scheduling | Maria Zampella et.al. | 2411.07634 | null |
2024-11-12 | A Simple Multi-agent Joint Prediction Method for Autonomous Driving | Mingyi Wang et.al. | 2411.07612 | null |
2024-11-12 | Multiple Non-cooperative Targets Encirclement by Relative Distance based Positioning and Neural Anti-Synchronization Control | Fen Liu et.al. | 2411.07590 | null |
2024-11-12 | Reinforcement Learning Framework for Quantitative Trading | Alhassan S. Yasin et.al. | 2411.07585 | null |
2024-11-12 | Stability for a stochastic fractional differential variational inequality with Lévy jump | Yue Zeng et.al. | 2411.07557 | null |
2024-11-12 | Collaborative and Federated Black-box Optimization: A Bayesian Optimization Perspective | Raed Al Kontar et.al. | 2411.07523 | null |
2024-11-12 | Two-Layer Attention Optimization for Bimanual Coordination | Justin Ting et.al. | 2411.07470 | null |
2024-11-12 | BudgetMLAgent: A Cost-Effective LLM Multi-Agent system for Automating Machine Learning Tasks | Shubham Gandhi et.al. | 2411.07464 | null |
2024-11-11 | Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving | Botao Yu et.al. | 2411.07228 | null |
2024-11-11 | Grounding Video Models to Actions through Goal Conditioned Exploration | Yunhao Luo et.al. | 2411.07223 | null |
2024-11-11 | 'Explaining RL Decisions with Trajectories': A Reproducibility Study | Karim Abdel Sadek et.al. | 2411.07200 | link |
2024-11-11 | Gradual Fine-Tuning with Graph Routing for Multi-Source Unsupervised Domain Adaptation | Yao Ma et.al. | 2411.07185 | null |
2024-11-11 | RoundTable: Investigating Group Decision-Making Mechanism in Multi-Agent Collaboration | Young-Min Cho et.al. | 2411.07161 | null |
2024-11-11 | Azurin-Based Peptide p28 Arrests the p53-HDM2 Interactions: A Novel Anti-Cancer Pathway | Albin Joy et.al. | 2411.07124 | null |
2024-11-11 | Learning Multi-Agent Collaborative Manipulation for Long-Horizon Quadrupedal Pushing | Chuye Hong et.al. | 2411.07104 | null |
2024-11-11 | Bounded Rationality Equilibrium Learning in Mean Field Games | Yannick Eich et.al. | 2411.07099 | link |
2024-11-11 | A Multi-Agent Approach for REST API Testing with Semantic Graphs and LLM-Driven Inputs | Myeongsoo Kim et.al. | 2411.07098 | null |
2024-11-11 | Differentially-Private Collaborative Online Personalized Mean Estimation | Yauhen Yakimenka et.al. | 2411.07094 | null |
2024-11-11 | To Train or Not to Train: Balancing Efficiency and Training Cost in Deep Reinforcement Learning for Mobile Edge Computing | Maddalena Boscaro et.al. | 2411.07086 | null |
2024-11-11 | Learning Collective Dynamics of Multi-Agent Systems using Event-based Vision | Minah Lee et.al. | 2411.07039 | null |
2024-11-11 | Designing Reliable Experiments with Generative Agent-Based Modeling: A Comprehensive Guide Using Concordia by Google DeepMind | Alejandro Leonardo García Navarro et.al. | 2411.07038 | null |
2024-11-11 | Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching | Arnav Kumar Jain et.al. | 2411.07007 | link |
2024-11-11 | Enhancing Robot Assistive Behaviour with Reinforcement Learning and Theory of Mind | Antonio Andriella et.al. | 2411.07003 | link |
2024-11-11 | Maximizing Nash Social Welfare in 2-Value Instances: A Simpler Proof for the Half-Integer Case | Kurt Mehlhorn et.al. | 2411.06924 | null |
2024-11-11 | Scalable Distributed Least Squares Algorithm for Linear Algebraic Equations via Scheduling | Shenyu Liu et.al. | 2411.06883 | null |
2024-11-11 | Distributed Graph Augmentation Protocols to Achieve Strong Connectivity in Multi-Agent Networks | Guilherme Ramos et.al. | 2411.06880 | link |
2024-11-11 | Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC | Aditya Soni et.al. | 2411.06815 | null |
2024-11-11 | Generative midtended cognition and Artificial Intelligence. Thinging with thinging things | Xabier E. Barandiaran et.al. | 2411.06812 | null |
2024-11-08 | Topology-aware Reinforcement Feature Space Reconstruction for Graph Data | Wangyang Ying et.al. | 2411.05742 | null |
2024-11-08 | A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics | Puze Liu et.al. | 2411.05718 | null |
2024-11-08 | Settling the Complexity of Popularity in Additively Separable and Fractional Hedonic Games | Martin Bullinger et.al. | 2411.05713 | null |
2024-11-08 | Data-Driven Distributed Common Operational Picture from Heterogeneous Platforms using Multi-Agent Reinforcement Learning | Indranil Sur et.al. | 2411.05683 | null |
2024-11-08 | The influence of persona and conversational task on social interactions with a LLM-controlled embodied conversational agent | Leon O. H. Kroczek et.al. | 2411.05653 | null |
2024-11-08 | LightVA: Lightweight Visual Analytics with LLM Agent-Based Task Planning and Execution | Yuheng Zhao et.al. | 2411.05651 | null |
2024-11-08 | Expectation vs. Reality: Towards Verification of Psychological Games | Marta Kwiatkowska et.al. | 2411.05599 | null |
2024-11-08 | Smart navigation through a rotating barrier: Deep reinforcement learning with application to size-based separation of active microagents | Mohammad Hossein Masoudi et.al. | 2411.05587 | null |
2024-11-08 | Tangled Program Graphs as an alternative to DRL-based control algorithms for UAVs | Hubert Szolc et.al. | 2411.05586 | link |
2024-11-08 | Parameterized Voter Relevance in Facility Location Games with Tree-Shaped Invitation Graphs | Ryoto Ando et.al. | 2411.05574 | null |
2024-11-08 | Time-to-reach Bounds for Verification of Dynamical Systems Using the Koopman Spectrum | Jianqiang Ding et.al. | 2411.05554 | null |
2024-11-08 | Evolution of cooperation in a three-strategy game combining snowdrift and stag hunt games | Hirofumi Takesue et.al. | 2411.05543 | null |
2024-11-08 | Generating surrogate temporal networks from mesoscale building blocks | Giulia Cencetti et.al. | 2411.05477 | link |
2024-11-08 | Enhancing Robustness in Language-Driven Robotics: A Modular Approach to Failure Reduction | Émiland Garrabé et.al. | 2411.05474 | null |
2024-11-08 | Emergent Cooperative Strategies for Multi-Agent Shepherding via Reinforcement Learning | Italo Napolitano et.al. | 2411.05454 | null |
2024-11-08 | WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models | Shengda Fan et.al. | 2411.05451 | link |
2024-11-08 | VISTA: Visual Integrated System for Tailored Automation in Math Problem Generation Using LLM | Jeongwoo Lee et.al. | 2411.05423 | null |
2024-11-08 | Towards Low-Resource Harmful Meme Detection with LMM Agents | Jianzhao Huang et.al. | 2411.05383 | link |
2024-11-08 | Enhancing Cluster Resilience: LLM-agent Based Autonomous Intelligent Cluster Diagnosis System and Evaluation Framework | Honghao Shi et.al. | 2411.05349 | null |
2024-11-08 | LLM-PySC2: Starcraft II learning environment for Large Language Models | Zongyuan Li et.al. | 2411.05348 | link |
2024-11-07 | Few-Shot Task Learning through Inverse Generative Modeling | Aviv Netanyahu et.al. | 2411.04987 | null |
2024-11-07 | Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games | Usman Anwar et.al. | 2411.04976 | link |
2024-11-07 | StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration | Panwen Hu et.al. | 2411.04925 | null |
2024-11-07 | OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models | Siming Huang et.al. | 2411.04905 | null |
2024-11-07 | Achieving superconductivity in infinite-layer nickelate thin films by aluminum sputtering deposition | Dongxin Zhang et.al. | 2411.04896 | null |
2024-11-07 | GUI Agents with Foundation Models: A Comprehensive Survey | Shuai Wang et.al. | 2411.04890 | null |
2024-11-07 | Think Smart, Act SMARL! Analyzing Probabilistic Logic Driven Safety in Multi-Agent Reinforcement Learning | Satchit Chatterji et.al. | 2411.04867 | link |
2024-11-07 | Robust Regulation of Labour Contracts | Théo Durandard et.al. | 2411.04841 | null |
2024-11-07 | Plasticity Loss in Deep Reinforcement Learning: A Survey | Timo Klein et.al. | 2411.04832 | null |
2024-11-07 | MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation | Sayan Paul et.al. | 2411.04796 | null |
2024-11-07 | A Continuification-Based Control Solution for Large-Scale Shepherding | Beniamino Di Lorenzo et.al. | 2411.04791 | null |
2024-11-07 | Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial Research | Xuewen Han et.al. | 2411.04788 | link |
2024-11-07 | Navigating Trade-offs: Policy Summarization for Multi-Objective Reinforcement Learning | Zuzanna Osika et.al. | 2411.04784 | link |
2024-11-07 | Learning from Demonstration with Hierarchical Policy Abstractions Toward High-Performance and Courteous Autonomous Racing | Chanyoung Chung et.al. | 2411.04735 | null |
2024-11-07 | A dynamical model of platform choice and online segregation | Sven Banisch et.al. | 2411.04681 | null |
2024-11-07 | CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation | Jie Liu et.al. | 2411.04679 | null |
2024-11-07 | Semantic-Aware Resource Management for C-V2X Platooning via Multi-Agent Reinforcement Learning | Zhiyu Shao et.al. | 2411.04672 | link |
2024-11-07 | CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR | Kadir Burak Buldu et.al. | 2411.04671 | null |
2024-11-07 | IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving | Clémence Grislain et.al. | 2411.04653 | link |
2024-11-07 | Mint: Cost-Efficient Tracing with All Requests Collection via Commonality and Variability Analysis | Haiyu Huang et.al. | 2411.04605 | null |
2024-11-06 | Predicting and Publishing Accurate Imbalance Prices Using Monte Carlo Tree Search | Fabio Pavirani et.al. | 2411.04011 | null |
2024-11-06 | Temporal Network Creation Games: The Impact of Non-Locality and Terminals | Davide Bilò et.al. | 2411.03973 | null |
2024-11-06 | Almost Time-Optimal Loosely-Stabilizing Leader Election on Arbitrary Graphs Without Identifiers in Population Protocols | Haruki Kanaya et.al. | 2411.03902 | null |
2024-11-06 | AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making | Yizhe Huang et.al. | 2411.03865 | link |
2024-11-06 | Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC | Tyler Clark et.al. | 2411.03820 | null |
2024-11-06 | From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning | Zhirui Deng et.al. | 2411.03817 | null |
2024-11-06 | MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue | Fengxiang Wang et.al. | 2411.03814 | null |
2024-11-06 | Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data | Chengrui Qu et.al. | 2411.03810 | link |
2024-11-06 | Multi-Modal Intelligent Channel Modeling: A New Modeling Paradigm via Synesthesia of Machines | Lu Bai et.al. | 2411.03711 | null |
2024-11-06 | Learn to Slice, Slice to Learn: Unveiling Online Optimization and Reinforcement Learning for Slicing AI Services | Amr Abo-eleneen et.al. | 2411.03686 | null |
2024-11-06 | Imagined Potential Games: A Framework for Simulating, Learning and Evaluating Interactive Behaviors | Lingfeng Sun et.al. | 2411.03669 | null |
2024-11-06 | Privacy-Preserving Resilient Vector Consensus | Bing Liu et.al. | 2411.03633 | null |
2024-11-06 | CPEG: Leveraging Consistency Policy with Consensus Guidance for Multi-agent Exploration | Yuqian Fu et.al. | 2411.03603 | null |
2024-11-05 | Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level | Antoine Grosnit et.al. | 2411.03562 | null |
2024-11-05 | VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation | Haochen Zhang et.al. | 2411.03540 | link |
2024-11-05 | AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution | Zhiqiang Xie et.al. | 2411.03519 | null |
2024-11-05 | An Open-source Sim2Real Approach for Sensor-independent Robot Navigation in a Grid | Murad Mehrab Abrar et.al. | 2411.03494 | link |
2024-11-05 | Watson: A Cognitive Observability Framework for the Reasoning of Foundation Model-Powered Agents | Benjamin Rombaut et.al. | 2411.03455 | null |
2024-11-05 | SAUCE: Synchronous and Asynchronous User-Customizable Environment for Multi-Agent LLM Interaction | Shlomo Neuberger et.al. | 2411.03397 | link |
2024-11-05 | SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents | Dawei Li et.al. | 2411.03284 | link |
2024-11-05 | Causal Responsibility Attribution for Human-AI Collaboration | Yahang Qi et.al. | 2411.03275 | link |
2024-11-05 | Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities | Ryosuke Takata et.al. | 2411.03252 | null |
2024-11-05 | Troll Farms | Philipp Denter et.al. | 2411.03241 | null |
2024-11-05 | A resolved Lyman-Alpha profile with doubly peaked emission at z~7 | C. Moya-Sierralta et.al. | 2411.03222 | null |
2024-11-05 | GIS Copilot: Towards an Autonomous GIS Agent for Spatial Analysis | Temitope Akinboyewa et.al. | 2411.03205 | link |
2024-11-05 | Online Data Collection for Efficient Semiparametric Inference | Shantanu Gupta et.al. | 2411.03195 | link |
2024-11-05 | Hierarchical Orchestra of Policies | Thomas P Cannon et.al. | 2411.03008 | null |
2024-11-05 | Accelerating Task Generalisation with Multi-Level Hierarchical Options | Thomas P Cannon et.al. | 2411.02998 | null |
2024-11-05 | Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation | Francisco Giral et.al. | 2411.02975 | null |
2024-11-05 | Embedding Safety into RL: A New Take on Trust Region Methods | Nikola Milosevic et.al. | 2411.02957 | null |
2024-11-05 | Constant Approximation for Weighted Nash Social Welfare with Submodular Valuations | Yuda Feng et.al. | 2411.02942 | null |
2024-11-05 | Multi-Modal 3D Scene Graph Updater for Shared and Dynamic Environments | Emilio Olivastri et.al. | 2411.02938 | null |
2024-11-05 | Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent | Yangning Li et.al. | 2411.02937 | link |
2024-11-05 | Polyhedral study of a temporal rural postman problem: application in inspection of railway track without disturbing train schedules | Somnath Buriuly et.al. | 2411.02822 | null |
2024-11-05 | DroidSpeak: Enhancing Cross-LLM Communication | Yuhan Liu et.al. | 2411.02820 | null |
2024-11-04 | Fair and Welfare-Efficient Constrained Multi-matchings under Uncertainty | Elita Lobo et.al. | 2411.02654 | link |
2024-11-04 | Fine Grained Insider Risk Detection | Birkett Huber et.al. | 2411.02645 | null |
2024-11-04 | Learning to Assist Humans without Inferring Rewards | Vivek Myers et.al. | 2411.02623 | link |
2024-11-04 | Multi-Agent Decision Transformers for Dynamic Dispatching in Material Handling Systems Leveraging Enterprise Big Data | Xian Yeow Lee et.al. | 2411.02584 | null |
2024-11-04 | Attacking Vision-Language Computer Agents via Pop-ups | Yanzhe Zhang et.al. | 2411.02391 | link |
2024-11-04 | Two-Sided Learning in Decentralized Matching Markets | Vade Shah et.al. | 2411.02377 | null |
2024-11-04 | Social-RAG: Retrieving from Group Interactions to Socially Ground Proactive AI Generation to Group Preferences | Ruotong Wang et.al. | 2411.02353 | null |
2024-11-04 | WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning | Zehan Qi et.al. | 2411.02337 | link |
2024-11-04 | CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments | Kung-Hsiang Huang et.al. | 2411.02305 | link |
2024-11-04 | Kinetic exchange opinion dynamics for the battleground-states in the 2024 US presidential elections | Soumyajyoti Biswas et.al. | 2411.02240 | null |
2024-11-04 | Positive Experience Reflection for Agents in Interactive Text Environments | Philip Lippmann et.al. | 2411.02223 | null |
2024-11-04 | CryptoEL: A Novel Experiential Learning Tool for Enhancing K-12 Cryptography Education | Pranathi Rayavaram et.al. | 2411.02143 | null |
2024-11-04 | Foundations and Recent Trends in Multimodal Mobile Agents: A Survey | Biao Wu et.al. | 2411.02006 | link |
2024-11-04 | Deep memetic models for combinatorial optimization problems: application to the tool switching problem | Jhon Edgar Amaya et.al. | 2411.01922 | null |
2024-11-04 | Efficient Active Imitation Learning with Random Network Distillation | Emilien Biré et.al. | 2411.01894 | null |
2024-11-04 | ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation | Hengkai Tan et.al. | 2411.01850 | null |
2024-11-04 | IRS-Enhanced Secure Semantic Communication Networks: Cross-Layer and Context-Awared Resource Allocation | Lingyi Wang et.al. | 2411.01821 | null |
2024-11-04 | A Polynomial-Time Algorithm for Fair and Efficient Allocation with a Fixed Number of Agents | Ryoga Mahara et.al. | 2411.01810 | null |
2024-11-04 | Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge | Weihua Du et.al. | 2411.01796 | link |
2024-11-04 | Revisiting Game-Theoretic Control in Socio-Technical Networks: Emerging Design Frameworks and Contemporary Applications | Quanyan Zhu et.al. | 2411.01794 | null |
2024-11-04 | Lyapunov-guided Multi-Agent Reinforcement Learning for Delay-Sensitive Wireless Scheduling | Cheng Zhang et.al. | 2411.01766 | null |
2024-11-04 | Show, Don't Tell: Learning Reward Machines from Demonstrations for Reinforcement Learning-Based Cardiac Pacemaker Synthesis | John Komp et.al. | 2411.01750 | null |
2024-11-04 | DynaSaur: Large Language Agents Beyond Predefined Actions | Dang Nguyen et.al. | 2411.01747 | null |
2024-11-04 | Taking AI Welfare Seriously | Robert Long et.al. | 2411.00986 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-24 | IGDA: Interactive Graph Discovery through Large Language Model Agents | Alex Havrilla et.al. | 2502.17189 | null |
2025-02-24 | Grounded Persuasive Language Generation for Automated Marketing | Jibang Wu et.al. | 2502.16810 | null |
2025-02-24 | Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent Advances | Yaozu Wu et.al. | 2502.16804 | null |
2025-02-23 | Guardians of the Agentic System: Preventing Many Shots Jailbreak with Agentic System | Saikat Barua et.al. | 2502.16750 | null |
2025-02-23 | RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents | Sho Nakatani et.al. | 2502.16730 | null |
2025-02-20 | Vending-Bench: A Benchmark for Long-Term Coherence of Autonomous Agents | Axel Backlund et.al. | 2502.15840 | null |
2025-02-18 | LLM Trading: Analysis of LLM Agent Behavior in Experimental Asset Markets | Thomas Henning et.al. | 2502.15800 | null |
2025-02-21 | Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing | Masaya Kobayashi et.al. | 2502.15506 | null |
2025-02-21 | Textual-to-Visual Iterative Self-Verification for Slide Generation | Yunqing Xu et.al. | 2502.15412 | null |
2025-02-21 | I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search | Zujie Liang et.al. | 2502.14693 | null |
2025-02-20 | Enhancing Language Multi-Agent Learning with Multi-Agent Credit Re-Assignment for Interactive Environment Generalization | Zhitao He et.al. | 2502.14496 | null |
2025-02-20 | FlowAgent: Achieving Compliance and Flexibility for Workflow Agents | Yuchen Shi et.al. | 2502.14345 | link |
2025-02-19 | Investigating Non-Transitivity in LLM-as-a-Judge | Yi Xu et.al. | 2502.14074 | null |
2025-02-19 | An LLM-based Agent for Reliable Docker Environment Configuration | Ruida Hu et.al. | 2502.13681 | null |
2025-02-16 | Understanding Dynamic Diffusion Process of LLM-based Agents under Information Asymmetry | Yiwen Zhang et.al. | 2502.13160 | null |
2025-02-18 | SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems | Mike Zhang et.al. | 2502.12927 | link |
2025-02-18 | Towards more Contextual Agents: An extractor-Generator Optimization Framework | Mourad Aouini et.al. | 2502.12926 | null |
2025-02-18 | DemonAgent: Dynamically Encrypted Multi-Backdoor Implantation Attack on LLM-based Agent | Pengyu Zhu et.al. | 2502.12575 | link |
2025-02-18 | Investigating and Extending Homans' Social Exchange Theory with Large Language Model based Agents | Lei Wang et.al. | 2502.12450 | link |
2025-02-17 | Connecting Large Language Model Agent to High Performance Computing Resource | Heng Ma et.al. | 2502.12280 | null |
2025-02-17 | Scaling Autonomous Agents via Automatic Reward Modeling And Planning | Zhenfang Chen et.al. | 2502.12130 | null |
2025-02-17 | TimeCAP: Learning to Contextualize, Augment, and Predict Time Series Events with Large Language Model Agents | Geon Lee et.al. | 2502.11418 | null |
2025-02-16 | A Survey of LLM-based Agents in Medicine: How far are we from Baymax? | Wenxuan Wang et.al. | 2502.11211 | null |
2025-02-16 | SCALE: Towards Collaborative Content Analysis in Social Science with Large Language Model Agents and Human Intervention | Chengshuai Zhao et.al. | 2502.10937 | null |
2025-02-14 | Can Large Language Model Agents Balance Energy Systems? | Xinxing Ren et.al. | 2502.10557 | null |
2025-02-13 | MDCrow: Automating Molecular Dynamics Workflows with Large Language Models | Quintina Campbell et.al. | 2502.09565 | link |
2025-02-12 | SPeCtrum: A Grounded Framework for Multidimensional Identity Representation in LLM-Based Agent | Keyeun Lee et.al. | 2502.08599 | link |
2025-02-13 | Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation | Mahnaz Koupaee et.al. | 2502.08514 | link |
2025-02-07 | Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization | Zelai Xu et.al. | 2502.04686 | null |
2025-02-06 | Multi-Agent Reinforcement Learning with Focal Diversity Optimization | Selim Furkan Tekin et.al. | 2502.04492 | link |
2025-02-04 | Position: Scaling LLM Agents Requires Asymptotic Analysis with LLM Primitives | Elliot Meyerson et.al. | 2502.04358 | null |
2025-02-03 | Simulating Rumor Spreading in Social Networks using LLM Agents | Tianrui Hu et.al. | 2502.01450 | link |
2025-02-03 | PlotGen: Multi-Agent LLM-based Scientific Data Visualization via Multimodal Feedback | Kanika Goswami et.al. | 2502.00988 | null |
2025-02-02 | RTBAgent: A LLM-based Agent System for Real-Time Bidding | Leng Cai et.al. | 2502.00792 | link |
2025-02-02 | Meta-Prompt Optimization for LLM-Based Sequential Decision Making | Mingze Kong et.al. | 2502.00728 | null |
2025-02-02 | PhiP-G: Physics-Guided Text-to-3D Compositional Scene Generation | Qixuan Li et.al. | 2502.00708 | null |
2025-01-31 | Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game | Mustafa O. Karabag et.al. | 2501.19398 | link |
2025-01-28 | Large Language Model Critics for Execution-Free Evaluation of Code Changes | Aashish Yadavally et.al. | 2501.16655 | link |
2024-12-30 | DropMicroFluidAgents (DMFAs): Autonomous Droplet Microfluidic Research Framework Through Large Language Model Agents | Dinh-Nguyen Nguyen et.al. | 2501.14772 | link |
2025-01-24 | AI Chatbots as Professional Service Agents: Developing a Professional Identity | Wenwen Li et.al. | 2501.14179 | null |
2025-02-08 | Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents | Shrinidhi Kumbhar et.al. | 2501.13299 | null |
2025-01-20 | Towards Advancing Code Generation with Large Language Models: A Research Roadmap | Haolin Jin et.al. | 2501.11354 | null |
2025-02-13 | Large Language Model Agents for Radio Map Generation and Wireless Network Planning | Hongye Quan et.al. | 2501.11283 | null |
2024-12-18 | Autonomous Microscopy Experiments through Large Language Model Agents | Indrajeet Mandal et.al. | 2501.10385 | null |
2025-01-13 | Lifelong Learning of Large Language Model based Agents: A Roadmap | Junhao Zheng et.al. | 2501.07278 | link |
2025-01-10 | Multi-Agent Collaboration Mechanisms: A Survey of LLMs | Khanh-Tung Tran et.al. | 2501.06322 | null |
2025-01-09 | Emergence of human-like polarization among large language model agents | Jinghua Piao et.al. | 2501.05171 | null |
2025-01-27 | MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning | Pu Yang et.al. | 2501.01834 | null |
2025-01-03 | SDPO: Segment-Level Direct Preference Optimization for Social Agents | Aobo Kong et.al. | 2501.01821 | link |
2025-01-03 | AgentRefine: Enhancing Agent Generalization through Refinement Tuning | Dayuan Fu et.al. | 2501.01702 | null |
2025-01-02 | BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery | Kanishk Gandhi et.al. | 2501.01540 | link |
2024-12-31 | Enabling New HDLs with Agents | Mark Zakharov et.al. | 2501.00642 | null |
2025-01-09 | Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding | Yue Fan et.al. | 2501.00358 | null |
2024-12-30 | AI Agent for Education: von Neumann Multi-Agent System Framework | Yuan-Hao Jiang et.al. | 2501.00083 | null |
2024-12-17 | AnalogXpert: Automating Analog Topology Synthesis by Incorporating Circuit Design Expertise into Large Language Models | Haoyi Zhang et.al. | 2412.19824 | null |
2024-12-24 | Explainable Multi-Modal Data Exploration in Natural Language via LLM Agent | Farhad Nooralahzadeh et.al. | 2412.18428 | link |
2024-12-24 | Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering | Zhongjian Hu et.al. | 2412.18351 | null |
2024-12-24 | INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent | Haohang Li et.al. | 2412.18174 | null |
2024-12-24 | Molly: Making Large Language Model Agents Solve Python Problem More Logically | Rui Xiao et.al. | 2412.18093 | null |
2024-12-17 | On the Structural Memory of LLM Agents | Ruihong Zeng et.al. | 2412.15266 | link |
2024-12-18 | Tree-of-Code: A Hybrid Approach for Robust Complex Task Planning and Execution | Ziyi Ni et.al. | 2412.14212 | null |
2024-12-17 | RareAgents: Autonomous Multi-disciplinary Team for Rare Disease Diagnosis and Treatment | Xuanzhong Chen et.al. | 2412.12475 | null |
2024-12-14 | Towards Action Hijacking of Large Language Model-based Agent | Yuyang Zhang et.al. | 2412.10807 | null |
2025-01-09 | Active Inference for Self-Organizing Multi-LLM Systems: A Bayesian Thermodynamic Approach to Adaptation | Rithvik Prakki et.al. | 2412.10425 | link |
2024-12-19 | Can Modern LLMs Act as Agent Cores in Radiology Environments? | Qiaoyu Zheng et.al. | 2412.09529 | link |
2024-12-09 | Toward LLM-Agent-Based Modeling of Transportation Systems: A Conceptual Framework | Tianming Liu et.al. | 2412.06681 | null |
2024-12-09 | Simulating Human-like Daily Activities with Desire-driven Autonomy | Yiding Wang et.al. | 2412.06435 | null |
2024-12-09 | StarWhisper Telescope: Agent-Based Observation Assistant System to Approach AI Astrophysicist | Cunshi Wang et.al. | 2412.06412 | null |
2024-12-09 | Beyond pip install: Evaluating LLM Agents for the Automated Installation of Python Projects | Louis Milliken et.al. | 2412.06294 | link |
2024-12-08 | Cooperative SQL Generation for Segmented Databases By Using Multi-functional LLM Agents | Zhiguang Wu et.al. | 2412.05850 | null |
2024-12-04 | DataLab: A Unified Platform for LLM-Powered Business Intelligence | Luoxuan Weng et.al. | 2412.02205 | null |
2024-12-02 | HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing | Lajos Muzsai et.al. | 2412.01778 | link |
2024-12-02 | SAUP: Situation Awareness Uncertainty Propagation on LLM Agent | Qiwei Zhao et.al. | 2412.01033 | null |
2024-12-03 | Multi-Agent System for Cosmological Parameter Analysis | Andrew Laverick et.al. | 2412.00431 | link |
2024-11-28 | SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments | Yue Cao et.al. | 2412.00114 | null |
2024-11-29 | Training Agents with Weakly Supervised Feedback from Large Language Models | Dihong Gong et.al. | 2411.19547 | null |
2024-11-26 | LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble | Yujeong Lee et.al. | 2411.17135 | null |
2024-11-21 | Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning | Song Jiang et.al. | 2411.13904 | null |
2024-11-19 | Human-In-the-Loop Software Development Agents | Wannita Takerngsaksiri et.al. | 2411.12924 | null |
2024-12-16 | A More Advanced Group Polarization Measurement Approach Based on LLM-Based Agents and Graphs | Zixin Liu et.al. | 2411.12196 | null |
2024-11-15 | Static network structure cannot stabilize cooperation among Large Language Model agents | Jin Han et.al. | 2411.10294 | null |
2024-11-15 | An Empirical Study on LLM-based Agents for Automated Bug Fixing | Xiangxin Meng et.al. | 2411.10213 | null |
2024-11-14 | Navigating the Risks: A Survey of Security, Privacy, and Ethics Threats in LLM-Based Agents | Yuyou Gan et.al. | 2411.09523 | null |
2024-10-29 | FinVision: A Multi-Agent Framework for Stock Market Prediction | Sorouralsadat Fatemi et.al. | 2411.08899 | null |
2024-11-11 | Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving | Botao Yu et.al. | 2411.07228 | null |
2024-11-05 | Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities | Ryosuke Takata et.al. | 2411.03252 | null |
2024-11-02 | Interacting Large Language Model Agents. Interpretable Models and Social Learning | Adit Jain et.al. | 2411.01271 | null |
2024-11-02 | AutoPT: How Far Are We from the End2End Automated Web Penetration Testing? | Benlong Wu et.al. | 2411.01236 | link |
2024-11-02 | A Large-scale Time-aware Agents Simulation for Influencer Selection in Digital Advertising Campaigns | Xiaoqing Zhang et.al. | 2411.01143 | null |
2024-11-01 | Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software Improvement | Yingwei Ma et.al. | 2411.00622 | link |
2024-10-31 | From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents | Nalin Tiwary et.al. | 2410.23555 | null |
2024-10-30 | Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration | Yanchu Guan et.al. | 2410.22916 | null |
2024-10-29 | SceneGenAgent: Precise Industrial Scene Generation with Coding Agent | Xiao Xia et.al. | 2410.21909 | link |
2024-10-28 | Guide-LLM: An Embodied LLM Agent and Text-Based Topological Map for Robotic Guidance of People with Visual Impairments | Sangmim Song et.al. | 2410.20666 | null |
2024-10-29 | Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting | Mohamed Salim Aissi et.al. | 2410.19920 | null |
2024-11-07 | GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration | Xin Li et.al. | 2410.18032 | link |
2024-10-25 | MiniFed : Integrating LLM-based Agentic-Workflow for Simulating FOMC Meeting | Sungil Seok et.al. | 2410.18012 | null |
2024-10-22 | SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning | Yizhou Chi et.al. | 2410.17238 | link |
2024-10-22 | Adsorb-Agent: Autonomous Identification of Stable Adsorption Configurations via Large Language Model Agent | Janghoon Ock et.al. | 2410.16658 | link |
2024-10-21 | NetSafe: Exploring the Topological Safety of Multi-agent Networks | Miao Yu et.al. | 2410.15686 | null |
2024-10-20 | When Machine Unlearning Meets Retrieval-Augmented Generation (RAG): Keep Secret or Forget Knowledge? | Shang Wang et.al. | 2410.15267 | null |
2024-10-19 | SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation | Jingxuan Chen et.al. | 2410.15164 | link |
2024-10-18 | Agents4PLC: Automating Closed-loop PLC Code Generation and Verification in Industrial Control Systems using LLM-based Agents | Zihan Liu et.al. | 2410.14209 | link |
2024-10-18 | SRAP-Agent: Simulating and Optimizing Scarce Resource Allocation Policy with LLM-based Agent | Jiarui Ji et.al. | 2410.14152 | link |
2024-10-17 | AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents | Ke Yang et.al. | 2410.13825 | null |
2024-10-17 | Rapid and Automated Alloy Design with Graph Neural Network-Powered LLM-Driven Multi-Agent Systems | Alireza Ghafarollahi et.al. | 2410.13768 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-17 | ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models | Hanxing Ding et.al. | 2502.11404 | link |
2025-02-17 | Mimicking the Familiar: Dynamic Command Generation for Information Theft Attacks in LLM Tool-Learning System | Ziyou Jiang et.al. | 2502.11358 | null |
2025-02-14 | RTBAS: Defending LLM Agents Against Prompt Injection and Privacy Leakage | Peter Yong Zhong et.al. | 2502.08966 | null |
2025-02-03 | Tool Unlearning for Tool-Augmented LLMs | Jiali Cheng et.al. | 2502.01083 | null |
2025-01-30 | ACEBench: Who Wins the Match Point in Tool Learning? | Chen Chen et.al. | 2501.12851 | null |
2025-01-21 | Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation | Dongsheng Zhu et.al. | 2501.12432 | null |
2024-12-11 | GraphTool-Instruction: Revolutionizing Graph Reasoning in LLMs through Decomposed Subtask Instruction | Rongzheng Wang et.al. | 2412.12152 | null |
2024-12-11 | Federated In-Context LLM Agent Learning | Panlong Wu et.al. | 2412.08054 | null |
2024-12-08 | TOOL-ED: Enhancing Empathetic Response Generation with the Tool Calling Capability of LLM | Huiying Cao et.al. | 2412.03096 | link |
2024-10-15 | Toolken+: Improving LLM Tool Usage with Reranking and a Reject Option | Konstantin Yakovlev et.al. | 2410.12004 | null |
2025-01-07 | NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models | Han Han et.al. | 2410.11805 | link |
2024-10-10 | From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions | Changle Qu et.al. | 2410.08197 | link |
2025-02-18 | StepTool: Enhancing Multi-Step Tool Usage in LLMs through Step-Grained Reinforcement Learning | Yuanqing Yu et.al. | 2410.07745 | link |
2025-02-24 | Learning Evolving Tools for Large Language Models | Guoxin Chen et.al. | 2410.06617 | link |
2024-10-08 | ToolGen: Unified Tool Retrieval and Calling via Generation | Renxi Wang et.al. | 2410.03439 | link |
2024-09-23 | CITI: Enhancing Tool Utilizing Ability in Large Language Models without Sacrificing General Performance | Yupu Hao et.al. | 2409.13202 | link |
2024-09-02 | ToolACE: Winning the Points of LLM Function Calling | Weiwen Liu et.al. | 2409.00920 | null |
2025-02-16 | Learning to Ask: When LLM Agents Meet Unclear Instruction | Wenxuan Wang et.al. | 2409.00557 | null |
2024-10-08 | MetaTool: Facilitating Large Language Models to Master Tools with Meta-task Augmentation | Xiaohan Wang et.al. | 2407.12871 | null |
2024-07-02 | WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models | Kangyun Ning et.al. | 2407.12823 | null |
2024-07-03 | What Affects the Stability of Tool Learning? An Empirical Study on the Robustness of Tool Learning Frameworks | Chengrui Huang et.al. | 2407.03007 | null |
2024-06-28 | Simulating Financial Market via Large Language Model based Agents | Shen Gao et.al. | 2406.19966 | null |
2024-09-29 | Enhancing Tool Retrieval with Iterative Feedback from Large Language Models | Qiancheng Xu et.al. | 2406.17465 | link |
2024-09-30 | Query Routing for Homogeneous Tools: An Instantiation in the RAG Scenario | Feiteng Mu et.al. | 2406.12429 | null |
2024-10-02 | Tool-Planner: Task Planning with Clusters across Multiple Tools | Yanming Liu et.al. | 2406.03807 | link |
2024-06-03 | A Survey of Useful LLM Evaluation | Ji-Lun Peng et.al. | 2406.00936 | null |
2024-11-04 | Tool Learning with Large Language Models: A Survey | Changle Qu et.al. | 2405.17935 | link |
2024-05-24 | Let Me Do It For You: Towards LLM Empowered Recommendation via Tool Learning | Yuyue Zhao et.al. | 2405.15114 | null |
2024-05-14 | Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark | Mengsong Wu et.al. | 2405.08355 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-20 | CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space | Yong Zhao et.al. | 2502.12532 | link |
2025-02-16 | NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLM | Zihan Wang et.al. | 2502.11142 | link |
2025-02-14 | STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning | Mingcong Lei et.al. | 2502.10177 | null |
2025-02-11 | Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning | Yuhang Dong et.al. | 2502.09649 | null |
2025-02-23 | EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents | Rui Yang et.al. | 2502.09560 | null |
2025-02-10 | Visual Agentic AI for Spatial Reasoning with a Dynamic API | Damiano Marsili et.al. | 2502.06787 | null |
2025-02-09 | EvoAgent: Agent Autonomous Evolution with Continual World Model for Long-Horizon Tasks | Tongtong Feng et.al. | 2502.05907 | null |
2025-02-10 | Humans Co-exist, So Must Embodied Artificial Agents | Hannah Kuehn et.al. | 2502.04809 | null |
2025-02-04 | AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement | Shivam Singh et.al. | 2502.02067 | link |
2025-02-03 | Provable Ordering and Continuity in Vision-Language Pretraining for Generalizable Embodied Agents | Zhizhen Zhang et.al. | 2502.01218 | link |
2025-01-31 | MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems | Anirudh Chari et.al. | 2501.19318 | null |
2025-01-31 | GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling | Pinxin Liu et.al. | 2501.18898 | link |
2025-02-03 | UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent | Jianke Zhang et.al. | 2501.18867 | null |
2025-01-29 | PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding | Wei Chow et.al. | 2501.16411 | null |
2025-02-13 | What if Eye...? Computationally Recreating Vision Evolution | Kushagra Tiwary et.al. | 2501.15001 | link |
2025-01-21 | EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents | Zhili Cheng et.al. | 2501.11858 | link |
2025-01-17 | Universal Actions for Enhanced Embodied Foundation Models | Jinliang Zheng et.al. | 2501.10105 | link |
2025-01-15 | Embodied Scene Understanding for Vision Language Models via MetaVQA | Weizhen Wang et.al. | 2501.09167 | null |
2025-01-10 | Semantic Mapping in Indoor Embodied AI -- A Comprehensive Survey and Future Directions | Sonia Raychaudhuri et.al. | 2501.05750 | null |
2025-01-09 | ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark | Ronghao Dang et.al. | 2501.05031 | link |
2025-01-29 | Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey | Zongxia Li et.al. | 2501.02189 | link |
2025-01-02 | Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method | Ruichen Zhang et.al. | 2501.01141 | null |
2024-12-30 | UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI | Fangwei Zhong et.al. | 2412.20977 | null |
2024-12-28 | FaGeL: Fabric LLMs Agent empowered Embodied Intelligence Evolution with Autonomous Human-Machine Collaboration | Jia Liu et.al. | 2412.20297 | null |
2024-12-30 | Embodied Image Quality Assessment for Robotic Intelligence | Jianbo Zhang et.al. | 2412.18774 | link |
2024-12-24 | Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems | Fernando Jia et.al. | 2412.18601 | link |
2024-12-24 | VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks | Shiduo Zhang et.al. | 2412.18194 | null |
2024-12-23 | Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples | Taewoong Kim et.al. | 2412.17288 | link |
2024-12-25 | Offline Reinforcement Learning for LLM Multi-Step Reasoning | Huaijie Wang et.al. | 2412.16145 | link |
2024-12-17 | GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding | Haoyi Jiang et.al. | 2412.13193 | link |
2024-12-18 | SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents | Sheng Yin et.al. | 2412.13178 | link |
2024-12-16 | Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agents | Wonje Choi et.al. | 2412.11484 | null |
2024-12-05 | TANGO: Training-free Embodied AI Agents for Open-world Tasks | Filippo Ziliotto et.al. | 2412.10402 | null |
2024-12-11 | From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons | Andrew Szot et.al. | 2412.08442 | null |
2024-12-23 | SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World | Jiaqi Zhang et.al. | 2412.07472 | link |
2024-12-08 | InfiniteWorld: A Unified Scalable Simulation Framework for General Visual-Language Robot Interaction | Pengzhen Ren et.al. | 2412.05789 | link |
2024-12-06 | TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft | Qian Long et.al. | 2412.05255 | link |
2024-12-06 | EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding | Yuqi Wu et.al. | 2412.04380 | link |
2024-12-03 | Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks | Zijiao Yang et.al. | 2412.02795 | null |
2024-12-25 | Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation | Yiyuan Pan et.al. | 2412.01857 | null |
2024-12-02 | The Bare Necessities: Designing Simple, Effective Open-Vocabulary Scene Graphs | Christina Kassab et.al. | 2412.01539 | null |
2024-12-02 | Generating Freeform Endoskeletal Robots | Muhan Li et.al. | 2412.01036 | null |
2024-12-01 | STEVE-Audio: Expanding the Goal Conditioning Modalities of Embodied Agents in Minecraft | Nicholas Lenzen et.al. | 2412.00949 | null |
2024-11-30 | Benchmark Real-time Adaptation and Communication Capabilities of Embodied Agent in Collaborative Scenarios | Shipeng Liu et.al. | 2412.00435 | null |
2024-11-28 | CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos | Xinhao Liu et.al. | 2411.17820 | link |
2024-12-15 | 3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning | Yuncong Yang et.al. | 2411.17735 | null |
2024-11-26 | LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble | Yujeong Lee et.al. | 2411.17135 | null |
2024-11-23 | Two Heads Are Better Than One: Collaborative LLM Embodied Agents for Human-Robot Interaction | Mitchell Rosser et.al. | 2411.16723 | null |
2024-11-25 | TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation | Linqing Zhong et.al. | 2411.16425 | null |
2024-12-04 | Functionality understanding and segmentation in 3D scenes | Jaime Corsetti et.al. | 2411.16310 | null |
2024-11-25 | Open-Vocabulary Octree-Graph for 3D Scene Understanding | Zhigang Wang et.al. | 2411.16253 | null |
2024-11-27 | XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models | Yixin Dong et.al. | 2411.15100 | null |
2024-11-20 | AMaze: An intuitive benchmark generator for fast prototyping of generalizable agents | Kevin Godin-Dubois et.al. | 2411.13072 | null |
2024-11-25 | MindForge: Empowering Embodied Agents with Theory of Mind for Lifelong Collaborative Learning | Mircea Lică et.al. | 2411.12977 | null |
2024-11-15 | Voxel-Aggergated Feature Synthesis: Efficient Dense Mapping for Simulated 3D Reasoning | Owen Burns et.al. | 2411.10616 | null |
2024-11-13 | NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation | Youzhi Liu et.al. | 2411.08579 | null |
2024-11-08 | Enhancing Robustness in Language-Driven Robotics: A Modular Approach to Failure Reduction | Émiland Garrabé et.al. | 2411.05474 | null |
2024-11-07 | MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation | Sayan Paul et.al. | 2411.04796 | null |
2024-11-07 | CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation | Jie Liu et.al. | 2411.04679 | null |
2024-11-07 | Scaling Laws for Pre-training Agents and World Models | Tim Pearce et.al. | 2411.04434 | null |
2024-11-05 | VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation | Haochen Zhang et.al. | 2411.03540 | link |
2024-11-04 | ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation | Hengkai Tan et.al. | 2411.01850 | null |
2024-11-05 | Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge | Weihua Du et.al. | 2411.01796 | link |
2024-10-31 | PARTNR: A Benchmark for Planning and Reasoning in Embodied Multi-agent Tasks | Matthew Chang et.al. | 2411.00081 | link |
2024-10-31 | Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use | Jiajun Xi et.al. | 2410.24218 | link |
2024-10-31 | Simulating User Agents for Embodied Conversational-AI | Daniel Philipov et.al. | 2410.23535 | null |
2024-10-30 | A little less conversation, a little more action, please: Investigating the physical common-sense of LLMs in a 3D embodied environment | Matteo G. Mecattaf et.al. | 2410.23242 | link |
2024-10-29 | ADAM: An Embodied Causal Agent in Open-World Environments | Shu Yu et.al. | 2410.22194 | null |
2024-10-23 | Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments | Luca Barsellotti et.al. | 2410.18195 | link |
2024-10-21 | Agent-Based Emulation for Deploying Robot Swarm Behaviors | Ricardo Vega et.al. | 2410.16444 | null |
2024-10-18 | Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied Agents | Sabit Hassan et.al. | 2410.14141 | null |
2024-10-17 | Goal Inference from Open-Ended Dialog | Rachel Ma et.al. | 2410.13957 | null |
2024-10-15 | M2Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes | Sixu Yan et.al. | 2410.11402 | null |
2024-10-14 | Embodied Active Learning of Generative Sensor-Object Models | Allison Pinosky et.al. | 2410.11130 | null |
2024-10-16 | PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation | Kaidong Zhang et.al. | 2410.10394 | null |
2024-10-12 | EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment | Chen Gao et.al. | 2410.09604 | null |
2024-10-05 | Semantic Environment Atlas for Object-Goal Navigation | Nuri Kim et.al. | 2410.09081 | null |
2024-11-01 | Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making | Manling Li et.al. | 2410.07166 | link |
2024-10-15 | M3Bench: Benchmarking Whole-body Motion Generation for Mobile Manipulation in 3D Scenes | Zeyu Zhang et.al. | 2410.06678 | null |
2024-10-08 | Entering Real Social World! Benchmarking the Theory of Mind and Socialization Capabilities of LLMs from a First-person Perspective | Guiyang Hou et.al. | 2410.06195 | link |
2024-10-07 | How do we Observe Relational Observables? | Emily Adlam et.al. | 2410.05508 | null |