Skip to content

observer4599/explainable-reinforcement-learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 

Repository files navigation

Explainable Reinforcement Learning (XRL) Resources

This repository aims to keep an up-to-date list of research on explainable reinforcement learning (XRL). If you find this helpful, you can give this repository a star and share it.

Missing resources or papers, problems, or questions? Open an issue here or email: contact (at) xrl (dot) ai.

Resources

  • Awesome Explainable Reinforcement Learning. Link

Newly Added Papers

#/Link Title Venue/Journal Year
1 CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems AAAI 2024
2 Causal Explanations for Sequential Decision-Making in Multi-Agent Systems AAMAS 2024
3 ASQ-IT: Interactive explanations for reinforcement-learning agents Artif. Intell. 2024
4 Abstracted Trajectory Visualization for Explainability in Reinforcement Learning CAI 2024
5 Demystifying Reinforcement Learning in Production Scheduling via Explainable AI CoRR 2024
6 Explaining an Agent's Future Beliefs through Temporally Decomposing Future Reward Estimators ECAI 2024
7 End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations ICML 2024
8 Leveraging Reward Consistency for Interpretable Feature Discovery in Reinforcement Learning IEEE Trans. Syst. Man Cybern. Syst. 2024
9 XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques KDD 2024
10 Semifactual Explanations for Reinforcement Learning Proc. of HAI 2024
11 Information based explanation methods for deep learning agents--with applications on large open-source chess models Scientific Reports 2024
12 Learning of Generalizable and Interpretable Knowledge in Grid-Based Reinforcement Learning Environments AIIDE 2023
13 Bridging the Human-AI Knowledge Gap: Concept Discovery and Transfer in AlphaZero CoRR 2023
14 Explaining Deep Reinforcement Learning-Based Methods for Control of Building HVAC Systems xAI 2023

Survey Papers

#/Link Title Venue/Journal Year
1 Explainable Reinforcement Learning: A Survey and Comparative Review ACM Comput. Surv. 2024
2 Redefining Counterfactual Explanations for Reinforcement Learning: Overview, Challenges and Opportunities ACM Comput. Surv. 2024
3 A Survey of Global Explanations in Reinforcement Learning Explainable Agency in Artificial Intelligence 2024
4 A survey on interpretable reinforcement learning Mach. Learn. 2024
5 Explainable reinforcement learning (XRL): a systematic literature review and taxonomy Mach. Learn. 2024
6 Explainable and Interpretable Reinforcement Learning for Robotics SLAIML 2024
7 Explainability in Deep Reinforcement Learning, a Review into Current Methods and Applications ACM Comput. Surv. 2023
8 A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges CoRR 2023
9 Advances in Explainable Reinforcement Learning: An Intelligent Transportation Systems Perspective Explainable Artificial Intelligence for Intelligent Transportation Systems 2023
10 Explainable reinforcement learning for broad-XAI: a conceptual framework and survey Neural Comput. Appl. 2023
11 Explainable Deep Reinforcement Learning: State of the Art and Challenges ACM Comput. Surv. 2022
12 Explainability in reinforcement learning: perspective and position CoRR 2022
13 Explainable AI and Reinforcement Learning - A Systematic Review of Current Approaches and Trends Frontiers Artif. Intell. 2021
14 Explainability in deep reinforcement learning Knowl. Based Syst. 2021
15 Explainable Reinforcement Learning: A Survey CD-MAKE 2020
16 Reinforcement Learning Interpretation Methods: A Survey IEEE Access 2020

Papers

#/Link Title Venue/Journal Year
1 Explaining Reinforcement Learning Agents Through Counterfactual Action Outcomes AAAI 2024
2 ASAP: Attention-Based State Space Abstraction for Policy Summarization ACML 2024
3 Causal State Distillation for Explainable Reinforcement Learning CLeaR 2024
4 Discovering Behavioral Modes in Deep Reinforcement Learning Policies Using Trajectory Clustering in Latent Space CoRR 2024
5 Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents CoRR 2024
6 Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning CoRR 2024
7 Detection of Important States through an Iterative Q-value Algorithm for Explainable Reinforcement Learning HICSS 2024
8 ''You just can't go around killing people'' Explaining Agent Behavior to a Human Terminator ICML Workshop MHFAIA 2024
9 Deep Reinforcement Learning Behavioral Mode Switching Using Optimal Control Based on a Latent Space Objective MED 2024
10 Explaining Deep Reinforcement Learning Policies with SHAP, Decision Trees, and Prototypes MED 2024
11 Local Explanations for Reinforcement Learning AAAI 2023
12 Explainable Reinforcement Learning Based on Q-Value Decomposition by Expected State Transitions AAAI-MAKE 2023
13 GANterfactual-RL: Understanding Reinforcement Learning Agents' Strategies through Visual Counterfactual Explanations AAMAS 2023
14 Interpreting a deep reinforcement learning model with conceptual embedding and performance analysis Appl. Intell. 2023
15 Deep Explainable Relational Reinforcement Learning: A Neuro-Symbolic Approach ECML PKDD 2023
16 Inherently Interpretable Deep Reinforcement Learning Through Online Mimicking EXTRAAMAS 2023
17 A Closer Look at Reward Decomposition for High-Level Robotic Explanations ICDL 2023
18 Towards Interpretable Deep Reinforcement Learning with Human-Friendly Prototypes ICLR 2023
19 Explaining Reinforcement Learning with Shapley Values ICML 2023
20 Counterfactual Explanation Policies in RL ICML Workshop on Counterfactuals in Minds and Machines 2023
21 Explaining Black Box Reinforcement Learning Agents Through Counterfactual Policies IDA 2023
22 Extracting Decision Tree From Trained Deep Reinforcement Learning in Traffic Signal Control IEEE Trans. Comput. Soc. Syst. 2023
23 Real-Time Counterfactual Explanations For Robotic Systems With Multiple Continuous Outputs IFAC-PapersOnLine 2023
24 Explainable Multi-Agent Reinforcement Learning for Temporal Queries IJCAI 2023
25 Explainable Reinforcement Learning via a Causal World Model IJCAI 2023
26 Unveiling Concepts Learned by a World-Class Chess-Playing Agent IJCAI 2023
27 Extracting tactics learned from self-play in general games Inf. Sci. 2023
28 Learning state importance for preference-based reinforcement learning Mach. Learn. 2023
29 Interpretable and Explainable Logical Policies via Neurally Guided Symbolic Abstraction NeurIPS 2023
30 State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding NeurIPS 2023
31 StateMask: Explaining Deep Reinforcement Learning through State Mask NeurIPS 2023
32 Comparing explanations in RL Neural Comput. Appl. 2023
33 Explainable robotic systems: understanding goal-driven actions in a reinforcement learning scenario Neural Comput. Appl. 2023
34 Hierarchical goals contextualize local reward decomposition explanations Neural Comput. Appl. 2023
35 Achieving efficient interpretability of reinforcement learning via policy distillation and selective input gradient regularization Neural Networks 2023
36 Model tree methods for explaining deep reinforcement learning agents in real-time robotic applications Neurocomputing 2023
37 Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents PAAMS 2023
38 Contrastive Visual Explanations for Reinforcement Learning via Counterfactual Rewards xAI 2023
39 IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit Based on Analyses of Interestingness xAI 2023
40 "I Don't Think So": Summarizing Policy Disagreements for Agent Comparison AAAI 2022
41 CAPS: Comprehensible Abstract Policy Summaries for Explaining Reinforcement Learning Agents AAMAS 2022
42 Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions AAMAS 2022
43 Lazy-MDPs: Towards Interpretable RL by Learning When to Act AAMAS 2022
44 Explaining Online Reinforcement Learning Decisions of Self-Adaptive Systems ACSOS 2022
45 Analysis of Explainable Goal-Driven Reinforcement Learning in a Continuous Simulated Environment Algorithms 2022
46 BEERL: Both Ends Explanations for Reinforcement Learning Applied Sciences 2022
47 Energy-Efficient Driving for Adaptive Traffic Signal Control Environment via Explainable Reinforcement Learning Applied Sciences 2022
48 Concept Learning for Interpretable Multi-Agent Reinforcement Learning CoRL 2022
49 Comparing Strategies for Visualizing the High-Dimensional Exploration Behavior of CPS Design Agents DESTION 2022
50 InAction: Interpretable Action Decision Making for Autonomous Driving ECCV 2022
51 Enhanced Oblique Decision Tree Enabled Policy Extraction for Deep Reinforcement Learning in Power System Emergency Control Electric Power Systems Research 2022
52 Attributation Analysis of Reinforcement Learning-Based Highway Driver Electronics 2022
53 Multi-objective Genetic Programming for Explainable Reinforcement Learning EuroGP 2022
54 Deep-Learning-based Fuzzy Symbolic Processing with Agents Capable of Knowledge Communication ICAART 2022
55 Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations ICLR 2022
56 POETREE: Interpretable Policy Learning with Adaptive Decision Trees ICLR 2022
57 Programmatic Reinforcement Learning without Oracles ICLR 2022
58 Explaining Reinforcement Learning Policies through Counterfactual Trajectories ICML Workshop on HILL 2022
59 Mean-variance Based Risk-sensitive Reinforcement Learning with Interpretable Attention ICMVA 2022
60 Towards Interpretable Deep Reinforcement Learning Models via Inverse Reinforcement Learning ICPR 2022
61 Explaining Intelligent Agent's Future Motion on Basis of Vocabulary Learning With Human Goal Inference IEEE Access 2022
62 Interpretable Autonomous Flight Via Compact Visualizable Neural Circuit Policies IEEE Robotics Autom. Lett. 2022
63 Explainable AI in Deep Reinforcement Learning Models for Power System Emergency Control IEEE Trans. Comput. Soc. Syst. 2022
64 Hierarchical Program-Triggered Reinforcement Learning Agents for Automated Driving IEEE Trans. Intell. Transp. Syst. 2022
65 Interpretable End-to-End Urban Autonomous Driving With Latent Deep Reinforcement Learning IEEE Trans. Intell. Transp. Syst. 2022
66 Continuous Action Reinforcement Learning From a Mixture of Interpretable Experts IEEE Trans. Pattern Anal. Mach. Intell. 2022
67 Self-Supervised Discovering of Interpretable Features for Reinforcement Learning IEEE Trans. Pattern Anal. Mach. Intell. 2022
68 Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning IEEE Trans. Pattern Anal. Mach. Intell. 2022
69 Visual Analytics for RNN-Based Deep Reinforcement Learning IEEE Trans. Vis. Comput. Graph. 2022
70 Toward Interpretable-AI Policies Using Evolutionary Nonlinear Decision Trees for Discrete-Action Systems IEEE Transactions on Cybernetics 2022
71 Understanding via Exploration: Discovery of Interpretable Features With Deep Reinforcement Learning IEEE Transactions on Neural Networks and Learning Systems 2022
72 Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction IJCAI Workshop on XAI 2022
73 ACMViz: a visual analytics approach to understand DRL-based autonomous control model J. Vis. 2022
74 Incorporating Explanations to Balance the Exploration and Exploitation of Deep Reinforcement Learning KSEM 2022
75 Towards Explainable Reinforcement Learning Using Scoring Mechanism Augmented Agents KSEM 2022
76 Explainable Reinforcement Learning via Model Transforms NeurIPS 2022
77 GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis NeurIPS 2022
78 Inherently Explainable Reinforcement Learning in Natural Language NeurIPS 2022
79 Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning NeurIPS 2022
80 ProtoX: Explaining a Reinforcement Learning Agent via Prototyping NeurIPS 2022
81 (When) Are Contrastive Explanations of Reinforcement Learning Helpful? NeurIPS workshop on HiLL 2022
82 Mo"ET: Mixture of Expert Trees and its application to verifiable reinforcement learning Neural Networks 2022
83 Analysing deep reinforcement learning agents trained with domain randomisation Neurocomputing 2022
84 Why? Why not? When? Visual Explanations of Agent Behaviour in Reinforcement Learning PacificVis 2022
85 Driving behavior explanation with multi-level fusion Pattern Recognit. 2022
86 Acquisition of chess knowledge in AlphaZero Proc. Natl. Acad. Sci. U.S.A. 2022
87 Learning Interpretable, High-Performing Policies for Autonomous Driving Robotics: Science and Systems 2022
88 Event-driven temporal models for explanations - ETeMoX: explaining reinforcement learning Softw. Syst. Model. 2022
89 Toward a Psychology of Deep Reinforcement Learning Agents Using a Cognitive Architecture Top. Cogn. Sci. 2022
90 DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning AAAI 2021
91 Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods AAAI 2021
92 TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments AAAI 2021
93 Explaining Deep Reinforcement Learning Agents in the Atari Domain through a Surrogate Model AIIDE 2021
94 A framework of explanation generation toward reliable autonomous robots Adv. Robotics 2021
95 Explainable Deep Reinforcement Learning for UAV autonomous path planning Aerospace Science and Technology 2021
96 Explaining robot policies Applied AI Letters 2021
97 Counterfactual state explanations for reinforcement learning agents via generative deep learning Artif. Intell. 2021
98 Local and global explanations of agent behavior: Integrating strategy summaries with saliency maps Artif. Intell. 2021
99 XPM: An Explainable Deep Reinforcement Learning Framework for Portfolio Management CIKM 2021
100 Interactive Explanations: Diagnosis and Repair of Reinforcement Learning Based Agent Behaviors CoG 2021
101 CDT: Cascading Decision Trees for Explainable Reinforcement Learning CoRR 2021
102 Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents CoRR 2021
103 Approximating a deep reinforcement learning docking agent using linear model trees ECC 2021
104 Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations ECC 2021
105 Off-Policy Differentiable Logic Reinforcement Learning ECML PKDD 2021
106 Neuro-Symbolic Reinforcement Learning with First-Order Logic EMNLP 2021
107 Explainable Reinforcement Learning for Longitudinal Control ICAART 2021
108 Explainable deep reinforcement learning for portfolio management: an empirical approach ICAIF 2021
109 Explainable Reinforcement Learning for Human-Robot Collaboration ICAR 2021
110 DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation ICCV 2021
111 Contrastive Explanations for Reinforcement Learning via Embedded Self Predictions ICLR 2021
112 Explaining by Imitating: Understanding Decisions by Interpretable Policy Learning ICLR 2021
113 Learning "What-if" Explanations for Sequential Decision-Making ICLR 2021
114 Discovering symbolic policies with deep reinforcement learning ICML 2021
115 Re-understanding Finite-State Representations of Recurrent Policy Networks ICML 2021
116 Explainable Reinforcement Learning with the Tsetlin Machine IEA/AIE 2021
117 A Blood Glucose Control Framework Based on Reinforcement Learning With Safety and Interpretability: In Silico Validation IEEE Access 2021
118 Symbolic Regression Methods for Reinforcement Learning IEEE Access 2021
119 Efficient Robotic Object Search Via HIEM: Hierarchical Policy Learning With Intrinsic-Extrinsic Modeling IEEE Robotics Autom. Lett. 2021
120 Learning to Discover Task-Relevant Features for Interpretable Reinforcement Learning IEEE Robotics Autom. Lett. 2021
121 Explaining Deep Learning Models Through Rule-Based Approximation and Visualization IEEE Trans. Fuzzy Syst. 2021
122 Interpretable Decision-Making for Autonomous Vehicles at Highway On-Ramps With Latent Space Reinforcement Learning IEEE Trans. Veh. Technol. 2021
123 Explainable AI methods on a deep reinforcement learning agent for automatic docking IFAC-PapersOnLine 2021
124 Visual Explanation using Attention Mechanism in Actor-Critic-based Deep Reinforcement Learning IJCNN 2021
125 Programmatic Policy Extraction by Iterative Local Search ILP 2021
126 Explaining the Decisions of Deep Policy Networks for Robotic Manipulations IROS 2021
127 XAI-N: Sensor-based Robot Navigation using Expert Policies and Decision Trees IROS 2021
128 Mixed Autonomous Supervision in Traffic Signal Control ITSC 2021
129 Can You Trust Your Autonomous Car? Interpretable and Verifiably Safe Reinforcement Learning IV 2021
130 Explaining a Deep Reinforcement Learning Docking Agent Using Linear Model Trees with User Adapted Visualization Journal of Marine Science and Engineering 2021
131 Visual Analysis of Deep Q-network KSII Trans. Internet Inf. Syst. 2021
132 Automatic discovery of interpretable planning strategies Mach. Learn. 2021
133 EDGE: Explaining Deep Reinforcement Learning Policies NeurIPS 2021
134 Learning Tree Interpretation from Object Representation for Deep Reinforcement Learning NeurIPS 2021
135 Learning to Synthesize Programs as Interpretable and Generalizable Policies NeurIPS 2021
136 Machine versus Human Attention in Deep Reinforcement Learning Tasks NeurIPS 2021
137 Explainable Artificial Intelligence (XAI) for Increasing User Trust in Deep Reinforcement Learning Driven Autonomous Systems NeurIPS Workshop on Deep RL 2021
138 Identifying Decision Points for Safe and Interpretable Reinforcement Learning in Hypotension Treatment NeurIPS Workshop on Machine Learning for Health 2021
139 Feature-Based Interpretable Reinforcement Learning based on State-Transition Models SMC 2021
140 A co-evolutionary approach to interpretable reinforcement learning in environments with continuous action spaces SSCI 2021
141 Interpretable AI Agent Through Nonlinear Decision Trees for Lane Change Problem SSCI 2021
142 Learning Sparse Evidence- Driven Interpretation to Understand Deep Reinforcement Learning Agents SSCI 2021
143 Explainable Reinforcement Learning through a Causal Lens AAAI 2020
144 Attribution-based Salience Method towards Interpretable Reinforcement Learning AAAI-MAKE 2020
145 Learning an Interpretable Traffic Signal Control Policy AAMAS 2020
146 Optimization Methods for Interpretable Differentiable Decision Trees Applied to Reinforcement Learning AISTATS 2020
147 Interestingness elements for explainable reinforcement learning: Understanding agents' capabilities and limitations Artif. Intell. 2020
148 Model primitives for hierarchical lifelong reinforcement learning Auton. Agents Multi Agent Syst. 2020
149 Understanding the Behavior of Reinforcement Learning Agents BIOMA 2020
150 Methodology for Interpretable Reinforcement Learning Model for HVAC Energy Control Big Data 2020
151 Explaining Autonomous Driving by Learning End-to-End Visual Attention CVPRW 2020
152 Understanding Learned Reward Functions CoRR 2020
153 Interpretable policy derivation for reinforcement learning based on evolutionary feature synthesis Complex & Intelligent Systems 2020
154 DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning Comput. Graph. Forum 2020
155 Understanding RL Vision Distill 2020
156 Interpretable policies for reinforcement learning by empirical fuzzy sets Eng. Appl. Artif. Intell. 2020
157 Neuroevolution of self-interpretable agents GECCO 2020
158 Topological Visualization Method for Understanding the Landscape of Value Functions and Structure of the State Space in Reinforcement Learning ICAART 2020
159 Identifying Critical States by the Action-Based Variance of Expected Return ICANN 2020
160 TLdR: Policy Summarization for Factored SSP Problems Using Temporal Abstractions ICAPS 2020
161 Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution ICLR 2020
162 Exploratory Not Explanatory: Counterfactual Analysis of Saliency Maps for Deep Reinforcement Learning ICLR 2020
163 Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents ICLR 2020
164 Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions ICML 2020
165 Deep Reinforcement Learning for Safe Local Planning of a Ground Vehicle in Unknown Rough Terrain IEEE Robotics Autom. Lett. 2020
166 Towards Interpretable Reinforcement Learning with State Abstraction Driven by External Knowledge IEICE Trans. Inf. Syst. 2020
167 Improved Policy Extraction via Online Q-Value Distillation IJCNN 2020
168 Visualization of topographical internal representation of learning robots IJCNN 2020
169 Explainable navigation system using fuzzy reinforcement learning IJIDeM 2020
170 Explainability of Intelligent Transportation Systems using Knowledge Compilation: a Traffic Light Controller Case ITSC 2020
171 xGAIL: Explainable Generative Adversarial Imitation Learning for Explainable Human Decision Analysis KDD 2020
172 What Did You Think Would Happen? Explaining Agent Behaviour through Intended Outcomes NeurIPS 2020
173 Explaining Conditions for Reinforcement Learning Behaviors from Real and Imagined Data NeurIPS Workshop on Challenges of Real-World RL 2020
174 DynamicsExplorer: Visual Analytics for Robot Control Tasks involving Dynamics and LSTM-based Control Policies PacificVis 2020
175 Combining reinforcement learning with rule-based controllers for transparent and general decision-making in autonomous driving Robotics Auton. Syst. 2020
176 Modelling Agent Policies with Interpretable Imitation Learning TAILOR 2020
177 Interpretable, Verifiable, and Robust Reinforcement Learning via Program Synthesis xxAI - Beyond Explainable AI 2020
178 Generation of Policy-Level Explanations for Reinforcement Learning AAAI 2019
179 SDRL: Interpretable and Data-Efficient Deep Reinforcement Learning Leveraging Symbolic Planning AAAI 2019
180 Towards Better Interpretability in Deep Q-Networks AAAI 2019
181 Toward Robust Policy Summarization AAMAS 2019
182 Towards Governing Agent's Efficacy: Action-Conditional \textdollar(\beta)\textdollar-VAE for Deep Transparent Reinforcement Learning ACML 2019
183 Memory-Based Explainable Reinforcement Learning AI 2019
184 Summarizing agent strategies Auton. Agents Multi Agent Syst. 2019
185 Enabling robots to communicate their objectives Auton. Robots 2019
186 Visualization of Deep Reinforcement Learning using Grad-CAM: How AI Plays Atari Games? CoG 2019
187 Explaining Reward Functions in Markov Decision Processes FLAIRS 2019
188 Explanation-Based Reward Coaching to Improve Human Performance via Reinforcement Learning HRI 2019
189 Free-Lunch Saliency via Attention in Atari Agents ICCVW 2019
190 Deep reinforcement learning with relational inductive biases ICLR 2019
191 Learning Finite State Representations of Recurrent Policy Networks ICLR 2019
192 Neural Logic Reinforcement Learning ICML 2019
193 Interpretable Approximation of a Deep Reinforcement Learning Agent as a Set of If-Then Rules ICMLA 2019
194 Semantic Predictive Control for Explainable and Efficient Policy Learning ICRA 2019
195 DQNViz: A Visual Analytics Approach to Understand Deep Q-Networks IEEE Trans. Vis. Comput. Graph. 2019
196 Visualizing Deep Q-Learning to Understanding Behavior of Swarm Robotic System IES 2019
197 Exploring Computational User Models for Agent Policy Summarization IJCA 2019
198 Explaining Reinforcement Learning to Mere Mortals: An Empirical Study IJCAI 2019
199 Counterfactual States for Atari Agents via Generative Deep Learning IJCAI Workshop on XAI 2019
200 Distilling Deep Reinforcement Learning Policies in Soft Decision Trees IJCAI Workshop on XAI 2019
201 Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation IROS 2019
202 Reinforcement Learning with Explainability for Traffic Signal Control ITSC 2019
203 Interestingness Elements for Explainable Reinforcement Learning through Introspection IUI Workshops 2019
204 Explainable Reinforcement Learning via Reward Decomposition JCAI Workshop on XAI 2019
205 Enhancing Explainability of Deep Reinforcement Learning Through Selective Layer-Wise Relevance Propagation KI 2019
206 Imitation-Projected Programmatic Reinforcement Learning NeurIPS 2019
207 Towards Interpretable Reinforcement Learning Using Attention Augmented Agents NeurIPS 2019
208 Verbal Explanations for Deep Reinforcement Learning Neural Networks with Attention on Extracted Features RO-MAN 2019
209 A formal methods approach to interpretable reinforcement learning for robotic planning Sci. Robotics 2019
210 HIGHLIGHTS: Summarizing Agent Behavior to People AAMAS 2018
211 Rationalization: A Neural Machine Translation Approach to Generating Natural Language Explanations AIES 2018
212 Transparency and Explanation in Deep Reinforcement Learning Neural Networks AIES 2018
213 Visual Rationalizations in Deep Reinforcement Learning for Atari Games BNAIC 2018
214 Textual Explanations for Self-Driving Vehicles ECCV 2018
215 Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees ECML PKDD 2018
216 Interpretable policies for reinforcement learning by genetic programming Eng. Appl. Artif. Intell. 2018
217 Generating interpretable fuzzy controllers using particle swarm optimization and genetic programming GECCO 2018
218 Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning ICLR 2018
219 Programmatically Interpretable Reinforcement Learning ICML 2018
220 Visualizing and Understanding Atari Agents ICML 2018
221 Deep Reinforcement Learning Monitor for Snapshot Recording ICMLA 2018
222 Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences IJCAI Workshop on XAI 2018
223 Explaining Deep Adaptive Programs via Reward Decomposition IJCAI/ECAI Workshop XAI 2018
224 Establishing Appropriate Trust via Critical States IROS 2018
225 Unsupervised Video Object Segmentation for Deep Reinforcement Learning NeurIPS 2018
226 Verifiable Reinforcement Learning via Policy Extraction NeurIPS 2018
227 Visual Sparse Bayesian Reinforcement Learning: A Framework for Interpreting What an Agent Has Learned SSCI 2018
228 Particle swarm optimization for generating interpretable fuzzy reinforcement learning policies Eng. Appl. Artif. Intell. 2017
229 Autonomous Self-Explanation of Behavior for Interactive Reinforcement Learning Agents HAI 2017
230 Improving Robot Controller Transparency Through Autonomous Policy Explanation HRI 2017
231 Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention ICCV 2017
232 Application of Instruction-Based Behavior Explanation to a Reinforcement Learning Agent with Changing Policy ICONIP 2017
233 Graying the black box: Understanding DQNs ICML 2016