index.html

<!DOCTYPE html>
<!-- adapted from http://bayesiandeeplearning.org/ -->
<html class="mel_workshop"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
		<title>Embodied Multimodal Learning Workshop | ICLR 2021</title>
		<meta http-equiv="Cache-Control" content="no-cache, no-store, must-revalidate">
		<meta http-equiv="Pragma" content="no-cache">
		<meta http-equiv="Expires" content="0">
		<meta name="description" content="Embodied Multimodal Learning Workshop | ICLR 2021">
		<meta name="keywords" content="Embodied,Multimodal,Learning,Workshop,ICLR,2021">
		<meta name="viewport" content="width=device-width, initial-scale=1">
		<link rel="stylesheet" href="./EML_files/main.css">
		<meta property="og:title" content="Embodied Multimodal Learning (EML)| ICLR 2021">
		<meta property="og:type" content="website">
		<meta property="og:url" content="http://eml-workshop.org">
		<meta property="og:description" content="Embodied Multimodal Learning Workshop at ICLR 2021 (Virtual) — Friday, May 7th, 2021">
	</head>
	<body data-gr-c-s-loaded="true" class="">

		<!-- Wrapper -->
			<div id="wrapper">

				<!-- Header -->
					<header id="header" class="alt">
						<h1>Embodied Multimodal Learning (EML)</h1>
						<h2><b>ICLR 2021 Workshop (Virtual)</b></h2>
						<h2>Friday, May 7th, 2021</h2>
						<h2><a href="https://iclr.cc/virtual/2021/workshop/2134" target="_blank"><font color="red">Link to ICLR Workshop Virtual Site (Join Zoom)</font></a></h2>
					</header>

				<!-- Nav -->
					<nav id="nav" class="">
						<ul>
							<li><a href="https://eml-workshop.github.io/#abstract" class="active">Abstract</a></li>
							<li><a href="https://eml-workshop.github.io/#speakers" class="">Invited Speakers</a></li>
							<li><a href="https://eml-workshop.github.io/#cfp" class="">Call for Papers</a></li>
							<li><a href="https://eml-workshop.github.io/#schedule" class="">Schedule</a></li>
							<li><a href="https://eml-workshop.github.io/#organizers" class="">Organizers</a></li>
						</ul>
					</nav>

				<!-- Main -->
					<div id="main">
						<!-- Introduction -->
							<section id="abstract" class="main">
								<div>
									<div class="content">
										<header class="major">
											<center>
												<h2>Abstract</h2>
											</center>
										</header>
										<h3> Despite encouraging progress in embodied learning over the past two decades, there is still a large gap between embodied agents' perception and human perception. Humans have remarkable capabilities combining all our multisensory inputs. To close the gap, embodied agents should also be enabled to see, hear, touch, and interact with their surroundings in order to select the appropriate actions. However, today's learning algorithms primarily operate on a single modality. In order for Artificial Intelligence to make progress in understanding the world around us, it needs to be able to interpret such multimodal signals jointly. The goal of this workshop is to share recent progress and discuss current challenges on embodied learning with multiple modalities.<br><br>

										The EML workshop will bring together researchers in different subareas of embodied multimodal learning including computer vision, robotics, machine learning, natural language processing, and cognitive science to examine the challenges and opportunities emerging from the design of embodied agents that unify their multisensory inputs. We will review the current state and identify the research infrastructure needed to enable a stronger collaboration between researchers working on different modalities.
										</h3>
									

									</div>
									<!-- <span class="image"></span> -->
								</div>
							</section>


							<section id="speakers" class="main">
								<div>
									<div class="content">
										<header class="major">
											<center>
												<h2>Invited Speakers</h2>
											</center>
										</header>
										<div class="row uniform" align="center">
					 						<div class="3u 12u$(small)">
												<a href="https://ai.stanford.edu/~cdarpino/" target="_blank" class="image">
													<span class="image fit">
														<img src="./EML_files/claudia.jpg" alt="">
													</span>
													<h2>Claudia Pérez D'Arpino<br> (Stanford) </h2>
												</a>
											</div>
											<div class="3u 12u$(small)">
											<a href="http://www.cs.cmu.edu/~abhinavg/" target="_blank" class="image">
												<span class="image fit">
													<img src="./EML_files/abhinav.jpg" alt="">
												</span>
												<h2>Abhinav Gupta<br> (CMU & FAIR) </h2>
											</a>
											</div>
					 						<div class="3u 12u$(small)">
												<a href="https://fh295.github.io/" target="_blank" class="image">
													<span class="image fit">
														<img src="./EML_files/felix.jpg" alt="">
													</span>
													<h2>Felix Hill<br> (DeepMind) </h2>
												</a>
											</div>
					 						<div class="3u 12u$(small)">
												<a href="http://www.csc.kth.se/~danik/" target="_blank" class="image">
													<span class="image fit">
														<img src="./EML_files/danica.jpg" alt="">
													</span>
													<h2>Danica Kragic<br> (KTH) </h2>
												</a>
											</div>
										</div>	
										<div class="row uniform" align="center">
					 						<div class="3u 12u$(small)">
												<a href="https://www.is.mpg.de/~kjk" target="_blank" class="image">
													<span class="image fit">
														<img src="./EML_files/katherine.jpg" alt="">
													</span>
													<h2>Katherine Kuchenbecker<br> (MPI-IS) </h2>
												</a>
											</div>
											<div class="3u 12u$(small)">
												<a href="https://people.eecs.berkeley.edu/~svlevine/" target="_blank" class="image">
													<span class="image fit">
														<img src="./EML_files/levine.jpg" alt="">
													</span>
													<h2>Sergey Levine<br> (UC Berkeley & Google) </h2>
												</a>
											</div>
					 						<div class="3u 12u$(small)">
												<a href="https://people.eecs.berkeley.edu/~malik/" target="_blank" class="image">
													<span class="image fit">
														<img src="./EML_files/malik.jpg" alt="">
													</span>
													<h2>Jitendra Malik<br> (UC Berkeley & FAIR) </h2>
												</a>
											</div>
					 						<div class="3u 12u$(small)">
												<a href="https://psych.indiana.edu/directory/faculty/smith-linda.html" target="_blank" class="image">
													<span class="image fit">
														<img src="./EML_files/smith.jpg" alt="">
													</span>
													<h2>Linda Smith<br> (Indiana University) </h2>
												</a>
											</div>
									</div>											
								</div>
							</section>


							<section id="cfp" class="main">
								<div>
									<div class="content">
										<header class="major">
											<center>
											<h2>Call for Papers</h2>
											</center>
										</header>
										<h3>We invite submissions of 2-4 pages extended abstracts in topics related to (but not limited to): 
										<ul>
												<li>audio-visual embodied learning</li>
												<li>touch sensing and embodied learning</li>
												<li>language and embodied learning</li>
												<li>speech and embodied learning</li>
												<li>self-supervised/semi-supervised learning with multiple modalities</li>
												<li>multimodal reinforcement learning</li>
												<li>meta-learning with multiple modalities</li>
												<li>novel multimodal datasets/simulators/tasks for embodied agents</li>
												<li>combining multisensory inputs for robot perception</li>
												<li>bio-inspired approaches for multimodal perception</li>
										</ul>
											A submission should take the form of an extended abstract (2-4 pages long excluding references) in PDF format using the <a href="https://eml-workshop.github.io/ICLR2021_EML_Workshop.zip" target="_blank">ICLR style</a>. We will accept submissions of (1) papers that have not been previously published or accepted for publication in substantially similar form; (2) papers that have been published or accepted for publication in recent venues including journal, conference, workshop, and arXiv; and (3) research proposals for future work with a focus on well-defined concepts and ideas. All submissions will be reviewed with single blind policy. Accepted extended abstracts will not appear in ICLR proceedings, and hence will not affect future publication of the work. We will publish all accepted extended abstracts on the workshop webpage. 
										</h3>
										<br>
										
										<h2>CMT submissions website: <a href="https://cmt3.research.microsoft.com/EML2021">https://cmt3.research.microsoft.com/EML2021</a><h2>

										<br>
										<h2>Key Dates:
											<h3>
											<ul>
												<li>Extended abstract submission deadline: <del>March 5th, 2021 (11:59 PM PST)</del></li>
												<li>Late submission deadline: <del>March 22nd, 2021 (11:59 PM PST)</del></li>
												<li>Notification to authors: <del>March 26th, 2021</del></li>
												<li>Workshop date: May 7th, 2021</li>
											</ul>
											</h3>
										</h2>


										<h2>Program Committee:
											<h3>
												Unnat Jain (UIUC), Michelle Lee (Stanford), Paul Pu Liang (CMU), Senthil Purushwalkam (CMU), Santhosh Kumar Ramakrishnan (UT Austin), Mohit Shridhar (UW), Tianmin Shu (MIT), Shaoxiong Wang (MIT)
											</h3>
										</h2>


									</div>
								</div>
							</section>


							<section id="schedule" class="main">
								<div>
									<div class="content">
										<header class="major">
											<center>
											<h2>Schedule</h2>
											</center>
										</header>

										<div class="table-wrapper">
											<table class="alt">
												<tbody>
                                                    <col width="20%">
                                                    <col width="20%">
                                                    <col width="25%">
													<tr>
														<td>07:55 am - 08:00 am (PDT)</td>
														<td>Introduction and Opening Remarks</td>
														<td></td>
														<td></td>
													</tr>
													<tr>
														<td>08:00 am - 08:30 am (PDT)</td>
														<td>Invited Talk</td>
														<td>Katherine Kuchenbecker<br><font size="2">(MPI-IS)</font></td> 
														<td></td>
													</tr>
													<tr>
														<td>08:30 am - 09:00 am (PDT) </td>
														<td>Invited Talk</td>
														<td>Danica Kragic<br><font size="2">(KTH)</font></td> 
														<td></td>
													</tr>
													<tr>
														<td>09:00 am - 09:30 am (PDT) </td>
														<td><b>Paper Session A</b></td>
														<td>A1 - A5</td>
														<td></td>
													</tr>
													<tr>
														<td>09:30 am - 09:40 am (PDT) </td>
														<td><b>Paper Session A Q&A</b></td>
														<td></td>
														<td></td>
													</tr>
													<tr>
														<td>09:40 am - 10:00 am (PDT)</td>
														<td>Break</td>
														<td></td>
														<td></td>
													</tr>
													<tr>
														<td>10:00 am - 10:30 am (PDT)</td>
														<td>Invited Talk</td>
														<td>Linda Smith<br><font size="2">(Indiana University)</font></td> 
														<td></td>
													</tr>
													<tr>
														<td>10:30 am - 11:00 am (PDT)</td>
														<td>Invited Talk</td>
														<td>Felix Hill<br><font size="2">(DeepMind)</font></td> 
														<td></td>
													</tr>
													<tr>
														<td>11:00 am - 12:00 pm (PDT)</td>
														<td><b>Panel Discussion</b></td>
														<td>Kristen Grauman, Felix Hill, Katherine Kuchenbecker, Sergey Levine, Jitendra Malik, Linda Smith</td>
														<td>Having a question for the panelists? Ask <a href="https://app.sli.do/event/0bmttghf/live/questions">here</a>!</td>
													</tr>
													<tr>
														<td>12:00 pm - 12:30 pm (PDT)</td>
														<td>Break</td>
														<td></td>
														<td></td>
													</tr>
													<tr>
														<td>12:30 pm - 01:00 pm (PDT)</td>
														<td>Invited Talk</td>
														<td>Abhinav Gupta<br><font size="2">(CMU & FAIR)</font></td> 
														<td></td>
													</tr>
													<tr>
														<td>01:00 pm - 01:30 pm (PDT)</td>
														<td>Invited Talk</td>
														<td>Sergey Levine<br><font size="2">(UC Berkeley & Google)</font></td> 
														<td></td>
													</tr>
													<tr>
														<td>01:30 pm - 02:00 pm (PDT)</td>
														<td><b>Paper Session B</b></td>
														<td>B1 - B4</td>
														<td></td>
													</tr>
													<tr>
														<td>02:00 pm - 02:10 pm (PDT) </td>
														<td><b>Paper Session B Q&A</b></td>
														<td></td>
														<td></td>
													</tr>
													<tr>
														<td>02:10 pm - 02:30 pm (PDT) </td>
														<td>Break</td>
														<td></td>
														<td></td>
													</tr>
													<tr>
														<td>02:30 pm - 03:00 pm (PDT)</td>
														<td>Invited Talk</td>
														<td>Jitendra Malik<br><font size="2">(UC Berkeley & FAIR)</font></td> 
														<td></td>
													</tr>
													<tr>
														<td>03:00 pm - 03:30 pm (PDT)</td>
														<td>Invited Talk</td>
														<td>Claudia Pérez D'Arpino<br><font size="2">(Stanford University)</font></td> 
														<td></td>
													</tr>
													<tr>
														<td>03:30 pm - 03:35 pm (PDT)</td>
														<td>Closing Remarks</td>
														<td></td>
														<td></td>
													</tr>
												</tbody>
											</table>
										</div>
										<center>
											<h2>Accepted Papers</h2>
										</center>
	<table>
												<tbody>
<col width="40%">
<col width="60%">
<tr><td><b>
Title
</b></td><td><b>
Authors
</b></td><td><b>
Paper Session
</b></td></tr>
<tr><td><b><a href="./Papers/A1.pdf">ABC Problem: An Investigation of Offline RL for Vision-Based Dynamic Manipulation</a></b></td><td>Kamyar Ghassemipour, Igor Mordatch, Shixiang Shane Gu</td><td><b>A1</b></td></tr>
<tr><td><b><a href="./Papers/A2.pdf">Language Acquisition is Embodied, Interactive, Emotive: a Research Proposal</a></b></td><td>Casey Kennington</td><td><b>A2</b></td></tr>
<tr><td><b><a href="./Papers/A3.pdf">Ask & Explore: Grounded Question Answering for Curiosity-Driven Exploration</a></b></td><td>Jivat Neet Kaur, Yiding Jiang, Paul Pu Liang</td><td><b>A3</b></td></tr>
<tr><td><b><a href="./Papers/A4.pdf">Towards Teaching Machines with Language: Interactive Learning From Only Language Descriptions of Activities</a></b></td><td>Khanh Nguyen, Dipendra Misra, Robert Schapire, Miroslav Dudik, Patrick Shafto</td><td><b>A4</b></td></tr>
<tr><td><b><a href="./Papers/A5.pdf">YouRefIt: Embodied Reference Understanding with Language and Gesture</b></td><td>Yixin Chen, Qing Li, Deqian Kong, Yik Lun Kei, Tao Gao, Yixin Zhu, Song-Chun Zhu, Siyuan Huang</td><td></a><b>A5</b></td></tr>
<tr><td><b><a href="./Papers/B1.pdf">Learning to Set Waypoints for Audio-Visual Navigation</b></td><td>Changan Chen, Sagnik Majumder, Ziad Al-Halah, Ruohan Gao, Santhosh K. Ramakrishnan, Kristen Grauman</td><td></a><b>B1</b></td></tr>
<tr><td><b><a href="./Papers/B2.pdf">Semantic Audio-Visual Navigation</b></td><td>Changan Chen, Ziad Al-Halah, Kristen Grauman</a></td><td><b>B2</b></td></tr>
<tr><td><b>Attentive Feature Reuse for Multi Task Meta learning</b></td><td>Kiran Lekkala, Laurent Itti</a></td><td><b>B3</b></td></tr>
<tr><td><b><a href="./Papers/B4.pdf">SeLaVi: self-labelling videos without any annotations from scratch</b></td><td>Yuki Asano, Mandela Patric, Christian Rupprecht, Andrea Vedaldi</td><td><b>B4</b></td></tr>

												</tbody>
											</table>
									</div>
								</div>
							</section>


							<section id="organizers" class="main special">
								<div>
									<div class="content">
										<header class="major">
											<h2>Organizers</h2>
										</header>
										<div class="row uniform">
											<div class="2u 12u$(small)">
												<a href="https://ai.stanford.edu/~rhgao/" target="_blank" class="image">
													<span class="image fit">
														<img src="./EML_files/ruohan.jpg" alt="">
													</span>
													<h3>Ruohan Gao <br> (Stanford)</h2>
												</a>
											</div>
											<div class="2u 12u$(small)">
												<a href="http://andrewowens.com/" target="_blank" class="image">
													<span class="image fit">
														<img src="./EML_files/andrew.jpg" alt="">
													</span>
													<h3>Andrew Owens <br> (UMich)</h2>
												</a>
											</div>
											<div class="2u 12u$(small)">
												<a href="https://www.seas.upenn.edu/~dineshj/" target="_blank" class="image">
													<span class="image fit">
														<img src="./EML_files/dinesh.jpg" alt="">
													</span>
													<h3>Dinesh Jayaraman<br> (UPenn)</h2>
												</a>
											</div>
											<div class="2u 12u$(small)">
												<a href="https://www.cs.utexas.edu/~yukez/" target="_blank" class="image">
													<span class="image fit">
														<img src="./EML_files/yuke.jpg" alt="">
													</span>
													<h3>Yuke Zhu<br> (UT Austin & Nvidia)</h2>
												</a>
											</div>
											<div class="2u 12u$(small)">
												<a href="https://jiajunwu.com/" target="_blank" class="image">
													<span class="image fit">
														<img src="./EML_files/jiajun.jpg" alt="">
													</span>
													<h3>Jiajun Wu <br> (Stanford)</h2>
												</a>
											</div>
											<div class="2u 12u$(small)">
												<a href="http://www.cs.utexas.edu/users/grauman/" target="_blank" class="image">
													<span class="image fit">
														<img src="./EML_files/grauman.jpg" alt="">
													</span>
													<h3>Kristen Grauman <br> (UT Austin & FAIR)</h2>
												</a>
											</div>
										</div>
										<br>
									</div>
								</div>
							</section>
					</div>

				<!-- Footer -->
					<footer id="footer">
						<p class="copyright">Website design adapted from <a href="http://bayesiandeeplearning.org/">Yarin Gal </a> and based on <a href="https://html5up.net/">HTML5 UP</a>.</p>
					</footer>

			</div>

		<!-- Scripts -->
			<script async="" src="./MEL_files/analytics.js"></script><script src="./EML_files/jquery.min.js"></script>
			<script src="./EML_files/jquery.scrollex.min.js"></script>
			<script src="./EML_files/jquery.scrolly.min.js"></script>
			<script src="./EML_files/skel.min.js"></script>
			<script src="./EML_files/util.js"></script>
			<!--[if lte IE 8]><script src="assets/js/ie/respond.min.js"></script><![endif]-->
			<script src="./EML_files/main.js"></script>

<!-- Global site tag (gtag.js) - Google Analytics -->
<script async src="https://www.googletagmanager.com/gtag/js?id=G-JC1QVS8GW2"></script>
<script>
  window.dataLayer = window.dataLayer || [];
  function gtag(){dataLayer.push(arguments);}
  gtag('js', new Date());

  gtag('config', 'G-JC1QVS8GW2');
</script>
	
</body></html>