Github Action Automatic Update CV Arxiv Papers

DWCTOD · Dec 5, 2024 · 589bab4 · 589bab4
1 parent 5ae500b
commit 589bab4
Show file tree

Hide file tree

Showing 6 changed files with 63 additions and 3 deletions.
diff --git a/README.md b/README.md
@@ -4,6 +4,16 @@
 
 |Publish Date|Title|Authors|PDF|Code|
 |---|---|---|---|---|
+|**2024-12-04**|**Navigation World Models**|Amir Bar et.al.|[2412.03572v1](http://arxiv.org/abs/2412.03572v1)|null|
+|**2024-12-04**|**The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control**|Ruili Feng et.al.|[2412.03568v1](http://arxiv.org/abs/2412.03568v1)|null|
+|**2024-12-04**|**Streaming Detection of Queried Event Start**|Cristobal Eyzaguirre et.al.|[2412.03567v1](http://arxiv.org/abs/2412.03567v1)|null|
+|**2024-12-04**|**Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning**|Wujian Peng et.al.|[2412.03565v1](http://arxiv.org/abs/2412.03565v1)|null|
+|**2024-12-04**|**From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents**|Xinyi Mou et.al.|[2412.03563v1](http://arxiv.org/abs/2412.03563v1)|null|
+|**2024-12-04**|**Imagine360: Immersive 360 Video Generation from Perspective Anchor**|Jing Tan et.al.|[2412.03552v1](http://arxiv.org/abs/2412.03552v1)|null|
+|**2024-12-04**|**Kibble-Zurek Dynamics & Statistics of Topological Defects in Chiral Superfluid $^3$He Films**|Noble Gluscevich et.al.|[2412.03544v1](http://arxiv.org/abs/2412.03544v1)|null|
+|**2024-12-04**|**Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos**|Hanxue Liang et.al.|[2412.03526v1](http://arxiv.org/abs/2412.03526v1)|null|
+|**2024-12-04**|**Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention**|Hannan Lu et.al.|[2412.03520v1](http://arxiv.org/abs/2412.03520v1)|null|
+|**2024-12-04**|**Distillation of Diffusion Features for Semantic Correspondence**|Frank Fundel et.al.|[2412.03512v1](http://arxiv.org/abs/2412.03512v1)|null|
 |**2024-12-03**|**Motion Prompting: Controlling Video Generation with Motion Trajectories**|Daniel Geng et.al.|[2412.02700v1](http://arxiv.org/abs/2412.02700v1)|null|
 |**2024-12-03**|**An ADHD Diagnostic Interface Based on EEG Spectrograms and Deep Learning Techniques**|Medha Pappula et.al.|[2412.02695v1](http://arxiv.org/abs/2412.02695v1)|null|
 |**2024-12-03**|**FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation**|Kefan Chen et.al.|[2412.02690v1](http://arxiv.org/abs/2412.02690v1)|null|
@@ -6177,6 +6187,16 @@
 
 |Publish Date|Title|Authors|PDF|Code|
 |---|---|---|---|---|
+|**2024-12-04**|**Data Fusion of Semantic and Depth Information in the Context of Object Detection**|Md Abu Yusuf et.al.|[2412.03490v1](http://arxiv.org/abs/2412.03490v1)|null|
+|**2024-12-04**|**Multi-Momentum Observer Contact Estimation for Bipedal Robots**|J. Joe Payne et.al.|[2412.03462v1](http://arxiv.org/abs/2412.03462v1)|null|
+|**2024-12-04**|**Distortions in Charged-Particle Images of Laser Direct-Drive Inertial Confinement Fusion Implosions**|P. V. Heuer et.al.|[2412.03362v1](http://arxiv.org/abs/2412.03362v1)|null|
+|**2024-12-04**|**Gaussian Processes for Probabilistic Estimates of Earthquake Ground Shaking: A 1-D Proof-of-Concept**|Sam A. Scivier et.al.|[2412.03299v1](http://arxiv.org/abs/2412.03299v1)|**[link](https://github.com/sscivier/gp-prob-earthquake-shaking)**|
+|**2024-12-04**|**Task-driven Image Fusion with Learnable Fusion Loss**|Haowen Bai et.al.|[2412.03240v1](http://arxiv.org/abs/2412.03240v1)|null|
+|**2024-12-04**|**Fab-ME: A Vision State-Space and Attention-Enhanced Framework for Fabric Defect Detection**|Shuai Wang et.al.|[2412.03200v1](http://arxiv.org/abs/2412.03200v1)|null|
+|**2024-12-04**|**Weighted-Reward Preference Optimization for Implicit Model Fusion**|Ziyi Yang et.al.|[2412.03187v1](http://arxiv.org/abs/2412.03187v1)|**[link](https://github.com/SLIT-AI/WRPO)**|
+|**2024-12-04**|**IRisPath: Enhancing Off-Road Navigation with Robust IR-RGB Fusion for Improved Day and Night Traversability**|Saksham Sharma et.al.|[2412.03173v1](http://arxiv.org/abs/2412.03173v1)|null|
+|**2024-12-04**|**Asynchronous Event-Inertial Odometry using a Unified Gaussian Process Regression Framework**|Xudong Li et.al.|[2412.03136v1](http://arxiv.org/abs/2412.03136v1)|null|
+|**2024-12-04**|**CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning**|Runjian Chen et.al.|[2412.03059v1](http://arxiv.org/abs/2412.03059v1)|null|
 |**2024-12-03**|**Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks**|Jinjin Cai et.al.|[2412.02531v1](http://arxiv.org/abs/2412.02531v1)|null|
 |**2024-12-03**|**Trajectory-based Road Autolabeling with Lidar-Camera Fusion in Winter Conditions**|Eerik Alamikkotervo et.al.|[2412.02370v1](http://arxiv.org/abs/2412.02370v1)|**[link](https://github.com/eerik98/lidar-camera-road-autolabeling)**|
 |**2024-12-03**|**Initial Study On Improving Segmentation By Combining Preoperative CT And Intraoperative CBCT Using Synthetic Data**|Maximilian E. Tschuchnig et.al.|[2412.02294v1](http://arxiv.org/abs/2412.02294v1)|null|

diff --git a/cv-arxiv-daily.json b/cv-arxiv-daily.json
diff --git a/docs/cv-arxiv-daily-web.json b/docs/cv-arxiv-daily-web.json
diff --git a/docs/cv-arxiv-daily-wechat.json b/docs/cv-arxiv-daily-wechat.json
diff --git a/docs/index.md b/docs/index.md
@@ -8,6 +8,16 @@ layout: default
 
 | Publish Date | Title | Authors | PDF | Code |
 |:---------|:-----------------------|:---------|:------|:------|
+|**2024-12-04**|**Navigation World Models**|Amir Bar et.al.|[2412.03572v1](http://arxiv.org/abs/2412.03572v1)|null|
+|**2024-12-04**|**The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control**|Ruili Feng et.al.|[2412.03568v1](http://arxiv.org/abs/2412.03568v1)|null|
+|**2024-12-04**|**Streaming Detection of Queried Event Start**|Cristobal Eyzaguirre et.al.|[2412.03567v1](http://arxiv.org/abs/2412.03567v1)|null|
+|**2024-12-04**|**Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning**|Wujian Peng et.al.|[2412.03565v1](http://arxiv.org/abs/2412.03565v1)|null|
+|**2024-12-04**|**From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents**|Xinyi Mou et.al.|[2412.03563v1](http://arxiv.org/abs/2412.03563v1)|null|
+|**2024-12-04**|**Imagine360: Immersive 360 Video Generation from Perspective Anchor**|Jing Tan et.al.|[2412.03552v1](http://arxiv.org/abs/2412.03552v1)|null|
+|**2024-12-04**|**Kibble-Zurek Dynamics & Statistics of Topological Defects in Chiral Superfluid $^3$He Films**|Noble Gluscevich et.al.|[2412.03544v1](http://arxiv.org/abs/2412.03544v1)|null|
+|**2024-12-04**|**Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos**|Hanxue Liang et.al.|[2412.03526v1](http://arxiv.org/abs/2412.03526v1)|null|
+|**2024-12-04**|**Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention**|Hannan Lu et.al.|[2412.03520v1](http://arxiv.org/abs/2412.03520v1)|null|
+|**2024-12-04**|**Distillation of Diffusion Features for Semantic Correspondence**|Frank Fundel et.al.|[2412.03512v1](http://arxiv.org/abs/2412.03512v1)|null|
 |**2024-12-03**|**Motion Prompting: Controlling Video Generation with Motion Trajectories**|Daniel Geng et.al.|[2412.02700v1](http://arxiv.org/abs/2412.02700v1)|null|
 |**2024-12-03**|**An ADHD Diagnostic Interface Based on EEG Spectrograms and Deep Learning Techniques**|Medha Pappula et.al.|[2412.02695v1](http://arxiv.org/abs/2412.02695v1)|null|
 |**2024-12-03**|**FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation**|Kefan Chen et.al.|[2412.02690v1](http://arxiv.org/abs/2412.02690v1)|null|
@@ -6181,6 +6191,16 @@ layout: default
 
 | Publish Date | Title | Authors | PDF | Code |
 |:---------|:-----------------------|:---------|:------|:------|
+|**2024-12-04**|**Data Fusion of Semantic and Depth Information in the Context of Object Detection**|Md Abu Yusuf et.al.|[2412.03490v1](http://arxiv.org/abs/2412.03490v1)|null|
+|**2024-12-04**|**Multi-Momentum Observer Contact Estimation for Bipedal Robots**|J. Joe Payne et.al.|[2412.03462v1](http://arxiv.org/abs/2412.03462v1)|null|
+|**2024-12-04**|**Distortions in Charged-Particle Images of Laser Direct-Drive Inertial Confinement Fusion Implosions**|P. V. Heuer et.al.|[2412.03362v1](http://arxiv.org/abs/2412.03362v1)|null|
+|**2024-12-04**|**Gaussian Processes for Probabilistic Estimates of Earthquake Ground Shaking: A 1-D Proof-of-Concept**|Sam A. Scivier et.al.|[2412.03299v1](http://arxiv.org/abs/2412.03299v1)|**[link](https://github.com/sscivier/gp-prob-earthquake-shaking)**|
+|**2024-12-04**|**Task-driven Image Fusion with Learnable Fusion Loss**|Haowen Bai et.al.|[2412.03240v1](http://arxiv.org/abs/2412.03240v1)|null|
+|**2024-12-04**|**Fab-ME: A Vision State-Space and Attention-Enhanced Framework for Fabric Defect Detection**|Shuai Wang et.al.|[2412.03200v1](http://arxiv.org/abs/2412.03200v1)|null|
+|**2024-12-04**|**Weighted-Reward Preference Optimization for Implicit Model Fusion**|Ziyi Yang et.al.|[2412.03187v1](http://arxiv.org/abs/2412.03187v1)|**[link](https://github.com/SLIT-AI/WRPO)**|
+|**2024-12-04**|**IRisPath: Enhancing Off-Road Navigation with Robust IR-RGB Fusion for Improved Day and Night Traversability**|Saksham Sharma et.al.|[2412.03173v1](http://arxiv.org/abs/2412.03173v1)|null|
+|**2024-12-04**|**Asynchronous Event-Inertial Odometry using a Unified Gaussian Process Regression Framework**|Xudong Li et.al.|[2412.03136v1](http://arxiv.org/abs/2412.03136v1)|null|
+|**2024-12-04**|**CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning**|Runjian Chen et.al.|[2412.03059v1](http://arxiv.org/abs/2412.03059v1)|null|
 |**2024-12-03**|**Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks**|Jinjin Cai et.al.|[2412.02531v1](http://arxiv.org/abs/2412.02531v1)|null|
 |**2024-12-03**|**Trajectory-based Road Autolabeling with Lidar-Camera Fusion in Winter Conditions**|Eerik Alamikkotervo et.al.|[2412.02370v1](http://arxiv.org/abs/2412.02370v1)|**[link](https://github.com/eerik98/lidar-camera-road-autolabeling)**|
 |**2024-12-03**|**Initial Study On Improving Segmentation By Combining Preoperative CT And Intraoperative CBCT Using Synthetic Data**|Maximilian E. Tschuchnig et.al.|[2412.02294v1](http://arxiv.org/abs/2412.02294v1)|null|

diff --git a/docs/wechat.md b/docs/wechat.md
@@ -2,6 +2,16 @@
 
 ## Video_Classification
 
+- 2024-12-04, **Navigation World Models**, Amir Bar et.al., Paper: [http://arxiv.org/abs/2412.03572v1](http://arxiv.org/abs/2412.03572v1)
+- 2024-12-04, **The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control**, Ruili Feng et.al., Paper: [http://arxiv.org/abs/2412.03568v1](http://arxiv.org/abs/2412.03568v1)
+- 2024-12-04, **Streaming Detection of Queried Event Start**, Cristobal Eyzaguirre et.al., Paper: [http://arxiv.org/abs/2412.03567v1](http://arxiv.org/abs/2412.03567v1)
+- 2024-12-04, **Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning**, Wujian Peng et.al., Paper: [http://arxiv.org/abs/2412.03565v1](http://arxiv.org/abs/2412.03565v1)
+- 2024-12-04, **From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents**, Xinyi Mou et.al., Paper: [http://arxiv.org/abs/2412.03563v1](http://arxiv.org/abs/2412.03563v1)
+- 2024-12-04, **Imagine360: Immersive 360 Video Generation from Perspective Anchor**, Jing Tan et.al., Paper: [http://arxiv.org/abs/2412.03552v1](http://arxiv.org/abs/2412.03552v1)
+- 2024-12-04, **Kibble-Zurek Dynamics & Statistics of Topological Defects in Chiral Superfluid $^3$He Films**, Noble Gluscevich et.al., Paper: [http://arxiv.org/abs/2412.03544v1](http://arxiv.org/abs/2412.03544v1)
+- 2024-12-04, **Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos**, Hanxue Liang et.al., Paper: [http://arxiv.org/abs/2412.03526v1](http://arxiv.org/abs/2412.03526v1)
+- 2024-12-04, **Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention**, Hannan Lu et.al., Paper: [http://arxiv.org/abs/2412.03520v1](http://arxiv.org/abs/2412.03520v1)
+- 2024-12-04, **Distillation of Diffusion Features for Semantic Correspondence**, Frank Fundel et.al., Paper: [http://arxiv.org/abs/2412.03512v1](http://arxiv.org/abs/2412.03512v1)
 - 2024-12-03, **Motion Prompting: Controlling Video Generation with Motion Trajectories**, Daniel Geng et.al., Paper: [http://arxiv.org/abs/2412.02700v1](http://arxiv.org/abs/2412.02700v1)
 - 2024-12-03, **An ADHD Diagnostic Interface Based on EEG Spectrograms and Deep Learning Techniques**, Medha Pappula et.al., Paper: [http://arxiv.org/abs/2412.02695v1](http://arxiv.org/abs/2412.02695v1)
 - 2024-12-03, **FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation**, Kefan Chen et.al., Paper: [http://arxiv.org/abs/2412.02690v1](http://arxiv.org/abs/2412.02690v1)
@@ -6173,6 +6183,16 @@
 
 ## MultiModal
 
+- 2024-12-04, **Data Fusion of Semantic and Depth Information in the Context of Object Detection**, Md Abu Yusuf et.al., Paper: [http://arxiv.org/abs/2412.03490v1](http://arxiv.org/abs/2412.03490v1)
+- 2024-12-04, **Multi-Momentum Observer Contact Estimation for Bipedal Robots**, J. Joe Payne et.al., Paper: [http://arxiv.org/abs/2412.03462v1](http://arxiv.org/abs/2412.03462v1)
+- 2024-12-04, **Distortions in Charged-Particle Images of Laser Direct-Drive Inertial Confinement Fusion Implosions**, P. V. Heuer et.al., Paper: [http://arxiv.org/abs/2412.03362v1](http://arxiv.org/abs/2412.03362v1)
+- 2024-12-04, **Gaussian Processes for Probabilistic Estimates of Earthquake Ground Shaking: A 1-D Proof-of-Concept**, Sam A. Scivier et.al., Paper: [http://arxiv.org/abs/2412.03299v1](http://arxiv.org/abs/2412.03299v1), Code: **[https://github.com/sscivier/gp-prob-earthquake-shaking](https://github.com/sscivier/gp-prob-earthquake-shaking)**
+- 2024-12-04, **Task-driven Image Fusion with Learnable Fusion Loss**, Haowen Bai et.al., Paper: [http://arxiv.org/abs/2412.03240v1](http://arxiv.org/abs/2412.03240v1)
+- 2024-12-04, **Fab-ME: A Vision State-Space and Attention-Enhanced Framework for Fabric Defect Detection**, Shuai Wang et.al., Paper: [http://arxiv.org/abs/2412.03200v1](http://arxiv.org/abs/2412.03200v1)
+- 2024-12-04, **Weighted-Reward Preference Optimization for Implicit Model Fusion**, Ziyi Yang et.al., Paper: [http://arxiv.org/abs/2412.03187v1](http://arxiv.org/abs/2412.03187v1), Code: **[https://github.com/SLIT-AI/WRPO](https://github.com/SLIT-AI/WRPO)**
+- 2024-12-04, **IRisPath: Enhancing Off-Road Navigation with Robust IR-RGB Fusion for Improved Day and Night Traversability**, Saksham Sharma et.al., Paper: [http://arxiv.org/abs/2412.03173v1](http://arxiv.org/abs/2412.03173v1)
+- 2024-12-04, **Asynchronous Event-Inertial Odometry using a Unified Gaussian Process Regression Framework**, Xudong Li et.al., Paper: [http://arxiv.org/abs/2412.03136v1](http://arxiv.org/abs/2412.03136v1)
+- 2024-12-04, **CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning**, Runjian Chen et.al., Paper: [http://arxiv.org/abs/2412.03059v1](http://arxiv.org/abs/2412.03059v1)
 - 2024-12-03, **Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks**, Jinjin Cai et.al., Paper: [http://arxiv.org/abs/2412.02531v1](http://arxiv.org/abs/2412.02531v1)
 - 2024-12-03, **Trajectory-based Road Autolabeling with Lidar-Camera Fusion in Winter Conditions**, Eerik Alamikkotervo et.al., Paper: [http://arxiv.org/abs/2412.02370v1](http://arxiv.org/abs/2412.02370v1), Code: **[https://github.com/eerik98/lidar-camera-road-autolabeling](https://github.com/eerik98/lidar-camera-road-autolabeling)**
 - 2024-12-03, **Initial Study On Improving Segmentation By Combining Preoperative CT And Intraoperative CBCT Using Synthetic Data**, Maximilian E. Tschuchnig et.al., Paper: [http://arxiv.org/abs/2412.02294v1](http://arxiv.org/abs/2412.02294v1)