Skip to content

Commit

Permalink
Github Action Automatic Update CV Arxiv Papers
Browse files Browse the repository at this point in the history
  • Loading branch information
DWCTOD committed Nov 27, 2024
1 parent 9fa2025 commit 8a08bf9
Show file tree
Hide file tree
Showing 6 changed files with 66 additions and 6 deletions.
22 changes: 21 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,9 +1,19 @@
## Updated on 2024.11.26
## Updated on 2024.11.27

## Video_Classification

|Publish Date|Title|Authors|PDF|Code|
|---|---|---|---|---|
|**2024-11-26**|**Video-Guided Foley Sound Generation with Multimodal Controls**|Ziyang Chen et.al.|[2411.17698v1](http://arxiv.org/abs/2411.17698v1)|null|
|**2024-11-26**|**StableAnimator: High-Quality Identity-Preserving Human Image Animation**|Shuyuan Tu et.al.|[2411.17697v1](http://arxiv.org/abs/2411.17697v1)|**[link](https://github.com/Francis-Rings/StableAnimator)**|
|**2024-11-26**|**Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis**|Akshita Gupta et.al.|[2411.17690v1](http://arxiv.org/abs/2411.17690v1)|null|
|**2024-11-26**|**BERT or FastText? A Comparative Analysis of Contextual as well as Non-Contextual Embeddings**|Abhay Shanbhag et.al.|[2411.17661v1](http://arxiv.org/abs/2411.17661v1)|null|
|**2024-11-26**|**DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting**|Christian Homeyer et.al.|[2411.17660v1](http://arxiv.org/abs/2411.17660v1)|**[link](https://github.com/chenhoy/droid-splat)**|
|**2024-11-26**|**SAMWISE: Infusing wisdom in SAM2 for Text-Driven Video Segmentation**|Claudia Cuttano et.al.|[2411.17646v1](http://arxiv.org/abs/2411.17646v1)|**[link](https://github.com/claudiacuttano/samwise)**|
|**2024-11-26**|**A robust image encryption scheme based on new 4-D hyperchaotic system and elliptic curve**|Yehia Lalili et.al.|[2411.17643v1](http://arxiv.org/abs/2411.17643v1)|null|
|**2024-11-26**|**On Limitations of LLM as Annotator for Low Resource Languages**|Suramya Jadhav et.al.|[2411.17637v1](http://arxiv.org/abs/2411.17637v1)|null|
|**2024-11-26**|**An Ensemble Approach for Brain Tumor Segmentation and Synthesis**|Juampablo E. Heras Rivera et.al.|[2411.17617v1](http://arxiv.org/abs/2411.17617v1)|null|
|**2024-11-26**|**Accelerating Vision Diffusion Transformers with Skip Branches**|Guanjie Chen et.al.|[2411.17616v1](http://arxiv.org/abs/2411.17616v1)|**[link](https://github.com/opensparsellms/skip-dit)**|
|**2024-11-25**|**Generative Omnimatte: Learning to Decompose Video into Layers**|Yao-Chih Lee et.al.|[2411.16683v1](http://arxiv.org/abs/2411.16683v1)|null|
|**2024-11-25**|**Quark: Real-time, High-resolution, and General Neural View Synthesis**|John Flynn et.al.|[2411.16680v1](http://arxiv.org/abs/2411.16680v1)|null|
|**2024-11-25**|**A Supervised Machine Learning Approach for Assessing Grant Peer Review Reports**|Gabriel Okasa et.al.|[2411.16662v1](http://arxiv.org/abs/2411.16662v1)|null|
Expand Down Expand Up @@ -6137,6 +6147,16 @@

|Publish Date|Title|Authors|PDF|Code|
|---|---|---|---|---|
|**2024-11-26**|**What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics**|Jordan J. Bird et.al.|[2411.17593v1](http://arxiv.org/abs/2411.17593v1)|null|
|**2024-11-26**|**E-CONDOR: Efficient Contour-Based Detection Of Random Spatial Signals From UAV Observations Using Dual Stochastic Gradient**|Maryam Zahra et.al.|[2411.17449v1](http://arxiv.org/abs/2411.17449v1)|null|
|**2024-11-26**|**Multimodal Outer Arithmetic Block Dual Fusion of Whole Slide Images and Omics Data for Precision Oncology**|Omnia Alwazzan et.al.|[2411.17418v1](http://arxiv.org/abs/2411.17418v1)|null|
|**2024-11-26**|**MFF-FTNet: Multi-scale Feature Fusion across Frequency and Temporal Domains for Time Series Forecasting**|Yangyang Shi et.al.|[2411.17382v1](http://arxiv.org/abs/2411.17382v1)|null|
|**2024-11-26**|**Graphical calculus for quantum vertex operators, II: q-KZB and coordinate Macdonald-Ruijsenaars equations**|Hadewijch De Clercq et.al.|[2411.17306v1](http://arxiv.org/abs/2411.17306v1)|null|
|**2024-11-26**|**in-Car Biometrics (iCarB) Datasets for Driver Recognition: Face, Fingerprint, and Voice**|Vedrana Krivokuca Hahn et.al.|[2411.17305v1](http://arxiv.org/abs/2411.17305v1)|null|
|**2024-11-26**|**Resurfaced CsPbBr3 Nanocrystals Enable Free Radical Thermal Polymerization of Efficient Ultrafast Polyvinyl Styrene Nanocomposite Scintillators**|Francesco Carulli et.al.|[2411.17259v1](http://arxiv.org/abs/2411.17259v1)|null|
|**2024-11-26**|**DiffSLT: Enhancing Diversity in Sign Language Translation via Diffusion Model**|JiHwan Moon et.al.|[2411.17248v1](http://arxiv.org/abs/2411.17248v1)|null|
|**2024-11-26**|**Depth-PC: A Visual Servo Framework Integrated with Cross-Modality Fusion for Sim2Real Transfer**|Haoyu Zhang et.al.|[2411.17195v1](http://arxiv.org/abs/2411.17195v1)|null|
|**2024-11-26**|**Enhancing Lane Segment Perception and Topology Reasoning with Crowdsourcing Trajectory Priors**|Peijin Jia et.al.|[2411.17161v1](http://arxiv.org/abs/2411.17161v1)|null|
|**2024-11-25**|**The JWST Weather Report from the Isolated Exoplanet Analog SIMP 0136+0933: Pressure-Dependent Variability Driven by Multiple Mechanisms**|Allison M. McCarthy et.al.|[2411.16577v1](http://arxiv.org/abs/2411.16577v1)|null|
|**2024-11-25**|**Deformable Mamba for Wide Field of View Segmentation**|Jie Hu et.al.|[2411.16481v1](http://arxiv.org/abs/2411.16481v1)|**[link](https://github.com/jiehu1996/deformablemamba)**|
|**2024-11-25**|**Luminosity predictions for the first three ionisation stages of W, Pt and Au to probe potential sources of emission in kilonova**|M. McCann et.al.|[2411.16476v1](http://arxiv.org/abs/2411.16476v1)|null|
Expand Down
2 changes: 1 addition & 1 deletion cv-arxiv-daily.json

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion docs/cv-arxiv-daily-web.json

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion docs/cv-arxiv-daily-wechat.json

Large diffs are not rendered by default.

22 changes: 21 additions & 1 deletion docs/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,12 +2,22 @@
layout: default
---

## Updated on 2024.11.26
## Updated on 2024.11.27

## Video_Classification

| Publish Date | Title | Authors | PDF | Code |
|:---------|:-----------------------|:---------|:------|:------|
|**2024-11-26**|**Video-Guided Foley Sound Generation with Multimodal Controls**|Ziyang Chen et.al.|[2411.17698v1](http://arxiv.org/abs/2411.17698v1)|null|
|**2024-11-26**|**StableAnimator: High-Quality Identity-Preserving Human Image Animation**|Shuyuan Tu et.al.|[2411.17697v1](http://arxiv.org/abs/2411.17697v1)|**[link](https://github.com/Francis-Rings/StableAnimator)**|
|**2024-11-26**|**Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis**|Akshita Gupta et.al.|[2411.17690v1](http://arxiv.org/abs/2411.17690v1)|null|
|**2024-11-26**|**BERT or FastText? A Comparative Analysis of Contextual as well as Non-Contextual Embeddings**|Abhay Shanbhag et.al.|[2411.17661v1](http://arxiv.org/abs/2411.17661v1)|null|
|**2024-11-26**|**DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting**|Christian Homeyer et.al.|[2411.17660v1](http://arxiv.org/abs/2411.17660v1)|**[link](https://github.com/chenhoy/droid-splat)**|
|**2024-11-26**|**SAMWISE: Infusing wisdom in SAM2 for Text-Driven Video Segmentation**|Claudia Cuttano et.al.|[2411.17646v1](http://arxiv.org/abs/2411.17646v1)|**[link](https://github.com/claudiacuttano/samwise)**|
|**2024-11-26**|**A robust image encryption scheme based on new 4-D hyperchaotic system and elliptic curve**|Yehia Lalili et.al.|[2411.17643v1](http://arxiv.org/abs/2411.17643v1)|null|
|**2024-11-26**|**On Limitations of LLM as Annotator for Low Resource Languages**|Suramya Jadhav et.al.|[2411.17637v1](http://arxiv.org/abs/2411.17637v1)|null|
|**2024-11-26**|**An Ensemble Approach for Brain Tumor Segmentation and Synthesis**|Juampablo E. Heras Rivera et.al.|[2411.17617v1](http://arxiv.org/abs/2411.17617v1)|null|
|**2024-11-26**|**Accelerating Vision Diffusion Transformers with Skip Branches**|Guanjie Chen et.al.|[2411.17616v1](http://arxiv.org/abs/2411.17616v1)|**[link](https://github.com/opensparsellms/skip-dit)**|
|**2024-11-25**|**Generative Omnimatte: Learning to Decompose Video into Layers**|Yao-Chih Lee et.al.|[2411.16683v1](http://arxiv.org/abs/2411.16683v1)|null|
|**2024-11-25**|**Quark: Real-time, High-resolution, and General Neural View Synthesis**|John Flynn et.al.|[2411.16680v1](http://arxiv.org/abs/2411.16680v1)|null|
|**2024-11-25**|**A Supervised Machine Learning Approach for Assessing Grant Peer Review Reports**|Gabriel Okasa et.al.|[2411.16662v1](http://arxiv.org/abs/2411.16662v1)|null|
Expand Down Expand Up @@ -6141,6 +6151,16 @@ layout: default

| Publish Date | Title | Authors | PDF | Code |
|:---------|:-----------------------|:---------|:------|:------|
|**2024-11-26**|**What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics**|Jordan J. Bird et.al.|[2411.17593v1](http://arxiv.org/abs/2411.17593v1)|null|
|**2024-11-26**|**E-CONDOR: Efficient Contour-Based Detection Of Random Spatial Signals From UAV Observations Using Dual Stochastic Gradient**|Maryam Zahra et.al.|[2411.17449v1](http://arxiv.org/abs/2411.17449v1)|null|
|**2024-11-26**|**Multimodal Outer Arithmetic Block Dual Fusion of Whole Slide Images and Omics Data for Precision Oncology**|Omnia Alwazzan et.al.|[2411.17418v1](http://arxiv.org/abs/2411.17418v1)|null|
|**2024-11-26**|**MFF-FTNet: Multi-scale Feature Fusion across Frequency and Temporal Domains for Time Series Forecasting**|Yangyang Shi et.al.|[2411.17382v1](http://arxiv.org/abs/2411.17382v1)|null|
|**2024-11-26**|**Graphical calculus for quantum vertex operators, II: q-KZB and coordinate Macdonald-Ruijsenaars equations**|Hadewijch De Clercq et.al.|[2411.17306v1](http://arxiv.org/abs/2411.17306v1)|null|
|**2024-11-26**|**in-Car Biometrics (iCarB) Datasets for Driver Recognition: Face, Fingerprint, and Voice**|Vedrana Krivokuca Hahn et.al.|[2411.17305v1](http://arxiv.org/abs/2411.17305v1)|null|
|**2024-11-26**|**Resurfaced CsPbBr3 Nanocrystals Enable Free Radical Thermal Polymerization of Efficient Ultrafast Polyvinyl Styrene Nanocomposite Scintillators**|Francesco Carulli et.al.|[2411.17259v1](http://arxiv.org/abs/2411.17259v1)|null|
|**2024-11-26**|**DiffSLT: Enhancing Diversity in Sign Language Translation via Diffusion Model**|JiHwan Moon et.al.|[2411.17248v1](http://arxiv.org/abs/2411.17248v1)|null|
|**2024-11-26**|**Depth-PC: A Visual Servo Framework Integrated with Cross-Modality Fusion for Sim2Real Transfer**|Haoyu Zhang et.al.|[2411.17195v1](http://arxiv.org/abs/2411.17195v1)|null|
|**2024-11-26**|**Enhancing Lane Segment Perception and Topology Reasoning with Crowdsourcing Trajectory Priors**|Peijin Jia et.al.|[2411.17161v1](http://arxiv.org/abs/2411.17161v1)|null|
|**2024-11-25**|**The JWST Weather Report from the Isolated Exoplanet Analog SIMP 0136+0933: Pressure-Dependent Variability Driven by Multiple Mechanisms**|Allison M. McCarthy et.al.|[2411.16577v1](http://arxiv.org/abs/2411.16577v1)|null|
|**2024-11-25**|**Deformable Mamba for Wide Field of View Segmentation**|Jie Hu et.al.|[2411.16481v1](http://arxiv.org/abs/2411.16481v1)|**[link](https://github.com/jiehu1996/deformablemamba)**|
|**2024-11-25**|**Luminosity predictions for the first three ionisation stages of W, Pt and Au to probe potential sources of emission in kilonova**|M. McCann et.al.|[2411.16476v1](http://arxiv.org/abs/2411.16476v1)|null|
Expand Down
22 changes: 21 additions & 1 deletion docs/wechat.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,17 @@
> Updated on 2024.11.26
> Updated on 2024.11.27

## Video_Classification

- 2024-11-26, **Video-Guided Foley Sound Generation with Multimodal Controls**, Ziyang Chen et.al., Paper: [http://arxiv.org/abs/2411.17698v1](http://arxiv.org/abs/2411.17698v1)
- 2024-11-26, **StableAnimator: High-Quality Identity-Preserving Human Image Animation**, Shuyuan Tu et.al., Paper: [http://arxiv.org/abs/2411.17697v1](http://arxiv.org/abs/2411.17697v1), Code: **[https://github.com/Francis-Rings/StableAnimator](https://github.com/Francis-Rings/StableAnimator)**
- 2024-11-26, **Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis**, Akshita Gupta et.al., Paper: [http://arxiv.org/abs/2411.17690v1](http://arxiv.org/abs/2411.17690v1)
- 2024-11-26, **BERT or FastText? A Comparative Analysis of Contextual as well as Non-Contextual Embeddings**, Abhay Shanbhag et.al., Paper: [http://arxiv.org/abs/2411.17661v1](http://arxiv.org/abs/2411.17661v1)
- 2024-11-26, **DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting**, Christian Homeyer et.al., Paper: [http://arxiv.org/abs/2411.17660v1](http://arxiv.org/abs/2411.17660v1), Code: **[https://github.com/chenhoy/droid-splat](https://github.com/chenhoy/droid-splat)**
- 2024-11-26, **SAMWISE: Infusing wisdom in SAM2 for Text-Driven Video Segmentation**, Claudia Cuttano et.al., Paper: [http://arxiv.org/abs/2411.17646v1](http://arxiv.org/abs/2411.17646v1), Code: **[https://github.com/claudiacuttano/samwise](https://github.com/claudiacuttano/samwise)**
- 2024-11-26, **A robust image encryption scheme based on new 4-D hyperchaotic system and elliptic curve**, Yehia Lalili et.al., Paper: [http://arxiv.org/abs/2411.17643v1](http://arxiv.org/abs/2411.17643v1)
- 2024-11-26, **On Limitations of LLM as Annotator for Low Resource Languages**, Suramya Jadhav et.al., Paper: [http://arxiv.org/abs/2411.17637v1](http://arxiv.org/abs/2411.17637v1)
- 2024-11-26, **An Ensemble Approach for Brain Tumor Segmentation and Synthesis**, Juampablo E. Heras Rivera et.al., Paper: [http://arxiv.org/abs/2411.17617v1](http://arxiv.org/abs/2411.17617v1)
- 2024-11-26, **Accelerating Vision Diffusion Transformers with Skip Branches**, Guanjie Chen et.al., Paper: [http://arxiv.org/abs/2411.17616v1](http://arxiv.org/abs/2411.17616v1), Code: **[https://github.com/opensparsellms/skip-dit](https://github.com/opensparsellms/skip-dit)**
- 2024-11-25, **Generative Omnimatte: Learning to Decompose Video into Layers**, Yao-Chih Lee et.al., Paper: [http://arxiv.org/abs/2411.16683v1](http://arxiv.org/abs/2411.16683v1)
- 2024-11-25, **Quark: Real-time, High-resolution, and General Neural View Synthesis**, John Flynn et.al., Paper: [http://arxiv.org/abs/2411.16680v1](http://arxiv.org/abs/2411.16680v1)
- 2024-11-25, **A Supervised Machine Learning Approach for Assessing Grant Peer Review Reports**, Gabriel Okasa et.al., Paper: [http://arxiv.org/abs/2411.16662v1](http://arxiv.org/abs/2411.16662v1)
Expand Down Expand Up @@ -6133,6 +6143,16 @@

## MultiModal

- 2024-11-26, **What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics**, Jordan J. Bird et.al., Paper: [http://arxiv.org/abs/2411.17593v1](http://arxiv.org/abs/2411.17593v1)
- 2024-11-26, **E-CONDOR: Efficient Contour-Based Detection Of Random Spatial Signals From UAV Observations Using Dual Stochastic Gradient**, Maryam Zahra et.al., Paper: [http://arxiv.org/abs/2411.17449v1](http://arxiv.org/abs/2411.17449v1)
- 2024-11-26, **Multimodal Outer Arithmetic Block Dual Fusion of Whole Slide Images and Omics Data for Precision Oncology**, Omnia Alwazzan et.al., Paper: [http://arxiv.org/abs/2411.17418v1](http://arxiv.org/abs/2411.17418v1)
- 2024-11-26, **MFF-FTNet: Multi-scale Feature Fusion across Frequency and Temporal Domains for Time Series Forecasting**, Yangyang Shi et.al., Paper: [http://arxiv.org/abs/2411.17382v1](http://arxiv.org/abs/2411.17382v1)
- 2024-11-26, **Graphical calculus for quantum vertex operators, II: q-KZB and coordinate Macdonald-Ruijsenaars equations**, Hadewijch De Clercq et.al., Paper: [http://arxiv.org/abs/2411.17306v1](http://arxiv.org/abs/2411.17306v1)
- 2024-11-26, **in-Car Biometrics (iCarB) Datasets for Driver Recognition: Face, Fingerprint, and Voice**, Vedrana Krivokuca Hahn et.al., Paper: [http://arxiv.org/abs/2411.17305v1](http://arxiv.org/abs/2411.17305v1)
- 2024-11-26, **Resurfaced CsPbBr3 Nanocrystals Enable Free Radical Thermal Polymerization of Efficient Ultrafast Polyvinyl Styrene Nanocomposite Scintillators**, Francesco Carulli et.al., Paper: [http://arxiv.org/abs/2411.17259v1](http://arxiv.org/abs/2411.17259v1)
- 2024-11-26, **DiffSLT: Enhancing Diversity in Sign Language Translation via Diffusion Model**, JiHwan Moon et.al., Paper: [http://arxiv.org/abs/2411.17248v1](http://arxiv.org/abs/2411.17248v1)
- 2024-11-26, **Depth-PC: A Visual Servo Framework Integrated with Cross-Modality Fusion for Sim2Real Transfer**, Haoyu Zhang et.al., Paper: [http://arxiv.org/abs/2411.17195v1](http://arxiv.org/abs/2411.17195v1)
- 2024-11-26, **Enhancing Lane Segment Perception and Topology Reasoning with Crowdsourcing Trajectory Priors**, Peijin Jia et.al., Paper: [http://arxiv.org/abs/2411.17161v1](http://arxiv.org/abs/2411.17161v1)
- 2024-11-25, **The JWST Weather Report from the Isolated Exoplanet Analog SIMP 0136+0933: Pressure-Dependent Variability Driven by Multiple Mechanisms**, Allison M. McCarthy et.al., Paper: [http://arxiv.org/abs/2411.16577v1](http://arxiv.org/abs/2411.16577v1)
- 2024-11-25, **Deformable Mamba for Wide Field of View Segmentation**, Jie Hu et.al., Paper: [http://arxiv.org/abs/2411.16481v1](http://arxiv.org/abs/2411.16481v1), Code: **[https://github.com/jiehu1996/deformablemamba](https://github.com/jiehu1996/deformablemamba)**
- 2024-11-25, **Luminosity predictions for the first three ionisation stages of W, Pt and Au to probe potential sources of emission in kilonova**, M. McCann et.al., Paper: [http://arxiv.org/abs/2411.16476v1](http://arxiv.org/abs/2411.16476v1)
Expand Down

0 comments on commit 8a08bf9

Please sign in to comment.