Skip to content

Latest commit

 

History

History
executable file
·
111 lines (90 loc) · 17.1 KB

2024-03-16.md

File metadata and controls

executable file
·
111 lines (90 loc) · 17.1 KB

[UPDATED!] 2024-03-16 (Publish Time)

生成模型

Publish Date Title Title_CN Authors PDF Code
2024-03-16 Reward Guided Latent Consistency Distillation 奖励引导的潜在一致性蒸馏 Jiachen Li, Weixi Feng, Wenhu Chen, William Yang Wang http://arxiv.org/pdf/2403.11027v1 null
2024-03-16 OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models OMG:扩散模型中遮挡友好的个性化多概念生成 Zhe Kong, Yong Zhang, Tianyu Yang, Tao Wang, Kaihao Zhang, Bizhu Wu, Guanying Chen, Wei Liu, Wenhan Luo http://arxiv.org/pdf/2403.10983v1 null
2024-03-16 Exploiting Topological Prior for Boosting Point Cloud Generation 利用拓扑先验促进点云生成 Baiyuan Chen http://arxiv.org/pdf/2403.10962v1 null
2024-03-16 Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription Ctrl123:通过闭环转录实现一致的小说视图合成 Hongxiang Zhao, Xili Dai, Jianan Wang, Shengbang Tong, Jingyuan Zhang, Weida Wang, Lei Zhang, Yi Ma http://arxiv.org/pdf/2403.10953v1 null
2024-03-16 Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation 用于测试时适应的高效扩散驱动的损坏编辑器 Yeongtak Oh, Jonghyun Lee, Jooyoung Choi, Dahuin Jung, Uiwon Hwang, Sungroh Yoon http://arxiv.org/pdf/2403.10911v1 null
2024-03-16 Urban Sound Propagation: a Benchmark for 1-Step Generative Modeling of Complex Physical Systems 城市声音传播:复杂物理系统一步生成建模的基准 Martin Spitznagel, Janis Keuper http://arxiv.org/pdf/2403.10904v1 null
2024-03-16 Could We Generate Cytology Images from Histopathology Images? An Empirical Study 我们可以从组织病理学图像生成细胞学图像吗?实证研究 Soumyajyoti Dey, Sukanta Chakraborty, Utso Guha Roy, Nibaran Das http://arxiv.org/pdf/2403.10885v1 null
2024-03-16 MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections MicroDiffusion:隐式表示引导扩散,用于有限 2D 显微投影的 3D 重建 Mude Hui, Zihao Wei, Hongru Zhu, Fei Xia, Yuyin Zhou http://arxiv.org/pdf/2403.10815v1 null
2024-03-16 Speech-driven Personalized Gesture Synthetics: Harnessing Automatic Fuzzy Feature Inference 语音驱动的个性化手势合成:利用自动模糊特征推理 Fan Zhang, Zhaohan Wang, Xin Lyu, Siyuan Zhao, Mengjian Li, Weidong Geng, Naye Ji, Hui Du, Fuxing Gao, Hao Wu, et.al. http://arxiv.org/pdf/2403.10805v1 null
2024-03-16 ContourDiff: Unpaired Image Translation with Contour-Guided Diffusion Models ContourDiff:使用轮廓引导扩散模型的不成对图像转换 Yuwen Chen, Nicholas Konz, Hanxue Gu, Haoyu Dong, Yaqian Chen, Lin Li, Jisoo Lee, Maciej A. Mazurowski http://arxiv.org/pdf/2403.10786v1 null

多模态

Publish Date Title Title_CN Authors PDF Code
2024-03-16 Improving Adversarial Transferability of Visual-Language Pre-training Models through Collaborative Multimodal Interaction 通过协作多模态交互提高视觉语言预训练模型的对抗性可迁移性 Jiyuan Fu, Zhaoyu Chen, Kaixun Jiang, Haijing Guo, Jiafeng Wang, Shuyong Gao, Wenqiang Zhang http://arxiv.org/pdf/2403.10883v1 null
2024-03-16 A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment 图像质量评估多模态大语言模型的综合研究 Tianhe Wu, Kede Ma, Jie Liang, Yujiu Yang, Lei Zhang http://arxiv.org/pdf/2403.10854v1 null
2024-03-16 Affective Behaviour Analysis via Integrating Multi-Modal Knowledge 整合多模态知识进行情感行为分析 Wei Zhang, Feng Qiu, Chen Liu, Lincheng Li, Heming Du, Tiancheng Guo, Xin Yu http://arxiv.org/pdf/2403.10825v1 null

Nerf

Publish Date Title Title_CN Authors PDF Code
2024-03-16 Fast Sparse View Guided NeRF Update for Object Reconfigurations 用于对象重新配置的快速稀疏视图引导 NeRF 更新 Ziqi Lu, Jianbo Ye, Xiaohan Fei, Xiaolong Li, Jiawei Mo, Ashwin Swaminathan, Stefano Soatto http://arxiv.org/pdf/2403.11024v1 null
2024-03-16 HourglassNeRF: Casting an Hourglass as a Bundle of Rays for Few-shot Neural Rendering HourglassNeRF:将沙漏投射为一束光线以进行少镜头神经渲染 Seunghyeon Seo, Yeonjin Chang, Jayeon Yoo, Seungwoo Lee, Hojun Lee, Nojun Kwak http://arxiv.org/pdf/2403.10906v1 null
2024-03-16 MSI-NeRF: Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field MSI-NeRF:通过多球图像辅助广义神经辐射场将全深度与视图合成联系起来 Dongyu Yan, Guanyu Huang, Fengyu Quan, Haoyao Chen http://arxiv.org/pdf/2403.10840v1 null

模型压缩/优化

Publish Date Title Title_CN Authors PDF Code
2024-03-16 N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields N2F2:具有嵌套神经特征字段的分层场景理解 Yash Bhalgat, Iro Laina, João F. Henriques, Andrew Zisserman, Andrea Vedaldi http://arxiv.org/pdf/2403.10997v1 null

分类/检测/识别/分割/...

Publish Date Title Title_CN Authors PDF Code
2024-03-16 Texture Edge detection by Patch consensus (TEP) 通过补丁一致性(TEP)进行纹理边缘检测 Guangyu Cui, Sung Ha Kang http://arxiv.org/pdf/2403.11038v1 null
2024-03-16 FH-TabNet: Multi-Class Familial Hypercholesterolemia Detection via a Multi-Stage Tabular Deep Learning FH-TabNet:通过多阶段表格深度学习进行多类家族性高胆固醇血症检测 Sadaf Khademi, Zohreh Hajiakhondi, Golnaz Vaseghi, Nizal Sarrafzadegan, Arash Mohammadi http://arxiv.org/pdf/2403.11032v1 null
2024-03-16 MASSM: An End-to-End Deep Learning Framework for Multi-Anatomy Statistical Shape Modeling Directly From Images MASSM:直接从图像进行多解剖统计形状建模的端到端深度学习框架 Janmesh Ukey, Tushar Kataria, Shireen Y. Elhabian http://arxiv.org/pdf/2403.11008v1 null
2024-03-16 Topologically faithful multi-class segmentation in medical images 医学图像中拓扑忠实的多类分割 Alexander H. Berger, Nico Stucki, Laurin Lux, Vincent Buergin, Suprosanna Shit, Anna Banaszak, Daniel Rueckert, Ulrich Bauer, Johannes C. Paetzold http://arxiv.org/pdf/2403.11001v1 null
2024-03-16 Automatic Spatial Calibration of Near-Field MIMO Radar With Respect to Optical Sensors 近场 MIMO 雷达相对于光学传感器的自动空间校准 Vanessa Wirth, Johanna Bräunig, Danti Khouri, Florian Gutsche, Martin Vossiek, Tim Weyrich, Marc Stamminger http://arxiv.org/pdf/2403.10981v1 null
2024-03-16 Task-Aware Low-Rank Adaptation of Segment Anything Model 分段任意模型的任务感知低阶自适应 Xuehao Wang, Feiyang Ye, Yu Zhang http://arxiv.org/pdf/2403.10971v1 null
2024-03-16 Understanding Robustness of Visual State Space Models for Image Classification 了解图像分类视觉状态空间模型的鲁棒性 Chengbin Du, Yanxi Li, Chang Xu http://arxiv.org/pdf/2403.10935v1 null
2024-03-16 Uncertainty-Aware Adapter: Adapting Segment Anything Model (SAM) for Ambiguous Medical Image Segmentation 不确定性感知适配器:采用分段任意模型 (SAM) 进行模糊医学图像分割 Mingzhou Jiang, Jiaying Zhou, Junde Wu, Tianyang Wang, Yueming Jin, Min Xu http://arxiv.org/pdf/2403.10931v1 null
2024-03-16 FishNet: Deep Neural Networks for Low-Cost Fish Stock Estimation FishNet:用于低成本鱼类种群估计的深度神经网络 Moseli Mots'oehli, Anton Nikolaev, Wawan B. IGede, John Lynham, Peter J. Mous, Peter Sadowski http://arxiv.org/pdf/2403.10916v1 null
2024-03-16 Automatic location detection based on deep learning 基于深度学习的自动位置检测 Anjali Karangiya, Anirudh Sharma, Divax Shah, Kartavya Badgujar, Dr. Chintan Thacker, Dainik Dave http://arxiv.org/pdf/2403.10912v1 null
2024-03-16 LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival LuoJiaHOG:用于遥感图像文本检索的面向层次结构的地理感知图像描述数据集 Yuanxin Zhao, Mi Zhang, Bingnan Yang, Zhan Zhang, Jiaju Kang, Jianya Gong http://arxiv.org/pdf/2403.10887v1 null
2024-03-16 Fuzzy Rank-based Late Fusion Technique for Cytology image Segmentation 基于模糊排序的细胞学图像分割后期融合技术 Soumyajyoti Dey, Sukanta Chakraborty, Utso Guha Roy, Nibaran Das http://arxiv.org/pdf/2403.10884v1 null
2024-03-16 COVID-CT-H-UNet: a novel COVID-19 CT segmentation network based on attention mechanism and Bi-category Hybrid loss COVID-CT-H-UNet:一种基于注意力机制和双类别混合损失的新型 COVID-19 CT 分割网络 Anay Panja, Somenath Kuiry, Alaka Das, Mita Nasipuri, Nibaran Das http://arxiv.org/pdf/2403.10880v1 null
2024-03-16 RetMIL: Retentive Multiple Instance Learning for Histopathological Whole Slide Image Classification RetMIL:用于组织病理学全幻灯片图像分类的保留性多实例学习 Hongbo Chu, Qiehe Sun, Jiawen Li, Yuxuan Chen, Lizhong Zhang, Tian Guan, Anjia Han, Yonghong He http://arxiv.org/pdf/2403.10858v1 null
2024-03-16 View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV 移动无人机中以视图为中心的多目标跟踪与单应匹配 Deyi Ji, Siqi Gao, Lanyun Zhu, Yiru Zhao, Peng Xu, Hongtao Lu, Feng Zhao http://arxiv.org/pdf/2403.10830v1 null
2024-03-16 Exploring Learning-based Motion Models in Multi-Object Tracking 探索多目标跟踪中基于学习的运动模型 Hsiang-Wei Huang, Cheng-Yen Yang, Wenhao Chai, Zhongyu Jiang, Jenq-Neng Hwang http://arxiv.org/pdf/2403.10826v1 null
2024-03-16 Active Label Correction for Semantic Segmentation with Foundation Models 使用基础模型进行语义分割的主动标签校正 Hoyoung Kim, Sehyun Hwang, Suha Kwak, Jungseul Ok http://arxiv.org/pdf/2403.10820v1 null
2024-03-16 Enhancing Out-of-Distribution Detection with Multitesting-based Layer-wise Feature Fusion 通过基于多重测试的分层特征融合增强分布外检测 Jiawei Li, Sitong Li, Shanshan Wang, Yicheng Zeng, Falong Tan, Chuanlong Xie http://arxiv.org/pdf/2403.10803v1 null
2024-03-16 Unsupervised Collaborative Metric Learning with Mixed-Scale Groups for General Object Retrieval 用于一般对象检索的混合规模组的无监督协作度量学习 Shichao Kan, Yuhai Deng, Yixiong Liang, Lihui Cen, Zhe Qu, Yigang Cen, Zhihai He http://arxiv.org/pdf/2403.10798v1 null
2024-03-16 Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation 分割任意对象模型(SAOM):多类多实例分割的实仿真微调策略 Mariia Khan, Yue Qiu, Yuren Cong, Jumana Abu-Khalaf, David Suter, Bodo Rosenhahn http://arxiv.org/pdf/2403.10780v1 null
2024-03-16 HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection HCF-Net:用于红外小物体检测的分层上下文融合网络 Shibiao Xu, ShuChen Zheng, Wenhao Xu, Rongtao Xu, Changwei Wang, Jiguang Zhang, Xiaoqiang Teng, Ao Li, Li Guo http://arxiv.org/pdf/2403.10778v1 null

图像理解

Publish Date Title Title_CN Authors PDF Code
2024-03-16 Rethinking Multi-view Representation Learning via Distilled Disentangling 通过蒸馏解开重新思考多视图表示学习 Guanzhou Ke, Bo Wang, Xiaoli Wang, Shengfeng He http://arxiv.org/pdf/2403.10897v1 null
2024-03-16 Efficient Domain Adaptation for Endoscopic Visual Odometry 内窥镜视觉里程计的有效域适应 Junyang Wu, Yun Gu, Guang-Zhong Yang http://arxiv.org/pdf/2403.10860v1 null
2024-03-16 DarkGS: Learning Neural Illumination and 3D Gaussians Relighting for Robotic Exploration in the Dark DarkGS:学习神经照明和 3D 高斯重新照明以实现黑暗中的机器人探索 Tianyi Zhang, Kaining Huang, Weiming Zhi, Matthew Johnson-Roberson http://arxiv.org/pdf/2403.10814v1 null
2024-03-16 Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples 针对对抗性示例安全地微调预训练编码器 Ziqi Zhou, Minghui Li, Wei Liu, Shengshan Hu, Yechao Zhang, Wei Wan, Lulu Xue, Leo Yu Zhang, Dezhong Yang, Hai Jin http://arxiv.org/pdf/2403.10801v1 null

Transformer

Publish Date Title Title_CN Authors PDF Code
2024-03-16 EfficientMorph: Parameter-Efficient Transformer-Based Architecture for 3D Image Registration EfficientMorph:用于 3D 图像配准的基于参数高效 Transformer 的架构 Abu Zahid Bin Aziz, Mokshagna Sai Teja Karanam, Tushar Kataria, Shireen Y. Elhabian http://arxiv.org/pdf/2403.11026v1 null
2024-03-16 StableGarment: Garment-Centric Generation via Stable Diffusion StableGarment:通过稳定扩散以服装为中心的生成 Rui Wang, Hailong Guo, Jiaming Liu, Huaxia Li, Haibo Zhao, Xu Tang, Yao Hu, Hao Tang, Peipei Li http://arxiv.org/pdf/2403.10783v1 null

3D/CG

Publish Date Title Title_CN Authors PDF Code
2024-03-16 Multiplane Quantitative Phase Imaging Using a Wavelength-Multiplexed Diffractive Optical Processor 使用波长复用衍射光学处理器的多平面定量相位成像 Che-Yung Shen, Jingxi Li, Tianyi Gan, Yuhang Li, Langxing Bai, Mona Jarrahi, Aydogan Ozcan http://arxiv.org/pdf/2403.11035v1 null
2024-03-16 ScanTalk: 3D Talking Heads from Unregistered Scans ScanTalk:来自未配准扫描的 3D 会说话的头像 Federico Nocentini, Thomas Besnier, Claudio Ferrari, Sylvain Arguillere, Stefano Berretti, Mohamed Daoudi http://arxiv.org/pdf/2403.10942v1 null
2024-03-16 SF(DA)$^2$: Source-free Domain Adaptation Through the Lens of Data Augmentation SF(DA)$^2$:通过数据增强的视角进行无源域适应 Uiwon Hwang, Jonghyun Lee, Juhyeon Shin, Sungroh Yoon http://arxiv.org/pdf/2403.10834v1 null
2024-03-16 DUE: Dynamic Uncertainty-Aware Explanation Supervision via 3D Imputation DUE:通过 3D 插补进行动态不确定性感知解释监督 Qilong Zhao, Yifei Zhang, Mengdan Zhu, Siyi Gu, Yuyang Gao, Xiaofeng Yang, Liang Zhao http://arxiv.org/pdf/2403.10831v1 null
2024-03-16 DPPE: Dense Pose Estimation in a Plenoxels Environment using Gradient Approximation DPPE:使用梯​​度近似的 Plenoxels 环境中的密集姿态估计 Christopher Kolios, Yeganeh Bahoo, Sajad Saeedi http://arxiv.org/pdf/2403.10773v1 null

各类学习方式

Publish Date Title Title_CN Authors PDF Code
2024-03-16 Just Say the Name: Online Continual Learning with Category Names Only via Data Generation 只需说出名称:仅通过数据生成使用类别名称进行在线持续学习 Minhyuk Seo, Diganta Misra, Seongwon Cho, Minjae Lee, Jonghyun Choi http://arxiv.org/pdf/2403.10853v1 null
2024-03-16 VisionCLIP: An Med-AIGC based Ethical Language-Image Foundation Model for Generalizable Retina Image Analysis VisionCLIP:基于 Med-AIGC 的伦理语言图像基础模型,用于广义视网膜图像分析 Hao Wei, Bowen Liu, Minqing Zhang, Peilun Shi, Wu Yuan http://arxiv.org/pdf/2403.10823v1 null

其他

Publish Date Title Title_CN Authors PDF Code
2024-03-16 Neuro-Symbolic Video Search 神经符号视频搜索 Minkyu Choi, Harsh Goel, Mohammad Omama, Yunhao Yang, Sahil Shah, Sandeep Chinchali http://arxiv.org/pdf/2403.11021v1 null
2024-03-16 Boosting Flow-based Generative Super-Resolution Models via Learned Prior 通过学习先验增强基于流的生成超分辨率模型 Li-Yuan Tsao, Yi-Chen Lo, Chia-Che Chang, Hao-Wei Chen, Roy Tseng, Chien Feng, Chun-Yi Lee http://arxiv.org/pdf/2403.10988v1 null
2024-03-16 Channel-wise Feature Decorrelation for Enhanced Learned Image Compression 用于增强学习图像压缩的通道特征解相关 Farhad Pakdaman, Moncef Gabbouj http://arxiv.org/pdf/2403.10936v1 null
2024-03-16 Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution 学习现实世界尺度任意超分辨率的双层可变形隐式表示 Zhiheng Li, Muheng Li, Jixuan Fan, Lei Chen, Yansong Tang, Jie Zhou, Jiwen Lu http://arxiv.org/pdf/2403.10925v1 null
2024-03-16 Regularizing CNNs using Confusion Penalty Based Label Smoothing for Histopathology Images 使用基于混淆惩罚的标签平滑对组织病理学图像进行正则化 CNN Somenath Kuiry, Alaka Das, Mita Nasipuri, Nibaran Das http://arxiv.org/pdf/2403.10881v1 null
2024-03-16 Bidirectional Multi-Step Domain Generalization for Visible-Infrared Person Re-Identification 可见光-红外行人重识别的双向多步域泛化 Mahdi Alehdaghi, Pourya Shamsolmoali, Rafael M. O. Cruz, Eric Granger http://arxiv.org/pdf/2403.10782v1 null
2024-03-16 Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching 匹配立体视频:双向对齐以实现一致的动态立体匹配 Junpeng Jing, Ye Mao, Krystian Mikolajczyk http://arxiv.org/pdf/2403.10755v1 null
2024-03-16 Vector search with small radiuses 小半径矢量搜索 Gergely Szilvasy, Pierre-Emmanuel Mazaré, Matthijs Douze http://arxiv.org/pdf/2403.10746v1 null