Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-03-16 | Reward Guided Latent Consistency Distillation | 奖励引导的潜在一致性蒸馏 | Jiachen Li, Weixi Feng, Wenhu Chen, William Yang Wang | http://arxiv.org/pdf/2403.11027v1 | null |
2024-03-16 | OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models | OMG:扩散模型中遮挡友好的个性化多概念生成 | Zhe Kong, Yong Zhang, Tianyu Yang, Tao Wang, Kaihao Zhang, Bizhu Wu, Guanying Chen, Wei Liu, Wenhan Luo | http://arxiv.org/pdf/2403.10983v1 | null |
2024-03-16 | Exploiting Topological Prior for Boosting Point Cloud Generation | 利用拓扑先验促进点云生成 | Baiyuan Chen | http://arxiv.org/pdf/2403.10962v1 | null |
2024-03-16 | Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription | Ctrl123:通过闭环转录实现一致的小说视图合成 | Hongxiang Zhao, Xili Dai, Jianan Wang, Shengbang Tong, Jingyuan Zhang, Weida Wang, Lei Zhang, Yi Ma | http://arxiv.org/pdf/2403.10953v1 | null |
2024-03-16 | Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation | 用于测试时适应的高效扩散驱动的损坏编辑器 | Yeongtak Oh, Jonghyun Lee, Jooyoung Choi, Dahuin Jung, Uiwon Hwang, Sungroh Yoon | http://arxiv.org/pdf/2403.10911v1 | null |
2024-03-16 | Urban Sound Propagation: a Benchmark for 1-Step Generative Modeling of Complex Physical Systems | 城市声音传播:复杂物理系统一步生成建模的基准 | Martin Spitznagel, Janis Keuper | http://arxiv.org/pdf/2403.10904v1 | null |
2024-03-16 | Could We Generate Cytology Images from Histopathology Images? An Empirical Study | 我们可以从组织病理学图像生成细胞学图像吗?实证研究 | Soumyajyoti Dey, Sukanta Chakraborty, Utso Guha Roy, Nibaran Das | http://arxiv.org/pdf/2403.10885v1 | null |
2024-03-16 | MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections | MicroDiffusion:隐式表示引导扩散,用于有限 2D 显微投影的 3D 重建 | Mude Hui, Zihao Wei, Hongru Zhu, Fei Xia, Yuyin Zhou | http://arxiv.org/pdf/2403.10815v1 | null |
2024-03-16 | Speech-driven Personalized Gesture Synthetics: Harnessing Automatic Fuzzy Feature Inference | 语音驱动的个性化手势合成:利用自动模糊特征推理 | Fan Zhang, Zhaohan Wang, Xin Lyu, Siyuan Zhao, Mengjian Li, Weidong Geng, Naye Ji, Hui Du, Fuxing Gao, Hao Wu, et.al. | http://arxiv.org/pdf/2403.10805v1 | null |
2024-03-16 | ContourDiff: Unpaired Image Translation with Contour-Guided Diffusion Models | ContourDiff:使用轮廓引导扩散模型的不成对图像转换 | Yuwen Chen, Nicholas Konz, Hanxue Gu, Haoyu Dong, Yaqian Chen, Lin Li, Jisoo Lee, Maciej A. Mazurowski | http://arxiv.org/pdf/2403.10786v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-03-16 | Improving Adversarial Transferability of Visual-Language Pre-training Models through Collaborative Multimodal Interaction | 通过协作多模态交互提高视觉语言预训练模型的对抗性可迁移性 | Jiyuan Fu, Zhaoyu Chen, Kaixun Jiang, Haijing Guo, Jiafeng Wang, Shuyong Gao, Wenqiang Zhang | http://arxiv.org/pdf/2403.10883v1 | null |
2024-03-16 | A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment | 图像质量评估多模态大语言模型的综合研究 | Tianhe Wu, Kede Ma, Jie Liang, Yujiu Yang, Lei Zhang | http://arxiv.org/pdf/2403.10854v1 | null |
2024-03-16 | Affective Behaviour Analysis via Integrating Multi-Modal Knowledge | 整合多模态知识进行情感行为分析 | Wei Zhang, Feng Qiu, Chen Liu, Lincheng Li, Heming Du, Tiancheng Guo, Xin Yu | http://arxiv.org/pdf/2403.10825v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-03-16 | Fast Sparse View Guided NeRF Update for Object Reconfigurations | 用于对象重新配置的快速稀疏视图引导 NeRF 更新 | Ziqi Lu, Jianbo Ye, Xiaohan Fei, Xiaolong Li, Jiawei Mo, Ashwin Swaminathan, Stefano Soatto | http://arxiv.org/pdf/2403.11024v1 | null |
2024-03-16 | HourglassNeRF: Casting an Hourglass as a Bundle of Rays for Few-shot Neural Rendering | HourglassNeRF:将沙漏投射为一束光线以进行少镜头神经渲染 | Seunghyeon Seo, Yeonjin Chang, Jayeon Yoo, Seungwoo Lee, Hojun Lee, Nojun Kwak | http://arxiv.org/pdf/2403.10906v1 | null |
2024-03-16 | MSI-NeRF: Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field | MSI-NeRF:通过多球图像辅助广义神经辐射场将全深度与视图合成联系起来 | Dongyu Yan, Guanyu Huang, Fengyu Quan, Haoyao Chen | http://arxiv.org/pdf/2403.10840v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-03-16 | N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields | N2F2:具有嵌套神经特征字段的分层场景理解 | Yash Bhalgat, Iro Laina, João F. Henriques, Andrew Zisserman, Andrea Vedaldi | http://arxiv.org/pdf/2403.10997v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-03-16 | Texture Edge detection by Patch consensus (TEP) | 通过补丁一致性(TEP)进行纹理边缘检测 | Guangyu Cui, Sung Ha Kang | http://arxiv.org/pdf/2403.11038v1 | null |
2024-03-16 | FH-TabNet: Multi-Class Familial Hypercholesterolemia Detection via a Multi-Stage Tabular Deep Learning | FH-TabNet:通过多阶段表格深度学习进行多类家族性高胆固醇血症检测 | Sadaf Khademi, Zohreh Hajiakhondi, Golnaz Vaseghi, Nizal Sarrafzadegan, Arash Mohammadi | http://arxiv.org/pdf/2403.11032v1 | null |
2024-03-16 | MASSM: An End-to-End Deep Learning Framework for Multi-Anatomy Statistical Shape Modeling Directly From Images | MASSM:直接从图像进行多解剖统计形状建模的端到端深度学习框架 | Janmesh Ukey, Tushar Kataria, Shireen Y. Elhabian | http://arxiv.org/pdf/2403.11008v1 | null |
2024-03-16 | Topologically faithful multi-class segmentation in medical images | 医学图像中拓扑忠实的多类分割 | Alexander H. Berger, Nico Stucki, Laurin Lux, Vincent Buergin, Suprosanna Shit, Anna Banaszak, Daniel Rueckert, Ulrich Bauer, Johannes C. Paetzold | http://arxiv.org/pdf/2403.11001v1 | null |
2024-03-16 | Automatic Spatial Calibration of Near-Field MIMO Radar With Respect to Optical Sensors | 近场 MIMO 雷达相对于光学传感器的自动空间校准 | Vanessa Wirth, Johanna Bräunig, Danti Khouri, Florian Gutsche, Martin Vossiek, Tim Weyrich, Marc Stamminger | http://arxiv.org/pdf/2403.10981v1 | null |
2024-03-16 | Task-Aware Low-Rank Adaptation of Segment Anything Model | 分段任意模型的任务感知低阶自适应 | Xuehao Wang, Feiyang Ye, Yu Zhang | http://arxiv.org/pdf/2403.10971v1 | null |
2024-03-16 | Understanding Robustness of Visual State Space Models for Image Classification | 了解图像分类视觉状态空间模型的鲁棒性 | Chengbin Du, Yanxi Li, Chang Xu | http://arxiv.org/pdf/2403.10935v1 | null |
2024-03-16 | Uncertainty-Aware Adapter: Adapting Segment Anything Model (SAM) for Ambiguous Medical Image Segmentation | 不确定性感知适配器:采用分段任意模型 (SAM) 进行模糊医学图像分割 | Mingzhou Jiang, Jiaying Zhou, Junde Wu, Tianyang Wang, Yueming Jin, Min Xu | http://arxiv.org/pdf/2403.10931v1 | null |
2024-03-16 | FishNet: Deep Neural Networks for Low-Cost Fish Stock Estimation | FishNet:用于低成本鱼类种群估计的深度神经网络 | Moseli Mots'oehli, Anton Nikolaev, Wawan B. IGede, John Lynham, Peter J. Mous, Peter Sadowski | http://arxiv.org/pdf/2403.10916v1 | null |
2024-03-16 | Automatic location detection based on deep learning | 基于深度学习的自动位置检测 | Anjali Karangiya, Anirudh Sharma, Divax Shah, Kartavya Badgujar, Dr. Chintan Thacker, Dainik Dave | http://arxiv.org/pdf/2403.10912v1 | null |
2024-03-16 | LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival | LuoJiaHOG:用于遥感图像文本检索的面向层次结构的地理感知图像描述数据集 | Yuanxin Zhao, Mi Zhang, Bingnan Yang, Zhan Zhang, Jiaju Kang, Jianya Gong | http://arxiv.org/pdf/2403.10887v1 | null |
2024-03-16 | Fuzzy Rank-based Late Fusion Technique for Cytology image Segmentation | 基于模糊排序的细胞学图像分割后期融合技术 | Soumyajyoti Dey, Sukanta Chakraborty, Utso Guha Roy, Nibaran Das | http://arxiv.org/pdf/2403.10884v1 | null |
2024-03-16 | COVID-CT-H-UNet: a novel COVID-19 CT segmentation network based on attention mechanism and Bi-category Hybrid loss | COVID-CT-H-UNet:一种基于注意力机制和双类别混合损失的新型 COVID-19 CT 分割网络 | Anay Panja, Somenath Kuiry, Alaka Das, Mita Nasipuri, Nibaran Das | http://arxiv.org/pdf/2403.10880v1 | null |
2024-03-16 | RetMIL: Retentive Multiple Instance Learning for Histopathological Whole Slide Image Classification | RetMIL:用于组织病理学全幻灯片图像分类的保留性多实例学习 | Hongbo Chu, Qiehe Sun, Jiawen Li, Yuxuan Chen, Lizhong Zhang, Tian Guan, Anjia Han, Yonghong He | http://arxiv.org/pdf/2403.10858v1 | null |
2024-03-16 | View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV | 移动无人机中以视图为中心的多目标跟踪与单应匹配 | Deyi Ji, Siqi Gao, Lanyun Zhu, Yiru Zhao, Peng Xu, Hongtao Lu, Feng Zhao | http://arxiv.org/pdf/2403.10830v1 | null |
2024-03-16 | Exploring Learning-based Motion Models in Multi-Object Tracking | 探索多目标跟踪中基于学习的运动模型 | Hsiang-Wei Huang, Cheng-Yen Yang, Wenhao Chai, Zhongyu Jiang, Jenq-Neng Hwang | http://arxiv.org/pdf/2403.10826v1 | null |
2024-03-16 | Active Label Correction for Semantic Segmentation with Foundation Models | 使用基础模型进行语义分割的主动标签校正 | Hoyoung Kim, Sehyun Hwang, Suha Kwak, Jungseul Ok | http://arxiv.org/pdf/2403.10820v1 | null |
2024-03-16 | Enhancing Out-of-Distribution Detection with Multitesting-based Layer-wise Feature Fusion | 通过基于多重测试的分层特征融合增强分布外检测 | Jiawei Li, Sitong Li, Shanshan Wang, Yicheng Zeng, Falong Tan, Chuanlong Xie | http://arxiv.org/pdf/2403.10803v1 | null |
2024-03-16 | Unsupervised Collaborative Metric Learning with Mixed-Scale Groups for General Object Retrieval | 用于一般对象检索的混合规模组的无监督协作度量学习 | Shichao Kan, Yuhai Deng, Yixiong Liang, Lihui Cen, Zhe Qu, Yigang Cen, Zhihai He | http://arxiv.org/pdf/2403.10798v1 | null |
2024-03-16 | Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation | 分割任意对象模型(SAOM):多类多实例分割的实仿真微调策略 | Mariia Khan, Yue Qiu, Yuren Cong, Jumana Abu-Khalaf, David Suter, Bodo Rosenhahn | http://arxiv.org/pdf/2403.10780v1 | null |
2024-03-16 | HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection | HCF-Net:用于红外小物体检测的分层上下文融合网络 | Shibiao Xu, ShuChen Zheng, Wenhao Xu, Rongtao Xu, Changwei Wang, Jiguang Zhang, Xiaoqiang Teng, Ao Li, Li Guo | http://arxiv.org/pdf/2403.10778v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-03-16 | Rethinking Multi-view Representation Learning via Distilled Disentangling | 通过蒸馏解开重新思考多视图表示学习 | Guanzhou Ke, Bo Wang, Xiaoli Wang, Shengfeng He | http://arxiv.org/pdf/2403.10897v1 | null |
2024-03-16 | Efficient Domain Adaptation for Endoscopic Visual Odometry | 内窥镜视觉里程计的有效域适应 | Junyang Wu, Yun Gu, Guang-Zhong Yang | http://arxiv.org/pdf/2403.10860v1 | null |
2024-03-16 | DarkGS: Learning Neural Illumination and 3D Gaussians Relighting for Robotic Exploration in the Dark | DarkGS:学习神经照明和 3D 高斯重新照明以实现黑暗中的机器人探索 | Tianyi Zhang, Kaining Huang, Weiming Zhi, Matthew Johnson-Roberson | http://arxiv.org/pdf/2403.10814v1 | null |
2024-03-16 | Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples | 针对对抗性示例安全地微调预训练编码器 | Ziqi Zhou, Minghui Li, Wei Liu, Shengshan Hu, Yechao Zhang, Wei Wan, Lulu Xue, Leo Yu Zhang, Dezhong Yang, Hai Jin | http://arxiv.org/pdf/2403.10801v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-03-16 | EfficientMorph: Parameter-Efficient Transformer-Based Architecture for 3D Image Registration | EfficientMorph:用于 3D 图像配准的基于参数高效 Transformer 的架构 | Abu Zahid Bin Aziz, Mokshagna Sai Teja Karanam, Tushar Kataria, Shireen Y. Elhabian | http://arxiv.org/pdf/2403.11026v1 | null |
2024-03-16 | StableGarment: Garment-Centric Generation via Stable Diffusion | StableGarment:通过稳定扩散以服装为中心的生成 | Rui Wang, Hailong Guo, Jiaming Liu, Huaxia Li, Haibo Zhao, Xu Tang, Yao Hu, Hao Tang, Peipei Li | http://arxiv.org/pdf/2403.10783v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-03-16 | Multiplane Quantitative Phase Imaging Using a Wavelength-Multiplexed Diffractive Optical Processor | 使用波长复用衍射光学处理器的多平面定量相位成像 | Che-Yung Shen, Jingxi Li, Tianyi Gan, Yuhang Li, Langxing Bai, Mona Jarrahi, Aydogan Ozcan | http://arxiv.org/pdf/2403.11035v1 | null |
2024-03-16 | ScanTalk: 3D Talking Heads from Unregistered Scans | ScanTalk:来自未配准扫描的 3D 会说话的头像 | Federico Nocentini, Thomas Besnier, Claudio Ferrari, Sylvain Arguillere, Stefano Berretti, Mohamed Daoudi | http://arxiv.org/pdf/2403.10942v1 | null |
2024-03-16 | SF(DA)$^2$: Source-free Domain Adaptation Through the Lens of Data Augmentation | SF(DA)$^2$:通过数据增强的视角进行无源域适应 | Uiwon Hwang, Jonghyun Lee, Juhyeon Shin, Sungroh Yoon | http://arxiv.org/pdf/2403.10834v1 | null |
2024-03-16 | DUE: Dynamic Uncertainty-Aware Explanation Supervision via 3D Imputation | DUE:通过 3D 插补进行动态不确定性感知解释监督 | Qilong Zhao, Yifei Zhang, Mengdan Zhu, Siyi Gu, Yuyang Gao, Xiaofeng Yang, Liang Zhao | http://arxiv.org/pdf/2403.10831v1 | null |
2024-03-16 | DPPE: Dense Pose Estimation in a Plenoxels Environment using Gradient Approximation | DPPE:使用梯度近似的 Plenoxels 环境中的密集姿态估计 | Christopher Kolios, Yeganeh Bahoo, Sajad Saeedi | http://arxiv.org/pdf/2403.10773v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-03-16 | Just Say the Name: Online Continual Learning with Category Names Only via Data Generation | 只需说出名称:仅通过数据生成使用类别名称进行在线持续学习 | Minhyuk Seo, Diganta Misra, Seongwon Cho, Minjae Lee, Jonghyun Choi | http://arxiv.org/pdf/2403.10853v1 | null |
2024-03-16 | VisionCLIP: An Med-AIGC based Ethical Language-Image Foundation Model for Generalizable Retina Image Analysis | VisionCLIP:基于 Med-AIGC 的伦理语言图像基础模型,用于广义视网膜图像分析 | Hao Wei, Bowen Liu, Minqing Zhang, Peilun Shi, Wu Yuan | http://arxiv.org/pdf/2403.10823v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-03-16 | Neuro-Symbolic Video Search | 神经符号视频搜索 | Minkyu Choi, Harsh Goel, Mohammad Omama, Yunhao Yang, Sahil Shah, Sandeep Chinchali | http://arxiv.org/pdf/2403.11021v1 | null |
2024-03-16 | Boosting Flow-based Generative Super-Resolution Models via Learned Prior | 通过学习先验增强基于流的生成超分辨率模型 | Li-Yuan Tsao, Yi-Chen Lo, Chia-Che Chang, Hao-Wei Chen, Roy Tseng, Chien Feng, Chun-Yi Lee | http://arxiv.org/pdf/2403.10988v1 | null |
2024-03-16 | Channel-wise Feature Decorrelation for Enhanced Learned Image Compression | 用于增强学习图像压缩的通道特征解相关 | Farhad Pakdaman, Moncef Gabbouj | http://arxiv.org/pdf/2403.10936v1 | null |
2024-03-16 | Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution | 学习现实世界尺度任意超分辨率的双层可变形隐式表示 | Zhiheng Li, Muheng Li, Jixuan Fan, Lei Chen, Yansong Tang, Jie Zhou, Jiwen Lu | http://arxiv.org/pdf/2403.10925v1 | null |
2024-03-16 | Regularizing CNNs using Confusion Penalty Based Label Smoothing for Histopathology Images | 使用基于混淆惩罚的标签平滑对组织病理学图像进行正则化 CNN | Somenath Kuiry, Alaka Das, Mita Nasipuri, Nibaran Das | http://arxiv.org/pdf/2403.10881v1 | null |
2024-03-16 | Bidirectional Multi-Step Domain Generalization for Visible-Infrared Person Re-Identification | 可见光-红外行人重识别的双向多步域泛化 | Mahdi Alehdaghi, Pourya Shamsolmoali, Rafael M. O. Cruz, Eric Granger | http://arxiv.org/pdf/2403.10782v1 | null |
2024-03-16 | Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching | 匹配立体视频:双向对齐以实现一致的动态立体匹配 | Junpeng Jing, Ye Mao, Krystian Mikolajczyk | http://arxiv.org/pdf/2403.10755v1 | null |
2024-03-16 | Vector search with small radiuses | 小半径矢量搜索 | Gergely Szilvasy, Pierre-Emmanuel Mazaré, Matthijs Douze | http://arxiv.org/pdf/2403.10746v1 | null |