Skip to content

Latest commit

 

History

History
executable file
·
89 lines (68 loc) · 11.1 KB

2024-01-06.md

File metadata and controls

executable file
·
89 lines (68 loc) · 11.1 KB

!UPDATED -- 2024-01-06

各类学习方式

Publish Date Title Title_CN Authors PDF Code
2024-01-06 Exploiting Data Hierarchy as a New Modality for Contrastive Learning 利用数据层次结构作为对比学习的新模式 Arjun Bhalla, Daniel Levenson, Jan Bernhard, Anton Abilov http://arxiv.org/pdf/2401.03312v1 null
2024-01-06 Large Language Models as Visual Cross-Domain Learners 作为视觉跨领域学习者的大型语言模型 Shuhao Chen, Yulong Zhang, Weisen Jiang, Jiangang Lu, Yu Zhang http://arxiv.org/pdf/2401.03253v1 null
2024-01-06 MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond MirrorDiffusion:通过提示重新描述及其他方式稳定零样本图像翻译中的扩散过程 Yupei Lin, Xiaoyu Xian, Yukai Shi, Liang Lin http://arxiv.org/pdf/2401.03221v1 null
2024-01-06 DistFormer: Enhancing Local and Global Features for Monocular Per-Object Distance Estimation DistFormer:增强单目每个对象距离估计的局部和全局特征 Aniello Panariello, Gianluca Mancusi, Fedy Haj Ali, Angelo Porrello, Simone Calderara, Rita Cucchiara http://arxiv.org/pdf/2401.03191v1 null
2024-01-06 Preserving Silent Features for Domain Generalization 保留静默特征以进行领域泛化 Chujie Zhao, Tianren Zhang, Feng Chen http://arxiv.org/pdf/2401.03170v1 null
2024-01-06 Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection 用于 3D 工业异常检测的自监督特征适应 Yuanpeng Tu, Boshen Zhang, Liang Liu, Yuxi Li, Chenhai Xu, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Cai Rong Zhao http://arxiv.org/pdf/2401.03145v1 null

分类/检测/识别/分割

Publish Date Title Title_CN Authors PDF Code
2024-01-06 Spatiotemporally adaptive compression for scientific dataset with feature preservation -- a case study on simulation data with extreme climate events analysis 具有特征保留的科学数据集时空自适应压缩——极端气候事件分析模拟数据案例研究 Qian Gong, Chengzhu Zhang, Xin Liang, Viktor Reshniak, Jieyang Chen, Anand Rangarajan, Sanjay Ranka, Nicolas Vidal, Lipeng Wan, Paul Ullrich, et.al. http://arxiv.org/pdf/2401.03317v1 null
2024-01-06 Realism in Action: Anomaly-Aware Diagnosis of Brain Tumors from Medical Images Using YOLOv8 and DeiT 现实主义行动:使用 YOLOv8 和 DeiT 根据医学图像对脑肿瘤进行异常感知诊断 Seyed Mohammad Hossein Hashemi, Leila Safari, Amirhossein Dadashzade Taromi http://arxiv.org/pdf/2401.03302v1 null
2024-01-06 Multi-View 3D Instance Segmentation of Structural Anomalies for Enhanced Structural Inspection of Concrete Bridges 结构异常的多视图 3D 实例分割,用于增强混凝土桥梁的结构检查 Christian Benz, Volker Rodehorst http://arxiv.org/pdf/2401.03298v1 null
2024-01-06 Real Time Human Detection by Unmanned Aerial Vehicles 无人机实时人体检测 Walid Guettala, Ali Sayah, Laid Kahloul, Ahmed Tibermacine http://arxiv.org/pdf/2401.03275v1 null
2024-01-06 Group Activity Recognition using Unreliable Tracked Pose 使用不可靠的跟踪姿势进行群体活动识别 Haritha Thilakarathne, Aiden Nibali, Zhen He, Stuart Morgan http://arxiv.org/pdf/2401.03262v1 null
2024-01-06 3DMIT: 3D Multi-modal Instruction Tuning for Scene Understanding 3DMIT:用于场景理解的 3D 多模态指令调整 Zeju Li, Chao Zhang, Xiaoyan Wang, Ruilong Ren, Yifan Xu, Ruifei Ma, Xiangde Liu http://arxiv.org/pdf/2401.03201v1 link
2024-01-06 Distribution-aware Interactive Attention Network and Large-scale Cloud Recognition Benchmark on FY-4A Satellite Image FY-4A卫星图像上的分布感知交互式注意力网络和大规模云识别基准 Jiaqing Zhang, Jie Lei, Weiying Xie, Kai Jiang, Mingxiang Cao, Yunsong Li http://arxiv.org/pdf/2401.03182v1 null
2024-01-06 Multimodal Informative ViT: Information Aggregation and Distribution for Hyperspectral and LiDAR Classification 多模态信息 ViT:高光谱和 LiDAR 分类的信息聚合和分发 Jiaqing Zhang, Jie Lei, Weiying Xie, Geng Yang, Daixun Li, Yunsong Li, Karim Seghouane http://arxiv.org/pdf/2401.03179v1 null
2024-01-06 Text-Video Retrieval via Variational Multi-Modal Hypergraph Networks 通过变分多模态超图网络进行文本视频检索 Qian Li, Lixin Su, Jiashu Zhao, Long Xia, Hengyi Cai, Suqi Cheng, Hengzhu Tang, Junfeng Wang, Dawei Yin http://arxiv.org/pdf/2401.03177v1 null
2024-01-06 UGGNet: Bridging U-Net and VGG for Advanced Breast Cancer Diagnosis UGGNet:桥接 U-Net 和 VGG 进行高级乳腺癌诊断 Tran Cao Minh, Nguyen Kim Quoc, Phan Cong Vinh, Dang Nhu Phu, Vuong Xuan Chi, Ha Minh Tan http://arxiv.org/pdf/2401.03173v1 null
2024-01-06 An Event-Oriented Diffusion-Refinement Method for Sparse Events Completion 一种面向事件的稀疏事件完成的扩散细化方法 Bo Zhang, Yuqi Han, Jinli Suo, Qionghai Dai http://arxiv.org/pdf/2401.03153v1 null
2024-01-06 Controllable Image Synthesis of Industrial Data Using Stable Diffusion 使用稳定扩散的工业数据的可控图像合成 Gabriele Valvano, Antonino Agostino, Giovanni De Magistris, Antonino Graziano, Giacomo Veneri http://arxiv.org/pdf/2401.03152v1 null
2024-01-06 Explicit Visual Prompts for Visual Object Tracking 视觉对象跟踪的显式视觉提示 Liangtao Shi, Bineng Zhong, Qihua Liang, Ning Li, Shengping Zhang, Xianxian Li http://arxiv.org/pdf/2401.03142v1 null
2024-01-06 Vision Transformers and Bi-LSTM for Alzheimer's Disease Diagnosis from 3D MRI 视觉 Transformers 和 Bi-LSTM 用于通过 3D MRI 诊断阿尔茨海默病 Taymaz Akan, Sait Alp, Mohammad A. N Bhuiyanb http://arxiv.org/pdf/2401.03132v1 null
2024-01-06 Transferable Learned Image Compression-Resistant Adversarial Perturbations 可迁移的学习图像抗压缩对抗性扰动 Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen http://arxiv.org/pdf/2401.03115v1 null

OCR

Publish Date Title Title_CN Authors PDF Code
2024-01-06 ImageLab: Simplifying Image Processing Exploration for Novices and Experts Alike ImageLab:为新手和专家简化图像处理探索 Sahan Dissanayaka, Oshan Mudanayaka, Thilina Halloluwa, Chameera De Silva http://arxiv.org/pdf/2401.03157v1 null

模型压缩/优化

Publish Date Title Title_CN Authors PDF Code
2024-01-06 A Physics-guided Generative AI Toolkit for Geophysical Monitoring 用于地球物理监测的物理引导生成人工智能工具包 Junhuan Yang, Hanchen Wang, Yi Sheng, Youzuo Lin, Lei Yang http://arxiv.org/pdf/2401.03131v1 null

Nerf

Publish Date Title Title_CN Authors PDF Code
2024-01-06 RustNeRF: Robust Neural Radiance Field with Low-Quality Images RustNeRF:具有低质量图像的鲁棒神经辐射场 Mengfei Li, Ming Lu, Xiaofang Li, Shanghang Zhang http://arxiv.org/pdf/2401.03257v1 null
2024-01-06 Hi-Map: Hierarchical Factorized Radiance Field for High-Fidelity Monocular Dense Mapping Hi-Map:用于高保真单目密集映射的分层分解辐射场 Tongyan Hua, Haotian Bai, Zidong Cao, Ming Liu, Dacheng Tao, Lin Wang http://arxiv.org/pdf/2401.03203v1 null

生成模型

Publish Date Title Title_CN Authors PDF Code
2024-01-06 Short-Time Fourier Transform for deblurring Variational Autoencoders 用于去模糊变分自动编码器的短时傅立叶变换 Vibhu Dalal http://arxiv.org/pdf/2401.03166v1 link
2024-01-06 SAR Despeckling via Regional Denoising Diffusion Probabilistic Model 通过区域去噪扩散概率模型进行 SAR 去斑 Xuran Hu, Ziqiang Xu, Zhihan Chen, Zhengpeng Feng, Mingzhe Zhu, LJubisa Stankovic http://arxiv.org/pdf/2401.03122v1 null

多模态

Publish Date Title Title_CN Authors PDF Code
2024-01-06 CaMML: Context-Aware Multimodal Learner for Large Models CaMML:大型模型的上下文感知多模态学习器 Yixin Chen, Shuai Zhang, Boran Han, Tong He, Bo Li http://arxiv.org/pdf/2401.03149v1 null
2024-01-06 Incorporating Visual Experts to Resolve the Information Loss in Multimodal Large Language Models 结合视觉专家解决多模态大语言模型中的信息丢失问题 Xin He, Longhui Wei, Lingxi Xie, Qi Tian http://arxiv.org/pdf/2401.03105v1 null

Transformer

Publish Date Title Title_CN Authors PDF Code
2024-01-06 MetaISP -- Exploiting Global Scene Structure for Accurate Multi-Device Color Rendition MetaISP——利用全局场景结构实现准确的多设备色彩再现 Matheus Souza, Wolfgang Heidrich http://arxiv.org/pdf/2401.03220v1 link
2024-01-06 PosDiffNet: Positional Neural Diffusion for Point Cloud Registration in a Large Field of View with Perturbations PosDiffNet:带扰动的大视场点云配准的位置神经扩散 Rui She, Sijie Wang, Qiyu Kang, Kai Zhao, Yang Song, Wee Peng Tay, Tianyu Geng, Xingchao Jian http://arxiv.org/pdf/2401.03167v1 null

3D/CG

Publish Date Title Title_CN Authors PDF Code
2024-01-06 Dress-Me-Up: A Dataset & Method for Self-Supervised 3D Garment Retargeting Dress-Me-Up:用于自监督 3D 服装重定向的数据集和方法 Shanthika Naik, Kunwar Singh, Astitva Srivastava, Dhawal Sirikonda, Amit Raj, Varun Jampani, Avinash Sharma http://arxiv.org/pdf/2401.03108v1 null

其他

Publish Date Title Title_CN Authors PDF Code
2024-01-06 Analysis and Validation of Image Search Engines in Histopathology 组织病理学图像搜索引擎的分析和验证 Isaiah Lahr, Saghir Alfasly, Peyman Nejat, Jibran Khan, Luke Kottom, Vaishnavi Kumbhar, Areej Alsaafin, Abubakr Shafique, Sobhan Hemati, Ghazal Alabtah, et.al. http://arxiv.org/pdf/2401.03271v1 null
2024-01-06 Autonomous Navigation in Complex Environments 复杂环境下的自主导航 Andrew Gerstenslager, Jomol Lewis, Liam McKenna, Poorva Patel http://arxiv.org/pdf/2401.03267v1 null
2024-01-06 Interpersonal Relationship Analysis with Dyadic EEG Signals via Learning Spatial-Temporal Patterns 通过学习时空模式进行二元脑电信号的人际关系分析 Wenqi Ji, Fang liu, Xinxin Du, Niqi Liu, Chao Zhou, Mingjin Yu, Guozhen Zhao, Yong-Jin Liu http://arxiv.org/pdf/2401.03250v1 null
2024-01-06 Efficient Bitrate Ladder Construction using Transfer Learning and Spatio-Temporal Features 使用迁移学习和时空特征构建高效的比特率阶梯 Ali Falahati, Mohammad Karim Safavi, Ardavan Elahi, Farhad Pakdaman, Moncef Gabbouj http://arxiv.org/pdf/2401.03195v1 null
2024-01-06 MPN: Leveraging Multilingual Patch Neuron for Cross-lingual Model Editing MPN:利用多语言补丁神经元进行跨语言模型编辑 Nianwen Si, Hao Zhang, Weiqiang Zhang http://arxiv.org/pdf/2401.03190v1 null