Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-01-06 | Exploiting Data Hierarchy as a New Modality for Contrastive Learning | 利用数据层次结构作为对比学习的新模式 | Arjun Bhalla, Daniel Levenson, Jan Bernhard, Anton Abilov | http://arxiv.org/pdf/2401.03312v1 | null |
2024-01-06 | Large Language Models as Visual Cross-Domain Learners | 作为视觉跨领域学习者的大型语言模型 | Shuhao Chen, Yulong Zhang, Weisen Jiang, Jiangang Lu, Yu Zhang | http://arxiv.org/pdf/2401.03253v1 | null |
2024-01-06 | MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond | MirrorDiffusion:通过提示重新描述及其他方式稳定零样本图像翻译中的扩散过程 | Yupei Lin, Xiaoyu Xian, Yukai Shi, Liang Lin | http://arxiv.org/pdf/2401.03221v1 | null |
2024-01-06 | DistFormer: Enhancing Local and Global Features for Monocular Per-Object Distance Estimation | DistFormer:增强单目每个对象距离估计的局部和全局特征 | Aniello Panariello, Gianluca Mancusi, Fedy Haj Ali, Angelo Porrello, Simone Calderara, Rita Cucchiara | http://arxiv.org/pdf/2401.03191v1 | null |
2024-01-06 | Preserving Silent Features for Domain Generalization | 保留静默特征以进行领域泛化 | Chujie Zhao, Tianren Zhang, Feng Chen | http://arxiv.org/pdf/2401.03170v1 | null |
2024-01-06 | Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection | 用于 3D 工业异常检测的自监督特征适应 | Yuanpeng Tu, Boshen Zhang, Liang Liu, Yuxi Li, Chenhai Xu, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Cai Rong Zhao | http://arxiv.org/pdf/2401.03145v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-01-06 | Spatiotemporally adaptive compression for scientific dataset with feature preservation -- a case study on simulation data with extreme climate events analysis | 具有特征保留的科学数据集时空自适应压缩——极端气候事件分析模拟数据案例研究 | Qian Gong, Chengzhu Zhang, Xin Liang, Viktor Reshniak, Jieyang Chen, Anand Rangarajan, Sanjay Ranka, Nicolas Vidal, Lipeng Wan, Paul Ullrich, et.al. | http://arxiv.org/pdf/2401.03317v1 | null |
2024-01-06 | Realism in Action: Anomaly-Aware Diagnosis of Brain Tumors from Medical Images Using YOLOv8 and DeiT | 现实主义行动:使用 YOLOv8 和 DeiT 根据医学图像对脑肿瘤进行异常感知诊断 | Seyed Mohammad Hossein Hashemi, Leila Safari, Amirhossein Dadashzade Taromi | http://arxiv.org/pdf/2401.03302v1 | null |
2024-01-06 | Multi-View 3D Instance Segmentation of Structural Anomalies for Enhanced Structural Inspection of Concrete Bridges | 结构异常的多视图 3D 实例分割,用于增强混凝土桥梁的结构检查 | Christian Benz, Volker Rodehorst | http://arxiv.org/pdf/2401.03298v1 | null |
2024-01-06 | Real Time Human Detection by Unmanned Aerial Vehicles | 无人机实时人体检测 | Walid Guettala, Ali Sayah, Laid Kahloul, Ahmed Tibermacine | http://arxiv.org/pdf/2401.03275v1 | null |
2024-01-06 | Group Activity Recognition using Unreliable Tracked Pose | 使用不可靠的跟踪姿势进行群体活动识别 | Haritha Thilakarathne, Aiden Nibali, Zhen He, Stuart Morgan | http://arxiv.org/pdf/2401.03262v1 | null |
2024-01-06 | 3DMIT: 3D Multi-modal Instruction Tuning for Scene Understanding | 3DMIT:用于场景理解的 3D 多模态指令调整 | Zeju Li, Chao Zhang, Xiaoyan Wang, Ruilong Ren, Yifan Xu, Ruifei Ma, Xiangde Liu | http://arxiv.org/pdf/2401.03201v1 | link |
2024-01-06 | Distribution-aware Interactive Attention Network and Large-scale Cloud Recognition Benchmark on FY-4A Satellite Image | FY-4A卫星图像上的分布感知交互式注意力网络和大规模云识别基准 | Jiaqing Zhang, Jie Lei, Weiying Xie, Kai Jiang, Mingxiang Cao, Yunsong Li | http://arxiv.org/pdf/2401.03182v1 | null |
2024-01-06 | Multimodal Informative ViT: Information Aggregation and Distribution for Hyperspectral and LiDAR Classification | 多模态信息 ViT:高光谱和 LiDAR 分类的信息聚合和分发 | Jiaqing Zhang, Jie Lei, Weiying Xie, Geng Yang, Daixun Li, Yunsong Li, Karim Seghouane | http://arxiv.org/pdf/2401.03179v1 | null |
2024-01-06 | Text-Video Retrieval via Variational Multi-Modal Hypergraph Networks | 通过变分多模态超图网络进行文本视频检索 | Qian Li, Lixin Su, Jiashu Zhao, Long Xia, Hengyi Cai, Suqi Cheng, Hengzhu Tang, Junfeng Wang, Dawei Yin | http://arxiv.org/pdf/2401.03177v1 | null |
2024-01-06 | UGGNet: Bridging U-Net and VGG for Advanced Breast Cancer Diagnosis | UGGNet:桥接 U-Net 和 VGG 进行高级乳腺癌诊断 | Tran Cao Minh, Nguyen Kim Quoc, Phan Cong Vinh, Dang Nhu Phu, Vuong Xuan Chi, Ha Minh Tan | http://arxiv.org/pdf/2401.03173v1 | null |
2024-01-06 | An Event-Oriented Diffusion-Refinement Method for Sparse Events Completion | 一种面向事件的稀疏事件完成的扩散细化方法 | Bo Zhang, Yuqi Han, Jinli Suo, Qionghai Dai | http://arxiv.org/pdf/2401.03153v1 | null |
2024-01-06 | Controllable Image Synthesis of Industrial Data Using Stable Diffusion | 使用稳定扩散的工业数据的可控图像合成 | Gabriele Valvano, Antonino Agostino, Giovanni De Magistris, Antonino Graziano, Giacomo Veneri | http://arxiv.org/pdf/2401.03152v1 | null |
2024-01-06 | Explicit Visual Prompts for Visual Object Tracking | 视觉对象跟踪的显式视觉提示 | Liangtao Shi, Bineng Zhong, Qihua Liang, Ning Li, Shengping Zhang, Xianxian Li | http://arxiv.org/pdf/2401.03142v1 | null |
2024-01-06 | Vision Transformers and Bi-LSTM for Alzheimer's Disease Diagnosis from 3D MRI | 视觉 Transformers 和 Bi-LSTM 用于通过 3D MRI 诊断阿尔茨海默病 | Taymaz Akan, Sait Alp, Mohammad A. N Bhuiyanb | http://arxiv.org/pdf/2401.03132v1 | null |
2024-01-06 | Transferable Learned Image Compression-Resistant Adversarial Perturbations | 可迁移的学习图像抗压缩对抗性扰动 | Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen | http://arxiv.org/pdf/2401.03115v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-01-06 | ImageLab: Simplifying Image Processing Exploration for Novices and Experts Alike | ImageLab:为新手和专家简化图像处理探索 | Sahan Dissanayaka, Oshan Mudanayaka, Thilina Halloluwa, Chameera De Silva | http://arxiv.org/pdf/2401.03157v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-01-06 | A Physics-guided Generative AI Toolkit for Geophysical Monitoring | 用于地球物理监测的物理引导生成人工智能工具包 | Junhuan Yang, Hanchen Wang, Yi Sheng, Youzuo Lin, Lei Yang | http://arxiv.org/pdf/2401.03131v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-01-06 | RustNeRF: Robust Neural Radiance Field with Low-Quality Images | RustNeRF:具有低质量图像的鲁棒神经辐射场 | Mengfei Li, Ming Lu, Xiaofang Li, Shanghang Zhang | http://arxiv.org/pdf/2401.03257v1 | null |
2024-01-06 | Hi-Map: Hierarchical Factorized Radiance Field for High-Fidelity Monocular Dense Mapping | Hi-Map:用于高保真单目密集映射的分层分解辐射场 | Tongyan Hua, Haotian Bai, Zidong Cao, Ming Liu, Dacheng Tao, Lin Wang | http://arxiv.org/pdf/2401.03203v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-01-06 | Short-Time Fourier Transform for deblurring Variational Autoencoders | 用于去模糊变分自动编码器的短时傅立叶变换 | Vibhu Dalal | http://arxiv.org/pdf/2401.03166v1 | link |
2024-01-06 | SAR Despeckling via Regional Denoising Diffusion Probabilistic Model | 通过区域去噪扩散概率模型进行 SAR 去斑 | Xuran Hu, Ziqiang Xu, Zhihan Chen, Zhengpeng Feng, Mingzhe Zhu, LJubisa Stankovic | http://arxiv.org/pdf/2401.03122v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-01-06 | CaMML: Context-Aware Multimodal Learner for Large Models | CaMML:大型模型的上下文感知多模态学习器 | Yixin Chen, Shuai Zhang, Boran Han, Tong He, Bo Li | http://arxiv.org/pdf/2401.03149v1 | null |
2024-01-06 | Incorporating Visual Experts to Resolve the Information Loss in Multimodal Large Language Models | 结合视觉专家解决多模态大语言模型中的信息丢失问题 | Xin He, Longhui Wei, Lingxi Xie, Qi Tian | http://arxiv.org/pdf/2401.03105v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-01-06 | MetaISP -- Exploiting Global Scene Structure for Accurate Multi-Device Color Rendition | MetaISP——利用全局场景结构实现准确的多设备色彩再现 | Matheus Souza, Wolfgang Heidrich | http://arxiv.org/pdf/2401.03220v1 | link |
2024-01-06 | PosDiffNet: Positional Neural Diffusion for Point Cloud Registration in a Large Field of View with Perturbations | PosDiffNet:带扰动的大视场点云配准的位置神经扩散 | Rui She, Sijie Wang, Qiyu Kang, Kai Zhao, Yang Song, Wee Peng Tay, Tianyu Geng, Xingchao Jian | http://arxiv.org/pdf/2401.03167v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-01-06 | Dress-Me-Up: A Dataset & Method for Self-Supervised 3D Garment Retargeting | Dress-Me-Up:用于自监督 3D 服装重定向的数据集和方法 | Shanthika Naik, Kunwar Singh, Astitva Srivastava, Dhawal Sirikonda, Amit Raj, Varun Jampani, Avinash Sharma | http://arxiv.org/pdf/2401.03108v1 | null |
Publish Date | Title | Title_CN | Authors | Code | |
---|---|---|---|---|---|
2024-01-06 | Analysis and Validation of Image Search Engines in Histopathology | 组织病理学图像搜索引擎的分析和验证 | Isaiah Lahr, Saghir Alfasly, Peyman Nejat, Jibran Khan, Luke Kottom, Vaishnavi Kumbhar, Areej Alsaafin, Abubakr Shafique, Sobhan Hemati, Ghazal Alabtah, et.al. | http://arxiv.org/pdf/2401.03271v1 | null |
2024-01-06 | Autonomous Navigation in Complex Environments | 复杂环境下的自主导航 | Andrew Gerstenslager, Jomol Lewis, Liam McKenna, Poorva Patel | http://arxiv.org/pdf/2401.03267v1 | null |
2024-01-06 | Interpersonal Relationship Analysis with Dyadic EEG Signals via Learning Spatial-Temporal Patterns | 通过学习时空模式进行二元脑电信号的人际关系分析 | Wenqi Ji, Fang liu, Xinxin Du, Niqi Liu, Chao Zhou, Mingjin Yu, Guozhen Zhao, Yong-Jin Liu | http://arxiv.org/pdf/2401.03250v1 | null |
2024-01-06 | Efficient Bitrate Ladder Construction using Transfer Learning and Spatio-Temporal Features | 使用迁移学习和时空特征构建高效的比特率阶梯 | Ali Falahati, Mohammad Karim Safavi, Ardavan Elahi, Farhad Pakdaman, Moncef Gabbouj | http://arxiv.org/pdf/2401.03195v1 | null |
2024-01-06 | MPN: Leveraging Multilingual Patch Neuron for Cross-lingual Model Editing | MPN:利用多语言补丁神经元进行跨语言模型编辑 | Nianwen Si, Hao Zhang, Weiqiang Zhang | http://arxiv.org/pdf/2401.03190v1 | null |