Arxiv 2025-01-06 Papers
| 标题 | 作者 | PDF链接 | 代码仓库 | Title |
|---|---|---|---|---|
| 高斯掩码自编码器 | Jathushan Rajasegaran | N/A | Gaussian Masked Autoencoders | |
| LightGNN:用于推荐的简单图神经网络 | Guoxuan Chen | N/A | LightGNN: Simple Graph Neural Network for Recommendation | |
| BoostStep:通过改进单步推理提升大型语言模型的数学能力 | Beichen Zhang | N/A | BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning | |
| 自动化生成具有挑战性的多选题以评估视觉语言模型 | Yuhui Zhang | N/A | Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation | |
| Rate-My-LoRA:用于心脏MRI分割的高效自适应联邦模型调优 | Xiaoxiao He | N/A | Rate-My-LoRA: Efficient and Adaptive Federated Model Tuning for Cardiac MRI Segmentation | |
| 以下是这段英文的中文翻译: |
描述分布式随机凸优化中的准确性-通信-隐私权衡
这段文字涉及分布式随机凸优化中的一个关键问题,即如何在模型准确性、通信效率和隐私保护之间找到平衡。具体来说,它探讨了在分布式计算环境中,如何通过优化算法设计来权衡这三个因素,以实现最佳的系统性能。 | Sudeep Salgia | PDF | N/A | Characterizing the Accuracy-Communication-Privacy Trade-off in Distributed Stochastic Convex Optimization | | RW-Net:基于小波变换投影网络的少样本点云分类增强方法 | Haosheng Zhang | PDF | N/A | RW-Net: Enhancing Few-Shot Point Cloud Classification with a Wavelet Transform Projection-based Network | | ProTracker:用于鲁棒且精确点跟踪的概率积分方法 | Tingyang Zhang | PDF | N/A | ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking | | Dispider:通过解耦感知、决策和反应,实现视频大语言模型的主动实时交互 | Rui Qian | PDF | N/A | Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction | | 利用可解释的人工智能进行LLM文本归属:区分人类撰写与多个LLM生成的文本 | Ayat Najjar | PDF | N/A | Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text | | 检测教育内容中的AI生成文本:利用机器学习和可解释AI维护学术诚信 | Ayat A. Najjar | PDF | N/A | Detecting AI-Generated Text in Educational Content: Leveraging Machine Learning and Explainable AI for Academic Integrity | | FACTS接地排行榜:评估LLMs在长文本输入中接地回应的能力 | Alon Jacovi | PDF | N/A | The FACTS Grounding Leaderboard: Benchmarking LLMs' Ability to Ground Responses to Long-Form Input | | CLIX:习语表达的跨语言解释 | Aaron Gluck | PDF | N/A | CLIX: Cross-Lingual Explanations of Idiomatic Expressions | | 多模态机器学习可以预测视频会议的流畅性和愉悦度 | Andrew Chang | PDF | N/A | Multimodal Machine Learning Can Predict Videoconference Fluidity and Enjoyment | | 回合制多智能体强化学习模型检验 | Dennis Gross | PDF | N/A | Turn-based Multi-Agent Reinforcement Learning Model Checking | | 通过自监督预训练实现抗噪目标说话人语音活动检测 | Holger Severin Bovbjerg | PDF | N/A | Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining | | 可扩展的前向-前向算法 | Andrii Krutsylo | PDF | N/A | Scalable Forward-Forward Algorithm | | MObI:使用扩散模型进行多模态物体修复 | Alexandru Buburuzan | PDF | N/A | MObI: Multimodal Object Inpainting Using Diffusion Models | | GLiREL —— 零样本关系抽取的通用模型 | Jack Boylan | PDF | N/A | GLiREL -- Generalist Model for Zero-Shot Relation Extraction | | 语义描述:SQL2Text的基准数据集和图感知的少样本上下文学习 | Ali Al-Lawati | PDF | N/A | Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text | | 基于深度相对信任的去中心化深度学习扩散 | Muyun Li | PDF | N/A | Deep-Relative-Trust-Based Diffusion for Decentralized Deep Learning | | 液相透射电子显微镜中的零样本单粒子追踪分割模型 | Risha Goel | PDF | N/A | Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy | | 基于互信息上界的LoRA缩放定律 | Jing Zhang | PDF | N/A | The Scaling Law for LoRA Base on Mutual Information Upper Bound | | 大型语言模型在人工通用智能(AGI)中的应用:基础原则与方法综述 | Alhassan Mumuni | PDF | N/A | Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches | | 相机拍摄文档图像的几何恢复与去扭曲 | Valery Istomin | PDF | N/A | Geometry Restoration and Dewarping of Camera-Captured Document Images | | 安全验证与可解释深度强化学习策略的协同激活图分析 | Dennis Gross | PDF | N/A | Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies | | VicSim:通过情感与语言真实性提升受害者模拟效果 | Yerong Li | PDF | N/A | VicSim: Enhancing Victim Simulation with Emotional and Linguistic Fidelity | | 分布式专家问题的通信界限 | Zhihao Jia | PDF | N/A | Communication Bounds for the Distributed Experts Problem | | 从时间序列数据中学习有向无环图(DAGs)和根本原因 | Panagiotis Misiakos | PDF | N/A | Learning DAGs and Root Causes from Time-Series Data | | PRMBench:一个细粒度且具有挑战性的过程级奖励模型基准 | Mingyang Song | PDF | N/A | PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models | | 将“Normalizing Batch Normalization for Long-Tailed Recognition”翻译成中文是:
“归一化批量归一化用于长尾识别”
或者更自然的表达可以是:
“长尾识别中的批量归一化归一化”
具体翻译可以根据上下文语境调整。 | Yuxiang Bao | PDF | N/A | Normalizing Batch Normalization for Long-Tailed Recognition | | CAT: 内容自适应图像标记化 | Junhong Shen | PDF | N/A | CAT: Content-Adaptive Image Tokenization | | 从模型到网络拓扑:去中心化联邦学习中的拓扑推断攻击 | Chao Feng | PDF | N/A | From Models to Network Topologies: A Topology Inference Attack in Decentralized Federated Learning | | 平衡效率与表达力:基于行走中心性的子图图神经网络 | Joshua Southern | PDF | N/A | Balancing Efficiency and Expressiveness: Subgraph GNNs with Walk-Based Centrality | | LangFair:一个用于评估大型语言模型用例中偏见与公平性的Python包 | Dylan Bouchard | PDF | N/A | LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases | | MVP:基于视频和生理信号的多模态情感识别 | Valeriya Strizhkova | PDF | N/A | MVP: Multimodal Emotion Recognition based on Video and Physiological Signals | | 一种新颖的结构无关多目标方法,用于深度神经网络中的权重共享压缩 | Rasa Khosrowshahli | PDF | N/A | A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing Compression in Deep Neural Networks | | 情感引导的常识感知响应生成在心理健康咨询中的应用 | Aseem Srivastava | PDF | N/A | Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling | | 个性化时尚推荐与图像属性及美学评估 | Chongxian Chen | PDF | N/A | Personalized Fashion Recommendation with Image Attributes and Aesthetics Assessment | | Qinco2:使用改进的隐式神经码本进行向量压缩与搜索 | Théophane Vallaeys | PDF | N/A | Qinco2: Vector Compression and Search with Improved Implicit Neural Codebooks | | AIF-SFDA:基于自主信息过滤的无源域自适应医学图像分割方法
在这段翻译中,“AIF-SFDA”是原文的缩写,直接保留。其余部分翻译如下: - “Autonomous Information Filter-driven” 翻译为“基于自主信息过滤的” - “Source-Free Domain Adaptation” 翻译为“无源域自适应” - “for Medical Image Segmentation” 翻译为“医学图像分割方法”
整体翻译保持了原文的技术性和专业性,同时确保了中文表达的流畅性。 | Haojin Li | PDF | N/A | AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation | | 基于Slim多尺度卷积自编码器的降阶模型用于复杂动力系统的可解释特征提取 | Philipp Teutsch | PDF | N/A | Slim multi-scale convolutional autoencoder-based reduced-order models for interpretable features of a complex dynamical system | | 以下是这段文字的中文翻译:
咨询对话中的信任建模:一项基准研究
这个标题指的是一项关于在咨询对话中建立信任模型的研究,该研究旨在为这一领域提供一个基准或参考标准。研究可能涉及如何通过对话分析、行为模式识别或其他技术来量化和理解咨询过程中信任的建立与维持。 | Aseem Srivastava | PDF | N/A | Trust Modeling in Counseling Conversations: A Benchmark Study | | 《透过面具:基于面具的运动轨迹用于图像到视频生成》
这个标题指的是一种技术或方法,通过使用“面具”(mask)来生成从静态图像到动态视频的运动轨迹。具体来说,这种方法可能涉及使用图像分割或遮罩技术来识别和跟踪图像中的特定区域或对象,然后根据这些区域或对象的运动轨迹生成视频。这种方法可以用于各种应用,如动画制作、视频编辑和增强现实等。 | Guy Yariv | PDF | N/A | Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation | | 生存分析再探:在跌倒风险分析中理解与统一泊松、指数和Cox模型 | Tianhua Chen | PDF | N/A | Survival Analysis Revisited: Understanding and Unifying Poisson, Exponential, and Cox Models in Fall Risk Analysis | | 分析与规范拥堵游戏中的人类参与学习 | Hongbo Li | PDF | N/A | To Analyze and Regulate Human-in-the-loop Learning for Congestion Games | | Dr. Tongue: 面向舌象的多标签检测用于远程舌诊 | Yiliang Chen | PDF | N/A | Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis | | 基于单通道距离的移动GPU在室外和室内环境中的源分离 | Hanbin Bae | PDF | N/A | Single-Channel Distance-Based Source Separation for Mobile GPU in Outdoor and Indoor Environments | | 群体Shapley值及其在债券回收率预测中的应用——基于稳健显著性检验
这个翻译将原标题进行了适当的扩展和调整,以更清晰地表达研究内容:
-
"Group Shapley" 翻译为 "群体Shapley值",明确了这是关于Shapley值方法的研究
-
增加了连接词"及其",使标题各部分关系更清晰
-
"Robust Significance Testing" 翻译为"基于稳健显著性检验",采用倒装结构突出方法论特征
-
"Application to" 翻译为"及其在...中的应用",更符合中文表达习惯
-
"Bond Recovery Rate Prediction" 翻译为"债券回收率预测",准确传达了应用领域
这样的翻译既保持了原文的专业性和准确性,又使其更符合中文的阅读习惯和学术论文标题的表达规范。 | Jingyi Wang | PDF | N/A | Group Shapley with Robust Significance Testing and Its Application to Bond Recovery Rate Prediction | | ChronoSense:通过事件时间间隔探索大型语言模型中的时间理解 | Duygu Sezen Islakoglu | PDF | N/A | ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events | | 钢琴转录通过分层语言建模与基于乐谱的预训练编码器实现 | Dichucheng Li | PDF | N/A | Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders | | 量化遇上推理:探索LLM低比特量化对数学推理的退化影响 | Zhen Li | PDF | N/A | Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning | | DDRM-PR:使用去噪扩散恢复模型进行傅里叶相位恢复 | Mehmet Onurcan Kaya | PDF | N/A | DDRM-PR: Fourier Phase Retrieval using Denoising Diffusion Restoration Models | | 从机器学习的视角解读普特南的批判性与解释性倾向 | Sheldon Z. Soudin | PDF | N/A | Putnam's Critical and Explanatory Tendencies Interpreted from a Machine Learning Perspective | | 一种基于信任引导的带有辅助信息的磁共振图像重建方法 | Arda Atalık | PDF | N/A | A Trust-Guided Approach to MR Image Reconstruction with Side Information | | 不确定性下双边市场的可能正确最优稳定匹配 | Andreas Athanasopoulos | PDF | N/A | Probably Correct Optimal Stable Matching for Two-Sided Markets Under Uncertainty | | ReLU神经网络中的凸性:超越ICNNs? | Anne Gagneux | PDF | N/A | Convexity in ReLU Neural Networks: beyond ICNNs? | | 分析用于多模态大语言模型(LLMs)微调的表示偏移以实现对齐 | Pegah Khayatan | PDF | N/A | Analyzing Fine-tuning Representation Shift for Multimodal LLMs Steering alignment | | 基于质量评估的反馈训练用于改进代词翻译 | Harshit Dhankhar | PDF | N/A | Quality Estimation based Feedback Training for Improving Pronoun Translation | | TransPixar:通过透明度推进文本到视频生成 | Luozhou Wang | PDF | N/A | TransPixar: Advancing Text-to-Video Generation with Transparency | | PiLaMIM: 通过整合像素和潜在掩码图像建模实现更丰富的视觉表示 | Junmyeong Lee | PDF | N/A | PiLaMIM: Toward Richer Visual Representations by Integrating Pixel and Latent Masked Image Modeling | | CALM:面向大型语言模型的好奇心驱动审计 | Xiang Zheng | PDF | N/A | CALM: Curiosity-Driven Auditing for Large Language Models | | NeuroPMD:基于神经场的产品流形密度估计 | William Consagra | PDF | N/A | NeuroPMD: Neural Fields for Density Estimation on Product Manifolds | | GLFC:基于Mamba增强UNet的统一全局-局部特征与对比学习,用于从CBCT生成合成CT | Xianhao Zhou | PDF | N/A | GLFC: Unified Global-Local Feature and Contrast Learning with Mamba-Enhanced UNet for Synthetic CT Generation from CBCT | | SurgRIPE挑战:手术机器人器械姿态估计基准测试 | Haozheng Xu | PDF | N/A | SurgRIPE challenge: Benchmark of Surgical Robot Instrument Pose Estimation | | 分类器加权混合模型 | Elouan Argouarc'h | PDF | N/A | Classifier Weighted Mixture models | | 生物启发的碰撞感知神经元研究范式推动神经机器人融合:以LGMD为例 | Ziyan Qin | PDF | N/A | A Bio-Inspired Research Paradigm of Collision Perception Neurons Enabling Neuro-Robotic Integration: The LGMD Case | | CONTINUUM:通过时空图神经网络检测APT攻击
翻译: CONTINUUM 是一种通过时空图神经网络(Spatial-Temporal Graph Neural Networks)来检测高级持续性威胁(APT)攻击的系统。 | Atmane Ayoub Mansour Bahara | PDF | N/A | CONTINUUM: Detecting APT Attacks through Spatial-Temporal Graph Neural Networks | | 在多语言神经机器翻译中将源语言标记注册到目标语言空间 | Zhi Qu | PDF | N/A | Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation | | CAMP:基于配置文件的协作注意力模型用于车辆路径问题 | Chuanbo Hua | PDF | N/A | CAMP: Collaborative Attention Model with Profiles for Vehicle Routing Problems | | STAR:利用文本到视频模型进行时空增强以实现现实世界视频超分辨率 | Rui Xie | PDF | N/A | STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution | | 基于模糊粒度的多尺度粒度球密度离群点检测 | Can Gao | PDF | N/A | Fuzzy Granule Density-Based Outlier Detection with Multi-Scale Granular Balls | | HaWoR: 从第一人称视角视频重建世界空间中的手部运动 | Jinglei Zhang | PDF | N/A | HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos | | 这段文字的中文翻译是:
数据证明:一种用于协作智能的共识协议
其中,“Proof-of-Data”指的是“数据证明”,“A Consensus Protocol”意为“一种共识协议”,“Collaborative Intelligence”则翻译为“协作智能”。 | Huiwen Liu | PDF | N/A | Proof-of-Data: A Consensus Protocol for Collaborative Intelligence | | LOHA:低通与高通视图之间的直接图谱对比学习 | Ziyun Zou | PDF | N/A | LOHA: Direct Graph Spectral Contrastive Learning Between Low-pass and High-pass Views | | 人类凝视增强以对象为中心的表示学习 | Timothy Schaumlöffel | PDF | N/A | Human Gaze Boosts Object-Centered Representation Learning | | 苏格拉底式提问法:学会在自然环境中自我引导多模态推理 | Wanpeng Hu | PDF | N/A | Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild | | 以下是将这段英文翻译成中文的结果:
一个用于优化向用户重复交付个性化行动的点过程模型
翻译说明: - "A Point Process Model" 翻译为 "点过程模型"。 - "for Optimizing" 翻译为 "用于优化"。 - "Repeated Personalized Action Delivery" 翻译为 "重复交付个性化行动"。 - "to Users" 翻译为 "向用户"。
希望这个翻译对你有帮助! | Alexander Merkov | PDF | N/A | A Point Process Model for Optimizing Repeated Personalized Action Delivery to Users | | SceneVTG++:可控的多语言视觉文本生成技术 | Jiawei Liu | PDF | N/A | SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild | | MotionBench:为视觉语言模型进行细粒度视频运动理解的基准测试与改进 | Wenyi Hong | PDF | N/A | MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models | | 大脑中的键值记忆 | Samuel J. Gershman | PDF | N/A | Key-value memory in the brain | | MSA-CNN: 一种轻量级多尺度卷积神经网络,带有注意力机制的睡眠阶段分类模型 | Stephan Goerttler | PDF | N/A | MSA-CNN: A Lightweight Multi-Scale CNN with Attention for Sleep Stage Classification | | 表格基础模型TabPFN基于简单特征超越了专门的时间序列预测模型 | Shi Bin Hoo | PDF | N/A | The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features | | 改进利用半定优化解决低秩问题的近似算法 | Ryan Cory-Wright | PDF | N/A | Improved Approximation Algorithms for Low-Rank Problems Using Semidefinite Optimization | | 4D-CS:利用集群先验进行4D时空LiDAR语义分割 | Jiexi Zhong | PDF | N/A | 4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation | | 从数据中发现时间延迟微分方程的贝叶斯方法 | Debangshu Chowdhury | PDF | N/A | A Bayesian Approach for Discovering Time- Delayed Differential Equation from Data | | 预测化学组成对带隙的影响:一种针对具有非典型统计特性的材料特性的简单学习模型 | Andrew Ma | PDF | N/A | Predicting band gap from chemical composition: A simple learned model for a material property with atypical statistics | | 自注意力作为一种参数化自函子:Transformer架构的范畴论框架 | Charles O'Neill | PDF | N/A | Self-Attention as a Parametric Endofunctor: A Categorical Framework for Transformer Architectures | | 离线到在线超参数迁移用于随机赌博机问题 | Dravyansh Sharma | PDF | N/A | Offline-to-online hyperparameter transfer for stochastic bandits | | 基于无标签概念的多实例学习用于千兆像素病理学 | Susu Sun | PDF | N/A | Label-free Concept Based Multiple Instance Learning for Gigapixel Histopathology | | 使用高光谱成像和变分自编码器进行无监督的番茄裂果异常检测 | Mahmoud Abdulsalam | PDF | N/A | Unsupervised Tomato Split Anomaly Detection using Hyperspectral Imaging and Variational Autoencoders | | 基于单目事件脉冲的6D姿态估计在空间应用中的应用 | Jonathan Courtois | PDF | N/A | Spiking monocular event based 6D pose estimation for space application | | 点图条件扩散用于一致的新视角合成 | Thang-Anh-Quan Nguyen | PDF | N/A | Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis | | 为肿瘤微环境分析提供全面的病理图像分割:通过教师聚合实现 | Daisuke Komura | PDF | N/A | Comprehensive Pathological Image Segmentation via Teacher Aggregation for Tumor Microenvironment Analysis | | 领域无关的通用并行算法组合协同进化 | Zhiyuan Wang | PDF | N/A | Domain-Agnostic Co-Evolution of Generalizable Parallel Algorithm Portfolios | | 以下是这段文字的中文翻译:
“利用集成深度学习框架进行高分辨率集合降水预测”
翻译说明: - Skillful:译为“熟练的”或“高效的”,这里可以理解为“高效的”或“精准的”。 - High-Resolution:译为“高分辨率”。 - Ensemble Precipitation Forecasting:译为“集合降水预测”,集合预测是一种通过结合多个模型或预测结果来提高预测准确性的方法。 - Integrated Deep Learning Framework:译为“集成深度学习框架”,指结合多种深度学习技术的综合框架。
整句话的意思是:通过一个集成的深度学习框架,实现高效的高分辨率集合降水预测。 | Shuangshuang He | PDF | N/A | Skillful High-Resolution Ensemble Precipitation Forecasting with an Integrated Deep Learning Framework | | 以下是将这段英文翻译成中文的结果:
基于强化学习的移动机器人仿真到现实迁移:从NVIDIA Isaac Sim到Gazebo和真实的ROS 2机器人
翻译解释: - Sim-to-Real Transfer:仿真到现实迁移,指将仿真环境中训练的结果应用到现实世界中的技术。 - Mobile Robots:移动机器人,指能够在环境中自主移动的机器人。 - Reinforcement Learning:强化学习,一种机器学习方法,通过试错和奖励机制来训练智能体。 - NVIDIA Isaac Sim:NVIDIA开发的机器人仿真平台。 - Gazebo:一个开源的机器人仿真工具。 - ROS 2 Robots:基于ROS 2(机器人操作系统2)的机器人。
希望这段翻译对你有帮助! | Sahar Salimpour | PDF | N/A | Sim-to-Real Transfer for Mobile Robots with Reinforcement Learning: from NVIDIA Isaac Sim to Gazebo and Real ROS 2 Robots | | 基于感兴趣区域的医学图像压缩 | Utkarsh Prakash Srivastava | PDF | N/A | Region of Interest based Medical Image Compression | | FoundPAD: 重新加载基础模型用于人脸呈现攻击检测 | Guray Ozgur | PDF | N/A | FoundPAD: Foundation Models Reloaded for Face Presentation Attack Detection | | 解释幽默风格分类:一种理解计算幽默分析的可解释人工智能方法 | Mary Ogbuka Kenneth | PDF | N/A | Explaining Humour Style Classifications: An XAI Approach to Understanding Computational Humour Analysis | | 从维度分析的角度重新审视多智能体强化学习中的通信效率 | Chuxiong Sun | PDF | N/A | Revisiting Communication Efficiency in Multi-Agent Reinforcement Learning from the Dimensional Analysis Perspective | | MDP3:一种无需训练的列表式视频帧选择方法,适用于视频-LLMs | Hui Sun | PDF | N/A | MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs | | PARF-Net:将像素级自适应感受野融入混合Transformer-CNN网络用于医学图像分割 | Xu Ma | PDF | N/A | PARF-Net: integrating pixel-wise adaptive receptive fields into hybrid Transformer-CNN network for medical image segmentation | | 基于条件互信息的扩散后验采样用于求解逆问题 | Shayan Mohajer Hamidi | PDF | N/A | Conditional Mutual Information Based Diffusion Posterior Sampling for Solving Inverse Problems | | 二维未知视角层析成像中的未知角度分布问题 | Kaishva Chintan Shah | PDF | N/A | Two-Dimensional Unknown View Tomography from Unknown Angle Distributions | | IIMedGPT:通过高效的人类偏好对齐提升大型语言模型在医疗任务中的能力 | Yiming Zhang | PDF | N/A | IIMedGPT: Promoting Large Language Model Capabilities of Medical Tasks by Efficient Human Preference Alignment | | Diff-Lung:基于扩散的纹理合成技术用于增强肺部CT扫描中的病理组织分割 | Rezkellah Noureddine Khiati | PDF | N/A | Diff-Lung: Diffusion-Based Texture Synthesis for Enhanced Pathological Tissue Segmentation in Lung CT Scans | | 在自监督表示学习中看到部分的整体 | Arthur Aubret | PDF | N/A | Seeing the Whole in the Parts in Self-Supervised Representation Learning | | 一种基于相机-激光雷达融合的新型视觉Transformer用于交通对象分割 | Toomas Tahves | PDF | N/A | A Novel Vision Transformer for Camera-LiDAR Fusion based Traffic Object Segmentation | | ParetoLens:一个用于探索多目标进化算法解集的视觉分析框架 | Yuxin Ma | PDF | N/A | ParetoLens: A Visual Analytics Framework for Exploring Solution Sets of Multi-objective Evolutionary Algorithms | | 合成真菌数据集:一种时间对齐的方法 | A. Rani | PDF | N/A | Synthetic Fungi Datasets: A Time-Aligned Approach | | 用于视频监控应用的大型语言模型 | Ulindu De Silva | PDF | N/A | Large Language Models for Video Surveillance Applications | | HOGSA:基于3D高斯溅射数据增强的双手机-物体交互理解 | Wentian Qu | PDF | N/A | HOGSA: Bimanual Hand-Object Interaction Understanding with 3D Gaussian Splatting Based Data Augmentation | | 基于图的检索增强生成用于动态少样本文本分类 | Yubo Wang | PDF | N/A | Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification | | RAHN:一种基于声誉的沙漏网络用于Web服务QoS预测 | Xia Chen | PDF | N/A | RAHN: A Reputation Based Hourglass Network for Web Service QoS Prediction | | GenIR的基础 | Qingyao Ai | PDF | N/A | Foundations of GenIR | | 通过高效聚合局部特征增强屋顶太阳能电池板的检测 | Kuldeep Kurte | PDF | N/A | Enhanced Rooftop Solar Panel Detection by Efficiently Aggregating Local Features | | 《向前一步,全面优化:面向高效云-端协同设备端推荐的结构化参数化适配》 | Kairui Fu | PDF | N/A | Forward Once for All: Structural Parameterized Adaptation for Efficient Cloud-coordinated On-device Recommendation | | Samba-asr 利用结构化状态空间模型实现的最先进语音识别 | Syed Abdul Gaffar Shakhadri | PDF | N/A | Samba-asr state-of-the-art speech recognition leveraging structured state-space models | | 通用特征引导的零样本类别级物体姿态估计 | Wentian Qu | PDF | N/A | Universal Features Guided Zero-Shot Category-Level Object Pose Estimation | | 随机抽样的语言推理问题揭示了大型语言模型的局限性 | Kavi Gupta | PDF | N/A | Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs | | γ-氨基丁酸(GABA)受体介导的麻醉的蛋白质组学研究 | Jian Jiang | PDF | N/A | Proteomic Learning of Gamma-Aminobutyric Acid (GABA) Receptor-Mediated Anesthesia | | RDD4D:基于4D注意力引导的道路损坏检测与分类 | Asma Alkalbani | PDF | N/A | RDD4D: 4D Attention-Guided Road Damage Detection And Classification | | InpDiffusion: 基于条件扩散模型的图像修复定位 | Kai Wang | PDF | N/A | InpDiffusion: Image Inpainting Localization via Conditional Diffusion Models | | 基于自编码器特征提取的日降水量预测类比预报系统:在香港的应用 | Yee Chun Tsoi | PDF | N/A | Analogue Forecast System for Daily Precipitation Prediction Using Autoencoder Feature Extraction: Application in Hong Kong | | 街道景观店铺招牌识别竞赛第一名解决方案 | Bin Wang | PDF | N/A | First-place Solution for Streetscape Shop Sign Recognition Competition | | 暗黑先知:通过隐藏风格增强和稀疏噪声缓解的归纳时空克里金法 | Zhuoxuan Liang | PDF | N/A | DarkFarseer: Inductive Spatio-temporal Kriging via Hidden Style Enhancement and Sparsity-Noise Mitigation | | AE-NeRF:增强基于事件的神经辐射场以应对非理想条件和更大场景 | Chaoran Feng | PDF | N/A | AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scene | | 利用缓存机制增强终身多智能体路径规划 | Yimin Tang | PDF | N/A | Enhancing Lifelong Multi-Agent Path Finding with Cache Mechanism | | COph100:一个来自“RIDIRP”数据库的婴儿眼底图像配准综合数据集 | Yan Hu | PDF | N/A | COph100: A comprehensive fundus image registration dataset from infants constituting the "RIDIRP" database | | GraphDART:用于高效高级持续性威胁检测的图蒸馏技术 | Saba Fathi Rabooki | PDF | N/A | GraphDART: Graph Distillation for Efficient Advanced Persistent Threat Detection | | InfiFusion:一个通过LLM融合增强跨模型推理的统一框架 | Zhaoyi Yan | PDF | N/A | InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion | | 公平通过匹配 | Kunwoong Kim | PDF | N/A | Fairness Through Matching | | 使用浅层神经网络的线性算子学习的正交贪婪算法 | Ye Lin | PDF | N/A | Orthogonal greedy algorithm for linear operator learning with shallow neural network | | 将文本分段并学习其奖励以改进语言模型中的RLHF | Yueqin Yin | PDF | N/A | Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model | | GLoG-CSUnet:通过可适应的放射组学特征增强视觉Transformer,用于医学图像分割 | Niloufar Eghbali | PDF | N/A | GLoG-CSUnet: Enhancing Vision Transformers with Adaptable Radiomic Features for Medical Image Segmentation | | CCStereo:用于双耳音频生成的视听上下文与对比学习 | Yuanhong Chen | PDF | N/A | CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation | | 基于迁移学习的混合深度卷积模型用于肺癌检测 | Sugandha Saxena | PDF | N/A | Hybrid deep convolution model for lung cancer detection with transfer learning | | 从密集到稀疏:事件响应在提升住宅负荷预测中的应用 | Xin Cao | PDF | N/A | From Dense to Sparse: Event Response for Enhanced Residential Load Forecasting | | ICFNet:用于生存预测的集成跨模态融合网络 | Binyu Zhang | PDF | N/A | ICFNet: Integrated Cross-modal Fusion Network for Survival Prediction | | 学习一种用于参数化动作马尔可夫决策过程的灵活探索模型 | Zijian Wang | PDF | N/A | Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes | | 无监督领域自适应用于抗遮挡人体姿态估计 | Arindam Dutta | PDF | N/A | Unsupervised Domain Adaptation for Occlusion Resilient Human Pose Estimation | | GeAR: 生成增强检索 | Haoyu Liu | PDF | N/A | GeAR: Generation Augmented Retrieval | | WorldPose: 一个用于全球3D人体姿态估计的世界杯数据集 | Tianjian Jiang | PDF | N/A | WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation | | 在有限通信范围约束下的多智能体路径规划:动态引导方法 | Hoang-Dung Bui | PDF | N/A | Multi-Agent Path Finding under Limited Communication Range Constraint via Dynamic Leading | | 提升图神经网络可信度的基于排序的保形训练方法 | Ting Wang | PDF | N/A | Enhancing Trustworthiness of Graph Neural Networks with Rank-Based Conformal Training | | GNNs在多模态故障诊断中是否有效用于微服务系统? | Fei Gao | PDF | N/A | Are GNNs Effective for Multimodal Fault Diagnosis in Microservice Systems? | | 视觉大语言模型在广义和专门应用中的应用 | Yifan Li | PDF | N/A | Visual Large Language Models for Generalized and Specialized Applications | | LDMapNet-U:一个面向城市级车道级地图更新的端到端系统 | Deguo Xia | PDF | N/A | LDMapNet-U: An End-to-End System for City-Scale Lane-Level Map Updating | | 超越 $\mathcal{O}(\sqrt{T})$ 遗憾:在线线性规划中的学习与决策解耦 | Wenzhi Gao | PDF | N/A | Beyond $\mathcal{O}(\sqrt{T})$ Regret: Decoupling Learning and Decision-making in Online Linear Programming | | CHAT:超越对比图变换器用于异质网络中的链路预测 | Shengming Zhang | PDF | N/A | CHAT: Beyond Contrastive Graph Transformer for Link Prediction in Heterogeneous Networks | | MBTSAD:基于令牌分割和注意力蒸馏的语言模型后门缓解方法 | Yidong Ding | PDF | N/A | MBTSAD: Mitigating Backdoors in Language Models Based on Token Splitting and Attention Distillation | | Ultrasound-QBench:大型语言模型能否辅助超声成像的质量评估? | Hongyi Miao | PDF | N/A | Ultrasound-QBench: Can LLMs Aid in Quality Assessment of Ultrasound Imaging? | | 在智能物流中通过集成Transformer和图神经网络(GNN)提升机器人路径优化 | Hao Luo | PDF | N/A | Enhancing Robot Route Optimization in Smart Logistics with Transformer and GNN Integration | | 砖块扩散:通过砖块到墙面的去噪生成长视频 | Yunlong Yuan | PDF | N/A | Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising | | 基于深度卷积随机配置网络的熔镁炉工况可解释性识别 | Li Weitao | PDF | N/A | Interpretable Recognition of Fused Magnesium Furnace Working Conditions with Deep Convolutional Stochastic Configuration Networks | | TARDiS:用于优化多样性与可分离性的文本增强技术 | Kyungmin Kim | PDF | N/A | TARDiS : Text Augmentation for Refining Diversity and Separability | | 整体语义表示用于导航轨迹生成 | Ji Cao | PDF | N/A | Holistic Semantic Representation for Navigational Trajectory Generation | | 序列补充器:通过可学习序列增强变压器在时间序列预测中的应用 | Xiwen Chen | PDF | N/A | Sequence Complementor: Complementing Transformers For Time Series Forecasting with Learnable Sequences | | AFed:算法公平的联邦学习 | Huiqiang Chen | PDF | N/A | AFed: Algorithmic Fair Federated Learning | | OpenGU: 图遗忘综合基准 | Bowen Fan | PDF | N/A | OpenGU: A Comprehensive Benchmark for Graph Unlearning | | 基于树的RAG-Agent推荐系统:医学测试数据案例研究 | Yahe Yang | PDF | N/A | Tree-based RAG-Agent Recommendation System: A Case Study in Medical Test Data | | 创意产业中的人工智能:2025年前的进展 | Nantheera Anantrasirichai | PDF | N/A | Artificial Intelligence in Creative Industries: Advances Prior to 2025 | | 学习具有嵌入潜在转移算子的随机非线性动力学 | Naichang Ke | PDF | N/A | Learning Stochastic Nonlinear Dynamics with Embedded Latent Transfer Operators | | 改进新兴计算范式的数据编码:从随机计算到超维计算 | Mehran Shoushtari Moghadam | PDF | N/A | Improved Data Encoding for Emerging Computing Paradigms: From Stochastic to Hyperdimensional Computing | | KG-CF:在大语言模型指导下的知识图谱补全与上下文过滤 | Zaiyi Zheng | PDF | N/A | KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models | | 强化学习中的视野泛化 | Vivek Myers | PDF | N/A | Horizon Generalization in Reinforcement Learning | | 多级语义感知模型用于AI生成视频质量评估 | Jiaze Li | PDF | N/A | Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment | | 知识蒸馏与自适应权重 | Sirong Wu | PDF | N/A | Knowledge Distillation with Adapted Weight | | 基于后门的水印在神经网络中的持久性:一项全面评估 | Anh Tu Ngo | PDF | N/A | Persistence of Backdoor-based Watermarks for Neural Networks: A Comprehensive Evaluation | | QuIM-RAG:通过逆向问题匹配提升检索增强生成以增强问答性能 | Binita Saha | PDF | N/A | QuIM-RAG: Advancing Retrieval-Augmented Generation with Inverted Question Matching for Enhanced QA Performance | | 通过先验引导的混合感知方法和水下图像修复的广泛基准分析 | Xiaojiao Guo | PDF | N/A | Underwater Image Restoration Through a Prior Guided Hybrid Sense Approach and Extensive Benchmark Analysis | | EAGLE:增强视觉基础减少教学多模态模型中的幻觉 | Andrés Villa | PDF | N/A | EAGLE: Enhanced Visual Grounding Minimizes Hallucinations in Instructional Multimodal Models |