跳转至

Arxiv 2024-09-30 Papers

标题 作者 PDF链接 代码仓库 Title
持续改进移动操作与自主现实世界强化学习 Russell Mendonca PDF N/A Continuously Improving Mobile Manipulation with Autonomous Real-World RL
MM1.5:多模态大语言模型微调中的方法、分析与洞察 Haotian Zhang PDF N/A MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
排名优于评分:迈向可靠且稳健的LLM生成医学解释性论证自动化评估 Iker De la Iglesia PDF N/A Ranking Over Scoring: Towards Reliable and Robust Automated Evaluation of LLM-Generated Medical Explanatory Arguments
DressRecon:从单目视频中自由形式重建4D人体 Jeff Tan PDF N/A DressRecon: Freeform 4D Human Reconstruction from Monocular Video
SpaceMesh:一种用于学习流形表面网格的连续表示 Tianchang Shen PDF N/A SpaceMesh: A Continuous Representation for Learning Manifold Surface Meshes
LaMMA-P:基于语言模型驱动的PDDL规划器实现的多智能体长时任务分配与规划的通用性方法 Xiaopan Zhang PDF N/A LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner
监督多模态裂变学习 Lingchao Mao PDF N/A Supervised Multi-Modal Fission Learning
Uni$^2$Det:用于提示引导的多数据集3D检测的统一通用框架 Yubin Wang PDF N/A Uni$^2$Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection
提议、评估、搜索:利用大型语言模型在教学视频中实现目标导向的规划 Md Mohaiminul Islam PDF N/A Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos
逆向绘画:重构绘画过程 Bowei Chen PDF N/A Inverse Painting: Reconstructing The Painting Process
Maia-2:国际象棋中人机对齐的统一模型 Zhenwei Tang PDF N/A Maia-2: A Unified Model for Human-AI Alignment in Chess
实际代码生成中的大型语言模型幻觉:现象、机制与缓解 Ziyao Zhang PDF N/A LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation
罗比·巴特勒:与家用机器人助手的远程多模态互动 Anxing Xiao PDF N/A Robi Butler: Remote Multimodal Interactions with Household Robot Assistant
退火流生成模型:实现高维和多模态分布的采样 Dongze Wu PDF N/A Annealing Flow Generative Model Towards Sampling High-Dimensional and Multi-Modal Distributions
扩展本体感受-视觉学习与异构预训练变压器 Lirui Wang PDF N/A Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers
负责任机器学习在信用评分中的最佳实践 Giovani Valdrighi PDF N/A Best Practices for Responsible Machine Learning in Credit Scoring
端到端保形校准用于不确定性下的优化 Christopher Yeh PDF N/A End-to-End Conformal Calibration for Optimization Under Uncertainty
双编码器生成对抗网络反演用于从单张图像进行高保真3D头部重建 Bahri Batuhan Bilecen PDF N/A Dual Encoder GAN Inversion for High-Fidelity 3D Head Reconstruction from Single Images
形式化验证的物理信息神经控制李雅普诺夫函数 Jun Liu PDF N/A Formally Verified Physics-Informed Neural Control Lyapunov Functions
母语西班牙语中的词义消歧:一个全面的词汇评估资源 Pablo Ortega PDF N/A Word Sense Disambiguation in Native Spanish: A Comprehensive Lexical Evaluation Resource
分布稳健的非动态强化学习的上下界 Zhishuai Liu PDF N/A Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
加速非极大值抑制:图论视角 King-Siong Si PDF N/A Accelerating Non-Maximum Suppression: A Graph Theory Perspective
SMLE:通过嵌入超近似实现的安全机器学习 Matteo Francobaldi PDF N/A SMLE: Safe Machine Learning via Embedded Overapproximation
基于激光全场测量的主控方程数据驱动发现的综合WSINDy方法 Abigail C. Schmid PDF N/A Ensemble WSINDy for Data Driven Discovery of Governing Equations from Laser-based Full-field Measurements
营养视野:智能医疗中的自动饮食管理系统 Madhumita Veeramreddy PDF N/A NUTRIVISION: A System for Automatic Diet Management in Smart Healthcare
基于日志的异常检测需要哪些信息?可配置Transformer方法的见解 Xingfang Wu PDF N/A What Information Contributes to Log-based Anomaly Detection? Insights from a Configurable Transformer-Based Approach
拼贴画:利用分层潜在扩散和语言模型生成协作式人机交互 Divyanshu Daiya PDF N/A COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models
FreeMask: 重新思考注意力掩码在零样本视频编辑中的重要性 Lingling Cai PDF N/A FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
通过知识蒸馏、多任务学习和数据增强提升罗马尼亚语攻击性语言检测 Vlad-Cristian Matei PDF N/A Enhancing Romanian Offensive Language Detection through Knowledge Distillation, Multi-Task Learning, and Data Augmentation
预算约束下的在线决策延迟 Mirabel Reid PDF N/A Online Decision Deferral under Budget Constraints
使用拉普拉斯神经流形的“什么”乘以“何时”工作记忆表征 Aakash Sarkar PDF N/A "What" x "When" working memory representations using Laplace Neural Manifolds
RecSys Challenge 2024:在新闻推荐中平衡准确性与编辑价值观 Johannes Kruse PDF N/A RecSys Challenge 2024: Balancing Accuracy and Editorial Values in News Recommendations
IRFusionFormer:通过RGB-T融合和基于拓扑的损失增强路面裂缝分割 Ruiqiang Xiao PDF N/A IRFusionFormer: Enhancing Pavement Crack Segmentation with RGB-T Fusion and Topological-Based Loss
持续人体姿态估计用于增量集成关键点和姿态变化 Muhammad Saif Ullah Khan PDF N/A Continual Human Pose Estimation for Incremental Integration of Keypoints and Pose Variations
一个针对越南社交媒体机器词汇规范化的弱监督数据标注框架 Dung Ha Nguyen PDF N/A A Weakly Supervised Data Labeling Framework for Machine Lexical Normalization in Vietnamese Social Media
跨领域自动文本简化的西班牙语语言资源 Antonio Moreno-Sandoval PDF N/A Language Resources in Spanish for Automatic Text Simplification across Domains
教师嵌入的线性投影用于少类蒸馏 Noel Loo PDF N/A Linear Projections of Teacher Embeddings for Few-Class Distillation
POMONAG:帕累托最优多目标神经架构生成器 Eugenio Lomurno PDF N/A POMONAG: Pareto-Optimal Many-Objective Neural Architecture Generator
实例自适应的零样本思维链提示 Xiaosong Yuan PDF N/A Instance-adaptive Zero-shot Chain-of-Thought Prompting
面对模糊性的乐观原则在多臂老虎机问题中的应用 Mengmeng Li PDF N/A Optimism in the Face of Ambiguity Principle for Multi-Armed Bandits
QA编码器:面向问答系统中的对齐表示学习 Zhengren Wang PDF N/A QAEncoder: Towards Aligned Representation Learning in Question Answering System
多层Picard逼近与具有ReLU、leaky ReLU和softplus激活的深度神经网络在$L^p$意义下克服了维度诅咒,当逼近半线性抛物型偏微分方程时。 Ariel Neufeld PDF N/A Multilevel Picard approximations and deep neural networks with ReLU, leaky ReLU, and softplus activation overcome the curse of dimensionality when approximating semilinear parabolic partial differential equations in $L^p$-sense
HELPD:通过分层反馈学习与视觉增强惩罚解码来减轻大型视觉语言模型的幻觉 Fan Yuan PDF N/A HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding
从fMRI解码视觉回声:过去语义信息的记忆解构 Runze Xia PDF N/A Decoding the Echoes of Vision from fMRI: Memory Disentangling for Past Semantic Information
充分必要解释(及其间的区别) Beepul Bharti PDF N/A Sufficient and Necessary Explanations (and What Lies in Between)
导航威胁:自动驾驶车辆中激光雷达感知系统的物理对抗攻击调查 Amira Guesmi PDF N/A Navigating Threats: A Survey of Physical Adversarial Attacks on LiDAR Perception Systems in Autonomous Vehicles
世界到代码:通过自我指导的组合字幕和过滤实现多模态数据生成 Jiacong Wang PDF N/A World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering
从贝叶斯决策理论的角度来看的流级流量匹配 Ganchao Wei PDF N/A Stream-level flow matching from a Bayesian decision theoretic perspective
基于人工智能的全自动分析儿童高度近视视网膜血管形态 Yinzheng Zhao PDF N/A AI-Based Fully Automatic Analysis of Retinal Vascular Morphology in Pediatric High Myopia
KANDU-Net:一种结合KAN的双通道U-Net用于医学图像分割 Chenglin Fang PDF N/A KANDU-Net:A Dual-Channel U-Net with KAN for Medical Image Segmentation
LHC中的新型机器学习应用 Javier M. Duarte PDF N/A Novel machine learning applications at the LHC
连续治疗剂量反应模型的共形预测 Jarne Verhaeghe PDF N/A Conformal Prediction for Dose-Response Models with Continuous Treatments
物理正则化的多模态图像同化用于脑肿瘤定位 Michal Balcerak PDF N/A Physics-Regularized Multi-Modal Image Assimilation for Brain Tumor Localization
开源眼周分割数据集,适用于眼科应用 George R. Nahass PDF N/A Open-Source Periorbital Segmentation Dataset for Ophthalmic Applications
加速边缘设备上的PoT量化 Rappy Saha PDF N/A Accelerating PoT Quantization on Edge Devices
AUCSeg:面向AUC的像素级长尾语义分割 Boyu Han PDF N/A AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation
反刻板印象的预测文本建议并不能可靠地产生反刻板印象的写作 Connor Baumler PDF N/A Anti-stereotypical Predictive Text Suggestions Do Not Reliably Yield Anti-stereotypical Writing
等等,但泰诺就是对乙酰氨基酚... 探究并提升语言模型对错误信息请求的抵抗力 Shan Chen PDF N/A Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation
FireLite:利用迁移学习在资源受限环境下实现高效火灾检测 Mahamudul Hasan PDF N/A FireLite: Leveraging Transfer Learning for Efficient Fire Detection in Resource-Constrained Environments
超越PINNs的衍生病理学:变量分裂策略与收敛性分析 Yesom Park PDF N/A Beyond Derivative Pathology of PINNs: Variable Splitting Strategy with Convergence Analysis
跨语言TTS系统的逐词语调模型 Tomilov A. A. PDF N/A Word-wise intonation model for cross-language TTS systems
非平稳时间序列预测的频率自适应归一化 Weiwei Ye PDF N/A Frequency Adaptive Normalization For Non-stationary Time Series Forecasting
完美融合:重新定义RLHF与评委组合 Tengyu Xu PDF N/A The Perfect Blend: Redefining RLHF with Mixture of Judges
通过任务驱动的表示解开新加坡英语话语粒子 Linus Tze En Foo PDF N/A Disentangling Singlish Discourse Particles with Task-Driven Representation
VideoINSTA:通过与LLMs进行信息丰富的时空推理实现零样本长视频理解 Ruotong Liao PDF N/A VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs
使用大型语言模型在边缘设备上进行高效驾驶行为叙述与推理 Yizhou Huang PDF N/A Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models
旋转运行时平滑:无需训练的激活平滑器,用于精确的INT4推理 Ke Yi PDF N/A Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference
CableInspect-AD:一个专家标注的异常检测数据集 Akshatha Arodi PDF N/A CableInspect-AD: An Expert-Annotated Anomaly Detection Dataset
国家癌症研究所影像数据中心中乳腺癌、脑癌、肝癌、肺癌和前列腺癌数据集的AI生成注释 Gowtham Krishnan Murugesan PDF N/A AI generated annotations for Breast, Brain, Liver, Lungs and Prostate cancer collections in National Cancer Institute Imaging Data Commons
基于对比学习的GAN多阶段渐进微调SNN与基于RL的外部优化增强 Osama Mustafa PDF N/A Enhancing GANs with Contrastive Learning-Based Multistage Progressive Finetuning SNN and RL-Based External Optimization
魔鬼在细节中:面向局部的3D腹部CT体积生成用于自监督器官分割 Yuran Wang PDF N/A Devil is in Details: Locality-Aware 3D Abdominal CT Volume Generation for Self-Supervised Organ Segmentation
在联邦学习中微调个性化以缓解对抗性客户端 Youssef Allouah PDF N/A Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial Clients
MARLadona -- 利用多智能体强化学习实现协作团队游戏 Zichong Li PDF N/A MARLadona -- Towards Cooperative Team Play Using Multi-Agent Reinforcement Learning
旧优化器,新规范:文集 Jeremy Bernstein PDF N/A Old Optimizer, New Norm: An Anthology
提示:头戴式以自我为中心的数据集,用于盲人辅助系统中的轨迹预测 Yasaman Haghighi PDF N/A HEADS-UP: Head-Mounted Egocentric Dataset for Trajectory Prediction in Blind Assistance Systems
通过内部声学模型训练和双重空白阈值提升基于混合自回归转换器的自动语音识别 Takafumi Moriya PDF N/A Boosting Hybrid Autoregressive Transducer-based ASR with Internal Acoustic Model Training and Dual Blank Thresholding
SSM 是从多元时间序列聚合而成的 Haixiang Wu PDF N/A A SSM is Polymerized from Multivariate Time Series
在评估语言模型中的行为时,是否存在迫在眉睫的复制危机?证据与解决方案 Laurène Vaugrante PDF N/A A Looming Replication Crisis in Evaluating Behavior in Language Models? Evidence and Solutions
OM4OV:利用本体匹配进行本体版本管理 Zhangcheng Qiang PDF N/A OM4OV: Leveraging Ontology Matching for Ontology Versioning
无对齐训练用于基于转换器模型的多说话人自动语音识别 Takafumi Moriya PDF N/A Alignment-Free Training for Transducer-based Multi-Talker ASR
个人化大型语言模型(PersonalLLM):根据个人偏好定制大型语言模型 Thomas P. Zollo PDF N/A PersonalLLM: Tailoring LLMs to Individual Preferences
通过弱少样本监督学习提示来自动化MedSAM Mélanie Gaillochet PDF N/A Automating MedSAM by Learning Prompts with Weak Few-Shot Supervision
分布式神经辐射场学习用于协作多机器人感知 Hongrui Zhao PDF N/A Distributed NeRF Learning for Collaborative Multi-Robot Perception
LexEval:一个全面的中文法律基准,用于评估大型语言模型 Haitao Li PDF N/A LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
利用CAM算法解释医学语义分割 Tillmann Rheude PDF N/A Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation
通过双向对齐匹配立体视频 Junpeng Jing PDF N/A Match Stereo Videos via Bidirectional Alignment
2024年OOD-CV研讨会SSB挑战赛(开放集识别赛道)解决方案 Mingxu Feng PDF N/A Solution for OOD-CV Workshop SSB Challenge 2024 (Open-Set Recognition Track)
大规模主动神经映射 Zijia Kuang PDF N/A Active Neural Mapping at Scale
带离散和连续随机变量的概率答案集编程 Damiano Azzolini PDF N/A Probabilistic Answer Set Programming with Discrete and Continuous Random Variables
真实世界治疗场景中的松散社交互动识别 Abid Ali PDF N/A Loose Social-Interaction Recognition in Real-world Therapy Scenarios
一阶系统最小二乘神经网络 Joost A. A. Opschoor PDF N/A First Order System Least Squares Neural Networks
计算机辅助中风康复治疗:系统综述与元分析 Stanley Mugisha. Mirko Job. Matteo Zoppi PDF N/A Computer-mediated therapies for stroke rehabilitation: a systematic review and meta-Analysis
学习将存在量化的目标具体化 Martin Funkquist PDF N/A Learning to Ground Existentially Quantified Goals
从多目标强化学习中的演示推断偏好 Junlin Lu PDF N/A Inferring Preferences from Demonstrations in Multi-objective Reinforcement Learning
PerCo(SD):开放感知压缩 Nikolai Körber PDF N/A PerCo (SD): Open Perceptual Compression
使用SAM生成的标注进行医学图像分割 Iira Häkkinen PDF N/A Medical Image Segmentation with SAM-generated Annotations
大型语言模型在天文学研究演进中扮演何种角色? Morgan Fouesneau PDF N/A What is the Role of Large Language Models in the Evolution of Astronomy Research?
通过端到端学习控制7T下3D FSE的锐度、信噪比和比吸收率 Peter Dawood PDF N/A Controlling sharpness, SNR and SAR for 3D FSE at 7T by end-to-end learning
随机特征优于线性模型:尖峰协方差数据中强输入-标签相关性的影响 Samet Demir PDF N/A Random Features Outperform Linear Models: Effect of Strong Input-Label Correlation in Spiked Covariance Data
移动边缘计算中稳定大型语言模型训练的资源分配 Chang Liu PDF N/A Resource Allocation for Stable LLM Training in Mobile Edge Computing
分析零样本可读性控制的句子简化 Abdullah Barayan PDF N/A Analysing Zero-Shot Readability-Controlled Sentence Simplification
PsyGUARD:心理咨询中用于自杀检测和风险评估的自动化系统 Huachuan Qiu PDF N/A PsyGUARD: An Automated System for Suicide Detection and Risk Assessment in Psychological Counseling
课堂启发式多导师知识蒸馏与自适应学习策略 Shalini Sarode PDF N/A Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies
铝硅酸盐熔体粘度的一般机器学习模型及其在干燥熔岩行星表面性质中的应用 Charles Le Losq PDF N/A A general machine learning model of aluminosilicate melt viscosity and its application to the surface properties of dry lava planets
评估预测的蛋白质-配体构象的相互作用恢复情况 David Errington PDF N/A Assessing interaction recovery of predicted protein-ligand poses
GTransPDM:一种用于行人穿越意图预测的图嵌入变换器,具有位置解耦功能 Chen Xie PDF N/A GTransPDM: A Graph-embedded Transformer with Positional Decoupling for Pedestrian Crossing Intention Prediction
超越提示:大型语言模型的动态对话基准测试 David Castillo-Bolado PDF N/A Beyond Prompts: Dynamic Conversational Benchmarking of Large Language Models
注意GAP:基于一瞥的主动感知提升了视觉推理的泛化能力和样本效率 Oleh Kolner PDF N/A Mind the GAP: Glimpse-based Active Perception improves generalization and sample efficiency of visual reasoning
利用无异常区域约束异常检测 Maximilian Toller PDF N/A Constraining Anomaly Detection with Anomaly-Free Regions
SetPINNs:基于集合的物理信息神经网络 Mayank Nagda PDF N/A SetPINNs: Set-based Physics-informed Neural Networks
学科划分?基于半自动化方法的网络性别歧视和厌女症量化系统文献综述 Aditi Dutta PDF N/A Divided by discipline? A systematic literature review on the quantification of online sexism and misogyny using a semi-automated approach
AfriHuBERT:一种针对非洲语言的自监督语音表示模型 Jesujoba O. Alabi PDF N/A AfriHuBERT: A self-supervised speech representation model for African languages
UIR-LoRA:通过多重低秩适应实现通用图像修复 Cheng Zhang PDF N/A UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation
旋律是你生成音乐所需的一切 Shaopeng Wei PDF N/A Melody Is All You Need For Music Generation
利用纵向视网膜OCT中的平行超平面预测疾病进展 Arunava Chakravarty PDF N/A Forecasting Disease Progression with Parallel Hyperplanes in Longitudinal Retinal OCT
工厂操作员对认知助手用于知识共享的看法:挑战、风险及对工作的影响 Samuel Kernan Freire PDF N/A Factory Operators' Perspectives on Cognitive Assistants for Knowledge Sharing: Challenges, Risks, and Impact on Work
任务复杂性:一个用于任务复杂性分类的数据集,包含上下文学习、FLAN-T5 和 GPT-4 基准测试 Areeg Fahad Rasheed PDF N/A TaskComplexity: A Dataset for Task Complexity Classification with In-Context Learning, FLAN-T5 and GPT-4o Benchmarks
在缺乏真实情况下的基于马尔可夫和最小边数选择DAG模型 Joseph D. Ramsey PDF N/A Choosing DAG Models Using Markov and Minimal Edge Count in the Absence of Ground Truth
参考可信解码:一种无需训练的增强大语言模型范式 Luohe Shi PDF N/A Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models
通过多模态表示学习预测肺癌生存率 Aiman Farooq PDF N/A Survival Prediction in Lung Cancer through Multi-Modal Representation Learning
集合卡尔曼扩散引导:一种无导数的逆问题求解方法 Hongkai Zheng PDF N/A Ensemble Kalman Diffusion Guidance: A Derivative-free Method for Inverse Problems
使用GPT-2建模自然阅读的认知过程 Bruno Bianchi PDF N/A Modelando procesos cognitivos de la lectura natural con GPT-2
ILeSiA:从摄像头输入中进行情境意识的交互式学习 Petr Vanc PDF N/A ILeSiA: Interactive Learning of Situational Awareness from Camera Input
利用高度差图像的无注释路缘检测 Fulong Ma PDF N/A Annotation-Free Curb Detection Leveraging Altitude Difference Image
利用大型多模态模型从多媒体问题信息中提取知识追踪的知识组件 Hyeongdon Moon PDF N/A Using Large Multimodal Models to Extract Knowledge Components for Knowledge Tracing from Multimedia Question Information
面向任务的预训练用于可行驶区域检测 Fulong Ma PDF N/A Task-Oriented Pre-Training for Drivable Area Detection
德国的事实与欺骗有多纠缠不清? Aswathy Velutharambath PDF N/A How Entangled is Factuality and Deception in German?
擦除,然后重绘:一种使用扩散模型进行自由空间检测的新型数据增强方法 Fulong Ma PDF N/A Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model
MemSim:一种用于评估基于LLM的个人助手记忆能力的贝叶斯模拟器 Zeyu Zhang PDF N/A MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal Assistants
ASTRA:基于精确且可扩展的近似最近邻搜索的极端分类器训练方法 Sonu Mehta PDF N/A ASTRA: Accurate and Scalable ANNS-based Training of Extreme Classifiers
1万亿代币(1TT)平台:一种用于大型语言模型中高效数据共享和补偿的创新框架 Chanjun Park PDF N/A 1 Trillion Token (1TT) Platform: A Novel Framework for Efficient Data Sharing and Compensation in Large Language Models
非英语语言环境下小规模不平衡数据集的放射学文本分类 Vincent Beliveau PDF N/A Classification of Radiological Text in Small and Imbalanced Datasets in a Non-English Language
VMAD:用于零样本异常检测的视觉增强多模态大语言模型 Huilin Deng PDF N/A VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection
RISE-SDF:一种用于光泽物体逆向渲染的可重新照明的信息共享符号距离场 Deheng Zhang PDF N/A RISE-SDF: a Relightable Information-Shared Signed Distance Field for Glossy Object Inverse Rendering
通过自然输入梯度表征模型鲁棒性 Adrián Rodríguez-Muñoz PDF N/A Characterizing Model Robustness via Natural Input Gradients
神经网络的约束引导模型量化 Quinten Van Baelen PDF N/A Constraint Guided Model Quantization of Neural Networks
使用计算机视觉模型分割木材腐烂 Roland Kammerbauer PDF N/A Segmenting Wood Rot using Computer Vision Models
使用领域覆盖增强对LLMs进行联邦指令微调 Zezhou Wang PDF N/A Federated Instruction Tuning of LLMs with Domain Coverage Augmentation
机器学习在玻璃瓶印刷工业质量控制中的应用 Maximilian Bundscherer PDF N/A Machine Learning in Industrial Quality Control of Glass Bottle Prints
重新评估归纳链接预测 Simon Ott PDF N/A Reevaluation of Inductive Link Prediction
PuzzleBoard:一种带有位置编码的新型相机标定图案 Peer Stelldinger PDF N/A PuzzleBoard: A New Camera Calibration Pattern with Position Encoding
DCAST:多样化的类感知自训练减轻选择偏差,促进更公平的学习 Yasin I. Tepeli PDF N/A DCAST: Diverse Class-Aware Self-Training Mitigates Selection Bias for Fairer Learning
为商业面包店训练计算机视觉模型,主要使用合成图像 Thomas H. Schmitt PDF N/A Training a Computer Vision Model for Commercial Bakeries with Primarily Synthetic Images
ACE:高效通信的抽象 Jonathan D. Thomas PDF N/A ACE: Abstractions for Communicating Efficiently
用于天气预报的掩码自回归模型 Doyi Kim PDF N/A Masked Autoregressive Model for Weather Forecasting
REST-HANDS:利用智能眼镜进行以自我为中心的视觉康复,用于中风后手部治疗 Wiktor Mucha PDF N/A REST-HANDS: Rehabilitation with Egocentric Vision Using Smartglasses for Treatment of Hands after Surviving Stroke
CBAM-SwinT-BL:基于带块级CBAM增强的Swin Transformer的小型轨道表面检测方法 Jiayi Zhao PDF N/A CBAM-SwinT-BL: Small Rail Surface Detect Detection Method Based on Swin Transformer with Block Level CBAM Enhancement
学习发现普遍的面部表情 Tingzhang Luo PDF N/A Learning to Discover Generalized Facial Expressions
对极大型语言模型进行激进的训练后压缩 Zining Zhang PDF N/A Aggressive Post-Training Compression on Extremely Large Language Models
不规则时间序列预测的连续时间线性位置嵌入 Byunghyun Kim PDF N/A Continuous-Time Linear Positional Embedding for Irregular Time Series Forecasting
通过拒绝功能对抗训练实现鲁棒的大型语言模型保护 Lei Yu PDF N/A Robust LLM safeguarding via refusal feature adversarial training
从对流许可模拟的垂直剖面推断雷暴发生:物理深度学习模型的物理洞察 Kianusch Vahid Yousefnia PDF N/A Inferring Thunderstorm Occurrence from Vertical Profiles of Convection-Permitting Simulations: Physical Insights from a Physical Deep Learning Model
SurgPETL:用于手术阶段识别的参数高效图像到手术视频迁移学习 Shu Yang PDF N/A SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning for Surgical Phase Recognition
ProFD:遮挡行人重识别的提示引导特征解耦 Can Cui PDF N/A ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification
BSharedRAG:电子商务领域中骨干共享的检索增强生成 Kaisi Guan PDF N/A BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain
全图表示学习用于符号网络分类 Noé Cecillon PDF N/A Whole-Graph Representation Learning For the Classification of Signed Networks
我们能否打破鲁棒多智能体强化学习中的多机构诅咒? Laixi Shi PDF N/A Can We Break the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning?
利用无监督认知进行知识发现 Alfredo Ibias PDF N/A Knowledge Discovery using Unsupervised Cognition
Q-Bench-视频:评估大型多模态模型对视频质量的理解能力 Zicheng Zhang PDF N/A Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs
用于脑瘫检测的轻量级神经架构搜索 Felix Tempel PDF N/A Lightweight Neural Architecture Search for Cerebral Palsy Detection
偏好对齐是否总是提升基于大语言模型翻译的最佳选择?一项实证分析 Hippolyte Gisserot-Boukhlef PDF N/A Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis
推荐系统中的神经点击模型 Mikhail Shirokikh PDF N/A Neural Click Models for Recommender Systems
评估和解释零样本跨语言新闻情感分析的训练策略 Luka Andrenšek PDF N/A Evaluating and explaining training strategies for zero-shot cross-lingual news sentiment analysis
《高达:将大型语言模型与图理解相结合》 Sheng Ouyang PDF N/A GUNDAM: Aligning Large Language Models with Graph Understanding
减轻大型语言模型在推荐系统中的倾向性偏差 Guixian Zhang PDF N/A Mitigating Propensity Bias of Large Language Models for Recommender Systems
使用基于Transformer的模型和辅助特征进行社交媒体帖子中的抑郁检测 Marios Kerasiotis PDF N/A Depression detection in social media posts using transformer-based models and auxiliary features
OPONeRF:用于鲁棒神经渲染的One-Point-One NeRF Yu Zheng PDF N/A OPONeRF: One-Point-One NeRF for Robust Neural Rendering
超越分数:基于模块化RAG的自动简答题评分与反馈系统 Menna Fateen PDF N/A Beyond Scores: A Modular RAG-Based System for Automatic Short Answer Scoring with Feedback
使用准直仪系统进行相机标定 Shunkun Liang PDF N/A Camera Calibration using a Collimator System
电动交通时代的燃油税损失:拥堵收费的机遇之窗 Thi Ngoc Nguyen PDF N/A Fuel tax loss in a world of electric mobility: A window of opportunity for congestion pricing
视觉上下文窗口扩展:长视频理解的新视角 Hongchen Wei PDF N/A Visual Context Window Extension: A New Perspective for Long Video Understanding
通过动态策略融合实现个性化 Ajsal Shereef Palattuparambil PDF N/A Personalisation via Dynamic Policy Fusion
利用物理驱动的神经网络在数字全息显微镜中实现生物细胞三维形态的单次重建 Jihwan Kim PDF N/A Single-shot reconstruction of three-dimensional morphology of biological cells in digital holographic microscopy using a physics-driven neural network
面向不完整数据的多模态情感分析的鲁棒性研究 Haoyu Zhang PDF N/A Towards Robust Multimodal Sentiment Analysis with Incomplete Data
使用大型语言模型进行定制化信息与领域中心知识图谱构建 Frank Wawrzik PDF N/A Customized Information and Domain-centric Knowledge Graph Construction with Large Language Models
开发无需语音指令调优数据的指令遵循语音语言模型 Ke-Han Lu PDF N/A Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data
基于形状特征距离度量的模型选择方法在时间序列分类中的多源迁移学习 Jiseok Lee PDF N/A Model Selection with a Shapelet-based Distance Measure for Multi-source Transfer Learning in Time Series Classification
数值鲁棒的无状态增强定点平滑 Nicholas Krämer PDF N/A Numerically Robust Fixed-Point Smoothing Without State Augmentation
使用单张人脸图像的多模态生物识别技术 Koichi Ito PDF N/A Multibiometrics Using a Single Face Image
影响力函数在大语言模型上有效吗? Zhe Li PDF N/A Do Influence Functions Work on Large Language Models?
缓解大型语言模型中的后门威胁:进展与挑战 Qin Liu PDF N/A Mitigating Backdoor Threats to Large Language Models: Advancement and Challenges
大规模指纹质量与人口统计学操作研究 Javier Galbally PDF N/A A large-scale operational study of fingerprint quality and demographics
鲁棒多视角共表达网络推断 Teodora Pandeva PDF N/A Robust Multi-view Co-expression Network Inference
预测性语音识别与话语结束检测:面向口语对话系统 Oswald Zink PDF N/A Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems
RoCoTex:一种基于扩散模型的稳健一致性纹理合成方法 Jangyeong Kim PDF N/A RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models
OccRWKV:重新思考具有线性复杂度的3D语义占用预测的高效性 Junming Wang PDF N/A OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
GearTrack:自动化6D姿态估计 Yu Deng PDF N/A GearTrack: Automating 6D Pose Estimation
竞赛:一种用于语言模型中跨度概率一致性测试的框架 Eitan Wagner PDF N/A CONTESTS: a Framework for Consistency Testing of Span Probabilities in Language Models
TS检测器:用于结肠镜视频检测的时间-空间自校正协同学习 Kaini Wang PDF N/A TSdetector: Temporal-Spatial Self-correction Collaborative Learning for Colonoscopy Video Detection
增强基于LLM的推荐模型中的高阶交互感知 Xinfeng Wang PDF N/A Enhancing High-order Interaction Awareness in LLM-based Recommender Model
Violina:线性时不变非马尔可夫动力学的多轨迹识别 Ryoji Anzaki PDF N/A Violina: Various-of-trajectories Identification of Linear Time-invariant Non-Markovian Dynamics
通过归一化流进行知识图谱嵌入 Changyi Xiao PDF N/A Knowledge Graph Embedding by Normalizing Flows
学习带有深度并行神经算子的偏微分方程 Qinglong Ma PDF N/A Learning Partial Differential Equations with Deep Parallel Neural Operators
通过奖励样本的转移在多臂老虎机任务中利用相邻相似性 NR Rahul PDF N/A Exploiting Adjacent Similarity in Multi-Armed Bandit Tasks via Transfer of Reward Samples
DAOcc:3D物体检测辅助的多传感器融合用于3D占用预测 Zhen Yang PDF N/A DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction
磁力:我们从未了解过文本到图像扩散模型的工作原理,直到我们掌握了视觉语言模型的运作机制。 Chenyi Zhuang PDF N/A Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function
基于变分自编码器的交互式动态影响图解决方案 Yinghui Pan PDF N/A Variational Auto-encoder Based Solutions to Interactive Dynamic Influence Diagrams
对《对抗投毒攻击的隐私增强联邦学习》的评论 Thomas Schneider PDF N/A Comments on "Privacy-Enhanced Federated Learning Against Poisoning Adversaries"
用于牛乳头图像健康状况分类的自注意力残差卷积神经网络 Minghao Wang PDF N/A A Self-attention Residual Convolutional Neural Network for Health Condition Classification of Cow Teat Images
多模态大语言模型增强的跨语言跨模态检索 Yabing Wang PDF N/A Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval