跳转至

Arxiv 2024-12-16 Papers

标题 作者 PDF链接 代码仓库 Title
MaxInfoRL:通过信息增益最大化提升强化学习中的探索能力 Bhavya Sukhija PDF N/A MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization
PanSplat:使用前馈高斯喷洒的4K全景合成 Cheng Zhang PDF N/A PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting
因果扩散变换器用于生成建模 Chaorui Deng PDF N/A Causal Diffusion Transformers for Generative Modeling
SepLLM:通过将一段压缩为一个分隔符来加速大型语言模型 Guoxuan Chen PDF N/A SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator
CAP4D:利用可变形的多视角扩散模型创建可动画化的4D肖像化身 Felix Taubner PDF N/A CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models
无需再调参:基于拉格朗日乘子法的多任务学习优先级分配 Zhengxing Cheng PDF N/A No More Tuning: Prioritized Multi-Task Learning with Lagrangian Differential Multiplier Methods
奇境:从单一图像导航3D场景 Hanwen Liang PDF N/A Wonderland: Navigating 3D Scenes from a Single Image
在可微分多物理模拟中稳定强化学习 Eliot Xing PDF N/A Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
通过观察事物如何移动来进行基于指令的图像处理 Mingdeng Cao PDF N/A Instruction-based Image Manipulation by Watching How Things Move
IDArb: 任意数量输入视图和光照的内在分解 Zhibing Li PDF N/A IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
UniLoc:迈向使用任意单一模态的通用地点识别 Yan Xia PDF N/A UniLoc: Towards Universal Place Recognition Using Any Single Modality
CPath-Omni:一种用于计算病理学中斑块和全切片图像分析的统一多模态基础模型 Yuxuan Sun PDF N/A CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology
CG-Bench:面向长视频理解的线索引导问答基准测试 Guo Chen PDF N/A CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
使用自回归变换器推断喷注辐射 Anja Butter PDF N/A Extrapolating Jet Radiation with Autoregressive Transformers
让FETCH!发生:通过常见栖息地发现新兴的“狗哨” Kuleen Sasse PDF N/A Making FETCH! Happen: Finding Emergent Dog Whistles Through Common Habitats
SPADE:使用分析和无数据增强框架的光谱光声去噪 Fangzhou Lin PDF N/A SPADE: Spectroscopic Photoacoustic Denoising using an Analytical and Data-free Enhancement Framework
启示录:具有Omega-正则目标的可判定POMDP类 Marius Belly PDF N/A Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives
半自动化的音频录课分析:以教师激励性信息为例 Samuel Falcon PDF N/A Semi-automated analysis of audio-recorded lessons: The case of teachers' engaging messages
基于虚拟代理的沟通技能培训以促进同伴间的健康说服 Farnaz Nouraei PDF N/A Virtual Agent-Based Communication Skills Training to Facilitate Health Persuasion Among Peers
探索领域泛化语义分割中的语义一致性与风格多样性 Hongwei Niu PDF N/A Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation
双层学习与不精确随机梯度 Mohammad Sadegh Salehi PDF N/A Bilevel Learning with Inexact Stochastic Gradients
一张LoRA抵得上千张图片 Chenxi Liu PDF N/A A LoRA is Worth a Thousand Pictures
交通系统中的人工智能 Ritwik Raj Saxena PDF N/A Artificial Intelligence in Traffic Systems
人工智能辅助对放射学报告的影响:使用模拟AI草稿报告的初步研究 Julián N. Acosta PDF N/A The Impact of AI Assistance on Radiology Reporting: A Pilot Study Using Simulated AI Draft Reports
语言模型在抽象摘要中的隐私性如何? Anthony Hughes PDF N/A How Private are Language Models in Abstractive Summarization?
大语言模型提示能否作为漏洞检测中静态分析的代理 Ira Ceka PDF N/A Can LLM Prompting Serve as a Proxy for Static Analysis in Vulnerability Detection
用于冷启动切割平面分离器配置的大型语言模型 Connor Lawless PDF N/A LLMs for Cold-Start Cutting Plane Separator Configuration
LeARN:系统辨识中非线性动力学的可学习与自适应表示 Arunabh Singh PDF N/A LeARN: Learnable and Adaptive Representations for Nonlinear Dynamics in System Identification
热力学启发的图神经网络用于数字人双胞胎的实时仿真 Lucas Tesán PDF N/A Thermodynamics-informed graph neural networks for real-time simulation of digital human twins
FSFM:通过自监督面部表示学习实现的可泛化人脸安全基础模型 Gaojian Wang PDF N/A FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning
RepFace:通过渐进式标签校正优化闭集噪声以提升人脸识别 Jie Zhang PDF N/A RepFace: Refining Closed-Set Noise with Progressive Label Correction for Face Recognition
具有保证收敛性的内存减少元学习 Honglin Yang PDF N/A Memory-Reduced Meta-Learning with Guaranteed Convergence
学习在具有新颖布局的迷宫中导航,利用抽象俯视地图 Linfeng Zhao PDF N/A Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps
基于深度学习的上肢轨迹个体运动特征识别及其在疾病阶段评估中的应用 Tim Sziburis PDF N/A Deep-learning-based identification of individual motion characteristics from upper-limb trajectories towards disorder stage evaluation
深度对比表示学习的泛化分析 Nong Minh Hieu PDF N/A Generalization Analysis for Deep Contrastive Representation Learning
SpeechPrune: 面向上下文感知的语音信息检索令牌剪枝 Yueqian Lin PDF N/A SpeechPrune: Context-aware Token Pruning for Speech Information Retrieval
面向企业系统的智能AI驱动技术故障排查:一种新颖的加权检索增强生成范式 Rajat Khanda PDF N/A Agentic AI-Driven Technical Troubleshooting for Enterprise Systems: A Novel Weighted Retrieval-Augmented Generation Paradigm
大型语言模型(LLMs)中的开源优势 Jiya Manchanda PDF N/A The Open Source Advantage in Large Language Models (LLMs)
LLM-RG4:在多样输入情境下灵活且基于事实的放射报告生成 Zhuhao Wang PDF N/A LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts
CP-Guard:协作鸟瞰感知中的恶意代理检测与防御 Senkang Hu PDF N/A CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird's Eye View Perception
SAMIC:通过上下文空间提示工程进行任意分割 Savinay Nagendra PDF N/A SAMIC: Segment Anything with In-Context Spatial Prompt Engineering
将大型语言模型与辅导系统智能相结合:一项关于护理人员家庭作业支持的案例研究 Devika Venugopalan PDF N/A Combining Large Language Models with Tutoring System Intelligence: A Case Study in Caregiver Homework Support
公平防护盾:防范偏见决策者 Filip Cano PDF N/A Fairness Shields: Safeguarding against Biased Decision Makers
ExecRepoBench:多层次可执行代码补全评估 Jian Yang PDF N/A ExecRepoBench: Multi-level Executable Code Completion Evaluation
SciFaultyQA:使用基于生成对抗网络(GAN)的合成数据集生成方法,在科学问题错误检测方面对大型语言模型(LLMs)进行基准测试 Debarshi Kundu PDF N/A SciFaultyQA: Benchmarking LLMs on Faulty Science Question Detection with a GAN-Inspired Approach to Synthetic Dataset Generation
Speak & Improve Corpus 2025:一个用于语言评估和反馈的第二语言英语语音语料库 Kate Knill PDF N/A Speak & Improve Corpus 2025: an L2 English Speech Corpus for Language Assessment and Feedback
《Speak & Improve Challenge 2025:任务与基线系统》 Mengjie Qian PDF N/A Speak & Improve Challenge 2025: Tasks and Baseline Systems
使用大型语言模型进行成本效益高的无标签节点分类 Taiyan Zhang PDF N/A Cost-Effective Label-free Node Classification with LLMs
用于研究电荷密度波粗粒化动力学的回声状态网络 Clement Dinh PDF N/A Echo State network for coarsening dynamics of charge density waves
使用机器学习进行工业规模的水泥熟料相预测 Sheikh Junaid Fayaz PDF N/A Industrial-scale Prediction of Cement Clinker Phases using Machine Learning
AlphaZero神经网络扩展与齐夫定律:棋盘游戏与幂律的故事 Oren Neumann PDF N/A AlphaZero Neural Scaling and Zipf's Law: a Tale of Board Games and Power Laws
语音基础模型与众包结合,实现高效、高质量的数据收集 Beomseok Lee PDF N/A Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection
艾玛-X:一种具有基础思维链和前瞻性空间推理的具身多模态行动模型 Qi Sun PDF N/A Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning
经过优化以预测基于卫星的降水观测结果的神经通用循环模型 Janni Yuval PDF N/A Neural general circulation models optimized to predict satellite-based precipitation observations
可控阴影生成:从合成数据中使用单步扩散模型 Onur Tasar PDF N/A Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data
DARWIN 1.5:将大型语言模型作为材料科学的适应性学习者 Tong Xie PDF N/A DARWIN 1.5: Large Language Models as Materials Science Adapted Learners
柴油发动机的数字孪生:结合迁移学习的操作员嵌入式物理信息神经网络用于发动机健康监测 Kamaljyoti Nath PDF N/A A Digital twin for Diesel Engines: Operator-infused PINNs with Transfer Learning for Engine Health Monitoring
从参数推断注意力头的功能 Amit Elhelo PDF N/A Inferring Functionality of Attention Heads from their Parameters
BetaExplainer:一种用于解释图神经网络的概率方法 Whitney Sloneker PDF N/A BetaExplainer: A Probabilistic Method to Explain Graph Neural Networks
格拉米安多模态表示学习与对齐 Giordano Cicchetti PDF N/A Gramian Multimodal Representation Learning and Alignment
基于不确定性感知的贝叶斯深度学习通过乳腺X线摄影进行可靠的乳腺癌分子亚型预测 Mohaddeseh Chegini PDF N/A Reliable Breast Cancer Molecular Subtype Prediction based on uncertainty-aware Bayesian Deep Learning by Mammography
通过多尺度文本引导的自监督学习提升全面美学洞察力 Yuti Liu PDF N/A Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning
泛化技术对图像分类中隐私、效用和公平性之间相互作用的影响 Ahmad Hassanpour PDF N/A The Impact of Generalization Techniques on the Interplay Among Privacy, Utility, and Fairness in Image Classification
异步分布式高斯过程回归用于在线学习与动态系统:补充文档 Zewen Yang PDF N/A Asynchronous Distributed Gaussian Process Regression for Online Learning and Dynamical Systems: Complementary Document
使用深度目标检测和合成训练数据对无人机图像中的椰子树进行计数 Tobias Rohe PDF N/A Coconut Palm Tree Counting on Drone Images with Deep Object Detection and Synthetic Training Data
OpenReviewer:一种专为生成批判性科学论文评审而设计的专用大型语言模型 Maximilian Idahl PDF N/A OpenReviewer: A Specialized Large Language Model for Generating Critical Scientific Paper Reviews
自动训练器:一个模块化和可扩展的深度学习工具包,用于计算机听觉任务 Simon Rampp PDF N/A autrainer: A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks
令牌粒度对语言模型意外度预测能力的影响 Byung-Doh Oh PDF N/A The Impact of Token Granularity on the Predictive Power of Language Model Surprisal
SEAGraph:揭示论文评审意见的全貌 Jianxiang Yu PDF N/A SEAGraph: Unveiling the Whole Story of Paper Review Comments
病理学基础模型的潜在表示是否对旋转不变? Matouš Elphick PDF N/A Are the Latent Representations of Foundation Models for Pathology Invariant to Rotation?
大型语言模型中的精确长度控制 Bradley Butcher PDF N/A Precise Length Control in Large Language Models
多模态大语言模型时代的数学推理研究:基准测试、方法与挑战 Yibo Yan PDF N/A A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges
逐步推理错误干扰攻击的大型语言模型 Jingyu Peng PDF N/A Stepwise Reasoning Error Disruption Attack of LLMs
使用简单的平局打破规则加速NSGA-II Benjamin Doerr PDF N/A Speeding Up the NSGA-II With a Simple Tie-Breaking Rule
通过自动宏动作发现实现的分层元强化学习 Minjae Cho PDF N/A Hierarchical Meta-Reinforcement Learning via Automated Macro-Action Discovery
可解释的程序错误检测 Shane Storks PDF N/A Explainable Procedural Mistake Detection
PICLe:低资源命名实体检测中的上下文学习伪注释 Sepideh Mamooler PDF N/A PICLe: Pseudo-Annotations for In-Context Learning in Low-Resource Named Entity Detection
RetroLLM:赋能大型语言模型在生成过程中检索细粒度证据 Xiaoxi Li PDF N/A RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation
视觉语言模型分类是否受益于大型语言模型描述的语义? Pingchuan Ma PDF N/A Does VLM Classification Benefit from LLM Description Semantics?
CharacterBench:评估大型语言模型的角色定制能力 Jinfeng Zhou PDF N/A CharacterBench: Benchmarking Character Customization of Large Language Models
语言模型能否媲美数学专业学生?通过文本操作和人类实验评估数学推理能力 Andrii Nikolaiev PDF N/A Can Language Models Rival Mathematics Students? Evaluating Mathematical Reasoning through Textual Manipulation and Human Experiments
PunchBench:在多模态笑点理解中对多模态大语言模型进行基准测试 Kun Ouyang PDF N/A PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
多语言音频的自发和脚本语音分类 Shahar Elisha PDF N/A Classification of Spontaneous and Scripted Speech for Multilingual Audio
从2D CAD图纸到3D参数化模型:一种视觉语言方法 Xilin Wang PDF N/A From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach
SegMAN:使用状态空间模型和局部注意力进行语义分割的全尺度上下文建模 Yunxiang Fu PDF N/A SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
将图神经网络应用于自我网络以进行好友推荐 Evgeny Zamyatin PDF N/A GNN Applied to Ego-nets for Friend Suggestions
向物理基础的天空建模 Ian J. Maquignaz PDF N/A Towards Physically-Based Sky-Modeling
使用指令微调的大型语言模型识别警方事件叙述中的脆弱性指标 Sam Relins PDF N/A Using Instruction-Tuned Large Language Models to Identify Indicators of Vulnerability in Police Incident Narratives
多数据源上的贝叶斯代理训练:一种混合建模策略 Philipp Reiser PDF N/A Bayesian Surrogate Training on Multiple Data Sources: A Hybrid Modeling Strategy
一个以变量出现为中心的不一致处理框架(扩展版) Yakoub Salhi PDF N/A A Variable Occurrence-Centric Framework for Inconsistency Handling (Extended Version)
变压器在迷宫解决任务中利用因果世界模型 Alex F. Spies PDF N/A Transformers Use Causal World Models in Maze-Solving Tasks
基于多时间粒度融合的事件驱动运动去模糊 Xiaopeng Lin PDF N/A Event-based Motion Deblurring via Multi-Temporal Granularity Fusion
研究密集检索中的专家混合模型 Effrosyni Sokli PDF N/A Investigating Mixture of Experts in Dense Retrieval
GeoX:通过统一的规范化视觉-语言预训练解决几何问题 Renqiu Xia PDF N/A GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training
一种表示知识的形式化理论 Heng Zhang PDF N/A A Theory of Formalisms for Representing Knowledge
一个关于大型语言模型在音乐实体检测中的上下文学习基准和鲁棒性研究 Simon Hachmeier PDF N/A A Benchmark and Robustness Study of In-Context-Learning with Large Language Models in Music Entity Detection
通过高效优化非凸目标实现因果不变性学习 Zhenyu Wang PDF N/A Causal Invariance Learning via Efficient Optimization of a Nonconvex Objective
集成学习和3D Pix2Pix在多模态MRI中全面脑肿瘤分析的应用 Ramy A. Zeineldin PDF N/A Ensemble Learning and 3D Pix2Pix for Comprehensive Brain Tumor Analysis in Multimodal MRI
SPGL:通过单正样本图学习提升基于会话的推荐 Tiantian Liang PDF N/A SPGL: Enhancing Session-based Recommendation with Single Positive Graph Learning
基于声纳的深海机器人深度学习:概述、鲁棒性与挑战 Martin Aubard PDF N/A Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges
评估向量心电图和心电图参数在决策树分析下对高效分配三级心脏病学护理的有效性 Lucas José da Costa PDF N/A Evaluating the Efficacy of Vectocardiographic and ECG Parameters for Efficient Tertiary Cardiology Care Allocation Using Decision Tree Analysis
《通过人工智能研究食双星系统。第二部分:PHOEBE前向模型中对速度的需求》 Marcin Wrona PDF N/A The Eclipsing Binaries via Artificial Intelligence. II. Need for Speed in PHOEBE Forward Models
UnMA-CapSumT:统一与多头部注意力驱动的标题摘要Transformer Dhruv Sharma PDF N/A UnMA-CapSumT: Unified and Multi-Head Attention-driven Caption Summarization Transformer
改进的媒体偏见检测与子分类模型 Tim Menzner PDF N/A Improved Models for Media Bias Detection and Subcategorization
奇妙的矩阵:结合以构建更高效、更强大的基础模型架构 Jingze Shi PDF N/A Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture
你有疑问吗?哦,那可能就有点难了!探索模型不确定性在问题难度估计中的应用。 Leonidas Zotos PDF N/A Are You Doubtful? Oh, It Might Be Difficult Then! Exploring the Use of Model Uncertainty for Question Difficulty Estimation
孟加拉语问答模型的发展与挑战:全面综述 Md Iftekhar Islam Tashik PDF N/A Advancements and Challenges in Bangla Question Answering Models: A Comprehensive Review
时空盲点网络与校准流对齐用于自监督视频去噪 Zikang Chen PDF N/A Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising
HiGDA:利用节点层次图学习从局部到全局的拓扑结构,用于半监督领域适应 Ba Hung Ngo PDF N/A HiGDA: Hierarchical Graph of Nodes to Learn Local-to-Global Topology for Semi-Supervised Domain Adaptation
ColorFlow:检索增强型图像序列着色 Junhao Zhuang PDF N/A ColorFlow: Retrieval-Augmented Image Sequence Colorization
EventSum:一个大规模以事件为中心的中文多新闻文档摘要数据集 Mengna Zhu PDF N/A EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
设计基于骨架识别的图卷积网络的半结构化剪枝 Hichem Sahbi PDF N/A Designing Semi-Structured Pruning of Graph Convolutional Networks for Skeleton-based Recognition
CLDA-YOLO:基于视觉对比学习的领域自适应YOLO检测器 Tianheng Qiu PDF N/A CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector
使用片外存储器的稀疏和循环架构的最佳梯度检查点 Wadjih Bencheikh PDF N/A Optimal Gradient Checkpointing for Sparse and Recurrent Architectures using Off-Chip Memory
PhysAug:一种面向单领域泛化目标检测的物理引导与频率基础数据增强方法 Xiaoran Xu PDF N/A PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object Detection
UAlign:利用不确定性估计实现大型语言模型的事实性对齐 Boyang Xue PDF N/A UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models
AMI-Net:一种用于工业异常检测与定位的自适应掩码修复网络 Wei Luo PDF N/A AMI-Net: Adaptive Mask Inpainting Network for Industrial Anomaly Detection and Localization
可扩展的大系统中时间异常因果关系发现:利用二进制异常标志数据实现计算效率 Mulugeta Weldezgina Asres PDF N/A Scalable Temporal Anomaly Causality Discovery in Large Systems: Achieving Computational Efficiency with Binary Anomaly Flag Data
在淘汰赛中的联盟适应性操控 Juhi Chaudhary PDF N/A Adaptive Manipulation for Coalitions in Knockout Tournaments
ProsodyFM:用于清晰语音合成的无监督短语和语调控制 Xiangheng He PDF N/A ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis
神经崩溃启发的知识蒸馏 Shuoxi Zhang PDF N/A Neural Collapse Inspired Knowledge Distillation
一种利用案例增强提及图检测韩国刑法条文竞争的方法 Seonho An PDF N/A A Method for Detecting Legal Article Competition for Korean Criminal Law Using a Case-augmented Mention Graph
InterDyn: 基于视频扩散模型的可控交互动力学 Rick Akkerman PDF N/A InterDyn: Controllable Interactive Dynamics with Video Diffusion Models
面部对齐对人脸图像质量的影响 Eren Onaran PDF N/A Impact of Face Alignment on Face Image Quality
用于二值神经网络优化的快速和慢速梯度近似 Xinquan Chen PDF N/A Fast and Slow Gradient Approximation for Binary Neural Network Optimization
点云辅助的神经图像压缩 Ziqun Li PDF N/A Point Cloud-Assisted Neural Image Compression
它是否发出“呜呜”声?迈向基于数据理解的吉他音色描述 Pratik Sutar PDF N/A Does it Chug? Towards a Data-Driven Understanding of Guitar Tone Description
不再需要Adam:初始化时的学习率缩放就是你所需的一切 Minghao Xu PDF N/A No More Adam: Learning Rate Scaling at Initialization is All You Need
IDEA-Bench:生成式模型与专业设计之间的差距有多大? Chen Liang PDF N/A IDEA-Bench: How Far are Generative Models from Professional Designing?
零样本仿真到真实强化学习策略在四旋翼控制中的关键因素是什么?一项全面研究 Jiayu Chen PDF N/A What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study
QUENCH:衡量LLMs在印度语与非印度语语境下通用推理能力的差距 Mohammad Aflah Khan PDF N/A QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs
GS-ProCams:基于高斯溅射的投影仪-相机系统 Qingyue Deng PDF N/A GS-ProCams: Gaussian Splatting-based Projector-Camera Systems
利用语言进行协调:一个基于大语言模型驱动的多智能体控制框架与基准测试 Timothée Anne PDF N/A Harnessing Language for Coordination: A Framework and Benchmark for LLM-Driven Multi-Agent Control
SCITAT:一个涵盖多种推理类型、针对科学表格和文本的问答基准 Xuanliang Zhang PDF N/A SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
通过帧间条件驱动的视频生成技术 Tianyi Zhu PDF N/A Generative Inbetweening through Frame-wise Conditions-Driven Video Generation
DriveGazen:使用传统摄像头进行基于事件的驾驶状态识别 Xiaoyin Yang PDF N/A DriveGazen: Event-Based Driving Status Recognition using Conventional Camera
可变形径向核点投影 Yi-Hua Huang PDF N/A Deformable Radial Kernel Splatting
共同点,多样根源:分类西班牙语变体中常见例子的困难 Javier A. Lopetegui PDF N/A Common Ground, Diverse Roots: The Difficulty of Classifying Common Examples in Spanish Varieties
超越数据集创建:在线激进内容检测数据集的标注变异与偏差探查之关键视角 Arij Riabi PDF N/A Beyond Dataset Creation: Critical View of Annotation Variation and Bias Probing of a Dataset for Online Radical Content Detection
基于条件扩散模型的条件独立性检验 Yanfeng Yang PDF N/A Conditional Diffusion Models Based Conditional Independence Testing
广义贝叶斯深度强化学习 Shreya Sinha Roy PDF N/A Generalized Bayesian deep reinforcement learning
CSR:通过稀疏表示实现1比特键值缓存 Hongxuan Zhang PDF N/A CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation
对于光谱图神经网络的不对称学习 Fangbing Liu PDF N/A Asymmetric Learning for Spectral Graph Neural Networks
高效实现安全模型训练和安全聚合,以确保联邦学习中的双向隐私保护 Xue Yang PDF N/A Efficiently Achieving Secure Model Training and Secure Aggregation to Ensure Bidirectional Privacy-Preservation in Federated Learning
个性化大型语言模型,用于为来自不同用户的相同查询生成定制化响应 Hang Zeng PDF N/A Personalized LLM for Generating Customized Responses to the Same Query from Different Users
可转移的对抗性人脸攻击,通过文本控制属性 Wenyun Li PDF N/A Transferable Adversarial Face Attack with Text Controlled Attribute
WMT 2024 关于话语层次文学翻译的共享任务研究成果 Longyue Wang PDF N/A Findings of the WMT 2024 Shared Task on Discourse-Level Literary Translation
大型语言模型(LLMs)能够通过代理协同进化模拟标准化病人 Zhuoyun Du PDF N/A LLMs Can Simulate Standardized Patients via Agent Coevolution
差异感知注意力网络:增强视听零样本学习的利器 RunLin Yu PDF N/A Discrepancy-Aware Attention Network for Enhanced Audio-Visual Zero-Shot Learning
探索者:通过中间语言代理框架实现异常安全代码生成 Xuanming Zhang PDF N/A Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework
MiMoTable:一个带有元操作的多尺度电子表格基准,用于表格推理 Zheng Li PDF N/A MiMoTable: A Multi-scale Spreadsheet Benchmark with Meta Operations for Table Reasoning
重新注意可控视频扩散编辑 Yuanzhi Wang PDF N/A Re-Attentional Controllable Video Diffusion Editing
在问答系统中使用奖励模型进行上下文过滤 Sangryul Kim PDF N/A Context Filtering with Reward Modeling in Question Answering
AsymRnR:利用非对称减少与恢复加速视频扩散变换器 Wenhao Sun PDF N/A AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration
使用未标记的目标语言数据扩展聊天模型的词汇量 Atsuki Yamaguchi PDF N/A Vocabulary Expansion of Chat Models with Unlabeled Target Language Data
Flex-PE:面向AI工作负载的灵活且支持SIMD的多精度处理单元 Mukul Lokhande PDF N/A Flex-PE: Flexible and SIMD Multi-Precision Processing Element for AI Workloads
CoinMath:利用编码教学的力量来提升数学大型语言模型 Chengwei Wei PDF N/A CoinMath: Harnessing the Power of Coding Instruction for Math LLMs
在关键任务型IT治理中的大型语言模型:我们准备好了吗? Matteo Esposito PDF N/A On Large Language Models in Mission-Critical IT Governance: Are We Ready Yet?
CiTrus:从低数据生物信号迁移学习中榨取额外性能 Eloy Geenjaar PDF N/A CiTrus: Squeezing Extra Performance out of Low-data Bio-signal Transfer Learning
从特定多模态大语言模型到全向多模态大语言模型:关于与多模态对齐的大语言模型综述 Shixin Jiang PDF N/A From Specific-MLLM to Omni-MLLM: A Survey about the MLLMs alligned with Multi-Modality
使用平行语料库进行多语言和可解释的文本去毒化 Daryna Dementieva PDF N/A Multilingual and Explainable Text Detoxification with Parallel Corpora
在纵向联邦学习中,只需简单的转换即可实现数据保护 Andrei Semenov PDF N/A Just a Simple Transformation is Enough for Data Protection in Vertical Federated Learning
双无迹卡尔曼滤波器架构在水网络泄漏定位中的传感器融合应用 Luis Romero-Ben PDF N/A Dual Unscented Kalman Filter Architecture for Sensor Fusion in Water Networks Leak Localization
通过无限像素学习实现的超高清动态多曝光图像融合 Xingchi Chen PDF N/A Ultra-High-Definition Dynamic Multi-Exposure Image Fusion via Infinite Pixel Learning
无界整数空间中多目标进化算法的运行时分析 Benjamin Doerr PDF N/A Runtime Analysis for Multi-Objective Evolutionary Algorithms in Unbounded Integer Spaces
用于智能交通系统的多模态大型语言模型 Dexter Le PDF N/A Multimodal LLM for Intelligent Transportation Systems
NEST:一种用于自动驾驶的神经调节小世界超图轨迹预测模型 Chengyue Wang PDF N/A NEST: A Neuromodulated Small-world Hypergraph Trajectory Prediction Model for Autonomous Driving
快速分阶段的CNN模型用于精确的肺部疾病和肺癌检测 Abdelbaki Souid PDF N/A Fast-staged CNN Model for Accurate pulmonary diseases and Lung cancer detection
EGP3D:面向RGB-D相机的边缘引导几何保持三维点云超分辨率技术 Zheng Fang PDF N/A EGP3D: Edge-guided Geometric Preserving 3D Point Cloud Super-resolution for RGB-D camera
偏置向量:通过任务算术方法减轻语言模型中的偏见 Daiki Shirafuji PDF N/A Bias Vector: Mitigating Biases in Language Models with Task Arithmetic Approach
基于松散同步规则的多智能体路径规划与异步动作 Shuai Zhou PDF N/A Loosely Synchronized Rule-Based Planning for Multi-Agent Path Finding with Asynchronous Actions
UA-PDFL:一种去中心化的联邦学习个性化方法 Hangyu Zhu PDF N/A UA-PDFL: A Personalized Approach for Decentralized Federated Learning
DINO-Foresight:用DINO展望未来 Efstathios Karypidis PDF N/A DINO-Foresight Looking into the Future with DINO
LLM-DaaS:从文本用户请求驱动的无人机即服务操作 Lillian Wassim PDF N/A LLM-DaaS: LLM-driven Drone-as-a-Service Operations from Text User Requests
生物桥梁:在代码切换的电子病历中实现统一生物嵌入与跨模态桥接 Jangyeong Jeon PDF N/A BioBridge: Unified Bio-Embedding with Bridging Modality in Code-Switched EMR
基于中文手写短语的在线书写者检索:一种协同的时间-频率表示学习方法 Peirong Zhang PDF N/A Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach
C3oT:在不牺牲有效性的前提下生成更短的思维链 Yu Kang PDF N/A C3oT: Generating Shorter Chain-of-Thought without Compromising Effectiveness
LMM-正则化的CLIP嵌入用于图像分类 Maria Tzelepi PDF N/A LMM-Regularized CLIP Embeddings for Image Classification
联邦学习中的非凸优化:通过方差缩减和自适应学习 Dipanwita Thakur PDF N/A Non-Convex Optimization in Federated Learning via Variance Reduction and Adaptive Learning
CNNtention: 卷积神经网络(CNN)能否在加入注意力机制后表现得更好? Julian Glattki PDF N/A CNNtention: Can CNNs do better with Attention?
私密却社交:大型语言模型聊天机器人如何支持并挑战饮食障碍康复 Ryuhaerang Choi PDF N/A Private Yet Social: How LLM Chatbots Support and Challenge Eating Disorder Recovery
平滑度确实重要:一种简单却有效的无监督图域适应方法 Wei Chen PDF N/A Smoothness Really Matters: A Simple yet Effective Approach for Unsupervised Graph Domain Adaptation
自适应释义与偏好学习以提升声明可验证性 Amelie Wührl PDF N/A Self-Adaptive Paraphrasing and Preference Learning for Improved Claim Verifiability
SE-GCL:一种基于事件的简单且有效的图对比学习方法,用于文本表示 Tao Meng PDF N/A SE-GCL: An Event-Based Simple and Effective Graph Contrastive Learning for Text Representation
图像梯度辅助的光度立体网络 Kaixuan Wang PDF N/A Image Gradient-Aided Photometric Stereo Network
BA-BFL:贝叶斯联邦学习的重心聚合方法 Nour Jamoussi PDF N/A BA-BFL: Barycentric Aggregation for Bayesian Federated Learning
全面的GeoAI综述:进展、挑战与展望 Anasse Boutayeb PDF N/A A comprehensive GeoAI review: Progress, Challenges and Outlooks
人工智能规划简介 Marco Aiello PDF N/A Introduction to AI Planning
基于脉冲稳定性定理的高速高质量脉冲相机视觉重建 Wei Zhang PDF N/A High-speed and High-quality Vision Reconstruction of Spike Camera with Spike Stability Theorem
IDProtector:一种对抗性噪声编码器,用于防止保留身份的图像生成 Yiren Song PDF N/A IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation
关于众包任务设计用于话语关系标注 Frances Yung PDF N/A On Crowdsourcing Task Design for Discourse Relation Annotation
预测受损历史文献的原始外观 Zhenhua Yang PDF N/A Predicting the Original Appearance of Damaged Historical Documents
多尺度增量建模在人机协作中增强人体运动预测 Juncheng Zou PDF N/A Multi-Scale Incremental Modeling for Enhanced Human Motion Prediction in Human-Robot Collaboration
一种具有隐式区间的映射算法及其优化 Yuyang Tao PDF N/A A Mapper Algorithm with implicit intervals and its optimization
QPruner:在大语言模型中进行结构化剪枝的概率决策量化 Changhai Zhou PDF N/A QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models
《愚弄我吧,愚弄我吧:用户对大型语言模型虚假陈述的态度》 Diana Bar-Or Nirman PDF N/A Fool Me, Fool Me: User Attitudes Toward LLM Falsehoods
VG-TVP:通过视觉基础的文本-视频提示进行多模态程序规划 Muhammet Furkan Ilaslan PDF N/A VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting
在有标签噪声的情况下进行学习时,对抗语义污染 Wenxiao Fan PDF N/A Combating Semantic Contamination in Learning with Label Noise
EvoLlama:通过多模态结构和序列表示增强大语言模型对蛋白质的理解 Nuowei Liu PDF N/A EvoLlama: Enhancing LLMs' Understanding of Proteins via Multimodal Structure and Sequence Representations
MT-LENS:一款全方位工具包,助力更优机器翻译评估 Javier García Gilabert PDF N/A MT-LENS: An all-in-one Toolkit for Better Machine Translation Evaluation