Arxiv 2024-10-14 Papers

标题	作者	PDF链接	代码仓库	Title
Tex4D: 利用视频扩散模型实现零样本4D场景纹理化	Jingzhi Bao	PDF	N/A	Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models
感知对齐何时有益于视觉表征？	Shobhita Sundaram	PDF	N/A	When Does Perceptual Alignment Benefit Vision Representations?
TemporalBench：多模态视频模型细粒度时间理解基准测试	Mu Cai	PDF	N/A	TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
DuoAttention：利用检索和流式处理头高效处理长上下文LLM推理	Guangxuan Xiao	PDF	N/A	DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
LVD-2M：一个带有时间密集字幕的长镜头视频数据集	Tianwei Xiong	PDF	N/A	LVD-2M: A Long-take Video Dataset with Temporally Dense Captions
使用可扩展的合成数据实现任意视频深度估计	Honghui Yang	PDF	N/A	Depth Any Video with Scalable Synthetic Data
LongMemEval：在长期互动记忆上评估聊天助手	Di Wu	PDF	N/A	LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory
你的混合专家大型语言模型实际上是一个免费的嵌入模型	Ziyue Li	PDF	N/A	Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
HART：高效视觉生成的混合自回归Transformer	Haotian Tang	PDF	N/A	HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
深度线性探针生成器用于权重空间学习	Jonathan Kahana	PDF	N/A	Deep Linear Probe Generators for Weight Space Learning
文本生成中的局部解码和全局解码	Daniel Gareev	PDF	N/A	Local and Global Decoding in Text Generation
具有普遍逼近保证的硬约束神经网络	Youngjae Min	PDF	N/A	Hard-Constrained Neural Networks with Universal Approximation Guarantees
TL-PCA：主成分分析的迁移学习	Sharon Hendy	PDF	N/A	TL-PCA: Transfer Learning of Principal Component Analysis
TrajDiffuse：一种用于环境感知轨迹预测的条件扩散模型	Qingze	PDF	N/A	TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction
具有改进3D扩散策略的通用类人操作	Yanjie Ze	PDF	N/A	Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies
增强视频扩散变换器的相机运动控制	Soon Yau Cheong	PDF	N/A	Boosting Camera Motion Control for Video Diffusion Transformers
混合数据还是合并模型？为多样化的多任务学习进行优化	Aakanksha	PDF	N/A	Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
面向3D视觉的基础模型：我们离目标还有多远？	Yiming Zuo	PDF	N/A	Towards Foundation Models for 3D Vision: How Close Are We?
MMAR：迈向无损多模态自回归概率建模	Jian Yang	PDF	N/A	MMAR: Towards Lossless Multi-Modal Auto-Regressive Prababilistic Modeling
上下文参数逆向：为什么指令微调可能实际上不会提高上下文依赖性	Sachin Goyal	PDF	N/A	Context-Parametric Inversion: Why Instruction Finetuning May Not Actually Improve Context Reliance
使用校正随机微分方程进行语义图像反演与编辑	Litu Rout	PDF	N/A	Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations
条件感知的多模态融合用于驾驶场景的鲁棒语义感知	Tim Broedermann	PDF	N/A	Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes
情景喜剧创作者：一种基于情节驱动的三维场景中人体运动生成系统	Jianqi Chen	PDF	N/A	Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes
关于预测不确定性的信息论度量	Kajetan Schweighofer	PDF	N/A	On Information-Theoretic Measures of Predictive Uncertainty
LiveXiv -- 一个基于Arxiv论文内容的多模态实时基准	Nimrod Shabtay	PDF	N/A	LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content
3DArticCyclists：生成用于人-物体交互（HOI）和自动驾驶应用的模拟动态3D骑车者	Eduardo R. Corral-Soto	PDF	N/A	3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications
当注意力下沉现象在语言模型中出现：一个实证视角	Xiangming Gu	PDF	N/A	When Attention Sink Emerges in Language Models: An Empirical View
ControlMM：可控的掩码运动生成	Ekkasit Pinyoanuntapong	PDF	N/A	ControlMM: Controllable Masked Motion Generation
聚焦式ReAct：通过反复迭代和早期停止改进ReAct	Shuoqiu Li	PDF	N/A	Focused ReAct: Improving ReAct through Reiterate and Early Stop
UniMatch V2：推动半监督语义分割的极限	Lihe Yang	PDF	N/A	UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation
Cavia：一种利用视图集成注意力机制的摄像机可控多视角视频扩散模型	Dejia Xu	PDF	N/A	Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention
增强JEPAs与空间条件：鲁棒且高效的表征学习	Etai Littwin	PDF	N/A	Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning
自适应扩散地形生成器，用于自主不平地形导航	Youwei Yu	PDF	N/A	Adaptive Diffusion Terrain Generator for Autonomous Uneven Terrain Navigation
AFlow：自动化代理工作流程生成	Jiayi Zhang	PDF	N/A	AFlow: Automating Agentic Workflow Generation
针对大型语言模型的拒绝服务中毒攻击	Kuofeng Gao	PDF	N/A	Denial-of-Service Poisoning Attacks against Large Language Models
SplitLLM：用于模型放置和吞吐量优化的协同推理	Akrit Mudvari	PDF	N/A	SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput Optimization
基于相关矩阵的图神经网络心律失常分类	Seungwoo Han	PDF	N/A	Arrhythmia Classification Using Graph Neural Networks Based on Correlation Matrix
目前使用随机选择：基于大语言模型的文本增强分类中的少样本选择策略研究	Jan Cegin	PDF	N/A	Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation for Classification
DragEntity：利用实体和位置关系进行轨迹引导的视频生成	Zhang Wan	PDF	N/A	DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships
FlexGen：从文本和图像输入生成灵活的多视图内容	Xinli Xu	PDF	N/A	FlexGen: Flexible Multi-View Generation from Text and Image Inputs
使用李雅普诺夫稳定嵌入进行对抗鲁棒的分布外检测	Hossein Mirzaei	PDF	N/A	Adversarially Robust Out-of-Distribution Detection Using Lyapunov-Stabilized Embeddings
NT-LLM：一种新颖的节点标记器，用于将图结构整合到大语言模型中	Yanbiao Ji	PDF	N/A	NT-LLM: A Novel Node Tokenizer for Integrating Graph Structure into Large Language Models
SensorBench: 基于编码的传感器处理中的大语言模型基准测试	Pengrui Quan	PDF	N/A	SensorBench: Benchmarking LLMs in Coding-Based Sensor Processing
平衡连续预训练与指令微调：优化大型语言模型中的指令遵循	Ishan Jindal	PDF	N/A	Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs
DrivingDojo数据集：推动交互式与知识丰富的驾驶世界模型的发展	Yuqi Wang	PDF	N/A	DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model
在线统计推断用于时变样本平均Q-学习	Saunak Kumar Panda	PDF	N/A	Online Statistical Inference for Time-varying Sample-averaged Q-learning
面向对抗鲁棒拒绝选项分类的校准损失	Vrund Shah	PDF	N/A	Towards Calibrated Losses for Adversarial Robust Reject Option Classification
将自我纠错作为大型语言模型的一种内在能力嵌入，以增强数学推理能力	Kuofeng Gao	PDF	N/A	Embedding Self-Correction as an Inherent Ability in Large Language Models for Enhanced Mathematical Reasoning
高效高分辨率扩散模型的深度压缩自编码器	Junyu Chen	PDF	N/A	Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
面向大语言模型引导的高效且可解释的多线性张量网络秩选择	Giorgos Iacovides	PDF	N/A	Towards LLM-guided Efficient and Interpretable Multi-linear Tensor Network Rank Selection
图像配准中的一个反例	Serap A. Savari	PDF	N/A	A Counterexample in Image Registration
大型语言模型在自然语言生成评估中充当积极批评者	Shuying Xu	PDF	N/A	Large Language Models Are Active Critics in NLG Evaluation
4-LEGS：4D语言嵌入高斯光栅化	Gal Fiebelman	PDF	N/A	4-LEGS: 4D Language Embedded Gaussian Splatting
SeedLM：将大型语言模型的权重压缩成伪随机生成器的种子	Rasoul Shafipour	PDF	N/A	SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
受益于量子？Q-Seg、量子启发技术和U-Net在裂缝分割中的比较研究	Akshaya Srinivasan	PDF	N/A	Benefiting from Quantum? A Comparative Study of Q-Seg, Quantum-Inspired Techniques, and U-Net for Crack Segmentation
结合ConvNeXt V2和MaxViT的模型用于长尾分布的CXR分类，并通过基于视角的聚合方法进行优化	Yosuke Yamagishi	PDF	N/A	Ensemble of ConvNeXt V2 and MaxViT for Long-Tailed CXR Classification with View-Based Aggregation
利用YOLOv8和YOLOv11深度学习模型进行急性淋巴细胞白血病的早期诊断	Alaa Awad	PDF	N/A	Early Diagnoses of Acute Lymphoblastic Leukemia Using YOLOv8 and YOLOv11 Deep Learning Models
脱轨：通过自我发现的线索进行多轮LLM越狱攻击	Qibing Ren	PDF	N/A	Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered Clues
TALK-Act：通过扩散模型增强2D说话头像重演的纹理感知能力	Jiazhi Guan	PDF	N/A	TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model
使用植入式微电极阵列分离多功能神经移植至肌肉的神经驱动	Laura Ferrante	PDF	N/A	Separation of Neural Drives to Muscles from Transferred Polyfunctional Nerves using Implanted Micro-electrode Arrays
动态损失函数塑造了地形景观并改进了人工神经网络的学习过程。	Eduardo Lavin	PDF	N/A	Dynamical loss functions shape landscape topography and improve learning in artificial neural networks
构建受自然语言处理（NLP）启发的多元时间序列基准数据集	Mohammad Asif Ibna Mustafa	PDF	N/A	Building a Multivariate Time Series Benchmarking Datasets Inspired by Natural Language Processing (NLP)
SAMPa：锐度感知最小化并行化	Wanyun Xie	PDF	N/A	SAMPa: Sharpness-aware Minimization Parallelized
组合多臂老虎机：通过分组测试进行臂选择	Arpan Mukherjee	PDF	N/A	Combinatorial Multi-armed Bandits: Arm Selection via Group Testing
双耳聆听：迈向语言驱动的空间音频生成	Peiwen Sun	PDF	N/A	Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
增强深度强化学习中的鲁棒性：一种李雅普诺夫指数方法	Rory Young	PDF	N/A	Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach
通过矩阵核范数进行大型语言模型评估	Yahan Li	PDF	N/A	Large Language Model Evaluation via Matrix Nuclear-Norm
双重风险与大型语言模型在气候影响中的应用：社会经济差异及对非英语使用者效用的降低	Aivin V. Solatorio	PDF	N/A	Double Jeopardy and Climate Impact in the Use of Large Language Models: Socio-economic Disparities and Reduced Utility for Non-English Speakers
跨模态少样本学习：一种生成式迁移学习框架	Zhengwei Yang	PDF	N/A	Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework
游戏玩法转型：强化学习中DCQN与DTQN架构的比较研究	William A. Stigall	PDF	N/A	Transforming Game Play: A Comparative Study of DCQN and DTQN Architectures in Reinforcement Learning
PCF-Lift：通过概率对比融合实现全景提升	Runsong Zhu	PDF	N/A	PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion
AutoTurb：利用大型语言模型实现湍流闭合模型的自动代数模型发现	Yu Zhang	PDF	N/A	AutoTurb: Using Large Language Models for Automatic Algebraic Model Discovery of Turbulence Closure
不确定性下的导航：基于切换动力系统的轨迹预测与遮挡推理	Ran Wei	PDF	N/A	Navigation under uncertainty: Trajectory prediction and occlusion reasoning with switching dynamical systems
任务：通过对比子图嵌入在空间转录组数据上查询功能性和结构性生态位	Mo Chen	PDF	N/A	QueST: Querying Functional and Structural Niches on Spatial Transcriptomics Data via Contrastive Subgraph Embedding
生成式人工智能及其对个性化智能辅导系统的影响	Subhankar Maity	PDF	N/A	Generative AI and Its Impact on Personalized Intelligent Tutoring Systems
使用自回归表格变换器预测事件的简单基线	Alex Stein	PDF	N/A	A Simple Baseline for Predicting Events with Auto-Regressive Tabular Transformers
DR-MPC：用于现实世界社交导航的深度残差模型预测控制	James R. Han	PDF	N/A	DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation
时空区域级数据的回声状态网络	Zhenhua Wang	PDF	N/A	Echo State Networks for Spatio-Temporal Area-Level Data
使用时间得分匹配法进行指数族中的高维微分参数推断	Daniel J. Williams	PDF	N/A	High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching
Adapt-$\infty$：通过动态数据选择实现可扩展的终身多模态指令调优	Adyasha Maharana	PDF	N/A	Adapt-$\infty$: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection
思考型大语言模型：通过思维生成实现通用指令跟随	Tianhao Wu	PDF	N/A	Thinking LLMs: General Instruction Following with Thought Generation
SANA：利用线性扩散变换器实现高效的高分辨率图像合成	Enze Xie	PDF	N/A	SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
通过语言家族专家的混合方法，高效地将医学大型语言模型（LLMs）民主化，适用于50种语言	Guorui Zheng	PDF	N/A	Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
SensorLLM：将大型语言模型与运动传感器结合用于人体活动识别	Zechen Li	PDF	N/A	SensorLLM: Aligning Large Language Models with Motion Sensors for Human Activity Recognition
鲁棒的相位检索梯度下降法	Alex Buna	PDF	N/A	Robust Gradient Descent for Phase Retrieval
建模新闻互动与影响以进行金融市场预测	Mengyu Wang	PDF	N/A	Modeling News Interactions and Influence for Financial Market Prediction
智能勘探者v2.0：在认知模型不确定性下的勘探钻井规划	John Mern	PDF	N/A	Intelligent prospector v2.0: exploration drill planning under epistemic model uncertainty
Lambda-跳跃连接：防止秩崩溃的结构组件	Federico Arangath Joseph	PDF	N/A	Lambda-Skip Connections: the architectural component that prevents Rank Collapse
BrainMVP：利用多参数MRI进行脑图像分析的多模态视觉预训练	Shaohao Rui	PDF	N/A	BrainMVP: Multi-modal Vision Pre-training for Brain Image Analysis using Multi-parametric MRI
通过实践克服经典挑战的神经网络	Kazuki Irie	PDF	N/A	Neural networks that overcome classic challenges through practice
VisRAG：基于视觉的多模态文档检索增强生成	Shi Yu	PDF	N/A	VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
认知雷达的在线波形选择	Thulasi Tholeti	PDF	N/A	Online waveform selection for cognitive radar
MoTE：协调视觉-语言到视频知识转移中的泛化与专业化	Minghao Zhu	PDF	N/A	MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer
TRESTLE：结构化领域中的概念形成模型	Christopher J. MacLellan	PDF	N/A	TRESTLE: A Model of Concept Formation in Structured Domains
TopoFR：深入探讨拓扑对齐在人脸识别中的应用	Jun Dan	PDF	N/A	TopoFR: A Closer Look at Topology Alignment on Face Recognition
Tübingen-CL 在 SemEval-2024 任务 1 中：基于集成学习的语义相关性估计	Leixin Zhang	PDF	N/A	Tübingen-CL at SemEval-2024 Task 1:Ensemble Learning for Semantic Relatedness Estimation
STACKFEED：结合反馈的结构化文本型行动者-评论家知识库编辑	Naman Gupta	PDF	N/A	STACKFEED: Structured Textual Actor-Critic Knowledge Base Editing with FeedBack
多语言控制生成与黄金标准无关的代码混合句子评估	Ayushman Gupta	PDF	N/A	Multilingual Controlled Generation And Gold-Standard-Agnostic Evaluation of Code-Mixed Sentences
燃烧的红色：在平均奖励马尔可夫决策过程中解锁子任务驱动的强化学习和风险意识	Juan Sebastian Rojas	PDF	N/A	Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes
零样本词性标注的配方：在现实场景中是否有用？	Zeno Vandenbulcke	PDF	N/A	Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios?
可查询原型多实例学习与视觉语言模型用于增量全切片图像分类	Jiaxiang Gou	PDF	N/A	Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification
正则化鲁棒可靠学习器与实例目标攻击	Avrim Blum	PDF	N/A	Regularized Robustly Reliable Learners and Instance Targeted Attacks
当先例发生冲突时	Cecilia Di Florio	PDF	N/A	When Precedents Clash
MEGA-Bench：将多模态评估扩展到超过500个现实世界任务	Jiacheng Chen	PDF	N/A	MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
结构依赖性是否为高效沟通而塑造？：一项关于协调的案例研究	Kohei Kajikawa	PDF	N/A	Is Structure Dependence Shaped for Efficient Communication?: A Case Study on Coordination
ROSAR：一种用于鲁棒侧扫声呐目标检测的对抗性再训练框架	Martin Aubard	PDF	N/A	ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection
SLaNC：静态层归一化校准	Mahsa Salmani	PDF	N/A	SLaNC: Static LayerNorm Calibration
保护心脏完整性：一种融入拓扑学的全心脏分割方法	Chenyu Zhang	PDF	N/A	Preserving Cardiac Integrity: A Topology-Infused Approach to Whole Heart Segmentation
RICASSO：通过类感知自监督异常值暴露增强的不平衡学习	Xuan Zhang	PDF	N/A	RICASSO: Reinforced Imbalance Learning with Class-Aware Self-Supervised Outliers Exposure
混合Transformer用于早期阿尔茨海默病检测：结合基于手写体的2D图像和1D信号特征	Changqing Gong	PDF	N/A	Hybrid Transformer for Early Alzheimer's Detection: Integration of Handwriting-Based 2D Images and 1D Signal Features
通过Hodgelet谱特征进行图分类的高斯过程	Mathieu Alain	PDF	N/A	Graph Classification Gaussian Processes via Hodgelet Spectral Features
在大语言模型时代重新思考现实场景中的法律判决预测	Shubham Kumar Nigam	PDF	N/A	Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Models
基于数据的方法用于建模目标行为	Isabel Schlangen	PDF	N/A	Data-Driven Approaches for Modelling Target Behaviour
基于可重复机器学习的语音病理检测：引入音高差异特征	Jan Vrba	PDF	N/A	Reproducible Machine Learning-based Voice Pathology Detection: Introducing the Pitch Difference Feature
多元时间序列的透明网络	Minkyu Kim	PDF	N/A	Transparent Networks for Multivariate Time Series
在数据驱动的监督深度学习中，无法收敛到全局最小值：Adam和随机梯度下降优化在训练具有ReLU激活的深度神经网络时，证明无法收敛到全局最小值。	Sonja Hannibal	PDF	N/A	Non-convergence to global minimizers in data driven supervised deep learning: Adam and stochastic gradient descent optimization provably fail to converge to global minimizers in the training of deep neural networks with ReLU activation
无需自适应内存需求的自适应概率ODE求解器	Nicholas Krämer	PDF	N/A	Adaptive Probabilistic ODE Solvers Without Adaptive Memory Requirements
在复杂且非平面场景中进行运动引导的小型微型飞行器检测	Hanqing Guo	PDF	N/A	Motion-guided small MAV detection in complex and non-planar scenes
摆脱任务隔离：一种连续多任务时空学习框架	Zhongchao Yi	PDF	N/A	Get Rid of Task Isolation: A Continuous Multi-task Spatio-Temporal Learning Framework
逆问题与数据同化：一种机器学习方法	Eviatar Bach	PDF	N/A	Inverse Problems and Data Assimilation: A Machine Learning Approach
持续深度强化学习以防止干扰缓解中的灾难性遗忘	Kemal Davaslioglu	PDF	N/A	Continual Deep Reinforcement Learning to Prevent Catastrophic Forgetting in Jamming Mitigation
基于人工智能的闪烁纤维成像传感器粒子轨迹识别	Noemi Bührer	PDF	N/A	AI-based particle track identification in scintillating fibres read out with imaging sensors
UniGEM：一种统一分子生成与性质预测的方法	Shikun Feng	PDF	N/A	UniGEM: A Unified Approach to Generation and Property Prediction for Molecules
我们是否需要更复杂的结构表示？对音乐变压器的音符时长表示进行比较	Gabriel Souza	PDF	N/A	Do we need more complex representations for structure? A comparison of note duration representation for Music Transformers
自定义您的视觉自回归配方，使用集合自回归建模	Wenze Liu	PDF	N/A	Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling
利用局部特征和范围图像进行小数据实时点云语义分割	Daniel Fusaro	PDF	N/A	Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation
基于人工智能的皮肤黑色素细胞病变分级	Ruben T. Lucassen	PDF	N/A	Artificial Intelligence-Based Triaging of Cutaneous Melanocytic Lesions
印度次大陆的日常用语	Utkarsh Pathak	PDF	N/A	Everyday Speech in the Indian Subcontinent
深度学习与传统方法在疾病发作预测中的比较	Luis H. John	PDF	N/A	Comparison of deep learning and conventional methods for disease onset prediction
一种可内核化的多线性奇异值分解的原对偶公式	Frederiek Wesel	PDF	N/A	A Kernelizable Primal-Dual Formulation of the Multilinear Singular Value Decomposition
一种对时间上的因果推断的实用方法	Martina Cinquini	PDF	N/A	A Practical Approach to Causal Inference over Time
持续学习提升零样本动作识别	Shreyank N Gowda	PDF	N/A	Continual Learning Improves Zero-Shot Action Recognition
基于提示的图像编辑的视觉引导和掩码增强自适应去噪	Kejie Wang	PDF	N/A	Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
学习无遗忘的视觉语言模型基础	Aritra Bhowmik	PDF	N/A	Learning to Ground VLMs without Forgetting
大型语言模型中的文化保真度：在线语言资源作为价值表示模型性能驱动力的评估	Sharif Kazemi	PDF	N/A	Cultural Fidelity in Large-Language Models: An Evaluation of Online Language Resources as a Driver of Model Performance in Value Representation
一种用于评估卫星影像清晰度的新型无参考图像质量指标	Lucas Gonzalo Antonel	PDF	N/A	A Novel No-Reference Image Quality Metric For Assessing Sharpness In Satellite Imagery
推进新生儿护理：利用AI驱动的自适应归一化热成像进行精确的出生时间检测	Jorge García-Torres	PDF	N/A	Advancing Newborn Care: Precise Birth Time Detection Using AI-Driven Thermal Imaging with Adaptive Normalization
基于模型的差分隐私知识迁移用于大型语言模型	Zhaomin Wu	PDF	N/A	Model-Based Differentially Private Knowledge Transfer for Large Language Models
TMGBench：一个系统的游戏基准，用于评估LLMs的战略推理能力	Haochuan Wang	PDF	N/A	TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
大型语言模型（LLMs）会取代仅编码器模型在时间关系分类中的地位吗？	Gabriel Roccabruna	PDF	N/A	Will LLMs Replace the Encoder-Only Models in Temporal Relation Classification?
结构化状态空间模型中的隐性偏见可以通过干净标签被毒害	Yonatan Slutzky	PDF	N/A	The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels
ReLayout：通过布局增强预训练实现现实世界文档理解	Zhouqiang Jiang	PDF	N/A	ReLayout: Towards Real-World Document Understanding via Layout-enhanced Pre-training
Moirai-MoE：通过稀疏专家混合赋能时间序列基础模型	Xu Liu	PDF	N/A	Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
深度图网络中的信息传播动力学	Alessio Gravina	PDF	N/A	Information propagation dynamics in Deep Graph Networks
TABCF：使用基于Transformer的VAE为表格数据生成反事实解释	Emmanouil Panagiotou	PDF	N/A	TABCF: Counterfactual Explanations for Tabular Data Using a Transformer-Based VAE
多智能体系统的组合屏蔽与强化学习	Asger Horn Brorholt	PDF	N/A	Compositional Shielding and Reinforcement Learning for Multi-Agent Systems
Ada-K 路由：提升基于 MoE 的大型语言模型效率	Tongtian Yue	PDF	N/A	Ada-K Routing: Boosting the Efficiency of MoE-based LLMs
通过增强型表示相似性融合推进学术知识检索	Wei Dai	PDF	N/A	Advancing Academic Knowledge Retrieval via LLM-enhanced Representation Similarity Fusion
通过改进元学习方法，利用从任务中获取的所有可用信息，提升少样本文本分类的性能。	Xinyue Liu	PDF	N/A	Improve Meta-learning for Few-Shot Text Classification with All You Can Acquire from the Tasks
自我评估生成：真实世界中光流和立体匹配的可信标签生成	Han Ling	PDF	N/A	Self-Assessed Generation: Trustworthy Label Generation for Optical Flow and Stereo Matching in Real-world
基于原则的贝叶斯优化与人类专家协作	Wenjie Xu	PDF	N/A	Principled Bayesian Optimisation in Collaboration with Human Experts
移动性感知的联邦学习：基于多臂赌博机的车辆网络选择	Haoyu Tu	PDF	N/A	Mobility-Aware Federated Learning: Multi-Armed Bandit Based Selection in Vehicular Network
KBLaM：知识库增强的语言模型	Xi Wang	PDF	N/A	KBLaM: Knowledge Base augmented Language Model
QUITE：在贝叶斯推理场景中量化自然语言文本中的不确定性	Timo Pierre Schrader	PDF	N/A	QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios
用于完全测试时适应的域条件变换器	Yushun Tang	PDF	N/A	Domain-Conditioned Transformer for Fully Test-time Adaptation
自由视频-大语言模型：提示引导的视觉感知，实现高效的无训练视频大语言模型	Kai Han	PDF	N/A	Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs
向个性化文本到图像扩散模型中未经授权数据使用的可靠验证迈进	Boheng Li	PDF	N/A	Towards Reliable Verification of Unauthorized Data Usage in Personalized Text-to-Image Diffusion Models
LKASeg：利用大核注意力与全尺度跳跃连接进行遥感图像语义分割	Xuezhi Xiang	PDF	N/A	LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections
多样性感知的强化学习用于从头药物设计	Hampus Gummesson Svensson	PDF	N/A	Diversity-Aware Reinforcement Learning for de novo Drug Design
DOME：将扩散模型驯化为高保真可控的占用世界模型	Songen Gu	PDF	N/A	DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
一种用于超参数优化和元学习的双层优化的随机方法	Minyoung Kim	PDF	N/A	A Stochastic Approach to Bi-Level Optimization for Hyperparameter Optimization and Meta Learning
耦合自回归主动推理代理用于多关节动力系统的控制	Tim N. Nisslbeck	PDF	N/A	Coupled autoregressive active inference agents for control of multi-joint dynamical systems
基于大语言模型的防护模型校准以实现可靠内容审核	Hongfu Liu	PDF	N/A	On Calibration of LLM-based Guard Models for Reliable Content Moderation
4DStyleGaussian：基于高斯光栅化的零样本4D风格迁移	Wanlin Liang	PDF	N/A	4DStyleGaussian: Zero-shot 4D Style Transfer with Gaussian Splatting
Medico：基于多源证据融合的幻觉检测与校正	Xinping Zhao	PDF	N/A	Medico: Towards Hallucination Detection and Correction with Multi-source Evidence Fusion
MMCFND：面向低资源印度语言的多模态多语言描述感知假新闻检测	Shubhi Bansal	PDF	N/A	MMCFND: Multimodal Multilingual Caption-aware Fake News Detection for Low-resource Indic Languages
确定性苹果品尝	Zachary Chase	PDF	N/A	Deterministic Apple Tasting
使用可微模板参数化结构以进行三维形状生成	Changfeng Ma	PDF	N/A	Parameterize Structure with Differentiable Template for 3D Shape Generation
FairMindSim: 伦理困境中人类与LLM代理的行为、情感与信念的协调	Yu Lei	PDF	N/A	FairMindSim: Alignment of Behavior, Emotion, and Belief in Humans and LLM Agents Amid Ethical Dilemmas
更严格的专家混合风险界限	Wissam Akretche	PDF	N/A	Tighter Risk Bounds for Mixtures of Experts
贝叶斯神经网络的深度估计改进	Bart van Erp	PDF	N/A	Improved Depth Estimation of Bayesian Neural Networks
PIVOT-R：用于机器人操作的原始驱动航点感知世界模型	Kaidong Zhang	PDF	N/A	PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation
GIFT-Eval：一个通用时间序列预测模型评估基准	Taha Aksu	PDF	N/A	GIFT-Eval: A Benchmark For General Time Series Forecasting Model Evaluation
优化指令合成：利用树搜索有效探索进化空间	Chenglin Li	PDF	N/A	Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search
斯坦变分进化策略	Cornelius V. Braun	PDF	N/A	Stein Variational Evolution Strategies
逆向精细化网络用于高分辨率卫星影像中狭窄乡村道路的检测	Ningjing Wang	PDF	N/A	Reverse Refinement Network for Narrow Rural Road Detection in High-Resolution Satellite Imagery
具有未知超参数的贝叶斯优化：对数更接近最优的遗憾界限	Juliusz Ziomek	PDF	N/A	Bayesian Optimisation with Unknown Hyperparameters: Regret Bounds Logarithmically Closer to Optimal
V2M：用于图像表示学习的视觉二维Mamba	Chengkun Wang	PDF	N/A	V2M: Visual 2-Dimensional Mamba for Image Representation Learning
基于非负/二值矩阵分解的协同过滤	Yukino Terui	PDF	N/A	Collaborative filtering based on nonnegative/binary matrix factorization
学习计算机网络中的亚秒级路由优化需要了解数据包级别的动态特性。	Andreas Boltres	PDF	N/A	Learning Sub-Second Routing Optimization in Computer Networks requires Packet-Level Dynamics
阿尔茨海默病诊断与早期检测的类别平衡多样性多模态集成方法	Arianna Francesconi	PDF	N/A	Class Balancing Diversity Multimodal Ensemble for Alzheimer's Disease Diagnosis and Early Detection
锐度感知最小化在训练后期有效地选择更平坦的最小值	Zhanpeng Zhou	PDF	N/A	Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training
书虫：角色描述与分析数据集	Argyrios Papoudakis	PDF	N/A	BookWorm: A Dataset for Character Description and Analysis
格罗宁根：利用选定的增强井和地震数据进行岩石气体饱和度的空间预测，采用分类器集成方法	Dmitry Ivlev	PDF	N/A	Groningen: Spatial Prediction of Rock Gas Saturation by Leveraging Selected and Augmented Well and Seismic Data with Classifier Ensembles
创新思维，无限幽默：通过结构化思维跳跃对大型语言模型进行幽默研究	Han Wang	PDF	N/A	Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps
计算稀疏图上一般随机游走图核的最优时间复杂度算法	Krzysztof Choromanski	PDF	N/A	Optimal Time Complexity Algorithms for Computing General Random Walk Graph Kernels on Sparse Graphs
亲和图引导的收缩学习用于极少标注的无前置任务医学图像分割	Zehua Cheng	PDF	N/A	Affinity-Graph-Guided Contractive Learning for Pretext-Free Medical Image Segmentation with Minimal Annotation
SpeGCL：无正样本的自监督图谱对比学习	Yuntao Shou	PDF	N/A	SpeGCL: Self-supervised Graph Spectrum Contrastive Learning without Positive Samples
育儿：通过参数解耦和定制调优优化检索增强语言模型的知识选择	Yongxin Xu	PDF	N/A	Parenting: Optimizing Knowledge Selection of Retrieval-Augmented Language Models with Parameter Decoupling and Tailored Tuning
FasterDiT：在不修改架构的情况下实现更快的扩散Transformer训练	Jingfeng Yao	PDF	N/A	FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification
使用BiFormer注意力机制和多路径扩张卷积的耻骨联合-胎头分割网络	Pengzhou Cai	PDF	N/A	Pubic Symphysis-Fetal Head Segmentation Network Using BiFormer Attention Mechanism and Multipath Dilated Convolution
在深度学习背景下表示三维旋转	Viktória Pravdová	PDF	N/A	On Representation of 3D Rotation in the Context of Deep Learning
基于大型语言模型的代码转换文本生成用于语法错误纠正	Tom Potter	PDF	N/A	LLM-based Code-Switched Text Generation for Grammatical Error Correction
通过自动数据标注和优化增强大型语言模型中的上下文学习	Joseph Shtok	PDF	N/A	Augmenting In-Context-Learning in LLMs via Automatic Data Labeling and Refinement
一种针对大型语言模型的统一路由与级联方法	Jasper Dekoninck	PDF	N/A	A Unified Approach to Routing and Cascading for LLMs
锁定微调后的大型语言模型（LLMs）的安全性	Minjun Zhu	PDF	N/A	Locking Down the Finetuned LLMs Safety
重放与遗忘自由的图类增量学习：一种任务分析与提示方法	Chaoxi Niu	PDF	N/A	Replay-and-Forget-Free Graph Class-Incremental Learning: A Task Profiling and Prompting Approach
CoMAT：数学注释思维链提升数学推理能力	Joshua Ong Jun Leang	PDF	N/A	CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning
解构针对不同目标身份的仇恨	Yiping Jin	PDF	N/A	Disentangling Hate Across Target Identities
解剖特征优先损失以增强MR到CT的转换	Arthur Longuefosse	PDF	N/A	Anatomical feature-prioritized loss for enhanced MR to CT translation