Arxiv 2025-02-17 Papers

标题	作者	PDF链接	代码仓库	Title
无分类器自由引导的扩散模型	Zhicong Tang	PDF	N/A	Diffusion Models without Classifier-free Guidance
为现实世界人形机器人学习起床策略	Xialin He	PDF	N/A	Learning Getting-Up Policies for Real-World Humanoid Robots
VoLUT：通过基于查找表（LUT）的超分辨率技术增强的高效体积流传输	Chendong Wang	PDF	N/A	VoLUT: Efficient Volumetric streaming enhanced by LUT-based super-resolution
大型语言模型中的独特性	Mingjie Sun	PDF	N/A	Idiosyncrasies in Large Language Models
HARBOR：探索多智能体竞争中角色动态	Kenan Jiang	PDF	N/A	HARBOR: Exploring Persona Dynamics in Multi-Agent Competition
HermesFlow：无缝弥合多模态理解与生成的鸿沟	Ling Yang	PDF	N/A	HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation
学习用于物理性质预测的平滑且富有表现力的原子间势能	Xiang Fu	PDF	N/A	Learning Smooth and Expressive Interatomic Potentials for Physical Property Prediction
扩散锐化：通过去噪轨迹锐化微调扩散模型	Ye Tian	PDF	N/A	Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
快还是好？在检索增强生成中平衡准确性与成本，并提供灵活的用户控制	Jinyan Su	PDF	N/A	Fast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control
小型模型难以从强大的推理者中学习	Yuetai Li	PDF	N/A	Small Models Struggle to Learn from Strong Reasoners
FLARE：从未校准的稀疏视图中进行前馈几何、外观和相机估计	Shangzhan Zhang	PDF	N/A	FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views
REVERSUM：一种多阶段的检索增强生成方法，通过个人叙事增强维基百科尾部传记	Sayantan Adak	PDF	N/A	REVERSUM: A Multi-staged Retrieval-Augmented Generation Method to Enhance Wikipedia Tail Biographies through Personal Narratives
MagicArticulate：让您的3D模型准备好进行关节连接	Chaoyue Song	PDF	N/A	MagicArticulate: Make Your 3D Models Articulation-Ready
SoftCoT：用于大型语言模型高效推理的软性思维链	Yige Xu	PDF	N/A	SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs
变压器动力学：一种神经科学视角下的大型语言模型可解释性研究	Jesseba Fernando	PDF	N/A	Transformer Dynamics: A neuroscientific approach to interpretability of large language models
将这段翻译成中文是：“通过自动奖励建模与规划扩展自主代理的规模。”	Zhenfang Chen	PDF	N/A	Scaling Autonomous Agents via Automatic Reward Modeling And Planning
LaM-SLidE：通过链接实体进行空间动力学系统的潜在空间建模	Florian Sestak	PDF	N/A	LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities
超类偏差：通过类层次结构的视角揭示深度分类器训练动态	Roman Malashin	PDF	N/A	Hypernym Bias: Unraveling Deep Classifier Training Dynamics through the Lens of Class Hierarchy
RA-MTR：一种基于检索增强多任务阅读器的方法，用于从长文档中提取励志语录	Sayantan Adak	PDF	N/A	RA-MTR: A Retrieval Augmented Multi-Task Reader based Approach for Inspirational Quote Extraction from Long Documents
关于验证器辅助语言生成的查询复杂性	Edoardo Botta	PDF	N/A	On the Query Complexity of Verifier-Assisted Language Generation
最小化参数，最大化信心：LoRA的高效不确定性量化	Patryk Marszałek	PDF	N/A	Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA
LLMs在线：数据决定损失到损失的缩放规律	Prasanna Mayilvahanan	PDF	N/A	LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws
PRISM：一种用于免训练多模态数据选择的自剪枝内在选择方法	Jinhe Bi	PDF	N/A	PRISM: Self-Pruning Intrinsic Selection Method for Training-Free Multimodal Data Selection
在不进行验证或强化学习的情况下扩展测试时计算是次优的	Amrith Setlur	PDF	N/A	Scaling Test-Time Compute Without Verification or RL is Suboptimal
SWE-Lancer: 前沿的大型语言模型能否从现实世界的自由职业软件工程中赚取100万美元？	Samuel Miserendino	PDF	N/A	SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?
单目事件相机运动捕捉系统	Leonard Bauersfeld	PDF	N/A	A Monocular Event-Camera Motion Capture System
A-MEM：面向LLM智能体的代理记忆	Wujiang Xu	PDF	N/A	A-MEM: Agentic Memory for LLM Agents
人格研究中的大型语言模型模拟用结构化访谈	Pengda Wang	PDF	N/A	Personality Structured Interview for Large Language Model Simulation in Personality Research
使用最小阻力路径来解释深度网络	Sina Salek	PDF	N/A	Using the Path of Least Resistance to Explain Deep Networks
人类与AI合作的关系规范	Brian D. Earp	PDF	N/A	Relational Norms for Human-AI Cooperation
Token通信：跨模态上下文感知语义通信的统一框架	Li Qiao	PDF	N/A	Token Communications: A Unified Framework for Cross-modal Context-aware Semantic Communications
视觉语言模型中的区分性-生成性自定义标记	Pramuditha Perera	PDF	N/A	Descriminative-Generative Custom Tokens for Vision-Language Models
一项关于利用搜索与自我反馈提升智能体推理能力的研究	Karthikeyan K	PDF	N/A	A Study on Leveraging Search and Self-Feedback for Agent Reasoning
随着扩散模型的训练，组合泛化能力和创造力是如何提升的	Alessandro Favero	PDF	N/A	How compositional generalization and creativity improve as diffusion models are trained
元统计学习：统计推断的监督学习	Maxime Peyrard	PDF	N/A	Meta-Statistical Learning: Supervised Learning of Statistical Inference
统一动态系统中的可解释异常检测与根本原因分析	Yue Sun	PDF	N/A	Unifying Explainable Anomaly Detection and Root Cause Analysis in Dynamical Systems
APB：通过跨GPU传递压缩上下文块来加速分布式长上下文推理	Yuxiang Huang	PDF	N/A	APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs
VLM$^2$-Bench：深入探讨视觉语言模型如何隐式链接显式匹配的视觉线索	Jianshu Zhang	PDF	N/A	VLM$^2$-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues
AdaSplash: 自适应稀疏闪存注意力	Nuno Gonçalves	PDF	N/A	AdaSplash: Adaptive Sparse Flash Attention
不可破解的时间奖励机制，用于可扩展的视频多模态大语言模型	En Yu	PDF	N/A	Unhackable Temporal Rewarding for Scalable Video MLLMs
HumanGif: 基于生成先验的单视角人体扩散模型	Shoukang Hu	PDF	N/A	HumanGif: Single-View Human Diffusion with Generative Prior
大型语言模型能否模拟社交媒体互动？一项关于行动导向响应生成的研究	Zhongyi Qiu	PDF	N/A	Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation
TokenSkip: 大语言模型中的可控思维链压缩	Heming Xia	PDF	N/A	TokenSkip: Controllable Chain-of-Thought Compression in LLMs
CONSTRUCTA：利用大型语言模型自动化制造设施中的商业建筑进度安排	Yifan Zhang	PDF	N/A	CONSTRUCTA: Automating Commercial Construction Schedules in Fabrication Facilities with Large Language Models
使用大型语言模型形式化复杂数学陈述：关于数学定义的研究	Lan Zhang	PDF	N/A	Formalizing Complex Mathematical Statements with LLMs: A Study on Mathematical Definitions
基于GLTR方法的AI生成文本检测	Lucía Yan Wu	PDF	N/A	AI-generated Text Detection with a GLTR-based Approach
低秩细化	Annabelle Michael Carrell	PDF	N/A	Low-Rank Thinning
一项关于出行感知的调查，旨在为基于代理的主观出行方式选择模拟器提供信息。	Carole Adam	PDF	N/A	A survey about perceptions of mobility to inform an agent-based simulator of subjective modal choice
文化不是琐事：面向文化自然语言处理的社会文化理论	Naitian Zhou	PDF	N/A	Culture is Not Trivia: Sociocultural Theory for Cultural NLP
设计角色向量以改进LLM推理行为	Daniele Potertì	PDF	N/A	Designing Role Vectors to Improve LLM Inference Behaviour
PhysReason: 一个面向物理推理的综合基准	Xinyu Zhang	PDF	N/A	PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning
双视角NLG元评估框架：自动基准与更高解释性	Xinyu Hu	PDF	N/A	A Dual-Perspective NLG Meta-Evaluation Framework with Automatic Benchmark and Better Interpretability
如何利用缩放法则提升神经网络性能？一份调查与实践指南	Ayan Sengupta	PDF	N/A	How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines
SpeechT: 首届语音翻译导师项目成果	Yasmin Moslem	PDF	N/A	SpeechT: Findings of the First Mentorship in Speech Translation
使用可解释的机器学习对病毒样颗粒的化学计量进行分类	Jiayang Zhang	PDF	N/A	Classifying the Stoichiometry of Virus-like Particles with Interpretable Machine Learning
《关于桥接脑电图信号与生成式人工智能的调查：从图像和文本到更广阔的领域》	Shreya Shukla	PDF	N/A	A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond
BERT的几何结构	Matteo Bonino	PDF	N/A	The geometry of BERT
自监督音频表示学习中的掩码潜在预测与分类	Aurian Quelennec	PDF	N/A	Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning
KnowPath：通过基于知识图谱的LLM生成推理路径实现知识增强的推理	Qi Zhao	PDF	N/A	KnowPath: Knowledge-enhanced Reasoning via LLM-generated Inference Paths over Knowledge Graphs
提升透明物体姿态估计：GDR-Net与边缘检测的融合	Tessa Pulli	PDF	N/A	Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection
SafeChain：具备长链思维推理能力的语言模型的安全性	Fengqing Jiang	PDF	N/A	SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
因材施教：数学问题解决中的自适应推理	Xin Xu	PDF	N/A	Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
在多场相干伊辛机中学习	Daan de Bos	PDF	N/A	Learning in a Multifield Coherent Ising Machine
Atom of Thoughts for Markov LLM Test-Time Scaling 的中文翻译是：

马尔可夫大语言模型测试时扩展的思维原子

精炼的离线赌博机问题的PAC-Bayes边界

翻译说明： - "Refined" 翻译为 "精炼的"，表示对原有理论或方法的改进或优化。 - "PAC-Bayes Bounds" 是机器学习中的一个理论概念，通常翻译为 "PAC-Bayes 边界" 或 "PAC-Bayes 界"，用于描述泛化误差的界限。 - "Offline Bandits" 翻译为 "离线赌博机"，是强化学习中的一个研究领域，专注于在离线数据上学习策略。

假设驱动的大语言模型心智理论推理

这个翻译保留了原文的核心含义： - Hypothesis-Driven 翻译为“假设驱动”，表示基于假设的推理过程。 - Theory-of-Mind Reasoning 翻译为“心智理论推理”，指的是理解和推断他人心理状态的能力。 - Large Language Models 翻译为“大语言模型”，指大规模的自然语言处理模型。

近年来，大语言模型（LLMs）在各种自然语言处理任务中取得了显著的成功。然而，这些模型通常需要针对特定任务进行微调，以充分发挥其潜力。尽管微调在实践中被广泛使用，但其内部机制仍未被完全理解。

本文旨在通过电路分析的视角，深入理解大语言模型微调的机制。我们将微调过程视为对模型内部“电路”的修改，并探讨这些修改如何影响模型的行为。具体来说，我们将重点关注以下几个方面：

识别关键电路组件： 我们将使用各种技术，例如激活值分析和梯度分析，来识别在微调过程中发生显著变化的模型组件。这些组件可能对应于特定的神经元、注意力头或网络层。
分析电路修改的影响： 我们将研究这些关键组件的修改如何影响模型的输出。例如，我们将探讨修改特定注意力头如何改变模型对输入文本的关注点。
建立微调机制的理论框架： 基于我们的分析，我们将尝试建立一个理论框架来解释大语言模型微调的机制。该框架将帮助我们更好地理解微调过程，并指导我们设计更有效的微调策略。

"充分利用LLM内部状态以增强知识边界感知"

其中： - "Towards" 表示朝着某个方向或目标努力 - "Fully Exploiting" 意思是充分利用或开发 - "LLM" 是Large Language Model（大型语言模型）的缩写 - "Internal States" 指模型内部的隐藏状态或表示 - "Enhance" 意思是增强或提高 - "Knowledge Boundary Perception" 指对知识边界的感知或理解能力