| Tex4D: 利用视频扩散模型实现零样本4D场景纹理化 |
Jingzhi Bao |
PDF |
N/A |
Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models |
| 感知对齐何时有益于视觉表征? |
Shobhita Sundaram |
PDF |
N/A |
When Does Perceptual Alignment Benefit Vision Representations? |
| TemporalBench:多模态视频模型细粒度时间理解基准测试 |
Mu Cai |
PDF |
N/A |
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models |
| DuoAttention:利用检索和流式处理头高效处理长上下文LLM推理 |
Guangxuan Xiao |
PDF |
N/A |
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads |
| LVD-2M:一个带有时间密集字幕的长镜头视频数据集 |
Tianwei Xiong |
PDF |
N/A |
LVD-2M: A Long-take Video Dataset with Temporally Dense Captions |
| 使用可扩展的合成数据实现任意视频深度估计 |
Honghui Yang |
PDF |
N/A |
Depth Any Video with Scalable Synthetic Data |
| LongMemEval:在长期互动记忆上评估聊天助手 |
Di Wu |
PDF |
N/A |
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory |
| 你的混合专家大型语言模型实际上是一个免费的嵌入模型 |
Ziyue Li |
PDF |
N/A |
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free |
| HART:高效视觉生成的混合自回归Transformer |
Haotian Tang |
PDF |
N/A |
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer |
| 深度线性探针生成器用于权重空间学习 |
Jonathan Kahana |
PDF |
N/A |
Deep Linear Probe Generators for Weight Space Learning |
| 文本生成中的局部解码和全局解码 |
Daniel Gareev |
PDF |
N/A |
Local and Global Decoding in Text Generation |
| 具有普遍逼近保证的硬约束神经网络 |
Youngjae Min |
PDF |
N/A |
Hard-Constrained Neural Networks with Universal Approximation Guarantees |
| TL-PCA:主成分分析的迁移学习 |
Sharon Hendy |
PDF |
N/A |
TL-PCA: Transfer Learning of Principal Component Analysis |
| TrajDiffuse:一种用于环境感知轨迹预测的条件扩散模型 |
Qingze |
PDF |
N/A |
TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction |
| 具有改进3D扩散策略的通用类人操作 |
Yanjie Ze |
PDF |
N/A |
Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies |
| 增强视频扩散变换器的相机运动控制 |
Soon Yau Cheong |
PDF |
N/A |
Boosting Camera Motion Control for Video Diffusion Transformers |
| 混合数据还是合并模型?为多样化的多任务学习进行优化 |
Aakanksha |
PDF |
N/A |
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning |
| 面向3D视觉的基础模型:我们离目标还有多远? |
Yiming Zuo |
PDF |
N/A |
Towards Foundation Models for 3D Vision: How Close Are We? |
| MMAR:迈向无损多模态自回归概率建模 |
Jian Yang |
PDF |
N/A |
MMAR: Towards Lossless Multi-Modal Auto-Regressive Prababilistic Modeling |
| 上下文参数逆向:为什么指令微调可能实际上不会提高上下文依赖性 |
Sachin Goyal |
PDF |
N/A |
Context-Parametric Inversion: Why Instruction Finetuning May Not Actually Improve Context Reliance |
| 使用校正随机微分方程进行语义图像反演与编辑 |
Litu Rout |
PDF |
N/A |
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations |
| 条件感知的多模态融合用于驾驶场景的鲁棒语义感知 |
Tim Broedermann |
PDF |
N/A |
Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes |
| 情景喜剧创作者:一种基于情节驱动的三维场景中人体运动生成系统 |
Jianqi Chen |
PDF |
N/A |
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes |
| 关于预测不确定性的信息论度量 |
Kajetan Schweighofer |
PDF |
N/A |
On Information-Theoretic Measures of Predictive Uncertainty |
| LiveXiv -- 一个基于Arxiv论文内容的多模态实时基准 |
Nimrod Shabtay |
PDF |
N/A |
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content |
| 3DArticCyclists:生成用于人-物体交互(HOI)和自动驾驶应用的模拟动态3D骑车者 |
Eduardo R. Corral-Soto |
PDF |
N/A |
3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications |
| 当注意力下沉现象在语言模型中出现:一个实证视角 |
Xiangming Gu |
PDF |
N/A |
When Attention Sink Emerges in Language Models: An Empirical View |
| ControlMM:可控的掩码运动生成 |
Ekkasit Pinyoanuntapong |
PDF |
N/A |
ControlMM: Controllable Masked Motion Generation |
| 聚焦式ReAct:通过反复迭代和早期停止改进ReAct |
Shuoqiu Li |
PDF |
N/A |
Focused ReAct: Improving ReAct through Reiterate and Early Stop |
| UniMatch V2:推动半监督语义分割的极限 |
Lihe Yang |
PDF |
N/A |
UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation |
| Cavia:一种利用视图集成注意力机制的摄像机可控多视角视频扩散模型 |
Dejia Xu |
PDF |
N/A |
Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention |
| 增强JEPAs与空间条件:鲁棒且高效的表征学习 |
Etai Littwin |
PDF |
N/A |
Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning |
| 自适应扩散地形生成器,用于自主不平地形导航 |
Youwei Yu |
PDF |
N/A |
Adaptive Diffusion Terrain Generator for Autonomous Uneven Terrain Navigation |
| AFlow:自动化代理工作流程生成 |
Jiayi Zhang |
PDF |
N/A |
AFlow: Automating Agentic Workflow Generation |
| 针对大型语言模型的拒绝服务中毒攻击 |
Kuofeng Gao |
PDF |
N/A |
Denial-of-Service Poisoning Attacks against Large Language Models |
| SplitLLM:用于模型放置和吞吐量优化的协同推理 |
Akrit Mudvari |
PDF |
N/A |
SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput Optimization |
| 基于相关矩阵的图神经网络心律失常分类 |
Seungwoo Han |
PDF |
N/A |
Arrhythmia Classification Using Graph Neural Networks Based on Correlation Matrix |
| 目前使用随机选择:基于大语言模型的文本增强分类中的少样本选择策略研究 |
Jan Cegin |
PDF |
N/A |
Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation for Classification |
| DragEntity:利用实体和位置关系进行轨迹引导的视频生成 |
Zhang Wan |
PDF |
N/A |
DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships |
| FlexGen:从文本和图像输入生成灵活的多视图内容 |
Xinli Xu |
PDF |
N/A |
FlexGen: Flexible Multi-View Generation from Text and Image Inputs |
| 使用李雅普诺夫稳定嵌入进行对抗鲁棒的分布外检测 |
Hossein Mirzaei |
PDF |
N/A |
Adversarially Robust Out-of-Distribution Detection Using Lyapunov-Stabilized Embeddings |
| NT-LLM:一种新颖的节点标记器,用于将图结构整合到大语言模型中 |
Yanbiao Ji |
PDF |
N/A |
NT-LLM: A Novel Node Tokenizer for Integrating Graph Structure into Large Language Models |
| SensorBench: 基于编码的传感器处理中的大语言模型基准测试 |
Pengrui Quan |
PDF |
N/A |
SensorBench: Benchmarking LLMs in Coding-Based Sensor Processing |
| 平衡连续预训练与指令微调:优化大型语言模型中的指令遵循 |
Ishan Jindal |
PDF |
N/A |
Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs |
| DrivingDojo数据集:推动交互式与知识丰富的驾驶世界模型的发展 |
Yuqi Wang |
PDF |
N/A |
DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model |
| 在线统计推断用于时变样本平均Q-学习 |
Saunak Kumar Panda |
PDF |
N/A |
Online Statistical Inference for Time-varying Sample-averaged Q-learning |
| 面向对抗鲁棒拒绝选项分类的校准损失 |
Vrund Shah |
PDF |
N/A |
Towards Calibrated Losses for Adversarial Robust Reject Option Classification |
| 将自我纠错作为大型语言模型的一种内在能力嵌入,以增强数学推理能力 |
Kuofeng Gao |
PDF |
N/A |
Embedding Self-Correction as an Inherent Ability in Large Language Models for Enhanced Mathematical Reasoning |
| 高效高分辨率扩散模型的深度压缩自编码器 |
Junyu Chen |
PDF |
N/A |
Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models |
| 面向大语言模型引导的高效且可解释的多线性张量网络秩选择 |
Giorgos Iacovides |
PDF |
N/A |
Towards LLM-guided Efficient and Interpretable Multi-linear Tensor Network Rank Selection |
| 图像配准中的一个反例 |
Serap A. Savari |
PDF |
N/A |
A Counterexample in Image Registration |
| 大型语言模型在自然语言生成评估中充当积极批评者 |
Shuying Xu |
PDF |
N/A |
Large Language Models Are Active Critics in NLG Evaluation |
| 4-LEGS:4D语言嵌入高斯光栅化 |
Gal Fiebelman |
PDF |
N/A |
4-LEGS: 4D Language Embedded Gaussian Splatting |
| SeedLM:将大型语言模型的权重压缩成伪随机生成器的种子 |
Rasoul Shafipour |
PDF |
N/A |
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators |
| 受益于量子?Q-Seg、量子启发技术和U-Net在裂缝分割中的比较研究 |
Akshaya Srinivasan |
PDF |
N/A |
Benefiting from Quantum? A Comparative Study of Q-Seg, Quantum-Inspired Techniques, and U-Net for Crack Segmentation |
| 结合ConvNeXt V2和MaxViT的模型用于长尾分布的CXR分类,并通过基于视角的聚合方法进行优化 |
Yosuke Yamagishi |
PDF |
N/A |
Ensemble of ConvNeXt V2 and MaxViT for Long-Tailed CXR Classification with View-Based Aggregation |
| 利用YOLOv8和YOLOv11深度学习模型进行急性淋巴细胞白血病的早期诊断 |
Alaa Awad |
PDF |
N/A |
Early Diagnoses of Acute Lymphoblastic Leukemia Using YOLOv8 and YOLOv11 Deep Learning Models |
| 脱轨:通过自我发现的线索进行多轮LLM越狱攻击 |
Qibing Ren |
PDF |
N/A |
Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered Clues |
| TALK-Act:通过扩散模型增强2D说话头像重演的纹理感知能力 |
Jiazhi Guan |
PDF |
N/A |
TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model |
| 使用植入式微电极阵列分离多功能神经移植至肌肉的神经驱动 |
Laura Ferrante |
PDF |
N/A |
Separation of Neural Drives to Muscles from Transferred Polyfunctional Nerves using Implanted Micro-electrode Arrays |
| 动态损失函数塑造了地形景观并改进了人工神经网络的学习过程。 |
Eduardo Lavin |
PDF |
N/A |
Dynamical loss functions shape landscape topography and improve learning in artificial neural networks |
| 构建受自然语言处理(NLP)启发的多元时间序列基准数据集 |
Mohammad Asif Ibna Mustafa |
PDF |
N/A |
Building a Multivariate Time Series Benchmarking Datasets Inspired by Natural Language Processing (NLP) |
| SAMPa:锐度感知最小化并行化 |
Wanyun Xie |
PDF |
N/A |
SAMPa: Sharpness-aware Minimization Parallelized |
| 组合多臂老虎机:通过分组测试进行臂选择 |
Arpan Mukherjee |
PDF |
N/A |
Combinatorial Multi-armed Bandits: Arm Selection via Group Testing |
| 双耳聆听:迈向语言驱动的空间音频生成 |
Peiwen Sun |
PDF |
N/A |
Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation |
| 增强深度强化学习中的鲁棒性:一种李雅普诺夫指数方法 |
Rory Young |
PDF |
N/A |
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach |
| 通过矩阵核范数进行大型语言模型评估 |
Yahan Li |
PDF |
N/A |
Large Language Model Evaluation via Matrix Nuclear-Norm |
| 双重风险与大型语言模型在气候影响中的应用:社会经济差异及对非英语使用者效用的降低 |
Aivin V. Solatorio |
PDF |
N/A |
Double Jeopardy and Climate Impact in the Use of Large Language Models: Socio-economic Disparities and Reduced Utility for Non-English Speakers |
| 跨模态少样本学习:一种生成式迁移学习框架 |
Zhengwei Yang |
PDF |
N/A |
Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework |
| 游戏玩法转型:强化学习中DCQN与DTQN架构的比较研究 |
William A. Stigall |
PDF |
N/A |
Transforming Game Play: A Comparative Study of DCQN and DTQN Architectures in Reinforcement Learning |
| PCF-Lift:通过概率对比融合实现全景提升 |
Runsong Zhu |
PDF |
N/A |
PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion |
| AutoTurb:利用大型语言模型实现湍流闭合模型的自动代数模型发现 |
Yu Zhang |
PDF |
N/A |
AutoTurb: Using Large Language Models for Automatic Algebraic Model Discovery of Turbulence Closure |
| 不确定性下的导航:基于切换动力系统的轨迹预测与遮挡推理 |
Ran Wei |
PDF |
N/A |
Navigation under uncertainty: Trajectory prediction and occlusion reasoning with switching dynamical systems |
| 任务:通过对比子图嵌入在空间转录组数据上查询功能性和结构性生态位 |
Mo Chen |
PDF |
N/A |
QueST: Querying Functional and Structural Niches on Spatial Transcriptomics Data via Contrastive Subgraph Embedding |
| 生成式人工智能及其对个性化智能辅导系统的影响 |
Subhankar Maity |
PDF |
N/A |
Generative AI and Its Impact on Personalized Intelligent Tutoring Systems |
| 使用自回归表格变换器预测事件的简单基线 |
Alex Stein |
PDF |
N/A |
A Simple Baseline for Predicting Events with Auto-Regressive Tabular Transformers |
| DR-MPC:用于现实世界社交导航的深度残差模型预测控制 |
James R. Han |
PDF |
N/A |
DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation |
| 时空区域级数据的回声状态网络 |
Zhenhua Wang |
PDF |
N/A |
Echo State Networks for Spatio-Temporal Area-Level Data |
| 使用时间得分匹配法进行指数族中的高维微分参数推断 |
Daniel J. Williams |
PDF |
N/A |
High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching |
| Adapt-$\infty$:通过动态数据选择实现可扩展的终身多模态指令调优 |
Adyasha Maharana |
PDF |
N/A |
Adapt-$\infty$: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection |
| 思考型大语言模型:通过思维生成实现通用指令跟随 |
Tianhao Wu |
PDF |
N/A |
Thinking LLMs: General Instruction Following with Thought Generation |
| SANA:利用线性扩散变换器实现高效的高分辨率图像合成 |
Enze Xie |
PDF |
N/A |
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers |
| 通过语言家族专家的混合方法,高效地将医学大型语言模型(LLMs)民主化,适用于50种语言 |
Guorui Zheng |
PDF |
N/A |
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts |
| SensorLLM:将大型语言模型与运动传感器结合用于人体活动识别 |
Zechen Li |
PDF |
N/A |
SensorLLM: Aligning Large Language Models with Motion Sensors for Human Activity Recognition |
| 鲁棒的相位检索梯度下降法 |
Alex Buna |
PDF |
N/A |
Robust Gradient Descent for Phase Retrieval |
| 建模新闻互动与影响以进行金融市场预测 |
Mengyu Wang |
PDF |
N/A |
Modeling News Interactions and Influence for Financial Market Prediction |
| 智能勘探者v2.0:在认知模型不确定性下的勘探钻井规划 |
John Mern |
PDF |
N/A |
Intelligent prospector v2.0: exploration drill planning under epistemic model uncertainty |
| Lambda-跳跃连接:防止秩崩溃的结构组件 |
Federico Arangath Joseph |
PDF |
N/A |
Lambda-Skip Connections: the architectural component that prevents Rank Collapse |
| BrainMVP:利用多参数MRI进行脑图像分析的多模态视觉预训练 |
Shaohao Rui |
PDF |
N/A |
BrainMVP: Multi-modal Vision Pre-training for Brain Image Analysis using Multi-parametric MRI |
| 通过实践克服经典挑战的神经网络 |
Kazuki Irie |
PDF |
N/A |
Neural networks that overcome classic challenges through practice |
| VisRAG:基于视觉的多模态文档检索增强生成 |
Shi Yu |
PDF |
N/A |
VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents |
| 认知雷达的在线波形选择 |
Thulasi Tholeti |
PDF |
N/A |
Online waveform selection for cognitive radar |
| MoTE:协调视觉-语言到视频知识转移中的泛化与专业化 |
Minghao Zhu |
PDF |
N/A |
MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer |
| TRESTLE:结构化领域中的概念形成模型 |
Christopher J. MacLellan |
PDF |
N/A |
TRESTLE: A Model of Concept Formation in Structured Domains |
| TopoFR:深入探讨拓扑对齐在人脸识别中的应用 |
Jun Dan |
PDF |
N/A |
TopoFR: A Closer Look at Topology Alignment on Face Recognition |
| Tübingen-CL 在 SemEval-2024 任务 1 中:基于集成学习的语义相关性估计 |
Leixin Zhang |
PDF |
N/A |
Tübingen-CL at SemEval-2024 Task 1:Ensemble Learning for Semantic Relatedness Estimation |
| STACKFEED:结合反馈的结构化文本型行动者-评论家知识库编辑 |
Naman Gupta |
PDF |
N/A |
STACKFEED: Structured Textual Actor-Critic Knowledge Base Editing with FeedBack |
| 多语言控制生成与黄金标准无关的代码混合句子评估 |
Ayushman Gupta |
PDF |
N/A |
Multilingual Controlled Generation And Gold-Standard-Agnostic Evaluation of Code-Mixed Sentences |
| 燃烧的红色:在平均奖励马尔可夫决策过程中解锁子任务驱动的强化学习和风险意识 |
Juan Sebastian Rojas |
PDF |
N/A |
Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes |
| 零样本词性标注的配方:在现实场景中是否有用? |
Zeno Vandenbulcke |
PDF |
N/A |
Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios? |
| 可查询原型多实例学习与视觉语言模型用于增量全切片图像分类 |
Jiaxiang Gou |
PDF |
N/A |
Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification |
| 正则化鲁棒可靠学习器与实例目标攻击 |
Avrim Blum |
PDF |
N/A |
Regularized Robustly Reliable Learners and Instance Targeted Attacks |
| 当先例发生冲突时 |
Cecilia Di Florio |
PDF |
N/A |
When Precedents Clash |
| MEGA-Bench:将多模态评估扩展到超过500个现实世界任务 |
Jiacheng Chen |
PDF |
N/A |
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks |
| 结构依赖性是否为高效沟通而塑造?:一项关于协调的案例研究 |
Kohei Kajikawa |
PDF |
N/A |
Is Structure Dependence Shaped for Efficient Communication?: A Case Study on Coordination |
| ROSAR:一种用于鲁棒侧扫声呐目标检测的对抗性再训练框架 |
Martin Aubard |
PDF |
N/A |
ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection |
| SLaNC:静态层归一化校准 |
Mahsa Salmani |
PDF |
N/A |
SLaNC: Static LayerNorm Calibration |
| 保护心脏完整性:一种融入拓扑学的全心脏分割方法 |
Chenyu Zhang |
PDF |
N/A |
Preserving Cardiac Integrity: A Topology-Infused Approach to Whole Heart Segmentation |
| RICASSO:通过类感知自监督异常值暴露增强的不平衡学习 |
Xuan Zhang |
PDF |
N/A |
RICASSO: Reinforced Imbalance Learning with Class-Aware Self-Supervised Outliers Exposure |
| 混合Transformer用于早期阿尔茨海默病检测:结合基于手写体的2D图像和1D信号特征 |
Changqing Gong |
PDF |
N/A |
Hybrid Transformer for Early Alzheimer's Detection: Integration of Handwriting-Based 2D Images and 1D Signal Features |
| 通过Hodgelet谱特征进行图分类的高斯过程 |
Mathieu Alain |
PDF |
N/A |
Graph Classification Gaussian Processes via Hodgelet Spectral Features |
| 在大语言模型时代重新思考现实场景中的法律判决预测 |
Shubham Kumar Nigam |
PDF |
N/A |
Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Models |
| 基于数据的方法用于建模目标行为 |
Isabel Schlangen |
PDF |
N/A |
Data-Driven Approaches for Modelling Target Behaviour |
| 基于可重复机器学习的语音病理检测:引入音高差异特征 |
Jan Vrba |
PDF |
N/A |
Reproducible Machine Learning-based Voice Pathology Detection: Introducing the Pitch Difference Feature |
| 多元时间序列的透明网络 |
Minkyu Kim |
PDF |
N/A |
Transparent Networks for Multivariate Time Series |
| 在数据驱动的监督深度学习中,无法收敛到全局最小值:Adam和随机梯度下降优化在训练具有ReLU激活的深度神经网络时,证明无法收敛到全局最小值。 |
Sonja Hannibal |
PDF |
N/A |
Non-convergence to global minimizers in data driven supervised deep learning: Adam and stochastic gradient descent optimization provably fail to converge to global minimizers in the training of deep neural networks with ReLU activation |
| 无需自适应内存需求的自适应概率ODE求解器 |
Nicholas Krämer |
PDF |
N/A |
Adaptive Probabilistic ODE Solvers Without Adaptive Memory Requirements |
| 在复杂且非平面场景中进行运动引导的小型微型飞行器检测 |
Hanqing Guo |
PDF |
N/A |
Motion-guided small MAV detection in complex and non-planar scenes |
| 摆脱任务隔离:一种连续多任务时空学习框架 |
Zhongchao Yi |
PDF |
N/A |
Get Rid of Task Isolation: A Continuous Multi-task Spatio-Temporal Learning Framework |
| 逆问题与数据同化:一种机器学习方法 |
Eviatar Bach |
PDF |
N/A |
Inverse Problems and Data Assimilation: A Machine Learning Approach |
| 持续深度强化学习以防止干扰缓解中的灾难性遗忘 |
Kemal Davaslioglu |
PDF |
N/A |
Continual Deep Reinforcement Learning to Prevent Catastrophic Forgetting in Jamming Mitigation |
| 基于人工智能的闪烁纤维成像传感器粒子轨迹识别 |
Noemi Bührer |
PDF |
N/A |
AI-based particle track identification in scintillating fibres read out with imaging sensors |
| UniGEM:一种统一分子生成与性质预测的方法 |
Shikun Feng |
PDF |
N/A |
UniGEM: A Unified Approach to Generation and Property Prediction for Molecules |
| 我们是否需要更复杂的结构表示?对音乐变压器的音符时长表示进行比较 |
Gabriel Souza |
PDF |
N/A |
Do we need more complex representations for structure? A comparison of note duration representation for Music Transformers |
| 自定义您的视觉自回归配方,使用集合自回归建模 |
Wenze Liu |
PDF |
N/A |
Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling |
| 利用局部特征和范围图像进行小数据实时点云语义分割 |
Daniel Fusaro |
PDF |
N/A |
Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation |
| 基于人工智能的皮肤黑色素细胞病变分级 |
Ruben T. Lucassen |
PDF |
N/A |
Artificial Intelligence-Based Triaging of Cutaneous Melanocytic Lesions |
| 印度次大陆的日常用语 |
Utkarsh Pathak |
PDF |
N/A |
Everyday Speech in the Indian Subcontinent |
| 深度学习与传统方法在疾病发作预测中的比较 |
Luis H. John |
PDF |
N/A |
Comparison of deep learning and conventional methods for disease onset prediction |
| 一种可内核化的多线性奇异值分解的原对偶公式 |
Frederiek Wesel |
PDF |
N/A |
A Kernelizable Primal-Dual Formulation of the Multilinear Singular Value Decomposition |
| 一种对时间上的因果推断的实用方法 |
Martina Cinquini |
PDF |
N/A |
A Practical Approach to Causal Inference over Time |
| 持续学习提升零样本动作识别 |
Shreyank N Gowda |
PDF |
N/A |
Continual Learning Improves Zero-Shot Action Recognition |
| 基于提示的图像编辑的视觉引导和掩码增强自适应去噪 |
Kejie Wang |
PDF |
N/A |
Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing |
| 学习无遗忘的视觉语言模型基础 |
Aritra Bhowmik |
PDF |
N/A |
Learning to Ground VLMs without Forgetting |
| 大型语言模型中的文化保真度:在线语言资源作为价值表示模型性能驱动力的评估 |
Sharif Kazemi |
PDF |
N/A |
Cultural Fidelity in Large-Language Models: An Evaluation of Online Language Resources as a Driver of Model Performance in Value Representation |
| 一种用于评估卫星影像清晰度的新型无参考图像质量指标 |
Lucas Gonzalo Antonel |
PDF |
N/A |
A Novel No-Reference Image Quality Metric For Assessing Sharpness In Satellite Imagery |
| 推进新生儿护理:利用AI驱动的自适应归一化热成像进行精确的出生时间检测 |
Jorge García-Torres |
PDF |
N/A |
Advancing Newborn Care: Precise Birth Time Detection Using AI-Driven Thermal Imaging with Adaptive Normalization |
| 基于模型的差分隐私知识迁移用于大型语言模型 |
Zhaomin Wu |
PDF |
N/A |
Model-Based Differentially Private Knowledge Transfer for Large Language Models |
| TMGBench:一个系统的游戏基准,用于评估LLMs的战略推理能力 |
Haochuan Wang |
PDF |
N/A |
TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs |
| 大型语言模型(LLMs)会取代仅编码器模型在时间关系分类中的地位吗? |
Gabriel Roccabruna |
PDF |
N/A |
Will LLMs Replace the Encoder-Only Models in Temporal Relation Classification? |
| 结构化状态空间模型中的隐性偏见可以通过干净标签被毒害 |
Yonatan Slutzky |
PDF |
N/A |
The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels |
| ReLayout:通过布局增强预训练实现现实世界文档理解 |
Zhouqiang Jiang |
PDF |
N/A |
ReLayout: Towards Real-World Document Understanding via Layout-enhanced Pre-training |
| Moirai-MoE:通过稀疏专家混合赋能时间序列基础模型 |
Xu Liu |
PDF |
N/A |
Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts |
| 深度图网络中的信息传播动力学 |
Alessio Gravina |
PDF |
N/A |
Information propagation dynamics in Deep Graph Networks |
| TABCF:使用基于Transformer的VAE为表格数据生成反事实解释 |
Emmanouil Panagiotou |
PDF |
N/A |
TABCF: Counterfactual Explanations for Tabular Data Using a Transformer-Based VAE |
| 多智能体系统的组合屏蔽与强化学习 |
Asger Horn Brorholt |
PDF |
N/A |
Compositional Shielding and Reinforcement Learning for Multi-Agent Systems |
| Ada-K 路由:提升基于 MoE 的大型语言模型效率 |
Tongtian Yue |
PDF |
N/A |
Ada-K Routing: Boosting the Efficiency of MoE-based LLMs |
| 通过增强型表示相似性融合推进学术知识检索 |
Wei Dai |
PDF |
N/A |
Advancing Academic Knowledge Retrieval via LLM-enhanced Representation Similarity Fusion |
| 通过改进元学习方法,利用从任务中获取的所有可用信息,提升少样本文本分类的性能。 |
Xinyue Liu |
PDF |
N/A |
Improve Meta-learning for Few-Shot Text Classification with All You Can Acquire from the Tasks |
| 自我评估生成:真实世界中光流和立体匹配的可信标签生成 |
Han Ling |
PDF |
N/A |
Self-Assessed Generation: Trustworthy Label Generation for Optical Flow and Stereo Matching in Real-world |
| 基于原则的贝叶斯优化与人类专家协作 |
Wenjie Xu |
PDF |
N/A |
Principled Bayesian Optimisation in Collaboration with Human Experts |
| 移动性感知的联邦学习:基于多臂赌博机的车辆网络选择 |
Haoyu Tu |
PDF |
N/A |
Mobility-Aware Federated Learning: Multi-Armed Bandit Based Selection in Vehicular Network |
| KBLaM:知识库增强的语言模型 |
Xi Wang |
PDF |
N/A |
KBLaM: Knowledge Base augmented Language Model |
| QUITE:在贝叶斯推理场景中量化自然语言文本中的不确定性 |
Timo Pierre Schrader |
PDF |
N/A |
QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios |
| 用于完全测试时适应的域条件变换器 |
Yushun Tang |
PDF |
N/A |
Domain-Conditioned Transformer for Fully Test-time Adaptation |
| 自由视频-大语言模型:提示引导的视觉感知,实现高效的无训练视频大语言模型 |
Kai Han |
PDF |
N/A |
Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs |
| 向个性化文本到图像扩散模型中未经授权数据使用的可靠验证迈进 |
Boheng Li |
PDF |
N/A |
Towards Reliable Verification of Unauthorized Data Usage in Personalized Text-to-Image Diffusion Models |
| LKASeg:利用大核注意力与全尺度跳跃连接进行遥感图像语义分割 |
Xuezhi Xiang |
PDF |
N/A |
LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections |
| 多样性感知的强化学习用于从头药物设计 |
Hampus Gummesson Svensson |
PDF |
N/A |
Diversity-Aware Reinforcement Learning for de novo Drug Design |
| DOME:将扩散模型驯化为高保真可控的占用世界模型 |
Songen Gu |
PDF |
N/A |
DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model |
| 一种用于超参数优化和元学习的双层优化的随机方法 |
Minyoung Kim |
PDF |
N/A |
A Stochastic Approach to Bi-Level Optimization for Hyperparameter Optimization and Meta Learning |
| 耦合自回归主动推理代理用于多关节动力系统的控制 |
Tim N. Nisslbeck |
PDF |
N/A |
Coupled autoregressive active inference agents for control of multi-joint dynamical systems |
| 基于大语言模型的防护模型校准以实现可靠内容审核 |
Hongfu Liu |
PDF |
N/A |
On Calibration of LLM-based Guard Models for Reliable Content Moderation |
| 4DStyleGaussian:基于高斯光栅化的零样本4D风格迁移 |
Wanlin Liang |
PDF |
N/A |
4DStyleGaussian: Zero-shot 4D Style Transfer with Gaussian Splatting |
| Medico:基于多源证据融合的幻觉检测与校正 |
Xinping Zhao |
PDF |
N/A |
Medico: Towards Hallucination Detection and Correction with Multi-source Evidence Fusion |
| MMCFND:面向低资源印度语言的多模态多语言描述感知假新闻检测 |
Shubhi Bansal |
PDF |
N/A |
MMCFND: Multimodal Multilingual Caption-aware Fake News Detection for Low-resource Indic Languages |
| 确定性苹果品尝 |
Zachary Chase |
PDF |
N/A |
Deterministic Apple Tasting |
| 使用可微模板参数化结构以进行三维形状生成 |
Changfeng Ma |
PDF |
N/A |
Parameterize Structure with Differentiable Template for 3D Shape Generation |
| FairMindSim: 伦理困境中人类与LLM代理的行为、情感与信念的协调 |
Yu Lei |
PDF |
N/A |
FairMindSim: Alignment of Behavior, Emotion, and Belief in Humans and LLM Agents Amid Ethical Dilemmas |
| 更严格的专家混合风险界限 |
Wissam Akretche |
PDF |
N/A |
Tighter Risk Bounds for Mixtures of Experts |
| 贝叶斯神经网络的深度估计改进 |
Bart van Erp |
PDF |
N/A |
Improved Depth Estimation of Bayesian Neural Networks |
| PIVOT-R:用于机器人操作的原始驱动航点感知世界模型 |
Kaidong Zhang |
PDF |
N/A |
PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation |
| GIFT-Eval:一个通用时间序列预测模型评估基准 |
Taha Aksu |
PDF |
N/A |
GIFT-Eval: A Benchmark For General Time Series Forecasting Model Evaluation |
| 优化指令合成:利用树搜索有效探索进化空间 |
Chenglin Li |
PDF |
N/A |
Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search |
| 斯坦变分进化策略 |
Cornelius V. Braun |
PDF |
N/A |
Stein Variational Evolution Strategies |
| 逆向精细化网络用于高分辨率卫星影像中狭窄乡村道路的检测 |
Ningjing Wang |
PDF |
N/A |
Reverse Refinement Network for Narrow Rural Road Detection in High-Resolution Satellite Imagery |
| 具有未知超参数的贝叶斯优化:对数更接近最优的遗憾界限 |
Juliusz Ziomek |
PDF |
N/A |
Bayesian Optimisation with Unknown Hyperparameters: Regret Bounds Logarithmically Closer to Optimal |
| V2M:用于图像表示学习的视觉二维Mamba |
Chengkun Wang |
PDF |
N/A |
V2M: Visual 2-Dimensional Mamba for Image Representation Learning |
| 基于非负/二值矩阵分解的协同过滤 |
Yukino Terui |
PDF |
N/A |
Collaborative filtering based on nonnegative/binary matrix factorization |
| 学习计算机网络中的亚秒级路由优化需要了解数据包级别的动态特性。 |
Andreas Boltres |
PDF |
N/A |
Learning Sub-Second Routing Optimization in Computer Networks requires Packet-Level Dynamics |
| 阿尔茨海默病诊断与早期检测的类别平衡多样性多模态集成方法 |
Arianna Francesconi |
PDF |
N/A |
Class Balancing Diversity Multimodal Ensemble for Alzheimer's Disease Diagnosis and Early Detection |
| 锐度感知最小化在训练后期有效地选择更平坦的最小值 |
Zhanpeng Zhou |
PDF |
N/A |
Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training |
| 书虫:角色描述与分析数据集 |
Argyrios Papoudakis |
PDF |
N/A |
BookWorm: A Dataset for Character Description and Analysis |
| 格罗宁根:利用选定的增强井和地震数据进行岩石气体饱和度的空间预测,采用分类器集成方法 |
Dmitry Ivlev |
PDF |
N/A |
Groningen: Spatial Prediction of Rock Gas Saturation by Leveraging Selected and Augmented Well and Seismic Data with Classifier Ensembles |
| 创新思维,无限幽默:通过结构化思维跳跃对大型语言模型进行幽默研究 |
Han Wang |
PDF |
N/A |
Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps |
| 计算稀疏图上一般随机游走图核的最优时间复杂度算法 |
Krzysztof Choromanski |
PDF |
N/A |
Optimal Time Complexity Algorithms for Computing General Random Walk Graph Kernels on Sparse Graphs |
| 亲和图引导的收缩学习用于极少标注的无前置任务医学图像分割 |
Zehua Cheng |
PDF |
N/A |
Affinity-Graph-Guided Contractive Learning for Pretext-Free Medical Image Segmentation with Minimal Annotation |
| SpeGCL:无正样本的自监督图谱对比学习 |
Yuntao Shou |
PDF |
N/A |
SpeGCL: Self-supervised Graph Spectrum Contrastive Learning without Positive Samples |
| 育儿:通过参数解耦和定制调优优化检索增强语言模型的知识选择 |
Yongxin Xu |
PDF |
N/A |
Parenting: Optimizing Knowledge Selection of Retrieval-Augmented Language Models with Parameter Decoupling and Tailored Tuning |
| FasterDiT:在不修改架构的情况下实现更快的扩散Transformer训练 |
Jingfeng Yao |
PDF |
N/A |
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification |
| 使用BiFormer注意力机制和多路径扩张卷积的耻骨联合-胎头分割网络 |
Pengzhou Cai |
PDF |
N/A |
Pubic Symphysis-Fetal Head Segmentation Network Using BiFormer Attention Mechanism and Multipath Dilated Convolution |
| 在深度学习背景下表示三维旋转 |
Viktória Pravdová |
PDF |
N/A |
On Representation of 3D Rotation in the Context of Deep Learning |
| 基于大型语言模型的代码转换文本生成用于语法错误纠正 |
Tom Potter |
PDF |
N/A |
LLM-based Code-Switched Text Generation for Grammatical Error Correction |
| 通过自动数据标注和优化增强大型语言模型中的上下文学习 |
Joseph Shtok |
PDF |
N/A |
Augmenting In-Context-Learning in LLMs via Automatic Data Labeling and Refinement |
| 一种针对大型语言模型的统一路由与级联方法 |
Jasper Dekoninck |
PDF |
N/A |
A Unified Approach to Routing and Cascading for LLMs |
| 锁定微调后的大型语言模型(LLMs)的安全性 |
Minjun Zhu |
PDF |
N/A |
Locking Down the Finetuned LLMs Safety |
| 重放与遗忘自由的图类增量学习:一种任务分析与提示方法 |
Chaoxi Niu |
PDF |
N/A |
Replay-and-Forget-Free Graph Class-Incremental Learning: A Task Profiling and Prompting Approach |
| CoMAT:数学注释思维链提升数学推理能力 |
Joshua Ong Jun Leang |
PDF |
N/A |
CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning |
| 解构针对不同目标身份的仇恨 |
Yiping Jin |
PDF |
N/A |
Disentangling Hate Across Target Identities |
| 解剖特征优先损失以增强MR到CT的转换 |
Arthur Longuefosse |
PDF |
N/A |
Anatomical feature-prioritized loss for enhanced MR to CT translation |