跳转至

Arxiv 2024-10-30 Papers

标题 作者 PDF链接 代码仓库 Title
通过面向对象的奖励弥合人机灵巧性差距 Irmak Guzey PDF N/A Bridging the Human to Robot Dexterity Gap through Object-Oriented Rewards
ReferEverything:迈向视频中我们能谈论的一切事物的分割 Anurag Bagchi PDF N/A ReferEverything: Towards Segmenting Everything We Can Speak of in Videos
在最小假设下,扩散模型的可证明加速 Gen Li PDF N/A Provable acceleration for diffusion models under minimal assumptions
RelationBooth:面向关系感知的定制化对象生成 Qingyu Shi PDF N/A RelationBooth: Towards Relation-Aware Customized Object Generation
一种用于同时进行分割、分类和呼叫者识别任务的神经网络转换器框架,针对狨猴发声 Bin Wu PDF N/A A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization
OpenSatMap:一种用于大规模地图构建的细粒度高分辨率卫星数据集 Hongbo Zhao PDF N/A OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map Construction
SlowFast-VGen:动作驱动的长视频生成的慢速-快速学习 Yining Hong PDF N/A SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
使用动态图神经网络的条件保证金追缴预测 Matteo Citterio PDF N/A Conditional Forecasting of Margin Calls using Dynamic Graph Neural Networks
多学生扩散蒸馏用于更优的一步生成器 Yanke Song PDF N/A Multi-student Diffusion Distillation for Better One-step Generators
非质心聚类中的比例公平性 Ioannis Caragiannis PDF N/A Proportional Fairness in Non-Centroid Clustering
一个用于序列预测中校准不确定性估计的蒙特卡罗框架 Qidong Yang PDF N/A A Monte Carlo Framework for Calibrated Uncertainty Estimation in Sequence Prediction
TOMATO:评估多模态基础模型中的视觉时间推理能力 Ziyao Shangguan PDF N/A TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models
EMMA:端到端多模态自动驾驶模型 Jyh-Jing Hwang PDF N/A EMMA: End-to-End Multimodal Model for Autonomous Driving
10万美元或100天:使用学术资源进行预训练时的权衡 Apoorv Khandelwal PDF N/A $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
使用大型模型进行物体相对模仿学习的要点抽象 Xiaolin Fang PDF N/A Keypoint Abstraction using Large Models for Object-Relative Imitation Learning
评估大型语言模型网络代理的文化和社会意识 Haoyi Qiu PDF N/A Evaluating Cultural and Social Awareness of LLM Web Agents
bit2bit:通过自监督光子预测实现1位量子视频重建 Yehe Liu PDF N/A bit2bit: 1-bit quanta video reconstruction via self-supervised photon prediction
PointRecon:通过基于射线的2D-3D匹配实现在线基于点的3D重建 Chen Ziwen PDF N/A PointRecon: Online Point-based 3D Reconstruction via Ray-based 2D-3D Matching
GPU上非常快速的贝叶斯加性回归树 Giacomo Petrillo PDF N/A Very fast Bayesian Additive Regression Trees on GPU
请少一些空谈,多一些实际行动:在3D具身体验环境中探究大型语言模型的物理常识 Matteo G. Mecattaf PDF N/A A little less conversation, a little more action, please: Investigating the physical common-sense of LLMs in a 3D embodied environment
使用基于模拟的推理进行全波形地震震源反演 A. A. Saoulis PDF N/A Full-waveform earthquake source inversion using simulation-based inference
情感:基于情境学习的类人机器人表达性动作序列生成 Peide Huang PDF N/A EMOTION: Expressive Motion Sequence Generation for Humanoid Robots with In-Context Learning
要删除的属性:通过数据模型匹配实现机器遗忘 Kristian Georgiev PDF N/A Attribute-to-Delete: Machine Unlearning via Datamodel Matching
LGU-SLAM:基于可变形相关采样的可学习高斯不确定性匹配的深度视觉SLAM Yucheng Huang PDF N/A LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM
对齐音频-视觉联合表示与一个代理工作流程 Shentong Mo PDF N/A Aligning Audio-Visual Joint Representations with an Agentic Workflow
平均场变压器模型中亚稳态聚类的出现 Giuseppe Bruno PDF N/A Emergence of meta-stable clustering in mean-field transformer models
(FL)$^2$:克服联邦半监督学习中的少量标签 Seungjoo Lee PDF N/A (FL)$^2$: Overcoming Few Labels in Federated Semi-Supervised Learning
COMAL:一种用于将大型语言模型与通用偏好对齐的收敛元算法 Yixin Liu PDF N/A COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences
时间序列基础模型的部分通道依赖与通道掩码 Seunghan Lee PDF N/A Partial Channel Dependence with Channel Masks for Time Series Foundation Models
DiaMond:利用多模态视觉变换器通过MRI和PET进行痴呆症诊断 Yitong Li PDF N/A DiaMond: Dementia Diagnosis with Multi-Modal Vision Transformers Using MRI and PET
OS-ATLAS:一种面向通用图形用户界面代理的基础行动模型 Zhiyong Wu PDF N/A OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
通过尝试进行接地:结合强化学习增强检索的大型语言模型 Sheryl Hsu PDF N/A Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
ELMGS:通过压缩技术提升3D高斯喷洒的内存与计算可扩展性 Muhammad Salman Ali PDF N/A ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting
kNN图拉普拉斯算子的收敛速度提升 Yixuan Tan PDF N/A Improved convergence rate of kNN graph Laplacians
Kinetix:通过开放式的基于物理的控制任务来研究通用代理的训练 Michael Matthews PDF N/A Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
HEX:自监督算法中的分层涌现利用 Kiran Kokilepersaud PDF N/A HEX: Hierarchical Emergence Exploitation in Self-Supervised Algorithms
用于4D心脏电影MRI分割的连续时空记忆网络 Meng Ye PDF N/A Continuous Spatio-Temporal Memory Networks for 4D Cardiac Cine MRI Segmentation
主题建模的可靠性 Kayla Schroeder PDF N/A Reliability of Topic Modeling
ProTransformer:通过即插即用范式增强Transformer的鲁棒性 Zhichao Hou PDF N/A ProTransformer: Robustify Transformers via Plug-and-Play Paradigm
ReasoningRec:通过LLM推理连接个性化推荐与人类可理解的解释 Millennium Bismay PDF N/A ReasoningRec: Bridging Personalized Recommendations and Human-Interpretable Explanations through LLM Reasoning
等变性在大规模情况下是否重要? Johann Brehmer PDF N/A Does equivariance matter at scale?
使用增强等变自举法的快速重建方法的不确定性量化:应用于射电干涉测量 Mostafa Cherif PDF N/A Uncertainty quantification for fast reconstruction methods using augmented equivariant bootstrap: Application to radio interferometry
用于约束采样的功能梯度流 Shiyue Zhang PDF N/A Functional Gradient Flows for Constrained Sampling
尽管存在低秩偏差,神经崩溃的持久性:通过无约束特征的分析视角 Connall Garrod PDF N/A The Persistence of Neural Collapse Despite Low-Rank Bias: An Analytic Perspective Through Unconstrained Features
TokenFormer: 重新思考使用标记化模型参数的Transformer扩展 Haiyang Wang PDF N/A TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
SciPIP:基于大型语言模型的科学论文创意提案工具 Wenxiao Wang PDF N/A SciPIP: An LLM-based Scientific Paper Idea Proposer
FlexTSF:一种适用于具有可变规律性的时间序列的通用预测模型 Jingge Xiao PDF N/A FlexTSF: A Universal Forecasting Model for Time Series with Variable Regularities
傅里叶振幅与相关性损失:超越使用L2损失进行精准降水预报 Chiu-Wai Yan PDF N/A Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting
方向异常检测 Oliver Urs Lenz PDF N/A Directional anomaly detection
视觉预测器:利用神经符号谓词学习抽象世界模型以进行机器人规划 Yichao Liang PDF N/A VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning
QWO:加速基于排列的LiGAMs因果发现 Mohammad Shahverdikondori PDF N/A QWO: Speeding Up Permutation-Based Causal Discovery in LiGAMs
嵌套残差网络:一种基于视觉的用于检测插入式伽马探头探测区域的方法 Songyu Xu PDF N/A Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe
经典神经网络何时能表示量子态? Tai-Hsuan Yang PDF N/A When can classical neural networks represent quantum states?
HiBO:通过自适应搜索空间划分实现的分层贝叶斯优化 Wenxuan Li PDF N/A HiBO: Hierarchical Bayesian Optimization via Adaptive Search Space Partitioning
FoLDTree:一种基于ULDA的决策树框架,用于高效斜分和特征选择 Siyu Wang PDF N/A FoLDTree: A ULDA-Based Decision Tree Framework for Efficient Oblique Splits and Feature Selection
DNA中多重子态相的统计力学 Midas Segers PDF N/A Statistical Mechanics of Multiplectoneme Phases in DNA
公共领域12M:具有新颖治理机制的高度美学图文数据集 Jordan Meyer PDF N/A Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms
好、坏与丑:AI质量披露在谎言检测中的作用 Haimanti Bhattacharya PDF N/A The Good, the Bad, and the Ugly: The Role of AI Quality Disclosure in Lie Detection
FAIR-TAT:利用目标对抗训练提升模型公平性 Tejaswini Medi PDF N/A FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training
公平分配与市场价值 Siddharth Barman PDF N/A Fair Division with Market Values
众包词汇多样性 Hadi Khalilia PDF N/A Crowdsourcing Lexical Diversity
回顾MAE预训练在三维医学图像分割中的应用 Tassilo Wald PDF N/A Revisiting MAE pre-training for 3D medical image segmentation
利用元数据对心脏图像进行组合分割 Abbas Khan PDF N/A Compositional Segmentation of Cardiac Images Leveraging Metadata
周期性客户端参与和异质数据下的联邦学习:一种新的通信高效算法及分析 Michael Crawshaw PDF N/A Federated Learning under Periodic Client Participation and Heterogeneous Data: A New Communication-Efficient Algorithm and Analysis
为什么预训练中的细粒度标签有助于泛化? Guan Zhe Hong PDF N/A Why Fine-grained Labels in Pretraining Benefit Generalization?
现代Hopfield模型的可证明最优记忆容量:作为球面编码的Transformer兼容密集联想记忆 Jerry Yao-Chieh Hu PDF N/A Provably Optimal Memory Capacity for Modern Hopfield Models: Transformer-Compatible Dense Associative Memories as Spherical Codes
关于大型语言模型在逻辑推理中的记忆能力 Chulin Xie PDF N/A On Memorization of Large Language Models in Logical Reasoning
训练语言模型区分相似细节:使用小型对抗训练集 Chris Achard PDF N/A Teaching a Language Model to Distinguish Between Similar Details using a Small Adversarial Training Set
统一的三元组级幻觉评估方法用于大规模视觉语言模型 Junjie Wu PDF N/A Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models
为什么是梯度子空间?识别并缓解联邦微调大型语言模型中LoRA的瓶颈 Navyansh Mahla PDF N/A Why Gradient Subspace? Identifying and Mitigating LoRA's Bottlenecks in Federated Fine-Tuning of Large Language Models
NASM:神经各向异性表面网格化 Hongbo Li PDF N/A NASM: Neural Anisotropic Surface Meshing
可控游戏关卡生成:评估负样本在GAN模型中的影响 Mahsa Bazzaz PDF N/A Controllable Game Level Generation: Assessing the Effect of Negative Examples in GAN Models
将语义相似性与空间对齐解耦用于神经网络 Tassilo Wald PDF N/A Decoupling Semantic Similarity from Spatial Alignment for Neural Networks
基于图像的自动识别与一致性分类:通过量化形状分析和空间位置识别实现火灾模式的识别 Pengkun Liu PDF N/A Automated Image-Based Identification and Consistent Classification of Fire Patterns with Quantitative Shape Analysis and Spatial Location Identification
通过可解释人工智能进行游戏关卡修复 Mahsa Bazzaz PDF N/A Guided Game Level Repair via Explainable AI
大语言模型上下文学习中演示选择算法的比较分析 Dong Shu PDF N/A Comparative Analysis of Demonstration Selection Algorithms for LLM In-Context Learning
ECCV 2024 ROAD++挑战赛@ROAD++原子活动识别2024的首名解决方案 Ruyang Li PDF N/A First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Atomic Activity Recognition 2024
CausalDiff:通过扩散模型实现对抗防御的因果启发式解耦 Mingkun Zhang PDF N/A CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense
CORAL:多轮对话检索增强生成的基准测试 Yiruo Cheng PDF N/A CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation
PIP-MM:通过现有多模态大语言模型结构预先整合提示信息到视觉编码中 Tianxiang Wu PDF N/A PIP-MM: Pre-Integrating Prompt Information into Visual Encoding via Existing MLLM Structures
密度估计的统计-计算权衡 Anders Aamand PDF N/A Statistical-Computational Trade-offs for Density Estimation
从炒作到现实:在6G网络中部署深度强化学习的未来之路 Haiyuan Li PDF N/A From Hype to Reality: The Road Ahead of Deploying DRL in 6G Networks
S3PT:场景语义和结构引导的聚类,以提升自动驾驶的自监督预训练 Maciej K. Wozniak PDF N/A S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving
通过分类放射科医生确诊的病例,进行双参数磁共振成像的AI辅助前列腺癌检测和定位 Xiangcen Wu PDF N/A AI-assisted prostate cancer detection and localisation on biparametric MR by classifying radiologist-positives
基于事件的数字存内计算加速器,具备灵活的操作数分辨率和逐层的权重/输出平稳性 Nicolas Chauvaux PDF N/A An Event-Based Digital Compute-In-Memory Accelerator with Flexible Operand Resolution and Layer-Wise Weight/Output Stationarity
BUZZ:采用蜂巢结构的分段重击者稀疏KV缓存,用于高效LLM推理 Junqi Zhao PDF N/A BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference
ECCV 2024 ROAD++挑战赛@ROAD++时空代理检测2024的首名解决方案 Tengfei Zhang PDF N/A First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024
多编程语言沙盒,适用于大型语言模型 Shihan Dou PDF N/A Multi-Programming Language Sandbox for LLMs
RSNet:一种用于多尺度遥感目标检测的轻量级框架 Hongyu Chen PDF N/A RSNet: A Light Framework for The Detection of Multi-scale Remote Sensing Targets
CNN可解释性:针对自监督模型的多向量塔克显著性图 Aymene Mohammed Bouayed PDF N/A CNN Explainability with Multivector Tucker Saliency Maps for Self-Supervised Models
大型语言模型在软件工程团队项目中的整合:角色、影响及计算教育中人工智能工具的教学设计空间 Ahmed Kharrufa PDF N/A LLMs Integration in Software Engineering Team Projects: Roles, Impact, and a Pedagogical Design Space for AI Tools in Computing Education
不仅仅是关注,而是“种植”它:在极端多标签文本分类中转移L2R模型以微调注意力 Debjyoti Saharoy PDF N/A Don't Just Pay Attention, PLANT It: Transfer L2R Models to Fine-tune Attention in Extreme Multi-Label Text Classification
通过传输激活来控制语言和扩散模型 Pau Rodriguez PDF N/A Controlling Language and Diffusion Models by Transporting Activations
合法的无真实标签指标用于深度不确定性分类评分 Arthur Pignet PDF N/A Legitimate ground-truth-free metrics for deep uncertainty classification scoring
理解上下文学习与权重学习的区别 Bryan Chan PDF N/A Toward Understanding In-context vs. In-weight Learning
情感RAG:通过情感检索增强角色扮演代理 Le Huang PDF N/A Emotional RAG: Enhancing Role-Playing Agents through Emotional Retrieval
神经注意力场:三维场景中新兴的点相关性用于一次性灵巧抓取 Qianxu Wang PDF N/A Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping
离线强化学习和序列建模在下行链路自适应中的应用 Samuele Peri PDF N/A Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation
风险感知的非平稳多臂赌博机问题的规划与学习 Nima Akbarzadeh PDF N/A Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problem
大型语言模型反馈驱动的决策代理在线内在奖励机制 Qinqing Zheng PDF N/A Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback
DexGraspNet 2.0:在大规模合成杂乱场景中学习生成灵巧抓握 Jialiang Zhang PDF N/A DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes
不精确概率的评分规则与校准 Christian Fröhlich PDF N/A Scoring Rules and Calibration for Imprecise Probabilities
\textsc{Long$^2$RAG}:评估长上下文与长表单检索增强生成,重点关注关键点回忆 Zehan Qi PDF N/A \textsc{Long$^2$RAG}: Evaluating Long-Context \& Long-Form Retrieval-Augmented Generation with Key Point Recall
服务机器人任务规划与执行中提示工程技术的比较 Jonas Bode PDF N/A A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics
文本中量子级联激光器特性的语义丰富——一种知识图谱生成方法 Deperias Kerre PDF N/A Semantic Enrichment of the Quantum Cascade Laser Properties in Text- A Knowledge Graph Generation Approach
VisAidMath:视觉辅助数学推理基准测试 Jingkun Ma PDF N/A VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning
具有后分配服务的动态匹配及其在难民安置中的应用 Kirk Bansak PDF N/A Dynamic Matching with Post-allocation Service and its Application to Refugee Resettlement
V2X辅助的分布式计算与控制框架,适用于匝道汇流场景下的网联与自动驾驶车辆 Qiong Wu PDF N/A V2X-Assisted Distributed Computing and Control Framework for Connected and Automated Vehicles under Ramp Merging Scenario
用于时间序列分析的高阶跨结构嵌入模型 Guancen Lin PDF N/A Higher-order Cross-structural Embedding Model for Time Series Analysis
双优化自适应图重构用于多视图图聚类 Zichen Wen PDF N/A Dual-Optimized Adaptive Graph Reconstruction for Multi-View Graph Clustering
PDSR:高效无人机部署,实现快速精准的灾后搜救 Alaa Awad Abdellatif PDF N/A PDSR: Efficient UAV Deployment for Swift and Accurate Post-Disaster Search and Rescue
DisenTS:多变量时间序列预测中的解耦通道演化模式建模 Zhiding Liu PDF N/A DisenTS: Disentangled Channel Evolving Pattern Modeling for Multivariate Time Series Forecasting
LumiSculpt:一种用于视频生成的连续照明控制网络 Yuxin Zhang PDF N/A LumiSculpt: A Consistency Lighting Control Network for Video Generation
基于扩散的流形对齐的图集成 Jake S. Rhodes PDF N/A Graph Integration for Diffusion-Based Manifold Alignment
Bonafide 在 LegalLens 2024 共享任务中:使用轻量级 DeBERTa 基础编码器进行法律违规检测与解决 Shikha Bordia PDF N/A Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based Encoder For Legal Violation Detection and Resolution
使用扩散模型进行私密合成文本生成 Sebastian Ochs PDF N/A Private Synthetic Text Generation with Diffusion Models
基于动态阈值的两层在线无监督异常检测器 Yachao Yuan PDF N/A Dynamic Threshold-based Two-layer Online Unsupervised Anomaly Detector
可扩展的高效用模式采样 Lamine Diop PDF N/A Scalable Sampling for High Utility Patterns
纵向联邦学习安全算法研究:以安全逻辑回归为例 Huan-Chih Wang PDF N/A A Study of Secure Algorithms for Vertical Federated Learning: Take Secure Logistic Regression as an Example
EnsIR:一种通过高斯混合模型实现图像恢复的集成算法 Shangquan Sun PDF N/A EnsIR: An Ensemble Algorithm for Image Restoration via Gaussian Mixture Models
基于源可靠性估计的检索增强生成 Jeongyeon Hwang PDF N/A Retrieval-Augmented Generation with Estimation of Source Reliability
通过Householder变换实现预训练视觉Transformer的高效适应 Wei Dong PDF N/A Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation
SpiroActive:用于肺功能测量的高效数据采集的主动学习 Ankita Kumari Jain PDF N/A SpiroActive: Active Learning for Efficient Data Acquisition for Spirometry
MutaPLM:用于突变解释与工程的蛋白质语言建模 Yizhen Luo PDF N/A MutaPLM: Protein Language Modeling for Mutation Explanation and Engineering
ELBOing Stein:使用Stein混合推断的变分贝叶斯 Ola Rønning PDF N/A ELBOing Stein: Variational Bayes with Stein Mixture Inference
KALAM:用于自动化模拟计算系统高层合成的工具包 Ankita Nandi PDF N/A KALAM: toolKit for Automating high-Level synthesis of Analog computing systeMs
专注于此,而非彼!通过自适应特征规范引导大型语言模型 Tom A. Lamb PDF N/A Focus On This, Not That! Steering LLMs With Adaptive Feature Specification
AdaptiveISP:学习用于目标检测的自适应图像信号处理器 Yujin Wang PDF N/A AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection
DiffLight:一种基于部分奖励条件的扩散模型,用于处理缺失数据的交通信号控制 Hanyang Chen PDF N/A DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data
审慎采用自然语言处理技术促进公民参与:理解政策制定者之间的差异 Jose A. Guridi PDF N/A Thoughtful Adoption of NLP for Civic Participation: Understanding Differences Among Policymakers
将NeRFs引入潜在空间:逆向图形自编码器 Antoine Schnepf PDF N/A Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder
多智能体大型语言模型用于对话任务解决 Jonas Becker PDF N/A Multi-Agent Large Language Models for Conversational Task-Solving
一种基于个体身份驱动的动物重识别框架 Yihao Wu PDF N/A An Individual Identity-Driven Framework for Animal Re-Identification
BIS:面向商业智能场景的NL2SQL服务评估基准 Bora Caglayan PDF N/A BIS: NL2SQL Service Evaluation Benchmark for Business Intelligence Scenarios
通过大规模真实世界数据集与记忆增强型Transformer实现的高保真文档污渍去除 Mingxian Li PDF N/A High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer
无模拟训练:在配对数据上训练神经ODE Semin Kim PDF N/A Simulation-Free Training of Neural ODEs on Paired Data
可解释行为克隆:通过示范学习教授大型语言模型代理 Yanchu Guan PDF N/A Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration
基于模块化状态的斯塔克尔伯格博弈在分布式制造系统中的自我优化 Steve Yuwono PDF N/A Self-optimization in distributed manufacturing systems using Modular State-based Stackelberg Games
CopRA:一种渐进式LoRA训练策略 Zhan Zhuang PDF N/A CopRA: A Progressive LoRA Training Strategy
UniRiT:迈向少样本非刚性点云配准 Geng Li PDF N/A UniRiT: Towards Few-Shot Non-Rigid Point Cloud Registration
联邦UCBVI:异构代理间的通信高效联邦后悔最小化 Safwan Labbi PDF N/A Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents
从咿呀学语到词汇:在连续的音素流上预训练语言模型 Zébulon Goriely PDF N/A From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes
HelloMeme:将空间编织注意力整合到扩散模型中,以嵌入高层次和保真度丰富的条件 Shengkai Zhang PDF N/A HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models
部分形状匹配的虫洞损失 Amit Bracha PDF N/A Wormhole Loss for Partial Shape Matching
YOLOv11 用于车辆检测:在智能交通系统中的进展、性能与应用 Mujadded Al Rabbani Alif PDF N/A YOLOv11 for Vehicle Detection: Advancements, Performance, and Applications in Intelligent Transportation Systems
结合精神分析与计算机科学:一项关于情绪与拉康话语之间关系的实证研究 Minas Gadalla PDF N/A Combining psychoanalysis and computer science: an empirical study of the relationship between emotions and the Lacanian discourses
VPO:利用偏好优化中的票数 Jae Hyeon Cho PDF N/A VPO: Leveraging the Number of Votes in Preference Optimization
通过单一向量实现视觉-语言模型的有效且高效的对抗检测 Youcheng Huang PDF N/A Effective and Efficient Adversarial Detection for Vision-Language Models via A Single Vector
通过条件$f$-信息进行泛化界限分析 Ziqiao Wang PDF N/A Generalization Bounds via Conditional $f$-Information
少即是多:采用认知上合理的课程学习策略预训练跨语言小规模语言模型 Suchir Salhan PDF N/A Less is More: Pre-Training Cross-Lingual Small-Scale Language Models with Cognitively-Plausible Curriculum Learning Strategies
从专家混合模型中窃取用户提示 Itay Yona PDF N/A Stealing User Prompts from Mixture of Experts
自适应范式协同:跨范式目标能否提升长尾学习效果? Haowen Xiao PDF N/A Adaptive Paradigm Synergy: Can a Cross-Paradigm Objective Enhance Long-Tailed Learning?
SFA-UNet:更多关注红外小目标分割中的多尺度对比与上下文信息 Imad Ali Shah PDF N/A SFA-UNet: More Attention to Multi-Scale Contrast and Contextual Information in Infrared Small Object Segmentation
通过对比解释在检索增强型语言模型中引发批判性推理 Leonardo Ranaldi PDF N/A Eliciting Critical Reasoning in Retrieval-Augmented Language Models via Contrastive Explanations
泊松回归中p次方根链接的数据子采样 Han Cheng Lie PDF N/A Data subsampling for Poisson regression with pth-root-link
粒子-量热计相互作用的量子辅助深度生成代理模型 J. Quetzalcoatl Toledo-Marin PDF N/A Conditioned quantum-assisted deep generative surrogate for particle-calorimeter interactions
面向人口规模的DIXON MRI睾丸体积分割 Jan Ernsting PDF N/A Towards Population Scale Testis Volume Segmentation in DIXON MRI
修剪与重绘:适用于任意比例的内容感知图像重定位 Feihong Shen PDF N/A Prune and Repaint: Content-Aware Image Retargeting for any Ratio
AtGCN:一种用于共济失调步态检测的图卷积网络 Karan Bania PDF N/A AtGCN: A Graph Convolutional Network For Ataxic Gait Detection
达芬奇:一种用于约束CAD草图推理的单阶段架构 Ahmet Serdar Karadeniz PDF N/A DAVINCI: A Single-Stage Architecture for Constrained CAD Sketch Inference
机器学习中的超参数优化 Luca Franceschi PDF N/A Hyperparameter Optimization in Machine Learning
极化图像数据集,包含机械生成的水面波与由波浪计线性阵列记录的表面高程记录耦合 Noam Ginio PDF N/A Dataset of polarimetric images of mechanically generated water surface waves coupled with surface elevation records by wave gauges linear array
生成式大型语言模型的数据无能 Søren Vejlgaard Holm PDF N/A Danoliteracy of Generative, Large Language Models
SFDFusion:一种用于红外与可见光图像融合的高效空间-频率域融合网络 Kun Hu PDF N/A SFDFusion: An Efficient Spatial-Frequency Domain Fusion Network for Infrared and Visible Image Fusion
劫持RAG:针对检索增强型大型语言模型的劫持攻击 Yucheng Zhang PDF N/A HijackRAG: Hijacking Attacks against Retrieval-Augmented Large Language Models
潜在扩散,隐式放大:高效的连续尺度遥感图像超分辨率 Hanlin Wu PDF N/A Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing Images
情境场景图:结构化以人为中心情境理解 Chinthani Sugandhika PDF N/A Situational Scene Graph for Structured Human-centric Situation Understanding
大型语言模型在瑞典语词义消歧方面表现如何? Richard Johansson PDF N/A How Well Do Large Language Models Disambiguate Swedish Words?
EvoCodeBench:一个不断发展的代码生成基准,具有特定领域的评估 Jia Li PDF N/A EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations
大规模随机配对交互网络系统的不变性原理基础上的集中性结果 Giacomo Como PDF N/A An invariance principle based concentration result for large-scale stochastic pairwise interaction network systems
无极线约束的三维高斯溅射技术在通用的新视角合成中的应用 Zhiyuan Min PDF N/A Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis
面向具有异质性客户端的鲁棒且高效的联邦低秩适应 Jabin Koo PDF N/A Towards Robust and Efficient Federated Low-Rank Adaptation with Heterogeneous Clients
$π^2/6$ 路径在避免模型崩溃中的普遍性 Apratim Dey PDF N/A Universality of the $π^2/6$ Pathway in Avoiding Model Collapse
使用Vision Mamba的自适应多尺度文档二值化 Mohd. Azfar PDF N/A Adaptive Multi Scale Document Binarisation Using Vision Mamba
在大型语言模型(LLMs)中增强因果关系的行为序列建模,以实现个性化推荐 Yang Zhang PDF N/A Causality-Enhanced Behavior Sequence Modeling in LLMs for Personalized Recommendation
MILP-StuDio:通过块结构分解生成MILP实例 Haoyang Liu PDF N/A MILP-StuDio: MILP Instance Generation via Block Structure Decomposition
神经波束形成在鲁棒语音去混响和降噪中的运行时适应 Yoto Fujita PDF N/A Run-Time Adaptation of Neural Beamforming for Robust Speech Dereverberation and Denoising
DOA-Aware视听自监督学习用于声音事件定位与检测 Yoto Fujita PDF N/A DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection
小波脉冲累积用于湍流缓解 Jerome Gilles PDF N/A Wavelet Burst Accumulation for turbulence mitigation
机器学习非绝热动力学:利用态相互作用态平均自旋限制系综参考Kohn-Sham方法消除非绝热耦合的相位自由度 Sung Wook Moon PDF N/A Machine Learning Nonadiabatic Dynamics: Eliminating Phase Freedom of Nonadiabatic Couplings with the State-Intraction State-Averaged Spin-Restricted Ensemble-Referenced Kohn-Sham Approach
使用约束学习求解微分方程 Viggo Moro PDF N/A Solving Differential Equations with Constrained Learning
开放湍流图像集(OTIS) Nicholas B. Ferrante PDF N/A Open Turbulent Image Set (OTIS)
用于序列推荐中层次偏好建模的双重对比变换器 Chengkai Huang PDF N/A Dual Contrastive Transformer for Hierarchical Preference Modeling in Sequential Recommendation
元学习中尾部任务风险最小化的理论研究与实践改进 Yiqin Lv PDF N/A Theoretical Investigations and Practical Enhancements on Tail Task Risk Minimization in Meta Learning
对比学习与对抗性解耦合在面向任务的语义通信中的隐私保护 Omar Erak PDF N/A Contrastive Learning and Adversarial Disentanglement for Privacy-Preserving Task-Oriented Semantic Communications
MALoRA:用于增强多任务学习的非对称低秩适应混合方法 Xujia Wang PDF N/A MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning
Bregman算法实现Meyer的$G-$范数用于卡通+纹理分解 Jerome Gilles PDF N/A Bregman implementation of Meyer's $G-$norm for cartoon + textures decomposition
扩散模型胜过自回归模型:文本到图像模型中组合生成的评估 Arash Marioriyad PDF N/A Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models
展开目标检测与状态空间模型 Luca Jiang-Tao Yu PDF N/A Unfolding Target Detection with State Space Model
基于随机排列集的信息源可靠性评估 Juntao Xu PDF N/A Reliability Assessment of Information Sources Based on Random Permutation Set
FuseAnyPart:通过多张参考图像实现扩散驱动的面部部位交换 Zheng Yu PDF N/A FuseAnyPart: Diffusion-Driven Facial Parts Swapping via Multiple Reference Images
InjecGuard:评估并缓解提示注入防护模型中的过度防御 Hao Li PDF N/A InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models
面向目标的聊天机器人对话状态跟踪中的本体论之外 Sejin Lee PDF N/A Beyond Ontology in Dialogue State Tracking for Goal-Oriented Chatbot
自动驾驶赛车:深度强化学习的应用 Florentiana Yuwono PDF N/A Self-Driving Car Racing: Application of Deep Reinforcement Learning
使用核嵌入进行因果推断的概述 Dino Sejdinovic PDF N/A An Overview of Causal Inference using Kernel Embeddings
SoftCTRL:用于自动驾驶的Transformer强化学习的软保守KL控制 Minh Tri Huynh PDF N/A SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving
理解多类分类中适当学习者的聚合 Julian Asilis PDF N/A Understanding Aggregations of Proper Learners in Multiclass Classification
跨域数据集上分类器训练的合成数据分析 Andoni Cortés PDF N/A Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets
设计人工智能个性:通过深思熟虑的角色设计增强人机交互 Nima Zargham PDF N/A Designing AI Personalities: Enhancing Human-Agent Interaction Through Thoughtful Persona Design
从零开始构建多模态数据集,以实现日本视觉语言模型的快速开发 Keito Sasagawa PDF N/A Constructing Multimodal Datasets from Scratch for Rapid Development of a Japanese Visual Language Model