| 通过面向对象的奖励弥合人机灵巧性差距 |
Irmak Guzey |
PDF |
N/A |
Bridging the Human to Robot Dexterity Gap through Object-Oriented Rewards |
| ReferEverything:迈向视频中我们能谈论的一切事物的分割 |
Anurag Bagchi |
PDF |
N/A |
ReferEverything: Towards Segmenting Everything We Can Speak of in Videos |
| 在最小假设下,扩散模型的可证明加速 |
Gen Li |
PDF |
N/A |
Provable acceleration for diffusion models under minimal assumptions |
| RelationBooth:面向关系感知的定制化对象生成 |
Qingyu Shi |
PDF |
N/A |
RelationBooth: Towards Relation-Aware Customized Object Generation |
| 一种用于同时进行分割、分类和呼叫者识别任务的神经网络转换器框架,针对狨猴发声 |
Bin Wu |
PDF |
N/A |
A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization |
| OpenSatMap:一种用于大规模地图构建的细粒度高分辨率卫星数据集 |
Hongbo Zhao |
PDF |
N/A |
OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map Construction |
| SlowFast-VGen:动作驱动的长视频生成的慢速-快速学习 |
Yining Hong |
PDF |
N/A |
SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation |
| 使用动态图神经网络的条件保证金追缴预测 |
Matteo Citterio |
PDF |
N/A |
Conditional Forecasting of Margin Calls using Dynamic Graph Neural Networks |
| 多学生扩散蒸馏用于更优的一步生成器 |
Yanke Song |
PDF |
N/A |
Multi-student Diffusion Distillation for Better One-step Generators |
| 非质心聚类中的比例公平性 |
Ioannis Caragiannis |
PDF |
N/A |
Proportional Fairness in Non-Centroid Clustering |
| 一个用于序列预测中校准不确定性估计的蒙特卡罗框架 |
Qidong Yang |
PDF |
N/A |
A Monte Carlo Framework for Calibrated Uncertainty Estimation in Sequence Prediction |
| TOMATO:评估多模态基础模型中的视觉时间推理能力 |
Ziyao Shangguan |
PDF |
N/A |
TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models |
| EMMA:端到端多模态自动驾驶模型 |
Jyh-Jing Hwang |
PDF |
N/A |
EMMA: End-to-End Multimodal Model for Autonomous Driving |
| 10万美元或100天:使用学术资源进行预训练时的权衡 |
Apoorv Khandelwal |
PDF |
N/A |
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources |
| 使用大型模型进行物体相对模仿学习的要点抽象 |
Xiaolin Fang |
PDF |
N/A |
Keypoint Abstraction using Large Models for Object-Relative Imitation Learning |
| 评估大型语言模型网络代理的文化和社会意识 |
Haoyi Qiu |
PDF |
N/A |
Evaluating Cultural and Social Awareness of LLM Web Agents |
| bit2bit:通过自监督光子预测实现1位量子视频重建 |
Yehe Liu |
PDF |
N/A |
bit2bit: 1-bit quanta video reconstruction via self-supervised photon prediction |
| PointRecon:通过基于射线的2D-3D匹配实现在线基于点的3D重建 |
Chen Ziwen |
PDF |
N/A |
PointRecon: Online Point-based 3D Reconstruction via Ray-based 2D-3D Matching |
| GPU上非常快速的贝叶斯加性回归树 |
Giacomo Petrillo |
PDF |
N/A |
Very fast Bayesian Additive Regression Trees on GPU |
| 请少一些空谈,多一些实际行动:在3D具身体验环境中探究大型语言模型的物理常识 |
Matteo G. Mecattaf |
PDF |
N/A |
A little less conversation, a little more action, please: Investigating the physical common-sense of LLMs in a 3D embodied environment |
| 使用基于模拟的推理进行全波形地震震源反演 |
A. A. Saoulis |
PDF |
N/A |
Full-waveform earthquake source inversion using simulation-based inference |
| 情感:基于情境学习的类人机器人表达性动作序列生成 |
Peide Huang |
PDF |
N/A |
EMOTION: Expressive Motion Sequence Generation for Humanoid Robots with In-Context Learning |
| 要删除的属性:通过数据模型匹配实现机器遗忘 |
Kristian Georgiev |
PDF |
N/A |
Attribute-to-Delete: Machine Unlearning via Datamodel Matching |
| LGU-SLAM:基于可变形相关采样的可学习高斯不确定性匹配的深度视觉SLAM |
Yucheng Huang |
PDF |
N/A |
LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM |
| 对齐音频-视觉联合表示与一个代理工作流程 |
Shentong Mo |
PDF |
N/A |
Aligning Audio-Visual Joint Representations with an Agentic Workflow |
| 平均场变压器模型中亚稳态聚类的出现 |
Giuseppe Bruno |
PDF |
N/A |
Emergence of meta-stable clustering in mean-field transformer models |
| (FL)$^2$:克服联邦半监督学习中的少量标签 |
Seungjoo Lee |
PDF |
N/A |
(FL)$^2$: Overcoming Few Labels in Federated Semi-Supervised Learning |
| COMAL:一种用于将大型语言模型与通用偏好对齐的收敛元算法 |
Yixin Liu |
PDF |
N/A |
COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences |
| 时间序列基础模型的部分通道依赖与通道掩码 |
Seunghan Lee |
PDF |
N/A |
Partial Channel Dependence with Channel Masks for Time Series Foundation Models |
| DiaMond:利用多模态视觉变换器通过MRI和PET进行痴呆症诊断 |
Yitong Li |
PDF |
N/A |
DiaMond: Dementia Diagnosis with Multi-Modal Vision Transformers Using MRI and PET |
| OS-ATLAS:一种面向通用图形用户界面代理的基础行动模型 |
Zhiyong Wu |
PDF |
N/A |
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents |
| 通过尝试进行接地:结合强化学习增强检索的大型语言模型 |
Sheryl Hsu |
PDF |
N/A |
Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval |
| ELMGS:通过压缩技术提升3D高斯喷洒的内存与计算可扩展性 |
Muhammad Salman Ali |
PDF |
N/A |
ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting |
| kNN图拉普拉斯算子的收敛速度提升 |
Yixuan Tan |
PDF |
N/A |
Improved convergence rate of kNN graph Laplacians |
| Kinetix:通过开放式的基于物理的控制任务来研究通用代理的训练 |
Michael Matthews |
PDF |
N/A |
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks |
| HEX:自监督算法中的分层涌现利用 |
Kiran Kokilepersaud |
PDF |
N/A |
HEX: Hierarchical Emergence Exploitation in Self-Supervised Algorithms |
| 用于4D心脏电影MRI分割的连续时空记忆网络 |
Meng Ye |
PDF |
N/A |
Continuous Spatio-Temporal Memory Networks for 4D Cardiac Cine MRI Segmentation |
| 主题建模的可靠性 |
Kayla Schroeder |
PDF |
N/A |
Reliability of Topic Modeling |
| ProTransformer:通过即插即用范式增强Transformer的鲁棒性 |
Zhichao Hou |
PDF |
N/A |
ProTransformer: Robustify Transformers via Plug-and-Play Paradigm |
| ReasoningRec:通过LLM推理连接个性化推荐与人类可理解的解释 |
Millennium Bismay |
PDF |
N/A |
ReasoningRec: Bridging Personalized Recommendations and Human-Interpretable Explanations through LLM Reasoning |
| 等变性在大规模情况下是否重要? |
Johann Brehmer |
PDF |
N/A |
Does equivariance matter at scale? |
| 使用增强等变自举法的快速重建方法的不确定性量化:应用于射电干涉测量 |
Mostafa Cherif |
PDF |
N/A |
Uncertainty quantification for fast reconstruction methods using augmented equivariant bootstrap: Application to radio interferometry |
| 用于约束采样的功能梯度流 |
Shiyue Zhang |
PDF |
N/A |
Functional Gradient Flows for Constrained Sampling |
| 尽管存在低秩偏差,神经崩溃的持久性:通过无约束特征的分析视角 |
Connall Garrod |
PDF |
N/A |
The Persistence of Neural Collapse Despite Low-Rank Bias: An Analytic Perspective Through Unconstrained Features |
| TokenFormer: 重新思考使用标记化模型参数的Transformer扩展 |
Haiyang Wang |
PDF |
N/A |
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters |
| SciPIP:基于大型语言模型的科学论文创意提案工具 |
Wenxiao Wang |
PDF |
N/A |
SciPIP: An LLM-based Scientific Paper Idea Proposer |
| FlexTSF:一种适用于具有可变规律性的时间序列的通用预测模型 |
Jingge Xiao |
PDF |
N/A |
FlexTSF: A Universal Forecasting Model for Time Series with Variable Regularities |
| 傅里叶振幅与相关性损失:超越使用L2损失进行精准降水预报 |
Chiu-Wai Yan |
PDF |
N/A |
Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting |
| 方向异常检测 |
Oliver Urs Lenz |
PDF |
N/A |
Directional anomaly detection |
| 视觉预测器:利用神经符号谓词学习抽象世界模型以进行机器人规划 |
Yichao Liang |
PDF |
N/A |
VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning |
| QWO:加速基于排列的LiGAMs因果发现 |
Mohammad Shahverdikondori |
PDF |
N/A |
QWO: Speeding Up Permutation-Based Causal Discovery in LiGAMs |
| 嵌套残差网络:一种基于视觉的用于检测插入式伽马探头探测区域的方法 |
Songyu Xu |
PDF |
N/A |
Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe |
| 经典神经网络何时能表示量子态? |
Tai-Hsuan Yang |
PDF |
N/A |
When can classical neural networks represent quantum states? |
| HiBO:通过自适应搜索空间划分实现的分层贝叶斯优化 |
Wenxuan Li |
PDF |
N/A |
HiBO: Hierarchical Bayesian Optimization via Adaptive Search Space Partitioning |
| FoLDTree:一种基于ULDA的决策树框架,用于高效斜分和特征选择 |
Siyu Wang |
PDF |
N/A |
FoLDTree: A ULDA-Based Decision Tree Framework for Efficient Oblique Splits and Feature Selection |
| DNA中多重子态相的统计力学 |
Midas Segers |
PDF |
N/A |
Statistical Mechanics of Multiplectoneme Phases in DNA |
| 公共领域12M:具有新颖治理机制的高度美学图文数据集 |
Jordan Meyer |
PDF |
N/A |
Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms |
| 好、坏与丑:AI质量披露在谎言检测中的作用 |
Haimanti Bhattacharya |
PDF |
N/A |
The Good, the Bad, and the Ugly: The Role of AI Quality Disclosure in Lie Detection |
| FAIR-TAT:利用目标对抗训练提升模型公平性 |
Tejaswini Medi |
PDF |
N/A |
FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training |
| 公平分配与市场价值 |
Siddharth Barman |
PDF |
N/A |
Fair Division with Market Values |
| 众包词汇多样性 |
Hadi Khalilia |
PDF |
N/A |
Crowdsourcing Lexical Diversity |
| 回顾MAE预训练在三维医学图像分割中的应用 |
Tassilo Wald |
PDF |
N/A |
Revisiting MAE pre-training for 3D medical image segmentation |
| 利用元数据对心脏图像进行组合分割 |
Abbas Khan |
PDF |
N/A |
Compositional Segmentation of Cardiac Images Leveraging Metadata |
| 周期性客户端参与和异质数据下的联邦学习:一种新的通信高效算法及分析 |
Michael Crawshaw |
PDF |
N/A |
Federated Learning under Periodic Client Participation and Heterogeneous Data: A New Communication-Efficient Algorithm and Analysis |
| 为什么预训练中的细粒度标签有助于泛化? |
Guan Zhe Hong |
PDF |
N/A |
Why Fine-grained Labels in Pretraining Benefit Generalization? |
| 现代Hopfield模型的可证明最优记忆容量:作为球面编码的Transformer兼容密集联想记忆 |
Jerry Yao-Chieh Hu |
PDF |
N/A |
Provably Optimal Memory Capacity for Modern Hopfield Models: Transformer-Compatible Dense Associative Memories as Spherical Codes |
| 关于大型语言模型在逻辑推理中的记忆能力 |
Chulin Xie |
PDF |
N/A |
On Memorization of Large Language Models in Logical Reasoning |
| 训练语言模型区分相似细节:使用小型对抗训练集 |
Chris Achard |
PDF |
N/A |
Teaching a Language Model to Distinguish Between Similar Details using a Small Adversarial Training Set |
| 统一的三元组级幻觉评估方法用于大规模视觉语言模型 |
Junjie Wu |
PDF |
N/A |
Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models |
| 为什么是梯度子空间?识别并缓解联邦微调大型语言模型中LoRA的瓶颈 |
Navyansh Mahla |
PDF |
N/A |
Why Gradient Subspace? Identifying and Mitigating LoRA's Bottlenecks in Federated Fine-Tuning of Large Language Models |
| NASM:神经各向异性表面网格化 |
Hongbo Li |
PDF |
N/A |
NASM: Neural Anisotropic Surface Meshing |
| 可控游戏关卡生成:评估负样本在GAN模型中的影响 |
Mahsa Bazzaz |
PDF |
N/A |
Controllable Game Level Generation: Assessing the Effect of Negative Examples in GAN Models |
| 将语义相似性与空间对齐解耦用于神经网络 |
Tassilo Wald |
PDF |
N/A |
Decoupling Semantic Similarity from Spatial Alignment for Neural Networks |
| 基于图像的自动识别与一致性分类:通过量化形状分析和空间位置识别实现火灾模式的识别 |
Pengkun Liu |
PDF |
N/A |
Automated Image-Based Identification and Consistent Classification of Fire Patterns with Quantitative Shape Analysis and Spatial Location Identification |
| 通过可解释人工智能进行游戏关卡修复 |
Mahsa Bazzaz |
PDF |
N/A |
Guided Game Level Repair via Explainable AI |
| 大语言模型上下文学习中演示选择算法的比较分析 |
Dong Shu |
PDF |
N/A |
Comparative Analysis of Demonstration Selection Algorithms for LLM In-Context Learning |
| ECCV 2024 ROAD++挑战赛@ROAD++原子活动识别2024的首名解决方案 |
Ruyang Li |
PDF |
N/A |
First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Atomic Activity Recognition 2024 |
| CausalDiff:通过扩散模型实现对抗防御的因果启发式解耦 |
Mingkun Zhang |
PDF |
N/A |
CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense |
| CORAL:多轮对话检索增强生成的基准测试 |
Yiruo Cheng |
PDF |
N/A |
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation |
| PIP-MM:通过现有多模态大语言模型结构预先整合提示信息到视觉编码中 |
Tianxiang Wu |
PDF |
N/A |
PIP-MM: Pre-Integrating Prompt Information into Visual Encoding via Existing MLLM Structures |
| 密度估计的统计-计算权衡 |
Anders Aamand |
PDF |
N/A |
Statistical-Computational Trade-offs for Density Estimation |
| 从炒作到现实:在6G网络中部署深度强化学习的未来之路 |
Haiyuan Li |
PDF |
N/A |
From Hype to Reality: The Road Ahead of Deploying DRL in 6G Networks |
| S3PT:场景语义和结构引导的聚类,以提升自动驾驶的自监督预训练 |
Maciej K. Wozniak |
PDF |
N/A |
S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving |
| 通过分类放射科医生确诊的病例,进行双参数磁共振成像的AI辅助前列腺癌检测和定位 |
Xiangcen Wu |
PDF |
N/A |
AI-assisted prostate cancer detection and localisation on biparametric MR by classifying radiologist-positives |
| 基于事件的数字存内计算加速器,具备灵活的操作数分辨率和逐层的权重/输出平稳性 |
Nicolas Chauvaux |
PDF |
N/A |
An Event-Based Digital Compute-In-Memory Accelerator with Flexible Operand Resolution and Layer-Wise Weight/Output Stationarity |
| BUZZ:采用蜂巢结构的分段重击者稀疏KV缓存,用于高效LLM推理 |
Junqi Zhao |
PDF |
N/A |
BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference |
| ECCV 2024 ROAD++挑战赛@ROAD++时空代理检测2024的首名解决方案 |
Tengfei Zhang |
PDF |
N/A |
First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024 |
| 多编程语言沙盒,适用于大型语言模型 |
Shihan Dou |
PDF |
N/A |
Multi-Programming Language Sandbox for LLMs |
| RSNet:一种用于多尺度遥感目标检测的轻量级框架 |
Hongyu Chen |
PDF |
N/A |
RSNet: A Light Framework for The Detection of Multi-scale Remote Sensing Targets |
| CNN可解释性:针对自监督模型的多向量塔克显著性图 |
Aymene Mohammed Bouayed |
PDF |
N/A |
CNN Explainability with Multivector Tucker Saliency Maps for Self-Supervised Models |
| 大型语言模型在软件工程团队项目中的整合:角色、影响及计算教育中人工智能工具的教学设计空间 |
Ahmed Kharrufa |
PDF |
N/A |
LLMs Integration in Software Engineering Team Projects: Roles, Impact, and a Pedagogical Design Space for AI Tools in Computing Education |
| 不仅仅是关注,而是“种植”它:在极端多标签文本分类中转移L2R模型以微调注意力 |
Debjyoti Saharoy |
PDF |
N/A |
Don't Just Pay Attention, PLANT It: Transfer L2R Models to Fine-tune Attention in Extreme Multi-Label Text Classification |
| 通过传输激活来控制语言和扩散模型 |
Pau Rodriguez |
PDF |
N/A |
Controlling Language and Diffusion Models by Transporting Activations |
| 合法的无真实标签指标用于深度不确定性分类评分 |
Arthur Pignet |
PDF |
N/A |
Legitimate ground-truth-free metrics for deep uncertainty classification scoring |
| 理解上下文学习与权重学习的区别 |
Bryan Chan |
PDF |
N/A |
Toward Understanding In-context vs. In-weight Learning |
| 情感RAG:通过情感检索增强角色扮演代理 |
Le Huang |
PDF |
N/A |
Emotional RAG: Enhancing Role-Playing Agents through Emotional Retrieval |
| 神经注意力场:三维场景中新兴的点相关性用于一次性灵巧抓取 |
Qianxu Wang |
PDF |
N/A |
Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping |
| 离线强化学习和序列建模在下行链路自适应中的应用 |
Samuele Peri |
PDF |
N/A |
Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation |
| 风险感知的非平稳多臂赌博机问题的规划与学习 |
Nima Akbarzadeh |
PDF |
N/A |
Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problem |
| 大型语言模型反馈驱动的决策代理在线内在奖励机制 |
Qinqing Zheng |
PDF |
N/A |
Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback |
| DexGraspNet 2.0:在大规模合成杂乱场景中学习生成灵巧抓握 |
Jialiang Zhang |
PDF |
N/A |
DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes |
| 不精确概率的评分规则与校准 |
Christian Fröhlich |
PDF |
N/A |
Scoring Rules and Calibration for Imprecise Probabilities |
| \textsc{Long$^2$RAG}:评估长上下文与长表单检索增强生成,重点关注关键点回忆 |
Zehan Qi |
PDF |
N/A |
\textsc{Long$^2$RAG}: Evaluating Long-Context \& Long-Form Retrieval-Augmented Generation with Key Point Recall |
| 服务机器人任务规划与执行中提示工程技术的比较 |
Jonas Bode |
PDF |
N/A |
A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics |
| 文本中量子级联激光器特性的语义丰富——一种知识图谱生成方法 |
Deperias Kerre |
PDF |
N/A |
Semantic Enrichment of the Quantum Cascade Laser Properties in Text- A Knowledge Graph Generation Approach |
| VisAidMath:视觉辅助数学推理基准测试 |
Jingkun Ma |
PDF |
N/A |
VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning |
| 具有后分配服务的动态匹配及其在难民安置中的应用 |
Kirk Bansak |
PDF |
N/A |
Dynamic Matching with Post-allocation Service and its Application to Refugee Resettlement |
| V2X辅助的分布式计算与控制框架,适用于匝道汇流场景下的网联与自动驾驶车辆 |
Qiong Wu |
PDF |
N/A |
V2X-Assisted Distributed Computing and Control Framework for Connected and Automated Vehicles under Ramp Merging Scenario |
| 用于时间序列分析的高阶跨结构嵌入模型 |
Guancen Lin |
PDF |
N/A |
Higher-order Cross-structural Embedding Model for Time Series Analysis |
| 双优化自适应图重构用于多视图图聚类 |
Zichen Wen |
PDF |
N/A |
Dual-Optimized Adaptive Graph Reconstruction for Multi-View Graph Clustering |
| PDSR:高效无人机部署,实现快速精准的灾后搜救 |
Alaa Awad Abdellatif |
PDF |
N/A |
PDSR: Efficient UAV Deployment for Swift and Accurate Post-Disaster Search and Rescue |
| DisenTS:多变量时间序列预测中的解耦通道演化模式建模 |
Zhiding Liu |
PDF |
N/A |
DisenTS: Disentangled Channel Evolving Pattern Modeling for Multivariate Time Series Forecasting |
| LumiSculpt:一种用于视频生成的连续照明控制网络 |
Yuxin Zhang |
PDF |
N/A |
LumiSculpt: A Consistency Lighting Control Network for Video Generation |
| 基于扩散的流形对齐的图集成 |
Jake S. Rhodes |
PDF |
N/A |
Graph Integration for Diffusion-Based Manifold Alignment |
| Bonafide 在 LegalLens 2024 共享任务中:使用轻量级 DeBERTa 基础编码器进行法律违规检测与解决 |
Shikha Bordia |
PDF |
N/A |
Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based Encoder For Legal Violation Detection and Resolution |
| 使用扩散模型进行私密合成文本生成 |
Sebastian Ochs |
PDF |
N/A |
Private Synthetic Text Generation with Diffusion Models |
| 基于动态阈值的两层在线无监督异常检测器 |
Yachao Yuan |
PDF |
N/A |
Dynamic Threshold-based Two-layer Online Unsupervised Anomaly Detector |
| 可扩展的高效用模式采样 |
Lamine Diop |
PDF |
N/A |
Scalable Sampling for High Utility Patterns |
| 纵向联邦学习安全算法研究:以安全逻辑回归为例 |
Huan-Chih Wang |
PDF |
N/A |
A Study of Secure Algorithms for Vertical Federated Learning: Take Secure Logistic Regression as an Example |
| EnsIR:一种通过高斯混合模型实现图像恢复的集成算法 |
Shangquan Sun |
PDF |
N/A |
EnsIR: An Ensemble Algorithm for Image Restoration via Gaussian Mixture Models |
| 基于源可靠性估计的检索增强生成 |
Jeongyeon Hwang |
PDF |
N/A |
Retrieval-Augmented Generation with Estimation of Source Reliability |
| 通过Householder变换实现预训练视觉Transformer的高效适应 |
Wei Dong |
PDF |
N/A |
Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation |
| SpiroActive:用于肺功能测量的高效数据采集的主动学习 |
Ankita Kumari Jain |
PDF |
N/A |
SpiroActive: Active Learning for Efficient Data Acquisition for Spirometry |
| MutaPLM:用于突变解释与工程的蛋白质语言建模 |
Yizhen Luo |
PDF |
N/A |
MutaPLM: Protein Language Modeling for Mutation Explanation and Engineering |
| ELBOing Stein:使用Stein混合推断的变分贝叶斯 |
Ola Rønning |
PDF |
N/A |
ELBOing Stein: Variational Bayes with Stein Mixture Inference |
| KALAM:用于自动化模拟计算系统高层合成的工具包 |
Ankita Nandi |
PDF |
N/A |
KALAM: toolKit for Automating high-Level synthesis of Analog computing systeMs |
| 专注于此,而非彼!通过自适应特征规范引导大型语言模型 |
Tom A. Lamb |
PDF |
N/A |
Focus On This, Not That! Steering LLMs With Adaptive Feature Specification |
| AdaptiveISP:学习用于目标检测的自适应图像信号处理器 |
Yujin Wang |
PDF |
N/A |
AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection |
| DiffLight:一种基于部分奖励条件的扩散模型,用于处理缺失数据的交通信号控制 |
Hanyang Chen |
PDF |
N/A |
DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data |
| 审慎采用自然语言处理技术促进公民参与:理解政策制定者之间的差异 |
Jose A. Guridi |
PDF |
N/A |
Thoughtful Adoption of NLP for Civic Participation: Understanding Differences Among Policymakers |
| 将NeRFs引入潜在空间:逆向图形自编码器 |
Antoine Schnepf |
PDF |
N/A |
Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder |
| 多智能体大型语言模型用于对话任务解决 |
Jonas Becker |
PDF |
N/A |
Multi-Agent Large Language Models for Conversational Task-Solving |
| 一种基于个体身份驱动的动物重识别框架 |
Yihao Wu |
PDF |
N/A |
An Individual Identity-Driven Framework for Animal Re-Identification |
| BIS:面向商业智能场景的NL2SQL服务评估基准 |
Bora Caglayan |
PDF |
N/A |
BIS: NL2SQL Service Evaluation Benchmark for Business Intelligence Scenarios |
| 通过大规模真实世界数据集与记忆增强型Transformer实现的高保真文档污渍去除 |
Mingxian Li |
PDF |
N/A |
High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer |
| 无模拟训练:在配对数据上训练神经ODE |
Semin Kim |
PDF |
N/A |
Simulation-Free Training of Neural ODEs on Paired Data |
| 可解释行为克隆:通过示范学习教授大型语言模型代理 |
Yanchu Guan |
PDF |
N/A |
Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration |
| 基于模块化状态的斯塔克尔伯格博弈在分布式制造系统中的自我优化 |
Steve Yuwono |
PDF |
N/A |
Self-optimization in distributed manufacturing systems using Modular State-based Stackelberg Games |
| CopRA:一种渐进式LoRA训练策略 |
Zhan Zhuang |
PDF |
N/A |
CopRA: A Progressive LoRA Training Strategy |
| UniRiT:迈向少样本非刚性点云配准 |
Geng Li |
PDF |
N/A |
UniRiT: Towards Few-Shot Non-Rigid Point Cloud Registration |
| 联邦UCBVI:异构代理间的通信高效联邦后悔最小化 |
Safwan Labbi |
PDF |
N/A |
Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents |
| 从咿呀学语到词汇:在连续的音素流上预训练语言模型 |
Zébulon Goriely |
PDF |
N/A |
From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes |
| HelloMeme:将空间编织注意力整合到扩散模型中,以嵌入高层次和保真度丰富的条件 |
Shengkai Zhang |
PDF |
N/A |
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models |
| 部分形状匹配的虫洞损失 |
Amit Bracha |
PDF |
N/A |
Wormhole Loss for Partial Shape Matching |
| YOLOv11 用于车辆检测:在智能交通系统中的进展、性能与应用 |
Mujadded Al Rabbani Alif |
PDF |
N/A |
YOLOv11 for Vehicle Detection: Advancements, Performance, and Applications in Intelligent Transportation Systems |
| 结合精神分析与计算机科学:一项关于情绪与拉康话语之间关系的实证研究 |
Minas Gadalla |
PDF |
N/A |
Combining psychoanalysis and computer science: an empirical study of the relationship between emotions and the Lacanian discourses |
| VPO:利用偏好优化中的票数 |
Jae Hyeon Cho |
PDF |
N/A |
VPO: Leveraging the Number of Votes in Preference Optimization |
| 通过单一向量实现视觉-语言模型的有效且高效的对抗检测 |
Youcheng Huang |
PDF |
N/A |
Effective and Efficient Adversarial Detection for Vision-Language Models via A Single Vector |
| 通过条件$f$-信息进行泛化界限分析 |
Ziqiao Wang |
PDF |
N/A |
Generalization Bounds via Conditional $f$-Information |
| 少即是多:采用认知上合理的课程学习策略预训练跨语言小规模语言模型 |
Suchir Salhan |
PDF |
N/A |
Less is More: Pre-Training Cross-Lingual Small-Scale Language Models with Cognitively-Plausible Curriculum Learning Strategies |
| 从专家混合模型中窃取用户提示 |
Itay Yona |
PDF |
N/A |
Stealing User Prompts from Mixture of Experts |
| 自适应范式协同:跨范式目标能否提升长尾学习效果? |
Haowen Xiao |
PDF |
N/A |
Adaptive Paradigm Synergy: Can a Cross-Paradigm Objective Enhance Long-Tailed Learning? |
| SFA-UNet:更多关注红外小目标分割中的多尺度对比与上下文信息 |
Imad Ali Shah |
PDF |
N/A |
SFA-UNet: More Attention to Multi-Scale Contrast and Contextual Information in Infrared Small Object Segmentation |
| 通过对比解释在检索增强型语言模型中引发批判性推理 |
Leonardo Ranaldi |
PDF |
N/A |
Eliciting Critical Reasoning in Retrieval-Augmented Language Models via Contrastive Explanations |
| 泊松回归中p次方根链接的数据子采样 |
Han Cheng Lie |
PDF |
N/A |
Data subsampling for Poisson regression with pth-root-link |
| 粒子-量热计相互作用的量子辅助深度生成代理模型 |
J. Quetzalcoatl Toledo-Marin |
PDF |
N/A |
Conditioned quantum-assisted deep generative surrogate for particle-calorimeter interactions |
| 面向人口规模的DIXON MRI睾丸体积分割 |
Jan Ernsting |
PDF |
N/A |
Towards Population Scale Testis Volume Segmentation in DIXON MRI |
| 修剪与重绘:适用于任意比例的内容感知图像重定位 |
Feihong Shen |
PDF |
N/A |
Prune and Repaint: Content-Aware Image Retargeting for any Ratio |
| AtGCN:一种用于共济失调步态检测的图卷积网络 |
Karan Bania |
PDF |
N/A |
AtGCN: A Graph Convolutional Network For Ataxic Gait Detection |
| 达芬奇:一种用于约束CAD草图推理的单阶段架构 |
Ahmet Serdar Karadeniz |
PDF |
N/A |
DAVINCI: A Single-Stage Architecture for Constrained CAD Sketch Inference |
| 机器学习中的超参数优化 |
Luca Franceschi |
PDF |
N/A |
Hyperparameter Optimization in Machine Learning |
| 极化图像数据集,包含机械生成的水面波与由波浪计线性阵列记录的表面高程记录耦合 |
Noam Ginio |
PDF |
N/A |
Dataset of polarimetric images of mechanically generated water surface waves coupled with surface elevation records by wave gauges linear array |
| 生成式大型语言模型的数据无能 |
Søren Vejlgaard Holm |
PDF |
N/A |
Danoliteracy of Generative, Large Language Models |
| SFDFusion:一种用于红外与可见光图像融合的高效空间-频率域融合网络 |
Kun Hu |
PDF |
N/A |
SFDFusion: An Efficient Spatial-Frequency Domain Fusion Network for Infrared and Visible Image Fusion |
| 劫持RAG:针对检索增强型大型语言模型的劫持攻击 |
Yucheng Zhang |
PDF |
N/A |
HijackRAG: Hijacking Attacks against Retrieval-Augmented Large Language Models |
| 潜在扩散,隐式放大:高效的连续尺度遥感图像超分辨率 |
Hanlin Wu |
PDF |
N/A |
Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing Images |
| 情境场景图:结构化以人为中心情境理解 |
Chinthani Sugandhika |
PDF |
N/A |
Situational Scene Graph for Structured Human-centric Situation Understanding |
| 大型语言模型在瑞典语词义消歧方面表现如何? |
Richard Johansson |
PDF |
N/A |
How Well Do Large Language Models Disambiguate Swedish Words? |
| EvoCodeBench:一个不断发展的代码生成基准,具有特定领域的评估 |
Jia Li |
PDF |
N/A |
EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations |
| 大规模随机配对交互网络系统的不变性原理基础上的集中性结果 |
Giacomo Como |
PDF |
N/A |
An invariance principle based concentration result for large-scale stochastic pairwise interaction network systems |
| 无极线约束的三维高斯溅射技术在通用的新视角合成中的应用 |
Zhiyuan Min |
PDF |
N/A |
Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis |
| 面向具有异质性客户端的鲁棒且高效的联邦低秩适应 |
Jabin Koo |
PDF |
N/A |
Towards Robust and Efficient Federated Low-Rank Adaptation with Heterogeneous Clients |
| $π^2/6$ 路径在避免模型崩溃中的普遍性 |
Apratim Dey |
PDF |
N/A |
Universality of the $π^2/6$ Pathway in Avoiding Model Collapse |
| 使用Vision Mamba的自适应多尺度文档二值化 |
Mohd. Azfar |
PDF |
N/A |
Adaptive Multi Scale Document Binarisation Using Vision Mamba |
| 在大型语言模型(LLMs)中增强因果关系的行为序列建模,以实现个性化推荐 |
Yang Zhang |
PDF |
N/A |
Causality-Enhanced Behavior Sequence Modeling in LLMs for Personalized Recommendation |
| MILP-StuDio:通过块结构分解生成MILP实例 |
Haoyang Liu |
PDF |
N/A |
MILP-StuDio: MILP Instance Generation via Block Structure Decomposition |
| 神经波束形成在鲁棒语音去混响和降噪中的运行时适应 |
Yoto Fujita |
PDF |
N/A |
Run-Time Adaptation of Neural Beamforming for Robust Speech Dereverberation and Denoising |
| DOA-Aware视听自监督学习用于声音事件定位与检测 |
Yoto Fujita |
PDF |
N/A |
DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection |
| 小波脉冲累积用于湍流缓解 |
Jerome Gilles |
PDF |
N/A |
Wavelet Burst Accumulation for turbulence mitigation |
| 机器学习非绝热动力学:利用态相互作用态平均自旋限制系综参考Kohn-Sham方法消除非绝热耦合的相位自由度 |
Sung Wook Moon |
PDF |
N/A |
Machine Learning Nonadiabatic Dynamics: Eliminating Phase Freedom of Nonadiabatic Couplings with the State-Intraction State-Averaged Spin-Restricted Ensemble-Referenced Kohn-Sham Approach |
| 使用约束学习求解微分方程 |
Viggo Moro |
PDF |
N/A |
Solving Differential Equations with Constrained Learning |
| 开放湍流图像集(OTIS) |
Nicholas B. Ferrante |
PDF |
N/A |
Open Turbulent Image Set (OTIS) |
| 用于序列推荐中层次偏好建模的双重对比变换器 |
Chengkai Huang |
PDF |
N/A |
Dual Contrastive Transformer for Hierarchical Preference Modeling in Sequential Recommendation |
| 元学习中尾部任务风险最小化的理论研究与实践改进 |
Yiqin Lv |
PDF |
N/A |
Theoretical Investigations and Practical Enhancements on Tail Task Risk Minimization in Meta Learning |
| 对比学习与对抗性解耦合在面向任务的语义通信中的隐私保护 |
Omar Erak |
PDF |
N/A |
Contrastive Learning and Adversarial Disentanglement for Privacy-Preserving Task-Oriented Semantic Communications |
| MALoRA:用于增强多任务学习的非对称低秩适应混合方法 |
Xujia Wang |
PDF |
N/A |
MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning |
| Bregman算法实现Meyer的$G-$范数用于卡通+纹理分解 |
Jerome Gilles |
PDF |
N/A |
Bregman implementation of Meyer's $G-$norm for cartoon + textures decomposition |
| 扩散模型胜过自回归模型:文本到图像模型中组合生成的评估 |
Arash Marioriyad |
PDF |
N/A |
Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models |
| 展开目标检测与状态空间模型 |
Luca Jiang-Tao Yu |
PDF |
N/A |
Unfolding Target Detection with State Space Model |
| 基于随机排列集的信息源可靠性评估 |
Juntao Xu |
PDF |
N/A |
Reliability Assessment of Information Sources Based on Random Permutation Set |
| FuseAnyPart:通过多张参考图像实现扩散驱动的面部部位交换 |
Zheng Yu |
PDF |
N/A |
FuseAnyPart: Diffusion-Driven Facial Parts Swapping via Multiple Reference Images |
| InjecGuard:评估并缓解提示注入防护模型中的过度防御 |
Hao Li |
PDF |
N/A |
InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models |
| 面向目标的聊天机器人对话状态跟踪中的本体论之外 |
Sejin Lee |
PDF |
N/A |
Beyond Ontology in Dialogue State Tracking for Goal-Oriented Chatbot |
| 自动驾驶赛车:深度强化学习的应用 |
Florentiana Yuwono |
PDF |
N/A |
Self-Driving Car Racing: Application of Deep Reinforcement Learning |
| 使用核嵌入进行因果推断的概述 |
Dino Sejdinovic |
PDF |
N/A |
An Overview of Causal Inference using Kernel Embeddings |
| SoftCTRL:用于自动驾驶的Transformer强化学习的软保守KL控制 |
Minh Tri Huynh |
PDF |
N/A |
SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving |
| 理解多类分类中适当学习者的聚合 |
Julian Asilis |
PDF |
N/A |
Understanding Aggregations of Proper Learners in Multiclass Classification |
| 跨域数据集上分类器训练的合成数据分析 |
Andoni Cortés |
PDF |
N/A |
Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets |
| 设计人工智能个性:通过深思熟虑的角色设计增强人机交互 |
Nima Zargham |
PDF |
N/A |
Designing AI Personalities: Enhancing Human-Agent Interaction Through Thoughtful Persona Design |
| 从零开始构建多模态数据集,以实现日本视觉语言模型的快速开发 |
Keito Sasagawa |
PDF |
N/A |
Constructing Multimodal Datasets from Scratch for Rapid Development of a Japanese Visual Language Model |