| 通过相关性追踪实现鲁棒高斯过程 |
Sebastian Ament |
PDF |
N/A |
Robust Gaussian Processes via Relevance Pursuit |
| URAvatar:通用可重新照明的高斯编解码化身 |
Junxuan Li |
PDF |
N/A |
URAvatar: Universal Relightable Gaussian Codec Avatars |
| 自我模仿:通过以自我为中心的视频扩展模仿学习 |
Simar Kareer |
PDF |
N/A |
EgoMimic: Scaling Imitation Learning via Egocentric Video |
| 通过分解编码和条件化增强文本到视频生成中的运动效果 |
Penghui Ruan |
PDF |
N/A |
Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning |
| 通过几何扩散桥连接几何状态 |
Shengjie Luo |
PDF |
N/A |
Bridging Geometric States via Geometric Diffusion Bridge |
| 教授具身强化学习代理:语言使用的信息性和多样性 |
Jiajun Xi |
PDF |
N/A |
Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use |
| CaAdam:使用感知连接方法改进Adam优化器 |
Remi Genet |
PDF |
N/A |
CaAdam: Improving Adam optimizer using connection aware methods |
| ARQ:一种用于精确且可验证鲁棒深度神经网络的混合精度量化框架 |
Yuchen Yang |
PDF |
N/A |
ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs |
| 无需自然视频即可学习视频表示 |
Xueyang Yu |
PDF |
N/A |
Learning Video Representations without Natural Videos |
| DELTA:适用于任何视频的密集高效长程3D追踪 |
Tuan Duc Ngo |
PDF |
N/A |
DELTA: Dense Efficient Long-range 3D Tracking for any video |
| TabM:通过参数高效集成推进表格深度学习 |
Yury Gorishniy |
PDF |
N/A |
TabM: Advancing Tabular Deep Learning with Parameter-Efficient Ensembling |
| 无姿态,无问题:令人惊讶的简单3D高斯斑点从稀疏无姿态图像中生成 |
Botao Ye |
PDF |
N/A |
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images |
| 理解深度学习中的优化与中心流 |
Jeremy M. Cohen |
PDF |
N/A |
Understanding Optimization in Deep Learning with Central Flows |
| 区域RL-RRT:集成RL-RRT路径规划与碰撞概率和区域连通性 |
AmirMohammad Tahmasbi |
PDF |
N/A |
Zonal RL-RRT: Integrated RL-RRT Path Planning with Collision Probability and Zone Connectivity |
| GeoSplatting:面向基于几何引导的高斯散射技术,实现基于物理的逆向渲染 |
Kai Ye |
PDF |
N/A |
GeoSplatting: Towards Geometry Guided Gaussian Splatting for Physically-based Inverse Rendering |
| DiffPano:利用球面对极感知扩散的可扩展且一致的文本到全景生成 |
Weicai Ye |
PDF |
N/A |
DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion |
| P-Masking:幂律掩码提升多属性控制生成 |
Mohamed Elgaar |
PDF |
N/A |
P-Masking: Power Law Masking Improves Multi-attribute Controlled Generation |
| 长度诱导的基于Transformer模型的嵌入崩溃 |
Yuqi Zhou |
PDF |
N/A |
Length-Induced Embedding Collapse in Transformer-based Models |
| 多属性语言调整用于受控释义生成 |
Mohamed Elgaar |
PDF |
N/A |
Multi-Attribute Linguistic Tuning for Controlled Paraphrase Generation |
| SelfCodeAlign:代码生成的自我对齐 |
Yuxiang Wei |
PDF |
N/A |
SelfCodeAlign: Self-Alignment for Code Generation |
| 隐藏的说客:大型语言模型的政治倾向及其对选民的影响 |
Yujin Potter |
PDF |
N/A |
Hidden Persuaders: LLMs' Political Leaning and Their Influence on Voters |
| 在过参数化和欠参数化之间追求更好的深度图像先验 |
Qiming Wu |
PDF |
N/A |
Chasing Better Deep Image Priors between Over- and Under-parameterization |
| DexMimicGen:通过模仿学习实现双手灵巧操作的自动化数据生成 |
Zhenyu Jiang |
PDF |
N/A |
DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning |
| 用于对称性机制分析的群交叉编码器 |
Liv Gorton |
PDF |
N/A |
Group Crosscoders for Mechanistic Analysis of Symmetry |
| 基于线性样条的扩展目标跟踪与分类 |
Matteo Tesori |
PDF |
N/A |
Extended Object Tracking and Classification based on Linear Splines |
| 联邦黑盒适应语义分割 |
Jay N. Paranjape |
PDF |
N/A |
Federated Black-Box Adaptation for Semantic Segmentation |
| AR-Pro:基于正式属性的异常修复反事实解释 |
Xiayan Ji |
PDF |
N/A |
AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties |
| DC-Spin:一种用于口语语言模型的说话人不变语音分词器 |
Heng-Jui Chang |
PDF |
N/A |
DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models |
| 约束反向翻译提升大型语言模型复杂指令遵循能力 |
Yunjia Qi |
PDF |
N/A |
Constraint Back-translation Improves Complex Instruction Following of Large Language Models |
| 基于微服务的分布式旅行数据集成与服务提供新型架构 |
Biman Barua |
PDF |
N/A |
Novel Architecture for Distributed Travel Data Integration and Service Provision Using Microservices |
| 可扩展性的重要性:提高神经网络原子间势能函数在化学领域中的速度和准确性 |
Eric Qu |
PDF |
N/A |
The Importance of Being Scalable: Improving the Speed and Accuracy of Neural Network Interatomic Potentials Across Chemical Domains |
| 通过被动雷达进行人体活动识别的方法 |
Christian Bresciani |
PDF |
N/A |
Approaches to human activity recognition via passive radar |
| $π_0$:一种用于通用机器人控制的视觉-语言-动作流模型 |
Kevin Black |
PDF |
N/A |
$π_0$: A Vision-Language-Action Flow Model for General Robot Control |
| 使用预训练和微调的注意力驱动神经算子进行故障后电压轨迹的保形预测 |
Amirhossein Mollaali |
PDF |
N/A |
Conformalized Prediction of Post-Fault Voltage Trajectories Using Pre-trained and Finetuned Attention-Driven Neural Operators |
| 重新定义词典中的<创意>:迈向对创意生成的增强语义理解 |
Fu Feng |
PDF |
N/A |
Redefining in Dictionary: Towards a Enhanced Semantic Understanding of Creative Generation |
| GPT还是BERT:为何不两者兼得? |
Lucas Georges Gabriel Charpentier |
PDF |
N/A |
GPT or BERT: why not both? |
| 思维空间探索者:导航与扩展思维空间以实现大型语言模型的推理 |
Jinghan Zhang |
PDF |
N/A |
Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning |
| 通过随机特征视角理解密集关联记忆 |
Benjamin Hoover |
PDF |
N/A |
Dense Associative Memory Through the Lens of Random Features |
| 缩放概念与文本引导的扩散模型 |
Chao Huang |
PDF |
N/A |
Scaling Concept With Text-Guided Diffusion Models |
| 探索用于面部属性识别的视觉语言模型:情感、种族、性别和年龄 |
Nouar AlDahoul |
PDF |
N/A |
Exploring Vision Language Models for Facial Attribute Recognition: Emotion, Race, Gender, and Age |
| 圆形数据的共形预测 |
Paulo C. Marques F. |
PDF |
N/A |
Conformal prediction of circular data |
| HoloChrome:用于减少全息近眼显示器中散斑的多色照明 |
Florian Schiffers |
PDF |
N/A |
HoloChrome: Polychromatic Illumination for Speckle Reduction in Holographic Near-Eye Displays |
| 别碰我的变音符号 |
Kyle Gorman |
PDF |
N/A |
Don't Touch My Diacritics |
| COSNet:一种在杂乱场景中使用增强边界的语义分割新网络 |
Muhammad Ali |
PDF |
N/A |
COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes |
| 分位数MDP的Q-学习:分解、性能与收敛性分析 |
Jia Lin Hau |
PDF |
N/A |
Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis |
| 多环境主题模型 |
Dominic Sobhani |
PDF |
N/A |
Multi-environment Topic Models |
| 利用大型语言模型进行代码翻译和科学计算中的软件开发 |
Akash Dhruv |
PDF |
N/A |
Leveraging Large Language Models for Code Translation and Software Development in Scientific Computing |
| 仓库级组合代码翻译与验证 |
Ali Reza Ibrahimzada |
PDF |
N/A |
Repository-Level Compositional Code Translation and Validation |
| AIDOVECL:用于眼平分类和定位的AI生成车辆外延数据集 |
Amir Kazemi |
PDF |
N/A |
AIDOVECL: AI-generated Dataset of Outpainted Vehicles for Eye-level Classification and Localization |
| 最近邻归一化提升了多模态检索的效果 |
Neil Chowdhury |
PDF |
N/A |
Nearest Neighbor Normalization Improves Multimodal Retrieval |
| 强化学习梯度作为在线微调决策变压器的维生素 |
Kai Yan |
PDF |
N/A |
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers |
| 光谱模型分片中的采样策略 |
Denis Korzhenkov |
PDF |
N/A |
On Sampling Strategies for Spectral Model Sharding |
| 媒人:用于模式匹配的自改进大型语言模型程序 |
Nabeel Seedat |
PDF |
N/A |
Matchmaker: Self-Improving Large Language Model Programs for Schema Matching |
| 聚类以最小化集群感知范数目标 |
Martin G. Herold |
PDF |
N/A |
Clustering to Minimize Cluster-Aware Norm Objectives |
| 基准数据存储库,助力更优基准测试 |
Rachel Longjohn |
PDF |
N/A |
Benchmark Data Repositories for Better Benchmarking |
| 在医学图像质量评估中使用HaarPSI时的参数选择 |
Clemens Karner |
PDF |
N/A |
Parameter choices in HaarPSI for IQA with medical images |
| 强化学习的渐进式安全保障措施:确保安全且与模型无关 |
Nabil Omi |
PDF |
N/A |
Progressive Safeguards for Safe and Model-Agnostic Reinforcement Learning |
| 3D-ViTac:利用视觉触觉感知学习细粒度操作 |
Binghao Huang |
PDF |
N/A |
3D-ViTac: Learning Fine-Grained Manipulation with Visuo-Tactile Sensing |
| 揭秘线性MDP与新型动态聚合框架 |
Joongkyu Lee |
PDF |
N/A |
Demystifying Linear MDPs and Novel Dynamics Aggregation Framework |
| 时间序列基础模型的上下文微调 |
Abhimanyu Das |
PDF |
N/A |
In-Context Fine-Tuning for Time-Series Foundation Models |
| 一种高效的动态资源分配框架,用于进化双层优化 |
Dejun Xu |
PDF |
N/A |
An Efficient Dynamic Resource Allocation Framework for Evolutionary Bilevel Optimization |
| 数值规划的图学习 |
Dillon Z. Chen |
PDF |
N/A |
Graph Learning for Numeric Planning |
| 边缘化线性混合效应模型的哈密尔顿蒙特卡洛推断 |
Jinlin Lai |
PDF |
N/A |
Hamiltonian Monte Carlo Inference of Marginalized Linear Mixed-Effects Models |
| 识别极端事件的时空驱动因素 |
Mohamad Hakam Shams Eddin |
PDF |
N/A |
Identifying Spatio-Temporal Drivers of Extreme Events |
| 局部线性化:连续MDP中无悔强化学习的关键 |
Davide Maran |
PDF |
N/A |
Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs |
| 动力学相似性分析独特地捕捉了计算在递归神经网络(RNNs)中如何发展的过程。 |
Quentin Guilhot |
PDF |
N/A |
Dynamical similarity analysis uniquely captures how computations develop in RNNs |
| 理解扩散模型的泛化性需要重新思考隐藏的高斯结构 |
Xiang Li |
PDF |
N/A |
Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure |
| 识别线性因果表示中的通用机制转变 |
Tianyu Chen |
PDF |
N/A |
Identifying General Mechanism Shifts in Linear Causal Representations |
| 自然梯度和量子玻尔兹曼机的参数估计 |
Dhrumil Patel |
PDF |
N/A |
Natural gradient and parameter estimation for quantum Boltzmann machines |
| 基于深度学习模型的超声波增材制造先进预测质量评估 |
Lokendra Poudel |
PDF |
N/A |
Advanced Predictive Quality Assessment for Ultrasonic Additive Manufacturing with Deep Learning Model |
| EigenVI:基于分数的变分推断与正交函数展开 |
Diana Cai |
PDF |
N/A |
EigenVI: score-based variational inference with orthogonal function expansions |
| 只需关注即可优化风电场运行和维护 |
Iman Kazemian |
PDF |
N/A |
Attention is All You Need to Optimize Wind Farm Operations and Maintenance |
| 神经网络训练动态的可视化案例研究 |
Ambroise Odonnat |
PDF |
N/A |
A Visual Case Study of the Training Dynamics in Neural Networks |
| 沙漠骆驼与石油酋长:以阿拉伯为中心的前沿大型语言模型红队测试 |
Muhammed Saeed |
PDF |
N/A |
Desert Camels and Oil Sheikhs: Arab-Centric Red Teaming of Frontier LLMs |
| 使用HM-VGG进行深度学习:多模态图像分析的AI策略 |
Junliang Du |
PDF |
N/A |
Deep Learning with HM-VGG: AI Strategies for Multi-modal Image Analysis |
| TPC:基于扩散的人体图像动画的测试时普鲁克校准 |
Sunjae Yoon |
PDF |
N/A |
TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation |
| 通过不确定性感知的模仿学习实现状态和上下文相关的机器人操作和抓取 |
Tim R. Winter |
PDF |
N/A |
State- and context-dependent robotic manipulation and grasping via uncertainty-aware imitation learning |
| 多模态大型语言模型在历史文献手写识别中的应用 |
Lucian Li |
PDF |
N/A |
Handwriting Recognition in Historical Documents with Multimodal LLM |
| 探索未知:基于聊天的个性化探索任务协作界面 |
Yingzhe Peng |
PDF |
N/A |
Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks |
| 使用视差图在非校准系统中进行人脸反欺骗的多模态方法 |
Ariel Larey |
PDF |
N/A |
A Multi-Modal Approach for Face Anti-Spoofing in Non-Calibrated Systems using Disparity Maps |
| 选择性预测的联合训练 |
Zhaohui Li |
PDF |
N/A |
Joint Training for Selective Prediction |
| AdaFlow:利用广义亲和性控制进行异步移动数据的机遇性推理 |
Fenmin Wu |
PDF |
N/A |
AdaFlow: Opportunistic Inference on Asynchronous Mobile Data with Generalized Affinity Control |
| AndroidLab:Android自主代理的训练与系统性基准测试 |
Yifan Xu |
PDF |
N/A |
AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents |
| 基于MLP的近似注意力:一种用于多元时间序列预测中基于注意力的模型的剪枝策略 |
Suhan Guo |
PDF |
N/A |
Approximate attention with MLP: a pruning strategy for attention-based model in multivariate time series forecasting |
| SFM-蛋白质:用于高级蛋白质序列表示的综合协同进化预训练 |
Liang He |
PDF |
N/A |
SFM-Protein: Integrative Co-evolutionary Pre-training for Advanced Protein Sequence Representation |
| 使用知识图谱嵌入检测文本层面的智力影响 |
Lucian Li |
PDF |
N/A |
Detecting text level intellectual influence with knowledge graph embeddings |
| 言语不止于词汇:语音转文本翻译系统是否利用了韵律? |
Ioannis Tsiamas |
PDF |
N/A |
Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody? |
| 贝叶斯引导的标签映射用于视觉重编程 |
Chengyi Cai |
PDF |
N/A |
Bayesian-guided Label Mapping for Visual Reprogramming |
| 评估打包对基于机器学习的恶意软件检测和分类系统的影响 |
Daniel Gibert |
PDF |
N/A |
Assessing the Impact of Packing on Machine Learning-Based Malware Detection and Classification Systems |
| 最大熵事后经验回放 |
Douglas C. Crowder |
PDF |
N/A |
Maximum Entropy Hindsight Experience Replay |
| 揭秘合成面孔:合成数据集如何暴露真实身份 |
Hatef Otroshi Shahreza |
PDF |
N/A |
Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real Identities |
| 带有循环引导的条件图生成扩散分支 |
Giangiacomo Mercatali |
PDF |
N/A |
Diffusion Twigs with Loop Guidance for Conditional Graph Generation |
| 重构过去:RePAIR数据集与基准测试,用于现实世界中的2D和3D拼图解决 |
Theodore Tsesmelis |
PDF |
N/A |
Re-assembling the past: The RePAIR dataset and benchmark for real world 2D and 3D puzzle solving |
| DiffPAD:基于去噪扩散的对抗性补丁净化 |
Jia Fu |
PDF |
N/A |
DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination |
| 上下文感知测试:一种基于大型语言模型的模型测试新范式 |
Paulius Rauba |
PDF |
N/A |
Context-Aware Testing: A New Paradigm for Model Testing with Large Language Models |
| 评估经典与深度神经影像生物标志物在早期阿尔茨海默病诊断中的效能 |
Milla E. Nielsen |
PDF |
N/A |
Assessing the Efficacy of Classical and Deep Neuroimaging Biomarkers in Early Alzheimer's Disease Diagnosis |
| ImOV3D:仅从2D图像学习开放词汇点云3D物体检测 |
Timing Yang |
PDF |
N/A |
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images |
| 多模态数据受控解耦的信息准则 |
Chenyu Wang |
PDF |
N/A |
An Information Criterion for Controlled Disentanglement of Multimodal Data |
| 打破决定论:利用离散状态空间扩散模型进行序列推荐的模糊建模 |
Wenjia Xie |
PDF |
N/A |
Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion Model |
| Ada-MSHyper:用于时间序列预测的自适应多尺度超图Transformer |
Zongjiang Shang |
PDF |
N/A |
Ada-MSHyper: Adaptive Multi-Scale Hypergraph Transformer for Time Series Forecasting |
| 本地化、平衡与亲和力:一种更强大的多方面协作显著目标检测器,用于遥感图像 |
Yakun Xie |
PDF |
N/A |
Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images |
| JEMA:一种用于可扩展多模态对齐联合学习的联合嵌入框架 |
Joao Sousa |
PDF |
N/A |
JEMA: A Joint Embedding Framework for Scalable Co-Learning with Multimodal Alignment |
| 总结因果图中的平均控制微直接效应和平均自然微直接效应 |
Simon Ferreira |
PDF |
N/A |
Average Controlled and Average Natural Micro Direct Effects in Summary Causal Graphs |
| TrAct:使第一层的预激活可训练 |
Felix Petersen |
PDF |
N/A |
TrAct: Making First-layer Pre-Activations Trainable |
| 用于验证(量子)学习与测试的交互式证明 |
Matthias C. Caro |
PDF |
N/A |
Interactive proofs for verifying (quantum) learning and testing |
| 手术场景分割的类感知语义扩散模型图像合成 |
Yihang Zhou |
PDF |
N/A |
Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation |
| 使用单一源语言机器翻译的大规模语料库进行多语言预训练 |
Jiayi Wang |
PDF |
N/A |
Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language |
| 多分辨率语音自监督学习的实证分析 |
Theo Clark |
PDF |
N/A |
An Empirical Analysis of Speech Self-Supervised Learning at Multiple Resolutions |
| 代表性社会选择:从学习理论到人工智能对齐 |
Tianyi Qiu |
PDF |
N/A |
Representative Social Choice: From Learning Theory to AI Alignment |
| 可扩展核逆优化 |
Youyuan Long |
PDF |
N/A |
Scalable Kernel Inverse Optimization |
| 认知无线电网络的深度学习框架:综述与开放研究挑战 |
Senthil Kumar Jagatheesaperumal |
PDF |
N/A |
Deep Learning Frameworks for Cognitive Radio Networks: Review and Open Research Challenges |
| 变压器预测符号积分例程的适用性 |
Rashid Barket |
PDF |
N/A |
Transformers to Predict the Applicability of Symbolic Integration Routines |
| MV-CC:用于遥感变化描述的掩码增强视频模型 |
Ruixun Liu |
PDF |
N/A |
MV-CC: Mask Enhanced Video Model for Remote Sensing Change Caption |
| 量子深度平衡模型 |
Philipp Schleich |
PDF |
N/A |
Quantum Deep Equilibrium Models |
| 从部分微观观测中学习宏观动力学 |
Mengyi Chen |
PDF |
N/A |
Learning Macroscopic Dynamics from Partial Microscopic Observations |
| 具有非各向同性设计的鲁棒稀疏回归 |
Chih-Hung Liu |
PDF |
N/A |
Robust Sparse Regression with Non-Isotropic Designs |
| 基于层次模型的偏好一致性问题快速算法研究 |
Anne-Marie George |
PDF |
N/A |
Towards Fast Algorithms for the Preference Consistency Problem Based on Hierarchical Models |
| 语言模型能够自我扩展以生成长文本 |
Shanghaoran Quan |
PDF |
N/A |
Language Models can Self-Lengthen to Generate Long Texts |
| 通过潜在空间编辑操控车辆三维形状 |
JiangDong Miao |
PDF |
N/A |
Manipulating Vehicle 3D Shapes through Latent Space Editing |
| 分析并减少GPT训练中对学习率预热的需求 |
Atli Kosson |
PDF |
N/A |
Analyzing & Reducing the Need for Learning Rate Warmup in GPT Training |
| BitStack:在可变内存环境中对压缩大型语言模型进行细粒度大小控制 |
Xinghao Wang |
PDF |
N/A |
BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments |
| 基于Transformer的模型预测控制:通过序列建模进行轨迹优化 |
Davide Celestini |
PDF |
N/A |
Transformer-based Model Predictive Control: Trajectory Optimization via Sequence Modeling |
| 基于字典模型的偏好语言最优替代方案的高效推理与计算 |
Nic Wilson |
PDF |
N/A |
Efficient Inference and Computation of Optimal Alternatives for Preference Languages Based On Lexicographic Models |
| RL-STaR:自教推理强化学习框架的理论分析 |
Fu-Chieh Chang |
PDF |
N/A |
RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner |
| 通过证据学习进行三维物体检测的不确定性估计 |
Nikita Durasov |
PDF |
N/A |
Uncertainty Estimation for 3D Object Detection via Evidential Learning |
| 从网络数据到实际领域:农业机器人的低成本无监督领域适应 |
Vasileios Tzouras |
PDF |
N/A |
From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots |
| Text-DiFuse:基于文本调制扩散模型的交互式多模态图像融合框架 |
Hao Zhang |
PDF |
N/A |
Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model |
| EZ-HOI:通过引导提示学习实现零样本HOI检测的VLM适应 |
Qinqian Lei |
PDF |
N/A |
EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection |
| 使用PyRAT进行神经网络验证 |
Augustin Lemesle |
PDF |
N/A |
Neural Network Verification with PyRAT |
| 负责任地从文档中检索增强生成以支持气候决策 |
Matyas Juhasz |
PDF |
N/A |
Responsible Retrieval Augmented Generation for Climate Decision Making from Documents |
| 变质恶意软件进化:大型语言模型的潜力与危险 |
Pooria Madani |
PDF |
N/A |
Metamorphic Malware Evolution: The Potential and Peril of Large Language Models |
| DiffBatt:一种用于电池退化预测与合成的扩散模型 |
Hamidreza Eivazi |
PDF |
N/A |
DiffBatt: A Diffusion Model for Battery Degradation Prediction and Synthesis |
| AllClear:一个用于卫星图像去云的综合数据集和基准测试 |
Hangyu Zhou |
PDF |
N/A |
AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite Imagery |
| 利用大型语言模型(LLMs)进行危机情境下的机器翻译:低资源语言的蓝图 |
Séamus Lankford |
PDF |
N/A |
Leveraging LLMs for MT in Crisis Scenarios: a blueprint for low-resource languages |
| GEPS:通过自适应调节提升参数化偏微分方程神经求解器的泛化能力 |
Armand Kassaï Koupaï |
PDF |
N/A |
GEPS: Boosting Generalization in Parametric PDE Neural Solvers through Adaptive Conditioning |
| 大型语言模型在叙事因果推理中的失败模式 |
Khurram Yamin |
PDF |
N/A |
Failure Modes of LLMs for Causal Reasoning on Narratives |
| “不”重要:多模态长对话中的分布外检测 |
Rena Gao |
PDF |
N/A |
'No' Matters: Out-of-Distribution Detection in Multimodality Long Dialogue |
| DynaSplit:一种面向边缘设备能效推理的硬件-软件协同设计框架 |
Daniel May |
PDF |
N/A |
DynaSplit: A Hardware-Software Co-Design Framework for Energy-Aware Inference on Edge |
| 直接优化解释以实现所需属性 |
Hiwot Belay Tadesse |
PDF |
N/A |
Directly Optimizing Explanations for Desired Properties |
| Plan-on-Graph:知识图谱上大型语言模型的自校正自适应规划 |
Liyi Chen |
PDF |
N/A |
Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs |
| 噪声作为双刃剑:强化学习利用神经网络中的随机防御 |
Steve Bakos |
PDF |
N/A |
Noise as a Double-Edged Sword: Reinforcement Learning Exploits Randomized Defenses in Neural Networks |
| QuACK:一种适用于合作$k$臂老虎机的多功能排队算法 |
Benjamin Howson |
PDF |
N/A |
QuACK: A Multipurpose Queuing Algorithm for Cooperative $k$-Armed Bandits |
| $ψ$DAG:有向无环图结构学习的投影随机逼近迭代法 |
Klea Ziu |
PDF |
N/A |
$ψ$DAG: Projected Stochastic Approximation Iteration for DAG Structure Learning |
| 音频是阿喀琉斯之踵:对音频大型多模态模型进行红队测试 |
Hao Yang |
PDF |
N/A |
Audio Is the Achilles' Heel: Red Teaming Audio Large Multimodal Models |
| 神经网络矩阵乘积算符:一种多维度可积的机器学习潜力 |
Kentaro Hino |
PDF |
N/A |
Neural Network Matrix Product Operator: A Multi-Dimensionally Integrable Machine Learning Potential |
| 语言模型在带有噪声理由的思维链提示中能否进行稳健推理? |
Zhanke Zhou |
PDF |
N/A |
Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales? |
| RAGraph:一种通用的检索增强图学习框架 |
Xinke Jiang |
PDF |
N/A |
RAGraph: A General Retrieval-Augmented Graph Learning Framework |
| 气道标记与临床应用的结合:通过可学习的注意力机制反映拓扑一致性和异常值 |
Chenyu Li |
PDF |
N/A |
Airway Labeling Meets Clinical Applications: Reflecting Topology Consistency and Outliers via Learnable Attentions |
| 文本声明自动验证(AVeriTeC)共享任务 |
Michael Schlichtkrull |
PDF |
N/A |
The Automated Verification of Textual Claims (AVeriTeC) Shared Task |
| 基于时间序列数据的案例ID检测——挖掘用例 |
Edyta Brzychczy |
PDF |
N/A |
Case ID detection based on time series data -- the mining use case |
| 基于大语言模型中的自由文本常识知识编辑 |
Xiusheng Huang |
PDF |
N/A |
Commonsense Knowledge Editing Based on Free-Text in LLMs |
| 编辑后模型性能下降的原因及解决方案 |
Xiusheng Huang |
PDF |
N/A |
Reasons and Solutions for the Decline in Model Performance after Editing |
| 审计谷歌的搜索算法:衡量巴西、英国和美国的新闻多样性 |
Raphael Hernandes |
PDF |
N/A |
Auditing Google's Search Algorithm: Measuring News Diversity Across Brazil, the UK, and the US |
| 通过稳定贝尔曼误差最大化实现确定性探索 |
Sebastian Griesbach |
PDF |
N/A |
Deterministic Exploration via Stationary Bellman Error Maximization |
| 立体声说话者:基于音频驱动的3D人体合成与先验引导的混合专家模型 |
Xiang Deng |
PDF |
N/A |
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts |
| 使用条件去噪扩散生成模型进行反事实MRI数据增强 |
Pedro Morão |
PDF |
N/A |
Counterfactual MRI Data Augmentation using Conditional Denoising Diffusion Generative Models |
| 用于医学图像异常定位的去噪扩散模型 |
Cosmin I. Bercea |
PDF |
N/A |
Denoising Diffusion Models for Anomaly Localization in Medical Images |
| FRoundation:基础模型是否已准备好应对人脸识别? |
Tahar Chettaoui |
PDF |
N/A |
FRoundation: Are Foundation Models Ready for Face Recognition? |
| 通过在图神经网络中采用有信息量的权重初始化来减少过平滑问题 |
Dimitrios Kelesis |
PDF |
N/A |
Reducing Oversmoothing through Informed Weight Initialization in Graph Neural Networks |
| 展示变化的内容和位置?远程传感变化检测的问答与定位 |
Ke Li |
PDF |
N/A |
Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection |
| GlotCC:一个面向少数语言的开源广泛覆盖CommonCrawl语料库及处理流程 |
Amir Hossein Kargaran |
PDF |
N/A |
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages |
| 用于异构物联网网络中鲁棒联邦学习的生成式人工智能插件 |
Youngjoon Lee |
PDF |
N/A |
Generative AI-Powered Plugin for Robust Federated Learning in Heterogeneous IoT Networks |
| 用于医学视觉定位的参数高效微调医学多模态大型语言模型 |
Jinlong He |
PDF |
N/A |
Parameter-Efficient Fine-Tuning Medical Multimodal Large Language Models for Medical Visual Grounding |
| 解开纠缠表示:通过扩散模型实现更优的潜在单元 |
Youngjun Jun |
PDF |
N/A |
Disentangling Disentangled Representations: Towards Improved Latent Units via Diffusion Models |
| 权重衰减引入了低秩注意力层 |
Seijin Kobayashi |
PDF |
N/A |
Weight decay induces low-rank attention layers |
| ISCSLP 2024 激励与说服性音频生成挑战赛的NPU-HWC系统 |
Dake Guo |
PDF |
N/A |
The NPU-HWC System for the ISCSLP 2024 Inspirational and Convincing Audio Generation Challenge |
| 图神经网络揭示了基于强化学习的运动学习中的几何神经表征 |
Federico Nardi |
PDF |
N/A |
Graph Neural Networks Uncover Geometric Neural Representations in Reinforcement-Based Motor Learning |
| CALE:连续街机学习环境 |
Jesse Farebrother |
PDF |
N/A |
CALE: Continuous Arcade Learning Environment |
| 一例多用:同时高效逼近所有概率值 |
Weida Li |
PDF |
N/A |
One Sample Fits All: Approximating All Probabilistic Values Simultaneously and Efficiently |
| 基于骨架的量子时空相对变换网络用于人体动作识别(HAR):ST-RTR |
Faisal Mehmood |
PDF |
N/A |
Human Action Recognition (HAR) Using Skeleton-based Quantum Spatial Temporal Relative Transformer Network: ST-RTR |
| 用于可访问和包容性扩展现实的生成式人工智能 |
Jens Grubert |
PDF |
N/A |
Generative AI for Accessible and Inclusive Extended Reality |
| SOAR:从野外单个视频中恢复自遮挡的虚拟形象 |
Zhuoyang Pan |
PDF |
N/A |
SOAR: Self-Occluded Avatar Recovery from a Single Video In the Wild |
| 通过谐波/打击乐源分离和卷积神经网络在有限数据集下改进打鼾检测 |
F. D. Gonzalez-Martinez |
PDF |
N/A |
Improving snore detection under limited dataset through harmonic/percussive source separation and convolutional neural networks |
| 神经模型检测 |
Mirco Giacobbe |
PDF |
N/A |
Neural Model Checking |
| EDT:一种受人类素描启发的有效扩散Transformer框架 |
Xinwang Chen |
PDF |
N/A |
EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching |
| 长视频理解中的视频令牌合并 |
Seon-Ho Lee |
PDF |
N/A |
Video Token Merging for Long-form Video Understanding |
| 遵守规则驾驶:将交通标志法规融入矢量化高清地图的基准 |
Xinyuan Chang |
PDF |
N/A |
Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map |
| Neurobench:DCASE 2020 声学场景分类基准测试在 XyloAudio 上的应用 |
Weijie Ke |
PDF |
N/A |
Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2 |
| 用于扩散变换器的上下文低秩适应(In-Context LoRA for Diffusion Transformers) |
Lianghua Huang |
PDF |
N/A |
In-Context LoRA for Diffusion Transformers |
| 向凸性迈进:一种具有唯一最优解的新型SSLM公式 |
Hongying Liu |
PDF |
N/A |
Towards Convexity in Anomaly Detection: A New Formulation of SSLM with Unique Optimal Solutions |
| 朝向生成射线路径采样以加速点对点光线追踪 |
Jérome Eertmans |
PDF |
N/A |
Towards Generative Ray Path Sampling for Faster Point-to-Point Ray Tracing |
| 在特征归因中解耦交互与依赖关系 |
Gunnar König |
PDF |
N/A |
Disentangling Interactions and Dependencies in Feature Attribution |
| 长上下文语言建模中困惑度的问题是什么? |
Lizhe Fang |
PDF |
N/A |
What is Wrong with Perplexity for Long-context Language Modeling? |
| LLMs在医学教育中的潜力:为资格考试生成问题和答案 |
Yunqi Zhu |
PDF |
N/A |
The Potential of LLMs in Medical Education: Generating Questions and Answers for Qualification Exams |
| 在LiDAR数据中进行Open-Set 3D物体检测作为分布外问题 |
Louis Soum-Fontez |
PDF |
N/A |
Open-Set 3D object detection in LiDAR data as an Out-of-Distribution problem |
| 基于CCS的进程演算中多方交互的抽象续延语义 |
Eneia Nicolae Todoran |
PDF |
N/A |
Abstract Continuation Semantics for Multiparty Interactions in Process Calculi based on CCS |
| 反姿态统计星图识别方法 |
Shunmei Dong |
PDF |
N/A |
Reverse Attitude Statistics Based Star Map Identification Method |
| 增强国际象棋强化学习的图表示 |
Tomas Rigaux |
PDF |
N/A |
Enhancing Chess Reinforcement Learning with Graph Representation |
| EXACFS -- 一种缓解灾难性遗忘的CIL方法 |
S Balasubramanian |
PDF |
N/A |
EXACFS -- A CIL Method to mitigate Catastrophic Forgetting |
| LSEAttention:时间序列预测中你所需的一切 |
Dizhen Liang |
PDF |
N/A |
LSEAttention is All You Need for Time Series Forecasting |
| 探索图表示的一致性:从图核到图神经网络 |
Xuyuan Liu |
PDF |
N/A |
Exploring Consistency in Graph Representations:from Graph Kernels to Graph Neural Networks |
| DetectRL:在现实场景中对LLM生成文本检测进行基准测试 |
Junchao Wu |
PDF |
N/A |
DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios |
| Syno:神经算子的结构化合成 |
Yongqi Zhuo |
PDF |
N/A |
Syno: Structured Synthesis for Neural Operators |
| EchoNarrator:生成射血分数预测的自然文本解释 |
Sarina Thomas |
PDF |
N/A |
EchoNarrator: Generating natural text explanations for ejection fraction predictions |
| 大型语言模型在训练过程中,快速思考与慢速思考时各层发生了什么:从梯度视角的分析 |
Ming Li |
PDF |
N/A |
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective |
| 尺度逆图形:高效学习大量三维场景 |
Karim Kassab |
PDF |
N/A |
Scaled Inverse Graphics: Efficiently Learning Large Sets of 3D Scenes |
| MLLA-UNet:一种高效的U形模型,结合了类似Mamba的线性注意力机制,用于医学图像分割 |
Yufeng Jiang |
PDF |
N/A |
MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image Segmentation |
| 一种非单体化的离线到在线强化学习策略方法 |
JaeYoon Kim |
PDF |
N/A |
A Non-Monolithic Policy Approach of Offline-to-Online Reinforcement Learning |
| MoTaDual:用于增强零样本组合图像检索的模态-任务双重对齐 |
Haiwen Li |
PDF |
N/A |
MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval |
| GPT-4V在时尚美学评估中的表现实证分析 |
Yuki Hirakawa |
PDF |
N/A |
An Empirical Analysis of GPT-4V's Performance on Fashion Aesthetic Evaluation |