| RoboTwin:配备生成式数字孪生的双臂机器人基准测试(早期版本) |
Yao Mu |
PDF |
N/A |
RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version) |
| HiPrompt:通过分层MLLM提示实现无调优的高分辨率生成 |
Xinyu Liu |
PDF |
N/A |
HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts |
| UC-NeRF:从内窥镜稀疏视图中实现不确定性感知的条件神经辐射场 |
Jiaxin Guo |
PDF |
N/A |
UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views |
| 大型语言模型能否获得驾照?面向自动驾驶可靠通用智能的基准测试 |
Yuhang Lu |
PDF |
N/A |
Can LVLMs Obtain a Driver's License? A Benchmark Towards Reliable AGI for Autonomous Driving |
| SITAR:用于动作识别的半监督图像变换器 |
Owais Iqbal |
PDF |
N/A |
SITAR: Semi-supervised Image Transformer for Action Recognition |
| 掩码扩散模型实际上是时间无关的掩码模型,并利用了不准确的分类采样 |
Kaiwen Zheng |
PDF |
N/A |
Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling |
| 拓扑方法在机器学习中的应用:面向实践者的教程 |
Baris Coskunuzer |
PDF |
N/A |
Topological Methods in Machine Learning: A Tutorial for Practitioners |
| LongCite:使大型语言模型能够在长上下文问答中生成细粒度的引用 |
jiajie Zhang |
PDF |
N/A |
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA |
| 区域数据驱动的全球拉伸网格天气模拟 |
Thomas Nils Nipen |
PDF |
N/A |
Regional data-driven weather modeling with a global stretched-grid |
| LongLLaVA:通过混合架构高效扩展多模态大语言模型至1000张图像 |
Xidong Wang |
PDF |
N/A |
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture |
| CanvOI,一个肿瘤学智能基础模型:以不同的方式扩展FLOPS |
Jonathan Zalach |
PDF |
N/A |
CanvOI, an Oncology Intelligence Foundation Model: Scaling FLOPS Differently |
| 多流深度学习框架,用于通过雷伊复杂图形测试预测轻度认知障碍 |
Junyoung Park |
PDF |
N/A |
Multi-stream deep learning framework to predict mild cognitive impairment with Rey Complex Figure Test |
| 基准测试少样本图像分类器中的虚假偏差 |
Guangtao Zheng |
PDF |
N/A |
Benchmarking Spurious Bias in Few-Shot Image Classifiers |
| 可配置的基础模型:从模块化角度构建大型语言模型 |
Chaojun Xiao |
PDF |
N/A |
Configurable Foundation Models: Building LLMs from a Modular Perspective |
| 城市驾驶混合模仿学习运动规划器 |
Cristian Gariboldi |
PDF |
N/A |
Hybrid Imitation-Learning Motion Planner for Urban Driving |
| 深入了解用于时间序列分类的LITE深度学习方法 |
Ali Ismail-Fawaz |
PDF |
N/A |
Look Into the LITE in Deep Learning for Time Series Classification |
| 平衡真实数据与合成数据对人脸识别中准确性与公平性的影响 |
Andrea Atzori |
PDF |
N/A |
The Impact of Balancing Real and Synthetic Data on Accuracy and Fairness in Face Recognition |
| 混合分割器:一种用于土木基础设施中自动细粒度裂缝分割的混合方法 |
June Moh Goo |
PDF |
N/A |
Hybrid-Segmentor: A Hybrid Approach to Automated Fine-Grained Crack Segmentation in Civil Infrastructure |
| 生物信息学检索增强数据(BRAD)数字助手 |
Joshua Pickard |
PDF |
N/A |
Bioinformatics Retrieval Augmentation Data (BRAD) Digital Assistant |
| CONClave -- 利用认证共识和信任评分实现CAV的安全稳健协同感知 |
Edward Andert |
PDF |
N/A |
CONClave -- Secure and Robust Cooperative Perception for CAVs Using Authenticated Consensus and Trust Scoring |
| 构建一个可扩展、高效且可控的搜索与排序平台 |
Marjan Celikik |
PDF |
N/A |
Building a Scalable, Effective, and Steerable Search and Ranking Platform |
| 人类-VDM:从视频扩散模型中学习单张图像的三维人体高斯喷射 |
Zhibin Liu |
PDF |
N/A |
Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models |
| 哎呀,我又采样了一次:重新解读少样本学习中的置信区间 |
Raphael Lafargue |
PDF |
N/A |
Oops, I Sampled it Again: Reinterpreting Confidence Intervals in Few-Shot Learning |
| MaDis-Stereo:通过蒸馏掩码图像建模增强的立体匹配 |
Jihye Ahn |
PDF |
N/A |
MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling |
| SNNAX -- 在 JAX 中的脉冲神经网络 |
Jamie Lohoff |
PDF |
N/A |
SNNAX -- Spiking Neural Networks in JAX |
| 使用类型和基于标记的语言建模进行历史德语文本规范化 |
Anton Ehrmanntraut |
PDF |
N/A |
Historical German Text Normalization Using Type- and Token-Based Language Modeling |
| R2GQA:检索器-阅读器-生成器问答系统,旨在帮助学生理解高等教育中的法律规章 |
Phuc-Tinh Pham Do |
PDF |
N/A |
R2GQA: Retriever-Reader-Generator Question Answering System to Support Students Understanding Legal Regulations in Higher Education |
| iConFormer:通过输入条件适应实现动态参数高效调整 |
Hayeon Jo |
PDF |
N/A |
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation |
| 通过大型语言模型进行少样本学习,探索加密货币讨论中的情感动态和预测行为 |
Moein Shahiki Tash |
PDF |
N/A |
Exploring Sentiment Dynamics and Predictive Behaviors in Cryptocurrency Discussions by Few-Shot Learning with Large Language Models |
| CMM-Math:一个中文多模态数学数据集,用于评估和提升大型多模态模型的数学推理能力 |
Wentao Liu |
PDF |
N/A |
CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models |
| ExpLLM:面向面部表情识别的思维链方法 |
Xing Lan |
PDF |
N/A |
ExpLLM: Towards Chain of Thought for Facial Expression Recognition |
| 三维胎儿超声图像的自动面部轴标准化 |
Antonia Alomar |
PDF |
N/A |
Automatic facial axes standardization of 3D fetal ultrasound images |
| 深度学习与卫星图像的结合——手工特征与基于学习的特征在多日期卫星立体图像上的评估 |
Shuang Song |
PDF |
N/A |
Deep Learning Meets Satellite Images -- An Evaluation on Handcrafted and Learning-based Features for Multi-date Satellite Stereo Images |
| 黑曜石:安全机器学习加速器上高效推理的协作状态空间探索 |
Sarbartha Banerjee |
PDF |
N/A |
Obsidian: Cooperative State-Space Exploration for Performant Inference on Secure ML Accelerators |
| MMMU-Pro:一个更强大的多学科多模态理解基准 |
Xiang Yue |
PDF |
N/A |
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark |
| 一种用于时间相关偏微分方程的混合有限元-物理信息神经网络方法 |
Xiaodong Feng |
PDF |
N/A |
A hybrid FEM-PINN method for time-dependent partial differential equations |
| 面向智能交通系统的边缘数据湖架构 |
Danilo Fernandes |
PDF |
N/A |
Towards Edge-Based Data Lake Architecture for Intelligent Transportation System |
| 提升时间序列分类证书鲁棒性的高效自集成方法 |
Chang Dong |
PDF |
N/A |
Boosting Certificate Robustness for Time Series Classification with Efficient Self-Ensemble |
| 迈向大语言模型偏好学习的统一视角:一项综述 |
Bofei Gao |
PDF |
N/A |
Towards a Unified View of Preference Learning for Large Language Models: A Survey |
| 从经验中“反学习”以避免虚假关联 |
Jeff Mitchell |
PDF |
N/A |
UnLearning from Experience to Avoid Spurious Correlations |
| 管理两用技术:国际安全协议案例研究及对人工智能治理的启示 |
Akash R. Wasil |
PDF |
N/A |
Governing dual-use technologies: Case studies of international security agreements and lessons for AI governance |
| 带有领域适应的正则化多输出高斯卷积过程 |
Wang Xinming |
PDF |
N/A |
Regularized Multi-output Gaussian Convolution Process with Domain Adaptation |
| 将因果表征学习与不变性原理统一起来 |
Dingling Yao |
PDF |
N/A |
Unifying Causal Representation Learning with the Invariance Principle |
| 髋至膝临床CT图像中骨与肌肉评估的不确定性估计肌肉骨骼分割模型验证 |
Mazen Soufi |
PDF |
N/A |
Validation of musculoskeletal segmentation model with uncertainty estimation for bone and muscle assessment in hip-to-knee clinical CT images |
| 一种基于增量偏好诱导的方法,用于学习多准则排序中可能的非单调偏好 |
Zhuolin Li |
PDF |
N/A |
An incremental preference elicitation-based approach to learning potentially non-monotonic preferences in multi-criteria sorting |
| 预训练与自训练的比较研究 |
Yiheng Wang |
PDF |
N/A |
A Comparative Study of Pre-training and Self-training |
| 可处理的正则决策过程离线学习 |
Ahana Deb |
PDF |
N/A |
Tractable Offline Learning of Regular Decision Processes |
| 卷积神经网络用于自动细胞自动机分类 |
Michiel Rollier |
PDF |
N/A |
Convolutional Neural Networks for Automated Cellular Automaton Classification |
| 完整且高效的3D点配置协变量及其在分子量子性质学习中的应用 |
Hartmut Maennel |
PDF |
N/A |
Complete and Efficient Covariants for 3D Point Configurations with Application to Learning Molecular Quantum Properties |
| 面向图数据的任务导向通信:一种图信息瓶颈方法 |
Shujing Li |
PDF |
N/A |
Task-Oriented Communication for Graph Data: A Graph Information Bottleneck Approach |
| 池化和注意力:基于大型语言模型(LLM)的嵌入模型中,哪些设计是有效的? |
Yixuan Tang |
PDF |
N/A |
Pooling And Attention: What Are Effective Designs For LLm-Based Embedding Models? |
| 使用期刊影响指标进行生物医学领域适应的预训练数据选择 |
Mathieu Laï-king |
PDF |
N/A |
Pre-training data selection for biomedical domain adaptation using journal impact metrics |
| 针对大型语言模型的对齐感知模型提取攻击 |
Zi Liang |
PDF |
N/A |
Alignment-Aware Model Extraction Attacks on Large Language Models |
| 一种利用跨语言句子表示增强低资源机器翻译的数据选择方法 |
Nidhi Kowtal |
PDF |
N/A |
A Data Selection Approach for Enhancing Low Resource Machine Translation Using Cross-Lingual Sentence Representations |
| 为PostNL创建基于生成式AI的追踪与追溯助手MVP(SuperTracy) |
Mohammad Reshadati |
PDF |
N/A |
Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL |
| 少样本多任务学习线性不变特征的元子空间追踪 |
Chaozhi Zhang |
PDF |
N/A |
Few-shot Multi-Task Learning of Linear Invariant Features with Meta Subspace Pursuit |
| 结合志同道合的同伴克服基于会话的社交推荐中的好友数据稀疏性 |
Chunyan An |
PDF |
N/A |
Incorporating Like-Minded Peers to Overcome Friend Data Sparsity in Session-Based Social Recommendations |
| CLDA:增强无监督域适应的协作学习 |
Minhee Cho |
PDF |
N/A |
CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation |
| 化学网络中二阶反应的精确首次通过时间分布 |
Changqian Rao |
PDF |
N/A |
Exact first passage time distribution for second-order reactions in chemical networks |
| 用于增强作业车间调度问题中神经局部搜索的决策变压器 |
Constantin Waubert de Puiseau |
PDF |
N/A |
Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem |
| 人工智能和机器学习在软件测试中的作用 |
Ahmed Ramadan |
PDF |
N/A |
The Role of Artificial Intelligence and Machine Learning in Software Testing |
| 大语言模型辅助的视觉分析:机遇与挑战 |
Maeve Hutchinson |
PDF |
N/A |
LLM-Assisted Visual Analytics: Opportunities and Challenges |
| 检测多模态内容中的行动号召:对2021年德国联邦选举在Instagram上的竞选活动分析 |
Michael Achmann-Denkler |
PDF |
N/A |
Detecting Calls to Action in Multimodal Content: Analysis of the 2021 German Federal Election Campaign on Instagram |
| 去混淆因果感知参数高效微调,以提升大语言模型的问题解决能力 |
Ruoyu Wang |
PDF |
N/A |
Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs |
| RouterRetriever:探索在多个专家嵌入模型上进行路由的优势 |
Hyunji Lee |
PDF |
N/A |
RouterRetriever: Exploring the Benefits of Routing over Multiple Expert Embedding Models |
| 从计算角度看神经时间尺度 |
Roxana Zeraati |
PDF |
N/A |
Neural timescales from a computational perspective |
| 重新思考HTG评估:弥合生成与识别之间的鸿沟 |
Konstantina Nikolaidou |
PDF |
N/A |
Rethinking HTG Evaluation: Bridging Generation and Recognition |
| 在亚马逊地区活跃火灾建模中使用LSTM和GRU的神经网络 |
Ramon Tavares |
PDF |
N/A |
Neural Networks with LSTM and GRU in Modeling Active Fires in the Amazon |
| 基于超声传感器和速率编码的低成本实时尖峰障碍物检测系统 |
Alvaro Ayuso-Martinez |
PDF |
N/A |
A Low-Cost Real-Time Spiking System for Obstacle Detection based on Ultrasonic Sensors and Rate Coding |
| 使用多摄像头训练改进单摄像头BEV感知 |
Daniel Busch |
PDF |
N/A |
Improved Single Camera BEV Perception Using Multi-Camera Training |
| 基于模型的多头部注意力残差展开网络的泛锐化方法 |
Ivan Pereira-Sánchez |
PDF |
N/A |
Multi-Head Attention Residual Unfolded Network for Model-Based Pansharpening |
| 从认识论角度看独立约束的解耦表示学习 |
Ruoyu Wang |
PDF |
N/A |
Independence Constrained Disentangled Representation Learning from Epistemological Perspective |
| 因果感知变换器网络用于机器人导航 |
Ruoyu Wang |
PDF |
N/A |
Causality-Aware Transformer Networks for Robotic Navigation |
| 机器学习简介 |
Laurent Younes |
PDF |
N/A |
Introduction to Machine Learning |
| 为机器翻译微调创建领域特定翻译记忆库:TRENCARD双语心脏病学语料库 |
Gokhan Dogru |
PDF |
N/A |
Creating Domain-Specific Translation Memories for Machine Translation Fine-tuning: The TRENCARD Bilingual Cardiology Corpus |
| 站在巨人的肩膀上:重新编程视觉-语言模型用于通用深度伪造检测 |
Kaiqing Lin |
PDF |
N/A |
Standing on the Shoulders of Giants: Reprogramming Visual-Language Model for General Deepfake Detection |
| PoseTalk:基于文本和音频的姿态控制与运动优化,用于一次性说话头生成 |
Jun Ling |
PDF |
N/A |
PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation |
| 跳跃与播放:深度驱动的姿态保持图像生成,适用于任意物体 |
Kyungmin Jo |
PDF |
N/A |
Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects |
| OpenFact在CheckThat! 2024:结合多种攻击方法实现有效的对抗性文本生成 |
Włodzimierz Lewoniewski |
PDF |
N/A |
OpenFact at CheckThat! 2024: Combining Multiple Attack Methods for Effective Adversarial Text Generation |
| 创建具有丰富材料信息的多相合金设计微观结构潜在空间 |
Xudong Ma |
PDF |
N/A |
Creating a Microstructure Latent Space with Rich Material Information for Multiphase Alloy Design |
| 基于学习的先进车辆仪表集群渲染错误检测系统 |
Cornelius Bürkle |
PDF |
N/A |
Learning-Based Error Detection System for Advanced Vehicle Instrument Cluster Rendering |
| 关于新兴语言的调查 |
Jannik Peters |
PDF |
N/A |
A Survey on Emergent Language |
| 动态生物系统中的共形预测 |
Alberto Portela |
PDF |
N/A |
Conformal Prediction in Dynamic Biological Systems |
| MADiff:面向以自我为中心视频的手轨迹预测的动觉感知Mamba扩散模型 |
Junyi Ma |
PDF |
N/A |
MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos |
| Loopy:利用长期运动依赖驯服音频驱动的肖像化身 |
Jianwen Jiang |
PDF |
N/A |
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency |
| 使用探索性代理评估环境 |
Bobby Khaleque |
PDF |
N/A |
Evaluating Environments Using Exploratory Agents |
| AdvSecureNet:一个用于对抗机器学习的Python工具包 |
Melih Catal |
PDF |
N/A |
AdvSecureNet: A Python Toolkit for Adversarial Machine Learning |
| (隐式)集成中的集成:大型模型中的认知不确定性崩溃 |
Andreas Kirsch |
PDF |
N/A |
(Implicit) Ensembles of Ensembles: Epistemic Uncertainty Collapse in Large Models |
| PUB:用于评估大型语言模型在合成视觉数据解释方面的理解和数据集基准 |
Aneta Pawelec |
PDF |
N/A |
PUB: Plot Understanding Benchmark and Dataset for Evaluating Large Language Models on Synthetic Visual Data Interpretation |
| GoT-CQA:基于图思维引导的组合推理图表问答系统 |
Lingling Zhang |
PDF |
N/A |
GoT-CQA: Graph-of-Thought Guided Compositional Reasoning for Chart Question Answering |
| 用于小儿肺炎的医学多模态大型语言模型 |
Weiwei Tian |
PDF |
N/A |
A Medical Multimodal Large Language Model for Pediatric Pneumonia |
| 假设缺失的因果变量与大型语言模型(LLMs) |
Ivaxi Sheth |
PDF |
N/A |
Hypothesizing Missing Causal Variables with LLMs |
| 一种双曲空间中的时尚单品推荐模型 |
Ryotaro Shimizu |
PDF |
N/A |
A Fashion Item Recommendation Model in Hyperbolic Space |
| SurgTrack:无CAD的现实手术器械3D追踪 |
Wenwu Guo |
PDF |
N/A |
SurgTrack: CAD-Free 3D Tracking of Real-world Surgical Instruments |
| 线性复杂度注意力替代方案的分析与BEST-RQ |
Ryan Whetten |
PDF |
N/A |
An Analysis of Linear Complexity Attention Substitutes with BEST-RQ |
| 用于预测DNA结合蛋白的多视角随机向量功能链接网络 |
A. Quadir |
PDF |
N/A |
Multiview Random Vector Functional Link Network for Predicting DNA-Binding Proteins |
| 使用卷积神经网络从手写英文字符预测BMI |
N. T. Diba |
PDF |
N/A |
BMI Prediction from Handwritten English Characters Using a Convolutional Neural Network |
| 从稀疏视角进行单目6D姿态估计的对象高斯方法 |
Luqing Luo |
PDF |
N/A |
Object Gaussian for Monocular 6D Pose Estimation from Sparse Views |
| AlignGroup:通过学习成员偏好来对齐群体共识,以实现群体推荐 |
Jinfeng Xu |
PDF |
N/A |
AlignGroup: Learning and Aligning Group Consensus with Member Preferences for Group Recommendation |
| 使用图像扩散模型解决视频逆问题 |
Taesung Kwon |
PDF |
N/A |
Solving Video Inverse Problems Using Image Diffusion Models |
| 通过基于规则的人工智能和大型语言模型推进网络事件时间线分析 |
Fatma Yasmine Loumachi |
PDF |
N/A |
Advancing Cyber Incident Timeline Analysis Through Rule Based AI and Large Language Models |
| 多多益善:大型语言模型中的加法偏见 |
Luca Santagata |
PDF |
N/A |
More is More: Addition Bias in Large Language Models |
| 关于SAM 2在类别无关实例级分割中的评估研究 |
Tiantian Zhang |
PDF |
N/A |
Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation |
| 你如何看待我的面容?通过建模心理表征来识别多模态情境中的面部表情 |
Florian Blume |
PDF |
N/A |
How Do You Perceive My Face? Recognizing Facial Expressions in Multi-Modal Context by Modeling Mental Representations |
| 基于交互多模型的联合单应矩阵与多目标状态估计 |
Paul Johannes Claasen |
PDF |
N/A |
Interacting Multiple Model-based Joint Homography Matrix and Multiple Object State Estimation |
| 视觉-语言导航与持续学习 |
Zhiyuan Li |
PDF |
N/A |
Vision-Language Navigation with Continual Learning |
| 低分辨率物体识别中的跨分辨率关系对比蒸馏 |
Kangkai Zhang |
PDF |
N/A |
Low-Resolution Object Recognition with Cross-Resolution Relational Contrastive Distillation |
| 一种用于周界识别的顺序决策模型 |
Ayal Taitler |
PDF |
N/A |
A Sequential Decision-Making Model for Perimeter Identification |
| 实时动态尺度感知融合检测网络:以道路损伤检测为例 |
Weichao Pan |
PDF |
N/A |
Real-Time Dynamic Scale-Aware Fusion Detection Network: Take Road Damage Detection as an example |
| UniTT-立体:统一训练变压器以增强立体匹配 |
Soomin Kim |
PDF |
N/A |
UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching |
| StyleTokenizer:通过单个实例定义图像风格以控制扩散模型 |
Wen Li |
PDF |
N/A |
StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models |
| 通过大型多模态模型理解eGFR轨迹和肾功能下降 |
Chih-Yuan Li |
PDF |
N/A |
Understanding eGFR Trajectories and Kidney Function Decline via Large Multimodal Models |
| 样品无法压缩 |
Vighnesh Birodkar |
PDF |
N/A |
Sample what you cant compress |
| 基于重整化群方法的昼夜节律中温度补偿和同步的波形畸变:一种方法 |
Shingo Gibo |
PDF |
N/A |
Waveform distortion for temperature compensation and synchronization in circadian rhythms: An approach based on the renormalization group method |
| Cog-GA:一种基于大型语言模型的生成式智能体,用于连续环境中的视觉语言导航 |
Zhiyuan Li |
PDF |
N/A |
Cog-GA: A Large Language Models-based Generative Agent for Vision-Language Navigation in Continuous Environments |
| 语言过度分析时令人恐惧:运用论证理论驱动的提示解析隐含的厌女逻辑 |
Arianna Muti |
PDF |
N/A |
Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts |
| 使用基于特征平滑的增强方法训练通用声码器以构建高质量的TTS系统 |
Jeongmin Liu |
PDF |
N/A |
Training Universal Vocoders with Feature Smoothing-Based Augmentation Methods for High-Quality TTS Systems |
| SG-MIM:结构化知识引导的高效预训练方法,适用于密集预测任务 |
Sumin Son |
PDF |
N/A |
SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction |
| 持续扩散器(CoD):通过经验复现掌握持续离线强化学习 |
Jifeng Hu |
PDF |
N/A |
Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal |
| TLD:车辆尾灯信号数据集与基准测试 |
Jinhao Chai |
PDF |
N/A |
TLD: A Vehicle Tail Light signal Dataset and Benchmark |
| 可学习的RAW重建色彩校正矩阵 |
Anqi Liu |
PDF |
N/A |
A Learnable Color Correction Matrix for RAW Reconstruction |
| CoAst:基于跨轮估值的无验证联邦学习贡献评估 |
Hao Wu |
PDF |
N/A |
CoAst: Validation-Free Contribution Assessment for Federated Learning based on Cross-Round Valuation |
| Plane2Depth:用于单目深度估计的分层自适应平面引导 |
Li Liu |
PDF |
N/A |
Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation |
| 可靠的深度扩散张量估计:重新思考数据驱动优化程序的力量 |
Jialong Li |
PDF |
N/A |
Reliable Deep Diffusion Tensor Estimation: Rethinking the Power of Data-Driven Optimization Routine |
| TP-GMOT:通过运动-外观成本(MAC)SORT,利用文本提示实现对通用多目标的跟踪 |
Duy Le Dinh Anh |
PDF |
N/A |
TP-GMOT: Tracking Generic Multiple Object by Textual Prompt with Motion-Appearance Cost (MAC) SORT |
| NeuroSpex:基于神经引导的跨模态注意力语音提取 |
Dashanka De Silva |
PDF |
N/A |
NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention |
| 通过元初始化提升零样本跨数据集单图像室内深度的泛化能力 |
Cho-Ying Wu |
PDF |
N/A |
Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization |
| 对抗性攻击对基于机器学习的可视化的影响 |
Takanori Fujiwara |
PDF |
N/A |
Adversarial Attacks on Machine Learning-Aided Visualizations |
| TASAR:骨架动作识别的可转移攻击 |
Yunfeng Diao |
PDF |
N/A |
TASAR: Transferable Attack on Skeletal Action Recognition |
| 体积表面:用多个网格表示模糊几何体 |
Stefano Esposito |
PDF |
N/A |
Volumetric Surfaces: Representing Fuzzy Geometries with Multiple Meshes |
| 图卷积网络中的词与短语特征在自动问题分类中的应用 |
Junyoung Lee |
PDF |
N/A |
Word and Phrase Features in Graph Convolutional Network for Automatic Question Classification |
| 大型语言模型在日志解析中的比较研究 |
Merve Astekin |
PDF |
N/A |
A Comparative Study on Large Language Models for Log Parsing |
| 在无意识框架下的回归和分类中的人口统计学平价 |
Vincent Divol |
PDF |
N/A |
Demographic parity in regression and classification within the unawareness framework |
| 侦探QA:评估长篇推理小说中的长上下文推理能力 |
Zhe Xu |
PDF |
N/A |
DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels |
| FrameCorr:资源和时序受限网络环境下基于自适应、自编码器的视频重建神经压缩技术 |
John Li |
PDF |
N/A |
FrameCorr: Adaptive, Autoencoder-based Neural Compression for Video Reconstruction in Resource and Timing Constrained Network Settings |
| 使用可微分数字信号处理实现快速、高质量和参数高效的语音合成 |
Yisi Liu |
PDF |
N/A |
Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP |
| 标准化中遗失了什么?探究多语言自动语音识别模型评估中的陷阱 |
Kavya Manohar |
PDF |
N/A |
What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations |
| 使用分层模型通过图像检测韩国食品 |
Hoang Khanh Lam |
PDF |
N/A |
Detecting Korean Food Using Image using Hierarchical Model |
| ForeCal:基于随机森林的深度神经网络校准方法 |
Dhruv Nigam |
PDF |
N/A |
ForeCal: Random Forest-based Calibration for DNNs |
| 非目标分歧假设:迈向理解跨模态知识蒸馏中的领域差异 |
Yilong Chen |
PDF |
N/A |
Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation |
| 基于上下文感知的智能长途运输系统代理模型 |
Muhammad Raees |
PDF |
N/A |
Context-Aware Agent-based Model for Smart Long Distance Transport System |
| 对抗性学习用于稀疏数据下的神经偏微分方程求解器 |
Yunpeng Gong |
PDF |
N/A |
Adversarial Learning for Neural PDE Solvers with Sparse Data |
| 基于迁移的对抗性投毒攻击在线(多输入多输出-)深度接收器 |
Kunze Wu |
PDF |
N/A |
Transfer-based Adversarial Poisoning Attacks for Online (MIMO-)Deep Receviers |
| 无训练色彩风格解耦用于受限文本到图像合成 |
Aishwarya Agarwal |
PDF |
N/A |
Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis |
| 大型语言模型作为自定义环境多目标强化学习的有效奖励函数搜索器 |
Guanwen Xie |
PDF |
N/A |
Large Language Models as Efficient Reward Function Searchers for Custom-Environment Multi-Objective Reinforcement Learning |
| 扩散模型通过子空间聚类学习低维分布 |
Peng Wang |
PDF |
N/A |
Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering |
| 深度自适应兴趣网络:基于上下文感知学习的个性化推荐 |
Shuaishuai Huang |
PDF |
N/A |
Deep Adaptive Interest Network: Personalized Recommendation with Context-Aware Learning |
| 通过混合GPU压缩加速大型语言模型训练 |
Lang Xu |
PDF |
N/A |
Accelerating Large Language Model Training with Hybrid GPU-based Compression |
| MOSMOS:借助医学报告监督的多器官分割 |
Weiwei Tian |
PDF |
N/A |
MOSMOS: Multi-organ segmentation facilitated by medical report supervision |
| 相对翻译不变的沃瑟斯坦距离 |
Binshuai Wang |
PDF |
N/A |
Relative-Translation Invariant Wasserstein Distance |
| 基于SD地图的局部地图构建方法:一项新颖的调查 |
Jiaqi Li |
PDF |
N/A |
Local map Construction Methods with SD map: A Novel Survey |
| 抽象文本摘要:现状、挑战与改进 |
Hassan Shakil |
PDF |
N/A |
Abstractive Text Summarization: State of the Art, Challenges, and Improvements |
| 自适应类涌现训练:通过渐进目标进化提升神经网络的稳定性和泛化能力 |
Jaouad Dabounou |
PDF |
N/A |
Adaptive Class Emergence Training: Enhancing Neural Network Stability and Generalization through Progressive Target Evolution |
| 哈达玛逐行生成算法 |
Brayan Monroy |
PDF |
N/A |
Hadamard Row-Wise Generation Algorithm |
| 通过判别-生成蒸馏学习隐私保护的学生网络 |
Shiming Ge |
PDF |
N/A |
Learning Privacy-Preserving Student Networks via Discriminative-Generative Distillation |
| 使用深度学习确定语言家族 |
Peter B. Lerner |
PDF |
N/A |
Determination of language families using deep learning |
| 构建具有多轮迭代偏好学习的数学代理 |
Wei Xiong |
PDF |
N/A |
Building Math Agents with Multi-Turn Iterative Preference Learning |
| 经济生产力规模法则:LLM辅助翻译的实验证据 |
Ali Merali |
PDF |
N/A |
Scaling Laws for Economic Productivity: Experimental Evidence in LLM-Assisted Translation |
| 视觉决策的神经动力学模型:从人类专家中学习 |
Jie Su |
PDF |
N/A |
Neural Dynamics Model of Visual Decision-Making: Learning from Human Experts |
| 三维场景中的多模态情境推理 |
Xiongkun Linghu |
PDF |
N/A |
Multi-modal Situated Reasoning in 3D Scenes |
| 高斯速率-失真-感知编码与熵约束标量量化 |
Li Xie |
PDF |
N/A |
Gaussian Rate-Distortion-Perception Coding and Entropy-Constrained Scalar Quantization |
| 大型语言模型与认知科学:相似性、差异性与挑战的综合评述 |
Qian Niu |
PDF |
N/A |
Large Language Models and Cognitive Science: A Comprehensive Review of Similarities, Differences, and Challenges |
| 统一框架,确保多模态间的一致性,用于人体活动识别 |
Tuyen Tran |
PDF |
N/A |
Unified Framework with Consistency across Modalities for Human Activity Recognition |
| STAB:语音分词评估基准 |
Shikhar Vashishth |
PDF |
N/A |
STAB: Speech Tokenizer Assessment Benchmark |
| GGS:自动驾驶中车道切换的通用高斯喷溅技术 |
Huasong Han |
PDF |
N/A |
GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving |
| 从单张图像生成珊瑚模型以用于虚拟现实应用 |
Jie Fu |
PDF |
N/A |
Coral Model Generation from Single Images for Virtual Reality Applications |
| 大型语言模型在隐私保护方面表现如何?合规与隐私技术审查案例研究 |
Xichou Zhu |
PDF |
N/A |
How Privacy-Savvy Are Large Language Models? A Case Study on Compliance and Privacy Technical Review |
| 探索扩散模型中的低维子空间以实现可控图像编辑 |
Siyi Chen |
PDF |
N/A |
Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing |
| 通过泰勒展开揭示视频动态 |
Siyi Chen |
PDF |
N/A |
Unfolding Videos Dynamics via Taylor Expansion |
| 大型语言模型是否具备情感敏感性? |
Yang Liu |
PDF |
N/A |
Do Large Language Models Possess Sensitive to Sentiment? |
| 多元显著目标检测 |
Xuelu Feng |
PDF |
N/A |
Pluralistic Salient Object Detection |
| 最优高维连续函数神经网络逼近 |
Ayan Maiti |
PDF |
N/A |
Optimal Neural Network Approximation for High-Dimensional Continuous Functions |
| 多样化-验证-适应:高效且鲁棒的检索增强型模糊问答 |
Yeonjun In |
PDF |
N/A |
Diversify-verify-adapt: Efficient and Robust Retrieval-Augmented Ambiguous Question Answering |
| 机器学习在计算等离子体物理与降阶等离子体建模中的应用:展望 |
Farbod Faraji |
PDF |
N/A |
Machine Learning Applications to Computational Plasma Physics and Reduced-Order Plasma Modeling: A Perspective |
| 理解功能多样性在基于成分选择和多维尺度分析的权重集成中的作用 |
Alex Rojas |
PDF |
N/A |
Understanding the Role of Functional Diversity in Weight-Ensembling with Ingredient Selection and Multidimensional Scaling |
| 通过交替最小化LoRA实现基础模型的鲁棒联邦微调 |
Shuangyi Chen |
PDF |
N/A |
Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA |
| NUDGE:用于检索的嵌入轻量级非参数微调 |
Sepanta Zeighami |
PDF |
N/A |
NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval |
| 最小二乘逼近的最优采样 |
Ben Adcock |
PDF |
N/A |
Optimal sampling for least-squares approximation |
| 通过深度神经网络学习,在修正的含两个势能的GP方程中,数据驱动的二维静态量子液滴和波传播 |
Jin Song |
PDF |
N/A |
Data-driven 2D stationary quantum droplets and wave propagations in the amended GP equation with two potentials via deep neural networks learning |