跳转至

Arxiv 2024-09-04 Papers

标题 作者 PDF链接 代码仓库 Title
RoboTwin:配备生成式数字孪生的双臂机器人基准测试(早期版本) Yao Mu PDF N/A RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)
HiPrompt:通过分层MLLM提示实现无调优的高分辨率生成 Xinyu Liu PDF N/A HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts
UC-NeRF:从内窥镜稀疏视图中实现不确定性感知的条件神经辐射场 Jiaxin Guo PDF N/A UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views
大型语言模型能否获得驾照?面向自动驾驶可靠通用智能的基准测试 Yuhang Lu PDF N/A Can LVLMs Obtain a Driver's License? A Benchmark Towards Reliable AGI for Autonomous Driving
SITAR:用于动作识别的半监督图像变换器 Owais Iqbal PDF N/A SITAR: Semi-supervised Image Transformer for Action Recognition
掩码扩散模型实际上是时间无关的掩码模型,并利用了不准确的分类采样 Kaiwen Zheng PDF N/A Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling
拓扑方法在机器学习中的应用:面向实践者的教程 Baris Coskunuzer PDF N/A Topological Methods in Machine Learning: A Tutorial for Practitioners
LongCite:使大型语言模型能够在长上下文问答中生成细粒度的引用 jiajie Zhang PDF N/A LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
区域数据驱动的全球拉伸网格天气模拟 Thomas Nils Nipen PDF N/A Regional data-driven weather modeling with a global stretched-grid
LongLLaVA:通过混合架构高效扩展多模态大语言模型至1000张图像 Xidong Wang PDF N/A LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture
CanvOI,一个肿瘤学智能基础模型:以不同的方式扩展FLOPS Jonathan Zalach PDF N/A CanvOI, an Oncology Intelligence Foundation Model: Scaling FLOPS Differently
多流深度学习框架,用于通过雷伊复杂图形测试预测轻度认知障碍 Junyoung Park PDF N/A Multi-stream deep learning framework to predict mild cognitive impairment with Rey Complex Figure Test
基准测试少样本图像分类器中的虚假偏差 Guangtao Zheng PDF N/A Benchmarking Spurious Bias in Few-Shot Image Classifiers
可配置的基础模型:从模块化角度构建大型语言模型 Chaojun Xiao PDF N/A Configurable Foundation Models: Building LLMs from a Modular Perspective
城市驾驶混合模仿学习运动规划器 Cristian Gariboldi PDF N/A Hybrid Imitation-Learning Motion Planner for Urban Driving
深入了解用于时间序列分类的LITE深度学习方法 Ali Ismail-Fawaz PDF N/A Look Into the LITE in Deep Learning for Time Series Classification
平衡真实数据与合成数据对人脸识别中准确性与公平性的影响 Andrea Atzori PDF N/A The Impact of Balancing Real and Synthetic Data on Accuracy and Fairness in Face Recognition
混合分割器:一种用于土木基础设施中自动细粒度裂缝分割的混合方法 June Moh Goo PDF N/A Hybrid-Segmentor: A Hybrid Approach to Automated Fine-Grained Crack Segmentation in Civil Infrastructure
生物信息学检索增强数据(BRAD)数字助手 Joshua Pickard PDF N/A Bioinformatics Retrieval Augmentation Data (BRAD) Digital Assistant
CONClave -- 利用认证共识和信任评分实现CAV的安全稳健协同感知 Edward Andert PDF N/A CONClave -- Secure and Robust Cooperative Perception for CAVs Using Authenticated Consensus and Trust Scoring
构建一个可扩展、高效且可控的搜索与排序平台 Marjan Celikik PDF N/A Building a Scalable, Effective, and Steerable Search and Ranking Platform
人类-VDM:从视频扩散模型中学习单张图像的三维人体高斯喷射 Zhibin Liu PDF N/A Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models
哎呀,我又采样了一次:重新解读少样本学习中的置信区间 Raphael Lafargue PDF N/A Oops, I Sampled it Again: Reinterpreting Confidence Intervals in Few-Shot Learning
MaDis-Stereo:通过蒸馏掩码图像建模增强的立体匹配 Jihye Ahn PDF N/A MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling
SNNAX -- 在 JAX 中的脉冲神经网络 Jamie Lohoff PDF N/A SNNAX -- Spiking Neural Networks in JAX
使用类型和基于标记的语言建模进行历史德语文本规范化 Anton Ehrmanntraut PDF N/A Historical German Text Normalization Using Type- and Token-Based Language Modeling
R2GQA:检索器-阅读器-生成器问答系统,旨在帮助学生理解高等教育中的法律规章 Phuc-Tinh Pham Do PDF N/A R2GQA: Retriever-Reader-Generator Question Answering System to Support Students Understanding Legal Regulations in Higher Education
iConFormer:通过输入条件适应实现动态参数高效调整 Hayeon Jo PDF N/A iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
通过大型语言模型进行少样本学习,探索加密货币讨论中的情感动态和预测行为 Moein Shahiki Tash PDF N/A Exploring Sentiment Dynamics and Predictive Behaviors in Cryptocurrency Discussions by Few-Shot Learning with Large Language Models
CMM-Math:一个中文多模态数学数据集,用于评估和提升大型多模态模型的数学推理能力 Wentao Liu PDF N/A CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models
ExpLLM:面向面部表情识别的思维链方法 Xing Lan PDF N/A ExpLLM: Towards Chain of Thought for Facial Expression Recognition
三维胎儿超声图像的自动面部轴标准化 Antonia Alomar PDF N/A Automatic facial axes standardization of 3D fetal ultrasound images
深度学习与卫星图像的结合——手工特征与基于学习的特征在多日期卫星立体图像上的评估 Shuang Song PDF N/A Deep Learning Meets Satellite Images -- An Evaluation on Handcrafted and Learning-based Features for Multi-date Satellite Stereo Images
黑曜石:安全机器学习加速器上高效推理的协作状态空间探索 Sarbartha Banerjee PDF N/A Obsidian: Cooperative State-Space Exploration for Performant Inference on Secure ML Accelerators
MMMU-Pro:一个更强大的多学科多模态理解基准 Xiang Yue PDF N/A MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
一种用于时间相关偏微分方程的混合有限元-物理信息神经网络方法 Xiaodong Feng PDF N/A A hybrid FEM-PINN method for time-dependent partial differential equations
面向智能交通系统的边缘数据湖架构 Danilo Fernandes PDF N/A Towards Edge-Based Data Lake Architecture for Intelligent Transportation System
提升时间序列分类证书鲁棒性的高效自集成方法 Chang Dong PDF N/A Boosting Certificate Robustness for Time Series Classification with Efficient Self-Ensemble
迈向大语言模型偏好学习的统一视角:一项综述 Bofei Gao PDF N/A Towards a Unified View of Preference Learning for Large Language Models: A Survey
从经验中“反学习”以避免虚假关联 Jeff Mitchell PDF N/A UnLearning from Experience to Avoid Spurious Correlations
管理两用技术:国际安全协议案例研究及对人工智能治理的启示 Akash R. Wasil PDF N/A Governing dual-use technologies: Case studies of international security agreements and lessons for AI governance
带有领域适应的正则化多输出高斯卷积过程 Wang Xinming PDF N/A Regularized Multi-output Gaussian Convolution Process with Domain Adaptation
将因果表征学习与不变性原理统一起来 Dingling Yao PDF N/A Unifying Causal Representation Learning with the Invariance Principle
髋至膝临床CT图像中骨与肌肉评估的不确定性估计肌肉骨骼分割模型验证 Mazen Soufi PDF N/A Validation of musculoskeletal segmentation model with uncertainty estimation for bone and muscle assessment in hip-to-knee clinical CT images
一种基于增量偏好诱导的方法,用于学习多准则排序中可能的非单调偏好 Zhuolin Li PDF N/A An incremental preference elicitation-based approach to learning potentially non-monotonic preferences in multi-criteria sorting
预训练与自训练的比较研究 Yiheng Wang PDF N/A A Comparative Study of Pre-training and Self-training
可处理的正则决策过程离线学习 Ahana Deb PDF N/A Tractable Offline Learning of Regular Decision Processes
卷积神经网络用于自动细胞自动机分类 Michiel Rollier PDF N/A Convolutional Neural Networks for Automated Cellular Automaton Classification
完整且高效的3D点配置协变量及其在分子量子性质学习中的应用 Hartmut Maennel PDF N/A Complete and Efficient Covariants for 3D Point Configurations with Application to Learning Molecular Quantum Properties
面向图数据的任务导向通信:一种图信息瓶颈方法 Shujing Li PDF N/A Task-Oriented Communication for Graph Data: A Graph Information Bottleneck Approach
池化和注意力:基于大型语言模型(LLM)的嵌入模型中,哪些设计是有效的? Yixuan Tang PDF N/A Pooling And Attention: What Are Effective Designs For LLm-Based Embedding Models?
使用期刊影响指标进行生物医学领域适应的预训练数据选择 Mathieu Laï-king PDF N/A Pre-training data selection for biomedical domain adaptation using journal impact metrics
针对大型语言模型的对齐感知模型提取攻击 Zi Liang PDF N/A Alignment-Aware Model Extraction Attacks on Large Language Models
一种利用跨语言句子表示增强低资源机器翻译的数据选择方法 Nidhi Kowtal PDF N/A A Data Selection Approach for Enhancing Low Resource Machine Translation Using Cross-Lingual Sentence Representations
为PostNL创建基于生成式AI的追踪与追溯助手MVP(SuperTracy) Mohammad Reshadati PDF N/A Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL
少样本多任务学习线性不变特征的元子空间追踪 Chaozhi Zhang PDF N/A Few-shot Multi-Task Learning of Linear Invariant Features with Meta Subspace Pursuit
结合志同道合的同伴克服基于会话的社交推荐中的好友数据稀疏性 Chunyan An PDF N/A Incorporating Like-Minded Peers to Overcome Friend Data Sparsity in Session-Based Social Recommendations
CLDA:增强无监督域适应的协作学习 Minhee Cho PDF N/A CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation
化学网络中二阶反应的精确首次通过时间分布 Changqian Rao PDF N/A Exact first passage time distribution for second-order reactions in chemical networks
用于增强作业车间调度问题中神经局部搜索的决策变压器 Constantin Waubert de Puiseau PDF N/A Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem
人工智能和机器学习在软件测试中的作用 Ahmed Ramadan PDF N/A The Role of Artificial Intelligence and Machine Learning in Software Testing
大语言模型辅助的视觉分析:机遇与挑战 Maeve Hutchinson PDF N/A LLM-Assisted Visual Analytics: Opportunities and Challenges
检测多模态内容中的行动号召:对2021年德国联邦选举在Instagram上的竞选活动分析 Michael Achmann-Denkler PDF N/A Detecting Calls to Action in Multimodal Content: Analysis of the 2021 German Federal Election Campaign on Instagram
去混淆因果感知参数高效微调,以提升大语言模型的问题解决能力 Ruoyu Wang PDF N/A Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs
RouterRetriever:探索在多个专家嵌入模型上进行路由的优势 Hyunji Lee PDF N/A RouterRetriever: Exploring the Benefits of Routing over Multiple Expert Embedding Models
从计算角度看神经时间尺度 Roxana Zeraati PDF N/A Neural timescales from a computational perspective
重新思考HTG评估:弥合生成与识别之间的鸿沟 Konstantina Nikolaidou PDF N/A Rethinking HTG Evaluation: Bridging Generation and Recognition
在亚马逊地区活跃火灾建模中使用LSTM和GRU的神经网络 Ramon Tavares PDF N/A Neural Networks with LSTM and GRU in Modeling Active Fires in the Amazon
基于超声传感器和速率编码的低成本实时尖峰障碍物检测系统 Alvaro Ayuso-Martinez PDF N/A A Low-Cost Real-Time Spiking System for Obstacle Detection based on Ultrasonic Sensors and Rate Coding
使用多摄像头训练改进单摄像头BEV感知 Daniel Busch PDF N/A Improved Single Camera BEV Perception Using Multi-Camera Training
基于模型的多头部注意力残差展开网络的泛锐化方法 Ivan Pereira-Sánchez PDF N/A Multi-Head Attention Residual Unfolded Network for Model-Based Pansharpening
从认识论角度看独立约束的解耦表示学习 Ruoyu Wang PDF N/A Independence Constrained Disentangled Representation Learning from Epistemological Perspective
因果感知变换器网络用于机器人导航 Ruoyu Wang PDF N/A Causality-Aware Transformer Networks for Robotic Navigation
机器学习简介 Laurent Younes PDF N/A Introduction to Machine Learning
为机器翻译微调创建领域特定翻译记忆库:TRENCARD双语心脏病学语料库 Gokhan Dogru PDF N/A Creating Domain-Specific Translation Memories for Machine Translation Fine-tuning: The TRENCARD Bilingual Cardiology Corpus
站在巨人的肩膀上:重新编程视觉-语言模型用于通用深度伪造检测 Kaiqing Lin PDF N/A Standing on the Shoulders of Giants: Reprogramming Visual-Language Model for General Deepfake Detection
PoseTalk:基于文本和音频的姿态控制与运动优化,用于一次性说话头生成 Jun Ling PDF N/A PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation
跳跃与播放:深度驱动的姿态保持图像生成,适用于任意物体 Kyungmin Jo PDF N/A Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects
OpenFact在CheckThat! 2024:结合多种攻击方法实现有效的对抗性文本生成 Włodzimierz Lewoniewski PDF N/A OpenFact at CheckThat! 2024: Combining Multiple Attack Methods for Effective Adversarial Text Generation
创建具有丰富材料信息的多相合金设计微观结构潜在空间 Xudong Ma PDF N/A Creating a Microstructure Latent Space with Rich Material Information for Multiphase Alloy Design
基于学习的先进车辆仪表集群渲染错误检测系统 Cornelius Bürkle PDF N/A Learning-Based Error Detection System for Advanced Vehicle Instrument Cluster Rendering
关于新兴语言的调查 Jannik Peters PDF N/A A Survey on Emergent Language
动态生物系统中的共形预测 Alberto Portela PDF N/A Conformal Prediction in Dynamic Biological Systems
MADiff:面向以自我为中心视频的手轨迹预测的动觉感知Mamba扩散模型 Junyi Ma PDF N/A MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos
Loopy:利用长期运动依赖驯服音频驱动的肖像化身 Jianwen Jiang PDF N/A Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency
使用探索性代理评估环境 Bobby Khaleque PDF N/A Evaluating Environments Using Exploratory Agents
AdvSecureNet:一个用于对抗机器学习的Python工具包 Melih Catal PDF N/A AdvSecureNet: A Python Toolkit for Adversarial Machine Learning
(隐式)集成中的集成:大型模型中的认知不确定性崩溃 Andreas Kirsch PDF N/A (Implicit) Ensembles of Ensembles: Epistemic Uncertainty Collapse in Large Models
PUB:用于评估大型语言模型在合成视觉数据解释方面的理解和数据集基准 Aneta Pawelec PDF N/A PUB: Plot Understanding Benchmark and Dataset for Evaluating Large Language Models on Synthetic Visual Data Interpretation
GoT-CQA:基于图思维引导的组合推理图表问答系统 Lingling Zhang PDF N/A GoT-CQA: Graph-of-Thought Guided Compositional Reasoning for Chart Question Answering
用于小儿肺炎的医学多模态大型语言模型 Weiwei Tian PDF N/A A Medical Multimodal Large Language Model for Pediatric Pneumonia
假设缺失的因果变量与大型语言模型(LLMs) Ivaxi Sheth PDF N/A Hypothesizing Missing Causal Variables with LLMs
一种双曲空间中的时尚单品推荐模型 Ryotaro Shimizu PDF N/A A Fashion Item Recommendation Model in Hyperbolic Space
SurgTrack:无CAD的现实手术器械3D追踪 Wenwu Guo PDF N/A SurgTrack: CAD-Free 3D Tracking of Real-world Surgical Instruments
线性复杂度注意力替代方案的分析与BEST-RQ Ryan Whetten PDF N/A An Analysis of Linear Complexity Attention Substitutes with BEST-RQ
用于预测DNA结合蛋白的多视角随机向量功能链接网络 A. Quadir PDF N/A Multiview Random Vector Functional Link Network for Predicting DNA-Binding Proteins
使用卷积神经网络从手写英文字符预测BMI N. T. Diba PDF N/A BMI Prediction from Handwritten English Characters Using a Convolutional Neural Network
从稀疏视角进行单目6D姿态估计的对象高斯方法 Luqing Luo PDF N/A Object Gaussian for Monocular 6D Pose Estimation from Sparse Views
AlignGroup:通过学习成员偏好来对齐群体共识,以实现群体推荐 Jinfeng Xu PDF N/A AlignGroup: Learning and Aligning Group Consensus with Member Preferences for Group Recommendation
使用图像扩散模型解决视频逆问题 Taesung Kwon PDF N/A Solving Video Inverse Problems Using Image Diffusion Models
通过基于规则的人工智能和大型语言模型推进网络事件时间线分析 Fatma Yasmine Loumachi PDF N/A Advancing Cyber Incident Timeline Analysis Through Rule Based AI and Large Language Models
多多益善:大型语言模型中的加法偏见 Luca Santagata PDF N/A More is More: Addition Bias in Large Language Models
关于SAM 2在类别无关实例级分割中的评估研究 Tiantian Zhang PDF N/A Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation
你如何看待我的面容?通过建模心理表征来识别多模态情境中的面部表情 Florian Blume PDF N/A How Do You Perceive My Face? Recognizing Facial Expressions in Multi-Modal Context by Modeling Mental Representations
基于交互多模型的联合单应矩阵与多目标状态估计 Paul Johannes Claasen PDF N/A Interacting Multiple Model-based Joint Homography Matrix and Multiple Object State Estimation
视觉-语言导航与持续学习 Zhiyuan Li PDF N/A Vision-Language Navigation with Continual Learning
低分辨率物体识别中的跨分辨率关系对比蒸馏 Kangkai Zhang PDF N/A Low-Resolution Object Recognition with Cross-Resolution Relational Contrastive Distillation
一种用于周界识别的顺序决策模型 Ayal Taitler PDF N/A A Sequential Decision-Making Model for Perimeter Identification
实时动态尺度感知融合检测网络:以道路损伤检测为例 Weichao Pan PDF N/A Real-Time Dynamic Scale-Aware Fusion Detection Network: Take Road Damage Detection as an example
UniTT-立体:统一训练变压器以增强立体匹配 Soomin Kim PDF N/A UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching
StyleTokenizer:通过单个实例定义图像风格以控制扩散模型 Wen Li PDF N/A StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models
通过大型多模态模型理解eGFR轨迹和肾功能下降 Chih-Yuan Li PDF N/A Understanding eGFR Trajectories and Kidney Function Decline via Large Multimodal Models
样品无法压缩 Vighnesh Birodkar PDF N/A Sample what you cant compress
基于重整化群方法的昼夜节律中温度补偿和同步的波形畸变:一种方法 Shingo Gibo PDF N/A Waveform distortion for temperature compensation and synchronization in circadian rhythms: An approach based on the renormalization group method
Cog-GA:一种基于大型语言模型的生成式智能体,用于连续环境中的视觉语言导航 Zhiyuan Li PDF N/A Cog-GA: A Large Language Models-based Generative Agent for Vision-Language Navigation in Continuous Environments
语言过度分析时令人恐惧:运用论证理论驱动的提示解析隐含的厌女逻辑 Arianna Muti PDF N/A Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts
使用基于特征平滑的增强方法训练通用声码器以构建高质量的TTS系统 Jeongmin Liu PDF N/A Training Universal Vocoders with Feature Smoothing-Based Augmentation Methods for High-Quality TTS Systems
SG-MIM:结构化知识引导的高效预训练方法,适用于密集预测任务 Sumin Son PDF N/A SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction
持续扩散器(CoD):通过经验复现掌握持续离线强化学习 Jifeng Hu PDF N/A Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal
TLD:车辆尾灯信号数据集与基准测试 Jinhao Chai PDF N/A TLD: A Vehicle Tail Light signal Dataset and Benchmark
可学习的RAW重建色彩校正矩阵 Anqi Liu PDF N/A A Learnable Color Correction Matrix for RAW Reconstruction
CoAst:基于跨轮估值的无验证联邦学习贡献评估 Hao Wu PDF N/A CoAst: Validation-Free Contribution Assessment for Federated Learning based on Cross-Round Valuation
Plane2Depth:用于单目深度估计的分层自适应平面引导 Li Liu PDF N/A Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation
可靠的深度扩散张量估计:重新思考数据驱动优化程序的力量 Jialong Li PDF N/A Reliable Deep Diffusion Tensor Estimation: Rethinking the Power of Data-Driven Optimization Routine
TP-GMOT:通过运动-外观成本(MAC)SORT,利用文本提示实现对通用多目标的跟踪 Duy Le Dinh Anh PDF N/A TP-GMOT: Tracking Generic Multiple Object by Textual Prompt with Motion-Appearance Cost (MAC) SORT
NeuroSpex:基于神经引导的跨模态注意力语音提取 Dashanka De Silva PDF N/A NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention
通过元初始化提升零样本跨数据集单图像室内深度的泛化能力 Cho-Ying Wu PDF N/A Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization
对抗性攻击对基于机器学习的可视化的影响 Takanori Fujiwara PDF N/A Adversarial Attacks on Machine Learning-Aided Visualizations
TASAR:骨架动作识别的可转移攻击 Yunfeng Diao PDF N/A TASAR: Transferable Attack on Skeletal Action Recognition
体积表面:用多个网格表示模糊几何体 Stefano Esposito PDF N/A Volumetric Surfaces: Representing Fuzzy Geometries with Multiple Meshes
图卷积网络中的词与短语特征在自动问题分类中的应用 Junyoung Lee PDF N/A Word and Phrase Features in Graph Convolutional Network for Automatic Question Classification
大型语言模型在日志解析中的比较研究 Merve Astekin PDF N/A A Comparative Study on Large Language Models for Log Parsing
在无意识框架下的回归和分类中的人口统计学平价 Vincent Divol PDF N/A Demographic parity in regression and classification within the unawareness framework
侦探QA:评估长篇推理小说中的长上下文推理能力 Zhe Xu PDF N/A DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels
FrameCorr:资源和时序受限网络环境下基于自适应、自编码器的视频重建神经压缩技术 John Li PDF N/A FrameCorr: Adaptive, Autoencoder-based Neural Compression for Video Reconstruction in Resource and Timing Constrained Network Settings
使用可微分数字信号处理实现快速、高质量和参数高效的语音合成 Yisi Liu PDF N/A Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP
标准化中遗失了什么?探究多语言自动语音识别模型评估中的陷阱 Kavya Manohar PDF N/A What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations
使用分层模型通过图像检测韩国食品 Hoang Khanh Lam PDF N/A Detecting Korean Food Using Image using Hierarchical Model
ForeCal:基于随机森林的深度神经网络校准方法 Dhruv Nigam PDF N/A ForeCal: Random Forest-based Calibration for DNNs
非目标分歧假设:迈向理解跨模态知识蒸馏中的领域差异 Yilong Chen PDF N/A Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
基于上下文感知的智能长途运输系统代理模型 Muhammad Raees PDF N/A Context-Aware Agent-based Model for Smart Long Distance Transport System
对抗性学习用于稀疏数据下的神经偏微分方程求解器 Yunpeng Gong PDF N/A Adversarial Learning for Neural PDE Solvers with Sparse Data
基于迁移的对抗性投毒攻击在线(多输入多输出-)深度接收器 Kunze Wu PDF N/A Transfer-based Adversarial Poisoning Attacks for Online (MIMO-)Deep Receviers
无训练色彩风格解耦用于受限文本到图像合成 Aishwarya Agarwal PDF N/A Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis
大型语言模型作为自定义环境多目标强化学习的有效奖励函数搜索器 Guanwen Xie PDF N/A Large Language Models as Efficient Reward Function Searchers for Custom-Environment Multi-Objective Reinforcement Learning
扩散模型通过子空间聚类学习低维分布 Peng Wang PDF N/A Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering
深度自适应兴趣网络:基于上下文感知学习的个性化推荐 Shuaishuai Huang PDF N/A Deep Adaptive Interest Network: Personalized Recommendation with Context-Aware Learning
通过混合GPU压缩加速大型语言模型训练 Lang Xu PDF N/A Accelerating Large Language Model Training with Hybrid GPU-based Compression
MOSMOS:借助医学报告监督的多器官分割 Weiwei Tian PDF N/A MOSMOS: Multi-organ segmentation facilitated by medical report supervision
相对翻译不变的沃瑟斯坦距离 Binshuai Wang PDF N/A Relative-Translation Invariant Wasserstein Distance
基于SD地图的局部地图构建方法:一项新颖的调查 Jiaqi Li PDF N/A Local map Construction Methods with SD map: A Novel Survey
抽象文本摘要:现状、挑战与改进 Hassan Shakil PDF N/A Abstractive Text Summarization: State of the Art, Challenges, and Improvements
自适应类涌现训练:通过渐进目标进化提升神经网络的稳定性和泛化能力 Jaouad Dabounou PDF N/A Adaptive Class Emergence Training: Enhancing Neural Network Stability and Generalization through Progressive Target Evolution
哈达玛逐行生成算法 Brayan Monroy PDF N/A Hadamard Row-Wise Generation Algorithm
通过判别-生成蒸馏学习隐私保护的学生网络 Shiming Ge PDF N/A Learning Privacy-Preserving Student Networks via Discriminative-Generative Distillation
使用深度学习确定语言家族 Peter B. Lerner PDF N/A Determination of language families using deep learning
构建具有多轮迭代偏好学习的数学代理 Wei Xiong PDF N/A Building Math Agents with Multi-Turn Iterative Preference Learning
经济生产力规模法则:LLM辅助翻译的实验证据 Ali Merali PDF N/A Scaling Laws for Economic Productivity: Experimental Evidence in LLM-Assisted Translation
视觉决策的神经动力学模型:从人类专家中学习 Jie Su PDF N/A Neural Dynamics Model of Visual Decision-Making: Learning from Human Experts
三维场景中的多模态情境推理 Xiongkun Linghu PDF N/A Multi-modal Situated Reasoning in 3D Scenes
高斯速率-失真-感知编码与熵约束标量量化 Li Xie PDF N/A Gaussian Rate-Distortion-Perception Coding and Entropy-Constrained Scalar Quantization
大型语言模型与认知科学:相似性、差异性与挑战的综合评述 Qian Niu PDF N/A Large Language Models and Cognitive Science: A Comprehensive Review of Similarities, Differences, and Challenges
统一框架,确保多模态间的一致性,用于人体活动识别 Tuyen Tran PDF N/A Unified Framework with Consistency across Modalities for Human Activity Recognition
STAB:语音分词评估基准 Shikhar Vashishth PDF N/A STAB: Speech Tokenizer Assessment Benchmark
GGS:自动驾驶中车道切换的通用高斯喷溅技术 Huasong Han PDF N/A GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving
从单张图像生成珊瑚模型以用于虚拟现实应用 Jie Fu PDF N/A Coral Model Generation from Single Images for Virtual Reality Applications
大型语言模型在隐私保护方面表现如何?合规与隐私技术审查案例研究 Xichou Zhu PDF N/A How Privacy-Savvy Are Large Language Models? A Case Study on Compliance and Privacy Technical Review
探索扩散模型中的低维子空间以实现可控图像编辑 Siyi Chen PDF N/A Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing
通过泰勒展开揭示视频动态 Siyi Chen PDF N/A Unfolding Videos Dynamics via Taylor Expansion
大型语言模型是否具备情感敏感性? Yang Liu PDF N/A Do Large Language Models Possess Sensitive to Sentiment?
多元显著目标检测 Xuelu Feng PDF N/A Pluralistic Salient Object Detection
最优高维连续函数神经网络逼近 Ayan Maiti PDF N/A Optimal Neural Network Approximation for High-Dimensional Continuous Functions
多样化-验证-适应:高效且鲁棒的检索增强型模糊问答 Yeonjun In PDF N/A Diversify-verify-adapt: Efficient and Robust Retrieval-Augmented Ambiguous Question Answering
机器学习在计算等离子体物理与降阶等离子体建模中的应用:展望 Farbod Faraji PDF N/A Machine Learning Applications to Computational Plasma Physics and Reduced-Order Plasma Modeling: A Perspective
理解功能多样性在基于成分选择和多维尺度分析的权重集成中的作用 Alex Rojas PDF N/A Understanding the Role of Functional Diversity in Weight-Ensembling with Ingredient Selection and Multidimensional Scaling
通过交替最小化LoRA实现基础模型的鲁棒联邦微调 Shuangyi Chen PDF N/A Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA
NUDGE:用于检索的嵌入轻量级非参数微调 Sepanta Zeighami PDF N/A NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval
最小二乘逼近的最优采样 Ben Adcock PDF N/A Optimal sampling for least-squares approximation
通过深度神经网络学习,在修正的含两个势能的GP方程中,数据驱动的二维静态量子液滴和波传播 Jin Song PDF N/A Data-driven 2D stationary quantum droplets and wave propagations in the amended GP equation with two potentials via deep neural networks learning