跳转至

Arxiv 2024-11-18 Papers

标题 作者 PDF链接 代码仓库 Title
UniHands:统一各种野外采集的关键点,用于个性化手部重建 Menghe Zhang PDF N/A UniHands: Unifying Various Wild-Collected Keypoints for Personalized Hand Reconstruction
生成世界探索者 Taiming Lu PDF N/A Generative World Explorer
Bi-Mamba:迈向精确的1位状态空间模型 Shengkun Tang PDF N/A Bi-Mamba: Towards Accurate 1-Bit State Space Models
RoboGSim:一个用于机器人仿真的Real2Sim2Real高斯样条模拟器 Xinhai Li PDF N/A RoboGSim: A Real2Sim2Real Robotic Gaussian Splatting Simulator
用于波动率预测的成对马尔可夫链 Elie Azeraf PDF N/A Pairwise Markov Chains for Volatility Forecasting
利用大型语言模型处理关系数据库中的预测任务 Marek Wydmuch PDF N/A Tackling prediction tasks in relational databases with LLMs
LightFFDNets:轻量级卷积神经网络用于快速面部伪造检测 Günel Jabbarlı PDF N/A LightFFDNets: Lightweight Convolutional Neural Networks for Rapid Facial Forgery Detection
用于扩散磁共振成像去卷积的等变空间-半球网络 Axel Elaldi PDF N/A Equivariant spatio-hemispherical networks for diffusion MRI deconvolution
用于非线性动力系统常/偏微分方程发现的KAN/MultKAN结合物理信息样条拟合(KAN-PISF)方法 Ashish Pal PDF N/A KAN/MultKAN with Physics-Informed Spline fitting (KAN-PISF) for ordinary/partial differential equation discovery of nonlinear dynamic systems
边缘增强的多模态医学图像融合的膨胀残差注意力网络 Meng Zhou PDF N/A Edge-Enhanced Dilated Residual Attention Network for Multimodal Medical Image Fusion
探索JPEG AI的对抗鲁棒性:方法、比较与新方法 Egor Kovalev PDF N/A Exploring adversarial robustness of JPEG AI: methodology, comparison and new methods
分散式大型上下文匹配市场中的竞争性强盗 Satush Parikh PDF N/A Competing Bandits in Decentralized Large Contextual Matching Markets
联邦学习中的潜在博弈视角 Kang Liu PDF N/A A Potential Game Perspective in Federated Learning
并行温度调节生成对抗网络 Jinwon Sohn PDF N/A Parallelly Tempered Generative Adversarial Networks
LLM-IE:一个用于大型语言模型生成信息提取的Python包 Enshuo Hsu PDF N/A LLM-IE: A Python Package for Generative Information Extraction with Large Language Models
探索临床医生对重症监护中可解释人工智能决策支持系统的需求 Jeffrey N. Clark PDF N/A Exploring the Requirements of Clinicians for Explainable AI Decision Support Systems in Intensive Care
CNMBert:一种用于汉语拼音缩写到汉字转换任务的模型 Zishuo Feng PDF N/A CNMBert: A Model For Hanyu Pinyin Abbreviation to Character Conversion Task
AdaptLIL:一种用于本体映射的注视自适应可视化方法 Nicholas Chow PDF N/A AdaptLIL: A Gaze-Adaptive Visualization for Ontology Mapping
文档之海:扩展重排序器推理的后果 Mathew Jacob PDF N/A Drowning in Documents: Consequences of Scaling Reranker Inference
使用格拉姆角场和可穿戴传感器联邦学习进行步态冻结检测 Shovito Barua Soumma PDF N/A Freezing of Gait Detection Using Gramian Angular Fields and Federated Learning from Wearable Sensors
绘制人类反馈在强化学习中的空间:一个概念框架 Yannick Metz PDF N/A Mapping out the Space of Human Feedback for Reinforcement Learning: A Conceptual Framework
多智能体多模态模型在文化图像描述中的力量 Longju Bai PDF N/A The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning
无偏回归用于根号N一致的条件均值估计 Masahiro Kato PDF N/A Debiased Regression for Root-N-Consistent Conditional Mean Estimation
BitMoD:比特串行混合数据类型LLM加速 Yuzong Chen PDF N/A BitMoD: Bit-serial Mixture-of-Datatype LLM Acceleration
重振选举信任:通过机器学习自动化计票提升透明度与效率 Mir Faris PDF N/A Revitalizing Electoral Trust: Enhancing Transparency and Efficiency through Automated Voter Counting with Machine Learning
QARM:快手上的定量对齐多模态推荐 Xinchen Luo PDF N/A QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
WoodYOLO:一种用于显微图像中木材种类检测的新型目标检测器 Lars Nieradzik PDF N/A WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images
Advacheck在GenAI检测任务1中:基于领域感知多任务的AI检测 German Gritsai PDF N/A Advacheck at GenAI Detection Task 1: AI Detection Powered by Domain-Aware Multi-Tasking
大型语言模型中的道德说服:评估易感性与伦理一致性 Allison Huang PDF N/A Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment
无需归一化的提升模型构建:利用因子图对称性的向量化方法 Malte Luttermann PDF N/A Lifted Model Construction without Normalisation: A Vectorised Approach to Exploit Symmetries in Factor Graphs
将少量步扩散模型与密集奖励差异学习对齐 Ziyi Zhang PDF N/A Aligning Few-Step Diffusion Models with Dense Reward Difference Learning
RAWMamba:统一sRGB到RAW的去渲染与状态空间模型 Hongjun Chen PDF N/A RAWMamba: Unified sRGB-to-RAW De-rendering With State Space Model
语义-几何-物理驱动的机器人操作技能转移:通过技能库和触觉表示实现 Mingchao Qi PDF N/A Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation
FLMarket:为联邦学习实现隐私保护的预训练数据定价 Zhenyu Wen PDF N/A FLMarket: Enabling Privacy-preserved Pre-training Data Pricing for Federated Learning
FedCoLLM:一种参数高效的联邦协同微调框架,适用于大型和小型语言模型 Tao Fan PDF N/A FedCoLLM: A Parameter-Efficient Federated Co-tuning Framework for Large and Small Language Models
MC-LLaVA:多概念个性化视觉-语言模型 Ruichuan An PDF N/A MC-LLaVA: Multi-Concept Personalized Vision-Language Model
基于扩散模型对含跳跃数据进行鲁棒强化学习 Chenyang Jiang PDF N/A Robust Reinforcement Learning under Diffusion Models for Data with Jumps
技术报告:利用奖励引导的树搜索增强大语言模型推理能力 Jinhao Jiang PDF N/A Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search
从光谱到地理:RRUFF矿物数据的智能制图 Francesco Pappone PDF N/A From Spectra to Geography: Intelligent Mapping of RRUFF Mineral Data
面向可泛化神经辐射场中的抗退化重建 Chan Ho Park PDF N/A Towards Degradation-Robust Reconstruction in Generalizable NeRF
Conceptwm:一种用于概念保护的扩散模型水印 Liangqi Lei PDF N/A Conceptwm: A Diffusion Model Watermark for Concept Protection
特洛伊机器人:针对物理世界中机器人操作的远程操控攻击 Xianlong Wang PDF N/A TrojanRobot: Backdoor Attacks Against Robotic Manipulation in the Physical World
学习可微分的结构化预测替代损失 Junjie Yang PDF N/A Learning Differentiable Surrogate Losses for Structured Prediction
PSPO*:一种有效的过程监督策略优化方法,用于推理对齐 Jiawei Li PDF N/A PSPO*: An Effective Process-supervised Policy Optimization for Reasoning Alignment
用于机器学习在碰撞触发和数据获取中的硬件综合策略分析 Haoyi Jia PDF N/A Analysis of Hardware Synthesis Strategies for Machine Learning in Collider Trigger and Data Acquisition
针对序列推荐系统的少样本模型提取攻击 Hui Zhang PDF N/A Few-shot Model Extraction Attacks against Sequential Recommender Systems
人工智能科学发现 Antonio Norelli PDF N/A Artificial Scientific Discovery
高效且鲁棒的持续图学习用于生物学中的图分类 Ding Zhang PDF N/A Efficient and Robust Continual Graph Learning for Graph Classification in Biology
通过影响函数剖析多模态大型语言模型的错位问题 Lijie Hu PDF N/A Dissecting Misalignment of Multimodal Large Language Models via Influence Function
洗牌私有强化学习中的无悔探索 Shaojie Bai PDF N/A No-regret Exploration in Shuffle Private Reinforcement Learning
TSINR:通过隐式神经表示捕捉时间连续性以进行时间序列异常检测 Mengxuan Li PDF N/A TSINR: Capturing Temporal Continuity via Implicit Neural Representations for Time Series Anomaly Detection
SP${}^3$:用于弱半监督医学图像分割的超像素传播伪标签学习 Shiman Li PDF N/A SP${ }^3$ : Superpixel-propagated pseudo-label learning for weakly semi-supervised medical image segmentation
第七章 基于数据的生成式人工智能模型在从医疗科学文献中提取知识方面的回顾 Leon Kopitar PDF N/A Chapter 7 Review of Data-Driven Generative AI Models for Knowledge Extraction from Scientific Literature in Healthcare
联合增量命名实体识别 Duzhen Zhang PDF N/A Federated Incremental Named Entity Recognition
具有可解释性的多元时间序列分类ST-Tree Mingsen Du PDF N/A ST-Tree with Interpretability for Multivariate Time Series Classification
FERT:利用短距离调频连续波雷达进行实时面部表情识别 Sabri Mustafa Kahya PDF N/A FERT: Real-Time Facial Expression Recognition with Short-Range FMCW Radar
机器人集群中的信号传递与社会学习 Leo Cazenille PDF N/A Signaling and Social Learning in Swarms of Robots
嵌套马尔可夫模型的物理学:广义概率论视角 Xingjian Zhang PDF N/A On the physics of nested Markov models: a generalized probabilistic theory perspective
利用计算病理学人工智能进行无创光学成像分析,无需重新训练 Danny Barash PDF N/A Leveraging Computational Pathology AI for Noninvasive Optical Imaging Analysis Without Retraining
网络入侵检测的特征选择 Charles Westphal PDF N/A Feature Selection for Network Intrusion Detection
用于跨音速机翼压力分布预测的生成时空图网络 Gabriele Immordino PDF N/A Generative Spatio-temporal GraphNet for Transonic Wing Pressure Distribution Forecasting
具有隐藏混杂因素的线性循环系统的鲁棒因果分析 Boris Lorbeer PDF N/A Robust Causal Analysis of Linear Cyclic Systems With Hidden Confounders
绿洲:百万代理社交互动模拟 Ziyi Yang PDF N/A OASIS: Open Agents Social Interaction Simulations on One Million Agents
混合数据驱动状态空间模型用于可解释和无标签的毫米波信道预测 Yiyong Sun PDF N/A Hybrid Data-Driven SSM for Interpretable and Label-Free mmWave Channel Prediction
使用Spinnaker的神经形态硬件广义Hebbian学习算法分析 Shivani Sharma PDF N/A Analysis of Generalized Hebbian Learning Algorithm for Neuromorphic Hardware Using Spinnaker
基于图神经网络的C代码安全边界建立代码注释逻辑 Varun Gadey PDF N/A GNN-Based Code Annotation Logic for Establishing Security Boundaries in C Code
MSSIDD:多传感器去噪基准 Shibin Mei PDF N/A MSSIDD: A Benchmark for Multi-Sensor Denoising
拓扑感知优先调度用于共存的大型语言模型工作负载 Ping Zhang PDF N/A Topology-aware Preemptive Scheduling for Co-located LLM Workloads
非线性波动力学的数据驱动模型重建 Ekaterina Smolina PDF N/A Data-driven model reconstruction for nonlinear wave dynamics
实时健身运动分类与视频帧计数 Riccardo Riccio PDF N/A Real-Time Fitness Exercise Classification and Counting from Video Frames
gpuPairHMM:基于GPU的高速Pair-HMM前向算法用于DNA变异检测 Bertil Schmidt PDF N/A gpuPairHMM: High-speed Pair-HMM Forward Algorithm for DNA Variant Calling on GPUs
一种用于多核神经形态处理器的高效多播寻址编码方案 Zhe Su PDF N/A An Efficient Multicast Addressing Encoding Scheme for Multi-Core Neuromorphic Processors
通过渐进式概念瓶颈驱动的对齐增强视觉语言模型安全性 Zhendong Liu PDF N/A Enhancing Vision-Language Model Safety through Progressive Concept-Bottleneck-Driven Alignment
分层图结构边缘划分模型用于学习演变的社区结构 Xincan Yu PDF N/A Hierarchical-Graph-Structured Edge Partition Models for Learning Evolving Community Structure
使用知识图谱嵌入作为附加模态来解决语言模型中的幻觉问题 Viktoriia Chekalina PDF N/A Addressing Hallucinations in Language Models with Knowledge Graph Embeddings as an Additional Modality
SeqProFT:应用LoRA微调进行仅序列蛋白质性质预测 Shuo Zhang PDF N/A SeqProFT: Applying LoRA Finetuning for Sequence-only Protein Property Predictions
利用锐度感知最小化增强的针对后门攻击的可靠中毒样本检测 Mingda Zhang PDF N/A Reliable Poisoned Sample Detection against Backdoor Attacks Enhanced by Sharpness Aware Minimization
在资源受限的隐私保护型大型语言模型交互中,预先防范文本净化工具 Robin Carpentier PDF N/A Preempting Text Sanitization Utility in Resource-Constrained Privacy-Preserving LLM Interactions
一种基于图的预训练模型,用于教育文档的自适应排序 Jean Vassoyan PDF N/A A Pre-Trained Graph-Based Model for Adaptive Sequencing of Educational Documents
通过高斯互信息的样本最优测试,高效地学习高斯树模型 Sutanu Gayen PDF N/A Efficient Sample-optimal Learning of Gaussian Tree Models via Sample-optimal Testing of Gaussian Mutual Information
级联扩散模型用于二维和三维显微图像合成以增强细胞分割 Rüveyda Yilmaz PDF N/A Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation
学习一种自监督多目标跟踪的神经关联网络 Shuai Li PDF N/A Learning a Neural Association Network for Self-supervised Multi-Object Tracking
一个用于基因组变异检测的模块化开源框架 Ankita Vaishnobi Bisoi PDF N/A A Modular Open Source Framework for Genomic Variant Calling
基于模型的强化学习中的时间高斯混合结构学习 Théophile Champion PDF N/A Structure learning with Temporal Gaussian Mixture for model-based Reinforcement Learning
具有先天物理知识的闭环多步规划 Giulia Lafratta PDF N/A Closed-loop multi-step planning with innate physics knowledge
SignEye:从车辆第一人称视角解读交通标志 Chuang Yang PDF N/A SignEye: Traffic Sign Interpretation from Vehicle First-Person View
LaVin-DiT:大型视觉扩散变换器 Zhaoqing Wang PDF N/A LaVin-DiT: Large Vision Diffusion Transformer
搜索、验证与反馈:通过验证器工程实现下一代基础模型的后训练范式 Xinyan Guan PDF N/A Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
残差神经网络架构中用于数字孪生模型的物理编码块 Muhammad Saad Zia PDF N/A Physics Encoded Blocks in Residual Neural Network Architectures for Digital Twin Models
安全 + 安全 = 不安全?探究如何利用安全图像来破解大型视觉语言模型 Chenhang Cui PDF N/A Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models
外星重组:探索视觉艺术中超越人类认知能力的概念融合 Alejandro Hernandez PDF N/A Alien Recombination: Exploring Concept Blends Beyond Human Cognitive Availability in Visual Art
一次看一组:多滑动建模用于生存预测 Xinyang Li PDF N/A Look a Group at Once: Multi-Slide Modeling for Survival Prediction
探索视觉场景识别中的新兴趋势与研究机遇 Antonios Gasteratos PDF N/A Exploring Emerging Trends and Research Opportunities in Visual Place Recognition
量化社交媒体情境下视觉语言模型的偏好:通过价值分解方法 Jingxuan Li PDF N/A Quantifying Preferences of Vision-Language Models via Value Decomposition in Social Media Contexts
SL-YOLO:一种更强大且更轻量的无人机目标检测模型 Defan Chen PDF N/A SL-YOLO: A Stronger and Lighter Drone Target Detection Model
MVLight:通过光照条件化的多视角扩散实现可重照明文本到3D生成 Dongseok Shim PDF N/A MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion
图神经网络用于量化中药配伍机制 Jingqi Zeng PDF N/A Graph Artificial Intelligence for Quantifying Compatibility Mechanisms in Traditional Chinese Medicine
通用行人重识别通过平衡对齐性和均匀性实现 Yoonki Cho PDF N/A Generalizable Person Re-identification via Balancing Alignment and Uniformity
物理学与拓扑学相遇:用于学习刚体动力学的物理信息拓扑神经网络 Amaury Wei PDF N/A Physics meets Topology: Physics-informed topological neural networks for learning rigid body dynamics
MGNiceNet:统一单目几何场景理解 Markus Schön PDF N/A MGNiceNet: Unified Monocular Geometric Scene Understanding
重新审视在情境中学习线性函数 Omar Naim PDF N/A Re-examining learning linear functions in context
PALMS:用于潜在网络重构的并行自适应Lasso与多方向信号 Zhaoyu Xing PDF N/A PALMS: Parallel Adaptive Lasso with Multi-directional Signals for Latent Networks Reconstruction
HistoEncoder:一种用于前列腺癌的数字病理基础模型 Joona Pohjonen PDF N/A HistoEncoder: a digital pathology foundation model for prostate cancer
倒置强化学习,实现更易解释的最优控制 Juan Cardenas-Cartagena PDF N/A Upside-Down Reinforcement Learning for More Interpretable Optimal Control
ADUULM-360数据集——一个用于恶劣天气下深度估计的多模态数据集 Markus Schön PDF N/A The ADUULM-360 Dataset -- A Multi-Modal Dataset for Depth Estimation in Adverse Weather
相关性引导的视听融合用于视频显著性预测 Li Yu PDF N/A Relevance-guided Audio Visual Fusion for Video Saliency Prediction
鲁棒马尔可夫决策过程:AI与形式化方法的交汇点 Marnix Suilen PDF N/A Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet
揭示交通预测中自适应嵌入的僵化性 Hongjun Wang PDF N/A Unveiling the Inflexibility of Adaptive Embedding in Traffic Forecasting
同行评审中群体多样性对冗余性和覆盖范围的因果效应 Navita Goyal PDF N/A Causal Effect of Group Diversity on Redundancy and Coverage in Peer-Reviewing
多标签特征选择的隐式正则化 Dou El Kefel Mansouri PDF N/A Implicit Regularization for Multi-label Feature Selection
GLDesigner:利用多模态大型语言模型作为设计师,以增强美学文本字形布局 Junwen He PDF N/A GLDesigner: Leveraging Multi-Modal LLMs as Designer for Enhanced Aesthetic Text Glyph Layouts
针对长上下文大语言模型的成员推断攻击 Zixiong Wang PDF N/A Membership Inference Attack against Long-Context Large Language Models
通过光谱保持数据压缩实现快速DBSCAN Yongyu Wang PDF N/A Towards fast DBSCAN via Spectrum-Preserving Data Compression
时空储层集成技术用于液态状态机 Anmol Biswas PDF N/A Temporal and Spatial Reservoir Ensembling Techniques for Liquid State Machines
宜家工作手册:在互联网视频上进行4D组装说明的接地 Yunong Liu PDF N/A IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos
信任的阴暗面:基于权威引用的越狱攻击对大型语言模型的影响 Xikang Yang PDF N/A The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models
弥合资源差距:将先进的模仿学习模型部署到经济实惠的嵌入式平台上 Haizhou Ge PDF N/A Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms
扩展的神经收缩动力系统:关于多任务和黎曼安全区域 Hadi Beik Mohammadi PDF N/A Extended Neural Contractive Dynamical Systems: On Multiple Tasks and Riemannian Safety Regions
逐层堆砌:对齐特征隔离在增量人脸伪造检测中的应用 Jikang Cheng PDF N/A Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery Detection
GECo算法用于图神经网络解释 Salvatore Calderaro PDF N/A The GECo algorithm for Graph Neural Networks Explanation
基于视觉变换器的肺病检测:机器学习方法的比较研究 Baljinnyam Dayan PDF N/A Lung Disease Detection with Vision Transformers: A Comparative Study of Machine Learning Methods
图数据库上的图神经网络 Dmytro Lopushanskyy PDF N/A Graph Neural Networks on Graph Databases
LeC$^2$O-NeRF:学习城市场景中的大规模连续紧凑占用 Zhenxing Mi PDF N/A LeC$^2$O-NeRF: Learning Continuous and Compact Large-Scale Occupancy for Urban Scenes
重新思考思考代币:理解它们在实践中表现不佳的原因 Sreeram Vennam PDF N/A Rethinking Thinking Tokens: Understanding Why They Underperform in Practice
TL-CLIP:一种专为输电线路缺陷识别设计的特定领域多模态预训练视觉基础模型 Ke Zhang PDF N/A TL-CLIP: A Power-specific Multimodal Pre-trained Visual Foundation Model for Transmission Line Defect Recognition
SCOP:一种用于蛋白质功能预测的序列-结构对比感知框架 Runze Ma PDF N/A SCOP: A Sequence-Structure Contrast-Aware Framework for Protein Function Prediction
通过自适应策略自我组合实现持续任务学习 Shengchao Hu PDF N/A Continual Task Learning through Adaptive Policy Self-Composition
GPS-Gaussian+:可泛化的逐像素3D高斯喷射技术,用于从稀疏视角实现实时人景渲染 Boyao Zhou PDF N/A GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views
MAIRA-Seg:利用分割感知的多模态大型语言模型增强放射报告生成 Harshita Sharma PDF N/A MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models
可扩展的自回归单目深度估计 Jinhong Wang PDF N/A Scalable Autoregressive Monocular Depth Estimation
CCExpert:通过差异感知集成和基础数据集提升多模态语言模型在遥感变化描述中的能力 Zhiming Wang PDF N/A CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware Integration and a Foundational Dataset
文本引导的零样本目标定位 Jingjing Wang PDF N/A Text-guided Zero-Shot Object Localization
超像素引导的隐式神经表示用于多维数据 Jiayi Li PDF N/A Superpixel-informed Implicit Neural Representation for Multi-Dimensional Data
甲骨文识别综合调查:挑战、基准测试及未来展望 Jing Li PDF N/A A comprehensive survey of oracle character recognition: challenges, benchmarks, and beyond
视觉-语义图匹配网络用于零样本学习 Bowen Duan PDF N/A Visual-Semantic Graph Matching Net for Zero-Shot Learning
使用大型语言模型进行零样本负荷预测 Wenlong Liao PDF N/A Zero-Shot Load Forecasting with Large Language Models
使用局部傅里叶神经算子建模多变量高分辨率三维城市微气候 Shaoxiang Qin PDF N/A Modeling Multivariable High-resolution 3D Urban Microclimate Using Localized Fourier Neural Operator
减轻语言模型驱动问答中的知识冲突 Han Cao PDF N/A Mitigating Knowledge Conflicts in Language Model-Driven Question Answering
教授视频扩散模型与潜在物理现象知识 Qinglong Cao PDF N/A Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge
基于分解的时间序列预测方法的混合损失框架:平衡全局和组件误差 Ronghui Han PDF N/A A Hybrid Loss Framework for Decomposition-based Time Series Forecasting Methods: Balancing Global and Component Errors
通过运动引导注意力实现视频到任务学习,用于少样本动作识别 Hanyu Guo PDF N/A Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition
面向颜色的数据集蒸馏冗余减少 Bowen Yuan PDF N/A Color-Oriented Redundancy Reduction in Dataset Distillation
使用基于扩散的轨迹分支生成增强决策Transformer Zhihong Liu PDF N/A Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation
Cuvis.Ai:一个用于高光谱处理和分类的开源、低代码软件生态系统 Nathaniel Hanson PDF N/A Cuvis.Ai: An Open-Source, Low-Code Software Ecosystem for Hyperspectral Processing and Classification
教学大纲:强化学习代理的可移植课程 Ryan Sullivan PDF N/A Syllabus: Portable Curricula for Reinforcement Learning Agents
机器遗忘技术综述 Haibo Zhang PDF N/A A Review on Machine Unlearning
CEEMDAN在欠定语音分离中的性能研究 Rawad Melhem PDF N/A Study of the Performance of CEEMDAN in Underdetermined Speech Separation
TP-UNet:用于医学图像分割的时间提示引导的UNet Ranmin Wang PDF N/A TP-UNet: Temporal Prompt Guided UNet for Medical Image Segmentation
面向个性化联邦节点分类的一次性通信 Guochen Yan PDF N/A Toward Personalized Federated Node Classification in One-shot Communication
具有增量块的递归随机配置网络 Gang Dang PDF N/A Recurrent Stochastic Configuration Networks with Incremental Blocks
基于内源性脑电范式的个性化脑机接口应用研究 Heon-Gyu Kwak PDF N/A Towards Personalized Brain-Computer Interface Application Based on Endogenous EEG Paradigms
加速大规模稀疏文档数据的球形K-均值聚类 Kazuo Aoyama PDF N/A Accelerating spherical K-means clustering for large-scale sparse document data
使用稀疏自编码器引导语言模型的拒绝行为 Kyle O'Brien PDF N/A Steering Language Model Refusal with Sparse Autoencoders
超越语言界限:利用大型语言模型进行低资源语言翻译 Peng Shu PDF N/A Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation
SADDE:基于可靠解释的半监督异常检测 Yachao Yuan PDF N/A SADDE: Semi-supervised Anomaly Detection with Dependable Explanations
基于Zarr和Tiff的地理空间图像性能评估 Jaheer Khan PDF N/A Performance Evaluation of Geospatial Images based on Zarr and Tiff
LP数据管道:轻量级、目标驱动的数据管道,适用于大型语言模型 Yungi Kim PDF N/A LP Data Pipeline: Lightweight, Purpose-driven Data Pipeline for Large Language Models
神经元:为零样本骨架动作识别学习上下文感知的演化表示 Yang Chen PDF N/A Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition
减少标签依赖:水下场景理解的数据集、技术与应用综述 Scarlett Raine PDF N/A Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications
使用LLM生成数据集进行零样本自动标注与实例分割:消除深度学习模型开发中的现场成像与人工标注 Ranjan Sapkota PDF N/A Zero-Shot Automatic Annotation and Instance Segmentation using LLM-Generated Datasets: Eliminating Field Imaging and Manual Annotation for Deep Learning Model Development
双频滤波自适应图神经网络用于同质图和异质图 Yachao Yang PDF N/A Dual-Frequency Filtering Self-aware Graph Neural Networks for Homophilic and Heterophilic Graphs
基于多双曲空间的异构图注意力网络 Jongmin Park PDF N/A Multi-Hyperbolic Space-based Heterogeneous Graph Attention Network
基于图像引导的连续K空间恢复网络用于快速MRI重建 Yucong Meng PDF N/A Continuous K-space Recovery Network with Image Guidance for Fast MRI Reconstruction
面向开放词汇的视听事件定位 Jinxing Zhou PDF N/A Towards Open-Vocabulary Audio-Visual Event Localization
守恒律的耦合积分PINN Yeping Wang PDF N/A Coupled Integral PINN for conservation law
急诊科就诊的有效预测建模及评估外生变量影响:运用可解释的元学习梯度提升方法 Mehdi Neshat PDF N/A Effective Predictive Modeling for Emergency Department Visits and Evaluating Exogenous Variables Impact: Using Explainable Meta-learning Gradient Boosting
ACE2:精确学习次季节至年代际大气变异及强迫响应 Oliver Watt-Meyer PDF N/A ACE2: Accurately learning subseasonal to decadal atmospheric variability and forced responses
VersaTune:高效微调多能力大型语言模型 Keer Lu PDF N/A VersaTune: Fine-Tuning Multi-Ability LLMs Efficiently
GROOT:利用有限实验数据进行生物序列的有效设计 Thanh V. T. Tran PDF N/A GROOT: Effective Design of Biological Sequences with Limited Experimental Data
跨患者伪包生成与课程对比学习用于全切片图像的不平衡多分类 Yonghuang Wu PDF N/A Cross-Patient Pseudo Bags Generation and Curriculum Contrastive Learning for Imbalanced Multiclassification of Whole Slide Image
大型语料库与大型语言模型:一种可复制的自动化语法标注方法 Cameron Morin PDF N/A Large corpora and large language models: a replicable method for automating grammatical annotation
用于动态图的图保留网络 Qian Chang PDF N/A Graph Retention Networks for Dynamic Graphs
数据高效因果效应估计的渐进泛化风险降低 Hechuan Wen PDF N/A Progressive Generalization Risk Reduction for Data-Efficient Causal Effect Estimation
语义还是协变量?一项关于分布外检测难题的研究 Xingming Long PDF N/A Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
DrivingSphere:构建高保真4D世界用于闭环仿真 Tianyi Yan PDF N/A DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation
EXCON:基于极端实例的对比表示学习,用于太阳耀斑预测的严重不平衡多元时间序列 Onur Vural PDF N/A EXCON: Extreme Instance-based Contrastive Representation Learning of Severely Imbalanced Multivariate Time Series for Solar Flare Prediction
ZeFaV:提升大型语言模型在零样本事实验证中的表现 Son T. Luu PDF N/A ZeFaV: Boosting Large Language Models for Zero-shot Fact Verification
再生核巴纳赫空间上的镜像下降法 Akash Kumar PDF N/A Mirror Descent on Reproducing Kernel Banach Spaces
在高斯边缘分布下对半空间进行可靠学习 Ilias Diakonikolas PDF N/A Reliable Learning of Halfspaces under Gaussian Marginals
MEMO-Bench:用于文本到图像和多模态大语言模型的人类情感分析的多重基准 Yingjie Zhou PDF N/A MEMO-Bench: A Multiple Benchmark for Text-to-Image and Multimodal Large Language Models on Human Emotion Analysis
神经形态卫星观测噪声过滤基准 Sami Arja PDF N/A Noise Filtering Benchmark for Neuromorphic Satellites Observations
BeautyBank:在潜在空间中编码面部化妆 Qianwen Lu PDF N/A BeautyBank: Encoding Facial Makeup in Latent Space
不要过于乐观:二阶方法中的负步长 Betty Shea PDF N/A Don't Be So Positive: Negative Step Sizes in Second-Order Methods
高效的视频-语言基础模型迁移学习 Haoxing Chen PDF N/A Efficient Transfer Learning for Video-language Foundation Models
水声:通过倾倒液体推断物理特性 Piyush Bagad PDF N/A The Sound of Water: Inferring Physical Properties from Pouring Liquids
基于人工智能专家指导的数据驱动自动电机初步设计 Yiwei Wang PDF N/A Data Driven Automatic Electrical Machine Preliminary Design with Artificial Intelligence Expert Guidance
场景文本识别的关系对比学习和掩码图像建模 Tiancheng Lin PDF N/A Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition
MoE-Lightning:在内存受限的GPU上实现高吞吐量的MoE推理 Shiyi Cao PDF N/A MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs
DeforHMR:使用可变形交叉注意力机制的视觉变换器用于3D人体网格恢复 Jaewoo Heo PDF N/A DeforHMR: Vision Transformer with Deformable Cross-Attention for 3D Human Mesh Recovery
让Sigmoid-MSE再次伟大:输出重置挑战神经网络分类中的Softmax交叉熵 Kanishka Tyagi PDF N/A Making Sigmoid-MSE Great Again: Output Reset Challenges Softmax Cross-Entropy in Neural Network Classification