Arxiv 2024-11-18 Papers

标题	作者	PDF链接	代码仓库	Title
UniHands：统一各种野外采集的关键点，用于个性化手部重建	Menghe Zhang	PDF	N/A	UniHands: Unifying Various Wild-Collected Keypoints for Personalized Hand Reconstruction
生成世界探索者	Taiming Lu	PDF	N/A	Generative World Explorer
Bi-Mamba：迈向精确的1位状态空间模型	Shengkun Tang	PDF	N/A	Bi-Mamba: Towards Accurate 1-Bit State Space Models
RoboGSim：一个用于机器人仿真的Real2Sim2Real高斯样条模拟器	Xinhai Li	PDF	N/A	RoboGSim: A Real2Sim2Real Robotic Gaussian Splatting Simulator
用于波动率预测的成对马尔可夫链	Elie Azeraf	PDF	N/A	Pairwise Markov Chains for Volatility Forecasting
利用大型语言模型处理关系数据库中的预测任务	Marek Wydmuch	PDF	N/A	Tackling prediction tasks in relational databases with LLMs
LightFFDNets：轻量级卷积神经网络用于快速面部伪造检测	Günel Jabbarlı	PDF	N/A	LightFFDNets: Lightweight Convolutional Neural Networks for Rapid Facial Forgery Detection
用于扩散磁共振成像去卷积的等变空间-半球网络	Axel Elaldi	PDF	N/A	Equivariant spatio-hemispherical networks for diffusion MRI deconvolution
用于非线性动力系统常/偏微分方程发现的KAN/MultKAN结合物理信息样条拟合（KAN-PISF）方法	Ashish Pal	PDF	N/A	KAN/MultKAN with Physics-Informed Spline fitting (KAN-PISF) for ordinary/partial differential equation discovery of nonlinear dynamic systems
边缘增强的多模态医学图像融合的膨胀残差注意力网络	Meng Zhou	PDF	N/A	Edge-Enhanced Dilated Residual Attention Network for Multimodal Medical Image Fusion
探索JPEG AI的对抗鲁棒性：方法、比较与新方法	Egor Kovalev	PDF	N/A	Exploring adversarial robustness of JPEG AI: methodology, comparison and new methods
分散式大型上下文匹配市场中的竞争性强盗	Satush Parikh	PDF	N/A	Competing Bandits in Decentralized Large Contextual Matching Markets
联邦学习中的潜在博弈视角	Kang Liu	PDF	N/A	A Potential Game Perspective in Federated Learning
并行温度调节生成对抗网络	Jinwon Sohn	PDF	N/A	Parallelly Tempered Generative Adversarial Networks
LLM-IE：一个用于大型语言模型生成信息提取的Python包	Enshuo Hsu	PDF	N/A	LLM-IE: A Python Package for Generative Information Extraction with Large Language Models
探索临床医生对重症监护中可解释人工智能决策支持系统的需求	Jeffrey N. Clark	PDF	N/A	Exploring the Requirements of Clinicians for Explainable AI Decision Support Systems in Intensive Care
CNMBert：一种用于汉语拼音缩写到汉字转换任务的模型	Zishuo Feng	PDF	N/A	CNMBert: A Model For Hanyu Pinyin Abbreviation to Character Conversion Task
AdaptLIL：一种用于本体映射的注视自适应可视化方法	Nicholas Chow	PDF	N/A	AdaptLIL: A Gaze-Adaptive Visualization for Ontology Mapping
文档之海：扩展重排序器推理的后果	Mathew Jacob	PDF	N/A	Drowning in Documents: Consequences of Scaling Reranker Inference
使用格拉姆角场和可穿戴传感器联邦学习进行步态冻结检测	Shovito Barua Soumma	PDF	N/A	Freezing of Gait Detection Using Gramian Angular Fields and Federated Learning from Wearable Sensors
绘制人类反馈在强化学习中的空间：一个概念框架	Yannick Metz	PDF	N/A	Mapping out the Space of Human Feedback for Reinforcement Learning: A Conceptual Framework
多智能体多模态模型在文化图像描述中的力量	Longju Bai	PDF	N/A	The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning
无偏回归用于根号N一致的条件均值估计	Masahiro Kato	PDF	N/A	Debiased Regression for Root-N-Consistent Conditional Mean Estimation
BitMoD：比特串行混合数据类型LLM加速	Yuzong Chen	PDF	N/A	BitMoD: Bit-serial Mixture-of-Datatype LLM Acceleration
重振选举信任：通过机器学习自动化计票提升透明度与效率	Mir Faris	PDF	N/A	Revitalizing Electoral Trust: Enhancing Transparency and Efficiency through Automated Voter Counting with Machine Learning
QARM：快手上的定量对齐多模态推荐	Xinchen Luo	PDF	N/A	QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
WoodYOLO：一种用于显微图像中木材种类检测的新型目标检测器	Lars Nieradzik	PDF	N/A	WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images
Advacheck在GenAI检测任务1中：基于领域感知多任务的AI检测	German Gritsai	PDF	N/A	Advacheck at GenAI Detection Task 1: AI Detection Powered by Domain-Aware Multi-Tasking
大型语言模型中的道德说服：评估易感性与伦理一致性	Allison Huang	PDF	N/A	Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment
无需归一化的提升模型构建：利用因子图对称性的向量化方法	Malte Luttermann	PDF	N/A	Lifted Model Construction without Normalisation: A Vectorised Approach to Exploit Symmetries in Factor Graphs
将少量步扩散模型与密集奖励差异学习对齐	Ziyi Zhang	PDF	N/A	Aligning Few-Step Diffusion Models with Dense Reward Difference Learning
RAWMamba：统一sRGB到RAW的去渲染与状态空间模型	Hongjun Chen	PDF	N/A	RAWMamba: Unified sRGB-to-RAW De-rendering With State Space Model
语义-几何-物理驱动的机器人操作技能转移：通过技能库和触觉表示实现	Mingchao Qi	PDF	N/A	Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation
FLMarket：为联邦学习实现隐私保护的预训练数据定价	Zhenyu Wen	PDF	N/A	FLMarket: Enabling Privacy-preserved Pre-training Data Pricing for Federated Learning
FedCoLLM：一种参数高效的联邦协同微调框架，适用于大型和小型语言模型	Tao Fan	PDF	N/A	FedCoLLM: A Parameter-Efficient Federated Co-tuning Framework for Large and Small Language Models
MC-LLaVA：多概念个性化视觉-语言模型	Ruichuan An	PDF	N/A	MC-LLaVA: Multi-Concept Personalized Vision-Language Model
基于扩散模型对含跳跃数据进行鲁棒强化学习	Chenyang Jiang	PDF	N/A	Robust Reinforcement Learning under Diffusion Models for Data with Jumps
技术报告：利用奖励引导的树搜索增强大语言模型推理能力	Jinhao Jiang	PDF	N/A	Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search
从光谱到地理：RRUFF矿物数据的智能制图	Francesco Pappone	PDF	N/A	From Spectra to Geography: Intelligent Mapping of RRUFF Mineral Data
面向可泛化神经辐射场中的抗退化重建	Chan Ho Park	PDF	N/A	Towards Degradation-Robust Reconstruction in Generalizable NeRF
Conceptwm：一种用于概念保护的扩散模型水印	Liangqi Lei	PDF	N/A	Conceptwm: A Diffusion Model Watermark for Concept Protection
特洛伊机器人：针对物理世界中机器人操作的远程操控攻击	Xianlong Wang	PDF	N/A	TrojanRobot: Backdoor Attacks Against Robotic Manipulation in the Physical World
学习可微分的结构化预测替代损失	Junjie Yang	PDF	N/A	Learning Differentiable Surrogate Losses for Structured Prediction
PSPO*：一种有效的过程监督策略优化方法，用于推理对齐	Jiawei Li	PDF	N/A	PSPO*: An Effective Process-supervised Policy Optimization for Reasoning Alignment
用于机器学习在碰撞触发和数据获取中的硬件综合策略分析	Haoyi Jia	PDF	N/A	Analysis of Hardware Synthesis Strategies for Machine Learning in Collider Trigger and Data Acquisition
针对序列推荐系统的少样本模型提取攻击	Hui Zhang	PDF	N/A	Few-shot Model Extraction Attacks against Sequential Recommender Systems
人工智能科学发现	Antonio Norelli	PDF	N/A	Artificial Scientific Discovery
高效且鲁棒的持续图学习用于生物学中的图分类	Ding Zhang	PDF	N/A	Efficient and Robust Continual Graph Learning for Graph Classification in Biology
通过影响函数剖析多模态大型语言模型的错位问题	Lijie Hu	PDF	N/A	Dissecting Misalignment of Multimodal Large Language Models via Influence Function
洗牌私有强化学习中的无悔探索	Shaojie Bai	PDF	N/A	No-regret Exploration in Shuffle Private Reinforcement Learning
TSINR：通过隐式神经表示捕捉时间连续性以进行时间序列异常检测	Mengxuan Li	PDF	N/A	TSINR: Capturing Temporal Continuity via Implicit Neural Representations for Time Series Anomaly Detection
SP${}^3$：用于弱半监督医学图像分割的超像素传播伪标签学习	Shiman Li	PDF	N/A	SP${ }^3$ : Superpixel-propagated pseudo-label learning for weakly semi-supervised medical image segmentation
第七章基于数据的生成式人工智能模型在从医疗科学文献中提取知识方面的回顾	Leon Kopitar	PDF	N/A	Chapter 7 Review of Data-Driven Generative AI Models for Knowledge Extraction from Scientific Literature in Healthcare
联合增量命名实体识别	Duzhen Zhang	PDF	N/A	Federated Incremental Named Entity Recognition
具有可解释性的多元时间序列分类ST-Tree	Mingsen Du	PDF	N/A	ST-Tree with Interpretability for Multivariate Time Series Classification
FERT：利用短距离调频连续波雷达进行实时面部表情识别	Sabri Mustafa Kahya	PDF	N/A	FERT: Real-Time Facial Expression Recognition with Short-Range FMCW Radar
机器人集群中的信号传递与社会学习	Leo Cazenille	PDF	N/A	Signaling and Social Learning in Swarms of Robots
嵌套马尔可夫模型的物理学：广义概率论视角	Xingjian Zhang	PDF	N/A	On the physics of nested Markov models: a generalized probabilistic theory perspective
利用计算病理学人工智能进行无创光学成像分析，无需重新训练	Danny Barash	PDF	N/A	Leveraging Computational Pathology AI for Noninvasive Optical Imaging Analysis Without Retraining
网络入侵检测的特征选择	Charles Westphal	PDF	N/A	Feature Selection for Network Intrusion Detection
用于跨音速机翼压力分布预测的生成时空图网络	Gabriele Immordino	PDF	N/A	Generative Spatio-temporal GraphNet for Transonic Wing Pressure Distribution Forecasting
具有隐藏混杂因素的线性循环系统的鲁棒因果分析	Boris Lorbeer	PDF	N/A	Robust Causal Analysis of Linear Cyclic Systems With Hidden Confounders
绿洲：百万代理社交互动模拟	Ziyi Yang	PDF	N/A	OASIS: Open Agents Social Interaction Simulations on One Million Agents
混合数据驱动状态空间模型用于可解释和无标签的毫米波信道预测	Yiyong Sun	PDF	N/A	Hybrid Data-Driven SSM for Interpretable and Label-Free mmWave Channel Prediction
使用Spinnaker的神经形态硬件广义Hebbian学习算法分析	Shivani Sharma	PDF	N/A	Analysis of Generalized Hebbian Learning Algorithm for Neuromorphic Hardware Using Spinnaker
基于图神经网络的C代码安全边界建立代码注释逻辑	Varun Gadey	PDF	N/A	GNN-Based Code Annotation Logic for Establishing Security Boundaries in C Code
MSSIDD：多传感器去噪基准	Shibin Mei	PDF	N/A	MSSIDD: A Benchmark for Multi-Sensor Denoising
拓扑感知优先调度用于共存的大型语言模型工作负载	Ping Zhang	PDF	N/A	Topology-aware Preemptive Scheduling for Co-located LLM Workloads
非线性波动力学的数据驱动模型重建	Ekaterina Smolina	PDF	N/A	Data-driven model reconstruction for nonlinear wave dynamics
实时健身运动分类与视频帧计数	Riccardo Riccio	PDF	N/A	Real-Time Fitness Exercise Classification and Counting from Video Frames
gpuPairHMM：基于GPU的高速Pair-HMM前向算法用于DNA变异检测	Bertil Schmidt	PDF	N/A	gpuPairHMM: High-speed Pair-HMM Forward Algorithm for DNA Variant Calling on GPUs
一种用于多核神经形态处理器的高效多播寻址编码方案	Zhe Su	PDF	N/A	An Efficient Multicast Addressing Encoding Scheme for Multi-Core Neuromorphic Processors
通过渐进式概念瓶颈驱动的对齐增强视觉语言模型安全性	Zhendong Liu	PDF	N/A	Enhancing Vision-Language Model Safety through Progressive Concept-Bottleneck-Driven Alignment
分层图结构边缘划分模型用于学习演变的社区结构	Xincan Yu	PDF	N/A	Hierarchical-Graph-Structured Edge Partition Models for Learning Evolving Community Structure
使用知识图谱嵌入作为附加模态来解决语言模型中的幻觉问题	Viktoriia Chekalina	PDF	N/A	Addressing Hallucinations in Language Models with Knowledge Graph Embeddings as an Additional Modality
SeqProFT：应用LoRA微调进行仅序列蛋白质性质预测	Shuo Zhang	PDF	N/A	SeqProFT: Applying LoRA Finetuning for Sequence-only Protein Property Predictions
利用锐度感知最小化增强的针对后门攻击的可靠中毒样本检测	Mingda Zhang	PDF	N/A	Reliable Poisoned Sample Detection against Backdoor Attacks Enhanced by Sharpness Aware Minimization
在资源受限的隐私保护型大型语言模型交互中，预先防范文本净化工具	Robin Carpentier	PDF	N/A	Preempting Text Sanitization Utility in Resource-Constrained Privacy-Preserving LLM Interactions
一种基于图的预训练模型，用于教育文档的自适应排序	Jean Vassoyan	PDF	N/A	A Pre-Trained Graph-Based Model for Adaptive Sequencing of Educational Documents
通过高斯互信息的样本最优测试，高效地学习高斯树模型	Sutanu Gayen	PDF	N/A	Efficient Sample-optimal Learning of Gaussian Tree Models via Sample-optimal Testing of Gaussian Mutual Information
级联扩散模型用于二维和三维显微图像合成以增强细胞分割	Rüveyda Yilmaz	PDF	N/A	Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation
学习一种自监督多目标跟踪的神经关联网络	Shuai Li	PDF	N/A	Learning a Neural Association Network for Self-supervised Multi-Object Tracking
一个用于基因组变异检测的模块化开源框架	Ankita Vaishnobi Bisoi	PDF	N/A	A Modular Open Source Framework for Genomic Variant Calling
基于模型的强化学习中的时间高斯混合结构学习	Théophile Champion	PDF	N/A	Structure learning with Temporal Gaussian Mixture for model-based Reinforcement Learning
具有先天物理知识的闭环多步规划	Giulia Lafratta	PDF	N/A	Closed-loop multi-step planning with innate physics knowledge
SignEye：从车辆第一人称视角解读交通标志	Chuang Yang	PDF	N/A	SignEye: Traffic Sign Interpretation from Vehicle First-Person View
LaVin-DiT：大型视觉扩散变换器	Zhaoqing Wang	PDF	N/A	LaVin-DiT: Large Vision Diffusion Transformer
搜索、验证与反馈：通过验证器工程实现下一代基础模型的后训练范式	Xinyan Guan	PDF	N/A	Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
残差神经网络架构中用于数字孪生模型的物理编码块	Muhammad Saad Zia	PDF	N/A	Physics Encoded Blocks in Residual Neural Network Architectures for Digital Twin Models
安全 + 安全 = 不安全？探究如何利用安全图像来破解大型视觉语言模型	Chenhang Cui	PDF	N/A	Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models
外星重组：探索视觉艺术中超越人类认知能力的概念融合	Alejandro Hernandez	PDF	N/A	Alien Recombination: Exploring Concept Blends Beyond Human Cognitive Availability in Visual Art
一次看一组：多滑动建模用于生存预测	Xinyang Li	PDF	N/A	Look a Group at Once: Multi-Slide Modeling for Survival Prediction
探索视觉场景识别中的新兴趋势与研究机遇	Antonios Gasteratos	PDF	N/A	Exploring Emerging Trends and Research Opportunities in Visual Place Recognition
量化社交媒体情境下视觉语言模型的偏好：通过价值分解方法	Jingxuan Li	PDF	N/A	Quantifying Preferences of Vision-Language Models via Value Decomposition in Social Media Contexts
SL-YOLO：一种更强大且更轻量的无人机目标检测模型	Defan Chen	PDF	N/A	SL-YOLO: A Stronger and Lighter Drone Target Detection Model
MVLight：通过光照条件化的多视角扩散实现可重照明文本到3D生成	Dongseok Shim	PDF	N/A	MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion
图神经网络用于量化中药配伍机制	Jingqi Zeng	PDF	N/A	Graph Artificial Intelligence for Quantifying Compatibility Mechanisms in Traditional Chinese Medicine
通用行人重识别通过平衡对齐性和均匀性实现	Yoonki Cho	PDF	N/A	Generalizable Person Re-identification via Balancing Alignment and Uniformity
物理学与拓扑学相遇：用于学习刚体动力学的物理信息拓扑神经网络	Amaury Wei	PDF	N/A	Physics meets Topology: Physics-informed topological neural networks for learning rigid body dynamics
MGNiceNet：统一单目几何场景理解	Markus Schön	PDF	N/A	MGNiceNet: Unified Monocular Geometric Scene Understanding
重新审视在情境中学习线性函数	Omar Naim	PDF	N/A	Re-examining learning linear functions in context
PALMS：用于潜在网络重构的并行自适应Lasso与多方向信号	Zhaoyu Xing	PDF	N/A	PALMS: Parallel Adaptive Lasso with Multi-directional Signals for Latent Networks Reconstruction
HistoEncoder：一种用于前列腺癌的数字病理基础模型	Joona Pohjonen	PDF	N/A	HistoEncoder: a digital pathology foundation model for prostate cancer
倒置强化学习，实现更易解释的最优控制	Juan Cardenas-Cartagena	PDF	N/A	Upside-Down Reinforcement Learning for More Interpretable Optimal Control
ADUULM-360数据集——一个用于恶劣天气下深度估计的多模态数据集	Markus Schön	PDF	N/A	The ADUULM-360 Dataset -- A Multi-Modal Dataset for Depth Estimation in Adverse Weather
相关性引导的视听融合用于视频显著性预测	Li Yu	PDF	N/A	Relevance-guided Audio Visual Fusion for Video Saliency Prediction
鲁棒马尔可夫决策过程：AI与形式化方法的交汇点	Marnix Suilen	PDF	N/A	Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet
揭示交通预测中自适应嵌入的僵化性	Hongjun Wang	PDF	N/A	Unveiling the Inflexibility of Adaptive Embedding in Traffic Forecasting
同行评审中群体多样性对冗余性和覆盖范围的因果效应	Navita Goyal	PDF	N/A	Causal Effect of Group Diversity on Redundancy and Coverage in Peer-Reviewing
多标签特征选择的隐式正则化	Dou El Kefel Mansouri	PDF	N/A	Implicit Regularization for Multi-label Feature Selection
GLDesigner：利用多模态大型语言模型作为设计师，以增强美学文本字形布局	Junwen He	PDF	N/A	GLDesigner: Leveraging Multi-Modal LLMs as Designer for Enhanced Aesthetic Text Glyph Layouts
针对长上下文大语言模型的成员推断攻击	Zixiong Wang	PDF	N/A	Membership Inference Attack against Long-Context Large Language Models
通过光谱保持数据压缩实现快速DBSCAN	Yongyu Wang	PDF	N/A	Towards fast DBSCAN via Spectrum-Preserving Data Compression
时空储层集成技术用于液态状态机	Anmol Biswas	PDF	N/A	Temporal and Spatial Reservoir Ensembling Techniques for Liquid State Machines
宜家工作手册：在互联网视频上进行4D组装说明的接地	Yunong Liu	PDF	N/A	IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos
信任的阴暗面：基于权威引用的越狱攻击对大型语言模型的影响	Xikang Yang	PDF	N/A	The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models
弥合资源差距：将先进的模仿学习模型部署到经济实惠的嵌入式平台上	Haizhou Ge	PDF	N/A	Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms
扩展的神经收缩动力系统：关于多任务和黎曼安全区域	Hadi Beik Mohammadi	PDF	N/A	Extended Neural Contractive Dynamical Systems: On Multiple Tasks and Riemannian Safety Regions
逐层堆砌：对齐特征隔离在增量人脸伪造检测中的应用	Jikang Cheng	PDF	N/A	Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery Detection
GECo算法用于图神经网络解释	Salvatore Calderaro	PDF	N/A	The GECo algorithm for Graph Neural Networks Explanation
基于视觉变换器的肺病检测：机器学习方法的比较研究	Baljinnyam Dayan	PDF	N/A	Lung Disease Detection with Vision Transformers: A Comparative Study of Machine Learning Methods
图数据库上的图神经网络	Dmytro Lopushanskyy	PDF	N/A	Graph Neural Networks on Graph Databases
LeC$^2$O-NeRF：学习城市场景中的大规模连续紧凑占用	Zhenxing Mi	PDF	N/A	LeC$^2$O-NeRF: Learning Continuous and Compact Large-Scale Occupancy for Urban Scenes
重新思考思考代币：理解它们在实践中表现不佳的原因	Sreeram Vennam	PDF	N/A	Rethinking Thinking Tokens: Understanding Why They Underperform in Practice
TL-CLIP：一种专为输电线路缺陷识别设计的特定领域多模态预训练视觉基础模型	Ke Zhang	PDF	N/A	TL-CLIP: A Power-specific Multimodal Pre-trained Visual Foundation Model for Transmission Line Defect Recognition
SCOP：一种用于蛋白质功能预测的序列-结构对比感知框架	Runze Ma	PDF	N/A	SCOP: A Sequence-Structure Contrast-Aware Framework for Protein Function Prediction
通过自适应策略自我组合实现持续任务学习	Shengchao Hu	PDF	N/A	Continual Task Learning through Adaptive Policy Self-Composition
GPS-Gaussian+：可泛化的逐像素3D高斯喷射技术，用于从稀疏视角实现实时人景渲染	Boyao Zhou	PDF	N/A	GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views
MAIRA-Seg：利用分割感知的多模态大型语言模型增强放射报告生成	Harshita Sharma	PDF	N/A	MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models
可扩展的自回归单目深度估计	Jinhong Wang	PDF	N/A	Scalable Autoregressive Monocular Depth Estimation
CCExpert：通过差异感知集成和基础数据集提升多模态语言模型在遥感变化描述中的能力	Zhiming Wang	PDF	N/A	CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware Integration and a Foundational Dataset
文本引导的零样本目标定位	Jingjing Wang	PDF	N/A	Text-guided Zero-Shot Object Localization
超像素引导的隐式神经表示用于多维数据	Jiayi Li	PDF	N/A	Superpixel-informed Implicit Neural Representation for Multi-Dimensional Data
甲骨文识别综合调查：挑战、基准测试及未来展望	Jing Li	PDF	N/A	A comprehensive survey of oracle character recognition: challenges, benchmarks, and beyond
视觉-语义图匹配网络用于零样本学习	Bowen Duan	PDF	N/A	Visual-Semantic Graph Matching Net for Zero-Shot Learning
使用大型语言模型进行零样本负荷预测	Wenlong Liao	PDF	N/A	Zero-Shot Load Forecasting with Large Language Models
使用局部傅里叶神经算子建模多变量高分辨率三维城市微气候	Shaoxiang Qin	PDF	N/A	Modeling Multivariable High-resolution 3D Urban Microclimate Using Localized Fourier Neural Operator
减轻语言模型驱动问答中的知识冲突	Han Cao	PDF	N/A	Mitigating Knowledge Conflicts in Language Model-Driven Question Answering
教授视频扩散模型与潜在物理现象知识	Qinglong Cao	PDF	N/A	Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge
基于分解的时间序列预测方法的混合损失框架：平衡全局和组件误差	Ronghui Han	PDF	N/A	A Hybrid Loss Framework for Decomposition-based Time Series Forecasting Methods: Balancing Global and Component Errors
通过运动引导注意力实现视频到任务学习，用于少样本动作识别	Hanyu Guo	PDF	N/A	Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition
面向颜色的数据集蒸馏冗余减少	Bowen Yuan	PDF	N/A	Color-Oriented Redundancy Reduction in Dataset Distillation
使用基于扩散的轨迹分支生成增强决策Transformer	Zhihong Liu	PDF	N/A	Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation
Cuvis.Ai：一个用于高光谱处理和分类的开源、低代码软件生态系统	Nathaniel Hanson	PDF	N/A	Cuvis.Ai: An Open-Source, Low-Code Software Ecosystem for Hyperspectral Processing and Classification
教学大纲：强化学习代理的可移植课程	Ryan Sullivan	PDF	N/A	Syllabus: Portable Curricula for Reinforcement Learning Agents
机器遗忘技术综述	Haibo Zhang	PDF	N/A	A Review on Machine Unlearning
CEEMDAN在欠定语音分离中的性能研究	Rawad Melhem	PDF	N/A	Study of the Performance of CEEMDAN in Underdetermined Speech Separation
TP-UNet：用于医学图像分割的时间提示引导的UNet	Ranmin Wang	PDF	N/A	TP-UNet: Temporal Prompt Guided UNet for Medical Image Segmentation
面向个性化联邦节点分类的一次性通信	Guochen Yan	PDF	N/A	Toward Personalized Federated Node Classification in One-shot Communication
具有增量块的递归随机配置网络	Gang Dang	PDF	N/A	Recurrent Stochastic Configuration Networks with Incremental Blocks
基于内源性脑电范式的个性化脑机接口应用研究	Heon-Gyu Kwak	PDF	N/A	Towards Personalized Brain-Computer Interface Application Based on Endogenous EEG Paradigms
加速大规模稀疏文档数据的球形K-均值聚类	Kazuo Aoyama	PDF	N/A	Accelerating spherical K-means clustering for large-scale sparse document data
使用稀疏自编码器引导语言模型的拒绝行为	Kyle O'Brien	PDF	N/A	Steering Language Model Refusal with Sparse Autoencoders
超越语言界限：利用大型语言模型进行低资源语言翻译	Peng Shu	PDF	N/A	Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation
SADDE：基于可靠解释的半监督异常检测	Yachao Yuan	PDF	N/A	SADDE: Semi-supervised Anomaly Detection with Dependable Explanations
基于Zarr和Tiff的地理空间图像性能评估	Jaheer Khan	PDF	N/A	Performance Evaluation of Geospatial Images based on Zarr and Tiff
LP数据管道：轻量级、目标驱动的数据管道，适用于大型语言模型	Yungi Kim	PDF	N/A	LP Data Pipeline: Lightweight, Purpose-driven Data Pipeline for Large Language Models
神经元：为零样本骨架动作识别学习上下文感知的演化表示	Yang Chen	PDF	N/A	Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition
减少标签依赖：水下场景理解的数据集、技术与应用综述	Scarlett Raine	PDF	N/A	Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications
使用LLM生成数据集进行零样本自动标注与实例分割：消除深度学习模型开发中的现场成像与人工标注	Ranjan Sapkota	PDF	N/A	Zero-Shot Automatic Annotation and Instance Segmentation using LLM-Generated Datasets: Eliminating Field Imaging and Manual Annotation for Deep Learning Model Development
双频滤波自适应图神经网络用于同质图和异质图	Yachao Yang	PDF	N/A	Dual-Frequency Filtering Self-aware Graph Neural Networks for Homophilic and Heterophilic Graphs
基于多双曲空间的异构图注意力网络	Jongmin Park	PDF	N/A	Multi-Hyperbolic Space-based Heterogeneous Graph Attention Network
基于图像引导的连续K空间恢复网络用于快速MRI重建	Yucong Meng	PDF	N/A	Continuous K-space Recovery Network with Image Guidance for Fast MRI Reconstruction
面向开放词汇的视听事件定位	Jinxing Zhou	PDF	N/A	Towards Open-Vocabulary Audio-Visual Event Localization
守恒律的耦合积分PINN	Yeping Wang	PDF	N/A	Coupled Integral PINN for conservation law
急诊科就诊的有效预测建模及评估外生变量影响：运用可解释的元学习梯度提升方法	Mehdi Neshat	PDF	N/A	Effective Predictive Modeling for Emergency Department Visits and Evaluating Exogenous Variables Impact: Using Explainable Meta-learning Gradient Boosting
ACE2：精确学习次季节至年代际大气变异及强迫响应	Oliver Watt-Meyer	PDF	N/A	ACE2: Accurately learning subseasonal to decadal atmospheric variability and forced responses
VersaTune：高效微调多能力大型语言模型	Keer Lu	PDF	N/A	VersaTune: Fine-Tuning Multi-Ability LLMs Efficiently
GROOT：利用有限实验数据进行生物序列的有效设计	Thanh V. T. Tran	PDF	N/A	GROOT: Effective Design of Biological Sequences with Limited Experimental Data
跨患者伪包生成与课程对比学习用于全切片图像的不平衡多分类	Yonghuang Wu	PDF	N/A	Cross-Patient Pseudo Bags Generation and Curriculum Contrastive Learning for Imbalanced Multiclassification of Whole Slide Image
大型语料库与大型语言模型：一种可复制的自动化语法标注方法	Cameron Morin	PDF	N/A	Large corpora and large language models: a replicable method for automating grammatical annotation
用于动态图的图保留网络	Qian Chang	PDF	N/A	Graph Retention Networks for Dynamic Graphs
数据高效因果效应估计的渐进泛化风险降低	Hechuan Wen	PDF	N/A	Progressive Generalization Risk Reduction for Data-Efficient Causal Effect Estimation
语义还是协变量？一项关于分布外检测难题的研究	Xingming Long	PDF	N/A	Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
DrivingSphere：构建高保真4D世界用于闭环仿真	Tianyi Yan	PDF	N/A	DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation
EXCON：基于极端实例的对比表示学习，用于太阳耀斑预测的严重不平衡多元时间序列	Onur Vural	PDF	N/A	EXCON: Extreme Instance-based Contrastive Representation Learning of Severely Imbalanced Multivariate Time Series for Solar Flare Prediction
ZeFaV：提升大型语言模型在零样本事实验证中的表现	Son T. Luu	PDF	N/A	ZeFaV: Boosting Large Language Models for Zero-shot Fact Verification
再生核巴纳赫空间上的镜像下降法	Akash Kumar	PDF	N/A	Mirror Descent on Reproducing Kernel Banach Spaces
在高斯边缘分布下对半空间进行可靠学习	Ilias Diakonikolas	PDF	N/A	Reliable Learning of Halfspaces under Gaussian Marginals
MEMO-Bench：用于文本到图像和多模态大语言模型的人类情感分析的多重基准	Yingjie Zhou	PDF	N/A	MEMO-Bench: A Multiple Benchmark for Text-to-Image and Multimodal Large Language Models on Human Emotion Analysis
神经形态卫星观测噪声过滤基准	Sami Arja	PDF	N/A	Noise Filtering Benchmark for Neuromorphic Satellites Observations
BeautyBank：在潜在空间中编码面部化妆	Qianwen Lu	PDF	N/A	BeautyBank: Encoding Facial Makeup in Latent Space
不要过于乐观：二阶方法中的负步长	Betty Shea	PDF	N/A	Don't Be So Positive: Negative Step Sizes in Second-Order Methods
高效的视频-语言基础模型迁移学习	Haoxing Chen	PDF	N/A	Efficient Transfer Learning for Video-language Foundation Models
水声：通过倾倒液体推断物理特性	Piyush Bagad	PDF	N/A	The Sound of Water: Inferring Physical Properties from Pouring Liquids
基于人工智能专家指导的数据驱动自动电机初步设计	Yiwei Wang	PDF	N/A	Data Driven Automatic Electrical Machine Preliminary Design with Artificial Intelligence Expert Guidance
场景文本识别的关系对比学习和掩码图像建模	Tiancheng Lin	PDF	N/A	Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition
MoE-Lightning：在内存受限的GPU上实现高吞吐量的MoE推理	Shiyi Cao	PDF	N/A	MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs
DeforHMR：使用可变形交叉注意力机制的视觉变换器用于3D人体网格恢复	Jaewoo Heo	PDF	N/A	DeforHMR: Vision Transformer with Deformable Cross-Attention for 3D Human Mesh Recovery
让Sigmoid-MSE再次伟大：输出重置挑战神经网络分类中的Softmax交叉熵	Kanishka Tyagi	PDF	N/A	Making Sigmoid-MSE Great Again: Output Reset Challenges Softmax Cross-Entropy in Neural Network Classification