Arxiv 2024-09-12 Papers

标题	作者	PDF链接	代码仓库	Title
DreamHOI：基于主题驱动的3D人-物交互生成，采用扩散先验	Thomas Hanwen Zhu	PDF	N/A	DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors
按需深度：从低帧率主动传感器流式传输密集深度	Andrea Conti	PDF	N/A	Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor
AnySkin：即插即用的机器人触觉感知技术	Raunaq Bhirangi	PDF	N/A	AnySkin: Plug-and-play Skin Sensing for Robotic Touch
手部-物体交互视频预训练	Himanshu Gaurav Singh	PDF	N/A	Hand-Object Interaction Pretraining from Videos
Click2Mask：基于动态掩码生成的局部编辑	Omer Regev	PDF	N/A	Click2Mask: Local Editing with Dynamic Mask Generation
梦兽：通过部分感知知识迁移提炼3D奇幻动物	Runjia Li	PDF	N/A	DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer
FlashSplat：二维到三维高斯溅射分割问题已最优解决	Qiuhong Shen	PDF	N/A	FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally
Windows Agent Arena：大规模评估多模态操作系统代理	Rogerio Bonatti	PDF	N/A	Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale
学习不完全因子分解预条件器用于GMRES	Paul Häusner	PDF	N/A	Learning incomplete factorization preconditioners for GMRES
改进文本引导的对象修复与语义预修复	Yifu Chen	PDF	N/A	Improving Text-guided Object Inpainting with Semantic Pre-inpainting
通过专注于服装的扩散模型改进虚拟试穿	Siqi Wan	PDF	N/A	Improving Virtual Try-On with Garment-focused Diffusion Models
LoRID: 低秩迭代扩散用于对抗性净化	Geigh Zollicoffer	PDF	N/A	LoRID: Low-Rank Iterative Diffusion for Adversarial Purification
半自主网络物理系统中信息性接管请求的设计：在无人机控制器设置中结合口语和视觉图标	Ashwini Gundappa	PDF	N/A	The Design of Informative Take-Over Requests for Semi-Autonomous Cyber-Physical Systems: Combining Spoken Language and Visual Icons in a Drone-Controller Setting
冻结的文本到图像扩散模型的动态提示用于全景叙事定位	Hongyu Li	PDF	N/A	Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
OmniQuery：通过上下文增强捕获的多模态记忆，以实现个性化问答	Jiahao Nick Li	PDF	N/A	OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering
TextBoost：通过微调文本编码器实现文本到图像模型的单次个性化定制	NaHyeon Park	PDF	N/A	TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder
基于风格的视觉艺术作品聚类	Abhishek Dangeti	PDF	N/A	Style Based Clustering of Visual Artworks
IFAdapter：基于实例特征控制的接地文本到图像生成	Yinwei Wu	PDF	N/A	IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation
Source2Synth：基于真实数据源的合成数据生成与管理	Alisia Lupidi	PDF	N/A	Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
基于多模型的联邦学习对抗模型投毒攻击：一种基于深度学习的MEC系统模型选择方法	Somayeh Kianpisheh	PDF	N/A	Multi-Model based Federated Learning Against Model Poisoning Attack: A Deep Learning Based Model Selection for MEC Systems
LLM蜜罐：利用大型语言模型作为高级交互式蜜罐系统	Hakan T. Otal	PDF	N/A	LLM Honeypot: Leveraging Large Language Models as Advanced Interactive Honeypot Systems
磁共振成像中脑肿瘤分割的模型集成	Daniel Capellán-Martín	PDF	N/A	Model Ensemble for Brain Tumor Segmentation in Magnetic Resonance Imaging
通过深度强化学习对核聚变反应堆进行设计优化	Jinsu Kim	PDF	N/A	Design Optimization of Nuclear Fusion Reactor through Deep Reinforcement Learning
光子量子计算机	M. AbuGhanem	PDF	N/A	Photonic Quantum Computers
CliquePH：通过团图上的持久同调为图神经网络提供高阶信息	Davide Buffelli	PDF	N/A	CliquePH: Higher-Order Information for Graph Neural Networks through Persistent Homology on Clique Graphs
LT3SD：用于三维场景扩散的潜在树	Quan Meng	PDF	N/A	LT3SD: Latent Trees for 3D Scene Diffusion
自适应语言引导的对比解释抽象化	Andi Peng	PDF	N/A	Adaptive Language-Guided Abstraction from Contrastive Explanations
基于图拉普拉斯矩阵的贝叶斯多保真度建模	Orazio Pinti	PDF	N/A	Graph Laplacian-based Bayesian Multi-fidelity Modeling
VI3DRM：通过逼真的新视角合成实现从稀疏视角到精细三维重建的迈进	Hao Chen	PDF	N/A	VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis
ComAlign：视觉-语言模型中的组合对齐	Ali Abdollah	PDF	N/A	ComAlign: Compositional Alignment in Vision-Language Models
是什么让迷宫看起来像迷宫？	Joy Hsu	PDF	N/A	What Makes a Maze Look Like a Maze?
右删失数据下两样本检验的机器学习：一项模拟研究	Petr Philonenko	PDF	N/A	Machine Learning for Two-Sample Testing under Right-Censored Data: A Simulation Study
AudioBERT：音频知识增强的语言模型	Hyunjong Ok	PDF	N/A	AudioBERT: Audio Knowledge Augmented Language Model
高斯服装：从多视角视频中重建具有逼真外观的仿真就绪服装	Boxiang Rong	PDF	N/A	Gaussian Garments: Reconstructing Simulation-Ready Clothing with Photorealistic Appearance from Multi-View Video
微调大型语言模型用于实体匹配	Aaron Steiner	PDF	N/A	Fine-tuning Large Language Models for Entity Matching
增强犬类肌肉骨骼诊断：利用合成图像数据对视觉文档进行AI模型预训练	Martin Thißen	PDF	N/A	Enhancing Canine Musculoskeletal Diagnoses: Leveraging Synthetic Image Data for Pre-Training AI-Models on Visual Documentations
基于头部运动学识别头部撞击位置、速度和力	Xianghao Zhan	PDF	N/A	Identification of head impact locations, speeds, and force based on head kinematics
使用基于深度学习的分割方法进行低成本树木冠层枯死估算	M. J. Allen	PDF	N/A	Low-Cost Tree Crown Dieback Estimation Using Deep Learning-Based Segmentation
AD-Lite Net：一种用于从MRI图像中检测阿尔茨海默病的轻量级且级联的CNN模型	Santanu Roy	PDF	N/A	AD-Lite Net: A Lightweight and Concatenated CNN Model for Alzheimer's Detection from MRI Images
学习在术前磁共振成像和术中超声图像之间匹配二维关键点	Hassan Rasheed	PDF	N/A	Learning to Match 2D Keypoints Across Preoperative MR and Intraoperative Ultrasound
高频反梦工坊：针对图像合成的鲁棒防御	Takuto Onikubo	PDF	N/A	High-Frequency Anti-DreamBooth: Robust Defense Against Image Synthesis
自动细胞分割的开源基础设施	Aaron Rock Menezes	PDF	N/A	Open Source Infrastructure for Automatic Cell Segmentation
基于交叉注意力的手语和非手语分析影响模型	Lipisha Chaudhary	PDF	N/A	Cross-Attention Based Influence Model for Manual and Nonmanual Sign Language Analysis
上下文在阅读时间预测中的作用	Andreas Opedal	PDF	N/A	On the Role of Context in Reading Time Prediction
SDformer：高效的端到端Transformer用于深度补全	Jian Qian	PDF	N/A	SDformer: Efficient End-to-End Transformer for Depth Completion
魔法风格：基于参考图像的肖像风格化	Zhaoli Deng	PDF	N/A	MagicStyle: Portrait Stylization Based on Reference Image
LLM-POTUS评分：利用大型语言模型分析总统辩论的框架	Zhengliang Liu	PDF	N/A	LLM-POTUS Score: A Framework of Analyzing Presidential Debates with Large Language Models
惯性协调博弈	Andrew Koh	PDF	N/A	Inertial Coordination Games
利用简单方法对治疗后胶质瘤进行有效分割：人工序列生成与集成模型	Heejong Kim	PDF	N/A	Effective Segmentation of Post-Treatment Gliomas Using Simple Approaches: Artificial Sequence Generation and Ensemble Models
JPEG Pleno基于学习的点云编码标准：服务于人类与机器	André F. R. Guarda	PDF	N/A	The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine
GAZEploit：通过VR/MR设备中化身视角的注视估计进行远程按键推理攻击	Hanqiu Wang	PDF	N/A	GAZEploit: Remote Keystroke Inference Attack by Gaze Estimation from Avatar Views in VR/MR Devices
面向基于图的网络流量分析基础模型	Louis Van Langendonck	PDF	N/A	Towards a graph-based foundation model for network traffic analysis
WhisperNER：统一开放命名实体与语音识别	Gil Ayache	PDF	N/A	WhisperNER: Unified Open Named Entity and Speech Recognition
DEMAU：分解、探索、建模和分析不确定性	Arthur Hoarau	PDF	N/A	DEMAU: Decompose, Explore, Model and Analyse Uncertainties
Faetar基准测试：在资源极度匮乏的语言中的语音识别	Michael Ong	PDF	N/A	The Faetar Benchmark: Speech Recognition in a Very Under-Resourced Language
贝叶斯自训练用于半监督三维分割	Ozan Unal	PDF	N/A	Bayesian Self-Training for Semi-Supervised 3D Segmentation
CLC-UKET数据集：英国就业法庭案件结果预测的基准测试	Huiyuan Xie	PDF	N/A	The CLC-UKET Dataset: Benchmarking Case Outcome Prediction for the UK Employment Tribunal
优化基于学习的控制系统中的反例生成：一种多保真贝叶斯方法	Zahra Shahrooei	PDF	N/A	Optimizing Falsification for Learning-Based Control Systems: A Multi-Fidelity Bayesian Approach
EZIGen：通过精确的主体编码和解耦引导增强零样本主体驱动图像生成	Zicheng Duan	PDF	N/A	EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guidance
SimMAT：探索从视觉基础模型到任意图像模态的可迁移性	Chenyang Lei	PDF	N/A	SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality
通过提示插值进行噪声校正的基于扩散的图像到图像翻译	Junsung Lee	PDF	N/A	Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation
旅行代理：个性化旅行规划的人工智能助手	Aili Chen	PDF	N/A	TravelAgent: An AI Assistant for Personalized Travel Planning
AutoPET挑战赛：用于数据增强的肿瘤合成	Lap Yan Lennon Chan	PDF	N/A	AutoPET Challenge: Tumour Synthesis for Data Augmentation
用于约束优化的迭代求解器的自监督学习	Lukas Lüken	PDF	N/A	Self-Supervised Learning of Iterative Solvers for Constrained Optimization
AI加速发现高临界温度超导体	Xiao-Qi Han	PDF	N/A	AI-accelerated discovery of high critical temperature superconductors
Q值正则化决策卷积变压器用于离线强化学习	Teng Yan	PDF	N/A	Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning
空间适应层：可解释的领域适应方法在生物信号传感器阵列应用中的应用	Joao Pereira	PDF	N/A	Spatial Adaptation Layer: Interpretable Domain Adaptation For Biosignal Sensor Array Applications
神经辐射场的大规模监督	Weixiang Zhang	PDF	N/A	Expansive Supervision for Neural Radiance Field
使用机器学习特征化预测和加速纳米材料合成	Christopher C. Price	PDF	N/A	Predicting and Accelerating Nanomaterials Synthesis Using Machine Learning Featurization
释放蠕虫并提取数据：通过越狱手段加剧针对基于RAG的推理攻击的规模和严重性	Stav Cohen	PDF	N/A	Unleashing Worms and Extracting Data: Escalating the Outcome of Attacks against RAG-based Inference in Scale and Severity Using Jailbreaking
Thermal3D-GS: 基于物理的3D高斯方法用于热红外新视角合成	Qian Chen	PDF	N/A	Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis
异构束神经网络	Luke Braithwaite	PDF	N/A	Heterogeneous Sheaf Neural Networks
LED: 夜间增强深度估计的光源	Simon de Moreau	PDF	N/A	LED: Light Enhanced Depth Estimation at Night
从解释到行动：一种零样本、理论驱动的学生表现反馈大型语言模型框架	Vinitra Swamy	PDF	N/A	From Explanations to Action: A Zero-Shot, Theory-Driven LLM Framework for Student Performance Feedback
草图引导的扩散模型用于无训练的文本到图像生成	Seonho Lee	PDF	N/A	Scribble-Guided Diffusion for Training-free Text-to-Image Generation
边缘引导图指令神经网络	Francesco Della Santa	PDF	N/A	Edge-Wise Graph-Instructed Neural Networks
从头设计高亲和力蛋白质结合剂的AlphaProteo	Vinicius Zambaldi	PDF	N/A	De novo design of high-affinity protein binders with AlphaProteo
通过多视角特征融合进行网络异常流量检测	Song Hao	PDF	N/A	Network Anomaly Traffic Detection via Multi-view Feature Fusion
从多样化的示范中学习因果不变的奖励函数	Ivan Ovinnikov	PDF	N/A	Learning Causally Invariant Reward Functions from Diverse Demonstrations
多路复用图对比学习与软负样本	Zhenhao Zhao	PDF	N/A	Multiplex Graph Contrastive Learning with Soft Negatives
OCTAMamba：一种用于精确OCTA血管分割的状态空间模型方法	Shun Zou	PDF	N/A	OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation
基于多中心调查数据的隐私保护联合疼痛强度变化预测	Supratim Das	PDF	N/A	Privacy-preserving federated prediction of pain intensity change based on multi-center survey data
深度至关重要：探索交通场景中语义分割的RGB-D深度交互	Siyu Chen	PDF	N/A	Depth Matters: Exploring Deep Interactions of RGB-D for Semantic Segmentation in Traffic Scenes
通过可学习的多尺度嵌入和注意力机制提升少样本图像分类	Fatemeh Askari	PDF	N/A	Enhancing Few-Shot Image Classification through Learnable Multi-Scale Embedding and Attention Mechanisms
AI控制游戏：AI部署协议安全性评估模型	Charlie Griffin	PDF	N/A	Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols
SPARK：自监督个性化实时单目人脸捕捉	Kelian Baert	PDF	N/A	SPARK: Self-supervised Personalized Real-time Monocular Face Capture
Sparse R-CNN OBB：基于定向稀疏建议的SAR图像船舶目标检测	Kamirul Kamirul	PDF	N/A	Sparse R-CNN OBB: Ship Target Detection in SAR Images Based on Oriented Sparse Proposals
基于视觉的精确三维占用预测的深度高度解耦	Yuan Wu	PDF	N/A	Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction
本地化的薛定谔桥梁采样器	Georg A. Gottwald	PDF	N/A	Localized Schrödinger Bridge Sampler
局部感知跨模态对应学习用于密集视听事件定位	Ling Xing	PDF	N/A	Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
ProbTalk3D：基于VQ-VAE的非确定性情感可控语音驱动3D面部动画合成	Sichun Wu	PDF	N/A	ProbTalk3D: Non-Deterministic Emotion Controllable Speech-Driven 3D Facial Animation Synthesis Using VQ-VAE
端到端可微分仿真中的自动驾驶车辆控制器	Asen Nachkov	PDF	N/A	Autonomous Vehicle Controllers From End-to-End Differentiable Simulation
无线代理：智能无线网络的大型语言模型代理	Jingwen Tong	PDF	N/A	WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks
通过条件去噪扩散模型从数字台风卫星图像中估算大气变量	Zhangyue Ling	PDF	N/A	Estimating atmospheric variables from Digital Typhoon Satellite Images via Conditional Denoising Diffusion Models
视觉基础模型是否能提升医学图像分割中的领域泛化能力？	Kerem Cekmeceli	PDF	N/A	Do Vision Foundation Models Enhance Domain Generalization in Medical Image Segmentation?
增强型在线诱导检测：利用上下文确定和消息级分析	Jake Street	PDF	N/A	Enhanced Online Grooming Detection Employing Context Determination and Message-Level Analysis
利用机器学习快速估计极端质量比旋进系统的参数	Bo Liang	PDF	N/A	Rapid Parameter Estimation for Extreme Mass Ratio Inspirals Using Machine Learning
张量分解与电路之间的关系是什么（以及我们如何利用它）？	Lorenzo Loconte	PDF	N/A	What is the Relationship between Tensor Factorizations and Circuits (and How Can We Exploit it)?
泰勒-感知网络：拥抱噪声，启迪科学数据的未知	Guangxuan Song	PDF	N/A	Taylor-Sensus Network: Embracing Noise to Enlighten Uncertainty for Scientific Data
Control+Shift: 生成可控的分布偏移	Roy Friedman	PDF	N/A	Control+Shift: Generating Controllable Distribution Shifts
通过序数原型分析建模人类反应	Anna Emilie J. Wedenborg	PDF	N/A	Modeling Human Responses by Ordinal Archetypal Analysis
强化学习发现高效的分散式图路径搜索策略	Alexei Pisacane	PDF	N/A	Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies
任务增强的跨视图插补网络用于部分多视图不完整多标签分类	Xiaohuan Lu	PDF	N/A	Task-Augmented Cross-View Imputation Network for Partial Multi-View Incomplete Multi-Label Classification
一种用于分离地震数据的卷积神经网络方法	Jing Sun	PDF	N/A	A convolutional neural network approach to deblending seismic data
用于评估神经网络架构训练效率的框架	Eduardo Cueto-Mendoza	PDF	N/A	A framework for measuring the training efficiency of a neural architecture
Tidal MerzA：通过强化学习结合情感建模与自主代码生成	Elizabeth Wilson	PDF	N/A	Tidal MerzA: Combining affective modelling and autonomous code generation through Reinforcement Learning
InterACT：基于分层注意力变压器的双臂操作动作分块，具备感知相互依赖性	Andrew Lee	PDF	N/A	InterACT: Inter-dependency Aware Action Chunking with Hierarchical Attention Transformers for Bimanual Manipulation
UGAD：利用频率指纹的通用生成式人工智能检测器	Inzamamul Alam	PDF	N/A	UGAD: Universal Generative AI Detector utilizing Frequency Fingerprints
Tera-SpaceCom：基于图神经网络的深度强化学习在太赫兹频段空间网络中的联合资源分配与任务卸载	Zhifeng Hu	PDF	N/A	Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource Allocation and Task Offloading in TeraHertz Band Space Networks
从COCO到COCO-FP：深入探讨COCO检测器中的背景误报问题	Longfei Liu	PDF	N/A	From COCO to COCO-FP: A Deep Dive into Background False Positives for COCO Detectors
事实：适用于多目标跟踪的特征自适应持续学习跟踪器	Rongzihan Song	PDF	N/A	FACT: Feature Adaptive Continual-learning Tracker for Multiple Object Tracking
在可靠性和通信约束下的传感器网络中的共形分布式远程推理	Meiyi Zhu	PDF	N/A	Conformal Distributed Remote Inference in Sensor Networks Under Reliability and Communication Constraints
微观曼巴：仅用4M参数揭示微观图像的秘密	Shun Zou	PDF	N/A	Microscopic-Mamba: Revealing the Secrets of Microscopic Images with Just 4M Parameters
基于语料库的台湾普通话会话中单音节词语调轮廓研究	Xiaoyun Jin	PDF	N/A	A corpus-based investigation of pitch contours of monosyllabic words in conversational Taiwan Mandarin
BLens：使用集成嵌入对比二进制函数的标注	Tristan Benoit	PDF	N/A	BLens: Contrastive Captioning of Binary Functions using Ensemble Embedding
单元：通过时间进行无监督在线实例分割	Corentin Sautier	PDF	N/A	UNIT: Unsupervised Online Instance Segmentation through Time
用于帕金森病检测的图神经网络	Shakeel A. Sheikh	PDF	N/A	Graph Neural Networks for Parkinsons Disease Detection
非负加权有向无环图结构学习	Samuel Rey	PDF	N/A	Non-negative Weighted DAG Structure Learning
随机样条树用于函数数据分类：理论与环境时间序列应用	Donato Riccio	PDF	N/A	Randomized Spline Trees for Functional Data Classification: Theory and Application to Environmental Time Series
从语言模型引导的知识图谱中学习规则	Zihang Peng	PDF	N/A	Learning Rules from KGs Guided by Language Models
基于上下文感知的最优传输学习用于视网膜眼底图像增强	Vamsi Krishna Vasa	PDF	N/A	Context-Aware Optimal Transport Learning for Retinal Fundus Image Enhancement
音频解码通过逆问题求解实现	Pedro J. Villasana T.	PDF	N/A	Audio Decoding by Inverse Problem Solving
使用Nvidia GPU和混合精度训练改进分类算法的机器学习碳足迹	Andrew Antonopoulos	PDF	N/A	Improve Machine Learning carbon footprint using Nvidia GPU and Mixed Precision training for classification algorithms
利用图同构网络增强跨市场推荐系统：一种个性化用户体验的新方法	Sümeyye Öztürk	PDF	N/A	Enhancing Cross-Market Recommendation System with Graph Isomorphism Networks: A Novel Approach to Personalized User Experience
实时多视角全方位深度估计系统，适用于机器人及真实场景下的自动驾驶	Ming Li	PDF	N/A	Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes
TSELM：利用离散标记和语言模型进行目标说话人提取	Beilong Tang	PDF	N/A	TSELM: Target Speaker Extraction using Discrete Tokens and Language Models
FPMT：增强型半监督模型用于交通事件检测	Xinying Lu	PDF	N/A	FPMT: Enhanced Semi-Supervised Model for Traffic Incident Detection
结构化剪枝在高效视觉场景识别中的应用	Oliver Grainge	PDF	N/A	Structured Pruning for Efficient Visual Place Recognition
使用CoLaNET脉冲神经网络对图像进行分类 -- MNIST示例	Mikhail Kiselev	PDF	N/A	Classifying Images with CoLaNET Spiking Neural Network -- the MNIST Example
使用NAND-Flash的非对称编码实现高效可靠的向量相似度搜索，适用于多类别少样本学习	Hao-Wei Chiang	PDF	N/A	Efficient and Reliable Vector Similarity Search Using Asymmetric Encoding with NAND-Flash for Many-Class Few-Shot Learning
ReGentS：实现稳定生成真实世界安全关键驾驶场景	Yuan Yin	PDF	N/A	ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable
绘画与音乐的桥梁——探索基于绘画的情感音乐生成	Tanisha Hisariya	PDF	N/A	Bridging Paintings and Music -- Exploring Emotion based Music Generation through Paintings
关于深度多模态学习中缺失模态的综合调查	Renjie Wu	PDF	N/A	A Comprehensive Survey on Deep Multimodal Learning with Missing Modality
在线与离线：社交聊天机器人第一方与第三方评估的比较研究	Ekaterina Svikhnushina	PDF	N/A	Online vs Offline: A Comparative Study of First-Party and Third-Party Evaluations of Social Chatbots
通过加权聚合实现空中联邦学习	Seyed Mohammad Azimi-Abarghouyi	PDF	N/A	Over-the-Air Federated Learning via Weighted Aggregation
销售联合广告：从遗憾最小化角度出发	Gagan Aggarwal	PDF	N/A	Selling Joint Ads: A Regret Minimization Perspective
YOLOv9是什么：下一代目标检测器内部特性的深入探究	Muhammad Yaseen	PDF	N/A	What is YOLOv9: An In-Depth Exploration of the Internal Features of the Next-Generation Object Detector
高度保守的序列特异性双链DNA结合网络，对人类和黑猩猩大脑发育的基因组进化产生不同影响。	Gennadi Glinsky	PDF	N/A	Highly conserved sequence-specific double-stranded DNA binding networks contributing to divergent genomic evolution of human and chimpanzee brain development
可控的合成临床笔记生成与隐私保障	Tal Baumel	PDF	N/A	Controllable Synthetic Clinical Note Generation with Privacy Guarantees
FedHide：通过隐藏在邻居中实现联邦学习	Hyunsin Park	PDF	N/A	FedHide: Federated Learning by Hiding in the Neighbors
SURGIVID：注释高效的手术视频对象发现	Çağhan Köksal	PDF	N/A	SURGIVID: Annotation-Efficient Surgical Video Object Discovery
GateAttentionPose：通过代理注意力和改进的门控卷积增强姿态估计	Liang Feng	PDF	N/A	GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions
四元数核范数减去Frobenius范数最小化用于彩色图像重建	Yu Guo	PDF	N/A	Quaternion Nuclear Norm minus Frobenius Norm Minimization for color image reconstruction
在物联网启用的相机陷阱中对野生动物模型进行就地微调，以实现高效适应	Mohammad Mehdi Rastikerdar	PDF	N/A	In-Situ Fine-Tuning of Wildlife Models in IoT-Enabled Camera Traps for Efficient Adaptation
通过迭代线性规划实现平衡有符号图的高效学习	Haruki Yokota	PDF	N/A	Efficient Learning of Balanced Signed Graphs via Iterative Linear Programming
拉格朗日对偶与复合多注意力变压器用于半监督医学图像分割	Fuchen Zheng	PDF	N/A	Lagrange Duality and Compound Multi-Attention Transformer for Semi-Supervised Medical Image Segmentation
基于大型语言模型的中文语音识别全文纠错	Zhiyuan Tang	PDF	N/A	Full-text Error Correction for Chinese Speech Recognition with Large Language Model
通过减少嵌入变异性进行稳定语言模型预训练	Woojin Chung	PDF	N/A	Stable Language Model Pre-training by Reducing Embedding Variability
XMOL：可解释的多属性分子优化	Aye Phyu Phyu Aung	PDF	N/A	XMOL: Explainable Multi-property Optimization of Molecules
支持在线讨论：将人工智能整合到adhocracy+参与平台以增强审议	Maike Behrendt	PDF	N/A	Supporting Online Discussions: Integrating AI Into the adhocracy+ Participation Platform To Enhance Deliberation
ASSNet：用于微小肿瘤和多器官分割的自适应语义分割网络	Fuchen Zheng	PDF	N/A	ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation
通过增强直接反馈对齐训练脉冲神经网络	Yongbo Zhang	PDF	N/A	Training Spiking Neural Networks via Augmented Direct Feedback Alignment
针对合作多智能体深度强化学习的时空隐蔽后门攻击	Yinbo Yu	PDF	N/A	A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning
ROCAS：通过网络-物理协同变异进行自动驾驶事故的根本原因分析	Shiwei Feng	PDF	N/A	ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation
与偏好优化对齐是实现大语言模型安全性的全部所需	Reda Alami	PDF	N/A	Alignment with Preference Optimization Is All You Need for LLM Safety
预训练模型多层特征的通用池化方法用于说话人验证	Jin Sob Kim	PDF	N/A	Universal Pooling Method of Multi-layer Features from Pretrained Models for Speaker Verification
基于网格的流体流动多尺度图神经网络超分辨率	Shivam Barwey	PDF	N/A	Mesh-based Super-Resolution of Fluid Flows with Multiscale Graph Neural Networks
重新构想线性探测：转移学习中的科尔莫戈罗夫-阿诺德网络	Sheng Shen	PDF	N/A	Reimagining Linear Probing: Kolmogorov-Arnold Networks in Transfer Learning
探索用于真实图像清晰度评估的柯尔莫哥洛夫-阿诺德网络	Shaode Yu	PDF	N/A	Exploring Kolmogorov-Arnold networks for realistic image sharpness assessment
SwinGS：用于任意长度体积视频流的滑动窗口高斯散射技术	Bangya Liu	PDF	N/A	SwinGS: Sliding Window Gaussian Splatting for Volumetric Video Streaming with Arbitrary Length
从不确定性到清晰：通过语义扩展实现有限生物医学样本的类增量学习	Yifei Yao	PDF	N/A	From Uncertainty to Clarity: Uncertainty-Guided Class-Incremental Learning for Limited Biomedical Samples via Semantic Expansion
DiTAS：通过增强激活平滑量化扩散变压器	Zhenyuan Dong	PDF	N/A	DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing
人机协作的相关性	Xiaotong Zhang	PDF	N/A	Relevance for Human Robot Collaboration
GatedUniPose：一种结合UniRepLKNet和门控卷积的新型姿态估计方法	Liang Feng	PDF	N/A	GatedUniPose: A Novel Approach for Pose Estimation Combining UniRepLKNet and Gated Convolution
使用同态加密进行高效隐私保护的KAN推理	Zhizheng Lai	PDF	N/A	Efficient Privacy-Preserving KAN Inference Using Homomorphic Encryption
自上而下的活动表示学习用于视频问答	Yanan Wang	PDF	N/A	Top-down Activity Representation Learning for Video Question Answering
多对象事件图表示学习用于视频问答	Yanan Wang	PDF	N/A	Multi-object event graph representation learning for Video Question Answering
通过可解释的状态空间模型学习三维高分辨率磁共振图像中的脑肿瘤表示	Qingqiao Hu	PDF	N/A	Learning Brain Tumor Representation in 3D High-Resolution MR Images via Interpretable State Space Models
琉璃：日文通用文本嵌入	Hayato Tsukagoshi	PDF	N/A	Ruri: Japanese General Text Embeddings
应用于计算机视觉问题的迁移学习：当前进展、局限性与机遇的综述	Aaryan Panda	PDF	N/A	Transfer Learning Applied to Computer Vision Problems: Survey on Current Progress, Limitations, and Opportunities
DFDG：无数据双生成器对抗蒸馏用于一次性联邦学习	Kangyang Luo	PDF	N/A	DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning
大型语言模型是模式匹配器：使用ChatGPT编辑半结构化和结构化文档	Irene Weber	PDF	N/A	Large Language Models are Pattern Matchers: Editing Semi-Structured and Structured Documents with ChatGPT
长尾音乐自动标签：一种少样本方法	T. Aleksandra Ma	PDF	N/A	Music auto-tagging in the long tail: A few-shot approach
GRE^2-MDCL：通过多维对比学习增强的图表示嵌入	Kaizhe Fan	PDF	N/A	GRE^2-MDCL: Graph Representation Embedding Enhanced via Multidimensional Contrastive Learning
推进深度任意模型以实现内窥镜下无监督单目深度估计	Bojian Li	PDF	N/A	Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy
FIReStereo：用于UAS在视觉退化环境中深度感知的森林红外立体数据集	Devansh Dhrafani	PDF	N/A	FIReStereo: Forest InfraRed Stereo Dataset for UAS Depth Perception in Visually Degraded Environments
CollaMamba：基于跨代理时空状态空间模型的有效协同感知	Yang Li	PDF	N/A	CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model
实验法律人工智能解决方案：以获取司法公正的问答为例	Jonathan Li	PDF	N/A	Experimenting with Legal AI Solutions: The Case of Question-Answering for Access to Justice
稀疏标注图中的节点分类虚拟节点生成	Hang Cui	PDF	N/A	Virtual Node Generation for Node Classification in Sparsely-Labeled Graphs
无数据集限制玻尔兹曼机上的权重初始化	Muneki Yasuda	PDF	N/A	Dataset-Free Weight-Initialization on Restricted Boltzmann Machine
通过模块级噪声攻击端到端自动驾驶	Lu Wang	PDF	N/A	Attack End-to-End Autonomous Driving through Module-Wise Noise
超级单调对齐搜索	Junhyeok Lee	PDF	N/A	Super Monotonic Alignment Search
DSBench：数据科学代理距离成为数据科学专家还有多远？	Liqiang Jing	PDF	N/A	DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?
TMFNet：用于彩色图像操作链检测的双流多通道融合网络	Yakun Niu	PDF	N/A	TMFNet: Two-Stream Multi-Channels Fusion Networks for Color Image Operation Chain Detection
临界阻尼三阶朗之万动力学	Benjamin Sterling	PDF	N/A	Critically Damped Third-Order Langevin Dynamics
从平衡中学习：纠正长尾场景中的知识转移	Xinlei Huang	PDF	N/A	Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed Scenarios
利用排序模型提升问答文本检索：基准测试、微调与部署RAG的重排器	Gabriel de Souza P. Moreira	PDF	N/A	Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG
在俄乌战争期间，对Telegram上的信息叙事检测与演变的建模	Patrick Gerard	PDF	N/A	Modeling Information Narrative Detection and Evolution on Telegram during the Russia-Ukraine War
开放词汇远程感知图像语义分割	Qinglong Cao	PDF	N/A	Open-Vocabulary Remote Sensing Image Semantic Segmentation
利用受限玻尔兹曼机中的目标能量进行比率散度学习：超越库尔贝克-莱布勒散度学习	Yuichi Ishida	PDF	N/A	Ratio Divergence Learning Using Target Energy in Restricted Boltzmann Machines: Beyond Kullback--Leibler Divergence Learning
基于话语重写的无监督对话主题分割模型	Xia Hou	PDF	N/A	An Unsupervised Dialogue Topic Segmentation Model Based on Utterance Rewriting
变换物理信息神经网络用于对流-扩散方程	Jiajing Guan	PDF	N/A	Transformed Physics-Informed Neural Networks for The Convection-Diffusion Equation