Arxiv 2024-11-07 Papers

标题	作者	PDF链接	代码仓库	Title
SVDQuant：通过低秩成分吸收异常值，用于4比特扩散模型	Muyang Li	PDF	N/A	SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
ProEdit：高质量3D场景编辑只需简单的渐进式操作	Jun-Kun Chen	PDF	N/A	ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing
Diff-2-in-1：利用扩散模型弥合生成与密集感知之间的差距	Shuhong Zheng	PDF	N/A	Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models
ReCapture：使用掩码视频微调技术，为用户提供的视频生成视频摄像机控制	David Junhao Zhang	PDF	N/A	ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning
分析视觉符号的语言	David M. Chan	PDF	N/A	Analyzing The Language of Visual Tokens
DynaMem：面向开放世界移动操作的在线动态时空语义记忆	Peiqi Liu	PDF	N/A	DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation
针线穿引：大型语言模型能否在近百万规模的干草堆中找到线索？	Jonathan Roberts	PDF	N/A	Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?
LLM2CLIP：强大的语言模型解锁更丰富的视觉表示	Weiquan Huang	PDF	N/A	LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
HourVideo：1小时视频语言理解	Keshigeyan Chandrasegaran	PDF	N/A	HourVideo: 1-Hour Video-Language Understanding
混合变压器：一种用于多模态基础模型的稀疏可扩展架构	Weixin Liang	PDF	N/A	Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
LoFi：利用隐式神经表示实现可扩展的局部图像重建	AmirEhsan Khorashadizadeh	PDF	N/A	LoFi: Scalable Local Image Reconstruction with Implicit Neural Representation
负责任的人工智能公共采购？了解美国城市的实践、挑战与需求	Nari Johnson	PDF	N/A	Public Procurement for Responsible AI? Understanding U.S. Cities' Practices, Challenges, and Needs
哪些部分去了哪里？基于信息瓶颈的过去与未来传递熵分解	Kieran A. Murphy	PDF	N/A	Which bits went where? Past and future transfer entropy decomposition with the information bottleneck
重新思考基于偏好的奖励建模中的布拉德利-特里模型：基础、理论与替代方案	Hao Sun	PDF	N/A	Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives
因果注意力掩码中的聚类	Nikita Karagodin	PDF	N/A	Clustering in Causal Attention Masking
SG-I2V：图像到视频生成中的自引导轨迹控制	Koichi Namekata	PDF	N/A	SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation
少样本任务学习通过逆生成建模实现	Aviv Netanyahu	PDF	N/A	Few-Shot Task Learning through Inverse Generative Modeling
语义中心假设：语言模型在不同语言和模态之间共享语义表示	Zhaofeng Wu	PDF	N/A	The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities
平面反射感知神经辐射场	Chen Gao	PDF	N/A	Planar Reflection-Aware Neural Radiance Fields
DINO-WM：在预训练视觉特征上的世界模型实现零样本规划	Gaoyue Zhou	PDF	N/A	DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning
增强逆向工程：研究与基准测试用于反编译二进制文件漏洞分析的大型语言模型	Dylan Manuel	PDF	N/A	Enhancing Reverse Engineering: Investigating and Benchmarking Large Language Models for Vulnerability Analysis in Decompiled Binaries
噪声零样本协调：打破零样本协调游戏中的共同知识假设	Usman Anwar	PDF	N/A	Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games
后缀解码：一种加速大型语言模型推理的无模型方法	Gabriele Oliaro	PDF	N/A	SuffixDecoding: A Model-Free Approach to Speeding Up Large Language Model Inference
AsCAN：用于高效识别和生成的非对称卷积-注意力网络	Anil Kag	PDF	N/A	AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
BitNet a4.8：适用于1-bit LLMs的4-bit激活功能	Hongyu Wang	PDF	N/A	BitNet a4.8: 4-bit Activations for 1-bit LLMs
VAIR：室内场景中低成本、多模态透明表面重建的视觉-声学隐式表示	Advaith V. Sethuraman	PDF	N/A	VAIR: Visuo-Acoustic Implicit Representations for Low-Cost, Multi-Modal Transparent Surface Reconstruction in Indoor Scenes
关于大型语言模型诊断不确定性估计的立场文件：下一个词的概率并非预测试概率	Yanjun Gao	PDF	N/A	Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability
利用重识别技术揭示视频扩散模型中的隐藏子空间	Mischa Dombrowski	PDF	N/A	Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification
CAD-MLLM：通过MLLM实现多模态条件下的CAD生成统一	Jingwei Xu	PDF	N/A	CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM
M3DocRAG：多模态检索是实现多页多文档理解的关键	Jaemin Cho	PDF	N/A	M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding
估计文本分类中顺序相关文学属性的影响：一种以数据为中心的假设检验方法	Gideon Yoffe	PDF	N/A	Estimating the Influence of Sequentially Correlated Literary Properties in Textual Classification: A Data-Centric Hypothesis-Testing Approach
SPGD：最陡扰动梯度下降优化	Amir M. Vahedi	PDF	N/A	SPGD: Steepest Perturbed Gradient Descent Optimization
基于强化学习的自动视频编辑方法，利用预训练的视觉-语言模型	Panwen Hu	PDF	N/A	A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model
帕累托集识别与后验采样	Cyrille Kone	PDF	N/A	Pareto Set Identification With Posterior Sampling
Fed-LDR：基于节点的模型优化与联邦局部数据注入图创建	Jiechao Gao	PDF	N/A	Fed-LDR: Federated Local Data-infused Graph Creation with Node-centric Model Refinement
SaSR-Net：源感知语义表示网络，用于增强视听问答	ianyu Yang	PDF	N/A	SaSR-Net: Source-Aware Semantic Representation Network for Enhancing Audio-Visual Question Answering
DimensionX：通过可控视频扩散从单一图像创建任意3D和4D场景	Wenqiang Sun	PDF	N/A	DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
StoryAgent：通过多智能体协作实现定制化讲故事视频生成	Panwen Hu	PDF	N/A	StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration
MVSplat360：从稀疏视角进行前馈360场景合成	Yuedong Chen	PDF	N/A	MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views
VideoGLaMM：一种用于视频中像素级视觉定位的大型多模态模型	Shehan Munasinghe	PDF	N/A	VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
GPTKB：从语言模型构建超大规模知识库	Yujia Hu	PDF	N/A	GPTKB: Building Very Large Knowledge Bases from Language Models
Stem-OB：通过扩散反演实现类似干细胞的收敛性观察，从而实现可泛化的视觉模仿学习	Kaizhe Hu	PDF	N/A	Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion
评估用于自主航运的强化学习算法的鲁棒性	Bavo Lesy	PDF	N/A	Evaluating Robustness of Reinforcement Learning Algorithms for Autonomous Shipping
GASE：生成性增强的句子编码	Manuel Frank	PDF	N/A	GASE: Generatively Augmented Sentence Encoding
结构至关重要：动态政策梯度	Sara Klein	PDF	N/A	Structure Matters: Dynamic Policy Gradient
鲁棒虹膜中心定位用于辅助眼动追踪	Nipun Sandamal Ranasekara Pathiranage	PDF	N/A	Robust Iris Centre Localisation for Assistive Eye-Gaze Tracking
通过结合二部图和完全有向图来增强缺失数据插补	Zhaoyang Zhang	PDF	N/A	Enhancing Missing Data Imputation through Combined Bipartite Graph and Complete Directed Graph
OpenCoder：顶级代码大型语言模型的开放食谱	Siming Huang	PDF	N/A	OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
采样引导的异质图神经网络结合时间平滑性用于可扩展的纵向数据插补	Zhaoyang Zhang	PDF	N/A	Sampling-guided Heterogeneous Graph Neural Network with Temporal Smoothing for Scalable Longitudinal Data Imputation
在视觉-语言模型提示学习的时代	Ankit Jha	PDF	N/A	In the Era of Prompt Learning with Vision-Language Models
具有基础模型的图形用户界面代理：综合调查	Shuai Wang	PDF	N/A	GUI Agents with Foundation Models: A Comprehensive Survey
用于社交网络嵌入的非欧几里得混合模型	Roshni G. Iyer	PDF	N/A	Non-Euclidean Mixture Model for Social Network Embedding
FrontierMath：评估人工智能高级数学推理能力的基准	Elliot Glazer	PDF	N/A	FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
思考智能，行动SMARL！分析多智能体强化学习中的概率逻辑驱动安全	Satchit Chatterji	PDF	N/A	Think Smart, Act SMARL! Analyzing Probabilistic Logic Driven Safety in Multi-Agent Reinforcement Learning
ZAHA: 介绍立面泛化等级与大规模点云立面语义分割基准数据集	Olaf Wysocki	PDF	N/A	ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset
OneProt：迈向多模态蛋白质基础模型	Klemens Flöge	PDF	N/A	OneProt: Towards Multi-Modal Protein Foundation Models
使用预训练语言模型对西班牙政党推文进行情感分析	Chuqiao Song	PDF	N/A	Sentiment Analysis of Spanish Political Party Tweets Using Pre-trained Language Models
基于远程教育讲座语义的多功能自动编辑系统	Panwen Hu	PDF	N/A	A multi-purpose automatic editing system based on lecture semantics for remote education
临床医生之声：医疗领域中可解释人工智能的基本考量	T. E. Röber	PDF	N/A	Clinicians' Voice: Fundamental Considerations for XAI in Healthcare
用于分类的带有模糊基本事实的保形化信度区域	Michele Caprio	PDF	N/A	Conformalized Credal Regions for Classification with Ambiguous Ground Truth
提示引导的内部状态用于大型语言模型幻觉检测	Fujie Zhang	PDF	N/A	Prompt-Guided Internal States for Hallucination Detection of Large Language Models
广义随机Halpern方案的渐近正则性及其应用	Nicholas Pischke	PDF	N/A	Asymptotic regularity of a generalised stochastic Halpern scheme with applications
用于不完整CT重建的可微高斯表示	Shaokai Wu	PDF	N/A	Differentiable Gaussian Representation for Incomplete CT Reconstruction
在具有间距目标的预算拍卖中学习	Giannis Fikioris	PDF	N/A	Learning in Budgeted Auctions with Spacing Objectives
基于机器学习和优化的统计物理学对偶性方法	Andrea E. V. Ferrari	PDF	N/A	Machine learning and optimization-based approaches to duality in statistical physics
深度强化学习中的可塑性丧失：一项调查	Timo Klein	PDF	N/A	Plasticity Loss in Deep Reinforcement Learning: A Survey
D$^3$epth：动态场景中使用动态掩码的自监督深度估计	Siyu Chen	PDF	N/A	D$^3$epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic Scenes
VTechAGP：一个面向学术到大众读者的文本释义数据集及基准模型	Ming Cheng	PDF	N/A	VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models
文言文何时有助？量化汉字和汉文中的跨语言迁移	Seyoung Song	PDF	N/A	When Does Classical Chinese Help? Quantifying Cross-Lingual Transfer in Hanja and Kanbun
基于端到端Inception-Unet的生成对抗网络用于雪和雨的去除	Ibrahim Kajo	PDF	N/A	End-to-end Inception-Unet based Generative Adversarial Networks for Snow and Rain Removals
利用基于梯度的模拟方法进行粒子加速器中的多目标优化	Kishansingh Rajput	PDF	N/A	Harnessing the Power of Gradient-Based Simulations for Multi-Objective Optimization in Particle Accelerators
一种用于优化人工神经网络在非易失性存储器交叉阵列上映射的简单打包算法	W. Haensch	PDF	N/A	A Simple Packing Algorithm for Optimized Mapping of Artificial Neural Networks onto Non-Volatile Memory Cross-Bar Arrays
LuxBank：首个卢森堡语通用依存树库	Alistair Plum	PDF	N/A	LuxBank: The First Universal Dependency Treebank for Luxembourgish
软霍夫丁树：一种数据流上的透明且可微分的模型	Kirsten Köbschall	PDF	N/A	Soft Hoeffding Tree: A Transparent and Differentiable Model on Data Streams
防御深度回归模型免受后门攻击	Lingyu Du	PDF	N/A	Defending Deep Regression Models against Backdoor Attacks
GANESH：用于无镜头成像的通用性神经辐射场	Rakesh Raj Madavan	PDF	N/A	GANESH: Generalizable NeRF for Lensless Imaging
Kwai-STaR：将大型语言模型转化为状态转换推理器	Xingyu Lu	PDF	N/A	Kwai-STaR: Transform LLMs into State-Transition Reasoners
MPVO：基于运动先验的视觉里程计用于点目标导航	Sayan Paul	PDF	N/A	MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation
AlignXIE：通过跨语言对齐提升多语言信息抽取	Yuxin Zuo	PDF	N/A	AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual Alignment
提升投资分析：优化金融研究中的人工智能代理协作	Xuewen Han	PDF	N/A	Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial Research
权衡之道：多目标强化学习的政策总结	Zuzanna Osika	PDF	N/A	Navigating Trade-offs: Policy Summarization for Multi-Objective Reinforcement Learning
一个用于全切片图像肾小球分割的高效流程	Quan Huu Cap	PDF	N/A	An Effective Pipeline for Whole-Slide Image Glomerulus Segmentation
学习快速解决车辆路径问题：一种针对有限车队时间约束车辆路径问题的神经优化方法	Elija Deineko	PDF	N/A	Learn to Solve Vehicle Routing Problems ASAP: A Neural Optimization Approach for Time-Constrained Vehicle Routing Problems with Finite Vehicle Fleet
从数据中学习动态系统：基于梯度的字典优化	Mohammad Tabish	PDF	N/A	Learning dynamical systems from data: Gradient-based dictionary optimization
注意力掩码帮助对抗性攻击绕过安全检测器	Yunfan Shi	PDF	N/A	Attention Masks Help Adversarial Attacks to Bypass Safety Detectors
《米诺里亚的挖掘：未知的、代表性不足的和表现不佳的少数群体》	Mohsen Dehghankar	PDF	N/A	Mining the Minoria: Unknown, Under-represented, and Under-performing Minority Groups
脉冲神经网络的零样本时间分辨率域自适应	Sanja Karilanova	PDF	N/A	Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks
通过语义和统计特征评估越南语文本可读性的研究	Hung Tuan Le	PDF	N/A	A study of Vietnamese readability assessing through semantic and statistical features
RetrieveGPT：融合提示与数学模型以提升代码混合信息检索效果	Aniket Deroy	PDF	N/A	RetrieveGPT: Merging Prompts and Mathematical Models for Enhanced Code-Mixed Information Retrieval
使用结构基序的等变图注意力网络用于预测细胞系特异性协同药物组合	Zachary Schwehr	PDF	N/A	Equivariant Graph Attention Networks with Structural Motifs for Predicting Cell Line-Specific Synergistic Drug Combinations
驯服整流流以实现反演与编辑	Jiangshan Wang	PDF	N/A	Taming Rectified Flow for Inversion and Editing
尊重极限：具有最优值界限的贝叶斯优化	Hanyang Wang	PDF	N/A	Respecting the limit:Bayesian optimization with a bound on the optimal value
卷积可微逻辑门网络	Felix Petersen	PDF	N/A	Convolutional Differentiable Logic Gate Networks
神经形态无线分裂计算与多级尖峰	Dengyu Wu	PDF	N/A	Neuromorphic Wireless Split Computing with Multi-Level Spikes
通过领域适应控制文本到图像扩散模型中的人体形状和姿态	Benito Buchheim	PDF	N/A	Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation
子空间约束二次矩阵分解：算法及应用	Zheng Zhai	PDF	N/A	Subspace-Constrained Quadratic Matrix Factorization: Algorithm and Applications
NeuroFly：一种用于全脑单神经元重构的框架	Rubin Zhao	PDF	N/A	NeuroFly: A framework for whole-brain single neuron reconstruction
利用模拟数据进行半监督域适应SAR目标识别的渐进多层次对齐	Xinzheng Zhang	PDF	N/A	Progressive Multi-Level Alignments for Semi-Supervised Domain Adaptation SAR Target Recognition Using Simulated Data
差分隐私概述及基本技术	Ferdinando Fioretto	PDF	N/A	Differential Privacy Overview and Fundamental Techniques
探索多模态大型语言模型中的层次分子图表示	Chengxin Hu	PDF	N/A	Exploring Hierarchical Molecular Graph Representation in Multimodal LLMs
从CNN到ConvRNN：为时间序列异常检测调整可视化技术	Fabien Poirier	PDF	N/A	From CNN to ConvRNN: Adapting Visualization Techniques for Time-Series Anomaly Detection
ESC-MISR：增强遥感多图像超分辨率的空间相关性	Zhihui Zhang	PDF	N/A	ESC-MISR: Enhancing Spatial Correlations for Multi-Image Super-Resolution in Remote Sensing
用于行星漫游车导航的力矩传感器现场评估	Levin Gerdes	PDF	N/A	Field Assessment of Force Torque Sensors for Planetary Rover Navigation
BhasaAnuvaad：一个包含14种印度语言的语音翻译数据集	Sparsh Jain	PDF	N/A	BhasaAnuvaad: A Speech Translation Dataset for 14 Indian Languages
动态亮度自适应用于鲁棒的多模态图像融合	Yiming Sun	PDF	N/A	Dynamic Brightness Adaptation for Robust Multi-modal Image Fusion
机器学习中虚假性的多重维度	Samuel J. Bell	PDF	N/A	The Multiple Dimensions of Spuriousness in Machine Learning
网络碎片化是一种有用的复杂性度量吗？	Coenraad Mouton	PDF	N/A	Is network fragmentation a useful complexity measure?
具有大电磁核的互点学习网络用于SAR开放集识别	Xiayang Xiao	PDF	N/A	Reciprocal Point Learning Network with Large Electromagnetic Kernel for SAR Open-Set Recognition
个性化联邦学习用于跨视角地理定位	Christos Anagnostopoulos	PDF	N/A	Personalized Federated Learning for Cross-view Geo-localization
AWARE叙述者和利用大型语言模型从智能手机感知数据中提取行为洞察	Tianyi Zhang	PDF	N/A	AWARE Narrator and the Utilization of Large Language Models to Extract Behavioral Insights from Smartphone Sensing Data
使用网络流模型解决细胞制造系统中的广义分组问题	Md. Kutub Uddin	PDF	N/A	Solving Generalized Grouping Problems in Cellular Manufacturing Systems Using a Network Flow Model
基于深度神经网络的三维云层检索：适用于可变太阳照明和多视角星载成像	Tamar Klein	PDF	N/A	DNN-based 3D Cloud Retrieval for Variable Solar Illumination and Multiview Spaceborne Imaging
使用预训练模型的差分隐私持续学习	Marlon Tobaben	PDF	N/A	Differentially Private Continual Learning using Pre-Trained Models
CaPo：高效具身多智能体合作的协同计划优化	Jie Liu	PDF	N/A	CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation
基于社会意识意见的导航与椭圆极限环	Giulia d'Addato	PDF	N/A	Socially-Aware Opinion-Based Navigation with Oval Limit Cycles
通过多智能体强化学习的语义感知资源管理用于C-V2X车队	Zhiyu Shao	PDF	N/A	Semantic-Aware Resource Management for C-V2X Platooning via Multi-Agent Reinforcement Learning
CUIfy XR：一个开源包，用于在XR中嵌入由LLM驱动的对话代理	Kadir Burak Buldu	PDF	N/A	CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR
EffiCANet：利用卷积注意力实现高效的时间序列预测	Xinxing Zhou	PDF	N/A	EffiCANet: Efficient Time Series Forecasting with Convolutional Attention
利用多模态大型语言模型解释和发现视觉文化遗产收藏	Taylor Arnold	PDF	N/A	Explainable Search and Discovery of Visual Cultural Heritage Collections with Multimodal Large Language Models
通过多种磁共振成像模式增强临床显著性前列腺癌预测的信任度	Benjamin Ng	PDF	N/A	Enhancing Trust in Clinically Significant Prostate Cancer Prediction with Multiple Magnetic Resonance Imaging Modalities
历史摄影收藏的自动图像色彩映射	Taylor Arnold	PDF	N/A	Automated Image Color Mapping for a Historic Photographic Collection
使用遗传算法寻找强彩票网络	Philipp Altmann	PDF	N/A	Finding Strong Lottery Ticket Networks with Genetic Algorithms
ICH-SCNet：基于CLIP引导的SAM机制的脑内出血分割与预后分类网络	Xinlei Yu	PDF	N/A	ICH-SCNet: Intracerebral Hemorrhage Segmentation and Prognosis Classification Network Using CLIP-guided SAM mechanism
中心性图移位算子用于图神经网络	Yassine Abbahaddou	PDF	N/A	Centrality Graph Shift Operators for Graph Neural Networks
IGDrivSim：一个用于评估自动驾驶中模仿差距的基准	Clémence Grislain	PDF	N/A	IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving
DISCO：发现文本分类模型中的过拟合现象作为因果规则	Zijian Zhang	PDF	N/A	DISCO: DISCovering Overfittings as Causal Rules for Text Classification Models
DanceFusion：一种用于音频驱动舞蹈动作重构的时空骨架扩散变换器	Li Zhao	PDF	N/A	DanceFusion: A Spatio-Temporal Skeleton Diffusion Transformer for Audio-Driven Dance Motion Reconstruction
wav2sleep：一种从生理信号进行睡眠阶段分类的统一多模态方法	Jonathan F. Carter	PDF	N/A	wav2sleep: A Unified Multi-Modal Approach to Sleep Stage Classification from Physiological Signals
TAP-VL：文本布局感知预训练，用于增强视觉-语言模型	Jonathan Fhima	PDF	N/A	TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models
实践教程：使用LLM和人在回路中的标注方法	Ekaterina Artemova	PDF	N/A	Hands-On Tutorial: Labeling with LLM and Human-in-the-Loop
通过地理加权学习进行网络犯罪预测	Muhammad Al-Zafar Khan	PDF	N/A	Cybercrime Prediction via Geographically Weighted Learning
通过合成数据增强改进的多任务脑肿瘤分割	André Ferreira	PDF	N/A	Improved Multi-Task Brain Tumour Segmentation with Synthetic Data Augmentation
使用3D WDM进行脑肿瘤切除与缺失模态生成	André Ferreira	PDF	N/A	Brain Tumour Removing and Missing Modality Generation using 3D WDM
KL正则化上下文老虎机和RLHF的锐利分析	Heyang Zhao	PDF	N/A	Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
使用深度学习方法对混凝土结构进行多时相裂缝分割	Said Harb	PDF	N/A	Multi-temporal crack segmentation in concrete structure using deep learning approaches
利用3D城市建模和Carto2S数据集进行人口估算——一个案例研究	Jai G Singla	PDF	N/A	Population estimation using 3D city modelling and Carto2S datasets -- A case study
利用高分辨率卫星影像和数字高程模型对印度城市进行太阳能潜力分析	Jai Singla	PDF	N/A	Solar potential analysis over Indian cities using high-resolution satellite imagery and DEM
跨图像和图像内原型学习用于多标签疾病诊断和解释	Chong Wang	PDF	N/A	Cross- and Intra-image Prototypical Learning for Multi-label Disease Diagnosis and Interpretation
FASSILA：用于阿尔及利亚方言假新闻检测和情感分析的语料库	Amin Abdedaiem	PDF	N/A	FASSILA: A Corpus for Algerian Dialect Fake News Detection and Sentiment Analysis
自校准的列表式重排序与大型语言模型	Ruiyang Ren	PDF	N/A	Self-Calibrated Listwise Reranking with Large Language Models
社交自我网格估计	Luca Scofano	PDF	N/A	Social EgoMesh Estimation
半监督学习对线段检测的影响	Johanna Engman	PDF	N/A	The Impact of Semi-Supervised Learning on Line Segment Detection
TexLiverNet：利用医学知识和空间-频率感知实现增强的肝脏肿瘤分割	Xiaoyan Jiang	PDF	N/A	TexLiverNet: Leveraging Medical Knowledge and Spatial-Frequency Perception for Enhanced Liver Tumor Segmentation
通过参数化核验证神经网络对抗卷积扰动的验证	Benedikt Brückner	PDF	N/A	Verification of Neural Networks against Convolutional Perturbations via Parameterised Kernels
Tibyan语料库：利用ChatGPT进行阿拉伯语语法错误校正的平衡且全面的错误覆盖语料库	Ahlam Alrehili	PDF	N/A	Tibyan Corpus: Balanced and Comprehensive Error Coverage Corpus Using ChatGPT for Arabic Grammatical Error Correction
一阶段目标检测在面对分布外数据时的固有鲁棒性	Aitor Martinez-Seras	PDF	N/A	On the Inherent Robustness of One-Stage Object Detection against Out-of-Distribution Data
摘要数据集的状态与命运	Noam Dahan	PDF	N/A	The State and Fate of Summarization Datasets
对皮肤科的热情：借助撒哈拉以南非洲色素性皮肤图像弥合多样性差距	Philippe Gottfrois	PDF	N/A	PASSION for Dermatology: Bridging the Diversity Gap with Pigmented Skin Images from Sub-Saharan Africa
解释MuZero规划中的学习模型	Hung Guei	PDF	N/A	Interpreting the Learned Model in MuZero Planning
通过差分隐私测量统计异质性实现鲁棒的联邦分析	Mary Scott	PDF	N/A	Towards Robust Federated Analytics via Differentially Private Measurements of Statistical Heterogeneity
多智能体即社会群体：探究人类-智能体互动中多智能体的社会影响	Tianqi Song	PDF	N/A	Multi-Agents are Social Groups: Investigating Social Influence of Multiple Agents in Human-Agent Interactions
低资源语言自动语音识别的多阶段微调策略	Leena G Pillai	PDF	N/A	Multistage Fine-tuning Strategies for Automatic Speech Recognition in Low-resource Languages
DomainGallery：通过以属性为中心的微调实现少样本领域驱动图像生成	Yuxuan Duan	PDF	N/A	DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning
高阶GNN与效率的结合：稀疏Sobolev图神经网络	Jhony H. Giraldo	PDF	N/A	Higher-Order GNNs Meet Efficiency: Sparse Sobolev Graph Neural Networks
标签噪声对学习复杂特征的影响	Rahul Vashisht	PDF	N/A	Impact of Label Noise on Learning Complex Features
选民模型的推广：有影响力的节点及其收敛性质	Abhiram Manohara	PDF	N/A	A Generalisation of Voter Model: Influential Nodes and Convergence Properties
基于模型的离线强化学习中的受限潜在动作策略	Marvin Alles	PDF	N/A	Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning
在词级实现高效可解释性的文字修剪	Rohan Kumar Yadav	PDF	N/A	Pruning Literals for Highly Efficient Explainability at Word Level
不确定性预测神经网络（UpNet）：将人工神经网络嵌入贝叶斯反演框架以量化遥感反演的不确定性	Dasheng Fan	PDF	N/A	Uncertainty Prediction Neural Network (UpNet): Embedding Artificial Neural Network in Bayesian Inversion Framework to Quantify the Uncertainty of Remote Sensing Retrieval
加权结构化论证中前提解码评价的公理化研究	Jonathan Ben-Naim	PDF	N/A	An Axiomatic Study of the Evaluation of Enthymeme Decoding in Weighted Structured Argumentation
Peri-midFormer：用于时间序列分析的周期性金字塔Transformer	Qiang Wu	PDF	N/A	Peri-midFormer: Periodic Pyramid Transformer for Time Series Analysis
使用Transformer的按测量插值	Borjan Geshkovski	PDF	N/A	Measure-to-measure interpolation using Transformers
视觉语言模型是情境价值学习者	Yecheng Jason Ma	PDF	N/A	Vision Language Models are In-Context Value Learners
交互式进化多目标优化中相关目标的动态检测与偏好漂移的适应	Seyed Mahdi Shavarani	PDF	N/A	Dynamic Detection of Relevant Objectives and Adaptation to Preference Drifts in Interactive Evolutionary Multi-Objective Optimization
将大型语言模型蒸馏为BERT以用于网页搜索排名的最佳实践	Dezhi Ye	PDF	N/A	Best Practices for Distilling Large Language Models into BERT for Web Search Ranking
元推理提升了大型语言模型中的工具使用能力	Lisa Alazraki	PDF	N/A	Meta-Reasoning Improves Tool Use in Large Language Models
超立方体策略正则化框架用于离线强化学习	Yi Shen	PDF	N/A	Hypercube Policy Regularization Framework for Offline Reinforcement Learning
神经指纹用于对抗攻击检测	Haim Fisher	PDF	N/A	Neural Fingerprints for Adversarial Attack Detection
利用大数据技术实时检测社交网络帖子中的压力	Hai-Yen Phan Nguyen	PDF	N/A	Real-time stress detection on social network posts using big data technology
番茄，番茄，番茄：衡量多语言语言模型中子词间共享语义的作用	Xinyu Zhang	PDF	N/A	Tomato, Tomahto, Tomate: Measuring the Role of Shared Semantics among Subwords in Multilingual Language Models
GenJoin：一种条件生成式计划到计划查询优化器，能够从子计划提示中学习	Pavel Sulimov	PDF	N/A	GenJoin: Conditional Generative Plan-to-Plan Query Optimizer that Learns from Subplan Hints
基于L0正则化稀疏编码的可解释网络用于多模态图像融合	Gargi Panda	PDF	N/A	l0-Regularized Sparse Coding-based Interpretable Network for Multi-Modal Image Fusion
使用深度学习与MediaPipe Holistic的连续手语识别系统	Sharvani Srivastava	PDF	N/A	Continuous Sign Language Recognition System using Deep Learning with MediaPipe Holistic
归一化空间对齐：一种多用途的表示分析度量	Danish Ebadulla	PDF	N/A	Normalized Space Alignment: A Versatile Metric for Representation Analysis
使用线性特征解耦方法提高深度学习对非线性薛定谔方程的拟合精度	Yunfan Zhang	PDF	N/A	Improve the Fitting Accuracy of Deep Learning for the Nonlinear Schrödinger Equation Using Linear Feature Decoupling Method
FedDP：基于联邦学习的组织病理学图像分割隐私保护方法	Liangrui Pan	PDF	N/A	FedDP: Privacy-preserving method based on federated learning for histopathology image segmentation
Pose2Trajectory：利用Transformer模型基于人体姿态预测网球运动员的运动轨迹	Ali K. AlShami	PDF	N/A	Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player's Trajectory
灭霸：通过融入心智技能的大型语言模型提升对话代理	Young-Jun Lee	PDF	N/A	Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model
协同引导的伪标签区域监督在半监督医学图像分割中的应用	Tao Wang	PDF	N/A	Synergy-Guided Regional Supervision of Pseudo Labels for Semi-Supervised Medical Image Segmentation
序列到序列扩散桥模型	Hao Yang	PDF	N/A	Series-to-Series Diffusion Bridge Model
CFPNet：通过跨区域特征传播改进轻量级ToF深度补全	Laiyan Ding	PDF	N/A	CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature Propagation
LLM-R：一种结合分层代理和RAG的领域自适应维护方案生成框架	Laifa Tao	PDF	N/A	LLM-R: A Framework for Domain-Adaptive Maintenance Scheme Generation Combining Hierarchical Agents and RAG
无人机辅助桥梁检测的深度学习模型：YOLO基准分析	Trong-Nhan Phan	PDF	N/A	Deep Learning Models for UAV-Assisted Bridge Inspection: A YOLO Benchmark Analysis
ML-Promise：一个用于企业承诺验证的多语言数据集	Yohei Seki	PDF	N/A	ML-Promise: A Multilingual Dataset for Corporate Promise Verification
FreeCap：开放环境中无需校准的混合动作捕捉	Aoru Xue	PDF	N/A	FreeCap: Hybrid Calibration-Free Motion Capture in Open Environments
Magentic-One：一种用于解决复杂任务的通用多智能体系统	Adam Fourney	PDF	N/A	Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks
通过目标多样性在开放式模拟器中实现自适应代理训练	Robby Costales	PDF	N/A	Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity
CDT能否通过修改人类学来合理化事前最优政策？	Emery Cooper	PDF	N/A	Can CDT rationalise the ex ante optimal policy via modified anthropics?
GPT引导的蒙特卡洛树搜索用于金融欺诈检测中的符号回归	Prashank Kadam	PDF	N/A	GPT-Guided Monte Carlo Tree Search for Symbolic Regression in Financial Fraud Detection
高效的单幅图像非均匀性校正算法	Yohann Tendero	PDF	N/A	Efficient single image non-uniformity correction algorithm
BV-G结构+纹理分解模型的性质。应用于卫星图像中的道路检测	Jerome Gilles	PDF	N/A	Properties of BV-G structures + textures decomposition models. Application to road detection in satellite images
比较生成性移动模型的公平性	Daniel Wang	PDF	N/A	Comparing Fairness of Generative Mobility Models
梯度局部化提升了语言模型的终身预训练效果	Jared Fernandez	PDF	N/A	Gradient Localization Improves Lifelong Pretraining of Language Models
ACCIO：通过聚合对比学习增强的表格理解	Whanhee Cho	PDF	N/A	ACCIO: Table Understanding Enhanced via Contrastive Learning with Aggregations
预训练智能体和世界模型的缩放法则	Tim Pearce	PDF	N/A	Scaling Laws for Pre-training Agents and World Models
统一解释性和可控性：通过干预进行评估	Usha Bhalla	PDF	N/A	Towards Unifying Interpretability and Control: Evaluation via Intervention
一条鱼，两条鱼，但不是整片海：对齐性降低了语言模型的概念多样性	Sonia K. Murthy	PDF	N/A	One fish, two fish, but not the whole sea: Alignment reduces language models' conceptual diversity
DELIFT：数据高效的语言模型指令微调	Ishika Agarwal	PDF	N/A	DELIFT: Data Efficient Language model Instruction Fine Tuning
贝叶斯校准的胜率估计与LLM评估器	Yicheng Gao	PDF	N/A	Bayesian Calibration of Win Rate Estimation with LLM Evaluators
基于低频GPS的长途客车无监督异常停车检测	Jiaxin Deng	PDF	N/A	Unsupervised Abnormal Stop Detection for Long Distance Coaches with Low-Frequency GPS