跳转至

Arxiv 2024-11-05 Papers

标题 作者 PDF链接 代码仓库 Title
MME-Finance:一个面向专家级理解和推理的多模态金融基准 Ziliang Gan PDF N/A MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
视觉-语言预训练的正确分类 Huang Zilong PDF N/A Classification Done Right for Vision-Language Pre-Training
推断最优的视觉语言模型只需要一个视觉标记,但更大的模型 Kevin Y. Li PDF N/A Inference Optimal VLMs Need Only One Visual Token but Larger Models
用于域生成算法检测的大型语言模型 Reynier Leyva La O PDF N/A LLMs for Domain Generation Algorithm Detection
VERITAS:一种统一的可靠性评估方法 Rajkumar Ramamurthy PDF N/A VERITAS: A Unified Approach to Reliability Evaluation
视觉运动模仿学习中的分布外恢复与以物体为中心的关键点逆策略 George Jiayuan Gao PDF N/A Out-of-Distribution Recovery with Object-Centric Keypoint Inverse Policy For Visuomotor Imitation Learning
交互生成代码:我们离自动生成网页交互还有多远? Jingyu Xiao PDF N/A Interaction2Code: How Far Are We From Automatic Interactive Webpage Generation?
智能医疗的未来:基于大语言模型的机器人集成与影响系统分析与讨论 Souren Pashangpour PDF N/A The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare
DiT4Edit:用于图像编辑的扩散变换器 Kunyu Feng PDF N/A DiT4Edit: Diffusion Transformer for Image Editing
SMoA:通过稀疏代理混合提升多智能体大型语言模型 Dawei Li PDF N/A SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents
机器学习模型中的无察觉防御:无检测的木马移除 Shafi Goldwasser PDF N/A Oblivious Defense in ML Models: Backdoor Removal without Detection
因果责任归属在人机协作中的应用 Yahang Qi PDF N/A Causal Responsibility Attribution for Human-AI Collaboration
基于图的半监督分离Lipschitz学习 Farid Bozorgnia PDF N/A Graph-Based Semi-Supervised Segregated Lipschitz Learning
稳定匹配与平局:近似比率和学习 Shiyun Lin PDF N/A Stable Matching with Ties: Approximation Ratios and Learning
代理信息引导的贝叶斯迁移学习与未知源 Sabina J. Sloman PDF N/A Proxy-informed Bayesian transfer learning with unknown sources
ShadowMamba:基于边界区域选择性扫描的阴影去除状态空间模型 Xiujin Zhu PDF N/A ShadowMamba: State-Space Model with Boundary-Region Selective Scan for Shadow Removal
探索数据结构:最近邻搜索及其扩展 Omar Salemohamed PDF N/A Discovering Data Structures: Nearest Neighbor Search and Beyond
基于大型语言模型社区中通过社会互动自发产生的个体性 Ryosuke Takata PDF N/A Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities
DiffLM:通过扩散语言模型实现可控的合成数据生成 Ying Zhou PDF N/A DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models
将精细细节与全局几何结构解耦,用于压缩深度图的超分辨率 Huan Zheng PDF N/A Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution
非合作可重构智能表面检测:通过深度支持向量数据描述进行扫描B测试 George Stamatelis PDF N/A On the Detection of Non-Cooperative RISs: Scan B-Testing via Deep Support Vector Data Description
使用动态Dropout提高Transformer训练效率 Hanrui Yan PDF N/A Enhancing Transformer Training Efficiency with Dynamic Dropout
形式逻辑引导的鲁棒联邦学习对抗投毒攻击 Dung Thuy Nguyen PDF N/A Formal Logic-guided Robust Federated Learning against Poisoning Attacks
Topograph:一种基于图的高效框架,用于严格保持拓扑结构的图像分割 Laurin Lux PDF N/A Topograph: An efficient Graph-Based Framework for Strictly Topology Preserving Image Segmentation
在卷积神经网络(CNNs)中,核正交性并不必然意味着特征图冗余的减少:卷积相似性最小化 Zakariae Belmekki PDF N/A Kernel Orthogonality does not necessarily imply a Decrease in Feature Map Redundancy in CNNs: Convolutional Similarity Minimization
驾驶场景的知识图谱:赋能神经符号人工智能的新兴能力 Ruwan Wickramarachchi PDF N/A Knowledge Graphs of Driving Scenes to Empower the Emerging Capabilities of Neurosymbolic AI
通过合理逻辑回归实现医疗领域的可解释预测模型 Thiti Suttaket PDF N/A Interpretable Predictive Models for Healthcare via Rational Logistic Regression
超越网格数据:探索用于地球观测的图神经网络 Shan Zhao PDF N/A Beyond Grid Data: Exploring Graph Neural Networks for Earth Observation
一种个人数据风险价值评估方法 Luis Enriquez PDF N/A A Personal data Value at Risk Approach
GIS Copilot:迈向空间分析的自主GIS代理 Temitope Akinboyewa PDF N/A GIS Copilot: Towards an Autonomous GIS Agent for Spatial Analysis
在线数据收集用于高效半参数推断 Shantanu Gupta PDF N/A Online Data Collection for Efficient Semiparametric Inference
月球矿物学洞察:一种无监督的月球矿物绘图仪(M3)光谱数据聚类方法 Freja Thoresen PDF N/A Insights into Lunar Mineralogy: An Unsupervised Approach for Clustering of the Moon Mineral Mapper (M3) spectral data
关于扩散模型的改进调节机制和预训练策略 Tariq Berrada Ifriqi PDF N/A On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models
利用频谱-空间协方差特征从Ambisonics录音中进行子带声学参数的盲估计 Hanyu Meng PDF N/A Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features
探索极端:大规模输出空间中的动态稀疏性 Nasib Ullah PDF N/A Navigating Extremes: Dynamic Sparsity in Large Output Space
用于高效策略学习的预训练视觉动力学表示 Hao Luo PDF N/A Pre-trained Visual Dynamics Representations for Efficient Policy Learning
高效的高斯态哈密顿量、结构与迹距离学习 Marco Fanizza PDF N/A Efficient Hamiltonian, structure and trace distance learning of Gaussian states
一种用于城市地区地面空气温度高效估算的机器学习方法 Iñigo Delgado-Enales PDF N/A A Machine Learning Approach for the Efficient Estimation of Ground-Level Air Temperature in Urban Areas
释放新型条件生成方法在新材料发现中的力量 Lev Novitskiy PDF N/A Unleashing the power of novel conditional generative approaches for new materials discovery
MA^2:一种基于自监督和运动增强的自编码器,用于基于步态的自动疾病检测 Yiqun Liu PDF N/A MA^2: A Self-Supervised and Motion Augmenting Autoencoder for Gait-Based Automatic Disease Detection
以用户为中心的语义通信 Xunze Liu PDF N/A User Centric Semantic Communications
研究快照计算机断层扫描成像光谱仪在预测葡萄糖度和pH值方面的适用性 Mads Svanborg Peters PDF N/A Investigating the Applicability of a Snapshot Computed Tomography Imaging Spectrometer for the Prediction of Brix and pH of Grapes
多尺度微分几何学习在蛋白质柔性分析中的应用 Hongsong Feng PDF N/A Multiscale differential geometry learning for protein flexibility analysis
对抗性线性混合MDP的近似最优动态遗憾 Long-Fei Li PDF N/A Near-Optimal Dynamic Regret for Adversarial Linear Mixture MDPs
评估机器学习模型与临床协议的一致性,以提升解释性和护理连续性 Christel Sirocchi PDF N/A Evaluating Machine Learning Models against Clinical Protocols for Enhanced Interpretability and Continuity of Care
局部病变生成在有限数据情况下的胶囊内窥镜图像数据增强中是有效的 Adrian B. Chłopowiec PDF N/A Local Lesion Generation is Effective for Capsule Endoscopy Image Data Augmentation in a Limited Data Setting
原生关联变分自编码器用于多视图插补 Ella S. C. Orme PDF N/A Correlating Variational Autoencoders Natively For Multi-View Imputation
HFGaussian:学习具有集成人体特征的通用高斯人体 Arnab Dey PDF N/A HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features
使用预训练前端进行语音分离以最小化领域不匹配 Wupeng Wang PDF N/A Speech Separation with Pretrained Frontend to Minimize Domain Mismatch
自监督跨模态学习在缺乏预标注训练数据的应用中实现不确定性感知的物体检测与识别 Irum Mehboob PDF N/A Self-supervised cross-modality learning for uncertainty-aware object detection and recognition in applications which lack pre-labelled training data
对于一个正在融化的RNA发夹来说,更热并不意味着更快。 Huaping Li PDF N/A Hotter isn't faster for a melting RNA hairpin
《阿尔法与偏见:通过内在重加权提升α规模的最坏情况公平性》 Jing Li PDF N/A Alpha and Prejudice: Improving $α$-sized Worst-case Fairness via Intrinsic Reweighting
利用分割任何模型(SAM)进行胸部X光图像中的肺部分割 Gabriel Bellon de Carvalho PDF N/A Exploiting the Segment Anything Model (SAM) for Lung Segmentation in Chest X-ray Images
通过非单调自适应缩放梯度权重增强DP-SGD Tao Huang PDF N/A Enhancing DP-SGD through Non-monotonous Adaptive Scaling Gradient Weight
ATM:通过交替调优和合并改进模型合并 Luca Zhou PDF N/A ATM: Improving Model Merging by Alternating Tuning and Merging
梯度引导的条件扩散模型用于私有图像重建:分析差分隐私和去噪的对抗性影响 Tao Huang PDF N/A Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising
GarVerseLOD:利用包含细节层次的数据集,从单张野外图像中实现高保真3D服装重建 Zhongjin Luo PDF N/A GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details
帕金森病手写运动学和压力评估的鉴别诊断 Peter Drotár PDF N/A Evaluation of handwriting kinematics and pressure for differential diagnosis of Parkinson's disease
预测校正增强型变压器与指数移动平均系数学习 Bei Li PDF N/A Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning
像真正的医生一样判断:用于半监督医学图像分类的双教师样本一致性框架 Zhang Qixiang PDF N/A Judge Like a Real Doctor: Dual Teacher Sample Consistency Framework for Semi-supervised Medical Image Classification
科学关键词生成的自我组合数据增强 Mael Houbre PDF N/A Self-Compositional Data Augmentation for Scientific Keyphrase Generation
变压器能像人类一样闻到气味吗? Farzaneh Taleb PDF N/A Can Transformers Smell Like Humans?
用于分类的遗传算法生成Alpha因子与情感(GAS)混合集成模型 Quechen Yang PDF N/A Blending Ensemble for Classification with Genetic-algorithm generated Alpha factors and Sentiments (GAS)
HumanVLM:人类场景视觉语言模型的基础 Dawei Dai PDF N/A HumanVLM: Foundation for Human-Scene Vision-Language Model
重新思考基于Transformer的语义分割解码器:压缩即所需 Qishuai Wen PDF N/A Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need
图不可知因果贝叶斯优化 Sumantrak Mukherjee PDF N/A Graph Agnostic Causal Bayesian Optimisation
基于自适应遗传选择的异构车辆系统多网络非对称耦合钉扎控制 Weian Guo PDF N/A Adaptive Genetic Selection based Pinning Control with Asymmetric Coupling for Multi-Network Heterogeneous Vehicular Systems
DA-MoE:通过专家混合解决图级分析中的深度敏感性问题 Zelin Yao PDF N/A DA-MoE: Addressing Depth-Sensitivity in Graph-Level Analysis through Mixture of Experts
闪烁后门:基于DVS摄像头的SNN现实环境后门攻击 Roberto Riaño PDF N/A Flashy Backdoor: Real-world Environment Backdoor Attack on SNNs with DVS Cameras
因果推断中的测试泛化性 Daniel de Vassimon Manela PDF N/A Testing Generalizability in Causal Inference
FEDLAD:深度泄露攻击与防御的联邦评估 Isaac Baglin PDF N/A FEDLAD: Federated Evaluation of Deep Leakage Attacks and Defenses
CRT-Fusion:利用运动信息进行3D目标检测的相机、雷达、时间融合技术 Jisong Kim PDF N/A CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection
在代码问答中利用大型语言模型:基线方法与问题 Georgy Andryushchenko PDF N/A Leveraging Large Language Models in Code Question Answering: Baselines and Issues
政策层级体系 Thomas P Cannon PDF N/A Hierarchical Orchestra of Policies
数据质量意识:从传统数据管理到数据科学系统的旅程 Sijie Dong PDF N/A Data Quality Awareness: A Journey from Traditional Data Management to Data Science Systems
神经网络与(虚拟)扩展公式 Christoph Hertrich PDF N/A Neural Networks and (Virtual) Extended Formulations
利用大型语言模型对患者吸烟状况进行分类以控制未观测到的混杂因素 Samuel Lee PDF N/A Controlling for Unobserved Confounding with Large Language Model Classification of Patient Smoking Status
精准驾驶与VLM:PRCV 2024驾驶语言模型挑战赛一等奖解决方案 Bin Huang PDF N/A Precise Drive with VLM: First Prize Solution for PRCV 2024 Drive LM challenge
加速任务泛化与多层次分层选项 Thomas P Cannon PDF N/A Accelerating Task Generalisation with Multi-Level Hierarchical Options
PV-faultNet:优化的卷积神经网络架构,用于检测缺陷,从而实现高效的太阳能电池板生产 Eiffat E Zaman PDF N/A PV-faultNet: Optimized CNN Architecture to detect defects resulting efficient PV production
SUDS:一种无监督漂移采样策略 Christofer Fellicious PDF N/A SUDS: A Strategy for Unsupervised Drift Sampling
高效且有效的多模态基础模型在序列推荐中的适应性 Junchen Fu PDF N/A Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation
长出尾巴:提升大型语言模型输出多样性 Michal Shur-Ofry PDF N/A Growing a Tail: Increasing Output Diversity in Large Language Models
多类别分类器的置信度校准 Adrien Le Coz PDF N/A Confidence Calibration of Classifiers with Many Classes
使用过完备相位字典对波前进行稀疏重构 S. Howard PDF N/A Sparse Reconstruction of Wavefronts using an Over-Complete Phase Dictionary
无人机协同追逃游戏的强化学习自主决策 Yang Zhao PDF N/A Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with Reinforcement Learning
CAD-NeRF:通过CAD模型检索从未校准的少量视图图像中学习NeRF Xin Wen PDF N/A CAD-NeRF: Learning NeRFs from Uncalibrated Few-view Images by CAD Model Retrieval
基于Transformer的固定翼无人机容错控制:利用知识蒸馏与情境内适应 Francisco Giral PDF N/A Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation
区域引导攻击分割任何模型(SAM) Xiaoliang Liu PDF N/A Region-Guided Attack on the Segment Anything Model (SAM)
[愿景文件] PRObot:利用聊天机器人和生成式人工智能提升糖尿病视网膜病变的患者报告结果测量 Maren Pielka PDF N/A [Vision Paper] PRObot: Enhancing Patient-Reported Outcome Measures for Diabetic Retinopathy using Chatbots and Generative AI
探索在卫星影像三维重建中神经辐射场背景下的季节性变化 Liv Kåreborn PDF N/A Exploring Seasonal Variability in the Context of Neural Radiance Fields for 3D Reconstruction on Satellite Imagery
多模态神经辐射场自监督用于激光雷达语义分割 Xavier Timoneda PDF N/A Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation
说话人情感识别:利用自监督模型进行特征提取——基于Wav2Vec2和HuBERT Pourya Jafarzadeh PDF N/A Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT
将安全性嵌入强化学习:信任区域方法的新视角 Nikola Milosevic PDF N/A Embedding Safety into RL: A New Take on Trust Region Methods
IMUDiffusion:一种用于惯性运动捕捉系统多元时间序列合成的扩散模型 Heiko Oppel PDF N/A IMUDiffusion: A Diffusion Model for Multivariate Time Series Synthetisation for Inertial Motion Capturing Systems
LDPM:利用MR-VAE和潜在扩散先验实现欠采样MRI重建 Xingjian Tang PDF N/A LDPM: Towards undersampled MRI reconstruction with MR-VAE and Latent Diffusion Prior
一种可扩展的生成模型,用于从神经影像数据中重建动力系统 Eric Volkmann PDF N/A A scalable generative model for dynamical system reconstruction from neuroimaging data
将自然语言与SQL翻译相结合,通过基于数据的自解释实现 Yuankai Fan PDF N/A Grounding Natural Language to SQL Translation with Data-Based Self-Explanations
时间因果变分自编码器:稳健的金融时间序列生成器 Beatrice Acciaio PDF N/A Time-Causal VAE: Robust Financial Time Series Generator
捕捉研究文献对可持续发展目标的态度:基于大语言模型的主题建模方法 Francesco Invernici PDF N/A Capturing research literature attitude towards Sustainable Development Goals: an LLM-based topic modeling approach
用于时间序列预测的Mamba基础模型 Haoyu Ma PDF N/A A Mamba Foundation Model for Time Series Forecasting
一种针对小型语言模型的后训练增强优化方法 Keke Zhai PDF N/A A Post-Training Enhanced Optimization Approach for Small Language Models
基准测试多模态检索增强生成与动态VQA数据集和自适应规划代理 Yangning Li PDF N/A Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
非洲定居点地图绘制:深度学习与卫星影像生成的高分辨率城市与乡村地图 Mohammad Kakooei PDF N/A Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery
P-MOSS:利用底层硬件统计信息在NUMA服务器上为索引进行学习型调度 Yeasir Rayhan PDF N/A P-MOSS: Learned Scheduling For Indexes Over NUMA Servers Using Low-Level Hardware Statistics
大型语言模型中的文本美学 Lingjie Jiang PDF N/A Textual Aesthetics in Large Language Models
基于隐私保护的图机器学习与全同态加密在协作反洗钱中的应用 Fabrianne Effendi PDF N/A Privacy-Preserving Graph-Based Machine Learning with Fully Homomorphic Encryption for Collaborative Anti-Money Laundering
理论上保证的分布自适应学习 Chao Xu PDF N/A Theoretically Guaranteed Distribution Adaptable Learning
开放集单源域泛化的域扩展与边界增长 Pengkun Jiao PDF N/A Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization
探索自动驾驶中视频生成与世界模型之间的相互作用:一项综述 Ao Fu PDF N/A Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey
Photon:联邦式大语言模型预训练 Lorenzo Sani PDF N/A Photon: Federated LLM Pre-Training
梯度下降法在非参数回归中找到具有锐利泛化能力的过参数化神经网络:一种无分布分析 Yingzhen Yang PDF N/A Gradient Descent Finds Over-Parameterized Neural Networks with Sharp Generalization for Nonparametric Regression: A Distribution-Free Analysis
针对大型视觉语言模型的成员推理攻击 Zhan Li PDF N/A Membership Inference Attacks against Large Vision-Language Models
油炸去卷积 Jerome Gilles PDF N/A Fried deconvolution
湍流稳定化 Yu Mao PDF N/A Turbulence stabilization
一种针对微分同胚医学图像配准的对称动态学习框架 Jinqiu Deng PDF N/A A Symmetric Dynamic Learning Framework for Diffeomorphic Medical Image Registration
阿拉伯短篇小说中迂回表达的英译 Dalal Waadallah Shehab PDF N/A The Translation of Circumlocution in Arabic Short Stories into English
TokenSelect:通过动态令牌级KV缓存选择实现LLMs的高效长上下文推理和长度外推 Wei Wu PDF N/A TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection
通过不确定性感知分布式对抗训练增强对抗鲁棒性 Junhao Dong PDF N/A Enhancing Adversarial Robustness via Uncertainty-Aware Distributional Adversarial Training
AtlasSeg:基于图谱先验引导的双U-Net用于胎儿脑部MRI中的皮层分割 Haoan Xu PDF N/A AtlasSeg: Atlas Prior Guided Dual-U-Net for Cortical Segmentation in Fetal Brain MRI
Graph-DPEP:基于思维图推理的少样本文档关系抽取分解式即插即用集成方法 Tao Zhang PDF N/A Graph-DPEP: Decomposed Plug and Ensemble Play for Few-Shot Document Relation Extraction with Graph-of-Thoughts Reasoning
大语言模型在查询优化中的非理性有效性 Peter Akioyamen PDF N/A The Unreasonable Effectiveness of LLMs for Query Optimization
基于中心性的实例感知知识蒸馏与任务互提升在无人机影像目标检测中的应用 Bowei Du PDF N/A Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery
持续音频-视觉声音分离 Weiguo Pian PDF N/A Continual Audio-Visual Sound Separation
OLAF:增强型多对象多部件场景解析的即插即用框架 Pranav Gupta PDF N/A OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing
通过年内时间序列分析贫困:小波变换方法 Mohammad Kakooei PDF N/A Analyzing Poverty through Intra-Annual Time-Series: A Wavelet Transform Approach
SpiDR:一种可重构的基于事件感知的数字存内计算脉冲神经网络加速器 Deepika Sharma PDF N/A SpiDR: A Reconfigurable Digital Compute-in-Memory Spiking Neural Network Accelerator for Event-based Perception
ADOPT:改进的Adam在任何$β_2$下都能以最优速率收敛 Shohei Taniguchi PDF N/A ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate
学习统一音频、视觉和文本,以实现音频增强的多语言视觉答案定位 Zhibin Wen PDF N/A Learning to Unify Audio, Visual and Text for Audio-Enhanced Multilingual Visual Answer Localization
WASHtsApp -- 一个基于RAG技术的WhatsApp聊天机器人,旨在支持非洲农村地区的清洁水资源获取、卫生设施和卫生习惯的推广。 Simon Kloker PDF N/A WASHtsApp -- A RAG-powered WhatsApp Chatbot for supporting rural African clean water access, sanitation and hygiene
对抗性多任务水下声学目标识别:针对各种影响因素的鲁棒性研究 Yuan Xie PDF N/A Adversarial multi-task underwater acoustic target recognition: towards robustness against various influential factors
剖析图上不变学习的失败之处 Qixun Wang PDF N/A Dissecting the Failure of Invariant Learning on Graphs
目标检测性能与视觉显著性和深度估计的相关性 Matthias Bartolo PDF N/A Correlation of Object Detection Performance with Visual Saliency and Depth Estimation
光声成像重建与定量分析在生物医学应用中的进展 Lei Wang PDF N/A Advances in Photoacoustic Imaging Reconstruction and Quantitative Analysis for Biomedical Applications
元启发式算法在模板设计问题中的应用:编码、对称性与混合化 David Rodríguez Rueda PDF N/A Metaheuristics for the Template Design Problem: Encoding, Symmetry and Hybridisation
测试时动态图像融合 Bing Cao PDF N/A Test-Time Dynamic Image Fusion
多模态与单模态对比学习的比较 Wei Huang PDF N/A On the Comparison between Multi-modal and Single-modal Contrastive Learning
迷失在上下文中:上下文对目标识别特征归因方法的影响 Sayanta Adhikari PDF N/A Lost in Context: The Influence of Context on Feature Attribution Methods for Object Recognition
PersianRAG:一个针对波斯语的检索增强生成系统 Hossein Hosseini PDF N/A PersianRAG: A Retrieval-Augmented Generation System for Persian Language
上下文学习者的混合体 Giwon Hong PDF N/A Mixtures of In-Context Learners
CE-CoLLM:通过云边协同实现高效且自适应的大语言模型 Hongpeng Jin PDF N/A CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration
深度状态空间模型的层级自适应状态剪枝 Minseon Gwak PDF N/A Layer-Adaptive State Pruning for Deep State Space Models
DroidSpeak:增强跨大型语言模型通信 Yuhan Liu PDF N/A DroidSpeak: Enhancing Cross-LLM Communication
LiVOS:基于门控线性匹配的轻量级视频目标分割 Qin Liu PDF N/A LiVOS: Light Video Object Segmentation with Gated Linear Matching
条件Vendi得分:一种基于信息论的生成模型提示多样性评估方法 Mohammad Jalali PDF N/A Conditional Vendi Score: An Information-Theoretic Approach to Diversity Evaluation of Prompt-based Generative Models
ChatGPT在研究和教育中的应用:探索其利与弊 Abu Saleh Musa Miah PDF N/A ChatGPT in Research and Education: Exploring Benefits and Threats
人工智能增强的Couinaud分段用于精准肝癌治疗 Liang Qiu PDF N/A Artificial Intelligence-Enhanced Couinaud Segmentation for Precision Liver Cancer Therapy
用于持续学习的稀疏正交参数调优 Kun-Peng Ning PDF N/A Sparse Orthogonal Parameters Tuning for Continual Learning
NEOviz:不确定性驱动的近地小行星轨迹可视化分析 Fangfei Lan PDF N/A NEOviz: Uncertainty-Driven Visual Analysis of Asteroid Trajectories
查询效率高的对抗攻击垂直联邦图学习 Jinyin Chen PDF N/A Query-Efficient Adversarial Attack Against Vertical Federated Graph Learning
ERUP-YOLO:通过统一图像自适应处理增强恶劣天气条件下的目标检测鲁棒性 Yuka Ogino PDF N/A ERUP-YOLO: Enhancing Object Detection Robustness for Adverse Weather Condition by Unified Image-Adaptive Processing
DeepContext:一个面向深度学习工作负载的性能剖析与分析工具,具备上下文感知、跨平台和跨框架的特性。 Qidong Zhao PDF N/A DeepContext: A Context-aware, Cross-platform, and Cross-framework Tool for Performance Profiling and Analysis of Deep Learning Workloads
专门化的基础模型难以超越有监督的基线模型 Zongzhe Xu PDF N/A Specialized Foundation Models Struggle to Beat Supervised Baselines
RWKV的演变:高效语言建模的进步 Akul Datta PDF N/A The Evolution of RWKV: Advancements in Efficient Language Modeling
实时文本检测与交通、工业及自然场景中的相似掩码 Xu Han PDF N/A Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
面向鲁棒的不完全多模态情感分析的分层表示学习 Mingcheng Li PDF N/A Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning
语言模型与循环一致性在自反式机器翻译中的应用 Jianqiao Wangni PDF N/A Language Models and Cycle Consistency for Self-Reflective Machine Translation
用于可控个性化搜索的记忆增强交叉编码器 Sheshera Mysore PDF N/A Memory Augmented Cross-encoders for Controllable Personalized Search
何时进行本地化?一种基于风险约束的强化学习方法 Chak Lam Shek PDF N/A When to Localize? A Risk-Constrained Reinforcement Learning Approach
通过多任务学习和多门混合专家系统推进水下声学目标识别的稳健性 Yuan Xie PDF N/A Advancing Robust Underwater Acoustic Target Recognition through Multi-task Learning and Multi-Gate Mixture-of-Experts
随机猴子玩耍:廉价随机增强破坏大型语言模型安全性对齐 Jason Vega PDF N/A Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment
循环神经网络的泛化与风险界定 Xuewei Cheng PDF N/A Generalization and Risk Bounds for Recurrent Neural Networks
脑波:生成重建方法使用了大脑的多少部分? David Mayo PDF N/A BrainBits: How Much of the Brain are Generative Reconstruction Methods Using?
嘈杂图像的价值是多少?环境扩散的数据缩放法则 Giannis Daras PDF N/A How much is a noisy image worth? Data Scaling Laws for Ambient Diffusion
提高回收效率:深度学习模型在废物分类中的比较分析 Zhanshan Qiao PDF N/A Advancing Recycling Efficiency: A Comparative Analysis of Deep Learning Models in Waste Classification
基于深度学习的模块化加载协议用于Bouc-Wen类模型参数估计 Sebin Oh PDF N/A Deep learning-based modularized loading protocol for parameter estimation of Bouc-Wen class models
FedBlock:一种针对后门攻击的联邦学习区块链方法 Duong H. Nguyen PDF N/A FedBlock: A Blockchain Approach to Federated Learning against Backdoor Attacks
各向同性核的新随机投影使用稳定谱分布 Nicolas Langrené PDF N/A New random projections for isotropic kernels using stable spectral distributions
One-Stage-TFS:用于手指拼写识别框架的泰语单阶段手指拼写数据集 Siriwiwat Lata PDF N/A One-Stage-TFS: Thai One-Stage Fingerspelling Dataset for Fingerspelling Recognition Frameworks
一种用于平行正齐次网络泛化分析的凸松弛方法 Uday Kiran Reddy Tadipatri PDF N/A A Convex Relaxation Approach to Generalization Analysis for Parallel Positively Homogeneous Networks
快速、鲁棒的近似消息传递 Misha Ivkov PDF N/A Fast, robust approximate message passing
EcoCropsAID:用于土地利用分类的经济作物航空图像数据集 Sangdaow Noppitak PDF N/A EcoCropsAID: Economic Crops Aerial Image Dataset for Land Use Classification
DEMONet:基于多专家网络和跨时变分自编码器的水下声学目标识别 Yuan Xie PDF N/A DEMONet: Underwater Acoustic Target Recognition based on Multi-Expert Network and Cross-Temporal Variational Autoencoder
标签评论家:在模型之前设计数据 Pedro R. A. S. Bassi PDF N/A Label Critic: Design Data Before Models
单量子比特确定性量子计算的表达能力 Yujin Kim PDF N/A Expressivity of deterministic quantum computation with one qubit
高效特征聚合与尺度感知回归在单目三维物体检测中的应用 Yifan Wang PDF N/A Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection
基于模式和函数方差分析的机器学习模型的贝叶斯解释 Quan Long PDF N/A A Bayesian explanation of machine learning models based on modes and functional ANOVA
医学图像分割的基础AI模型 Rina Bao PDF N/A Foundation AI Model for Medical Image Segmentation
一种基于信息匹配的最优实验设计和主动学习方法 Yonatan Kurniawan PDF N/A An information-matching approach to optimal experimental design and active learning
基于新颖性聚焦的研发景观分析:结合Transformer与局部异常因子 Jaewoong Choi PDF N/A Novelty-focused R&D landscaping using transformer and local outlier factor
DDFAV:遥感大视觉语言模型数据集与评估基准 Haodong Li PDF N/A DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation Benchmark
一种支持生物医学数据协调的自然语言处理方法:利用大型语言模型 Zexu Li PDF N/A A Natural Language Processing Approach to Support Biomedical Data Harmonization: Leveraging Large Language Models
基于组合模拟的时间序列推理 Manuel Gloeckler PDF N/A Compositional simulation-based inference for time series
椭圆Wishart分布:信息几何、极大似然估计、性能分析与统计学习 Imen Ayadi PDF N/A Elliptical Wishart distributions: information geometry, maximum likelihood estimator, performance analysis and statistical learning
TransUNext:迈向更先进的U形框架,用于眼底图像中的自动血管分割 Xiang Li PDF N/A TransUNext: towards a more advanced U-shaped framework for automatic vessel segmentation in the fundus image
用于视觉问答的多模态常识知识蒸馏 Shuo Yang PDF N/A Multimodal Commonsense Knowledge Distillation for Visual Question Answering
CIT:重新思考类增量语义分割与类独立变换 Jinchao Ge PDF N/A CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation
基于大型语言模型辅助的游戏剧情设计:与游戏设计师的实证研究 Seyed Hossein Alavi PDF N/A Game Plot Design with an LLM-powered Assistant: An Empirical Study with Game Designers
V-DPO:通过视觉引导的直接偏好优化来减轻大型视觉语言模型中的幻觉现象 Yuxi Xie PDF N/A V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference Optimization
全视野数字乳腺摄影数据集来自一项人群筛查计划 Edward Kendall PDF N/A Full Field Digital Mammography Dataset from a Population Screening Program
利用区块链信息进行碳价波动预测:一种新的混合机器学习方法 H. Wang PDF N/A Carbon price fluctuation prediction using blockchain information A new hybrid machine learning approach
探索多语言大语言模型中的响应不确定性:在误导场景下的实证评估 Yunkai Dang PDF N/A Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios
RT-Affordance:Affordances 是机器人操作的多功能中间表示 Soroush Nasiriany PDF N/A RT-Affordance: Affordances are Versatile Intermediate Representations for Robot Manipulation
可转移的多色光学编码器用于神经网络 Minho Choi PDF N/A Transferable polychromatic optical encoder for neural networks
JEL:在摩根大通应用端到端神经实体链接 Wanying Ding PDF N/A JEL: Applying End-to-End Neural Entity Linking in JPMorgan Chase
具有事件时间不确定性的点过程 Xiuyuan Cheng PDF N/A Point processes with event time uncertainty
JPEC:一种用于金融知识图谱中竞争对手检索的新型图神经网络 Wanying Ding PDF N/A JPEC: A Novel Graph Neural Network for Competitor Retrieval in Financial Knowledge Graphs
在通用指令微调中失去上下文感知能力 Yihan Wang PDF N/A On the loss of context-awareness in general instruction fine-tuning