| MME-Finance:一个面向专家级理解和推理的多模态金融基准 |
Ziliang Gan |
PDF |
N/A |
MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning |
| 视觉-语言预训练的正确分类 |
Huang Zilong |
PDF |
N/A |
Classification Done Right for Vision-Language Pre-Training |
| 推断最优的视觉语言模型只需要一个视觉标记,但更大的模型 |
Kevin Y. Li |
PDF |
N/A |
Inference Optimal VLMs Need Only One Visual Token but Larger Models |
| 用于域生成算法检测的大型语言模型 |
Reynier Leyva La O |
PDF |
N/A |
LLMs for Domain Generation Algorithm Detection |
| VERITAS:一种统一的可靠性评估方法 |
Rajkumar Ramamurthy |
PDF |
N/A |
VERITAS: A Unified Approach to Reliability Evaluation |
| 视觉运动模仿学习中的分布外恢复与以物体为中心的关键点逆策略 |
George Jiayuan Gao |
PDF |
N/A |
Out-of-Distribution Recovery with Object-Centric Keypoint Inverse Policy For Visuomotor Imitation Learning |
| 交互生成代码:我们离自动生成网页交互还有多远? |
Jingyu Xiao |
PDF |
N/A |
Interaction2Code: How Far Are We From Automatic Interactive Webpage Generation? |
| 智能医疗的未来:基于大语言模型的机器人集成与影响系统分析与讨论 |
Souren Pashangpour |
PDF |
N/A |
The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare |
| DiT4Edit:用于图像编辑的扩散变换器 |
Kunyu Feng |
PDF |
N/A |
DiT4Edit: Diffusion Transformer for Image Editing |
| SMoA:通过稀疏代理混合提升多智能体大型语言模型 |
Dawei Li |
PDF |
N/A |
SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents |
| 机器学习模型中的无察觉防御:无检测的木马移除 |
Shafi Goldwasser |
PDF |
N/A |
Oblivious Defense in ML Models: Backdoor Removal without Detection |
| 因果责任归属在人机协作中的应用 |
Yahang Qi |
PDF |
N/A |
Causal Responsibility Attribution for Human-AI Collaboration |
| 基于图的半监督分离Lipschitz学习 |
Farid Bozorgnia |
PDF |
N/A |
Graph-Based Semi-Supervised Segregated Lipschitz Learning |
| 稳定匹配与平局:近似比率和学习 |
Shiyun Lin |
PDF |
N/A |
Stable Matching with Ties: Approximation Ratios and Learning |
| 代理信息引导的贝叶斯迁移学习与未知源 |
Sabina J. Sloman |
PDF |
N/A |
Proxy-informed Bayesian transfer learning with unknown sources |
| ShadowMamba:基于边界区域选择性扫描的阴影去除状态空间模型 |
Xiujin Zhu |
PDF |
N/A |
ShadowMamba: State-Space Model with Boundary-Region Selective Scan for Shadow Removal |
| 探索数据结构:最近邻搜索及其扩展 |
Omar Salemohamed |
PDF |
N/A |
Discovering Data Structures: Nearest Neighbor Search and Beyond |
| 基于大型语言模型社区中通过社会互动自发产生的个体性 |
Ryosuke Takata |
PDF |
N/A |
Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities |
| DiffLM:通过扩散语言模型实现可控的合成数据生成 |
Ying Zhou |
PDF |
N/A |
DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models |
| 将精细细节与全局几何结构解耦,用于压缩深度图的超分辨率 |
Huan Zheng |
PDF |
N/A |
Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution |
| 非合作可重构智能表面检测:通过深度支持向量数据描述进行扫描B测试 |
George Stamatelis |
PDF |
N/A |
On the Detection of Non-Cooperative RISs: Scan B-Testing via Deep Support Vector Data Description |
| 使用动态Dropout提高Transformer训练效率 |
Hanrui Yan |
PDF |
N/A |
Enhancing Transformer Training Efficiency with Dynamic Dropout |
| 形式逻辑引导的鲁棒联邦学习对抗投毒攻击 |
Dung Thuy Nguyen |
PDF |
N/A |
Formal Logic-guided Robust Federated Learning against Poisoning Attacks |
| Topograph:一种基于图的高效框架,用于严格保持拓扑结构的图像分割 |
Laurin Lux |
PDF |
N/A |
Topograph: An efficient Graph-Based Framework for Strictly Topology Preserving Image Segmentation |
| 在卷积神经网络(CNNs)中,核正交性并不必然意味着特征图冗余的减少:卷积相似性最小化 |
Zakariae Belmekki |
PDF |
N/A |
Kernel Orthogonality does not necessarily imply a Decrease in Feature Map Redundancy in CNNs: Convolutional Similarity Minimization |
| 驾驶场景的知识图谱:赋能神经符号人工智能的新兴能力 |
Ruwan Wickramarachchi |
PDF |
N/A |
Knowledge Graphs of Driving Scenes to Empower the Emerging Capabilities of Neurosymbolic AI |
| 通过合理逻辑回归实现医疗领域的可解释预测模型 |
Thiti Suttaket |
PDF |
N/A |
Interpretable Predictive Models for Healthcare via Rational Logistic Regression |
| 超越网格数据:探索用于地球观测的图神经网络 |
Shan Zhao |
PDF |
N/A |
Beyond Grid Data: Exploring Graph Neural Networks for Earth Observation |
| 一种个人数据风险价值评估方法 |
Luis Enriquez |
PDF |
N/A |
A Personal data Value at Risk Approach |
| GIS Copilot:迈向空间分析的自主GIS代理 |
Temitope Akinboyewa |
PDF |
N/A |
GIS Copilot: Towards an Autonomous GIS Agent for Spatial Analysis |
| 在线数据收集用于高效半参数推断 |
Shantanu Gupta |
PDF |
N/A |
Online Data Collection for Efficient Semiparametric Inference |
| 月球矿物学洞察:一种无监督的月球矿物绘图仪(M3)光谱数据聚类方法 |
Freja Thoresen |
PDF |
N/A |
Insights into Lunar Mineralogy: An Unsupervised Approach for Clustering of the Moon Mineral Mapper (M3) spectral data |
| 关于扩散模型的改进调节机制和预训练策略 |
Tariq Berrada Ifriqi |
PDF |
N/A |
On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models |
| 利用频谱-空间协方差特征从Ambisonics录音中进行子带声学参数的盲估计 |
Hanyu Meng |
PDF |
N/A |
Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features |
| 探索极端:大规模输出空间中的动态稀疏性 |
Nasib Ullah |
PDF |
N/A |
Navigating Extremes: Dynamic Sparsity in Large Output Space |
| 用于高效策略学习的预训练视觉动力学表示 |
Hao Luo |
PDF |
N/A |
Pre-trained Visual Dynamics Representations for Efficient Policy Learning |
| 高效的高斯态哈密顿量、结构与迹距离学习 |
Marco Fanizza |
PDF |
N/A |
Efficient Hamiltonian, structure and trace distance learning of Gaussian states |
| 一种用于城市地区地面空气温度高效估算的机器学习方法 |
Iñigo Delgado-Enales |
PDF |
N/A |
A Machine Learning Approach for the Efficient Estimation of Ground-Level Air Temperature in Urban Areas |
| 释放新型条件生成方法在新材料发现中的力量 |
Lev Novitskiy |
PDF |
N/A |
Unleashing the power of novel conditional generative approaches for new materials discovery |
| MA^2:一种基于自监督和运动增强的自编码器,用于基于步态的自动疾病检测 |
Yiqun Liu |
PDF |
N/A |
MA^2: A Self-Supervised and Motion Augmenting Autoencoder for Gait-Based Automatic Disease Detection |
| 以用户为中心的语义通信 |
Xunze Liu |
PDF |
N/A |
User Centric Semantic Communications |
| 研究快照计算机断层扫描成像光谱仪在预测葡萄糖度和pH值方面的适用性 |
Mads Svanborg Peters |
PDF |
N/A |
Investigating the Applicability of a Snapshot Computed Tomography Imaging Spectrometer for the Prediction of Brix and pH of Grapes |
| 多尺度微分几何学习在蛋白质柔性分析中的应用 |
Hongsong Feng |
PDF |
N/A |
Multiscale differential geometry learning for protein flexibility analysis |
| 对抗性线性混合MDP的近似最优动态遗憾 |
Long-Fei Li |
PDF |
N/A |
Near-Optimal Dynamic Regret for Adversarial Linear Mixture MDPs |
| 评估机器学习模型与临床协议的一致性,以提升解释性和护理连续性 |
Christel Sirocchi |
PDF |
N/A |
Evaluating Machine Learning Models against Clinical Protocols for Enhanced Interpretability and Continuity of Care |
| 局部病变生成在有限数据情况下的胶囊内窥镜图像数据增强中是有效的 |
Adrian B. Chłopowiec |
PDF |
N/A |
Local Lesion Generation is Effective for Capsule Endoscopy Image Data Augmentation in a Limited Data Setting |
| 原生关联变分自编码器用于多视图插补 |
Ella S. C. Orme |
PDF |
N/A |
Correlating Variational Autoencoders Natively For Multi-View Imputation |
| HFGaussian:学习具有集成人体特征的通用高斯人体 |
Arnab Dey |
PDF |
N/A |
HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features |
| 使用预训练前端进行语音分离以最小化领域不匹配 |
Wupeng Wang |
PDF |
N/A |
Speech Separation with Pretrained Frontend to Minimize Domain Mismatch |
| 自监督跨模态学习在缺乏预标注训练数据的应用中实现不确定性感知的物体检测与识别 |
Irum Mehboob |
PDF |
N/A |
Self-supervised cross-modality learning for uncertainty-aware object detection and recognition in applications which lack pre-labelled training data |
| 对于一个正在融化的RNA发夹来说,更热并不意味着更快。 |
Huaping Li |
PDF |
N/A |
Hotter isn't faster for a melting RNA hairpin |
| 《阿尔法与偏见:通过内在重加权提升α规模的最坏情况公平性》 |
Jing Li |
PDF |
N/A |
Alpha and Prejudice: Improving $α$-sized Worst-case Fairness via Intrinsic Reweighting |
| 利用分割任何模型(SAM)进行胸部X光图像中的肺部分割 |
Gabriel Bellon de Carvalho |
PDF |
N/A |
Exploiting the Segment Anything Model (SAM) for Lung Segmentation in Chest X-ray Images |
| 通过非单调自适应缩放梯度权重增强DP-SGD |
Tao Huang |
PDF |
N/A |
Enhancing DP-SGD through Non-monotonous Adaptive Scaling Gradient Weight |
| ATM:通过交替调优和合并改进模型合并 |
Luca Zhou |
PDF |
N/A |
ATM: Improving Model Merging by Alternating Tuning and Merging |
| 梯度引导的条件扩散模型用于私有图像重建:分析差分隐私和去噪的对抗性影响 |
Tao Huang |
PDF |
N/A |
Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising |
| GarVerseLOD:利用包含细节层次的数据集,从单张野外图像中实现高保真3D服装重建 |
Zhongjin Luo |
PDF |
N/A |
GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details |
| 帕金森病手写运动学和压力评估的鉴别诊断 |
Peter Drotár |
PDF |
N/A |
Evaluation of handwriting kinematics and pressure for differential diagnosis of Parkinson's disease |
| 预测校正增强型变压器与指数移动平均系数学习 |
Bei Li |
PDF |
N/A |
Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning |
| 像真正的医生一样判断:用于半监督医学图像分类的双教师样本一致性框架 |
Zhang Qixiang |
PDF |
N/A |
Judge Like a Real Doctor: Dual Teacher Sample Consistency Framework for Semi-supervised Medical Image Classification |
| 科学关键词生成的自我组合数据增强 |
Mael Houbre |
PDF |
N/A |
Self-Compositional Data Augmentation for Scientific Keyphrase Generation |
| 变压器能像人类一样闻到气味吗? |
Farzaneh Taleb |
PDF |
N/A |
Can Transformers Smell Like Humans? |
| 用于分类的遗传算法生成Alpha因子与情感(GAS)混合集成模型 |
Quechen Yang |
PDF |
N/A |
Blending Ensemble for Classification with Genetic-algorithm generated Alpha factors and Sentiments (GAS) |
| HumanVLM:人类场景视觉语言模型的基础 |
Dawei Dai |
PDF |
N/A |
HumanVLM: Foundation for Human-Scene Vision-Language Model |
| 重新思考基于Transformer的语义分割解码器:压缩即所需 |
Qishuai Wen |
PDF |
N/A |
Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need |
| 图不可知因果贝叶斯优化 |
Sumantrak Mukherjee |
PDF |
N/A |
Graph Agnostic Causal Bayesian Optimisation |
| 基于自适应遗传选择的异构车辆系统多网络非对称耦合钉扎控制 |
Weian Guo |
PDF |
N/A |
Adaptive Genetic Selection based Pinning Control with Asymmetric Coupling for Multi-Network Heterogeneous Vehicular Systems |
| DA-MoE:通过专家混合解决图级分析中的深度敏感性问题 |
Zelin Yao |
PDF |
N/A |
DA-MoE: Addressing Depth-Sensitivity in Graph-Level Analysis through Mixture of Experts |
| 闪烁后门:基于DVS摄像头的SNN现实环境后门攻击 |
Roberto Riaño |
PDF |
N/A |
Flashy Backdoor: Real-world Environment Backdoor Attack on SNNs with DVS Cameras |
| 因果推断中的测试泛化性 |
Daniel de Vassimon Manela |
PDF |
N/A |
Testing Generalizability in Causal Inference |
| FEDLAD:深度泄露攻击与防御的联邦评估 |
Isaac Baglin |
PDF |
N/A |
FEDLAD: Federated Evaluation of Deep Leakage Attacks and Defenses |
| CRT-Fusion:利用运动信息进行3D目标检测的相机、雷达、时间融合技术 |
Jisong Kim |
PDF |
N/A |
CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection |
| 在代码问答中利用大型语言模型:基线方法与问题 |
Georgy Andryushchenko |
PDF |
N/A |
Leveraging Large Language Models in Code Question Answering: Baselines and Issues |
| 政策层级体系 |
Thomas P Cannon |
PDF |
N/A |
Hierarchical Orchestra of Policies |
| 数据质量意识:从传统数据管理到数据科学系统的旅程 |
Sijie Dong |
PDF |
N/A |
Data Quality Awareness: A Journey from Traditional Data Management to Data Science Systems |
| 神经网络与(虚拟)扩展公式 |
Christoph Hertrich |
PDF |
N/A |
Neural Networks and (Virtual) Extended Formulations |
| 利用大型语言模型对患者吸烟状况进行分类以控制未观测到的混杂因素 |
Samuel Lee |
PDF |
N/A |
Controlling for Unobserved Confounding with Large Language Model Classification of Patient Smoking Status |
| 精准驾驶与VLM:PRCV 2024驾驶语言模型挑战赛一等奖解决方案 |
Bin Huang |
PDF |
N/A |
Precise Drive with VLM: First Prize Solution for PRCV 2024 Drive LM challenge |
| 加速任务泛化与多层次分层选项 |
Thomas P Cannon |
PDF |
N/A |
Accelerating Task Generalisation with Multi-Level Hierarchical Options |
| PV-faultNet:优化的卷积神经网络架构,用于检测缺陷,从而实现高效的太阳能电池板生产 |
Eiffat E Zaman |
PDF |
N/A |
PV-faultNet: Optimized CNN Architecture to detect defects resulting efficient PV production |
| SUDS:一种无监督漂移采样策略 |
Christofer Fellicious |
PDF |
N/A |
SUDS: A Strategy for Unsupervised Drift Sampling |
| 高效且有效的多模态基础模型在序列推荐中的适应性 |
Junchen Fu |
PDF |
N/A |
Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation |
| 长出尾巴:提升大型语言模型输出多样性 |
Michal Shur-Ofry |
PDF |
N/A |
Growing a Tail: Increasing Output Diversity in Large Language Models |
| 多类别分类器的置信度校准 |
Adrien Le Coz |
PDF |
N/A |
Confidence Calibration of Classifiers with Many Classes |
| 使用过完备相位字典对波前进行稀疏重构 |
S. Howard |
PDF |
N/A |
Sparse Reconstruction of Wavefronts using an Over-Complete Phase Dictionary |
| 无人机协同追逃游戏的强化学习自主决策 |
Yang Zhao |
PDF |
N/A |
Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with Reinforcement Learning |
| CAD-NeRF:通过CAD模型检索从未校准的少量视图图像中学习NeRF |
Xin Wen |
PDF |
N/A |
CAD-NeRF: Learning NeRFs from Uncalibrated Few-view Images by CAD Model Retrieval |
| 基于Transformer的固定翼无人机容错控制:利用知识蒸馏与情境内适应 |
Francisco Giral |
PDF |
N/A |
Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation |
| 区域引导攻击分割任何模型(SAM) |
Xiaoliang Liu |
PDF |
N/A |
Region-Guided Attack on the Segment Anything Model (SAM) |
| [愿景文件] PRObot:利用聊天机器人和生成式人工智能提升糖尿病视网膜病变的患者报告结果测量 |
Maren Pielka |
PDF |
N/A |
[Vision Paper] PRObot: Enhancing Patient-Reported Outcome Measures for Diabetic Retinopathy using Chatbots and Generative AI |
| 探索在卫星影像三维重建中神经辐射场背景下的季节性变化 |
Liv Kåreborn |
PDF |
N/A |
Exploring Seasonal Variability in the Context of Neural Radiance Fields for 3D Reconstruction on Satellite Imagery |
| 多模态神经辐射场自监督用于激光雷达语义分割 |
Xavier Timoneda |
PDF |
N/A |
Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation |
| 说话人情感识别:利用自监督模型进行特征提取——基于Wav2Vec2和HuBERT |
Pourya Jafarzadeh |
PDF |
N/A |
Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT |
| 将安全性嵌入强化学习:信任区域方法的新视角 |
Nikola Milosevic |
PDF |
N/A |
Embedding Safety into RL: A New Take on Trust Region Methods |
| IMUDiffusion:一种用于惯性运动捕捉系统多元时间序列合成的扩散模型 |
Heiko Oppel |
PDF |
N/A |
IMUDiffusion: A Diffusion Model for Multivariate Time Series Synthetisation for Inertial Motion Capturing Systems |
| LDPM:利用MR-VAE和潜在扩散先验实现欠采样MRI重建 |
Xingjian Tang |
PDF |
N/A |
LDPM: Towards undersampled MRI reconstruction with MR-VAE and Latent Diffusion Prior |
| 一种可扩展的生成模型,用于从神经影像数据中重建动力系统 |
Eric Volkmann |
PDF |
N/A |
A scalable generative model for dynamical system reconstruction from neuroimaging data |
| 将自然语言与SQL翻译相结合,通过基于数据的自解释实现 |
Yuankai Fan |
PDF |
N/A |
Grounding Natural Language to SQL Translation with Data-Based Self-Explanations |
| 时间因果变分自编码器:稳健的金融时间序列生成器 |
Beatrice Acciaio |
PDF |
N/A |
Time-Causal VAE: Robust Financial Time Series Generator |
| 捕捉研究文献对可持续发展目标的态度:基于大语言模型的主题建模方法 |
Francesco Invernici |
PDF |
N/A |
Capturing research literature attitude towards Sustainable Development Goals: an LLM-based topic modeling approach |
| 用于时间序列预测的Mamba基础模型 |
Haoyu Ma |
PDF |
N/A |
A Mamba Foundation Model for Time Series Forecasting |
| 一种针对小型语言模型的后训练增强优化方法 |
Keke Zhai |
PDF |
N/A |
A Post-Training Enhanced Optimization Approach for Small Language Models |
| 基准测试多模态检索增强生成与动态VQA数据集和自适应规划代理 |
Yangning Li |
PDF |
N/A |
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent |
| 非洲定居点地图绘制:深度学习与卫星影像生成的高分辨率城市与乡村地图 |
Mohammad Kakooei |
PDF |
N/A |
Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery |
| P-MOSS:利用底层硬件统计信息在NUMA服务器上为索引进行学习型调度 |
Yeasir Rayhan |
PDF |
N/A |
P-MOSS: Learned Scheduling For Indexes Over NUMA Servers Using Low-Level Hardware Statistics |
| 大型语言模型中的文本美学 |
Lingjie Jiang |
PDF |
N/A |
Textual Aesthetics in Large Language Models |
| 基于隐私保护的图机器学习与全同态加密在协作反洗钱中的应用 |
Fabrianne Effendi |
PDF |
N/A |
Privacy-Preserving Graph-Based Machine Learning with Fully Homomorphic Encryption for Collaborative Anti-Money Laundering |
| 理论上保证的分布自适应学习 |
Chao Xu |
PDF |
N/A |
Theoretically Guaranteed Distribution Adaptable Learning |
| 开放集单源域泛化的域扩展与边界增长 |
Pengkun Jiao |
PDF |
N/A |
Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization |
| 探索自动驾驶中视频生成与世界模型之间的相互作用:一项综述 |
Ao Fu |
PDF |
N/A |
Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey |
| Photon:联邦式大语言模型预训练 |
Lorenzo Sani |
PDF |
N/A |
Photon: Federated LLM Pre-Training |
| 梯度下降法在非参数回归中找到具有锐利泛化能力的过参数化神经网络:一种无分布分析 |
Yingzhen Yang |
PDF |
N/A |
Gradient Descent Finds Over-Parameterized Neural Networks with Sharp Generalization for Nonparametric Regression: A Distribution-Free Analysis |
| 针对大型视觉语言模型的成员推理攻击 |
Zhan Li |
PDF |
N/A |
Membership Inference Attacks against Large Vision-Language Models |
| 油炸去卷积 |
Jerome Gilles |
PDF |
N/A |
Fried deconvolution |
| 湍流稳定化 |
Yu Mao |
PDF |
N/A |
Turbulence stabilization |
| 一种针对微分同胚医学图像配准的对称动态学习框架 |
Jinqiu Deng |
PDF |
N/A |
A Symmetric Dynamic Learning Framework for Diffeomorphic Medical Image Registration |
| 阿拉伯短篇小说中迂回表达的英译 |
Dalal Waadallah Shehab |
PDF |
N/A |
The Translation of Circumlocution in Arabic Short Stories into English |
| TokenSelect:通过动态令牌级KV缓存选择实现LLMs的高效长上下文推理和长度外推 |
Wei Wu |
PDF |
N/A |
TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection |
| 通过不确定性感知分布式对抗训练增强对抗鲁棒性 |
Junhao Dong |
PDF |
N/A |
Enhancing Adversarial Robustness via Uncertainty-Aware Distributional Adversarial Training |
| AtlasSeg:基于图谱先验引导的双U-Net用于胎儿脑部MRI中的皮层分割 |
Haoan Xu |
PDF |
N/A |
AtlasSeg: Atlas Prior Guided Dual-U-Net for Cortical Segmentation in Fetal Brain MRI |
| Graph-DPEP:基于思维图推理的少样本文档关系抽取分解式即插即用集成方法 |
Tao Zhang |
PDF |
N/A |
Graph-DPEP: Decomposed Plug and Ensemble Play for Few-Shot Document Relation Extraction with Graph-of-Thoughts Reasoning |
| 大语言模型在查询优化中的非理性有效性 |
Peter Akioyamen |
PDF |
N/A |
The Unreasonable Effectiveness of LLMs for Query Optimization |
| 基于中心性的实例感知知识蒸馏与任务互提升在无人机影像目标检测中的应用 |
Bowei Du |
PDF |
N/A |
Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery |
| 持续音频-视觉声音分离 |
Weiguo Pian |
PDF |
N/A |
Continual Audio-Visual Sound Separation |
| OLAF:增强型多对象多部件场景解析的即插即用框架 |
Pranav Gupta |
PDF |
N/A |
OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing |
| 通过年内时间序列分析贫困:小波变换方法 |
Mohammad Kakooei |
PDF |
N/A |
Analyzing Poverty through Intra-Annual Time-Series: A Wavelet Transform Approach |
| SpiDR:一种可重构的基于事件感知的数字存内计算脉冲神经网络加速器 |
Deepika Sharma |
PDF |
N/A |
SpiDR: A Reconfigurable Digital Compute-in-Memory Spiking Neural Network Accelerator for Event-based Perception |
| ADOPT:改进的Adam在任何$β_2$下都能以最优速率收敛 |
Shohei Taniguchi |
PDF |
N/A |
ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate |
| 学习统一音频、视觉和文本,以实现音频增强的多语言视觉答案定位 |
Zhibin Wen |
PDF |
N/A |
Learning to Unify Audio, Visual and Text for Audio-Enhanced Multilingual Visual Answer Localization |
| WASHtsApp -- 一个基于RAG技术的WhatsApp聊天机器人,旨在支持非洲农村地区的清洁水资源获取、卫生设施和卫生习惯的推广。 |
Simon Kloker |
PDF |
N/A |
WASHtsApp -- A RAG-powered WhatsApp Chatbot for supporting rural African clean water access, sanitation and hygiene |
| 对抗性多任务水下声学目标识别:针对各种影响因素的鲁棒性研究 |
Yuan Xie |
PDF |
N/A |
Adversarial multi-task underwater acoustic target recognition: towards robustness against various influential factors |
| 剖析图上不变学习的失败之处 |
Qixun Wang |
PDF |
N/A |
Dissecting the Failure of Invariant Learning on Graphs |
| 目标检测性能与视觉显著性和深度估计的相关性 |
Matthias Bartolo |
PDF |
N/A |
Correlation of Object Detection Performance with Visual Saliency and Depth Estimation |
| 光声成像重建与定量分析在生物医学应用中的进展 |
Lei Wang |
PDF |
N/A |
Advances in Photoacoustic Imaging Reconstruction and Quantitative Analysis for Biomedical Applications |
| 元启发式算法在模板设计问题中的应用:编码、对称性与混合化 |
David Rodríguez Rueda |
PDF |
N/A |
Metaheuristics for the Template Design Problem: Encoding, Symmetry and Hybridisation |
| 测试时动态图像融合 |
Bing Cao |
PDF |
N/A |
Test-Time Dynamic Image Fusion |
| 多模态与单模态对比学习的比较 |
Wei Huang |
PDF |
N/A |
On the Comparison between Multi-modal and Single-modal Contrastive Learning |
| 迷失在上下文中:上下文对目标识别特征归因方法的影响 |
Sayanta Adhikari |
PDF |
N/A |
Lost in Context: The Influence of Context on Feature Attribution Methods for Object Recognition |
| PersianRAG:一个针对波斯语的检索增强生成系统 |
Hossein Hosseini |
PDF |
N/A |
PersianRAG: A Retrieval-Augmented Generation System for Persian Language |
| 上下文学习者的混合体 |
Giwon Hong |
PDF |
N/A |
Mixtures of In-Context Learners |
| CE-CoLLM:通过云边协同实现高效且自适应的大语言模型 |
Hongpeng Jin |
PDF |
N/A |
CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration |
| 深度状态空间模型的层级自适应状态剪枝 |
Minseon Gwak |
PDF |
N/A |
Layer-Adaptive State Pruning for Deep State Space Models |
| DroidSpeak:增强跨大型语言模型通信 |
Yuhan Liu |
PDF |
N/A |
DroidSpeak: Enhancing Cross-LLM Communication |
| LiVOS:基于门控线性匹配的轻量级视频目标分割 |
Qin Liu |
PDF |
N/A |
LiVOS: Light Video Object Segmentation with Gated Linear Matching |
| 条件Vendi得分:一种基于信息论的生成模型提示多样性评估方法 |
Mohammad Jalali |
PDF |
N/A |
Conditional Vendi Score: An Information-Theoretic Approach to Diversity Evaluation of Prompt-based Generative Models |
| ChatGPT在研究和教育中的应用:探索其利与弊 |
Abu Saleh Musa Miah |
PDF |
N/A |
ChatGPT in Research and Education: Exploring Benefits and Threats |
| 人工智能增强的Couinaud分段用于精准肝癌治疗 |
Liang Qiu |
PDF |
N/A |
Artificial Intelligence-Enhanced Couinaud Segmentation for Precision Liver Cancer Therapy |
| 用于持续学习的稀疏正交参数调优 |
Kun-Peng Ning |
PDF |
N/A |
Sparse Orthogonal Parameters Tuning for Continual Learning |
| NEOviz:不确定性驱动的近地小行星轨迹可视化分析 |
Fangfei Lan |
PDF |
N/A |
NEOviz: Uncertainty-Driven Visual Analysis of Asteroid Trajectories |
| 查询效率高的对抗攻击垂直联邦图学习 |
Jinyin Chen |
PDF |
N/A |
Query-Efficient Adversarial Attack Against Vertical Federated Graph Learning |
| ERUP-YOLO:通过统一图像自适应处理增强恶劣天气条件下的目标检测鲁棒性 |
Yuka Ogino |
PDF |
N/A |
ERUP-YOLO: Enhancing Object Detection Robustness for Adverse Weather Condition by Unified Image-Adaptive Processing |
| DeepContext:一个面向深度学习工作负载的性能剖析与分析工具,具备上下文感知、跨平台和跨框架的特性。 |
Qidong Zhao |
PDF |
N/A |
DeepContext: A Context-aware, Cross-platform, and Cross-framework Tool for Performance Profiling and Analysis of Deep Learning Workloads |
| 专门化的基础模型难以超越有监督的基线模型 |
Zongzhe Xu |
PDF |
N/A |
Specialized Foundation Models Struggle to Beat Supervised Baselines |
| RWKV的演变:高效语言建模的进步 |
Akul Datta |
PDF |
N/A |
The Evolution of RWKV: Advancements in Efficient Language Modeling |
| 实时文本检测与交通、工业及自然场景中的相似掩码 |
Xu Han |
PDF |
N/A |
Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes |
| 面向鲁棒的不完全多模态情感分析的分层表示学习 |
Mingcheng Li |
PDF |
N/A |
Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning |
| 语言模型与循环一致性在自反式机器翻译中的应用 |
Jianqiao Wangni |
PDF |
N/A |
Language Models and Cycle Consistency for Self-Reflective Machine Translation |
| 用于可控个性化搜索的记忆增强交叉编码器 |
Sheshera Mysore |
PDF |
N/A |
Memory Augmented Cross-encoders for Controllable Personalized Search |
| 何时进行本地化?一种基于风险约束的强化学习方法 |
Chak Lam Shek |
PDF |
N/A |
When to Localize? A Risk-Constrained Reinforcement Learning Approach |
| 通过多任务学习和多门混合专家系统推进水下声学目标识别的稳健性 |
Yuan Xie |
PDF |
N/A |
Advancing Robust Underwater Acoustic Target Recognition through Multi-task Learning and Multi-Gate Mixture-of-Experts |
| 随机猴子玩耍:廉价随机增强破坏大型语言模型安全性对齐 |
Jason Vega |
PDF |
N/A |
Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment |
| 循环神经网络的泛化与风险界定 |
Xuewei Cheng |
PDF |
N/A |
Generalization and Risk Bounds for Recurrent Neural Networks |
| 脑波:生成重建方法使用了大脑的多少部分? |
David Mayo |
PDF |
N/A |
BrainBits: How Much of the Brain are Generative Reconstruction Methods Using? |
| 嘈杂图像的价值是多少?环境扩散的数据缩放法则 |
Giannis Daras |
PDF |
N/A |
How much is a noisy image worth? Data Scaling Laws for Ambient Diffusion |
| 提高回收效率:深度学习模型在废物分类中的比较分析 |
Zhanshan Qiao |
PDF |
N/A |
Advancing Recycling Efficiency: A Comparative Analysis of Deep Learning Models in Waste Classification |
| 基于深度学习的模块化加载协议用于Bouc-Wen类模型参数估计 |
Sebin Oh |
PDF |
N/A |
Deep learning-based modularized loading protocol for parameter estimation of Bouc-Wen class models |
| FedBlock:一种针对后门攻击的联邦学习区块链方法 |
Duong H. Nguyen |
PDF |
N/A |
FedBlock: A Blockchain Approach to Federated Learning against Backdoor Attacks |
| 各向同性核的新随机投影使用稳定谱分布 |
Nicolas Langrené |
PDF |
N/A |
New random projections for isotropic kernels using stable spectral distributions |
| One-Stage-TFS:用于手指拼写识别框架的泰语单阶段手指拼写数据集 |
Siriwiwat Lata |
PDF |
N/A |
One-Stage-TFS: Thai One-Stage Fingerspelling Dataset for Fingerspelling Recognition Frameworks |
| 一种用于平行正齐次网络泛化分析的凸松弛方法 |
Uday Kiran Reddy Tadipatri |
PDF |
N/A |
A Convex Relaxation Approach to Generalization Analysis for Parallel Positively Homogeneous Networks |
| 快速、鲁棒的近似消息传递 |
Misha Ivkov |
PDF |
N/A |
Fast, robust approximate message passing |
| EcoCropsAID:用于土地利用分类的经济作物航空图像数据集 |
Sangdaow Noppitak |
PDF |
N/A |
EcoCropsAID: Economic Crops Aerial Image Dataset for Land Use Classification |
| DEMONet:基于多专家网络和跨时变分自编码器的水下声学目标识别 |
Yuan Xie |
PDF |
N/A |
DEMONet: Underwater Acoustic Target Recognition based on Multi-Expert Network and Cross-Temporal Variational Autoencoder |
| 标签评论家:在模型之前设计数据 |
Pedro R. A. S. Bassi |
PDF |
N/A |
Label Critic: Design Data Before Models |
| 单量子比特确定性量子计算的表达能力 |
Yujin Kim |
PDF |
N/A |
Expressivity of deterministic quantum computation with one qubit |
| 高效特征聚合与尺度感知回归在单目三维物体检测中的应用 |
Yifan Wang |
PDF |
N/A |
Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection |
| 基于模式和函数方差分析的机器学习模型的贝叶斯解释 |
Quan Long |
PDF |
N/A |
A Bayesian explanation of machine learning models based on modes and functional ANOVA |
| 医学图像分割的基础AI模型 |
Rina Bao |
PDF |
N/A |
Foundation AI Model for Medical Image Segmentation |
| 一种基于信息匹配的最优实验设计和主动学习方法 |
Yonatan Kurniawan |
PDF |
N/A |
An information-matching approach to optimal experimental design and active learning |
| 基于新颖性聚焦的研发景观分析:结合Transformer与局部异常因子 |
Jaewoong Choi |
PDF |
N/A |
Novelty-focused R&D landscaping using transformer and local outlier factor |
| DDFAV:遥感大视觉语言模型数据集与评估基准 |
Haodong Li |
PDF |
N/A |
DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation Benchmark |
| 一种支持生物医学数据协调的自然语言处理方法:利用大型语言模型 |
Zexu Li |
PDF |
N/A |
A Natural Language Processing Approach to Support Biomedical Data Harmonization: Leveraging Large Language Models |
| 基于组合模拟的时间序列推理 |
Manuel Gloeckler |
PDF |
N/A |
Compositional simulation-based inference for time series |
| 椭圆Wishart分布:信息几何、极大似然估计、性能分析与统计学习 |
Imen Ayadi |
PDF |
N/A |
Elliptical Wishart distributions: information geometry, maximum likelihood estimator, performance analysis and statistical learning |
| TransUNext:迈向更先进的U形框架,用于眼底图像中的自动血管分割 |
Xiang Li |
PDF |
N/A |
TransUNext: towards a more advanced U-shaped framework for automatic vessel segmentation in the fundus image |
| 用于视觉问答的多模态常识知识蒸馏 |
Shuo Yang |
PDF |
N/A |
Multimodal Commonsense Knowledge Distillation for Visual Question Answering |
| CIT:重新思考类增量语义分割与类独立变换 |
Jinchao Ge |
PDF |
N/A |
CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation |
| 基于大型语言模型辅助的游戏剧情设计:与游戏设计师的实证研究 |
Seyed Hossein Alavi |
PDF |
N/A |
Game Plot Design with an LLM-powered Assistant: An Empirical Study with Game Designers |
| V-DPO:通过视觉引导的直接偏好优化来减轻大型视觉语言模型中的幻觉现象 |
Yuxi Xie |
PDF |
N/A |
V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference Optimization |
| 全视野数字乳腺摄影数据集来自一项人群筛查计划 |
Edward Kendall |
PDF |
N/A |
Full Field Digital Mammography Dataset from a Population Screening Program |
| 利用区块链信息进行碳价波动预测:一种新的混合机器学习方法 |
H. Wang |
PDF |
N/A |
Carbon price fluctuation prediction using blockchain information A new hybrid machine learning approach |
| 探索多语言大语言模型中的响应不确定性:在误导场景下的实证评估 |
Yunkai Dang |
PDF |
N/A |
Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios |
| RT-Affordance:Affordances 是机器人操作的多功能中间表示 |
Soroush Nasiriany |
PDF |
N/A |
RT-Affordance: Affordances are Versatile Intermediate Representations for Robot Manipulation |
| 可转移的多色光学编码器用于神经网络 |
Minho Choi |
PDF |
N/A |
Transferable polychromatic optical encoder for neural networks |
| JEL:在摩根大通应用端到端神经实体链接 |
Wanying Ding |
PDF |
N/A |
JEL: Applying End-to-End Neural Entity Linking in JPMorgan Chase |
| 具有事件时间不确定性的点过程 |
Xiuyuan Cheng |
PDF |
N/A |
Point processes with event time uncertainty |
| JPEC:一种用于金融知识图谱中竞争对手检索的新型图神经网络 |
Wanying Ding |
PDF |
N/A |
JPEC: A Novel Graph Neural Network for Competitor Retrieval in Financial Knowledge Graphs |
| 在通用指令微调中失去上下文感知能力 |
Yihan Wang |
PDF |
N/A |
On the loss of context-awareness in general instruction fine-tuning |