跳转至

Arxiv 2024-10-21 Papers

标题 作者 PDF链接 代码仓库 Title
FrugalNeRF:无需学习先验知识,快速收敛的小样本新视角合成方法 Chin-Yang Lin PDF N/A FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors
MvDrag3D:基于拖拽的多视角生成-重构先验的创意3D编辑 Honghua Chen PDF N/A MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors
反思-长凳:通过反思探究人工智能的智能 Lingyu Li PDF N/A Reflection-Bench: probing AI intelligence with reflection
SAM2Long:利用无训练记忆树增强SAM 2的长视频分割能力 Shuangrui Ding PDF N/A SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
xGen-MM-Vid (BLIP-3-Video): 即使在视觉语言模型中,你只需要32个标记就能表示一段视频 Michael S. Ryoo PDF N/A xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs
3DGS-Enhancer:通过视图一致的2D扩散先验增强无界3D高斯喷洒 Xi Liu PDF N/A 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors
Mini-InternVL:一个灵活迁移的口袋多模态模型,参数减少5%,性能保持90% Zhangwei Gao PDF N/A Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance
代理对模拟:从随意的纵向视频中学习交互行为模型 Gengshan Yang PDF N/A Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos
阐明用于图像生成的语言模型的设计空间 Xuantong Liu PDF N/A Elucidating the design space of language models for image generation
指南针评判者-1:一体化评判模型助力模型评估与进化 Maosong Cao PDF N/A CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution
重新审视深度特征重构在逻辑和结构工业异常检测中的应用 Sukanya Patra PDF N/A Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection
具有有效输出的分布学习超越最坏情况 Nick Rittler PDF N/A Distribution Learning with Valid Outputs Beyond the Worst-Case
知识编辑真的能纠正幻觉吗? Baixiang Huang PDF N/A Can Knowledge Editing Really Correct Hallucinations?
通过梯度下降实现管状张量分解的隐式正则化 Santhosh Karnik PDF N/A Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent
分析基于大型语言模型的机器翻译中的上下文贡献 Emmanouil Zaranis PDF N/A Analyzing Context Contributions in LLM-based Machine Translation
MoRE:在X光片、心电图和诊断报告上使用多模态对比预训练的Transformer模型 Samrajya Thapa PDF N/A MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report
多中心MRI临床显著性前列腺癌深度放射组学检测:初步对比PI-RADS评估 G. A. Nketiah PDF N/A Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment
IBGP:通信多智能体系统中零样本鲁棒性的不完全拜占庭将军问题 Yihuan Mao PDF N/A IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems
LLaVA-KD:一种多模态大语言模型蒸馏框架 Yuxuan Cai PDF N/A LLaVA-KD: A Framework of Distilling Multimodal Large Language Models
ToW:词语思考提升大型语言模型的推理能力 Zhikun Xu PDF N/A ToW: Thoughts of Words Improve Reasoning in Large Language Models
Sketch2Code:评估视觉语言模型在交互式网页设计原型制作中的应用 Ryan Li PDF N/A Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping
通过检索增强语言模型构建编码助手 Xinze Li PDF N/A Building A Coding Assistant via the Retrieval-Augmented Language Model
管理带宽:云辅助自动驾驶的关键 Alexander Krentsel PDF N/A Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving
大型语言模型越狱的现实威胁模型 Valentyn Boreiko PDF N/A A Realistic Threat Model for Large Language Model Jailbreaks
在医疗领域创建英泰代码转换机器翻译 Parinthapat Pengpun PDF N/A On Creating an English-Thai Code-switched Machine Translation in Medical Domain
大型语言模型预训练蒸馏:设计空间探索 Hao Peng PDF N/A Pre-training Distillation for Large Language Models: A Design Space Exploration
全面基准测试大型语言模型用于RNA二级结构预测 L. I. Zablocki PDF N/A Comprehensive benchmarking of large language models for RNA secondary structure prediction
计算约束的数据选择 Junjie Oscar Yin PDF N/A Compute-Constrained Data Selection
CoT-TL:利用思维链推理进行低资源规划指令的时间知识表示 Kumar Manas PDF N/A CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning
系统综述:用于社交媒体心理健康检测的机器学习与深度学习中的文本处理算法 Yuchen Cao PDF N/A Systematic Review: Text Processing Algorithms in Machine Learning and Deep Learning for Mental Health Detection on Social Media
在过度参数化时代中集成方法的理论局限性 Niclas Dern PDF N/A Theoretical Limitations of Ensembles in the Age of Overparameterization
改进视觉语言模型链式思维推理 Ruohong Zhang PDF N/A Improve Vision Language Model Chain-of-thought Reasoning
LASER:自主代理执行脚本以实现按需交通模拟 Hao Gao PDF N/A LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation
对话生成信息:利用知识图谱的建议 Alex Clay PDF N/A Information for Conversation Generation: Proposals Utilising Knowledge Graphs
一种用于图形化Stein变分推理的信赖域方法 Liam Pavlovic PDF N/A A Trust-Region Method for Graphical Stein Variational Inference
利用人类视觉显著性训练更好的深度学习模型 Aidan Boyd PDF N/A Training Better Deep Learning Models Using Human Saliency
多语言基准测试的污染报告 Sanchit Ahuja PDF N/A Contamination Report for Multilingual Benchmarks
RM-Bench:以微妙和风格为语言模型的奖励模型进行基准测试 Yantao Liu PDF N/A RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
魔法猪:LSH采样,用于高效的大型语言模型生成 Zhuoming Chen PDF N/A MagicPIG: LSH Sampling for Efficient LLM Generation
一个利用合成图像协变量和纵向数据评估预测模型的框架 Simon Deltadahl PDF N/A A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data
脉冲神经网络作为涌现群体代理的控制器 Kevin Zhu PDF N/A Spiking Neural Networks as a Controller for Emergent Swarm Agents
学习如何按原则投票:神经网络集体决策的公理性洞察 Levin Hornischer PDF N/A Learning How to Vote With Principles: Axiomatic Insights Into the Collective Decisions of Neural Networks
身体活动、蛋白质摄入与睡眠质量在肌肉蛋白质合成中的相互作用 Ayush Devkota PDF N/A The Interplay Between Physical Activity, Protein Consumption, and Sleep Quality in Muscle Protein Synthesis
探索通过主动遗忘来改进解码器语言模型的跨语言迁移的预训练方法 Divyanshu Aggarwal PDF N/A Exploring Pretraining via Active Forgetting for Improving Cross Lingual Transfer for Decoder Language Models
超越过滤:面向多模态大语言模型预训练的自适应图文质量增强 Han Huang PDF N/A Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining
从标记到材料:利用语言模型助力科学发现 Yuwei Wan PDF N/A From Tokens to Materials: Leveraging Language Models for Scientific Discovery
生成式人工智能辅助医学培训 Stefan Fritsch PDF N/A GenAI Assisting Medical Training
Griffon-G:通过大型多模态模型连接视觉-语言与视觉中心任务 Yufei Zhan PDF N/A Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models
Sparkle:掌握视觉语言模型中的基本空间能力可激发对复合空间推理的泛化能力 Yihong Tang PDF N/A Sparkle: Mastering Basic Spatial Capabilities in Vision Language Models Elicits Generalization to Composite Spatial Reasoning
DMM:使用打包秘密共享的差分隐私联邦学习分布式矩阵机制 Alexander Bienstock PDF N/A DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing
度量作为变换:探索超越仿射变换的可解释神经网络 Suman Sapkota PDF N/A Metric as Transform: Exploring beyond Affine Transform for Interpretable Neural Network
网络:复杂性的视觉语言 Blai Vidiella PDF N/A Networks: The Visual Language of Complexity
林北不讲:新式英语标注的挑战 Lynnette Hui Xian Ng PDF N/A Limpeh ga li gong: Challenges in Singlish Annotations
一个具有传染性越狱能力的捣乱者在诚实的小镇中制造了混乱 Tianyi Men PDF N/A A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns
有限数据下持续学习的无监督重放策略 Anthony Bazhenov PDF N/A Unsupervised Replay Strategies for Continual Learning with Limited Data
泛亚:一个完全开放的多语言多模态大语言模型,支持39种语言 Xiang Yue PDF N/A Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages
扭曲扩散:利用图像扩散模型解决视频逆问题 Giannis Daras PDF N/A Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models
小贡献,小网络:基于相对重要性的高效神经网络剪枝 Mostafa Hussien PDF N/A Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance
在教师-学生设置中使用受限玻尔兹曼机进行结构化数据学习的建模 Robin Thériault PDF N/A Modelling Structured Data Learning with Restricted Boltzmann Machines in the Teacher-Student Setting
PODTILE:利用自动生成的章节促进播客剧集浏览 Azin Ghazimatin PDF N/A PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters
面向对抗领域泛化中的频率简单性偏置学习 Xilin He PDF N/A Towards Combating Frequency Simplicity-biased Learning for Domain Generalization
1-bit AI 基础设施:第1.1部分,在CPU上快速且无损的BitNet b1.58推理 Jinheng Wang PDF N/A 1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs
一种基于可解释对比的扩张卷积网络,结合Transformer用于儿童肺炎检测 Chandravardhan Singh Raghaw PDF N/A An Explainable Contrastive-based Dilated Convolutional Network with Transformer for Pediatric Pneumonia Detection
语言模型对论元角色敏感性的心理语言学评估 Eun-Kyoung Rosa Lee PDF N/A A Psycholinguistic Evaluation of Language Models' Sensitivity to Argument Roles
图学习中线图变换的理论洞察 Fan Yang PDF N/A Theoretical Insights into Line Graph Transformation on Graph Learning
通过结合自然视频刺激和与刺激无关的潜在因素来建模动态神经活动 Finn Schmidt PDF N/A Modeling dynamic neural activity by combining naturalistic video stimuli and stimulus-independent latent factors
超越2:4:探索V:N:M稀疏性以在GPU上实现高效的Transformer推理 Kang Zhao PDF N/A Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs
一种数据驱动的群体模拟框架,结合了物理信息机器学习与导航势场 Runkang Guo PDF N/A A Data-driven Crowd Simulation Framework Integrating Physics-informed Machine Learning with Navigation Potential Fields
大型音频-语言模型真的能听懂吗?通过多任务评估和逐步音频推理解决幻觉问题 Chun-Yi Kuan PDF N/A Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning
SMART:用于推理任务的自学习元策略代理 Rongxing Liu PDF N/A SMART: Self-learning Meta-strategy Agent for Reasoning Tasks
MNIST-Nd:一组用于跨维度基准聚类的自然主义数据集 Polina Turishcheva PDF N/A MNIST-Nd: a set of naturalistic datasets to benchmark clustering across dimensions
分子机器学习中无监督训练集选择的整数线性规划 Matthieu Haeberle PDF N/A Integer linear programming for unsupervised training set selection in molecular machine learning
从大型语言模型中提取时空数据 Lele Zheng PDF N/A Extracting Spatiotemporal Data from Gradients with Large Language Models
SeaDAG:用于有条件有向无环图生成的半自回归扩散模型 Xinyi Zhou PDF N/A SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation
多模态耀斑预测与深度学习 Grégoire Francisco PDF N/A Multimodal Flare Forecasting with Deep Learning
通过近似人类视觉显著性来提高神经网络的可解释性 Aidan Boyd PDF N/A Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency
大型语言模型写作是否像人类?语法和修辞风格的变化 Alex Reinhart PDF N/A Do LLMs write like humans? Variation in grammatical and rhetorical styles
线性函数逼近下的时序差分学习的统计推断 Weichen Wu PDF N/A Statistical Inference for Temporal Difference Learning with Linear Function Approximation
通过多级深度学习解决深度神经网络的光谱偏差问题 Ronglong Fang PDF N/A Addressing Spectral Bias of Deep Neural Networks by Multi-Grade Deep Learning
LDAdam:从低维梯度统计中自适应优化 Thomas Robert PDF N/A LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics
ExDBN:动态贝叶斯网络的精确学习 Pavel Rytíř PDF N/A ExDBN: Exact learning of Dynamic Bayesian Networks
LMHaze:基于强度感知的图像去雾方法,采用大规模多强度真实雾霾数据集 Ruikun Zhang PDF N/A LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset
CHESS最终报告:面向科学和安全的云、高性能计算与边缘计算 Nathan Tallent PDF N/A Final Report for CHESS: Cloud, High-Performance Computing, and Edge for Science and Security
用于驱动耗散量子动力学的神经量子传播器 Jiaji Zhang PDF N/A Neural Quantum Propagators for Driven-Dissipative Quantum Dynamics
分析语言模型在知识冲突下的残差流 Yu Zhao PDF N/A Analysing the Residual Stream of Language Models Under Knowledge Conflicts
基于图像和雷达数据特征图的无人机分类多传感器融合 Nikos Sakellariou PDF N/A Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data
微调大型语言模型以提供可靠的医疗问答服务 Ali Anaissi PDF N/A Fine-Tuning LLMs for Reliable Medical Question-Answering Services
基于流生成模型的车辆轨迹预测关键示例挖掘 Zhezhang Ding PDF N/A Critical Example Mining for Vehicle Trajectory Prediction using Flow-based Generative Models
CartesianMoE:通过专家混合中的笛卡尔积路由提升专家间的知识共享 Zhenpeng Su PDF N/A CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts
对抗训练中的正则化几何:高维渐近性和泛化界限 Matteo Vilucchio PDF N/A On the Geometry of Regularization in Adversarial Training: High-Dimensional Asymptotics and Generalization Bounds
中小企业设备上的大型语言模型:挑战与机遇 Jeremy Stephen Gabriel Yee Zhi Wen PDF N/A On-Device LLMs for SMEs: Challenges and Opportunities
滚动语言模型(LLMs)在习语理解上的骰子:它们如何未能把握语境 Maggie Mi PDF N/A Rolling the DICE on Idiomaticity: How LLMs Fail to Grasp Context
使用随机滴定常数-pH元动力学模拟对RNA寡聚体进行表征 Tomas F. D. Silva PDF N/A Characterizing RNA oligomers using Stochastic Titration Constant-pH Metadynamics simulations
基于半监督学习的小样本实例分割的综合图像-文本方法 Ruting Chi PDF N/A Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation
惊喜!统一信息密度并非全部:预测长篇话语中的意外轮廓 Eleftheria Tsipidi PDF N/A Surprise! Uniform Information Density Isn't the Whole Story: Predicting Surprisal Contours in Long-form Discourse
通过混合监督进行标签填充以从噪声标注中进行医学图像分割 Ming Li PDF N/A Label Filling via Mixed Supervision for Medical Image Segmentation from Noisy Annotations
非平稳核化多臂老虎机的近似最优算法 Shogo Iwazaki PDF N/A Near-Optimal Algorithm for Non-Stationary Kernelized Bandits
大型语言模型知道该说什么,但不知道何时该说话。 Muhammad Umair PDF N/A Large Language Models Know What To Say But Not When To Speak
用于群中相容算子哈密顿量分解的GFlowNets Isaac L. Huidobro-Meezs PDF N/A GFlowNets for Hamiltonian decomposition in groups of compatible operators
基准化病理学基础模型:适应策略与场景 Jeaung Lee PDF N/A Benchmarking Pathology Foundation Models: Adaptation Strategies and Scenarios
通过鲁棒视觉特征和高级注意力机制改进多标签原子活动识别 @ ROAD++ 原子活动识别 2024 Jiamin Cao PDF N/A Improving the Multi-label Atomic Activity Recognition by Robust Visual Feature and Advanced Attention @ ROAD++ Atomic Activity Recognition 2024
TimeMixer++:一种通用的时间序列模式机器,用于普遍的预测分析 Shiyu Wang PDF N/A TimeMixer++: A General Time Series Pattern Machine for Universal Predictive Analysis
自然GaLore:加速GaLore以实现内存高效的LLM训练与微调 Arijit Das PDF N/A Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning
基于开放词汇目标检测模型的少样本目标驱动实例检测 Ben Crulis PDF N/A Few-shot target-driven instance detection based on open-vocabulary object detection models
ComPO:社区对语言模型个性化的偏好 Sachin Kumar PDF N/A ComPO: Community Preferences for Language Model Personalization
解决SMAC任务的新方法:从大型语言模型生成决策树代码 Yue Deng PDF N/A A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models
开始:一种具有显著性驱动令牌感知变换的广义状态空间模型 Jintao Guo PDF N/A START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation
使用RGB卷积神经网络的多光谱纹理合成 Sélim Ollivier PDF N/A Multispectral Texture Synthesis using RGB Convolutional Neural Networks
基于对偶的信息论极小极大后悔界限用于强化学习 Raghav Bongole PDF N/A Information-Theoretic Minimax Regret Bounds for Reinforcement Learning based on Duality
Massimo:基于质量-弹簧模型的公共队列监控与管理 Abhijeet Kumar PDF N/A Massimo: Public Queue Monitoring and Management using Mass-Spring Model
CA*:解决计算感知延迟在同时语音翻译中的评估陷阱 Xi Xu PDF N/A CA*: Addressing Evaluation Pitfalls in Computation-Aware Latency for Simultaneous Speech Translation
3D-GANTex:基于StyleGAN3的多视图图像和3DDFA网格生成的3D人脸重建 Rohit Das PDF N/A 3D-GANTex: 3D Face Reconstruction with StyleGAN3-based Multi-View Images and 3DDFA based Mesh Generation
在拓扑结构不准确的情况下,弹性时间图卷积网络用于智能电网状态估计 Seyed Hamed Haghshenas PDF N/A Resilient Temporal GCN for Smart Grid State Estimation Under Topology Inaccuracies
语言模型输出的对数概率是否经过校准? Charles Lovering PDF N/A Are Language Model Logits Calibrated?
探索持续微调以提升大型语言模型的语言能力 Divyanshu Aggarwal PDF N/A Exploring Continual Fine-Tuning for Enhancing Language Ability in Large Language Model
通过基于SAE的表示工程引导LLMs的知识选择行为 Yu Zhao PDF N/A Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
在SMM4H 2024的1024m任务3、5和6中:用于医学文本分类的Transformer和大型语言模型集成 Ram Mohan Rao Kadiyala PDF N/A 1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and Large Language Models for Medical Text Classification
MultiRC:联合学习用于多尺度重构对比的时间序列异常预测与检测 Shiyan Hu PDF N/A MultiRC: Joint Learning for Time Series Anomaly Prediction and Detection with Multi-scale Reconstructive Contrast
利用基于大语言模型的自然语言推理增强法律决策支持系统,以分析社交媒体证据 Ram Mohan Rao Kadiyala PDF N/A Augmenting Legal Decision Support Systems with LLM-based NLI for Analyzing Social Media Evidence
分析自动驾驶高速公路驾驶模拟中用于真实交通代理模型训练的闭环训练技术 Matthias Bitzer PDF N/A Analyzing Closed-loop Training Techniques for Realistic Traffic Agent Models in Autonomous Highway Driving Simulations
一个定量的Robbins-Siegmund定理 Morenikeji Neri PDF N/A A quantitative Robbins-Siegmund theorem
使用稀疏DEIM和循环神经网络的状态估计 Mohammad Farazmand PDF N/A State Estimation Using Sparse DEIM and Recurrent Neural Networks
多模态先验知识引导的视觉表示学习 Hongkuan Zhou PDF N/A Visual Representation Learning Guided By Multi-modal Prior Knowledge
在长尾学习中,粒度至关重要 Shizhen Zhao PDF N/A Granularity Matters in Long-Tail Learning
PROMPTHEUS:一种以人为中心的管道,利用大型语言模型简化系统文献综述流程 João Pedro Fernandes Torres PDF N/A PROMPTHEUS: A Human-Centered Pipeline to Streamline SLRs with LLMs
在忆阻器交叉阵列上实现大型语言模型的能效部署:大与小的协同作用 Zhehui Wang PDF N/A Enabling Energy-Efficient Deployment of Large Language Models on Memristor Crossbar: A Synergy of Large and Small
用于跨语言情感检测的大型语言模型 Ram Mohan Rao Kadiyala PDF N/A Large Language Models for Cross-lingual Emotion Detection
卡鲁什-库恩-塔克条件训练神经网络(KKT Nets) Shreya Arvind PDF N/A Karush-Kuhn-Tucker Condition-Trained Neural Networks (KKT Nets)
利用深度先验组件从单张图像进行零样本场景重建 Junsheng Zhou PDF N/A Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly
基于文档的对话中的政策驱动知识选择与回复生成 Longxuan Ma PDF N/A Policy-driven Knowledge Selection and Response Generation for Document-grounded Dialogue
自解释关键词赋能大型语言模型进行代码生成 Lishui Fan PDF N/A Self-Explained Keywords Empower Large Language Models for Code Generation
系统探索对话摘要方法:可重复性、比较评估及推进自然语言处理在抽象摘要中的方法论创新 Yugandhar Reddy Gogireddy PDF N/A Systematic Exploration of Dialogue Summarization Approaches for Reproducibility, Comparative Assessment, and Methodological Innovations for Advancing Natural Language Processing in Abstractive Summarization
莫扎地图矢量化中的范式转变:人机协作方法 Mahir Shahriar Dhrubo PDF N/A A Paradigm Shift in Mouza Map Vectorization: A Human-Machine Collaboration Approach
现代云计算中的AI驱动创新 Animesh Kumar PDF N/A AI-Driven Innovations in Modern Cloud Computing
扩散变换器策略 Zhi Hou PDF N/A Diffusion Transformer Policy
CamI2V:相机控制的图像到视频扩散模型 Guangcong Zheng PDF N/A CamI2V: Camera-Controlled Image-to-Video Diffusion Model
大型语言模型是否带有英语口音?评估和提升多语言LLM的自然性 Yanzhu Guo PDF N/A Do Large Language Models Have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMs
TS-ACL:一种用于隐私保护和类增量模式识别的时间序列分析持续学习框架 Kejia Fan PDF N/A TS-ACL: A Time Series Analytic Continual Learning Framework for Privacy-Preserving and Class-Incremental Pattern Recognition
以用户为中心的AI可解释性评估:人与AI协同的全面实证研究 Szymon Bobek PDF N/A User-centric evaluation of explainability of AI with and for humans: a comprehensive empirical study
重新定义金融:人工智能(AI)与机器学习(ML)的影响 Animesh Kumar PDF N/A Redefining Finance: The Influence of Artificial Intelligence (AI) and Machine Learning (ML)
第三届多语言指代消解共享任务的结果 Michal Novák PDF N/A Findings of the Third Shared Task on Multilingual Coreference Resolution
青光眼检测的AI驱动方法 -- 全面综述 Yuki Hagiwara PDF N/A AI-Driven Approaches for Glaucoma Detection -- A Comprehensive Review
从PDF开发基于检索增强生成(RAG)的大型语言模型系统:一份经验报告 Ayman Asad Khan PDF N/A Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience Report
MBPU:一种即插即用的点云上采样状态空间模型,支持快速点渲染 Jiayi Song PDF N/A MBPU: A Plug-and-Play State Space Model for Point Cloud Upsamping with Fast Point Rendering
研究无序蛋白质的机器学习方法 Sören von Bülow PDF N/A Machine learning methods to study disordered proteins
CausalGraph2LLM:评估大型语言模型对因果查询的能力 Ivaxi Sheth PDF N/A CausalGraph2LLM: Evaluating LLMs for Causal Queries
专注于鸟瞰图:用于单目鸟瞰图分割的自校准循环视图变换 Jiawei Zhao PDF N/A Focus on BEV: Self-calibrated Cycle View Transformation for Monocular Birds-Eye-View Segmentation
中心化感知的产品检索与排序 Hadeel Saadany PDF N/A Centrality-aware Product Retrieval and Ranking
是的,嗯,哦:通过微调语音活动投影实现连续和实时反馈预测 Koji Inoue PDF N/A Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity Projection
GReFEL:在偏差和不平衡数据分布下,基于几何感知的可靠面部表情学习 Azmine Toushik Wasi PDF N/A GReFEL: Geometry-Aware Reliable Facial Expression Learning under Bias and Imbalanced Data Distribution
通过同心因果注意力缓解对象幻觉 Yun Xing PDF N/A Mitigating Object Hallucination via Concentric Causal Attention
时间变化更新优化算法的自动微分 Sheheryar Mehmood PDF N/A Automatic Differentiation of Optimization Algorithms with Time-Varying Updates
大规模软标签对于大规模数据集蒸馏是否必要? Lingao Xiao PDF N/A Are Large-scale Soft Labels Necessary for Large-scale Dataset Distillation?
利用CORAL-相关一致性网络进行半监督左心房MRI分割 Xinze Li PDF N/A Leveraging CORAL-Correlation Consistency Network for Semi-Supervised Left Atrium MRI Segmentation
Bench4Merge:一个综合基准,用于在具有微交互车辆的现实密集交通中进行合并 Zhengming Wang PDF N/A Bench4Merge: A Comprehensive Benchmark for Merging in Realistic Dense Traffic with Micro-Interactive Vehicles
DefVerify:仇恨言论模型是否反映了其数据集的定义? Urja Khurana PDF N/A DefVerify: Do Hate Speech Models Reflect Their Dataset's Definition?
多样性策略通过点对点互信息加权模仿学习实现恢复 Hanlin Yang PDF N/A Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
实时视频异常检测的混合架构:整合空间与时间分析 Fabien Poirier PDF N/A Hybrid Architecture for Real-Time Video Anomaly Detection: Integrating Spatial and Temporal Analysis
地震相位拾取 Yuchen Wang PDF N/A Seismic Phase Picking
基于机器学习的纠错解码器的设计与性能 Yuncheng Yuan PDF N/A On the Design and Performance of Machine Learning Based Error Correcting Decoders
IGMaxHS -- 一种支持XOR子句的增量最大SAT求解器 Ole Lübke PDF N/A IGMaxHS -- An Incremental MaxSAT Solver with Support for XOR Clauses
基于模拟的单分子实验推断 Lars Dingeldein PDF N/A Simulation-based inference of single-molecule experiments
TexPro:基于文本指导的PBR纹理生成与程序化材质建模 Ziqiang Dang PDF N/A TexPro: Text-guided PBR Texturing with Procedural Material Modeling
模型模仿攻击:可证明可转移对抗样本的知识蒸馏 Kirill Lukyanov PDF N/A Model Mimic Attack: Knowledge Distillation for Provably Transferable Adversarial Examples
用于数字病理学中幻灯片级癌症亚型分类的基础模型 Pablo Meseguer PDF N/A Foundation Models for Slide-level Cancer Subtyping in Digital Pathology
如何构建一个用于同时聊天和决策的预训练多模态模型? Zuojin Tang PDF N/A How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?
使用GPT模型进行2024年美国总统选举过程中的定性与定量新闻分析 Bohdan M. Pavlyshenko PDF N/A Using GPT Models for Qualitative and Quantitative News Analytics in the 2024 US Presidental Election Process
无人机集群的分布式学习 Chen Hu PDF N/A Distributed Learning for UAV Swarms
MI-VisionShot:用于组织病理学图像幻灯片级分类的视觉-语言模型的少样本适应 Pablo Meseguer PDF N/A MI-VisionShot: Few-shot adaptation of vision-language models for slide-level classification of histopathological images
闪烁融合:轨迹内领域泛化的多智能体强化学习 Woosung Koh PDF N/A FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
在多任务学习中通过自辅助实现非对称知识迁移 Olivier Graffeuille PDF N/A Enabling Asymmetric Knowledge Transfer in Multi-Task Learning with Self-Auxiliaries
视觉主题识别:精心策划的比较数据集和分类方法的详细阐述 Adam Phillips PDF N/A Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods
语法模式中语义和功能效率的原则 Emily Cheng PDF N/A Principles of semantic and functional efficiency in grammatical patterning
Mesa-外推法:一种用于增强大型语言模型外推能力的编织位置编码方法 Xin Ma PDF N/A Mesa-Extrapolation: A Weave Position Encoding Method for Enhanced Extrapolation in LLMs
面向高效迁移学习的最佳适配器放置策略 Aleksandra I. Nowak PDF N/A Towards Optimal Adapter Placement for Efficient Transfer Learning
TEXEL:一种具有片上学习功能的神经形态处理器,适用于超越CMOS器件的集成 Hugh Greatorex PDF N/A TEXEL: A neuromorphic processor with on-chip learning for beyond-CMOS device integration
R2I-rPPG:一种用于远程光电容积脉搏波描记法提取心率的鲁棒感兴趣区域选择方法 Sandeep Nagar PDF N/A R2I-rPPG: A Robust Region of Interest Selection Method for Remote Photoplethysmography to Extract Heart Rate
聚焦关键:图选择性状态聚焦注意力网络 Shikhar Vashistha PDF N/A Focus Where It Matters: Graph Selective State Focused Attention Networks
多视角医学诊断的随机令牌融合 Jingyu Guo PDF N/A Random Token Fusion for Multi-View Medical Diagnosis
为实时通信中的端到端服务质量预测建模并发RTP流 Tailai Song PDF N/A Modelling Concurrent RTP Flows for End-to-end Predictions of QoS in Real Time Communications
通过图模型实现强化学习中的高效协作 Wenzhe Fan PDF N/A Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning
私密、高效且可扩展的医学图像分析内核学习 Anika Hannemann PDF N/A Private, Efficient and Scalable Kernel Learning for Medical Image Analysis
在GNSS缺失环境下利用深度强化学习进行远程地磁导航 Wenqi Bai PDF N/A Long-distance Geomagnetic Navigation in GNSS-denied Environments with Deep Reinforcement Learning
LiOn-XA:通过仅使用LiDAR的跨模态对抗训练实现无监督领域自适应 Thomas Kreutz PDF N/A LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training
LLM4GRN:利用大型语言模型发现因果基因调控网络——通过合成数据生成进行评估 Tejumade Afonja PDF N/A LLM4GRN: Discovering Causal Gene Regulatory Networks with LLMs -- Evaluation through Synthetic Data Generation
高度相关模糊流失模式在二分类中的可解释性 D. Y. C. Wang PDF N/A Explainability of Highly Associated Fuzzy Churn Patterns in Binary Classification
有人提到“Gest-IT”了吗?这是对多模态数据管理的一次试点探索。 Ludovica Pannitto PDF N/A Did somebody say "Gest-IT"? A pilot exploration of multimodal data management
微调对语言模型毒性的影响 Will Hawkins PDF N/A The effect of fine-tuning on language model toxicity
MAC Revivo:人工智能铺就道路 Jinzhe Pan PDF N/A MAC Revivo: Artificial Intelligence Paves the Way
LiMTR:通过多模态特征融合实现多样化道路用户的时间序列运动预测 Camiel Oerlemans PDF N/A LiMTR: Time Series Motion Prediction for Diverse Road Users through Multimodal Feature Integration
从神经热力学积分中获得的溶剂化自由能 Bálint Máté PDF N/A Solvation Free Energies from Neural Thermodynamic Integration
Kaninfradet3D:基于非线性特征提取与内在关联的路边相机-激光雷达融合3D感知模型 Pei Liu PDF N/A Kaninfradet3D:A Road-side Camera-LiDAR Fusion 3D Perception Model based on Nonlinear Feature Extraction and Intrinsic Correlation
FusionLungNet:用于肺部CT图像分割的多尺度融合卷积与细化网络 Sadjad Rezvani PDF N/A FusionLungNet: Multi-scale Fusion Convolution with Refinement Network for Lung CT Image Segmentation
数据高效的CLIP驱动的双分支网络用于无源无监督领域自适应 Yongguang Li PDF N/A Data-Efficient CLIP-Powered Dual-Branch Networks for Source-Free Unsupervised Domain Adaptation
基于平均场模拟的宇宙初始条件推断 Oleg Savchenko PDF N/A Mean-Field Simulation-Based Inference for Cosmological Initial Conditions
RAG4ITOps:一种面向IT运维与维护的监督式微调与综合性RAG框架 Tianyang Zhang PDF N/A RAG4ITOps: A Supervised Fine-Tunable and Comprehensive RAG Framework for IT Operations and Maintenance
深度学习与数据增强技术在检测自我承认的技术债务中的应用 Edi Sutoyo PDF N/A Deep Learning and Data Augmentation for Detecting Self-Admitted Technical Debt
辅助物理交互:配备神经网络检测、导航和安全层的自主空中机器人 Andrea Berra PDF N/A Assisted Physical Interaction: Autonomous Aerial Robots with Neural Network Detection, Navigation, and Safety Layers
通过蕴含调优改进密集段落检索 Lu Dai PDF N/A Improve Dense Passage Retrieval with Entailment Tuning
深度群卷积神经网络的VC维度 Anna Sepliarskaia PDF N/A On the VC dimension of deep group convolutional neural networks