跳转至

Arxiv 2024-10-24 Papers

标题 作者 PDF链接 代码仓库 Title
PixelGaussian:从任意视角进行可泛化的三维高斯重建 Xin Fei PDF N/A PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary Views
Framer:交互式帧插值 Wen Wang PDF N/A Framer: Interactive Frame Interpolation
MotionCLR:通过理解注意力机制实现运动生成和无需训练的编辑 Ling-Hao Chen PDF N/A MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms
CAMEL-Bench:一个全面的阿拉伯大型语言模型基准测试 Sara Ghaboura PDF N/A CAMEL-Bench: A Comprehensive Arabic LMM Benchmark
无界:角色生活模拟的生成性无限游戏 Jialu Li PDF N/A Unbounded: A Generative Infinite Game of Character Life Simulation
3D-Adapter: 几何一致的多视角扩散用于高质量3D生成 Hansheng Chen PDF N/A 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation
无需调参的核心集马尔可夫链蒙特卡罗方法 Naitong Chen PDF N/A Tuning-free coreset Markov chain Monte Carlo
深入洞察认知衰退:利用深度学习技术进行非侵入式模式调查 David Ortiz-Perez PDF N/A Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning Techniques
概念漂移:通过基础模型的视角揭示偏见 Cristian Daniel Păduraru PDF N/A ConceptDrift: Uncovering Biases through the Lens of Foundational Models
Ferret-UI 2:掌握跨平台通用用户界面理解 Zhangheng Li PDF N/A Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms
数据污染检测对大型语言模型有效吗?关于检测假设的调查与评估 Yujuan Fu PDF N/A Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions
初始化在矩阵分解中的关键作用 Bingcong Li PDF N/A On the Crucial Role of Initialization for Matrix Factorization
学会观察:通过策略分解寻求决策信息 Shivin Dass PDF N/A Learning to Look: Seeking Information for Decision Making via Policy Factorization
OSCAR:通过状态感知推理和重新规划实现操作系统控制 Xiaoqiang Wang PDF N/A OSCAR: Operating System Control via State-Aware Reasoning and Re-Planning
我在哪里以及我将看到什么:一种用于空间定位和视角预测的自回归模型 Junyi Chen PDF N/A Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction
上下文是关键:基于重要文本信息的预测基准 Andrew Robert Williams PDF N/A Context is Key: A Benchmark for Forecasting with Essential Textual Information
稳定一致性调整:理解与提升一致性模型 Fu-Yun Wang PDF N/A Stable Consistency Tuning: Understanding and Improving Consistency Models
Bridge-Coder:解锁大型语言模型在低资源代码中跨越语言障碍的潜力 Jipeng Zhang PDF N/A Bridge-Coder: Unlocking LLMs' Potential to Overcome Language Gaps in Low-Resource Code
大型空间模型:从无姿态图像到语义三维的端到端处理 Zhiwen Fan PDF N/A Large Spatial Model: End-to-end Unposed Images to Semantic 3D
BioMistral-NLU:通过指令微调实现更通用的医学语言理解 Yujuan Velvin Fu PDF N/A BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning
学习结构化压缩感知与自动资源分配 Han Wang PDF N/A Learning Structured Compressed Sensing with Automatic Resource Allocation
早期退出大型语言模型中的动态词汇剪枝 Jort Vincenti PDF N/A Dynamic Vocabulary Pruning in Early-Exit LLMs
调整过拟合回归 Dylan Wilson PDF N/A Adjusted Overfitting Regression
从随机矩阵理论视角看学习特征的谱及渐近泛化能力 Yatin Dandi PDF N/A A Random Matrix Theory Perspective on the Spectrum of Learned Features and Asymptotic Generalization Capabilities
具有多智能体角色扮演的架构引导文化感知复杂事件模拟 Sha Li PDF N/A Schema-Guided Culture-Aware Complex Event Simulation with Multi-Agent Role-Play
ANAVI:利用室内环境的视觉信息进行导航的音频噪音感知系统 Vidhi Jain PDF N/A ANAVI: Audio Noise Awareness using Visuals of Indoor environments for NAVIgation
通过加权求和渲染实现的无排序高斯溅射 Qiqi Hou PDF N/A Sort-free Gaussian Splatting via Weighted Sum Rendering
AutoStep:局部自适应的隐式MCMC Tiange Liu PDF N/A AutoStep: Locally adaptive involutive MCMC
通过压缩感知学习$k$-体哈密顿量 Muzhou Ma PDF N/A Learning $k$-body Hamiltonians via compressed sensing
LoRANN:用于近似最近邻搜索的低秩矩阵分解 Elias Jääsaari PDF N/A LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor Search
SegLLM:多轮推理分割 XuDong Wang PDF N/A SegLLM: Multi-round Reasoning Segmentation
从盲解者到逻辑思考者:在有缺陷的数学问题上对大语言模型逻辑完整性的基准测试 A M Muntasir Rahman PDF N/A From Blind Solvers to Logical Thinkers: Benchmarking LLMs' Logical Integrity on Faulty Mathematical Problems
优化边缘卸载决策以进行对象检测 Jiaming Qiu PDF N/A Optimizing Edge Offloading Decisions for Object Detection
MissNODAG: 从不完全数据中学习可微分循环因果图 Muralikrishnna G. Sethuraman PDF N/A MissNODAG: Differentiable Cyclic Causal Graph Learning from Incomplete Data
使用参数化物理信息神经网络预测内部和外部湍流流动 Shinjan Ghosh PDF N/A Using Parametric PINNs for Predicting Internal and External Turbulent Flows
更高效地测试支持大小而非学习直方图 Renato Ferreira Pinto Jr. PDF N/A Testing Support Size More Efficiently Than Learning Histograms
动态三维高斯追踪用于基于图的神经动力学建模 Mingtong Zhang PDF N/A Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling
技能模仿生成器(SkillMimicGen):自动生成演示,以实现高效技能学习和部署 Caelan Garrett PDF N/A SkillMimicGen: Automated Demonstration Generation for Efficient Skill Learning and Deployment
PRISM:一种用于审计大型语言模型中偏见的方法 Leif Azzopardi PDF N/A PRISM: A Methodology for Auditing Biases in Large Language Models
调制自适应傅里叶神经算子用于气象预报的时间插值 Jussi Leinonen PDF N/A Modulated Adaptive Fourier Neural Operators for Temporal Interpolation of Weather Forecasts
用于极低资源芬兰-乌戈尔语族语言的大型语言模型 Taido Purason PDF N/A LLMs for Extremely Low-Resource Finno-Ugric Languages
多目标多样性优化指标的比较分析 Ksenia Pereverdieva PDF N/A Comparative Analysis of Indicators for Multiobjective Diversity Optimization
动脉网络:利用可穿戴脉搏信号重建动脉血压波形,一种群体感知方法 Sicong Huang PDF N/A ArterialNet: Reconstructing Arterial Blood Pressure Waveform with Wearable Pulsatile Signals, a Cohort-Aware Approach
使用异构任务的元学习 Zhaofeng Si PDF N/A Meta-Learning with Heterogeneous Tasks
在开放世界领域中创建和修复机器人程序 Claire Schlesinger PDF N/A Creating and Repairing Robot Programs in Open-World Domains
改进小规模大型语言模型在推理任务中的函数调用功能 Graziano A. Manduzio PDF N/A Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks
大型语言模型真的如报告所说那么优秀吗?检测标签错误并减轻其对模型性能的影响 Omer Nahum PDF N/A Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance
多模态讽刺检测综述 Shafkat Farabi PDF N/A A Survey of Multimodal Sarcasm Detection
Diff-Instruct++:训练一步文本到图像生成模型以符合人类偏好 Weijian Luo PDF N/A Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
使用深度学习进行视频胶囊内窥镜中的多类别异常分类 Arnav Samal PDF N/A Multi-Class Abnormality Classification in Video Capsule Endoscopy Using Deep Learning
引导赋权模式:在线高等教育中释放神经多样性 Hannah Beaux PDF N/A Guiding Empowerment Model: Liberating Neurodiversity in Online Higher Education
使用SNAD探索宇宙:天文学中的异常检测 Alina A. Volnova PDF N/A Exploring the Universe with SNAD: Anomaly Detection in Astronomy
学习在分集式、库存受限市场中的勾结行为 Paul Friedrich PDF N/A Learning Collusion in Episodic, Inventory-Constrained Markets
基于语言用户档案的端到端推荐训练 Zhaolin Gao PDF N/A End-to-end Training for Recommendation with Language-based User Profiles
一个用于学习降阶拉格朗日动力学的黎曼框架 Katharina Friedl PDF N/A A Riemannian Framework for Learning Reduced-order Lagrangian Dynamics
猫鼠游戏:扩散模型与检测方法之间的持续军备竞赛 Linda Laurier PDF N/A The Cat and Mouse Game: The Ongoing Arms Race Between Diffusion Models and Detection Methods
基于组学的生物过程混合动态建模及不确定性估计 Sebastián Espinel-Ríos PDF N/A Omics-driven hybrid dynamic modeling of bioprocesses with uncertainty estimation
FedSPD:一种个性化去中心化联邦学习中的软聚类方法 I-Cheng Lin PDF N/A FedSPD: A Soft-clustering Approach for Personalized Decentralized Federated Learning
开源语言模型的可验证鲁棒水印 Miranda Christ PDF N/A Provably Robust Watermarks for Open-Source Language Models
DeCoRe:通过对比检索头来解码以减轻幻觉 Aryo Pradipta Gema PDF N/A DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations
双线性序列回归:一种从长序列高维令牌中学习的模型 Vittorio Erba PDF N/A Bilinear Sequence Regression: A Model for Learning from Long Sequences of High-dimensional Tokens
概率性语言-图像预训练 Sanghyuk Chun PDF N/A Probabilistic Language-Image Pre-Training
揭开医学领域大型语言模型的神秘面纱:入门指南 Qiao Jin PDF N/A Demystifying Large Language Models for Medicine: A Primer
DL-Polycube: 深度学习增强的多面体方法,用于高质量六面体网格生成和体积样条构造 Yuxuan Yu PDF N/A DL-Polycube: Deep learning enhanced polycube method for high-quality hexahedral mesh generation and volumetric spline construction
我们用kNN增强了Whisper,接下来发生的事情你绝对想不到 Maya K. Nachesa PDF N/A We Augmented Whisper With kNN and You Won't Believe What Came Next
通过日常与人工智能互动来提升人工智能意识:反思日记研究 Ashish Hingle PDF N/A Expanding AI Awareness Through Everyday Interactions with AI: A Reflective Journal Study
学习在未知线性约束下使用拉格朗日方法进行探索的 bandits 问题 Udvas Das PDF N/A Learning to Explore with Lagrangians for Bandits under Unknown Linear Constraints
从效率到公平:衡量偏好学习中的公平性 Shreeyash Gowaikar PDF N/A From Efficiency to Equity: Measuring Fairness in Preference Learning
高维知识蒸馏分析:从弱到强的泛化能力和缩放定律 M. Emrullah Ildiz PDF N/A High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws
从以英语为中心到有效双语:为弱势语言定制分词器的语言模型 Artur Kiulian PDF N/A From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages
在k空间中高效进行非刚性配准及其在心脏磁共振成像中的应用 Aya Ghoul PDF N/A Highly efficient non-rigid registration in k-space with application to cardiac Magnetic Resonance Imaging
MazeNet:一种精确、快速且可扩展的深度学习解决方案,用于斯坦纳最小树 Gabriel Díaz Ramos PDF N/A MazeNet: An Accurate, Fast, and Scalable Deep Learning Solution for Steiner Minimum Trees
多尺度扩散:增强高分辨率全景图像生成中的空间布局 Xiaoyu Zhang PDF N/A Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation
面向跨语言视觉文本设计的迁移 Yejin Choi PDF N/A Towards Visual Text Design Transfer Across Languages
双目引导的三维高斯溅射与视图一致性用于稀疏视图合成 Liang Han PDF N/A Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis
从模仿到内省:探究语言模型中的自我意识 Sirui Chen PDF N/A From Imitation to Introspection: Probing Self-Consciousness in Language Models
通过解耦的槽注意力学习全局以对象为中心的表示 Tonglin Chen PDF N/A Learning Global Object-Centric Representations via Disentangled Slot Attention
深入探究反转诅咒:大型语言模型能泛化到何种程度? Zhengkai Lin PDF N/A Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
一种组合方法用于神经涌现通信 Zheyuan Zhang PDF N/A A Combinatorial Approach to Neural Emergent Communication
在预训练扩散模型中进行快速约束采样 Alexandros Graikos PDF N/A Fast constrained sampling in pre-trained diffusion models
跨语言建模维基百科来源的可靠性 Jacopo D'Ignazi PDF N/A Language-Agnostic Modeling of Source Reliability on Wikipedia
PointPatchRL -- 掩码重建提升点云强化学习 Balázs Gyenes PDF N/A PointPatchRL -- Masked Reconstruction Improves Reinforcement Learning on Point Clouds
从大型语言模型(LLMs)中提炼视觉图表推理能力到多模态大型语言模型(MLLMs) Wei He PDF N/A Distill Visual Chart Reasoning Ability from LLMs to MLLMs
从图像中学习几何形状变形的测地线 Nian Wu PDF N/A Learning Geodesics of Geometric Shape Deformations From Images
WARP-LCA:利用局部竞争算法实现高效卷积稀疏编码 Geoffrey Kasenbacher PDF N/A WARP-LCA: Efficient Convolutional Sparse Coding with Locally Competitive Algorithm
适应6G时代多样化的网络内智能的MLOps:挑战与解决方案 Peizheng Li PDF N/A Adapting MLOps for Diverse In-Network Intelligence in 6G Era: Challenges and Solutions
一个用于自动地理空间数据分析的大型语言模型代理 Yuxing Chen PDF N/A An LLM Agent for Automatic Geospatial Data Analysis
通过超表面光学实现深湍流中的单次相位多样性波前传感 Arturo Martin Jimenez PDF N/A Single-Shot Phase Diversity Wavefront Sensing in Deep Turbulence via Metasurface Optics
将神经蒙特卡罗树搜索应用于自动驾驶车辆的非信号化多交叉口调度 Yucheng Shi PDF N/A Applying Neural Monte Carlo Tree Search to Unsignalized Multi-intersection Scheduling for Autonomous Vehicles
我们真的应该编辑语言模型吗?关于编辑语言模型的评估 Qi Li PDF N/A Should We Really Edit Language Models? On the Evaluation of Edited Language Models
去噪扩散概率模型能够最优地适应未知的低维度情况。 Zhihan Huang PDF N/A Denoising diffusion probabilistic models are optimally adaptive to unknown low dimensionality
小帮助大有裨益:通过利用小型语言模型实现高效的LLM训练 Ankit Singh Rawat PDF N/A A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
利用生成先验对抗图像编辑的鲁棒水印技术:从基准测试到进展 Shilin Lu PDF N/A Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
随机图上非凸优化的全随机原始-对偶梯度算法 Chung-Yiu Yau PDF N/A Fully Stochastic Primal-dual Gradient Algorithm for Non-convex Optimization on Random Graphs
考虑城市区域和动态影响的基于注意力的城市电动汽车充电需求预测方法 Haoxuan Kuang PDF N/A Attention-based Citywide Electric Vehicle Charging Demand Prediction Approach Considering Urban Region and Dynamic Influences
任务校准:在推理任务上校准大型语言模型 Yingjie Li PDF N/A Task Calibration: Calibrating Large Language Models on Inference Tasks
安排您的编辑:一种简单而有效的图像编辑扩散噪声调度 Haonan Lin PDF N/A Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
差分隐私会影响预训练自然语言模型中的偏差吗? Md. Khairul Islam PDF N/A Does Differential Privacy Impact Bias in Pretrained NLP Models?
为什么大型语言模型的有效上下文长度不足? Chenxin An PDF N/A Why Does the Effective Context Length of LLMs Fall Short?
Cellpose+是一种用于染色细胞图像特征提取的形态学分析工具。 Israel A. Huaman PDF N/A Cellpose+, a morphological analysis tool for feature extraction of stained cell images
条件生成的修正扩散引导 Mengfei Xia PDF N/A Rectified Diffusion Guidance for Conditional Generation
通过叙事性XAI实现医疗保健中的AI准备 Akshat Dubey PDF N/A AI Readiness in Healthcare through Storytelling XAI
VoxelKeypointFusion:可泛化的多视角多人姿态估计 Daniel Bermuth PDF N/A VoxelKeypointFusion: Generalizable Multi-View Multi-Person Pose Estimation
GeoLoRA:几何集成用于参数高效微调 Steffen Schotthöfer PDF N/A GeoLoRA: Geometric integration for parameter efficient fine-tuning
基于大语言模型的时变图信号在线预测 Dayu Qin PDF N/A LLM-based Online Prediction of Time-varying Graph Signals
低延迟视频匿名化用于人群异常检测:隐私与性能的权衡 Mulugeta Weldezgina Asres PDF N/A Low-Latency Video Anonymization for Crowd Anomaly Detection: Privacy vs. Performance
ChatSearch:一个用于通用对话图像检索的数据集及生成式检索模型 Zijia Zhao PDF N/A ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval
用于时间序列预测的检索增强扩散模型 Jingwei Liu PDF N/A Retrieval-Augmented Diffusion Models for Time Series Forecasting
利用可解释能力:概念增强扩散与原型网络 Alba Carballo-Castro PDF N/A Exploiting Interpretable Capabilities with Concept-Enhanced Diffusion and Prototype Networks
GrammaMT:利用语法引导的上下文学习改进机器翻译 Rita Ramos PDF N/A GrammaMT: Improving Machine Translation with Grammar-Informed In-Context Learning
BATON:通过动态重批处理提升大型语言模型的批量推理效率 Peizhuang Cong PDF N/A BATON: Enhancing Batch-wise Inference Efficiency for Large Language Models via Dynamic Re-batching
将知识从高质量MRI转移到低质量MRI用于成人胶质瘤诊断 Yanguang Zhao PDF N/A Transferring Knowledge from High-Quality to Low-Quality MRI for Adult Glioma Diagnosis
大型语言模型在文学翻译中的表现究竟如何?人类与大型语言模型在文学翻译评估中的对比 Ran Zhang PDF N/A How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs
PESFormer:通过直接时间戳编码提升宏观和微表情识别 Wang-Wang Yu PDF N/A PESFormer: Boosting Macro- and Micro-expression Spotting with Direct Timestamp Encoding
从零开始通过可扩展的问题合成释放大语言模型的推理能力 Yuyang Ding PDF N/A Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch
ODDN:解决在线社交网络上开放世界深度伪造检测中的未配对数据挑战 Renshuai Tao PDF N/A ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks
具有语义空间对齐的分层多模态大型语言模型用于增强时间序列分类 Xiaoyu Tao PDF N/A Hierarchical Multimodal LLMs with Semantic Space Alignment for Enhanced Time Series Classification
每个组件都至关重要:重新思考多实例分割任务中医学语义分割的成功衡量标准 Alexander Jaus PDF N/A Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks
通过旋转等变2D/3D特征匹配实现刚性单切片-体注册 Stefan Brandstätter PDF N/A Rigid Single-Slice-in-Volume registration via rotation-equivariant 2D/3D feature matching
Ali-AUG:利用一步扩散模型进行标注数据增强的创新方法 Ali Hamza PDF N/A Ali-AUG: Innovative Approaches to Labeled Data Augmentation using One-Step Diffusion Model
通过可迁移性度量提升医学图像分割的预训练效率 Gábor Hidy PDF N/A Enhancing pretraining efficiency for medical image segmentation via transferability metrics
同态计数作为图学习的结构编码 Linus Bao PDF N/A Homomorphism Counts as Structural Encodings for Graph Learning
社交网络中的健康错误信息:IT方法综述 Vasiliki Papanikou PDF N/A Health Misinformation in Social Networks: A Survey of IT Approaches
测试时训练的三维形状补全 Michael Schopf-Kuester PDF N/A 3D Shape Completion with Test-Time Training
DreamClear:高容量真实世界图像修复与隐私安全数据集构建 Yuang Ai PDF N/A DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
使用滑动时间窗口数据处理的可训练激活神经网络及其泛化能力 Anton Raskovalov PDF N/A NIDS Neural Networks Using Sliding Time Window Data Processing with Trainable Activations and its Generalization Capability
学习具有再生核希尔伯特空间和随机傅里叶特征的耗散哈密顿动力学 Torbjørn Smith PDF N/A Learning dissipative Hamiltonian dynamics with reproducing kernel Hilbert spaces and random Fourier features
面向更好的开放式文本生成:多标准评估框架 Esteban Garces Arias PDF N/A Towards Better Open-Ended Text Generation: A Multicriteria Evaluation Framework
$C^2$:基于LLM的图表生成的可扩展自动反馈 Woosung Koh PDF N/A $C^2$: Scalable Auto-Feedback for LLM-based Chart Generation
GADT:通过梯度引导的对抗性数据转换增强可迁移的对抗性攻击 Yating Ma PDF N/A GADT: Enhancing Transferable Adversarial Attacks through Gradient-guided Adversarial Data Transformation
智能ETL与基于大语言模型的内容分类:欧洲智能旅游工具观测站的经验 Diogo Cosme PDF N/A Smart ETL and LLM-based contents classification: the European Smart Tourism Tools Observatory experience
弱到强偏好优化:从弱对齐模型中窃取奖励 Wenhong Zhu PDF N/A Weak-to-Strong Preference Optimization: Stealing Reward from Weak Aligned Model
扩散归因分数:评估扩散模型中的训练数据影响 Jinxu Lin PDF N/A Diffusion Attribution Score: Evaluating Training Data Influence in Diffusion Model
使用隐马尔可夫模型对点云数据中的移动物体进行分割 Vedant Bhandari PDF N/A Moving Object Segmentation in Point Cloud Data using Hidden Markov Models
远程检测应用程序以改进毫米波/亚太赫兹5G/6G系统中的波束跟踪 Alexander Shurakov PDF N/A Remote Detection of Applications for Improved Beam Tracking in mmWave/sub-THz 5G/6G Systems
通过学习感知策略梯度的多智能体合作 Alexander Meulemans PDF N/A Multi-agent cooperation through learning-aware policy gradients
小巨人:大规模合成高质量嵌入数据 Haonan Chen PDF N/A Little Giants: Synthesizing High-Quality Embedding Data at Scale
利用图神经网络和多智能体强化学习进行供应链中的库存控制 Niki Kotecha PDF N/A Leveraging Graph Neural Networks and Multi-Agent Reinforcement Learning for Inventory Control in Supply Chains
基于颅骨特征的机器人显微操作注册方案,采用显微立体摄像系统 Xiaofeng Lin PDF N/A A Cranial-Feature-Based Registration Scheme for Robotic Micromanipulation Using a Microscopic Stereo Camera System
利用问题SAPPhIRE概念支持设计新颖性评估 Sanjay Singh PDF N/A Supporting Assessment of Novelty of Design Problems Using Concept of Problem SAPPhIRE
基于语义标签的音色控制使用CVAE进行波表合成 Tsugumasa Yutani PDF N/A Wavetable Synthesis Using CVAE for Timbre Control Based on Semantic Label
SAMG:基于状态-动作感知的离线到在线强化学习与离线模型引导 Liyu Zhang PDF N/A SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance
小型语言模型的提示与微调:实现长度可控的电话通话摘要 David Thulke PDF N/A Prompting and Fine-Tuning of Small LLMs for Length-Controllable Telephone Call Summarization
使用逆向渲染和对抗性隐式函数的环境贴图编辑 Antonio D'Orazio PDF N/A Environment Maps Editing using Inverse Rendering and Adversarial Implicit Functions
通过多智能体深度强化学习实现生态物种的进化扩散 Wonhyung Choi PDF N/A Evolutionary Dispersal of Ecological Species via Multi-Agent Deep Reinforcement Learning
FairQueue:重新思考用于公平文本到图像生成的提示学习 Christopher T. H Teo PDF N/A FairQueue: Rethinking Prompt Learning for Fair Text-to-Image Generation
重新思考Softmax:基于多项式激活的自注意力机制 Hemanth Saratchandran PDF N/A Rethinking Softmax: Self-Attention with Polynomial Activations
TripCast:用于行程时间序列预测的掩码2D变换器预训练 Yuhua Liao PDF N/A TripCast: Pre-training of Masked 2D Transformers for Trip Time Series Forecasting
使用连续和离散特征的联合表示方法用于胸部CT扫描心血管疾病风险预测 Minfeng Xu PDF N/A A Joint Representation Using Continuous and Discrete Features for Cardiovascular Diseases Risk Prediction on Chest CT Scans
STTATTS:统一的语音转文本与文本转语音模型 Hawau Olamide Toyin PDF N/A STTATTS: Unified Speech-To-Text And Text-To-Speech Model
理解玩家如同他们用定制语言与游戏对话:一项初步研究 Tianze Wang PDF N/A Understanding Players as if They Are Talking to the Game in a Customized Language: A Pilot Study
AgentStore:异构代理的可扩展集成,作为专业化的通用计算机助手 Chengyou Jia PDF N/A AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant
微分信息自编码器 Jinrui Zhang PDF N/A Differential Informed Auto-Encoder
语音感知:词汇识别模型 Jean-Marc Luck PDF N/A Speech perception: a model of word recognition
使用前沿开源大型语言模型进行知识蒸馏:泛化能力与合成数据的作用 Anup Shirgaonkar PDF N/A Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data
将代码大型语言模型与直接偏好优化对齐 Yibo Miao PDF N/A Aligning CodeLLMs with Direct Preference Optimization
基准测试图学习用于药物-药物相互作用预测 Zhenqian Shen PDF N/A Benchmarking Graph Learning for Drug-Drug Interaction Prediction
时空搜索用于脉冲神经网络 Kaiwei Che PDF N/A Spatial-Temporal Search for Spiking Neural Networks
SIKeD:用于数学推理的自引导迭代知识蒸馏 Shivam Adarsh PDF N/A SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning
无模型视觉位置识别的重新排序方法,利用深度学习局部特征 Tomáš Pivoňka PDF N/A On Model-Free Re-ranking for Visual Place Recognition with Deep Learned Local Features
Taipan:具有选择性注意力的高效且富有表现力的状态空间语言模型 Chien Van Nguyen PDF N/A Taipan: Efficient and Expressive State Space Language Models with Selective Attention
零样本目标导航与视觉语言模型推理 Congcong Wen PDF N/A Zero-shot Object Navigation with Vision-Language Models Reasoning
对谁来说困难?一项关于日语词汇复杂性的研究 Adam Nohejl PDF N/A Difficult for Whom? A Study of Japanese Lexical Complexity
Bielik 7B v0.1:波兰语言模型 -- 开发、洞察与评估 Krzysztof Ociepa PDF N/A Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation
可解释的新闻摘要 -- 分析与解决分歧问题 Seema Aswani PDF N/A Explainable News Summarization -- Analysis and mitigation of Disagreement Problem
Infinity-MM:通过大规模高质量指令数据扩展多模态性能 Shuhao Gu PDF N/A Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data
基于SEDCNN-SVM的手势识别方法研究 Mingjin Zhang PDF N/A Research on gesture recognition method based on SEDCNN-SVM
复杂性问题:有效维度作为对抗鲁棒性的度量 David Khachaturov PDF N/A Complexity Matters: Effective Dimensionality as a Measure for Adversarial Robustness
局部和全局图建模与边加权图注意力网络用于手写数学表达式识别 Yejing Xie PDF N/A Local and Global Graph Modeling with Edge-weighted Graph Attention Network for Handwritten Mathematical Expression Recognition
从矩阵元素似然性的对称性中得到的最优等变架构 Daniel Maître PDF N/A Optimal Equivariant Architectures from the Symmetries of Matrix-Element Likelihoods
IMAN:一种用于鲁棒性NPC死亡率预测的适应性网络,处理缺失模态问题 Yejing Huo PDF N/A IMAN: An Adaptive Network for Robust NPC Mortality Prediction with Missing Modalities
关于使用注意力矩阵进行解释 Omar Naim PDF N/A On Explaining with Attention Matrices
使用非线性先验从视频中进行可解释的表示学习 Marian Longa PDF N/A Interpretable Representation Learning from Videos using Nonlinear Priors
SMITE:时间分割我 Amirhossein Alimohammadi PDF N/A SMITE: Segment Me In TimE
超越色彩与线条:基于协调语义的零样本特定风格图像变体 Jinghao Hu PDF N/A Beyond Color and Lines: Zero-Shot Style-Specific Image Variations with Coordinated Semantics
LOGO -- 通过高效偏好优化实现长上下文对齐 Zecheng Tang PDF N/A LOGO -- Long cOntext aliGnment via efficient preference Optimization
关于教学文本的系统性综述:从表征到下游自然语言处理任务 Abdulfattah Safa PDF N/A A Systematic Survey on Instructional Text: From Representation and Downstream NLP Tasks
实践:优化大型语言模型代理的原则性推理和行动 Zhiwei Liu PDF N/A PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
探究排名型大型语言模型:信息检索中的机制性可解释性 Tanya Chowdhury PDF N/A Probing Ranking LLMs: Mechanistic Interpretability in Information Retrieval
城市高密度多光谱点云的无监督语义分割 Oona Oinonen PDF N/A Unsupervised semantic segmentation of urban high-density multispectral point clouds
KVSharer:通过逐层不同KV缓存共享实现高效推理 Yifei Yang PDF N/A KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing
在文本上扩展掩码扩散模型 Shen Nie PDF N/A Scaling up Masked Diffusion Models on Text
关于多相机和投影仪几何校准的说明 Tomislav Petkovic PDF N/A A Note on Geometric Calibration of Multiple Cameras and Projectors
一个基于GNSS的ERTMS解决方案性能分析框架 Juliette Marais PDF N/A A framework for GNSS-based solutions performance analysis in an ERTMS context
通过大规模增强格兰杰因果关系(lsAGC)分析功能性MR图像,提升图注意力神经网络在大麻消费分类中的性能 Ali Vosoughi PDF N/A Enhancing Graph Attention Neural Network Performance for Marijuana Consumption Classification through Large-scale Augmented Granger Causality (lsAGC) Analysis of Functional MR Images
CCI3.0-HQ:一个为预训练大型语言模型设计的高质量大规模中文数据集 Liangdong Wang PDF N/A CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models
用于心脏分割的SFB-net:通过注意力机制弥合语义鸿沟 Nicolas Portal PDF N/A SFB-net for cardiac segmentation: Bridging the semantic gap with attention
通过大型语言模型实现可靠的自动编程 Martin Mirchev PDF N/A Assured Automatic Programming via Large Language Models
ChineseSafe:一个用于评估大型语言模型安全性的中文基准 Hengxiang Zhang PDF N/A ChineseSafe: A Chinese Benchmark for Evaluating Safety in Large Language Models
Synth4Seg -- 利用双层优化学习缺陷数据合成以进行缺陷分割 Shancong Mou PDF N/A Synth4Seg -- Learning Defect Data Synthesis for Defect Segmentation using Bi-level Optimization
在敏捷模型驱动开发中,大型语言模型作为代码生成器 Ahmed R. Sadik PDF N/A LLM as a code generator in Agile Model Driven Development
图预训练模型是强大的异常检测器 Jiashun Cheng PDF N/A Graph Pre-Training Models Are Strong Anomaly Detectors
基于时间泊松分解的进化声音 Jan Vávra PDF N/A Evolving Voices Based on Temporal Poisson Factorisation
Dialog2Flow:为自动对话流程提取预训练软对比动作驱动句子嵌入 Sergio Burdisso PDF N/A Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction
分类器聚类与特征对齐在分布式概念漂移下的联邦学习 Junbao Chen PDF N/A Classifier Clustering and Feature Alignment for Federated Learning under Distributed Concept Drift
蒙日-安培正则化用于从点云学习任意形状 Chuanxiang Yang PDF N/A Monge-Ampere Regularization for Learning Arbitrary Shapes from Point Clouds
基因-代谢物关联预测与代谢物生产增强图的交互知识转移 Kexuan Xin PDF N/A Gene-Metabolite Association Prediction with Interactive Knowledge Transfer Enhanced Graph for Metabolite Production
如果输入在OOD检测中被扩展会怎样? Boxuan Zhang PDF N/A What If the Input is Expanded in OOD Detection?
迭代自调优大型语言模型以增强越狱能力 Chung-En Sun PDF N/A Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
学习愤怒:体验强化学习中的情感过山车 Lachlan Mares PDF N/A Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning