跳转至

Arxiv 2024-11-25 Papers

标题 作者 PDF链接 代码仓库 Title
生成式全息图:学习将视频分解成图层 Yao-Chih Lee PDF N/A Generative Omnimatte: Learning to Decompose Video into Layers
因子分解视觉标记化和生成 Zechen Bai PDF N/A Factorized Visual Tokenization and Generation
夸克:实时、高分辨率及通用的神经视图合成 John Flynn PDF N/A Quark: Real-time, High-resolution, and General Neural View Synthesis
大型语言模型是否在不利用捷径的情况下执行潜在的多步推理? Sohee Yang PDF N/A Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
用于零样本6DoF物体姿态估计的扩散特征 Bernd Von Gimborn PDF N/A Diffusion Features for Zero-Shot 6DoF Object Pose Estimation
OPMOS:有序并行多目标最短路径 Leo Gold PDF N/A OPMOS: Ordered Parallel Multi-Objective Shortest-Path
CatNet:在LSTM中使用高斯镜像和SHAP特征重要性实现有效的FDR控制 Jiaan Han PDF N/A CatNet: Effective FDR Control in LSTM with Gaussian Mirrors and SHAP Feature Importance
边缘权重预测用于类别无关的姿态估计 Or Hirschorn PDF N/A Edge Weight Prediction For Category-Agnostic Pose Estimation
用于线性偏微分方程边值问题的高斯过程先验 Jianlei Huang PDF N/A Gaussian Process Priors for Boundary Value Problems of Linear Partial Differential Equations
使用延迟投影快速训练大核模型 Amirhesam Abedsoltan PDF N/A Fast training of large kernel models with delayed projections
DreamRunner:通过检索增强的运动适应实现细粒度故事叙述视频生成 Zun Wang PDF N/A DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation
自我生成的批评提升语言模型的奖励建模 Yue Yu PDF N/A Self-Generated Critiques Boost Reward Modeling for Language Models
推荐系统为善(RS4Good):使用案例调查及对重要研究行动的呼吁 Dietmar Jannach PDF N/A Recommender Systems for Good (RS4Good): Survey of Use Cases and a Call to Action for Research that Matters
探索用于从头生成3D分子的离散流匹配方法 Ian Dunn PDF N/A Exploring Discrete Flow Matching for 3D De Novo Molecule Generation
防止越狱提示作为网络犯罪分子的恶意工具:网络防御视角 Jean Marie Tshimula PDF N/A Preventing Jailbreak Prompts as Malicious Tools for Cybercriminals: A Cyber Defense Perspective
自动事实性指标真的能衡量事实性吗?一项批判性评估 Sanjana Ramprasad PDF N/A Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation
LegoPET:用于PET图像重建的分层特征引导条件扩散 Yiran Sun PDF N/A LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction
通过人类互动进行推理时策略调整 Yanwei Wang PDF N/A Inference-Time Policy Steering through Human Interactions
泄漏鲁棒的贝叶斯劝说 Nika Haghtalab PDF N/A Leakage-Robust Bayesian Persuasion
物理世界中的不可感知对抗样本 Weilin Xu PDF N/A Imperceptible Adversarial Examples in the Physical World
人类活动AGV质量评估:基准数据集与客观评价指标 Zhichao Zhang PDF N/A Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
StructFormer:基于文档结构的掩码注意力及其对语言模型预训练的影响 Kaustubh Ponkshe PDF N/A StructFormer: Document Structure-based Masked Attention and its Impact on Language Model Pre-Training
GeoFormer:一种多边形分割变换器 Maxim Khomiakov PDF N/A GeoFormer: A Multi-Polygon Segmentation Transformer
局部聚类选择的图池化 Yizhu Chen PDF N/A Graph Pooling with Local Cluster Selection
线性文本分割的最新趋势:一项调查 Iacopo Ghinassi PDF N/A Recent Trends in Linear Text Segmentation: a Survey
F -- 基于基础本体DOLCE+DnS Ultralite的事件模型 Ansgar Scherp PDF N/A F -- A Model of Events based on the Foundational Ontology DOLCE+DnS Ultralite
Chat2SVG:利用大型语言模型和图像扩散模型生成矢量图形 Ronghuan Wu PDF N/A Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models
组合优化预测的近似算法 Antonios Antoniadis PDF N/A Approximation Algorithms for Combinatorial Optimization with Predictions
解锁基于扩散的净化中自适应攻击的潜力 Andre Kassis PDF N/A Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification
从生成到判断:LLM作为法官的机遇与挑战 Dawei Li PDF N/A From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
对抗性攻击用于漂移检测 Fabian Hinder PDF N/A Adversarial Attacks for Drift Detection
基于新信息的贝叶斯优化中的Alpha熵搜索 Daniel Fernández-Sánchez PDF N/A Alpha Entropy Search for New Information-based Bayesian Optimization
通过在测试时和训练时监督下使用批判模型来增强大语言模型的推理能力 Zhiheng Xi PDF N/A Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
重新思考用于文本驱动的生成人类运动扩散模型 Zichong Meng PDF N/A Rethinking Diffusion for Text-Driven Human Motion Generation
朴素算法共谋:当强盗学习者何时合作,何时竞争? Connor Douglas PDF N/A Naive Algorithmic Collusion: When Do Bandit Learners Cooperate and When Do They Compete?
J-CaPA:联合通道和金字塔注意力提升医学图像分割 Marzia Binta Nizam PDF N/A J-CaPA : Joint Channel and Pyramid Attention Improves Medical Image Segmentation
通过整合数据和GAN模型方法提升少样本学习能力 Yinqiu Feng PDF N/A Enhancing Few-Shot Learning with Integrated Data and GAN Model Approaches
EnStack:一种大型语言模型集成堆叠框架,用于增强源代码中的漏洞检测 Shahriyar Zaman Ridoy PDF N/A EnStack: An Ensemble Stacking Framework of Large Language Models for Enhanced Vulnerability Detection in Source Code
基于生长架构的量子电路训练 Callum Duffy PDF N/A Quantum Circuit Training with Growth-Based Architectures
窄带射电技术信号搜索中的异常检测与RFI分类:基于无监督学习的研究 Ben Jacobson-Bell PDF N/A Anomaly Detection and RFI Classification with Unsupervised Learning in Narrowband Radio Technosignature Searches
使用语言模型生成分布外场景 Erfan Aasi PDF N/A Generating Out-Of-Distribution Scenarios Using Language Models
向量量化中的表示崩溃问题 Wenhao Zhao PDF N/A Representation Collapsing Problems in Vector Quantization
Transformer 是深度优化器:深度模型训练的可证明的上下文学习 Weimin Wu PDF N/A Transformers are Deep Optimizers: Provable In-Context Learning for Deep Model Training
RoboSpatial:为2D和3D视觉-语言模型教授空间理解,以应用于机器人技术 Chan Hee Song PDF N/A RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
使用任务无关策略蒸馏的持续深度强化学习 Muhammad Burhan Hafez PDF N/A Continual Deep Reinforcement Learning with Task-Agnostic Policy Distillation
大型语言模型中的偏见分析:上下文词嵌入中的刻板印象维度 Carolin M. Schuster PDF N/A Profiling Bias in LLMs: Stereotype Dimensions in Contextual Word Embeddings
提示调优变压器的根本限制:普遍性、容量和效率 Jerry Yao-Chieh Hu PDF N/A Fundamental Limits of Prompt Tuning Transformers: Universality, Capacity and Efficiency
LaB-RAG:用于放射报告生成的标签增强检索增强生成 Steven Song PDF N/A LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology Report Generation
PriorPath:用于受控从头病理语义掩码生成的由粗到细方法 Nati Daniel PDF N/A PriorPath: Coarse-To-Fine Approach for Controlled De-Novo Pathology Semantic Masks Generation
守门:概念防护——在概念瓶颈模型中抵御概念级后门 Songning Lai PDF N/A Guarding the Gate: ConceptGuard Battles Concept-Level Backdoors in Concept Bottleneck Models
Jaya R 包——一种无需参数的先进单目标和多目标优化解决方案 Neeraj Dhanraj Bokde PDF N/A Jaya R Package -- A Parameter-Free Solution for Advanced Single and Multi-Objective Optimization
所有语言都重要:评估大型多语言模型在文化多样化的100种语言上的表现 Ashmal Vayani PDF N/A All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
终身多智能体路径寻找的在线指导图优化 Hongzhi Zang PDF N/A Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding
用于增强文本到图像合成中语义忠实度的噪声扩散技术 Boming Miao PDF N/A Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis
通过对比解释解读语言奖励模型 Junqi Jiang PDF N/A Interpreting Language Reward Models via Contrastive Explanations
从有限数据中生成人类动作的多分辨率建模 David Eduardo Moreno-Villamarín PDF N/A Multi-Resolution Generative Modeling of Human Motion from Limited Data
AtomR:基于原子操作的大型语言模型,用于异构知识推理 Amy Xin PDF N/A AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning
O1复制之旅 -- 第二部分:通过简单蒸馏超越O1-preview,是大进步还是苦涩教训? Zhen Huang PDF N/A O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?
婴儿教婴儿:学生知识共享能否在小数据集上超越教师指导的蒸馏? Srikrishna Iyer PDF N/A When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets?
用于精确能带结构预测的图变换网络:一种端到端的方法 Weiyi Gong PDF N/A Graph Transformer Networks for Accurate Band Structure Prediction: An End-to-End Approach
可变形Mamba用于广角视场分割 Jie Hu PDF N/A Deformable Mamba for Wide Field of View Segmentation
分布式、通信高效且满足差分隐私的KL散度估计 Mary Scott PDF N/A Distributed, communication-efficient, and differentially private estimation of KL divergence
具有随机代理可用性的分布式在线优化 Juliette Achddou PDF N/A Distributed Online Optimization with Stochastic Agent Availability
NonSysId:一个非线性系统识别包,针对NARMAX模型改进了模型项选择 Rajintha Gunawardena PDF N/A NonSysId: A nonlinear system identification package with improved model term selection for NARMAX models
高效视频人脸增强与增强的空间-时间一致性 Yutong Wang PDF N/A Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
无身份,无问题:通过检测实现的人员追踪运动 Martin Engilberge PDF N/A No Identity, no problem: Motion through detection for people tracking
幼狮:分布式系统中通信开销的最小化 Satoki Ishikawa PDF N/A Lion Cub: Minimizing Communication Overhead in Distributed Lion
从群不变网络重建训练数据 Ran Elbaz PDF N/A On the Reconstruction of Training Data from Group Invariant Networks
用于增强自动驾驶轨迹预测的特征扩散网络 Haoming Li PDF N/A Characterized Diffusion Networks for Enhanced Autonomous Driving Trajectory Prediction
类比学习:通过基于计算图的检索增强数学应用题解决中的少样本提示 Xiaocong Yang PDF N/A Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval
VQ-SGen:一种用于草图生成的向量量化笔画表示方法 Jiawei Wang PDF N/A VQ-SGen: A Vector Quantized Stroke Representation for Sketch Generation
塑料树:一种现代的突触可塑性模拟框架——从单个突触到形态神经元网络 Jannik Luboeinski PDF N/A Plastic Arbor: a modern simulation framework for synaptic plasticity $\unicode{x2013}$ from single synapses to networks of morphological neurons
SplatFlow:用于3D高斯喷洒合成的多视图校正流模型 Hyojun Go PDF N/A SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis
TIFeD:一种基于小整数的直接反馈对齐联邦学习算法 Luca Colombo PDF N/A TIFeD: a Tiny Integer-based Federated learning algorithm with Direct feedback alignment
AnonyNoise:利用智能噪声匿名化事件数据,以超越重识别并保护隐私 Katharina Bendig PDF N/A AnonyNoise: Anonymizing Event Data with Smart Noise to Outsmart Re-Identification and Preserve Privacy
利用超类从分层数据库中学习 Nicolas Urbani PDF N/A Harnessing Superclasses for Learning from Hierarchical Databases
通过定向交叉注意力对抗攻击实现个性化扩散模型中的隐私保护 Xide Xu PDF N/A Privacy Protection in Personalized Diffusion Models via Targeted Cross-Attention Adversarial Attack
在语言模型中寻找结构 Jaap Jumelet PDF N/A Finding Structure in Language Models
连续时间中的无监督事件异常检测 Somjit Nath PDF N/A Unsupervised Event Outlier Detection in Continuous Time
TopV-Nav:释放MLLM在零样本目标导航中的顶视图空间推理潜力 Linqing Zhong PDF N/A TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation
基于双向长短期记忆网络(BLSTM)的涡轮风扇发动机剩余使用寿命(RUL)预测 Abedin Sherifi PDF N/A Turbofan Engine Remaining Useful Life (RUL) Prediction Based on Bi-Directional Long Short-Term Memory (BLSTM)
数字台风数据集的机器学习:扩展至多个流域及表示与任务的新进展 Asanobu Kitamoto PDF N/A Machine Learning for the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and Tasks
湍流建模生成学习方法的比较 Claudia Drygala PDF N/A Comparison of Generative Learning Methods for Turbulence Modeling
低数据历史音乐手稿分类:一种少样本学习方法 Elona Shatri PDF N/A Low-Data Classification of Historical Music Manuscripts: A Few-Shot Learning Approach
视觉-语言模型时代下语义分割的无监督域适应研究 Manuel Schwonberg PDF N/A A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models
使用GAN生成手写乐谱:对CycleWGAN、ProGAN和DCGAN的综合评估 Elona Shatri PDF N/A Synthesising Handwritten Music with GANs: A Comprehensive Evaluation of CycleWGAN, ProGAN, and DCGAN
基于适配器的知识增强语言模型方法综述 Alexander Fichtl PDF N/A Adapter-based Approaches to Knowledge-enhanced Language Models -- A Survey
耦合细胞系统中的叉式分岔 Shikhar Raj PDF N/A Pitchfork Bifurcation In A Coupled Cell System
量子奇异模型的统计推断 Hiroshi Yano PDF N/A Statistical inference for quantum singular models
用于高效且细致表面重建的二次高斯散射技术 Ziyu Zhang PDF N/A Quadratic Gaussian Splatting for Efficient and Detailed Surface Reconstruction
人类校准的生成语言模型的自动化测试与验证 Agus Sudjianto PDF N/A Human-Calibrated Automated Testing and Validation of Generative Language Models
FineWeb-zhtw:可扩展的从网络获取中文文本数据并进行整理 Cheng-Wei Lin PDF N/A FineWeb-zhtw: Scalable Curation of Traditional Chinese Text Data from the Web
隐私保护的联邦基础模型用于通用超声人工智能 Yuncheng Jiang PDF N/A Privacy-Preserving Federated Foundation Model for Generalist Ultrasound Artificial Intelligence
Ca2-VDM:具有因果生成和缓存共享的高效自回归视频扩散模型 Kaifeng Gao PDF N/A Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing
虾体内的魔鬼阶梯揭示了平台尖峰和爆发的周期性 Luiz F. B. Caixeta PDF N/A Devil's staircase inside shrimps reveals periodicity of plateau spikes and bursts
深度概率图像分割中的贝叶斯不确定性量化综述 M. M. A. Valiuddin PDF N/A A Review of Bayesian Uncertainty Quantification in Deep Probabilistic Image Segmentation
多模态检索增强多模态生成:一个基准测试,评估指标和强基线 Zi-Ao Ma PDF N/A Multi-modal Retrieval Augmented Multi-modal Generation: A Benchmark, Evaluate Metrics and Strong Baselines
基于图神经网络的大规模超导量子电路参数设计以减轻串扰 Hao Ai PDF N/A Graph Neural Networks-based Parameter Design towards Large-Scale Superconducting Quantum Circuits for Crosstalk Mitigation
两跳诅咒:在训练中仅基于A->B、B->C的LLMs无法学会A-->C Mikita Balesni PDF N/A The Two-Hop Curse: LLMs trained on A->B, B->C fail to learn A-->C
用于脑部血管畸形的机器学习 Irem Topal PDF N/A Machine learning for cerebral blood vessels' malformations
面向重症监护时间序列的基础模型 Manuel Burger PDF N/A Towards Foundation Models for Critical Care Time Series
伪反馈推理的偏好优化 Fangkai Jiao PDF N/A Preference Optimization for Reasoning with Pseudo Feedback
一种基于数据驱动的数据流感知图神经网络推理在线调度方法 Pol Puigdemont PDF N/A A Data-Driven Approach to Dataflow-Aware Online Scheduling for Graph Neural Network Inference
Solaris:太阳的基础模型 Harris Abdul Majid PDF N/A Solaris: A Foundation Model of the Sun
AI能给你的作文打分吗?大型语言模型与教师评分在多维度作文评分中的比较分析 Kathrin Seßler PDF N/A Can AI grade your essays? A comparative analysis of large language models and teacher ratings in multidimensional essay scoring
WTDUN:用于图像压缩感知的基于小波树结构采样和深度展开网络 Kai Han PDF N/A WTDUN: Wavelet Tree-Structured Sampling and Deep Unfolding Network for Image Compressed Sensing
基于聚类的半监督策略用于提升液体活检中循环肿瘤细胞的机器学习检测效果 Hümeyra Husseini-Wüsthoff PDF N/A Cluster-based human-in-the-loop strategy for improving machine learning-based circulating tumor cell detection in liquid biopsy
CapHDR2IR:从可见光到红外域的标题驱动传输 Jingchao Peng PDF N/A CapHDR2IR: Caption-Driven Transfer from Visible Light to Infrared Domain
深度网络中的类脑涌现特性:网络架构、数据集和训练的影响 Niranjan Rajesh PDF N/A Brain-like emergent properties in deep networks: impact of network architecture, datasets and training
曝光校正的亮度分量分析 Jingchao Peng PDF N/A Luminance Component Analysis for Exposure Correction
CutS3D: 在3D中切割语义以实现2D无监督实例分割 Leon Sick PDF N/A CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation
一次扩散生成一切 Duong H. Le PDF N/A One Diffusion to Generate Them All
基于深度学习的单目车道线检测:综述 Xin He PDF N/A Monocular Lane Detection Based on Deep Learning: A Survey
潜在变量非参数因果效应估计中的协变量选择局部学习 Zheng Li PDF N/A Local Learning for Covariate Selection in Nonparametric Causal Effect Estimation with Latent Variables
基于定向直方图的矢量场嵌入用于表征放射治疗中的4D CT数据集 Frederic Madesta PDF N/A Oriented histogram-based vector field embedding for characterizing 4D CT data sets in radiotherapy
CATP-LLM:赋能大型语言模型进行成本意识工具规划 Duo Wu PDF N/A CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning
EPS:深度超分辨率模型训练中视频过拟合的高效补丁采样 Yiying Wei PDF N/A EPS: Efficient Patch Sampling for Video Overfitting in Deep Super-Resolution Model Training
三维场景中的功能理解与分割 Jaime Corsetti PDF N/A Functionality understanding and segmentation in 3D scenes
一种端到端鲁棒点云语义分割网络,采用单步条件扩散模型 Wentao Qu PDF N/A An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models
通过迭代训练从成功对话的相关子目标中学习,以实现面向任务的对话系统 Magdalena Kaiser PDF N/A Learning from Relevant Subgoals in Successful Dialogs using Iterative Training for Task-oriented Dialog Systems
理解联邦学习的泛化性:模型稳定性与优化之间的权衡 Dun Zeng PDF N/A Understanding Generalization of Federated Learning: the Trade-off between Model Stability and Optimization
DiffDesign:结合元先验的可控扩散,实现高效室内设计生成 Yuxuan Yang PDF N/A DiffDesign: Controllable Diffusion with Meta Prior for Efficient Interior Design Generation
BayLing 2:一种高效语言对齐的多语言大型语言模型 Shaolei Zhang PDF N/A BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment
评估Rank-N-Contrast:回归任务中的连续且鲁棒的表示 Six Valentin PDF N/A Evaluating Rank-N-Contrast: Continuous and Robust Representations for Regression
一种针对受损道路低分辨率图像语义分割的性能提升策略 Rafael S. Toledo PDF N/A A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads
利用二维姿态检测器中的不确定性进行概率性三维人体网格重建 Tom Wehrbein PDF N/A Utilizing Uncertainty in 2D Pose Detectors for Probabilistic 3D Human Mesh Recovery
一种用于识别社交媒体中机器人的图神经架构搜索方法 Georgios Tzoumanekas PDF N/A A Graph Neural Architecture Search Approach for Identifying Bots in Social Media
甚至更稀疏的图变换器 Hamed Shirzad PDF N/A Even Sparser Graph Transformers
用于文本相关说话人验证(TdSV)AAIC挑战赛2024的SVASR系统 Mohammadreza Molavi PDF N/A The SVASR System for Text-dependent Speaker Verification (TdSV) AAIC Challenge 2024
使用表面肌电图和惯性测量单元信号进行踝关节外骨骼运动分类的深度学习 Silas Ruhrberg Estévez PDF N/A Deep Learning for Motion Classification in Ankle Exoskeletons Using Surface EMG and IMU Signals
具有崩溃约束的控制器调优的局部贝叶斯优化 Alexander von Rohr PDF N/A Local Bayesian Optimization for Controller Tuning with Crash Constraints
探索机器中的意识 Mathis Immertreu PDF N/A Probing for Consciousness in Machines
解析大型语言模型中的算术:代数结构的作用 Fu-Chieh Chang PDF N/A Unraveling Arithmetic in Large Language Models: The Role of Algebraic Structures
气体背景对XFEL单粒子成像的影响 Tong You PDF N/A Impact of gas background on XFEL single-particle imaging
开放词汇八叉树图用于三维场景理解 Zhigang Wang PDF N/A Open-Vocabulary Octree-Graph for 3D Scene Understanding
NormXLogit:头顶上的真相永不撒谎 Sina Abbasi PDF N/A NormXLogit: The Head-on-Top Never Lies
文本分类器解释的透明邻域近似 Yi Cai PDF N/A Transparent Neighborhood Approximation for Text Classifier Explanation
使用机器学习与深度学习技术诊断糖尿病视网膜病变 Eric Shah PDF N/A Diagnosis of diabetic retinopathy using machine learning & deep learning technique
通过核嵌入实现预测的高效池化 Sam Allen PDF N/A Efficient pooling of predictions via kernel embeddings
DoubleCCA: 使用随机句子嵌入提升基础模型群体鲁棒性 Hong Liu PDF N/A DoubleCCA: Improving Foundation Model Group Robustness with Random Sentence Embeddings
流退火重要性采样自举法与可微粒子物理学的结合 Annalena Kofler PDF N/A Flow Annealed Importance Sampling Bootstrap meets Differentiable Particle Physics
有效的非随机极限学习机 Daniela De Canditiis PDF N/A Effective Non-Random Extreme Learning Machine
特征之心:利用特征脸方法进行心脏疾病分类 Nourelhouda Groun PDF N/A EigenHearts: Cardiac Diseases Classification Using EigenFaces Approach
UltraSam:利用大规模开放访问分割数据集构建的超声基础模型 Adrien Meyer PDF N/A UltraSam: A Foundation Model for Ultrasound using Large Open-Access Segmentation Datasets
弱监督图像分割用于新鲜农产品的基于缺陷的分级 Manuel Knott PDF N/A Weakly supervised image segmentation for defect-based grading of fresh produce
通过局部动态优化和条件嵌入实现混合退化图像恢复 Yubin Gu PDF N/A Mixed Degradation Image Restoration via Local Dynamic Optimization and Conditional Embedding
SMGDiff:使用扩散概率模型生成足球运动 Hongdi Yang PDF N/A SMGDiff: Soccer Motion Generation using diffusion probabilistic models
SAVEn-Vid:长视频背景下增强理解的视听协同整合 Jungang Li PDF N/A SAVEn-Vid: Synergistic Audio-Visual Integration for Enhanced Understanding in Long Video Context
批量贝叶斯优化通过期望子空间改进 Dawei Zhan PDF N/A Batch Bayesian Optimization via Expected Subspace Improvement
MH-MoE:多头部专家混合模型 Shaohan Huang PDF N/A MH-MoE:Multi-Head Mixture-of-Experts
多AI反馈的视频-文本数据集构建:推动视频大语言模型的弱至强偏好学习 Hao Yi PDF N/A Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models
基于神经网络的高指数鞍点动力学方法用于搜索鞍点和解景观 Yuankai Liu PDF N/A Neural Network-based High-index Saddle Dynamics Method for Searching Saddle Points and Solution Landscape
VIRES:基于草图和文本引导的视频实例重绘 Shuchen Weng PDF N/A VIRES: Video Instance Repainting with Sketch and Text Guidance
通过视觉精度搜索解释对象级基础模型 Ruoyu Chen PDF N/A Interpreting Object-level Foundation Models via Visual Precision Search
从基础模型学习:无需手动标注的水果检测模型 Yanan Wang PDF N/A Learn from Foundation Model: Fruit Detection Model without Manual Annotation
关于连续投影算法鲁棒性的研究 Giovanni Barbarino PDF N/A On the Robustness of the Successive Projection Algorithm
通过第三方大语言模型集成增强多智能体共识:分析不确定性并减轻大语言模型中的幻觉现象 Zhihua Duan PDF N/A Enhancing Multi-Agent Consensus through Third-Party LLM Integration: Analyzing Uncertainty and Mitigating Hallucinations in Large Language Models
Fancy123:通过即插即用变形技术实现从单张图像到高质量3D网格生成的过程 Qiao Yu PDF N/A Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation
Any3DIS:通过2D掩码跟踪实现类无关的3D实例分割 Phuc Nguyen PDF N/A Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking
事件增强的可变形三维高斯分布用于快速动态场景重建 Wenhao Xu PDF N/A Event-boosted Deformable 3D Gaussians for Fast Dynamic Scene Reconstruction
高分辨率需警惕!改进自监督真实世界超分辨率 Yuehan Zhang PDF N/A High-Resolution Be Aware! Improving the Self-Supervised Real-World Super-Resolution
SALOVA:面向长视频分析的目标检索与路由的分段增强长视频助手 Junho Kim PDF N/A SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
U2NeRF:无监督水下图像复原与神经辐射场 Vinayak Gupta PDF N/A U2NeRF: Unsupervised Underwater Image Restoration and Neural Radiance Fields
图像生成多样性问题及如何解决它们 Mischa Dombrowski PDF N/A Image Generation Diversity Issues and How to Tame Them
CARE Transformer:通过解耦双重交互实现移动友好的线性视觉Transformer Yuan Zhou PDF N/A CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
局部与全局特征注意力融合网络用于人脸识别 Wang Yu PDF N/A Local and Global Feature Attention Fusion Network for Face Recognition
BadSFL:针对Scaffold联邦学习的后门攻击 Xingshuo Han PDF N/A BadSFL: Backdoor Attack against Scaffold Federated Learning
文本到图像合成:十年回顾 Nonghai Zhang PDF N/A Text-to-Image Synthesis: A Decade Survey
稀疏补丁对抗攻击通过外推逐点信息 Yaniv Nemcovsky PDF N/A Sparse patches adversarial attacks via extrapolating point-wise information
MixPE:高效LLM推理的量化与硬件协同设计 Yu Zhang PDF N/A MixPE: Quantization and Hardware Co-design for Efficient LLM Inference
MVGenMaster:通过增强的3D先验扩散模型从任意图像扩展多视图生成 Chenjie Cao PDF N/A MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model
VideoOrion:视频中对象动态的代币化 Yicheng Feng PDF N/A VideoOrion: Tokenizing Object Dynamics in Videos
脑电图基础模型参数高效微调的图适配器 Toyotaro Suzumura PDF N/A Graph Adapter of EEG Foundation Models for Parameter Efficient Fine Tuning
DeDe:通过解码器检测SSL编码器的后门样本 Sizai Hou PDF N/A DeDe: Detecting Backdoor Samples for SSL Encoders via Decoders
回顾Marr在人脸中的应用:深度神经网络中2D--2.5D--3D表示的构建 Xiangyu Zhu PDF N/A Revisiting Marr in Face: The Building of 2D--2.5D--3D Representations in Deep Neural Networks
SKQVC:通过K-均值量化与自监督语音表示实现的一次性语音转换 Youngjun Sim PDF N/A SKQVC: One-Shot Voice Conversion by K-Means Quantization with Self-Supervised Speech Representations
动态图嵌入的局部内在维度 Dušica Knežević PDF N/A Local Intrinsic Dimensionality for Dynamic Graph Embeddings
利用无人机群扑灭野火:一种先预测后优化的方法 Shijie Pan PDF N/A Using Drone Swarm to Stop Wildfire: A Predict-then-optimize Approach
图上时空预测的因果邻近学习 Zhaobin Mo PDF N/A Causal Adjacency Learning for Spatiotemporal Prediction Over Graphs
超越任务向量:基于重要性度量的选择性任务算术 Tian Bowen PDF N/A Beyond Task Vectors: Selective Task Arithmetic Based on Importance Metrics
上下文感知门控用于检索增强生成 Mohammad Hassan Heydari PDF N/A Context Awareness Gate For Retrieval Augmented Generation
TreeFormer:通过树约束图生成实现单视图植物骨架估计 Xinpeng Liu PDF N/A TreeFormer: Single-view Plant Skeleton Estimation via Tree-constrained Graph Generation
通过条件模仿协同学习实现自动驾驶车辆的端到端转向控制 Mahmoud M. Kishky PDF N/A End-to-End Steering for Autonomous Vehicles via Conditional Imitation Co-Learning
三辆车接近100米内!通过基于相机的三轴体素扫描增强远距离几何细节,实现语义场景补全 Jongseong Bae PDF N/A Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
CIA:基于稳定扩散的可控图像增强框架 Mohamed Benkedadra PDF N/A CIA: Controllable Image Augmentation Framework Based on Stable Diffusion
DF-GNN:面向GPU的注意力图神经网络动态融合框架 Jiahui Liu PDF N/A DF-GNN: Dynamic Fusion Framework for Attention Graph Neural Networks on GPUs
Med-PerSAM:面向医疗领域的个性化分割一切模型的一次性视觉提示调优 Hangyul Yoon PDF N/A Med-PerSAM: One-Shot Visual Prompt Tuning for Personalized Segment Anything Model in Medical Domain
DP-CDA:一种通过随机混合增强数据集合成中隐私保护的算法 Utsab Saha PDF N/A DP-CDA: An Algorithm for Enhanced Privacy Preservation in Dataset Synthesis Through Randomized Mixing
为什么代理会做出这个决定:用视觉掩码解释深度强化学习 Rui Zuo PDF N/A Why the Agent Made that Decision: Explaining Deep Reinforcement Learning with Vision Masks
学习用于端到端神经图像压缩的最优格点向量量化器 Xi Zhang PDF N/A Learning Optimal Lattice Vector Quantizers for End-to-end Neural Image Compression
支持多文档分析推理的LLM增强方法 Raquib Bin Yousuf PDF N/A LLM Augmentations to support Analytical Reasoning over Multiple Documents
LLMPirate:用于黑箱硬件IP盗版的LLMs Vasudev Gohil PDF N/A LLMPirate: LLMs for Black-box Hardware IP Piracy
FUN-AD:针对含噪训练数据的完全无监督异常检测学习 Jiin Im PDF N/A FUN-AD: Fully Unsupervised Learning for Anomaly Detection with Noisy Training Data
UNOPose:利用未配准的RGB-D参考图像进行未见物体的姿态估计 Xingyu Liu PDF N/A UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image
自适应电路行为与机制可解释性中的泛化 Jatin Nainani PDF N/A Adaptive Circuit Behavior and Generalization in Mechanistic Interpretability
BlendServe:通过资源感知的批处理优化自回归大型模型的离线推理 Yilong Zhao PDF N/A BlendServe: Optimizing Offline Inference for Auto-regressive Large Models with Resource-aware Batching
使用联邦学习进行漏洞检测的实证研究 Peiheng Zhou PDF N/A An Empirical Study of Vulnerability Detection using Federated Learning
ENCLIP:基于集成和聚类的对比语言-图像预训练,用于在数据有限和图像质量低下的情况下进行时尚多模态搜索 Prithviraj Purushottam Naik PDF N/A ENCLIP: Ensembling and Clustering-Based Contrastive Language-Image Pretraining for Fashion Multimodal Search with Limited Data and Low-Quality Images
LDACP:针对竞价策略的长延迟广告转化预测模型 Peng Cui PDF N/A LDACP: Long-Delayed Ad Conversions Prediction Model for Bidding Strategy
张量的图形符号基础:展开、计算与分解 Tatsuya Yokota PDF N/A Very Basics of Tensors with Graphical Notations: Unfolding, Calculations, and Decompositions