跳转至

Arxiv 2024-12-23 Papers

标题 作者 PDF链接 代码仓库 Title
FaceLift:单张图像生成3D头部模型并结合视图生成与GS-LRM技术 Weijie Lyu PDF N/A FaceLift: Single Image to 3D Head with View Generation and GS-LRM
ChatGarment:通过大型语言模型实现服装估算、生成与编辑 Siyuan Bian PDF N/A ChatGarment: Garment Estimation, Generation and Editing via Large Language Models
令牌统计变换器:通过变分速率降低实现线性时间注意力 Ziyang Wu PDF N/A Token Statistics Transformer: Linear-Time Attention via Variational Rate Reduction
Dora: 三维形状变分自编码器的采样与基准测试 Rui Chen PDF N/A Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders
跨视角参考多目标追踪 Sijia Chen PDF N/A Cross-View Referring Multi-Object Tracking
重建人物、地点和摄像机 Lea Müller PDF N/A Reconstructing People, Places, and Cameras
大动作视频自动编码与跨模态视频变分自编码器 Yazhou Xing PDF N/A Large Motion Video Autoencoding with Cross-modal Video VAE
GauSim:通过高斯模拟器将弹性物体注册到数字世界 Yidi Shao PDF N/A GauSim: Registering Elastic Objects into Digital World by Gaussian Simulator
探究不平衡效应对临床语言模型性能及人口统计学公平性的影响 Precious Jones PDF N/A Examining Imbalance Effects on Performance and Demographic Fairness of Clinical Language Models
综合多模态原型是用于大规模词汇目标检测的简单而有效的分类器 Yitong Chen PDF N/A Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection
使用基础模型自动化搜索人工生命 Akarsh Kumar PDF N/A Automating the Search for Artificial Life with Foundation Models
稳态变种的通用几何结构 Elisenda Feliu PDF N/A The generic geometry of steady state varieties
部分可观测协助游戏中的观察干扰 Scott Emmons PDF N/A Observation Interference in Partially Observable Assistance Games
记忆使计算具有普适性,还记得吗? Erik Garrison PDF N/A Memory makes computation universal, remember?
跨语言文本丰富的视觉理解:信息论视角 Xinmiao Yu PDF N/A Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective
PepTune:基于多目标引导离散扩散的全新治疗性肽生成 Sophia Tang PDF N/A PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion
一项关于KAN在语音增强中潜力的研究 Haoyang Li PDF N/A An Investigation on the Potential of KAN in Speech Enhancement
朝向结构保持的量子编码 Arthur J. Parzygnat PDF N/A Towards structure-preserving quantum encodings
ActiveGS:使用高斯喷洒进行主动场景重建 Liren Jin PDF N/A ActiveGS: Active Scene Reconstruction using Gaussian Splatting
研究小镇:人类研究社区的模拟器 Haofei Yu PDF N/A ResearchTown: Simulator of Human Research Community
HyperQ-Opt:用于超参数优化的Q学习 Md. Tarek Hasan PDF N/A HyperQ-Opt: Q-learning for Hyperparameter Optimization
使用伊藤密度估计器叠加扩散模型 Marta Skreta PDF N/A The Superposition of Diffusion Models Using the Itô Density Estimator
大型多模态模型数据集、应用类别及分类调查 Priyaranjan Pattnayak PDF N/A Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy
万一你错过了:ARC的“挑战”并没有那么具有挑战性 Łukasz Borchmann PDF N/A In Case You Missed It: ARC 'Challenge' Is Not That Challenging
在两臂最佳臂识别中的极小极大最优简单遗憾 Masahiro Kato PDF N/A Minimax Optimal Simple Regret in Two-Armed Best-Arm Identification
在潜在空间中通过可微缓存增强进行审议 Luyang Liu PDF N/A Deliberation in Latent Space via Differentiable Cache Augmentation
RepoTransBench:一个用于仓库级代码翻译的真实世界基准 Yanli Wang PDF N/A RepoTransBench: A Real-World Benchmark for Repository-Level Code Translation
YuLan-Mini:一个开放的高效数据语言模型 Yiwen Hu PDF N/A YuLan-Mini: An Open Data-efficient Language Model
参加推理:尝试理解标记的工作原理 Rui Qian PDF N/A Reasoning to Attend: Try to Understand How Token Works
敏感度曲线最大化:攻击分布式学习中的鲁棒聚合器 Christian A. Schroth PDF N/A Sensitivity Curve Maximization: Attacking Robust Aggregators in Distributed Learning
傅里叶位置嵌入:增强注意力周期扩展以实现长度泛化 Ermo Hua PDF N/A Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization
上下文反向传播循环:通过迭代自上而下的反馈增强深度推理能力 Jacob Fein-Ashley PDF N/A Contextual Backpropagation Loops: Amplifying Deep Reasoning with Iterative Top-Down Feedback
LASE:学习邻接谱嵌入 Sofía Pérez Casulo PDF N/A LASE: Learned Adjacency Spectral Embeddings
Mimicking-Bench:通过模仿人类行为进行通用型人形-场景交互学习的基准测试 Yun Liu PDF N/A Mimicking-Bench: A Benchmark for Generalizable Humanoid-Scene Interaction Learning via Human Mimicking
Chumor 2.0:迈向中文幽默理解基准测试 Ruiqi He PDF N/A Chumor 2.0: Towards Benchmarking Chinese Humor Understanding
通过思维链进行知识编辑 Changyue Wang PDF N/A Knowledge Editing through Chain-of-Thought
VidTwin: 视频变分自编码器与解耦结构和动态 Yuchi Wang PDF N/A VidTwin: Video VAE with Decoupled Structure and Dynamics
异步联邦学习:一种适用于去中心化机器学习的可扩展方法 Ali Forootani PDF N/A Asynchronous Federated Learning: A Scalable Approach for Decentralized Machine Learning
通过近似基于核的广义评分函数实现快速因果发现,具有线性计算复杂度 Yixin Ren PDF N/A Fast Causal Discovery by Approximate Kernel-based Generalized Score Functions with Linear Computational Complexity
GaussianPainter:通过法线引导将点云绘制成3D高斯分布 Jingqiu Zhou PDF N/A GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance
SMAC-Hard:在SMAC上启用混合对手策略脚本和自我对弈 Yue Deng PDF N/A SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC
从模型到微观理论:提炼模型的主题知识以用于基于事实的问题回答 Nathaniel Weir PDF N/A From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question Answering
MRANet:一种用于肺和结肠癌分类的改进残差注意力网络 Diponkor Bala PDF N/A MRANet: A Modified Residual Attention Networks for Lung and Colon Cancer Classification
在城市数字孪生中建立现实与虚拟的互联,以实现卓越的智能道路检测 Yikang Zhang PDF N/A Establishing Reality-Virtuality Interconnections in Urban Digital Twins for Superior Intelligent Road Inspection
通过逻辑理解直接偏好对齐 Kyle Richardson PDF N/A Understanding the Logic of Direct Preference Alignment through Logic
FedTLU:具有目标层更新的联邦学习 Jong-Ik Park PDF N/A FedTLU: Federated Learning with Targeted Layer Updates
RAGONITE:基于诱导数据库和口语化RDF的迭代检索,用于在知识图谱上进行对话式问答 Rishiraj Saha Roy PDF N/A RAGONITE: Iterative Retrieval on Induced Databases and Verbalized RDF for Conversational QA over KGs with RAG
大型语言模型安全性:全面综述 Dan Shi PDF N/A Large Language Model Safety: A Holistic Survey
COBRA:用于少样本学习的组合检索增强 Arnav M. Das PDF N/A COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Learning
EPE-P:基于证据的参数高效提示,用于多模态学习中的缺失模态处理 Zhe Chen PDF N/A EPE-P: Evidence-based Parameter-efficient Prompting for Multimodal Learning with Missing Modalities
一种无偏训练范式,用于更通用的AI生成图像检测 Fabrizio Guillaro PDF N/A A Bias-Free Training Paradigm for More General AI-generated Image Detection
使用大型语言模型生成布洛卡失语症碎片句的完整句子 Sijbren van Vaals PDF N/A Generating Completions for Fragmented Broca's Aphasic Sentences Using Large Language Models
增强尖峰神经网络中的时间处理能力以利用三维卷积进行静态物体检测 Huaxu He PDF N/A Enhanced Temporal Processing in Spiking Neural Networks for Static Object Detection Using 3D Convolutions
基准测试用于深度学习测试输入生成的生成式AI模型 Maryam PDF N/A Benchmarking Generative AI Models for Deep Learning Test Input Generation
检测对话中的焦虑和抑郁:一种多标签且可解释的方法 Francisco de Arriba-Pérez PDF N/A Detecting anxiety and depression in dialogues: a multi-label and explainable approach
一个利用条件熵优化的多视图聚类自适应框架 Lijian Li PDF N/A An Adaptive Framework for Multi-View Clustering Leveraging Conditional Entropy Optimization
递归训练中的模型崩溃率 Ananda Theertha Suresh PDF N/A Rate of Model Collapse in Recursive Training
DreamFit:通过轻量级Anything-Dressing编码器实现以服装为中心的人体生成 Ente Lin PDF N/A DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder
利用知识图谱推进机器学习研究 Jing Si PDF N/A Advances in Machine Learning Research Using Knowledge Graphs
无监督动作分割的分层向量量化 Federico Spurio PDF N/A Hierarchical Vector Quantization for Unsupervised Action Segmentation
SCBench:一个面向视频大型语言模型的体育解说基准 Kuangzhi Ge PDF N/A SCBench: A Sports Commentary Benchmark for Video LLMs
LangSurf: 用于三维场景理解的语言嵌入表面高斯方法 Hao Li PDF N/A LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding
ANID:我们还有多远?通过多模态指导评估AI合成图像与自然图像之间的差异 Renyang Liu PDF N/A ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance
细节保留的潜在扩散模型用于稳定阴影去除 Jiamin Xu PDF N/A Detail-Preserving Latent Diffusion for Stable Shadow Removal
图神经网络是进化算法 Kaichen Ouyang PDF N/A Graph Neural Networks Are Evolutionary Algorithms
编辑辐射场的隐式与显式表示:一项综述 Arthur Hubert PDF N/A Editing Implicit and Explicit Representations of Radiance Fields: A Survey
追踪LLM训练中的特征动态:一项机制性研究 Yang Xu PDF N/A Tracking the Feature Dynamics in LLM Training: A Mechanistic Study
迈向一种高效求解参数化混合整数规划的无监督学习方案 Shiyuan Qu PDF N/A Towards An Unsupervised Learning Scheme for Efficiently Solving Parameterized Mixed-Integer Programs
比最多样化更进一步:生成模型的多样化混合在线选择 Parham Rezaei PDF N/A Be More Diverse than the Most Diverse: Online Selection of Diverse Mixtures of Generative Models
面向内核的图提示学习在小样本异常检测中的应用 Fenfang Tao PDF N/A Kernel-Aware Graph Prompt Learning for Few-Shot Anomaly Detection
面部表情分析及其在物联网系统中的潜力:当代综述 Zixuan Shanggua PDF N/A Facial Expression Analysis and Its Potentials in IoT Systems: A Contemporary Survey
大型语言模型的安全挑战初现 Herve Debar PDF N/A Emerging Security Challenges of Large Language Models
稳定性是否可能有害?通过梯度下降的不稳定性实现更好的泛化 Lawrence Wang PDF N/A Can Stability be Detrimental? Better Generalization through Gradient Descent Instabilities
CoSurfGS:基于分布式学习的大规模场景重建协同三维表面高斯光栅化技术 Yuanyuan Gao PDF N/A CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction
个性化大型视觉-语言模型 Chau Pham PDF N/A Personalized Large Vision-Language Models
面向图的基础模型:预训练图神经网络跨数据集迁移的分析 Fabrizio Frasca PDF N/A Towards Foundation Models on Graphs: An Analysis on Cross-Dataset Transfer of Pretrained GNNs
SBS数据:从分阶段合成的图像中进行预训练的图表问答 Risa Shinoda PDF N/A SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images
EasyTime:让时间序列预测变得简单 Xiangfei Qiu PDF N/A EasyTime: Time Series Forecasting Made Easy
AFANet:用于弱监督少样本语义分割的自适应频率感知网络 Jiaqi Ma PDF N/A AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation
LiveIdeaBench:通过极少上下文评估大型语言模型的科学创造力和创意生成能力 Kai Ruan PDF N/A LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context
V$^2$-SfMLearner:为多模态无线胶囊内窥镜学习单目深度和自我运动 Long Bai PDF N/A V$^2$-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy
调查文档级机器翻译中的长度问题 Ziqian Peng PDF N/A Investigating Length Issues in Document-level Machine Translation
图大小不平衡学习与能量引导结构平滑 Jiawen Qin PDF N/A Graph Size-imbalanced Learning with Energy-guided Structural Smoothing
PC代理:在你沉睡时,AI正在工作——一场深入数字世界的认知之旅 Yanheng He PDF N/A PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World
使用参数高效的深度学习框架改进棉花叶病分类 Aswini Kumar Patra PDF N/A Improved Cotton Leaf Disease Classification Using Parameter-Efficient Deep Learning Framework
通过模型和度量集成提升脑部MRI中的基于重建的分布外检测 Evi M. C. Huijben PDF N/A Enhancing Reconstruction-Based Out-of-Distribution Detection in Brain MRI with Model and Metric Ensembles
使用进化算法进行量子时间序列学习 Vignesh Anantharamakrishnan PDF N/A Quantum Time-Series Learning with Evolutionary Algorithms
HumanVBench:利用合成基准数据探索MLLMs在以人为中心的视频理解方面的能力 Ting Zhou PDF N/A HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data
URoadNet:用于多尺度道路网络提取的双稀疏注意力U-Net Jie Song PDF N/A URoadNet: Dual Sparse Attentive U-Net for Multiscale Road Network Extraction
使用情感偏好优化和Mamba压缩器在视听对话中实现共情响应 Yeonju Kim PDF N/A Empathetic Response in Audio-Visual Conversations Using Emotion Preference Optimization and MambaCompressor
HPCNeuroNet:一种将SNN时间动态与Transformer注意力机制融合的神经形态方法,用于基于FPGA的粒子物理学研究 Murat Isik PDF N/A HPCNeuroNet: A Neuromorphic Approach Merging SNN Temporal Dynamics with Transformer Attention for FPGA-based Particle Physics
高级掩码自编码器学习的动态双雄:协作掩码与目标 Shentong Mo PDF N/A The Dynamic Duo of Collaborative Masking and Target for Advanced Masked Autoencoder Learning
在不同学习环境下评估生物启发模型在网络流量预测中的能效 Theodoros Tsiolakis PDF N/A Evaluation of Bio-Inspired Models under Different Learning Settings For Energy Efficiency in Network Traffic Prediction
ERUPD -- 英语到罗马乌尔都语平行数据集 Mohammed Furqan PDF N/A ERUPD -- English to Roman Urdu Parallel Dataset
S-INF:通过场景隐式神经场实现逼真的室内场景合成 Zixi Liang PDF N/A S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field
GQSA:用于加速大型语言模型推理的组量化与稀疏化 Chao Zeng PDF N/A GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference
一种基于卷积神经网络的多基因风险预测肾结石形成的方法 Amr Salem PDF N/A A CNN Approach to Polygenic Risk Prediction of Kidney Stone Formation
大型语言模型中的查询优化研究综述 Mingyang Song PDF N/A A Survey of Query Optimization in Large Language Models
莎士比亚十四行诗与泰勒·斯威夫特歌词相似度评分中文档级嵌入方法的比较分析 Klara Kramer PDF N/A Comparative Analysis of Document-Level Embedding Methods for Similarity Scoring on Shakespeare Sonnets and Taylor Swift Lyrics
资源感知的阿拉伯语大型语言模型创建:模型适配、集成与多领域测试 Prakash Aryan PDF N/A Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing
概率密度感知半监督学习 Shuyang Liu PDF N/A Probability-density-aware Semi-supervised Learning
保留分数:量化视觉语言模型的越狱风险 Zaitang Li PDF N/A Retention Score: Quantifying Jailbreak Risks for Vision Language Models
利用心血管模拟进行心脏生物标志物的体内预测 Laura Manduchi PDF N/A Leveraging Cardiovascular Simulations for In-Vivo Prediction of Cardiac Biomarkers
深度神经网络中的概念发现用于可解释的人脸反欺骗 Haoyuan Zhang PDF N/A Concept Discovery in Deep Neural Networks for Explainable Face Anti-Spoofing
WildPPG:一个包含长时间连续记录的真实世界PPG数据集 Manuel Meier PDF N/A WildPPG: A Real-World PPG Dataset of Long Continuous Recordings
领域适应机器翻译:灾难性遗忘遗忘了什么以及为什么? Danielle Saunders PDF N/A Domain adapted machine translation: What does catastrophic forgetting forget and why?
CiteBART:学习为本地引文推荐生成引文 Ege Yiğit Çelik PDF N/A CiteBART: Learning to Generate Citations for Local Citation Recommendation
《闭门之语:创建与探索波兰情色话语的forePLay注释数据集》 Anna Kołos PDF N/A Behind Closed Words: Creating and Investigating the forePLay Annotated Dataset for Polish Erotic Discourse
探索电影制作中的动态新颖视角合成技术 Adrian Azzarelli PDF N/A Exploring Dynamic Novel View Synthesis Technologies for Cinematography
双重地雷:基于双触发机制的隐形文本后门攻击 Yang Hou PDF N/A Double Landmines: Invisible Textual Backdoor Attacks based on Dual-Trigger
通过可解释且可信赖的深度学习模型提升癌症诊断 Badaru I. Olumuyiwa PDF N/A Enhancing Cancer Diagnosis with Explainable & Trustworthy Deep Learning Models
STAHGNet:高效建模混合粒度异质依赖性以用于交通预测 Jiyao Wang PDF N/A STAHGNet: Modeling Hybrid-grained Heterogenous Dependency Efficiently for Traffic Prediction
构建公平的潜在空间以实现公平性与可解释性的交叉 Hyungjun Joo PDF N/A Constructing Fair Latent Space for Intersection of Fairness and Explainability
DiffusionAttacker:用于LLM越狱的扩散驱动提示操控 Hao Wang PDF N/A DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak
神经算子的最优收敛速度 Mike Nguyen PDF N/A Optimal Convergence Rates for Neural Operators
用于基于FMCW毫米波雷达的现实世界人体动作检测的数据集 Dylan jayabahu PDF N/A Dataset for Real-World Human Action Detection Using FMCW mmWave Radar
BEE:通过基线探索-利用实现度量适应性解释 Oren Barkan PDF N/A BEE: Metric-Adapted Explanations via Baseline Exploration-Exploitation
一种利用多元信息评分进行祖先图的高效搜索评分算法 Nikita Lagrange PDF N/A An efficient search-and-score algorithm for ancestral graphs using multivariate information scores
基于深度学习的卫星基本气候变量不确定性 Junyang Gou PDF N/A Uncertainties of Satellite-based Essential Climate Variables from Deep Learning
多即是少?基于模拟的方法探讨多模态模型中偏差间的动态交互 Mounia Drissi PDF N/A More is Less? A Simulation-Based Approach to Dynamic Interactions between Biases in Multimodal Models
基于人类反馈和产品一致性的产品图像背景修复评估框架 Yuqi Liang PDF N/A An Evaluation Framework for Product Images Background Inpainting based on Human Feedback and Product Consistency
改进潜在神经随机微分方程的噪声估计 Linus Heck PDF N/A Improving the Noise Estimation of Latent Neural Stochastic Differential Equations
DRT-o1:通过长链思维优化深度推理翻译 Jiaan Wang PDF N/A DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought
使用YCbCr色彩空间进行引导的真实图像去雾 Wenxuan Fang PDF N/A Guided Real Image Dehazing using YCbCr Color Space
虚拟现实数据收集工具包 Tim Rolff PDF N/A A Toolkit for Virtual Reality Data Collection
DeepMF:闭环安全关键驾驶场景仿真的深度运动分解 Yizhe Li PDF N/A DeepMF: Deep Motion Factorization for Closed-Loop Safety-Critical Driving Scenario Simulation
当前学生是否大规模使用ChatGPT?关于ChatGPT等大型语言模型在教育环境中使用情况的调查 Jérémie Sublime PDF N/A Is ChatGPT Massively Used by Students Nowadays? A Survey on the Use of Large Language Models such as ChatGPT in Educational Settings
面向GPU数据中心的功耗与碎片感知的在线调度 Francesco Lettich PDF N/A Power- and Fragmentation-aware Online Scheduling for GPU Datacenters
银弹还是全神贯注的妥协?基于Gist Token的上下文压缩全面研究 Chenlong Deng PDF N/A A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression
《多生成智能体系统综述:最新进展与新前沿》 Shuaihang Chen PDF N/A A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers
信号转换在多通道信号处理中的有效性 Sunil Kumar Kopparapu PDF N/A Signal Transformation for Effective Multi-Channel Signal Processing
预测压缩图像的满意用户与机器比例:一种统一的方法 Qi Zhang PDF N/A Predicting Satisfied User and Machine Ratio for Compressed Images: A Unified Approach
线图Vietoris-Rips持久性图用于拓扑图表示学习 Jaesun Shin PDF N/A Line Graph Vietoris-Rips Persistence Diagram for Topological Graph Representation Learning
CALLIC:无损图像压缩的内容自适应学习 Daxin Li PDF N/A CALLIC: Content Adaptive Learning for Lossless Image Compression
工业异常检测中的渐进边界引导异常合成 Qiyu Chen PDF N/A Progressive Boundary Guided Anomaly Synthesis for Industrial Anomaly Detection
早期婴儿单语和双语语音持续学习的发展性预测编码模型 Xiaodan Chen PDF N/A Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning
从总结数据中学习:基于样本准似然的Gaussian过程回归 Yuta Shikuri PDF N/A Learning from Summarized Data: Gaussian Process Regression with Sample Quasi-Likelihood
基于时间卷积网络的网络入侵检测方法 Rukmini Nazre PDF N/A A Temporal Convolutional Network-based Approach for Network Intrusion Detection
深入探讨多模态推理的自进化训练 Wei Liu PDF N/A Diving into Self-Evolving Training for Multimodal Reasoning
在心理治疗环境中应用大语言模型与主题建模 Alexander Vanin PDF N/A Applying LLM and Topic Modelling in Psychotherapeutic Contexts
XAI在转变航空航天系统中的作用 Francisco Javier Cantero Zorita PDF N/A The Role of XAI in Transforming Aeronautics and Aerospace Systems
基于马尔可夫过程的图卷积网络用于知识图谱中的实体分类 Johannes Mäkelburg PDF N/A Markov Process-Based Graph Convolutional Networks for Entity Classification in Knowledge Graphs
神经连续时间上鞅证书 Grigory Neustroev PDF N/A Neural Continuous-Time Supermartingale Certificates
衡量面向儿童的文本中的上下文信息量 Maria Valentini PDF N/A Measuring Contextual Informativeness in Child-Directed Text
多模态偏好数据与奖励模型的合成对齐 Robert Wijaya PDF N/A Multimodal Preference Data Synthetic Alignment with Reward Model
VidCtx:利用图像模型实现上下文感知的视频问答 Andreas Goulas PDF N/A VidCtx: Context-aware Video Question Answering with Image Models
使用随机噪声进行预训练以实现不确定性校准 Jeonghwan Cheon PDF N/A Pretraining with random noise for uncertainty calibration
正是你所期望的:通过自我反思实现约束时间线摘要,以增强相关性 Muhammad Reza Qorib PDF N/A Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance
证据理论不确定性对训练目标检测模型的影响 M. Tahasanul Ibrahim PDF N/A Impact of Evidence Theory Uncertainty on Training Object Detection Models
BrainMAP:在大脑网络中学习多重激活路径 Song Wang PDF N/A BrainMAP: Learning Multiple Activation Pathways in Brain Networks
学习红外小目标检测的动态局部上下文表示 Guoyi Zhang PDF N/A Learning Dynamic Local Context Representations for Infrared Small Target Detection
通过迭代偏好学习增强蒙特卡洛树搜索推理中的内在自我修正能力 Huchen Jiang PDF N/A Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning
WarriorCoder:从专家对决中学习以增强代码大型语言模型 Huawen Feng PDF N/A WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models
PointVoxelFormer -- 复兴点云网络用于三维医学影像 Mattias Paul Heinrich PDF N/A PointVoxelFormer -- Reviving point cloud networks for 3D medical imaging
奇异值缩放:通过剪枝权重精炼实现高效生成模型压缩 Hyeonjin Kim PDF N/A Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement
交织记忆:暹罗大型语言模型 Xin Song PDF N/A Interweaving Memories of a Siamese Large Language Model
平衡的3DGS:基于高斯并行性的精细分块渲染 Hao Gui PDF N/A Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling
一种即插即用的野外高难度动作物理恢复方法 Youliang Zhang PDF N/A A Plug-and-Play Physical Motion Restoration Approach for In-the-Wild High-Difficulty Motions
人工智能能有多环保?一项关于机器学习环境影响趋势的研究 Clément Morand PDF N/A How Green Can AI Be? A Study of Trends in Machine Learning Environmental Impacts
FRTP:联合路由搜索记录以增强长期交通预测 Hangli Ge PDF N/A FRTP: Federating Route Search Records to Enhance Long-term Traffic Prediction
FlowMamba:通过全局运动传播学习点云场景流 Min Lin PDF N/A FlowMamba: Learning Point Cloud Scene Flow with Global Motion Propagation
通过迭代和选择性地从数据中学习来提升大语言模型 Qi Jia PDF N/A Boosting LLM via Learning from Data Iteratively and Selectively
用于信息检索的文本嵌入模型高效微调方法:对比学习惩罚(CLP) Jeongsu Yu PDF N/A Efficient fine-tuning methodology of text embedding models for information retrieval: contrastive learning penalty (clp)
一种基于情感的文本分类中日语分词器的实验评估 Andre Rusli PDF N/A An Experimental Evaluation of Japanese Tokenizers for Sentiment-Based Text Classification
分层获取受限贝叶斯优化:应用于模拟电路 Ria Rashid PDF N/A Tiered Acquisition for Constrained Bayesian Optimization: An Application to Analog Circuits
通过信息瓶颈实现的双向多尺度图数据集压缩 Xingcheng Fu PDF N/A Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck
DiffFormer:一种用于高光谱图像分类的微分空间-光谱变换器 Muhammad Ahmad PDF N/A DiffFormer: a Differential Spatial-Spectral Transformer for Hyperspectral Image Classification
蛋白质组学信息学中的深度学习:应用、挑战与未来方向 Yindan Luo PDF N/A Deep Learning in Proteomics Informatics: Applications, Challenges, and Future Directions
折纸:一种用于从半结构化数据进行预测的生成式变压器架构 Thomas Rückstieß PDF N/A ORIGAMI: A generative transformer architecture for predictions from semi-structured data
基于LSTM的三分类文本情感分析 Yin Qixuan PDF N/A Three-Class Text Sentiment Analysis Based on LSTM
FFA Sora,将视频生成作为眼底荧光素血管造影模拟器 Xinyuan Wu PDF N/A FFA Sora, video generation as fundus fluorescein angiography simulator
关于描述逻辑概念的示例的效力与局限性 Balder ten Cate PDF N/A On the Power and Limitations of Examples for Description Logic Concepts
专注于调整策略以达到目标的强化学习 Akane Tsuboya PDF N/A Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets
MineAgent:利用多模态大型语言模型进行遥感矿产勘探 Beibei Yu PDF N/A MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models
通过主题对比学习提升神经主题模型的主题可解释性 Xin Gao PDF N/A Enhancing Topic Interpretability for Neural Topic Modeling through Topic-wise Contrastive Learning
神经-MCRL:基于脑电图的视觉解码的多模态对比表示学习 Yueyang Li PDF N/A Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding
APEX$^2$:个性化知识图谱的自适应和极值摘要 Zihao Li PDF N/A APEX$^2$: Adaptive and Extreme Summarization for Personalized Knowledge Graphs
完整实现WXF中国象棋规则 Daniel Tan PDF N/A Complete Implementation of WXF Chinese Chess Rules
基于扩散模型的宽带地面运动合成,条件极简 Jaeheun Jung PDF N/A Broadband Ground Motion Synthesis by Diffusion Model with Minimal Condition
使用大型语言模型的双视角隐喻检测框架 Yujie Lin PDF N/A A Dual-Perspective Metaphor Detection Framework Using Large Language Models
用于半监督语义分割的不确定性-参与上下文一致性学习 Jianjian Yin PDF N/A Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation
EcoSearch:一种用于程序合成的恒定延迟最佳优先搜索算法 Théo Matricon PDF N/A EcoSearch: A Constant-Delay Best-First Search Algorithm for Program Synthesis
基于特征的方法在目标检测中的领域自适应:综述论文 Helia Mohamadi PDF N/A Feature Based Methods Domain Adaptation for Object Detection: A Review Paper
xPatch:基于指数季节性趋势分解的双流时间序列预测 Artyom Stitsyuk PDF N/A xPatch: Dual-Stream Time Series Forecasting with Exponential Seasonal-Trend Decomposition
通过基于压缩的编辑距离评估人类对LLM生成文本的编辑工作量 Nicolas Devatine PDF N/A Assessing Human Editing Effort on LLM-Generated Texts via Compression-Based Edit Distance
更好的知识增强用于保护隐私的跨项目缺陷预测 Yuying Wang PDF N/A Better Knowledge Enhancement for Privacy-Preserving Cross-Project Defect Prediction
快速计算RoPE注意力的时间复杂度接近线性 Yifang Chen PDF N/A Fast Gradient Computation for RoPE Attention in Almost Linear Time
CodeV:通过视觉数据解决问题 Linhao Zhang PDF N/A CodeV: Issue Resolving with Visual Data
通过深度学习和ResNeXt进行金融数据挖掘的协作优化 Pengbin Feng PDF N/A Collaborative Optimization in Financial Data Mining Through Deep Learning and ResNeXt
通过Stein变分超网络改进昂贵的多目标优化的Pareto集学习 Minh-Duc Nguyen PDF N/A Improving Pareto Set Learning for Expensive Multi-objective Optimization via Stein Variational Hypernetworks
基于内容和上下文嵌入的流行度估计和新捆绑包生成 Ashutosh Nayak PDF N/A Popularity Estimation and New Bundle Generation using Content and Context based Embeddings
多重一致性引导的无监督音频测试时适应对比音频-语言模型 Gongyu Chen PDF N/A Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio
FedLEC:在标签偏斜情况下,利用脉冲神经网络实现有效联邦学习的算法 Di Yu PDF N/A FedLEC: Effective Federated Learning Algorithm with Spiking Neural Networks Under Label Skews
视觉-语言模型在时间序列分类中的可行性研究 Vinay Prithyani PDF N/A On the Feasibility of Vision-Language Models for Time-Series Classification
用于红外小目标检测的神经时空张量表示 Fengyi Wu PDF N/A Neural Spatial-Temporal Tensor Representation for Infrared Small Target Detection
计算环境中的资源优化动态调度策略 Xiaoye Wang PDF N/A Dynamic Scheduling Strategies for Resource Optimization in Computing Environments
从架构角度重新审视用于3D异常检测的多模态融合 Kaifang Long PDF N/A Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective
Friends-MMC:一个用于多模态多方对话理解的数据集 Yueqian Wang PDF N/A Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding
AV-EmoDialog:利用情感线索与视听用户进行对话 Se Jin Park PDF N/A AV-EmoDialog: Chat with Audio-Visual Users Leveraging Emotional Cues
自由视角人体动画与姿态相关参考选择 Fa-Ting Hong PDF N/A Free-viewpoint Human Animation with Pose-correlated Reference Selection