| FaceLift:单张图像生成3D头部模型并结合视图生成与GS-LRM技术 |
Weijie Lyu |
PDF |
N/A |
FaceLift: Single Image to 3D Head with View Generation and GS-LRM |
| ChatGarment:通过大型语言模型实现服装估算、生成与编辑 |
Siyuan Bian |
PDF |
N/A |
ChatGarment: Garment Estimation, Generation and Editing via Large Language Models |
| 令牌统计变换器:通过变分速率降低实现线性时间注意力 |
Ziyang Wu |
PDF |
N/A |
Token Statistics Transformer: Linear-Time Attention via Variational Rate Reduction |
| Dora: 三维形状变分自编码器的采样与基准测试 |
Rui Chen |
PDF |
N/A |
Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders |
| 跨视角参考多目标追踪 |
Sijia Chen |
PDF |
N/A |
Cross-View Referring Multi-Object Tracking |
| 重建人物、地点和摄像机 |
Lea Müller |
PDF |
N/A |
Reconstructing People, Places, and Cameras |
| 大动作视频自动编码与跨模态视频变分自编码器 |
Yazhou Xing |
PDF |
N/A |
Large Motion Video Autoencoding with Cross-modal Video VAE |
| GauSim:通过高斯模拟器将弹性物体注册到数字世界 |
Yidi Shao |
PDF |
N/A |
GauSim: Registering Elastic Objects into Digital World by Gaussian Simulator |
| 探究不平衡效应对临床语言模型性能及人口统计学公平性的影响 |
Precious Jones |
PDF |
N/A |
Examining Imbalance Effects on Performance and Demographic Fairness of Clinical Language Models |
| 综合多模态原型是用于大规模词汇目标检测的简单而有效的分类器 |
Yitong Chen |
PDF |
N/A |
Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection |
| 使用基础模型自动化搜索人工生命 |
Akarsh Kumar |
PDF |
N/A |
Automating the Search for Artificial Life with Foundation Models |
| 稳态变种的通用几何结构 |
Elisenda Feliu |
PDF |
N/A |
The generic geometry of steady state varieties |
| 部分可观测协助游戏中的观察干扰 |
Scott Emmons |
PDF |
N/A |
Observation Interference in Partially Observable Assistance Games |
| 记忆使计算具有普适性,还记得吗? |
Erik Garrison |
PDF |
N/A |
Memory makes computation universal, remember? |
| 跨语言文本丰富的视觉理解:信息论视角 |
Xinmiao Yu |
PDF |
N/A |
Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective |
| PepTune:基于多目标引导离散扩散的全新治疗性肽生成 |
Sophia Tang |
PDF |
N/A |
PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion |
| 一项关于KAN在语音增强中潜力的研究 |
Haoyang Li |
PDF |
N/A |
An Investigation on the Potential of KAN in Speech Enhancement |
| 朝向结构保持的量子编码 |
Arthur J. Parzygnat |
PDF |
N/A |
Towards structure-preserving quantum encodings |
| ActiveGS:使用高斯喷洒进行主动场景重建 |
Liren Jin |
PDF |
N/A |
ActiveGS: Active Scene Reconstruction using Gaussian Splatting |
| 研究小镇:人类研究社区的模拟器 |
Haofei Yu |
PDF |
N/A |
ResearchTown: Simulator of Human Research Community |
| HyperQ-Opt:用于超参数优化的Q学习 |
Md. Tarek Hasan |
PDF |
N/A |
HyperQ-Opt: Q-learning for Hyperparameter Optimization |
| 使用伊藤密度估计器叠加扩散模型 |
Marta Skreta |
PDF |
N/A |
The Superposition of Diffusion Models Using the Itô Density Estimator |
| 大型多模态模型数据集、应用类别及分类调查 |
Priyaranjan Pattnayak |
PDF |
N/A |
Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy |
| 万一你错过了:ARC的“挑战”并没有那么具有挑战性 |
Łukasz Borchmann |
PDF |
N/A |
In Case You Missed It: ARC 'Challenge' Is Not That Challenging |
| 在两臂最佳臂识别中的极小极大最优简单遗憾 |
Masahiro Kato |
PDF |
N/A |
Minimax Optimal Simple Regret in Two-Armed Best-Arm Identification |
| 在潜在空间中通过可微缓存增强进行审议 |
Luyang Liu |
PDF |
N/A |
Deliberation in Latent Space via Differentiable Cache Augmentation |
| RepoTransBench:一个用于仓库级代码翻译的真实世界基准 |
Yanli Wang |
PDF |
N/A |
RepoTransBench: A Real-World Benchmark for Repository-Level Code Translation |
| YuLan-Mini:一个开放的高效数据语言模型 |
Yiwen Hu |
PDF |
N/A |
YuLan-Mini: An Open Data-efficient Language Model |
| 参加推理:尝试理解标记的工作原理 |
Rui Qian |
PDF |
N/A |
Reasoning to Attend: Try to Understand How Token Works |
| 敏感度曲线最大化:攻击分布式学习中的鲁棒聚合器 |
Christian A. Schroth |
PDF |
N/A |
Sensitivity Curve Maximization: Attacking Robust Aggregators in Distributed Learning |
| 傅里叶位置嵌入:增强注意力周期扩展以实现长度泛化 |
Ermo Hua |
PDF |
N/A |
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization |
| 上下文反向传播循环:通过迭代自上而下的反馈增强深度推理能力 |
Jacob Fein-Ashley |
PDF |
N/A |
Contextual Backpropagation Loops: Amplifying Deep Reasoning with Iterative Top-Down Feedback |
| LASE:学习邻接谱嵌入 |
Sofía Pérez Casulo |
PDF |
N/A |
LASE: Learned Adjacency Spectral Embeddings |
| Mimicking-Bench:通过模仿人类行为进行通用型人形-场景交互学习的基准测试 |
Yun Liu |
PDF |
N/A |
Mimicking-Bench: A Benchmark for Generalizable Humanoid-Scene Interaction Learning via Human Mimicking |
| Chumor 2.0:迈向中文幽默理解基准测试 |
Ruiqi He |
PDF |
N/A |
Chumor 2.0: Towards Benchmarking Chinese Humor Understanding |
| 通过思维链进行知识编辑 |
Changyue Wang |
PDF |
N/A |
Knowledge Editing through Chain-of-Thought |
| VidTwin: 视频变分自编码器与解耦结构和动态 |
Yuchi Wang |
PDF |
N/A |
VidTwin: Video VAE with Decoupled Structure and Dynamics |
| 异步联邦学习:一种适用于去中心化机器学习的可扩展方法 |
Ali Forootani |
PDF |
N/A |
Asynchronous Federated Learning: A Scalable Approach for Decentralized Machine Learning |
| 通过近似基于核的广义评分函数实现快速因果发现,具有线性计算复杂度 |
Yixin Ren |
PDF |
N/A |
Fast Causal Discovery by Approximate Kernel-based Generalized Score Functions with Linear Computational Complexity |
| GaussianPainter:通过法线引导将点云绘制成3D高斯分布 |
Jingqiu Zhou |
PDF |
N/A |
GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance |
| SMAC-Hard:在SMAC上启用混合对手策略脚本和自我对弈 |
Yue Deng |
PDF |
N/A |
SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC |
| 从模型到微观理论:提炼模型的主题知识以用于基于事实的问题回答 |
Nathaniel Weir |
PDF |
N/A |
From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question Answering |
| MRANet:一种用于肺和结肠癌分类的改进残差注意力网络 |
Diponkor Bala |
PDF |
N/A |
MRANet: A Modified Residual Attention Networks for Lung and Colon Cancer Classification |
| 在城市数字孪生中建立现实与虚拟的互联,以实现卓越的智能道路检测 |
Yikang Zhang |
PDF |
N/A |
Establishing Reality-Virtuality Interconnections in Urban Digital Twins for Superior Intelligent Road Inspection |
| 通过逻辑理解直接偏好对齐 |
Kyle Richardson |
PDF |
N/A |
Understanding the Logic of Direct Preference Alignment through Logic |
| FedTLU:具有目标层更新的联邦学习 |
Jong-Ik Park |
PDF |
N/A |
FedTLU: Federated Learning with Targeted Layer Updates |
| RAGONITE:基于诱导数据库和口语化RDF的迭代检索,用于在知识图谱上进行对话式问答 |
Rishiraj Saha Roy |
PDF |
N/A |
RAGONITE: Iterative Retrieval on Induced Databases and Verbalized RDF for Conversational QA over KGs with RAG |
| 大型语言模型安全性:全面综述 |
Dan Shi |
PDF |
N/A |
Large Language Model Safety: A Holistic Survey |
| COBRA:用于少样本学习的组合检索增强 |
Arnav M. Das |
PDF |
N/A |
COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Learning |
| EPE-P:基于证据的参数高效提示,用于多模态学习中的缺失模态处理 |
Zhe Chen |
PDF |
N/A |
EPE-P: Evidence-based Parameter-efficient Prompting for Multimodal Learning with Missing Modalities |
| 一种无偏训练范式,用于更通用的AI生成图像检测 |
Fabrizio Guillaro |
PDF |
N/A |
A Bias-Free Training Paradigm for More General AI-generated Image Detection |
| 使用大型语言模型生成布洛卡失语症碎片句的完整句子 |
Sijbren van Vaals |
PDF |
N/A |
Generating Completions for Fragmented Broca's Aphasic Sentences Using Large Language Models |
| 增强尖峰神经网络中的时间处理能力以利用三维卷积进行静态物体检测 |
Huaxu He |
PDF |
N/A |
Enhanced Temporal Processing in Spiking Neural Networks for Static Object Detection Using 3D Convolutions |
| 基准测试用于深度学习测试输入生成的生成式AI模型 |
Maryam |
PDF |
N/A |
Benchmarking Generative AI Models for Deep Learning Test Input Generation |
| 检测对话中的焦虑和抑郁:一种多标签且可解释的方法 |
Francisco de Arriba-Pérez |
PDF |
N/A |
Detecting anxiety and depression in dialogues: a multi-label and explainable approach |
| 一个利用条件熵优化的多视图聚类自适应框架 |
Lijian Li |
PDF |
N/A |
An Adaptive Framework for Multi-View Clustering Leveraging Conditional Entropy Optimization |
| 递归训练中的模型崩溃率 |
Ananda Theertha Suresh |
PDF |
N/A |
Rate of Model Collapse in Recursive Training |
| DreamFit:通过轻量级Anything-Dressing编码器实现以服装为中心的人体生成 |
Ente Lin |
PDF |
N/A |
DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder |
| 利用知识图谱推进机器学习研究 |
Jing Si |
PDF |
N/A |
Advances in Machine Learning Research Using Knowledge Graphs |
| 无监督动作分割的分层向量量化 |
Federico Spurio |
PDF |
N/A |
Hierarchical Vector Quantization for Unsupervised Action Segmentation |
| SCBench:一个面向视频大型语言模型的体育解说基准 |
Kuangzhi Ge |
PDF |
N/A |
SCBench: A Sports Commentary Benchmark for Video LLMs |
| LangSurf: 用于三维场景理解的语言嵌入表面高斯方法 |
Hao Li |
PDF |
N/A |
LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding |
| ANID:我们还有多远?通过多模态指导评估AI合成图像与自然图像之间的差异 |
Renyang Liu |
PDF |
N/A |
ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance |
| 细节保留的潜在扩散模型用于稳定阴影去除 |
Jiamin Xu |
PDF |
N/A |
Detail-Preserving Latent Diffusion for Stable Shadow Removal |
| 图神经网络是进化算法 |
Kaichen Ouyang |
PDF |
N/A |
Graph Neural Networks Are Evolutionary Algorithms |
| 编辑辐射场的隐式与显式表示:一项综述 |
Arthur Hubert |
PDF |
N/A |
Editing Implicit and Explicit Representations of Radiance Fields: A Survey |
| 追踪LLM训练中的特征动态:一项机制性研究 |
Yang Xu |
PDF |
N/A |
Tracking the Feature Dynamics in LLM Training: A Mechanistic Study |
| 迈向一种高效求解参数化混合整数规划的无监督学习方案 |
Shiyuan Qu |
PDF |
N/A |
Towards An Unsupervised Learning Scheme for Efficiently Solving Parameterized Mixed-Integer Programs |
| 比最多样化更进一步:生成模型的多样化混合在线选择 |
Parham Rezaei |
PDF |
N/A |
Be More Diverse than the Most Diverse: Online Selection of Diverse Mixtures of Generative Models |
| 面向内核的图提示学习在小样本异常检测中的应用 |
Fenfang Tao |
PDF |
N/A |
Kernel-Aware Graph Prompt Learning for Few-Shot Anomaly Detection |
| 面部表情分析及其在物联网系统中的潜力:当代综述 |
Zixuan Shanggua |
PDF |
N/A |
Facial Expression Analysis and Its Potentials in IoT Systems: A Contemporary Survey |
| 大型语言模型的安全挑战初现 |
Herve Debar |
PDF |
N/A |
Emerging Security Challenges of Large Language Models |
| 稳定性是否可能有害?通过梯度下降的不稳定性实现更好的泛化 |
Lawrence Wang |
PDF |
N/A |
Can Stability be Detrimental? Better Generalization through Gradient Descent Instabilities |
| CoSurfGS:基于分布式学习的大规模场景重建协同三维表面高斯光栅化技术 |
Yuanyuan Gao |
PDF |
N/A |
CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction |
| 个性化大型视觉-语言模型 |
Chau Pham |
PDF |
N/A |
Personalized Large Vision-Language Models |
| 面向图的基础模型:预训练图神经网络跨数据集迁移的分析 |
Fabrizio Frasca |
PDF |
N/A |
Towards Foundation Models on Graphs: An Analysis on Cross-Dataset Transfer of Pretrained GNNs |
| SBS数据:从分阶段合成的图像中进行预训练的图表问答 |
Risa Shinoda |
PDF |
N/A |
SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images |
| EasyTime:让时间序列预测变得简单 |
Xiangfei Qiu |
PDF |
N/A |
EasyTime: Time Series Forecasting Made Easy |
| AFANet:用于弱监督少样本语义分割的自适应频率感知网络 |
Jiaqi Ma |
PDF |
N/A |
AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation |
| LiveIdeaBench:通过极少上下文评估大型语言模型的科学创造力和创意生成能力 |
Kai Ruan |
PDF |
N/A |
LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context |
| V$^2$-SfMLearner:为多模态无线胶囊内窥镜学习单目深度和自我运动 |
Long Bai |
PDF |
N/A |
V$^2$-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy |
| 调查文档级机器翻译中的长度问题 |
Ziqian Peng |
PDF |
N/A |
Investigating Length Issues in Document-level Machine Translation |
| 图大小不平衡学习与能量引导结构平滑 |
Jiawen Qin |
PDF |
N/A |
Graph Size-imbalanced Learning with Energy-guided Structural Smoothing |
| PC代理:在你沉睡时,AI正在工作——一场深入数字世界的认知之旅 |
Yanheng He |
PDF |
N/A |
PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World |
| 使用参数高效的深度学习框架改进棉花叶病分类 |
Aswini Kumar Patra |
PDF |
N/A |
Improved Cotton Leaf Disease Classification Using Parameter-Efficient Deep Learning Framework |
| 通过模型和度量集成提升脑部MRI中的基于重建的分布外检测 |
Evi M. C. Huijben |
PDF |
N/A |
Enhancing Reconstruction-Based Out-of-Distribution Detection in Brain MRI with Model and Metric Ensembles |
| 使用进化算法进行量子时间序列学习 |
Vignesh Anantharamakrishnan |
PDF |
N/A |
Quantum Time-Series Learning with Evolutionary Algorithms |
| HumanVBench:利用合成基准数据探索MLLMs在以人为中心的视频理解方面的能力 |
Ting Zhou |
PDF |
N/A |
HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data |
| URoadNet:用于多尺度道路网络提取的双稀疏注意力U-Net |
Jie Song |
PDF |
N/A |
URoadNet: Dual Sparse Attentive U-Net for Multiscale Road Network Extraction |
| 使用情感偏好优化和Mamba压缩器在视听对话中实现共情响应 |
Yeonju Kim |
PDF |
N/A |
Empathetic Response in Audio-Visual Conversations Using Emotion Preference Optimization and MambaCompressor |
| HPCNeuroNet:一种将SNN时间动态与Transformer注意力机制融合的神经形态方法,用于基于FPGA的粒子物理学研究 |
Murat Isik |
PDF |
N/A |
HPCNeuroNet: A Neuromorphic Approach Merging SNN Temporal Dynamics with Transformer Attention for FPGA-based Particle Physics |
| 高级掩码自编码器学习的动态双雄:协作掩码与目标 |
Shentong Mo |
PDF |
N/A |
The Dynamic Duo of Collaborative Masking and Target for Advanced Masked Autoencoder Learning |
| 在不同学习环境下评估生物启发模型在网络流量预测中的能效 |
Theodoros Tsiolakis |
PDF |
N/A |
Evaluation of Bio-Inspired Models under Different Learning Settings For Energy Efficiency in Network Traffic Prediction |
| ERUPD -- 英语到罗马乌尔都语平行数据集 |
Mohammed Furqan |
PDF |
N/A |
ERUPD -- English to Roman Urdu Parallel Dataset |
| S-INF:通过场景隐式神经场实现逼真的室内场景合成 |
Zixi Liang |
PDF |
N/A |
S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field |
| GQSA:用于加速大型语言模型推理的组量化与稀疏化 |
Chao Zeng |
PDF |
N/A |
GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference |
| 一种基于卷积神经网络的多基因风险预测肾结石形成的方法 |
Amr Salem |
PDF |
N/A |
A CNN Approach to Polygenic Risk Prediction of Kidney Stone Formation |
| 大型语言模型中的查询优化研究综述 |
Mingyang Song |
PDF |
N/A |
A Survey of Query Optimization in Large Language Models |
| 莎士比亚十四行诗与泰勒·斯威夫特歌词相似度评分中文档级嵌入方法的比较分析 |
Klara Kramer |
PDF |
N/A |
Comparative Analysis of Document-Level Embedding Methods for Similarity Scoring on Shakespeare Sonnets and Taylor Swift Lyrics |
| 资源感知的阿拉伯语大型语言模型创建:模型适配、集成与多领域测试 |
Prakash Aryan |
PDF |
N/A |
Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing |
| 概率密度感知半监督学习 |
Shuyang Liu |
PDF |
N/A |
Probability-density-aware Semi-supervised Learning |
| 保留分数:量化视觉语言模型的越狱风险 |
Zaitang Li |
PDF |
N/A |
Retention Score: Quantifying Jailbreak Risks for Vision Language Models |
| 利用心血管模拟进行心脏生物标志物的体内预测 |
Laura Manduchi |
PDF |
N/A |
Leveraging Cardiovascular Simulations for In-Vivo Prediction of Cardiac Biomarkers |
| 深度神经网络中的概念发现用于可解释的人脸反欺骗 |
Haoyuan Zhang |
PDF |
N/A |
Concept Discovery in Deep Neural Networks for Explainable Face Anti-Spoofing |
| WildPPG:一个包含长时间连续记录的真实世界PPG数据集 |
Manuel Meier |
PDF |
N/A |
WildPPG: A Real-World PPG Dataset of Long Continuous Recordings |
| 领域适应机器翻译:灾难性遗忘遗忘了什么以及为什么? |
Danielle Saunders |
PDF |
N/A |
Domain adapted machine translation: What does catastrophic forgetting forget and why? |
| CiteBART:学习为本地引文推荐生成引文 |
Ege Yiğit Çelik |
PDF |
N/A |
CiteBART: Learning to Generate Citations for Local Citation Recommendation |
| 《闭门之语:创建与探索波兰情色话语的forePLay注释数据集》 |
Anna Kołos |
PDF |
N/A |
Behind Closed Words: Creating and Investigating the forePLay Annotated Dataset for Polish Erotic Discourse |
| 探索电影制作中的动态新颖视角合成技术 |
Adrian Azzarelli |
PDF |
N/A |
Exploring Dynamic Novel View Synthesis Technologies for Cinematography |
| 双重地雷:基于双触发机制的隐形文本后门攻击 |
Yang Hou |
PDF |
N/A |
Double Landmines: Invisible Textual Backdoor Attacks based on Dual-Trigger |
| 通过可解释且可信赖的深度学习模型提升癌症诊断 |
Badaru I. Olumuyiwa |
PDF |
N/A |
Enhancing Cancer Diagnosis with Explainable & Trustworthy Deep Learning Models |
| STAHGNet:高效建模混合粒度异质依赖性以用于交通预测 |
Jiyao Wang |
PDF |
N/A |
STAHGNet: Modeling Hybrid-grained Heterogenous Dependency Efficiently for Traffic Prediction |
| 构建公平的潜在空间以实现公平性与可解释性的交叉 |
Hyungjun Joo |
PDF |
N/A |
Constructing Fair Latent Space for Intersection of Fairness and Explainability |
| DiffusionAttacker:用于LLM越狱的扩散驱动提示操控 |
Hao Wang |
PDF |
N/A |
DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak |
| 神经算子的最优收敛速度 |
Mike Nguyen |
PDF |
N/A |
Optimal Convergence Rates for Neural Operators |
| 用于基于FMCW毫米波雷达的现实世界人体动作检测的数据集 |
Dylan jayabahu |
PDF |
N/A |
Dataset for Real-World Human Action Detection Using FMCW mmWave Radar |
| BEE:通过基线探索-利用实现度量适应性解释 |
Oren Barkan |
PDF |
N/A |
BEE: Metric-Adapted Explanations via Baseline Exploration-Exploitation |
| 一种利用多元信息评分进行祖先图的高效搜索评分算法 |
Nikita Lagrange |
PDF |
N/A |
An efficient search-and-score algorithm for ancestral graphs using multivariate information scores |
| 基于深度学习的卫星基本气候变量不确定性 |
Junyang Gou |
PDF |
N/A |
Uncertainties of Satellite-based Essential Climate Variables from Deep Learning |
| 多即是少?基于模拟的方法探讨多模态模型中偏差间的动态交互 |
Mounia Drissi |
PDF |
N/A |
More is Less? A Simulation-Based Approach to Dynamic Interactions between Biases in Multimodal Models |
| 基于人类反馈和产品一致性的产品图像背景修复评估框架 |
Yuqi Liang |
PDF |
N/A |
An Evaluation Framework for Product Images Background Inpainting based on Human Feedback and Product Consistency |
| 改进潜在神经随机微分方程的噪声估计 |
Linus Heck |
PDF |
N/A |
Improving the Noise Estimation of Latent Neural Stochastic Differential Equations |
| DRT-o1:通过长链思维优化深度推理翻译 |
Jiaan Wang |
PDF |
N/A |
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought |
| 使用YCbCr色彩空间进行引导的真实图像去雾 |
Wenxuan Fang |
PDF |
N/A |
Guided Real Image Dehazing using YCbCr Color Space |
| 虚拟现实数据收集工具包 |
Tim Rolff |
PDF |
N/A |
A Toolkit for Virtual Reality Data Collection |
| DeepMF:闭环安全关键驾驶场景仿真的深度运动分解 |
Yizhe Li |
PDF |
N/A |
DeepMF: Deep Motion Factorization for Closed-Loop Safety-Critical Driving Scenario Simulation |
| 当前学生是否大规模使用ChatGPT?关于ChatGPT等大型语言模型在教育环境中使用情况的调查 |
Jérémie Sublime |
PDF |
N/A |
Is ChatGPT Massively Used by Students Nowadays? A Survey on the Use of Large Language Models such as ChatGPT in Educational Settings |
| 面向GPU数据中心的功耗与碎片感知的在线调度 |
Francesco Lettich |
PDF |
N/A |
Power- and Fragmentation-aware Online Scheduling for GPU Datacenters |
| 银弹还是全神贯注的妥协?基于Gist Token的上下文压缩全面研究 |
Chenlong Deng |
PDF |
N/A |
A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression |
| 《多生成智能体系统综述:最新进展与新前沿》 |
Shuaihang Chen |
PDF |
N/A |
A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers |
| 信号转换在多通道信号处理中的有效性 |
Sunil Kumar Kopparapu |
PDF |
N/A |
Signal Transformation for Effective Multi-Channel Signal Processing |
| 预测压缩图像的满意用户与机器比例:一种统一的方法 |
Qi Zhang |
PDF |
N/A |
Predicting Satisfied User and Machine Ratio for Compressed Images: A Unified Approach |
| 线图Vietoris-Rips持久性图用于拓扑图表示学习 |
Jaesun Shin |
PDF |
N/A |
Line Graph Vietoris-Rips Persistence Diagram for Topological Graph Representation Learning |
| CALLIC:无损图像压缩的内容自适应学习 |
Daxin Li |
PDF |
N/A |
CALLIC: Content Adaptive Learning for Lossless Image Compression |
| 工业异常检测中的渐进边界引导异常合成 |
Qiyu Chen |
PDF |
N/A |
Progressive Boundary Guided Anomaly Synthesis for Industrial Anomaly Detection |
| 早期婴儿单语和双语语音持续学习的发展性预测编码模型 |
Xiaodan Chen |
PDF |
N/A |
Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning |
| 从总结数据中学习:基于样本准似然的Gaussian过程回归 |
Yuta Shikuri |
PDF |
N/A |
Learning from Summarized Data: Gaussian Process Regression with Sample Quasi-Likelihood |
| 基于时间卷积网络的网络入侵检测方法 |
Rukmini Nazre |
PDF |
N/A |
A Temporal Convolutional Network-based Approach for Network Intrusion Detection |
| 深入探讨多模态推理的自进化训练 |
Wei Liu |
PDF |
N/A |
Diving into Self-Evolving Training for Multimodal Reasoning |
| 在心理治疗环境中应用大语言模型与主题建模 |
Alexander Vanin |
PDF |
N/A |
Applying LLM and Topic Modelling in Psychotherapeutic Contexts |
| XAI在转变航空航天系统中的作用 |
Francisco Javier Cantero Zorita |
PDF |
N/A |
The Role of XAI in Transforming Aeronautics and Aerospace Systems |
| 基于马尔可夫过程的图卷积网络用于知识图谱中的实体分类 |
Johannes Mäkelburg |
PDF |
N/A |
Markov Process-Based Graph Convolutional Networks for Entity Classification in Knowledge Graphs |
| 神经连续时间上鞅证书 |
Grigory Neustroev |
PDF |
N/A |
Neural Continuous-Time Supermartingale Certificates |
| 衡量面向儿童的文本中的上下文信息量 |
Maria Valentini |
PDF |
N/A |
Measuring Contextual Informativeness in Child-Directed Text |
| 多模态偏好数据与奖励模型的合成对齐 |
Robert Wijaya |
PDF |
N/A |
Multimodal Preference Data Synthetic Alignment with Reward Model |
| VidCtx:利用图像模型实现上下文感知的视频问答 |
Andreas Goulas |
PDF |
N/A |
VidCtx: Context-aware Video Question Answering with Image Models |
| 使用随机噪声进行预训练以实现不确定性校准 |
Jeonghwan Cheon |
PDF |
N/A |
Pretraining with random noise for uncertainty calibration |
| 正是你所期望的:通过自我反思实现约束时间线摘要,以增强相关性 |
Muhammad Reza Qorib |
PDF |
N/A |
Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance |
| 证据理论不确定性对训练目标检测模型的影响 |
M. Tahasanul Ibrahim |
PDF |
N/A |
Impact of Evidence Theory Uncertainty on Training Object Detection Models |
| BrainMAP:在大脑网络中学习多重激活路径 |
Song Wang |
PDF |
N/A |
BrainMAP: Learning Multiple Activation Pathways in Brain Networks |
| 学习红外小目标检测的动态局部上下文表示 |
Guoyi Zhang |
PDF |
N/A |
Learning Dynamic Local Context Representations for Infrared Small Target Detection |
| 通过迭代偏好学习增强蒙特卡洛树搜索推理中的内在自我修正能力 |
Huchen Jiang |
PDF |
N/A |
Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning |
| WarriorCoder:从专家对决中学习以增强代码大型语言模型 |
Huawen Feng |
PDF |
N/A |
WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models |
| PointVoxelFormer -- 复兴点云网络用于三维医学影像 |
Mattias Paul Heinrich |
PDF |
N/A |
PointVoxelFormer -- Reviving point cloud networks for 3D medical imaging |
| 奇异值缩放:通过剪枝权重精炼实现高效生成模型压缩 |
Hyeonjin Kim |
PDF |
N/A |
Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement |
| 交织记忆:暹罗大型语言模型 |
Xin Song |
PDF |
N/A |
Interweaving Memories of a Siamese Large Language Model |
| 平衡的3DGS:基于高斯并行性的精细分块渲染 |
Hao Gui |
PDF |
N/A |
Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling |
| 一种即插即用的野外高难度动作物理恢复方法 |
Youliang Zhang |
PDF |
N/A |
A Plug-and-Play Physical Motion Restoration Approach for In-the-Wild High-Difficulty Motions |
| 人工智能能有多环保?一项关于机器学习环境影响趋势的研究 |
Clément Morand |
PDF |
N/A |
How Green Can AI Be? A Study of Trends in Machine Learning Environmental Impacts |
| FRTP:联合路由搜索记录以增强长期交通预测 |
Hangli Ge |
PDF |
N/A |
FRTP: Federating Route Search Records to Enhance Long-term Traffic Prediction |
| FlowMamba:通过全局运动传播学习点云场景流 |
Min Lin |
PDF |
N/A |
FlowMamba: Learning Point Cloud Scene Flow with Global Motion Propagation |
| 通过迭代和选择性地从数据中学习来提升大语言模型 |
Qi Jia |
PDF |
N/A |
Boosting LLM via Learning from Data Iteratively and Selectively |
| 用于信息检索的文本嵌入模型高效微调方法:对比学习惩罚(CLP) |
Jeongsu Yu |
PDF |
N/A |
Efficient fine-tuning methodology of text embedding models for information retrieval: contrastive learning penalty (clp) |
| 一种基于情感的文本分类中日语分词器的实验评估 |
Andre Rusli |
PDF |
N/A |
An Experimental Evaluation of Japanese Tokenizers for Sentiment-Based Text Classification |
| 分层获取受限贝叶斯优化:应用于模拟电路 |
Ria Rashid |
PDF |
N/A |
Tiered Acquisition for Constrained Bayesian Optimization: An Application to Analog Circuits |
| 通过信息瓶颈实现的双向多尺度图数据集压缩 |
Xingcheng Fu |
PDF |
N/A |
Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck |
| DiffFormer:一种用于高光谱图像分类的微分空间-光谱变换器 |
Muhammad Ahmad |
PDF |
N/A |
DiffFormer: a Differential Spatial-Spectral Transformer for Hyperspectral Image Classification |
| 蛋白质组学信息学中的深度学习:应用、挑战与未来方向 |
Yindan Luo |
PDF |
N/A |
Deep Learning in Proteomics Informatics: Applications, Challenges, and Future Directions |
| 折纸:一种用于从半结构化数据进行预测的生成式变压器架构 |
Thomas Rückstieß |
PDF |
N/A |
ORIGAMI: A generative transformer architecture for predictions from semi-structured data |
| 基于LSTM的三分类文本情感分析 |
Yin Qixuan |
PDF |
N/A |
Three-Class Text Sentiment Analysis Based on LSTM |
| FFA Sora,将视频生成作为眼底荧光素血管造影模拟器 |
Xinyuan Wu |
PDF |
N/A |
FFA Sora, video generation as fundus fluorescein angiography simulator |
| 关于描述逻辑概念的示例的效力与局限性 |
Balder ten Cate |
PDF |
N/A |
On the Power and Limitations of Examples for Description Logic Concepts |
| 专注于调整策略以达到目标的强化学习 |
Akane Tsuboya |
PDF |
N/A |
Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets |
| MineAgent:利用多模态大型语言模型进行遥感矿产勘探 |
Beibei Yu |
PDF |
N/A |
MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models |
| 通过主题对比学习提升神经主题模型的主题可解释性 |
Xin Gao |
PDF |
N/A |
Enhancing Topic Interpretability for Neural Topic Modeling through Topic-wise Contrastive Learning |
| 神经-MCRL:基于脑电图的视觉解码的多模态对比表示学习 |
Yueyang Li |
PDF |
N/A |
Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding |
| APEX$^2$:个性化知识图谱的自适应和极值摘要 |
Zihao Li |
PDF |
N/A |
APEX$^2$: Adaptive and Extreme Summarization for Personalized Knowledge Graphs |
| 完整实现WXF中国象棋规则 |
Daniel Tan |
PDF |
N/A |
Complete Implementation of WXF Chinese Chess Rules |
| 基于扩散模型的宽带地面运动合成,条件极简 |
Jaeheun Jung |
PDF |
N/A |
Broadband Ground Motion Synthesis by Diffusion Model with Minimal Condition |
| 使用大型语言模型的双视角隐喻检测框架 |
Yujie Lin |
PDF |
N/A |
A Dual-Perspective Metaphor Detection Framework Using Large Language Models |
| 用于半监督语义分割的不确定性-参与上下文一致性学习 |
Jianjian Yin |
PDF |
N/A |
Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation |
| EcoSearch:一种用于程序合成的恒定延迟最佳优先搜索算法 |
Théo Matricon |
PDF |
N/A |
EcoSearch: A Constant-Delay Best-First Search Algorithm for Program Synthesis |
| 基于特征的方法在目标检测中的领域自适应:综述论文 |
Helia Mohamadi |
PDF |
N/A |
Feature Based Methods Domain Adaptation for Object Detection: A Review Paper |
| xPatch:基于指数季节性趋势分解的双流时间序列预测 |
Artyom Stitsyuk |
PDF |
N/A |
xPatch: Dual-Stream Time Series Forecasting with Exponential Seasonal-Trend Decomposition |
| 通过基于压缩的编辑距离评估人类对LLM生成文本的编辑工作量 |
Nicolas Devatine |
PDF |
N/A |
Assessing Human Editing Effort on LLM-Generated Texts via Compression-Based Edit Distance |
| 更好的知识增强用于保护隐私的跨项目缺陷预测 |
Yuying Wang |
PDF |
N/A |
Better Knowledge Enhancement for Privacy-Preserving Cross-Project Defect Prediction |
| 快速计算RoPE注意力的时间复杂度接近线性 |
Yifang Chen |
PDF |
N/A |
Fast Gradient Computation for RoPE Attention in Almost Linear Time |
| CodeV:通过视觉数据解决问题 |
Linhao Zhang |
PDF |
N/A |
CodeV: Issue Resolving with Visual Data |
| 通过深度学习和ResNeXt进行金融数据挖掘的协作优化 |
Pengbin Feng |
PDF |
N/A |
Collaborative Optimization in Financial Data Mining Through Deep Learning and ResNeXt |
| 通过Stein变分超网络改进昂贵的多目标优化的Pareto集学习 |
Minh-Duc Nguyen |
PDF |
N/A |
Improving Pareto Set Learning for Expensive Multi-objective Optimization via Stein Variational Hypernetworks |
| 基于内容和上下文嵌入的流行度估计和新捆绑包生成 |
Ashutosh Nayak |
PDF |
N/A |
Popularity Estimation and New Bundle Generation using Content and Context based Embeddings |
| 多重一致性引导的无监督音频测试时适应对比音频-语言模型 |
Gongyu Chen |
PDF |
N/A |
Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio |
| FedLEC:在标签偏斜情况下,利用脉冲神经网络实现有效联邦学习的算法 |
Di Yu |
PDF |
N/A |
FedLEC: Effective Federated Learning Algorithm with Spiking Neural Networks Under Label Skews |
| 视觉-语言模型在时间序列分类中的可行性研究 |
Vinay Prithyani |
PDF |
N/A |
On the Feasibility of Vision-Language Models for Time-Series Classification |
| 用于红外小目标检测的神经时空张量表示 |
Fengyi Wu |
PDF |
N/A |
Neural Spatial-Temporal Tensor Representation for Infrared Small Target Detection |
| 计算环境中的资源优化动态调度策略 |
Xiaoye Wang |
PDF |
N/A |
Dynamic Scheduling Strategies for Resource Optimization in Computing Environments |
| 从架构角度重新审视用于3D异常检测的多模态融合 |
Kaifang Long |
PDF |
N/A |
Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective |
| Friends-MMC:一个用于多模态多方对话理解的数据集 |
Yueqian Wang |
PDF |
N/A |
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding |
| AV-EmoDialog:利用情感线索与视听用户进行对话 |
Se Jin Park |
PDF |
N/A |
AV-EmoDialog: Chat with Audio-Visual Users Leveraging Emotional Cues |
| 自由视角人体动画与姿态相关参考选择 |
Fa-Ting Hong |
PDF |
N/A |
Free-viewpoint Human Animation with Pose-correlated Reference Selection |