| 闪存缓存:基于辐射缓存逆向渲染中的偏差减少 |
Benjamin Attal |
PDF |
N/A |
Flash Cache: Reducing Bias in Radiance Cache Based Inverse Rendering |
| 一个从个人决策角度评估PM2.5预测的框架 |
Renato Berlinghieri |
PDF |
N/A |
A Framework for Evaluating PM2.5 Forecasts from the Perspective of Individual Decision Making |
| 机器人实用模型:在新环境中零样本部署的通用策略 |
Haritheja Etukuru |
PDF |
N/A |
Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments |
| 神经MP:一种通用型神经运动规划器 |
Murtaza Dalal |
PDF |
N/A |
Neural MP: A Generalist Neural Motion Planner |
| 可提示的闭环交通模拟 |
Shuhan Tan |
PDF |
N/A |
Promptable Closed-loop Traffic Simulation |
| 评估人类和图像模型中的多视角物体一致性 |
Tyler Bonnen |
PDF |
N/A |
Evaluating Multiview Object Consistency in Humans and Image Models |
| LSVOS挑战报告:大规模复杂长视频对象分割 |
Henghui Ding |
PDF |
N/A |
LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation |
| 量子强化学习(QRL)简介 |
Samuel Yen-Chi Chen |
PDF |
N/A |
An Introduction to Quantum Reinforcement Learning (QRL) |
| MMEvol:通过Evol-Instruct赋能多模态大型语言模型 |
Run Luo |
PDF |
N/A |
MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct |
| 视觉驱动的二维监督微调框架用于鸟瞰感知 |
Lei He |
PDF |
N/A |
Vision-Driven 2D Supervised Fine-Tuning Framework for Bird's Eye View Perception |
| 在真相发现定量双极论证框架中应用归因解释 |
Xiang Yin |
PDF |
N/A |
Applying Attribution Explanations in Truth-Discovery Quantitative Bipolar Argumentation Frameworks |
| 非平衡生物物理过程的计算表达能力限制 |
Carlos Floyd |
PDF |
N/A |
Limits on the computational expressivity of non-equilibrium biophysical processes |
| GASP:基于物理模拟的高斯样条方法 |
Piotr Borycki |
PDF |
N/A |
GASP: Gaussian Splatting for Physic-Based Simulations |
| VFA:基础模型与人类的视觉频率分析 |
Mohammad-Javad Darvishi-Bayazi |
PDF |
N/A |
VFA: Vision Frequency Analysis of Foundation Models and Human |
| 利用困惑度相关性改进预训练数据 |
Tristan Thrush |
PDF |
N/A |
Improving Pretraining Data Using Perplexity Correlations |
| 通过自动透镜库生成和领域自适应实现通用计算像差校正的灵活框架 |
Qi Jiang |
PDF |
N/A |
A Flexible Framework for Universal Computational Aberration Correction via Automatic Lens Library Generation and Domain Adaptation |
| 软件测试的未来:AI驱动的测试用例生成与验证 |
Mohammad Baqar |
PDF |
N/A |
The Future of Software Testing: AI-Powered Test Case Generation and Validation |
| 在大语言模型中对中国知识进行基准修正 |
Tianhe Lu |
PDF |
N/A |
Benchmarking Chinese Knowledge Rectification in Large Language Models |
| Celcomen:用于单细胞和组织扰动建模的空间因果解耦 |
Stathis Megas |
PDF |
N/A |
Celcomen: spatial causal disentanglement for single-cell and tissue perturbation modeling |
| 输入空间模式连通性在深度神经网络中的应用 |
Jakub Vrabel |
PDF |
N/A |
Input Space Mode Connectivity in Deep Neural Networks |
| PDAF:一种用于说话人验证的语音去偏注意力框架 |
Massa Baali |
PDF |
N/A |
PDAF: A Phonetic Debiasing Attention Framework For Speaker Verification |
| 通过人类反应时间提升基于偏好的线性多臂赌博机 |
Shen Li |
PDF |
N/A |
Enhancing Preference-based Linear Bandits via Human Response Time |
| 使用条件变分自编码器和深度神经网络进行不确定性量化和领域泛化的临界热流预测 |
Farah Alsafadi |
PDF |
N/A |
Predicting Critical Heat Flux with Uncertainty Quantification and Domain Generalization Using Conditional Variational Autoencoders and Deep Neural Networks |
| 利用对象先验进行点跟踪 |
Bikram Boote |
PDF |
N/A |
Leveraging Object Priors for Point Tracking |
| NeurLZ:基于神经学习与误差控制的科学数据有损压缩性能系统性提升研究 |
Wenqi Jia |
PDF |
N/A |
NeurLZ: On Systematically Enhancing Lossy Compression Performance for Scientific Data based on Neural Learning with Error Control |
| 统一神经网络缩放定律与尺度-时间等效性 |
Akhilan Boopathy |
PDF |
N/A |
Unified Neural Network Scaling Laws and Scale-time Equivalence |
| 通过模块化打破神经网络的缩放法则 |
Akhilan Boopathy |
PDF |
N/A |
Breaking Neural Network Scaling Laws with Modularity |
| 使用机器学习技术预测特定行业ETF方向变化的先进LSTM神经网络 |
Rifa Gowani |
PDF |
N/A |
Advanced LSTM Neural Networks for Predicting Directional Changes in Sector-Specific ETFs Using Machine Learning Techniques |
| 从机器到音乐家的创造力与视觉传达:通过机器人相机分享乐谱 |
Ross Greer |
PDF |
N/A |
Creativity and Visual Communication from Machine to Musician: Sharing a Score through a Robotic Camera |
| 来自fMRI的证据支持语言模型中存在两阶段抽象过程 |
Emily Cheng |
PDF |
N/A |
Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models |
| 基于共识的分布式量子核学习用于语音识别 |
Kuan-Cheng Chen |
PDF |
N/A |
Consensus-based Distributed Quantum Kernel Learning for Speech Recognition |
| 异质性特定的图神经网络和同质性度量真的有效吗?评估陷阱与新基准 |
Sitao Luan |
PDF |
N/A |
Are Heterophily-Specific GNNs and Homophily Metrics Really Effective? Evaluation Pitfalls and New Benchmarks |
| ReL-SAR:基于卷积Transformer和BYOL的骨架动作识别表示学习 |
Safwen Naimi |
PDF |
N/A |
ReL-SAR: Representation Learning for Skeleton Action Recognition with Convolutional Transformers and BYOL |
| 一种利用结构化对话人工智能(CAI)系统的新颖创意生成工具 |
B. Sankar |
PDF |
N/A |
A Novel Idea Generation Tool using a Structured Conversational AI (CAI) System |
| 大型语言模型(LLMs)总会产生幻觉,我们需要学会与之共存。 |
Sourav Banerjee |
PDF |
N/A |
LLMs Will Always Hallucinate, and We Need to Live With This |
| 在有限地面实况条件下进行物体抓取的鲁棒损失函数 |
Yangfan Deng |
PDF |
N/A |
Robust Loss Functions for Object Grasping under Limited Ground Truth |
| 基于大语言模型的异构数据问答系统与基准测试 |
Achille Fokoue |
PDF |
N/A |
A System and Benchmark for LLM-based Q\&A on Heterogeneous Data |
| 通过两阶段指令微调方法,推动多语言大型语言模型在医学领域的民主化 |
Meng Zhou |
PDF |
N/A |
Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-Stage Instruction Fine-tuning Approach |
| 我的车说了什么?自动驾驶车辆解释错误、情境及个人特质对舒适度、依赖性、满意度及驾驶信心的影响 |
Robert Kaufman |
PDF |
N/A |
What Did My Car Say? Autonomous Vehicle Explanation Errors, Context, and Personal Traits Impact Comfort, Reliance, Satisfaction, and Driving Confidence |
| 视觉基础对话中基于话语理解引导的指代表达生成 |
Bram Willemsen |
PDF |
N/A |
Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding |
| 基于深度学习的降阶模型实现高维参数化系统的实时最优控制 |
Matteo Tomasetto |
PDF |
N/A |
Real-time optimal control of high-dimensional parametrized systems by deep learning-based reduced order models |
| pFedGPA:基于扩散的个性化联邦学习生成参数聚合方法 |
Jiahao Lai |
PDF |
N/A |
pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning |
| 利用可学习的松弛标签提升基于CNN的手写识别系统 |
Sara Ferro |
PDF |
N/A |
Boosting CNN-based Handwriting Recognition Systems with Learnable Relaxation Labeling |
| MANA-Net:通过新闻加权缓解聚合情感同质化,提升市场预测能力 |
Mengyu Wang |
PDF |
N/A |
MANA-Net: Mitigating Aggregated Sentiment Homogenization with News Weighting for Enhanced Market Prediction |
| 通过因子分解进行分割:利用基础模型特征分解实现病理学的无监督语义分割 |
Jacob Gildenblat |
PDF |
N/A |
Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features |
| 从OpenStreetMap数据中提取美国建筑类型 |
Henrique F. de Arruda |
PDF |
N/A |
Extracting the U.S. building types from OpenStreetMap data |
| LayeredFlow:用于非朗伯多层光流的现实世界基准 |
Hongyu Wen |
PDF |
N/A |
LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow |
| SX-Stitch:一种基于VMS-UNet的高效框架,用于术中脊柱X光图像拼接 |
Yi Li |
PDF |
N/A |
SX-Stitch: An Efficient VMS-UNet Based Framework for Intraoperative Scoliosis X-Ray Image Stitching |
| 切伦科夫成像生物形态特征验证可变形组织移位下的乳腺癌放疗患者定位 |
Yao Chen |
PDF |
N/A |
Cherenkov Imaged Bio-morphological Features Verify Patient Positioning with Deformable Tissue Translocation in Breast Radiotherapy |
| AnomalyCD:一种用于高分辨率和时间序列观测地球异常变化检测的基准 |
Jingtao Li |
PDF |
N/A |
AnomalyCD: A benchmark for Earth anomaly change detection with high-resolution and time-series observations |
| RegNLP实战:通过自动化信息检索和答案生成促进合规性 |
Tuba Gokhan |
PDF |
N/A |
RegNLP in Action: Facilitating Compliance Through Automated Information Retrieval and Answer Generation |
| 使用端到端ASR模型对实时转录进行评估 |
Carlos Arriaga |
PDF |
N/A |
Evaluation of real-time transcriptions using end-to-end ASR models |
| 通过先验数据拟合网络实现零样本异常检测:模型选择已成为过去! |
Yuchen Shen |
PDF |
N/A |
Zero-shot Outlier Detection via Prior-data Fitted Networks: Model Selection Bygone! |
| 遗忘还是隐藏?扩散模型中遗忘机制的批判性分析与评估指标 |
Aakash Sen Sharma |
PDF |
N/A |
Unlearning or Concealment? A Critical Analysis and Evaluation Metrics for Unlearning in Diffusion Models |
| 通过深度学习实现放射治疗中人体切伦科夫成像的生物形态特征的鲁棒实时分割 |
Shiru Wang |
PDF |
N/A |
Robust Real-time Segmentation of Bio-Morphological Features in Human Cherenkov Imaging during Radiotherapy via Deep Learning |
| K折因果BART用于CATE估计 |
Hugo Gobato Souto |
PDF |
N/A |
K-Fold Causal BART for CATE Estimation |
| 嵌入式平台上的实时人体动作识别 |
Ruiqi Wang |
PDF |
N/A |
Real-Time Human Action Recognition on Embedded Platforms |
| 数据归属的对抗性攻击 |
Xinhe Wang |
PDF |
N/A |
Adversarial Attacks on Data Attribution |
| 交互式增量学习具有可推广技能的局部轨迹调制 |
Markus Knauer |
PDF |
N/A |
Interactive incremental learning of generalizable skills with local trajectory modulation |
| 重新审视英语Winogender模式以确保一致性、覆盖范围和语法格 |
Vagrant Gautam |
PDF |
N/A |
Revisiting English Winogender Schemas for Consistency, Coverage, and Grammatical Case |
| 基于标签传播的持续目标检测中的重放整合 |
Riccardo De Monte |
PDF |
N/A |
Replay Consolidation with Label Propagation for Continual Object Detection |
| 原型驱动的可见光-红外行人重识别多特征生成 |
Jiarui Li |
PDF |
N/A |
Prototype-Driven Multi-Feature Generation for Visible-Infrared Person Re-identification |
| 三维合成孔径雷达断层成像与机器学习在高分辨率树高估算中的应用 |
Grace Colverd |
PDF |
N/A |
3D-SAR Tomography and Machine Learning for High-Resolution Tree Height Estimation |
| 朴素贝叶斯分类的最佳投影 |
David P. Hofmeyr |
PDF |
N/A |
Optimal Projections for Classification with Naive Bayes |
| 卫星图像中尺度偏好目标检测的重整化连接 |
Fan Zhang |
PDF |
N/A |
Renormalized Connection for Scale-preferred Object Detection in Satellite Imagery |
| 前向KL正则化偏好优化用于对齐扩散策略 |
Zhao Shan |
PDF |
N/A |
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies |
| 联合输入与输出协调的类增量学习 |
Shuai Wang |
PDF |
N/A |
Joint Input and Output Coordination for Class-Incremental Learning |
| G-NeLF:用于新视角合成的内存和数据高效混合神经光场 |
Lutao Jiang |
PDF |
N/A |
G-NeLF: Memory- and Data-Efficient Hybrid Neural Light Field for Novel View Synthesis |
| Adapted-MoE:结合测试时适应的专家混合模型用于异常检测 |
Tianwu Lei |
PDF |
N/A |
Adapted-MoE: Mixture of Experts with Test-Time Adaption for Anomaly Detection |
| 自定义对比度:一种多层次对比视角下的主体驱动文本到图像定制 |
Nan Chen |
PDF |
N/A |
CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization |
| 标准化硬件无关评估的能耗 |
Constance Douwes |
PDF |
N/A |
Normalizing Energy Consumption for Hardware-Independent Evaluation |
| 长未必强:间断长序列训练提升语音识别与翻译效果 |
Nithin Rao Koluguri |
PDF |
N/A |
Longer is (Not Necessarily) Stronger: Punctuated Long-Sequence Training for Enhanced Speech Recognition and Translation |
| 当重采样/重加权改善不平衡分类中的特征学习?:一个玩具模型研究 |
Tomoyuki Obuchi |
PDF |
N/A |
When resampling/reweighting improves feature learning in imbalanced classification?: A toy-model study |
| SynMorph:生成带有匹配样本的合成人脸变形数据集 |
Haoyu Zhang |
PDF |
N/A |
SynMorph: Generating Synthetic Face Morphing Dataset with Mated Samples |
| ExDDI:用自然语言解释药物-药物相互作用预测 |
Zhaoyue Sun |
PDF |
N/A |
ExDDI: Explaining Drug-Drug Interaction Predictions with Natural Language |
| MemoRAG:通过记忆启发的知识发现迈向新一代RAG |
Hongjin Qian |
PDF |
N/A |
MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery |
| DSDFormer:一种创新的Transformer-Mamba框架,用于鲁棒的高精度驾驶员分心识别 |
Junzhou Chen |
PDF |
N/A |
DSDFormer: An Innovative Transformer-Mamba Framework for Robust High-Precision Driver Distraction Identification |
| 可解释的责任分担作为任务和运动规划的启发式方法 |
Arda Sarp Yenicesu |
PDF |
N/A |
Interpretable Responsibility Sharing as a Heuristic for Task and Motion Planning |
| 潜在的三维脑部MRI反事实 |
Wei Peng |
PDF |
N/A |
Latent 3D Brain MRI Counterfactual |
| 空间感知型讲解员用于视觉与语言导航指令生成 |
Muraleekrishna Gopinathan |
PDF |
N/A |
Spatially-Aware Speaker for Vision-and-Language Navigation Instruction Generation |
| 递归神经网络的逼近界限及其在回归中的应用 |
Yuling Jiao |
PDF |
N/A |
Approximation Bounds for Recurrent Neural Networks with Application to Regression |
| 通过图结构自对比学习在多层感知机中建模图结构信息 |
Lirong Wu |
PDF |
N/A |
Learning to Model Graph Structural Information on MLPs via Graph Structure Self-Contrasting |
| 关于Sigmoid和tanh模糊广义灰色认知图的收敛性 |
Xudong Gao |
PDF |
N/A |
On the Convergence of Sigmoid and tanh Fuzzy General Grey Cognitive Maps |
| LEROjD:仅雷达扩展的激光雷达目标检测 |
Patrick Palmer |
PDF |
N/A |
LEROjD: Lidar Extended Radar-Only Object Detection |
| CauseJudger:利用大型语言模型进行溯因逻辑推理以识别原因 |
Jinwei He |
PDF |
N/A |
CauseJudger: Identifying the Cause with LLMs for Abductive Logical Reasoning |
| 透过面具看本质:重新思考对抗样本在验证码中的应用 |
Yahya Jabary |
PDF |
N/A |
Seeing Through the Mask: Rethinking Adversarial Examples for CAPTCHAs |
| SciAgents:通过多智能体智能图推理实现科学发现的自动化 |
Alireza Ghafarollahi |
PDF |
N/A |
SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning |
| 眼见为实?利用视觉扰动增强视觉-语言导航 |
Xuesong Zhang |
PDF |
N/A |
Seeing is Believing? Enhancing Vision-Language Navigation using Visual Perturbations |
| 探索野外图像质量评估中的丰富主观质量信息 |
Xiongkuo Min |
PDF |
N/A |
Exploring Rich Subjective Quality Information for Image Quality Assessment in the Wild |
| CoBo:通过双层优化实现协作学习 |
Diba Hashemi |
PDF |
N/A |
CoBo: Collaborative Learning via Bilevel Optimization |
| 在降低温度时,蓝细菌体内的生物钟通过霍普夫分岔机制,不仅跟随而且超越了体外蛋白质钟的节奏。 |
I. Mihalcescu |
PDF |
N/A |
When lowering temperature, the in vivo circadian clock in cyanobacteria follows and surpasses the in vitro protein clock trough the Hopf bifurcation |
| HMAFlow:通过分层运动场对齐学习更准确的光流 |
Dianbo Ma |
PDF |
N/A |
HMAFlow: Learning More Accurate Optical Flow via Hierarchical Motion Field Alignment |
| QiBERT -- 使用BERT作为特征对在线对话消息进行分类 |
Bruno D. Ferreira-Saraiva |
PDF |
N/A |
QiBERT -- Classifying Online Conversations Messages with BERT as a Feature |
| 使用二次无约束二值优化对论证问题进行编码 |
Marco Baioletti |
PDF |
N/A |
An encoding of argumentation problems using quadratic unconstrained binary optimization |
| 大型语言模型中的谐波推理 |
Anna Kruspe |
PDF |
N/A |
Harmonic Reasoning in Large Language Models |
| 插值、外推、超插值:向新维度泛化 |
Toby Ord |
PDF |
N/A |
Interpolation, Extrapolation, Hyperpolation: Generalising into new dimensions |
| 一种用于复杂空间域上时空预测学习的通用降阶神经算子 |
Qinglu Meng |
PDF |
N/A |
A general reduced-order neural operator for spatio-temporal predictive learning on complex spatial domains |
| 优化VarLiNGAM以实现可扩展和高效的时间序列因果发现 |
Ziyang Jiao |
PDF |
N/A |
Optimizing VarLiNGAM for Scalable and Efficient Time Series Causal Discovery |
| 使用机器学习进行灯塔灯光传感器故障检测 |
Michael Kampouridis |
PDF |
N/A |
Using machine learning for fault detection in lighthouse light sensors |
| 高分辨率卫星影像的大气校正与土地利用/土地覆盖分类集成模型 |
Soham Mukherjee |
PDF |
N/A |
An Atmospheric Correction Integrated LULC Segmentation Model for High-Resolution Satellite Imagery |
| 神经压缩中的图像取证:误压缩分类法 |
Nora Hofer |
PDF |
N/A |
A Taxonomy of Miscompressions: Preparing Image Forensics for Neural Compression |
| 爱思唯尔竞技场:化学/生物/健康基础大语言模型的人类评估 |
Camilo Thorne |
PDF |
N/A |
Elsevier Arena: Human Evaluation of Chemistry/Biology/Health Foundational Large Language Models |
| CRADLE-VAE:通过基于反事实推理的伪影解耦增强单细胞基因扰动建模 |
Seungheun Baek |
PDF |
N/A |
CRADLE-VAE: Enhancing Single-Cell Gene Perturbation Modeling with Counterfactual Reasoning-based Artifact Disentanglement |
| 推进用于恒星活动和系外行星周期旋转的机器学习 |
Fatemeh Fazel Hesar |
PDF |
N/A |
Advancing Machine Learning for Stellar Activity and Exoplanet Period Rotation |
| 将时间图神经网络与Transformer进行改造 |
Qiang Huang |
PDF |
N/A |
Retrofitting Temporal Graph Neural Networks with Transformer |
| 变分量子电路设计的强化学习 |
Simone Foderà |
PDF |
N/A |
Reinforcement Learning for Variational Quantum Circuits Design |
| PVP-Recon:通过扭曲一致性实现稀疏视图表面重建的渐进视图规划 |
Sheng Ye |
PDF |
N/A |
PVP-Recon: Progressive View Planning via Warping Consistency for Sparse-View Surface Reconstruction |
| 原型OOD:通过原型特征相似性增强OOD目标检测 |
Junkun Chen |
PDF |
N/A |
Proto-OOD: Enhancing OOD Object Detection with Prototype Feature Similarity |
| DriveScape:面向高分辨率可控多视角驾驶视频生成 |
Wei Wu |
PDF |
N/A |
DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation |
| 超越二维平面:治疗效果估计匹配方法的几何视角 |
Melanie F. Pradier |
PDF |
N/A |
Beyond Flatland: A Geometric Take on Matching Methods for Treatment Effect Estimation |
| 选择差异剪接方法:实际考量 |
Ben J Draper |
PDF |
N/A |
Selecting Differential Splicing Methods: Practical Considerations |
| 将论证框架的扩展可视化为分层图 |
Martin Nöllenburg |
PDF |
N/A |
Visualizing Extensions of Argumentation Frameworks as Layered Graphs |
| 大型语言模型中的绑定表示分析 |
Qin Dai |
PDF |
N/A |
Representational Analysis of Binding in Large Language Models |
| EndoOmni:通过从噪声标签中鲁棒自学习实现内窥镜中的零样本跨数据集深度估计 |
Qingyao Tian |
PDF |
N/A |
EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels |
| 强化学习的半事实解释 |
Jasmina Gajcin |
PDF |
N/A |
Semifactual Explanations for Reinforcement Learning |
| 在深度强化学习中的状态-新颖性引导动作持续性 |
Jianshu Hu |
PDF |
N/A |
State-Novelty Guided Action Persistence in Deep Reinforcement Learning |
| TextToucher:细粒度文本到触觉生成 |
Jiahang Tu |
PDF |
N/A |
TextToucher: Fine-Grained Text-to-Touch Generation |
| 用于主动三维物体检测的分布差异和特征异质性 |
Huang-Yu Chen |
PDF |
N/A |
Distribution Discrepancy and Feature Heterogeneity for Active 3D Object Detection |
| STLM工程报告:丢失 |
Dylan Hillier |
PDF |
N/A |
STLM Engineering Report: Dropout |
| AD-Net:基于注意力的扩张卷积残差网络与引导解码器,用于鲁棒的皮肤病变分割 |
Asim Naveed |
PDF |
N/A |
AD-Net: Attention-based dilated convolutional residual network with guided decoder for robust skin lesion segmentation |
| CipherDM:扩散模型采样的安全三方推理 |
Xin Zhao |
PDF |
N/A |
CipherDM: Secure Three-Party Inference for Diffusion Model Sampling |
| 从文字到姿态:利用视觉语言模型提升新物体姿态估计 |
Tessa Pulli |
PDF |
N/A |
From Words to Poses: Enhancing Novel Object Pose Estimation with Vision Language Models |
| KRONC:基于关键点的稳健相机优化用于3D汽车重建 |
Davide Di Nucci |
PDF |
N/A |
KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction |
| 多模态复合编辑与检索调查 |
Suyan Li |
PDF |
N/A |
A Survey of Multimodal Composite Editing and Retrieval |
| HyperSMOTE:一种基于超图的不平衡节点分类过采样方法 |
Ziming Zhao |
PDF |
N/A |
HyperSMOTE: A Hypergraph-based Oversampling Approach for Imbalanced Node Classifications |
| NLLB-E5:一种可扩展的多语言检索模型 |
Arkadeep Acharya |
PDF |
N/A |
NLLB-E5: A Scalable Multilingual Retrieval Model |
| 顺序后验采样与扩散模型 |
Tristan S. W. Stevens |
PDF |
N/A |
Sequential Posterior Sampling with Diffusion Models |
| FacialFlowNet:通过多样化数据集和分解模型推进面部光流估计 |
Jianzhi Lu |
PDF |
N/A |
FacialFlowNet: Advancing Facial Optical Flow Estimation with a Diverse Dataset and a Decomposed Model |
| 颠覆视觉与语言模型:对比Transformer与结构化状态空间模型在视觉与语言建模中的应用 |
Georgios Pantazopoulos |
PDF |
N/A |
Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling |
| TAVP:跨域少样本分割的任务自适应视觉提示 |
Jiaqi Yang |
PDF |
N/A |
TAVP: Task-Adaptive Visual Prompt for Cross-domain Few-shot Segmentation |
| 一种新的周期模式表示方法及其在无训练异常检测中的应用 |
Peng Ye |
PDF |
N/A |
A Novel Representation of Periodic Pattern and Its Application to Untrained Anomaly Detection |
| 解耦接触以实现细粒度运动风格迁移 |
Xiangjun Tang |
PDF |
N/A |
Decoupling Contact for Fine-Grained Motion Style Transfer |
| 朝着构建一个强大的知识密集型问答模型迈进:利用大型语言模型 |
Hong Xingyun Hong |
PDF |
N/A |
Towards Building a Robust Knowledge Intensive Question Answering Model with Large Language Models |
| 外观与更多:蒸馏混合顺序关系知识用于跨分辨率图像识别 |
Shiming Ge |
PDF |
N/A |
Look One and More: Distilling Hybrid Order Relational Knowledge for Cross-Resolution Image Recognition |
| 深度学习在视频异常检测中的应用:综述 |
Peng Wu |
PDF |
N/A |
Deep Learning for Video Anomaly Detection: A Review |
| 通过元提示学习和梯度正则化提升CLIP在图像质量评估中的适应性 |
Xudong Li |
PDF |
N/A |
Boosting CLIP Adaptation for Image Quality Assessment via Meta-Prompt Learning and Gradient Regularization |
| Prim2Room:从基本图形生成可控布局的房间网格 |
Chengzeng Feng |
PDF |
N/A |
Prim2Room: Layout-Controllable Room Mesh Generation from Primitives |
| PersonaTalk:在视觉配音中引人注目 |
Longhao Zhang |
PDF |
N/A |
PersonaTalk: Bring Attention to Your Persona in Visual Dubbing |
| 通过学生-教师网络和符号距离学习的无记忆多模态异常检测 |
Zhongbin Sun |
PDF |
N/A |
Memoryless Multimodal Anomaly Detection via Student-Teacher Network and Signed Distance Learning |
| KARGEN:利用大型语言模型实现知识增强的自动化放射报告生成 |
Yingshu Li |
PDF |
N/A |
KARGEN: Knowledge-enhanced Automated Radiology Report Generation Using Large Language Models |
| 深度学习模型的应用特定压缩 |
Rohit Raj Rai |
PDF |
N/A |
Application Specific Compression of Deep Learning Models |
| 自然语言诊断推理:计算模型及应用 |
Nils Dycke |
PDF |
N/A |
Diagnostic Reasoning in Natural Language: Computational Model and Application |
| FedBrain-Distill:基于非IID数据的联邦脑肿瘤分类中使用集成知识蒸馏实现高效通信 |
Rasoul Jafari Gohari |
PDF |
N/A |
FedBrain-Distill: Communication-Efficient Federated Brain Tumor Classification Using Ensemble Knowledge Distillation on Non-IID Data |
| BAMDP塑造:一个统一的内生动机和奖励塑造理论框架 |
Aly Lidayan |
PDF |
N/A |
BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shaping |
| 基于注意力机制的机器学习方法用于数据降维,并保证误差界限 |
Xiao Li |
PDF |
N/A |
Attention Based Machine Learning Methods for Data Reduction with Guaranteed Error Bounds |
| IndicVoices-R:解锁大规模多语言多说话人语音语料库,助力印度TTS扩展 |
Ashwin Sankar |
PDF |
N/A |
IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS |
| 递归嵌套过滤:高效摊销贝叶斯实验设计 |
Sahel Iqbal |
PDF |
N/A |
Recursive Nested Filtering for Efficient Amortized Bayesian Experimental Design |
| 使用先验地图驾驶:为自动驾驶车辆映射提供统一向量先验编码 |
Shuang Zeng |
PDF |
N/A |
Driving with Prior Maps: Unified Vector Prior Encoding for Autonomous Vehicle Mapping |
| 过度参数化的变分自编码器的收敛性分析:一种神经切线核视角 |
Li Wang |
PDF |
N/A |
On the Convergence Analysis of Over-Parameterized Variational Autoencoders: A Neural Tangent Kernel Perspective |
| TriplePlay:通过CLIP增强非IID数据和资源效率的联邦学习 |
Ahmed Imteaj |
PDF |
N/A |
TriplePlay: Enhancing Federated Learning with CLIP for Non-IID Data and Resource Efficiency |
| GDFlow:基于NCDE的归一化流用于高级驾驶辅助系统的异常检测 |
Kangjun Lee |
PDF |
N/A |
GDFlow: Anomaly Detection with NCDE-based Normalizing Flow for Advanced Driver Assistance System |
| 在组员身份规范中存在错误情况下的稳健非自适应组测试 |
Shuvayan Banerjee |
PDF |
N/A |
Robust Non-adaptive Group Testing under Errors in Group Membership Specifications |
| 内禀随机反应系统的非爆炸性 |
Chuang Xu |
PDF |
N/A |
Non-explosivity of endotactic stochastic reaction systems |
| 格拉芬:在节点分类不平衡的情况下支持尾部类别 |
Xiaorui Qi |
PDF |
N/A |
Graffin: Stand for Tails in Imbalanced Node Classification |
| 早期退出卷积神经网络 |
Edanur Demir |
PDF |
N/A |
Early-exit Convolutional Neural Networks |
| 基于多模态深度学习的房价预测方法 |
Md Hasebul Hasan |
PDF |
N/A |
A Multi-Modal Deep Learning Based Approach for House Price Prediction |
| 用于压缩神经场表示的拉格朗日哈希 |
Shrisudhan Govindarajan |
PDF |
N/A |
Lagrangian Hashing for Compressed Neural Field Representations |
| 细胞极性-极性及极性-非极性相互作用中的运动顺序 |
Katsuyoshi Matsushita |
PDF |
N/A |
Motion Ordering in Cellular Polar-polar and Polar-nonpolar Interactions |
| 基于KAN的双域融合用于音频驱动的面部标志生成 |
Hoang-Son Vo-Thanh |
PDF |
N/A |
KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks Generation |
| ICPR 2024 非结构化交通及恶劣天气条件下的安全驾驶场景分割竞赛 |
Furqan Ahmed Shaik |
PDF |
N/A |
ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions |
| 具有迁移学习的异构搜索空间的样本高效贝叶斯优化 |
Aryan Deshwal |
PDF |
N/A |
Sample-Efficient Bayesian Optimization with Transfer Learning for Heterogeneous Search Spaces |
| FIF-UNet:一种利用特征交互与融合的高效UNet用于医学图像分割 |
Xiaolin Gou |
PDF |
N/A |
FIF-UNet: An Efficient UNet Using Feature Interaction and Fusion for Medical Image Segmentation |
| 使用基于特定机器滤波器的频谱-时间调制表示进行机器异常声音检测 |
Kai Li |
PDF |
N/A |
Machine Anomalous Sound Detection Using Spectral-temporal Modulation Representations Derived from Machine-specific Filterbanks |
| 电信领域专用大型语言模型系列:Tele-LLMs |
Ali Maatouk |
PDF |
N/A |
Tele-LLMs: A Series of Specialized Large Language Models for Telecommunications |
| 开放世界动态提示与持续视觉表征学习 |
Youngeun Kim |
PDF |
N/A |
Open-World Dynamic Prompt and Continual Visual Representation Learning |
| 通过基于图的学习来拟合骨骼模型 |
Nicolás Gaggion |
PDF |
N/A |
Fitting Skeletal Models via Graph-based Learning |
| 用于激光雷达-视觉系统的神经表面重建与渲染 |
Jianheng Liu |
PDF |
N/A |
Neural Surface Reconstruction and Rendering for LiDAR-Visual Systems |
| RAL:基于对称视图微分学习的冗余感知唇读模型 |
Zejun gu |
PDF |
N/A |
RAL:Redundancy-Aware Lipreading Model Based on Differential Learning with Symmetric Views |
| 神经网络潜在空间的闭式解释与符号梯度 |
Zakaria Patel |
PDF |
N/A |
Closed-Form Interpretation of Neural Network Latent Spaces with Symbolic Gradients |
| 资源高效型生成式AI模型在移动边缘网络中的部署 |
Yuxin Liang |
PDF |
N/A |
Resource-Efficient Generative AI Model Deployment in Mobile Edge Networks |
| TERD:一种保护扩散模型免受后门攻击的统一框架 |
Yichuan Mo |
PDF |
N/A |
TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors |
| Instagram 上的 Mpox 叙事:一个标注的多语言 Instagram Mpox 帖子数据集,用于情感、仇恨言论和焦虑分析 |
Nirmalya Thakur |
PDF |
N/A |
Mpox Narrative on Instagram: A Labeled Multilingual Dataset of Instagram Posts on Mpox for Sentiment, Hate Speech, and Anxiety Analysis |
| 面向联邦学习和多任务强化学习的快速学习率 |
Feng Zhu |
PDF |
N/A |
Towards Fast Rates for Federated and Multi-Task Reinforcement Learning |
| 寻求与解决:表格问答的推理 |
Ruya Jiang |
PDF |
N/A |
Seek and Solve Reasoning for Table Question Answering |
| 从动力学中高效学习马尔可夫随机场 |
Jason Gaitonde |
PDF |
N/A |
Efficiently Learning Markov Random Fields from Dynamics |
| 语言模型中真理与政治偏见之间的关系 |
Suyash Fulay |
PDF |
N/A |
On the Relationship between Truth and Political Bias in Language Models |
| RotCAtt-TransUNet++:一种用于复杂心脏分割的新型深度神经网络 |
Quoc-Bao Nguyen-Le |
PDF |
N/A |
RotCAtt-TransUNet++: Novel Deep Neural Network for Sophisticated Cardiac Segmentation |
| 脑解码器:基于风格的脑电信号视觉解码 |
Minsuk Choi |
PDF |
N/A |
BrainDecoder: Style-Based Visual Decoding of EEG Signals |
| 短期和长期人物再识别的去耦表示 |
Chanho Eom |
PDF |
N/A |
Disentangled Representations for Short-Term and Long-Term Person Re-Identification |
| RexUniNLU:通用自然语言理解中的递归方法与显式模式指导器 |
Chengyuan Liu |
PDF |
N/A |
RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU |
| 重新思考通过通道和伽马校正先验的低光图像增强中的大气散射驱动注意力 |
Shyang-En Weng |
PDF |
N/A |
Rethinking the Atmospheric Scattering-driven Attention via Channel and Gamma Correction Priors for Low-Light Image Enhancement |
| 从样本中学习子模态序列 |
Jing Yuan |
PDF |
N/A |
Learning Submodular Sequencing from Samples |
| 可扩展帧采样用于视频分类:一种减少搜索空间的半最优策略方法 |
Junho Lee |
PDF |
N/A |
Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search Space |
| 迈向自动化机器学习研究 |
Shervin Ardeshir |
PDF |
N/A |
Towards Automated Machine Learning Research |
| UPCS:用于对话生成的无偏见人格构建 |
Kuiyun Chen |
PDF |
N/A |
UPCS: Unbiased Persona Construction for Dialogue Generation |
| 使用虚拟染色技术对肺和心脏移植活检进行无标记评估 |
Yuzhu Li |
PDF |
N/A |
Label-free evaluation of lung and heart transplant biopsies using virtual staining |
| MRStyle:一种多模态参考色彩风格迁移的统一框架 |
Jiancheng Huang |
PDF |
N/A |
MRStyle: A Unified Framework for Color Style Transfer with Multi-Modality Reference |