| 印度公务员模拟面试中的性别表现与偏见 |
Somonnoy Banerjee |
PDF |
N/A |
Gender Representation and Bias in Indian Civil Service Mock Interviews |
| Vista3D:揭秘单张图像的3D暗面 |
Qiuhong Shen |
PDF |
N/A |
Vista3D: Unravel the 3D Darkside of a Single Image |
| DynaMo:视觉-运动控制中的领域内动态预训练 |
Zichen Jeff Cui |
PDF |
N/A |
DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control |
| Qwen2-VL:提升视觉-语言模型对任意分辨率世界的感知能力 |
Peng Wang |
PDF |
N/A |
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution |
| 急切模式下的捆绑调整 |
Zitong Zhan |
PDF |
N/A |
Bundle Adjustment in the Eager Mode |
| 大规模多人3D人体运动预测与场景上下文 |
Felix B Mueller |
PDF |
N/A |
Massively Multi-Person 3D Human Motion Forecasting with Scene Context |
| Qwen2.5-Coder 技术报告 |
Binyuan Hui |
PDF |
N/A |
Qwen2.5-Coder Technical Report |
| 是否采用思维链(Chain-of-thought)?思维链主要在数学和符号推理中发挥作用。 |
Zayne Sprague |
PDF |
N/A |
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning |
| 关于大语言模型中长上下文扩展与泛化的控制研究 |
Yi Lu |
PDF |
N/A |
A Controlled Study on Long Context Extension and Generalization in LLMs |
| 微调语言模型以生成不确定性语言表达 |
Arslan Chaudhry |
PDF |
N/A |
Finetuning Language Models to Emit Linguistic Expressions of Uncertainty |
| 计算动力系统 |
Jordan Cotler |
PDF |
N/A |
Computational Dynamical Systems |
| 你只需阅读一次(YORO):学习将数据库知识内化以实现文本到SQL的转换 |
Hideo Kobayashi |
PDF |
N/A |
You Only Read Once (YORO): Learning to Internalize Database Knowledge for Text-to-SQL |
| multiPI-TransBTS:基于多物理信息的脑肿瘤图像分割多路径学习框架 |
Hongjun Zhu |
PDF |
N/A |
multiPI-TransBTS: A Multi-Path Learning Framework for Brain Tumor Image Segmentation Based on Multi-Physical Information |
| 使用空间扭曲进行精确的天空图像预测 |
Leron Julian |
PDF |
N/A |
Precise Forecasting of Sky Images Using Spatial Warping |
| JEAN:联合表达与音频引导的基于NeRF的说话人脸生成 |
Sai Tanmay Reddy Chakkera |
PDF |
N/A |
JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation |
| Autopet III挑战:将解剖学知识融入nnUNet以进行PET/CT中的病变分割 |
Hamza Kalisch |
PDF |
N/A |
Autopet III challenge: Incorporating anatomical knowledge into nnUNet for lesion segmentation in PET/CT |
| 受限条件下分类器的溯因解释:复杂性与性质 |
Martin Cooper |
PDF |
N/A |
Abductive explanations of classifiers under constraints: Complexity and properties |
| 解码风格:利用偏好高效微调大型语言模型进行图像引导的服装推荐 |
Najmeh Forouzandehmehr |
PDF |
N/A |
Decoding Style: Efficient Fine-Tuning of LLMs for Image-Guided Outfit Recommendation with Preference |
| MAgICoRe:多智能体、迭代、由粗到细的推理优化 |
Justin Chih-Yao Chen |
PDF |
N/A |
MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning |
| MoRAG——用于人体运动的多融合检索增强生成 |
Kalakonda Sai Shashank |
PDF |
N/A |
MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human Motion |
| Takin:一系列高质量的零样本语音生成模型 |
EverestAI |
PDF |
N/A |
Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models |
| GRIN:梯度引导的专家混合网络 |
Liyuan Liu |
PDF |
N/A |
GRIN: GRadient-INformed MoE |
| 线性时序差分学习的几乎必然收敛性与任意特征 |
Jiuqi Wang |
PDF |
N/A |
Almost Sure Convergence of Linear Temporal Difference Learning with Arbitrary Features |
| BERT-VBD:越南语多文档摘要框架 |
Tuan-Cuong Vuong |
PDF |
N/A |
BERT-VBD: Vietnamese Multi-Document Summarization Framework |
| Linguini:一种语言无关的语义推理基准 |
Eduardo Sánchez |
PDF |
N/A |
Linguini: A benchmark for language-agnostic linguistic reasoning |
| 使用高度启发式决策规则进行最佳视觉搜索 |
Anqi Zhang |
PDF |
N/A |
Optimal Visual Search with Highly Heuristic Decision Rules |
| Qwen2.5-数学技术报告:通过自我改进迈向数学专家模型 |
An Yang |
PDF |
N/A |
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement |
| 低帧率语音编解码器:专为快速高质量语音大语言模型训练与推理设计的编解码器 |
Edresson Casanova |
PDF |
N/A |
Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference |
| 更强大的基线模型——使机器学习研究与临床实用性相一致的关键要求 |
Nathan Wolfrath |
PDF |
N/A |
Stronger Baseline Models -- A Key Requirement for Aligning Machine Learning Research with Clinical Utility |
| 帕累托数据框架:迈向资源高效决策的步骤——使用最小可行数据(MVD) |
Tashfain Ahmed |
PDF |
N/A |
Pareto Data Framework: Steps Towards Resource-Efficient Decision Making Using Minimum Viable Data (MVD) |
| 知识蒸馏在遥感中的应用:综述 |
Yassine Himeur |
PDF |
N/A |
Applications of Knowledge Distillation in Remote Sensing: A Survey |
| SPRMamba:基于Mamba的内镜黏膜下剥离手术阶段识别 |
Xiangning Zhang |
PDF |
N/A |
SPRMamba: Surgical Phase Recognition for Endoscopic Submucosal Dissection with Mamba |
| 基于大型语言模型的生成心理测量法评估人类与AI的价值观 |
Haoran Ye |
PDF |
N/A |
Measuring Human and AI Values based on Generative Psychometrics with Large Language Models |
| FedLF:联邦长尾学习中的自适应Logit调整与特征优化 |
Xiuhua Lu |
PDF |
N/A |
FedLF: Adaptive Logit Adjustment and Feature Optimization in Federated Long-Tailed Learning |
| 对称性增强学习:一种基于范畴论的鲁棒机器学习模型框架 |
Ronald Katende |
PDF |
N/A |
Symmetry-Enriched Learning: A Category-Theoretic Framework for Robust Machine Learning Models |
| 脑流:基于多模态引导的fMRI-to-图像重建 |
Jaehoon Joo |
PDF |
N/A |
Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance |
| 大规模技能匹配:自由职业者与项目的精准对接,实现高效的多语言候选人检索 |
Warren Jouanneau |
PDF |
N/A |
Skill matching at scale: freelancer-project alignment for efficient multilingual candidate retrieval |
| IMRL:整合视觉、物理、时间及几何表示,以增强食物获取能力 |
Rui Liu |
PDF |
N/A |
IMRL: Integrating Visual, Physical, Temporal, and Geometric Representations for Enhanced Food Acquisition |
| 元素顺序对语言模型代理性能的影响 |
Wayne Chi |
PDF |
N/A |
The Impact of Element Ordering on LM Agent Performance |
| 面向可解释的终末期肾病(ESRD)预测:利用行政索赔数据与可解释的人工智能技术 |
Yubo Li |
PDF |
N/A |
Towards Interpretable End-Stage Renal Disease (ESRD) Prediction: Utilizing Administrative Claims Data with Explainable AI Techniques |
| 原子流匹配的配体结合蛋白设计 |
Junqi Liu |
PDF |
N/A |
Design of Ligand-Binding Proteins with Atomic Flow Matching |
| 用于高分辨率显微图像恢复的去噪扩散模型 |
Pamela Osuna-Vargas |
PDF |
N/A |
Denoising diffusion models for high-resolution microscopy image restoration |
| 通过数据修剪实现无监督领域自适应 |
Andrea Napoli |
PDF |
N/A |
Unsupervised Domain Adaptation Via Data Pruning |
| 视觉惯性里程计中的在线折射相机模型标定 |
Mohit Singh |
PDF |
N/A |
Online Refractive Camera Model Calibration in Visual Inertial Odometry |
| PAD-FT:一种通过数据净化和微调实现轻量级防御后门攻击的方法 |
Yukai Xu |
PDF |
N/A |
PAD-FT: A Lightweight Defense for Backdoor Attacks via Data Purification and Fine-Tuning |
| 拟合多层次因子模型 |
Tetiana Parshakova |
PDF |
N/A |
Fitting Multilevel Factor Models |
| 通用机器人学习框架 |
Jiahuan Yan |
PDF |
N/A |
Generalized Robot Learning Framework |
| PARAPHRASUS:一个全面评估释义检测模型的基准 |
Andrianos Michail |
PDF |
N/A |
PARAPHRASUS : A Comprehensive Benchmark for Evaluating Paraphrase Detection Models |
| 大型语言模型的双层训练与解码:同时思考与表达 |
Ningyuan Xi |
PDF |
N/A |
Dual-Layer Training and Decoding of Large Language Model with Simultaneously Thinking and Speaking |
| Cartan移动标架与数据流形 |
Eliot Tron |
PDF |
N/A |
Cartan moving frames and the data manifolds |
| 扩展的深度子模块函数 |
Seyed Mohammad Hosseini |
PDF |
N/A |
Extended Deep Submodular Functions |
| 使用大型语言模型生成临床试验表格和图表 |
Yumeng Yang |
PDF |
N/A |
Using Large Language Models to Generate Clinical Trial Tables and Figures |
| 在安全强化学习中处理长期安全和不确定性 |
Jonas Günster |
PDF |
N/A |
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning |
| 理解百度-ULTR日志记录策略对双塔模型的影响 |
Morris de Haan |
PDF |
N/A |
Understanding the Effects of the Baidu-ULTR Logging Policy on Two-Tower Models |
| ASR基准测试:需要一个更具代表性的对话数据集 |
Gaurav Maheshwari |
PDF |
N/A |
ASR Benchmarking: Need for a More Representative Conversational Dataset |
| SFDA-rPPG:无源域自适应远程生理测量与时空一致性 |
Yiping Xie |
PDF |
N/A |
SFDA-rPPG: Source-Free Domain Adaptive Remote Physiological Measurement with Spatio-Temporal Consistency |
| 一个统一的时间神经计算与学习框架 |
Stefano Melacci |
PDF |
N/A |
A Unified Framework for Neural Computation and Learning Over Time |
| 多传感器深度学习用于冰川制图 |
Codruţ-Andrei Diaconu |
PDF |
N/A |
Multi-Sensor Deep Learning for Glacier Mapping |
| 拓扑深度学习与状态空间模型:一种针对单纯复形的Mamba方法 |
Marco Montagna |
PDF |
N/A |
Topological Deep Learning with State-Space Models: A Mamba Approach for Simplicial Complexes |
| PhysMamba:利用SlowFast时间差异Mamba进行高效远程生理测量 |
Chaoqi Luo |
PDF |
N/A |
PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference Mamba |
| 侧扫声纳图像分类任务中的视觉变换器 |
BW Sheffield |
PDF |
N/A |
On Vision Transformers for Classification Tasks in Side-Scan Sonar Imagery |
| LEMON:结合网格优化与神经着色器的局部化编辑 |
Furkan Mert Algan |
PDF |
N/A |
LEMON: Localized Editing with Mesh Optimization and Neural Shaders |
| 协作代码生成模型的承诺与风险:平衡效果与记忆 |
Zhi Chen |
PDF |
N/A |
Promise and Peril of Collaborative Code Generation Models: Balancing Effectiveness and Memorization |
| 用于长期预测太阳辐照度的计算成像 |
Leron Julian |
PDF |
N/A |
Computational Imaging for Long-Term Prediction of Solar Irradiance |
| 跨量子化学层次的一体化基础模型学习 |
Yuxinxin Chen |
PDF |
N/A |
All-in-one foundational models learning across quantum chemical levels |
| BRDF-NeRF:基于光学卫星图像和BRDF建模的神经辐射场 |
Lulin Zhang |
PDF |
N/A |
BRDF-NeRF: Neural Radiance Fields with Optical Satellite Images and BRDF Modelling |
| 视觉语言模型的提示学习混合方法 |
Yu Du |
PDF |
N/A |
Mixture of Prompt Learning for Vision Language Models |
| ChefFusion:融合食谱与食物图像生成的多模态基础模型 |
Peiyu Li |
PDF |
N/A |
ChefFusion: Multimodal Foundation Model Integrating Recipe and Food Image Generation |
| 全景深度预测 |
Juana Valeria Hurtado |
PDF |
N/A |
Panoptic-Depth Forecasting |
| 在生成世界模型中表示物体操作的位置信息 |
Stefano Ferraro |
PDF |
N/A |
Representing Positional Information in Generative World Models for Object Manipulation |
| 使用多模态目标实例重识别实现全球定位 |
Aneesh Chavan |
PDF |
N/A |
Towards Global Localization using Multi-Modal Object-Instance Re-Identification |
| 将数据置于离线多智能体强化学习的中心 |
Claude Formanek |
PDF |
N/A |
Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning |
| “它在技术上可能令人印象深刻,但对我们的实际应用毫无用处”:新闻业中围绕人工智能跨职能协作的实践、挑战与机遇 |
Qing Xiao |
PDF |
N/A |
"It Might be Technically Impressive, But It's Practically Useless to Us": Practices, Challenges, and Opportunities for Cross-Functional Collaboration around AI within the News Industry |
| 解开Hessian之谜:平滑收敛损失函数景观的关键 |
Nikita Kiselev |
PDF |
N/A |
Unraveling the Hessian: A Key to Smooth Convergence in Loss Function Landscapes |
| 加性特征归因方法:流体动力学和传热领域可解释人工智能综述 |
Andrés Cremades |
PDF |
N/A |
Additive-feature-attribution methods: a review on explainable artificial intelligence for fluid dynamics and heat transfer |
| 一种高效的数据受限地统计应用中的不确定性估计模型无关方法 |
Viacheslav Barkov |
PDF |
N/A |
An Efficient Model-Agnostic Approach for Uncertainty Estimation in Data-Restricted Pedometric Applications |
| 术中通过跨模态逆神经渲染进行配准 |
Maximilian Fehrentz |
PDF |
N/A |
Intraoperative Registration by Cross-Modal Inverse Neural Rendering |
| MitoSeg:线粒体分割工具 |
Faris Serdar Taşel |
PDF |
N/A |
MitoSeg: Mitochondria Segmentation Tool |
| 基于图神经网络的度量-语义因子图生成 |
Jose Andres Millan-Romera |
PDF |
N/A |
Metric-Semantic Factor Graph Generation based on Graph Neural Networks |
| 从LLM衍生的嵌入表示中采样潜在材料属性信息 |
Luke P. J. Gilligan |
PDF |
N/A |
Sampling Latent Material-Property Information From LLM-Derived Embedding Representations |
| 揭开黑箱:鸟瞰图感知模型的独立功能模块评估 |
Ludan Zhang |
PDF |
N/A |
Unveiling the Black Box: Independent Functional Module Evaluation for Bird's-Eye-View Perception Model |
| 合成数据作为基准的有效性 |
Gaurav Maheshwari |
PDF |
N/A |
Efficacy of Synthetic Data as a Benchmark |
| 使用教师指导的混淆类指令进行数据高效声场景分类 |
Jin Jie Sean Yeo |
PDF |
N/A |
Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction |
| 基于复杂环境的中文连续手语数据集 |
Qidan Zhu |
PDF |
N/A |
A Chinese Continuous Sign Language Dataset Based on Complex Environments |
| 使用帧事件融合网络在高帧率下跟踪任意点 |
Jiaxiong Liu |
PDF |
N/A |
Tracking Any Point with Frame-Event Fusion Network at High Frame Rate |
| GaussianHeads:从粗到细表示中端到端学习可驾驶的高斯头虚拟形象 |
Kartik Teotia |
PDF |
N/A |
GaussianHeads: End-to-End Learning of Drivable Gaussian Head Avatars from Coarse-to-fine Representations |
| 可微分碰撞监督的牙齿排列网络:基于解耦视角 |
Zhihui He |
PDF |
N/A |
Differentiable Collision-Supervised Tooth Arrangement Network with a Decoupling Perspective |
| 使用李群方向的强化学习用于机器人 |
Martin Schuck |
PDF |
N/A |
Reinforcement Learning with Lie Group Orientations for Robotics |
| 将强化学习作为一种改进启发式算法用于实际生产调度 |
Arthur Müller |
PDF |
N/A |
Reinforcement Learning as an Improvement Heuristic for Real-World Production Scheduling |
| 一种可解释的机器学习方法用于交通事故死亡预测 |
Md. Asif Khan Rifat |
PDF |
N/A |
An Explainable Machine Learning Approach to Traffic Accident Fatality Prediction |
| 凝聚式令牌聚类 |
Joakim Bruslund Haurum |
PDF |
N/A |
Agglomerative Token Clustering |
| 通过时间与空间组合扩散模型生成复杂的三维人体动作 |
Lorenzo Mandelli |
PDF |
N/A |
Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models |
| LLM-wrapper: 视觉-语言基础模型的黑箱语义感知适应 |
Amaia Cardiel |
PDF |
N/A |
LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Foundation Models |
| 教育中的大型语言模型:新视角、挑战与机遇 |
Bashar Alhafni |
PDF |
N/A |
LLMs in Education: Novel Perspectives, Challenges, and Opportunities |
| 肿瘤感知的多患者间可变形图像配准的计算机断层扫描图像,用于肺癌 |
Jue Jiang |
PDF |
N/A |
Tumor aware recurrent inter-patient deformable image registration of computed tomography scans with lung cancer |
| AlignBot:通过微调实现家用机器人与用户提醒的视觉语言模型驱动的定制任务规划对齐 |
Zhaxizhuoma |
PDF |
N/A |
AlignBot: Aligning VLM-powered Customized Task Planning with User Reminders Through Fine-Tuning for Household Robots |
| 寻找主观真相:为全面生成式人工智能模型评估收集200万张选票 |
Dimitrios Christodoulou |
PDF |
N/A |
Finding the Subjective Truth: Collecting 2 Million Votes for Comprehensive Gen-AI Model Evaluation |
| 更少的内存意味着更小的GPU:使用压缩激活进行反向传播 |
Daniel Barley |
PDF |
N/A |
Less Memory Means smaller GPUs: Backpropagation with Compressed Activations |
| LLMs + Persona-Plug = 个性化LLMs |
Jiongnan Liu |
PDF |
N/A |
LLMs + Persona-Plug = Personalized LLMs |
| 多网格图神经网络与自注意力机制在计算力学中的应用 |
Paul Garnier |
PDF |
N/A |
Multi-Grid Graph Neural Networks with Self-Attention for Computational Mechanics |
| 针对网络攻击的自主四旋翼无人机安全控制系统 |
Samuel Belkadi |
PDF |
N/A |
Secure Control Systems for Autonomous Quadrotors against Cyber-Attacks |
| DocMamba:利用状态空间模型实现高效文档预训练 |
Pengfei Hu |
PDF |
N/A |
DocMamba: Efficient Document Pre-training with State Space Model |
| OOD检测的最新进展:问题与方法 |
Shuo Lu |
PDF |
N/A |
Recent Advances in OOD Detection: Problems and Approaches |
| ABHINAW:一种用于自动评估AI生成图像中排版的方法 |
Abhinaw Jagtap |
PDF |
N/A |
ABHINAW: A method for Automatic Evaluation of Typography within AI-Generated Images |
| SpheriGait:通过球面投影丰富空间表示,用于基于激光雷达的步态识别 |
Yanxi Wang |
PDF |
N/A |
SpheriGait: Enriching Spatial Representation via Spherical Projection for LiDAR-based Gait Recognition |
| 无需蒸馏的图像和视频大型SSM模型扩展 |
Hamid Suleman |
PDF |
N/A |
Distillation-free Scaling of Large SSMs for Images and Videos |
| 从多模态演示中学习多阶段接触密集操作的任务规划 |
Kejia Chen |
PDF |
N/A |
Learning Task Planning from Multi-Modal Demonstration for Multi-Stage Contact-Rich Manipulation |
| 基于位置的概率性电动汽车充电站负荷预测:采用多分位数时间卷积网络的深度迁移学习 |
Mohammad Wazed Ali |
PDF |
N/A |
Location based Probabilistic Load Forecasting of EV Charging Sites: Deep Transfer Learning with Multi-Quantile Temporal Convolutional Network |
| 检索、注释、评估、重复:利用多模态大型语言模型进行大规模产品检索评估 |
Kasra Hosseini |
PDF |
N/A |
Retrieve, Annotate, Evaluate, Repeat: Leveraging Multimodal LLMs for Large-Scale Product Retrieval Evaluation |
| 卷积层谱范数的上界紧致且高效 |
Ekaterina Grishina |
PDF |
N/A |
Tight and Efficient Upper Bound on Spectral Norm of Convolutional Layers |
| 基于边缘的图组件池化 |
T. Snelleman |
PDF |
N/A |
Edge-Based Graph Component Pooling |
| 基于物理光度学的非朗伯环境下的捆绑调整 |
Lei Cheng |
PDF |
N/A |
Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments |
| XP-MARL:在多智能体强化学习中辅助优先级排序以解决非平稳性问题 |
Jianye Xu |
PDF |
N/A |
XP-MARL: Auxiliary Prioritization in Multi-Agent Reinforcement Learning to Address Non-Stationarity |
| 一种基于小波的高效物理信息神经网络用于奇异摄动问题 |
Himanshu Pandey |
PDF |
N/A |
An efficient wavelet-based physics-informed neural networks for singularly perturbed problems |
| 喵:通过反转事实实现记忆监督的LLM遗忘 |
Tianle Gu |
PDF |
N/A |
MEOW: MEMOry Supervised LLM Unlearning Via Inverted Facts |
| 用于学习分子热力学和动力学的图神经网络-状态预测信息瓶颈(GNN-SPIB)方法 |
Ziyue Zou |
PDF |
N/A |
Graph Neural Network-State Predictive Information Bottleneck (GNN-SPIB) approach for learning molecular thermodynamics and kinetics |
| NT-ViT:用于EEG-to-fMRI合成的神经转码视觉变换器 |
Romeo Lanzino |
PDF |
N/A |
NT-ViT: Neural Transcoding Vision Transformers for EEG-to-fMRI Synthesis |
| DPI-TTS:用于快速收敛和风格时间建模的文本到语音中的定向补丁交互 |
Xin Qi |
PDF |
N/A |
DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech |
| RaggeDi:基于扩散的无序布料、床单、毛巾和毯子的状态估计 |
Jikai Ye |
PDF |
N/A |
RaggeDi: Diffusion-based State Estimation of Disordered Rags, Sheets, Towels and Blankets |
| 提取与摘要的统一:在单一编码器-解码器框架内融合抽取式与生成式摘要 |
Yuping Wu |
PDF |
N/A |
Extract-and-Abstract: Unifying Extractive and Abstractive Summarization within Single Encoder-Decoder Framework |
| 优化家具行业作业车间调度:一种考虑机器设置、批次变异性和内部物流的强化学习方法 |
Malte Schneevogt |
PDF |
N/A |
Optimizing Job Shop Scheduling in the Furniture Industry: A Reinforcement Learning Approach Considering Machine Setup, Batch Variability, and Intralogistics |
| 端到端概率几何引导回归用于6自由度物体姿态估计 |
Thomas Pöllabauer |
PDF |
N/A |
End-to-End Probabilistic Geometry-Guided Regression for 6DoF Object Pose Estimation |
| EFCM:在医疗图像分析中部署大型模型的压缩模型高效微调 |
Shaojie Li |
PDF |
N/A |
EFCM: Efficient Fine-tuning on Compressed Models for deployment of large models in medical image analysis |
| SymFace:深度人脸识别中的额外面部对称性损失 |
Pritesh Prakash |
PDF |
N/A |
SymFace: Additional Facial Symmetry Loss for Deep Face Recognition |
| EventAug:面向基于事件学习的多维度时空数据增强方法 |
Yukun Tian |
PDF |
N/A |
EventAug: Multifaceted Spatio-Temporal Data Augmentation Methods for Event-based Learning |
| 通过主动学习加速训练并提高强非谐材料机器学习原子间势的可靠性 |
Kisung Kang |
PDF |
N/A |
Accelerating the Training and Improving the Reliability of Machine-Learned Interatomic Potentials for Strongly Anharmonic Materials through Active Learning |
| 约束引导的自编码器在机器状态监测中联合优化状态指标估计与异常检测 |
Maarten Meire |
PDF |
N/A |
Constraint Guided AutoEncoders for Joint Optimization of Condition Indicator Estimation and Anomaly Detection in Machine Condition Monitoring |
| 潜在指纹增强以实现精确细节检测 |
Abdul Wahab |
PDF |
N/A |
Latent fingerprint enhancement for accurate minutiae detection |
| 大型语言模型在法律领域的事实性 |
Rajaa El Hamdani |
PDF |
N/A |
The Factuality of Large Language Models in the Legal Domain |
| 通过桥梁蒸馏实现高效低分辨率人脸识别 |
Shiming Ge |
PDF |
N/A |
Efficient Low-Resolution Face Recognition via Bridge Distillation |
| 提取通道以实现高效的深度跟踪 |
Shiming Ge |
PDF |
N/A |
Distilling Channels for Efficient Deep Tracking |
| 在合理有限的计算资源下开发和双语评估日本医疗大型语言模型 |
Issey Sukeda |
PDF |
N/A |
Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources |
| 智能数据驱动的GRU预测器用于SnO$_2$薄膜特性 |
Faiza Bouamra |
PDF |
N/A |
Smart Data-Driven GRU Predictor for SnO$_2$ Thin films Characteristics |
| 使用道义逻辑的论证理论解释非单调规范推理 |
Zhe Yu |
PDF |
N/A |
Explaining Non-monotonic Normative Reasoning using Argumentation Theory with Deontic Logic |
| 基于对称性的结构化矩阵用于高效近似等变网络 |
Ashwin Samudre |
PDF |
N/A |
Symmetry-Based Structured Matrices for Efficient Approximately Equivariant Networks |
| 知识适应网络用于少样本类增量学习 |
Ye Wang |
PDF |
N/A |
Knowledge Adaptation Network for Few-Shot Class-Incremental Learning |
| 一张地图找到所有:零样本多对象导航的实时开放词汇映射 |
Finn Lukas Busch |
PDF |
N/A |
One Map to Find Them All: Real-time Open-Vocabulary Mapping for Zero-shot Multi-Object Navigation |
| 一致估计一类协方差矩阵间距离的方法 |
Roberto Pereira |
PDF |
N/A |
Consistent Estimation of a Class of Distances Between Covariance Matrices |
| 为自主系统合成演变的符号表示 |
Gabriele Sartor |
PDF |
N/A |
Synthesizing Evolving Symbolic Representations for Autonomous Systems |
| NPAT 零空间投影对抗训练:实现零退化 |
Hanyi Hu |
PDF |
N/A |
NPAT Null-Space Projected Adversarial Training Towards Zero Deterioration |
| 使用Rein对跨组织和跨扫描仪的腺癌进行细粒度分割以微调视觉基础模型 |
Pengzhou Cai |
PDF |
N/A |
Cross-Organ and Cross-Scanner Adenocarcinoma Segmentation using Rein to Fine-tune Vision Foundation Models |
| 图像回忆的神经编码:类人记忆 |
Virgile Foussereau |
PDF |
N/A |
Neural Encoding for Image Recall: Human-Like Memory |
| RockTrack:一种3D鲁棒多相机多目标跟踪框架 |
Xiaoyu Li |
PDF |
N/A |
RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework |
| 探索自闭症儿童的注视模式:聚类、可视化和预测 |
Weiyan Shi |
PDF |
N/A |
Exploring Gaze Pattern in Autistic Children: Clustering, Visualization, and Prediction |
| HARP:结合人类辅助重组与排列不变评论器的多智能体强化学习 |
Huawen Hu |
PDF |
N/A |
HARP: Human-Assisted Regrouping with Permutation Invariant Critic for Multi-Agent Reinforcement Learning |
| 自适应选择傅里叶压缩感知中的采样-重构方法 |
Seongmin Hong |
PDF |
N/A |
Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing |
| InverseMeetInsert:通过引导扩散模型中的几何累积反演实现鲁棒的实图像编辑 |
Yan Zheng |
PDF |
N/A |
InverseMeetInsert: Robust Real Image Editing via Geometric Accumulation Inversion in Guided Diffusion Models |
| 基础模型中的人类情感认知 |
Kanishk Gandhi |
PDF |
N/A |
Human-like Affective Cognition in Foundation Models |
| DETECLAP:利用对象信息增强视听表示学习 |
Shota Nakada |
PDF |
N/A |
DETECLAP: Enhancing Audio-Visual Representation Learning with Object Information |
| 实现低培训成本的实时对话 |
Wang Xu |
PDF |
N/A |
Enabling Real-Time Conversations with Minimal Training Costs |
| 揭示在大型语言模型角色扮演中检测角色知识错误所面临的挑战 |
Wenyuan Zhang |
PDF |
N/A |
Revealing the Challenge of Detecting Character Knowledge Errors in LLM Role-Playing |
| TART:一个用于可解释表格推理的开源工具增强框架 |
Xinyuan Lu |
PDF |
N/A |
TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning |
| Free-VSC:从视觉基础模型中释放语义,实现无监督视频语义压缩 |
Yuan Tian |
PDF |
N/A |
Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression |
| 从指数稳定到有限/固定时间稳定:优化中的应用 |
Ibrahim K. Ozaslan |
PDF |
N/A |
From exponential to finite/fixed-time stability: Applications to optimization |
| LFIC-DRASC:使用解耦表示和非对称条带卷积的深度光场图像压缩 |
Shiyu Feng |
PDF |
N/A |
LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution |
| 多机器人连接以实现集体障碍场遍历 |
Haodi Hu |
PDF |
N/A |
Multi-robot connection towards collective obstacle field traversal |
| RopeBEV:一种基于多摄像头鸟瞰视角的路侧感知网络 |
Jinrang Jia |
PDF |
N/A |
RopeBEV: A Multi-Camera Roadside Perception Network in Bird's-Eye-View |
| 从列表到表情符号:格式偏差如何影响模型对齐 |
Xuanchang Zhang |
PDF |
N/A |
From Lists to Emojis: How Format Bias Affects Model Alignment |
| 利用大型语言模型进行API交互:分类与合成数据生成的框架 |
Chunliang Tao |
PDF |
N/A |
Harnessing LLMs for API Interactions: A Framework for Classification and Synthetic Data Generation |
| 使用解析本体模板发现可表述对象的概念知识 |
Jianhua Sun |
PDF |
N/A |
Discovering Conceptual Knowledge with Analytic Ontology Templates for Articulated Objects |
| FLARE:融合语言模型与协作架构以增强推荐系统 |
Liam Hebert |
PDF |
N/A |
FLARE: Fusing Language Models and Collaborative Architectures for Recommender Enhancement |
| 单项式矩阵群等变神经函数网络 |
Hoang V. Tran |
PDF |
N/A |
Monomial Matrix Group Equivariant Neural Functional Networks |
| ORB-SfMLearner:基于ORB引导的自监督视觉里程计与选择性在线适应 |
Yanlin Jin |
PDF |
N/A |
ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation |
| GUNet:一种结合扩散模型的图卷积网络,用于稳定且多样化的姿态生成 |
Shuowen Liang |
PDF |
N/A |
GUNet: A Graph Convolutional Network United Diffusion Model for Stable and Diversity Pose Generation |
| SLAM辅助的腹腔镜手术三维跟踪系统 |
Jingwei Song |
PDF |
N/A |
SLAM assisted 3D tracking system for laparoscopic surgery |
| 利用基于深度学习的偶发性CT影像检测漏诊的医疗状况 |
Asad Aali |
PDF |
N/A |
Detecting Underdiagnosed Medical Conditions with Deep Learning-Based Opportunistic CT Imaging |
| 概率时间序列预测的递归插值器 |
Yu Chen |
PDF |
N/A |
Recurrent Interpolants for Probabilistic Time Series Prediction |
| 基于k-mer的方法用于连接泛基因组学和群体遗传学 |
Miles D. Roberts |
PDF |
N/A |
k-mer-based approaches to bridging pangenomics and population genetics |
| SRIF:基于扩散图像形变和流估计的语义形状配准 |
Mingze Sun |
PDF |
N/A |
SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation |
| 使用二维掩码的梯度驱动三维分割和高斯喷洒中的功能转移 |
Joji Joseph |
PDF |
N/A |
Gradient-Driven 3D Segmentation and Affordance Transfer in Gaussian Splatting Using 2D Masks |
| 一种用于大规模推荐系统中多任务融合的增强状态强化学习算法 |
Peng Liu |
PDF |
N/A |
An Enhanced-State Reinforcement Learning Algorithm for Multi-Task Fusion in Large-Scale Recommender Systems |
| 增强复杂公式识别的分层细节聚焦网络 |
Jiale Wang |
PDF |
N/A |
Enhancing Complex Formula Recognition with Hierarchical Detail-Focused Network |
| 基于超图的运动生成与多模态交互关系推理 |
Keshu Wu |
PDF |
N/A |
Hypergraph-based Motion Generation with Multi-modal Interaction Relational Reasoning |
| 基于证据权重的可解释目标识别方法(WoE):一种以人为中心的方法 |
Abeer Alshehri |
PDF |
N/A |
Towards Explainable Goal Recognition Using Weight of Evidence (WoE): A Human-Centered Approach |
| RUIE:基于检索的大语言模型统一信息抽取 |
Xincheng Liao |
PDF |
N/A |
RUIE: Retrieval-based Unified Information Extraction using Large Language Model |
| 在随机博弈中预见对手的疏忽 |
Shadi Tasdighi Kalat |
PDF |
N/A |
Anticipating Oblivious Opponents in Stochastic Games |
| 具有掩码去噪机制的代理聚合器用于病理全切片图像分析 |
Xitong Ling |
PDF |
N/A |
Agent Aggregator with Mask Denoise Mechanism for Histopathology Whole Slide Image Analysis |
| GReDP:一种更鲁棒的差分隐私训练方法,通过梯度保持噪声减少 |
Haodi Wang |
PDF |
N/A |
GReDP: A More Robust Approach for Differential Privacy Training with Gradient-Preserving Noise Reduction |
| 缩小飞行就绪星载视觉的领域差距 |
Tae Ha Park |
PDF |
N/A |
Bridging Domain Gap for Flight-Ready Spaceborne Vision |
| 非独立同分布去中心化数据下的少样本类增量学习 |
Cuiwei Liu |
PDF |
N/A |
Few-Shot Class-Incremental Learning with Non-IID Decentralized Data |
| VL-Reader:视觉与语言重构器是一种高效的场景文本识别器。 |
Humen Zhong |
PDF |
N/A |
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer |
| 如何利用人工智能构建虚拟细胞:优先事项与机遇 |
Charlotte Bunne |
PDF |
N/A |
How to Build the Virtual Cell with Artificial Intelligence: Priorities and Opportunities |
| 通过代表性和多样性样本选择增强半监督学习 |
Qian Shao |
PDF |
N/A |
Enhancing Semi-Supervised Learning via Representative and Diverse Sample Selection |
| 放松DARTS:放松可微架构搜索在眼动识别中的约束 |
Hongyu Zhu |
PDF |
N/A |
Relax DARTS: Relaxing the Constraints of Differentiable Architecture Search for Eye Movement Recognition |
| 大规模模型量化的艺术与科学:全面概述 |
Yanshu Wang |
PDF |
N/A |
Art and Science of Quantizing Large-Scale Models: A Comprehensive Overview |
| 硬标签密码分析提取神经网络模型 |
Yi Chen |
PDF |
N/A |
Hard-Label Cryptanalytic Extraction of Neural Network Models |
| 基于胸部X光图像的肺结核分类少样本学习方法 |
A. A. G. Yogi Pramana |
PDF |
N/A |
Few-Shot Learning Approach on Tuberculosis Classification Based on Chest X-Ray Images |
| 基于大语言模型检测的电话诈骗对抗:我们目前处于什么阶段? |
Zitong Shen |
PDF |
N/A |
Combating Phone Scams with LLM-based Detection: Where Do We Stand? |
| DAF-Net:一种具有域自适应的双分支特征分解融合网络,用于红外与可见光图像融合 |
Jian Xu |
PDF |
N/A |
DAF-Net: A Dual-Branch Feature Decomposition Fusion Network with Domain Adaptive for Infrared and Visible Image Fusion |
| 利用KNN-SINDy混合模型增强空气质量监测网络中的PM2.5数据插补与预测 |
Yohan Choi |
PDF |
N/A |
Enhancing PM2.5 Data Imputation and Prediction in Air Quality Monitoring Networks Using a KNN-SINDy Hybrid Model |
| BanStereoSet:一个用于衡量大型语言模型中对孟加拉语的刻板社会偏见的数据集 |
Mahammed Kamruzzaman |
PDF |
N/A |
BanStereoSet: A Dataset to Measure Stereotypical Social Biases in LLMs for Bangla |
| “女性比男性更具文化知识?”:角色设定对大型语言模型文化规范解读的影响 |
Mahammed Kamruzzaman |
PDF |
N/A |
"A Woman is More Culturally Knowledgeable than A Man?": The Effect of Personas on Cultural Norm Interpretation in LLMs |
| PainDiffusion: 机器人能表达疼痛吗? |
Quang Tien Dam |
PDF |
N/A |
PainDiffusion: Can robot express pain? |
| 一种度量混合规划方法,用于解决基于简单SIR模型的疫情规划问题 |
Ari Gestetner |
PDF |
N/A |
A Metric Hybrid Planning Approach to Solving Pandemic Planning Problems with Simple SIR Models |
| 多模态广义类别发现 |
Yuchang Su |
PDF |
N/A |
Multimodal Generalized Category Discovery |
| 基于更快残差多分支脉冲神经网络的高光谱图像分类 |
Yang Liu |
PDF |
N/A |
Hyperspectral Image Classification Based on Faster Residual Multi-branch Spiking Neural Network |
| PieClam:基于重叠包容性和排他性社区的通用图自编码器 |
Daniel Zilberg |
PDF |
N/A |
PieClam: A Universal Graph Autoencoder Based on Overlapping Inclusive and Exclusive Communities |
| HRA:一种用于排序元启发式优化算法的多准则框架 |
Evgenia-Maria K. Goula |
PDF |
N/A |
HRA: A Multi-Criteria Framework for Ranking Metaheuristic Optimization Algorithms |
| 基于CMOS的时间域模拟尖峰神经元的物理储备计算硬件友好实现 |
Nanako Kimura |
PDF |
N/A |
Hardware-Friendly Implementation of Physical Reservoir Computing with CMOS-based Time-domain Analog Spiking Neurons |