| ACING:黑箱大型语言模型中的指令学习演员-评论家方法 |
Salma Kharrat |
PDF |
N/A |
ACING: Actor-Critic for Instruction Learning in Black-Box Large Language Models |
| 多多益善:论演变的五值光谱布尔函数 |
Claude Carlet |
PDF |
N/A |
The More the Merrier: On Evolving Five-valued Spectra Boolean Functions |
| 基准测试GNN和图变换器的定位编码 |
Florian Grötschla |
PDF |
N/A |
Benchmarking Positional Encodings for GNNs and Graph Transformers |
| 从量子数据中测试经典性质 |
Matthias C. Caro |
PDF |
N/A |
Testing classical properties from quantum data |
| 有意义交流的信息论 |
Doron Sivan |
PDF |
N/A |
Information Theory of Meaningful Communication |
| LazyDINO:通过结构利用和代理驱动的测度传输实现快速、可扩展且高效分摊的贝叶斯反演 |
Lianghao Cao |
PDF |
N/A |
LazyDINO: Fast, scalable, and efficiently amortized Bayesian inversion via structure-exploiting and surrogate-driven measure transport |
| 无启发式多教师学习 |
Huy Thong Nguyen |
PDF |
N/A |
Heuristic-Free Multi-Teacher Learning |
| 用于语音的非线性动力学模型的缩放法则 |
Sam Kirkham |
PDF |
N/A |
Scaling laws for nonlinear dynamical models of speech |
| 重新思考MUSHRA:应对文本到语音评估中的现代挑战 |
Praveen Srinivasa Varadhan |
PDF |
N/A |
Rethinking MUSHRA: Addressing Modern Challenges in Text-to-Speech Evaluation |
| CATCH:互补自适应令牌级对比解码,以减轻大型视觉语言模型中的幻觉现象 |
Zhehan Kan |
PDF |
N/A |
CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs |
| 增强多类别疾病分类:利用先进大型语言模型对肿瘤、心血管、神经系统及消化系统疾病进行分类 |
Ahmed Akib Jawad Karim |
PDF |
N/A |
Enhancing Multi-Class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced LLMs |
| 调酒师:一种可亲近且可解释的方法,用于比较医学影像与非影像数据 |
Ayush Singla |
PDF |
N/A |
Barttender: An approachable & interpretable way to compare medical imaging and non-imaging data |
| 强化虚假新闻检测:利用支持向量机与复杂文本向量化技术。挑战BERT? |
Ahmed Akib Jawad Karim |
PDF |
N/A |
Strengthening Fake News Detection: Leveraging SVM and Sophisticated Text Vectorization Techniques. Defying BERT? |
| 当后门发声:通过模型生成的解释理解大语言模型后门攻击 |
Huaizhi Ge |
PDF |
N/A |
When Backdoors Speak: Understanding LLM Backdoor Attacks Through Model-Generated Explanations |
| 学习带有不完美建议的多变量高斯分布 |
Arnab Bhattacharyya |
PDF |
N/A |
Learning multivariate Gaussians with imperfect advice |
| 属性推理攻击在联邦回归任务中的应用 |
Francesco Diana |
PDF |
N/A |
Attribute Inference Attacks for Federated Regression Tasks |
| AdaCM$^2$:通过自适应跨模态记忆缩减理解极长期视频 |
Yuanbin Man |
PDF |
N/A |
AdaCM$^2$: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction |
| IMUVIE:通过运动电影进行拾取时间线动作定位 |
John Clapham |
PDF |
N/A |
IMUVIE: Pickup Timeline Action Localization via Motion Movies |
| 利用大型语言模型增强美国手语(ASL)与印度手语(ISL)之间的翻译 |
Malay Kumar |
PDF |
N/A |
Enhanced Sign Language Translation between American Sign Language (ASL) and Indian Sign Language (ISL) Using LLMs |
| AI引导的宫颈癌早期筛查 |
Dharanidharan S I |
PDF |
N/A |
AI Guided Early Screening of Cervical Cancer |
| 深度学习驱动的损伤皮肤层厚度评估热图分析 |
Devakumar GR |
PDF |
N/A |
Deep Learning-Driven Heat Map Analysis for Evaluating thickness of Wounded Skin Layers |
| 基于物联网的运动员三维姿态估计与动作优化:C3D与OpenPose的应用 |
Fei Ren |
PDF |
N/A |
IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose |
| 基于世界模型的神经符号图谱丰富 |
Stefano De Giorgis |
PDF |
N/A |
Neurosymbolic Graph Enrichment for Grounded World Models |
| 作物模式识别中的机器学习方法:比较分析 |
Kazi Hasibul Kabir |
PDF |
N/A |
Machine Learning Approaches on Crop Pattern Recognition a Comparative Analysis |
| 通过后验回归进行少标签的自动评估 |
Benjamin Eyre |
PDF |
N/A |
Auto-Evaluation with Few Labels through Post-hoc Regression |
| PoM:利用多项式混合器实现高效图像和视频生成 |
David Picard |
PDF |
N/A |
PoM: Efficient Image and Video Generation with the Polynomial Mixer |
| 利用边缘计算的微服务优化航空公司预订系统:实时数据处理与提升用户响应性的框架 |
Biman Barua |
PDF |
N/A |
Optimizing Airline Reservation Systems with Edge-Enabled Microservices: A Framework for Real-Time Data Processing and Enhanced User Responsiveness |
| CodeXEmbed:一种面向多语言和多任务代码检索的通用嵌入模型家族 |
Ye Liu |
PDF |
N/A |
CodeXEmbed: A Generalist Embedding Model Family for Multiligual and Multi-task Code Retrieval |
| DLBacktrace:一种适用于任何深度学习模型的模型无关可解释性方法 |
Vinay Kumar Sankarapu |
PDF |
N/A |
DLBacktrace: A Model Agnostic Explainability for any Deep Learning Models |
| Leadsee-Precip:一种用于降水预测的深度学习诊断模型 |
Weiwen Ji |
PDF |
N/A |
Leadsee-Precip: A Deep Learning Diagnostic Model for Precipitation |
| PyAWD:一个用于生成带有Devito的大规模声波传播合成数据集的库 |
Pascal Tribel |
PDF |
N/A |
PyAWD: A Library for Generating Large Synthetic Datasets of Acoustic Wave Propagation with Devito |
| M3D:双流选择性状态空间与深度驱动框架,用于高保真单视图三维重建 |
Luoxi Zhang |
PDF |
N/A |
M3D: Dual-Stream Selective State Spaces and Depth-Driven Framework for High-Fidelity Single-View 3D Reconstruction |
| 即时策略:通过图扩散进行上下文模仿学习 |
Vitalis Vosylius |
PDF |
N/A |
Instant Policy: In-Context Imitation Learning via Graph Diffusion |
| 使用图神经网络估计模拟星系团中的暗物质晕质量 |
Nikhil Garuda |
PDF |
N/A |
Estimating Dark Matter Halo Masses in Simulated Galaxy Clusters with Graph Neural Networks |
| 利用扩散几何探索神经网络的多面性 |
Elliott Abel |
PDF |
N/A |
Exploring the Manifold of Neural Networks Using Diffusion Geometry |
| 运动地图(MfM):从稀疏多视角图像生成二维语义地图 |
Matteo Toso |
PDF |
N/A |
Maps from Motion (MfM): Generating 2D Semantic Maps from Sparse Multi-view Images |
| 利用虚拟现实和人工智能辅导进行语言学习:一个虚拟校园环境的案例研究,结合了OpenAI GPT与Unity 3D的集成 |
Adithya TG |
PDF |
N/A |
Leveraging Virtual Reality and AI Tutoring for Language Learning: A Case Study of a Virtual Campus Environment with OpenAI GPT Integration with Unity 3D |
| 一种结合结构和跨域文本指导的弱监督OCT分割多模态方法 |
Jiaqi Yang |
PDF |
N/A |
A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation |
| 奖励驱动的工作流程,用于从原子分辨率成像数据中进行无监督的可解释相位和铁电变体的分析 |
Kamyar Barakati |
PDF |
N/A |
Reward driven workflows for unsupervised explainable analysis of phases and ferroic variants from atomically resolved imaging data |
| SG-LRA:基于低秩近似的自生成自动脊柱侧弯Cobb角测量 |
Zhiwen Shao |
PDF |
N/A |
SG-LRA: Self-Generating Automatic Scoliosis Cobb Angle Measurement with Low-Rank Approximation |
| STREAM:一种适用于稀疏几何数据的通用状态空间模型 |
Mark Schöne |
PDF |
N/A |
STREAM: A Universal State-Space Model for Sparse Geometric Data |
| SAM 承担重任:一种半监督方法,用于优化医学分割中的伪标签 |
Ron Keuth |
PDF |
N/A |
SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation |
| 超图 $p$-Laplacian 方程用于数据插值和半监督学习 |
Kehan Shi |
PDF |
N/A |
Hypergraph $p$-Laplacian equations for data interpolation and semi-supervised learning |
| 主题建模和下游任务中的可证明遗忘 |
Stanley Wei |
PDF |
N/A |
Provable unlearning in topic modeling and downstream tasks |
| GNNAS-Dock:基于图神经网络的分子对接预算感知算法选择 |
Yiliang Yuan |
PDF |
N/A |
GNNAS-Dock: Budget Aware Algorithm Selection with Graph Neural Networks for Molecular Docking |
| 尼泊尔语上的Whisper微调 |
Sanjay Rijal |
PDF |
N/A |
Whisper Finetuning on Nepali Language |
| 预训练中的程序性知识推动大型语言模型中的推理 |
Laura Ruis |
PDF |
N/A |
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models |
| 随机BIQA:用于认证盲图像质量评估的中值随机平滑 |
Ekaterina Shumitskaya |
PDF |
N/A |
Stochastic BIQA: Median Randomized Smoothing for Certified Blind Image Quality Assessment |
| 用于设计结构矩阵组合优化的大型语言模型 |
Shuo Jiang |
PDF |
N/A |
Large Language Models for Combinatorial Optimization of Design Structure Matrix |
| 一种基于数据驱动的方法,用于根据描述符在将噪声轨迹转化为物理相关信息方面的效率对其进行分类 |
Simone Martino |
PDF |
N/A |
A data driven approach to classify descriptors based on their efficiency in translating noisy trajectories into physically-relevant information |
| 基于流的主动学习在过程监控中的应用 |
Christian Capezza |
PDF |
N/A |
Stream-Based Active Learning for Process Monitoring |
| 拓扑对称增强图卷积用于基于骨架的动作识别 |
Zeyu Liang |
PDF |
N/A |
Topological Symmetry Enhanced Graph Convolution for Skeleton-Based Action Recognition |
| 回忆与精炼:一种简单但有效的无源开放集域适应框架 |
Ismail Nejjar |
PDF |
N/A |
Recall and Refine: A Simple but Effective Source-free Open-set Domain Adaptation Framework |
| UMGAD:无监督多重图异常检测 |
Xiang Li |
PDF |
N/A |
UMGAD: Unsupervised Multiplex Graph Anomaly Detection |
| S3TU-Net:结构化卷积与超像素变换器用于肺结节分割 |
Yuke Wu |
PDF |
N/A |
S3TU-Net: Structured Convolution and Superpixel Transformer for Lung Nodule Segmentation |
| 通过复制调查回复分布来预测客户满意度 |
Etienne Manderscheid |
PDF |
N/A |
Predicting Customer Satisfaction by Replicating the Survey Response Distribution |
| 通过负特征值解锁线性RNN中的状态跟踪 |
Riccardo Grazzi |
PDF |
N/A |
Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues |
| 用于热光谱分布正则化的红外图像超分辨率的轮廓波细化门控框架 |
Yang Zou |
PDF |
N/A |
Contourlet Refinement Gate Framework for Thermal Spectrum Distribution Regularized Infrared Image Super-Resolution |
| 重新思考多视角下的最高概率以进行驾驶员分心行为定位 |
Quang Vinh Nguyen |
PDF |
N/A |
Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization |
| 生成扩散模型中的数据修剪 |
Rania Briq |
PDF |
N/A |
Data Pruning in Generative Diffusion Models |
| VMGNet:一种基于VMamba的低计算复杂度机器人抓取网络,具备多尺度特征融合功能 |
Yuhao Jin |
PDF |
N/A |
VMGNet: A Low Computational Complexity Robotic Grasping Network Based on VMamba with Multi-Scale Feature Fusion |
| AI的诠释学转向:机器能否进行解释? |
Remy Demichelis |
PDF |
N/A |
The Hermeneutic Turn of AI: Is the Machine Capable of Interpreting? |
| MAViS:用于二维半导体量子点阵列的模块化自主虚拟化系统 |
Anantha S. Rao |
PDF |
N/A |
MAViS: Modular Autonomous Virtualization System for Two-Dimensional Semiconductor Quantum Dot Arrays |
| 通过观察进行三维重建:室内SLAM的即时盲点检测器通过混合现实实现 |
Hanbeom Chang |
PDF |
N/A |
3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality |
| PR-ENDO:基于物理的可重光照高斯溅射技术在内窥镜中的应用 |
Joanna Kaleta |
PDF |
N/A |
PR-ENDO: Physically Based Relightable Gaussian Splatting for Endoscopy |
| Transformer神经过程--核回归 |
Daniel Jenson |
PDF |
N/A |
Transformer Neural Processes -- Kernel Regression |
| 通过基于原则的合成逻辑语料库增强大型语言模型的推理能力 |
Terufumi Morishita |
PDF |
N/A |
Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus |
| 无偏见情感分析 |
Hubert Plisiecki |
PDF |
N/A |
Bias Free Sentiment Analysis |
| 用于远距离标签交互的正则模式敏感条件随机场 |
Sean Papay |
PDF |
N/A |
Regular-pattern-sensitive CRFs for Distant Label Interactions |
| 分析协作感知-认知-沟通-行动中的解释相关互动 |
Marc Roig Vilamala |
PDF |
N/A |
Analysing Explanation-Related Interactions in Collaborative Perception-Cognition-Communication-Action |
| 比较时间序列变换器模型中先验时间表示与学习时间表示的差异 |
Natalia Koliou |
PDF |
N/A |
Comparing Prior and Learned Time Representations in Transformer Models of Timeseries |
| NMT-混淆攻击:在翻译中忽略仅含一个词的句子 |
Sahar Sadrizadeh |
PDF |
N/A |
NMT-Obfuscator Attack: Ignore a sentence in translation with only one word |
| SCIGS: 从快照压缩图像中进行3D高斯喷洒 |
Zixu Wang |
PDF |
N/A |
SCIGS: 3D Gaussians Splatting from a Snapshot Compressive Image |
| 网络边缘的AI流 |
Jiawei Shao |
PDF |
N/A |
AI Flow at the Network Edge |
| 可控摘要解释指南 |
Sangwon Ryu |
PDF |
N/A |
Guide-to-Explain for Controllable Summarization |
| 不同主题下可信与不可信新闻的差异 |
Emilie Francis |
PDF |
N/A |
Variation between Credible and Non-Credible News Across Topics |
| GaussianPretrain:一种用于自动驾驶视觉预训练的简单统一3D高斯表示 |
Shaoqing Xu |
PDF |
N/A |
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving |
| 生成和预测机器学习模型的经验隐私评估——综述与实践挑战 |
Flavio Hafner |
PDF |
N/A |
Empirical Privacy Evaluations of Generative and Predictive Machine Learning Models -- A review and challenges for practice |
| 通过扩散模型实现盲图像复原的频率感知引导 |
Jun Xiao |
PDF |
N/A |
Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models |
| \textsc{霓虹}:新闻实体互动提取,增强问答能力 |
Sneha Singhania |
PDF |
N/A |
\textsc{Neon}: News Entity-Interaction Extraction for Enhanced Question Answering |
| 用于无损图像压缩的大型语言模型:语言空间中的下一像素预测就是你所需要的 |
Kecheng Chen |
PDF |
N/A |
Large Language Models for Lossless Image Compression: Next-Pixel Prediction in Language Space is All You Need |
| 超越高斯:使用线性核实现快速且高质量的三维点云渲染 |
Haodong Chen |
PDF |
N/A |
Beyond Gaussians: Fast and High-Fidelity 3D Splatting with Linear Kernels |
| 通过平方和降维与非球形混合聚类算法的改进 |
Prashanti Anderson |
PDF |
N/A |
Dimension Reduction via Sum-of-Squares and Improved Clustering Algorithms for Non-Spherical Mixtures |
| STRisk:一种评估黑客入侵风险的社技结合方法 |
Hicham Hammouchi |
PDF |
N/A |
STRisk: A Socio-Technical Approach to Assess Hacking Breaches Risk |
| 偏好条件下的多目标质量多样性梯度变化 |
Hannah Janmohamed |
PDF |
N/A |
Preference-Conditioned Gradient Variations for Multi-Objective Quality-Diversity |
| CV-城市:在全球城市中推进跨视角地理定位 |
Gaoshuang Huang |
PDF |
N/A |
CV-Cities: Advancing Cross-View Geo-Localization in Global Cities |
| 主题通道在白盒中开启:通过主题相关图进行立体匹配 |
Ziyang Chen |
PDF |
N/A |
Motif Channel Opened in a White-Box: Stereo Matching via Motif Correlation Graph |
| 利用卷积神经网络和迁移学习进行地理地貌结构的分类 |
Mustafa M. Abd Zaid |
PDF |
N/A |
Classification of Geographical Land Structure Using Convolution Neural Network and Transfer Learning |
| 评估大型语言模型的提示可控性 |
Erik Miehling |
PDF |
N/A |
Evaluating the Prompt Steerability of Large Language Models |
| 大语言模型是否理解文本中的歧义?开放世界问答中的案例研究 |
Aryan Keluskar |
PDF |
N/A |
Do LLMs Understand Ambiguity in Text? A Case Study in Open-world Question Answering |
| 在SIMSSA项目中自动进行员工重建 |
Lorenzo J. Tardon |
PDF |
N/A |
Automatic staff reconstruction within SIMSSA proect |
| 联邦学习中的非独立同分布数据:系统综述与分类、度量、方法、框架及未来方向 |
Daniel M. Jimenez G. |
PDF |
N/A |
Non-IID data in Federated Learning: A Systematic Review with Taxonomy, Metrics, Methods, Frameworks and Future Directions |
| RedPajama:用于训练大型语言模型的开源数据集 |
Maurice Weber |
PDF |
N/A |
RedPajama: an Open Dataset for Training Large Language Models |
| 利用AlphaFold 3辅助拓扑深度学习快速应对病毒快速进化 |
JunJie Wee |
PDF |
N/A |
Rapid response to fast viral evolution using AlphaFold 3-assisted topological deep learning |
| 超稀疏内存网络 |
Zihao Huang |
PDF |
N/A |
Ultra-Sparse Memory Network |
| 无言:一场8小时表演,对比人类与机器的表现力 |
Catie Cuan |
PDF |
N/A |
Breathless: An 8-hour Performance Contrasting Human and Robot Expressiveness |
| 一种用于开发和增强基于大型语言模型的软件系统的分层架构 |
Dawen Zhang |
PDF |
N/A |
A Layered Architecture for Developing and Enhancing Capabilities in Large Language Model-based Software Systems |
| DynFocus:动态合作网络赋予大型语言模型视频理解能力 |
Yudong Han |
PDF |
N/A |
DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding |
| 使用锐度感知训练完善不完美的物理神经网络并利用可迁移的鲁棒性 |
Tengji Xu |
PDF |
N/A |
Perfecting Imperfect Physical Neural Networks with Transferable Robustness using Sharpness-Aware Training |
| DiM:半监督医学图像分割中基于$f$-散度最小化的锐度感知优化引导 |
Bingli Wang |
PDF |
N/A |
DiM: $f$-Divergence Minimization Guided Sharpness-Aware Optimization for Semi-supervised Medical Image Segmentation |
| 使用单个声学相机进行目标高度估计以补偿二维海底拼接 |
Xiaoteng Zhou |
PDF |
N/A |
Target Height Estimation Using a Single Acoustic Camera for Compensation in 2D Seabed Mosaicking |
| 学习标签比例和协变量偏移实例 |
Sagalpreet Singh |
PDF |
N/A |
Learning from Label Proportions and Covariate-shifted Instances |
| 硅烷化策略用于定制硅表面上的肽功能化:对增强干细胞粘附的启示 |
Melissa Kosovari |
PDF |
N/A |
Silanization Strategies for Tailoring Peptide Functionalization on Silicon Surfaces: Implications for Enhancing Stem Cell Adhesion |
| 通过谱粗化加速大规模数据集的UMAP |
Yongyu Wang |
PDF |
N/A |
Accelerating UMAP for Large-Scale Datasets Through Spectral Coarsening |
| 图作为特征:利用非神经图感知逻辑回归提升节点分类 |
Simon Delarue |
PDF |
N/A |
Graph as a feature: improving node classification with non-neural graph-aware logistic regression |
| 协作环境下的属性图聚类 |
Rui Zhang |
PDF |
N/A |
Attributed Graph Clustering in Collaborative Settings |
| 通过解离主成分分析增强盲源分离 |
Muhammad Usman Khalid |
PDF |
N/A |
Enhancing Blind Source Separation with Dissociative Principal Component Analysis |
| CLIP在单次人脸识别中展现出的不合理潜力 |
Nhan T. Luu |
PDF |
N/A |
CLIP Unreasonable Potential in Single-Shot Face Recognition |
| C$^{2}$INet:利用先验感知持续因果干预实现增量轨迹预测 |
Xiaohe Li |
PDF |
N/A |
C$^{2}$INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention |
| DGTR:用于稀疏视图广阔场景的分布式高斯涡轮重建 |
Hao Li |
PDF |
N/A |
DGTR: Distributed Gaussian Turbo-Reconstruction for Sparse-View Vast Scenes |
| 基于SNN的开放世界中概念和行动规律的在线学习 |
Christel Grimaud |
PDF |
N/A |
SNN-Based Online Learning of Concepts and Action Laws in an Open World |
| 在生产环境中,为基于大型语言模型(LLM)的对话系统进行多轮意图分类时,平衡准确性与效率 |
Junhua Liu |
PDF |
N/A |
Balancing Accuracy and Efficiency in Multi-Turn Intent Classification for LLM-Powered Dialog Systems in Production |
| 扩散产品量化 |
Jie Shao |
PDF |
N/A |
Diffusion Product Quantization |
| 凡人代理隐式世界模型的涌现 |
Kazuya Horibe |
PDF |
N/A |
Emergence of Implicit World Models from Mortal Agents |
| 物理引导的合成孔径雷达飞机检测器 |
Zhongling Huang |
PDF |
N/A |
Physics-Guided Detector for SAR Airplanes |
| 用于指导视觉组装的生成时间线 |
Alejandro Pardo |
PDF |
N/A |
Generative Timelines for Instructed Visual Assembly |
| SSEditor:利用扩散模型实现可控的掩码到场景生成 |
Haowen Zheng |
PDF |
N/A |
SSEditor: Controllable Mask-to-Scene Generation with Diffusion Model |
| CUE-M:基于多模态大语言模型的上下文理解和增强搜索 |
Dongyoung Go |
PDF |
N/A |
CUE-M: Contextual Understanding and Enhanced Search with Multimodal Large Language Model |
| GLOVER:面向任务的可泛化开放词汇功能性推理抓取方法 |
Teli Ma |
PDF |
N/A |
GLOVER: Generalizable Open-Vocabulary Affordance Reasoning for Task-Oriented Grasping |
| HouseLLM:基于LLM辅助的两阶段文本到楼层平面图生成 |
Ziyang Zong |
PDF |
N/A |
HouseLLM: LLM-Assisted Two-Phase Text-to-Floorplan Generation |
| 利用未配对的白内障和高质量图像的多功能白内障眼底图像恢复模型 |
Zheng Gong |
PDF |
N/A |
Versatile Cataract Fundus Image Restoration Model Utilizing Unpaired Cataract and High-quality Images |
| libcll:一个可扩展的Python工具包,用于互补标签学习 |
Nai-Xuan Ye |
PDF |
N/A |
libcll: an Extendable Python Toolkit for Complementary-Label Learning |
| 建立信任:人工智能中的安全、保障和透明度的基础 |
Huzaifa Sidhpurwala |
PDF |
N/A |
Building Trust: Foundations of Security, Safety and Transparency in AI |
| 关于生成式AI模型在合成医学文本、时间序列和纵向数据方面的综述 |
Mohammad Loni |
PDF |
N/A |
A Review on Generative AI Models for Synthetic Medical Text, Time Series, and Longitudinal Data |
| 获取精确且可比的视网膜图像质量评分:FTHNet与FQS数据集 |
Zheng Gong |
PDF |
N/A |
Acquire Precise and Comparable Fundus Image Quality Score: FTHNet and FQS Dataset |
| KDC-MAE:知识蒸馏对比掩码自动编码器 |
Maheswar Bora |
PDF |
N/A |
KDC-MAE: Knowledge Distilled Contrastive Mask Auto-Encoder |
| 移动平均法在估计Wi-Fi链路质量时的准确性与精确性 |
Gianluca Cena |
PDF |
N/A |
On the Accuracy and Precision of Moving Averages to Estimate Wi-Fi Link Quality |
| 低资源机器翻译:为何而设?为谁而设?针对专门提供Tetun语言翻译服务的观察研究 |
Raphael Merx |
PDF |
N/A |
Low-resource Machine Translation: what for? who for? An observational study on a dedicated Tetun language translation service |
| 基于神经ODE的小样本学习原型优化 |
Baoquan Zhang |
PDF |
N/A |
Prototype Optimization with Neural ODE for Few-Shot Learning |
| 重构易处理的概率电路 |
Honghua Zhang |
PDF |
N/A |
Restructuring Tractable Probabilistic Circuits |
| 基于双边控制模仿学习的输出校正错误反馈模型 |
Hiroshi Sato |
PDF |
N/A |
Error-Feedback Model for Output Correction in Bilateral Control-Based Imitation Learning |
| 从音乐探索对话中预测用户意图和音乐属性 |
Daeyong Kwon |
PDF |
N/A |
Predicting User Intents and Musical Attributes from Music Discovery Conversations |
| ADV2E:在视频到事件模拟器中弥合模拟电路与离散帧之间的差距 |
Xiao Jiang |
PDF |
N/A |
ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator |
| 神经-3D:从脑电信号实现三维视觉解码 |
Zhanqiang Guo |
PDF |
N/A |
Neuro-3D: Towards 3D Visual Decoding from EEG Signals |
| 多智能体强化学习中的高效训练:针对推箱子问题的无通信框架 |
David Ge |
PDF |
N/A |
Efficient Training in Multi-Agent Reinforcement Learning: A Communication-Free Framework for the Box-Pushing Problem |
| 具有逐步自适应机制的联邦学习超参数优化 |
Yasaman Saadati |
PDF |
N/A |
Hyper-parameter Optimization for Federated Learning with Step-wise Adaptive Mechanism |
| 评估大型语言模型在官方印度语言中的分词器性能 |
S. Tamang |
PDF |
N/A |
Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages |
| 布尔问题:稠密检索是否理解语言中的布尔逻辑? |
Zongmeng Zhang |
PDF |
N/A |
BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language? |
| 对比相似性感知的双路径Mamba用于多元时间序列节点分类 |
Mingsen Du |
PDF |
N/A |
Contrast Similarity-Aware Dual-Pathway Mamba for Multivariate Time Series Node Classification |
| DeTrigger:一种基于梯度的联邦学习后门攻击缓解方法 |
Kichang Lee |
PDF |
N/A |
DeTrigger: A Gradient-Centric Approach to Backdoor Attack Mitigation in Federated Learning |
| 不变形表示学习在图像分类中的应用 |
Tonmoy Hossain |
PDF |
N/A |
Invariant Shape Representation Learning For Image Classification |
| RoSIS:使用视觉-语言融合的文本提示手术器械分割的鲁棒框架 |
Tae-Min Choi |
PDF |
N/A |
RoSIS: Robust Framework for Text-Promptable Surgical Instrument Segmentation Using Vision-Language Fusion |
| CCIS-Diff:一种基于稳定扩散先验的受控结肠镜图像生成模型 |
Yifan Xie |
PDF |
N/A |
CCIS-Diff: A Generative Model with Stable Diffusion Prior for Controlled Colonoscopy Image Synthesis |
| MTFusion:利用多词文本反转从单张图像重建任意3D物体 |
Yu Liu |
PDF |
N/A |
MTFusion: Reconstructing Any 3D Object from Single Image Using Multi-word Textual Inversion |
| 基于LLM代理和图的更高级群体极化测量方法 |
Zixin Liu |
PDF |
N/A |
A More Advanced Group Polarization Measurement Approach Based on LLM-Based Agents and Graphs |
| 医学视觉与语言应用及其技术综述 |
Qi Chen |
PDF |
N/A |
A Survey of Medical Vision-and-Language Applications and Their Techniques |
| 分层时空不确定性量化在分布式能源采用中的应用 |
Wenbin Zhou |
PDF |
N/A |
Hierarchical Spatio-Temporal Uncertainty Quantification for Distributed Energy Adoption |
| 恒定速率计划:扩散模型中用于高效训练和采样的恒定速率分布变化 |
Shuntaro Okada |
PDF |
N/A |
Constant Rate Schedule: Constant-Rate Distributional Change for Efficient Training and Sampling in Diffusion Models |
| 工具变量在加性非线性、非恒定效应模型中的可测试性 |
Xichen Guo |
PDF |
N/A |
Testability of Instrumental Variables in Additive Nonlinear, Non-Constant Effects Models |
| 动作关注型深度强化学习用于光束线自主对准 |
Siyu Wang |
PDF |
N/A |
Action-Attentive Deep Reinforcement Learning for Autonomous Alignment of Beamlines |
| 基于扩散模型的计算机化自适应测试中的充分先验冷启动 |
Haiping Ma |
PDF |
N/A |
Diffusion-Inspired Cold Start with Sufficient Prior in Computerized Adaptive Testing |
| 使用一致性训练技术增强低剂量计算机断层扫描图像 |
Mahmut S. Gokmen |
PDF |
N/A |
Enhancing Low Dose Computed Tomography Images Using Consistency Training Techniques |
| 无需校准的空间变换的鲁棒三维语义占用预测 |
Zhuangwei Zhuang |
PDF |
N/A |
Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation |
| AsynEIO:使用高斯过程回归的异步单目事件惯性里程计 |
Zhixiang Wang |
PDF |
N/A |
AsynEIO: Asynchronous Monocular Event-Inertial Odometry Using Gaussian Process Regression |
| 只是开玩笑:知识注入与蒸馏用于检测不当表情包 |
Rahul Garg |
PDF |
N/A |
Just KIDDIN: Knowledge Infusion and Distillation for Detection of INdecent Memes |
| 技能树:针对长时程控制任务的可解释基于技能的深度强化学习 |
Yongyan Wen |
PDF |
N/A |
SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks |
| 基于草图引导的笼状三维高斯散射变形 |
Tianhao Xie |
PDF |
N/A |
Sketch-guided Cage-based 3D Gaussian Splatting Deformation |
| UrbanDiT:一种面向开放世界城市时空学习的基石模型 |
Yuan Yuan |
PDF |
N/A |
UrbanDiT: A Foundation Model for Open-World Urban Spatio-Temporal Learning |
| 基于传感器融合的复杂工程系统多故障模式预测框架 |
Benjamin Peters |
PDF |
N/A |
Sensor-fusion based Prognostics Framework for Complex Engineering Systems Exhibiting Multiple Failure Modes |
| 一种结合编码器和变压器的方法用于生成连贯且高质量的文本 |
Jiajing Chen |
PDF |
N/A |
A Combined Encoder and Transformer Approach for Coherent and High-Quality Text Generation |
| HNCSE:通过混合对比学习与硬负样本提升句子嵌入 |
Wenxiao Liu |
PDF |
N/A |
HNCSE: Advancing Sentence Embeddings via Hybrid Contrastive Learning with Hard Negatives |
| 使用动作序列的强化学习以实现数据高效机器人学习 |
Younggyo Seo |
PDF |
N/A |
Reinforcement Learning with Action Sequence for Data-Efficient Robot Learning |
| 线性 bandits 中的切向随机化 (TRAiL): 保证的推断和遗憾界限 |
Arda Güçlü |
PDF |
N/A |
Tangential Randomization in Linear Bandits (TRAiL): Guaranteed Inference and Regret Bounds |
| 深度网络中的自监督学习:通向鲁棒小样本分类的路径 |
Yuyang Xiao |
PDF |
N/A |
Self-Supervised Learning in Deep Networks: A Pathway to Robust Few-Shot Classification |
| 高度:用于在拥挤和受限环境中进行机器人导航的异构交互图变换器 |
Shuijing Liu |
PDF |
N/A |
HEIGHT: Heterogeneous Interaction Graph Transformer for Robot Navigation in Crowded and Constrained Environments |
| CoMeDi 共享任务:词汇语义分歧中的模型作为注释器 |
Zhu Liu |
PDF |
N/A |
CoMeDi Shared Task: Models as Annotators in Lexical Semantics Disagreements |
| 自监督视野数据去噪提高了青光眼进展的检测 |
Sean Wu |
PDF |
N/A |
Self-supervised denoising of visual field data improves detection of glaucoma progression |
| 一种用于测量定性分析中“开放编码”的计算方法 |
John Chen |
PDF |
N/A |
A Computational Method for Measuring "Open Codes" in Qualitative Analysis |
| 将损失函数可视化为拓扑地貌剖面 |
Caleb Geniesse |
PDF |
N/A |
Visualizing Loss Functions as Topological Landscape Profiles |
| 高维空间中signSGD的精确风险曲线:量化预处理与噪声压缩效应 |
Ke Liang Xiao |
PDF |
N/A |
Exact Risk Curves of signSGD in High-Dimensions: Quantifying Preconditioning and Noise-Compression Effects |