| 自演化深度监督的三维高斯光栅化技术从渲染的立体图像对中生成 |
Sadra Safadoust |
PDF |
N/A |
Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs |
| DreamMesh:联合操作和纹理三角网格以实现文本到3D生成 |
Haibo Yang |
PDF |
N/A |
DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation |
| “我的分数不对!”:一种可争议的AI框架,用于在评估学生作文时进行互动反馈 |
Shengxin Hong |
PDF |
N/A |
"My Grade is Wrong!": A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays |
| Hi3D:追求高分辨率图像到3D生成的视频扩散模型 |
Haibo Yang |
PDF |
N/A |
Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models |
| FreeEnhance:通过内容一致的噪声添加与去噪过程实现无需调整的图像增强 |
Yang Luo |
PDF |
N/A |
FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process |
| VMAS:通过网络音乐视频中的语义对齐实现视频到音乐的生成 |
Yan-Bo Lin |
PDF |
N/A |
VMAS: Video-to-Music Generation via Semantic Alignment in Web Music Videos |
| 引入扰动能力评分(PS)以增强对抗逃避性对抗攻击的鲁棒性于ML-NIDS |
Mohamed elShehaby |
PDF |
N/A |
Introducing Perturb-ability Score (PS) to Enhance Robustness Against Evasion Adversarial Attacks on ML-NIDS |
| StereoCrafter:基于扩散模型的单目视频生成长时间高质量立体3D内容 |
Sijie Zhao |
PDF |
N/A |
StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos |
| 长尾类增量学习的自适应适配器路由 |
Zhi-Hong Qi |
PDF |
N/A |
Adaptive Adapter Routing for Long-Tailed Class-Incremental Learning |
| SUPER:评估代理在从研究仓库中设置和执行任务的能力 |
Ben Bogin |
PDF |
N/A |
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories |
| 一套用于声学语言模型评估的工具包 |
Gallil Maimon |
PDF |
N/A |
A Suite for Acoustic Language Model Evaluation |
| 线性模型中带有Dropout正则化的随机梯度下降的渐近性 |
Jiaqi Li |
PDF |
N/A |
Asymptotics of Stochastic Gradient Descent with Dropout Regularization in Linear Models |
| 合成连续预训练 |
Zitong Yang |
PDF |
N/A |
Synthetic continued pretraining |
| 代理工作流程记忆 |
Zora Zhiruo Wang |
PDF |
N/A |
Agent Workflow Memory |
| 基于深度神经网络的手语识别:一种利用迁移学习与可解释性的综合方法 |
A. E. M Ridwan |
PDF |
N/A |
Deep Neural Network-Based Sign Language Recognition: A Comprehensive Approach Using Transfer Learning with Explainability |
| 迈向更公平的健康建议:通过词义消歧寻找信息量丰富的无偏样本 |
Gavin Butts |
PDF |
N/A |
Towards Fairer Health Recommendations: finding informative unbiased samples via Word Sense Disambiguation |
| 通过解释增强自然语言推理中的对抗鲁棒性 |
Alexandros Koulakos |
PDF |
N/A |
Enhancing adversarial robustness in Natural Language Inference using explanations |
| 利用条件StyleGAN和潜在空间操作的可控视网膜图像合成,以改进糖尿病视网膜病变的诊断和分级 |
Somayeh Pakdelmoez |
PDF |
N/A |
Controllable retinal image synthesis using conditional StyleGAN and latent space manipulation for improved diagnosis and grading of diabetic retinopathy |
| 高效的一步扩散优化用于快照压缩成像 |
Yunzhen Wang |
PDF |
N/A |
Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging |
| 用于列表推荐时间抽象的分层强化学习 |
Luo Ji |
PDF |
N/A |
Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation |
| SoK: 医疗人工智能的安全与隐私风险 |
Yuanhaur Chang |
PDF |
N/A |
SoK: Security and Privacy Risks of Medical AI |
| NVRC:神经视频表示压缩 |
Ho Man Kwan |
PDF |
N/A |
NVRC: Neural Video Representation Compression |
| 通过叶状结构和知识迁移进行流形学习 |
E. Tron |
PDF |
N/A |
Manifold Learning via Foliations and Knowledge Transfer |
| 稳健的机器人行走者:学习在微小陷阱上敏捷移动 |
Shaoting Zhu |
PDF |
N/A |
Robust Robot Walker: Learning Agile Locomotion over Tiny Traps |
| CLNX:连接代码与自然语言,助力C/C++漏洞贡献提交的识别 |
Zeqing Qin |
PDF |
N/A |
CLNX: Bridging Code and Natural Language for C/C++ Vulnerability-Contributing Commits Identification |
| 多模态对比学习中应如何对齐? |
Benoit Dufumier |
PDF |
N/A |
What to align in multimodal contrastive learning? |
| 连续时间随机梯度下降的收敛性及其在深度线性神经网络中的应用 |
Gabor Lugosi |
PDF |
N/A |
Convergence of continuous-time stochastic gradient descent with applications to linear deep neural networks |
| 重新审视基于静态特征的安卓恶意软件检测 |
Md Tanvirul Alam |
PDF |
N/A |
Revisiting Static Feature-Based Android Malware Detection |
| AdaCAD:自适应解码以平衡上下文知识与参数化知识之间的冲突 |
Han Wang |
PDF |
N/A |
AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge |
| 一种可扩展的主动学习算法 |
Youguang Chen |
PDF |
N/A |
A Scalable Algorithm for Active Learning |
| D-CAPTCHA++:深度伪造验证码在可转移的不可感知对抗攻击下的韧性研究 |
Hong-Hanh Nguyen-Le |
PDF |
N/A |
D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack |
| 多模态情感计算的最新趋势:从自然语言处理角度进行的调查 |
Guimin Hu |
PDF |
N/A |
Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective |
| 一种用于持续学习任务的对称前向前向算法(SFFA)对比研究 |
Erik B. Terres-Escudero |
PDF |
N/A |
A Contrastive Symmetric Forward-Forward Algorithm (SFFA) for Continual Learning Tasks |
| 多变量控制以减轻CRISPRa网络中的负载 |
Krishna Manoj |
PDF |
N/A |
Multi-variable control to mitigate loads in CRISPRa networks |
| FIRAL:一种用于多项逻辑回归的主动学习算法 |
Youguang Chen |
PDF |
N/A |
FIRAL: An Active Learning Algorithm for Multinomial Logistic Regression |
| 唤醒幻灯片:一种通过语言模型协调的无调优与知识调控的AI辅导系统 |
Daniel Zhang-Li |
PDF |
N/A |
Awaking the Slides: A Tuning-free and Knowledge-regulated AI Tutoring System via Language Model Coordination |
| 演示:SGCode:一个灵活的提示优化系统,用于安全生成代码 |
Khiem Ton |
PDF |
N/A |
Demo: SGCode: A Flexible Prompt-Optimizing System for Secure Generation of Code |
| 基于事件的拼接捆绑调整 |
Shuang Guo |
PDF |
N/A |
Event-based Mosaicing Bundle Adjustment |
| 量化膝关节软骨形态和损伤:从图像到指标 |
Yongcheng Yao |
PDF |
N/A |
Quantifying Knee Cartilage Shape and Lesion: From Image to Metrics |
| 无需训练的离散扩散模型分子生成指导 |
Thomas J. Kerby |
PDF |
N/A |
Training-Free Guidance for Discrete Diffusion Models for Molecular Generation |
| 共同思考,协作更佳:结合人类与大语言模型(LLMs)的出声思考结果,实现有效文本评估 |
SeongYeub Chu |
PDF |
N/A |
Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text Evaluation |
| 通过一个强大的编码器保护视觉-语言模型,以抵御越狱和对抗性攻击 |
Md Zarif Hossain |
PDF |
N/A |
Securing Vision-Language Models with a Robust Encoder Against Jailbreak and Adversarial Attacks |
| 联合印象用于处理分布式异构数据的学习 |
Sana Ayromlou |
PDF |
N/A |
Federated Impression for Learning with Distributed Heterogeneous Data |
| 可解释人工智能在革新人类健康监测中的作用 |
Abdullah Alharthi |
PDF |
N/A |
The Role of Explainable AI in Revolutionizing Human Health Monitoring |
| 在线决策元形变器:一种基于通用具身智能的强化学习框架 |
Luo Ji |
PDF |
N/A |
Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence |
| 通过元数据发现预测游戏平衡性变化影响的框架 |
Akash Saravanan |
PDF |
N/A |
A Framework for Predicting the Impact of Game Balance Changes through Meta Discovery |
| 基准测试二维自我中心手势数据集 |
Olga Taran |
PDF |
N/A |
Benchmarking 2D Egocentric Hand Pose Datasets |
| 解释、辩论、对齐:一种从弱到强的语言模型泛化框架 |
Mehrdad Zakershahrak |
PDF |
N/A |
Explanation, Debate, Align: A Weak-to-Strong Framework for Language Model Generalization |
| 学习压缩上下文以实现基于知识的视觉问答的高效性 |
Weixi Weng |
PDF |
N/A |
Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering |
| 当前用于表示学习的对称群等变卷积框架 |
Ramzan Basheer |
PDF |
N/A |
Current Symmetry Group Equivariant Convolution Frameworks for Representation Learning |
| ART:用于重建无噪声多通道脑电信号的去除伪影变压器 |
Chun-Hsiang Chuang |
PDF |
N/A |
ART: Artifact Removal Transformer for Reconstructing Noise-Free Multichannel Electroencephalographic Signals |
| 通过多重假设检验实现的统计有效信息瓶颈 |
Amirmohammad Farzaneh |
PDF |
N/A |
Statistically Valid Information Bottleneck via Multiple Hypothesis Testing |
| 通过一致性模型实现玻尔兹曼分布的高效且无偏采样 |
Fengzhe Zhang |
PDF |
N/A |
Efficient and Unbiased Sampling of Boltzmann Distributions via Consistency Models |
| 用于机器学习应用的三维多模态同步辐射数据 |
Calum Green |
PDF |
N/A |
Three-Dimensional, Multimodal Synchrotron Data for Machine Learning Applications |
| 模块化自适应对抗训练用于端到端自动驾驶 |
Tianyuan Zhang |
PDF |
N/A |
Module-wise Adaptive Adversarial Training for End-to-end Autonomous Driving |
| MEDIC:面向临床应用中评估大型语言模型的综合框架 |
Praveen K Kanithi |
PDF |
N/A |
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications |
| 使用丢番图方程编码优化神经网络性能和可解释性 |
Ronald Katende |
PDF |
N/A |
Optimizing Neural Network Performance and Interpretability with Diophantine Equation Encoding |
| 基于混合线性模型和元森林的非侵入式血糖预测系统,用于领域泛化 |
Yuyang Sun |
PDF |
N/A |
Non-Invasive Glucose Prediction System Enhanced by Mixed Linear Models and Meta-Forests for Domain Generalization |
| 通过潜在扩散进行数据增强以进行显著性预测 |
Bahar Aydemir |
PDF |
N/A |
Data Augmentation via Latent Diffusion for Saliency Prediction |
| BLS-GAN:一种用于消除常规放射照片中骨骼重叠的深度分层框架 |
Haolin Wang |
PDF |
N/A |
BLS-GAN: A Deep Layer Separation Framework for Eliminating Bone Overlap in Conventional Radiographs |
| PaveSAM路面病害分割 |
Neema Jakisa Owor |
PDF |
N/A |
PaveSAM Segment Anything for Pavement Distress |
| 一个统一的对比损失用于自训练 |
Aurelien Gauffre |
PDF |
N/A |
A Unified Contrastive Loss for Self-Training |
| 探索带有扩散先验的用户级梯度反演 |
Zhuohang Li |
PDF |
N/A |
Exploring User-level Gradient Inversion with a Diffusion Prior |
| 使用生成式代理创建调查数据报道的提示表 |
Joris Veerbeek |
PDF |
N/A |
Using Generative Agents to Create Tip Sheets for Investigative Data Reporting |
| TLD-READY: 交通灯检测 -- 相关性评估与部署分析 |
Nikolai Polley |
PDF |
N/A |
TLD-READY: Traffic Light Detection -- Relevance Estimation and Deployment Analysis |
| 无需调参的在线鲁棒主成分分析通过隐式正则化 |
Lakshmi Jayalal |
PDF |
N/A |
Tuning-Free Online Robust Principal Component Analysis through Implicit Regularization |
| 重放:一个用于实验和生产使用的推荐框架 |
Alexey Vasilev |
PDF |
N/A |
RePlay: a Recommendation Framework for Experimentation and Production Use |
| CCFExp:面向面瘫个体的循环交叉融合扩散模型面部图像合成 |
Weixiang Gao |
PDF |
N/A |
CCFExp: Facial Image Synthesis with Cycle Cross-Fusion Diffusion Model for Facial Paralysis Individuals |
| 现实且高效的人脸交换:基于扩散模型的统一方法 |
Sanoojan Baliah |
PDF |
N/A |
Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models |
| 多类型偏好学习:赋予基于偏好的强化学习以平等偏好 |
Ziang Liu |
PDF |
N/A |
Multi-Type Preference Learning: Empowering Preference-Based Reinforcement Learning with Equal Preferences |
| MiniDrive:通过将多层次2D特征作为文本标记,为自动驾驶提供更高效视觉语言模型 |
Enming Zhang |
PDF |
N/A |
MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving |
| 跨方言文本到语音转换在音调重音语言中结合多方言音素级BERT |
Kazuki Yamauchi |
PDF |
N/A |
Cross-Dialect Text-To-Speech in Pitch-Accent Language Incorporating Multi-Dialect Phoneme-Level BERT |
| TopoMap++:一种更快且更节省空间的计算投影技术,具有拓扑保证 |
Vitoria Guardieiro |
PDF |
N/A |
TopoMap++: A faster and more space efficient technique to compute projections with topological guarantees |
| MRAC 轨道1:第二届多模态、生成与负责任情感计算研讨会 |
Shreya Ghosh |
PDF |
N/A |
MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective Computing |
| EMOdiffhead:通过扩散实现连续情感控制的说话头生成 |
Jian Zhang |
PDF |
N/A |
EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion |
| 扩散模型对齐:基础、挑战与未来 |
Buhua Liu |
PDF |
N/A |
Alignment of Diffusion Models: Fundamentals, Challenges, and Future |
| 具有灵活个性化功能的联邦$\mathcal{X}$-臂老虎机 |
Ali Arabzadeh |
PDF |
N/A |
Federated $\mathcal{X}$-armed Bandit with Flexible Personalisation |
| 仇恨宣传:对阿拉伯语模因的多模态分析与多智能体大型语言模型 |
Firoj Alam |
PDF |
N/A |
Propaganda to Hate: A Multimodal Analysis of Arabic Memes with Multi-Agent LLMs |
| 通过SO(2)-等变高斯雕刻网络进行单视图三维重建 |
Ruihan Xu |
PDF |
N/A |
Single-View 3D Reconstruction via SO(2)-Equivariant Gaussian Sculpting Networks |
| PiTe:大型视频-语言模型的像素-时间对齐 |
Yang Liu |
PDF |
N/A |
PiTe: Pixel-Temporal Alignment for Large Video-Language Model |
| Diff-VPS:通过多任务扩散网络与对抗性时间推理实现视频息肉分割 |
Yingling Lu |
PDF |
N/A |
Diff-VPS: Video Polyp Segmentation via a Multi-task Diffusion Network with Adversarial Temporal Reasoning |
| 3DGCQA:一个用于3D AI生成内容的质量评估数据库 |
Yingjie Zhou |
PDF |
N/A |
3DGCQA: A Quality Assessment Database for 3D AI-Generated Contents |
| 通过平均梯度流进行黎曼联邦学习 |
Zhenwei Huang |
PDF |
N/A |
Riemannian Federated Learning via Averaging Gradient Stream |
| 观察名单挑战:第三届开放集人脸检测与识别 |
Furkan Kasım |
PDF |
N/A |
Watchlist Challenge: 3rd Open-set Face Detection and Identification |
| 行为克隆模型:自动驾驶现实检验 |
Mustafa Yildirim |
PDF |
N/A |
Behavioral Cloning Models Reality Check for Autonomous Driving |
| 合并是否值得?安全地评估因果数据集获取的信息增益 |
Jake Fawkes |
PDF |
N/A |
Is merging worth it? Securely evaluating the information gain for causal dataset acquisition |
| 增强基于CTC的视觉语音识别 |
Hendrik Laux |
PDF |
N/A |
Enhancing CTC-Based Visual Speech Recognition |
| 在线扩展图上的图滤波 |
Bishwadeep Das |
PDF |
N/A |
Online Graph Filtering Over Expanding Graphs |
| 通过拼接预训练块实现联邦学习的异质性感知协调 |
Shichen Zhan |
PDF |
N/A |
Heterogeneity-Aware Coordination for Federated Learning via Stitching Pre-trained blocks |
| ThermalGaussian:热成像3D高斯散射 |
Rongfeng Lu |
PDF |
N/A |
ThermalGaussian: Thermal 3D Gaussian Splatting |
| 网络欺骗:现状、趋势与开放挑战 |
Pedro Beltrán López |
PDF |
N/A |
Cyber Deception: State of the art, Trends and Open challenges |
| 基于人工智能系统的需求工程成熟度如何?关于实践、挑战和未来研究方向的系统映射研究 |
Umm-e- Habiba |
PDF |
N/A |
How Mature is Requirements Engineering for AI-based Systems? A Systematic Mapping Study on Practices, Challenges, and Future Research Directions |
| 在化学领域应用多保真贝叶斯优化:开放挑战与主要考虑 |
Edmund Judge |
PDF |
N/A |
Applying Multi-Fidelity Bayesian Optimization in Chemistry: Open Challenges and Major Considerations |
| AI引导的分子模拟在虚拟现实中的视角:探索高维分子系统中的模仿学习策略 |
Mohamed Dhouioui |
PDF |
N/A |
A Perspective on AI-Guided Molecular Simulations in VR: Exploring Strategies for Imitation Learning in Hyperdimensional Molecular Systems |
| 伏羲-2.0:推进机器学习天气预报模型以实现实际应用 |
Xiaohui Zhong |
PDF |
N/A |
FuXi-2.0: Advancing machine learning weather forecasting model for practical applications |
| 通过方向性编码和几何约束提升脑扩散张量成像中的角度分辨率 |
Sheng Chen |
PDF |
N/A |
Enhancing Angular Resolution via Directionality Encoding and Geometric Constraints in Brain Diffusion Tensor Imaging |
| Phy124:从单张图像快速生成物理驱动的4D内容 |
Jiajing Lin |
PDF |
N/A |
Phy124: Fast Physics-Driven 4D Content Generation from a Single Image |
| 结合机器学习局部预测与计算流体动力学求解器,加速瞬态浮力羽流模拟 |
Clément Caron |
PDF |
N/A |
Coupling Machine Learning Local Predictions with a Computational Fluid Dynamics Solver to Accelerate Transient Buoyant Plume Simulations |
| Swin-LiteMedSAM:一种基于轻量级框的分割任意模型,适用于大规模医学图像数据集 |
Ruochen Gao |
PDF |
N/A |
Swin-LiteMedSAM: A Lightweight Box-Based Segment Anything Model for Large-Scale Medical Image Datasets |
| AC-IND:基于衰减系数估计和隐式神经分布的稀疏CT重建 |
Wangduo Xie |
PDF |
N/A |
AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution |
| 通过强化学习学习高效的递归数字系统 |
Jonathan D. Thomas |
PDF |
N/A |
Learning Efficient Recursive Numeral Systems via Reinforcement Learning |
| 具有SummaryMixing的线性时间复杂度一致器用于流式语音识别 |
Titouan Parcollet |
PDF |
N/A |
Linear Time Complexity Conformers with SummaryMixing for Streaming Speech Recognition |
| 曼巴策略:基于混合选择性状态模型的高效三维扩散策略 |
Jiahang Cao |
PDF |
N/A |
Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models |
| 使用大型语言模型对应用评论进行细粒度情感分析:一项评估研究 |
Faiz Ali Shah |
PDF |
N/A |
A Fine-grained Sentiment Analysis of App Reviews using Large Language Models: An Evaluation Study |
| 神经算法推理中的循环聚合器 |
Kaijia Xu |
PDF |
N/A |
Recurrent Aggregators in Neural Algorithmic Reasoning |
| 零样本文本到语音作为黄金语音生成器:一个系统框架及其在自动发音评估中的适用性 |
Tien-Hong Lo |
PDF |
N/A |
Zero-Shot Text-to-Speech as Golden Speech Generator: A Systematic Framework and its Applicability in Automatic Pronunciation Assessment |
| 门控槽注意力:高效线性时间序列建模 |
Yu Zhang |
PDF |
N/A |
Gated Slot Attention for Efficient Linear-Time Sequence Modeling |
| 在稀疏观测数据上的端到端学习与动力学和同化联合优化 |
Vadim Zinchenko |
PDF |
N/A |
Combined Optimization of Dynamics and Assimilation with End-to-End Learning on Sparse Observations |
| 利用非结构化文本数据进行大型语言模型的联邦指令微调 |
Rui Ye |
PDF |
N/A |
Leveraging Unstructured Text Data for Federated Instruction Tuning of Large Language Models |
| 无监督新奇检测方法基准测试与小波分解 |
Ariel Priarone |
PDF |
N/A |
Unsupervised Novelty Detection Methods Benchmarking with Wavelet Decomposition |
| 基于大型语言模型的文本特征生成,用于可解释的机器学习 |
Vojtěch Balek |
PDF |
N/A |
LLM-based feature generation from text for interpretable machine learning |
| 语言生成中的重排序法则:一种通信理论的视角 |
António Farinhas |
PDF |
N/A |
Reranking Laws for Language Generation: A Communication-Theoretic Perspective |
| MVLLaVA:一种用于统一和灵活的新视角合成的智能代理 |
Hanyu Jiang |
PDF |
N/A |
MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis |
| 深度学习技术在手静脉生物识别中的应用:全面综述 |
Mustapha Hemis |
PDF |
N/A |
Deep Learning Techniques for Hand Vein Biometrics: A Comprehensive Review |
| DCMAC:通过上界训练实现需求感知的定制化多智能体通信 |
Dongkun Huo |
PDF |
N/A |
DCMAC: Demand-aware Customized Multi-Agent Communication via Upper Bound Training |
| 交叉精炼:通过联合学习提升自然语言解释生成 |
Qianli Wang |
PDF |
N/A |
Cross-Refine: Improving Natural Language Explanation Generation by Learning in Tandem |
| 知识空间的可信度受限修正 |
Kai Sauerwald |
PDF |
N/A |
Credibility-Limited Revision for Epistemic Spaces |
| 盲图像质量评估的注意力下采样变换器、相对排序和自一致性 |
Mohammed Alsaafin |
PDF |
N/A |
Attention Down-Sampling Transformer, Relative Ranking and Self-Consistency for Blind Image Quality Assessment |
| 使用数据集蒸馏和模型尺寸适应的TinyML设备上训练的持续和增量学习方法 |
Marcus Rüb |
PDF |
N/A |
A Continual and Incremental Learning Approach for TinyML On-device Training Using Dataset Distillation and Model Size Adaption |
| 使用TinyPropv2推进设备上神经网络训练:动态、稀疏和高效的反向传播 |
Marcus Rüb |
PDF |
N/A |
Advancing On-Device Neural Network Training with TinyPropv2: Dynamic, Sparse, and Efficient Backpropagation |
| 通过元学习隐式神经表示实现快速医学形状重建 |
Gaia Romana De Paolis |
PDF |
N/A |
Fast Medical Shape Reconstruction via Meta-learned Implicit Neural Representations |
| 冗余感知相机选择用于室内场景神经渲染 |
Zehao Wang |
PDF |
N/A |
Redundancy-Aware Camera Selection for Indoor Scene Neural Rendering |
| 深度的术中高光谱相机照明校准 |
Alexander Baumann |
PDF |
N/A |
Deep intra-operative illumination calibration of hyperspectral cameras |
| CWT-Net:利用跨尺度小波变换的Transformer实现病理图像超分辨率 |
Feiyang Jia |
PDF |
N/A |
CWT-Net: Super-resolution of Histopathology Images Using a Cross-scale Wavelet-based Transformer |
| TrialSynth:生成合成顺序临床试验数据 |
Chufan Gao |
PDF |
N/A |
TrialSynth: Generation of Synthetic Sequential Clinical Trial Data |
| 无本体自由泛领域知识图谱到文本生成数据集合成使用大型语言模型 |
Daehee Kim |
PDF |
N/A |
Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Model |
| 通过错误信息理解大型语言模型中的知识漂移 |
Alina Fastowski |
PDF |
N/A |
Understanding Knowledge Drift in LLMs through Misinformation |
| 多模态情感识别与视觉-语言提示及模态缺失 |
Anbin QI |
PDF |
N/A |
Multimodal Emotion Recognition with Vision-language Prompting and Modality Dropout |
| 潜在空间解释用于风格分析和可解释的作者归属 |
Milad Alshomary |
PDF |
N/A |
Latent Space Interpretation for Stylistic Analysis and Explainable Authorship Attribution |
| 边缘建模激活自由傅里叶网络用于航天器图像去噪 |
Jingfan Yang |
PDF |
N/A |
Edge Modeling Activation Free Fourier Network for Spacecraft Image Denoising |
| 基于图模型的口语响应连贯性自动评估对话测试 |
Jiun-Ting Li |
PDF |
N/A |
Automated Speaking Assessment of Conversation Tests with Novel Graph-based Modeling on Spoken Response Coherence |
| 法律事实预测:任务定义与数据集构建 |
Junkai Liu |
PDF |
N/A |
Legal Fact Prediction: Task Definition and Dataset Construction |
| 母语与非母语提示:一项比较分析 |
Mohamed Bayan Kmainasi |
PDF |
N/A |
Native vs Non-Native Language Prompting: A Comparative Analysis |
| 在遥感领域中推动视觉-语言模型的发展,无需人工标注 |
Keumgang Cha |
PDF |
N/A |
Pushing the Limits of Vision-Language Models in Remote Sensing without Human Annotations |
| 超越独立同分布(IID):从指令交互和依赖的角度优化指令学习 |
Hanyu Zhao |
PDF |
N/A |
Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency |
| 软影:利用半影感知软掩码进行阴影去除 |
Xinrui Wang |
PDF |
N/A |
SoftShadow: Leveraging Penumbra-Aware Soft Masks for Shadow Removal |
| Retinex-RAWMamba:为低光RAW图像增强架起去马赛克与去噪的桥梁 |
Xianmin Chen |
PDF |
N/A |
Retinex-RAWMamba: Bridging Demosaicing and Denoising for Low-Light RAW Image Enhancement |
| 基于语义挖掘和神经网络的电子商务网页推荐方案 |
Wenchao Zhao |
PDF |
N/A |
E-commerce Webpage Recommendation Scheme Base on Semantic Mining and Neural Networks |
| 从最优得分匹配到最优采样 |
Zehao Dou |
PDF |
N/A |
From optimal score matching to optimal sampling |
| 神经网络压缩中的动态误差有界分层矩阵 |
John Mango |
PDF |
N/A |
Dynamic Error-Bounded Hierarchical Matrices in Neural Network Compression |
| CPSample:分类器保护采样,用于在扩散过程中保护训练数据 |
Joshua Kazdan |
PDF |
N/A |
CPSample: Classifier Protected Sampling for Guarding Training Data During Diffusion |
| SCLNet:一种用于无人机图像目标检测的尺度鲁棒互补学习网络 |
Xuexue Li |
PDF |
N/A |
SCLNet: A Scale-Robust Complementary Learning Network for Object Detection in UAV Images |
| 洞察任意实例:可提示实例分割用于遥感图像 |
Xuexue Li |
PDF |
N/A |
Insight Any Instance: Promptable Instance Segmentation for Remote Sensing Images |
| EVENet:基于证据的集成学习用于使用扩散MRI进行不确定性感知的脑部分割 |
Chenjun Li |
PDF |
N/A |
EVENet: Evidence-based Ensemble Learning for Uncertainty-aware Brain Parcellation Using Diffusion MRI |
| 通过预训练音频模型的低秩适应微调提升异常声音检测 |
Xinhu Zheng |
PDF |
N/A |
Improving Anomalous Sound Detection via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models |
| 选择性学习中的泛化实用理论 |
Peizhi Wu |
PDF |
N/A |
A Practical Theory of Generalization in Selectivity Learning |
| 基于电子健康记录预测患者胸部X光图像的时间变化 |
Daeun Kyung |
PDF |
N/A |
Towards Predicting Temporal Changes in a Patient's Chest X-ray Images based on Electronic Health Records |
| 二维FS声呐图像特征检测方法的性能评估 |
Hitesh Kyatham |
PDF |
N/A |
Performance Assessment of Feature Detection Methods for 2-D FS Sonar Imagery |
| ODYSSEE:边缘电子传感器系统检测到的牡蛎产量 |
Xiaomin Lin |
PDF |
N/A |
ODYSSEE: Oyster Detection Yielded by Sensor Systems on Edge Electronics |
| AdvLogo:基于扩散模型的目标检测器对抗性补丁攻击 |
Boming Miao |
PDF |
N/A |
AdvLogo: Adversarial Patch Attack against Object Detectors based on Diffusion Models |
| 学习在异质性下的图神经网络的个性化范围 |
Gangda Deng |
PDF |
N/A |
Learning Personalized Scoping for Graph Neural Networks under Heterophily |
| 预测-再优化任务之间的正确距离概念是什么? |
Paula Rodriguez-Diaz |
PDF |
N/A |
What is the Right Notion of Distance between Predict-then-Optimize Tasks? |
| RICAU-Net:用于心脏CT中分割小而稀疏钙化病变的残差块启发坐标注意力U-Net |
Doyoung Park |
PDF |
N/A |
RICAU-Net: Residual-block Inspired Coordinate Attention U-Net for Segmentation of Small and Sparse Calcium Lesions in Cardiac CT |
| 1M-Deepfakes检测挑战赛 |
Zhixi Cai |
PDF |
N/A |
1M-Deepfakes Detection Challenge |
| 增强跨领域预训练决策变换器与自适应注意力 |
Wenhao Zhao |
PDF |
N/A |
Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention |
| PanAdapter:基于空间-光谱先验注入的两阶段微调技术用于全色锐化 |
RuoCheng Wu |
PDF |
N/A |
PanAdapter: Two-Stage Fine-Tuning with Spatial-Spectral Priors Injecting for Pansharpening |
| 大型语言模型与扩展的丘奇-图灵论题 |
Jiří Wiedermann |
PDF |
N/A |
Large Language Models and the Extended Church-Turing Thesis |
| 脑启发式分步补丁合并用于视觉变换器 |
Yonghao Yu |
PDF |
N/A |
Brain-Inspired Stepwise Patch Merging for Vision Transformers |
| 使用数据驱动信号区域进行模型无关的新物理检测 |
Soheun Yi |
PDF |
N/A |
Toward Model-Agnostic Detection of New Physics Using Data-Driven Signal Regions |
| RLHF中对策略的过滤以微调LLM进行代码生成 |
Wei Shen |
PDF |
N/A |
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation |
| 通过自监督几何增强弥合点云表示的领域差异 |
Li Yu |
PDF |
N/A |
Bridging Domain Gap of Point Cloud Representations via Self-Supervised Geometric Augmentation |
| 通过使用条件生成器进行知识蒸馏实现隐私保护的联邦学习与一致性 |
Kangyang Luo |
PDF |
N/A |
Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator |
| 具有多个正确解的神经算法推理 |
Zeno Kujawa |
PDF |
N/A |
Neural Algorithmic Reasoning with Multiple Correct Solutions |
| 你有十三小时来解开迷宫:通过函数调用增强AI游戏主持人 |
Jaewoo Song |
PDF |
N/A |
You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing AI Game Masters with Function Calling |
| FSMDet:用于全稀疏三维检测器的视觉引导特征扩散 |
Tianran Liu |
PDF |
N/A |
FSMDet: Vision-guided feature diffusion for fully sparse 3D detector |
| 使用DAFS Express在L3椎体水平的2D MRI切片上进行自动体成分分析 |
Varun Akella |
PDF |
N/A |
Automated Body Composition Analysis Using DAFS Express on 2D MRI Slices at L3 Vertebral Level |
| FreeRide:在流水线并行中收获泡沫 |
Jiashu Zhang |
PDF |
N/A |
FreeRide: Harvesting Bubbles in Pipeline Parallelism |
| k-MLE、k-Bregman、k-VARs:理论、收敛性、计算 |
Zuogong Yue |
PDF |
N/A |
k-MLE, k-Bregman, k-VARs: Theory, Convergence, Computation |
| 产时超声图像分割:利用双学生-教师框架结合CNN-ViT协同学习技术对耻骨联合和胎儿头部进行分割 |
Jianmei Jiang |
PDF |
N/A |
Intrapartum Ultrasound Image Segmentation of Pubic Symphysis and Fetal Head Using Dual Student-Teacher Framework with CNN-ViT Collaborative Learning |
| 表示调优 |
Christopher M. Ackerman |
PDF |
N/A |
Representation Tuning |
| 重新思考神经隐式曲面重建中的方向参数化 |
Zijie Jiang |
PDF |
N/A |
Rethinking Directional Parameterization in Neural Implicit Surface Reconstruction |