跳转至

Arxiv 2024-09-11 Papers

标题 作者 PDF链接 代码仓库 Title
自演化深度监督的三维高斯光栅化技术从渲染的立体图像对中生成 Sadra Safadoust PDF N/A Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs
DreamMesh:联合操作和纹理三角网格以实现文本到3D生成 Haibo Yang PDF N/A DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation
“我的分数不对!”:一种可争议的AI框架,用于在评估学生作文时进行互动反馈 Shengxin Hong PDF N/A "My Grade is Wrong!": A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays
Hi3D:追求高分辨率图像到3D生成的视频扩散模型 Haibo Yang PDF N/A Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models
FreeEnhance:通过内容一致的噪声添加与去噪过程实现无需调整的图像增强 Yang Luo PDF N/A FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process
VMAS:通过网络音乐视频中的语义对齐实现视频到音乐的生成 Yan-Bo Lin PDF N/A VMAS: Video-to-Music Generation via Semantic Alignment in Web Music Videos
引入扰动能力评分(PS)以增强对抗逃避性对抗攻击的鲁棒性于ML-NIDS Mohamed elShehaby PDF N/A Introducing Perturb-ability Score (PS) to Enhance Robustness Against Evasion Adversarial Attacks on ML-NIDS
StereoCrafter:基于扩散模型的单目视频生成长时间高质量立体3D内容 Sijie Zhao PDF N/A StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
长尾类增量学习的自适应适配器路由 Zhi-Hong Qi PDF N/A Adaptive Adapter Routing for Long-Tailed Class-Incremental Learning
SUPER:评估代理在从研究仓库中设置和执行任务的能力 Ben Bogin PDF N/A SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories
一套用于声学语言模型评估的工具包 Gallil Maimon PDF N/A A Suite for Acoustic Language Model Evaluation
线性模型中带有Dropout正则化的随机梯度下降的渐近性 Jiaqi Li PDF N/A Asymptotics of Stochastic Gradient Descent with Dropout Regularization in Linear Models
合成连续预训练 Zitong Yang PDF N/A Synthetic continued pretraining
代理工作流程记忆 Zora Zhiruo Wang PDF N/A Agent Workflow Memory
基于深度神经网络的手语识别:一种利用迁移学习与可解释性的综合方法 A. E. M Ridwan PDF N/A Deep Neural Network-Based Sign Language Recognition: A Comprehensive Approach Using Transfer Learning with Explainability
迈向更公平的健康建议:通过词义消歧寻找信息量丰富的无偏样本 Gavin Butts PDF N/A Towards Fairer Health Recommendations: finding informative unbiased samples via Word Sense Disambiguation
通过解释增强自然语言推理中的对抗鲁棒性 Alexandros Koulakos PDF N/A Enhancing adversarial robustness in Natural Language Inference using explanations
利用条件StyleGAN和潜在空间操作的可控视网膜图像合成,以改进糖尿病视网膜病变的诊断和分级 Somayeh Pakdelmoez PDF N/A Controllable retinal image synthesis using conditional StyleGAN and latent space manipulation for improved diagnosis and grading of diabetic retinopathy
高效的一步扩散优化用于快照压缩成像 Yunzhen Wang PDF N/A Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging
用于列表推荐时间抽象的分层强化学习 Luo Ji PDF N/A Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation
SoK: 医疗人工智能的安全与隐私风险 Yuanhaur Chang PDF N/A SoK: Security and Privacy Risks of Medical AI
NVRC:神经视频表示压缩 Ho Man Kwan PDF N/A NVRC: Neural Video Representation Compression
通过叶状结构和知识迁移进行流形学习 E. Tron PDF N/A Manifold Learning via Foliations and Knowledge Transfer
稳健的机器人行走者:学习在微小陷阱上敏捷移动 Shaoting Zhu PDF N/A Robust Robot Walker: Learning Agile Locomotion over Tiny Traps
CLNX:连接代码与自然语言,助力C/C++漏洞贡献提交的识别 Zeqing Qin PDF N/A CLNX: Bridging Code and Natural Language for C/C++ Vulnerability-Contributing Commits Identification
多模态对比学习中应如何对齐? Benoit Dufumier PDF N/A What to align in multimodal contrastive learning?
连续时间随机梯度下降的收敛性及其在深度线性神经网络中的应用 Gabor Lugosi PDF N/A Convergence of continuous-time stochastic gradient descent with applications to linear deep neural networks
重新审视基于静态特征的安卓恶意软件检测 Md Tanvirul Alam PDF N/A Revisiting Static Feature-Based Android Malware Detection
AdaCAD:自适应解码以平衡上下文知识与参数化知识之间的冲突 Han Wang PDF N/A AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
一种可扩展的主动学习算法 Youguang Chen PDF N/A A Scalable Algorithm for Active Learning
D-CAPTCHA++:深度伪造验证码在可转移的不可感知对抗攻击下的韧性研究 Hong-Hanh Nguyen-Le PDF N/A D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack
多模态情感计算的最新趋势:从自然语言处理角度进行的调查 Guimin Hu PDF N/A Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective
一种用于持续学习任务的对称前向前向算法(SFFA)对比研究 Erik B. Terres-Escudero PDF N/A A Contrastive Symmetric Forward-Forward Algorithm (SFFA) for Continual Learning Tasks
多变量控制以减轻CRISPRa网络中的负载 Krishna Manoj PDF N/A Multi-variable control to mitigate loads in CRISPRa networks
FIRAL:一种用于多项逻辑回归的主动学习算法 Youguang Chen PDF N/A FIRAL: An Active Learning Algorithm for Multinomial Logistic Regression
唤醒幻灯片:一种通过语言模型协调的无调优与知识调控的AI辅导系统 Daniel Zhang-Li PDF N/A Awaking the Slides: A Tuning-free and Knowledge-regulated AI Tutoring System via Language Model Coordination
演示:SGCode:一个灵活的提示优化系统,用于安全生成代码 Khiem Ton PDF N/A Demo: SGCode: A Flexible Prompt-Optimizing System for Secure Generation of Code
基于事件的拼接捆绑调整 Shuang Guo PDF N/A Event-based Mosaicing Bundle Adjustment
量化膝关节软骨形态和损伤:从图像到指标 Yongcheng Yao PDF N/A Quantifying Knee Cartilage Shape and Lesion: From Image to Metrics
无需训练的离散扩散模型分子生成指导 Thomas J. Kerby PDF N/A Training-Free Guidance for Discrete Diffusion Models for Molecular Generation
共同思考,协作更佳:结合人类与大语言模型(LLMs)的出声思考结果,实现有效文本评估 SeongYeub Chu PDF N/A Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text Evaluation
通过一个强大的编码器保护视觉-语言模型,以抵御越狱和对抗性攻击 Md Zarif Hossain PDF N/A Securing Vision-Language Models with a Robust Encoder Against Jailbreak and Adversarial Attacks
联合印象用于处理分布式异构数据的学习 Sana Ayromlou PDF N/A Federated Impression for Learning with Distributed Heterogeneous Data
可解释人工智能在革新人类健康监测中的作用 Abdullah Alharthi PDF N/A The Role of Explainable AI in Revolutionizing Human Health Monitoring
在线决策元形变器:一种基于通用具身智能的强化学习框架 Luo Ji PDF N/A Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence
通过元数据发现预测游戏平衡性变化影响的框架 Akash Saravanan PDF N/A A Framework for Predicting the Impact of Game Balance Changes through Meta Discovery
基准测试二维自我中心手势数据集 Olga Taran PDF N/A Benchmarking 2D Egocentric Hand Pose Datasets
解释、辩论、对齐:一种从弱到强的语言模型泛化框架 Mehrdad Zakershahrak PDF N/A Explanation, Debate, Align: A Weak-to-Strong Framework for Language Model Generalization
学习压缩上下文以实现基于知识的视觉问答的高效性 Weixi Weng PDF N/A Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering
当前用于表示学习的对称群等变卷积框架 Ramzan Basheer PDF N/A Current Symmetry Group Equivariant Convolution Frameworks for Representation Learning
ART:用于重建无噪声多通道脑电信号的去除伪影变压器 Chun-Hsiang Chuang PDF N/A ART: Artifact Removal Transformer for Reconstructing Noise-Free Multichannel Electroencephalographic Signals
通过多重假设检验实现的统计有效信息瓶颈 Amirmohammad Farzaneh PDF N/A Statistically Valid Information Bottleneck via Multiple Hypothesis Testing
通过一致性模型实现玻尔兹曼分布的高效且无偏采样 Fengzhe Zhang PDF N/A Efficient and Unbiased Sampling of Boltzmann Distributions via Consistency Models
用于机器学习应用的三维多模态同步辐射数据 Calum Green PDF N/A Three-Dimensional, Multimodal Synchrotron Data for Machine Learning Applications
模块化自适应对抗训练用于端到端自动驾驶 Tianyuan Zhang PDF N/A Module-wise Adaptive Adversarial Training for End-to-end Autonomous Driving
MEDIC:面向临床应用中评估大型语言模型的综合框架 Praveen K Kanithi PDF N/A MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications
使用丢番图方程编码优化神经网络性能和可解释性 Ronald Katende PDF N/A Optimizing Neural Network Performance and Interpretability with Diophantine Equation Encoding
基于混合线性模型和元森林的非侵入式血糖预测系统,用于领域泛化 Yuyang Sun PDF N/A Non-Invasive Glucose Prediction System Enhanced by Mixed Linear Models and Meta-Forests for Domain Generalization
通过潜在扩散进行数据增强以进行显著性预测 Bahar Aydemir PDF N/A Data Augmentation via Latent Diffusion for Saliency Prediction
BLS-GAN:一种用于消除常规放射照片中骨骼重叠的深度分层框架 Haolin Wang PDF N/A BLS-GAN: A Deep Layer Separation Framework for Eliminating Bone Overlap in Conventional Radiographs
PaveSAM路面病害分割 Neema Jakisa Owor PDF N/A PaveSAM Segment Anything for Pavement Distress
一个统一的对比损失用于自训练 Aurelien Gauffre PDF N/A A Unified Contrastive Loss for Self-Training
探索带有扩散先验的用户级梯度反演 Zhuohang Li PDF N/A Exploring User-level Gradient Inversion with a Diffusion Prior
使用生成式代理创建调查数据报道的提示表 Joris Veerbeek PDF N/A Using Generative Agents to Create Tip Sheets for Investigative Data Reporting
TLD-READY: 交通灯检测 -- 相关性评估与部署分析 Nikolai Polley PDF N/A TLD-READY: Traffic Light Detection -- Relevance Estimation and Deployment Analysis
无需调参的在线鲁棒主成分分析通过隐式正则化 Lakshmi Jayalal PDF N/A Tuning-Free Online Robust Principal Component Analysis through Implicit Regularization
重放:一个用于实验和生产使用的推荐框架 Alexey Vasilev PDF N/A RePlay: a Recommendation Framework for Experimentation and Production Use
CCFExp:面向面瘫个体的循环交叉融合扩散模型面部图像合成 Weixiang Gao PDF N/A CCFExp: Facial Image Synthesis with Cycle Cross-Fusion Diffusion Model for Facial Paralysis Individuals
现实且高效的人脸交换:基于扩散模型的统一方法 Sanoojan Baliah PDF N/A Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models
多类型偏好学习:赋予基于偏好的强化学习以平等偏好 Ziang Liu PDF N/A Multi-Type Preference Learning: Empowering Preference-Based Reinforcement Learning with Equal Preferences
MiniDrive:通过将多层次2D特征作为文本标记,为自动驾驶提供更高效视觉语言模型 Enming Zhang PDF N/A MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving
跨方言文本到语音转换在音调重音语言中结合多方言音素级BERT Kazuki Yamauchi PDF N/A Cross-Dialect Text-To-Speech in Pitch-Accent Language Incorporating Multi-Dialect Phoneme-Level BERT
TopoMap++:一种更快且更节省空间的计算投影技术,具有拓扑保证 Vitoria Guardieiro PDF N/A TopoMap++: A faster and more space efficient technique to compute projections with topological guarantees
MRAC 轨道1:第二届多模态、生成与负责任情感计算研讨会 Shreya Ghosh PDF N/A MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective Computing
EMOdiffhead:通过扩散实现连续情感控制的说话头生成 Jian Zhang PDF N/A EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion
扩散模型对齐:基础、挑战与未来 Buhua Liu PDF N/A Alignment of Diffusion Models: Fundamentals, Challenges, and Future
具有灵活个性化功能的联邦$\mathcal{X}$-臂老虎机 Ali Arabzadeh PDF N/A Federated $\mathcal{X}$-armed Bandit with Flexible Personalisation
仇恨宣传:对阿拉伯语模因的多模态分析与多智能体大型语言模型 Firoj Alam PDF N/A Propaganda to Hate: A Multimodal Analysis of Arabic Memes with Multi-Agent LLMs
通过SO(2)-等变高斯雕刻网络进行单视图三维重建 Ruihan Xu PDF N/A Single-View 3D Reconstruction via SO(2)-Equivariant Gaussian Sculpting Networks
PiTe:大型视频-语言模型的像素-时间对齐 Yang Liu PDF N/A PiTe: Pixel-Temporal Alignment for Large Video-Language Model
Diff-VPS:通过多任务扩散网络与对抗性时间推理实现视频息肉分割 Yingling Lu PDF N/A Diff-VPS: Video Polyp Segmentation via a Multi-task Diffusion Network with Adversarial Temporal Reasoning
3DGCQA:一个用于3D AI生成内容的质量评估数据库 Yingjie Zhou PDF N/A 3DGCQA: A Quality Assessment Database for 3D AI-Generated Contents
通过平均梯度流进行黎曼联邦学习 Zhenwei Huang PDF N/A Riemannian Federated Learning via Averaging Gradient Stream
观察名单挑战:第三届开放集人脸检测与识别 Furkan Kasım PDF N/A Watchlist Challenge: 3rd Open-set Face Detection and Identification
行为克隆模型:自动驾驶现实检验 Mustafa Yildirim PDF N/A Behavioral Cloning Models Reality Check for Autonomous Driving
合并是否值得?安全地评估因果数据集获取的信息增益 Jake Fawkes PDF N/A Is merging worth it? Securely evaluating the information gain for causal dataset acquisition
增强基于CTC的视觉语音识别 Hendrik Laux PDF N/A Enhancing CTC-Based Visual Speech Recognition
在线扩展图上的图滤波 Bishwadeep Das PDF N/A Online Graph Filtering Over Expanding Graphs
通过拼接预训练块实现联邦学习的异质性感知协调 Shichen Zhan PDF N/A Heterogeneity-Aware Coordination for Federated Learning via Stitching Pre-trained blocks
ThermalGaussian:热成像3D高斯散射 Rongfeng Lu PDF N/A ThermalGaussian: Thermal 3D Gaussian Splatting
网络欺骗:现状、趋势与开放挑战 Pedro Beltrán López PDF N/A Cyber Deception: State of the art, Trends and Open challenges
基于人工智能系统的需求工程成熟度如何?关于实践、挑战和未来研究方向的系统映射研究 Umm-e- Habiba PDF N/A How Mature is Requirements Engineering for AI-based Systems? A Systematic Mapping Study on Practices, Challenges, and Future Research Directions
在化学领域应用多保真贝叶斯优化:开放挑战与主要考虑 Edmund Judge PDF N/A Applying Multi-Fidelity Bayesian Optimization in Chemistry: Open Challenges and Major Considerations
AI引导的分子模拟在虚拟现实中的视角:探索高维分子系统中的模仿学习策略 Mohamed Dhouioui PDF N/A A Perspective on AI-Guided Molecular Simulations in VR: Exploring Strategies for Imitation Learning in Hyperdimensional Molecular Systems
伏羲-2.0:推进机器学习天气预报模型以实现实际应用 Xiaohui Zhong PDF N/A FuXi-2.0: Advancing machine learning weather forecasting model for practical applications
通过方向性编码和几何约束提升脑扩散张量成像中的角度分辨率 Sheng Chen PDF N/A Enhancing Angular Resolution via Directionality Encoding and Geometric Constraints in Brain Diffusion Tensor Imaging
Phy124:从单张图像快速生成物理驱动的4D内容 Jiajing Lin PDF N/A Phy124: Fast Physics-Driven 4D Content Generation from a Single Image
结合机器学习局部预测与计算流体动力学求解器,加速瞬态浮力羽流模拟 Clément Caron PDF N/A Coupling Machine Learning Local Predictions with a Computational Fluid Dynamics Solver to Accelerate Transient Buoyant Plume Simulations
Swin-LiteMedSAM:一种基于轻量级框的分割任意模型,适用于大规模医学图像数据集 Ruochen Gao PDF N/A Swin-LiteMedSAM: A Lightweight Box-Based Segment Anything Model for Large-Scale Medical Image Datasets
AC-IND:基于衰减系数估计和隐式神经分布的稀疏CT重建 Wangduo Xie PDF N/A AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution
通过强化学习学习高效的递归数字系统 Jonathan D. Thomas PDF N/A Learning Efficient Recursive Numeral Systems via Reinforcement Learning
具有SummaryMixing的线性时间复杂度一致器用于流式语音识别 Titouan Parcollet PDF N/A Linear Time Complexity Conformers with SummaryMixing for Streaming Speech Recognition
曼巴策略:基于混合选择性状态模型的高效三维扩散策略 Jiahang Cao PDF N/A Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models
使用大型语言模型对应用评论进行细粒度情感分析:一项评估研究 Faiz Ali Shah PDF N/A A Fine-grained Sentiment Analysis of App Reviews using Large Language Models: An Evaluation Study
神经算法推理中的循环聚合器 Kaijia Xu PDF N/A Recurrent Aggregators in Neural Algorithmic Reasoning
零样本文本到语音作为黄金语音生成器:一个系统框架及其在自动发音评估中的适用性 Tien-Hong Lo PDF N/A Zero-Shot Text-to-Speech as Golden Speech Generator: A Systematic Framework and its Applicability in Automatic Pronunciation Assessment
门控槽注意力:高效线性时间序列建模 Yu Zhang PDF N/A Gated Slot Attention for Efficient Linear-Time Sequence Modeling
在稀疏观测数据上的端到端学习与动力学和同化联合优化 Vadim Zinchenko PDF N/A Combined Optimization of Dynamics and Assimilation with End-to-End Learning on Sparse Observations
利用非结构化文本数据进行大型语言模型的联邦指令微调 Rui Ye PDF N/A Leveraging Unstructured Text Data for Federated Instruction Tuning of Large Language Models
无监督新奇检测方法基准测试与小波分解 Ariel Priarone PDF N/A Unsupervised Novelty Detection Methods Benchmarking with Wavelet Decomposition
基于大型语言模型的文本特征生成,用于可解释的机器学习 Vojtěch Balek PDF N/A LLM-based feature generation from text for interpretable machine learning
语言生成中的重排序法则:一种通信理论的视角 António Farinhas PDF N/A Reranking Laws for Language Generation: A Communication-Theoretic Perspective
MVLLaVA:一种用于统一和灵活的新视角合成的智能代理 Hanyu Jiang PDF N/A MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis
深度学习技术在手静脉生物识别中的应用:全面综述 Mustapha Hemis PDF N/A Deep Learning Techniques for Hand Vein Biometrics: A Comprehensive Review
DCMAC:通过上界训练实现需求感知的定制化多智能体通信 Dongkun Huo PDF N/A DCMAC: Demand-aware Customized Multi-Agent Communication via Upper Bound Training
交叉精炼:通过联合学习提升自然语言解释生成 Qianli Wang PDF N/A Cross-Refine: Improving Natural Language Explanation Generation by Learning in Tandem
知识空间的可信度受限修正 Kai Sauerwald PDF N/A Credibility-Limited Revision for Epistemic Spaces
盲图像质量评估的注意力下采样变换器、相对排序和自一致性 Mohammed Alsaafin PDF N/A Attention Down-Sampling Transformer, Relative Ranking and Self-Consistency for Blind Image Quality Assessment
使用数据集蒸馏和模型尺寸适应的TinyML设备上训练的持续和增量学习方法 Marcus Rüb PDF N/A A Continual and Incremental Learning Approach for TinyML On-device Training Using Dataset Distillation and Model Size Adaption
使用TinyPropv2推进设备上神经网络训练:动态、稀疏和高效的反向传播 Marcus Rüb PDF N/A Advancing On-Device Neural Network Training with TinyPropv2: Dynamic, Sparse, and Efficient Backpropagation
通过元学习隐式神经表示实现快速医学形状重建 Gaia Romana De Paolis PDF N/A Fast Medical Shape Reconstruction via Meta-learned Implicit Neural Representations
冗余感知相机选择用于室内场景神经渲染 Zehao Wang PDF N/A Redundancy-Aware Camera Selection for Indoor Scene Neural Rendering
深度的术中高光谱相机照明校准 Alexander Baumann PDF N/A Deep intra-operative illumination calibration of hyperspectral cameras
CWT-Net:利用跨尺度小波变换的Transformer实现病理图像超分辨率 Feiyang Jia PDF N/A CWT-Net: Super-resolution of Histopathology Images Using a Cross-scale Wavelet-based Transformer
TrialSynth:生成合成顺序临床试验数据 Chufan Gao PDF N/A TrialSynth: Generation of Synthetic Sequential Clinical Trial Data
无本体自由泛领域知识图谱到文本生成数据集合成使用大型语言模型 Daehee Kim PDF N/A Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Model
通过错误信息理解大型语言模型中的知识漂移 Alina Fastowski PDF N/A Understanding Knowledge Drift in LLMs through Misinformation
多模态情感识别与视觉-语言提示及模态缺失 Anbin QI PDF N/A Multimodal Emotion Recognition with Vision-language Prompting and Modality Dropout
潜在空间解释用于风格分析和可解释的作者归属 Milad Alshomary PDF N/A Latent Space Interpretation for Stylistic Analysis and Explainable Authorship Attribution
边缘建模激活自由傅里叶网络用于航天器图像去噪 Jingfan Yang PDF N/A Edge Modeling Activation Free Fourier Network for Spacecraft Image Denoising
基于图模型的口语响应连贯性自动评估对话测试 Jiun-Ting Li PDF N/A Automated Speaking Assessment of Conversation Tests with Novel Graph-based Modeling on Spoken Response Coherence
法律事实预测:任务定义与数据集构建 Junkai Liu PDF N/A Legal Fact Prediction: Task Definition and Dataset Construction
母语与非母语提示:一项比较分析 Mohamed Bayan Kmainasi PDF N/A Native vs Non-Native Language Prompting: A Comparative Analysis
在遥感领域中推动视觉-语言模型的发展,无需人工标注 Keumgang Cha PDF N/A Pushing the Limits of Vision-Language Models in Remote Sensing without Human Annotations
超越独立同分布(IID):从指令交互和依赖的角度优化指令学习 Hanyu Zhao PDF N/A Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency
软影:利用半影感知软掩码进行阴影去除 Xinrui Wang PDF N/A SoftShadow: Leveraging Penumbra-Aware Soft Masks for Shadow Removal
Retinex-RAWMamba:为低光RAW图像增强架起去马赛克与去噪的桥梁 Xianmin Chen PDF N/A Retinex-RAWMamba: Bridging Demosaicing and Denoising for Low-Light RAW Image Enhancement
基于语义挖掘和神经网络的电子商务网页推荐方案 Wenchao Zhao PDF N/A E-commerce Webpage Recommendation Scheme Base on Semantic Mining and Neural Networks
从最优得分匹配到最优采样 Zehao Dou PDF N/A From optimal score matching to optimal sampling
神经网络压缩中的动态误差有界分层矩阵 John Mango PDF N/A Dynamic Error-Bounded Hierarchical Matrices in Neural Network Compression
CPSample:分类器保护采样,用于在扩散过程中保护训练数据 Joshua Kazdan PDF N/A CPSample: Classifier Protected Sampling for Guarding Training Data During Diffusion
SCLNet:一种用于无人机图像目标检测的尺度鲁棒互补学习网络 Xuexue Li PDF N/A SCLNet: A Scale-Robust Complementary Learning Network for Object Detection in UAV Images
洞察任意实例:可提示实例分割用于遥感图像 Xuexue Li PDF N/A Insight Any Instance: Promptable Instance Segmentation for Remote Sensing Images
EVENet:基于证据的集成学习用于使用扩散MRI进行不确定性感知的脑部分割 Chenjun Li PDF N/A EVENet: Evidence-based Ensemble Learning for Uncertainty-aware Brain Parcellation Using Diffusion MRI
通过预训练音频模型的低秩适应微调提升异常声音检测 Xinhu Zheng PDF N/A Improving Anomalous Sound Detection via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models
选择性学习中的泛化实用理论 Peizhi Wu PDF N/A A Practical Theory of Generalization in Selectivity Learning
基于电子健康记录预测患者胸部X光图像的时间变化 Daeun Kyung PDF N/A Towards Predicting Temporal Changes in a Patient's Chest X-ray Images based on Electronic Health Records
二维FS声呐图像特征检测方法的性能评估 Hitesh Kyatham PDF N/A Performance Assessment of Feature Detection Methods for 2-D FS Sonar Imagery
ODYSSEE:边缘电子传感器系统检测到的牡蛎产量 Xiaomin Lin PDF N/A ODYSSEE: Oyster Detection Yielded by Sensor Systems on Edge Electronics
AdvLogo:基于扩散模型的目标检测器对抗性补丁攻击 Boming Miao PDF N/A AdvLogo: Adversarial Patch Attack against Object Detectors based on Diffusion Models
学习在异质性下的图神经网络的个性化范围 Gangda Deng PDF N/A Learning Personalized Scoping for Graph Neural Networks under Heterophily
预测-再优化任务之间的正确距离概念是什么? Paula Rodriguez-Diaz PDF N/A What is the Right Notion of Distance between Predict-then-Optimize Tasks?
RICAU-Net:用于心脏CT中分割小而稀疏钙化病变的残差块启发坐标注意力U-Net Doyoung Park PDF N/A RICAU-Net: Residual-block Inspired Coordinate Attention U-Net for Segmentation of Small and Sparse Calcium Lesions in Cardiac CT
1M-Deepfakes检测挑战赛 Zhixi Cai PDF N/A 1M-Deepfakes Detection Challenge
增强跨领域预训练决策变换器与自适应注意力 Wenhao Zhao PDF N/A Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention
PanAdapter:基于空间-光谱先验注入的两阶段微调技术用于全色锐化 RuoCheng Wu PDF N/A PanAdapter: Two-Stage Fine-Tuning with Spatial-Spectral Priors Injecting for Pansharpening
大型语言模型与扩展的丘奇-图灵论题 Jiří Wiedermann PDF N/A Large Language Models and the Extended Church-Turing Thesis
脑启发式分步补丁合并用于视觉变换器 Yonghao Yu PDF N/A Brain-Inspired Stepwise Patch Merging for Vision Transformers
使用数据驱动信号区域进行模型无关的新物理检测 Soheun Yi PDF N/A Toward Model-Agnostic Detection of New Physics Using Data-Driven Signal Regions
RLHF中对策略的过滤以微调LLM进行代码生成 Wei Shen PDF N/A Policy Filtration in RLHF to Fine-Tune LLM for Code Generation
通过自监督几何增强弥合点云表示的领域差异 Li Yu PDF N/A Bridging Domain Gap of Point Cloud Representations via Self-Supervised Geometric Augmentation
通过使用条件生成器进行知识蒸馏实现隐私保护的联邦学习与一致性 Kangyang Luo PDF N/A Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator
具有多个正确解的神经算法推理 Zeno Kujawa PDF N/A Neural Algorithmic Reasoning with Multiple Correct Solutions
你有十三小时来解开迷宫:通过函数调用增强AI游戏主持人 Jaewoo Song PDF N/A You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing AI Game Masters with Function Calling
FSMDet:用于全稀疏三维检测器的视觉引导特征扩散 Tianran Liu PDF N/A FSMDet: Vision-guided feature diffusion for fully sparse 3D detector
使用DAFS Express在L3椎体水平的2D MRI切片上进行自动体成分分析 Varun Akella PDF N/A Automated Body Composition Analysis Using DAFS Express on 2D MRI Slices at L3 Vertebral Level
FreeRide:在流水线并行中收获泡沫 Jiashu Zhang PDF N/A FreeRide: Harvesting Bubbles in Pipeline Parallelism
k-MLE、k-Bregman、k-VARs:理论、收敛性、计算 Zuogong Yue PDF N/A k-MLE, k-Bregman, k-VARs: Theory, Convergence, Computation
产时超声图像分割:利用双学生-教师框架结合CNN-ViT协同学习技术对耻骨联合和胎儿头部进行分割 Jianmei Jiang PDF N/A Intrapartum Ultrasound Image Segmentation of Pubic Symphysis and Fetal Head Using Dual Student-Teacher Framework with CNN-ViT Collaborative Learning
表示调优 Christopher M. Ackerman PDF N/A Representation Tuning
重新思考神经隐式曲面重建中的方向参数化 Zijie Jiang PDF N/A Rethinking Directional Parameterization in Neural Implicit Surface Reconstruction