跳转至

Arxiv 2024-09-24 Papers

标题 作者 PDF链接 代码仓库 Title
使用基于SAM2的跟踪进行在线轴估计的关节物体操作 Xi Wang PDF N/A Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking
通过对比随机游走实现的自监督任意点跟踪 Ayush Shrivastava PDF N/A Self-Supervised Any-Point Tracking by Contrastive Random Walks
Gen2Act:在新场景中生成人类视频,实现可泛化的机器人操作 Homanga Bharadhwaj PDF N/A Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation
MonoFormer:一个Transformer同时适用于扩散和自回归 Chuyang Zhao PDF N/A MonoFormer: One Transformer for Both Diffusion and Autoregression
语义重聚焦调优用于开放词汇全景分割 Yong Xien Chng PDF N/A Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation
压缩深度图超分辨率与恢复:AIM 2024挑战赛结果 Marcos V. Conde PDF N/A Compressed Depth Map Super-Resolution and Restoration: AIM 2024 Challenge Results
AIM 2024超高清盲照片质量评估挑战赛 Vlad Hosu PDF N/A AIM 2024 Challenge on UHD Blind Photo Quality Assessment
CDChat:一种用于遥感变化描述的大型多模态模型 Mubashir Noman PDF N/A CDChat: A Large Multimodal Model for Remote Sensing Change Description
学习如何帮助:训练模型以协助旧设备 Yu Wu PDF N/A Learning To Help: Training Models to Assist Legacy Devices
全球农业田地边界分割的机器学习基准数据集:世界田地 Hannah Kerner PDF N/A Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation
一种快速且可靠的非连续命名实体识别标注方法 Caio Corro PDF N/A A fast and sound tagging method for discontinuous named-entity recognition
LLM回音室:个性化与自动化的虚假信息传播 Tony Ma PDF N/A LLM Echo Chamber: personalized and automated disinformation
标签增强的数据集蒸馏 Seoungyoon Kang PDF N/A Label-Augmented Dataset Distillation
通过廉价排序挖掘规则高效学习概率逻辑模型 Jonathan Feldstein PDF N/A Efficiently Learning Probabilistic Logical Models by Cheaply Ranking Mined Rules
EuroLLM:欧洲多语言语言模型 Pedro Henrique Martins PDF N/A EuroLLM: Multilingual Language Models for Europe
使用生存变压器、极端梯度提升和Cox比例风险模型预测轻度认知障碍的恶化 Henry Musto PDF N/A Predicting Deterioration in Mild Cognitive Impairment with Survival Transformers, Extreme Gradient Boosting and Cox Proportional Hazard Modelling
VideoPatchCore:一种有效的记忆正常视频以进行异常检测的方法 Sunghyun Ahn PDF N/A VideoPatchCore: An Effective Method to Memorize Normality for Video Anomaly Detection
微调是好的,只要校准得当 Zheda Mai PDF N/A Fine-Tuning is Fine, if Calibrated
利用大型语言模型提升对话式用户界面中的关联数据检索 Omar Mussa PDF N/A Towards Enhancing Linked Data Retrieval in Conversational UIs using Large Language Models
面向问题的聚类自动机器学习 Matheus Camilo da Silva PDF N/A Problem-oriented AutoML in Clustering
微型机器人数据集与持续目标检测基准 Francesco Pasti PDF N/A Tiny Robotics Dataset and Benchmark for Continual Object Detection
深度学习在精准农业中的应用:喷洒后评估与沉积量估算 Harry Rogers PDF N/A Deep Learning for Precision Agriculture: Post-Spraying Evaluation and Deposition Estimation
MaskBit:通过位标记实现的无嵌入图像生成 Mark Weber PDF N/A MaskBit: Embedding-free Image Generation via Bit Tokens
LLMCount:利用多模态大语言模型增强静态毫米波检测 Boyan Li PDF N/A LLMCount: Enhancing Stationary mmWave Detection with Multimodal-LLM
深度学习在前列腺癌诊断中的分割策略:Mamba、SAM 和 YOLO 的比较研究 Ali Badiezadeh PDF N/A Segmentation Strategies in Deep Learning for Prostate Cancer Diagnosis: A Comparative Study of Mamba, SAM, and YOLO
AUGUR,一种用于识别最佳吸附位点的灵活且高效的优化算法 Ioannis Kouroudis PDF N/A AUGUR, A flexible and efficient optimization algorithm for identification of optimal adsorption sites
表情增强型TTS:结合面部表情表示与情感强度实现自适应语音合成 Yunji Chu PDF N/A Facial Expression-Enhanced TTS: Combining Face Representation and Emotion Intensity for Adaptive Speech
CJEval:一个使用中国初中考试数据评估大型语言模型的基准 Qianwen Zhang PDF N/A CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data
应用上肢自由呼吸磁共振指纹技术定量水T1和脂肪分数 Constantin Slioussarenko PDF N/A Upper-body free-breathing Magnetic Resonance Fingerprinting applied to the quantification of water T1 and fat fraction
利用估计的可迁移性优于人类直觉进行文本排序中的模型选择 Jun Bai PDF N/A Leveraging Estimated Transferability Over Human Intuition for Model Selection in Text Ranking
具有函数逼近的上下文老虎机的二阶边界 Aldo Pacchiano PDF N/A Second Order Bounds for Contextual Bandits with Function Approximation
HelloBench:评估大型语言模型的长文本生成能力 Haoran Que PDF N/A HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
专家级视觉语言基础模型,适用于实际放射学应用及全面评估 Xiaohong Liu PDF N/A Expert-level vision-language foundation model for real-world radiology and comprehensive evaluation
SDFit:通过将可变形SDF拟合到单张图像来实现3D物体姿态和形状的估计 Dimitrije Antić PDF N/A SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image
使用大型语言模型进行网络知识补全 Braden K Webb PDF N/A Cyber Knowledge Completion Using Large Language Models
将稳定且流行的匹配算法从二部图扩展到任意实例 Gergely Csáji PDF N/A Extending Stable and Popular Matching Algorithms from Bipartite to Arbitrary Instances
像玩乐高一样合并LoRA:通过秩级聚类将LoRA的模块化推向极致 Ziyu Zhao PDF N/A Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
EnIGMA:增强型交互式生成模型代理,用于CTF挑战赛 Talor Abramovich PDF N/A EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges
MIMO:基于空间分解建模的可控角色视频合成 Yifang Men PDF N/A MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
ComiCap:一种用于漫画分镜密集标注的视觉语言模型流水线 Emanuele Vivoli PDF N/A ComiCap: A VLMs pipeline for dense captioning of Comic Panels
高效运动预测:一种轻量级且精确的轨迹预测模型,具备快速训练和推理速度 Alexander Prutsch PDF N/A Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed
控制检索增强生成的风险:一种反事实提示框架 Lu Chen PDF N/A Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting Framework
事物中的面孔:一种模型和数据集用于幻想性视错觉 Mark Hamilton PDF N/A Seeing Faces in Things: A Model and Dataset for Pareidolia
DiffPaSS -- 使用软分数的高性能可微分蛋白质序列配对 Umberto Lupo PDF N/A DiffPaSS -- High-performance differentiable pairing of protein sequences using soft scores
HA-FGOVD:通过显式线性组合突出细粒度属性以实现开放词汇对象检测 Yuqi Ma PDF N/A HA-FGOVD: Highlighting Fine-grained Attributes via Explicit Linear Composition for Open-Vocabulary Object Detection
评估最先进的自动语音识别模型在儿童-成人互动中的表现 Aditya Ashvin PDF N/A Evaluation of state-of-the-art ASR Models in Child-Adult Interactions
在练习过程中对语言学习的隐性评估与显性测试一样准确 Jue Hou PDF N/A Implicit assessment of language learning during practice as accurate as explicit testing
VisioPhysioENet:利用视觉和生理信号进行多模态参与度检测 Alakhsimar Singh PDF N/A VisioPhysioENet: Multimodal Engagement Detection using Visual and Physiological Signals
分析评估智能体能力的概率方法 Axel Højmark PDF N/A Analyzing Probabilistic Methods for Evaluating Agent Capabilities
MOSS:为AI代理提供代码驱动的演进与上下文管理 Ming Zhu PDF N/A MOSS: Enabling Code-Driven Evolution and Context Management for AI Agents
TabEBM:一种基于不同类特定能量模型的表格数据增强方法 Andrei Margeloiu PDF N/A TabEBM: A Tabular Data Augmentation Method with Distinct Class-Specific Energy-Based Models
自注意力机制作为吸引子网络:无需反向传播的瞬态记忆 Francesco D'Amico PDF N/A Self-attention as an attractor network: transient memories without backpropagation
CloudTrack:基于云语义的可扩展无人机追踪 Yannik Blei PDF N/A CloudTrack: Scalable UAV Tracking with Cloud Semantics
使用场景方案:医疗领域中保护说话者隐私的威胁模型规范 Mehtab Ur Rahman PDF N/A Scenario of Use Scheme: Threat Model Specification for Speaker Privacy Protection in the Medical Domain
神经形态无人机检测:一种事件-RGB多模态方法 Gabriele Magrini PDF N/A Neuromorphic Drone Detection: an Event-RGB Multimodal Approach
数字化转型在医疗领域的应用:人工智能如何提升医疗系统的效能 África Periáñez PDF N/A The Digital Transformation in Health: How AI Can Improve the Performance of Health Systems
探索开放领域问答中的提示生成方法 Jamshid Mozafari PDF N/A Exploring Hint Generation Approaches in Open-Domain Question Answering
从像素到文字:通过交互式自然语言处理利用人脸识别中的可解释性 Ivan DeAndres-Tame PDF N/A From Pixels to Words: Leveraging Explainability in Face Recognition through Interactive Natural Language Processing
评估神经网络中的简化水平:超参数配置对复杂性和敏感性的影响 Huixin Guan PDF N/A Assessing Simplification Levels in Neural Networks: The Impact of Hyperparameter Configurations on Complexity and Sensitivity
MM-CamObj:一个全面的多模态数据集,适用于伪装物体场景 Jiacheng Ruan PDF N/A MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios
多模型集成方法用于心房颤动患者LGE-MRI中准确的双心房分割 Lucas Beveridge PDF N/A Multi-Model Ensemble Approach for Accurate Bi-Atrial Segmentation in LGE-MRI of Atrial Fibrillation Patients
GS-Net:面向多阶段青光眼分类的全球自注意力引导CNN Dipankar Das PDF N/A GS-Net: Global Self-Attention Guided CNN for Multi-Stage Glaucoma Classification
在线多层次对比表示蒸馏用于跨受试者fNIRS情绪识别 Zhili Lai PDF N/A Online Multi-level Contrastive Representation Distillation for Cross-Subject fNIRS Emotion Recognition
利用专家混合技术提升语音深度伪造检测 Viola Negroni PDF N/A Leveraging Mixture of Experts for Improved Speech Deepfake Detection
在FPGA上实现的极低延迟量子启发式机器学习预测器 Lorenzo Borella PDF N/A Ultra-low latency quantum-inspired machine learning predictors implemented on FPGA
开放世界目标检测与实例表示学习 Sunoh Lee PDF N/A Open-World Object Detection with Instance Representation Learning
自信学习:从软标签训练更好的分类器 Sjoerd de Vries PDF N/A Learning with Confidence: Training Better Classifiers from Soft Labels
用于光伏系统自动缺陷检测的机器学习方法 Swayam Rajat Mohanty PDF N/A Machine learning approaches for automatic defect detection in photovoltaic systems
一个关于委托-代理协作学习问题的决策理论模型 Getachew K Befekadu PDF N/A A decision-theoretic model for a principal-agent collaborative learning problem
使用合成损坏数据评估内窥镜深度估计的鲁棒性 An Wang PDF N/A Benchmarking Robustness of Endoscopic Depth Estimation with Synthetically Corrupted Data
生成三维心脏形状建模用于计算机模拟试验 Andrei Gasparovici PDF N/A Generative 3D Cardiac Shape Modelling for In-Silico Trials
面向鲁棒目标检测:通过模块不一致性分析识别和移除后门 Xianda Zhang PDF N/A Towards Robust Object Detection: Identifying and Removing Backdoors via Module Inconsistency Analysis
人脸识别的对抗性水印 Yuguang Yao PDF N/A Adversarial Watermarking for Face Recognition
去噪图超分辨率以改进对撞机事件重建 Nilotpal Kakati PDF N/A Denoising Graph Super-Resolution towards Improved Collider Event Reconstruction
全身末端执行器姿态跟踪 Tifanny Portela PDF N/A Whole-body end-effector pose tracking
LTNtorch:逻辑张量网络的PyTorch实现 Tommaso Carraro PDF N/A LTNtorch: PyTorch Implementation of Logic Tensor Networks
使用对比学习和方向梯度直方图增强无监督图像到图像翻译 Wanchen Zhao PDF N/A Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients
时间混合专家模型(Time-MoE):基于混合专家的十亿级时间序列基础模型 Xiaoming Shi PDF N/A Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts
接地计算与意识:探索机器及其他生物意识的一个框架 Ryan Williams PDF N/A Grounded Computation & Consciousness: A Framework for Exploring Consciousness in Machines & Other Organisms
色调映射图像的深度色度压缩 Xenios Milidonis PDF N/A Deep chroma compression of tone-mapped images
解锁市场:跨市场问答的多语言基准 Yifei Yuan PDF N/A Unlocking Markets: A Multilingual Benchmark to Cross-Market Question Answering
通过渲染函数和视觉-语言模型连接环境和语言 Theo Cachet PDF N/A Bridging Environments and Language with Rendering Functions and Vision-Language Models
AI可能存在认知偏见:基于LLM的批量相关性评估中的阈值启动探索性研究 Nuo Chen PDF N/A AI Can Be Cognitively Biased: An Exploratory Study on Threshold Priming in LLM-Based Batch Relevance Assessment
VascX 模型:用于彩色眼底图像视网膜血管分析的模型集成 Jose Vargas Quiros PDF N/A VascX Models: Model Ensembles for Retinal Vascular Analysis from Color Fundus Images
鲁棒神经IDA-PBC:基于耗散性的稳定化在近似条件下的应用 Santiago Sanchez-Escalonilla PDF N/A Robust Neural IDA-PBC: passivity-based stabilization under approximations
跨越语音与文本的界限:在大型语言模型中利用拼音到汉字的预训练提升自动语音识别 Yang Yuhang PDF N/A Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs
释放合成图像的潜力:一项关于病理图像分类的研究 Leire Benito-Del-Valle PDF N/A Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification
人工智能:人类在开发下一代人工智能中的作用 Suayb S. Arslan PDF N/A Artificial Human Intelligence: The role of Humans in the Development of Next Generation AI
NovelAI Diffusion V3中对SDXL的改进 Juan Ossa PDF N/A Improvements to SDXL in NovelAI Diffusion V3
具有重启和局部搜索机制的多算子集成LSHADE用于单目标优化 Dikshit Chauhan PDF N/A A Multi-operator Ensemble LSHADE with Restart and Local Search Mechanisms for Single-objective Optimization
比特币和推特的半强有效市场:提取关键词的语义向量空间与轻梯度提升机模型的分析 Fang Wang PDF N/A Semi-strong Efficient Market of Bitcoin and Twitter: an Analysis of Semantic Vector Spaces of Extracted Keywords and Light Gradient Boosting Machine Models
探索异常值变异性对异常检测评估指标的影响 Minjae Ok PDF N/A Exploring the Impact of Outlier Variability on Anomaly Detection Evaluation Metrics
DataGpt-SQL-7B:一个用于文本到SQL的开源语言模型 Lixia Wu PDF N/A DataGpt-SQL-7B: An Open-Source Language Model for Text-to-SQL
利用无监督学习实现成本效益高的视觉异常检测 Yunbo Long PDF N/A Leveraging Unsupervised Learning for Cost-Effective Visual Anomaly Detection
微调大型语言模型以进行比较评估任务 Vatsal Raina PDF N/A Finetuning LLMs for Comparative Assessment Tasks
StyleSinger 2:基于风格迁移和多层次风格控制的无监督歌声合成 Yu Zhang PDF N/A StyleSinger 2: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control
解耦年龄和身份:一种基于互信息最小化方法的跨年龄说话人验证 Fengrun Zhang PDF N/A Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification
边缘设备协同计算用于多视图分类 Marco Palena PDF N/A Edge-device Collaborative Computing for Multi-view Classification
创造健康摩擦:确定利益相关者对工作推荐解释的需求 Roan Schellingerhout PDF N/A Creating Healthy Friction: Determining Stakeholder Requirements of Job Recommendation Explanations
CLIP中的对抗性后门防御 Junhao Kuang PDF N/A Adversarial Backdoor Defense in CLIP
在逆约束强化学习中可证明高效探索 Bo Yue PDF N/A Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
语义控制的虚拟现实户外场景重建与渲染中的高斯溅射 Hannah Schieber PDF N/A Semantics-Controlled Gaussian Splatting for Outdoor Scene Reconstruction and Rendering in Virtual Reality
混合量子卷积神经网络的集成框架方法用于乳腺癌图像分类 Dibyasree Guha PDF N/A An ensemble framework approach of hybrid Quantum convolutional neural networks for classification of breast cancer images
ASD-扩散:基于扩散模型的异常声音检测 Fengrun Zhang PDF N/A ASD-Diffusion: Anomalous Sound Detection with Diffusion Models
历史轨迹辅助的零阶联邦优化 Xiaoyu He PDF N/A Historical Trajectory Assisted Zeroth-Order Federated Optimization
注意提示:基于提示的类无关计数的新基准 Luca Ciampi PDF N/A Mind the Prompt: A Novel Benchmark for Prompt-based Class-Agnostic Counting
偏见之声:通过主题建模和性别偏见测量分析歌词 Danqing Chen PDF N/A Beats of Bias: Analyzing Lyrics with Topic Modeling and Gender Bias Measurements
TSFeatLIME:在单变量时间序列预测中增强可解释性的在线用户研究 Hongnan Ma PDF N/A TSFeatLIME: An Online User Study in Enhancing Explainability in Univariate Time Series Forecasting
CMA-ES中的采样:低数量的低差异点 Jacob de Nobel PDF N/A Sampling in CMA-ES: Low Numbers of Low Discrepancy Points
通过区域合并实现图像矢量化的形式化 Roy Y. He PDF N/A A Formalization of Image Vectorization by Region Merging
通过内卷和隐式对应实现的自监督形状补全 Mengya Liu PDF N/A Self-supervised Shape Completion via Involution and Implicit Correspondences
利用随机归一化流确定有效弦的宽度和形状的数值方法 Michele Caselle PDF N/A Numerical determination of the width and shape of the effective string using Stochastic Normalizing Flows
DepMamba:用于多模态抑郁症检测的渐进融合Mamba Jiaxin Ye PDF N/A DepMamba: Progressive Fusion Mamba for Multimodal Depression Detection
自动生成测试以评估工具增强的大型语言模型作为对话式AI代理 Samuel Arcadinho PDF N/A Automated test generation to evaluate tool-augmented LLMs as conversational AI agents
SLIMER-IT:意大利语零样本命名实体识别 Andrew Zamai PDF N/A SLIMER-IT: Zero-Shot NER on Italian Language
基于特征的初始对齐和基于强度的实例优化实现SHG与H&E图像的自动配准:对COMULIS挑战的贡献 Marek Wodzinski PDF N/A Automatic Registration of SHG and H&E Images with Feature-based Initial Alignment and Intensity-based Instance Optimization: Contribution to the COMULIS Challenge
面对不对称——利用合成干预揭示面部对称性与表情分类器之间的因果关系 Tim Büchner PDF N/A Facing Asymmetry -- Uncovering the Causal Link between Facial Symmetry and Expression Classifiers using Synthetic Interventions
西班牙低资源语言的多语言迁移与领域适应 Yuanchang Luo PDF N/A Multilingual Transfer and Domain Adaptation for Low-Resource Languages of Spain
在指导性强化学习中克服奖励模型噪声 Sukai Huang PDF N/A Overcoming Reward Model Noise in Instruction-Guided Reinforcement Learning
学习用于激光雷达地点识别的紧凑通道相关性表示 Saimunur Rahman PDF N/A Learning Compact Channel Correlation Representation for LiDAR Place Recognition
深度卷积框架用于使用Compton相机探测器的BNCT剂量重建 Angelo Didonna PDF N/A Deep convolutional framelets for dose reconstruction in BNCT with Compton camera detector
黑暗中的规划:无专家参与的LLM-符号规划流水线 Sukai Huang PDF N/A Planning in the Dark: LLM-Symbolic Planning Pipeline without Experts
探索合作无人机3D测绘在肯尼亚稀树草原野生动物研究中的潜力 Vandita Shukla PDF N/A Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research
完美保真地解释词嵌入:研究影响预测案例研究 Lucie Dvorackova PDF N/A Explaining word embeddings with perfect fidelity: Case study in research impact prediction
基于模块化的策略用于缓解同时语音翻译中的梯度冲突 Xiaoqian Liu PDF N/A A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation
通过使用大型语言模型和移动应用程序实现先进的人机植物交互,提升基于物联网的植物健康监测 Kriti Agarwal PDF N/A Enhancing IoT based Plant Health Monitoring through Advanced Human Plant Interaction using Large Language Models and Mobile Applications
通过领域数据库知识注入增强大型语言模型的文本到SQL能力 Xingyu Ma PDF N/A Enhancing Text-to-SQL Capabilities of Large Language Models via Domain Database Knowledge Injection
利用专家混合增强的语音条件大语言模型提升代码转换自动语音识别 Fengrun Zhang PDF N/A Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM
Unimotion:统一3D人体运动合成与理解 Chuqiao Li PDF N/A Unimotion: Unifying 3D Human Motion Synthesis and Understanding
关于人工智能的五个问答 Alberto Prieto PDF N/A Five questions and answers about artificial intelligence
构造器:简单知识图谱问答的一个强大基线 Maria Lysyuk PDF N/A Konstruktor: A Strong Baseline for Simple Knowledge Graph Question Answering
FedRepOpt:联邦学习中的梯度重参数化优化器 Kin Wai Lau PDF N/A FedRepOpt: Gradient Re-parametrized Optimizers in Federated Learning
基于无监督注意力正则化的领域自适应甲骨文识别 Mei Wang PDF N/A Unsupervised Attention Regularization Based Domain Adaptation for Oracle Character Recognition
对称性和表达需求对于学习通用策略的影响 Dominik Drexler PDF N/A Symmetries and Expressive Requirements for Learning General Policies
HLB: 评估大型语言模型在语言使用中的人性化程度 Xufeng Duan PDF N/A HLB: Benchmarking LLMs' Humanlikeness in Language Use
CAD: 用于分割任何事物的内存高效卷积适配器 Joohyeok Kim PDF N/A CAD: Memory Efficient Convolutional Adapter for Segment Anything
研究解剖学先验知识在淋巴结分割中的性别偏见 Ricardo Coimbra Brioso PDF N/A Investigating Gender Bias in Lymph-node Segmentation with Anatomical Priors
自监督图嵌入聚类 Fangfang Li PDF N/A Self-Supervised Graph Embedding Clustering
关于powerset说话人日志模型校准的研究 Alexis Plaquet PDF N/A On the calibration of powerset speaker diarization models
通过角度分辨率增强和循环一致性学习实现无监督dMRI伪影检测 Sheng Chen PDF N/A Unsupervised dMRI Artifact Detection via Angular Resolution Enhancement and Cycle Consistency Learning
探索使用韵律参数的VQ-VAE用于说话人匿名化 Sotheara Leang PDF N/A Exploring VQ-VAE with Prosody Parameters for Speaker Anonymization
通过迁移学习实现的低资源印度语言机器翻译进展 Bin Wei PDF N/A Machine Translation Advancements of Low-Resource Indian Languages by Transfer Learning
零样本检测AI生成的图像 Davide Cozzolino PDF N/A Zero-Shot Detection of AI-Generated Images
血管细胞中的甾醇类物质及其在动脉粥样硬化中的作用 Celine Luquain-Costaz PDF N/A Oxysterols in Vascular Cells and Role in Atherosclerosis
蛇发女妖的低语:基于Transformer的ASR的多头高效解码 Yael Segal-Feldman PDF N/A Whisper in Medusa's Ear: Multi-head Efficient Decoding for Transformer-based ASR
自然语言处理模型的隐私评估基准 Wei Huang PDF N/A Privacy Evaluation Benchmarks for NLP Models
上下文集成改进了视频-语言模型,用于从人类演示中理解低层次工作流程 Moucheng Xu PDF N/A In-Context Ensemble Improves Video-Language Models for Low-Level Workflow Understanding from Human Demonstrations
多无人机在未知环境中的在线规划追逃问题通过深度强化学习解决 Jiayu Chen PDF N/A Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning
BeSimulator:基于大型语言模型的文本行为模拟器 Jianan Wang PDF N/A BeSimulator: A Large Language Model Powered Text-based Behavior Simulator
一个零样本开放词汇对话理解管道 Abdulfattah Safa PDF N/A A Zero-Shot Open-Vocabulary Pipeline for Dialogue Understanding
基于神经网络的控制识别:近似线性化模型 Maxime Thieffry PDF N/A Identification For Control Based on Neural Networks: Approximately Linearizable Models
双网络增强:一种改进脉冲神经网络和高效权重量化的创新训练策略 Lucas Deckers PDF N/A Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization
iGAiVA:在文本分类的机器学习工作流程中集成生成式AI与可视化分析 Yuanzhe Jin PDF N/A iGAiVA: Integrated Generative AI and Visual Analytics in a Machine Learning Workflow for Text Classification
基于行为改变的视觉风险对象识别的场景可供性:势场 Pang-Yuan Pao PDF N/A Potential Field as Scene Affordance for Behavior Change-Based Visual Risk Object Identification
自适应学习-测试:统计上有效且高效的超参数选择 Matteo Zecchin PDF N/A Adaptive Learn-then-Test: Statistically Valid and Efficient Hyperparameter Selection
从被动观看到主动学习:借助AI视频助手在数字课堂中实现积极主动的参与 Anna Bodonhelyi PDF N/A From Passive Watching to Active Learning: Empowering Proactive Participation in Digital Classrooms with AI Video Assistant
FSF-Net:利用粗略BEV场景流增强4D占用预测,助力自动驾驶 Erxin Guo PDF N/A FSF-Net: Enhance 4D Occupancy Forecasting with Coarse BEV Scene Flow for Autonomous Driving
深度学习技术在自动侧位X线头影测量标志点检测中的应用:问题是否已解决? Hongyuan Zhang PDF N/A Deep Learning Techniques for Automatic Lateral X-ray Cephalometric Landmark Detection: Is the Problem Solved?
PseudoNeg-MAE:使用条件伪负嵌入的自我监督点云学习 Sutharsan Mahendren PDF N/A PseudoNeg-MAE: Self-Supervised Point Cloud Learning using Conditional Pseudo-Negative Embeddings
介绍各向异性场以增强人群模拟中的多样性 Yihao Li PDF N/A Introducing Anisotropic Fields for Enhanced Diversity in Crowd Simulation
揭示语言能力神经元:一种心理语言学方法来建模可解释性 Xufeng Duan PDF N/A Unveiling Language Competence Neurons: A Psycholinguistic Approach to Model Interpretability
关于微调大型语言模型用于问答任务的实证见解 Junjie Ye PDF N/A Empirical Insights on Fine-Tuning Large Language Models for Question-Answering
监督微调:一种针对注意力头的激活模式优化过程 Yang Zhao PDF N/A Supervised Fine-Tuning: An Activation Pattern Optimization Process for Attention Heads
SwiftDossier:基于LLMs和代理的定制化药物发现档案 Gabriele Fossi PDF N/A SwiftDossier: Tailored Automatic Dossier for Drug Discovery with LLMs and Agents
AsthmaBot:用于哮喘患者支持的多模态、多语言检索增强生成系统 Adil Bahaj PDF N/A AsthmaBot: Multi-modal, Multi-Lingual Retrieval Augmented Generation For Asthma Patient Support
交互式基于示例的解释,以提升健康专业人员在使用人工智能进行人机协作决策时的入职培训 Min Hun Lee PDF N/A Interactive Example-based Explanations to Improve Health Professionals' Onboarding with AI for Human-AI Collaborative Decision Making
分层模型合并用于分割任务中的无监督领域自适应 Roberto Alcover-Couso PDF N/A Layer-wise Model Merging for Unsupervised Domain Adaptation in Segmentation Tasks
基于Stable Diffusion微调的桥梁美学辅助设计 Leye Zhang PDF N/A Aided design of bridge aesthetics based on Stable Diffusion fine-tuning
用于三维分类的双曲图像与点云对比学习 Naiwen Hu PDF N/A Hyperbolic Image-and-Pointcloud Contrastive Learning for 3D Classification
一种使自动驾驶汽车在施工区域安全行驶的计算机视觉方法 Abu Shad Ahammed PDF N/A A Computer Vision Approach for Autonomous Cars to Drive Safe at Construction Zone
CLSP:用于智能体状态表示的高保真对比语言-状态预训练 Fuxian Huang PDF N/A CLSP: High-Fidelity Contrastive Language-State Pre-training for Agent State Representation
NER-奢侈品:时尚与奢侈品领域的命名实体识别 Akim Mousterou PDF N/A NER-Luxury: Named entity recognition for the fashion and luxury domain
3D-JEPA:一种用于三维自监督表示学习的联合嵌入预测架构 Naiwen Hu PDF N/A 3D-JEPA: A Joint Embedding Predictive Architecture for 3D Self-Supervised Representation Learning
用于远程工业4.0应用的联邦学习中类别不平衡问题的多层次方法 Razin Farhan Hussain PDF N/A A Multi-Level Approach for Class Imbalance Problem in Federated Learning for Remote Industry 4.0 Applications
DIAL:用于弱监督语义分割的密集图像文本对齐 Soojin Jang PDF N/A DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation
面向大规模基础模型的天然气需求预测 Xinxing Zhou PDF N/A Towards Universal Large-Scale Foundational Model for Natural Gas Demand Forecasting
小型语言模型:综述、测量与洞察 Zhenyan Lu PDF N/A Small Language Models: Survey, Measurements, and Insights
深度学习实时相位检索:从X射线自由电子激光器获取不完美衍射图案 Sung Yun Lee PDF N/A Deep-learning real-time phase retrieval of imperfect diffraction patterns from X-ray free-electron lasers
训练数据归属:你的模型是否秘密地使用了由我创建的数据进行训练? Likun Zhang PDF N/A Training Data Attribution: Was Your Model Secretly Trained On Data Created By Mine?
混沌系统的零样本预测 Yuanzhao Zhang PDF N/A Zero-shot forecasting of chaotic systems
CHBench:一个用于评估大型语言模型健康状况的中文数据集 Chenlu Guo PDF N/A CHBench: A Chinese Dataset for Evaluating Health in Large Language Models
时空混合图专家模型用于多类型犯罪预测 Ziyang Wu PDF N/A Spatial-Temporal Mixture-of-Graph-Experts for Multi-Type Crime Prediction
IRSC:在检索增强生成场景中,通过语义理解进行信息检索的零样本评估基准 Hai Lin PDF N/A IRSC: A Zero-shot Evaluation Benchmark for Information Retrieval through Semantic Comprehension in Retrieval-Augmented Generation Scenarios
XTRUST:关于大型语言模型多语言可信度的研究 Yahan Li PDF N/A XTRUST: On the Multilingual Trustworthiness of Large Language Models
TFG:扩散模型的统一无训练指导 Haotian Ye PDF N/A TFG: Unified Training-Free Guidance for Diffusion Models
杂技机器人分阶段奖励塑造:一种约束多目标强化学习方法 Dohyeong Kim PDF N/A Stage-Wise Reward Shaping for Acrobatic Robots: A Constrained Multi-Objective Reinforcement Learning Approach
使用离线强化学习算法开发和验证肝素剂量策略 Yooseok Lim PDF N/A Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm
生成式人工智能在电动汽车互联网中的作用 Hanwen Zhang PDF N/A The Roles of Generative Artificial Intelligence in Internet of Electric Vehicles
STEM领域多模态答题卡的自动化评估 Rajlaxmi Patil PDF N/A Automated Assessment of Multimodal Answer Sheets in the STEM domain
训练神经网络以实现模块化有助于提高可解释性 Satvik Golechha PDF N/A Training Neural Networks for Modularity aids Interpretability
ManiNeg:用于乳腺X线摄影分类的表现指导多模态预训练 Xujun Li PDF N/A ManiNeg: Manifestation-guided Multimodal Pretraining for Mammography Classification
ViKL:一种通过视觉-知识-语言特征多模态聚合的乳腺X线摄影解读框架 Xin Wei PDF N/A ViKL: A Mammography Interpretation Framework via Multimodal Aggregation of Visual-knowledge-linguistic Features
物联网边缘设备上的实时行人检测:一种轻量级深度学习方法 Muhammad Dany Alfikri PDF N/A Real-Time Pedestrian Detection on IoT Edge Devices: A Lightweight Deep Learning Approach
因材施教:通过提示池和深度-任意约束进行恶劣天气恢复 Sixiang Chen PDF N/A Teaching Tailored to Talent: Adverse Weather Restoration via Prompt Pool and Depth-Anything Constraint
随机优化中基于随机模型的信赖域序列二次规划方法 Yuchen Fang PDF N/A Trust-Region Sequential Quadratic Programming for Stochastic Optimization with Random Models
EvoFA:可进化的快速适应用于脑电情绪识别 Ming Jin PDF N/A EvoFA: Evolvable Fast Adaptation for EEG Emotion Recognition
假设聚类与合并:基于说话人标记的新型多说话人语音识别 Yosuke Kashiwagi PDF N/A Hypothesis Clustering and Merging: Novel MultiTalker Speech Recognition with Speaker Tokens
从自动驾驶中的潜在世界模型学习多个概率决策 Lingyu Xiao PDF N/A Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous Driving
密集联想记忆中的顺序学习 Hayden McAlister PDF N/A Sequential Learning in the Dense Associative Memory
LaPose:基于RGB的类别级物体姿态估计的拉普拉斯混合形状建模 Ruida Zhang PDF N/A LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation