跳转至

Arxiv 2024-11-14 Papers

标题 作者 PDF链接 代码仓库 Title
魔法羽毛笔:智能交互式图像编辑系统 Zichen Liu PDF N/A MagicQuill: An Intelligent Interactive Image Editing System
视觉变换器中注意力转移的惊人有效性 Alexander C. Li PDF N/A On the Surprising Effectiveness of Attention Transfer for Vision Transformers
一种基于贝叶斯优化的机器翻译重排序方法 Julius Cheng PDF N/A A Bayesian Optimization Approach to Machine Translation Reranking
CropCraft:用于作物植物三维重建的逆向程序建模 Albert J. Zhai PDF N/A CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants
利用多模态模型中的多尺度对齐推进细粒度视觉理解 Wei Wang PDF N/A Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models
零样本知识测试的LLM幻觉推理 Seongmin Lee PDF N/A LLM Hallucination Reasoning with Zero-shot Knowledge Test
压缩注意力:加速长上下文长度大型语言模型推理 Coleman Hooper PDF N/A Squeezed Attention: Accelerating Long Context Length LLM Inference
非线性单变量模型的条件回归 Yantao Wu PDF N/A Conditional regression for the Nonlinear Single-Variable Model
面向软件工程的开源机器学习模型和数据集分类研究 Alexandra González PDF N/A Towards a Classification of Open-Source ML Models and Datasets for Software Engineering
神经DEM——工业颗粒流的实时模拟 Benedikt Alkin PDF N/A NeuralDEM -- Real-time Simulation of Industrial Particulate Flows
通过潜在偏好优化进行自适应解码 Shehzaad Dhuliawala PDF N/A Adaptive Decoding via Latent Preference Optimization
Med-Bot:一款人工智能驱动的助手,提供准确可靠的医疗信息 Ahan Bhatt PDF N/A Med-Bot: An AI-Powered Assistant to Provide Accurate and Reliable Medical Information
机器学习模型是如何变化的? Joel Castaño PDF N/A How do Machine Learning Models Change?
神经算子可以玩动态斯塔克尔伯格博弈 Guillermo Alvarez PDF N/A Neural Operators Can Play Dynamic Stackelberg Games
语言生成的局限性:幻觉与模式崩溃之间的权衡 Alkis Kalavasis PDF N/A On the Limits of Language Generation: Trade-Offs Between Hallucination and Mode Collapse
MCCE:缺失感知因果概念解释器 Jifan Gao PDF N/A MCCE: Missingness-aware Causal Concept Explainer
一类二次-双线性Wasserstein分布鲁棒博弈的纳什均衡寻求 Georgios Pantazis PDF N/A Nash equilibrium seeking for a class of quadratic-bilinear Wasserstein distributionally robust games
从治疗前后重复测量随机对照试验中,对事实效能估算的反事实不确定性量化 Xingya Wang PDF N/A Counterfactual Uncertainty Quantification of Factual Estimand of Efficacy from Before-and-After Treatment Repeated Measures Randomized Controlled Trials
一次性操作策略学习:通过建立接触类比 Yuyao Liu PDF N/A One-Shot Manipulation Strategy Learning by Making Contact Analogies
在商用硬件上本地部署大规模音乐AI模型 Xun Zhou PDF N/A Local deployment of large-scale music AI models on commodity hardware
基于视觉的工业环境中透明塑料袋操作 F. Adetunji PDF N/A Vision-based Manipulation of Transparent Plastic Bags in Industrial Setups
MICCAI-CDMRI 2023 QuantConn挑战赛成果:通过协调扩散MRI预处理实现稳健定量连接 Nancy R. Newlin PDF N/A MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI
PTR:面向大型语言模型的精准驱动工具推荐 Hang Gao PDF N/A PTR: Precision-Driven Tool Recommendation for Large Language Models
道德基础微博语料库 Renjie Cao PDF N/A The Moral Foundations Weibo Corpus
TREC 2024 RAG赛道初始Nugget评估结果与AutoNuggetizer框架 Ronak Pradeep PDF N/A Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework
局部-全局注意力:一种用于多尺度特征融合的自适应机制 Yifan Shao PDF N/A Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration
利用大型语言模型加速知识图谱与本体工程 Cogan Shimizu PDF N/A Accelerating Knowledge Graph and Ontology Engineering with Large Language Models
混合波束图案和干扰控制下的低轨卫星通信延迟优化 Qianqian Zhang PDF N/A Latency Optimization in LEO Satellite Communications with Hybrid Beam Pattern and Interference Control
评估DINOv2自监督学习视觉Transformer模型在从MRI图像中分割左心房方面的性能 Bipasha Kundu PDF N/A Assessing the Performance of the DINOv2 Self-supervised Learning Vision Transformer Model for the Segmentation of the Left Atrium from MRI Images
LLaMA-Mesh:将3D网格生成与语言模型统一 Zhengyi Wang PDF N/A LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
SMILE-乌胡拉挑战赛——从超高分辨率7T磁共振血管造影中进行介观尺度的小血管分割 Soumick Chatterjee PDF N/A SMILE-UHURA Challenge -- Small Vessel Segmentation at Mesoscopic Scale from Ultra-High Resolution 7T Magnetic Resonance Angiograms
缺失数据可解释机器学习模型的专家研究 Lena Stempfle PDF N/A Expert Study on Interpretable Machine Learning Models with Missing Data
采用RAG进行LLM辅助的未来车辆设计 Vahid Zolfaghari PDF N/A Adopting RAG for LLM-Aided Future Vehicle Design
蜘蛛:任意到多模态大语言模型 Jinxiang Lai PDF N/A Spider: Any-to-Many Multimodal LLM
BabyLM挑战赛:探索变异集对语言模型训练效率的影响 Akari Haga PDF N/A BabyLM Challenge: Exploring the Effect of Variation Sets on Language Model Training Efficiency
基础模型驱动软件(FMware)的软件性能工程 Haoxiang Zhang PDF N/A Software Performance Engineering for Foundation Model-Powered Software (FMware)
通过图重写自动重构本质规范 Ian Miguel PDF N/A Automating Reformulation of Essence Specifications via Graph Rewriting
动态重建手-物体交互的分布式力感知接触表示 Zhenjun Yu PDF N/A Dynamic Reconstruction of Hand-Object Interaction with Distributed Force-aware Contact Representation
VPBSD:基于船舶模式的半监督蒸馏方法,用于高效的3D显微脑血管分割 Xi Lin PDF N/A VPBSD:Vessel-Pattern-Based Semi-Supervised Distillation for Efficient 3D Microscopic Cerebrovascular Segmentation
自适应偏差学习用于视觉异常检测与数据污染 Anindya Sundar Das PDF N/A Adaptive Deviation Learning for Visual Anomaly Detection with Data Contamination
运动放大图像处理 Nadaniela Egidi PDF N/A Image Processing for Motion Magnification
OOD-SEG:利用稀疏多类正样本标注进行图像分割的分布外检测 Junwen Wang PDF N/A OOD-SEG: Out-Of-Distribution detection for image SEGmentation with sparse multi-class positive-only annotations
MFTIQ:具有独立匹配质量评估的多流追踪器 Jonas Serych PDF N/A MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation
拼凑一切:验证多跳多模态声明 Haoran Wang PDF N/A Piecing It All Together: Verifying Multi-Hop Multimodal Claims
基于方程的数据驱动流动预算和动力学识别 Nataliya Sevryugina PDF N/A Equation-informed data-driven identification of flow budgets and dynamics
OpenGeMM:一种高利用率GeMM加速器生成器,配备轻量级RISC-V控制与紧密内存耦合 Xiaoling Yi PDF N/A OpenGeMM: A High-Utilization GeMM Accelerator Generator with Lightweight RISC-V Control and Tight Memory Coupling
提示未知:检测黑盒模型中的隐藏后门 Zi-Xuan Huang PDF N/A Prompting the Unseen: Detecting Hidden Backdoors in Black-Box Models
《有限数据下的语言模型微调实用指南》 Márton Szép PDF N/A A Practical Guide to Fine-tuning Language Models with Limited Data
使用智能边缘传感器系统进行无标记人体步态分析 Eva Katharina Bauer PDF N/A Marker-free Human Gait Analysis using a Smart Edge Sensor System
导航风险:基于大语言模型代理的安全、隐私和伦理威胁调查 Yuyou Gan PDF N/A Navigating the Risks: A Survey of Security, Privacy, and Ethics Threats in LLM-Based Agents
随机化诚实拍卖与学习代理 Gagan Aggarwal PDF N/A Randomized Truthful Auctions with Learning Agents
基于生成对抗网络的低剂量计算机断层扫描图像去噪架构 Yunuo Wang PDF N/A GAN-Based Architecture for Low-dose Computed Tomography Imaging Denoising
张量并行大型语言模型推理中的通信压缩 Jan Hansen-Palmus PDF N/A Communication Compression for Tensor Parallel LLM Inference
迈向科学创新的凝聚性人工智能与仿真软件生态系统 Michael A. Heroux PDF N/A Toward a Cohesive AI and Simulation Software Ecosystem for Scientific Innovation
扩散模型的黄金噪声:一种学习框架 Zikai Zhou PDF N/A Golden Noise for Diffusion Models: A Learning Framework
基于强化学习的侧梁设计优化方法的发展 Aditya Borse PDF N/A Developement of Reinforcement Learning based Optimisation Method for Side-Sill Design
法律文本中可读性指标的应用:一项系统性文献综述 Yu Han PDF N/A The Use of Readability Metrics in Legal Text: A Systematic Literature Review
战略性牺牲:自组织机器人集群定位以提升检测效率 Sneha Ramshanker PDF N/A Strategic Sacrifice: Self-Organized Robot Swarm Localization for Inspection Productivity
MM-Eval:一个用于现代蒙古语评估的分层基准 Mengyuan Zhang PDF N/A MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs
图像匹配滤波与平面及超越平面的精细化处理 Fabio Bellavia PDF N/A Image Matching Filtering and Refinement by Planes and Beyond
用于压缩感知的稀疏贝叶斯生成模型 Benedikt Böck PDF N/A Sparse Bayesian Generative Modeling for Compressive Sensing
什么是好的BIM设计:设计行为与质量之间的量化联系 Xiang-Rui Ni PDF N/A What makes a good BIM design: quantitative linking between design behavior and quality
图神经网络与微分方程:一种用于流体流动数据同化的混合方法 M. Quattromini PDF N/A Graph Neural Networks and Differential Equations: A hybrid approach for data assimilation of fluid flows
残差下降路径:增强残差连接上的特征重用 Sejik Park PDF N/A ResidualDroppath: Enhancing Feature Reuse over Residual Connections
肾细胞癌亚型分类:从多分辨率定位中学习 Mohamad Mohamad PDF N/A Renal Cell Carcinoma subtyping: learning from multi-resolution localization
使用阴道镜图像进行宫颈癌前风险分类的可解释注意力模型 Smith K. Khare PDF N/A An Explainable Attention Model for Cervical Precancer Risk Classification using Colposcopic Images
利用机器学习实现自由电子激光脉冲功率的单发测量 Till Korten PDF N/A Harnessing Machine Learning for Single-Shot Measurement of Free Electron Laser Pulse Power
SINETRA:一种用于评估行为动物中单个神经元追踪的多功能框架 Raphael Reme PDF N/A SINETRA: a Versatile Framework for Evaluating Single Neuron Tracking in Behaving Animals
Caravan MultiMet:通过整合多个天气现报和预报扩展Caravan功能 Guy Shalev PDF N/A Caravan MultiMet: Extending Caravan with Multiple Weather Nowcasts and Forecasts
长尾目标检测预训练:动态重平衡对比学习与双重重构 Chen-Long Duan PDF N/A Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction
DiffRoad:为自动驾驶车辆测试生成真实且多样化的道路场景 Junjie Zhou PDF N/A DiffRoad: Realistic and Diverse Road Scenario Generation for Autonomous Vehicle Testing
图像重现:通过多模态大语言模型生成相同图像来评估文本到图像模型 Chutian Meng PDF N/A Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
学习高效且可证明收敛的分割方法 L. M. Kreusser PDF N/A Learning efficient and provably convergent splitting methods
从自然语言指令中提取模糊时间要求的机器人任务 Sascha Sucker PDF N/A Robot Tasks with Fuzzy Time Requirements from Natural Language Instructions
每个人都应被倾听:分析应用于荷兰语音数据的自动语音识别模型中的预测性别偏见 Rik Raes PDF N/A Everyone deserves their voice to be heard: Analyzing Predictive Gender Bias in ASR Models Applied to Dutch Speech Data
材料的人工智能驱动逆向设计:过去、现在与未来 Xiao-Qi Han PDF N/A AI-driven inverse design of materials: Past, present and future
一个适用于逻辑综合机器学习任务的自适应开源数据集生成框架 Liwei Ni PDF N/A An Adaptive Open-Source Dataset Generation Framework for Machine Learning Tasks in Logic Synthesis
SAG-ViT:一种基于图注意力机制的尺度感知、高保真补丁化方法,适用于视觉变换器 Shravan Venkatraman PDF N/A SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers
以脚本为中心的行为理解助力自闭症谱系障碍诊断 Wenxing Liu PDF N/A Script-centric behavior understanding for assisted autism spectrum disorder diagnosis
利用卫星影像中的阴影长度进行建筑物高度估计 Mahd Qureshi PDF N/A Building Height Estimation Using Shadow Length in Satellite Imagery
量子机器学习:量子计算与机器学习的交融 Jun Qi PDF N/A Quantum Machine Learning: An Interplay Between Quantum Computing and Machine Learning
非对比计算机断层扫描图像中缺血性脑卒中病变的自动分割,以提升治疗效果和预后 Toufiq Musah PDF N/A Automated Segmentation of Ischemic Stroke Lesions in Non-Contrast Computed Tomography Images for Enhanced Treatment and Prognosis
想象中的言语和视觉意象作为脑机接口的直观范式 Seo-Hyun Lee PDF N/A Imagined Speech and Visual Imagery as Intuitive Paradigms for Brain-Computer Interfaces
用于网络安全问题在线学习的固有可解释性与不确定性感知模型 Benjamin Kolicic PDF N/A Inherently Interpretable and Uncertainty-Aware Models for Online Learning in Cyber-Security Problems
少即是多:通过因果传播子结构检测未见领域虚假新闻 Shuzhi Gong PDF N/A Less is More: Unseen Domain Fake News Detection via Causal Propagation Substructures
分子模拟的概率生成框架调查 Richard John PDF N/A A survey of probabilistic generative frameworks for molecular simulations
指令驱动的红外-可见光图像融合:为多样化的下游任务量身定制 Zengyi Yang PDF N/A Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks
核掩码是否足以提升域外泛化能力?深入探讨组织病理学中的癌症分类问题 Dhananjay Tomar PDF N/A Are nuclear masks all you need for improved out-of-domain generalisation? A closer look at cancer classification in histopathology
DSCformer:一种双分支网络,结合增强型动态蛇卷积和SegFormer用于裂缝分割 Kaiwei Yu PDF N/A DSCformer: A Dual-Branch Network Integrating Enhanced Dynamic Snake Convolution and SegFormer for Crack Segmentation
LTLf+ 和 PPLTL+:将LTLf和PPLTL扩展至无限轨迹 Benjamin Aminof PDF N/A LTLf+ and PPLTL+: Extending LTLf and PPLTL to Infinite Traces
分布式随机梯度下降平均算法的稳定性和泛化性 Miaoxi Zhu PDF N/A Stability and Generalization for Distributed SGDA
3D医学影像的时间到事件预训练 Zepeng Huo PDF N/A Time-to-Event Pretraining for 3D Medical Imaging
您的固定水印易碎:面向EaaS版权保护的语义感知水印 Zekun Fei PDF N/A Your Fixed Watermark is Fragile: Towards Semantic-Aware Watermark for EaaS Copyright Protection
多尺度生成模型用于快速采样 Xiongye Xiao PDF N/A Multi-scale Generative Modeling for Fast Sampling
自适应增强一致性学习:一种用于遥感数据的半监督分割框架 Hui Ye PDF N/A Adaptively Augmented Consistency Learning: A Semi-supervised Segmentation Framework for Remote Sensing
近似变分贝叶斯逆强化学习用于大规模语言模型对齐 Yuang Cai PDF N/A Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment
轻量级Transformer在设备端语音情感识别中的重参数化 Zixing Zhang PDF N/A Re-Parameterization of Lightweight Transformer for On-Device Speech Emotion Recognition
改进用于稳态对流占优问题的hp-变分物理信息神经网络 Thivin Anandh PDF N/A Improving hp-Variational Physics-Informed Neural Networks for Steady-State Convection-Dominated Problems
DriveThru:一个用于印度尼西亚地方语言档案的文档提取平台和基准数据集 MohammadRifqi Farhansyah PDF N/A DriveThru: a Document Extraction Platform and Benchmark Datasets for Indonesian Local Language Archives
Pie:为大型语言模型推理汇聚CPU内存 Yi Xu PDF N/A Pie: Pooling CPU Memory for LLM Inference
时间序列数据的近似概率推断:一种具有时间感知能力的鲁棒潜高斯模型 Anton Johansson PDF N/A Approximate Probabilistic Inference forTime-Series Data A Robust Latent Gaussian Model With Temporal Awareness
从Hinode SOT/SP观测中收集的太阳偏振光谱的压缩方法 Jargalmaa Batmunkh PDF N/A Compression Method for Solar Polarization Spectra Collected from Hinode SOT/SP Observations
探索在医学影像中利用CLIP进行零样本异常检测:我们是否已经达到目标? Aldo Marzullo PDF N/A Exploring Zero-Shot Anomaly Detection with CLIP in Medical Imaging: Are We There Yet?
DT-JRD:基于深度变换器的机器视频编码可识别差异预测模型 Junqi Liu PDF N/A DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines
基于脑电图的语音解码:一种利用多核集成扩散模型的新方法 Soowon Kim PDF N/A EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models
LHRS-Bot-Nova:用于遥感视觉语言解释的改进型多模态大型语言模型 Zhenshi Li PDF N/A LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation
DTELS:面向时间轴摘要动态粒度的研究 Chenlong Zhang PDF N/A DTELS: Towards Dynamic Granularity of Timeline Summarization
使用白盒对抗攻击增强高能物理中的泛化能力 Franck Rothen PDF N/A Enhancing generalization in high energy physics using white-box adversarial attacks
学习轻型外骨骼的手部状态估计 Gabriele Abbate PDF N/A Learning Hand State Estimation for a Light Exoskeleton
LLV-FSR:利用大规模语言-视觉先验进行人脸超分辨率 Chenyang Wang PDF N/A LLV-FSR: Exploiting Large Language-Vision Prior for Face Super-resolution
StreamAdapter:从上下文流中进行高效测试时间适应 Dilxat Muhtar PDF N/A StreamAdapter: Efficient Test Time Adaptation from Contextual Streams
基于多源异构迁移学习的跨域推荐集中-分布式迁移模型 Ke Xu PDF N/A A Centralized-Distributed Transfer Model for Cross-Domain Recommendation Based on Multi-Source Heterogeneous Transfer Learning
利用辅助分类进行肋骨骨折分割 Harini G. PDF N/A Leveraging Auxiliary Classification for Rib Fracture Segmentation
多模态大型语言模型中的跨模态一致性 Xiang Zhang PDF N/A Cross-Modal Consistency in Multimodal Large Language Models
利用多个大型语言模型进行信息检索:以生物多样性出版物中的深度学习方法为例的研究 Vamsi Krishna Kommineni PDF N/A Harnessing multiple LLMs for Information Retrieval: A case study on Deep Learning methodologies in Biodiversity publications
LES-Talker:线性情感空间中用于生成说话人头部的细粒度情感编辑 Guanwen Feng PDF N/A LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space
面向基于原型的去中心化学习的有效压缩与通信 Pablo Fernández-Piñeiro PDF N/A Towards efficient compression and communication for prototype-based decentralized learning
ChatGPT在视听深度伪造检测中的表现如何:ChatGPT、AI模型与人类感知能力的比较研究 Sahibzada Adil Shahzad PDF N/A How Good is ChatGPT at Audiovisual Deepfake Detection: A Comparative Study of ChatGPT, AI Models and Human Perception
胡须:数据集蒸馏对抗鲁棒性基准测试 Zheng Zhou PDF N/A BEARD: Benchmarking the Adversarial Robustness for Dataset Distillation
重新思考加权平均模型合并 Hu Wang PDF N/A Rethinking Weight-Averaged Model-merging
自动化自动评分:大型语言模型作为入门编程测试套件生成器 Umar Alkafaween PDF N/A Automating Autograding: Large Language Models as Test Suite Generators for Introductory Programming
越狱攻击与多模态生成模型防御:综述 Xuannan Liu PDF N/A Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey
DAHL:通过生物医学基准数据集,对长篇文本进行领域特定自动幻觉评估 Jean Seo PDF N/A DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine
跨时空:一种时空单元化模型用于交通流量预测 Weilin Ruan PDF N/A Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting
嵌入空间分配与角度-范数联合分类器用于少样本类增量学习 Dunwei Tu PDF N/A Embedding Space Allocation with Angle-Norm Joint Classifiers for Few-Shot Class-Incremental Learning
通过模型增强提升语言模型在金融领域的适应性 Kota Tanabe PDF N/A Enhancing Financial Domain Adaptation of Language Models via Model Augmentation
统一神经解码:从脑电信号中感知、口语和想象语音的解码 Jung-Sun Lee PDF N/A Towards Unified Neural Decoding of Perceived, Spoken and Imagined Speech from EEG Signals
FluidML:快速且内存高效的推理优化 Jinjie Liu PDF N/A FluidML: Fast and Memory Efficient Inference Optimization
重新思考“热图+蒙特卡洛树搜索”范式用于解决大规模旅行商问题 Xuanhao Pan PDF N/A Rethinking the "Heatmap + Monte Carlo Tree Search" Paradigm for Solving Large Scale TSP
使用AI编程:评估ChatGPT、Gemini、AlphaCode和GitHub Copilot对程序员的效果 Md Kamrul Siam PDF N/A Programming with AI: Evaluating ChatGPT, Gemini, AlphaCode, and GitHub Copilot for Programmers
针对自动语音识别系统的可转移对抗攻击 Xiaoxue Gao PDF N/A Transferable Adversarial Attacks against ASR
利用视觉基础模型实现高性能、无需训练的开放词汇分割 Yuheng Shi PDF N/A Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation
HateGPT:利用GPT-3.5 Turbo在X平台上对抗仇恨言论 Aniket Deroy PDF N/A HateGPT: Unleashing GPT-3.5 Turbo to Combat Hate Speech on X
全面实用的检索增强生成系统在医疗问答中的评估 Nghia Trung Ngo PDF N/A Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering
动态神经通信:计算机视觉与脑机接口的融合 Ji-Ha Park PDF N/A Dynamic Neural Communication: Convergence of Computer Vision and Brain-Computer Interface
经典验证量子学习优势与噪声 Yinghao Ma PDF N/A Classical Verification of Quantum Learning Advantages with Noises
JoyVASA:基于扩散的音频驱动面部动态和头部运动生成的人物和动物图像动画 Xuyang Cao PDF N/A JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation
RibCageImp:一种用于3D肋骨植入物生成的深度学习框架 Gyanendra Chaubey PDF N/A RibCageImp: A Deep Learning Framework for 3D Ribcage Implant Generation
Ghost-Connect Net:分布偏移下稀疏深度网络的泛化增强引导 Mary Isabelle Wisell PDF N/A Ghost-Connect Net: A Generalization-Enhanced Guidance For Sparse Deep Networks Under Distribution Shifts
信息性期权 Andrew Koh PDF N/A Informational Puts
基于双层LSTM的语音情感识别模型的改进与实现 Xiaoran Yang PDF N/A Improvement and Implementation of a Speech Emotion Recognition Model Based on Dual-Layer LSTM
动态技术影响分析:基于多任务学习的专利引用预测方法 Youngjin Seol PDF N/A Dynamic technology impact analysis: A multi-task learning approach to patent citation prediction
DeBaTeR:用于推荐的降噪二分时间图 Xinyu He PDF N/A DeBaTeR: Denoising Bipartite Temporal Graph for Recommendation
LEAP:D -- 一种新颖的基于提示的领域泛化航空目标检测方法 Chanyeong Park PDF N/A LEAP:D -- A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection
SAFES:负责任人工智能的顺序隐私和公平增强数据合成 Spencer Giddens PDF N/A SAFES: Sequential Privacy and Fairness Enhancing Data Synthesis for Responsible AI
凝视奖励:眼动作为混合视觉觅食中人类与AI决策的透镜 Bo Wang PDF N/A Gazing at Rewards: Eye Movements as a Lens into Human and AI Decision-Making in Hybrid Visual Foraging
混合深度加性神经网络 Gyu Min Kim PDF N/A Hybrid deep additive neural networks
推进扩散模型:无别名重采样与增强旋转等变性 Md Fahim Anjum PDF N/A Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance
通过脑电图解码和潜在嵌入整合实现可扩展的手写交流 Jun-Young Kim PDF N/A Towards Scalable Handwriting Communication via EEG Decoding and Latent Embedding Integration
人工智能理论思维与自我引导的社会组织 Michael S. Harré PDF N/A Artificial Theory of Mind and Self-Guided Social Organisation
心智理论增强集体智慧 Michael S. Harré PDF N/A Theory of Mind Enhances Collective Intelligence
非结构化文本增强的开放域对话系统:系统性综述 Longxuan Ma PDF N/A Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey
基于理性与先天价值驱动的强化学习 Qin Yang PDF N/A Rationality based Innate-Values-driven Reinforcement Learning
《乐观主义者》:迈向全自动图论研究 Randy Davila PDF N/A The \emph{Optimist}: Towards Fully Automated Graph Theory Research
DyGASR:基于表面对齐的动态广义指数溅射技术加速三维网格重建 Shengchao Zhao PDF N/A DyGASR: Dynamic Generalized Exponential Splatting with Surface Alignment for Accelerated 3D Mesh Reconstruction
VidMan:利用视频扩散模型中的隐式动态实现有效的机器人操作 Youpeng Wen PDF N/A VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation
GRAINRec:基于图和注意力集成的实时会话推荐方法 Bhavtosh Rath PDF N/A GRAINRec: Graph and Attention Integrated Approach for Real-Time Session-Based Item Recommendations
Mono2Stereo:单目知识迁移以增强立体匹配 Yuran Wang PDF N/A Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching
UniHOI:学习快速、密集且可泛化的第一人称手部物体交互视频的4D重建 Chengbo Yuan PDF N/A UniHOI: Learning Fast, Dense and Generalizable 4D Reconstruction for Egocentric Hand Object Interaction Videos
微分隐私的拉普拉斯变换解释 Rishav Chourasia PDF N/A Laplace Transform Interpretation of Differential Privacy
早产儿视网膜病变诊断中的对抗性血管揭示半监督分割 Gozde Merve Demirci PDF N/A Adversarial Vessel-Unveiling Semi-Supervised Segmentation for Retinopathy of Prematurity Diagnosis
快速概率蛇形算法 Jérôme Gilles PDF N/A Fast probabilistic snake algorithm
ABCI 3.0:日本领先AI基础设施的演进 Ryousei Takano PDF N/A ABCI 3.0: Evolution of the leading AI infrastructure in Japan
用于成像的计算超表面光学元件 Charles Roques-Carmes PDF N/A Computational metaoptics for imaging
深度神经网络最优结构发现的复杂度感知训练 Valentin Frank Ingmar Guenter PDF N/A Complexity-Aware Training of Deep Neural Networks for Optimal Structure Discovery
扫描:为提高数据效率的自举对比预训练 Yangyang Guo PDF N/A SCAN: Bootstrapping Contrastive Pre-training for Data Efficiency
DROJ:针对大型语言模型的提示驱动攻击 Leyang Hu PDF N/A DROJ: A Prompt-Driven Attack against Large Language Models
复杂系统神经图模拟器 Hoyun Choi PDF N/A Neural Graph Simulator for Complex Systems
FxTS-Net:神经ODE的固定时间稳定学习框架 Chaoyang Luo PDF N/A FxTS-Net: Fixed-Time Stable Learning Framework for Neural ODEs
基于数据初始化的多模态分布高效学习和采样 Frederic Koehler PDF N/A Efficiently learning and sampling multimodal distributions with data-based initialization
P-MMEval:一种并行多语言多任务基准,用于对大型语言模型进行一致性评估 Yidan Zhang PDF N/A P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
降低推理成本——通过稀疏注意力机制优化思维链的路径 Libo Wang PDF N/A Reducing Reasoning Costs -- The Path of Optimization for Chain of Thought via Sparse Attention Mechanism
星际物体探索中的信息最优多航天器定位 Arna Bhardwaj PDF N/A Information-Optimal Multi-Spacecraft Positioning for Interstellar Object Exploration
个性化帮助优化低技能用户的策略 Feng Gu PDF N/A Personalized Help for Optimizing Low-Skilled Users' Strategy
VCBench:一个可控的基准测试,用于评估视频认知中的符号和抽象挑战 Chenglin Li PDF N/A VCBench: A Controllable Benchmark for Symbolic and Abstract Challenges in Video Cognition
挑衅性问题:在生成式人工智能中,“包容性”让谁受益? Nari Johnson PDF N/A Provocation: Who benefits from "inclusion" in Generative AI?
遥感影像语义分割中视觉变换器与卷积神经网络的启发式比较 Ashim Dahal PDF N/A Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery