| 魔法羽毛笔:智能交互式图像编辑系统 |
Zichen Liu |
PDF |
N/A |
MagicQuill: An Intelligent Interactive Image Editing System |
| 视觉变换器中注意力转移的惊人有效性 |
Alexander C. Li |
PDF |
N/A |
On the Surprising Effectiveness of Attention Transfer for Vision Transformers |
| 一种基于贝叶斯优化的机器翻译重排序方法 |
Julius Cheng |
PDF |
N/A |
A Bayesian Optimization Approach to Machine Translation Reranking |
| CropCraft:用于作物植物三维重建的逆向程序建模 |
Albert J. Zhai |
PDF |
N/A |
CropCraft: Inverse Procedural Modeling for 3D Reconstruction of Crop Plants |
| 利用多模态模型中的多尺度对齐推进细粒度视觉理解 |
Wei Wang |
PDF |
N/A |
Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models |
| 零样本知识测试的LLM幻觉推理 |
Seongmin Lee |
PDF |
N/A |
LLM Hallucination Reasoning with Zero-shot Knowledge Test |
| 压缩注意力:加速长上下文长度大型语言模型推理 |
Coleman Hooper |
PDF |
N/A |
Squeezed Attention: Accelerating Long Context Length LLM Inference |
| 非线性单变量模型的条件回归 |
Yantao Wu |
PDF |
N/A |
Conditional regression for the Nonlinear Single-Variable Model |
| 面向软件工程的开源机器学习模型和数据集分类研究 |
Alexandra González |
PDF |
N/A |
Towards a Classification of Open-Source ML Models and Datasets for Software Engineering |
| 神经DEM——工业颗粒流的实时模拟 |
Benedikt Alkin |
PDF |
N/A |
NeuralDEM -- Real-time Simulation of Industrial Particulate Flows |
| 通过潜在偏好优化进行自适应解码 |
Shehzaad Dhuliawala |
PDF |
N/A |
Adaptive Decoding via Latent Preference Optimization |
| Med-Bot:一款人工智能驱动的助手,提供准确可靠的医疗信息 |
Ahan Bhatt |
PDF |
N/A |
Med-Bot: An AI-Powered Assistant to Provide Accurate and Reliable Medical Information |
| 机器学习模型是如何变化的? |
Joel Castaño |
PDF |
N/A |
How do Machine Learning Models Change? |
| 神经算子可以玩动态斯塔克尔伯格博弈 |
Guillermo Alvarez |
PDF |
N/A |
Neural Operators Can Play Dynamic Stackelberg Games |
| 语言生成的局限性:幻觉与模式崩溃之间的权衡 |
Alkis Kalavasis |
PDF |
N/A |
On the Limits of Language Generation: Trade-Offs Between Hallucination and Mode Collapse |
| MCCE:缺失感知因果概念解释器 |
Jifan Gao |
PDF |
N/A |
MCCE: Missingness-aware Causal Concept Explainer |
| 一类二次-双线性Wasserstein分布鲁棒博弈的纳什均衡寻求 |
Georgios Pantazis |
PDF |
N/A |
Nash equilibrium seeking for a class of quadratic-bilinear Wasserstein distributionally robust games |
| 从治疗前后重复测量随机对照试验中,对事实效能估算的反事实不确定性量化 |
Xingya Wang |
PDF |
N/A |
Counterfactual Uncertainty Quantification of Factual Estimand of Efficacy from Before-and-After Treatment Repeated Measures Randomized Controlled Trials |
| 一次性操作策略学习:通过建立接触类比 |
Yuyao Liu |
PDF |
N/A |
One-Shot Manipulation Strategy Learning by Making Contact Analogies |
| 在商用硬件上本地部署大规模音乐AI模型 |
Xun Zhou |
PDF |
N/A |
Local deployment of large-scale music AI models on commodity hardware |
| 基于视觉的工业环境中透明塑料袋操作 |
F. Adetunji |
PDF |
N/A |
Vision-based Manipulation of Transparent Plastic Bags in Industrial Setups |
| MICCAI-CDMRI 2023 QuantConn挑战赛成果:通过协调扩散MRI预处理实现稳健定量连接 |
Nancy R. Newlin |
PDF |
N/A |
MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI |
| PTR:面向大型语言模型的精准驱动工具推荐 |
Hang Gao |
PDF |
N/A |
PTR: Precision-Driven Tool Recommendation for Large Language Models |
| 道德基础微博语料库 |
Renjie Cao |
PDF |
N/A |
The Moral Foundations Weibo Corpus |
| TREC 2024 RAG赛道初始Nugget评估结果与AutoNuggetizer框架 |
Ronak Pradeep |
PDF |
N/A |
Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework |
| 局部-全局注意力:一种用于多尺度特征融合的自适应机制 |
Yifan Shao |
PDF |
N/A |
Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration |
| 利用大型语言模型加速知识图谱与本体工程 |
Cogan Shimizu |
PDF |
N/A |
Accelerating Knowledge Graph and Ontology Engineering with Large Language Models |
| 混合波束图案和干扰控制下的低轨卫星通信延迟优化 |
Qianqian Zhang |
PDF |
N/A |
Latency Optimization in LEO Satellite Communications with Hybrid Beam Pattern and Interference Control |
| 评估DINOv2自监督学习视觉Transformer模型在从MRI图像中分割左心房方面的性能 |
Bipasha Kundu |
PDF |
N/A |
Assessing the Performance of the DINOv2 Self-supervised Learning Vision Transformer Model for the Segmentation of the Left Atrium from MRI Images |
| LLaMA-Mesh:将3D网格生成与语言模型统一 |
Zhengyi Wang |
PDF |
N/A |
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models |
| SMILE-乌胡拉挑战赛——从超高分辨率7T磁共振血管造影中进行介观尺度的小血管分割 |
Soumick Chatterjee |
PDF |
N/A |
SMILE-UHURA Challenge -- Small Vessel Segmentation at Mesoscopic Scale from Ultra-High Resolution 7T Magnetic Resonance Angiograms |
| 缺失数据可解释机器学习模型的专家研究 |
Lena Stempfle |
PDF |
N/A |
Expert Study on Interpretable Machine Learning Models with Missing Data |
| 采用RAG进行LLM辅助的未来车辆设计 |
Vahid Zolfaghari |
PDF |
N/A |
Adopting RAG for LLM-Aided Future Vehicle Design |
| 蜘蛛:任意到多模态大语言模型 |
Jinxiang Lai |
PDF |
N/A |
Spider: Any-to-Many Multimodal LLM |
| BabyLM挑战赛:探索变异集对语言模型训练效率的影响 |
Akari Haga |
PDF |
N/A |
BabyLM Challenge: Exploring the Effect of Variation Sets on Language Model Training Efficiency |
| 基础模型驱动软件(FMware)的软件性能工程 |
Haoxiang Zhang |
PDF |
N/A |
Software Performance Engineering for Foundation Model-Powered Software (FMware) |
| 通过图重写自动重构本质规范 |
Ian Miguel |
PDF |
N/A |
Automating Reformulation of Essence Specifications via Graph Rewriting |
| 动态重建手-物体交互的分布式力感知接触表示 |
Zhenjun Yu |
PDF |
N/A |
Dynamic Reconstruction of Hand-Object Interaction with Distributed Force-aware Contact Representation |
| VPBSD:基于船舶模式的半监督蒸馏方法,用于高效的3D显微脑血管分割 |
Xi Lin |
PDF |
N/A |
VPBSD:Vessel-Pattern-Based Semi-Supervised Distillation for Efficient 3D Microscopic Cerebrovascular Segmentation |
| 自适应偏差学习用于视觉异常检测与数据污染 |
Anindya Sundar Das |
PDF |
N/A |
Adaptive Deviation Learning for Visual Anomaly Detection with Data Contamination |
| 运动放大图像处理 |
Nadaniela Egidi |
PDF |
N/A |
Image Processing for Motion Magnification |
| OOD-SEG:利用稀疏多类正样本标注进行图像分割的分布外检测 |
Junwen Wang |
PDF |
N/A |
OOD-SEG: Out-Of-Distribution detection for image SEGmentation with sparse multi-class positive-only annotations |
| MFTIQ:具有独立匹配质量评估的多流追踪器 |
Jonas Serych |
PDF |
N/A |
MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation |
| 拼凑一切:验证多跳多模态声明 |
Haoran Wang |
PDF |
N/A |
Piecing It All Together: Verifying Multi-Hop Multimodal Claims |
| 基于方程的数据驱动流动预算和动力学识别 |
Nataliya Sevryugina |
PDF |
N/A |
Equation-informed data-driven identification of flow budgets and dynamics |
| OpenGeMM:一种高利用率GeMM加速器生成器,配备轻量级RISC-V控制与紧密内存耦合 |
Xiaoling Yi |
PDF |
N/A |
OpenGeMM: A High-Utilization GeMM Accelerator Generator with Lightweight RISC-V Control and Tight Memory Coupling |
| 提示未知:检测黑盒模型中的隐藏后门 |
Zi-Xuan Huang |
PDF |
N/A |
Prompting the Unseen: Detecting Hidden Backdoors in Black-Box Models |
| 《有限数据下的语言模型微调实用指南》 |
Márton Szép |
PDF |
N/A |
A Practical Guide to Fine-tuning Language Models with Limited Data |
| 使用智能边缘传感器系统进行无标记人体步态分析 |
Eva Katharina Bauer |
PDF |
N/A |
Marker-free Human Gait Analysis using a Smart Edge Sensor System |
| 导航风险:基于大语言模型代理的安全、隐私和伦理威胁调查 |
Yuyou Gan |
PDF |
N/A |
Navigating the Risks: A Survey of Security, Privacy, and Ethics Threats in LLM-Based Agents |
| 随机化诚实拍卖与学习代理 |
Gagan Aggarwal |
PDF |
N/A |
Randomized Truthful Auctions with Learning Agents |
| 基于生成对抗网络的低剂量计算机断层扫描图像去噪架构 |
Yunuo Wang |
PDF |
N/A |
GAN-Based Architecture for Low-dose Computed Tomography Imaging Denoising |
| 张量并行大型语言模型推理中的通信压缩 |
Jan Hansen-Palmus |
PDF |
N/A |
Communication Compression for Tensor Parallel LLM Inference |
| 迈向科学创新的凝聚性人工智能与仿真软件生态系统 |
Michael A. Heroux |
PDF |
N/A |
Toward a Cohesive AI and Simulation Software Ecosystem for Scientific Innovation |
| 扩散模型的黄金噪声:一种学习框架 |
Zikai Zhou |
PDF |
N/A |
Golden Noise for Diffusion Models: A Learning Framework |
| 基于强化学习的侧梁设计优化方法的发展 |
Aditya Borse |
PDF |
N/A |
Developement of Reinforcement Learning based Optimisation Method for Side-Sill Design |
| 法律文本中可读性指标的应用:一项系统性文献综述 |
Yu Han |
PDF |
N/A |
The Use of Readability Metrics in Legal Text: A Systematic Literature Review |
| 战略性牺牲:自组织机器人集群定位以提升检测效率 |
Sneha Ramshanker |
PDF |
N/A |
Strategic Sacrifice: Self-Organized Robot Swarm Localization for Inspection Productivity |
| MM-Eval:一个用于现代蒙古语评估的分层基准 |
Mengyuan Zhang |
PDF |
N/A |
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs |
| 图像匹配滤波与平面及超越平面的精细化处理 |
Fabio Bellavia |
PDF |
N/A |
Image Matching Filtering and Refinement by Planes and Beyond |
| 用于压缩感知的稀疏贝叶斯生成模型 |
Benedikt Böck |
PDF |
N/A |
Sparse Bayesian Generative Modeling for Compressive Sensing |
| 什么是好的BIM设计:设计行为与质量之间的量化联系 |
Xiang-Rui Ni |
PDF |
N/A |
What makes a good BIM design: quantitative linking between design behavior and quality |
| 图神经网络与微分方程:一种用于流体流动数据同化的混合方法 |
M. Quattromini |
PDF |
N/A |
Graph Neural Networks and Differential Equations: A hybrid approach for data assimilation of fluid flows |
| 残差下降路径:增强残差连接上的特征重用 |
Sejik Park |
PDF |
N/A |
ResidualDroppath: Enhancing Feature Reuse over Residual Connections |
| 肾细胞癌亚型分类:从多分辨率定位中学习 |
Mohamad Mohamad |
PDF |
N/A |
Renal Cell Carcinoma subtyping: learning from multi-resolution localization |
| 使用阴道镜图像进行宫颈癌前风险分类的可解释注意力模型 |
Smith K. Khare |
PDF |
N/A |
An Explainable Attention Model for Cervical Precancer Risk Classification using Colposcopic Images |
| 利用机器学习实现自由电子激光脉冲功率的单发测量 |
Till Korten |
PDF |
N/A |
Harnessing Machine Learning for Single-Shot Measurement of Free Electron Laser Pulse Power |
| SINETRA:一种用于评估行为动物中单个神经元追踪的多功能框架 |
Raphael Reme |
PDF |
N/A |
SINETRA: a Versatile Framework for Evaluating Single Neuron Tracking in Behaving Animals |
| Caravan MultiMet:通过整合多个天气现报和预报扩展Caravan功能 |
Guy Shalev |
PDF |
N/A |
Caravan MultiMet: Extending Caravan with Multiple Weather Nowcasts and Forecasts |
| 长尾目标检测预训练:动态重平衡对比学习与双重重构 |
Chen-Long Duan |
PDF |
N/A |
Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction |
| DiffRoad:为自动驾驶车辆测试生成真实且多样化的道路场景 |
Junjie Zhou |
PDF |
N/A |
DiffRoad: Realistic and Diverse Road Scenario Generation for Autonomous Vehicle Testing |
| 图像重现:通过多模态大语言模型生成相同图像来评估文本到图像模型 |
Chutian Meng |
PDF |
N/A |
Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models |
| 学习高效且可证明收敛的分割方法 |
L. M. Kreusser |
PDF |
N/A |
Learning efficient and provably convergent splitting methods |
| 从自然语言指令中提取模糊时间要求的机器人任务 |
Sascha Sucker |
PDF |
N/A |
Robot Tasks with Fuzzy Time Requirements from Natural Language Instructions |
| 每个人都应被倾听:分析应用于荷兰语音数据的自动语音识别模型中的预测性别偏见 |
Rik Raes |
PDF |
N/A |
Everyone deserves their voice to be heard: Analyzing Predictive Gender Bias in ASR Models Applied to Dutch Speech Data |
| 材料的人工智能驱动逆向设计:过去、现在与未来 |
Xiao-Qi Han |
PDF |
N/A |
AI-driven inverse design of materials: Past, present and future |
| 一个适用于逻辑综合机器学习任务的自适应开源数据集生成框架 |
Liwei Ni |
PDF |
N/A |
An Adaptive Open-Source Dataset Generation Framework for Machine Learning Tasks in Logic Synthesis |
| SAG-ViT:一种基于图注意力机制的尺度感知、高保真补丁化方法,适用于视觉变换器 |
Shravan Venkatraman |
PDF |
N/A |
SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers |
| 以脚本为中心的行为理解助力自闭症谱系障碍诊断 |
Wenxing Liu |
PDF |
N/A |
Script-centric behavior understanding for assisted autism spectrum disorder diagnosis |
| 利用卫星影像中的阴影长度进行建筑物高度估计 |
Mahd Qureshi |
PDF |
N/A |
Building Height Estimation Using Shadow Length in Satellite Imagery |
| 量子机器学习:量子计算与机器学习的交融 |
Jun Qi |
PDF |
N/A |
Quantum Machine Learning: An Interplay Between Quantum Computing and Machine Learning |
| 非对比计算机断层扫描图像中缺血性脑卒中病变的自动分割,以提升治疗效果和预后 |
Toufiq Musah |
PDF |
N/A |
Automated Segmentation of Ischemic Stroke Lesions in Non-Contrast Computed Tomography Images for Enhanced Treatment and Prognosis |
| 想象中的言语和视觉意象作为脑机接口的直观范式 |
Seo-Hyun Lee |
PDF |
N/A |
Imagined Speech and Visual Imagery as Intuitive Paradigms for Brain-Computer Interfaces |
| 用于网络安全问题在线学习的固有可解释性与不确定性感知模型 |
Benjamin Kolicic |
PDF |
N/A |
Inherently Interpretable and Uncertainty-Aware Models for Online Learning in Cyber-Security Problems |
| 少即是多:通过因果传播子结构检测未见领域虚假新闻 |
Shuzhi Gong |
PDF |
N/A |
Less is More: Unseen Domain Fake News Detection via Causal Propagation Substructures |
| 分子模拟的概率生成框架调查 |
Richard John |
PDF |
N/A |
A survey of probabilistic generative frameworks for molecular simulations |
| 指令驱动的红外-可见光图像融合:为多样化的下游任务量身定制 |
Zengyi Yang |
PDF |
N/A |
Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks |
| 核掩码是否足以提升域外泛化能力?深入探讨组织病理学中的癌症分类问题 |
Dhananjay Tomar |
PDF |
N/A |
Are nuclear masks all you need for improved out-of-domain generalisation? A closer look at cancer classification in histopathology |
| DSCformer:一种双分支网络,结合增强型动态蛇卷积和SegFormer用于裂缝分割 |
Kaiwei Yu |
PDF |
N/A |
DSCformer: A Dual-Branch Network Integrating Enhanced Dynamic Snake Convolution and SegFormer for Crack Segmentation |
| LTLf+ 和 PPLTL+:将LTLf和PPLTL扩展至无限轨迹 |
Benjamin Aminof |
PDF |
N/A |
LTLf+ and PPLTL+: Extending LTLf and PPLTL to Infinite Traces |
| 分布式随机梯度下降平均算法的稳定性和泛化性 |
Miaoxi Zhu |
PDF |
N/A |
Stability and Generalization for Distributed SGDA |
| 3D医学影像的时间到事件预训练 |
Zepeng Huo |
PDF |
N/A |
Time-to-Event Pretraining for 3D Medical Imaging |
| 您的固定水印易碎:面向EaaS版权保护的语义感知水印 |
Zekun Fei |
PDF |
N/A |
Your Fixed Watermark is Fragile: Towards Semantic-Aware Watermark for EaaS Copyright Protection |
| 多尺度生成模型用于快速采样 |
Xiongye Xiao |
PDF |
N/A |
Multi-scale Generative Modeling for Fast Sampling |
| 自适应增强一致性学习:一种用于遥感数据的半监督分割框架 |
Hui Ye |
PDF |
N/A |
Adaptively Augmented Consistency Learning: A Semi-supervised Segmentation Framework for Remote Sensing |
| 近似变分贝叶斯逆强化学习用于大规模语言模型对齐 |
Yuang Cai |
PDF |
N/A |
Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment |
| 轻量级Transformer在设备端语音情感识别中的重参数化 |
Zixing Zhang |
PDF |
N/A |
Re-Parameterization of Lightweight Transformer for On-Device Speech Emotion Recognition |
| 改进用于稳态对流占优问题的hp-变分物理信息神经网络 |
Thivin Anandh |
PDF |
N/A |
Improving hp-Variational Physics-Informed Neural Networks for Steady-State Convection-Dominated Problems |
| DriveThru:一个用于印度尼西亚地方语言档案的文档提取平台和基准数据集 |
MohammadRifqi Farhansyah |
PDF |
N/A |
DriveThru: a Document Extraction Platform and Benchmark Datasets for Indonesian Local Language Archives |
| Pie:为大型语言模型推理汇聚CPU内存 |
Yi Xu |
PDF |
N/A |
Pie: Pooling CPU Memory for LLM Inference |
| 时间序列数据的近似概率推断:一种具有时间感知能力的鲁棒潜高斯模型 |
Anton Johansson |
PDF |
N/A |
Approximate Probabilistic Inference forTime-Series Data A Robust Latent Gaussian Model With Temporal Awareness |
| 从Hinode SOT/SP观测中收集的太阳偏振光谱的压缩方法 |
Jargalmaa Batmunkh |
PDF |
N/A |
Compression Method for Solar Polarization Spectra Collected from Hinode SOT/SP Observations |
| 探索在医学影像中利用CLIP进行零样本异常检测:我们是否已经达到目标? |
Aldo Marzullo |
PDF |
N/A |
Exploring Zero-Shot Anomaly Detection with CLIP in Medical Imaging: Are We There Yet? |
| DT-JRD:基于深度变换器的机器视频编码可识别差异预测模型 |
Junqi Liu |
PDF |
N/A |
DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines |
| 基于脑电图的语音解码:一种利用多核集成扩散模型的新方法 |
Soowon Kim |
PDF |
N/A |
EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models |
| LHRS-Bot-Nova:用于遥感视觉语言解释的改进型多模态大型语言模型 |
Zhenshi Li |
PDF |
N/A |
LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation |
| DTELS:面向时间轴摘要动态粒度的研究 |
Chenlong Zhang |
PDF |
N/A |
DTELS: Towards Dynamic Granularity of Timeline Summarization |
| 使用白盒对抗攻击增强高能物理中的泛化能力 |
Franck Rothen |
PDF |
N/A |
Enhancing generalization in high energy physics using white-box adversarial attacks |
| 学习轻型外骨骼的手部状态估计 |
Gabriele Abbate |
PDF |
N/A |
Learning Hand State Estimation for a Light Exoskeleton |
| LLV-FSR:利用大规模语言-视觉先验进行人脸超分辨率 |
Chenyang Wang |
PDF |
N/A |
LLV-FSR: Exploiting Large Language-Vision Prior for Face Super-resolution |
| StreamAdapter:从上下文流中进行高效测试时间适应 |
Dilxat Muhtar |
PDF |
N/A |
StreamAdapter: Efficient Test Time Adaptation from Contextual Streams |
| 基于多源异构迁移学习的跨域推荐集中-分布式迁移模型 |
Ke Xu |
PDF |
N/A |
A Centralized-Distributed Transfer Model for Cross-Domain Recommendation Based on Multi-Source Heterogeneous Transfer Learning |
| 利用辅助分类进行肋骨骨折分割 |
Harini G. |
PDF |
N/A |
Leveraging Auxiliary Classification for Rib Fracture Segmentation |
| 多模态大型语言模型中的跨模态一致性 |
Xiang Zhang |
PDF |
N/A |
Cross-Modal Consistency in Multimodal Large Language Models |
| 利用多个大型语言模型进行信息检索:以生物多样性出版物中的深度学习方法为例的研究 |
Vamsi Krishna Kommineni |
PDF |
N/A |
Harnessing multiple LLMs for Information Retrieval: A case study on Deep Learning methodologies in Biodiversity publications |
| LES-Talker:线性情感空间中用于生成说话人头部的细粒度情感编辑 |
Guanwen Feng |
PDF |
N/A |
LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space |
| 面向基于原型的去中心化学习的有效压缩与通信 |
Pablo Fernández-Piñeiro |
PDF |
N/A |
Towards efficient compression and communication for prototype-based decentralized learning |
| ChatGPT在视听深度伪造检测中的表现如何:ChatGPT、AI模型与人类感知能力的比较研究 |
Sahibzada Adil Shahzad |
PDF |
N/A |
How Good is ChatGPT at Audiovisual Deepfake Detection: A Comparative Study of ChatGPT, AI Models and Human Perception |
| 胡须:数据集蒸馏对抗鲁棒性基准测试 |
Zheng Zhou |
PDF |
N/A |
BEARD: Benchmarking the Adversarial Robustness for Dataset Distillation |
| 重新思考加权平均模型合并 |
Hu Wang |
PDF |
N/A |
Rethinking Weight-Averaged Model-merging |
| 自动化自动评分:大型语言模型作为入门编程测试套件生成器 |
Umar Alkafaween |
PDF |
N/A |
Automating Autograding: Large Language Models as Test Suite Generators for Introductory Programming |
| 越狱攻击与多模态生成模型防御:综述 |
Xuannan Liu |
PDF |
N/A |
Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey |
| DAHL:通过生物医学基准数据集,对长篇文本进行领域特定自动幻觉评估 |
Jean Seo |
PDF |
N/A |
DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine |
| 跨时空:一种时空单元化模型用于交通流量预测 |
Weilin Ruan |
PDF |
N/A |
Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting |
| 嵌入空间分配与角度-范数联合分类器用于少样本类增量学习 |
Dunwei Tu |
PDF |
N/A |
Embedding Space Allocation with Angle-Norm Joint Classifiers for Few-Shot Class-Incremental Learning |
| 通过模型增强提升语言模型在金融领域的适应性 |
Kota Tanabe |
PDF |
N/A |
Enhancing Financial Domain Adaptation of Language Models via Model Augmentation |
| 统一神经解码:从脑电信号中感知、口语和想象语音的解码 |
Jung-Sun Lee |
PDF |
N/A |
Towards Unified Neural Decoding of Perceived, Spoken and Imagined Speech from EEG Signals |
| FluidML:快速且内存高效的推理优化 |
Jinjie Liu |
PDF |
N/A |
FluidML: Fast and Memory Efficient Inference Optimization |
| 重新思考“热图+蒙特卡洛树搜索”范式用于解决大规模旅行商问题 |
Xuanhao Pan |
PDF |
N/A |
Rethinking the "Heatmap + Monte Carlo Tree Search" Paradigm for Solving Large Scale TSP |
| 使用AI编程:评估ChatGPT、Gemini、AlphaCode和GitHub Copilot对程序员的效果 |
Md Kamrul Siam |
PDF |
N/A |
Programming with AI: Evaluating ChatGPT, Gemini, AlphaCode, and GitHub Copilot for Programmers |
| 针对自动语音识别系统的可转移对抗攻击 |
Xiaoxue Gao |
PDF |
N/A |
Transferable Adversarial Attacks against ASR |
| 利用视觉基础模型实现高性能、无需训练的开放词汇分割 |
Yuheng Shi |
PDF |
N/A |
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation |
| HateGPT:利用GPT-3.5 Turbo在X平台上对抗仇恨言论 |
Aniket Deroy |
PDF |
N/A |
HateGPT: Unleashing GPT-3.5 Turbo to Combat Hate Speech on X |
| 全面实用的检索增强生成系统在医疗问答中的评估 |
Nghia Trung Ngo |
PDF |
N/A |
Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering |
| 动态神经通信:计算机视觉与脑机接口的融合 |
Ji-Ha Park |
PDF |
N/A |
Dynamic Neural Communication: Convergence of Computer Vision and Brain-Computer Interface |
| 经典验证量子学习优势与噪声 |
Yinghao Ma |
PDF |
N/A |
Classical Verification of Quantum Learning Advantages with Noises |
| JoyVASA:基于扩散的音频驱动面部动态和头部运动生成的人物和动物图像动画 |
Xuyang Cao |
PDF |
N/A |
JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation |
| RibCageImp:一种用于3D肋骨植入物生成的深度学习框架 |
Gyanendra Chaubey |
PDF |
N/A |
RibCageImp: A Deep Learning Framework for 3D Ribcage Implant Generation |
| Ghost-Connect Net:分布偏移下稀疏深度网络的泛化增强引导 |
Mary Isabelle Wisell |
PDF |
N/A |
Ghost-Connect Net: A Generalization-Enhanced Guidance For Sparse Deep Networks Under Distribution Shifts |
| 信息性期权 |
Andrew Koh |
PDF |
N/A |
Informational Puts |
| 基于双层LSTM的语音情感识别模型的改进与实现 |
Xiaoran Yang |
PDF |
N/A |
Improvement and Implementation of a Speech Emotion Recognition Model Based on Dual-Layer LSTM |
| 动态技术影响分析:基于多任务学习的专利引用预测方法 |
Youngjin Seol |
PDF |
N/A |
Dynamic technology impact analysis: A multi-task learning approach to patent citation prediction |
| DeBaTeR:用于推荐的降噪二分时间图 |
Xinyu He |
PDF |
N/A |
DeBaTeR: Denoising Bipartite Temporal Graph for Recommendation |
| LEAP:D -- 一种新颖的基于提示的领域泛化航空目标检测方法 |
Chanyeong Park |
PDF |
N/A |
LEAP:D -- A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection |
| SAFES:负责任人工智能的顺序隐私和公平增强数据合成 |
Spencer Giddens |
PDF |
N/A |
SAFES: Sequential Privacy and Fairness Enhancing Data Synthesis for Responsible AI |
| 凝视奖励:眼动作为混合视觉觅食中人类与AI决策的透镜 |
Bo Wang |
PDF |
N/A |
Gazing at Rewards: Eye Movements as a Lens into Human and AI Decision-Making in Hybrid Visual Foraging |
| 混合深度加性神经网络 |
Gyu Min Kim |
PDF |
N/A |
Hybrid deep additive neural networks |
| 推进扩散模型:无别名重采样与增强旋转等变性 |
Md Fahim Anjum |
PDF |
N/A |
Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance |
| 通过脑电图解码和潜在嵌入整合实现可扩展的手写交流 |
Jun-Young Kim |
PDF |
N/A |
Towards Scalable Handwriting Communication via EEG Decoding and Latent Embedding Integration |
| 人工智能理论思维与自我引导的社会组织 |
Michael S. Harré |
PDF |
N/A |
Artificial Theory of Mind and Self-Guided Social Organisation |
| 心智理论增强集体智慧 |
Michael S. Harré |
PDF |
N/A |
Theory of Mind Enhances Collective Intelligence |
| 非结构化文本增强的开放域对话系统:系统性综述 |
Longxuan Ma |
PDF |
N/A |
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey |
| 基于理性与先天价值驱动的强化学习 |
Qin Yang |
PDF |
N/A |
Rationality based Innate-Values-driven Reinforcement Learning |
| 《乐观主义者》:迈向全自动图论研究 |
Randy Davila |
PDF |
N/A |
The \emph{Optimist}: Towards Fully Automated Graph Theory Research |
| DyGASR:基于表面对齐的动态广义指数溅射技术加速三维网格重建 |
Shengchao Zhao |
PDF |
N/A |
DyGASR: Dynamic Generalized Exponential Splatting with Surface Alignment for Accelerated 3D Mesh Reconstruction |
| VidMan:利用视频扩散模型中的隐式动态实现有效的机器人操作 |
Youpeng Wen |
PDF |
N/A |
VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation |
| GRAINRec:基于图和注意力集成的实时会话推荐方法 |
Bhavtosh Rath |
PDF |
N/A |
GRAINRec: Graph and Attention Integrated Approach for Real-Time Session-Based Item Recommendations |
| Mono2Stereo:单目知识迁移以增强立体匹配 |
Yuran Wang |
PDF |
N/A |
Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching |
| UniHOI:学习快速、密集且可泛化的第一人称手部物体交互视频的4D重建 |
Chengbo Yuan |
PDF |
N/A |
UniHOI: Learning Fast, Dense and Generalizable 4D Reconstruction for Egocentric Hand Object Interaction Videos |
| 微分隐私的拉普拉斯变换解释 |
Rishav Chourasia |
PDF |
N/A |
Laplace Transform Interpretation of Differential Privacy |
| 早产儿视网膜病变诊断中的对抗性血管揭示半监督分割 |
Gozde Merve Demirci |
PDF |
N/A |
Adversarial Vessel-Unveiling Semi-Supervised Segmentation for Retinopathy of Prematurity Diagnosis |
| 快速概率蛇形算法 |
Jérôme Gilles |
PDF |
N/A |
Fast probabilistic snake algorithm |
| ABCI 3.0:日本领先AI基础设施的演进 |
Ryousei Takano |
PDF |
N/A |
ABCI 3.0: Evolution of the leading AI infrastructure in Japan |
| 用于成像的计算超表面光学元件 |
Charles Roques-Carmes |
PDF |
N/A |
Computational metaoptics for imaging |
| 深度神经网络最优结构发现的复杂度感知训练 |
Valentin Frank Ingmar Guenter |
PDF |
N/A |
Complexity-Aware Training of Deep Neural Networks for Optimal Structure Discovery |
| 扫描:为提高数据效率的自举对比预训练 |
Yangyang Guo |
PDF |
N/A |
SCAN: Bootstrapping Contrastive Pre-training for Data Efficiency |
| DROJ:针对大型语言模型的提示驱动攻击 |
Leyang Hu |
PDF |
N/A |
DROJ: A Prompt-Driven Attack against Large Language Models |
| 复杂系统神经图模拟器 |
Hoyun Choi |
PDF |
N/A |
Neural Graph Simulator for Complex Systems |
| FxTS-Net:神经ODE的固定时间稳定学习框架 |
Chaoyang Luo |
PDF |
N/A |
FxTS-Net: Fixed-Time Stable Learning Framework for Neural ODEs |
| 基于数据初始化的多模态分布高效学习和采样 |
Frederic Koehler |
PDF |
N/A |
Efficiently learning and sampling multimodal distributions with data-based initialization |
| P-MMEval:一种并行多语言多任务基准,用于对大型语言模型进行一致性评估 |
Yidan Zhang |
PDF |
N/A |
P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs |
| 降低推理成本——通过稀疏注意力机制优化思维链的路径 |
Libo Wang |
PDF |
N/A |
Reducing Reasoning Costs -- The Path of Optimization for Chain of Thought via Sparse Attention Mechanism |
| 星际物体探索中的信息最优多航天器定位 |
Arna Bhardwaj |
PDF |
N/A |
Information-Optimal Multi-Spacecraft Positioning for Interstellar Object Exploration |
| 个性化帮助优化低技能用户的策略 |
Feng Gu |
PDF |
N/A |
Personalized Help for Optimizing Low-Skilled Users' Strategy |
| VCBench:一个可控的基准测试,用于评估视频认知中的符号和抽象挑战 |
Chenglin Li |
PDF |
N/A |
VCBench: A Controllable Benchmark for Symbolic and Abstract Challenges in Video Cognition |
| 挑衅性问题:在生成式人工智能中,“包容性”让谁受益? |
Nari Johnson |
PDF |
N/A |
Provocation: Who benefits from "inclusion" in Generative AI? |
| 遥感影像语义分割中视觉变换器与卷积神经网络的启发式比较 |
Ashim Dahal |
PDF |
N/A |
Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery |