| FrugalNeRF:无需学习先验知识,快速收敛的小样本新视角合成方法 |
Chin-Yang Lin |
PDF |
N/A |
FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors |
| MvDrag3D:基于拖拽的多视角生成-重构先验的创意3D编辑 |
Honghua Chen |
PDF |
N/A |
MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors |
| 反思-长凳:通过反思探究人工智能的智能 |
Lingyu Li |
PDF |
N/A |
Reflection-Bench: probing AI intelligence with reflection |
| SAM2Long:利用无训练记忆树增强SAM 2的长视频分割能力 |
Shuangrui Ding |
PDF |
N/A |
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree |
| xGen-MM-Vid (BLIP-3-Video): 即使在视觉语言模型中,你只需要32个标记就能表示一段视频 |
Michael S. Ryoo |
PDF |
N/A |
xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs |
| 3DGS-Enhancer:通过视图一致的2D扩散先验增强无界3D高斯喷洒 |
Xi Liu |
PDF |
N/A |
3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors |
| Mini-InternVL:一个灵活迁移的口袋多模态模型,参数减少5%,性能保持90% |
Zhangwei Gao |
PDF |
N/A |
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance |
| 代理对模拟:从随意的纵向视频中学习交互行为模型 |
Gengshan Yang |
PDF |
N/A |
Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos |
| 阐明用于图像生成的语言模型的设计空间 |
Xuantong Liu |
PDF |
N/A |
Elucidating the design space of language models for image generation |
| 指南针评判者-1:一体化评判模型助力模型评估与进化 |
Maosong Cao |
PDF |
N/A |
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution |
| 重新审视深度特征重构在逻辑和结构工业异常检测中的应用 |
Sukanya Patra |
PDF |
N/A |
Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection |
| 具有有效输出的分布学习超越最坏情况 |
Nick Rittler |
PDF |
N/A |
Distribution Learning with Valid Outputs Beyond the Worst-Case |
| 知识编辑真的能纠正幻觉吗? |
Baixiang Huang |
PDF |
N/A |
Can Knowledge Editing Really Correct Hallucinations? |
| 通过梯度下降实现管状张量分解的隐式正则化 |
Santhosh Karnik |
PDF |
N/A |
Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent |
| 分析基于大型语言模型的机器翻译中的上下文贡献 |
Emmanouil Zaranis |
PDF |
N/A |
Analyzing Context Contributions in LLM-based Machine Translation |
| MoRE:在X光片、心电图和诊断报告上使用多模态对比预训练的Transformer模型 |
Samrajya Thapa |
PDF |
N/A |
MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report |
| 多中心MRI临床显著性前列腺癌深度放射组学检测:初步对比PI-RADS评估 |
G. A. Nketiah |
PDF |
N/A |
Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment |
| IBGP:通信多智能体系统中零样本鲁棒性的不完全拜占庭将军问题 |
Yihuan Mao |
PDF |
N/A |
IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems |
| LLaVA-KD:一种多模态大语言模型蒸馏框架 |
Yuxuan Cai |
PDF |
N/A |
LLaVA-KD: A Framework of Distilling Multimodal Large Language Models |
| ToW:词语思考提升大型语言模型的推理能力 |
Zhikun Xu |
PDF |
N/A |
ToW: Thoughts of Words Improve Reasoning in Large Language Models |
| Sketch2Code:评估视觉语言模型在交互式网页设计原型制作中的应用 |
Ryan Li |
PDF |
N/A |
Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping |
| 通过检索增强语言模型构建编码助手 |
Xinze Li |
PDF |
N/A |
Building A Coding Assistant via the Retrieval-Augmented Language Model |
| 管理带宽:云辅助自动驾驶的关键 |
Alexander Krentsel |
PDF |
N/A |
Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving |
| 大型语言模型越狱的现实威胁模型 |
Valentyn Boreiko |
PDF |
N/A |
A Realistic Threat Model for Large Language Model Jailbreaks |
| 在医疗领域创建英泰代码转换机器翻译 |
Parinthapat Pengpun |
PDF |
N/A |
On Creating an English-Thai Code-switched Machine Translation in Medical Domain |
| 大型语言模型预训练蒸馏:设计空间探索 |
Hao Peng |
PDF |
N/A |
Pre-training Distillation for Large Language Models: A Design Space Exploration |
| 全面基准测试大型语言模型用于RNA二级结构预测 |
L. I. Zablocki |
PDF |
N/A |
Comprehensive benchmarking of large language models for RNA secondary structure prediction |
| 计算约束的数据选择 |
Junjie Oscar Yin |
PDF |
N/A |
Compute-Constrained Data Selection |
| CoT-TL:利用思维链推理进行低资源规划指令的时间知识表示 |
Kumar Manas |
PDF |
N/A |
CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning |
| 系统综述:用于社交媒体心理健康检测的机器学习与深度学习中的文本处理算法 |
Yuchen Cao |
PDF |
N/A |
Systematic Review: Text Processing Algorithms in Machine Learning and Deep Learning for Mental Health Detection on Social Media |
| 在过度参数化时代中集成方法的理论局限性 |
Niclas Dern |
PDF |
N/A |
Theoretical Limitations of Ensembles in the Age of Overparameterization |
| 改进视觉语言模型链式思维推理 |
Ruohong Zhang |
PDF |
N/A |
Improve Vision Language Model Chain-of-thought Reasoning |
| LASER:自主代理执行脚本以实现按需交通模拟 |
Hao Gao |
PDF |
N/A |
LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation |
| 对话生成信息:利用知识图谱的建议 |
Alex Clay |
PDF |
N/A |
Information for Conversation Generation: Proposals Utilising Knowledge Graphs |
| 一种用于图形化Stein变分推理的信赖域方法 |
Liam Pavlovic |
PDF |
N/A |
A Trust-Region Method for Graphical Stein Variational Inference |
| 利用人类视觉显著性训练更好的深度学习模型 |
Aidan Boyd |
PDF |
N/A |
Training Better Deep Learning Models Using Human Saliency |
| 多语言基准测试的污染报告 |
Sanchit Ahuja |
PDF |
N/A |
Contamination Report for Multilingual Benchmarks |
| RM-Bench:以微妙和风格为语言模型的奖励模型进行基准测试 |
Yantao Liu |
PDF |
N/A |
RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style |
| 魔法猪:LSH采样,用于高效的大型语言模型生成 |
Zhuoming Chen |
PDF |
N/A |
MagicPIG: LSH Sampling for Efficient LLM Generation |
| 一个利用合成图像协变量和纵向数据评估预测模型的框架 |
Simon Deltadahl |
PDF |
N/A |
A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data |
| 脉冲神经网络作为涌现群体代理的控制器 |
Kevin Zhu |
PDF |
N/A |
Spiking Neural Networks as a Controller for Emergent Swarm Agents |
| 学习如何按原则投票:神经网络集体决策的公理性洞察 |
Levin Hornischer |
PDF |
N/A |
Learning How to Vote With Principles: Axiomatic Insights Into the Collective Decisions of Neural Networks |
| 身体活动、蛋白质摄入与睡眠质量在肌肉蛋白质合成中的相互作用 |
Ayush Devkota |
PDF |
N/A |
The Interplay Between Physical Activity, Protein Consumption, and Sleep Quality in Muscle Protein Synthesis |
| 探索通过主动遗忘来改进解码器语言模型的跨语言迁移的预训练方法 |
Divyanshu Aggarwal |
PDF |
N/A |
Exploring Pretraining via Active Forgetting for Improving Cross Lingual Transfer for Decoder Language Models |
| 超越过滤:面向多模态大语言模型预训练的自适应图文质量增强 |
Han Huang |
PDF |
N/A |
Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining |
| 从标记到材料:利用语言模型助力科学发现 |
Yuwei Wan |
PDF |
N/A |
From Tokens to Materials: Leveraging Language Models for Scientific Discovery |
| 生成式人工智能辅助医学培训 |
Stefan Fritsch |
PDF |
N/A |
GenAI Assisting Medical Training |
| Griffon-G:通过大型多模态模型连接视觉-语言与视觉中心任务 |
Yufei Zhan |
PDF |
N/A |
Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models |
| Sparkle:掌握视觉语言模型中的基本空间能力可激发对复合空间推理的泛化能力 |
Yihong Tang |
PDF |
N/A |
Sparkle: Mastering Basic Spatial Capabilities in Vision Language Models Elicits Generalization to Composite Spatial Reasoning |
| DMM:使用打包秘密共享的差分隐私联邦学习分布式矩阵机制 |
Alexander Bienstock |
PDF |
N/A |
DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing |
| 度量作为变换:探索超越仿射变换的可解释神经网络 |
Suman Sapkota |
PDF |
N/A |
Metric as Transform: Exploring beyond Affine Transform for Interpretable Neural Network |
| 网络:复杂性的视觉语言 |
Blai Vidiella |
PDF |
N/A |
Networks: The Visual Language of Complexity |
| 林北不讲:新式英语标注的挑战 |
Lynnette Hui Xian Ng |
PDF |
N/A |
Limpeh ga li gong: Challenges in Singlish Annotations |
| 一个具有传染性越狱能力的捣乱者在诚实的小镇中制造了混乱 |
Tianyi Men |
PDF |
N/A |
A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns |
| 有限数据下持续学习的无监督重放策略 |
Anthony Bazhenov |
PDF |
N/A |
Unsupervised Replay Strategies for Continual Learning with Limited Data |
| 泛亚:一个完全开放的多语言多模态大语言模型,支持39种语言 |
Xiang Yue |
PDF |
N/A |
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages |
| 扭曲扩散:利用图像扩散模型解决视频逆问题 |
Giannis Daras |
PDF |
N/A |
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models |
| 小贡献,小网络:基于相对重要性的高效神经网络剪枝 |
Mostafa Hussien |
PDF |
N/A |
Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance |
| 在教师-学生设置中使用受限玻尔兹曼机进行结构化数据学习的建模 |
Robin Thériault |
PDF |
N/A |
Modelling Structured Data Learning with Restricted Boltzmann Machines in the Teacher-Student Setting |
| PODTILE:利用自动生成的章节促进播客剧集浏览 |
Azin Ghazimatin |
PDF |
N/A |
PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters |
| 面向对抗领域泛化中的频率简单性偏置学习 |
Xilin He |
PDF |
N/A |
Towards Combating Frequency Simplicity-biased Learning for Domain Generalization |
| 1-bit AI 基础设施:第1.1部分,在CPU上快速且无损的BitNet b1.58推理 |
Jinheng Wang |
PDF |
N/A |
1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs |
| 一种基于可解释对比的扩张卷积网络,结合Transformer用于儿童肺炎检测 |
Chandravardhan Singh Raghaw |
PDF |
N/A |
An Explainable Contrastive-based Dilated Convolutional Network with Transformer for Pediatric Pneumonia Detection |
| 语言模型对论元角色敏感性的心理语言学评估 |
Eun-Kyoung Rosa Lee |
PDF |
N/A |
A Psycholinguistic Evaluation of Language Models' Sensitivity to Argument Roles |
| 图学习中线图变换的理论洞察 |
Fan Yang |
PDF |
N/A |
Theoretical Insights into Line Graph Transformation on Graph Learning |
| 通过结合自然视频刺激和与刺激无关的潜在因素来建模动态神经活动 |
Finn Schmidt |
PDF |
N/A |
Modeling dynamic neural activity by combining naturalistic video stimuli and stimulus-independent latent factors |
| 超越2:4:探索V:N:M稀疏性以在GPU上实现高效的Transformer推理 |
Kang Zhao |
PDF |
N/A |
Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs |
| 一种数据驱动的群体模拟框架,结合了物理信息机器学习与导航势场 |
Runkang Guo |
PDF |
N/A |
A Data-driven Crowd Simulation Framework Integrating Physics-informed Machine Learning with Navigation Potential Fields |
| 大型音频-语言模型真的能听懂吗?通过多任务评估和逐步音频推理解决幻觉问题 |
Chun-Yi Kuan |
PDF |
N/A |
Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning |
| SMART:用于推理任务的自学习元策略代理 |
Rongxing Liu |
PDF |
N/A |
SMART: Self-learning Meta-strategy Agent for Reasoning Tasks |
| MNIST-Nd:一组用于跨维度基准聚类的自然主义数据集 |
Polina Turishcheva |
PDF |
N/A |
MNIST-Nd: a set of naturalistic datasets to benchmark clustering across dimensions |
| 分子机器学习中无监督训练集选择的整数线性规划 |
Matthieu Haeberle |
PDF |
N/A |
Integer linear programming for unsupervised training set selection in molecular machine learning |
| 从大型语言模型中提取时空数据 |
Lele Zheng |
PDF |
N/A |
Extracting Spatiotemporal Data from Gradients with Large Language Models |
| SeaDAG:用于有条件有向无环图生成的半自回归扩散模型 |
Xinyi Zhou |
PDF |
N/A |
SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation |
| 多模态耀斑预测与深度学习 |
Grégoire Francisco |
PDF |
N/A |
Multimodal Flare Forecasting with Deep Learning |
| 通过近似人类视觉显著性来提高神经网络的可解释性 |
Aidan Boyd |
PDF |
N/A |
Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency |
| 大型语言模型写作是否像人类?语法和修辞风格的变化 |
Alex Reinhart |
PDF |
N/A |
Do LLMs write like humans? Variation in grammatical and rhetorical styles |
| 线性函数逼近下的时序差分学习的统计推断 |
Weichen Wu |
PDF |
N/A |
Statistical Inference for Temporal Difference Learning with Linear Function Approximation |
| 通过多级深度学习解决深度神经网络的光谱偏差问题 |
Ronglong Fang |
PDF |
N/A |
Addressing Spectral Bias of Deep Neural Networks by Multi-Grade Deep Learning |
| LDAdam:从低维梯度统计中自适应优化 |
Thomas Robert |
PDF |
N/A |
LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics |
| ExDBN:动态贝叶斯网络的精确学习 |
Pavel Rytíř |
PDF |
N/A |
ExDBN: Exact learning of Dynamic Bayesian Networks |
| LMHaze:基于强度感知的图像去雾方法,采用大规模多强度真实雾霾数据集 |
Ruikun Zhang |
PDF |
N/A |
LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset |
| CHESS最终报告:面向科学和安全的云、高性能计算与边缘计算 |
Nathan Tallent |
PDF |
N/A |
Final Report for CHESS: Cloud, High-Performance Computing, and Edge for Science and Security |
| 用于驱动耗散量子动力学的神经量子传播器 |
Jiaji Zhang |
PDF |
N/A |
Neural Quantum Propagators for Driven-Dissipative Quantum Dynamics |
| 分析语言模型在知识冲突下的残差流 |
Yu Zhao |
PDF |
N/A |
Analysing the Residual Stream of Language Models Under Knowledge Conflicts |
| 基于图像和雷达数据特征图的无人机分类多传感器融合 |
Nikos Sakellariou |
PDF |
N/A |
Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data |
| 微调大型语言模型以提供可靠的医疗问答服务 |
Ali Anaissi |
PDF |
N/A |
Fine-Tuning LLMs for Reliable Medical Question-Answering Services |
| 基于流生成模型的车辆轨迹预测关键示例挖掘 |
Zhezhang Ding |
PDF |
N/A |
Critical Example Mining for Vehicle Trajectory Prediction using Flow-based Generative Models |
| CartesianMoE:通过专家混合中的笛卡尔积路由提升专家间的知识共享 |
Zhenpeng Su |
PDF |
N/A |
CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts |
| 对抗训练中的正则化几何:高维渐近性和泛化界限 |
Matteo Vilucchio |
PDF |
N/A |
On the Geometry of Regularization in Adversarial Training: High-Dimensional Asymptotics and Generalization Bounds |
| 中小企业设备上的大型语言模型:挑战与机遇 |
Jeremy Stephen Gabriel Yee Zhi Wen |
PDF |
N/A |
On-Device LLMs for SMEs: Challenges and Opportunities |
| 滚动语言模型(LLMs)在习语理解上的骰子:它们如何未能把握语境 |
Maggie Mi |
PDF |
N/A |
Rolling the DICE on Idiomaticity: How LLMs Fail to Grasp Context |
| 使用随机滴定常数-pH元动力学模拟对RNA寡聚体进行表征 |
Tomas F. D. Silva |
PDF |
N/A |
Characterizing RNA oligomers using Stochastic Titration Constant-pH Metadynamics simulations |
| 基于半监督学习的小样本实例分割的综合图像-文本方法 |
Ruting Chi |
PDF |
N/A |
Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation |
| 惊喜!统一信息密度并非全部:预测长篇话语中的意外轮廓 |
Eleftheria Tsipidi |
PDF |
N/A |
Surprise! Uniform Information Density Isn't the Whole Story: Predicting Surprisal Contours in Long-form Discourse |
| 通过混合监督进行标签填充以从噪声标注中进行医学图像分割 |
Ming Li |
PDF |
N/A |
Label Filling via Mixed Supervision for Medical Image Segmentation from Noisy Annotations |
| 非平稳核化多臂老虎机的近似最优算法 |
Shogo Iwazaki |
PDF |
N/A |
Near-Optimal Algorithm for Non-Stationary Kernelized Bandits |
| 大型语言模型知道该说什么,但不知道何时该说话。 |
Muhammad Umair |
PDF |
N/A |
Large Language Models Know What To Say But Not When To Speak |
| 用于群中相容算子哈密顿量分解的GFlowNets |
Isaac L. Huidobro-Meezs |
PDF |
N/A |
GFlowNets for Hamiltonian decomposition in groups of compatible operators |
| 基准化病理学基础模型:适应策略与场景 |
Jeaung Lee |
PDF |
N/A |
Benchmarking Pathology Foundation Models: Adaptation Strategies and Scenarios |
| 通过鲁棒视觉特征和高级注意力机制改进多标签原子活动识别 @ ROAD++ 原子活动识别 2024 |
Jiamin Cao |
PDF |
N/A |
Improving the Multi-label Atomic Activity Recognition by Robust Visual Feature and Advanced Attention @ ROAD++ Atomic Activity Recognition 2024 |
| TimeMixer++:一种通用的时间序列模式机器,用于普遍的预测分析 |
Shiyu Wang |
PDF |
N/A |
TimeMixer++: A General Time Series Pattern Machine for Universal Predictive Analysis |
| 自然GaLore:加速GaLore以实现内存高效的LLM训练与微调 |
Arijit Das |
PDF |
N/A |
Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning |
| 基于开放词汇目标检测模型的少样本目标驱动实例检测 |
Ben Crulis |
PDF |
N/A |
Few-shot target-driven instance detection based on open-vocabulary object detection models |
| ComPO:社区对语言模型个性化的偏好 |
Sachin Kumar |
PDF |
N/A |
ComPO: Community Preferences for Language Model Personalization |
| 解决SMAC任务的新方法:从大型语言模型生成决策树代码 |
Yue Deng |
PDF |
N/A |
A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models |
| 开始:一种具有显著性驱动令牌感知变换的广义状态空间模型 |
Jintao Guo |
PDF |
N/A |
START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation |
| 使用RGB卷积神经网络的多光谱纹理合成 |
Sélim Ollivier |
PDF |
N/A |
Multispectral Texture Synthesis using RGB Convolutional Neural Networks |
| 基于对偶的信息论极小极大后悔界限用于强化学习 |
Raghav Bongole |
PDF |
N/A |
Information-Theoretic Minimax Regret Bounds for Reinforcement Learning based on Duality |
| Massimo:基于质量-弹簧模型的公共队列监控与管理 |
Abhijeet Kumar |
PDF |
N/A |
Massimo: Public Queue Monitoring and Management using Mass-Spring Model |
| CA*:解决计算感知延迟在同时语音翻译中的评估陷阱 |
Xi Xu |
PDF |
N/A |
CA*: Addressing Evaluation Pitfalls in Computation-Aware Latency for Simultaneous Speech Translation |
| 3D-GANTex:基于StyleGAN3的多视图图像和3DDFA网格生成的3D人脸重建 |
Rohit Das |
PDF |
N/A |
3D-GANTex: 3D Face Reconstruction with StyleGAN3-based Multi-View Images and 3DDFA based Mesh Generation |
| 在拓扑结构不准确的情况下,弹性时间图卷积网络用于智能电网状态估计 |
Seyed Hamed Haghshenas |
PDF |
N/A |
Resilient Temporal GCN for Smart Grid State Estimation Under Topology Inaccuracies |
| 语言模型输出的对数概率是否经过校准? |
Charles Lovering |
PDF |
N/A |
Are Language Model Logits Calibrated? |
| 探索持续微调以提升大型语言模型的语言能力 |
Divyanshu Aggarwal |
PDF |
N/A |
Exploring Continual Fine-Tuning for Enhancing Language Ability in Large Language Model |
| 通过基于SAE的表示工程引导LLMs的知识选择行为 |
Yu Zhao |
PDF |
N/A |
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering |
| 在SMM4H 2024的1024m任务3、5和6中:用于医学文本分类的Transformer和大型语言模型集成 |
Ram Mohan Rao Kadiyala |
PDF |
N/A |
1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and Large Language Models for Medical Text Classification |
| MultiRC:联合学习用于多尺度重构对比的时间序列异常预测与检测 |
Shiyan Hu |
PDF |
N/A |
MultiRC: Joint Learning for Time Series Anomaly Prediction and Detection with Multi-scale Reconstructive Contrast |
| 利用基于大语言模型的自然语言推理增强法律决策支持系统,以分析社交媒体证据 |
Ram Mohan Rao Kadiyala |
PDF |
N/A |
Augmenting Legal Decision Support Systems with LLM-based NLI for Analyzing Social Media Evidence |
| 分析自动驾驶高速公路驾驶模拟中用于真实交通代理模型训练的闭环训练技术 |
Matthias Bitzer |
PDF |
N/A |
Analyzing Closed-loop Training Techniques for Realistic Traffic Agent Models in Autonomous Highway Driving Simulations |
| 一个定量的Robbins-Siegmund定理 |
Morenikeji Neri |
PDF |
N/A |
A quantitative Robbins-Siegmund theorem |
| 使用稀疏DEIM和循环神经网络的状态估计 |
Mohammad Farazmand |
PDF |
N/A |
State Estimation Using Sparse DEIM and Recurrent Neural Networks |
| 多模态先验知识引导的视觉表示学习 |
Hongkuan Zhou |
PDF |
N/A |
Visual Representation Learning Guided By Multi-modal Prior Knowledge |
| 在长尾学习中,粒度至关重要 |
Shizhen Zhao |
PDF |
N/A |
Granularity Matters in Long-Tail Learning |
| PROMPTHEUS:一种以人为中心的管道,利用大型语言模型简化系统文献综述流程 |
João Pedro Fernandes Torres |
PDF |
N/A |
PROMPTHEUS: A Human-Centered Pipeline to Streamline SLRs with LLMs |
| 在忆阻器交叉阵列上实现大型语言模型的能效部署:大与小的协同作用 |
Zhehui Wang |
PDF |
N/A |
Enabling Energy-Efficient Deployment of Large Language Models on Memristor Crossbar: A Synergy of Large and Small |
| 用于跨语言情感检测的大型语言模型 |
Ram Mohan Rao Kadiyala |
PDF |
N/A |
Large Language Models for Cross-lingual Emotion Detection |
| 卡鲁什-库恩-塔克条件训练神经网络(KKT Nets) |
Shreya Arvind |
PDF |
N/A |
Karush-Kuhn-Tucker Condition-Trained Neural Networks (KKT Nets) |
| 利用深度先验组件从单张图像进行零样本场景重建 |
Junsheng Zhou |
PDF |
N/A |
Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly |
| 基于文档的对话中的政策驱动知识选择与回复生成 |
Longxuan Ma |
PDF |
N/A |
Policy-driven Knowledge Selection and Response Generation for Document-grounded Dialogue |
| 自解释关键词赋能大型语言模型进行代码生成 |
Lishui Fan |
PDF |
N/A |
Self-Explained Keywords Empower Large Language Models for Code Generation |
| 系统探索对话摘要方法:可重复性、比较评估及推进自然语言处理在抽象摘要中的方法论创新 |
Yugandhar Reddy Gogireddy |
PDF |
N/A |
Systematic Exploration of Dialogue Summarization Approaches for Reproducibility, Comparative Assessment, and Methodological Innovations for Advancing Natural Language Processing in Abstractive Summarization |
| 莫扎地图矢量化中的范式转变:人机协作方法 |
Mahir Shahriar Dhrubo |
PDF |
N/A |
A Paradigm Shift in Mouza Map Vectorization: A Human-Machine Collaboration Approach |
| 现代云计算中的AI驱动创新 |
Animesh Kumar |
PDF |
N/A |
AI-Driven Innovations in Modern Cloud Computing |
| 扩散变换器策略 |
Zhi Hou |
PDF |
N/A |
Diffusion Transformer Policy |
| CamI2V:相机控制的图像到视频扩散模型 |
Guangcong Zheng |
PDF |
N/A |
CamI2V: Camera-Controlled Image-to-Video Diffusion Model |
| 大型语言模型是否带有英语口音?评估和提升多语言LLM的自然性 |
Yanzhu Guo |
PDF |
N/A |
Do Large Language Models Have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMs |
| TS-ACL:一种用于隐私保护和类增量模式识别的时间序列分析持续学习框架 |
Kejia Fan |
PDF |
N/A |
TS-ACL: A Time Series Analytic Continual Learning Framework for Privacy-Preserving and Class-Incremental Pattern Recognition |
| 以用户为中心的AI可解释性评估:人与AI协同的全面实证研究 |
Szymon Bobek |
PDF |
N/A |
User-centric evaluation of explainability of AI with and for humans: a comprehensive empirical study |
| 重新定义金融:人工智能(AI)与机器学习(ML)的影响 |
Animesh Kumar |
PDF |
N/A |
Redefining Finance: The Influence of Artificial Intelligence (AI) and Machine Learning (ML) |
| 第三届多语言指代消解共享任务的结果 |
Michal Novák |
PDF |
N/A |
Findings of the Third Shared Task on Multilingual Coreference Resolution |
| 青光眼检测的AI驱动方法 -- 全面综述 |
Yuki Hagiwara |
PDF |
N/A |
AI-Driven Approaches for Glaucoma Detection -- A Comprehensive Review |
| 从PDF开发基于检索增强生成(RAG)的大型语言模型系统:一份经验报告 |
Ayman Asad Khan |
PDF |
N/A |
Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience Report |
| MBPU:一种即插即用的点云上采样状态空间模型,支持快速点渲染 |
Jiayi Song |
PDF |
N/A |
MBPU: A Plug-and-Play State Space Model for Point Cloud Upsamping with Fast Point Rendering |
| 研究无序蛋白质的机器学习方法 |
Sören von Bülow |
PDF |
N/A |
Machine learning methods to study disordered proteins |
| CausalGraph2LLM:评估大型语言模型对因果查询的能力 |
Ivaxi Sheth |
PDF |
N/A |
CausalGraph2LLM: Evaluating LLMs for Causal Queries |
| 专注于鸟瞰图:用于单目鸟瞰图分割的自校准循环视图变换 |
Jiawei Zhao |
PDF |
N/A |
Focus on BEV: Self-calibrated Cycle View Transformation for Monocular Birds-Eye-View Segmentation |
| 中心化感知的产品检索与排序 |
Hadeel Saadany |
PDF |
N/A |
Centrality-aware Product Retrieval and Ranking |
| 是的,嗯,哦:通过微调语音活动投影实现连续和实时反馈预测 |
Koji Inoue |
PDF |
N/A |
Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity Projection |
| GReFEL:在偏差和不平衡数据分布下,基于几何感知的可靠面部表情学习 |
Azmine Toushik Wasi |
PDF |
N/A |
GReFEL: Geometry-Aware Reliable Facial Expression Learning under Bias and Imbalanced Data Distribution |
| 通过同心因果注意力缓解对象幻觉 |
Yun Xing |
PDF |
N/A |
Mitigating Object Hallucination via Concentric Causal Attention |
| 时间变化更新优化算法的自动微分 |
Sheheryar Mehmood |
PDF |
N/A |
Automatic Differentiation of Optimization Algorithms with Time-Varying Updates |
| 大规模软标签对于大规模数据集蒸馏是否必要? |
Lingao Xiao |
PDF |
N/A |
Are Large-scale Soft Labels Necessary for Large-scale Dataset Distillation? |
| 利用CORAL-相关一致性网络进行半监督左心房MRI分割 |
Xinze Li |
PDF |
N/A |
Leveraging CORAL-Correlation Consistency Network for Semi-Supervised Left Atrium MRI Segmentation |
| Bench4Merge:一个综合基准,用于在具有微交互车辆的现实密集交通中进行合并 |
Zhengming Wang |
PDF |
N/A |
Bench4Merge: A Comprehensive Benchmark for Merging in Realistic Dense Traffic with Micro-Interactive Vehicles |
| DefVerify:仇恨言论模型是否反映了其数据集的定义? |
Urja Khurana |
PDF |
N/A |
DefVerify: Do Hate Speech Models Reflect Their Dataset's Definition? |
| 多样性策略通过点对点互信息加权模仿学习实现恢复 |
Hanlin Yang |
PDF |
N/A |
Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning |
| 实时视频异常检测的混合架构:整合空间与时间分析 |
Fabien Poirier |
PDF |
N/A |
Hybrid Architecture for Real-Time Video Anomaly Detection: Integrating Spatial and Temporal Analysis |
| 地震相位拾取 |
Yuchen Wang |
PDF |
N/A |
Seismic Phase Picking |
| 基于机器学习的纠错解码器的设计与性能 |
Yuncheng Yuan |
PDF |
N/A |
On the Design and Performance of Machine Learning Based Error Correcting Decoders |
| IGMaxHS -- 一种支持XOR子句的增量最大SAT求解器 |
Ole Lübke |
PDF |
N/A |
IGMaxHS -- An Incremental MaxSAT Solver with Support for XOR Clauses |
| 基于模拟的单分子实验推断 |
Lars Dingeldein |
PDF |
N/A |
Simulation-based inference of single-molecule experiments |
| TexPro:基于文本指导的PBR纹理生成与程序化材质建模 |
Ziqiang Dang |
PDF |
N/A |
TexPro: Text-guided PBR Texturing with Procedural Material Modeling |
| 模型模仿攻击:可证明可转移对抗样本的知识蒸馏 |
Kirill Lukyanov |
PDF |
N/A |
Model Mimic Attack: Knowledge Distillation for Provably Transferable Adversarial Examples |
| 用于数字病理学中幻灯片级癌症亚型分类的基础模型 |
Pablo Meseguer |
PDF |
N/A |
Foundation Models for Slide-level Cancer Subtyping in Digital Pathology |
| 如何构建一个用于同时聊天和决策的预训练多模态模型? |
Zuojin Tang |
PDF |
N/A |
How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making? |
| 使用GPT模型进行2024年美国总统选举过程中的定性与定量新闻分析 |
Bohdan M. Pavlyshenko |
PDF |
N/A |
Using GPT Models for Qualitative and Quantitative News Analytics in the 2024 US Presidental Election Process |
| 无人机集群的分布式学习 |
Chen Hu |
PDF |
N/A |
Distributed Learning for UAV Swarms |
| MI-VisionShot:用于组织病理学图像幻灯片级分类的视觉-语言模型的少样本适应 |
Pablo Meseguer |
PDF |
N/A |
MI-VisionShot: Few-shot adaptation of vision-language models for slide-level classification of histopathological images |
| 闪烁融合:轨迹内领域泛化的多智能体强化学习 |
Woosung Koh |
PDF |
N/A |
FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL |
| 在多任务学习中通过自辅助实现非对称知识迁移 |
Olivier Graffeuille |
PDF |
N/A |
Enabling Asymmetric Knowledge Transfer in Multi-Task Learning with Self-Auxiliaries |
| 视觉主题识别:精心策划的比较数据集和分类方法的详细阐述 |
Adam Phillips |
PDF |
N/A |
Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods |
| 语法模式中语义和功能效率的原则 |
Emily Cheng |
PDF |
N/A |
Principles of semantic and functional efficiency in grammatical patterning |
| Mesa-外推法:一种用于增强大型语言模型外推能力的编织位置编码方法 |
Xin Ma |
PDF |
N/A |
Mesa-Extrapolation: A Weave Position Encoding Method for Enhanced Extrapolation in LLMs |
| 面向高效迁移学习的最佳适配器放置策略 |
Aleksandra I. Nowak |
PDF |
N/A |
Towards Optimal Adapter Placement for Efficient Transfer Learning |
| TEXEL:一种具有片上学习功能的神经形态处理器,适用于超越CMOS器件的集成 |
Hugh Greatorex |
PDF |
N/A |
TEXEL: A neuromorphic processor with on-chip learning for beyond-CMOS device integration |
| R2I-rPPG:一种用于远程光电容积脉搏波描记法提取心率的鲁棒感兴趣区域选择方法 |
Sandeep Nagar |
PDF |
N/A |
R2I-rPPG: A Robust Region of Interest Selection Method for Remote Photoplethysmography to Extract Heart Rate |
| 聚焦关键:图选择性状态聚焦注意力网络 |
Shikhar Vashistha |
PDF |
N/A |
Focus Where It Matters: Graph Selective State Focused Attention Networks |
| 多视角医学诊断的随机令牌融合 |
Jingyu Guo |
PDF |
N/A |
Random Token Fusion for Multi-View Medical Diagnosis |
| 为实时通信中的端到端服务质量预测建模并发RTP流 |
Tailai Song |
PDF |
N/A |
Modelling Concurrent RTP Flows for End-to-end Predictions of QoS in Real Time Communications |
| 通过图模型实现强化学习中的高效协作 |
Wenzhe Fan |
PDF |
N/A |
Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning |
| 私密、高效且可扩展的医学图像分析内核学习 |
Anika Hannemann |
PDF |
N/A |
Private, Efficient and Scalable Kernel Learning for Medical Image Analysis |
| 在GNSS缺失环境下利用深度强化学习进行远程地磁导航 |
Wenqi Bai |
PDF |
N/A |
Long-distance Geomagnetic Navigation in GNSS-denied Environments with Deep Reinforcement Learning |
| LiOn-XA:通过仅使用LiDAR的跨模态对抗训练实现无监督领域自适应 |
Thomas Kreutz |
PDF |
N/A |
LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training |
| LLM4GRN:利用大型语言模型发现因果基因调控网络——通过合成数据生成进行评估 |
Tejumade Afonja |
PDF |
N/A |
LLM4GRN: Discovering Causal Gene Regulatory Networks with LLMs -- Evaluation through Synthetic Data Generation |
| 高度相关模糊流失模式在二分类中的可解释性 |
D. Y. C. Wang |
PDF |
N/A |
Explainability of Highly Associated Fuzzy Churn Patterns in Binary Classification |
| 有人提到“Gest-IT”了吗?这是对多模态数据管理的一次试点探索。 |
Ludovica Pannitto |
PDF |
N/A |
Did somebody say "Gest-IT"? A pilot exploration of multimodal data management |
| 微调对语言模型毒性的影响 |
Will Hawkins |
PDF |
N/A |
The effect of fine-tuning on language model toxicity |
| MAC Revivo:人工智能铺就道路 |
Jinzhe Pan |
PDF |
N/A |
MAC Revivo: Artificial Intelligence Paves the Way |
| LiMTR:通过多模态特征融合实现多样化道路用户的时间序列运动预测 |
Camiel Oerlemans |
PDF |
N/A |
LiMTR: Time Series Motion Prediction for Diverse Road Users through Multimodal Feature Integration |
| 从神经热力学积分中获得的溶剂化自由能 |
Bálint Máté |
PDF |
N/A |
Solvation Free Energies from Neural Thermodynamic Integration |
| Kaninfradet3D:基于非线性特征提取与内在关联的路边相机-激光雷达融合3D感知模型 |
Pei Liu |
PDF |
N/A |
Kaninfradet3D:A Road-side Camera-LiDAR Fusion 3D Perception Model based on Nonlinear Feature Extraction and Intrinsic Correlation |
| FusionLungNet:用于肺部CT图像分割的多尺度融合卷积与细化网络 |
Sadjad Rezvani |
PDF |
N/A |
FusionLungNet: Multi-scale Fusion Convolution with Refinement Network for Lung CT Image Segmentation |
| 数据高效的CLIP驱动的双分支网络用于无源无监督领域自适应 |
Yongguang Li |
PDF |
N/A |
Data-Efficient CLIP-Powered Dual-Branch Networks for Source-Free Unsupervised Domain Adaptation |
| 基于平均场模拟的宇宙初始条件推断 |
Oleg Savchenko |
PDF |
N/A |
Mean-Field Simulation-Based Inference for Cosmological Initial Conditions |
| RAG4ITOps:一种面向IT运维与维护的监督式微调与综合性RAG框架 |
Tianyang Zhang |
PDF |
N/A |
RAG4ITOps: A Supervised Fine-Tunable and Comprehensive RAG Framework for IT Operations and Maintenance |
| 深度学习与数据增强技术在检测自我承认的技术债务中的应用 |
Edi Sutoyo |
PDF |
N/A |
Deep Learning and Data Augmentation for Detecting Self-Admitted Technical Debt |
| 辅助物理交互:配备神经网络检测、导航和安全层的自主空中机器人 |
Andrea Berra |
PDF |
N/A |
Assisted Physical Interaction: Autonomous Aerial Robots with Neural Network Detection, Navigation, and Safety Layers |
| 通过蕴含调优改进密集段落检索 |
Lu Dai |
PDF |
N/A |
Improve Dense Passage Retrieval with Entailment Tuning |
| 深度群卷积神经网络的VC维度 |
Anna Sepliarskaia |
PDF |
N/A |
On the VC dimension of deep group convolutional neural networks |