| 本地策略实现零样本长时程操作 |
Murtaza Dalal |
PDF |
N/A |
Local Policies Enable Zero-shot Long-horizon Manipulation |
| 任务向量是跨模态的 |
Grace Luo |
PDF |
N/A |
Task Vectors are Cross-Modal |
| 机器人预训练机器人:基于大规模机器人数据集的以操控为中心的机器人表征 |
Guangqi Jiang |
PDF |
N/A |
Robots Pre-train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Dataset |
| 通过求根法优化贝叶斯优化的后验样本 |
Taiwo A. Adebiyi |
PDF |
N/A |
Optimizing Posterior Samples for Bayesian Optimization via Rootfinding |
| 通过下注进行顺序假设检验在线检测由大型语言模型生成的文本 |
Can Chen |
PDF |
N/A |
Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by Betting |
| 多类别文本反转暗中产生了一个语义无关的分类器 |
Kai Wang |
PDF |
N/A |
Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic Classifier |
| 通过检索头理解合成上下文扩展 |
Xinyu Zhao |
PDF |
N/A |
Understanding Synthetic Context Extension via Retrieval Heads |
| 自然语言推理提升视觉-语言模型的组合性 |
Paola Cascante-Bonilla |
PDF |
N/A |
Natural Language Inference Improves Compositionality in Vision-Language Models |
| 一种通过激光雷达-相机-高精度地图融合生成安全可行驶空间的高效方法 |
Minghao Ning |
PDF |
N/A |
An Efficient Approach to Generate Safe Drivable Space by LiDAR-Camera-HDmap Fusion |
| Senna: 连接大规模视觉语言模型与端到端自动驾驶 |
Bo Jiang |
PDF |
N/A |
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving |
| 通过简单的“是-否”标注,实现对模型关注的有效引导 |
Seongmin Lee |
PDF |
N/A |
Effective Guidance for Model Attention with Simple Yes-no Annotations |
| 用于训练两层ReLU神经网络的凸优化公式 |
Karthik Prakhya |
PDF |
N/A |
Convex Formulations for Training Two-Layer ReLU Neural Networks |
| SVIP:面向开源大型语言模型的可验证推理 |
Yifan Sun |
PDF |
N/A |
SVIP: Towards Verifiable Inference of Open-source Large Language Models |
| 多对象三维定位与动态模块及语言引导的空间注意力 |
Haomeng Zhang |
PDF |
N/A |
Multi-Object 3D Grounding with Dynamic Modules and Language-Informed Spatial Attention |
| Flow-DPO:通过在线多智能体学习提升大语言模型的数学推理能力 |
Yihe Deng |
PDF |
N/A |
Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning |
| $\mathsf{OPA}$:单次交互下的单客户端隐私聚合及其在联邦学习中的应用 |
Harish Karthikeyan |
PDF |
N/A |
$\mathsf{OPA}$: One-shot Private Aggregation with Single Client Interaction and its Applications to Federated Learning |
| 情感引导的图像到音乐生成 |
Souraja Kundu |
PDF |
N/A |
Emotion-Guided Image to Music Generation |
| 大型语言模型是高度受限的生物物理序列优化器 |
Angelica Chen |
PDF |
N/A |
LLMs are Highly-Constrained Biophysical Sequence Optimizers |
| 批量处理、匹配和修补:基于分数的变分推断的低秩近似 |
Chirag Modi |
PDF |
N/A |
Batch, match, and patch: low-rank approximations for score-based variational inference |
| 运动图谱释放:一种新颖的视频预测方法 |
Yiqi Zhong |
PDF |
N/A |
Motion Graph Unleashed: A Novel Approach to Video Prediction |
| 从旋律音符序列到音高使用word2vec |
Daniel Defays |
PDF |
N/A |
From melodic note sequences to pitches using word2vec |
| 基于嵌入的分类器可以检测提示注入攻击 |
Md. Ahsan Ayub |
PDF |
N/A |
Embedding-based classifiers can detect prompt injection attacks |
| 利用循环神经网络从灵长类动物运动皮层神经记录中预测运动动作 |
Yuanxi Wang |
PDF |
N/A |
Leveraging Recurrent Neural Networks for Predicting Motor Movements from Primate Motor Cortex Neural Recordings |
| 单目距离估计的主动事件对齐 |
Nan Cai |
PDF |
N/A |
Active Event Alignment for Monocular Distance Estimation |
| 利用混响和视觉深度线索进行声事件定位与检测,并估计距离 |
Davide Berghi |
PDF |
N/A |
Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation |
| 傅里叶头:帮助大型语言模型学习复杂概率分布 |
Nate Gillman |
PDF |
N/A |
Fourier Head: Helping Large Language Models Learn Complex Probability Distributions |
| NCA-Morph:基于神经元细胞自动机的医学图像配准 |
Amin Ranem |
PDF |
N/A |
NCA-Morph: Medical Image Registration with Neural Cellular Automata |
| 元学习可适应的基础模型 |
Jacob L. Block |
PDF |
N/A |
Meta-Learning Adaptable Foundation Models |
| LipKernel: 通过耗散层实现Lipschitz有界卷积神经网络 |
Patricia Pauli |
PDF |
N/A |
LipKernel: Lipschitz-Bounded Convolutional Neural Networks via Dissipative Layers |
| FactBench:一个用于实际语言模型事实性评估的动态基准 |
Farima Fatahi Bayat |
PDF |
N/A |
FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation |
| 基于超图的多尺度时空图卷积网络用于时间序列异常检测 |
Hongyi Xu |
PDF |
N/A |
Hypergraph-based multi-scale spatio-temporal graph convolution network for Time-Series anomaly detection |
| 推动基于深度神经网络的推荐系统在GPU上的推理性能极限 |
Rishabh Jain |
PDF |
N/A |
Pushing the Performance Envelope of DNN-based Recommendation Systems Inference on GPUs |
| 变压器中的突变学习:矩阵补全案例研究 |
Pulkit Gopalani |
PDF |
N/A |
Abrupt Learning in Transformers: A Case Study on Matrix Completion |
| DISCERN:解码文本分类器中的系统性错误 |
Rakesh R. Menon |
PDF |
N/A |
DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers |
| 在一次运行中审计$f$-差分隐私 |
Saeed Mahloujifar |
PDF |
N/A |
Auditing $f$-Differential Privacy in One Run |
| ContextIQ:一种基于多模态专家系统的视频检索系统,用于情境广告 |
Ashutosh Chaubey |
PDF |
N/A |
ContextIQ: A Multimodal Expert-Based Video Retrieval System for Contextual Advertising |
| Cora:利用智能网卡加速有状态网络应用程序 |
Shaoke Xi |
PDF |
N/A |
Cora: Accelerating Stateful Network Applications with SmartNICs |
| 图数据上的分布外泛化子图聚合 |
Bowen Liu |
PDF |
N/A |
Subgraph Aggregation for Out-of-Distribution Generalization on Graphs |
| Guide3D:一种用于三维形状重建的双平面X射线数据集 |
Tudor Jianu |
PDF |
N/A |
Guide3D: A Bi-planar X-ray Dataset for 3D Shape Reconstruction |
| MAPUNetR:一种用于高效和可解释医学图像分割的混合视觉Transformer和U-Net架构 |
Ovais Iqbal Shah |
PDF |
N/A |
MAPUNetR: A Hybrid Vision Transformer and U-Net Architecture for Efficient and Interpretable Medical Image Segmentation |
| 在视觉基础模型时代统一理解和生成:从自回归角度进行的综述 |
Shenghao Xie |
PDF |
N/A |
Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective |
| LiVisSfM:结合激光雷达和视觉线索的精确且鲁棒的从运动中恢复结构方法 |
Hanqing Jiang |
PDF |
N/A |
LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues |
| ProMQA:用于多模态程序性活动理解的问题回答数据集 |
Kimihiro Hasegawa |
PDF |
N/A |
ProMQA: Question Answering Dataset for Multimodal Procedural Activity Understanding |
| 一种在不完全信息下逐步构建结构化论证语义的方法 |
Antonio Rago |
PDF |
N/A |
A Methodology for Gradual Semantics for Structured Argumentation under Incomplete Information |
| 无人机声学分析通过人工神经网络预测心理声学烦恼 |
Andrea Vaiuso |
PDF |
N/A |
Drone Acoustic Analysis for Predicting Psychoacoustic Annoyance via Artificial Neural Networks |
| 民主化个人和代表性价值一致的奖励设计 |
Carter Blair |
PDF |
N/A |
Democratizing Reward Design for Personal and Representative Value-Alignment |
| 类感知对比优化用于不平衡文本分类 |
Grigorii Khvatskii |
PDF |
N/A |
Class-Aware Contrastive Optimization for Imbalanced Text Classification |
| ADAM:开放世界环境中的具身因果智能体 |
Shu Yu |
PDF |
N/A |
ADAM: An Embodied Causal Agent in Open-World Environments |
| GRINNs:用于学习双曲守恒律的Godunov-Riemann信息神经网络 |
Dimitrios G. Patsatzis |
PDF |
N/A |
GRINNs: Godunov-Riemann Informed Neural Networks for Learning Hyperbolic Conservation Laws |
| $r$年龄-$k$:利用年龄因子实现通信高效的联邦学习 |
Matin Mortaheb |
PDF |
N/A |
$r$Age-$k$: Communication-Efficient Federated Learning Using Age Factor |
| 视觉-语言模型的主动学习 |
Bardia Safaei |
PDF |
N/A |
Active Learning for Vision-Language Models |
| 多层次特征蒸馏:在不同图像数据集上训练的联合教师模型 |
Adrian Iordache |
PDF |
N/A |
Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasets |
| 用于分析电子健康记录和癌症研究中临床笔记的自然语言处理:综述 |
Muhammad Bilal |
PDF |
N/A |
Natural Language Processing for Analyzing Electronic Health Records and Clinical Notes in Cancer Research: A Review |
| 非常专注的Tacotron:基于自回归Transformer的文本到语音转换中的鲁棒性和无界长度泛化 |
Eric Battenberg |
PDF |
N/A |
Very Attentive Tacotron: Robust and Unbounded Length Generalization in Autoregressive Transformer-Based Text-to-Speech |
| 分析多模态交互策略以辅助大型语言模型(LLM)进行3D场景操控 |
Junlong Chen |
PDF |
N/A |
Analyzing Multimodal Interaction Strategies for LLM-Assisted Manipulation of 3D Scenes |
| EconoJax:一个快速且可扩展的基于Jax的经济模拟框架 |
Koen Ponse |
PDF |
N/A |
EconoJax: A Fast & Scalable Economic Simulation in Jax |
| 评估大型语言模型在处理多语言毒性方面的防护措施 |
Yahan Yang |
PDF |
N/A |
Benchmarking LLM Guardrails in Handling Multilingual Toxicity |
| 先进人工智能安全与可信技术标准化趋势 |
Jonghong Jeon |
PDF |
N/A |
Standardization Trends on Safety and Trustworthiness Technology for Advanced AI |
| 利用夜间灯光数据评估飓风破坏程度:预处理至关重要 |
Nancy Thomas |
PDF |
N/A |
Shining a Light on Hurricane Damage Estimation via Nighttime Light Data: Pre-processing Matters |
| 容量控制是文本条件扩散模型中一种有效的记忆缓解机制 |
Raman Dutt |
PDF |
N/A |
Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion Models |
| AmpleGCG-Plus:一种强大的生成模型,用于对抗性后缀,以更少的尝试实现更高的成功率来破解大型语言模型 |
Vishal Kumar |
PDF |
N/A |
AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts |
| Lighten CARAFE:动态轻量级上采样与引导重装配核 |
Ruigang Fu |
PDF |
N/A |
Lighten CARAFE: Dynamic Lightweight Upsampling with Guided Reassemble Kernels |
| ProMoE:使用主动缓存实现基于MoE的LLM快速服务 |
Xiaoniu Song |
PDF |
N/A |
ProMoE: Fast MoE-based LLM Serving using Proactive Caching |
| 轻量级频率掩码器用于跨域少样本语义分割 |
Jintao Tong |
PDF |
N/A |
Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation |
| 简单学习后继特征 |
Raymond Chua |
PDF |
N/A |
Learning Successor Features the Simple Way |
| 使用生成-传播-测试方法解决认知逻辑程序 |
Jorge Fandinno |
PDF |
N/A |
Solving Epistemic Logic Programs using Generate-and-Test with Propagation |
| 提升商用AI产品在多智能体配置中的性能 |
Cory Hymel |
PDF |
N/A |
Improving Performance of Commercially Available AI Products in a Multi-Agent Configuration |
| PF3plat:无姿态前馈三维高斯喷射 |
Sunghwan Hong |
PDF |
N/A |
PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting |
| RankUp:通过辅助排序分类器提升半监督回归性能 |
Pin-Yen Huang |
PDF |
N/A |
RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier |
| 愿景文件:根据《欧洲人工智能法案》设计图神经网络 |
Barbara Hoffmann |
PDF |
N/A |
Vision Paper: Designing Graph Neural Networks in Compliance with the European Artificial Intelligence Act |
| 深度Q指数过程 |
Zhi Chang |
PDF |
N/A |
Deep Q-Exponential Processes |
| 推理加速策略对大型语言模型偏差的影响 |
Elisabeth Kirsten |
PDF |
N/A |
The Impact of Inference Acceleration Strategies on Bias of LLMs |
| 鲁棒马尔可夫决策过程的策略梯度 |
Qiuhao Wang |
PDF |
N/A |
Policy Gradient for Robust Markov Decision Processes |
| 大学习率将我们引向何方? |
Ildus Sadrtdinov |
PDF |
N/A |
Where Do Large Learning Rates Lead Us? |
| 硬件友好型训练后量化的数据生成 |
Lior Dikstein |
PDF |
N/A |
Data Generation for Hardware-Friendly Post-Training Quantization |
| 使用MLLMU-Bench保护多模态大语言模型中的隐私 |
Zheyuan Liu |
PDF |
N/A |
Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench |
| DAGE:通过带有逻辑约束的关系组合器进行DAG查询回答 |
Yunjie He |
PDF |
N/A |
DAGE: DAG Query Answering via Relational Combinator with Logical Constraints |
| 丹麦职业匹配中的能力联合提取与分类 |
Qiuchi Li |
PDF |
N/A |
Joint Extraction and Classification of Danish Competences for Job Matching |
| 基于高光谱成像的自动驾驶场景感知:基准语义分割模型评估 |
Imad Ali Shah |
PDF |
N/A |
Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models |
| TractShapeNet:利用3D纤维束点云进行高效的多形状学习 |
Yui Lo |
PDF |
N/A |
TractShapeNet: Efficient Multi-Shape Learning with 3D Tractography Point Clouds |
| InLINE:异构图上多任务学习的内层信息交换 |
Xinyue Feng |
PDF |
N/A |
InLINE: Inner-Layer Information Exchange for Multi-task Learning on Heterogeneous Graphs |
| 基于相对论图像处理的4D机器人导航 |
Simone Müller |
PDF |
N/A |
4D-based Robot Navigation Using Relativistic Image Processing |
| 多任务优化的去学习:一种自适应学习率的归一化梯度差方法 |
Zhiqi Bu |
PDF |
N/A |
Unlearning as multi-task optimization: A normalized gradient difference approach with an adaptive learning rate |
| 挑剔的宝宝需要一位教练:利用反向KL散度引导BabyLlama的模式探索行为 |
Shaozhen Shi |
PDF |
N/A |
Choosy Babies Need One Coach: Inducing Mode-Seeking Behavior in BabyLlama with Reverse KL Divergence |
| HRPVT:用于中、小规模人体姿态估计的高分辨率金字塔视觉Transformer |
Zhoujie Xu |
PDF |
N/A |
HRPVT: High-Resolution Pyramid Vision Transformer for medium and small-scale human pose estimation |
| DINeuro:通过可变形管状传输策略从2D自然图像中提取知识用于3D神经元重建 |
Yik San Cheng |
PDF |
N/A |
DINeuro: Distilling Knowledge from 2D Natural Images via Deformable Tubular Transferring Strategy for 3D Neuron Reconstruction |
| 通过架构映射神经符号人工智能领域:一本关于通过符号推理增强深度学习的指南 |
Jonathan Feldstein |
PDF |
N/A |
Mapping the Neuro-Symbolic AI Landscape by Architectures: A Handbook on Augmenting Deep Learning Through Symbolic Reasoning |
| 使用扩散模型进行强子对撞机上堆积事件的变分推断 |
Malte Algren |
PDF |
N/A |
Variational inference for pile-up removal at hadron colliders with diffusion models |
| 在大型语言模型(LLM)的幻觉现象中区分无知与错误 |
Adi Simhi |
PDF |
N/A |
Distinguishing Ignorance from Error in LLM Hallucinations |
| FreeGaussian:基于流导数的无引导可控3D高斯散射 |
Qizhi Chen |
PDF |
N/A |
FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives |
| 边际的味道:同质神经网络中梯度下降的隐性偏见 |
Nikolaos Tsilivis |
PDF |
N/A |
Flavors of Margin: Implicit Bias of Steepest Descent in Homogeneous Neural Networks |
| 唱出来,讲述它:高质量音乐歌词翻译 |
Zhuorui Ye |
PDF |
N/A |
Sing it, Narrate it: Quality Musical Lyrics Translation |
| 在ReLU神经网络上使用哈密顿蒙特卡洛方法效率低下 |
Vu C. Dinh |
PDF |
N/A |
Hamiltonian Monte Carlo on ReLU Neural Networks is Inefficient |
| PACA:面向视角的交叉注意力表示,用于零样本场景重排 |
Shutong Jin |
PDF |
N/A |
PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot Scene Rearrangement |
| FANCL:基于特征引导注意力网络与课程学习的脑转移瘤分割 |
Zijiang Liu |
PDF |
N/A |
FANCL: Feature-Guided Attention Network with Curriculum Learning for Brain Metastases Segmentation |
| 在Segment Anything模型中对人类和自动化提示进行基准测试 |
Jorge Quesada |
PDF |
N/A |
Benchmarking Human and Automated Prompting in the Segment Anything Model |
| 和弦宝典:包含666,000首歌曲及其和弦进程的数据集 |
Spyridon Kantarelis |
PDF |
N/A |
CHORDONOMICON: A Dataset of 666,000 Songs and their Chord Progressions |
| NetAurHPD:网络听觉化超链接预测模型,用于从代谢组学数据中识别代谢途径 |
Tamir Bar-Tov |
PDF |
N/A |
NetAurHPD: Network Auralization Hyperlink Prediction Model to Identify Metabolic Pathways from Metabolomics Data |
| VLMs真的盲吗 |
Ayush Singh |
PDF |
N/A |
Are VLMs Really Blind |
| 通过二阶池化增强双曲表示学习 |
Kun Song |
PDF |
N/A |
Enhance Hyperbolic Representation Learning via Second-order Pooling |
| 语音情感识别的特征分布自适应网络 |
Shaokai Li |
PDF |
N/A |
Feature distribution Adaptation Network for Speech Emotion Recognition |
| 基于路径的图推荐系统摘要解释 -- 扩展版本 |
Danae Pla Karidi |
PDF |
N/A |
Path-based summary explanations for graph recommenders -- extended version |
| 为序列推荐建模时间上的正负激励 |
Chengkai Huang |
PDF |
N/A |
Modeling Temporal Positive and Negative Excitation for Sequential Recommendation |
| 结构化模型学习中的唯一性问题 |
Martin Holler |
PDF |
N/A |
On uniqueness in structured model learning |
| 基于机器学习的面部验证安全方案及其在数字监控中的应用 |
Huan-Chih Wang |
PDF |
N/A |
A Machine Learning-Based Secure Face Verification Scheme and Its Applications to Digital Surveillance |
| 从显式规则到隐式推理:可解释暴力监控系统中的转变 |
Wen-Dong Jiang |
PDF |
N/A |
From Explicit Rules to Implicit Reasoning in an Interpretable Violence Monitoring System |
| 通过局部平均在潜在位置随机图上的节点回归 |
Martin Gjorgjevski |
PDF |
N/A |
Node Regression on Latent Position Random Graphs via Local Averaging |
| 使用学习图的概率分布之间的距离,对行动受阻患者的个性化康复轨迹 |
Chuqiao Zhang |
PDF |
N/A |
Individualised recovery trajectories of patients with impeded mobility, using distance between probability distributions of learnt graphs |
| 关于无监督工业异常检测的RGB、3D及多模态方法综述 |
Yuxuan Lin |
PDF |
N/A |
A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Anomaly Detection |
| 并非所有语言都平等:多语言检索增强生成之洞察 |
Suhang Wu |
PDF |
N/A |
Not All Languages are Equal: Insights into Multilingual Retrieval-Augmented Generation |
| BenchX:一个统一的胸部X光影像语言预训练基准框架 |
Yang Zhou |
PDF |
N/A |
BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays |
| 使用深度学习技术进行自动化漏洞检测 |
Guan-Yan Yang |
PDF |
N/A |
Automated Vulnerability Detection Using Deep Learning Technique |
| 用于序列推荐的二重条件扩散模型 |
Hongtao Huang |
PDF |
N/A |
Dual Conditional Diffusion Models for Sequential Recommendation |
| PrefPaint:将图像修复扩散模型与人类偏好对齐 |
Kendong Liu |
PDF |
N/A |
PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference |
| SG-Bench: 评估LLM在多样化任务和提示类型中的安全泛化能力 |
Yutao Mou |
PDF |
N/A |
SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types |
| FakeFormer:用于可泛化深度伪造检测的高效漏洞驱动型Transformer |
Dat Nguyen |
PDF |
N/A |
FakeFormer: Efficient Vulnerability-Driven Transformers for Generalisable Deepfake Detection |
| 使用事件相机进行动作单元分类的时空变换器 |
Luca Cultrera |
PDF |
N/A |
Spatio-temporal Transformers for Action Unit Classification with Event Cameras |
| ActiveSplat:通过主动高斯溅射实现高保真场景重建 |
Yuetao Li |
PDF |
N/A |
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian Splatting |
| 对抗训练在不确定性攻击下的鲁棒性研究 |
Emanuele Ledda |
PDF |
N/A |
On the Robustness of Adversarial Training Against Uncertainty Attacks |
| 通过推测解码实现快速且高质量的自回归语音合成 |
Bohan Li |
PDF |
N/A |
Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding |
| 分析噪声模型和图像增强的高级滤波算法 |
Sahil Ali Akbar |
PDF |
N/A |
Analyzing Noise Models and Advanced Filtering Algorithms for Image Enhancement |
| 超越文本:优化工业应用中的RAG与多模态输入 |
Monica Riedler |
PDF |
N/A |
Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications |
| 使用批评者调节进化的强化学习代理的人类可读程序 |
Senne Deproost |
PDF |
N/A |
Human-Readable Programs as Actors of Reinforcement Learning Agents Using Critic-Moderated Evolution |
| 基准测试OpenAI o1在网络安全中的表现 |
Dan Ristea |
PDF |
N/A |
Benchmarking OpenAI o1 in Cyber Security |
| ReMix:在混合数据上训练广义人员重识别 |
Timur Mamedov |
PDF |
N/A |
ReMix: Training Generalized Person Re-identification on a Mixture of Data |
| LogSHIELD:一种基于图的实时异常检测框架,利用频率分析 |
Krishna Chandra Roy |
PDF |
N/A |
LogSHIELD: A Graph-based Real-time Anomaly Detection Framework using Frequency Analysis |
| CT到PET翻译:大规模数据集与领域知识引导的扩散方法 |
Dac Thai Nguyen |
PDF |
N/A |
CT to PET Translation: A Large-scale Dataset and Domain-Knowledge-Guided Diffusion Approach |
| 可微归纳逻辑编程用于欺诈检测 |
Boris Wolfson |
PDF |
N/A |
Differentiable Inductive Logic Programming for Fraud Detection |
| 可靠的语义理解用于现实世界零样本目标物体导航 |
Halil Utku Unlu |
PDF |
N/A |
Reliable Semantic Understanding for Real World Zero-shot Object Goal Navigation |
| 神经网络深对流参数化在ARP-GEM1中的在线测试 |
Blanka Balogh |
PDF |
N/A |
Online Test of a Neural Network Deep Convection Parameterization in ARP-GEM1 |
| 具有隐藏混杂因素的线性常微分方程系统的可识别性分析 |
Yuanyuan Wang |
PDF |
N/A |
Identifiability Analysis of Linear ODE Systems with Hidden Confounders |
| 历史手写密码中字母的结构化分析与比较 |
Martín Méndez |
PDF |
N/A |
Structured Analysis and Comparison of Alphabets in Historical Handwritten Ciphers |
| SceneGenAgent:通过编码代理实现精准工业场景生成 |
Xiao Xia |
PDF |
N/A |
SceneGenAgent: Precise Industrial Scene Generation with Coding Agent |
| 多步骤特征融合用于卫星图像上的自然灾害损害评估 |
Mateusz Żarski |
PDF |
N/A |
Multi-step feature fusion for natural disaster damage assessment on satellite images |
| 《纽约时报》和《福克斯新闻》图片与文章中种族和性别偏见的纵向分析 |
Hazem Ibrahim |
PDF |
N/A |
A Longitudinal Analysis of Racial and Gender Bias in New York Times and Fox News Images and Articles |
| 半监督自学习增强的音乐情感识别 |
Yifu Sun |
PDF |
N/A |
Semi-Supervised Self-Learning Enhanced Music Emotion Recognition |
| 评估基于Transformer的符号回归模型的K折交叉验证 |
Kaustubh Kislay |
PDF |
N/A |
Evaluating K-Fold Cross Validation for Transformer Based Symbolic Regression Models |
| 神经网络超参数调优的贝叶斯优化 |
Gabriele Onorato |
PDF |
N/A |
Bayesian Optimization for Hyperparameters Tuning in Neural Networks |
| 自放松联合训练:基于有序噪声标签的严重程度估计样本选择 |
Shumpei Takezaki |
PDF |
N/A |
Self-Relaxed Joint Training: Sample Selection for Severity Estimation with Ordinal Noisy Labels |
| 构建具有脑启发情感共情机制的利他道德人工智能代理 |
Feifei Zhao |
PDF |
N/A |
Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms |
| SCGNet-基于门控循环单元网络的堆叠卷积网络用于网络入侵检测及入侵类型分类 |
Rajana Akter |
PDF |
N/A |
SCGNet-Stacked Convolution with Gated Recurrent Unit Network for Cyber Network Intrusion Detection and Intrusion Type Classification |
| 推进高效脑肿瘤多类分类——迁移学习中Vision Mamba模型的新见解 |
Yinyi Lai |
PDF |
N/A |
Advancing Efficient Brain Tumor Multi-Class Classification -- New Insights from the Vision Mamba Model in Transfer Learning |
| 交叉熵足以反转数据生成过程 |
Patrik Reizinger |
PDF |
N/A |
Cross-Entropy Is All You Need To Invert the Data Generating Process |
| 通过小型语言模型集成提升上下文学习 |
M. Mehdi Mojarradi |
PDF |
N/A |
Improving In-Context Learning with Small Language Model Ensembles |
| 分层混合的Unigram模型用于短文本聚类:Beta-Liouville先验的作用 |
Massimo Bilancia |
PDF |
N/A |
Hierarchical mixtures of Unigram models for short text clustering: the role of Beta-Liouville priors |
| HRGR:通过分层区域感知图推理增强图像篡改检测 |
Xudong Wang |
PDF |
N/A |
HRGR: Enhancing Image Manipulation Detection via Hierarchical Region-aware Graph Reasoning |
| 非平衡面板的条件均值和协方差联合估计 |
Damir Filipovic |
PDF |
N/A |
Joint Estimation of Conditional Mean and Covariance for Unbalanced Panels |
| 基于微结构图的点云配准方法,旨在平衡效率与精度 |
Rongling Zhang |
PDF |
N/A |
Micro-Structures Graph-Based Point Cloud Registration for Balancing Efficiency and Accuracy |
| 从数据中学习连续对称的无穷小生成元 |
Gyeonghoon Ko |
PDF |
N/A |
Learning Infinitesimal Generators of Continuous Symmetries from Data |
| 联合波束成形与说话人属性自动语音识别用于真实远场麦克风会议转录 |
Can Cui |
PDF |
N/A |
Joint Beamforming and Speaker-Attributed ASR for Real Distant-Microphone Meeting Transcription |
| 通过人机协作强化学习实现精准灵巧的机器人操作 |
Jianlan Luo |
PDF |
N/A |
Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning |
| 扩散作为推理:利用LLM偏置扩散模型增强目标导向导航 |
Yiming Ji |
PDF |
N/A |
Diffusion as Reasoning: Enhancing Object Goal Navigation with LLM-Biased Diffusion Model |
| 通过归纳对话系统进行多方面抑郁症严重程度评估 |
Chaebin Lee |
PDF |
N/A |
Multi-aspect Depression Severity Assessment via Inductive Dialogue System |
| 利用卷积块注意力和多模态数据融合提升头颈部癌症的生存预测 |
Aiman Farooq |
PDF |
N/A |
Enhanced Survival Prediction in Head and Neck Cancer Using Convolutional Block Attention and Multimodal Data Fusion |
| 体积条件模块用于控制预训练扩散模型进行三维医学图像处理 |
Suhyun Ahn |
PDF |
N/A |
Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images |
| PK-YOLO:预训练知识引导的YOLO用于多平面MRI切片中的脑肿瘤检测 |
Ming Kang |
PDF |
N/A |
PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices |
| LLM作为裁判中的自我偏好偏差 |
Koki Wataoka |
PDF |
N/A |
Self-Preference Bias in LLM-as-a-Judge |
| “认识自己”:在黑箱模型中赋予信仰者自我解释能力 |
Shaobo Wang |
PDF |
N/A |
Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models |
| SAM-Swin:基于SAM驱动的双Swin变换器,具有自适应病变增强功能,用于咽喉肿瘤检测 |
Jia Wei |
PDF |
N/A |
SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion Enhancement for Laryngo-Pharyngeal Tumor Detection |
| 通过非负矩阵分解重新审视广义类别发现 |
Zhong Ji |
PDF |
N/A |
A Fresh Look at Generalized Category Discovery through Non-negative Matrix Factorization |
| 高效且有效的多任务模型合并的权重集成专家混合方法 |
Li Shen |
PDF |
N/A |
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging |
| SimSiam命名游戏:一种统一的表征学习与涌现通信方法 |
Nguyen Le Hoang |
PDF |
N/A |
SimSiam Naming Game: A Unified Approach for Representation Learning and Emergent Communication |
| 文本引导的注意力机制足以实现视觉-语言模型中的零样本鲁棒性 |
Lu Yu |
PDF |
N/A |
Text-Guided Attention is All You Need for Zero-Shot Robustness in Vision-Language Models |
| 具有分布不确定性的连续序列的指数一致统计分类 |
Lina Zhu |
PDF |
N/A |
Exponentially Consistent Statistical Classification of Continuous Sequences with Distribution Uncertainty |
| 带有时间最优传输奖励的机器人策略学习 |
Yuwei Fu |
PDF |
N/A |
Robot Policy Learning with Temporal Optimal Transport Reward |
| 多智能体系统中的逆向注意力代理 |
Qian Long |
PDF |
N/A |
Inverse Attention Agent for Multi-Agent System |
| 通过思维链增强对抗性攻击 |
Jingbo Su |
PDF |
N/A |
Enhancing Adversarial Attacks through Chain of Thought |
| HairDiffusion: 通过潜在扩散实现生动的多色发型编辑 |
Yu Zeng |
PDF |
N/A |
HairDiffusion: Vivid Multi-Colored Hair Editing via Latent Diffusion |
| MARCO:多智能体实时聊天编排 |
Anubhav Shrimal |
PDF |
N/A |
MARCO: Multi-Agent Real-time Chat Orchestration |
| 利用大型语言模型(LLMs)进行逻辑推理中的假设演绎:一种神经符号方法 |
Qingchuan Li |
PDF |
N/A |
Leveraging LLMs for Hypothetical Deduction in Logical Inference: A Neuro-Symbolic Approach |
| RELATE:一个现代化的罗马尼亚语处理平台 |
Vasile Păiş |
PDF |
N/A |
RELATE: A Modern Processing Platform for Romanian Language |
| 在线镜像下降法用于多目标优化中的Tchebycheff标量化 |
Meitong Liu |
PDF |
N/A |
Online Mirror Descent for Tchebycheff Scalarization in Multi-Objective Optimization |
| Fast-OMRA:神经B帧编码的快速在线运动分辨率适应 |
Sang NguyenQuang |
PDF |
N/A |
Fast-OMRA: Fast Online Motion Resolution Adaptation for Neural B-Frame Coding |
| IntLoRA:量化扩散模型的积分低秩适应 |
Hang Guo |
PDF |
N/A |
IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models |
| DOFS:一个真实世界的3D可变形物体数据集,具有完整的空间信息,用于动力学模型学习 |
Zhen Zhang |
PDF |
N/A |
DOFS: A Real-world 3D Deformable Object Dataset with Full Spatial Information for Dynamics Model Learning |
| 通过重叠区域采样实现内存高效的点云配准 |
Tomoyasu Shimada |
PDF |
N/A |
Memory-Efficient Point Cloud Registration via Overlapping Region Sampling |
| 语言模型中对编造知识的习得与遗忘 |
Chen Sun |
PDF |
N/A |
Learning and Unlearning of Fabricated Knowledge in Language Models |
| 通过GraphSparse提示实现可靠且紧凑的图微调 |
Bo Jiang |
PDF |
N/A |
Reliable and Compact Graph Fine-tuning via GraphSparse Prompting |
| MotionGPT-2:一种用于运动生成与理解的多功能运动-语言模型 |
Yuan Wang |
PDF |
N/A |
MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding |
| 一种基于图的鲁棒聚类双适应分配方法 |
Yang Xiang |
PDF |
N/A |
A Dual Adaptive Assignment Approach for Robust Graph-Based Clustering |
| EI-Nexus:面向无中介且灵活的事件-图像数据跨模态局部特征提取与匹配 |
Zhonghua Yi |
PDF |
N/A |
EI-Nexus: Towards Unmediated and Flexible Inter-Modality Local Feature Extraction and Matching for Event-Image Data |
| 通过多智能体反思框架提升金融问答能力 |
Sorouralsadat Fatemi |
PDF |
N/A |
Enhancing Financial Question Answering with a Multi-Agent Reflection Framework |
| SS3DM:使用合成3D网格数据集对街景表面重建进行基准测试 |
Yubin Hu |
PDF |
N/A |
SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset |
| 高效重编程忆阻交叉阵列用于DNN:权重排序与比特粘连 |
Matheus Farias |
PDF |
N/A |
Efficient Reprogramming of Memristive Crossbars for DNNs: Weight Sorting and Bit Stucking |
| 让我们通过逐步推理实现自我生成:一种基于课程学习的自动化推理方法,利用大型语言模型 |
Kangyang Luo |
PDF |
N/A |
Let's Be Self-generated via Step by Step: A Curriculum Learning Approach to Automated Reasoning with Large Language Models |
| DiffSTR:用于场景文本去除的受控扩散模型 |
Sanhita Pathak |
PDF |
N/A |
DiffSTR: Controlled Diffusion Models for Scene Text Removal |
| 从经验数据估计VENDI分数的统计复杂性 |
Azim Ospanov |
PDF |
N/A |
On the Statistical Complexity of Estimating VENDI Scores from Empirical Data |
| 使用大型语言模型生成逼真的表格数据 |
Dang Nguyen |
PDF |
N/A |
Generating Realistic Tabular Data with Large Language Models |
| 一种利用大型语言模型在作者归属中发挥作用的贝叶斯方法 |
Zhengmian Hu |
PDF |
N/A |
A Bayesian Approach to Harnessing the Power of LLMs in Authorship Attribution |
| 基于切片沃瑟斯坦的异常检测与局部关键峰值回扣的开放数据集 |
Julien Pallage |
PDF |
N/A |
Sliced-Wasserstein-based Anomaly Detection and Open Dataset for Localized Critical Peak Rebates |
| 多视角聚类整合锚点属性和结构信息 |
Xuetong Li |
PDF |
N/A |
Multi-view clustering integrating anchor attribute and structural information |
| 使用文本到图像扩散模型进行语义分割的无监督模态适应 |
Ruihao Xia |
PDF |
N/A |
Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation |
| AdaptGCD:用于广义类别发现的多元专家适配器调优 |
Yuxun Qu |
PDF |
N/A |
AdaptGCD: Multi-Expert Adapter Tuning for Generalized Category Discovery |
| 带有无界马尔可夫噪声的随机逼近:一个通用定理 |
Shaan Ul Haque |
PDF |
N/A |
Stochastic Approximation with Unbounded Markovian Noise: A General-Purpose Theorem |
| 依赖数据上的深度神经网络的极小极大最优性通过PAC-贝叶斯界 |
Pierre Alquier |
PDF |
N/A |
Minimax optimality of deep neural networks on dependent data via PAC-Bayes bounds |
| 深度和循环在任务多样性下的上下文学习中的作用 |
Khashayar Gatmiry |
PDF |
N/A |
On the Role of Depth and Looping for In-Context Learning with Task Diversity |
| 多任务学习对ReLU神经网络函数的影响 |
Julia Nakhleh |
PDF |
N/A |
The Effects of Multi-Task Learning on ReLU Neural Network Functions |
| CFSafety:针对大型语言模型的全面细粒度安全评估 |
Zhihao Liu |
PDF |
N/A |
CFSafety: Comprehensive Fine-grained Safety Assessment for LLMs |
| 推动全原子几何图神经网络的极限:预训练、扩展和零样本迁移 |
Zihan Pengmei |
PDF |
N/A |
Pushing the Limits of All-Atom Geometric Graph Neural Networks: Pre-Training, Scaling and Zero-Shot Transfer |
| 回顾大规模机器学习研究集群中的可靠性 |
Apostolos Kokolis |
PDF |
N/A |
Revisiting Reliability in Large-Scale Machine Learning Research Clusters |