| GeoCalib:通过几何优化学习单张图像的标定 |
Alexander Veicht |
PDF |
N/A |
GeoCalib: Learning Single-image Calibration with Geometric Optimization |
| LEIA:隐式三维关节的潜在视图不变嵌入 |
Archana Swaminathan |
PDF |
N/A |
LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation |
| 提示-AD:端到端自动驾驶中的整体对齐可解释性 |
Kairui Ding |
PDF |
N/A |
Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving |
| 关于乳腺癌检测的深度卷积神经网络、迁移学习和集成模型的研究 |
Md Taimur Ahad |
PDF |
N/A |
A study on Deep Convolutional Neural Networks, Transfer Learning and Ensemble Model for Breast Cancer Detection |
| DANCE:使用混沌增强万花筒图像的深度学习辅助蛋白质序列分析 |
Taslim Murad |
PDF |
N/A |
DANCE: Deep Learning-Assisted Analysis of Protein Sequences Using Chaos Enhanced Kaleidoscopic Images |
| HybridFC:一种用于知识图谱的混合事实核查方法 |
Umair Qudus |
PDF |
N/A |
HybridFC: A Hybrid Fact-Checking Approach for Knowledge Graphs |
| 几何平均偏好优化用于软偏好标签 |
Hiroki Furuta |
PDF |
N/A |
Geometric-Averaged Preference Optimization for Soft Preference Labels |
| 主舞台舞蹈音乐子类型分类基准测试 |
Hongzhi Shu |
PDF |
N/A |
Benchmarking Sub-Genre Classification For Mainstage Dance Music |
| 使用卷积神经网络进行血液癌症检测与分类的综合研究 |
Md Taimur Ahad |
PDF |
N/A |
A comprehensive study on Blood Cancer detection and classification using Convolutional Neural Network |
| 深度特征提取用于检测和分类急性淋巴细胞白血病(ALL)的研究 |
Sabit Ahamed Preanto |
PDF |
N/A |
A study on deep feature extraction to detect and classify Acute Lymphoblastic Leukemia (ALL) |
| GigaGS:基于平面的3D高斯分布在大规模场景表面重建中的扩展 |
Junyi Chen |
PDF |
N/A |
GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction |
| Alignist:通过融合形状和对应关系进行CAD引导的方向分布估计 |
Shishir Reddy Vutukur |
PDF |
N/A |
Alignist: CAD-Informed Orientation Distribution Estimation by Fusing Shape and Correspondences |
| E2LLM:用于长上下文理解和推理的编码器延伸大型语言模型 |
Zihan Liao |
PDF |
N/A |
E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning |
| 通过展开图拉普拉斯正则化器构建可解释的深度降噪器 |
Seyed Alireza Hosseini |
PDF |
N/A |
Constructing an Interpretable Deep Denoiser by Unrolling Graph Laplacian Regularizer |
| 灾难性损失的责任与保险:核电先例及其对人工智能的启示 |
Cristian Trout |
PDF |
N/A |
Liability and Insurance for Catastrophic Losses: the Nuclear Power Precedent and Lessons for AI |
| 为人工智能无法承保的风险提供保险:国家作为最后的保险人 |
Cristian Trout |
PDF |
N/A |
Insuring Uninsurable Risks from AI: The State as Insurer of Last Resort |
| 利用YOLO进行甜橙叶病害检测的语义分割方法 |
Sabit Ahamed Preanto |
PDF |
N/A |
A Semantic Segmentation Approach on Sweet Orange Leaf Diseases Detection Utilizing YOLO |
| DA-MoE:面向混合专家模型动态专家分配 |
Maryam Akhavan Aghdam |
PDF |
N/A |
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models |
| LLaMA-Omni:与大型语言模型实现无缝语音交互 |
Qingkai Fang |
PDF |
N/A |
LLaMA-Omni: Seamless Speech Interaction with Large Language Models |
| 无数据收集的掩码视频建模 |
Yuchi Ishikawa |
PDF |
N/A |
Data Collection-free Masked Video Modeling |
| 基于重力视角坐标的世界接地式人体运动恢复 |
Zehong Shen |
PDF |
N/A |
World-Grounded Human Motion Recovery via Gravity-View Coordinates |
| Sortformer:通过桥接时间戳和标记实现说话人分割和自动语音识别的无缝集成 |
Taejin Park |
PDF |
N/A |
Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens |
| KANtrol:一种基于物理信息的Kolmogorov-Arnold网络框架,用于求解多维和分数阶最优控制问题 |
Alireza Afzal Aghaei |
PDF |
N/A |
KANtrol: A Physics-Informed Kolmogorov-Arnold Network Framework for Solving Multi-Dimensional and Fractional Optimal Control Problems |
| 图像矢量化与深度:具有深度排序的凸化形状层 |
Ho Law |
PDF |
N/A |
Image Vectorization with Depth: convexified shape layers with depth ordering |
| EyeCLIP:一种用于多模态眼科图像分析的视觉-语言基础模型 |
Danli Shi |
PDF |
N/A |
EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis |
| TeXBLEU:自动评估LaTeX格式的度量标准 |
Kyudan Jung |
PDF |
N/A |
TeXBLEU: Automatic Metric for Evaluate LaTeX Format |
| MoWE-音频:多任务音频LLMs与弱编码器混合 |
Wenyu Zhang |
PDF |
N/A |
MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders |
| SaRA:使用渐进稀疏低秩适应进行高效扩散模型微调 |
Teng Hu |
PDF |
N/A |
SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation |
| 面向局部结构元素:在RGB-D数据中融合几何检测与语义验证 |
Ali Tourani |
PDF |
N/A |
Towards Localizing Structural Elements: Merging Geometrical Detection with Semantic Verification in RGB-D Data |
| 对Llama-3 70B进行后训练的实践:最佳选择附加语言混合比例 |
Ningyuan Xi |
PDF |
N/A |
A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio |
| 通过多任务处理探索意大利语句子嵌入的特性 |
Vivi Nastase |
PDF |
N/A |
Exploring Italian sentence embeddings properties through multi-tasking |
| MVGaussian:利用多视角引导和表面密度增强实现高保真文本到3D内容生成 |
Phu Pham |
PDF |
N/A |
MVGaussian: High-Fidelity text-to-3D Content Generation with Multi-View Guidance and Surface Densification |
| 具有缺失信息的海底栖息地图像分层多标签分类 |
Isaac Xu |
PDF |
N/A |
Hierarchical Multi-Label Classification with Missing Information for Benthic Habitat Imagery |
| 何时提取ReID特征:一种选择性方法以改进多目标跟踪 |
Emirhan Bayar |
PDF |
N/A |
When to Extract ReID Features: A Selective Approach for Improved Multiple Object Tracking |
| 在执行不匹配情况下的单次模仿 |
Kushal Kedia |
PDF |
N/A |
One-Shot Imitation under Mismatched Execution |
| DemoStart:应用于多指机器人仿真到现实中的演示引导式自动课程 |
Maria Bauza |
PDF |
N/A |
DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots |
| 无标签监控自监督学习进度 |
Isaac Xu |
PDF |
N/A |
Label-free Monitoring of Self-Supervised Learning Progress |
| 提高卷积神经网络在磁共振频谱建模中的精度 |
John LaMaster |
PDF |
N/A |
Improving the Precision of CNNs for Magnetic Resonance Spectral Modeling |
| 基于模拟的场景生成,用于自主系统的鲁棒混合人工智能 |
Hambisa Keno |
PDF |
N/A |
Simulation-based Scenario Generation for Robust Hybrid AI for Autonomy |
| 基于本体的方法在自动驾驶中实现可追溯行为规范 |
Nayel Fabian Salem |
PDF |
N/A |
An Ontology-based Approach Towards Traceable Behavior Specifications in Automated Driving |
| 口咽癌原发大体肿瘤体积的交互式三维分割 |
Mikko Saukkoriipi |
PDF |
N/A |
Interactive 3D Segmentation for Primary Gross Tumor Volume in Oropharyngeal Cancer |
| 一种实用的门控循环变换器网络,结合多种融合技术用于视频去噪 |
Kai Guo |
PDF |
N/A |
A Practical Gated Recurrent Transformer Network Incorporating Multiple Fusions for Video Denoising |
| 通过怀疑建模缓解大型语言模型中的幻觉现象 |
Yetao Wu |
PDF |
N/A |
Alleviating Hallucinations in Large Language Models with Scepticism Modeling |
| GroUSE:一个用于评估接地问答中评估器的基准 |
Sacha Muller |
PDF |
N/A |
GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering |
| 推进因果推断:一种非参数方法用于连续处理的ATE和CATE估计 |
Hugo Gobato Souto |
PDF |
N/A |
Advancing Causal Inference: A Nonparametric Approach to ATE and CATE Estimation with Continuous Treatments |
| 基于双分支卷积与Transformer的轻量级多尺度特征融合超分辨率网络 |
Li Ke |
PDF |
N/A |
Lightweight Multiscale Feature Fusion Super-Resolution Network Based on Two-branch Convolution and Transformer |
| Seg-HGNN:基于双曲图神经网络的无监督轻量级图像分割 |
Debjyoti Mondal |
PDF |
N/A |
Seg-HGNN: Unsupervised and Light-Weight Image Segmentation with Hyperbolic Graph Neural Networks |
| 开发时间图卷积神经网络模型以利用电子健康记录预测髋关节置换 |
Zoe Hancox |
PDF |
N/A |
Developing the Temporal Graph Convolutional Neural Network Model to Predict Hip Replacement using Electronic Health Records |
| Transtreaming:实时流媒体感知中的自适应延迟感知Transformer |
Xiang Zhang |
PDF |
N/A |
Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception |
| 半监督三维物体检测与变换等变性通道增强 |
Minju Kang |
PDF |
N/A |
Semi-Supervised 3D Object Detection with Chanel Augmentation using Transformation Equivariance |
| 量化并提升类似CLIP模型的可解释性 |
Avinash Madasu |
PDF |
N/A |
Quantifying and Enabling the Interpretability of CLIP-like Models |
| 通过多语言主谓一致性探索句子嵌入中的句法信息 |
Vivi Nastase |
PDF |
N/A |
Exploring syntactic information in sentence embeddings through multilingual subject-verb agreement |
| 纳什需求博弈中的间接动态谈判 |
Tatiana V. Guy |
PDF |
N/A |
Indirect Dynamic Negotiation in the Nash Demand Game |
| ChatGPT在密码学误用检测中的潜力:与静态分析工具的比较分析 |
Ehsan Firouzi |
PDF |
N/A |
ChatGPT's Potential in Cryptography Misuse Detection: A Comparative Analysis with Static Analysis Tools |
| 用于物理信息深度生成建模的变分推断入门 |
Alex Glyn-Davies |
PDF |
N/A |
A Primer on Variational Inference for Physics-Informed Deep Generative Modelling |
| 学习聚合:利用图神经网络生成Chvátal-Gomory割的监督生成方法 |
Arnaud Deza |
PDF |
N/A |
Learn2Aggregate: Supervised Generation of Chvátal-Gomory Cuts Using Graph Neural Networks |
| 深度神经网络:多分类与通用逼近 |
Martín Hernández |
PDF |
N/A |
Deep Neural Networks: Multi-Classification and Universal Approximation |
| 使用最优运输模型全球贸易 |
Thomas Gaskin |
PDF |
N/A |
Modelling Global Trade with Optimal Transport |
| 从LIMA到DeepLIMA:遵循互操作性的新路径 |
Victor Bocharov |
PDF |
N/A |
From LIMA to DeepLIMA: following a new path of interoperability |
| 基于平静终端吸引子的梯度下降算法的动态解耦 |
Jinwei Zhao |
PDF |
N/A |
Dynamic Decoupling of Placid Terminal Attractor-based Gradient Descent Algorithm |
| 利用大型语言模型和叙事结构化文本嵌入映射新闻叙事 |
Jan Elfes |
PDF |
N/A |
Mapping News Narratives Using LLMs and Narrative-Structured Text Embeddings |
| PoseEmbroider:迈向一种三维、视觉、语义感知的人体姿态表示 |
Ginger Delmas |
PDF |
N/A |
PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation |
| 功能受限算法解决凸简单双层问题 |
Huaqing Zhang |
PDF |
N/A |
Functionally Constrained Algorithm Solves Convex Simple Bilevel Problems |
| MENSA:一种用于在信息性删失下进行生存分析的多事件网络 |
Christian Marius Lillelund |
PDF |
N/A |
MENSA: A Multi-Event Network for Survival Analysis under Informative Censoring |
| 理想化大气动力学中Koopman算子估计的深度学习方法 |
David Millard |
PDF |
N/A |
Deep Learning for Koopman Operator Estimation in Idealized Atmospheric Dynamics |
| 轻型机载推扫式成像光谱仪飞行中视轴校正 |
Julien Yuuki Burkhard |
PDF |
N/A |
In Flight Boresight Rectification for Lightweight Airborne Pushbroom Imaging Spectrometry |
| 通过奥林匹克运动会的视角质疑大型语言模型的内部知识结构 |
Juhwan Choi |
PDF |
N/A |
Questioning Internal Knowledge Structure of Large Language Models Through the Lens of the Olympic Games |
| 限价订单簿模拟与交易评估,采用$K$-近邻重采样方法 |
Michael Giegrich |
PDF |
N/A |
Limit Order Book Simulation and Trade Evaluation with $K$-Nearest-Neighbor Resampling |
| 钢琴音符的正弦、瞬态、噪声神经建模 |
Riccardo Simionato |
PDF |
N/A |
Sine, Transient, Noise Neural Modeling of Piano Notes |
| 在抽象层次上对齐机器和人类视觉表示 |
Lukas Muttenthaler |
PDF |
N/A |
Aligning Machine and Human Visual Representations across Abstraction Levels |
| 用于三维点云的神经拉普拉斯算子 |
Bo Pang |
PDF |
N/A |
Neural Laplacian Operator for 3D Point Clouds |
| 从精确的交换-相关势能和能量中学习局域和半局域密度泛函 |
Bikash Kanungo |
PDF |
N/A |
Learning local and semi-local density functionals from exact exchange-correlation potentials and energies |
| 通过重新平衡对比解码来缓解视觉-语言模型中的幻觉现象 |
Xiaoyu Liang |
PDF |
N/A |
Mitigating Hallucination in Visual-Language Models via Re-Balancing Contrastive Decoding |
| 采用模型预测控制、强化学习和回放技术的高级计算机象棋 |
Atharva Gundawar |
PDF |
N/A |
Superior Computer Chess with Model Predictive Control, Reinforcement Learning, and Rollout |
| 动态平面图中的多尺度循环追踪 |
Farhan Rasheed |
PDF |
N/A |
Multi-scale Cycle Tracking in Dynamic Planar Graphs |
| 弱监督的地面到卫星图像配准相机定位 |
Yujiao Shi |
PDF |
N/A |
Weakly-supervised Camera Localization by Ground-to-satellite Image Registration |
| 一种有效的长尾语音识别上下文平衡适应方法 |
Yi-Cheng Wang |
PDF |
N/A |
An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition |
| 一种基于机器学习的爆震胞格统计分析方法,数据来源于烟灰箔 |
Vansh Sharma |
PDF |
N/A |
A Machine Learning Based Approach for Statistical Analysis of Detonation Cells from Soot Foils |
| 持续领域增量学习在隐私保护的数字病理学中的应用 |
Pratibha Kumari |
PDF |
N/A |
Continual Domain Incremental Learning for Privacy-aware Digital Pathology |
| 使用机器学习在Linux内核中进行勒索软件检测 |
Adrian Brodzik |
PDF |
N/A |
Ransomware Detection Using Machine Learning in the Linux Kernel |
| 多模态大语言模型驱动的自动驾驶车辆场景测试 |
Qiujing Lu |
PDF |
N/A |
Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles |
| HexaCoder:通过Oracle引导的合成训练数据实现安全代码生成 |
Hossein Hajipour |
PDF |
N/A |
HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training Data |
| 通过训练的智能体探索学习生成互动环境 |
Naser Kazemi |
PDF |
N/A |
Learning Generative Interactive Environments By Trained Agent Exploration |
| 通过查询选择进行知识蒸馏的检测变压器 |
Yi Liu |
PDF |
N/A |
Knowledge Distillation via Query Selection for Detection Transformer |
| Prompt2Fashion:一个自动生成的时尚数据集 |
Georgia Argyro |
PDF |
N/A |
Prompt2Fashion: An automatically generated fashion dataset |
| 将可解释集成树(E2Tree)扩展到回归场景 |
Massimo Aria |
PDF |
N/A |
Extending Explainable Ensemble Trees (E2Tree) to regression contexts |
| 线性自回归学习的信息论简要分析 |
Ingvar Ziemann |
PDF |
N/A |
A Short Information-Theoretic Analysis of Linear Auto-Regressive Learning |
| 利用认知知识图谱进行学术知识组织的微调与提示工程 |
Gollam Rabby |
PDF |
N/A |
Fine-tuning and Prompt Engineering with Cognitive Knowledge Graphs for Scholarly Knowledge Organization |
| 慢集体变量的谱图、马尔可夫动力学及过渡态集合 |
Jakub Rydzewski |
PDF |
N/A |
Spectral Map for Slow Collective Variables, Markovian Dynamics, and Transition State Ensembles |
| GeMuCo:用于身体图式学习的广义多感官相关模型 |
Kento Kawaharazuka |
PDF |
N/A |
GeMuCo: Generalized Multisensory Correlational Model for Body Schema Learning |
| 一种基于似然比的未知物体分割方法 |
Nazir Nayal |
PDF |
N/A |
A Likelihood Ratio-Based Approach to Segmenting Unknown Objects |
| 未揭示的威胁:水下图像增强模型对抗鲁棒性的综合研究 |
Siyu Zhai |
PDF |
N/A |
Unrevealed Threats: A Comprehensive Study of the Adversarial Robustness of Underwater Image Enhancement Models |
| 探索大型语言模型在工业测试维护流程中的整合 |
Ludvig Lemner |
PDF |
N/A |
Exploring the Integration of Large Language Models in Industrial Test Maintenance Processes |
| 长度去敏化在定向偏好优化中的应用 |
Wei Liu |
PDF |
N/A |
Length Desensitization in Directed Preference Optimization |
| 三维场景重建中的不确定性来源 |
Marcus Klasson |
PDF |
N/A |
Sources of Uncertainty in 3D Scene Reconstruction |
| 神经网络优化中的对称性破缺:从输入维度扩展中获得的见解 |
Jun-Jie Zhang |
PDF |
N/A |
Symmetry Breaking in Neural Network Optimization: Insights from Input Dimension Expansion |
| 基于英语词典语义匹配的粗粒度感官词库 |
Masato Kikuchi |
PDF |
N/A |
Coarse-Grained Sense Inventories Based on Semantic Matching between English Dictionaries |
| AMNS:用于文本到图像人物检索的注意力加权选择性掩码与噪声标签抑制 |
Runqing Zhang |
PDF |
N/A |
AMNS: Attention-Weighted Selective Mask and Noise Label Suppression for Text-to-Image Person Retrieval |
| 一种用于识别未释读甲骨文的多字体图像检索网络 |
Zhicong Wu |
PDF |
N/A |
A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions |
| 通过多视角反思与迭代提升序列推荐 |
Weicong Qin |
PDF |
N/A |
Enhancing Sequential Recommendations through Multi-Perspective Reflections and Iteration |
| SpeechTaxi:多语言语义语音分类 |
Lennart Keller |
PDF |
N/A |
SpeechTaxi: On Multilingual Semantic Speech Classification |
| 蒸馏生成-判别表示用于极低分辨率人脸识别 |
Junzheng Zhang |
PDF |
N/A |
Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition |
| Texture-AD:一个用于真实算法开发的异常检测数据集和基准 |
Tianwu Lei |
PDF |
N/A |
Texture-AD: An Anomaly Detection Dataset and Benchmark for Real Algorithm Development |
| “一策统御”:一种端到端学习的多实体运动方法 |
Nico Bohlinger |
PDF |
N/A |
One Policy to Run Them All: an End-to-end Learning Approach to Multi-Embodiment Locomotion |
| 当你的模型是有条件的时候,扩散模型的似然性会发生什么变化? |
Mattias Cross |
PDF |
N/A |
What happens to diffusion model likelihood when your model is conditional? |
| 在深度神经网络中连接概念凸性和人机对齐 |
Teresa Dorszewski |
PDF |
N/A |
Connecting Concept Convexity and Human-Machine Alignment in Deep Neural Networks |
| 双重连续过松弛Q学习及其在深度强化学习中的扩展 |
Shreyas S R |
PDF |
N/A |
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning |
| DiffQRCoder:基于扩散的审美二维码生成,通过扫描鲁棒性引导的迭代优化实现 |
Jia-Wei Liao |
PDF |
N/A |
DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement |
| MAGDA:多智能体指南驱动的诊断辅助 |
David Bani-Harouni |
PDF |
N/A |
MAGDA: Multi-agent guideline-driven diagnostic assistance |
| 在三消游戏中利用自动化验证改进条件关卡生成 |
Monica Villanueva Aylagas |
PDF |
N/A |
Improving Conditional Level Generation using Automated Validation in Match-3 Games |
| 语音悟空:深度伪造语音检测基准测试 |
Ziwei Yan |
PDF |
N/A |
VoiceWukong: Benchmarking Deepfake Voice Detection |
| Foragax:一个基于JAX的基于代理的建模框架 |
Siddharth Chaturvedi |
PDF |
N/A |
Foragax: An Agent Based Modelling framework based on JAX |
| 计算-更新联邦学习:一种格点编码方法 |
Seyed Mohammad Azimi-Abarghouyi |
PDF |
N/A |
Compute-Update Federated Learning: A Lattice Coding Approach |
| 检索还是整体理解?Dolce:区分我们的长上下文评估任务 |
Zi Yang |
PDF |
N/A |
Retrieval Or Holistic Understanding? Dolce: Differentiate Our Long Context Evaluation Tasks |
| 粒子加速器上的自主人工智能 |
Antonin Sulc |
PDF |
N/A |
Towards Agentic AI on Particle Accelerators |
| 基于直方图的Transformer特征增强的多天气图像复原 |
Yang Wen |
PDF |
N/A |
Multi-Weather Image Restoration via Histogram-Based Transformer Feature Enhancement |
| 线性 bandits 的改进元-Thompson 采样及其贝叶斯遗憾分析 |
Hao Li |
PDF |
N/A |
Modified Meta-Thompson Sampling for Linear Bandits and Its Bayes Regret Analysis |
| 从LLM令牌激活中提取段落 |
Nicholas Pochinkov |
PDF |
N/A |
Extracting Paragraphs from LLM Token Activations |
| SDF-Net:一种用于对比CT图像上纵隔淋巴结检测的混合检测网络 |
Jiuli Xiong |
PDF |
N/A |
SDF-Net: A Hybrid Detection Network for Mediastinal Lymph Node Detection on Contrast CT Images |
| LAMP:可学习的元路径引导对抗对比学习用于异质图 |
Siqing Li |
PDF |
N/A |
LAMP: Learnable Meta-Path Guided Adversarial Contrastive Learning for Heterogeneous Graphs |
| G3PT:通过跨尺度查询Transformer释放自回归建模在3D生成中的力量 |
Jinzhi Zhang |
PDF |
N/A |
G3PT: Unleash the power of Autoregressive Modeling in 3D Generation via Cross-scale Querying Transformer |
| 速率受限量化以实现通信高效的联邦学习 |
Shayan Mohajer Hamidi |
PDF |
N/A |
Rate-Constrained Quantization for Communication-Efficient Federated Learning |
| PharmacoMatch:通过神经子图匹配实现高效的三维药效团筛选 |
Daniel Rose |
PDF |
N/A |
PharmacoMatch: Efficient 3D Pharmacophore Screening through Neural Subgraph Matching |
| 在卷积神经网络(CNN)中使用Seam Carving作为特征池化 |
Mohammad Imrul Jubair |
PDF |
N/A |
Seam Carving as Feature Pooling in CNN |
| PPMamba:一种基于金字塔池化局部辅助SSM的遥感图像语义分割模型 |
Yin Hu |
PDF |
N/A |
PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation |
| 一种端到端的和弦条件歌曲生成方法 |
Shuochen Gao |
PDF |
N/A |
An End-to-End Approach for Chord-Conditioned Song Generation |
| 基于基础模型的高性能少样本分割:一项实证研究 |
Shijie Chang |
PDF |
N/A |
High-Performance Few-Shot Segmentation with Foundation Models: An Empirical Study |
| 一个属性丰富的数据集和开放检测的自动标注管道 |
Pengfei Qi |
PDF |
N/A |
An Attribute-Enriched Dataset and Auto-Annotated Pipeline for Open Detection |
| 通过基于层次事件的记忆增强长视频理解 |
Dingxin Cheng |
PDF |
N/A |
Enhancing Long Video Understanding via Hierarchical Event-Based Memory |
| 用户对大型语言模型与基于模板的电影推荐解释的偏好:一项初步研究 |
Julien Albert |
PDF |
N/A |
User Preferences for Large Language Model versus Template-Based Explanations of Movie Recommendations: A Pilot Study |
| EntAugment:基于熵驱动的自适应数据增强框架,用于图像分类 |
Suorong Yang |
PDF |
N/A |
EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification |
| 使用LLM自动化量化投资中的策略发现 |
Zhizhuo Kou |
PDF |
N/A |
Automate Strategy Finding with LLM in Quant investment |
| 使用重建作为序列的上下文增强的统一无监督异常检测 |
Hui-Yue Yang |
PDF |
N/A |
Context Enhancement with Reconstruction as Sequence for Unified Unsupervised Anomaly Detection |
| 从时间序列预测模型库中学习增强策略 |
Haochen Yuan |
PDF |
N/A |
Learning Augmentation Policies from A Model Zoo for Time Series Forecasting |
| 《猫捉老鼠》:检测深度学习模型中的未授权数据使用 |
Zitao Chen |
PDF |
N/A |
Catch Me if You Can: Detecting Unauthorized Data Use in Deep Learning Models |
| Ferret: 大规模语言模型的联邦全参数微调 |
Yao Shu |
PDF |
N/A |
Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models |
| 全球敏感性分析的新范式 |
Gildas Mazo |
PDF |
N/A |
A new paradigm for global sensitivity analysis |
| 面向鲁棒不确定性感知的不完全多视图分类 |
Mulin Chen |
PDF |
N/A |
Towards Robust Uncertainty-Aware Incomplete Multi-View Classification |
| 马氏距离k-NN:一种用于鲁棒点云配准的统计视角 |
Tejas Anvekar |
PDF |
N/A |
Mahalanobis k-NN: A Statistical Lens for Robust Point-Cloud Registrations |
| 关键词感知的自动语音识别错误增强,用于鲁棒的对话状态跟踪 |
Jihyun Lee |
PDF |
N/A |
Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking |
| ALSS-YOLO:一种适用于无人机影像中红外野生动物检测的自适应轻量级通道分割与混洗网络 |
Ang He |
PDF |
N/A |
ALSS-YOLO: An Adaptive Lightweight Channel Split and Shuffling Network for TIR Wildlife Detection in UAV Imagery |
| 供应链网络中新闻流的市场反应 |
Hiroyasu Inoue |
PDF |
N/A |
Market Reaction to News Flows in Supply Chain Networks |
| 推理即一切:基于ChatGPT的跨领域对话状态追踪自示例检索器 |
Jihyun Lee |
PDF |
N/A |
Inference is All You Need: Self Example Retriever for Cross-domain Dialogue State Tracking with ChatGPT |
| DiPT:通过多样化视角提升大型语言模型的推理能力 |
Hoang Anh Just |
PDF |
N/A |
DiPT: Enhancing LLM reasoning through diversified perspective-taking |
| 测试时可验证的自监督学习方法用于弥合基于事件的卫星姿态估计中的仿真与现实差距 |
Mohsi Jawaid |
PDF |
N/A |
Test-Time Certifiable Self-Supervision to Bridge the Sim2Real Gap in Event-Based Satellite Pose Estimation |
| 用于静态图像的循环神经网络 |
Dmitri |
PDF |
N/A |
Recurrent Neural Networks for Still Images |
| 一种用于多层次细节的潜在隐式三维形状模型 |
Benoit Guillard |
PDF |
N/A |
A Latent Implicit 3D Shape Model for Multiple Levels of Detail |
| 基于自然语言处理的学术论文库与搜索引擎:以网络风险文献为例——CyLit案例研究 |
Linfeng Zhang |
PDF |
N/A |
NLP-Powered Repository and Search Engine for Academic Papers: A Case Study on Cyber Risk Literature with CyLit |
| MIP-GAF:一个用于最重要人物定位和群体上下文理解的MLLM注释基准 |
Surbhi Madan |
PDF |
N/A |
MIP-GAF: A MLLM-annotated Benchmark for Most Important Person Localization and Group Context Understanding |
| 增强大型音频语言模型在音频问答中的时间理解能力 |
Arvind Krishna Sridhar |
PDF |
N/A |
Enhancing Temporal Understanding in Audio Question Answering for Large Audio Language Models |
| 利用多语言语义嵌入推进广播语音的主题分割 |
Sakshi Deo Shukla |
PDF |
N/A |
Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic Embeddings |
| CerviXpert:一种用于预测宫颈类型和宫颈细胞异常的多结构卷积神经网络 |
Rashik Shahriar Akash |
PDF |
N/A |
CerviXpert: A Multi-Structural Convolutional Neural Network for Predicting Cervix Type and Cervical Cell Abnormalities |
| 去噪:成像、逆问题和机器学习中的强大基础组件 |
Peyman Milanfar |
PDF |
N/A |
Denoising: A Powerful Building-Block for Imaging, Inverse Problems, and Machine Learning |
| DACAT:用于鲁棒在线手术阶段识别的双流自适应剪辑感知时间建模 |
Kaixiang Yang |
PDF |
N/A |
DACAT: Dual-stream Adaptive Clip-aware Time Modeling for Robust Online Surgical Phase Recognition |
| SubRegWeigh:利用子词正则化实现有效且高效的标注权重分配 |
Kohei Tsuji |
PDF |
N/A |
SubRegWeigh: Effective and Efficient Annotation Weighing with Subword Regularization |
| 面向泛化场景变化检测 |
Jaewoo Kim |
PDF |
N/A |
Towards Generalizable Scene Change Detection |
| STUN:用于可扩展MoE剪枝的结构化-然后-非结构化剪枝 |
Jaeseong Lee |
PDF |
N/A |
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning |
| INTRA:交互关系感知的弱监督功能基础 |
Ji Ha Jang |
PDF |
N/A |
INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding |
| 自适应变换器密度函数建模在非参数生存分析中的应用 |
Xin Zhang |
PDF |
N/A |
Adaptive Transformer Modelling of Density Function for Nonparametric Survival Analysis |
| AgileIR:用于敏捷图像恢复的内存高效组移位窗口注意力机制 |
Hongyi Cai |
PDF |
N/A |
AgileIR: Memory-Efficient Group Shifted Windows Attention for Agile Image Restoration |
| SHAPE-IT:利用大型语言模型探索生成形状变化行为的文本到形状显示 |
Wanli Qian |
PDF |
N/A |
SHAPE-IT: Exploring Text-to-Shape-Display for Generative Shape-Changing Behaviors with LLMs |
| RealisDance:为可控角色动画配备逼真的手部动作 |
Jingkai Zhou |
PDF |
N/A |
RealisDance: Equip controllable character animation with realistic hands |
| 用于低剂量PET-MR成像的潜在空间特征的深度核表示,对剂量减少变化具有鲁棒性 |
Cameron Dennis Pain |
PDF |
N/A |
Deep kernel representations of latent space features for low-dose PET-MR imaging robust to variable dose reduction |
| UdeerLID+:结合激光雷达、图像和相对深度与半监督 |
Tao Ni |
PDF |
N/A |
UdeerLID+: Integrating LiDAR, Image, and Relative Depth with Semi-Supervised |
| MTDA-HSED:异质声音事件检测中的互助调优与双分支聚合 |
Zehao Wang |
PDF |
N/A |
MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection |
| NOVI:基于BERT和大型语言模型的大学新生聊天机器人系统 |
Yoonji Nam |
PDF |
N/A |
NOVI : Chatbot System for University Novice with BERT and LLMs |
| 多源音乐生成与潜在扩散 |
Zhongweiyang Xu |
PDF |
N/A |
Multi-Source Music Generation with Latent Diffusion |
| MyGo:通过相机控制实现一致且可控的多视角驾驶视频生成 |
Yining Yao |
PDF |
N/A |
MyGo: Consistent and Controllable Multi-View Driving Video Generation with Camera Control |
| 基于瓶颈的编码器-解码器架构(BEAR)用于学习无偏的消费者对消费者图像表示 |
Pablo Rivas |
PDF |
N/A |
Bottleneck-based Encoder-decoder ARchitecture (BEAR) for Learning Unbiased Consumer-to-Consumer Image Representations |
| 大型语言模型能否解锁新颖的科学研究思路? |
Sandeep Kumar |
PDF |
N/A |
Can Large Language Models Unlock Novel Scientific Research Ideas? |
| EDADepth:用于单目深度估计的增强数据增强 |
Nischal Khanal |
PDF |
N/A |
EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation |
| 优化批量转录组测序的监督机器学习样本量:一种学习曲线方法 |
Yunhui Qi |
PDF |
N/A |
Optimizing Sample Size for Supervised Machine Learning with Bulk Transcriptomic Sequencing: A Learning Curve Approach |
| 负责任的区块链:STEADI原则与基于行动者网络理论的开发方法论(ANT-RDM) |
Yibai Li |
PDF |
N/A |
Responsible Blockchain: STEADI Principles and the Actor-Network Theory-based Development Methodology (ANT-RDM) |
| SQLucid:通过交互式解释实现自然语言数据库查询的接地 |
Yuan Tian |
PDF |
N/A |
SQLucid: Grounding Natural Language Database Queries with Interactive Explanations |
| 更大的语言模型并不关心你的思考方式:为何思维链提示在主观任务中失效 |
Georgios Chochlakis |
PDF |
N/A |
Larger Language Models Don't Care How You Think: Why Chain-of-Thought Prompting Fails in Subjective Tasks |
| 通过梯度匹配实现点云补全的损失蒸馏,使用加权倒角距离 |
Fangzhou Lin |
PDF |
N/A |
Loss Distillation via Gradient Matching for Point Cloud Completion with Weighted Chamfer Distance |
| VE:利用变量嵌入建模多元时间序列的相关性 |
Shangjiong Wang |
PDF |
N/A |
VE: Modeling Multivariate Time Series Correlation with Variate Embedding |
| 回顾视觉-语言模型的提示预训练 |
Zhenyuan Chen |
PDF |
N/A |
Revisiting Prompt Pretraining of Vision-Language Models |
| 深度学习与大型语言模型在预测中国心理支持热线中的自杀行为中的音频与文本分析应用 |
Yining Chen |
PDF |
N/A |
Deep Learning and Large Language Models for Audio and Text Analysis in Predicting Suicidal Acts in Chinese Psychological Support Hotlines |
| MCDGLN:基于掩码连接的动态图学习网络用于自闭症谱系障碍 |
Peng Wang |
PDF |
N/A |
MCDGLN: Masked Connection-based Dynamic Graph Learning Network for Autism Spectrum Disorder |
| Shapley值的因果分析:条件与边际 |
Ilya Rozenfeld |
PDF |
N/A |
Causal Analysis of Shapley Values: Conditional vs. Marginal |
| UniLearn:通过在图像和视频上进行统一预训练和微调,增强动态面部表情识别 |
Yin Chen |
PDF |
N/A |
UniLearn: Enhancing Dynamic Facial Expression Recognition through Unified Pre-Training and Fine-Tuning on Images and Videos |
| 多类别心律失常分类:利用智能手表光电容积脉搏波信号在真实生活场景中采集的数据 |
Dong Han |
PDF |
N/A |
Multiclass Arrhythmia Classification using Smartwatch Photoplethysmography Signals Collected in Real-life Settings |
| 配置相互作用引导的采样与可解释的受限玻尔兹曼机 |
Jorge I. Hernandez-Martinez |
PDF |
N/A |
Configuration Interaction Guided Sampling with Interpretable Restricted Boltzmann Machine |
| 变分搜索分布 |
Daniel M. Steinberg |
PDF |
N/A |
Variational Search Distributions |
| 绘制音频:利用多指令进行视频到音频的合成 |
Qi Yang |
PDF |
N/A |
Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis |
| 通过LFR教学法加速大型语言模型预训练:学习、专注和复习 |
Neha Prakriya |
PDF |
N/A |
Accelerating Large Language Model Pretraining via LFR Pedagogy: Learn, Focus, and Review |
| 基于后门模型的水印技术弱点:信息论视角 |
Aoting Hu |
PDF |
N/A |
On the Weaknesses of Backdoor-based Model Watermarking: An Information-theoretic Perspective |
| DECOLLAGE:通过可控、局部化和学习的几何增强实现3D细节化 |
Qimin Chen |
PDF |
N/A |
DECOLLAGE: 3D Detailization by Controllable, Localized, and Learned Geometry Enhancement |
| 对比联邦学习与表格数据孤岛 |
Achmad Ginanjar |
PDF |
N/A |
Contrastive Federated Learning with Tabular Data Silos |
| 案例研究:利用生成式人工智能构建基于人工智能的代理模型和回归模型,用于模拟聚变能源科学中的射频加热 |
E. Wes Bethel |
PDF |
N/A |
Case Study: Leveraging GenAI to Build AI-based Surrogates and Regressors for Modeling Radio Frequency Heating in Fusion Energy Science |