Arxiv 2024-11-21 Papers

标题	作者	PDF链接	代码仓库	Title
Insight-V：借助多模态大型语言模型探索长链视觉推理	Yuhao Dong	PDF	N/A	Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
稳定流：无需训练的图像编辑的关键层	Omri Avrahami	PDF	N/A	Stable Flow: Vital Layers for Training-Free Image Editing
回顾卷积与注意力在视觉骨干网络中的融合	Lei Zhu	PDF	N/A	Revisiting the Integration of Convolution and Attention for Vision Backbone
敲击芯片：硬件中心出口管制的徒劳无功	Ritwik Gupta	PDF	N/A	Whack-a-Chip: The Futility of Hardware-Centric Export Controls
通过领域混合学习公平鲁棒性	Meiyu Zhong	PDF	N/A	Learning Fair Robustness via Domain Mixup
释放多模态基础模型和视频扩散在4D动态物理场景模拟中的潜力	Zhuoman Liu	PDF	N/A	Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation
从循环神经网络到基础模型：商业建筑能耗的实证研究	Shourya Bose	PDF	N/A	From RNNs to Foundation Models: An Empirical Study on Commercial Building Energy Consumption
多模态三维脑肿瘤分割与对抗训练和条件随机场	Lan Jiang	PDF	N/A	Multimodal 3D Brain Tumor Segmentation with Adversarial Training and Conditional Random Field
量子机器学习模型的对抗性投毒攻击	Satwik Kundu	PDF	N/A	Adversarial Poisoning Attack on Quantum Machine Learning Models
用于车辆路径问题的多智能体环境	Ricardo Gama	PDF	N/A	Multi-Agent Environments for Vehicle Routing Problems
Marco-o1：面向开放式解决方案的开放推理模型	Yu Zhao	PDF	N/A	Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
解决假设驱动信念-MDP中的多动态模型不确定性	Ofer Dagan	PDF	N/A	Resolving Multiple-Dynamic Model Uncertainty in Hypothesis-Driven Belief-MDPs
基于生成对抗网络的无人机着陆轨迹预测	Jun Xiang	PDF	N/A	Landing Trajectory Prediction for UAS Based on Generative Adversarial Network
大型视觉编码器多模态自回归预训练	Enrico Fini	PDF	N/A	Multimodal Autoregressive Pre-training of Large Vision Encoders
超越训练：动态令牌合并用于零样本视频理解	Yiming Zhang	PDF	N/A	Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding
使用微调BERT嵌入的轻量级安全护栏	Aaron Zheng	PDF	N/A	Lightweight Safety Guardrails Using Fine-tuned BERT Embeddings
词性标注以突出句子的骨架结构	Grigorii Churakov	PDF	N/A	POS-tagging to highlight the skeletal structure of sentences
无序系统结构表征的持久性同调	An Wang	PDF	N/A	Persistent Homology for Structural Characterization in Disordered Systems
通过自动化病变分割提升胃出血诊断精准度：一种深度DuS-KFCM方法	Xian-Xian Liu	PDF	N/A	Enhancing Diagnostic Precision in Gastric Bleeding through Automated Lesion Segmentation: A Deep DuS-KFCM Approach
将高斯喷溅烘焙进扩散去噪器，实现快速且可扩展的单阶段图像到3D生成	Yuanhao Cai	PDF	N/A	Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation
CoNFiLD-inlet：使用生成性潜在扩散模型与神经场的合成湍流入口	Xin-Yang Liu	PDF	N/A	CoNFiLD-inlet: Synthetic Turbulence Inflow Using Generative Latent Diffusion Models with Neural Fields
自动驾驶中的强化学习模型检查：你能做的比你想象的更多！	Rong Gu	PDF	N/A	Model Checking for Reinforcement Learning in Autonomous Driving: One Can Do More Than You Think!
使用形式化模型、安全防护和认证控制来验证基于人工智能的列车系统	Jan Gruteser	PDF	N/A	Using Formal Models, Safety Shields and Certified Control to Validate AI-Based Train Systems
合成针对具有循环任务的机器人集体的鲁棒控制器：一个案例研究	Till Schnittka	PDF	N/A	Synthesising Robust Controllers for Robot Collectives with Recurrent Tasks: A Case Study
协作机器人焊接同步特性的模型检验与验证	Yvonne Murray	PDF	N/A	Model Checking and Verification of Synchronisation Properties of Cobot Welding
RV4Chatbot：聊天机器人是否能梦见电子羊？	Andrea Gatti	PDF	N/A	RV4Chatbot: Are Chatbots Allowed to Dream of Electric Sheep?
ROSMonitoring 2.0：将ROS运行时验证扩展到服务和有序主题	Maryam Ghaffari Saadat	PDF	N/A	ROSMonitoring 2.0: Extending ROS Runtime Verification to Services and Ordered Topics
InCrowd-VI：一个用于评估室内行人密集空间中SLAM（同步定位与地图构建）系统在人类导航中的真实视觉惯性数据集	Marziyeh Bamdad	PDF	N/A	InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation
利用机器学习和卫星数据进行局部与全局建模的对比研究：以非洲萨瓦纳地区树冠高度估算为例	Esther Rolf	PDF	N/A	Contrasting local and global modeling with machine learning and satellite data: A case study estimating tree canopy height in African savannas
利用深度学习和扩散模型提升医学图像分割	Houze Liu	PDF	N/A	Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models
无差别干扰多元高斯分布条件推断	William N. Caballero	PDF	N/A	Indiscriminate Disruption of Conditional Inference on Multivariate Gaussians
在具有高斯边缘分布的情况下，对任意ReLU激活函数的不可知学习	Anxin Guo	PDF	N/A	Agnostic Learning of Arbitrary ReLU Activation under Gaussian Marginals
DINO-X: 一种用于开放世界物体检测与理解的统一视觉模型	Tianhe Ren	PDF	N/A	DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
共识层剪枝：三赢解决方案	Leandro Giusti Mugnaini	PDF	N/A	Layer Pruning with Consensus: A Triple-Win Solution
通过Koszul-Young展平实现的超完备张量分解	Pravesh K. Kothari	PDF	N/A	Overcomplete Tensor Decomposition via Koszul-Young Flattenings
统一爬取：为低资源语言上的大型语言模型提供经济实惠的适应性的综合通用爬取	Bethel Melesse Tessema	PDF	N/A	UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages
自适应估计平均处理效应的对数尼曼后悔	Ojash Neopane	PDF	N/A	Logarithmic Neyman Regret for Adaptive Estimation of the Average Treatment Effect
SplatR：利用3D高斯喷射和密集特征匹配实现目标视觉重排	Arjun P S	PDF	N/A	SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching
Velocitune：一种基于速度的持续预训练动态域重加权方法	Zheheng Luo	PDF	N/A	Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
无模型概率流学习：阐明群体行为的非平衡动力学	Nicholas M. Boffi	PDF	N/A	Model-free learning of probability flows: Elucidating the nonequilibrium dynamics of flocking
通过平方和实现接近崩溃点的离群点鲁棒均值估计	Hongjie Chen	PDF	N/A	Outlier-robust Mean Estimation near the Breakdown Point via Sum-of-Squares
代码调试练习的自动生成	Victor-Alexandru Pădurean	PDF	N/A	Automated Generation of Code Debugging Exercises
通过使用平滑的一次性增强预测器，利用神经架构搜索（NAS）改进布线预测	Arjun Sridhar	PDF	N/A	Improving Routability Prediction via NAS Using a Smooth One-shot Augmented Predictor
StereoCrafter-Zero：通过噪声重启实现零样本立体视频生成	Jian Shi	PDF	N/A	StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart
机器学习框架用于预测脂质纳米粒子在核酸递送中的表现	Gaurav Kumar	PDF	N/A	Machine learning framework to predict the performance of lipid nanoparticles for nucleic acid delivery
通过非平衡熵产生实现细胞骨架结构的适应性灵活性	Yuika Ueda	PDF	N/A	Adaptive flexibility of cytoskeletal structures through nonequilibrium entropy production
关于具有等变性、局部性和权重共享的一隐层网络的样本复杂度	Arash Behboodi	PDF	N/A	On the Sample Complexity of One Hidden Layer Networks with Equivariance, Locality and Weight Sharing
EasyHOI：释放大型模型在野外重建手-物交互中的力量	Yumeng Liu	PDF	N/A	EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild
超越文本：通过多模态双重注意力和软图像引导减少大型视觉-语言模型中的语言偏见	Haozhe Zhao	PDF	N/A	Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance
知识图谱中的神经符号查询优化	Maribel Acosta	PDF	N/A	Neuro-Symbolic Query Optimization in Knowledge Graphs
使用小型语言模型高效地进行基于方面的气候变化报告摘要	Iacopo Ghinassi	PDF	N/A	Efficient Aspect-Based Summarization of Climate Change Reports with Small Language Models
通过薛定谔桥引导的磁共振成像重建	Yue Wang	PDF	N/A	Guided MRI Reconstruction via Schrödinger Bridge
使用变分自编码器生成逼真的业务流程对抗样本	Alexander Stevens	PDF	N/A	Generating Realistic Adversarial Examples for Business Processes using Variational Autoencoders
知识图谱、大型语言模型与幻觉：一个自然语言处理视角	Ernests Lavrinovics	PDF	N/A	Knowledge Graphs, Large Language Models, and Hallucinations: An NLP Perspective
我了解这个实体吗？语言模型中的知识意识与幻觉	Javier Ferrando	PDF	N/A	Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models
基于BERT的方法，利用可解释的人工智能自动化构建课程衔接矩阵	Natenaile Asmamaw Shiferaw	PDF	N/A	BERT-Based Approach for Automating Course Articulation Matrix Construction with Explainable AI
意图感知对话生成与多任务对比学习在多轮意图分类中的应用	Junhua Liu	PDF	N/A	Intent-Aware Dialogue Generation and Multi-Task Contrastive Learning for Multi-Turn Intent Classification
自然语言强化学习	Xidong Feng	PDF	N/A	Natural Language Reinforcement Learning
CP-UNet：基于轮廓的概率模型用于医学超声图像分割	Ruiguo Yu	PDF	N/A	CP-UNet: Contour-based Probabilistic Model for Medical Ultrasound Images Segmentation
黑箱机器人学习的仿真辅助策略调优	Shiming He	PDF	N/A	Simulation-Aided Policy Tuning for Black-Box Robot Learning
AnywhereDoor：针对目标检测的多目标后门攻击	Jialin Lu	PDF	N/A	AnywhereDoor: Multi-Target Backdoor Attacks on Object Detection
FocusLLaVA：一种高效且有效的视觉令牌压缩的由粗到细方法	Yuke Zhu	PDF	N/A	FocusLLaVA: A Coarse-to-Fine Approach for Efficient and Effective Visual Token Compression
迈向情境丰富的自动化生物多样性评估：从相机陷阱数据中提取人工智能驱动的洞察	Paul Fergus	PDF	N/A	Towards Context-Rich Automated Biodiversity Assessments: Deriving AI-Powered Insights from Camera Trap Data
评估大型语言模型中类比推理的鲁棒性	Martha Lewis	PDF	N/A	Evaluating the Robustness of Analogical Reasoning in Large Language Models
用于电力电子系统中自动调制设计的物理信息引导的大型语言模型代理	Junhua Liu	PDF	N/A	Physics-Informed LLM-Agent for Automated Modulation Design in Power Electronics Systems
生成式外延以增强短视频的记忆性	Alan Byju	PDF	N/A	Generative Outpainting To Enhance the Memorability of Short-Form Videos
HARP：一个大规模的高阶Ambisonic房间脉冲响应数据集	Shivam Saini	PDF	N/A	HARP: A Large-Scale Higher-Order Ambisonic Room Impulse Response Dataset
基于视频扩散先验的视角外推	Kunhao Liu	PDF	N/A	Novel View Extrapolation with Video Diffusion Priors
这个生成的人物在现实世界中存在吗？细粒度检测和校准异常人体	Zeqing Wang	PDF	N/A	Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body
通过贝叶斯神经网络中基于相关性的参数更新实现高效持续学习的修正正则化	Sanchar Palit	PDF	N/A	Revised Regularization for Efficient Continual Learning through Correlation-Based Parameter Update in Bayesian Neural Networks
区域注意力用于阴影去除	Hengxing Liu	PDF	N/A	Regional Attention for Shadow Removal
OpenScholar：利用检索增强型语言模型合成科学文献	Akari Asai	PDF	N/A	OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs
为什么语言模型在形态复杂的语言上表现更差？	Catherine Arnett	PDF	N/A	Why do language models perform worse for morphologically complex languages?
通过矩神经网络对工作记忆中的不确定性量化	Hengyuan Ma	PDF	N/A	Uncertainty Quantification in Working Memory via Moment Neural Networks
ComfyGI：图像生成工作流程的自动改进	Dominik Sobania	PDF	N/A	ComfyGI: Automatic Improvement of Image Generation Workflows
学习从实验数据中利用图神经网络进行孔隙尺度多相流模拟	Yuxuan Gu	PDF	N/A	Learning Pore-scale Multi-phase Flow from Experimental Data with Graph Neural Network
深度学习方法结合LIME可解释AI技术用于增强口腔鳞状细胞癌的诊断	Samiha Islam	PDF	N/A	Deep Learning Approach for Enhancing Oral Squamous Cell Carcinoma with LIME Explainable AI Technique
竞争对手Former：用于3D实例分割的竞争对手Transformer	Duanchu Wang	PDF	N/A	CompetitorFormer: Competitor Transformer for 3D Instance Segmentation
时空解耦用于高效基于视觉的占用预测	Jingyi Xu	PDF	N/A	Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting
多机混合事件-B的自治系统安全属性	Richard Banach	PDF	N/A	Autonomous System Safety Properties with Multi-Machine Hybrid Event-B
SPARKLE：一个统一的单循环主对偶框架，用于去中心化的双层优化	Shuchen Zhu	PDF	N/A	SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel Optimization
FoPru: 高效大型视觉语言模型的焦点剪枝	Lei Jiang	PDF	N/A	FoPru: Focal Pruning for Efficient Large Vision-Language Models
创建经过形式验证的神经网络用于自主导航：经验报告	Syed Ali Asadullah Bukhari	PDF	N/A	Creating a Formally Verified Neural Network for Autonomous Navigation: An Experience Report
点云去噪与细粒度动态图卷积网络	Wenqiang Xu	PDF	N/A	Point Cloud Denoising With Fine-Granularity Dynamic Graph Convolutional Networks
基于Moore-Penrose伪逆的可微分奇异值分解用于逆成像问题	Yinghao Zhang	PDF	N/A	Differentiable SVD based on Moore-Penrose Pseudoinverse for Inverse Imaging Problems
视觉上下文澄清含糊表达：基准数据集	Heejeong Nam	PDF	N/A	Visual Contexts Clarify Ambiguous Expressions: A Benchmark Dataset
GASP：高效生成用于越狱LLM的黑盒对抗性后缀	Advik Raj Basani	PDF	N/A	GASP: Efficient Black-Box Generation of Adversarial Suffixes for Jailbreaking LLMs
表观遗传性癌发生的证据：癌症研究的一个转折点	Jean-Pascal Capp	PDF	N/A	Evidence of epigenetic oncogenesis: a turning point in cancer research
RestorerID：实现无调优人脸修复与身份保留	Jiacheng Ying	PDF	N/A	RestorerID: Towards Tuning-Free Face Restoration with ID Preservation
从“傻瓜”问题中学习能提升大型语言模型，但效果仅微乎其微	Tingyuan Zhu	PDF	N/A	Learning from "Silly" Questions Improves Large Language Models, But Only Slightly
点云重采样与可学习的热扩散	Wenqiang Xu	PDF	N/A	Point Cloud Resampling with Learnable Heat Diffusion
基于多视角遥感的不确定性感知回归用于社会经济估计	Fan Yang	PDF	N/A	Uncertainty-Aware Regression for Socio-Economic Estimation via Multi-View Remote Sensing
伞形强化学习——解决复杂非线性问题的计算高效工具	Egor E. Nuzhin	PDF	N/A	Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems
基于伴随的两层准地转斜压湍流在线学习	Fei Er Yan	PDF	N/A	Adjoint-based online learning of two-layer quasi-geostrophic baroclinic turbulence
迷失在推理中：重新发现自然语言推理在大语言模型中的作用	Lovish Madaan	PDF	N/A	Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models
BEST-STD：面向语音检索的双向Mamba增强语音分词技术	Anup Singh	PDF	N/A	BEST-STD: Bidirectional Mamba-Enhanced Speech Tokenization for Spoken Term Detection
WARLearn：天气自适应表示学习	Shubham Agarwal	PDF	N/A	WARLearn: Weather-Adaptive Representation Learning
GNN-MultiFix：解决GNN在多标签节点分类中的缺陷	Tianqi Zhao	PDF	N/A	GNN-MultiFix: Addressing the pitfalls for GNNs for multi-label node classification
MetaCropFollow：利用元学习进行少样本适应的冠层下导航	Thomas Woehrle	PDF	N/A	MetaCropFollow: Few-Shot Adaptation with Meta-Learning for Under-Canopy Navigation
通过逃离过去来探索	Paul-Antoine Le Tolguenec	PDF	N/A	Exploration by Running Away from the Past
使用递归特征机和多尺度指纹的可解释定量结构-性质关系建模	Jiaxuan Shen	PDF	N/A	Interpretable QSPR Modeling using Recursive Feature Machines and Multi-scale Fingerprints
用于射电天文学源分类的自监督学习：一个基准	Thomas Cecconello	PDF	N/A	Self-supervised learning for radio-astronomy source classification: a benchmark
在普朗克尺度上的意义？科学史、哲学和社会学的语境化词嵌入	Arno Simons	PDF	N/A	Meaning at the Planck scale? Contextualized word embeddings for doing history, philosophy, and sociology of science
用于改进专利文本摘要的主从编码器模型：一种结合说明书和权利要求的新方法	Shu Zhou	PDF	N/A	The Master-Slave Encoder Model for Improving Patent Text Summarization: A New Approach to Combining Specifications and Claims
多任务LoRA与视觉的结合：通过合并多个适配器来构建一个多任务模型	Ege Kesim	PDF	N/A	Multi LoRA Meets Vision: Merging multiple adapters to create a multi task model
MMGenBench：从文本到图像生成的角度评估大模型（LMMs）的极限	Hailang Huang	PDF	N/A	MMGenBench: Evaluating the Limits of LMMs from the Text-to-Image Generation Perspective
DRPruning：通过分布稳健优化实现高效的大型语言模型剪枝	Hexuan Deng	PDF	N/A	DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
功能聊天-基准：全面评估语言模型在韩语工具使用对话中的生成能力	Shinbok Lee	PDF	N/A	FunctionChat-Bench: Comprehensive Evaluation of Language Models' Generative Capabilities in Korean Tool-use Dialogs
立体任意：统一立体匹配与大规模混合数据	Xianda Guo	PDF	N/A	Stereo Anything: Unifying Stereo Matching with Large-Scale Mixed Data
分布外检测与多样化（可证明地）	Haiyun Yao	PDF	N/A	Out-Of-Distribution Detection with Diversification (Provably)
REFOL：面向交通流量预测的资源高效联邦在线学习	Qingxiang Liu	PDF	N/A	REFOL: Resource-Efficient Federated Online Learning for Traffic Flow Forecasting
预测未来国际事件：基于文本的事件建模的可靠数据集	Daehoon Gwak	PDF	N/A	Forecasting Future International Events: A Reliable Dataset for Text-Based Event Modeling
使用深度学习技术进行子宫超声图像的描述	Abdennour Boulesnane	PDF	N/A	Uterine Ultrasound Image Captioning Using Deep Learning Techniques
训练多层感知器掌握异构图结构知识，以实现高效且准确的推理	Yunhui Liu	PDF	N/A	Teaching MLPs to Master Heterogeneous Graph-Structured Knowledge for Efficient and Accurate Inference
评估透明导电材料带隙和电导率的数据驱动预测	Federico Ottomano	PDF	N/A	Assessing data-driven predictions of band gap and electrical conductivity for transparent conducting materials
多LLM代理系统：技术与商业视角	Yingxuan Yang	PDF	N/A	Multi-LLM-Agent Systems: Techniques and Business Perspectives
Q-Learning中的时间尺度分离：扩展TD($\triangle$)以实现动作值函数分解	Mahammad Humayoo	PDF	N/A	Time-Scale Separation in Q-Learning: Extending TD($\triangle$) for Action-Value Function Decomposition
使用MRI肿瘤标注对2D术中超声图像进行自动脑肿瘤分割	Mathilde Faanes	PDF	N/A	Automatic brain tumor segmentation in 2D intra-operative ultrasound images using MRI tumor annotations
道路网络和网格上的轨迹表示学习与时空动态	Stefan Schestakov	PDF	N/A	Trajectory Representation Learning on Road Networks and Grids with Spatio-Temporal Dynamics
通过声码器指纹在开放世界环境中对伪造语音进行单模型归因	Matías Pizarro	PDF	N/A	Single-Model Attribution for Spoofed Speech via Vocoder Fingerprints in an Open-World Setting
逻辑增强生成	Aldo Gangemi	PDF	N/A	Logic Augmented Generation
GPT与人类：揭示在对话生成型AI赋能的多机器人系统中的伦理问题	Rebekah Rousi	PDF	N/A	GPT versus Humans: Uncovering Ethical Concerns in Conversational Generative AI-empowered Multi-Robot Systems
基于图的近似最近邻搜索算法在边缘设备上的实验比较	Ali Ganbarov	PDF	N/A	Experimental comparison of graph-based approximate nearest neighbor search algorithms on edge devices
用于因果扰动建模的生成干预模型	Nora Schneider	PDF	N/A	Generative Intervention Models for Causal Perturbation Modeling
SEMPose：一种用于多目标姿态估计的单端到端网络	Xin Liu	PDF	N/A	SEMPose: A Single End-to-end Network for Multi-object Pose Estimation
基于全切片图像的生存预测中的图域自适应：双分支编码器与双层对齐	Yuntao Shou	PDF	N/A	Graph Domain Adaptation with Dual-branch Encoder and Two-level Alignment for Whole Slide Image-based Survival Prediction
在高阶平滑和过度参数化条件下的加速零阶随机梯度下降	Georgii Bychkov	PDF	N/A	Accelerated zero-order SGD under high-order smoothness and overparameterized regime
镜像目标YOLO：一种改进的YOLOv8方法，结合间接视觉用于文化遗产建筑火灾检测	Jian Liang	PDF	N/A	Mirror Target YOLO: An Improved YOLOv8 Method with Indirect Vision for Heritage Buildings Fire Detection
无悔做市	Nicolò Cesa-Bianchi	PDF	N/A	Market Making without Regret
学习从广义纳什均衡中推导出的双智能体运动规划策略，用于模型预测控制	Hansung Kim	PDF	N/A	Learning Two-agent Motion Planning Strategies from Generalized Nash Equilibrium for Model Predictive Control
无语义破坏的安全性：通过保留上下文的双重潜在重构实现无需编辑的安全图像生成	Jordan Vice	PDF	N/A	Safety Without Semantic Disruptions: Editing-free Safe Image Generation via Context-preserving Dual Latent Reconstruction
关于文本到图像生成模型的公平性、多样性和可靠性	Jordan Vice	PDF	N/A	On the Fairness, Diversity and Reliability of Text-to-Image Generative Models
FedRAV：用于自动驾驶车辆交通目标分类的分层联邦区域学习	Yijun Zhai	PDF	N/A	FedRAV: Hierarchically Federated Region-Learning for Traffic Object Classification of Autonomous Vehicles
使用生成模型将静态图像转换为视频显著目标检测	Suhwan Cho	PDF	N/A	Transforming Static Images Using Generative Models for Video Salient Object Detection
配备可移动天线的无人机在反向散射传感器网络中进行数据收集：一种基于深度强化学习的方法	Yu Bai	PDF	N/A	Movable Antenna-Equipped UAV for Data Collection in Backscatter Sensor Networks: A Deep Reinforcement Learning-based Approach
通过联合频域先验引导扩散实现零样本低光图像增强	Jinhong He	PDF	N/A	Zero-Shot Low-Light Image Enhancement via Joint Frequency Domain Priors Guided Diffusion
经济文本情感分析：基于词典的方法	Luca Barbaglia	PDF	N/A	Sentiment Analysis of Economic Text: A Lexicon-Based Approach
通过机器学习指导的模拟进行材料合成：立场论文	Usman Syed	PDF	N/A	Material synthesis through simulations guided by machine learning: a position paper
用于评估离散多元时间序列在线异常检测方法的数据集	Lucas Correia	PDF	N/A	A Dataset for Evaluating Online Anomaly Detection Approaches for Discrete Multivariate Time Series
可分离的低秩适应混合模型用于持续视觉指令调优	Ziqi Wang	PDF	N/A	Separable Mixture of Low-Rank Adaptation for Continual Visual Instruction Tuning
神经形态姿态估计与控制	Stein Stroobants	PDF	N/A	Neuromorphic Attitude Estimation and Control
大型语言模型作为持续学习者：改进软件问题中缺陷代码的再现	Yalan Lin	PDF	N/A	LLMs as Continuous Learners: Improving the Reproduction of Defective Code in Software Issues
学习与人类合作使用生成代理	Yancheng Liang	PDF	N/A	Learning to Cooperate with Humans using Generative Agents
XAgents：一种基于可解释规则的多智能体合作框架	Hailong Yang	PDF	N/A	XAgents: A Framework for Interpretable Rule-Based Multi-Agents Cooperation
工程图转换：一种利用变压器进行P&ID数字化的创新方法	Jan Marius Stürmer	PDF	N/A	Transforming Engineering Diagrams: A Novel Approach for P&ID Digitization using Transformers
多模态3D复杂场景推理分割	Xueying Jiang	PDF	N/A	Multimodal 3D Reasoning Segmentation with Complex Scenes
数据流指数一致性非参数聚类	Bhupender Singh	PDF	N/A	Exponentially Consistent Nonparametric Clustering of Data Streams
NBMLSS：基于神经基模型进行位置、尺度和形状的概率电价预测	Alessandro Brusaferri	PDF	N/A	NBMLSS: probabilistic forecasting of electricity prices via Neural Basis Models for Location Scale and Shape
高压工业压缩机预测性维护研究：混合聚类模型	Alessandro Costa	PDF	N/A	Predictive Maintenance Study for High-Pressure Industrial Compressors: Hybrid Clustering Models
无泪量化	Minghao Fu	PDF	N/A	Quantization without Tears
ICODE：利用外部输入信息建模动态系统	Zhaoyi Li	PDF	N/A	ICODE: Modeling Dynamical Systems with Extrinsic Input Information
黑豹：通过指令引导的视觉提示照亮多模态大语言模型的视野	Honglin Li	PDF	N/A	Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts
异构边缘设备上的分割联邦学习：算法与优化	Yunrui Sun	PDF	N/A	Split Federated Learning Over Heterogeneous Edge Devices: Algorithm and Optimization
迈向全面委托：为旅行规划设计理想的代理行为	Song Jiang	PDF	N/A	Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning
AmpliNetECG12：一种基于轻量级SoftMax的相对论振幅放大架构，用于12导联心电图分类	Shreya Srivastava	PDF	N/A	AmpliNetECG12: A lightweight SoftMax-based relativistic amplitude amplification architecture for 12 lead ECG classification
PIORS：基于大型语言模型与多智能体医疗场景模拟的个性化智能门诊接待系统	Zhijie Bao	PDF	N/A	PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario Simulation
装扮想象力：一个用于将文本转化为时尚服装的AI驱动翻译数据集及一种新型KAN适配器，用于增强特征适应	Gayatri Deshmukh	PDF	N/A	Dressing the Imagination: A Dataset for AI-Powered Translation of Text into Fashion Outfits and A Novel KAN Adapter for Enhanced Feature Adaptation
Schemato -- 用于网表到原理图转换的LLM	Ryoga Matsuo	PDF	N/A	Schemato -- An LLM for Netlist-to-Schematic Conversion
GraCo -- 一种用于集成电路的图形化设计工具	Stefan Uhlich	PDF	N/A	GraCo -- A Graph Composer for Integrated Circuits
CLFace：一种可扩展且资源高效的持续学习框架，用于终身人脸识别	Md Mahedi Hasan	PDF	N/A	CLFace: A Scalable and Resource-Efficient Continual Learning Framework for Lifelong Face Recognition
当在线算法影响环境：对意外后果的动态系统分析	Prabhat Lankireddy	PDF	N/A	When Online Algorithms Influence the Environment: A Dynamical Systems Analysis of the Unintended Consequences
探索拓扑数据分析在股票指数走势预测中的应用	Dazhi Huang	PDF	N/A	Exploring applications of topological data analysis in stock index movement prediction
下一代钓鱼攻击：LLM代理如何赋能网络攻击者	Khalifa Afane	PDF	N/A	Next-Generation Phishing: How LLM Agents Empower Cyber Attackers
Sli2Vol+：基于目标估计引导的对应流网络的3D医学图像分割	Delin An	PDF	N/A	Sli2Vol+: Segmenting 3D Medical Images Based on an Object Estimation Guided Correspondence Flow Network
考虑构件连接性的机器学习在指定机械性能下对周期性点阵结构进行拓扑优化	Tomoya Matsuoka	PDF	N/A	Topology optimization of periodic lattice structures for specified mechanical properties using machine learning considering member connectivity
在人类编辑下对大型语言模型进行鲁棒水印检测	Xiang Li	PDF	N/A	Robust Detection of Watermarks for Large Language Models Under Human Edits
用于序列生成的生成模糊系统	Hailong Yang	PDF	N/A	Generative Fuzzy System for Sequence Generation
HARec：推荐系统中探索与利用的双曲图-LLM对齐	Qiyao Ma	PDF	N/A	HARec: Hyperbolic Graph-LLM Alignment for Exploration and Exploitation in Recommender Systems
使用新颖视图合成先验的图像压缩	Luyuan Peng	PDF	N/A	Image Compression Using Novel View Synthesis Priors
解耦稀疏先验引导的扩散压缩模型用于点云	Xiaoge Zhang	PDF	N/A	Decoupled Sparse Priors Guided Diffusion Compression Model for Point Clouds
一种多模态方法用于皮肤疾病的检测和分类	Allen Yang	PDF	N/A	A Multimodal Approach to The Detection and Classification of Skin Diseases
处理在线持续学习中的合成数据污染	Maorong Wang	PDF	N/A	Dealing with Synthetic Data Contamination in Online Continual Learning
物理信息神经网络的精确误差界限和近似误差界限	Augusto T. Chantada	PDF	N/A	Exact and approximate error bounds for physics-informed neural networks
多任务学习用于SAR船舶检测与高斯掩码联合分割	Ming Zhao	PDF	N/A	Multitask Learning for SAR Ship Detection with Gaussian-Mask Joint Segmentation
印度斯坦音乐人机交互探索性研究	Nithya Shikarpur	PDF	N/A	Exploratory Study Of Human-AI Interaction For Hindustani Music
从文本到图像模型中检测人类制品	Kaihong Wang	PDF	N/A	Detecting Human Artifacts from Text-to-Image Models
通过约束提示实现光场中的实时应用分割	Nikolai Goncharov	PDF	N/A	Segment Anything in Light Fields for Real-Time Applications via Constrained Prompting
CLIPer：通过分层改进CLIP的空间表示以实现开放词汇语义分割	Lin Sun	PDF	N/A	CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation
交互式与表现力增强的代码辅助规划与大型语言模型	Anthony Z. Liu	PDF	N/A	Interactive and Expressive Code-Augmented Planning with Large Language Models
异质图神经网络优化与因果消息传递	Botao Wang	PDF	N/A	Heterophilic Graph Neural Networks Optimization with Causal Message-passing
InstCache：一种用于LLM服务的预测性缓存	Longwei Zou	PDF	N/A	InstCache: A Predictive Cache for LLM Serving
FLRNet：一种用于从有限传感器测量中回归重建流场的深度学习方法	Phong C. H. Nguyen	PDF	N/A	FLRNet: A Deep Learning Method for Regressive Reconstruction of Flow Field From Limited Sensor Measurements
AutoMixQ：高性能内存高效微调的自适应量化	Changhai Zhou	PDF	N/A	AutoMixQ: Self-Adjusting Quantization for High Performance Memory-Efficient Fine-Tuning
MagicDriveDiT：为自动驾驶设计的高分辨率长视频生成，具备自适应控制功能	Ruiyuan Gao	PDF	N/A	MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control
SemiKong：策划、训练与评估半导体行业专用大型语言模型	Christopher Nguyen	PDF	N/A	SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language Model
使用机器行为分析解释GPT-4的抑郁模式	Adithya V Ganesan	PDF	N/A	Explaining GPT-4's Schema of Depression Using Machine Behavior Analysis
拥抱雨人：自闭症谱系障碍儿童非典型面部表情分析的小说面部动作单元数据集	Yanfeng Ji	PDF	N/A	Hugging Rain Man: A Novel Facial Action Units Dataset for Analyzing Atypical Facial Expressions in Children with Autism Spectrum Disorder
GalaxyEdit：具有增强扩散适配器的大规模图像编辑数据集	Aniruddha Bala	PDF	N/A	GalaxyEdit: Large-Scale Image Editing Dataset with Enhanced Diffusion Adapter
边缘-云端路由用于文本到图像模型的基于令牌的多指标预测	Zewei Xin	PDF	N/A	Edge-Cloud Routing for Text-to-Image Model with Token-Level Multi-Metric Prediction
自适应嵌入网络（AEN）	Stan Loosmore	PDF	N/A	Adaptable Embeddings Network (AEN)
新闻采访：一个数据集和评估大型语言模型基础差距的实验平台	Michael Lu	PDF	N/A	NewsInterview: a Dataset and a Playground to Evaluate LLMs' Ground Gap via Informational Interviews
自动驾驶车辆中基于激光雷达的机器学习感知对抗鲁棒性研究综述	Junae Kim	PDF	N/A	A Survey on Adversarial Robustness of LiDAR-based Machine Learning Perception in Autonomous Vehicles
将GPT-4与人类翻译进行对比：跨语言、领域和专业水平的全面评估	Jianhao Yan	PDF	N/A	Benchmarking GPT-4 against Human Translators: A Comprehensive Evaluation Across Languages, Domains, and Expertise Levels
任意类别分割（SAC）：通过类别区域提议实现多类别少样本语义分割	Hussni Mohd Zakir	PDF	N/A	Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals
FastRAG：用于半结构化数据的检索增强生成	Amar Abane	PDF	N/A	FastRAG: Retrieval Augmented Generation for Semi-structured Data
一种基于评估驱动的LLM代理设计方法：过程与架构	Boming Xia	PDF	N/A	An Evaluation-Driven Approach to Designing LLM Agents: Process and Architecture
Tiny-Align：在边缘设备上连接自动语音识别与大型语言模型	Ruiyang Qin	PDF	N/A	Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
在任务不确定性下评估大型语言模型的框架	Luke Guerdan	PDF	N/A	A Framework for Evaluating LLMs Under Task Indeterminacy
AttentionBreaker：通过位翻转攻击揭示大语言模型漏洞的自适应进化优化	Sanjay Das	PDF	N/A	AttentionBreaker: Adaptive Evolutionary Optimization for Unmasking Vulnerabilities in LLMs through Bit-Flip Attacks