{
  "version": 1,
  "date": "2026-06-13",
  "count": 20,
  "sources": {
    "arxiv": {
      "source_label": "arXiv",
      "count": 10,
      "page_path": "daily-papers/2026-06-13-arxiv.md"
    },
    "biorxiv": {
      "source_label": "bioRxiv",
      "count": 2,
      "page_path": "daily-papers/2026-06-13-biorxiv.md"
    },
    "medrxiv": {
      "source_label": "medRxiv",
      "count": 8,
      "page_path": "daily-papers/2026-06-13-medrxiv.md"
    }
  },
  "page_path": "daily-papers/2026-06-13.md",
  "papers": [
    {
      "id": "b2758cdf1f01f69c",
      "date": "2026-06-13",
      "source": "arxiv",
      "source_label": "arXiv",
      "title": "EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments",
      "translated_title": "EvoArena：在动态环境中追踪记忆演化以实现鲁棒的大语言模型智能体",
      "author": "Jundong Xu",
      "summary": "Large language model (LLM) agents have achieved strong performance on a wide range of benchmarks, yet most evaluations assume static environments. In contrast, real-world deployment is inherently dynamic, requiring agents to continually align their knowledge, skills, and behavior with changing environments and updated task conditions. To address this gap, we introduce EvoArena, a benchmark suite that models environment changes as sequences of progressive updates across terminal, software, and social domains. We further propose EvoMem, a patch-based memory paradigm that records memory evolution as structured update histories, enabling agents to reason about environmental evolution through changes in their memory. Experiments show that current agents struggle on EvoArena, achieving an average accuracy of 39.6% across evolving terminal, software, and social-preference domains. EvoMem consistently improves performance, yielding an average gain of 1.5% on EvoArena and also improving standard benchmarks such as GAIA and LoCoMo by 6.1% and 4.8%. Beyond individual tasks, EvoMem further improves chain-level accuracy by 3.7% on EvoArena, where success requires completing a consecutive sequence of related evolutionary subtasks. Mechanistic analysis shows that EvoMem improves evidence capture in the memory, indicating better preservation of complete evolving environment states. Our results highlight the importance of modeling evolution in both evaluation and memory for reliable agent deployment.",
      "translated_summary": "大语言模型（LLM）智能体在多项基准测试中展现出强劲性能，但多数评估假设环境是静态的。相比之下，实际部署环境本质上是动态的，要求智能体持续调整其知识、技能和行为以适应不断变化的环境与更新的任务条件。为弥补这一差距，我们提出EvoArena基准套件，将环境变化建模为终端、软件和社交领域中的渐进式更新序列。我们进一步提出EvoMem，一种基于补丁的记忆范式，将记忆演化记录为结构化的更新历史，使智能体能够通过记忆变化推理环境演化。实验表明，当前智能体在EvoArena上表现不佳，在演化的终端、软件和社交偏好领域平均准确率仅为39.6%。EvoMem持续提升性能，在EvoArena上平均提升1.5%，同时将GAIA和LoCoMo等标准基准测试分别提升6.1%和4.8%。除单个任务外，EvoMem在EvoArena上将链级准确率提升3.7%，该场景要求成功完成一系列连续相关的演化子任务。机制分析显示，EvoMem改善了记忆中的证据捕获，表明能更完整地保存演化环境状态。我们的结果凸显了在评估和记忆中对演化进行建模对于智能体可靠部署的重要性。",
      "pdf_url": "https://arxiv.org/pdf/2606.13681v1",
      "paper_url": "http://arxiv.org/abs/2606.13681v1",
      "rank": 1,
      "published_date": "2026-06-11"
    },
    {
      "id": "5912ae2e43f738ed",
      "date": "2026-06-13",
      "source": "arxiv",
      "source_label": "arXiv",
      "title": "Learning to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning",
      "translated_title": "通过检索增强的强化微调学习类比推理",
      "author": "Zilin Xiao",
      "summary": "Retrieval-augmented generation (RAG) has become a standard mechanism for grounding language models in external knowledge, yet conventional retrieval based on lexical or semantic similarity is poorly suited for complex reasoning tasks: a semantically similar problem may demand an entirely different solution strategy, while a superficially different problem may share the same underlying reasoning pattern. We propose Retrieval-Augmented Reinforcement Fine-Tuning (RA-RFT), a post-training framework that teaches language models to reason by analogy. RA-RFT uses gold-relevance distillation to train a retriever that ranks contexts by expected reasoning benefit rather than semantic overlap, and then fine-tunes the policy model via reinforcement fine-tuning methods with retrieved analogous demonstrations, so the model learns to leverage reasoning traces under verifiable outcome rewards. We further analyze the diversity of retrieved contexts and find that reasoning-aware retrieval surfaces complementary solution strategies that provide distinct reasoning scaffolds for individual problems. Across challenging mathematical reasoning benchmarks, RA-RFT consistently outperforms standard reinforcement fine-tuning methods. For example, it improves AIME 2025 average@32 accuracy by 7.1 and 2.8 points over GRPO for Qwen3-1.7B and Qwen3-4B respectively -- suggesting that reasoning-aware retrieval is a complementary axis of improvement and orthogonal to advances in reward design or training curricula.",
      "translated_summary": "检索增强生成（RAG）已成为将语言模型锚定于外部知识的标准机制，然而基于词汇或语义相似性的传统检索方法难以胜任复杂推理任务：语义相似的问题可能需要完全不同的解决策略，而表面不同的问题却可能共享相同的底层推理模式。我们提出检索增强强化微调（RA-RFT），这是一种通过类比推理训练语言模型的后训练框架。RA-RFT利用黄金相关性蒸馏训练检索器，使其根据预期推理收益而非语义重叠对上下文进行排序，随后通过强化微调方法结合检索到的类比演示对策略模型进行微调，从而使模型学会在可验证结果奖励下利用推理轨迹。我们进一步分析了检索上下文的多样性，发现推理感知检索能够呈现互补的解决策略，为不同问题提供独特的推理支架。在多个具有挑战性的数学推理基准测试中，RA-RFT始终优于标准强化微调方法。例如，在AIME 2025平均@32准确率上，针对Qwen3-1.7B和Qwen3-4B模型，RA-RFT较GRPO分别提升7.1和2.8个百分点——这表明推理感知检索是独立于奖励设计或训练课程改进的互补优化维度。",
      "pdf_url": "https://arxiv.org/pdf/2606.13680v1",
      "paper_url": "http://arxiv.org/abs/2606.13680v1",
      "rank": 2,
      "published_date": "2026-06-11"
    },
    {
      "id": "c796118f69e03208",
      "date": "2026-06-13",
      "source": "arxiv",
      "source_label": "arXiv",
      "title": "InterleaveThinker: Reinforcing Agentic Interleaved Generation",
      "translated_title": "InterleaveThinker：强化智能体交错生成",
      "author": "Dian Zheng",
      "summary": "Recent image generators have demonstrated impressive photorealism and instruction-following capabilities in single-image generation and editing. However, constrained by their architectures, they cannot achieve interleaved generation (text-image sequence), which has crucial applications in visual narratives, guidance, and embodied manipulation. Even the latest open-source Unified Multimodal Models (UMMs) exhibit limited performance in this regard. In this paper, we introduce InterleaveThinker, the first multi-agent pipeline designed to endow any existing image generator with interleaved generation capabilities. Specifically, we employ a planner agent to organize the image-text input sequence, instructing the image generator on the required execution at each step. Subsequently, we introduce a critic agent to evaluate the generator's outputs, identify samples that deviate from the planned instructions, and refine the instructions for regeneration. To implement this pipeline, we construct the Interleave-Planner-SFT-80k and Interleave-Critic-SFT-112k to perform a format cold-start. Then we develop Interleave-Critic-RL-13k to reinforce the step-wise instruction correction capability within a generation trajectory using GRPO. Since a single interleaved generation trajectory may involve over 25 generator calls, optimizing the entire trajectory is computationally impractical. Therefore, we propose accuracy reward and step-wise reward, allowing single-step RL to effectively guide the entire generation trajectory. The results show that InterleaveThinker improves performance across various image generators. On interleaved generation benchmarks, it achieves performance comparable to Nano Banana and GPT-5. Surprisingly, it also significantly enhances the base model on reasoning-based benchmarks; for example, on 4-step FLUX.2-klein, we observe substantial gains on WISE and RISE.",
      "translated_summary": "近期图像生成器在单图生成与编辑中展现出惊人的照片真实感与指令遵循能力。然而受限于架构设计，它们无法实现交错生成（文本-图像序列），而这一能力在视觉叙事、引导与具身操作中具有关键应用。即便是最新的开源统一多模态模型（UMMs）在此方面表现也相当有限。本文提出InterleaveThinker——首个旨在赋予任意现有图像生成器交错生成能力的多智能体流水线。具体而言，我们采用规划智能体组织图像-文本输入序列，指导图像生成器执行每一步所需操作；随后引入批评智能体评估生成器输出，识别偏离规划指令的样本并优化指令以重新生成。为实现该流水线，我们构建了Interleave-Planner-SFT-80k与Interleave-Critic-SFT-112k数据集以完成格式冷启动，进而开发Interleave-Critic-RL-13k数据集，利用GRPO强化生成轨迹中逐步指令修正能力。由于单次交错生成轨迹可能涉及超过25次生成器调用，优化完整轨迹在计算上不可行，因此我们提出准确率奖励与逐步奖励，使单步强化学习能有效引导整个生成轨迹。实验表明，InterleaveThinker能提升多种图像生成器的性能。在交错生成基准测试中，其表现可媲美Nano Banana与GPT-5。令人惊讶的是，它在基于推理的基准测试中也显著增强了基础模型——例如在4步FLUX.2-klein上，我们在WISE与RISE指标中观察到大幅提升。",
      "pdf_url": "https://arxiv.org/pdf/2606.13679v1",
      "paper_url": "http://arxiv.org/abs/2606.13679v1",
      "rank": 3,
      "published_date": "2026-06-11"
    },
    {
      "id": "200e949c69e4fada",
      "date": "2026-06-13",
      "source": "arxiv",
      "source_label": "arXiv",
      "title": "Mana: Dexterous Manipulation of Articulated Tools",
      "translated_title": "Mana：铰接工具的灵巧操控",
      "author": "Zhao-Heng Yin",
      "summary": "Articulated tool manipulation remains a major challenge in dexterous robotics due to the need to coordinate internal degrees of freedom and contact-rich interactions. While prior work has largely focused on rigid objects, articulated tool use remains underexplored because of its physical complexity and the difficulty of learning functional grasping and manipulation policies. We present Mana (Manipulation Animator), a general sim-to-real framework that reinterprets dexterous manipulation as an animation problem. Inspired by computer animation, Mana employs a coarse-to-fine pipeline that transforms procedurally-generated grasp keyframes into manipulation trajectories through motion planning and reinforcement learning. The data generation process is largely automatic, requiring only a few mouse clicks to specify functional affordances (<1 minute per tool). Across four articulated tools spanning different scales and joint types, Mana achieves zero-shot sim-to-real transfer for both grasping and in-hand manipulation, demonstrating a scalable approach to dexterous articulated tool use.",
      "translated_summary": "铰接工具的操作仍是灵巧机器人领域的一项重大挑战，这源于需要协调内部自由度与接触丰富的交互过程。尽管先前研究主要聚焦于刚性物体，但铰接工具的使用因其物理复杂性以及功能性抓取与操作策略的学习难度，至今仍未被充分探索。我们提出Mana（操作动画师），这是一个通用的仿真到现实框架，将灵巧操作重新诠释为动画问题。受计算机动画启发，Mana采用从粗到细的流水线，通过运动规划与强化学习，将程序化生成的抓取关键帧转化为操作轨迹。数据生成过程基本实现自动化，仅需几次鼠标点击即可指定功能可供性（每件工具耗时<1分钟）。在涵盖不同尺度与关节类型的四种铰接工具上，Mana实现了抓取与手内操作的零样本仿真到现实迁移，为灵巧铰接工具的使用提供了可扩展的解决方案。",
      "pdf_url": "https://arxiv.org/pdf/2606.13677v1",
      "paper_url": "http://arxiv.org/abs/2606.13677v1",
      "rank": 4,
      "published_date": "2026-06-11"
    },
    {
      "id": "3d97d888c25dc68c",
      "date": "2026-06-13",
      "source": "arxiv",
      "source_label": "arXiv",
      "title": "Modality Forcing for Scalable Spatial Generation",
      "translated_title": "面向可扩展空间生成的模态强制",
      "author": "Bardienus Pieter Duisterhof",
      "summary": "Text-to-image (T2I) models contain rich spatial priors. Synthesizing photorealistic, cluttered scenes requires an understanding of geometry, including perspective and relative scale. Prior works adapt T2I models to leverage this prior for depth prediction, but they require dense depth data and involve complex recipes. We propose Modality Forcing, a simple, scalable post-training recipe for joint image-depth generation using a single DiT trained on sparse depth data. Modality Forcing enables conditional and joint generation of image and depth in any permutation by assigning separate noise levels per modality. Per-modality decoders let us train on sparse, real-world depth and achieve strong, generalizable depth prediction. We further show that Modality Forcing inherits the scalability of T2I pre-training: by training a set of T2I models from scratch (370M to 3.3B parameters), we find that larger models trained on more image data produce more accurate depth. Our strongest model is competitive with state-of-the-art monocular depth estimators and reduces AbsRel by 57% relative to existing joint image-depth generative models. These results provide strong evidence that image generation is a scalable pre-training objective for spatial perception. https://modality-forcing.github.io/",
      "translated_summary": "文本到图像（T2I）模型包含丰富的空间先验知识。合成逼真且杂乱的场景需要理解几何结构，包括透视和相对尺度。先前的研究通过调整T2I模型利用这一先验进行深度预测，但需要密集的深度数据并涉及复杂的流程。我们提出模态强制（Modality Forcing）方法，这是一种简单且可扩展的后训练方案，利用在稀疏深度数据上训练的单一DiT模型实现图像与深度的联合生成。模态强制通过为每个模态分配独立的噪声水平，支持图像和深度以任意排列进行条件生成与联合生成。基于各模态的解码器使我们能够使用稀疏的真实世界深度数据进行训练，并实现强泛化能力的深度预测。我们进一步证明模态强制继承了T2I预训练的可扩展性：通过从头训练一组T2I模型（参数规模从3.7亿到33亿），我们发现使用更多图像数据训练的更大模型能产生更精确的深度。我们的最强模型与最先进的单目深度估计器性能相当，相较于现有的图像-深度联合生成模型，绝对相对误差（AbsRel）降低了57%。这些结果有力证明图像生成是空间感知任务中可扩展的预训练目标。https://modality-forcing.github.io/",
      "pdf_url": "https://arxiv.org/pdf/2606.13676v1",
      "paper_url": "http://arxiv.org/abs/2606.13676v1",
      "rank": 5,
      "published_date": "2026-06-11"
    },
    {
      "id": "4895e0c7290581a3",
      "date": "2026-06-13",
      "source": "arxiv",
      "source_label": "arXiv",
      "title": "RepWAM: World Action Modeling with Representation Visual-Action Tokenizers",
      "translated_title": "RepWAM：基于表征视觉-动作分词器的世界动作建模",
      "author": "Junke Wang",
      "summary": "This work presents RepWAM, a representation-centric world action model (WAM) built on representation visual-action tokenizers. Existing WAMs typically inherit reconstruction-oriented video tokenizers from pretrained video generation models. Although these tokenizers preserve visual fidelity, pixel reconstruction alone provides limited guidance for learning instruction-following dynamics that connect future prediction with robot control. To address this, we explore a semantic visual-action latent space for representation-centric world action modeling. Specifically, we train a representation visual-action tokenizer that maps visual inputs into aligned visual and latent action tokens. We then pretrain our WAM to jointly model future visual states and the latent actions that connect them under language instructions, followed by adaptation to real robot trajectories for closed-loop manipulation. Experiments on real-world manipulation tasks and simulation benchmarks show that RepWAM delivers strong performance across diverse manipulation settings, while ablations highlight the value of semantic visual-action tokenization over reconstruction-oriented alternatives. These results establish representation visual-action tokenization as a promising foundation for world action models and a step toward generalist robot policies. Code and weights will be available at https://github.com/wdrink/RepWAM.",
      "translated_summary": "本文提出RepWAM，一种基于表征视觉-动作分词器的表征中心世界动作模型。现有世界动作模型通常继承自预训练视频生成模型的重建导向分词器。尽管这些分词器能保持视觉保真度，但仅靠像素重建难以有效学习连接未来预测与机器人控制的指令跟随动态。为此，我们探索了用于表征中心世界动作建模的语义视觉-动作潜在空间。具体而言，我们训练了一个表征视觉-动作分词器，将视觉输入映射为对齐的视觉与潜在动作标记。随后预训练世界动作模型，使其在语言指令下联合建模未来视觉状态及其连接的潜在动作，并通过适配真实机器人轨迹实现闭环操作。在真实世界操作任务与仿真基准上的实验表明，RepWAM在多种操作场景中均展现出强劲性能，消融实验进一步凸显了语义视觉-动作分词相较于重建导向方案的优越性。这些结果确立了表征视觉-动作分词作为世界动作模型的有力基础，并朝着通用机器人策略迈进一步。代码与权重将发布于https://github.com/wdrink/RepWAM。",
      "pdf_url": "https://arxiv.org/pdf/2606.13674v1",
      "paper_url": "http://arxiv.org/abs/2606.13674v1",
      "rank": 6,
      "published_date": "2026-06-11"
    },
    {
      "id": "f75439508e1a1d6d",
      "date": "2026-06-13",
      "source": "arxiv",
      "source_label": "arXiv",
      "title": "SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning",
      "translated_title": "SpatialClaw：重新思考智能体空间推理的动作接口",
      "author": "Seokju Cho",
      "summary": "Spatial reasoning, the ability to determine where objects are, how they relate, and how they move in 3D, remains a fundamental challenge for vision-language models (VLMs). Tool-augmented agents attempt to address this by augmenting VLMs with specialist perception modules, yet their effectiveness is bounded by the action interface through which those tools are invoked. In this work, we study how the design of this interface shapes the agent's capacity for open-ended spatial reasoning. Existing spatial agents either employ single-pass code execution, which commits to a full analysis strategy before any intermediate result is observed, or rely on a structured tool-call interface that often offers less flexibility for freely composing operations or tailoring the analysis to each task. Both designs offer limited flexibility for open-ended, complex 3D/4D spatial reasoning. We therefore propose SpatialClaw, a training-free framework for spatial reasoning that adopts code as the action interface. SpatialClaw maintains a stateful Python kernel pre-loaded with input frames and a suite of perception and geometry primitives, letting a VLM-backed agent write one executable cell per step conditioned on all prior outputs, enabling the agent to flexibly compose and manipulate perception results and adapt its analysis to both intermediate text and visual observations and the demands of each problem. Evaluated across 20 spatial reasoning benchmarks spanning a broad range of static and dynamic 3D/4D spatial reasoning tasks, SpatialClaw achieves 59.9% average accuracy, outperforming the recent spatial agent by +11.2 points, with consistent gains across six VLM backbones from two model families without any benchmark- or model-specific adaptation.",
      "translated_summary": "空间推理——即确定物体位置、相互关系及其在三维空间中运动的能力——仍是视觉语言模型（VLM）面临的根本性挑战。工具增强型智能体试图通过为VLM配备专业感知模块来解决这一问题，但其有效性受限于调用这些工具的动作接口。本研究探讨了该接口的设计如何塑造智能体进行开放式空间推理的能力。现有空间智能体要么采用单次代码执行策略（在观察到任何中间结果前就确定完整分析方案），要么依赖结构化工具调用接口（这种接口通常缺乏自由组合操作或针对任务定制分析的灵活性）。这两种设计对开放式复杂三维/四维空间推理的灵活性均有限。为此我们提出SpatialClaw——一种无需训练的空间推理框架，采用代码作为动作接口。SpatialClaw维护着预加载输入帧及感知与几何原语集合的有状态Python内核，使基于VLM的智能体能够根据所有先前输出逐步执行可执行代码单元，从而灵活组合与操控感知结果，并根据中间文本/视觉观察及具体问题需求动态调整分析策略。在涵盖静态与动态三维/四维空间推理任务的20个基准测试中，SpatialClaw实现了59.9%的平均准确率，较最新空间智能体提升11.2个百分点，并在两个模型家族的六种VLM骨干网络上保持稳定优势，无需针对特定基准或模型进行适配。",
      "pdf_url": "https://arxiv.org/pdf/2606.13673v1",
      "paper_url": "http://arxiv.org/abs/2606.13673v1",
      "rank": 7,
      "published_date": "2026-06-11"
    },
    {
      "id": "e20ce0a0e613103f",
      "date": "2026-06-13",
      "source": "arxiv",
      "source_label": "arXiv",
      "title": "$\\texttt{WEAVER}$, Better, Faster, Longer: An Effective World Model for Robotic Manipulation",
      "translated_title": "$\\texttt{WEAVER}$：更好、更快、更长——一种有效的机器人操作世界模型",
      "author": "Arnav Kumar Jain",
      "summary": "The potential impacts of world models (WMs, i.e., learned simulators) on robotics are far-reaching -- policy evaluation, policy improvement, and test-time planning -- all with limited real-world interaction. To unlock these downstream capabilities, a WM needs to jointly satisfy three desiderata: $\\textit{(i)}$ fidelity (i.e., producing simulated trajectories that correlate with reality), $\\textit{(ii)}$ consistency (i.e., producing simulated trajectories that are coherent over long horizons), and $\\textit{(iii)}$ efficiency (i.e., producing simulated trajectories quickly). We propose $\\texttt{WEAVER}$ (World Estimation Across Views for Embodied Reasoning): a WM architecture that simultaneously achieves all three desiderata, providing state-of-the-art results on robotic manipulation tasks. $\\texttt{WEAVER}$ is a multi-view WM trained to predict future latents and reward values via a flow-matching loss. We distill the key design decisions across model architecture, memory, and prediction objectives required to unlock the kinds of long-horizon dynamic manipulation tasks that have confounded prior world modeling approaches. We apply $\\texttt{WEAVER}$ in robotic hardware, demonstrating its effectiveness at policy evaluation ($ρ$=0.870 correlation with real-world success rate), policy improvement (real-world success rate improvement of $38\\%$ on top of the $π_{0.5}$ robot foundation model), and test-time planning (real-world success rate improvement of $14\\%$ with a $5-10\\times$ speedup over prior WMs). $\\texttt{WEAVER}$ also demonstrates better performance than prior WMs when evaluated on out-of-distribution scenarios. Code, models, and videos at: https://arnavkj1995.github.io/WEAVER/ .",
      "translated_summary": "世界模型（WMs，即学习型模拟器）对机器人技术的潜在影响深远——包括策略评估、策略改进和测试时规划——所有这些都只需有限的真实世界交互。为解锁这些下游能力，世界模型需同时满足三个必要条件：$\\textit{(i)}$ 保真度（即生成与真实世界相关的模拟轨迹）、$\\textit{(ii)}$ 一致性（即生成在长时间范围内保持连贯的模拟轨迹）以及$\\textit{(iii)}$ 效率（即快速生成模拟轨迹）。我们提出$\\texttt{WEAVER}$（面向具身推理的多视角世界估计）：一种同时实现上述三个必要条件的世界模型架构，在机器人操作任务中达到最先进水平。$\\texttt{WEAVER}$是一种多视角世界模型，通过流匹配损失训练以预测未来潜在状态和奖励值。我们提炼了模型架构、记忆和预测目标中的关键设计决策，这些决策对于解锁此前世界建模方法难以处理的长时间动态操作任务至关重要。我们在机器人硬件上应用$\\texttt{WEAVER}$，展示了其在策略评估（与真实世界成功率的相关系数$ρ$=0.870）、策略改进（在$π_{0.5}$机器人基础模型基础上实现真实世界成功率提升$38\\%$）和测试时规划（相较于先前世界模型实现真实世界成功率提升$14\\%$，同时速度提升$5-10$倍）方面的有效性。$\\texttt{WEAVER}$在分布外场景评估中也展现出优于先前世界模型的性能。代码、模型和视频详见：https://arnavkj1995.github.io/WEAVER/。",
      "pdf_url": "https://arxiv.org/pdf/2606.13672v1",
      "paper_url": "http://arxiv.org/abs/2606.13672v1",
      "rank": 8,
      "published_date": "2026-06-11"
    },
    {
      "id": "331e62c2997c9450",
      "date": "2026-06-13",
      "source": "arxiv",
      "source_label": "arXiv",
      "title": "Understanding Truncated Positional Encodings for Graph Neural Networks",
      "translated_title": "理解图神经网络的截断位置编码",
      "author": "James Flora",
      "summary": "Positional encodings (PEs) enhance the power of graph neural networks (GNNs), both theoretically and empirically. Two of the most popular families of PEs - spectral (e.g., Laplacian eigenspaces, effective resistance) and walk-based (polynomials of the adjacency matrix) - are theoretically equivalent in expressive power, with expressivity between the 1-WL and 3-WL tests. However, this equivalence assumes the GNN uses the \"complete\" version of these PEs, which requires $O(n^3)$ time and space complexity. Instead, practitioners commonly use truncated variants of these encodings, such as the first $k$ eigenspaces or powers of the adjacency matrix. However, the theoretical properties of these truncated PEs are unknown. In this work, we initiate the study of these truncated PEs. Theoretically, we show that, under truncation, several families of PEs are fundamentally different in expressive power. As a corollary, we show that truncated spectral PEs are no longer stronger than the 1-WL test. We also study a family of spectral PEs, the $k$-harmonic distances, to highlight the differences in expressive power of even closely related truncated PEs. Finally, we experimentally show that a mix of truncated PEs is preferable to any single family on real-world datasets.",
      "translated_summary": "位置编码（PEs）在理论和实证上均能增强图神经网络（GNNs）的能力。两类最主流的PEs——谱类（如拉普拉斯特征空间、有效电阻）与游走类（邻接矩阵的多项式）——在表达能力上理论等价，其表达力介于1-WL与3-WL测试之间。然而，这种等价性假设GNN使用这些PEs的\"完整\"版本，这需要$O(n^3)$的时间和空间复杂度。实践中，研究者通常使用这些编码的截断变体，例如前$k$个特征空间或邻接矩阵的幂次。但截断PEs的理论性质尚不明确。本研究首次系统探究截断PEs。理论上，我们证明在截断条件下，多类PEs的表达力存在根本性差异。作为推论，我们表明截断谱PEs不再强于1-WL测试。我们还研究了一类谱PEs——$k$调和距离，以揭示即使密切相关的截断PEs在表达力上的差异。最后，实验表明在真实数据集上，混合使用截断PEs优于任何单一类型。",
      "pdf_url": "https://arxiv.org/pdf/2606.13671v1",
      "paper_url": "http://arxiv.org/abs/2606.13671v1",
      "rank": 9,
      "published_date": "2026-06-11"
    },
    {
      "id": "46683d588a8950c1",
      "date": "2026-06-13",
      "source": "arxiv",
      "source_label": "arXiv",
      "title": "Automated reproducibility assessments in the social and behavioral sciences using large language models",
      "translated_title": "使用大型语言模型进行社会与行为科学的自动化可重复性评估",
      "author": "Tobias Holtdirk",
      "summary": "Reproducibility in the social and behavioral sciences is typically evaluated by independent researchers who reanalyze the original data to assess whether the published findings can be recovered. However, such approaches are resource-intensive and difficult to scale. Here, we show that large language models (LLMs) can automate reproducibility assessments. Using N=76 published studies with predefined claims from the behavioral and social sciences, we compare LLM-generated analysis with the original findings and human reanalysis. For 7 studies, the LLM could not produce a viable effect size estimate. For the remaining studies, our LLM pipeline recovered the original effect sizes in 41% of studies using a +/-0.05 tolerance in Cohen's d. Further, our LLM pipeline reached the same qualitative conclusion as the original study in 96% of cases, where conclusions indicate whether the reanalysis supports the original claim. For comparison, human reanalysts recovered the original effect sizes in 34% of studies and reached the same qualitative conclusion in 74% of cases. Together, these results show that LLMs can serve as a scalable tool for automated reproducibility assessment and provide a foundation for systematic auditing of empirical results in the social and behavioral sciences.",
      "translated_summary": "在社会科学与行为科学中，可重复性通常由独立研究人员通过重新分析原始数据来评估，以判断已发表的研究结果能否复现。然而，这类方法资源消耗大且难以规模化。本研究表明，大型语言模型（LLMs）能够自动化可重复性评估。我们以行为与社会科学领域N=76项包含预设结论的已发表研究为对象，将LLM生成的分析结果与原始发现及人类重新分析结果进行对比。其中7项研究中，LLM无法生成有效的效应量估计值。在其余研究中，采用Cohen's d ±0.05容差标准时，我们的LLM流程在41%的研究中成功复现了原始效应量。此外，在96%的案例中，LLM流程得出了与原始研究一致的定性结论（即重新分析是否支持原始主张）。作为对比，人类重新分析师在34%的研究中复现了原始效应量，并在74%的案例中得出相同定性结论。综合来看，这些结果表明LLM可作为自动化可重复性评估的可扩展工具，为社会科学与行为科学实证结果的系统性审计奠定基础。",
      "pdf_url": "https://arxiv.org/pdf/2606.13670v1",
      "paper_url": "http://arxiv.org/abs/2606.13670v1",
      "rank": 10,
      "published_date": "2026-06-11"
    },
    {
      "id": "5cae677446717a59",
      "date": "2026-06-13",
      "source": "biorxiv",
      "source_label": "bioRxiv",
      "title": "Variant annotation across homologous proteins (Paralogue Annotation) identifies disease-causing missense variants with high precision, and is widely applicable across protein families",
      "translated_title": "跨同源蛋白的变异注释（旁系同源注释）能够以高精度识别致病性错义变异，并广泛适用于各类蛋白质家族。",
      "author": "Li, N.",
      "summary": "Background: Distinguishing pathogenic variants from those that are rare but benign remains a key challenge in clinical genetics, especially for variants not previously observed and characterised in humans. In vitro and in vivo functional characterisation are typically resource intensive, and model systems may not accurately predict influence on human disease. Many in silico tools have been developed to predict which variants are disease-causing, but typically lack precision. Here we demonstrate the applicability of a framework, called Paralogue Annotation, that draws on information from previously-characterised variants in homologous proteins to predict whether variants in a gene of interest are likely disease causing. Methods: We assessed the performance of Paralogue Annotation through three orthogonal approaches: (1) comparison to established in silico variant prediction tools using 47,360 missense variants from ClinVar across 3,524 genes representing a broad range of diverse protein classes, by calculating precision and sensitivity; (2) evaluation against large-scale functional assays of variant effect in TP53 and PPARG; and (3) comparing odd ratios calculated from case-control association tests for inherited cardiac arrhythmia syndromes, and neurodevelopmental disorders with epilepsy, stratifying variants by Paralogue Annotation. Results: Paralogue Annotation correctly annotates 4,328 ClinVar pathogenic variants, with 245 false positives, yielding a precision of 0.95. This increases to 0.99 with more stringent annotation parameters (requiring greater conservation of amino acids between annotated orthologues) at the expense of sensitivity. Compared to established tools, Paralogue Annotation has higher precision for identification of pathogenic variants, albeit with lower sensitivity across diverse test sets. Extending the technique by transferring annotations between homologous protein domains, rather than full-length protein paralogues, increases sensitivity. Rare variants predicted pathogenic by Paralogue Annotation were more strongly disease-associated (increased odds ratio) than unstratified rare variants for six out of eight genes tested with case-control cohort approaches. Conclusions: Paralogue Annotation has high precision for detection of pathogenic missense variants, outperforming in silico methods where data are available to make a prediction. As the number of characterised variants increases in reference datasets such as ClinVar, Paralogue Annotation will further increase in sensitivity and applicability.",
      "translated_summary": "背景：在临床遗传学中，区分致病性变异与罕见但良性的变异仍是一项关键挑战，尤其是针对此前未在人类中观察和表征过的变异。体外和体内功能表征通常耗费大量资源，且模型系统可能无法准确预测其对人类疾病的影响。许多计算机工具已被开发用于预测哪些变异会导致疾病，但通常缺乏精确性。在此，我们展示了一种名为“旁系同源注释”的框架的适用性，该框架利用同源蛋白质中已表征变异的信息，预测目标基因中的变异是否可能致病。方法：我们通过三种正交方法评估了旁系同源注释的性能：（1）使用来自ClinVar的47,360个错义变异（涵盖3,524个基因，代表多种蛋白质类别），通过计算精确度和灵敏度，与已建立的计算机变异预测工具进行比较；（2）针对TP53和PPARG的大规模功能检测评估变异效应；（3）比较遗传性心脏心律失常综合征和神经发育障碍伴癫痫的病例对照关联检验中计算的比值比，按旁系同源注释对变异进行分层。结果：旁系同源注释正确注释了4,328个ClinVar致病性变异，假阳性为245个，精确度为0.95。采用更严格的注释参数（要求注释的直系同源物之间氨基酸保守性更高）时，精确度提升至0.99，但灵敏度有所降低。与已建立的工具相比，旁系同源注释在识别致病性变异方面具有更高的精确度，尽管在不同测试集中的灵敏度较低。通过在同源蛋白质结构域（而非全长蛋白质旁系同源物）之间转移注释来扩展该技术，可提高灵敏度。在病例对照队列方法测试的八个基因中，有六个基因中由旁系同源注释预测为致病性的罕见变异与疾病的关联性更强（比值比增加），优于未分层的罕见变异。结论：旁系同源注释在检测致病性错义变异方面具有高精确度，在可获得预测数据的情况下优于计算机方法。随着ClinVar等参考数据集中表征变异数量的增加，旁系同源注释的灵敏度和适用性将进一步提升。",
      "pdf_url": "https://www.biorxiv.org/content/10.1101/2023.08.07.552236v1.full.pdf",
      "paper_url": "https://doi.org/10.1101/2023.08.07.552236",
      "rank": 1,
      "published_date": "2026-06-13"
    },
    {
      "id": "45c9a34f472ae6e1",
      "date": "2026-06-13",
      "source": "biorxiv",
      "source_label": "bioRxiv",
      "title": "Lipid Flippase Mediated Membrane Asymmetry Governs Extracellular Vesicles Biogenesis and Host Interactions in Cryptococcus neoformans",
      "translated_title": "脂质翻转酶介导的膜不对称性调控新生隐球菌胞外囊泡生物发生及宿主相互作用",
      "author": "Pawar, S.",
      "summary": "Cryptococcus neoformans is the leading cause of fungal meningitis in immunocompromised patients. Alveolar macrophages are the first line of defense against Cryptococcus infection. Our previous study showed that deletion of Cdc50, the regulatory subunit of P4-ATPase (lipid flippase) complex, results in increased phagocytosis and macrophage killing, and avirulence in animal models. However, how fungal flippase dysfunction modulates Cryptococcus-macrophage interaction remains unknown. Here we identify Cdc50 as a central determinant of membrane lipid homeostasis, extracellular vesicle (EV) biogenesis and macrophage responses in C. neoformans. Our whole cell lipidomic analysis revealed that loss of Cdc50 disrupted membrane lipid homeostasis leading to phospholipid enrichment in cdc50{Delta} mutant, and a reduction in fatty acid production accompanied by pronounced ultrastructural defects in membrane architecture. Loss of Cdc50 also induced a hyper-vesiculating phenotype, with cdc50{Delta}producing significantly more extracellular vesicles (EVs) than wild type H99 cells. Lipidomic profiling of cdc50{Delta} EVs revealed enrichment of phospholipids, including phosphatidylserine (PS), indicating active lipid sorting during vesicle biogenesis. Functional analysis showed that EVs from the wildtype H99 suppress phagocytosis whereas cdc50{Delta} EVs enhance phagocytosis, indicating a differential macrophage priming. Despite increased PS externalization in cdc50{Delta} cells and EVs, macrophage recognition and uptake occur independent of PS-mediated efferocytosis pathways, including PS receptor MertK. Following macrophage uptake, cdc50{Delta} were intrinsically vulnerable to macrophage killing due to rapid phagosome acidification. Together, we demonstrate that Cdc50 dependent lipid homeostasis regulates EV production, lipid composition, membrane architecture and drives the intracellular fate of C. neoformans.",
      "translated_summary": "新型隐球菌是免疫功能低下患者真菌性脑膜炎的首要病因。肺泡巨噬细胞是抵御隐球菌感染的第一道防线。我们既往研究表明，P4-ATP酶（脂质翻转酶）复合体调节亚基Cdc50的缺失会导致吞噬作用增强、巨噬细胞杀伤力提升，并在动物模型中呈现无毒力表型。然而，真菌翻转酶功能失调如何调控隐球菌与巨噬细胞的相互作用仍不清楚。本研究揭示Cdc50是新型隐球菌膜脂质稳态、胞外囊泡生物发生及巨噬细胞反应的核心决定因子。全细胞脂质组学分析显示，Cdc50缺失破坏了膜脂质稳态，导致cdc50Δ突变体磷脂富集，脂肪酸产量降低，并伴随显著的膜结构超微结构缺陷。Cdc50缺失还诱导了超囊泡化表型，cdc50Δ产生的胞外囊泡显著多于野生型H99细胞。cdc50Δ胞外囊泡的脂质组学分析显示磷脂（包括磷脂酰丝氨酸）富集，表明囊泡生物发生过程中存在活跃的脂质分选。功能分析表明，野生型H99的胞外囊泡抑制吞噬作用，而cdc50Δ的胞外囊泡则增强吞噬作用，提示存在差异性的巨噬细胞预激活。尽管cdc50Δ细胞和胞外囊泡的磷脂酰丝氨酸外化增加，但巨噬细胞的识别和摄取不依赖于磷脂酰丝氨酸介导的胞葬途径（包括磷脂酰丝氨酸受体MertK）。被巨噬细胞摄取后，cdc50Δ因吞噬体快速酸化而固有地易受巨噬细胞杀伤。综上，我们证明Cdc50依赖性脂质稳态调控胞外囊泡生成、脂质组成、膜结构，并决定新型隐球菌的胞内命运。",
      "pdf_url": "https://www.biorxiv.org/content/10.64898/2026.06.12.731820v1.full.pdf",
      "paper_url": "https://doi.org/10.64898/2026.06.12.731820",
      "rank": 2,
      "published_date": "2026-06-13"
    },
    {
      "id": "d4fc9209a3325d8c",
      "date": "2026-06-13",
      "source": "medrxiv",
      "source_label": "medRxiv",
      "title": "Projected population level impact and cost-effectiveness of clinic and community-based tuberculosis screening approaches",
      "translated_title": "基于诊所和社区的结核病筛查方法的预期人群层面影响及成本效益分析",
      "author": "McCreesh, N.",
      "summary": "The South Africa National Department of Health have set ambitious targets to scale up TB testing, focusing primarily on clinic attendees. In the context of declining funding for TB care and prevention, the most cost-effective approaches for targeting testing should be identified. We developed a mathematical model of TB in South Africa, explicitly incorporating clinic attendance by sex and HIV/ART status. We simulated six screening approaches over 2026-2035 (individually and in combination): three clinic-based (symptom screening, intensified targeted universal TB testing [TUTT, symptom-agnostic sputum testing of clinic attendees in key risk groups], and intensified TUTT allowing saliva samples) and three targeted community-based (community radiographic screening, symptom screening, and universal Xpert Ultra testing), each implemented at a range of coverage levels. Model outputs were combined with a mechanistic cost function to estimate potential impact and cost-effectiveness from a societal perspective. The most cost-effective standalone approach was community radiographic screening at 10% annual population coverage, with an incremental cost-effectiveness ratio (ICER) of $421 per disability-adjusted life year (DALY) averted. 10/11 scenarios along the expansion path included community radiographic screening at progressively higher coverage, combined with a clinic-based approach. Combining complementary approaches to reach both groups at increased risk of TB (e.g. clinic-based screening) and groups with lower screening coverage (e.g. community-based screening) may increase cost-effectiveness of TB screening, compared to standalone approaches. When designing TB screening strategies, both population risk and existing screening coverage should be considered.",
      "translated_summary": "南非国家卫生部设定了扩大结核病检测的宏伟目标，主要聚焦于就诊人群。在结核病防治资金持续缩减的背景下，需确定最具成本效益的靶向检测策略。我们构建了南非结核病数学模型，明确纳入按性别及HIV/抗逆转录病毒治疗状态分层的就诊行为参数。模拟了2026-2035年间六种筛查方案（单独及组合实施）：三种基于诊所的方案（症状筛查、强化靶向全民结核病检测[TUTT，对关键风险群体就诊者进行无症状痰检]、允许唾液样本的强化TUTT）与三种社区靶向方案（社区影像学筛查、症状筛查、通用Xpert Ultra检测），每种方案均在不同覆盖率水平下实施。模型输出与机制性成本函数结合，从社会视角评估潜在影响与成本效益。最具成本效益的独立方案为年度人口覆盖率10%的社区影像学筛查，其增量成本效益比（ICER）为每避免一个伤残调整生命年（DALY）421美元。扩展路径上的10/11种情景均包含逐步提高覆盖率的社区影像学筛查，并联合诊所方案。相较于独立方案，联合互补策略（如覆盖结核病高风险群体的诊所筛查与低筛查覆盖群体的社区筛查）可提升结核病筛查的成本效益。制定结核病筛查策略时，需同时考虑人群风险水平与现有筛查覆盖率。",
      "pdf_url": "https://www.medrxiv.org/content/10.64898/2026.06.10.26355416v1.full.pdf",
      "paper_url": "https://doi.org/10.64898/2026.06.10.26355416",
      "rank": 1,
      "published_date": "2026-06-13"
    },
    {
      "id": "37fbdff091d8bd59",
      "date": "2026-06-13",
      "source": "medrxiv",
      "source_label": "medRxiv",
      "title": "Prevalence and Correlates of Symptoms of Cannabinoid Hyperemesis Syndrome in the United States",
      "translated_title": "美国大麻素剧吐综合征症状的患病率及相关因素分析",
      "author": "Ilgen, M. A.",
      "summary": "Objectives. To determine the national prevalence of Cannabinoid Hyperemesis Syndrome (CHS) symptoms and associated characteristics. Methods. Using data from a nationally representative survey of 7,034 US adults (conducted May-September 2025), we fit separate survey-weighted multinomial logistic regression models with a four-category CHS symptom and cannabis use group variable as the outcome and demographic characteristics, cannabis use behaviors, and cannabis-related problems as predictors. Results. Overall, 2.7% of all respondents and 17.8% of respondents who used cannabis daily reported CHS-like symptoms. Respondents who were younger, female, identified as non-White, had lower income, had lower educational attainment, and endorsed cannabis-related use problems were more likely to be in the CHS symptom group than in the daily cannabis use group. Conclusions. CHS symptoms are not uncommon in the US and those with fewer economic resources and more cannabis-related use problems were more likely to report these symptoms, even compared to others with daily cannabis use. Policy implications. As cannabis use increases, CHS is also likely to become more common, underscoring the importance of expanded education about the identification and treatment of CHS.",
      "translated_summary": "目的。确定大麻素剧吐综合征（CHS）症状的全国患病率及相关特征。方法。利用一项针对7034名美国成年人的全国代表性调查数据（2025年5月至9月实施），我们分别拟合了调查加权多项逻辑回归模型，以四分类CHS症状与大麻使用组变量作为结果变量，以人口学特征、大麻使用行为及大麻相关问题作为预测变量。结果。总体受访者中2.7%报告了CHS样症状，每日使用大麻的受访者中这一比例为17.8%。与每日使用大麻组相比，年龄较轻、女性、非白人、收入较低、教育程度较低以及存在大麻相关使用问题的受访者更可能属于CHS症状组。结论。CHS症状在美国并不罕见，经济资源较少且大麻相关问题较多的个体更可能报告这些症状，即使与每日使用大麻者相比也是如此。政策启示。随着大麻使用增加，CHS也可能变得更加普遍，这凸显了加强CHS识别与治疗教育的重要性。",
      "pdf_url": "https://www.medrxiv.org/content/10.64898/2026.01.25.26344780v1.full.pdf",
      "paper_url": "https://doi.org/10.64898/2026.01.25.26344780",
      "rank": 2,
      "published_date": "2026-06-13"
    },
    {
      "id": "c36e06664b524ca0",
      "date": "2026-06-13",
      "source": "medrxiv",
      "source_label": "medRxiv",
      "title": "Functionally Focused Evaluation: A Novel Comparative Protocol for Wearable Electroencephalography Headsets",
      "translated_title": "功能性聚焦评估：一种用于可穿戴脑电图头套的新型比较方案",
      "author": "Bhuyan, A.",
      "summary": "With the emergence of electroencephalography (EEG) as a tool in the cognitive domain, new demands are being placed on the technology to keep up with functional applications - especially in the context of at-home neural monitoring. New use cases have fostered development of wearable EEG (wEEG) devices: portable, low-cost headsets used for EEG monitoring. This evolution of technology and application has not been accompanied by development in technology evaluation, often relying on function-agnostic markers to assess devices for efficacy in this new space. With current methods limited in scope, this study designed, tested and evaluated a novel functionally-focused comparative protocol for wEEG devices. Eight participants undertook a protocol for the evaluation of four established wEEG devices, assessing cognitive resolution and general usability. Compared to a well-established traditional analysis method (eyes open/eyes closed protocol), the novel design proposed here enabled the same analysis of headset resolution, while also providing additional context into user preferences and opening downstream possibilities for specific cognitive insights. Future research could enable the development of this protocol into a standardised method to ensure the performance of wEEG technology can satisfy emerging clinical needs.",
      "translated_summary": "随着脑电图（EEG）作为认知领域工具的出现，该技术面临满足功能性应用的新需求——尤其是在家庭神经监测场景中。新应用场景推动了可穿戴脑电图（wEEG）设备的发展：用于脑电监测的便携式低成本头戴设备。技术与应用的演进并未伴随技术评估方法的发展，当前仍常依赖功能无关的指标来评估设备在新领域中的效能。鉴于现有方法存在局限性，本研究设计、测试并评估了一种针对wEEG设备的新型功能导向比较方案。八名受试者通过该方案对四款成熟wEEG设备进行认知分辨率与通用可用性评估。与传统成熟分析方法（睁眼/闭眼方案）相比，本研究提出的新设计在实现同等头戴设备分辨率分析的同时，还能提供用户偏好的额外背景信息，并为特定认知洞察开辟下游可能性。未来研究可推动该方案发展为标准化方法，以确保wEEG技术性能满足新兴临床需求。",
      "pdf_url": "https://www.medrxiv.org/content/10.64898/2026.06.03.26354802v1.full.pdf",
      "paper_url": "https://doi.org/10.64898/2026.06.03.26354802",
      "rank": 3,
      "published_date": "2026-06-13"
    },
    {
      "id": "8242e81c00322f76",
      "date": "2026-06-13",
      "source": "medrxiv",
      "source_label": "medRxiv",
      "title": "Elevated suicidal thoughts and behaviors, proximal suicide risk factors, and non-suicidal self-injury among adults reporting symptoms of Cannabinoid Hyperemesis Syndrome: Results from a national survey of US adults",
      "translated_title": "报告大麻素剧吐综合征症状的成年人中自杀意念和行为、近端自杀风险因素及非自杀性自伤的增加：一项美国全国性成人调查的结果",
      "author": "Hicks, B. M. M.",
      "summary": "Background: Cannabinoid hyperemesis syndrome (CHS), a disorder characterized by severe nausea, vomiting, and abdominal pain among heavy cannabis users. We previously showed that CHS symptoms are associated with several behavioral and psychological characteristics linked to psychosocial impairment. We examined links between CHS symptoms and suicidal thoughts, behaviors, and proximal suicide risk factors. Methods: We used data from the National Firearms, Alcohol, Cannabis, and Suicide survey, a nationally representative survey of 7034 US adults. Items assessed symptoms of CHS and suicidal thoughts and behaviors, proximal suicide risk factors, and non-suicidal self-injury. Comparisons focused on: those with daily cannabis use and CHS symptoms (n = 191), those with daily cannabis use without CHS symptoms (n = 882), those with past year cannabis use but not daily use (n = 1288), and those without past year cannabis use (n = 4673). Results: Those with CHS symptoms reported the highest lifetime and past 12 months prevalence of suicidal thoughts and behaviors with nearly all being significantly higher than those with daily cannabis use without CHS symptoms. Those with CHS symptoms also reported higher mean-levels of proximal risk factors for suicide (i.e., perceived burdensomeness, thwarted belongingness, defeat, entrapment) than all the other groups. Conclusions: Those with CHS symptoms reported especially high rates of suicidal thoughts, behaviors, and attempts even when compared to others with daily cannabis use. People with CHS symptoms appear to be at high risk of suicide, possibly related to distress from their gastrointestinal symptoms and psychiatric, substance use, and medical comorbidities.",
      "translated_summary": "背景：大麻素剧吐综合征（CHS）是一种以重度恶心、呕吐和腹痛为特征的疾病，常见于重度大麻使用者。我们此前的研究表明，CHS症状与多种与心理社会功能损害相关的行为和心理特征存在关联。本研究探讨了CHS症状与自杀意念、自杀行为及近端自杀风险因素之间的关系。方法：我们使用了全国枪支、酒精、大麻与自杀调查数据，该调查涵盖7034名美国成年人，具有全国代表性。评估项目包括CHS症状、自杀意念与行为、近端自杀风险因素以及非自杀性自伤行为。比较对象分为四组：每日使用大麻且伴有CHS症状者（n=191）、每日使用大麻但无CHS症状者（n=882）、过去一年使用大麻但非每日使用者（n=1288）、以及过去一年未使用大麻者（n=4673）。结果：伴有CHS症状的群体在终生及过去12个月内自杀意念与行为的报告率最高，几乎所有指标均显著高于每日使用大麻但无CHS症状的群体。该群体在近端自杀风险因素（如感知累赘感、归属受挫、挫败感、陷入困境）的平均水平也高于其他所有组别。结论：即使与每日使用大麻的其他群体相比，伴有CHS症状者报告的自杀意念、行为及尝试率仍显著偏高。CHS症状人群可能面临较高的自杀风险，这可能与胃肠道症状带来的痛苦、以及精神疾病、物质使用障碍和躯体共病有关。",
      "pdf_url": "https://www.medrxiv.org/content/10.64898/2026.02.26.26347185v1.full.pdf",
      "paper_url": "https://doi.org/10.64898/2026.02.26.26347185",
      "rank": 4,
      "published_date": "2026-06-13"
    },
    {
      "id": "f7f862b4847e00f7",
      "date": "2026-06-13",
      "source": "medrxiv",
      "source_label": "medRxiv",
      "title": "Links between Cannabinoid Hyperemesis Syndrome symptoms and drug use, mental health problems, antisocial behavior, and personality in a national survey of adults in the United States",
      "translated_title": "美国全国成人调查中大麻素剧吐综合征症状与药物使用、心理健康问题、反社会行为及人格之间的关联",
      "author": "Hicks, B. M.",
      "summary": "Background: Cannabinoid hyperemesis syndrome (CHS) is characterized by episodes of severe nausea, vomiting, and abdominal pain among those with heavy cannabis use. We estimated differences between those reporting CHS symptoms and those who use cannabis daily and less frequently on drug use, psychiatric problems, other health problems, antisocial behavior, and personality. Methods: The National Firearms, Alcohol, Cannabis, and Suicide survey was administered to 7034 US adults (3427 male, 3607 female) in 2025. Survey items assessed substance use, common psychiatric symptoms, personality traits, and symptoms of CHS. Results: Those with CHS symptoms reported the highest rates and greatest variety of drug use compared to others who used cannabis. Those with CHS symptoms reported higher rates of other drug use than those who used cannabis daily without CHS symptoms across a variety of drug classes, including opioids, hallucinogens, and sedatives, higher rates of drug overdoses, and greater use of all drug classes than those with less-than-daily cannabis use. Those with CHS symptoms also reported more depression, anxiety, sleep problems, chronic pain, antisocial behavior, intimate partner violence, and disinhibited personality traits than those who used daily (mean d = 0.58) and less frequently (mean d = 0.69) and those with no cannabis use in the past 12 months (mean d = 0.99). Conclusions: Those with CHS symptoms exhibit a variety of psychological and behavioral problems including higher rates of other drug use, psychiatric symptoms, antisocial behavior, and dysfunctional personality traits. Results highlight the importance of understanding and addressing the broader psychosocial challenges faced by people experiencing CHS symptoms.",
      "translated_summary": "背景：大麻素剧吐综合征（CHS）的特征是重度大麻使用者出现严重恶心、呕吐和腹痛的发作性症状。本研究评估了报告CHS症状者与每日及非每日使用大麻者在药物使用、精神问题、其他健康问题、反社会行为及人格方面的差异。方法：2025年对7034名美国成年人（男性3427人，女性3607人）进行了全国枪支、酒精、大麻与自杀调查。调查项目评估了物质使用、常见精神症状、人格特质及CHS症状。结果：与其他大麻使用者相比，报告CHS症状者的药物使用率最高且种类最多。与每日使用大麻但无CHS症状者相比，报告CHS症状者在多种药物类别（包括阿片类、致幻剂和镇静剂）中的其他药物使用率更高，药物过量发生率更高，且所有药物类别的使用量均高于非每日使用大麻者。报告CHS症状者还表现出比每日使用者（平均效应量d=0.58）、非频繁使用者（平均d=0.69）及过去12个月未使用大麻者（平均d=0.99）更严重的抑郁、焦虑、睡眠问题、慢性疼痛、反社会行为、亲密伴侣暴力及去抑制性人格特质。结论：报告CHS症状者表现出多种心理和行为问题，包括更高的其他药物使用率、精神症状、反社会行为及功能失调的人格特质。研究结果凸显了理解并应对CHS症状患者所面临的更广泛社会心理挑战的重要性。",
      "pdf_url": "https://www.medrxiv.org/content/10.64898/2026.02.26.26347188v1.full.pdf",
      "paper_url": "https://doi.org/10.64898/2026.02.26.26347188",
      "rank": 5,
      "published_date": "2026-06-13"
    },
    {
      "id": "8f0a88e6ff9a6a83",
      "date": "2026-06-13",
      "source": "medrxiv",
      "source_label": "medRxiv",
      "title": "Mortality, morbidity, and post-operative complications of typhoid intestinal perforations: global systematic review and meta-analysis",
      "translated_title": "伤寒肠穿孔的死亡率、发病率及术后并发症：全球系统综述与荟萃分析",
      "author": "Hagedoorn, N. N.",
      "summary": "Background TIP is a serious and life-threatening complication of typhoid fever that requires emergency surgery and an important driver of typhoid burden. We aimed to review global studies reporting on mortality, morbidity, and post-operative complications in patients with typhoid intestinal perforation (TIP). Methods We searched multiple databases for articles reporting case-fatality ratio (CFR) or complications in patients with TIP undergoing surgery published from 1980 through 11 January 2025. We described the prevalence of each reported complication. Of patients with TIP, we pooled CFR using random-effects meta-analysis and stratified by United Nations region, sex, and number of perforations per patient. Results We included 48 articles reporting on 4,309 patients with TIP. The most prevalent post-operative complications were wound or surgical site infection in 1,553 (50.1%) of 3,100 patients, wound dehiscence in 308 (16.1%) of 1,909, and respiratory infection in 136 (15.6%) of 872. Overall, the pooled CFR (95%CI) of patients with TIP was 16.3% (13.4-19.7%), and was 20.3% (17.0-24.2%) in 32 observations from the African region, 8.5% (5.7-12.6%) in 15 observations from the Asian region. Overall, no significant correlation was observed between median year of data collection and CFR (estimate [95%CI]: 0.00 [-0.02, 0.02]), nor when analyses were stratified by region, including Africa (estimate [95%CI]: -0.01 [-0.03, 0.02]), and Asia (estimate [95%CI]: -0.04 [-0.07,0.00]). Conclusions Disability and death associated with TIP remains substantial. Over time, mortality from TIP did not decline. Efforts to improve access to and quality of surgical services for those with TIP are warranted.",
      "translated_summary": "背景 伤寒肠穿孔（TIP）是伤寒的一种严重且危及生命的并发症，需紧急手术干预，也是导致伤寒疾病负担的重要因素。本研究旨在系统回顾全球关于TIP患者死亡率、并发症及术后结局的研究。方法 我们检索多个数据库，纳入1980年至2025年1月11日期间发表的、报告接受手术治疗的TIP患者病死率或并发症的文献。描述各并发症的发生率，并通过随机效应荟萃分析汇总TIP患者的总体病死率，按联合国区域、性别及穿孔数量进行分层分析。结果 共纳入48篇文献，涉及4,309例TIP患者。最常见的术后并发症为伤口或手术部位感染（1,553/3,100例，50.1%）、伤口裂开（308/1,909例，16.1%）及呼吸道感染（136/872例，15.6%）。总体汇总病死率为16.3%（95%CI：13.4-19.7%），其中非洲地区32项观察数据为20.3%（95%CI：17.0-24.2%），亚洲地区15项观察数据为8.5%（95%CI：5.7-12.6%）。数据收集年份中位数与病死率无显著相关性（总体估计值[95%CI]：0.00[-0.02,0.02]），按区域分层后亦无显著关联（非洲：-0.01[-0.03,0.02]；亚洲：-0.04[-0.07,0.00]）。结论 TIP导致的残疾和死亡负担仍居高不下，且其死亡率随时间推移未见下降。亟需改善TIP患者手术服务的可及性与质量。",
      "pdf_url": "https://www.medrxiv.org/content/10.1101/2024.06.28.24309663v1.full.pdf",
      "paper_url": "https://doi.org/10.1101/2024.06.28.24309663",
      "rank": 6,
      "published_date": "2026-06-13"
    },
    {
      "id": "fb5fe411c2ab427c",
      "date": "2026-06-13",
      "source": "medrxiv",
      "source_label": "medRxiv",
      "title": "Neurovascular instability, impaired cortical recruitment, and network dysconnectivity across the transdiagnostic anxiety spectrum: a functional multi-channel near-infrared spectroscopy study",
      "translated_title": "跨诊断焦虑谱系中的神经血管不稳定性、皮层募集受损及网络连接异常：一项功能性多通道近红外光谱研究",
      "author": "Luo, Y.",
      "summary": "Background: Anxiety-spectrum disorders (ANSD) are highly prevalent, yet the underlying neurovascular mechanisms remain unclear. Functional near-infrared spectroscopy (fNIRS) comprises a non-invasive method to assess cortical hemodynamics, neurovascular coupling, and network organization during cognitive processing. Methods: We investigated healthy controls (HC), generalized anxiety disorder (GAD), anxious depression (AD), and anxiety-depression comorbidity (CO) using multichannel fNIRS during a verbal fluency task. Multiple hemodynamic features were extracted, including peak response, temporal hemodynamic variability, {beta}activation, and HbO, HbR, and HbT signals. Functional connectivity, graph-theoretical network measures, machine-learning classification, and associations with depressive, anxiety and psychosomatic scores were examined. Results: Compared to controls, ANSD patients showed reduced task-evoked HbO and HbT responses, preserved HbR levels, increased temporal hemodynamic variability, and reduced {beta}activation. Activation deficits were most prominent in bilateral frontopolar and medial prefrontal cortices and followed a gradient, with the CO group exhibiting highest abnormalities. Functional connectivity was increased, whereas clustering coefficient, nodal local efficiency, and nodal efficiency were reduced, indicating maladaptive hyperconnectivity accompanied by inefficient network organization. The AD and CO groups showed the greatest network disintegration. Temporal hemodynamic variability emerged as the strongest predictor of anxiety, depressive, and physiosomatic symptom severity. Reduced prefrontal activation was significantly associated with higher symptom domain scores. Machine-learning analyses demonstrated adequate discrimination between HC and ANSD. Conclusions: ANSD are characterized by impaired neurovascular recruitment, increased hemodynamic instability, maladaptive hyperconnectivity, and disrupted cortical network topology. These abnormalities appear to represent transdiagnostic neurovascular processes underlying anxiety, depressive, and physiosomatic symptoms across the anxiety spectrum.",
      "translated_summary": "背景：焦虑谱系障碍（ANSD）患病率较高，但其潜在的神经血管机制尚不明确。功能性近红外光谱（fNIRS）是一种非侵入性方法，可用于评估认知加工过程中的皮层血流动力学、神经血管耦合及网络组织。方法：我们采用多通道fNIRS，在言语流畅性任务中研究了健康对照组（HC）、广泛性焦虑障碍（GAD）、焦虑性抑郁（AD）及焦虑抑郁共病组（CO）。提取了多种血流动力学特征，包括峰值反应、时间血流动力学变异性、β激活值以及HbO、HbR和HbT信号。分析了功能连接、图论网络指标、机器学习分类结果，并探讨了这些指标与抑郁、焦虑及心身症状评分的关联。结果：与对照组相比，ANSD患者表现出任务诱发的HbO和HbT反应减弱、HbR水平保持稳定、时间血流动力学变异性增加以及β激活值降低。激活缺陷在双侧额极和内侧前额叶皮层最为显著，且呈现梯度变化，其中CO组异常程度最高。功能连接增强，而聚类系数、节点局部效率和节点效率降低，提示存在适应不良的超连接伴随低效网络组织。AD组和CO组网络解体最为严重。时间血流动力学变异性成为焦虑、抑郁及心身症状严重程度的最强预测因子。前额叶激活减弱与更高症状领域评分显著相关。机器学习分析显示可有效区分HC与ANSD。结论：ANSD的特征包括神经血管募集受损、血流动力学不稳定性增加、适应不良的超连接以及皮层网络拓扑结构破坏。这些异常可能代表了焦虑谱系中焦虑、抑郁及心身症状背后的跨诊断神经血管过程。",
      "pdf_url": "https://www.medrxiv.org/content/10.64898/2026.06.11.26355427v1.full.pdf",
      "paper_url": "https://doi.org/10.64898/2026.06.11.26355427",
      "rank": 7,
      "published_date": "2026-06-13"
    },
    {
      "id": "e6b710a50769015b",
      "date": "2026-06-13",
      "source": "medrxiv",
      "source_label": "medRxiv",
      "title": "Genome-wide association study in Brazil identifies genetic susceptibility to tuberculosis with single-cell gene effects",
      "translated_title": "巴西全基因组关联研究识别出具有单细胞基因效应的结核病遗传易感性",
      "author": "Dill-McFarland, K. A.",
      "summary": "Background: Although genetic factors contribute to tuberculosis (TB) risk, no cross-population causal variants have been identified by genome-wide association studies (GWAS). Methods: Here, we utilized low-pass whole genome sequencing (lpWGS) with high-pass WGS verified imputation plus detailed epidemiologic risk factors and single-cell expression quantitative loci (sceQTL) to address prior GWAS limitations. Results: Using 947 pulmonary tuberculosis (PTB) cases and 1807 close contact controls in the Regional Prospective Observational Research in TB (RePORT) study in Brazil, we estimated PTB heritability to be 40 - 68%. We identified 17 SNPs associated with PTB (P<5E-8) after adjustment for major risk factors (HIV, diabetes, smoking). Seven of these SNPs were associated with peripheral blood cell-specific sceQTLs in controls. Specifically, SNPs cis to transcription factors ZNF717 and MAML3 were associated with PTB disease and gene expression in myeloid, T cells, or B cells. The nucleoporin-associated genes NUP93 and AGFG1 were also associated with sceQTLs in multiple cell types including dendritic, natural killer, or T cells. Conclusion: Overall, this study utilized lpWGS, in-depth epidemiology, and single-cell analyses to detect population-specific genetic risk factors for PTB in Brazil.",
      "translated_summary": "背景：尽管遗传因素影响结核病（TB）风险，但全基因组关联研究（GWAS）尚未发现跨人群的因果变异。方法：本研究采用低深度全基因组测序（lpWGS）结合高深度全基因组测序验证的基因型填充、详细流行病学风险因素及单细胞表达数量性状位点（sceQTL）分析，以克服既往GWAS的局限性。结果：基于巴西结核病区域前瞻性观察研究（RePORT）中947例肺结核（PTB）病例和1807例密切接触对照，估算PTB遗传力为40%-68%。在校正主要风险因素（HIV、糖尿病、吸烟）后，鉴定出17个与PTB显著关联的SNP（P<5E-8）。其中7个SNP与对照组外周血细胞特异性sceQTL相关。具体而言，转录因子ZNF717和MAML3基因顺式SNP与PTB疾病及髓系细胞、T细胞或B细胞基因表达相关。核孔蛋白相关基因NUP93和AGFG1也与树突状细胞、自然杀伤细胞或T细胞等多种细胞类型的sceQTL相关。结论：本研究综合运用lpWGS、深度流行病学分析和单细胞分析，在巴西人群中发现了群体特异性PTB遗传风险因素。",
      "pdf_url": "https://www.medrxiv.org/content/10.1101/2025.03.13.25323932v1.full.pdf",
      "paper_url": "https://doi.org/10.1101/2025.03.13.25323932",
      "rank": 8,
      "published_date": "2026-06-13"
    }
  ],
  "generated_at": "2026-06-14T00:08:36Z"
}