2026-06-02

研究LLM对齐后产生的AI风格特征，提出PASTA方法通过激活消除降低AI检测率，揭示风格特征可定位与因果干预。提出ImmersiveTTS模型，基于多模态扩散Transformer实现环境感知语音合成，通过跨模态交互与领域表示对齐提升自然度和保真度。新基准LongDS测试长程多轮数据分析能力，最佳模型准确率不足50%，揭示长程状态维护是关键瓶颈。

Measuring, Localizing, and Ablating Alignment Signatures in LLMs 85

Tags: 对齐 AI安全 可解释性 大模型
Source: arXiv Computation and Language | 阅读原文

[摘要]
研究LLM对齐后产生的AI风格特征，提出PASTA方法通过激活消除降低AI检测率，揭示风格特征可定位与因果干预。

ImmersiveTTS: Environment-Aware Text-to-Speech with Multimodal Diffusion Transformer and Domain-Specific Representation Alignment 85

Tags: 语音合成 多模态 扩散模型 TTS
Source: arXiv Computation and Language | 阅读原文

[摘要]
提出ImmersiveTTS模型，基于多模态扩散Transformer实现环境感知语音合成，通过跨模态交互与领域表示对齐提升自然度和保真度。

LongDS-Bench: On the Failure of Long-Horizon Agentic Data Analysis 85

Tags: 评测基准 AI代理 数据分析
Source: arXiv Computation and Language | 阅读原文

[摘要]
新基准LongDS测试长程多轮数据分析能力，最佳模型准确率不足50%，揭示长程状态维护是关键瓶颈。

Counterfactual Evaluation Reveals Hidden Capability Profiles in Clinical LLMs and Agents 85

Tags: 医学AI 模型评估 AI安全
Source: arXiv Computation and Language | 阅读原文

[摘要]
提出因果敏感度分数CSS，用反事实干预评估临床大模型和智能体的响应能力，发现覆盖指标（如CMS）无法揭示的安全盲点和模型排名反转。

Self-Reflective Generation at Test Time 85

Tags: 大模型 推理优化 自反思
Source: arXiv Computation and Language | 阅读原文

[摘要]
SRGen提出轻量级测试时框架，通过动态熵阈值识别高不确定性token，训练修正向量实现自反思生成，显著提升大型语言模型的数学推理能力。

SERA: Soft-Verified Efficient Repository Agents 85

Tags: 编码代理 训练方法 合成数据 开源模型
Source: arXiv Computation and Language | 阅读原文

[摘要]
SERA通过软验证生成合成轨迹，大幅降低训练私有代码库专用编码代理的成本（比RL低26倍），性能匹配开源模型，并开源所有代码和数据。

EvoDefense: Co-Evolving Black-Box Defense with Large Language Models 85

Tags: AI安全 大模型 对抗防御 进化优化
Source: arXiv Computation and Language | 阅读原文

[摘要]
EvoDefense 提出一种经验引导的共进化黑盒防御范式，利用守卫LLM与攻击生成器持续对抗进化，无需重训练即可泛化防御多种未见攻击与模型，显著降低攻击成功率。

Esoteric Language Models: A Family of Any-Order Diffusion LLMs 85

Tags: 扩散模型 语言模型 推理优化 模型架构
Source: arXiv Computation and Language | 阅读原文

[摘要]
Eso-LMs融合自回归与掩码扩散模型，采用因果注意力实现首次MDM的KV缓存和并行生成，建立无条件生成速度-质量新标杆。

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards 85

Tags: 大模型 推理优化 长上下文 强化学习
Source: arXiv Computation and Language | 阅读原文

[摘要]
提出LongTraceRL方法，通过搜索智能体轨迹构建高混淆度干扰项，并设计基于实体的细粒度过程奖励，有效提升大模型长上下文推理能力，在多个基准上超越基线方法。

ParisKV: Fast and Drift-Robust KV-Cache Retrieval for Long-Context LLMs 85

Tags: 大模型 推理优化 KV缓存 长上下文
Source: arXiv Computation and Language | 阅读原文

[摘要]
ParisKV是一个面向长上下文LLM的漂移鲁棒、GPU原生KV缓存检索框架，采用碰撞选择与量化重排序，在百万token长度下解码延迟比MagicPIG和PQCache降低17倍和44倍，吞吐量提升至2.8倍。

Learning Whom to Trust: Market-Feedback Adaptive Retrieval for Frozen LLMs in Event-Driven Financial RAG 85

Tags: 金融RAG 检索优化 自适应学习 大模型
Source: arXiv Computation and Language | 阅读原文

[摘要]
提出一种基于贝叶斯源记忆的市场反馈自适应检索方法，在金融事件驱动RAG中通过调整检索源而非微调LLM，显著提升预测F1和投资组合夏普比率。

The Information Geometry of Softmax: Probing and Steering 85

Tags: 表示学习 模型可解释性 AI安全
Source: arXiv Computation and Language | 阅读原文

[摘要]
该论文从信息几何角度分析softmax表示，提出“双引导”方法，可稳健操控模型概念表示，并保持无关概念不变，提升可控性与稳定性。

Skill is Not One-Size-Fits-All: Model-Aware Skill Alignment for LLM Agents 85

Tags: 大模型 智能体 技能对齐
Source: arXiv Computation and Language | 阅读原文

[摘要]
提出模型感知的技能对齐框架MASA，通过分层演化和轻量重写器为不同LLM骨干适配技能，在多个交互任务上提升高达25.8分。

Beyond Static Dialogues: Benchmarking Realistic, Heterogeneous, and Evolving Long-Term Memory 85

Tags: 大模型 评测基准 记忆机制
Source: arXiv Computation and Language | 阅读原文

[摘要]
提出RHELM基准，模拟真实、异质、动态演化的长期记忆对话场景，评估LLM在多源聚合与上下文推理上的表现，揭示当前模型在复杂环境下的关键弱点。

Learning to Reason with Insight for Informal Theorem Proving 85

Tags: 大模型 推理优化 数学推理
Source: arXiv Computation and Language | 阅读原文

[摘要]
提出DeepInsight框架，通过层级数据集、渐进式多阶段SFT和InsightPO策略，提升大语言模型在非正式定理证明中的洞察推理能力，数学基准上显著优于基线。

Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging 85

Tags: LoRA 模型合并 多任务学习 微调
Source: arXiv Computation and Language | 阅读原文

[摘要]
研究LoRA微调模型合并的干扰问题，提出正交子空间方法，在不牺牲单任务性能下提升多任务合并效果，兼容现有算法。

Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers 85

Tags: 混合专家模型 贝叶斯推断 不确定性量化 推理优化
Source: arXiv Artificial Intelligence | 阅读原文

[摘要]
提出变分混合专家路由（VMoER），在MoE层引入贝叶斯不确定性估计，显著降低校准误差94%，提高鲁棒性，额外计算开销极低。

CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability 85

Tags: 代码安全 智能体 模型微调 基准测试
Source: arXiv Artificial Intelligence | 阅读原文

[摘要]
CVE-Factory提出多智能体框架自动将CVE元数据转化为可执行安全任务，构建LiveCVEBench基准，微调Qwen3-32B显著提升安全能力并超越Claude 4.5 Sonnet。

HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents 85

Tags: 大模型 强化学习 层次化规划 多轮决策
Source: arXiv Artificial Intelligence | 阅读原文

[摘要]
HiPER提出层次化计划执行强化学习框架，通过层次优势估计实现显式信用分配，显著提升多轮LLM智能体在稀疏奖励长时任务中的性能

ASH: Agents that Self-Hone via Embodied Learning 85

Tags: 强化学习 智能体 模仿学习 长期任务
Source: arXiv Artificial Intelligence | 阅读原文

[摘要]
提出ASH智能体系统，通过自改进循环从未标记互联网视频学习，无需人工奖励或标注，在长期游戏任务中远超基线方法。

2026-06-02 ​

Measuring, Localizing, and Ablating Alignment Signatures in LLMs 85 ​

ImmersiveTTS: Environment-Aware Text-to-Speech with Multimodal Diffusion Transformer and Domain-Specific Representation Alignment 85 ​

LongDS-Bench: On the Failure of Long-Horizon Agentic Data Analysis 85 ​

Counterfactual Evaluation Reveals Hidden Capability Profiles in Clinical LLMs and Agents 85 ​

Self-Reflective Generation at Test Time 85 ​

SERA: Soft-Verified Efficient Repository Agents 85 ​

EvoDefense: Co-Evolving Black-Box Defense with Large Language Models 85 ​

Esoteric Language Models: A Family of Any-Order Diffusion LLMs 85 ​

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards 85 ​

ParisKV: Fast and Drift-Robust KV-Cache Retrieval for Long-Context LLMs 85 ​

Learning Whom to Trust: Market-Feedback Adaptive Retrieval for Frozen LLMs in Event-Driven Financial RAG 85 ​

The Information Geometry of Softmax: Probing and Steering 85 ​

Skill is Not One-Size-Fits-All: Model-Aware Skill Alignment for LLM Agents 85 ​

Beyond Static Dialogues: Benchmarking Realistic, Heterogeneous, and Evolving Long-Term Memory 85 ​

Learning to Reason with Insight for Informal Theorem Proving 85 ​

Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging 85 ​

Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers 85 ​

CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability 85 ​

HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents 85 ​

ASH: Agents that Self-Hone via Embodied Learning 85 ​

2026-06-02

Measuring, Localizing, and Ablating Alignment Signatures in LLMs 85

ImmersiveTTS: Environment-Aware Text-to-Speech with Multimodal Diffusion Transformer and Domain-Specific Representation Alignment 85

LongDS-Bench: On the Failure of Long-Horizon Agentic Data Analysis 85

Counterfactual Evaluation Reveals Hidden Capability Profiles in Clinical LLMs and Agents 85

Self-Reflective Generation at Test Time 85

SERA: Soft-Verified Efficient Repository Agents 85

EvoDefense: Co-Evolving Black-Box Defense with Large Language Models 85

Esoteric Language Models: A Family of Any-Order Diffusion LLMs 85

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards 85

ParisKV: Fast and Drift-Robust KV-Cache Retrieval for Long-Context LLMs 85

Learning Whom to Trust: Market-Feedback Adaptive Retrieval for Frozen LLMs in Event-Driven Financial RAG 85

The Information Geometry of Softmax: Probing and Steering 85

Skill is Not One-Size-Fits-All: Model-Aware Skill Alignment for LLM Agents 85

Beyond Static Dialogues: Benchmarking Realistic, Heterogeneous, and Evolving Long-Term Memory 85

Learning to Reason with Insight for Informal Theorem Proving 85

Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging 85

Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers 85

CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability 85

HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents 85

ASH: Agents that Self-Hone via Embodied Learning 85