Introducing RePo: Language Models with Context Re-Positioning
Website: https://pub.sakana.ai/repo/
Paper: https://arxiv.org/abs/2512.14391
Standard language models process information as a rigid linear sequence where the only signal for structure is a fixed token index, forcing them to treat physical proximity as semantic relevance. Cognitive Load Theory suggests this is inefficient. Just as humans struggle when key facts are buried in noise, models waste finite capacity managing disorganized inputs instead of focusing on deep reasoning.
RePo breaks this bottleneck by allowing models to actively reorganize their context. Instead of using a fixed index, our module learns to assign positions based on content relevance. This lets the model dynamically pull relevant distant information closer and push noise away, effectively reshaping the attention geometry to match the problem structure.
This flexibility yields significant gains in robustness. RePo outperforms standard encodings on noisy contexts, structured data, and long-range dependencies while maintaining competitive general performance. It represents a step toward models that intelligently curate their own working memory rather than passively accepting input order.

Twitter

RePo简介：具有上下文重定位功能的语言模型

网址：https://pub.sakana.ai/repo/

论文：https://arxiv.org/abs/2512.14391

标准语言模型将资讯处理为僵化的线性序列，其中结构的唯一讯号是固定的词元索引，这迫使它们将物理上的接近性视为语义相关性。认知负荷理论表明，这种方法效率低。正如人类在关键资讯被杂讯淹没时难以理解一样，模型也会浪费有限的认知能力来处理杂乱无章的输入，而不是专注于深度推理。

RePo透过允许模型主动重组其上下文来打破这一瓶颈。我们的模组不使用固定的索引，而是学习基于内容相关性来分配位置。这使得模型能够动态地将相关的远距离资讯拉近，并将杂讯推开，从而有效地重塑注意力结构以匹配问题结构。

这种灵活性显著提高了模型的稳健性。 RePo 在处理噪音环境、结构化资料和长程依赖关系时优于标准编码，同时保持了具有竞争力的整体效能。它标志著模型朝著智慧管理自身工作记忆而非被动接受输入顺序的方向迈出了重要一步。

2026 年 2 月，稳定币月交易量达到 7.2 万亿镁，次超越了自动清算系统 (ACH) 网络的 6.8 万亿镁。
ACH是一种对外支付系统……

稳定币的资金量流动量超过了美国核心金融体系。

特朗普总统在2024年4月1日关于伊朗冲突的讲话中预测，未来两到三周将持续遭受猛烈的军事空袭，这将阻碍美国股市的链复苏……

本周，特朗普关于伊朗的讲话对三只美国股票造成了严重影响。

XRP在币安的30天流动性指数已下降历史新低，接近于零。交易量也从2025年1月的超过2000亿镁下挫几乎为零。
这...