Introducing RePo: Language Models with Context Re-Positioning
Website: https://pub.sakana.ai/repo/
Paper: https://arxiv.org/abs/2512.14391
Standard language models process information as a rigid linear sequence where the only signal for structure is a fixed token index, forcing them to treat physical proximity as semantic relevance. Cognitive Load Theory suggests this is inefficient. Just as humans struggle when key facts are buried in noise, models waste finite capacity managing disorganized inputs instead of focusing on deep reasoning.
RePo breaks this bottleneck by allowing models to actively reorganize their context. Instead of using a fixed index, our module learns to assign positions based on content relevance. This lets the model dynamically pull relevant distant information closer and push noise away, effectively reshaping the attention geometry to match the problem structure.
This flexibility yields significant gains in robustness. RePo outperforms standard encodings on noisy contexts, structured data, and long-range dependencies while maintaining competitive general performance. It represents a step toward models that intelligently curate their own working memory rather than passively accepting input order.

Twitter

RePo簡介：具有上下文重定位功能的語言模型

網址：https://pub.sakana.ai/repo/

論文：https://arxiv.org/abs/2512.14391

標準語言模型將資訊處理為僵化的線性序列，其中結構的唯一訊號是固定的詞元索引，這迫使它們將物理上的接近性視為語義相關性。認知負荷理論表明，這種方法效率低。正如人類在關鍵資訊被雜訊淹沒時難以理解一樣，模型也會浪費有限的認知能力來處理雜亂無章的輸入，而不是專注於深度推理。

RePo透過允許模型主動重組其上下文來打破這一瓶頸。我們的模組不使用固定的索引，而是學習基於內容相關性來分配位置。這使得模型能夠動態地將相關的遠距離資訊拉近，並將雜訊推開，從而有效地重塑注意力結構以匹配問題結構。

這種靈活性顯著提高了模型的穩健性。 RePo 在處理噪音環境、結構化資料和長程依賴關係時優於標準編碼，同時保持了具有競爭力的整體效能。它標誌著模型朝著智慧管理自身工作記憶而非被動接受輸入順序的方向邁出了重要一步。

XRP在幣安的30天流動性指數已下降歷史新低，接近於零。交易量也從2025年1月的超過2000億鎂下挫幾乎為零。
這...

XRP在幣安的流動性下挫——價格將如何反應？

2026 年 2 月，穩定幣月交易量達到 7.2 萬億鎂，次超越了自動清算系統 (ACH) 網絡的 6.8 萬億鎂。
ACH是一種對外支付系統……

穩定幣的資金量流動量超過了美國核心金融體系。

特朗普總統在2024年4月1日關於伊朗衝突的講話中預測，未來兩到三週將持續遭受猛烈的軍事空襲，這將阻礙美國股市的鏈復甦……