My initial research shows that it is possible to gain a further 30% storage reduction for contract code, on top of contract deduplication and compressing code at storage time.
Of necessity, the solidi...

<div><p>My initial research shows that it is possible to gain a further 30% storage reduction for contract code, on top of contract deduplication and compressing code at storage time.</p><p>Of necessity, the solidity compiler and every other compiler will create bytecode with patterns of opcodes. When compressing an individual contract, an ideal compression algorithm learns these patterns after they have occurred once in the individual contract, and can then refer to them in shortened ways throughout the rest of the contact.</p><p>However, this means that the compression algorithm is always surprised by the first occurrence of a pattern in an individual contract, because the algorithm has no idea that this is common to many smart contracts, and so can only learn from the individual contract that it is working on.</p><p>The solution to this is built into most compression algorithm libraries - you can provide a small pre-trained “dictionary” that encodes common patterns already seen in historical data. This allows immediate compression of common patterns the first time they appear in new data.</p><p>As a second benefit, when a pretrained dictionary is used, it also remembers large patterns from the over represented spam/spam contracts. For those contracts that have been spammed tens of thousands of times with minor variations, a dictionary can reduce these to single digit percentages of their original size.</p><p>Here’s the results:</p><div><a href="https://ethresear.ch/uploads/default/original/3X/f/0/f0909407717981d471afd2779553664ebbb126c4.png" title="image" rel="nofollow"><img src="https://ethresear.ch/uploads/default/optimized/3X/f/0/f0909407717981d471afd2779553664ebbb126c4_2_690x299.png" alt="image"><div><span>image</span><span>986×428 11 KB</span></div></a></div><p>Using the zelliac dataset of all contract bytecode deployed up to early 2025, there are <code>1,539,858</code> deduplicated deployed bytecode sets.</p><p>Using the Zstandard compression library at its default fast compression level of 3, and compressing each individual bytecode set, the total size goes from <code>100%</code> to <code>41.8%</code> of original size. Adding in a 100KB dictionary, trained at the default settings for compression level results in bytecode taking up <code>29.3%</code> of the original size, a <code>30%</code> reduction from the compressed size.</p><p>Increasing the compression dictionary size or increasing the compression level further reduces the final size. I kept this optimized for speed. It’s also quite possible that further tuning of dictionary training parameters could result in even smaller sizes.</p><p>Total storage space required across different sized contract byte codes:</p><div><a href="https://ethresear.ch/uploads/default/original/3X/3/d/3d1c0ed89b680c139d145f6daf4e48a053327a99.png" title="image" rel="nofollow"><img src="https://ethresear.ch/uploads/default/optimized/3X/3/d/3d1c0ed89b680c139d145f6daf4e48a053327a99_2_690x314.png" alt="image"><div><span>image</span><span>1149×523 17.7 KB</span></div></a></div><p>Worst to best performance across all contracts.</p><div><a href="https://ethresear.ch/uploads/default/original/3X/9/2/92f21bd69bf92a6430803c6adf2b3ce3f855a5c2.png" title="image" rel="nofollow"><img src="https://ethresear.ch/uploads/default/optimized/3X/9/2/92f21bd69bf92a6430803c6adf2b3ce3f855a5c2_2_690x321.png" alt="image"><div><span>image</span><span>1136×529 26.4 KB</span></div></a></div><p>Final notes:</p><ul><li>If using this in a client, I would presume storing a single byte that would tell if a contract was compressed at all, and if so which dictionary / algo was used. This would allow for smooth upgrades to better dictionaries in the future, as well as not compressing files that compression makes worse.</li><li>I used zstandard as a compression library simply because I’ve had good experiences with it in the past. I’ve not compared different compression libraries or algorithms at this point.</li></ul><div><br></div></div>

30% reduction in stored contract code size beyond deduplication and compression

我的初步研究表明，除了合約去重和儲存時程式碼壓縮之外，合約程式碼的儲存空間還可以再減少 30%。

因此，固守…

<div><p>我的初步研究表明，除了合同去重和存儲時壓縮代碼之外，合同代碼的存儲空間還可以再減少 30%。</p><p> Solidity 編譯器和其他所有編譯器都必然會生成包含操作碼模式的字節碼。在壓縮單個合約時，理想的壓縮算法會在這些模式在合約中出現一次後學習它們，然後在合約的其餘部分以簡化的方式引用它們。</p><p>然而，這意味著壓縮算法總是會對單個合約中首次出現的某種模式感到驚訝，因為該算法不知道這種模式在許多智能合約中都很常見，因此只能從它正在處理的單個合約中學習。</p><p>大多數壓縮算法庫都內置瞭解決方案——你可以提供一個預訓練的小型“字典”，其中編碼了歷史數據中已出現的常見模式。這樣，當這些常見模式首次出現在新數據中時，就可以立即對其進行壓縮。</p><p>其次，使用預訓練字典還能記住大量垃圾郵件/垃圾郵件合同中的常見模式。對於那些已被髮送數萬次且僅有細微差別的垃圾郵件合同，字典可以將這些合同的數量減少到原來的個位數百分比。</p><p>以下是結果：</p><div> <a href="https://ethresear.ch/uploads/default/original/3X/f/0/f0909407717981d471afd2779553664ebbb126c4.png" title="圖像" rel="nofollow"><img src="https://ethresear.ch/uploads/default/optimized/3X/f/0/f0909407717981d471afd2779553664ebbb126c4_2_690x299.png" alt="圖像"></a><div><span>圖片尺寸</span><span>：986×428，11 KB</span></div></div><p>使用 zelliac 數據集，該數據集包含了截至 2025 年初部署的所有合約字節碼，共有<code>1,539,858</code>去重後的已部署字節碼集。</p><p>使用 Zstandard 壓縮庫，並採用其默認的快速壓縮級別 3，對每個單獨的字節碼集進行壓縮，總大小從原始大小的<code>100%</code>減少到<code>41.8%</code> 。添加一個 100KB 的字典，並在默認壓縮級別下進行訓練，最終字節碼大小為原始大小的<code>29.3%</code> ，比壓縮後的大小減少了<code>30%</code> 。</p><p>增加壓縮字典的大小或提高壓縮級別可以進一步減小最終文件的大小。我為了追求速度而進行了優化。此外，進一步調整字典訓練參數也很有可能獲得更小的文件大小。</p><p>不同大小的合約字節碼所需的總存儲空間：</p><div> <a href="https://ethresear.ch/uploads/default/original/3X/3/d/3d1c0ed89b680c139d145f6daf4e48a053327a99.png" title="圖像" rel="nofollow"><img src="https://ethresear.ch/uploads/default/optimized/3X/3/d/3d1c0ed89b680c139d145f6daf4e48a053327a99_2_690x314.png" alt="圖像"></a><div><span>圖片尺寸</span><span>：1149×523，大小：17.7 KB</span></div></div><p>所有合同中的表現從最差到最佳。</p><div> <a href="https://ethresear.ch/uploads/default/original/3X/9/2/92f21bd69bf92a6430803c6adf2b3ce3f855a5c2.png" title="圖像" rel="nofollow"><img src="https://ethresear.ch/uploads/default/optimized/3X/9/2/92f21bd69bf92a6430803c6adf2b3ce3f855a5c2_2_690x321.png" alt="圖像"></a><div><span>圖片尺寸</span><span>：1136×529，大小：26.4 KB</span></div></div><p>最後幾點說明：</p><ul><li>如果在客戶端使用此功能，我建議存儲一個字節來指示合約是否被壓縮，如果被壓縮，則指示使用了哪個字典/ALGO。這樣可以方便將來平滑升級到更好的字典，同時也能避免壓縮那些壓縮後質量反而更差的文件。</li><li>我之所以選擇 zstandard 作為壓縮庫，僅僅是因為我過去使用它的體驗很好。目前我還沒有比較過不同的壓縮庫或算法。</li></ul><div><br></div></div>

除了去重和壓縮之外，儲存的合約程式碼大小還減少了 30%。

RAVE 股價在最近一個交易日從 16.50鎂左右上漲至 28鎂以上，創下歷史，隨後回落至 26.30鎂左右。這意味著 24 小時內上漲高達 51.7%，過去一年累計漲幅更是高達 1314.1%。

RAVE 價格突破 28鎂，創歷史新高；成交合約價值 2054 萬鎂。

伊朗外交部發言人巴加埃（Esmaeil Baghaei）在 18 日凌晨強硬回應川普「美方協助取出並運走濃縮鈾 […]
〈伊朗：不接受濃縮鈾被運走！打臉川普，但承認與美國擬定 60 天談判備忘錄〉這篇文章最早發佈於動區BlockTempo《動區動趨-最具影響力的區塊鏈新聞媒體》。

伊朗：不接受濃縮鈾被運走！打臉川普，但承認與美國擬定 60 天談判備忘錄

伊朗外交部長阿巴斯·阿拉格奇表示，根據以色列和黎巴嫩之間的協議，霍爾木茲海峽在停火期間“完全開放”。

德黑蘭方面宣佈這一消息後，美國總統唐納德·特朗普立即……