My initial research shows that it is possible to gain a further 30% storage reduction for contract code, on top of contract deduplication and compressing code at storage time.
Of necessity, the solidi...

<div><p>My initial research shows that it is possible to gain a further 30% storage reduction for contract code, on top of contract deduplication and compressing code at storage time.</p><p>Of necessity, the solidity compiler and every other compiler will create bytecode with patterns of opcodes. When compressing an individual contract, an ideal compression algorithm learns these patterns after they have occurred once in the individual contract, and can then refer to them in shortened ways throughout the rest of the contact.</p><p>However, this means that the compression algorithm is always surprised by the first occurrence of a pattern in an individual contract, because the algorithm has no idea that this is common to many smart contracts, and so can only learn from the individual contract that it is working on.</p><p>The solution to this is built into most compression algorithm libraries - you can provide a small pre-trained “dictionary” that encodes common patterns already seen in historical data. This allows immediate compression of common patterns the first time they appear in new data.</p><p>As a second benefit, when a pretrained dictionary is used, it also remembers large patterns from the over represented spam/spam contracts. For those contracts that have been spammed tens of thousands of times with minor variations, a dictionary can reduce these to single digit percentages of their original size.</p><p>Here’s the results:</p><div><a href="https://ethresear.ch/uploads/default/original/3X/f/0/f0909407717981d471afd2779553664ebbb126c4.png" title="image" rel="nofollow"><img src="https://ethresear.ch/uploads/default/optimized/3X/f/0/f0909407717981d471afd2779553664ebbb126c4_2_690x299.png" alt="image"><div><span>image</span><span>986×428 11 KB</span></div></a></div><p>Using the zelliac dataset of all contract bytecode deployed up to early 2025, there are <code>1,539,858</code> deduplicated deployed bytecode sets.</p><p>Using the Zstandard compression library at its default fast compression level of 3, and compressing each individual bytecode set, the total size goes from <code>100%</code> to <code>41.8%</code> of original size. Adding in a 100KB dictionary, trained at the default settings for compression level results in bytecode taking up <code>29.3%</code> of the original size, a <code>30%</code> reduction from the compressed size.</p><p>Increasing the compression dictionary size or increasing the compression level further reduces the final size. I kept this optimized for speed. It’s also quite possible that further tuning of dictionary training parameters could result in even smaller sizes.</p><p>Total storage space required across different sized contract byte codes:</p><div><a href="https://ethresear.ch/uploads/default/original/3X/3/d/3d1c0ed89b680c139d145f6daf4e48a053327a99.png" title="image" rel="nofollow"><img src="https://ethresear.ch/uploads/default/optimized/3X/3/d/3d1c0ed89b680c139d145f6daf4e48a053327a99_2_690x314.png" alt="image"><div><span>image</span><span>1149×523 17.7 KB</span></div></a></div><p>Worst to best performance across all contracts.</p><div><a href="https://ethresear.ch/uploads/default/original/3X/9/2/92f21bd69bf92a6430803c6adf2b3ce3f855a5c2.png" title="image" rel="nofollow"><img src="https://ethresear.ch/uploads/default/optimized/3X/9/2/92f21bd69bf92a6430803c6adf2b3ce3f855a5c2_2_690x321.png" alt="image"><div><span>image</span><span>1136×529 26.4 KB</span></div></a></div><p>Final notes:</p><ul><li>If using this in a client, I would presume storing a single byte that would tell if a contract was compressed at all, and if so which dictionary / algo was used. This would allow for smooth upgrades to better dictionaries in the future, as well as not compressing files that compression makes worse.</li><li>I used zstandard as a compression library simply because I’ve had good experiences with it in the past. I’ve not compared different compression libraries or algorithms at this point.</li></ul><div><br></div></div>

30% reduction in stored contract code size beyond deduplication and compression

我的初步研究表明，除了合约去重和储存时程式码压缩之外，合约程式码的储存空间还可以再减少 30%。

因此，固守…

<div><p>我的初步研究表明，除了合同去重和存储时压缩代码之外，合同代码的存储空间还可以再减少 30%。</p><p> Solidity 编译器和其他所有编译器都必然会生成包含操作码模式的字节码。在压缩单个合约时，理想的压缩算法会在这些模式在合约中出现一次后学习它们，然后在合约的其余部分以简化的方式引用它们。</p><p>然而，这意味着压缩算法总是会对单个合约中首次出现的某种模式感到惊讶，因为该算法不知道这种模式在许多智能合约中都很常见，因此只能从它正在处理的单个合约中学习。</p><p>大多数压缩算法库都内置了解决方案——你可以提供一个预训练的小型“字典”，其中编码了历史数据中已出现的常见模式。这样，当这些常见模式首次出现在新数据中时，就可以立即对其进行压缩。</p><p>其次，使用预训练字典还能记住大量垃圾邮件/垃圾邮件合同中的常见模式。对于那些已被发送数万次且仅有细微差别的垃圾邮件合同，字典可以将这些合同的数量减少到原来的个位数百分比。</p><p>以下是结果：</p><div> <a href="https://ethresear.ch/uploads/default/original/3X/f/0/f0909407717981d471afd2779553664ebbb126c4.png" title="图像" rel="nofollow"><img src="https://ethresear.ch/uploads/default/optimized/3X/f/0/f0909407717981d471afd2779553664ebbb126c4_2_690x299.png" alt="图像"></a><div><span>图片尺寸</span><span>：986×428，11 KB</span></div></div><p>使用 zelliac 数据集，该数据集包含了截至 2025 年初部署的所有合约字节码，共有<code>1,539,858</code>去重后的已部署字节码集。</p><p>使用 Zstandard 压缩库，并采用其默认的快速压缩级别 3，对每个单独的字节码集进行压缩，总大小从原始大小的<code>100%</code>减少到<code>41.8%</code> 。添加一个 100KB 的字典，并在默认压缩级别下进行训练，最终字节码大小为原始大小的<code>29.3%</code> ，比压缩后的大小减少了<code>30%</code> 。</p><p>增加压缩字典的大小或提高压缩级别可以进一步减小最终文件的大小。我为了追求速度而进行了优化。此外，进一步调整字典训练参数也很有可能获得更小的文件大小。</p><p>不同大小的合约字节码所需的总存储空间：</p><div> <a href="https://ethresear.ch/uploads/default/original/3X/3/d/3d1c0ed89b680c139d145f6daf4e48a053327a99.png" title="图像" rel="nofollow"><img src="https://ethresear.ch/uploads/default/optimized/3X/3/d/3d1c0ed89b680c139d145f6daf4e48a053327a99_2_690x314.png" alt="图像"></a><div><span>图片尺寸</span><span>：1149×523，大小：17.7 KB</span></div></div><p>所有合同中的表现从最差到最佳。</p><div> <a href="https://ethresear.ch/uploads/default/original/3X/9/2/92f21bd69bf92a6430803c6adf2b3ce3f855a5c2.png" title="图像" rel="nofollow"><img src="https://ethresear.ch/uploads/default/optimized/3X/9/2/92f21bd69bf92a6430803c6adf2b3ce3f855a5c2_2_690x321.png" alt="图像"></a><div><span>图片尺寸</span><span>：1136×529，大小：26.4 KB</span></div></div><p>最后几点说明：</p><ul><li>如果在客户端使用此功能，我建议存储一个字节来指示合约是否被压缩，如果被压缩，则指示使用了哪个字典/ALGO。这样可以方便将来平滑升级到更好的字典，同时也能避免压缩那些压缩后质量反而更差的文件。</li><li>我之所以选择 zstandard 作为压缩库，仅仅是因为我过去使用它的体验很好。目前我还没有比较过不同的压缩库或算法。</li></ul><div><br></div></div>

除了去重和压缩之外，储存的合约程式码大小还减少了 30%。

RAVE 股价在最近一个交易日从 16.50镁左右上涨至 28镁以上，创下历史，随后回落至 26.30镁左右。这意味着 24 小时内上涨高达 51.7%，过去一年累计涨幅更是高达 1314.1%。

RAVE 价格突破 28镁，创历史新高；成交合约价值 2054 万镁。

伊朗外交部发言人巴加埃（Esmaeil Baghaei）在 18 日凌晨强硬回应川普「美方协助取出并运走浓缩铀 […]
〈伊朗：不接受浓缩铀被运走！打脸川普，但承认与美国拟定 60 天谈判备忘录〉这篇文章最早发布于动区BlockTempo《动区动趋-最具影响力的区块链新闻媒体》。

伊朗：不接受浓缩铀被运走！打脸川普，但承认与美国拟定 60 天谈判备忘录

伊朗外交部长阿巴斯·阿拉格奇表示，根据以色列和黎巴嫩之间的协议，霍尔木兹海峡在停火期间“完全开放”。

德黑兰方面宣布这一消息后，美国总统唐纳德·特朗普立即……