bytecode compression #91

axic · 2016-04-07T21:41:48Z

I've run a small experiment to see how compressible EVM bytecode is and the result is: very.

The two examples I've taken is the Multisig Wallet contract and Shapeshiftbot:

Wallet: 6993 bytes to 2706 bytes
Shapeshift: 6018 bytes to 1486 bytes

The compressor used was zlib on level 1 (best speed). Level 9 (slowest) only gives an improvement of around 10% in the case of the Wallet.

It might be early discussing this and may become more important when blockchain rent comes into effect. Using compression would optimise from storage costs to paying decompression costs.

Would that make sense? In two cases it might:

very compressible code or
something which is rarely executed

This could implemented either:

on the blockchain level
as a self-decompressing contract (i.e. in bytecode only). However this would require jumping to a memory location, which is prohibited in EVM.

chriseth · 2016-04-08T08:08:59Z

Haha, interesting. If you fancy, take a look at the very last item on the Solidity backlog:
https://www.pivotaltracker.com/n/projects/1189488

Arachnid · 2016-08-18T11:10:53Z

I've done some experiments with doing LZF decompression in the EVM. Here's a simple decompressor: https://gist.github.com/Arachnid/4bf0dc27432e325505ca7b06f6366114

It averages about 40 gas per byte decompressing bytecode, and the compressed bytecode is about 60% of the plaintext size. That's not quite enough to make it more efficient to compress transaction data (which costs 68 gas per byte), unfortunately.

More optimal decompressors are probably possible by reading the input data directly from calldata, instead of memory, but I wouldn't expect more than perhaps another 25% improvement in speed from that.

chriseth · 2016-08-18T16:14:32Z

Oh and by the way, @axic: "Jumping to a memory location is prohibited by the EVM" - not quite: You can create a contract and then delegatecall into it.

axic · 2016-08-29T19:23:31Z

You can create a contract and then delegatecall into it.

True, but that is a second instance of a contract. You cannot do it properly within a single instance AFAIK.

gcolvin · 2017-07-11T23:09:49Z

Can't a client store the blockchain data any way it likes?

nicksavers · 2017-11-17T23:10:46Z

@axic Is this still relevant with #706 Snappy compression for DEVp2p?
Like @gcolvin said, for storage, each client can solve this in their own way.

poemm · 2020-01-20T19:05:28Z

There is new interest stateless blocks, where bytecode is transmitted with each block that executes it. Recent experiments show that bytecode is one of the size bottlenecks (but not the biggest bottleneck). A proposal to improve the bytecode size bottleneck is to merkleize bytecode.

In curiosity, I compressed 124 EVM contracts, including CryptoKitties, Uniswap, and other dapps which I greedily found on rankings lists and in recent transactions, until I had ~1 MB total. Then I measured sizes before and after compression.

Total size without compression: 1004806 bytes
Total size with off-the-shelf zstd compression: 364759 bytes
Total size compressed with zstd using custom dictionary: 235683 bytes

So EVM bytecode compressed to 36% without much effort, and 23% with a little more effort. Decompression of everything took around 250ms on my slow computer.

Compression may improve further by doing some of the following.

Separately compress opcodes and immediates, i.e. split-stream compression.
Tune the dictionary for popular contracts.
Use many dictionaries, each tuned for different opcode frequency histograms.
Tune algorithms to be more aggressive, being aware of space-time trade-offs.
Compress leaves of merkleized bytecode.

Arachnid · 2020-01-21T02:15:32Z

Be careful when considering compression as a solution; an attacker can easily make a maximally-uncompressible contract in order to cause a DoS.

poemm · 2020-01-21T15:15:14Z

Yes, good point. Depending on how you define "valid block", there could be a DoS attack.

Merkleizeing bytecode only gives 50% bytecode size reduction (early estimate, ref: see section: Code "chunking"). My post was to remind people that compression can give closer to 30% on average, sometime better.

Perhaps it is wise to allow individual contracts to choose whether/how to reduce their bytecode size, and the decompress/merkleize/bytecode_recover operations can be metered as EVM code or as precompiles, as wisely described by @Arachnid above.

Perhaps it is wise to allow popular contracts to be stored by every node, maybe as consensus cache of recent blocks, or with storage deposit/rent costs. Then only infrequently used bytecode will need to be transmitted with blocks.

github-actions · 2022-01-16T10:12:35Z

There has been no activity on this issue for two months. It will be closed in a week if no further activity occurs. If you would like to move this EIP forward, please respond to any outstanding feedback or add a comment indicating that you have addressed all required feedback and are ready for a review.

github-actions · 2022-01-30T11:08:16Z

This issue was closed due to inactivity. If you are still pursuing it, feel free to reopen it and respond to any feedback or request a review in a comment.

wanderer added the ERC label Apr 14, 2016

wanderer changed the title ~~RFC: bytecode compression~~ bytecode compression Apr 14, 2016

Souptacular added the editor-needs-to-review label Feb 10, 2017

axic mentioned this issue Jan 24, 2018

Introduce bytecode compression for gigantic contracts ethereum/solidity#3432

Closed

Arachnid removed the editor-needs-to-review label Mar 27, 2018

github-actions bot added the stale label Jan 16, 2022

github-actions bot closed this as completed Jan 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bytecode compression #91

bytecode compression #91

axic commented Apr 7, 2016

chriseth commented Apr 8, 2016

Arachnid commented Aug 18, 2016

chriseth commented Aug 18, 2016

axic commented Aug 29, 2016

gcolvin commented Jul 11, 2017

nicksavers commented Nov 17, 2017

poemm commented Jan 20, 2020

Arachnid commented Jan 21, 2020

poemm commented Jan 21, 2020

github-actions bot commented Jan 16, 2022

github-actions bot commented Jan 30, 2022

bytecode compression #91

bytecode compression #91

Comments

axic commented Apr 7, 2016

chriseth commented Apr 8, 2016

Arachnid commented Aug 18, 2016

chriseth commented Aug 18, 2016

axic commented Aug 29, 2016

gcolvin commented Jul 11, 2017

nicksavers commented Nov 17, 2017

poemm commented Jan 20, 2020

Arachnid commented Jan 21, 2020

poemm commented Jan 21, 2020

github-actions bot commented Jan 16, 2022

github-actions bot commented Jan 30, 2022