DISL: Fueling Research with A Large Dataset of Solidity Smart Contracts

25 Mar 2024  ·  Gabriele Morello, Mojtaba Eshghie, Sofia Bobadilla, Martin Monperrus ·

The DISL dataset features a collection of $514,506$ unique Solidity files that have been deployed to Ethereum mainnet. It caters to the need for a large and diverse dataset of real-world smart contracts. DISL serves as a resource for developing machine learning systems and for benchmarking software engineering tools designed for smart contracts. By aggregating every verified smart contract from Etherscan up to January 15, 2024, DISL surpasses existing datasets in size and recency.

PDF Abstract
No code implementations yet. Submit your code now

Datasets


Introduced in the Paper:

DISL

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods


No methods listed for this paper. Add relevant methods here