BinaryCorp is built for binary similarity detection based on the ArchLinux official repositories and Arch User Repository. BinaryCorp contains tens of thousands of software, including editors, instant messenger, HTTP server, web browser, compiler, graphics library, cryptographic library, etc. The binary code similarity task requires a large number of labeled data, thus we use the infrastructures provided by ArchLinux to construct our dataset with different optimization levels (e.g O0, O1, O2, O3, Os).

Get BinaryCorp at here . For more details, please check our official repo: https://github.com/vul337/jTrans

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


  • MIT License

Modalities


Languages