This dataset was made from a set of 482 not packed cleanware including executable and object files coming from /usr/bin
from a fresh installation of Ubuntu 20.04. From this folder of not packed binaries, we built the 6 subsets with the packers integrated in the Packing Box with the dataset make
command, using the --pack-all
option. We end up with 200 binaries randomly chosen from the not packed folder per packer. This dataset can be used for training machine learning models tailored to ELF executable packing.
You may also like these:
- Awesome Executable Packing: A curated list of awesome resources related to executable packing.
- Bintropy: Analysis tool for estimating the likelihood that a binary contains compressed or encrypted bytes (inspired from this paper).
- Dataset of packed PE files: Dataset of PE samples packed with many different packers (fork of this repository).
- Docker Packing Box: Docker image gathering packers and tools for making datasets of packed executables.
- DSFF: Library implementing the DataSet File Format (DSFF).
- PEiD: Python implementation of the well-known Packed Executable iDentifier (PEiD).
- PyPackerDetect: Packing detection tool for PE files (fork of this repository).
- REMINDer: Packing detector using a simple heuristic (inspired from this paper).