Skip to content

Latest commit

 

History

History

executables

EXECUTABLES

Malware detection algorithms need to extract features from executable files (PE, ELF, MachO, OAT, DEX, VDEX, and ART formats). The LIEF package (https://github.com/lief-project/LIEF) has tools to parse these files and create features that can be used in training.

PE Features

Features for PE files are based on EMBER (https://arxiv.org/abs/1804.04637) (https://github.com/endgameinc/ember).

PEGeneralFeatures

➡️ Code

➡️ Description

Extracts general features from PE files such as size, import/export counts, and other basic features.

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • Multiple numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief

PEHeaderFeatures

➡️ Code

➡️ Description

Features derived from the PE file header and option header.

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • 63 numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief

PESectionCharacteristics

➡️ Code

➡️ Description

Extracts section characteristics from PE files.

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • Multiple numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief

PENormalizedByteCount

➡️ Code

➡️ Description

The counts for each byte value in a PE file. These counts are then normalized.

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • 256 numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief

PEDataDirectoryFeatures

➡️ Code

➡️ Description

Features derived from the PE file data directory

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • 30 numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief

PEImportsFeatures

➡️ Code

➡️ Description

Features derived from the PE file imports

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • 1280 numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief

PEExportsFeatures

➡️ Code

➡️ Description

Features derived from the export data section of the PE file.

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • 128 numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief