Npm package manager security for developers using metadata analysis

This project was done as my final practicum for completing MS in Cybersecurit @ Georgia Tech. I worked on it during January-May 2022, and completed it in May 2022.

The paper for this project is available here: Npm package manager security for developers using metadata analysis.

It takes much inspiration from the work of Maloss. Their paper is available at Towards Measuring Supply Chain Attacks on Package Managers for Interpreted Languages.

This is the code which I have made publicly available for my project. The project is described in the paper in much more detail.

How to use

You can use the tool in order to perform a standalon analysis using the following methods:

./npm-sec.sh: You can use the script present in the base folder by giving it a package name as an argument. This makes use of the npm-sec.sh located in the util/eval/scripts/ folder. It will perform the tests and then ask for confirmation to continue with installation of the npm package.
util/eval/scripts/npm-sec.sh: You can navigate to this script and then run it with an argument package name.
util/eval/scripts/tool.py: This will perform the analysis for the package name but it will not ask for confirmation to continue with install. It will do nothing more than perform the metadata tests and print the results.

The tool performs various metadata tests in order to give the user an idea about whether the requested package could be malicious or not. It will print out information for the tests if they raise a positive. This info can be used to make a decision about continuing the installation.

Structure

The project is structered into two modules of metadata tests that it performs:

Typo tests: Tests which look at attributes across all the packages in the typo radius. These are implemented in typo_eval_framework.py under the Typo_Framework class.
Anly tests: Tests which look at attributes of a package without the context of other packages in the typo radius. These are implemented in analyse_framework.py under the Analysis_Framework class. (Anly is short for 'self-analysis')

The collective_analysis.py file contains the Collective_Analysis class. It makes use of both the above classes to perform the full range of metadata tests for the packages. It also stores state and results for the packages that are testes, once their tests are completed.

The evaluator.py contains the Evaluator class. It is used to perform the metadata tests for a bulk of packages. This is mainly useful for the evaluation of the tool developed in this project.

Metadata tests

The tests performed for each module are listed below. A more in-depth desciprtion and discussion of these tests is given in the paper linked at the beginning. It also describes and discusses the evaluation results and the effectiveness of each of these tests.

Anly

Version Skipping
Immature Package
Strictly Increasing Version Numbers
Dist-tag is the latest version
First Version
Maintainer Changes
Author Changes
Package Popularity
Malicious Authors Involved (blacklist)
Malicious Maintainers Involved (blacklist)

Typo

Popularity Comparison
Age Comparison
Same Author Check

Contact

Feel free to contact me for any information that might be missing in the documentation of this project. I would be more than happy to provide any help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Npm package manager security for developers using metadata analysis

How to use

Structure

Metadata tests

Anly

Typo

Contact

Files

README.md

Latest commit

History

README.md

File metadata and controls

Npm package manager security for developers using metadata analysis

How to use

Structure

Metadata tests

Anly

Typo

Contact