Guarddog integration #2499

hugo-glez · 2024-09-06T19:31:04Z

adding GuardDog CI closes #2483

Description

GuardDog tool was added to the Docker of malware_tools_analyzers, because the requirement of python 3.10, libraries for thug were also updated. The decision to use docker instead of directly integration is because is the only way to support Windows.

[Guarddog] (https://github.com/DataDog/guarddog/tree/main) scans for libraries in the ecosystem to detect suspicious or malicious content

It supports pypi, npm and go modules. It can scan a binary file with the package or from the name retrieve from the repository and perform the scan. In total six new scanners were added.

Type of change

Please delete options that are not relevant.

[ x] New feature. The use of guarddog scaner for files and observables.

Checklist

Important Rules

If you miss to compile the Checklist properly, your PR won't be reviewed by the maintainers.
Everytime you make changes to the PR and you think the work is done, you should explicitly ask for a review. After being reviewed and received a "change request", you should explicitly ask for a review again once you have made the requested changes.

Results:

report:"Found 0 potentially malicious indicators scanning flask Some rules failed to run while scanning flask: * rules-all: failed to run rule: An error occurred when running Semgrep. command: semgrep --config /opt/deploy/guarddog/venv/lib/python3.10/site-packages/guarddog/analyzer/sourcecode/shady-links.yml --config /opt/deploy/guarddog/venv/lib/python3.10/site-packages/guarddog/analyzer/sourcecode/cmd-overwrite.yml --config /opt/deploy/guarddog/venv/lib/python3.10/site-packages/guarddog/analyzer/sourcecode/obfuscation.yml --config /opt/deploy/guarddog/venv/lib/python3.10/site-packages/guarddog/analyzer/sourcecode/steganography.yml --config /opt/deploy/guarddog/venv/lib/python3.10/site-packages/guarddog/analyzer/sourcecode/silent-process-execution.yml --config /opt/deploy/guarddog/venv/lib/python3.10/site-packages/guarddog/analyzer/sourcecode/bidirectional-characters.yml --config /opt/deploy/guarddog/venv/lib/python3.10/site-packages/guarddog/analyzer/sourcecode/clipboard-access.yml --config /opt/deploy/guarddog/venv/lib/python3.10/site-packages/guarddog/analyzer/sourcecode/code-execution.yml --config /opt/deploy/guarddog/venv/lib/python3.10/site-packages/guarddog/analyzer/sourcecode/exfiltrate-sensitive-data.yml --config /opt/deploy/guarddog/venv/lib/python3.10/site-packages/guarddog/analyzer/sourcecode/download-executable.yml --config /opt/deploy/guarddog/venv/lib/python3.10/site-packages/guarddog/analyzer/sourcecode/exec-base64.yml --config /opt/deploy/guarddog/venv/lib/python3.10/site-packages/guarddog/analyzer/sourcecode/dll-hijacking.yml --exclude='helm' --exclude='.idea' --exclude='venv' --exclude='test' --exclude='tests' --exclude='.env' --exclude='dist' --exclude='build' --exclude='semgrep' --exclude='migrations' --exclude='.github' --exclude='.semgrep_logs' --no-git-ignore --json --quiet --max-target-bytes=10000000 /tmp/tmpum2n0sfa/flask status code: 1 output: "
errors:
scan_type:"pypi"

…updated references for thug also

mlodic

thank you for your contribution!

I'll try to point out some possible changes and give you the space to do those if you have any time. If you have some problems about that, feel free to ask for help.

mlodic · 2024-09-09T07:13:41Z

api_app/analyzers_manager/file_analyzers/guarddog_file.py

+    # interval between http request polling (in secs)
+    poll_distance: int = 30


This is the polling between this script in the main docker container and the malware docker container. The problem about this number is that if the analyzers finishes its computation in the middle of those 30 seconds, the client would get the results a lot later than intended. Because of this, this value can usually reduced to a lower number.
How much time does this analyzer require to be completely executed in your experience?

mlodic · 2024-09-09T07:15:51Z

api_app/analyzers_manager/migrations/0121_analyzer_config_guarddogobservable_go.py

+    "name": "GuarddogObservable_go",
+    "description": "GuardDog is a tool that allows to identify malicious PyPI and npm packages or Go modules. It runs a set of heuristics on the package source code (through Semgrep rules) and on the package metadata.\r\n\r\ngo modules observable. Just give the name of the package",


It's enough a single migration for all the 3 cases (npm,pypi, org) and you can call it GuarddogObservable generically

you can remove the other 2 and the merge migration

mlodic · 2024-09-09T07:17:21Z

api_app/analyzers_manager/migrations/0123_analyzer_config_guarddogfile_go.py

+    "name": "GuarddogFile_go",
+    "description": "GuardDog is a tool that allows to identify malicious PyPI and npm packages or Go modules. It runs a set of heuristics on the package source code (through Semgrep rules) and on the package metadata.\r\n\r\nSubmit the go packed file to analize",


same for the "File" type of migrations, 1 python file is enough

mlodic · 2024-09-09T07:18:51Z

integrations/malware_tools_analyzers/Dockerfile

+# Add user for guarddog
+#RUN addgroup --system --gid 1000 guarddog \
+#    && adduser --system --shell /bin/bash --uid 1000 --ingroup guarddog guarddog
+


is this necessary or is it a leftover?

mlodic · 2024-09-09T07:21:15Z

integrations/malware_tools_analyzers/Dockerfile

+# Install Guarddog
+# Not working with virutalenv
+WORKDIR ${PROJECT_PATH}/guarddog
+
+RUN pip3 install --no-cache-dir --upgrade pip \
+    && pip3 install --no-cache-dir guarddog
+
+


the ideal thing would be to use its own python virtual environment like we did for all of the other integrations in this image to avoid conflicts. You can copy how we did that for the others

(you can add a new file in the requirements folder with that guardog dependency and pinning a version for reproducibility)

mlodic · 2024-09-09T07:24:21Z

integrations/malware_tools_analyzers/app.py

+
+shell2http.register_command(
+    endpoint="guarddog",
+    command_name="/usr/local/bin/guarddog",


when adding a virtual env, this path would change. See other tools

hugo-glez added 4 commits September 6, 2024 13:03

Update Dockerfile for malware tools analyzers to integrate guarddog, …

b19c66e

…updated references for thug also

Update app.py to add guarddog command

e581efb

Added classes to handle files and observables for guarddog

afad12e

Add migrations for guarddog and migrations merges

ea3bbac

mlodic requested changes Sep 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guarddog integration #2499

Guarddog integration #2499

hugo-glez commented Sep 6, 2024

mlodic left a comment

mlodic Sep 9, 2024

mlodic Sep 9, 2024

mlodic Sep 9, 2024

mlodic Sep 9, 2024

mlodic Sep 9, 2024

mlodic Sep 9, 2024

mlodic Sep 9, 2024

mlodic Sep 9, 2024

		# interval between http request polling (in secs)
		poll_distance: int = 30

		"name": "GuarddogObservable_go",
		"description": "GuardDog is a tool that allows to identify malicious PyPI and npm packages or Go modules. It runs a set of heuristics on the package source code (through Semgrep rules) and on the package metadata.\r\n\r\ngo modules observable. Just give the name of the package",

		"name": "GuarddogFile_go",
		"description": "GuardDog is a tool that allows to identify malicious PyPI and npm packages or Go modules. It runs a set of heuristics on the package source code (through Semgrep rules) and on the package metadata.\r\n\r\nSubmit the go packed file to analize",

Guarddog integration #2499

Are you sure you want to change the base?

Guarddog integration #2499

Conversation

hugo-glez commented Sep 6, 2024

Description

Type of change

Checklist

Important Rules

mlodic left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment