Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Towards a centralized vulnerability database for Dependency-Track #4122

Open
2 tasks done
nscuro opened this issue Sep 4, 2024 · 6 comments
Open
2 tasks done

Towards a centralized vulnerability database for Dependency-Track #4122

nscuro opened this issue Sep 4, 2024 · 6 comments
Labels
enhancement New feature or request help wanted Extra attention is needed p2 Non-critical bugs, and features that help organizations to identify and reduce risk size/XL Higher effort

Comments

@nscuro
Copy link
Member

nscuro commented Sep 4, 2024

Current Behavior

When Dependency-Track came into existence around 2013, the only public and widely accepted vulnerability database was the NVD. Since its inception, DT has supported
mirroring of the NVD.

Over the past decade, multiple other public vulnerability databases came into existence:

Those databases filled a large gap in the vulnerability intelligence space: They support Package URL. To this date, the NVD still operates exclusively on CPE, which is understood to be problematic.

Further, more and more entities maintain their own databases, which may or may not feed into the larger databases listed above, for example:

The NVD is a database provided by the US government. Other nation states or regions
May provide their own equivalent - The EU discussed this possibility, and China already has one. Vendors selling products in these regions will likely need to analyze their products using such databases to comply with regulations.

The situation is further complicated by the fact that multiple sources can provide information for the same vulnerability: While the NVD will only report CVEs, and GitHub Security Advisories only GHSAs, OSV may report both. This can lead to conflicting information, since vulnerability metadata may vary across data sources: descriptions, severities, risk ratings, affected version ranges, fixed versions, and more.

Currently, DT tries to avoid surfacing conflicting information to users by enforcing which source can modify what kind of vulnerability. For example, the NVD is considered the authoritative source for CVEs, GitHub for GHSAs, and so on. This system fails when, like it happened with the NVD, upstream databases lag behind in providing up-to-date information. A problem which could be avoided by taking the data of all databases into consideration.

Finally, by having each instance of Dependency-Track download all this data over and over again, we’re putting public infrastructure such as the NVD under unnecessary stress. The NVD experienced major availability issues at the beginning of 2024, and the many DT instances downloading data from it certainly played their part in this.

Related issues:

Related discussions:

Proposed Behavior

The primary goal is to establish a centrally managed, pre-compiled vulnerability database, that all DT instances can consume. Users should receive more accurate, more complete data out-of-the-box, without having to tweak any settings.

Further details are documented here: https://docs.google.com/document/d/1DVV4ik7NGOBc6u-fdPlPVKoplNmSpzDT6iC4FJAFYi0/edit?usp=sharing

Note

Please request edit permissions if you'd like to add comments or suggest changes.

This proposal was brought up in the September Community Meeting!
You can watch the recording here: https://www.youtube.com/watch?v=hzelt7jv6dE&t=1188s

Checklist

@nscuro nscuro added enhancement New feature or request help wanted Extra attention is needed p2 Non-critical bugs, and features that help organizations to identify and reduce risk size/XL Higher effort labels Sep 4, 2024
@nscuro nscuro pinned this issue Sep 22, 2024
@Shortfinga
Copy link

Is anybody working on this already?
I'd like to contribute...

What would be a good plan to tackle this?

  • Draft for how the database could look like
  • PoC for updating it (tools, processes)
  • PoC for integating it to dependency-track?

@nscuro
Copy link
Member Author

nscuro commented Oct 21, 2024

Hey @Shortfinga, thanks for throwing your hat into the ring!

Work has not officially started yet, primarily because we were hoping for a bit more community feedback and involvement. So thanks a lot for doing just that!

Draft for how the database could look like

This would be the best way to start for community contributions, yes. The linked Google Doc has quite a few requirements in the Goals section. A large chunk of this work is coming up with a workable solution to building and distributing the database (file format, storage location, consumption etc.). Again the Doc has more details on what we need.

Please feel free to request permissions on the doc and add comments where things are unclear, or when you want to add things. Having a discussion in there, and bouncing ideas around would be a very good start I think.

PoC for integrating it to dependency-track?

I have started drafting the Dependency-Track side of things a while ago, but it needs a lot more fleshing out: https://github.com/DependencyTrack/hyades/blob/f60fa37510ec090d0eed42f8a82842bca6e999ff/docs/architecture/design/vulnerability-db-v2.md

However, the implementation in DT does not necessarily depend on how the centralized database looks. All that matters is that the Goals are met so the DB is useful for DT.

@nscuro
Copy link
Member Author

nscuro commented Oct 21, 2024

FYI, I added a link to the recording of our September community meeting, where we discussed this topic, to the issue description. Perhaps this is helpful for people curious what this is all about.

@nscuro
Copy link
Member Author

nscuro commented Nov 29, 2024

Trivy is facing issues due to rate limiting of services hosting their databases: aquasecurity/trivy#8009

This is certainly relevant to us as well, since we're aiming to follow a similar model as them.

@nscuro

This comment has been minimized.

@nscuro
Copy link
Member Author

nscuro commented Jan 7, 2025

I have created sub-issues for some initial work items we'll need to tackle. It's mostly research at this point, but help would still be much appreciated.

It might just be that there exist one or two efforts by other projects which we can use to significantly slash the work we need to do ourselves.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed p2 Non-critical bugs, and features that help organizations to identify and reduce risk size/XL Higher effort
Projects
None yet
Development

No branches or pull requests

2 participants