Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Checklist items provide guides instead of analysis #27

Open
IlonaShishov opened this issue Dec 9, 2024 · 1 comment
Open

Checklist items provide guides instead of analysis #27

IlonaShishov opened this issue Dec 9, 2024 · 1 comment
Labels
accuracy bug Something isn't working external P0

Comments

@IlonaShishov
Copy link
Contributor

IlonaShishov commented Dec 9, 2024

Sometimes, items in the checklist provide a "How-to" guide or instructions on achieving a task, rather than fulfilling their intended purpose of scanning the code base and reporting actual findings based on the analysis.

An example of such an item is:
{ "input": "Inspect Cleanup Process: Analyze the cleanup process following the decompression of archives. Ensure that it does not follow relative paths that could lead to overwriting or deleting files outside the intended scope. This is crucial as the vulnerability exploits this specific behavior.", "response": "The cleanup process following the decompression of archives should be designed to prevent path traversal attacks. This can be achieved by ensuring that the decompression process does not follow relative paths that could lead to overwriting or deleting files outside the intended scope. The container image should use absolute paths and validate user input to prevent such attacks. However, without more specific information about the container image's implementation, it is difficult to provide a more detailed answer." },

CVE: CVE-2024-1485
Component: openshift4/ose-console
output.json

@exactlyallan exactlyallan added the bug Something isn't working label Jan 8, 2025
@ashsong-nv
Copy link
Collaborator

Hi Ilona, thank you for raising this issue! We discussed this as a team and we think that prompt tuning experiments for the checklist model may be able to improve this issue.

Before running specific checklist model prompt tuning experiments, we are concurrently evaluating llama 3.3 model as a general improvement to the pipeline, and so far we are seeing promising accuracy improvements when using it for checklist generation. Our plan is to work on prompt engineering for this issue after migrating to llama 3.3 to ensure the new prompt is customized to work with the new model.

ruromero added a commit to ruromero/vulnerability-analysis that referenced this issue Mar 3, 2025
feat: add authentication configuration for the client
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
accuracy bug Something isn't working external P0
Projects
None yet
Development

No branches or pull requests

4 participants