Add dataset and baseline A/B results#13
Add dataset and baseline A/B results#13Aditya-Ghatole wants to merge 2 commits intoaboutcode-org:mainfrom
Conversation
Signed-off-by: Aditya G <aditya.ghatole05@gmail.com>
|
@Aditya-Ghatole |
Signed-off-by: Aditya G <aditya.ghatole05@gmail.com>
|
@ziadhany I’m currently using the openai/gpt-oss-120b model with a temperature of 0.0, along with the existing prompt (prompt_v1) and an experimental prompt I’m working on. |
|
I should also mention that I’m aware there are a few issues in the current code. I’m working on a cleaner solution and will update you soon. For now, I’ve shared a beta version that is functional and should be sufficient to move things forward. |
@ziadhany
Summary
Add a curated vulnerability dataset and baseline evaluation results for review.
Details
This PR includes:
A vulnerability dataset containing summaries with expected severity and CWE labels.
Baseline evaluation results produced by running the current prompt against this dataset.
Purpose
The intent is to review and validate the dataset first, specifically to confirm that it contains sufficient and appropriate information to support CWE prediction, before making further changes to prompts or evaluation logic.
Related issue: #3