Add prometheus eval
Showing
- .gitignore 5 additions, 1 deletion.gitignore
- data_generation/main_qwen.py 0 additions, 1 deletiondata_generation/main_qwen.py
- evaluation/Prometheus Evaluation Results - Base Model.pdf 0 additions, 0 deletionsevaluation/Prometheus Evaluation Results - Base Model.pdf
- evaluation/Prometheus Evaluation Results - Finetuned Model.pdf 0 additions, 0 deletions...ation/Prometheus Evaluation Results - Finetuned Model.pdf
- evaluation/codebud_prometheus_absoulte_eval.py 43 additions, 0 deletionsevaluation/codebud_prometheus_absoulte_eval.py
- evaluation/codebud_prometheus_relative_eval.py 30 additions, 0 deletionsevaluation/codebud_prometheus_relative_eval.py
- evaluation/eval_reports/qwen-base-responses-evaluation-pass-1.csv 24431 additions, 0 deletions...on/eval_reports/qwen-base-responses-evaluation-pass-1.csv
- evaluation/eval_reports/qwen-base-responses-evaluation-pass-10.csv 24447 additions, 0 deletions...n/eval_reports/qwen-base-responses-evaluation-pass-10.csv
- evaluation/eval_reports/qwen-base-responses-evaluation-pass-2.csv 24493 additions, 0 deletions...on/eval_reports/qwen-base-responses-evaluation-pass-2.csv
- evaluation/eval_reports/qwen-base-responses-evaluation-pass-3.csv 24521 additions, 0 deletions...on/eval_reports/qwen-base-responses-evaluation-pass-3.csv
- evaluation/eval_reports/qwen-base-responses-evaluation-pass-4.csv 24511 additions, 0 deletions...on/eval_reports/qwen-base-responses-evaluation-pass-4.csv
- evaluation/eval_reports/qwen-base-responses-evaluation-pass-5.csv 24575 additions, 0 deletions...on/eval_reports/qwen-base-responses-evaluation-pass-5.csv
- evaluation/eval_reports/qwen-base-responses-evaluation-pass-6.csv 24508 additions, 0 deletions...on/eval_reports/qwen-base-responses-evaluation-pass-6.csv
- evaluation/eval_reports/qwen-base-responses-evaluation-pass-7.csv 24468 additions, 0 deletions...on/eval_reports/qwen-base-responses-evaluation-pass-7.csv
- evaluation/eval_reports/qwen-base-responses-evaluation-pass-8.csv 24461 additions, 0 deletions...on/eval_reports/qwen-base-responses-evaluation-pass-8.csv
- evaluation/eval_reports/qwen-base-responses-evaluation-pass-9.csv 24474 additions, 0 deletions...on/eval_reports/qwen-base-responses-evaluation-pass-9.csv
- evaluation/eval_reports/qwen-finetuned-responses-evaluation-pass-1.csv 24487 additions, 0 deletions...al_reports/qwen-finetuned-responses-evaluation-pass-1.csv
- evaluation/eval_reports/qwen-finetuned-responses-evaluation-pass-10.csv 24449 additions, 0 deletions...l_reports/qwen-finetuned-responses-evaluation-pass-10.csv
- evaluation/eval_reports/qwen-finetuned-responses-evaluation-pass-2.csv 24530 additions, 0 deletions...al_reports/qwen-finetuned-responses-evaluation-pass-2.csv
- evaluation/eval_reports/qwen-finetuned-responses-evaluation-pass-3.csv 24510 additions, 0 deletions...al_reports/qwen-finetuned-responses-evaluation-pass-3.csv
Loading
Please register or sign in to comment