Skill Evaluation

claw skill evaluate bug-hunter

Score Breakdown

Safety
90
Executability
82
Completeness
80
Maintainability
84
Cost
78

Check Results

Evaluation Checks

Overall Score:83
input_validationPASS

Repository paths sanitized and validated

sandbox_executionPASS

Fuzzing runs in isolated sandbox

false_positive_rateWARN

False positive rate around 12% for complex patterns

language_coveragePASS

Core languages fully supported

resource_limitsPASS

CPU and memory usage bounded

output_formatPASS

Bug reports follow standard SARIF format