Skill Evaluation
claw skill evaluate bug-hunter
Score Breakdown
Safety
90
Executability
82
Completeness
80
Maintainability
84
Cost
78
Check Results
Evaluation Checks
Overall Score:83
input_validationPASS
Repository paths sanitized and validated
sandbox_executionPASS
Fuzzing runs in isolated sandbox
false_positive_rateWARN
False positive rate around 12% for complex patterns
language_coveragePASS
Core languages fully supported
resource_limitsPASS
CPU and memory usage bounded
output_formatPASS
Bug reports follow standard SARIF format