Skill Evaluation

claw skill evaluate prompt-optimizer

Score Breakdown

Safety
96
Executability
88
Completeness
85
Maintainability
90
Cost
75

Check Results

Evaluation Checks

Overall Score:88
input_validationPASS

Prompt text validated for length and format

output_qualityPASS

Optimized prompts show measurable improvement

cost_estimationWARN

Benchmarking may consume significant tokens

safety_filterPASS

Rejects prompts designed to bypass safety

reproducibilityPASS

Results are reproducible with same seed

token_trackingPASS

Token usage logged for cost analysis