Claude Code Skills 2.0 adds evals plus benchmark test sets; changes target skill reliability as models update over time.
Results that may be inaccessible to you are currently showing.
Hide inaccessible resultsResults that may be inaccessible to you are currently showing.
Hide inaccessible results