lm-studio
qwen/qwen3.5-9b
Run score 92.9% across 70 completed iterations.
- Score
- 92.9%
- Passed
- 65
- Failed
- 5
- Errors
- 0
- Started
- 5/11/2026, 1:46:42 PM
- Ended
- 5/11/2026, 2:03:24 PM
- Duration
- 1002s
Category Breakdown
| Category | Total | Passed | Failed | Errors | Score |
|---|---|---|---|---|---|
| Basic File Reading | 40 | 40 | 0 | 0 | 100.0% |
| Basic Skills | 30 | 25 | 5 | 0 | 83.3% |
Cases
| Case | iterations | Passed | Failed | Errors | Score |
|---|---|---|---|---|---|
| find-file | 10 total | 10 | 0 | 0 | 100.0% |
| read-exact-file | 10 total | 10 | 0 | 0 | 100.0% |
| read-exact-file-with-at-reference | 10 total | 10 | 0 | 0 | 100.0% |
| read-file | 10 total | 10 | 0 | 0 | 100.0% |
| use-skill | 10 total | 7 | 3 | 0 | 70.0% |
| use-skill-with-refs | 10 total | 8 | 2 | 0 | 80.0% |
| use-skill-with-scripts | 10 total | 10 | 0 | 0 | 100.0% |
Iterations
70 matching iterations
| Iteration | Model | Category | Variant | Status | Duration | Tools | Tokens | Context |
|---|---|---|---|---|---|---|---|---|
| find-file / 001 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 5,592ms | 4 | 7,990 | 5.4% |
| find-file / 002 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 5,736ms | 4 | 8,095 | 5.5% |
| find-file / 003 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 4,749ms | 4 | 8,061 | 5.5% |
| find-file / 004 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 3,955ms | 4 | 7,674 | 5.2% |
| find-file / 005 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 3,779ms | 4 | 7,607 | 5.1% |
| find-file / 006 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 4,054ms | 4 | 7,651 | 5.2% |
| find-file / 007 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 3,607ms | 3 | 5,982 | 5.0% |
| find-file / 008 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 4,077ms | 4 | 7,579 | 5.1% |
| find-file / 009 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 4,837ms | 4 | 7,651 | 5.2% |
| find-file / 010 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 4,069ms | 4 | 7,706 | 5.3% |
| read-exact-file / 001 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 2,850ms | 2 | 4,663 | 5.0% |
| read-exact-file / 002 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 3,613ms | 4 | 8,261 | 5.5% |
| read-exact-file / 003 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 1,973ms | 2 | 4,620 | 4.9% |
| read-exact-file / 004 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 2,994ms | 2 | 4,631 | 5.0% |
| read-exact-file / 005 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 2,280ms | 2 | 4,651 | 5.0% |
| read-exact-file / 006 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 2,217ms | 2 | 4,657 | 5.0% |
| read-exact-file / 007 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 2,202ms | 2 | 4,646 | 5.0% |
| read-exact-file / 008 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 2,356ms | 2 | 4,649 | 5.0% |
| read-exact-file / 009 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 3,058ms | 3 | 6,409 | 5.3% |
| read-exact-file / 010 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 2,136ms | 2 | 4,630 | 4.9% |
| read-exact-file-with-at-reference / 001 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 7,378ms | 2 | 4,443 | 4.8% |
| read-exact-file-with-at-reference / 002 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 12,447ms | 2 | 4,492 | 4.9% |
| read-exact-file-with-at-reference / 003 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 15,484ms | 3 | 6,036 | 4.9% |
| read-exact-file-with-at-reference / 004 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 11,164ms | 2 | 4,452 | 4.8% |
| read-exact-file-with-at-reference / 005 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 8,983ms | 2 | 4,492 | 4.8% |
| read-exact-file-with-at-reference / 006 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 8,935ms | 2 | 4,462 | 4.8% |
| read-exact-file-with-at-reference / 007 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 9,969ms | 2 | 4,638 | 5.0% |
| read-exact-file-with-at-reference / 008 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 9,858ms | 2 | 4,468 | 4.8% |
| read-exact-file-with-at-reference / 009 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 8,293ms | 2 | 4,484 | 4.9% |
| read-exact-file-with-at-reference / 010 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 9,733ms | 2 | 4,445 | 4.8% |
| read-file / 001 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 17,162ms | 4 | 7,704 | 5.3% |
| read-file / 002 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 14,207ms | 4 | 7,642 | 5.2% |
| read-file / 003 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 5,792ms | 4 | 7,651 | 5.1% |
| read-file / 004 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 3,813ms | 4 | 7,634 | 5.2% |
| read-file / 005 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 3,944ms | 4 | 7,618 | 5.2% |
| read-file / 006 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 3,382ms | 4 | 7,568 | 5.1% |
| read-file / 007 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 4,640ms | 4 | 7,920 | 5.4% |
| read-file / 008 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 2,947ms | 3 | 5,931 | 4.9% |
| read-file / 009 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 4,398ms | 4 | 7,720 | 5.3% |
| read-file / 010 | lm-studio / qwen/qwen3.5-9b | Basic File Reading | Baseline (/skills) | passed | 4,175ms | 4 | 7,627 | 5.2% |
| use-skill / 001 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 2,866ms | 2 | 4,724 | 5.0% |
| use-skill / 002 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | failed | 399,910ms | 190 | 3,229,064 | 52.6% |
| use-skill / 003 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 2,607ms | 2 | 4,725 | 5.0% |
| use-skill / 004 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 1,979ms | 2 | 4,674 | 5.0% |
| use-skill / 005 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | failed | 22,314ms | 12 | 31,396 | 10.9% |
| use-skill / 006 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 2,683ms | 2 | 4,789 | 5.1% |
| use-skill / 007 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | failed | 9,388ms | 8 | 17,111 | 7.2% |
| use-skill / 008 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 2,045ms | 2 | 4,673 | 5.0% |
| use-skill / 009 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 2,566ms | 2 | 4,671 | 5.0% |
| use-skill / 010 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 3,055ms | 2 | 4,696 | 5.0% |
| use-skill-with-refs / 001 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 11,022ms | 5 | 11,835 | 7.1% |
| use-skill-with-refs / 002 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 3,945ms | 3 | 6,924 | 5.7% |
| use-skill-with-refs / 003 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 3,319ms | 3 | 6,742 | 5.5% |
| use-skill-with-refs / 004 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 9,394ms | 8 | 17,069 | 7.1% |
| use-skill-with-refs / 005 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 13,156ms | 13 | 26,554 | 8.3% |
| use-skill-with-refs / 006 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | failed | 2,839ms | 2 | 4,826 | 5.2% |
| use-skill-with-refs / 007 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | failed | 6,726ms | 2 | 5,127 | 6.2% |
| use-skill-with-refs / 008 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 7,218ms | 5 | 11,407 | 6.6% |
| use-skill-with-refs / 009 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 31,931ms | 29 | 80,413 | 13.6% |
| use-skill-with-refs / 010 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 10,006ms | 6 | 12,536 | 6.5% |
| use-skill-with-scripts / 001 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 14,763ms | 5 | 11,319 | 6.5% |
| use-skill-with-scripts / 002 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 13,114ms | 5 | 9,569 | 6.4% |
| use-skill-with-scripts / 003 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 20,223ms | 7 | 16,199 | 7.2% |
| use-skill-with-scripts / 004 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 10,893ms | 3 | 7,209 | 5.9% |
| use-skill-with-scripts / 005 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 12,794ms | 4 | 9,215 | 6.2% |
| use-skill-with-scripts / 006 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 11,070ms | 3 | 7,085 | 5.8% |
| use-skill-with-scripts / 007 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 14,124ms | 3 | 7,087 | 5.8% |
| use-skill-with-scripts / 008 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 15,098ms | 3 | 7,093 | 5.8% |
| use-skill-with-scripts / 009 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 14,012ms | 3 | 7,057 | 5.8% |
| use-skill-with-scripts / 010 | lm-studio / qwen/qwen3.5-9b | Basic Skills | Baseline (/skills) | passed | 15,047ms | 3 | 7,043 | 5.8% |