lm-studio
lfm2.5-350m
Run score 2.9% across 70 completed iterations.
- Score
- 2.9%
- Passed
- 2
- Failed
- 68
- Errors
- 0
- Started
- 5/8/2026, 4:27:34 PM
- Ended
- 5/8/2026, 4:28:57 PM
- Duration
- 84s
Category Breakdown
| Category | Total | Passed | Failed | Errors | Score |
|---|---|---|---|---|---|
| Basic File Reading | 40 | 2 | 38 | 0 | 5.0% |
| Basic Skills | 30 | 0 | 30 | 0 | 0.0% |
Cases
| Case | iterations | Passed | Failed | Errors | Score |
|---|---|---|---|---|---|
| find-file | 10 total | 0 | 10 | 0 | 0.0% |
| read-exact-file | 10 total | 0 | 10 | 0 | 0.0% |
| read-exact-file-with-at-reference | 10 total | 2 | 8 | 0 | 20.0% |
| read-file | 10 total | 0 | 10 | 0 | 0.0% |
| use-skill | 10 total | 0 | 10 | 0 | 0.0% |
| use-skill-with-refs | 10 total | 0 | 10 | 0 | 0.0% |
| use-skill-with-scripts | 10 total | 0 | 10 | 0 | 0.0% |
Iterations
70 matching iterations
| Iteration | Model | Category | Variant | Status | Duration | Tools | Tokens | Context |
|---|---|---|---|---|---|---|---|---|
| find-file / 001 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 392ms | 1 | 2,374 | 3.8% |
| find-file / 002 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 232ms | 1 | 2,323 | 3.6% |
| find-file / 003 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 215ms | 1 | 2,318 | 3.6% |
| find-file / 004 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 313ms | 1 | 2,345 | 3.7% |
| find-file / 005 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 310ms | 3 | 2,420 | 3.8% |
| find-file / 006 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 428ms | 2 | 3,663 | 4.0% |
| find-file / 007 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 338ms | 1 | 2,397 | 3.8% |
| find-file / 008 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 252ms | 1 | 2,324 | 3.6% |
| find-file / 009 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 307ms | 1 | 2,353 | 3.7% |
| find-file / 010 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 273ms | 1 | 2,335 | 3.6% |
| read-exact-file / 001 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 686ms | 1 | 2,603 | 4.1% |
| read-exact-file / 002 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 262ms | 1 | 2,591 | 4.0% |
| read-exact-file / 003 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 262ms | 1 | 2,581 | 4.0% |
| read-exact-file / 004 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 247ms | 1 | 2,587 | 4.0% |
| read-exact-file / 005 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | -1,025ms | 1 | 2,588 | 4.0% |
| read-exact-file / 006 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 339ms | 2 | 3,923 | 4.2% |
| read-exact-file / 007 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 243ms | 1 | 2,587 | 4.0% |
| read-exact-file / 008 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 258ms | 1 | 2,588 | 4.0% |
| read-exact-file / 009 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 242ms | 1 | 2,590 | 4.0% |
| read-exact-file / 010 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 209ms | 1 | 2,570 | 4.0% |
| read-exact-file-with-at-reference / 001 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 601ms | 1 | 2,325 | 3.6% |
| read-exact-file-with-at-reference / 002 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 208ms | 1 | 2,320 | 3.6% |
| read-exact-file-with-at-reference / 003 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 259ms | 1 | 2,332 | 3.6% |
| read-exact-file-with-at-reference / 004 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 313ms | 2 | 3,518 | 3.7% |
| read-exact-file-with-at-reference / 005 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 238ms | 1 | 2,329 | 3.6% |
| read-exact-file-with-at-reference / 006 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 301ms | 1 | 2,353 | 3.7% |
| read-exact-file-with-at-reference / 007 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 290ms | 2 | 3,531 | 3.7% |
| read-exact-file-with-at-reference / 008 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | passed | 318ms | 2 | 3,529 | 3.7% |
| read-exact-file-with-at-reference / 009 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | passed | 362ms | 2 | 3,530 | 3.7% |
| read-exact-file-with-at-reference / 010 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 296ms | 1 | 2,358 | 3.7% |
| read-file / 001 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 237ms | 1 | 2,332 | 3.7% |
| read-file / 002 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 254ms | 1 | 2,344 | 3.7% |
| read-file / 003 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 215ms | 1 | 2,316 | 3.6% |
| read-file / 004 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 240ms | 1 | 2,336 | 3.7% |
| read-file / 005 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 285ms | 1 | 2,352 | 3.7% |
| read-file / 006 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 296ms | 1 | 2,362 | 3.7% |
| read-file / 007 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 198ms | 1 | 2,324 | 3.6% |
| read-file / 008 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 234ms | 1 | 2,338 | 3.7% |
| read-file / 009 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 229ms | 1 | 2,333 | 3.7% |
| read-file / 010 | lm-studio / lfm2.5-350m | Basic File Reading | Baseline (/skills) | failed | 257ms | 1 | 2,351 | 3.7% |
| use-skill / 001 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 301ms | 0 | 1,299 | 4.0% |
| use-skill / 002 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 248ms | 0 | 1,278 | 3.9% |
| use-skill / 003 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 234ms | 1 | 2,522 | 3.9% |
| use-skill / 004 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 213ms | 0 | 1,265 | 3.9% |
| use-skill / 005 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 219ms | 0 | 1,267 | 3.9% |
| use-skill / 006 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 242ms | 0 | 1,278 | 3.9% |
| use-skill / 007 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 302ms | 1 | 2,555 | 4.0% |
| use-skill / 008 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 235ms | 0 | 1,270 | 3.9% |
| use-skill / 009 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 204ms | 0 | 1,258 | 3.8% |
| use-skill / 010 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 262ms | 0 | 1,276 | 3.9% |
| use-skill-with-refs / 001 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 311ms | 0 | 1,295 | 4.0% |
| use-skill-with-refs / 002 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 230ms | 0 | 1,287 | 3.9% |
| use-skill-with-refs / 003 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 202ms | 0 | 1,275 | 3.9% |
| use-skill-with-refs / 004 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 209ms | 0 | 1,285 | 3.9% |
| use-skill-with-refs / 005 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 269ms | 0 | 1,297 | 4.0% |
| use-skill-with-refs / 006 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 217ms | 0 | 1,279 | 3.9% |
| use-skill-with-refs / 007 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 203ms | 0 | 1,281 | 3.9% |
| use-skill-with-refs / 008 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 237ms | 0 | 1,292 | 3.9% |
| use-skill-with-refs / 009 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 196ms | 0 | 1,274 | 3.9% |
| use-skill-with-refs / 010 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 198ms | 0 | 1,276 | 3.9% |
| use-skill-with-scripts / 001 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 361ms | 1 | 2,850 | 4.5% |
| use-skill-with-scripts / 002 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 294ms | 0 | 1,447 | 4.4% |
| use-skill-with-scripts / 003 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 291ms | 0 | 1,452 | 4.4% |
| use-skill-with-scripts / 004 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 224ms | 0 | 1,420 | 4.3% |
| use-skill-with-scripts / 005 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 294ms | 0 | 1,447 | 4.4% |
| use-skill-with-scripts / 006 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 241ms | 0 | 1,423 | 4.3% |
| use-skill-with-scripts / 007 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 340ms | 0 | 1,468 | 4.5% |
| use-skill-with-scripts / 008 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 207ms | 0 | 1,419 | 4.3% |
| use-skill-with-scripts / 009 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 487ms | 2 | 4,500 | 4.8% |
| use-skill-with-scripts / 010 | lm-studio / lfm2.5-350m | Basic Skills | Baseline (/skills) | failed | 199ms | 0 | 1,408 | 4.3% |