lm-studio

lfm2.5-350m

Run score 2.9% across 70 completed iterations.

Score
2.9%
Passed
2
Failed
68
Errors
0
Started
5/8/2026, 4:27:34 PM
Ended
5/8/2026, 4:28:57 PM
Duration
84s

Category Breakdown

Category Total Passed Failed Errors Score
Basic File Reading 40 2 38 0 5.0%
Basic Skills 30 0 30 0 0.0%

Cases

Case iterations Passed Failed Errors Score
find-file 10 total 0 10 0 0.0%
read-exact-file 10 total 0 10 0 0.0%
read-exact-file-with-at-reference 10 total 2 8 0 20.0%
read-file 10 total 0 10 0 0.0%
use-skill 10 total 0 10 0 0.0%
use-skill-with-refs 10 total 0 10 0 0.0%
use-skill-with-scripts 10 total 0 10 0 0.0%

Iterations

70 matching iterations

IterationModelCategoryVariantStatusDurationToolsTokensContext
find-file / 001lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed392ms12,3743.8%
find-file / 002lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed232ms12,3233.6%
find-file / 003lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed215ms12,3183.6%
find-file / 004lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed313ms12,3453.7%
find-file / 005lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed310ms32,4203.8%
find-file / 006lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed428ms23,6634.0%
find-file / 007lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed338ms12,3973.8%
find-file / 008lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed252ms12,3243.6%
find-file / 009lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed307ms12,3533.7%
find-file / 010lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed273ms12,3353.6%
read-exact-file / 001lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed686ms12,6034.1%
read-exact-file / 002lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed262ms12,5914.0%
read-exact-file / 003lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed262ms12,5814.0%
read-exact-file / 004lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed247ms12,5874.0%
read-exact-file / 005lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed-1,025ms12,5884.0%
read-exact-file / 006lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed339ms23,9234.2%
read-exact-file / 007lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed243ms12,5874.0%
read-exact-file / 008lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed258ms12,5884.0%
read-exact-file / 009lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed242ms12,5904.0%
read-exact-file / 010lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed209ms12,5704.0%
read-exact-file-with-at-reference / 001lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed601ms12,3253.6%
read-exact-file-with-at-reference / 002lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed208ms12,3203.6%
read-exact-file-with-at-reference / 003lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed259ms12,3323.6%
read-exact-file-with-at-reference / 004lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed313ms23,5183.7%
read-exact-file-with-at-reference / 005lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed238ms12,3293.6%
read-exact-file-with-at-reference / 006lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed301ms12,3533.7%
read-exact-file-with-at-reference / 007lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed290ms23,5313.7%
read-exact-file-with-at-reference / 008lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)passed318ms23,5293.7%
read-exact-file-with-at-reference / 009lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)passed362ms23,5303.7%
read-exact-file-with-at-reference / 010lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed296ms12,3583.7%
read-file / 001lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed237ms12,3323.7%
read-file / 002lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed254ms12,3443.7%
read-file / 003lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed215ms12,3163.6%
read-file / 004lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed240ms12,3363.7%
read-file / 005lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed285ms12,3523.7%
read-file / 006lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed296ms12,3623.7%
read-file / 007lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed198ms12,3243.6%
read-file / 008lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed234ms12,3383.7%
read-file / 009lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed229ms12,3333.7%
read-file / 010lm-studio / lfm2.5-350mBasic File ReadingBaseline (/skills)failed257ms12,3513.7%
use-skill / 001lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed301ms01,2994.0%
use-skill / 002lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed248ms01,2783.9%
use-skill / 003lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed234ms12,5223.9%
use-skill / 004lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed213ms01,2653.9%
use-skill / 005lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed219ms01,2673.9%
use-skill / 006lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed242ms01,2783.9%
use-skill / 007lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed302ms12,5554.0%
use-skill / 008lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed235ms01,2703.9%
use-skill / 009lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed204ms01,2583.8%
use-skill / 010lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed262ms01,2763.9%
use-skill-with-refs / 001lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed311ms01,2954.0%
use-skill-with-refs / 002lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed230ms01,2873.9%
use-skill-with-refs / 003lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed202ms01,2753.9%
use-skill-with-refs / 004lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed209ms01,2853.9%
use-skill-with-refs / 005lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed269ms01,2974.0%
use-skill-with-refs / 006lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed217ms01,2793.9%
use-skill-with-refs / 007lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed203ms01,2813.9%
use-skill-with-refs / 008lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed237ms01,2923.9%
use-skill-with-refs / 009lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed196ms01,2743.9%
use-skill-with-refs / 010lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed198ms01,2763.9%
use-skill-with-scripts / 001lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed361ms12,8504.5%
use-skill-with-scripts / 002lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed294ms01,4474.4%
use-skill-with-scripts / 003lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed291ms01,4524.4%
use-skill-with-scripts / 004lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed224ms01,4204.3%
use-skill-with-scripts / 005lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed294ms01,4474.4%
use-skill-with-scripts / 006lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed241ms01,4234.3%
use-skill-with-scripts / 007lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed340ms01,4684.5%
use-skill-with-scripts / 008lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed207ms01,4194.3%
use-skill-with-scripts / 009lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed487ms24,5004.8%
use-skill-with-scripts / 010lm-studio / lfm2.5-350mBasic SkillsBaseline (/skills)failed199ms01,4084.3%