Skip to main content

Table of Results

LLM NamePython "Hello World"Python ClockPython ApplicationPython Compliment AppNotes
mlfoundations-dev_-_mistral_7b_0-3_oh-dcft-v3.1-claude-3-5-haiku-20241022Completed in a single passFailed

Failed

 


utilized Cline interface
llama-3.2-1b-claude-3.7-sonnet-reasoning-distilledCouldn't use tools



llama-3.2-3b-claude-3.7-sonnet-reasoning-distilledNot using the tools



meta-llama-3.1-8b-instructCompleted in a single pass

Tried hard