Files
MatBench/results/20250602_1706/summary.csv

706 B

1Modelaccuracyprecision_microrecall_microf1_microprecision_macrorecall_macrof1_macroData Count
2qwen-max-2025-01-250.64467005076142140.63366336633663370.6497461928934010.64160401002506260.63887600494743360.65010204081632650.64232342205538197
3gpt-4o0.54822335025380710.56185567010309280.55329949238578680.55754475703324810.57790880503144650.55367346938775510.5600088997453159197
4deepseek-chat0.67005076142131980.6769230769230770.67005076142131980.6734693877551020.68991146934460890.67051020408163260.6754210676562946197
5claude-sonnet-4-202505140.7005076142131980.69346733668341710.7005076142131980.6969696969696970.70721804842444380.70091836734693880.69833034513671197