generated at
MMLU


Claude 3.5 Sonnetが 90.4%でGPT-4を越えているwogikaze

現状トップはGPT-4

SmartGPT: Major Benchmark Broken - 89.0% on MMLU + Exam's Many Errors