The evidence is solid but not definitive, as the conclusions rely on the absence of changes in spatial breadth and would benefit from clearer statistical justification and a more cautious ...
Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models ...