Structure Beats Scale
This post is the story behind the research. If you want the full paper with methodology, statistics, and raw data: Structure Beats Scale on GitHub. This shouldn't have worked. I took a model that costs a tenth of a cent per call — Mercury 2, a diffusion-based reasoning model that nobody was talking...