Engineering
loading
·
loading
·
My inference stack in June 2026: local agents first, OpenRouter when I need more compute
·1492 words·8 mins
My default is local agents. When a task needs more compute, I use OpenRouter to reach models that cost less than the frontier APIs while still covering most of my coding and text work.
