MLOps, Observability & Cost/Performance
Essays by the OneMind Strata Team. Click a title to open the full essay.
The “Two-Model” Pattern for Cost & Reliability
Cross-Industry
Cheap first, smart second—route only when needed.
Observability: What Matters Beyond Tokens
Technology & Software
Answerability, latency budget, and drift—not just spend.
Batch vs. Streaming for AI Workloads
Resources & Utilities
When nightly jobs beat real-time (and vice versa).
Cost Postmortems That Actually Change Things
Public Sector
From “too expensive” to concrete routing/caching fixes.
Versioning Prompts, Policies, and Models Together
Cross-Industry
Ship sets, not parts; roll forward safely.