Writing
Writing on healthcare interoperability, API ecosystems, and operational execution, plus occasional lab notes. For deeper implementation dives, see Case Studies.
Showing all 7 posts
Professional Articles
Getting Gemma 4 Running on a Radeon 7900 XTX (with and without TurboQuant)
What it took to get Gemma 4 E4B serving cleanly on Radeon through FlexInfer: a stable TRITON lane on a 7900 XTX, an experimental TurboQuant long-context lane on a second node, and the GPTQ pipeline work still underway.
Build Your Own Legs Before the Crutches Fail
AI-assisted development is useful leverage, but only if you convert borrowed competence into real judgment before the support becomes a dependency.
Standing Up a GPU-Ready Private AI Platform (Harvester + K3s + Flux + GitLab)
Field notes from building and operating a small private GPU platform with Harvester, K3s, and a GitLab -> Flux delivery loop.
Optimizing Real-Time Kubernetes Visualizations: From 25ms to 12ms Per Frame
A deep dive into optimizing Canvas 2D and Three.js visualizations for Kubernetes dashboards, covering algorithmic complexity, memory management, and GPU-efficient rendering patterns.
Case Study: When API Docs Omit Contract Details — Patient Matching and the “Enhanced Best Match” Trap
A real-world integration story from both sides of the table: supporting and consuming the same healthcare API.
Healthcare Interoperability in 2025: From “SMART Apps” to TEFCA-Scale Exchange
What changed since Meaningful Use, and what still blocks plug-and-play healthcare apps.
Operationalizing Healthcare API Integrations: The Playbook That Actually Works
A practical blueprint for scaling partner integrations without burning out engineering or support.