Mihir Jadhav
← Back to Writing

Self-Hosting in 2026: Why I Run My Own AI Stack

·3 min read
InfrastructureAWSSelf-Hosted

Everyone's going serverless. Everyone's using managed services. I'm running my own AI stack on a single AWS EC2 t4g.medium instance. Here's why.

The stack

OpenClaw - self-hosted AI platform for private, cost-controlled LLM access
Kong - API gateway routing traffic across all my services
Keycloak - SSO and Google OAuth for every app I build
Novu - self-hosted notification infrastructure
Getlago - self-hosted billing and metering
MeiliSearch - self-hosted search engine for Thumbboard.io

Total monthly cost: under $50 for everything.

The philosophy

Self-hosting isn't about saving money. It's about three things:

1. Understanding your own stack

When you run managed services, you learn their API. When you self-host, you learn the system. I can debug a Kong routing issue at 2am because I configured every route myself. I can't say the same about a managed API gateway.

2. Data ownership

Our LLM chatbot at Futurescape is trained on internal company data. Running that through a third-party API isn't just expensive - it's a data governance risk. OpenClaw on our own EC2 means the data never leaves our infrastructure.

3. Composability

Kong + Keycloak + Novu + Getlago aren't just separate tools - they're a platform. Keycloak handles auth for every app. Kong routes traffic. Novu sends notifications based on Getlago billing events. It's a self-hosted PaaS.

The tradeoffs

I won't pretend there are none:

Ops overhead - I'm the sysadmin. Updates, security patches, monitoring are my responsibility.
Resource sizing - Running multiple services on t4g.medium requires careful memory budgeting.
No auto-scaling - If traffic spikes, I scale manually.

For a solo builder running products with hundreds (not millions) of users, these tradeoffs are worth it. When OneBiz hits scale, I'll move billing to a managed service. Until then, I learn more from running it myself.

ARM-based instances are underrated

The t4g.medium is an ARM-based Graviton instance. It's 20% cheaper than equivalent x86 instances and handles my entire stack without breaking a sweat. If you're self-hosting in 2026, go ARM.

Should you self-host?

If you're building a startup that needs to move fast, probably not. Use managed services and focus on your product.

If you're a builder who wants to deeply understand infrastructure, absolutely. The knowledge compounds. Every service you run teaches you something about networking, auth, databases, and operations that makes you a better engineer.

I'm not anti-managed-services. I'm pro-understanding-your-stack.