FastRoutervs.Helicone
Helicone built one of the cleanest LLM observability products in the category. Mintlify acquired it in March 2026; the cloud product is in maintenance mode — security patches and new model support continue, but no new features. If you're still on it, plan a migration before the gap becomes a problem.
By Andrej Gamser~12 min readShort version
The quick decision
| If you are... | Use... | Why? |
|---|---|---|
| Already on Helicone Cloud and want to keep moving | FastRouter | Active roadmap, parity on observability, plus routing + evals + governance. |
| Need to keep Helicone-specific workflows on your own infra | Helicone OSS | Apache 2.0 repo, Docker/Helm. Frozen at current features but stays on your terms. |
| Want backend-agnostic instrumentation | OpenTelemetry | OTel + Langfuse or another OTel backend keeps you off any single vendor's roadmap. |
| Want the deepest gateway + evals combo | FastRouter | 7 routing strategies, Smart + Auto + GEPA evals, MCP credential vaulting. |
At a glance: Key metrics
| Metric | FastRouter | Helicone |
|---|---|---|
| Roadmap status | Active development | Maintenance only (Mintlify, Mar 2026) |
| Routing strategies | 7 strategies (incl. AI Auto Router) | Provider routing + failover; no new strategies |
| Built-in evals | Smart + Auto + GEPA + Video | LLM-as-judge experiments (frozen) |
| Open-source / self-host | Managed only | Apache 2.0, Docker + Helm |
Feature matrix
| Capability | FastRouter | Helicone |
|---|---|---|
| Multi-provider routing | 7 strategies incl. AI Auto Router | Provider routing + failover (no new strategies coming) |
| AI Auto Model Router | Per-request, eval-driven | Not supported |
| Production evals | Smart + Auto + GEPA | LLM-as-judge shipped, frozen |
| Sessions / multi-step traces | Supported | Strong implementation |
| Prompt management / A/B | Supported | Versioning + A/B testing |
| MCP credential vaulting | Agents never see raw keys | Not supported |
| Open source / self-host | Managed only | Apache 2.0; Docker, Helm |
Three migration paths
Move to a managed alternative — FastRouter. Lowest engineering lift. Both OpenAI-compatible. Sessions, traces, prompts, and per-user cost tracking map cleanly. Hands-on migration support included.
Self-host Helicone OSS. Highest fidelity to your current setup. Apache 2.0; Postgres + Clickhouse + Redis. Frozen at OSS feature set but stays on your infra.
Instrument with OpenTelemetry, stay backend-agnostic. Most defensive long-term. OTel GenAI conventions + Langfuse / FastRouter / your own ClickHouse. Decouples you from any single vendor's roadmap.
What changed in March 2026
Mintlify acquired Helicone. Justin Torre and Cole Gottdank joined Mintlify. Cloud product is in maintenance mode — security patches, bug fixes, new model support continue; no new features. Mintlify's stated rationale is "AI knowledge infrastructure" — they want agents that maintain their documentation.
What that means for current users: cloud product still runs; SLAs unchanged; the OSS repo accepts community PRs but no core team driving feature work; Mintlify-internal use likely deepens.
Side-by-side
Feature matrix
✓ supported, ✗ not supported, ◑ partial, ⏸ frozen (maintenance only).
| Capability | FastRouterActive | HeliconeMaintenance |
|---|---|---|
| Roadmap status | Active development | Maintenance only (Mintlify, March 2026) |
| OpenAI-compatible endpoint | Supported | Supported |
| Async / non-proxy mode | Sidecar logging | Helicone's signature option |
| Multi-provider routing | 7 strategies incl. AI Auto Router | Provider routing + failover; no new strategies coming |
| AI Auto Model Router | Supported | Not supported |
| Smart / Automatic Evaluations | Live production evals | ◑LLM-as-judge experiments shipped, no new evaluators |
| GEPA prompt optimization | Supported | Not supported |
| Video evaluations | Supported | Not supported |
| Sessions / multi-step traces | Supported | Strong implementation |
| Custom properties / metadata | Supported | Supported |
| Prompt management / versioning | Supported | Versioning, A/B testing |
| Caching | Semantic + simple | Cross-provider Redis cache |
| Per-user / per-feature cost tracking | Supported | Supported |
| MCP credential vaulting | Supported | Not supported |
| Workspace governance / RBAC | Supported | 5 orgs on Team plan; SOC-2 / HIPAA on Team+ |
| Open source / self-host | Managed only | Apache 2.0 on GitHub; Docker, Helm |
| Active community / new features | Supported | Frozen at acquisition |
| Migration assistance | Hands-on migration support included | — |
What changed
What "maintenance mode" actually means here
Mintlify's stated rationale for the acquisition is "AI knowledge infrastructure" — they want to build agents that autonomously maintain and update documentation, and the Helicone gateway/observability primitives are useful inputs. Mintlify was already a Helicone customer pre-acquisition.
For current Helicone users:
Cloud product still runs. Free, Pro ($79/mo), Team ($799/mo), and Enterprise tiers continue to be supported. SLAs unchanged.
Security patches and new model support continue. The team has committed to keeping the lights on.
No new features. No new evaluators, no new gateway capabilities, no new analytics surfaces.
OSS repo accepts community PRs. But there's no core team driving feature work.
Mintlify-internal use likely deepens. Some Helicone capabilities may end up tightly coupled to Mintlify's agent platform over time.
Pattern is similar to other "acqui-hire-with-a-product" outcomes: the existing platform doesn't disappear, but it stops being a competitive product. The longer you wait, the more your stack accumulates dependencies on a tool that isn't moving.
If you're on Helicone today, the right window for migration planning is 6–12 months. Long enough to do it without panic, short enough that you're not still on a static product when the next round of evals/routing/MCP capabilities ships across the rest of the category.
Observability parity
The easy part to replace
Helicone's observability layer is genuinely well-designed — sessions, custom properties, per-user/per-feature cost aggregation, request/response tracing across 100+ providers. It's also the part of Helicone with the most parity in the broader market. FastRouter ships an observability stack at feature parity for almost all of those primitives, and the rest of the category (Langfuse, Braintrust, OTel-native backends) covers the same surface.
If your reason for being on Helicone is observability, the migration is structurally low-risk. The data model maps cleanly to FastRouter's traces, sessions, and custom properties; the dashboard surfaces are similar; per-user cost tracking is included; OTel export keeps you backend-agnostic if you'd rather stay portable.
Helicone built the right primitives. The bad news for Helicone is those primitives are commoditized now.
Routing & gateway
Helicone evolved into a gateway. The roadmap stopped there.
Helicone shipped a real Rust-based gateway in late 2025 — provider routing with weighted policies, instant failover on rate limits/timeouts/server errors, Redis-based caching with cross-provider compatibility (cache OpenAI for Anthropic for up to 95% cost reduction on cache hits), per-user rate limiting, ~10K req/sec capacity per node. As of the acquisition, those capabilities are stable but won't extend further.
FastRouter's gateway is actively developed: 7 routing strategies, AI Auto Model Router, MCP credential vaulting, weighted shuffle for canary releases, category-based routing as a first-class strategy. If routing depth and AI-driven model selection are part of your roadmap, you'll outgrow what maintenance-mode Helicone can offer.
Evals & optimization
The widest gap is here
Helicone's experiments and evaluators were genuinely interesting — LLM-as-judge with side-by-side comparison, dataset-driven testing, prompt versioning paired with experimentation. The team had ambitious roadmap signals before the acquisition. Those signals are now frozen.
FastRouter's eval layer is the part of the product that's moving fastest:
Smart Evaluations — AI quality scoring on live production calls.
Automatic Evaluations — background sampler that benchmarks competing models on real traffic.
GEPA — Generative Evolutionary Prompt Architecture iterates across prompt and model combinations toward Pareto-optimal cost/quality.
Video evaluations — compare model output on video inputs, a content type no other gateway in the category currently supports.
If evals were one of the reasons you chose Helicone, the gap will widen meaningfully over the next 6–12 months across the whole category.
Self-host option
Helicone OSS is a real option — with real costs
Helicone is Apache 2.0 on GitHub. There's a Docker Compose for development, a Helm chart for Kubernetes, and the gateway repo is also open source. Self-hosting Helicone keeps the product running indefinitely on your own infrastructure — patches still come, models still get added, and you're not exposed to the cloud product's roadmap freeze.
The trade-off is the same as any self-hosted observability platform. You're operating Postgres, Clickhouse, Redis, container orchestration, log retention, the upgrade path. None of that is unique to Helicone — it's the carrying cost of any stateful platform — but worth pricing honestly. For most teams that originally chose Helicone Cloud to avoid running infrastructure, going OSS is a solution that costs more than it saves.
If self-hosting is what you want anyway (compliance, sovereignty, code visibility), Helicone OSS is a reasonable target. LiteLLM is the more common self-hosted choice for the gateway role; pair with Langfuse for evals.
Migration plan
Three paths off Helicone Cloud
None of these need to be done all at once. Pick the one that matches your constraints, then sequence the work over a quarter.
1. Move to a managed alternative — FastRouter
Lowest engineering lift. Both products are OpenAI-compatible: change the base URL, swap the API key, port your custom property names. Sessions, traces, prompts, and per-user cost tracking all map cleanly. FastRouter offers hands-on migration support for production cutovers and can run in passive audit mode against a slice of your traffic for 7 days first, so you have measured numbers before the switch.
2. Self-host Helicone OSS
Highest fidelity to what you have today. Apache 2.0 repo, Docker Compose for dev, Helm chart for production. You inherit the operational burden — Postgres, Clickhouse, Redis, scaling, upgrades — and you're frozen at the OSS feature set, but the product keeps running on your terms. Best fit if you have a strong DevOps function and Helicone-specific workflows you don't want to retrain.
3. Instrument with OpenTelemetry, stay backend-agnostic
Strategically the most defensive option. Wrap your LLM calls with OpenTelemetry GenAI semantic conventions (or OpenLLMetry / OpenInference), then send to whatever backend you want — Langfuse, FastRouter, your own ClickHouse, multiple at once. You decouple from any single vendor's roadmap. Highest engineering lift up front; you don't have to do this migration again next time someone gets acquired.
You don't have to pick one. A common approach is option 3 (OpenTelemetry instrumentation) + option 1 (FastRouter as the primary backend with the routing/eval layer on top). The OTel layer keeps you portable; FastRouter gives you the active product surface.
Common questions
Not today. Helicone Cloud still runs, the OSS code is still there, and Mintlify is committed to security patches and new model support. The reason to plan a migration now is the trajectory — every quarter you stay on a frozen product, the gap to actively-developed alternatives widens.
Mintlify hasn't announced an end-of-life date, and the public commitment has been to continued operation. We'd plan around a 12–24 month horizon for "still works fine" and a longer horizon for "still competitive." If you have time-sensitive workloads, give yourself runway before the second half of that range.
For most stacks, low effort. Both expose OpenAI-compatible endpoints; the bulk of work is endpoint and key changes. Custom property names port directly. Session and trace data models map well. We help with cutover for production workloads. The 7-day passive audit means you can measure the destination before fully committing.
Almost everything Helicone Cloud ships has parity in FastRouter — sessions, custom properties, per-user cost tracking, prompt management, multi-provider routing, caching. The exceptions are usually edge cases. If you depend on something specific (a particular dashboard surface, a custom integration), tell us and we'll be specific about whether it ports or whether you need a workaround.
If your reason to be on Helicone Cloud was avoiding infrastructure, no — self-hosting trades the acquisition risk for a different kind of operational tax. If you'd be comfortable running Postgres + Clickhouse + Redis anyway and you specifically want to stay on Helicone's data model, yes — Apache 2.0 is a real safety net.
Langfuse is the most common Helicone migration target for teams whose primary need is observability and evals (not gateway routing). Open source, OpenTelemetry-friendly, healthy roadmap. If you want a gateway alongside the observability, Langfuse pairs well with FastRouter — observability in Langfuse, routing/governance/evals in FastRouter.
Mintlify hasn't announced any data retention changes for the cloud product. Data export is available through the existing API, so we'd recommend exporting your historical traces and sessions on whatever cadence makes sense before doing any migration — independent of which destination you pick.