Question 1

Why not just use Triton?

Accepted Answer

Triton is excellent for multi-tenant cloud inference at scale — many models, many services, datacenter GPUs, an ML platform team. Reflex is for the opposite: one model, one robot, one process, edge GPU, one developer. Reflex ships VLA-specific features Triton doesn't: decomposed pi0.5, A2C2, ActionGuard with URDF, episode-aware policy routing.

Question 2

Does Reflex work without a GPU?

Accepted Answer

Yes — install with pip install 'reflex-vla[serve,onnx]'. The CPU path runs all four supported VLAs at machine-precision parity to PyTorch; SmolVLA is the only one fast enough for real-time control on CPU.

Question 3

What does BSL 1.1 mean for me?

Accepted Answer

Reflex is source-available under BSL 1.1 — same license HashiCorp, MongoDB, and Sentry use. Free for personal, academic, and commercial use, including embedding in your own product. The only restriction is offering Reflex itself as a competing hosted service. Auto-converts to Apache 2.0 four years after each release.

Question 4

Does Reflex support RTX 5090 / Blackwell?

Accepted Answer

Not yet. ORT's bundled cuBLAS / cuDNN don't ship sm_100 kernels, so reflex go segfaults at startup on Blackwell. Workarounds: use reflex chat (no GPU needed), reflex doctor, and reflex models list. /act needs a non-Blackwell GPU temporarily.

Question 5

Can I use Reflex in a commercial product?

Accepted Answer

Yes — BSL 1.1 explicitly permits commercial use, including embedding in proprietary products you ship to your own customers. The only restricted case is offering Reflex itself as a competing hosted service.

Question 6

How does Reflex compare to NVIDIA's GR00T runtime?

Accepted Answer

Reflex is the only open-source one-command deploy path for GR00T, as far as we know. NVIDIA's runtime is closed-source and locked to their hardware. Reflex supports GR00T alongside pi0 / pi0.5 / SmolVLA — multi-vendor, source-available, and works on Jetson Orin (not just Thor).

Question 7

What's the Pro tier?

Accepted Answer

Open-source Reflex covers everything except continuous self-distillation. Pro ($99/mo) adds an automated loop: collect production traffic → distill a customer-specific 1-step student every N hours → gate via a 9-check methodology → atomic warm-swap.

	Reflex	Triton	HF Endpoints	Raw ONNX
Edge GPU deployment	design center	cloud-first	cloud-only	DIY
VLA-specific export (pi0 / pi0.5 / GR00T)	built-in, validated	no	no	manual, error-prone
Verified machine-precision parity	automatic	DIY	DIY	DIY
Decomposed pi0.5 (9× speedup)	one flag	no	no	~weeks of work
Setup time	30 seconds	days	minutes	1–3 weeks
Multi-tenant cloud serving at scale	not the design	battle-tested	managed	DIY

Take a robot policy off the training cluster, onto a robot.

How it works

From a HuggingFace model to a robot, in four steps.

What it looks like

Talk to your robot fleet in plain English.

Composable wedges

Every flag is opt-in. Compose only what you need.

Multi-embodiment

Edge-first

SnapFlow distillation

Production runtime

Numbers

Verifiable claims, not vibes.

vs other tools

Where Reflex fits.

Common questions

FAQ

Get in touch

Product

Resources

Community

Company