Question 1

What is A3M Router?

Accepted Answer

A3M Router is an open-source LLM routing library and OpenAI-compatible proxy server. It analyzes each query and routes it to the cheapest capable LLM provider using multi-signal classification — no ML model, no GPU, no training. It achieves 99.5% ±1 tier routing accuracy across 40 providers including Groq, Cerebras, Ollama, DeepSeek, Mistral, OpenAI, and Anthropic.

Question 2

How accurate is A3M Router compared to RouteLLM?

Accepted Answer

A3M Router achieves 99.5% ±1 tier accuracy and 64.5% exact tier match on a 200-query benchmark using 4 cost tiers (free/cheap/mid/premium). This uses the same methodology as RouteLLM but with a different test set. A3M Router requires no GPU and no ML model — it uses multi-signal keyword classification including domain detection, complexity scoring, and action verb analysis.

Question 3

How do I use A3M Router with OpenAI SDK?

Accepted Answer

Install with npm install adaptive-memory-multi-model-router, run npx a3m-router serve, then point any OpenAI SDK at http://localhost:8787/v1 with api_key='not-needed'. Works with Python, Node.js, LangChain, LlamaIndex, and any OpenAI-compatible client. Zero code changes required.

Question 4

What LLM providers does A3M Router support?

Accepted Answer

A3M Router supports 40 LLM providers: Free tier includes Groq, Cerebras, Ollama, LM Studio. Cheap tier includes DeepSeek, MiniMax, Mistral, Qwen. Mid tier includes Google Gemini, Cohere, Together AI. Premium tier includes OpenAI GPT-4o/o3, Anthropic Claude, Google Gemini Ultra.

Question 5

Does A3M Router require a GPU?

Accepted Answer

No. A3M Router uses multi-signal keyword classification for routing — no ML model weights, no GPU, no training required. The entire package is 19.5 KB gzipped, approximately 500x smaller than ML-based routers like RouteLLM with BERT (~1.5 GB).

Question 6

How much can I save using A3M Router?

Accepted Answer

A3M Router achieves 61.6% cost savings compared to routing all queries to premium models. Simple queries go to free providers (Groq, Cerebras), medium queries to cheap providers (DeepSeek, Mistral), and only complex queries to premium (GPT-4o, Claude). Real savings depend on your query distribution.

Question 7

Is there a Python SDK for A3M Router?

Accepted Answer

Yes. Install with pip install a3m-router. Use from a3m import A3MRouter for async or from a3m import A3MRouterSync for synchronous usage. The Python SDK supports chat, route, batch routing, streaming, and cost reporting.

Question 8

What is the best lightweight LLM router?

Accepted Answer

A3M Router is the lightest LLM router at 19.5 KB gzipped with 99.5% ±1 tier accuracy. It requires no GPU, no ML model, and no external dependencies beyond nanoid. It includes a built-in OpenAI-compatible proxy, semantic cache, guardrails, and cost analytics — features that other routers lack.

A3M Router

Intelligent LLM Routing

Cost Optimization

Smart Fallback & Retry

Real-time Analytics

Security Guardrails

Semantic Cache

LLM Provider Pricing Tiers

Free Tier

Budget Tier

Mid Tier

Premium Tier

Quick Start: LLM Routing in 30 Seconds

Frequently Asked Questions

What is A3M Router?

How much can I save with A3M Router?

Is A3M Router free?

How do I get started with A3M Router?

What LLM providers does A3M Router support?

How does A3M Router compare to LiteLLM?