Anthropic Cut Off OpenClaw from Claude. How to Switch to Free Models

6 min read

On April 4, 2026, Anthropic cut off access to Claude models for all third-party applications. OpenClaw, Cursor, OpenCode – everything that used a Claude Pro or Max subscription via OAuth stopped working overnight.

In this article: what exactly happened, which alternatives are available right now (including free ones), and a step-by-step guide to switch your AI server in ten minutes.

What Happened with Anthropic#

Here’s how it used to work: you paid for a Claude subscription ($20 – $200/month). OpenClaw used the same models via OAuth. The subscription covered everything – both the chat and the server.

Now the subscription works only in Anthropic’s own products. Claude.ai, Claude Code, Claude Desktop. Period. Third-party applications have been moved to “Extra Usage” – a separate per-request billing system.

Here’s what you see in chat when the bot goes silent:

WARNING

“LLM request rejected: Third-party apps now draw from your extra usage, not your plan limits.”

Why did they do this? Arbitrage. A $200 subscription was covering $1,000–$5,000 worth of compute. Anthropic was subsidizing AI servers thinking everyone was just chatting. OpenClaw’s agent loops consume 5–25x more tokens than a regular conversation. The math didn’t work.

HackerNews was split down the middle. What caught my attention was the timing.

Qwen 3.6 Plus Launched Two Days Before the Ban#

On April 2, Alibaba released Qwen 3.6 Plus. A free model with a million-token context window. On April 4, Anthropic shut down OAuth. Coincidence? Maybe. But the market reacted instantly.

I’m looking at OpenRouter stats – the largest AI model routing platform – and here’s what I see:

Programming: Qwen took 84%.

Qwen dominates programming on OpenRouter — 83.7% of all requests

Not a typo. 83.7% of all programming requests go through Qwen. Claude Opus and Sonnet are somewhere at the bottom. This isn’t an opinion poll. Thousands of developers voted with their time and money.

Technology – 67%. Analytics – 55%. Russian language – also first place.

Qwen in the Technology category — 67.3% market share

Across all OpenRouter categories – from code to Russian-language text processing – Qwen is in first place. Not by a small margin. By a landslide.

Qwen leads the Russian-language segment on OpenRouter

The key takeaway for me – people aren’t “trying” Qwen. They’ve moved. 84% is a migration, not a test.

What Qwen 3.6 Plus Can Do#

Qwen 3.6 PlusClaude Opus 4.6
Price$0$15/$75 per 1M tokens
Context1,000,000 tokens200,000
Speed150–180 tok/s50–70 tok/s
MCPMark (tool use)Above ClaudeBaseline

One million tokens. Free. Three times faster. And it beats Claude on tool usage in the MCPMark benchmark.

OpenClaw — #1 app on OpenRouter, 20.5T tokens processed

OpenClaw has processed 20.5 trillion tokens and holds the #1 spot on OpenRouter by daily rate. 353 models in use. The shift to Qwen has already happened.

Compensation Credits – Check Your Account#

Anthropic automatically credited compensation to affected accounts. The amount depends on your plan:

PlanCredit
Pro ($20/month)$20
Max ($100–200/month)$200

Credits are applied automatically – no action needed to claim them. You can use them for Extra Usage across Claude.ai, Claude Code, Claude Desktop, or third-party apps.

WARNING

Credits expire April 17, 2026. Check your balance at claude.ai/settings/usage and use them before they’re gone.

Can you cancel your Claude subscription? Yes. If you’re switching to free models, the subscription is no longer needed. Credits will stay in your account regardless.

Recovery in 10 Minutes#

You already have Claude Code – we installed it when setting up your server. It still works through your subscription. Let’s use it to fix OpenClaw.

Step 1. Open Terminal#

Mac: Cmd + Space → “Terminal” → Enter.

Step 2. Launch Claude Code#

Terminal window
claude --dangerously-skip-permissions

Step 3. Paste This Command#

Find my OpenClaw server and do the following:
1. Add OpenRouter provider:
- Get a free API key at openrouter.ai
- Add it to openclaw.json as "openrouter" provider
2. Add NVIDIA NIM provider:
- Get a free API key at build.nvidia.com
- Add it to openclaw.json as "nvidia" provider
3. Switch the primary model:
- Primary: openrouter/qwen/qwen3.6-plus:free
- Fallback: nvidia/nemotron-3-super-120b-a12b:free
4. Restart the gateway.
5. Send a test message and verify it works.

Claude Code will find the config, make changes, and restart the server. You just hit Enter a few times.

Step 4. Verify#

Message your bot on Telegram. If it replies – you’re good.

NOTE

Server on a remote machine? Ask Claude Code to connect: “Connect to my server via SSH. Check OpenClaw status, current model, and errors in logs.”

Free Tier Limits#

There’s a catch. OpenRouter’s free tier is 20 requests per minute, 200 per day. Sometimes the bot may say the limit is exceeded.

TIP

Hit the limit? Wait a minute and try again. Sent a voice message? Forward it again – the bot will process it at the next available slot.

If limits are a constant problem – top up OpenRouter with $5–10. Limits are completely removed, and Qwen costs pennies. Ten dollars lasts months.

Right now Qwen is seeing explosive traffic growth – everyone is migrating. Alibaba is scaling servers. The situation will stabilize.

Three Configurations#

Free ($0/month)#

Orchestrator: Qwen 3.6 Plus (OpenRouter)
Fallback: Nemotron 120B (NVIDIA)

80% of OpenClaw users run this right now. Sufficient for daily tasks.

Optimal (~$30/month)#

Orchestrator: Claude Opus 4.6 (Anthropic API)
Sub-agents: Qwen 3.6 Plus + Nemotron (free)
TIP

Putting $30 into the Anthropic API is the best investment. Opus remains the strongest model for complex reasoning. Routine tasks run on free sub-agents. Opus quality at the price of coffee.

Premium ($200+/month)#

Orchestrator: Claude Opus 4.6
Sub-agents: Claude Sonnet 4.6 + Qwen 3.6 Plus

For tasks where every detail matters.

How Orchestration Works – Briefly#

Your AI server is a conductor. It receives a task, breaks it into parts, and delegates to performers.

Orchestrator – the lead agent. Thinks, plans, makes decisions. The CEO. Needs the smartest model.

Sub-agents – the performers. Search, write, analyze, compare. Free models work fine here.

Example. You say: “Prepare a competitor report.”

  1. Orchestrator (Opus) breaks it into sub-tasks
  2. Sub-agent (Qwen, $0) searches for competitors
  3. Sub-agent (Qwen, $0) gathers prices and reviews
  4. Sub-agent (Nemotron, $0) analyzes websites
  5. Orchestrator synthesizes the final report

Opus quality. Cost – pennies. That’s hybrid orchestration.

You Have Claude Code. Use It.#

You pay for a Claude subscription. It includes Claude Code – a tool that runs in the terminal. It’s still available.

Claude Code can connect to servers, edit configs, run scripts, diagnose problems. If something breaks:

Terminal window
claude
Connect to my server, check OpenClaw status,
show the latest errors in logs and suggest a fix.

It’s your Swiss army knife. It works.

Where Things Are Headed#

Here’s how I see it: the Anthropic block is the best thing that could have happened.

Dependency on a single provider is a vulnerability. Anthropic just proved it. Today they change OAuth policy. Tomorrow they raise prices. The day after – something else.

The AI model market is no longer a monopoly. Qwen, Nemotron, DeepSeek, Gemma, Llama – dozens of models. Many free. Many excellent. And getting better every month.

We’re heading toward a future where you own your models. Running them locally, on your own hardware. No dependency on clouds or subscriptions. OpenClaw already supports this via Ollama. And if you want to start from scratch – here’s how to install OpenClaw.

But for now – free cloud models cover 90% of needs. And cost less than a $200 Claude Max subscription.

Switch over. Use your credits before April 17. Keep working.


FAQ#

Do free models actually work?

84% of developers on OpenRouter chose Qwen for code. 67% for technology tasks. People don’t mass-adopt something that doesn’t work. Qwen handles most tasks as well as Claude. Opus is better at non-trivial reasoning – but that’s what the $30 hybrid configuration is for.

Is my data safe?

OpenRouter and NVIDIA process requests on their servers. Security level is standard for any cloud API. For confidential data, I recommend Claude Opus via API or a local model through Ollama.

I don’t understand terminals.

Reach out – I’ll connect remotely and set everything up in 10 minutes.

Should I use my credits?

Absolutely. They expire April 17 – even if you’ve switched to free models, use them as a fallback while they last.


Need help with setup? Contact me – I’ll connect and configure everything.

Поделиться Share