feat(provider): add adaptive thinking and 1M context support for Claude Opus 4.6 by okhsunrog · Pull Request #12342 · anomalyco/opencode

okhsunrog · 2026-02-05T20:43:47Z

Adaptive thinking

Add adaptive-thinking-2026-01-28 beta header for Anthropic provider
Detect Opus 4.6 models and use adaptive thinking with effort parameter
Support all effort levels: low, medium, high, max
Older models continue to use manual thinking with budgetTokens

Opus 4.6 uses adaptive thinking where the model decides how much to think based on task complexity, guided by the effort parameter. This is more efficient than fixed budgetTokens as simple tasks use minimal thinking.

1M context window

Add context-1m-2025-08-07 beta header for Anthropic provider (API key users)
Fix compaction logic: when model.limit.input is set, compare input tokens only against that limit instead of the combined total (input + cache + output)

Without the beta header, Opus 4.6 enforces a 200k input token limit (the context window is still 1M — output/thinking tokens aren't affected). The opencode (OAuth) provider doesn't support the 1M beta header, so it relies on model.limit.input in models.dev to trigger compaction at the right time: anomalyco/models.dev#819

AI SDK upgrade

Upgrade ai v5 → v6, @ai-sdk/anthropic v2 → v3, and all @ai-sdk/* packages
Migrate LanguageModelV2 → LanguageModelV3, async toModelMessages, renamed tool factories

Ref: https://platform.claude.com/docs/en/build-with-claude/adaptive-thinking
Ref: https://platform.claude.com/docs/en/build-with-claude/context-windows#1-m-token-context-window

Tested locally, works with Opus 4.6

Closes #12323
Closes #12338
Closes #12438

github-actions · 2026-02-05T20:43:59Z

Thanks for your contribution!

This PR doesn't have a linked issue. All PRs must reference an existing issue.

Please:

Open an issue describing the bug/feature (if one doesn't exist)
Add Fixes #<number> or Closes #<number> to this PR description

See CONTRIBUTING.md for details.

github-actions · 2026-02-05T20:44:29Z

The following comment was made by an LLM, it may be inaccurate:

No duplicate PRs found

okhsunrog · 2026-02-05T21:15:29Z

Adding of adaptive thinking needs an upgrade of @ai-sdk/anthropic which leads to upgrading of a bunch of other packages. Making it a draft for now.
I started working on upgrading the deps, I hope it can be made as part of this PR

okhsunrog · 2026-02-05T21:46:28Z

Added a commit with dependencies upgrade. Tested locally, switching effort levels with Opus 4.6 now works as expected

okhsunrog · 2026-02-05T23:10:00Z

Can anyone test if 1M context for you? It works for me just fine with Opus 4.6. I managed to get up to 340k token

ItsWendell · 2026-02-05T23:25:15Z

@okhsunrog what's the easiest way to test this?

okhsunrog · 2026-02-05T23:28:14Z

@ItsWendell make sure you have latest bun installed, then run these commands:

git clone -b feat/opus-4-6-adaptive-thinking https://github.com/okhsunrog/opencode.git
cd opencode
bun install
bun dev

ItsWendell · 2026-02-05T23:40:01Z

Tested and works:

ItsWendell · 2026-02-06T00:40:34Z

Couple of issues that I ran into, at e.g. 400 tokens I again get a error:

okhsunrog · 2026-02-06T00:43:44Z

well, shit...

okhsunrog · 2026-02-06T00:46:23Z

@ItsWendell yes, I confirm the issue is there. but if we'd pass context-1m-2025-08-07 header we'd get error from Anthropic with OAuth, that the long context beta is not available for this subscription. Are you using it with Claude subscription via OAuth or vie Anthropic API?

okhsunrog · 2026-02-06T00:51:20Z

Confirmed, the 200k limit is enforced by the Anthropic API. The status bar shows a combined total (input + output + cache tokens), but the API only counts input tokens against the 200k limit. That's why I didn't hit it earlier — actual input tokens stayed under 200k even though the status bar showed 385k.
The context-1m-2025-08-07 beta header is needed for Opus 4.6 to go past 200k, same as Sonnet 4/4.5. The compaction check in opencode uses this combined count:

// compaction.ts:35
const count = input.tokens.input + input.tokens.cache.read + input.tokens.output

So opencode thinks there's plenty of headroom (39% of 1M) while the API is already at the 200k input limit. I'll update my PR to add the context-1m-2025-08-07 beta header for non-OAuth.

P.S. Worth noting that the 200k limit on Opus 4.6 without the beta header is different from the 200k context window on models like Haiku 4.5. For Haiku 4.5, 200k is the total context window — input tokens, output tokens, everything has to fit within that 200k budget. For Opus 4.6, the context window is actually 1M — the 200k is only a gate on input tokens. Output and thinking tokens live in the larger 1M space and don't count against the 200k. So even without the context-1m-2025-08-07 header, Opus 4.6 gives you significantly more effective room. For example, if you're using extended thinking with large thinking budgets, those tokens aren't eating into your 200k input limit like they would eat into Haiku 4.5's 200k context window. And thinking tokens from previous turns are automatically stripped by the API, so they don't accumulate as input at all.

- Add adaptive-thinking-2026-01-28 beta header for Anthropic provider - Detect Opus 4.6 models and use adaptive thinking with effort parameter - Support all effort levels: low, medium, high, max - Older models continue to use manual thinking with budgetTokens Opus 4.6 uses adaptive thinking where the model decides how much to think based on task complexity, guided by the effort parameter. This is more efficient than fixed budgetTokens as simple tasks use minimal thinking. Ref: https://docs.anthropic.com/en/docs/build-with-claude/adaptive-thinking

- Upgrade ai package from 5.0.124 to 6.0.72 - Upgrade @ai-sdk/anthropic from 2.0.58 to 3.0.37 (adds adaptive thinking support) - Upgrade all @ai-sdk/* packages to v3+/v4+ for compatibility - Update LanguageModelV2 to LanguageModelV3 across codebase - Make toModelMessages async per AI SDK 6 requirements - Update toModelOutput signature to use destructured parameter - Fix tool factory renames and remove deprecated name property - Update tests for new LanguageModelUsage type structure Enables Claude Opus 4.6 to use adaptive thinking with effort levels (low/medium/high/max) instead of fixed budget tokens.

okhsunrog · 2026-02-06T08:26:38Z

@rekram1-node @thdxr could anyone review this, please?

BouquetAntoine · 2026-02-06T15:35:35Z

Work well for me except the displayed usage is wrong (that's also why you got the 200k limit - claude code limit - warning at "400k" context)

Bug: Token Counter Displays ~2× Actual Usage with `@ai-sdk/anthropic` v3.x

Symptom

When using @ai-sdk/anthropic v3.x, the token counter displays approximately double the actual token usage (e.g., 49,000 shown vs 24,450 in provider logs).

Root Cause

The SDK v3 changed the structure of LanguageModelUsage.inputTokens:

Version	`inputTokens` value
v2.x	`input_tokens` (excludes cache)
v3.x	`total = noCache + cacheRead + cacheWrite`

In session/index.ts, the getUsage() function assumes that for Anthropic, inputTokens excludes cached tokens:

const excludesCachedTokens = !!(input.metadata?.["anthropic"] || input.metadata?.["bedrock"])

const adjustedInputTokens = excludesCachedTokens
  ? (input.usage.inputTokens ?? 0) // ❌ This is now the TOTAL in v3!
  : ...

Then in the UI (header.tsx, sidebar.tsx), tokens are displayed as:

tokens.input + tokens.output + tokens.reasoning + tokens.cache.read + tokens.cache.write

Result: Cache tokens are counted twice — once inside tokens.input (which is now the total) and again as tokens.cache.read + tokens.cache.write.

Fix

Use the new inputTokenDetails.noCacheTokens field introduced in SDK v3:

// SDK v3: inputTokens is now TOTAL, use noCacheTokens for pure input
const noCacheInputTokens = input.usage.inputTokenDetails?.noCacheTokens

const adjustedInputTokens = noCacheInputTokens !== undefined
  ? noCacheInputTokens
  : (input.usage.inputTokens ?? 0) - cacheReadInputTokens - cacheWriteInputTokens

Also update cache token extraction to use the new structure:

const cacheReadInputTokens =
  input.usage.cachedInputTokens ??
  input.usage.inputTokenDetails?.cacheReadTokens ??
  0

const cacheWriteInputTokens =
  input.usage.inputTokenDetails?.cacheWriteTokens ??
  0

SDK v3 Type Reference

type LanguageModelUsage = {
  inputTokens: number | undefined;          // Now the TOTAL
  inputTokenDetails: {
    noCacheTokens: number | undefined;      // Pure input without cache
    cacheReadTokens: number | undefined;    // Tokens read from cache
    cacheWriteTokens: number | undefined;   // Tokens written to cache
  };
  cachedInputTokens?: number | undefined;   // @deprecated — use inputTokenDetails.cacheReadTokens
  // ...
}

BouquetAntoine · 2026-02-06T15:41:33Z

can confirm I managed to use the 1M context

reynard93 · 2026-02-06T16:55:13Z

how did anyone manage to use with this branch? i keep getting : The long context beta is not yet available for this subscription

…ounting, use plugin fork for OAuth context cap

loop-uh · 2026-02-06T18:11:10Z

+1

avarayr · 2026-02-06T19:16:56Z

packages/opencode/src/plugin/index.ts

@@ -16,15 +16,14 @@ import { gitlabAuthPlugin as GitlabAuthPlugin } from "@gitlab/opencode-gitlab-au
 export namespace Plugin {
  const log = Log.create({ service: "plugin" })

-  const BUILTIN = ["opencode-anthropic-auth@0.0.13"]
+  const BUILTIN = ["github:okhsunrog/opencode-anthropic-auth#feat/oauth-context-cap"]


opencode core shouldn't have builtin plugins that pull from forks.

It was only supposed for testing till changes to opencode-anthropoc-auth are accepted

github-actions bot added the needs:issue label Feb 5, 2026

github-actions bot removed the needs:issue label Feb 5, 2026

okhsunrog force-pushed the feat/opus-4-6-adaptive-thinking branch from 7ffe6a8 to a3ebbc5 Compare February 5, 2026 20:58

okhsunrog marked this pull request as draft February 5, 2026 21:00

okhsunrog force-pushed the feat/opus-4-6-adaptive-thinking branch from a3ebbc5 to 5316c7e Compare February 5, 2026 21:38

okhsunrog marked this pull request as ready for review February 5, 2026 21:45

okhsunrog mentioned this pull request Feb 5, 2026

1M tokens for Opus 4.6 #12338

Open

okhsunrog mentioned this pull request Feb 6, 2026

fix(opencode): add input token limit for Claude Opus 4.6 anomalyco/models.dev#819

Closed

okhsunrog added 2 commits February 6, 2026 04:19

okhsunrog force-pushed the feat/opus-4-6-adaptive-thinking branch from cd15179 to b49a992 Compare February 6, 2026 08:39

randomm mentioned this pull request Feb 6, 2026

[strategy] Fork maintenance strategy - decouple, contribute, minimize randomm/opencode#157

Closed

6 tasks

okhsunrog changed the title ~~feat(provider): add adaptive thinking support for Claude Opus 4.6~~ feat(provider): add adaptive thinking and 1M context support for Claude Opus 4.6 Feb 6, 2026

ruslan-kurchenko mentioned this pull request Feb 6, 2026

feat(zen): enable 1M context window for Opus 4.6 and Sonnet 4.5 #12451

Closed

ruslan-kurchenko mentioned this pull request Feb 6, 2026

feat(zen): enable 1M context window for Opus 4.6 and Sonnet 4.5 ruslan-kurchenko/opencode#1

Closed

This was referenced Feb 6, 2026

chore(deps): bump @gitlab/gitlab-ai-provider to 3.5.0 #12496

Merged

Sync upstream opencode and upgrade Opus 4.5 to 4.6 #12505

Closed

feat(provider): add 1M context beta header, fix SDK v6 token double-c…

471746d

…ounting, use plugin fork for OAuth context cap

okhsunrog force-pushed the feat/opus-4-6-adaptive-thinking branch from b49a992 to 471746d Compare February 6, 2026 17:14

okhsunrog mentioned this pull request Feb 6, 2026

Cap context to 200k and strip context-1m beta header for OAuth users anomalyco/opencode-anthropic-auth#44

Open

BouquetAntoine mentioned this pull request Feb 6, 2026

Support adaptive thinking for Claude Opus 4.6 #12485

Open

avarayr reviewed Feb 6, 2026

View reviewed changes

Conversation

okhsunrog commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Adaptive thinking

1M context window

AI SDK upgrade

Uh oh!

github-actions bot commented Feb 5, 2026

Uh oh!

github-actions bot commented Feb 5, 2026

Uh oh!

okhsunrog commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

okhsunrog commented Feb 5, 2026

Uh oh!

okhsunrog commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ItsWendell commented Feb 5, 2026

Uh oh!

okhsunrog commented Feb 5, 2026

Uh oh!

ItsWendell commented Feb 5, 2026

Uh oh!

ItsWendell commented Feb 6, 2026

Uh oh!

okhsunrog commented Feb 6, 2026

Uh oh!

okhsunrog commented Feb 6, 2026

Uh oh!

okhsunrog commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

okhsunrog commented Feb 6, 2026

Uh oh!

BouquetAntoine commented Feb 6, 2026

Bug: Token Counter Displays ~2× Actual Usage with @ai-sdk/anthropic v3.x

Symptom

Root Cause

Fix

SDK v3 Type Reference

Uh oh!

BouquetAntoine commented Feb 6, 2026

Uh oh!

reynard93 commented Feb 6, 2026

Uh oh!

loop-uh commented Feb 6, 2026

Uh oh!

avarayr Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

rekram1-node Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

okhsunrog Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

okhsunrog commented Feb 5, 2026 •

edited

Loading

okhsunrog commented Feb 5, 2026 •

edited

Loading

okhsunrog commented Feb 5, 2026 •

edited

Loading

okhsunrog commented Feb 6, 2026 •

edited

Loading

Bug: Token Counter Displays ~2× Actual Usage with `@ai-sdk/anthropic` v3.x