AI Frontier19 sources verified35% positive on Glassdoor

Anthropic Interview Guide

Anthropic has a 6-question recurring bank, a code review HM round that no other company uses, and values screening that fails more candidates than the technical rounds. The company that builds Claude redesigns its own interview questions when Claude beats them. If you can articulate a genuine perspective on AI safety, you have a structural advantage.

~15% easy, 65% medium, 20% hard|6 recurring problems|~20 day timeline

What makes Anthropic different

Anthropic was founded as a safety-conscious fork of OpenAI. Dario and Daniela Amodei left in 2021 over philosophical differences about building safety into models from the beginning. That origin story permeates every hiring decision: the most common reason candidates fail is the culture fit round, not the technical rounds. If you cannot articulate a thoughtful perspective on AI risks, you will not advance regardless of technical ability.

The coding interview uses a tiny, known bank of 6 problems that recycle across candidates: Web Crawler, LRU Cache, Stack Trace, Distributed Mode/Median, Profiler Trace, and Tokenization. These are progressive multi-level problems on CodeSignal where you pass each level's unit tests to unlock the next. With only 6 questions, most candidates know them beforehand — but interviewers drill into concurrency, distributed extensions, and failure modes that pure memorization cannot cover.

Two things make Anthropic's interview structurally unique. First, the hiring manager round is code review, not coding — you analyze someone else's codebase to find bugs and concurrency risks. No other top tech company has this as a dedicated round. Second, Anthropic's engineering team publishes openly about designing “AI-resistant” evaluations and iteratively rebuilds questions when Claude models outperform candidates. When Opus 4 beat their performance engineering take-home, they created v2. When Opus 4.5 matched top human performance, they shifted to Zachtronics-style puzzle problems. The interview format evolves faster here than at any other company.

Everyone carries the title “Member of Technical Staff” — from new hires to co-founders. Internally it's T4/T5/T6+, but LinkedIn cannot differentiate. About 50% of technical staff had no prior ML experience, and about 50% hold PhDs. The company has surged from ~500 employees in late 2023 to ~5,000 in 2026, with pre-IPO equity at a $380B valuation making the comp package among the most equity-leveraged in tech.

The interview loop

6\u20138 total interviews across 4\u20136 stages. Two recruiter calls, coding screen, HM code review, 4-round onsite. Consensus-based hiring with HM final authority.

Recruiter Screens (2x)

30 min each · Phone / Video

First call covers background and motivation. Second call discusses compensation AND distributes culture/values study documents. Read those documents — the values round is where most candidates fail.

Technical Coding Screen

60–90 min · CodeSignalgate

One progressive multi-level problem with escalating complexity. Must pass each level’s unit tests to unlock the next. Six known recurring problems. Google/Stack Overflow allowed; AI tools prohibited.

Hiring Manager Interview

60 min · Code Reviewgate

Code review, not coding. You analyze existing codebases to find bugs, concurrency risks, and architectural issues. Unique in the dossier store — no other company tests this as a dedicated round.

Onsite: System Design

60 min · Whiteboard / Virtual

LLM-infrastructure-native: inference batching, GPU scheduling, token-generation services, distributed search. Not "Design Instagram." Evaluation scales with level: junior = clarity, senior = trade-offs + failure scenarios.

Onsite: Coding

60 min · Your IDE or CodeSignalgate

Real-world progressive problems with escalating parameters. Practical engineering focus — modular, extensible code that adapts to new requirements. Python strongly favored.

Onsite: Experiences & Goals

60 min · Behavioral

Manager-level interviewer probing collaboration, conflict resolution, candor. Values screening is woven throughout. AI ethics and safety questions are not a checkbox — they are a hard gate.

Onsite: Project Presentation

60 min · Presentation

Present a past project showing end-to-end ownership. Interviewers probe organizational impact and tradeoff rationale more than technical depth. Consensus-based hiring with HM final authority.

The code review round — what you actually need to know

The hiring manager round is code review, not code writing. You'll analyze an existing codebase to identify bugs, concurrency risks, and architectural bottlenecks. This is unique among top tech companies — Airbnb's code review round is the closest analog.

How to prepare: Practice reading unfamiliar Python codebases. Look for threading issues (race conditions, deadlocks, GIL implications), error handling gaps, and architectural anti-patterns. The focus is on identifying problems in real code, not solving greenfield puzzles.

Difficulty breakdown

0% easy

100% medium

0% hard

65% medium reflects the progressive multi-level format where problems start approachable and escalate. The 20% hard is driven by concurrency extensions and distributed system follow-ups in the final levels.

Compensation by level

Median total comp. Source: Levels.fyi, April 2026. Zero bonuses at any level. 4-year vesting (25% year 1, 2.08% monthly). Pre-IPO equity at $380B valuation.

Level	Title	Total Comp	Base	RSU/yr	Bonus
T4	Senior MTS	$550K	$318K	$233K/yr	—
T5	Lead MTS	$759K	$329K	$430K/yr	—
T6+	Staff+ MTS	$890K+	~$400K+	~$490K+/yr	—

Stock as % of total comp

42%

57%

T6+

55%

Practice targets from public interview reports

Use these as prep targets, not a ranked or company-confirmed leaderboard. Rows with direct public sources cite them inline.

Web Crawler Multithreadedmedium

Exclusive Time of Functionsmedium

Web Crawlermedium

Find Duplicate File in Systemmedium

What patterns to study

BFS & Graph Traversal

Web crawler, domain-scoped link discovery, threading

Cache Implementation

LRU cache, variable args, bug identification, durability extension

Stack & Trace Processing

Periodic samples to start/end events, recursion handling

Distributed Algorithms

Mode/median across nodes, bandwidth constraints

Concurrency & Threading

ThreadPoolExecutor, async/await, GIL implications, race conditions

Tokenization & Parsing

Code review + implementation, text processing logic

Key difference: Anthropic problems are practical engineering tasks that grow in complexity, not abstract algorithm puzzles. Modular, extensible code that adapts to new requirements is the goal.

8 things nobody tells you

6-question recurring coding bank

Web Crawler, LRU Cache, Stack Trace, Distributed Mode/Median, Profiler Trace, Tokenization. Progressive multi-level on CodeSignal. Smaller than OpenAI’s 8. Most candidates know them — the interview is won on follow-up depth, not recognition.

The HM round is code review, not coding

You analyze existing codebases to find bugs, concurrency risks, and bottlenecks. No other company in the dossier store has this as a dedicated round. Practice reading unfamiliar code, not just writing it.

Values screening is the primary failure point

The recruiter distributes culture documents for mandatory study. Behavioral red flags that directly disqualify: lone wolf mentality, arrogance, purely financial motivation. The most common reason candidates fail is culture fit, not the technical rounds.

They redesign questions when Claude beats them

Anthropic's engineering team publishes about their arms race against their own AI. When Claude Opus 4 beat their take-home, they created v2. When Opus 4.5 matched top humans, they shifted to Zachtronics-style puzzle problems. Expect the format to evolve faster than at any other company.

Everyone is 'Member of Technical Staff'

From new hires to co-founders, the same external title. Internal T4/T5/T6+ system for compensation, but LinkedIn can't differentiate. Deliberate anti-poaching defense + research-lab egalitarianism. If external title progression matters to your career narrative, factor this in.

AI policy reversed May–July 2025

Banned AI in hiring May 2025, reversed by July 2025. Now Claude is allowed for prep and application refinement, prohibited during live interviews. Meanwhile, Anthropic uses Claude to write job descriptions and generate interview questions — a deliberate asymmetry they acknowledge openly.

29% positive SWE vs 100% positive Research Scientist on Glassdoor

The starkest role split in the dossier store. SWE loop is perceived as harsh. Research Scientists report a warmer, more collegial experience. The rapid scaling from ~500 to ~5,000 employees may be straining interview calibration.

The recruiter gives you homework

The second recruiter call distributes culture/values documents you're expected to study. This is not optional — the values round is where most candidates fail. Seven core values: act for the global good, hold light and shade, be good to users, ignite a safety race, do the simple thing, be helpful/honest/harmless, put mission first.

System design questions

LLM-infrastructure-native. Forget “Design Instagram” — expect inference batching, GPU scheduling, and model serving.

Design an Inference Batching System

Design a Scalable Token-Generation Service (100K+ RPS)

Design Distributed Search for 1B Docs at 1M QPS

Design a Prompt Playground for LLMs

Design Model Serving Infrastructure with GPU Load Balancing

Design a Claude Chat Service with Multi-Thread Handling

Unlock the full guide

Complete walkthrough, diagrams, and practice problems — all included with StrongYes Pro.

Unlock with Pro

Values screening — the Anthropic-distinctive section

This is what makes Anthropic interviews fundamentally different from every other company, including OpenAI. Seven core values are explicitly assessed:

Act for the global good
Hold light and shade (acknowledge both risks and benefits)
Be good to users, policymakers, and affected communities
Ignite a race to the top on safety
Do the simple thing that works
Be helpful, honest, and harmless
Put the mission first

Behavioral red flags that directly disqualify: lone wolf mentality, arrogance, purely financial motivation, inability to discuss AI ethics beyond surface-level boilerplate, and discomfort with candid discussion of past failures. The recruiter distributes culture documents early — treat them like a take-home assignment.

New grad entry (MTS)

New grads enter as Member of Technical Staff (MTS) — the same external title everyone carries. Internally likely T3 or T4. H-1B filings show MTS base range $300K–$405K. Zero bonuses. 4-year vesting with pre-IPO RSU-equivalent equity at $380B valuation.

What's different for new grads:

The 6-problem recurring bank applies to all levels. Studying it has high ROI.
Values screening is the primary failure point. Read the culture documents the recruiter provides. Study Anthropic's safety publications. “I want to work on cool AI” is not enough.
The HM code review round tests reading, not writing. Practice spotting bugs in unfamiliar Python.
~50% of technical staff had no prior ML experience. If you're a new grad without ML background, that's fine.
12-month cooldown after rejection. Longer than Meta (3–6 months).
Hybrid Bay Area with monthly visits. Remote employees report feeling “out of the loop.”

SWE vs Research Scientist tracks

Anthropic organizes interviews by two orgs: Applied (SWE) and Research. Your interviewers all come from your target org. Key differences:

Dimension	SWE (Applied)	Research Scientist
Coding focus	Practical systems, progressive complexity	Algorithm reasoning + collaborative problem-solving
Design round	LLM infrastructure, distributed systems	ML theory + system architecture
Unique round	Code review of existing codebases	Research presentation / case study
Glassdoor	29% positive	100% positive
Prep	Study team-specific infra	Review 40+ Anthropic papers

Interview culture

Candidates consistently describe Anthropic's interview as “thoughtful” and less adversarial than FAANG. One interviewing.io respondent noted the process “felt so easy and thoughtful compared to all the other companies I interviewed with. They have their shit together.”

However, the 29% positive rate for SWE interviews on Glassdoor suggests significant variability in execution — possibly reflecting the rapid scaling from ~500 to ~5,000 employees straining interview calibration. Research Scientists report 100% positive, suggesting the research loop is more refined.

The emphasis on practical engineering over algorithmic tricks, the code review round, and the genuine (not performative) values discussion create a distinctive interview experience. CEO approval is 93%, recommend-to-friend is 95%, and comp & benefits rated 4.8/5.0 (highest category). Work-life balance is 3.7/5.0 — some 60+ hour weeks are reported.

Next move

Practice the Anthropic problems.

If Anthropic is a live target, use the sourced guide and practice targets before opening the next rep.

Practice Anthropic problems Review Anthropic practice targets Compare another target

At a glance

Problems: 6 recurring
Difficulty: 15% easy, 65% med, 20% hard
Category: AI Frontier
Timeline: ~20 days
Glassdoor: 35% positive, 3.2/5 difficulty
AI Policy: Nuanced (prep OK, live no)
Sources: 19 verified
Updated: April 2026

Key topics

BFS & Graph TraversalCache ImplementationStack & Trace ProcessingDistributed AlgorithmsConcurrency & ThreadingCode Review

FAQ

Is the Anthropic question bank really only 6 problems?

Yes. Six core problems are consistently reported: Web Crawler, LRU Cache, Stack Trace, Distributed Mode/Median, Profiler Trace, and Tokenization. These are progressive multi-level problems on CodeSignal. Most candidates know them beforehand, but interviewers drill into concurrency models, distributed extensions, and failure modes that pure memorization cannot cover.

What makes the HM round different from other companies?

It's code review, not coding. You analyze someone else's codebase to find bugs, concurrency risks, and architectural issues. No other top tech company has a dedicated code review interview round. Practice reading unfamiliar code and spotting concurrency problems.

What level do new grads enter at?

New grads enter as Member of Technical Staff (MTS) — the same external title everyone carries from new hire to co-founder. Internally likely T3 or T4. The lowest documented band is T4/Senior at ~$550K median TC, but this is not the new-grad number. H-1B filings show MTS base range $300K–$405K. Zero bonuses at any level.

Can I use AI during the interview?

Nuanced. Claude is allowed for application refinement (after writing your own first draft) and interview prep. Prohibited during live interviews and take-home assessments unless explicitly told otherwise. Google and Stack Overflow are allowed during CodeSignal sessions. Anthropic's policy is more permissive than OpenAI's blanket prohibition.

How long does Anthropic's interview process take?

20 days on average (Glassdoor, 120+ submissions). One of the fastest in the store. Standard loops run 2–4 weeks. Complex roles or team matching can stretch to 4 months.

Why do so many candidates fail the culture round?

Values screening is the primary failure point. The recruiter distributes culture documents early — treat them like a take-home assignment. Behavioral red flags that directly disqualify: lone wolf mentality, arrogance, purely financial motivation. You must articulate a thoughtful perspective on AI risks and responsible deployment. "I want to work on cool AI" is not enough.