Common System Design Interview Questions

practice interview-questions

A comprehensive reference of the most common system design interview questions, organized by company type, difficulty, and category. Use this alongside the SDI books and the interview-framework to prepare systematically.


Section 1: Top 25 Most Common Questions Overall

#QuestionDifficultyTypeChapter ReferenceKey Patterns to Mention
1Design a URL Shortener (like bit.ly)EasyStorage, HashVol1 Ch08Consistent hashing, Base62 encoding, redirect, analytics
2Design a Rate LimiterEasyDistributedVol1 Ch04Token bucket, sliding window, Redis, distributed state
3Design a Key-Value StoreMediumDistributedVol1 Ch06Consistent hashing, replication, CAP, vector clocks
4Design a News Feed (like Facebook)MediumSocialVol1 Ch11Fan-out on write/read, Redis, pagination, celebrity problem
5Design a Chat System (like WhatsApp)MediumReal-timeVol1 Ch12WebSocket, message ordering, delivery receipts, presence
6Design a Notification SystemMediumReal-timeVol1 Ch10Push/APNs/FCM, fan-out, deduplication, retry
7Design YouTube / Video StreamingMediumStorage, CDNVol1 Ch14CDN, adaptive bitrate, chunked upload, encoding pipeline
8Design Google Drive / DropboxMediumStorageVol1 Ch15Block storage, delta sync, conflict resolution, versioning
9Design a Web CrawlerMediumDistributedVol1 Ch09URL frontier, politeness, dedup, distributed crawl
10Design a Search AutocompleteMediumReal-timeVol1 Ch13Trie, prefix hash, typeahead, CDN caching
11Design a Unique ID GeneratorEasyDistributedVol1 Ch07Snowflake ID, UUID, ticket server, clock skew
12Design a Payment System (like Stripe)HardFintechVol2 Ch11Idempotency, double-entry ledger, reconciliation, PSPs
13Design a Proximity Service (like Yelp)MediumGeospatialVol2 Ch01Geohash, quadtree, radius search, DB indexing
14Design a Ride-Sharing Service (like Uber)HardGeospatial, Real-timeVol2 Ch01, Ch02Geohash, WebSocket, matching algorithm, surge pricing
15Design Google Maps / NavigationHardGeospatialVol2 Ch03Graph traversal (Dijkstra/A*), ETA, routing tiles, traffic
16Design a Distributed Message Queue (like Kafka)HardDistributedVol2 Ch04Partitioning, consumer groups, offset, durability
17Design a Metrics & Monitoring SystemMediumData pipelineVol2 Ch05Time-series DB, aggregation, alerting, pull vs push
18Design S3 / Object StorageHardStorageVol2 Ch09Multipart upload, erasure coding, bucket/object model
19Design a Gaming LeaderboardMediumReal-time, DataVol2 Ch10Redis Sorted Sets, score normalization, real-time updates
20Design a Stock ExchangeVery HardFintechVol2 Ch13Order book, matching engine, ACID, low latency, sequencer
21Design a Digital WalletHardFintechVol2 Ch12Distributed transactions, idempotency, event sourcing
22Design an Ad Click Aggregation SystemHardData pipelineVol2 Ch06Stream processing, MapReduce, deduplication, watermarks
23Design a Hotel Reservation SystemMediumBookingVol2 Ch07Optimistic locking, double-booking prevention, inventory
24Design a Distributed Email ServiceHardDistributedVol2 Ch08SMTP, MIME, deduplication, spam filtering, mailbox storage
25Design Consistent HashingEasyDistributedVol1 Ch05Virtual nodes, ring, rebalancing, load distribution

Section 2: Questions by Company Type

FAANG / Big Tech

QuestionDifficultyTypeChapter ReferenceKey Patterns to Mention
Design Google Search (crawl + index + serve)Very HardDistributed, StorageVol1 Ch09 (crawl)Crawler, inverted index, PageRank, sharding, serving
Design Google MapsHardGeospatialVol2 Ch03Graph routing, tile serving, ETA, live traffic
Design YouTubeMediumStorage, CDNVol1 Ch14CDN, transcoding, metadata DB, recommendation
Design Gmail / Distributed EmailHardDistributedVol2 Ch08Mailbox sharding, SMTP, spam, threading
Design Facebook News FeedMediumSocialVol1 Ch11Fan-out, Redis, denormalization, pagination
Design Facebook Messenger / WhatsAppMediumReal-timeVol1 Ch12WebSocket, message store, group messaging
Design InstagramMediumSocial, StorageVol1 Ch11, Ch14CDN, fan-out, media storage, explore feed
Design Amazon Search / Product CatalogMediumSearch, StorageVol1 Ch13Elasticsearch, inverted index, faceted search
Design Amazon Order ManagementHardFintech, DistributedVol2 Ch11Saga pattern, inventory, payment, fulfillment
Design Netflix StreamingHardCDN, StorageVol1 Ch14Adaptive bitrate, CDN prefetch, Open Connect
Design Apple Push Notification ServiceMediumReal-timeVol1 Ch10Fan-out, device tokens, delivery guarantee
Design AWS S3HardStorageVol2 Ch09Object store, multipart, metadata, replication

Fintech

QuestionDifficultyTypeChapter ReferenceKey Patterns to Mention
Design Stripe Payment APIHardFintechVol2 Ch11Idempotency keys, PSP integration, webhooks, reconciliation
Design Robinhood Stock Feed / TradingVery HardFintech, Real-timeVol2 Ch13Order book, market data feed, matching engine
Design Coinbase Crypto ExchangeVery HardFintechVol2 Ch13Matching engine, blockchain integration, wallets
Design a Digital Wallet (like Venmo/PayPal)HardFintechVol2 Ch12Distributed transactions, double-entry accounting
Design Fraud Detection SystemHardData pipelineVol2 Ch06Stream processing, ML feature store, rule engine
Design a Rate Limiter for a Payments APIEasyDistributedVol1 Ch04Token bucket, per-customer limits, Redis

Social Media

QuestionDifficultyTypeChapter ReferenceKey Patterns to Mention
Design Twitter / X TimelineMediumSocialVol1 Ch11Fan-out, celebrity problem, Redis sorted sets
Design Twitter SearchHardSearchVol1 Ch09 (concepts)Inverted index, real-time indexing, Elasticsearch
Design Instagram StoriesMediumSocial, StorageVol1 Ch14TTL, CDN, media pipeline, viewer counts
Design TikTok Video FeedHardSocial, CDNVol1 Ch14Recommendation, video pipeline, CDN, infinite scroll
Design Snapchat Stories / MessagingMediumReal-time, StorageVol1 Ch12Ephemeral storage, E2E encryption, media storage
Design a “Who Viewed Your Profile” FeatureMediumSocialVol1 Ch11Count-min sketch, approximate counting, privacy

Storage / Cloud / Productivity

QuestionDifficultyTypeChapter ReferenceKey Patterns to Mention
Design Dropbox / Google DriveMediumStorageVol1 Ch15Block sync, delta diff, conflict resolution, versioning
Design Slack / Team MessagingMediumReal-timeVol1 Ch12WebSocket, channels, threading, search, retention
Design Notion / Collaborative Doc EditorHardReal-time, StorageVol1 Ch15 (concepts)CRDT / OT, versioning, real-time collab, block model
Design a PastebinEasyStorageVol1 Ch08 (concepts)Object store, TTL, short URLs, rate limiting
Design a Cloud File Sync ServiceMediumStorageVol1 Ch15Delta sync, deduplication, conflict resolution

Rideshare / Maps

QuestionDifficultyTypeChapter ReferenceKey Patterns to Mention
Design Uber / LyftHardGeospatial, Real-timeVol2 Ch01, Ch02Geohash, driver matching, WebSocket, surge, ETA
Design Google Maps NavigationHardGeospatialVol2 Ch03Dijkstra/A*, routing tiles, live traffic, ETA
Design Yelp / Proximity SearchMediumGeospatialVol2 Ch01Geohash, quadtree, radius query, business index
Design Nearby Friends (like Facebook)MediumGeospatial, Real-timeVol2 Ch02Location pub/sub, Redis geospatial, privacy
Design a Delivery ETA SystemHardGeospatialVol2 Ch03Historical traffic, ML models, graph routing

Section 3: Question Difficulty Breakdown

Easy — Junior / Mid-Level (L3-L4)

QuestionKey FocusWhat Interviewers Want to See
URL ShortenerHashing, redirects, analyticsBasic CRUD, simple scale, DB design
Unique ID GeneratorDistributed IDs, clock skewSnowflake IDs, trade-offs of approaches
Rate LimiterAlgorithms, distributed stateToken bucket vs sliding window, Redis
Consistent HashingRing, virtual nodesUnderstanding of why it’s needed
PastebinObject storage, TTL, short URLsSimple architecture, basic scale
Design a TinyURL Analytics DashboardMetrics, aggregationTime-series data, aggregation strategies

Medium — Senior (L5)

QuestionKey FocusWhat Interviewers Want to See
News FeedFan-out, caching, scalePush vs pull trade-off, celebrity problem
Chat SystemReal-time, ordering, deliveryWebSocket, message queues, at-least-once
Notification SystemPush delivery, fan-outAPNs/FCM, deduplication, retry strategies
YouTubeVideo pipeline, CDNEncoding, adaptive bitrate, metadata sharding
Google DriveSync, versioningBlock diff, conflict resolution, resumable upload
Web CrawlerDistributed crawl, politenessURL frontier, dedup, robots.txt, scheduling
Search AutocompleteTrie, caching, latencyData structure choice, CDN, update lag
Key-Value StoreCAP, replication, quorumDynamo-style design, gossip, vector clocks
Proximity ServiceGeohash, quadtreeSpatial indexing trade-offs, radius search
Metrics MonitoringTime-series, alertingTSDB, pull vs push, cardinality

Hard — Staff Level (L6)

QuestionKey FocusWhat Interviewers Want to See
Payment SystemIdempotency, ACID, reconciliationDouble-entry, PSP integration, failure recovery
Distributed Message QueuePartitioning, durability, at-least-onceKafka internals, consumer groups, offset management
S3 Object StorageErasure coding, metadata, multipartObject model, durability, replication design
Ad Click AggregationStream + batch, deduplicationLambda architecture, watermarks, fault tolerance
Google MapsGraph routing, live trafficDijkstra/A*, tile serving, ETA accuracy
Uber Driver MatchingReal-time matching, geospatialLocation indexing, WebSocket at scale, surge
Digital WalletDistributed transactions, consistencyEvent sourcing, saga pattern, reconciliation
Distributed EmailMailbox sharding, SMTPDeduplication, spam filtering, threading
Hotel ReservationInventory locking, double-bookingOptimistic locking, idempotency, overbooking
Gaming LeaderboardReal-time ranking, high writeRedis Sorted Sets, approximate counting

Very Hard — Principal / Distinguished (L7+)

QuestionKey FocusWhat Interviewers Want to See
Stock ExchangeOrder book, matching engine, latencyLMAX disruptor, sequencer, sub-ms latency
Google Search (full)Crawl + index + rank + servePageRank, serving at scale, freshness
Global Distributed DBConsensus, partitioning, clock syncPaxos/Raft, TrueTime (Spanner), linearizability
Collaborative Doc EditorCRDT/OT, real-time consistencyOperational transforms, conflict-free merges
Recommendation Engine at ScaleML pipeline, feature store, servingOffline training + online serving, A/B testing

Section 4: Question Types

Storage-Heavy Systems

QuestionStorage ChallengeKey Patterns
YouTubeVideo blobs, metadata, thumbnailsObject store (S3), CDN, chunked upload
Google DriveFile blocks, versioning, delta syncBlock dedup, conflict resolution, resumable uploads
S3 Object StoragePetabyte-scale blob storageErasure coding, consistent hashing, metadata DB
Distributed EmailMailbox storage, attachmentsMIME, mailbox sharding, attachment dedup
Web CrawlerURL data, crawled contentBloom filter dedup, distributed queue, content store

Real-Time Systems

QuestionReal-Time ChallengeKey Patterns
Chat SystemMessage ordering, delivery guaranteeWebSocket, message queue, at-least-once delivery
Notification SystemLow-latency fan-outAPNs/FCM, Kafka fan-out, device token management
Gaming LeaderboardLive score updatesRedis Sorted Sets, pub/sub, debounced updates
Nearby FriendsLive location broadcastLocation pub/sub, geospatial index, privacy controls
Stock ExchangeSub-millisecond matchingLMAX disruptor, single-threaded engine, lock-free

High-Throughput Systems

QuestionThroughput ChallengeKey Patterns
Rate LimiterMillions of checks/secRedis INCR, sliding window, token bucket in memory
Ad Click AggregationBillions of events/dayKafka ingestion, MapReduce, time-window aggregation
Distributed Message QueueHigh-volume pub/subPartitioning, sequential disk I/O, batch writes
Key-Value StoreHigh read/write QPSIn-memory cache, consistent hashing, replication
Metrics MonitoringHigh-cardinality time-seriesTSDB (InfluxDB/Prometheus), downsampling, retention

Geospatial Systems

QuestionGeo ChallengeKey Patterns
Proximity ServiceRadius search at scaleGeohash, quadtree, DB spatial index
Nearby FriendsReal-time location updatesRedis GEORADIUS, pub/sub, location TTL
Google MapsRouting, live trafficGraph algorithms, routing tiles, traffic prediction
Uber Driver MatchingReal-time driver locationGeohash, WebSocket, H3 hexagonal indexing
Delivery ETARoute estimationHistorical traffic, ML models, A/B routing

Financial Systems

QuestionFinance ChallengeKey Patterns
Payment SystemExactly-once, reconciliationIdempotency keys, double-entry ledger, saga
Digital WalletDistributed balance updatesEvent sourcing, CQRS, optimistic locking
Stock ExchangeOrder book, fairness, latencyFIFO queue, price-time priority, sequencer
Fraud DetectionReal-time anomaly detectionStream processing, rule engine, ML scoring
Hotel ReservationInventory + double bookingOptimistic locking, overbooking policy, ACID tx

Section 5: Questions NOT in the SDI Books (Good Stretch Problems)

These questions extend beyond the SDI books. Ideal for senior/staff prep and for companies with unique domains.

QuestionDifficultyTypeHints / Key Areas to Explore
Design a Collaborative Code Editor (like CodePair)HardReal-time, CRDTOperational Transform or CRDT, cursor sync, execution sandbox
Design a Distributed Cron / Job SchedulerHardDistributedLeader election, at-least-once execution, idempotency, sharding jobs
Design an App Store (like Google Play)HardStorage, DistributedBinary distribution, versioning, review pipeline, developer portal
Design a Live Streaming Platform (like Twitch)HardReal-time, CDNRTMP ingest, HLS/DASH distribution, chat at scale, viewer count
Design a CI/CD Pipeline (like GitHub Actions)HardDistributedJob queue, worker isolation (containers), artifact storage, caching
Design a Social Graph (like LinkedIn connections)HardGraph, DistributedGraph DB vs adjacency list, BFS at scale, degree-of-connection
Design a Real-Time Analytics DashboardHardData pipelineStream processing, pre-aggregation, live push via WebSocket
Design an E-commerce Search with PersonalizationHardSearch, MLElasticsearch, personalized ranking, A/B testing, click feedback
Design a Ride-Share Pricing Engine (Surge)HardReal-timeDemand/supply ratio, ML models, geographic zones, fairness
Design a Content Moderation SystemHardML, DistributedAsync media pipeline, classifier queue, human-in-the-loop, appeals
Design a Two-Factor Authentication (2FA) ServiceMediumSecurity, DistributedTOTP (RFC 6238), rate limiting, fallback SMS, device trust
Design a Distributed Tracing System (like Jaeger)HardObservabilityTrace context propagation, span storage, sampling strategies
Design a Feature Flag ServiceMediumDistributedConsistent evaluation, rollout percentages, kill switch, low latency
Design a Real-Time Bidding (RTB) SystemVery HardFintech, Real-timeSub-100ms auction, DSP/SSP, bidding logic, auction result delivery
Design an ML Feature StoreHardData pipeline, MLFeature computation, point-in-time correctness, online/offline store
Design a DNS Resolver at ScaleHardDistributedRecursive resolution, caching, TTL, DDoS protection
Design a CDN from ScratchHardInfrastructureEdge PoPs, origin pull, cache invalidation, anycast routing
Design a Ticket Booking System (like Ticketmaster)HardFintech, DistributedSeat locking, virtual queue, flash sale handling, double-booking
Design a Distributed Configuration Service (like ZooKeeper)HardDistributedConsensus (ZAB/Raft), watch mechanism, sequential consistency
Design a Privacy-Preserving Analytics SystemVery HardData pipeline, SecurityDifferential privacy, k-anonymity, aggregation with noise

Last Updated: 2026-04-13