Design Twitter's News Feed

Twitter's feed system is a canonical system design problem because it has an interesting architectural challenge: the fan-out problem. When someone with 50M followers tweets, how do you efficiently deliver that tweet to all followers?

Step 1: Clarify Requirements

Functional requirements:

Users can post tweets (text, images, videos)
Users can follow other users
Home timeline shows tweets from followed users in reverse-chronological order
Tweets can have likes, replies, and retweets

Non-functional requirements:

300M DAU
500M tweets/day
Read-heavy: 300B timeline reads/day vs. 500M writes
P99 latency for home timeline: < 200ms
System should be highly available
Eventual consistency is acceptable (slight delays are OK)

Step 2: Estimate Scale

Writes: 500M tweets/day = ~6,000 tweets/sec (peak: ~15,000/sec)
Reads:  300B timeline reads/day = ~3.5M reads/sec (peak: ~7M/sec)

Read/Write ratio: 500:1 (extremely read-heavy)

Storage:
- Tweet: user_id (8B) + text (280 bytes) + timestamp (8B) + media_url (256B) ≈ 600B
- 500M tweets/day × 600B = 300 GB/day
- 10 years: ~1 PB (tweets are small but there are a lot of them)

Following relationships:
- 300M users × avg 200 follows = 60B follow relationships
- 60B × 16 bytes (follower_id + following_id) = ~1 TB

Step 3: Core Design Decisions

The Fan-Out Problem

This is the central challenge of Twitter's architecture.

Scenario: User A has 50M followers. They tweet. How do 50M people see it?

You have two strategies:

Option A: Fan-Out on Write (Push Model)

When a tweet is posted, immediately write to each follower's timeline cache.

User A tweets
    │
    ▼
Tweet Service
    │
    ▼
Fan-Out Worker
    │
    ├──► Write to follower 1's timeline cache
    ├──► Write to follower 2's timeline cache
    ├──► Write to follower 3's timeline cache
    │    ...
    └──► Write to follower 50M's timeline cache

Pros:

Home timeline reads are O(1) — just read from cache
Very fast reads (< 1ms)

Cons:

Writing a celebrity's tweet requires 50M cache writes — expensive and slow
Storage: 300M users × 100 cached tweets × 600 bytes = 18 TB of cache
Wasted storage for inactive users

Option B: Fan-Out on Read (Pull Model)

When a user opens their timeline, query tweets from all accounts they follow.

User B opens timeline
    │
    ▼
Timeline Service
    │
    ├──► Fetch tweets from user 1's tweet store
    ├──► Fetch tweets from user 2's tweet store
    ├──► Fetch tweets from user 3's tweet store
    │    ... (for each of user B's 200 follows)
    └──► Merge, sort, return top 20

Pros:

No fan-out cost on write
No wasted storage for inactive users

Cons:

Timeline generation requires N queries (N = number of follows) — slow
Hard to maintain low latency for users who follow 1000 accounts

Twitter's Actual Approach: Hybrid

Twitter uses a hybrid strategy based on account type:

User posts tweet
    │
    ▼
Is user a celebrity (>10k followers)?
    │
    ├── YES: Store tweet only in tweet store
    │         (don't fan-out to all followers)
    │
    └── NO: Fan-out to all followers' timeline caches
           (write tweet_id to each follower's timeline)

User opens timeline
    │
    ▼
Read timeline cache (pre-built for non-celebrity follows)
    │
    ▼
Fetch any celebrity tweets from their tweet stores
    │
    ▼
Merge + sort
    │
    ▼
Return to user

This works because:

Most people have < 10k followers → fan-out is cheap
Most people don't follow many celebrities → the merge step is small
Celebrities are the exception, not the rule

Step 4: Detailed Architecture

Data Model

-- Users table
CREATE TABLE users (
  user_id    BIGINT PRIMARY KEY,
  username   VARCHAR(50) UNIQUE,
  bio        TEXT,
  follower_count BIGINT,
  created_at TIMESTAMP
);

-- Tweets table
CREATE TABLE tweets (
  tweet_id   BIGINT PRIMARY KEY,  -- Snowflake ID (time-sortable)
  user_id    BIGINT NOT NULL,
  content    TEXT,
  media_url  TEXT,
  like_count BIGINT DEFAULT 0,
  created_at TIMESTAMP,
  INDEX(user_id, created_at DESC)
);

-- Follows table
CREATE TABLE follows (
  follower_id  BIGINT NOT NULL,
  following_id BIGINT NOT NULL,
  created_at   TIMESTAMP,
  PRIMARY KEY(follower_id, following_id),
  INDEX(following_id, follower_id)
);

Why Snowflake IDs for tweets?

Time-sortable: tweet IDs are monotonically increasing with time
No need to sort by timestamp — just sort by ID
Distributed: can be generated without a central counter
Compact: 64-bit integer

Timeline Cache (Redis)

Each user's home timeline is stored as a sorted set in Redis:

// Key: timeline:{user_id}
// Value: sorted set of tweet_ids, score = timestamp

// Fan-out: add tweet to follower's timeline
async function fanOut(tweetId, authorId, timestamp) {
  const followers = await getFollowers(authorId);

  // Process in batches of 100
  for (const batch of chunk(followers, 100)) {
    await Promise.all(batch.map(followerId =>
      redis.zadd(
        `timeline:${followerId}`,
        timestamp,
        tweetId,
        { NX: true } // Don't overwrite if already present
      )
    ));
  }

  // Keep only the 800 most recent tweets per timeline
  await redis.zremrangebyrank(`timeline:${authorId}`, 0, -801);
}

// Read: get home timeline
async function getHomeTimeline(userId, cursor, limit = 20) {
  const tweetIds = await redis.zrevrange(
    `timeline:${userId}`,
    cursor,
    cursor + limit - 1
  );

  // Fetch actual tweet data (could be another cache layer)
  const tweets = await Promise.all(tweetIds.map(id => getTweet(id)));
  return tweets.filter(Boolean);
}

Fan-Out Service

class FanOutService {
  async processTweet(tweet) {
    const { tweetId, userId, timestamp } = tweet;
    const followerCount = await getUserFollowerCount(userId);

    if (followerCount < 10000) {
      // Regular user: fan-out immediately
      await this.fanOutToFollowers(tweetId, userId, timestamp);
    } else {
      // Celebrity: just store in tweet store, timeline service will merge
      await tweetStore.save(tweet);
      // Still fan-out to a limited set (e.g., "super followers" or verified accounts)
    }
  }

  async fanOutToFollowers(tweetId, userId, timestamp) {
    let cursor = null;

    // Paginate through followers
    while (true) {
      const { followers, nextCursor } = await getFollowersBatch(userId, cursor);

      await Promise.all(followers.map(followerId =>
        redis.zadd(`timeline:${followerId}`, timestamp, tweetId)
      ));

      if (!nextCursor) break;
      cursor = nextCursor;
    }
  }
}

Full System Architecture

                    ┌────────────────────────────────────────┐
                    │            Load Balancer                │
                    └───────┬───────────────────────┬────────┘
                            │                       │
                   ┌────────▼──────┐       ┌────────▼──────┐
                   │  Tweet API    │       │ Timeline API  │
                   │  Service      │       │ Service       │
                   └────────┬──────┘       └────────┬──────┘
                            │                       │
              ┌─────────────▼──────┐      ┌─────────▼───────┐
              │   Message Queue    │      │ Timeline Cache  │
              │   (Kafka)          │      │ (Redis Cluster) │
              └─────────────┬──────┘      └─────────────────┘
                            │                       ▲
                   ┌────────▼──────┐                │ (merge)
                   │  Fan-Out      │                │
                   │  Workers      │────────────────┘
                   └────────┬──────┘
                            │
              ┌─────────────▼──────────────────┐
              │         Tweet Store             │
              │  (Cassandra — time-series)      │
              └─────────────────────────────────┘
                            │
              ┌─────────────▼──────────────────┐
              │    User/Follow Graph Store      │
              │    (MySQL / Graph DB)           │
              └─────────────────────────────────┘

Interview Follow-up Questions

Q: How do you handle users with 100M followers?

For mega-celebrities (100M+ followers), fan-out is simply too expensive. Their tweets are not pre-pushed to anyone's timeline. Instead, the timeline service identifies celebrities among the user's follows and fetches their recent tweets at read time, merging with the pre-built timeline.

Q: How do you handle the "thundering herd" problem when a celebrity tweets?

Use a cache with a short TTL for celebrity timeline fetches. Add jitter to prevent all users from invalidating their cache at the same moment. Process fan-outs through an async queue rather than synchronously.

Q: How would you implement the "trending topics" feature?

Maintain a real-time count of hashtags using a sliding window (e.g., 1-hour window). Use Redis sorted sets: ZINCRBY trending 1 #hashtag. At read time, ZREVRANGE trending 0 9 returns top 10. Refresh periodically and apply geographic filtering.

Q: How do you keep tweet counts (likes, retweets) accurate at scale?

Count updates are high volume — don't write to the tweet record directly. Instead, write counts to a separate counter store (Redis) and sync to the primary DB asynchronously. For the final tally, use Flajolet-Martin sketches or HyperLogLog for approximate counts at massive scale.

Step 1: Clarify Requirements​

Step 2: Estimate Scale​

Step 3: Core Design Decisions​

The Fan-Out Problem​

Option A: Fan-Out on Write (Push Model)​

Option B: Fan-Out on Read (Pull Model)​

Twitter's Actual Approach: Hybrid​

Step 4: Detailed Architecture​

Data Model​

Timeline Cache (Redis)​

Fan-Out Service​

Full System Architecture​

Interview Follow-up Questions​