Do not route every action to one core. Serve static catalog, media, manifests, depot chunks, and community reads from regional edge caches while keeping accounts, payments, and entitlements strongly controlled.
Architecture Briefing
System Design: Steam
A tactical architecture map for a Steam-scale gaming platform: global edge delivery, identity and ownership, commerce, content pipelines, multiplayer coordination, data stores, observability, and elastic capacity controls for a billion-user footprint.
Traffic manager shifts users to healthy regions, autoscalers add API/game-service workers, queues absorb spikes, CDN capacity expands by depot popularity, and hot data moves into cache.
Cooldown windows drain queues, remove idle workers, lower cache prewarm, shrink batch fleets, and keep only baseline regional capacity plus standby failover.
Shard accounts and inventories, isolate store traffic from multiplayer traffic, roll out with feature flags, and route around bad POPs or degraded data replicas.
Operation Map
Steam-Scale Platform Battlefield
Clients and Creators
Billions of sessions start here, but most reads stay cached.
Global Traffic Edge
Absorbs reads, routes around regional failure, sheds abusive load.
Trust and Commerce
Authoritative path for identity, money, licenses, and policy.
Content Supply Line
Turns developer builds into regional chunks and manifests.
Community and UGC
High-read social surfaces isolated from payment hot paths.
Multiplayer Command
Session control scales separately from real-time relay traffic.
Data Backbone
Shard writes, replicate reads, and keep hot state in memory.
Elastic War Room
Decides when to add, drain, fail over, or brown out capacity.
Edge gateway checks identity, catalog and commerce, commits the license to account shards, publishes purchase events, then warms read replicas for launch.
Builds become immutable depots and manifests. Popular depots are forecast, prewarmed, and served from CDN/edge capacity instead of the origin.
Friends, lobbies, server heartbeats, and SDR ticketing coordinate the match while relay POPs independently scale bandwidth and route quality.
How It Supports 1B Users
The architecture separates ultra-hot read paths from authoritative write paths. Store pages, media, depots, manifests, profiles, reviews, and public metadata are cached at the edge. Identity, wallet, ownership, inventory and payment writes stay behind stricter control planes with partitioned account shards and audit logs.
How It Scales Up
Traffic spikes trigger more gateway workers, more regional service pools, larger relay bandwidth, wider CDN cache fill, and additional queue consumers. Launch events can prewarm depots and temporarily reserve capacity before players arrive.
How It Scales Down
After queue depth, request rate, and relay utilization drop, the capacity controller drains workers, removes cache prewarm, shifts batch jobs to cheaper windows, and returns regions to baseline plus failover reserves.
Vercel turns Git commits into immutable deployments, then routes global traffic through an edge network that can serve cached static assets, invoke serverless compute, or stream dynamic responses.
Popular pages move to cache, serverless/Fluid Compute workers scale by request load, ISR and data caches reduce origin pressure, and deployments stay immutable so new versions do not mutate old traffic.
Idle compute instances drain, cache entries age out, preview deployments sit cold until requested, and traffic can shift back to stable production aliases without rebuilding.
Preview URLs, atomic aliases, instant rollback, rolling releases, protection checks, observability, and firewall controls keep experiments isolated from production users.
Cloud Operation Map
Vercel Platform Architecture
Developer Inputs
Source changes enter through Git, CLI, APIs, and project settings.
Build Control Plane
Framework detection and build isolation create immutable outputs.
Release Layer
Deployments become URLs, aliases, previews, and guarded rollouts.
Global Edge Network
First request landing zone for cache, routing, security, and compute.
Compute Plane
Dynamic requests split across edge code and regional serverless compute.
Data and Cache Plane
Platform cache and managed stores reduce function and origin work.
Observability and Governance
Feedback loop for performance, reliability, cost, and security.
User Traffic
Requests resolve to cached assets, edge logic, or compute.
Vercel Request Path
A request enters the nearest edge, passes protection and route matching, then either serves a cached asset/page, runs middleware, invokes Functions or Fluid Compute, or reaches a managed/external datastore. Cacheable results feed back into CDN and data cache.
Vercel Deploy Path
Git or CLI events enqueue a build, reuse build cache, produce Build Output API artifacts, register an immutable deployment, create preview URLs, then atomically move aliases for production.
Vercel Scaling Path
Static assets and ISR pages scale through the CDN. Dynamic traffic scales through request-driven compute and regional capacity. When load falls, idle compute drains while cached outputs remain cheap to serve until eviction or revalidation.
Use Vercel as the global experience layer: store, web app, marketing, community reads, preview builds, dashboards, edge routing, cached pages, and lightweight APIs.
Keep authoritative systems outside Vercel: identity ledger, wallet/payment, entitlements, inventory, matchmaking, game relays, content depot storage, and event streaming.
Most users hit Vercel edge cache and CDN. Only authenticated writes, purchases, license checks, matchmaking, and downloads cross into regional platform services.
Scale web reads through Vercel, scale game/content systems through dedicated regional backends, and connect them with async events plus strict ownership APIs.
Combined Target Architecture
Steam-Like Platform Built on Vercel
Player and Developer Entry
Browsers, launchers, games, and creators enter through separate paths.
Vercel Experience Layer
Vercel owns global web delivery, previews, and cache-heavy surfaces.
API Gateway and Trust Boundary
All authoritative operations pass through strict regional gateways.
Commerce and Ownership Core
Money and licenses need strong consistency, audit, and rollback safety.
Content and Download Plane
Vercel serves store media; dedicated depot CDN serves game bytes.
Multiplayer and Social
Real-time game systems bypass Vercel and scale on regional capacity.
Data, Events and Analytics
Queues isolate spikes and let slow work happen outside requests.
Scaling and Operations
Vercel scales the web edge; platform controllers scale game systems.
What Vercel Should Own
Vercel is the global web surface: storefront pages, discovery, marketing, docs, community reads, preview deployments, admin UI, edge middleware, thin BFF functions, cached catalog/read pages, and safe UI releases through aliases and rollbacks.
What Must Stay Outside Vercel
Do not run the authoritative Steam-like core only on Vercel. Payments, wallet, entitlements, account shards, inventory, matchmaking, game relays, depot downloads, event streams, and large object storage need dedicated regional backends and operational control.
1B User Scaling Shape
Anonymous browsing and public reads scale through Vercel CDN and ISR. Authenticated writes go to regional core services. Large downloads use a dedicated game CDN. Multiplayer uses regional lobbies and relay POPs. Queues absorb spikes and let capacity scale down safely.
Research Sources
- Steamworks Authentication documentation
- SteamPipe content system documentation
- Steam Datagram Relay documentation
- Steamworks matchmaking, inventory, cloud, and Web API docs
- Vercel docs: Edge Network, Deployments, Functions, Fluid Compute, Build Output API, Caching, ISR, Rolling Releases, Observability, Firewall and Storage