Top K Problem System Design

This post is first created by CodingMrWang, 作者 @Zexian Wang ,please keep the original link if you want to repost it.

Requirement

Scalable (scales out together with increasing amount of data: videos, tweets, posts, etc.)
Highly available (survive hardware/network failure, no single point of failure)
Highly performant
Accurate (would be trade off between highly performant and accurate)
Data persistence

Top K Service

HashMap + Min Heap
- Use HashMap count frequency of each element
- When an element comes, check if it exists in the map - If yes, increase the frequency, compare with freq of min element in the heap, if it is larger than it, pop the least element in the heap and insert the new element - If No, insert into map

- Time Complexity: O(N) - Space Complexity: O(N)

But if data size is large, we cannot save all data in a single machine.

Multiple machine HashMap + Min Heap
- Shard the incoming data and save to multiple machine, in each machine, use a hashMap and a heap to maintain a max k frequent elements heap.
- Send all k size heap to a single machine and combine into a single k-size heap
Still cannot fit large size data. Memory size is limited.
Database + Heap
- Shard the incoming data and save to database.
- Sort each database to get top k items.
- Combine k items from each database into a single heap and get top K
Can save large amount of data, but if write qps is really large, need cache to save write queries, if machine crash, cache data will loss. So not good for large write requests.
Count-Min Sketch + Heap

No really accurate but high performant.
- A two dimension array - Width is usually in thousands, depth is small (can be 5, 5 different hash function)
  - Whenever an element comes, calculate 5 hash value base on 5 hash function, take the smallest value as its frequent.
  - Update heap base on frequency of the new element.
- Count-Min sketch is a fixed size data structure.
Map-Reduce
- We can persist data in a distributed file system and do periodic Map-Reduce task to get most frequent k elements.
- Map-Reduce is a time-consuming task, so cannot get update to date top k list, but it is accurate.
- Need two Map-Reduce job, one to count frequency of each elements and get top k for each partition. one to get total top k frequent elements.

User call API Gateway which will create a log.
Distributed Message system could be Kafka, Kinesis or SQS. Random partitioning.
Fast Processor
- Creates count-min sketch and aggregates data for a short period of time
- Memory is no longer a problem, no need to partition the data. - Data replication is nice to have, but not strictly required.
Storage can be SQL/NoSQL
- Builds final count-min sketch and stores a list of top k elements for a period of time.
- Data replication is required.
Data Prtitioner
- Parses batches of events into individual events
- Hash partitioning
- Deals with hot partitions
Parition Processor
- Aggregates in-memory over the course of several minutes.
- Generates files of specified size
Finally, save map reduce result to storage as well.

Helpful links: