Scaling to a Distributed Data system introduces substantial overhead. Don’t over-engineer. Make sure that you are designing the system that you need.

You can distribute data 2 ways across a node:

  1. Replication
  2. Partitioning