Natural processes have a preferred progress towards chaos
Goal: "I want to know how many customers I have."
The further away from source your data is, the more annoying it gets to query and manage
From the Data Science Salary Survey 2015
and The emergence of Spark.Organizations which design systems ... are constrained to produce designs which are copies of the communication structures of these organizations
If you don't have big data, here's what your stack looks like:
Infrastructure | Analysis | Visualization |
---|---|---|
Kafka | Tensor Flow | Caravel |
Feather | Bayesian Query Language | Jupyter Dashboards |
For more, check out my data post.