One of my favourite phrases is: What problem are we trying to solve? As techies, we often launch into solutions before we even understand the true nature of the problem. The performance issues on any analytics platform generally fall into one of three categories:
1.Data Load Speed: The ability to load massive volumes of data as quickly as possible.
2.Data Transformation: The ability to maximize throughput, and rapidly transform the raw data into a form suitable for queries.
3.Data Query Speed: Which aims to minimize the latency of each query and deliver results to business intelligence users as fast as possible.