Streaming data and method to get a good approximation of COUNT(DISTINCT) problem
So, I am sure you know LogLog / HyperLogLog algorithms to get a good approximation of some interesting business metrics “on-the-fly” (or Streaming Data). From technical side, there are a good technical realization to get well good approximation like database…