The Data Lake and Data Pool are dirty, beware.
Wonder why it requires so much cleaning in order to become useful in Data Analytics?
It has been estimated that 70 to 90% of time is spent on cleaning dirty Data?
Maybe there is a market for an embedded Data cleaning agent?