Dirty data is the bane of the analytics industry. Almost every organization that deals with data has had to deal with some degree of unreliability in its numbers.
According to the Pragmatic Institute, data practitioners spend 80% of their time identifying, cleansing, and arranging data and 20% analyzing it. This 80/20 rule is referred to as the Pareto Principle.