Currently the problem are not the data but to know what to do with them.
Today we have access to a huge amount of data: increasing interactions, the digital lifestyle, the explosion of the so-called Internet of Things (IoT) and technological capabilities in continuous evolution have generated a tsunami of information available. It is estimated that in the world 5 Exabytes of data are created every two days, which is approximately the same amount generated from the beginning of civilization until 2003. We are therefore faced with the paradox of overabundance of data: we have more data than time to analyze them.
The problem is not the data but to know what to do with them. Not so long ago, the problem with many companies was getting information from their customers and knowing what happened. On the contrary, at present the problem is just the opposite: there is an overabundance of data, what some call infoxication. And this excess of information causes the receiver to be unable to understand or assimilate it, preventing decision making based on available data. It is not only the volume and the speed with which these data are generated, but also the variety: the data come from very different sources, some of them very unstructured like text in social networks, images, audio, video, sensors, etc. which do not fit well into commonly used relational databases.