
The growth of data mining is inevitable. It reflects a fundamental change caused by technological advances that, like King Canute’s fabled tide, cannot be stopped or slowed. The opportunity ⎯ or problem, depending on one’s perspective ⎯ derives from two related, yet distinct trends: increases in computing power and decreases in data storage costs.
Many are familiar with the long-term increase in the power of computers. It is most familiarly characterized as Moore’s Law ⎯ named after Intel computer scientist Gordon Moore, who first posited the law in 1965. Moore’s Law predicts that computer chip capacities will double every eighteen to twenty-four months. Moore’s law has been remarkably constant for nearly thirty years, as the graph below demonstrates.

The power of this processing capacity ⎯ which translates almost directly into processing speed ⎯ is immense. It is what drives the information technology tools that power Google and Amazon and make Walmart’s purchasing system a reality. Though no one predicts that processing speed will double indefinitely–surely a physical impossibility ⎯ there is no current expectation that the limits of chip capacity have been reached.
To this trend one must also add the remarkable reduction in the costs of data storage. As the following chart demonstrates, data storage costs have also been decreasing at a logarithmic rate, almost identical to the increases we have experienced in chip capacity, but with an inverse slope.

Here, too, the prospects are for ever-cheaper data storage. One can readily imagine peta-, exa-, or even yottabyte sized personal storage devices. [A petabyte is 10005 bytes, a exabyte is 10006 bytes, and a yottabyte is 10008 bytes.] If that is for the individual, imagine what a large corporation or a government can purchase and maintain.
The story of technology today requires us to answer the question: “What happens when ever-quicker processing power meets ever-cheaper storage capacity?” Data is now pervasively available and pervasively searchable. For large-scale databases of the size maintained by governments or companies, the practical limitations lie in the actual search algorithms used and how they are designed to process the data, not in the chips or the storage units.
For perspective on how science, policy and other events shaped data mining in the United States, please view our interactive timeline feature.