
Data mining is the process of analyzing data to extract information not
o$ered by the raw data alone. Data mining can also begin at a summary
information level (coarse granularity) and progress through increasing levels
of detail (drilling down), or the reverse (drilling up). Data mining occurs on
structured data that are already in a database or a spreadsheet.
Learning Outcome 6.7: De+ne the three organizational methods for
analyzing big data.
Unstructured data do not exist in a fixed location and can include text
documents, PDFs, voice messages, emails, and so on. Three common forms
for mining structured and unstructured data are cluster analysis, association
detection, and statistical analysis. Data mining, big data analytics, and data
visualization are the three methods organizations are using to dissect,
analyze, and understand organizational data. Data mining is the process of
analyzing data to extract information not offered by the raw data alone. Data
mining can also begin at a summary information level (coarse granularity)
and progress through increasing levels of detail (drilling down), or the
reverse (drilling up). Big data is a collection of large, complex data sets,
including structured and unstructured data, which cannot be analyzed using
traditional database methods and tools. Data visualization describes
technologies that allow users to see or visualize data to transform
information into a business perspective.
CLASSROOM OPENER
GREAT BUSINESS DECISIONS – Bill Inmon – The Father of
the Data Warehouse
Bill Inmon, is recognized as the "father of the data warehouse" and
co-creator of the "Corporate Information Factory." He has 35 years of
experience in database technology management and data warehouse
design. He is known globally for his seminars on developing data warehouses
and has been a keynote speaker for every major computing association and
many industry conferences, seminars, and tradeshows.
As an author, Bill has written about a variety of topics on the building, usage,
and maintenance of the data warehouse and the Corporate Information
Factory. He has written more than 650 articles, many of them have been
published in major computer journals such as Datamation, ComputerWorld,
DM Review and Byte Magazine. Bill currently publishes a free weekly
newsletter for the Business Intelligence Network, and has been a major
contributor since its inception. http://www.b-eye-network.com/home/
CLASSROOM EXERCISE
Analyzing Multiple Dimensions of Information