Database Processing, 15e (Kroenke)
Chapter 12: Data Warehouses, Business Intelligence Systems, and Big Data
1) Big Data is the name given to the enormous datasets generated by Web 2.0 applications.
2) Business Intelligence (BI) systems are information systems that help users analyze and use
data.
3) Business Intelligence (BI) systems support operational activities.
4) Business Intelligence (BI) systems obtain data in three different ways.
5) Business Intelligence (BI) reporting systems are used to filter data, sort data, group data, and
make simple calculations based on the data.
6) Business Intelligence (BI) reporting systems can analyze data using standard SQL.
7) Business Intelligence (BI) reporting systems summarize the current status of business
activities and compare that status with past events, but not with predicted future activities.
8) Data mining uses sophisticated statistical and mathematical techniques to perform what-if
analyses, to make predictions, and to facilitate decision making.
9) Report delivery is more important for data mining than it is for reporting systems.
10) A data warehouse is a database system that has data, programs, and personnel specialized in
Business Intelligence (BI) processing.
11) Data warehouses are populated with data prepared by data extraction, transformation, and
load (ETL) programs.
12) Data warehouse data are frequently denormalized.
13) Data warehouses also store the data warehouse metadata.
14) Data warehouses often include data purchased from outside vendors.
15) Metadata about the data’s source, format, assumptions, and constraints are kept in a data
warehouse metadata database.
16) Problematic data are called dirty data.
17) A data mart is a collection of data that addresses a particular component of a functional area
of a business.
18) Operational databases store historical data.
19) Dimensional databases are used for analytical data processing.
20) Dimensional databases use the star schema.
21) Operational databases contain a fact table.
22) A star schema resembles a star, with a dimension table at the center, and fact tables radiating
out from the center.
23) In a snowflake table, unlike tables in a data warehouse, each dimension table is normalized.
24) Business Intelligence (BI) reporting systems are intended to create meaningful information
from disparate data sources and to deliver that information to the proper users on a timely basis.
25) RFM analysis is a way of analyzing and ranking customers based on online survey data.
26) In a common form of RFM analysis, customers are sorted into five groups and given an
associated score depending on their group.
27) In a common form of RFM analysis, customers with an R score of 5 are in the 20% of
customers who have the most recent orders.
28) In RFM analysis, R stands for “how recently.”
29) In RFM analysis, F stands for “how frequently.”
30) In RFM analysis, M stands for “how much money.”
31) In a common form of RFM analysis, a score of 1 is “high” or “good” while a score of 5 is
“low” or “bad.”
32) In a common form of RFM analysis, an RFM score of {5 1 1} means that the customer
orders frequently and orders items of high monetary value but has not ordered anything for some
time.
33) OLAP provides the ability to sum, count, average, and perform other simple arithmetic
operations on groups of data.
34) An OLAP cube is limited to three axes.
35) The term drill down refers to the capability of seeing the data in smaller and smaller units.
36) To create an OLAP report for an SQL Server 2017 database, use the PivotTable tool in SQL
Server 2017.
37) When creating an OLAP report based on SQL Server 2017 data, it is often a good idea to
create a view to organize the data needed for the OLAP report.
38) Microsoft Excel 2016 allows us to connect directly to an SQL Server 2017 database when
building a PivotTable.
39) Microsoft Excel 2016 cannot import SQL Server 2017 data directly into a PivotTable report,
but must first place the data into a worksheet.
40) Although Microsoft Excel 2016 will create a PivotTable report using SQL Server 2017 data,
it does not have formatting tools that can be used with the report.
41) Data mining is the application of mathematical and statistical techniques to find patterns and
relationships that can be used to classify and predict future outcomes.
42) Most data mining techniques are simple and easy to use.
43) The movement that uses different database methods than the relational model and/or SQL is
called the NoSQL movement.
44) Most of NoSQL nonrelational database methodologies are known as structured storage.
45) NoSQL really stands of “Not only SQL.”
46) Facebook uses the Apache Software Foundation’s Cassandra NoSQL database.
47) Amazon.com’s Dynamo was an early example of structured storage.
48) Google’s Bigtable was an early example of structured storage.
49) Structured storage column families are indistinguishable from relational database tables.
50) In the MapReduce process, the Reduce step is followed by the Map step.
51) Business Intelligence (BI) systems do which of the following?
A) Analyze current and past activities
B) Predict future events
C) Record and process transactions
D) Both A and B are correct
52) Business Intelligence (BI) systems fall into which of the following categories?
A) Processing
B) Reporting
C) Data mining
D) Both B and C are correct
53) Business Intelligence (BI) systems obtain their data by all of the following means except
________.
A) read and process data from an operational database
B) process extracts from operational databases
C) process data purchased from data vendors
D) read and process data entered by BI system users
54) Business Intelligence (BI) reporting systems can do which of the following operations?
A) Filter data
B) Group data
C) Modify data
D) Both A and B are correct
55) One Business Intelligence (BI) reporting system that uses extensions to SQL is ________.
A) cluster analysis
B) OLAP
C) regression analysis
D) RFM analysis
56) Data mining applications are used to accomplish all of the following tasks except ________.
A) perform what-if analysis
B) make predications
C) facilitate decision making
D) update the database
57) Which of the following is/are true about data mining applications?
A) They use sophisticated mathematical techniques.
B) They use sophisticated statistical techniques.
C) Their report delivery is more important than report delivery for reporting systems.
D) Both A and B are correct
58) Which of the following is not a reason that operational data are difficult to read?
A) Dirty data
B) Current data
C) Nonintegrated data
D) Missing values
59) We have obtained access to the company’s operational data. In one record, we find that a
customer’s age has been recorded as “337.” This is an example of ________.
A) dirty data
B) inconsistent data
C) nonintegrated data
D) a “wrong format” problem
60) We have obtained access to the company’s operational data. We examine 50 records for
customers with phone numbers that should use the current area code of 345. Of these 50 records,
we find 10 that still use an older area code of 567. This is an example of ________.
A) dirty data
B) inconsistent data
C) nonintegrated data
D) a “too much data” problem
61) We have obtained access to the company’s operational data. We have been asked to produce
a report with an item by item analysis of sales, but the only sales figure available is the total sale
value for each order. This is an example of ________.
A) dirty data
B) inconsistent data
C) nonintegrated data
D) a “wrong format” problem
62) A data warehouse database differs from an operational database because ________.
A) data warehouse data are not stored in tables
B) data warehouse databases do not have metadata
C) data warehouse data are often denormalized
D) Both B and C are correct
63) Which of the following is not a component of a data warehouse?
A) Data extract, transform, and load (ETL) preparation programs
B) Data warehouse data
C) Operational database updates
D) Data warehouse metadata
64) A data mart differs from a data warehouse in that ________.
A) it has a larger database
B) it deals with a particular component or functional area of the business
C) data mart users must have more data management expertise than data warehouse employees
D) it is updated more frequently by the data mart users