All of the following statements about data mining are true EXCEPT
A) the process aspect means that data mining should be a one-step process to results.
B) the novel aspect means that previously unknown patterns are discovered.
C) the potentially useful aspect means that results should lead to some business benefit.
D) the valid aspect means that the discovered patterns should hold true on new data.
In the Big Data and Analytics in Politics case study, which of the following was an
input to the analytic system?
A) census data
B) assessment of sentiment
C) voter mobilization
D) group clustering
In the Clinical Decision Support System case study, what was the system’s output?
A) a diagnosis of the type of tendon injury suffered by the patient
B) a treatment and rehabilitation plan for the patient
C) an explanation of the tendon anatomy of the patient
D) a referral to specialists who could accurately diagnose the tendon injury
In the health sciences, the largest potential source of Big Data comes from
A) accounting systems.
B) human resources.
C) patient monitoring.
D) research administration.
What is Six Sigma?
A) a letter in the Greek alphabet that statisticians use to measure process variability
B) a methodology aimed at reducing the number of defects in a business process
C) a methodology aimed at reducing the amount of variability in a business process
D) a methodology aimed at measuring the amount of variability in a business process
In the heart disease diagnosis case study, what was a benefit of the SIPMES expert
system?
A) Expert systems from other domains were used, saving development time.
B) The SIPMES system agreed with human experts 64% of the time.
C) The SIPMES system could diagnose all types of cardiovascular diseases.
D) No human expert knowledge was needed in development, only textbook knowledge.
How does the use of cloud computing affect the scalability of a data warehouse?
A) Cloud computing vendors bring as much hardware as needed to users’ offices.
B) Hardware resources are dynamically allocated as use increases.
C) Cloud vendors are mostly based overseas where the cost of labor is low.
D) Cloud computing has little effect on a data warehouse’s scalability.
Web site usability may be rated poor if
A) the average number of page views on your Web site is large.
B) the time spent on your Web site is long.
C) Web site visitors download few of your offered PDFs and videos.
D) users fail to click on all pages equally.
Which broad area of data mining applications analyzes data, forming rules to
distinguish between defined classes?
A) associations
B) visualization
C) classification
D) clustering
In developing an artificial neural network, all of the following are important reasons to
pre-select the network architecture and learning method EXCEPT
A) some configurations have better success than others with specific problems.
B) development personnel may be more experienced with certain architectures.
C) most neural networks need special purpose hardware, which may be absent.
D) some neural network software may not be available in the organization.
Big Data often involves a form of distributed storage and processing using Hadoop and
MapReduce. One reason for this is
A) centralized storage creates too many vulnerabilities.
B) the “Big” in Big Data necessitates over 10,000 processing nodes.
C) the processing power needed for the centralized model would overload a single
computer.
D) Big Data systems have to match the geographical spread of social media.
Revenue management systems modify the prices of products and services dynamically
based on
A) intuition, demand, and supply.
B) intuition, competition, and supply.
C) business rules, demand, and supply.
D) business rules, supply, and intuition.
The deployment of large data warehouses with terabytes or even petabytes of data been
crucial to the growth of decision support. All the following explain why EXCEPT
A) data warehouses have enabled the affordable collection of data for analytics.
B) data warehouses have enabled the collection of decision makers in one place.
C) data warehouses have assisted the collection of data for data mining.
D) data warehouses have assisted the collection of data from multiple sources.
The fact that many organizations share many similar problems means that in sourcing a
DSS, it is often wiser to acquire a(n)
A) ready-made DSS.
B) custom-made DSS.
C) offshored DSS.
D) consultant-developed DSS.
What BEST describes a simulation model with a limited number of variables, each with
a finite number of values?
A) Monte Carlo simulation
B) discrete event simulation
C) continuous distribution simulation
D) system dynamics simulation
All of the following statements about MapReduce are true EXCEPT
A) MapReduce is a general-purpose execution engine.
B) MapReduce handles the complexities of network communication.
C) MapReduce handles parallel programming.
D) MapReduce runs without fault tolerance.
What does Web structure mining involve?
A) analyzing the universal resource locators in Web pages
B) analyzing the unstructured content of Web pages
C) analyzing the pattern of visits to a Web site
D) analyzing the PageRank and other metadata of a Web page
Define the term sensitivity analysis as it relates to ANNs.
The data mining in cancer research case study explains that data mining methods are
capable of extracting patterns and ________ hidden deep in large and complex medical
databases.
________ simulation refers to building a model of a system where the interaction
between different entities is studied.
Briefly describe five techniques (or algorithms) that are used for classification
modeling.
Briefly describe three benefits (process gains) derived from working in groups.
At a very high level, the text mining process can be broken down into three consecutive
tasks, the first of which is to establish the ________.
________-based groupware is the norm for anytime/anyplace collaboration.
In sensitivity analysis, the task of differentiating between a fact and an opinion can also
be characterized as calculation of ________ polarity.
HBase is a nonrelational ________ that allows for low-latency, quick lookups in
Hadoop.
One way to accomplish privacy and protection of individuals’ rights when data mining
is by ________ of the customer records prior to applying data mining applications, so
that the records cannot be traced to an individual.
Hadoop is primarily a(n) ________ file system and lacks capabilities we’d associate
with a DBMS, such as indexing, random access to data, and support for SQL.
Groupware products provide a way for groups to share resources and opinions.
Groupware implies the use of networks to connect people, even if they are in the same
room.
What are three general features of groupware products that support commutation,
collaboration, and coordination?
A scenario is a statement of assumptions about the operating environment of a
particular
system at a given time; that is, it is a narrative description of the decision-situation
setting. What does a scenario describe, and what may it also provide?
List four of Mintzberg’s Decisional roles of managers.
List four myths associated with data mining.
In the mathematical formulation of SVM’s, the normalization and/or scaling are
important steps to guard against variables/attributes with ________ that might
otherwise dominate the classification formulae.