Knowledge-based management subsystems provide intelligence to augment the
decision maker’s own intelligence.
The main difference between service level agreements and key performance indicators
is the audience.
In the Army expertise transfer system case, knowledge nuggets from interviewees
needed no further vetting before use in the ETS.
In service-oriented DSS, an application programming interface (API) serves to populate
source systems with raw data and to pull operational reports.
In text mining, creating the term-document matrix includes all the terms that are
included in all documents, making for huge matrices only manageable on computers.
Companies understand that when their product goes “viral,” the content of the online
conversations about their product does not matter, only the volume of conversations.
Simulation does not usually allow decision makers to see how a solution to a complex
problem evolves over (compressed) time, nor can decision makers interact with the
simulation.
The inference engine, also known as the control structure or the rule interpreter (in
rule-based ES), is essentially a computer program that provides a methodology for
reasoning about information in the knowledge base and on the blackboard to formulate
appropriate conclusions.
In the life coach case study, Kaggle recently hosted a competition aimed at identifying
muscle motions that may be used to predict the progression of Alzheimer’s disease.
One comparison typically made when data is presented in business intelligence systems
is a comparison against historical values.
In order to be effective, analysts must use models to solve problems with no regard to
the organizational culture to find optimal results.
ES/DSS were found to improve the performance of new managers but not existing
managers.
A well-designed data warehouse means that user requirements do not have to change as
business needs change.
Internet- and intranet-based group decision support systems (GDSS) are less popular
than special-purpose decision rooms.
Business intelligence systems typically support solving a certain problem or evaluate an
opportunity, while decision support systems monitor situations and identify problems
and/or opportunities, using analytic methods.
The cost of data storage has plummeted recently, making data mining feasible for more
firms.
Modeling can be viewed as a science in its entirety.
Genetic algorithms are heuristic methods that do not guarantee an optimal solution to a
problem.
Using expected value (EV) with decision trees is totally appropriate for situations
where one outcome could lead to an immense loss for the company.
Web site visitors who critique and create content are more engaged than those who join
networks and spectate.
In the Memphis Police Department case study, predictive analytics helped to identify
the best schedule for officers in order to pay the least overtime.
Because the recession has raised interest in low-cost open source software, it is now set
to replace traditional enterprise software.
In the opening vignette, the high accuracy of the models in predicting the outcomes of
complex medical procedures showed that data mining tools are ready to replace experts
in the medical field.
One of the four components of BI systems, business performance management, is a
collection of source data in the data warehouse.
In the Coors case study, genetic algorithms were of little use in solving the flavor
prediction problem.
Big Data uses commodity hardware, which is expensive, specialized hardware that is
custom built for a client or application.
OLTP systems are designed to handle ad hoc analysis and complex queries that deal
with many data items.
The process approach to knowledge management may limit innovation and force
participants into fixed patterns of thinking.
In the opening vignette, the CERN Data Aggregation System (DAS), built on
MongoDB (a Big Data management infrastructure), used relational database
technology.
In the mining industry case study, the input to the neural network is a verbal description
of a hanging rock on the mine wall.
The industry impact of an automated decision system’s use is limited to the company’s
supply chain.
Service-oriented DSS solutions generally offer individual or bundled services to the
user as a service.
Using support vector machines, you must normalize the data before you numericize it.
In the Isle of Capri case, the only capability added by the new software was increased
processing speed of processing reports.
While cloud services are useful for small and midsize analytic applications, they are
still limited in their ability to handle Big Data applications.
In the student retention case study, which of the following variables was MOST
important in determining whether a student dropped out of college?
A) high school GPA and SAT high score math
B) college and major
C) completed credit hours and hours enrolled
D) marital status and hours enrolled
Understanding customers better has helped Amazon and others become more
successful. The understanding comes primarily from
A) collecting data about customers and transactions.
B) developing a philosophy that is data analytics-centric.
C) analyzing the vast data amounts routinely collected.
D) asking the customers what they want.
Traditional data warehouses have not been able to keep up with
A) the evolution of the SQL language.
B) the variety and complexity of data.
C) expert systems that run on them.
D) OLAP.
In which stage of extraction, transformation, and load (ETL) into a data warehouse are
anomalies detected and corrected?
A) transformation
B) extraction
C) load
D) cleanse
Third party providers of publicly available datasets protect the anonymity of the
individuals in the data set primarily by
A) asking data users to use the data ethically.
B) leaving in identifiers (e.g., name), but changing other variables.
C) removing identifiers such as names and social security numbers.
D) letting individuals in the data know their data is being accessed.
A critical emerging trend in analytics is the incorporation of location data. ________
data is the static location data used by these location-based analytic applications.
The advantages of visual interactive simulation include all of the following EXCEPT
A) the ability to see how a simulation works.
B) improved presentation of simulation results.
C) reduced need for decision maker involvement.
D) improvements in training using the simulation.
What is a major drawback to the basic majority voting classification in kNN?
A) It requires frequent human subjective input during computation.
B) Classes that are more clustered tend to dominate prediction.
C) Even the naive version of the algorithm is hard to implement.
D) Classes with more frequent examples tend to dominate prediction.
Which of the following is NOT an attribute of knowledge?
A) Knowledge is not subject to diminishing returns.
B) Knowledge fragments as it grows.
C) There is a need to refresh knowledge periodically for competitive advantage.
D) The value of knowledge is easily quantified.
In a Hadoop “stack,” what node periodically replicates and stores data from the Name
Node should it fail?
A) backup node
B) secondary node
C) substitute node
D) slave node
Which of the following is true about the furtherance of homeland security?
A) There is a lessening of privacy issues.
B) There is a greater need for oversight.
C) The impetus was the need to harvest information related to financial fraud after
2001.
D) Most people regard analytic tools as mostly ineffective in increasing security.
All the following statements about hidden layers in artificial neural networks are true
EXCEPT
A) hidden layers are not direct inputs or outputs.
B) more hidden layers increase required computation exponentially.
C) many top commercial ANNs forgo hidden layers completely.
D) more hidden layers include many more weights.
All of the following statements about decision style are true EXCEPT
A) autocratic styles are authority-based.
B) decision styles are consistent among top managers.
C) heuristic styles can also be democratic.
D) decision styles may vary among lower-level managers.
Knowledge can be best described as
A) facts, measurements, and statistics.
B) an organized collection or set of facts.
C) facts, measurements, and statistics that are validated.
D) organized facts set in context and actionable.
In the InterContinental Hotel Group case study, the mathematical model used to
increase profits was based on
A) a simulation model that tried out many options.
B) a system that collated the subjective inputs of managers.
C) a mathematical model that used two variables: price and day of the week.
D) an optimization model that used multiple variables.
In the Cabela’s case study, what types of models helped the company understand the
value of customers, using a five-point scale?
A) reporting and association models
B) simulation and geographical models
C) simulation and regression models
D) clustering and association models
In a Hadoop “stack,” what is a slave node?
A) a node where bits of programs are stored
B) a node where metadata is stored and used to organize data processing
C) a node where data is stored and processed
D) a node responsible for holding all the source programs
The ________ of Big Data is its potential to contain more useful patterns and
interesting anomalies than “small” data.
Which of the following offers a flexible data integration platform based on a newer
generation of service-oriented standards that enables ubiquitous access to any type of
data?
A) EAI
B) EII
C) IaaS
D) ETL
In which stage of extraction, transformation, and load (ETL) into a data warehouse are
data aggregated?
A) transformation
B) extraction
C) load
D) cleanse
Which data warehouse architecture uses metadata from existing data warehouses to
create a hybrid logical data warehouse comprised of data from the other warehouses?
A) independent data marts architecture
B) centralized data warehouse architecture
C) hub-and-spoke data warehouse architecture
D) federated architecture
Understanding which keywords your users enter to reach your Web site through a
search engine can help you understand
A) the hardware your Web site is running on.
B) the type of Web browser being used by your Web site visitors.
C) most of your Web site visitors’ wants and needs.
D) how well visitors understand your products.
All of the following are suitable problems for genetic algorithms EXCEPT
A) dynamic process control.
B) simulation of biological models.
C) pattern recognition with complex patterns.
D) simple optimization with few variables.
Which of the following statements about expected utility is true?
A) It does not affect decisions made with expected values.
B) Used in decision making, it is an objective value, not subjective.
C) Used in decision making, it can bring huge risk to a small startup with limited
resources.
D) In calculating utility, it assumes the decision will be made thousands of times,
making the probabilities more likely on average.
Clickstream analysis is most likely to be used for all the following types of applications
EXCEPT
A) determining the lifetime value of clients.
B) hiring new functional area managers.
C) designing cross-marketing strategies across products.
D) predicting user behavior.
Sentiment classification usually covers all the following issues EXCEPT
A) classes of sentiment (e.g., positive versus negative).
B) range of polarity (e.g., star ratings for hotels and for restaurants).
C) range in strength of opinion.
D) biometric identification of the consumer expressing the sentiment.
In handling uncertainty in decision modeling, the optimistic approach assumes
A) the best possible outcome of most alternatives will occur.
B) the best possible outcome of some alternatives will occur.
C) the best possible outcome of each alternative will occur.
D) the best possible outcome of one alternative will occur.
In the sport talents identification case study, the expert system was calibrated with
expertise from
A) multiple sports experts.
B) one overall sports expert.
C) ) the system developer.
D) subjects in the cases used to create the ES.
In modeling, an optimal solution is understood to be
A) a solution found in the least possible time and using the least possible computing
resources.
B) a solution that can only be determined by an exhaustive enumeration and testing of
alternatives.
C) a solution that is the best based on criteria defined in the design phase.
D) a solution that requires an algorithm for determination.
The “single version of the truth” embodied in a data warehouse such as Capri Casinos’
means all of the following EXCEPT
A) decision makers get to see the same results to queries.
B) decision makers have the same data available to support their decisions.
C) decision makers get to use more dependable data for their decisions.
D) decision makers have unfettered access to all data in the warehouse.
Describe the Turing test for determining whether a computer exhibits intelligent
behavior.
Web 2.0 is the popular term for describing advanced Web technologies and applications.
Describe four main representative characteristics of the Web 2.0 environment.
When considering Big Data projects and architecture, list and describe five challenges
designers should be mindful of in order to make the journey to analytics competency
less stressful.
Deciding to purchase an FDIC-insured Certificate of Deposit at a U.S. bank can be
viewed as decision making under ________.
In the Hepatitis B case study, Markov models were used to determine the
cost-________ of various governmental interventions for Hepatitis B.
Managers usually make decisions by following a four-step process. What are the steps?
List four well-known search methods used in the choice phase of problem solving.
The ________ Model, also known as the EDW approach, emphasizes top-down
development, employing established database development methodologies and tools,
such as entity-relationship diagrams (ERD), and an adjustment of the spiral
development approach.
The basic idea behind a ________ is that it recursively divides a training set until each
division consists entirely or primarily of examples from one class.
Compare and contrast decision making under uncertainty, risk and certainty.
What is the most common method for treating risk in decision trees and tables?
After an ES system is built, it must be evaluated in a two-step process. The first step,
________, ensures that the resulting knowledge base contains knowledge exactly the
same as that acquired from the expert.
The knowledge possessed by human experts is often lacking in ________ and not
explicitly expressed.
What is the definition of text analytics according to the experts in the field?
Early definitions of a(n) ________ identified it as a system intended to support
managerial decision makers in semistructured and unstructured decision situations.
In the Expedia case study, what three steps were taken to convert drivers of
departmental performance into a scorecard?
________ (or reasoning) is the process of using the rules in the knowledge base along
with the known facts to draw conclusions.
Describe data stream mining and how it is used.
In the opening vignette, the combination of filed infrastructure, geospatial data,
enterprise data warehouse, and analytics has enabled OG&E to manage its customer
demand in such a way that it can optimize its ________ investments.
An agent-based modeling approach focuses on modeling a(n) “________” property
rather than “optimizing” nature.