Big data Big DataDr Marwan

subject Type Homework Help
subject Pages 20
subject Words 1527
subject School N/A
subject Course N/A

Unlock document.

This document is partially blurred.
Unlock all pages and 1 million more documents.
Get Access
Big Data
Dr. Marwan Al-Tawil
Introduction
What is the difference between Big Data, Data Science and Data
analytics ?
Big data: refers to significant volumes of data that cannot be processed
effectively with the traditional applications that are currently used. The
processing of big data begins with raw data that isnt aggregated and is most
often impossible to store in the memory of a single computer.
Data science: Dealing with unstructured and structured data, data science is
a field that comprises everything that is related to data cleansing,
preparation, and analysis.
Data analytics: is the science of examining raw data to reach certain
conclusions.
Data types
Data are raw facts that have not been processed to produce meaning.
Under the notion of Big Data, data can be one of three types:
Structured Data: Data stored in Tabular format (rows and columns that
are related). It is clearly defined and stored in a pre-defined data model.
E.g. Excel files or SQL (relational) databases.
- Two sources of structured data: Machine generated and Human
generated data
Around 20% of data are structured. Proper view and understanding of data
(tables represent related data, and links between table shows relationships)
Data types
Unstructured Data: have not pre-defined structure (it has no Data
Model).
- It is described as irregular and ambiguous.
- around 80% of data are unstructured.
- Examples, satellite images, human generated videos, Audio and Text ,
PDF documents (Social Media Data).
- Two sources of structured data: Machine generated and Human
generated data
Semi-structured data: difficult to categorise, and we can’t store this type
of data in traditional database models
- examples include XML, JSON
Example of structured data
Examples of Unstructured data
page-pf7
Example of Semi-structured Data
An example of a JSON object is:
{
"ID": "22222",
"name": {
"firstname: "Albert",
page-pf8
page-pf9
page-pfa
page-pfb
page-pfc
page-pfd
page-pfe
page-pff
page-pf10
page-pf11
page-pf12
page-pf13
page-pf14
page-pf15
page-pf16
page-pf17
page-pf18
page-pf19
page-pf1a

Trusted by Thousands of
Students

Here are what students say about us.

Copyright ©2022 All rights reserved. | CoursePaper is not sponsored or endorsed by any college or university.