User Tools

Site Tools


big_data

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
big_data [2014/07/23 15:55] hkimscilbig_data [2014/07/23 16:21] (current) hkimscil
Line 9: Line 9:
 Organizations Organizations
   * Relational Data Model: RDBMS (Relational Database Management System), mainly implemently by SQL (Structured Query Language).   * Relational Data Model: RDBMS (Relational Database Management System), mainly implemently by SQL (Structured Query Language).
-  * Entity-Relationship Data Model (ER): . . . It added additional abstraction to increase the usability of the data. In the model, each item was defined independently of its use. Therefore, developers could create new relationships between data sources without complex programming [[FootNote(Big Data For Dummies, 2013)]].+  * Entity-Relationship Data Model (ER): . . . It added additional abstraction to increase the usability of the data. In the model, each item was defined independently of its use. Therefore, developers could create new relationships between data sources without complex programming ((Big Data For Dummies, 2013)).
   * Data warehouse in 90s   * Data warehouse in 90s
   * Beginning of unstructured data use -- BLOBs (Binary Large Objects)   * Beginning of unstructured data use -- BLOBs (Binary Large Objects)
Line 22: Line 22:
 {{:bigdataarchitecture.jpg?direct| Big Data Architecture}} {{:bigdataarchitecture.jpg?direct| Big Data Architecture}}
  
 +Sources of big structured data 
 +  * Computer-generated or Machine-generated Data
 +  * Human-generated Data
  
   * Computer-generated or Machine-generated Data   * Computer-generated or Machine-generated Data
 +    * Sensor data . . . . RFID tags, Smart meters, medical devices, GPS data, etc. 
 +    * Web log data . . . Google analytics, 
 +    * Point-of-sale data . . . Cashiers' swipes . . . . 
 +    * Financial data
   * Human-generated Data   * Human-generated Data
-  Hybrid Data . . . . +    Input data . . . 
 +    * Click stream data . . . .  
 +    * Gaming-related data . . . . 
 + 
 +Sources of unstructured data 
 +Exploring sources of unstructured data  
 + 
 +Unstructured data is everywhere. In fact, most individuals and organizations conduct their lives around unstructured data. Just as with structured data, unstructured data is either machine generated or human generated
  
-  Sensor data . . . . RFID tags, Smart meters, medical devices, GPS dataetc.  +Here are some examples of machine-generated unstructured data:  
-  * Web log data . . . Google analytics,  +  Satellite images: This includes weather data or the data that the government captures in its satellite surveillance imageryJust think about Google Earthand you get the picture (pun intended).  
-  * Point-of-sale data . . . Cashiers' swipes . . . .  +  * Scientific data: This includes seismic imagery, atmospheric data, and high energy physics.  
-  * Financial data+  * Photographs and video: This includes security, surveillance, and traffic video.  
 +  * Radar or sonar data: This includes vehicular, meteorological, and oceanographic seismic profiles. 
  
 +The following list shows a few examples of human-generated unstructured data: 
 +  * Text internal to your company: Think of all the text within documents, logs, survey results, and e-mails. Enterprise information actually represents a large percent of the text information in the world today. 
 +  * Social media data: This data is generated from the social media platforms such as YouTube, Facebook, Twitter, LinkedIn, and Flickr. 
 +  * Mobile data: This includes data such as text messages and location information. 
 +  * Website content: This comes from any site delivering unstructured content, like YouTube, Flickr, or Instagram. 
big_data.txt · Last modified: 2014/07/23 16:21 by hkimscil

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki