book

Q: What is big data?

  • Find FOUR really big datasets. They should be sufficiently different. Cite your sources.
  • Discuss and determine a ranking in terms of "bigness".
  • Fill the template below. Replace all (( )) with your answers.

Rank 1: ((Amazon))

(( These data sets include photos, text, url links, and personal information.)

((Huge ammount of traffic))

Rank 2: ((Google Maps/Twitter/Facebook))

((Photos, Tweets, url links, vidoes, and personal information))

((Photos and videos are large))

Rank 3: ((Government Data Sets))

((Personal information, tax information, work history.))

((There are a large number of people in the U.S. and this data is held on to for a long time.))

Rank 4: ((Scientific Data Sets))

((Scientific data that may be recieved from a scientific mission e.g. satellite mission))

((There is a constant flow of information from a satellite to the Earth.))