Friday, September 7, 2012

Tech Review Big Data: Thursday, September 6, 2012


Big data
  • a collection of data sets so large and complex that it becomes awkward to work with using on-hand database management tools.
  • Difficulties
    • capture
    • storage
    • search
    • sharing
    • analysis
    • visualization
  • What is considered "big data" varies depending on the capabilities of the organization managing the data set.
  • Big data sizes are a constantly moving target
    • few dozen terabytes
    • many petabytes
  • new platform of "big data" tools 
    • Apache Hadoop
  • MIKE2.0
  • Doug Laney
    • data growth challenges and opportunities are three-dimensional
      • increasing volume (amount of data)
      • velocity (speed of data in and out)
      • variety (range of data types and sources)
  • Big players in big data
    • Oracle
    • IBM
    • Microsoft
    • SAP
    • HP


10 Steps for Testing and Choosing a Big Data Appliance




Marko Grobelnik



IBM:  What is big data?
  • Spans four dimension
    • Volume
    • Velocity
    • Variety
    • Veracity
  • big data is more than simply a matter of size
IBM big data platform

Do a search on "ibm what is big data?" for some more reading.

O'Reilly:  What is big data?
  • data that exceeds the processing capacity of conventional db systems.
    • too big
    • too fast
    • doesn't fit structures of db architectures
Google BigQuery


Stanford University:  Data Mining Certificates Online

  • concepts not tools
  • doesn't seem to be hands on
Texas A&M:  Data Mining Certificate
  • SAS classes
  • heavily statistic based

Who are the top influencers in Big Data, Analytics, Data Mining?


Big Data on Campus
  • should be called big data in education
Online Education in Analytics, Data Mining and Data Science



Big Data University



Web Intelligence and Big Data



Army: Manning Snuck 'Data-Mining' Software Onto Secret Network

No comments:

Post a Comment