Big data
- a collection of data sets so large and complex that it becomes awkward to work with using on-hand database management tools.
- Difficulties
- capture
- storage
- search
- sharing
- analysis
- visualization
- What is considered "big data" varies depending on the capabilities of the organization managing the data set.
- Big data sizes are a constantly moving target
- few dozen terabytes
- many petabytes
- new platform of "big data" tools
- Apache Hadoop
- MIKE2.0
- http://en.wikipedia.org/wiki/MIKE2.0_Methodology
- Method for an Integrated Knowledge Environment
- Open source
- delivery methodology for Enterprise information management
- Doug Laney
- data growth challenges and opportunities are three-dimensional
- increasing volume (amount of data)
- velocity (speed of data in and out)
- variety (range of data types and sources)
- Big players in big data
- Oracle
- IBM
- Microsoft
- SAP
- HP
10 Steps for Testing and Choosing a Big Data Appliance
Marko Grobelnik
IBM: What is big data?
- Spans four dimension
- Volume
- Velocity
- Variety
- Veracity
- big data is more than simply a matter of size
IBM big data platform
Do a search on "ibm what is big data?" for some more reading.
O'Reilly: What is big data?
- data that exceeds the processing capacity of conventional db systems.
- too big
- too fast
- doesn't fit structures of db architectures
Google BigQuery
Stanford University: Data Mining Certificates Online
- concepts not tools
- doesn't seem to be hands on
Texas A&M: Data Mining Certificate
- SAS classes
- heavily statistic based
Who are the top influencers in Big Data, Analytics, Data Mining?
Big Data on Campus
- should be called big data in education
Online Education in Analytics, Data Mining and Data Science
Big Data University
Web Intelligence and Big Data
Army: Manning Snuck 'Data-Mining' Software Onto Secret Network
No comments:
Post a Comment