Monday, April 27, 2015

Download Data For Hadoop Practice

I am providing some links below to to download large amount of data for practice  :-GROUPLENS LABORA

  • INFOCHIMPS

 Stanford network data collectionhttp://snap.stanford.edu/data/index.html

 Open FlightsCrowd sourced flight data http://openflights.org/

 Flight arrival datahttp://stat-computing.org/dataexpo/2009/the-data.html Wikipedia datawikipedia data

 OpenStreetMap.orgOpenStreetMap is a free worldwide map, created by people users. The geo and map data is available for download.
openstreet.org

 Natural Earth Datahttp://www.naturalearthdata.com/downloads/

 Geocommhttp://data.geocomm.com/drg/index.html

 Geonames datahttp://www.geonames.org/

 US GIS DataAvailable from http://libremap.org/

Web data Wikipedia datawikipedia data

 Google N-gram datagoogle ngram

 Public terabyte dataWeb data crawl data linky

 Freebase datavariety of data available from http://www.freebase.com/

 Stack OverFlow datahttp://blog.stackoverflow.com/category/cc-wiki-dump/

 UCI KDD datahttp://kdd.ics.uci.edu/
proceedings from Statistical machine Translation


 World Bank Datahttp://datacatalog.worldbank.org/

 Public Health Data setshttp://phpartners.org/health_stats.html

 National Institute of Healthhttp://projectreporter.nih.gov/reporter.cfm

 Aid informationhttp://www.aidinfo.org/data

 UN Datahttp://data.un.org/Explorer.aspx

No comments:

Post a Comment