1. HADOOP: is a large scales batch data processing system it is important because all large websites use it to process and store big data.
2.Cloudera: Gives Big companies a data platform a variation of HADOOP and gives the user a more enhanced storage and processing system.
3. PIG: is a platform language is a used for analyzing data sets and lets you perform actions like merging and filtering data sets. and users can create their own functions as well.
4. HIVE: is a data warehouse software built from Hadoop that allows data sets and querying and uses QL language. It does not allow real time and row level query .
5. Cassandra : is a highly scalable distributed and decentralized data store that is very efficient for fast writing and reading. It was created at Facebook
6. Mahout: another variation of hadoop that allows for classification, listing and clusters information, it is what allows some websites to show recommended items.
No comments:
Post a Comment