Need help answering data science questions.
Need to Answer 40 multiple choice questions.
Must be good at:
1. Hadoop and map reduce
2. Distributed computing
3. Parallel computing
4. Cloud computing
5. What is hive and why hive?
6. Basics of probability and machine learning
7. Linear regression, classification and clustering