Explain the five vs of big data?
What are Apache Spark, Flume, Lucene, Hama, HCatalog, Mahout, Drill, Crunch and Thrift?
Why big data use?
How would you pipeline large amounts of data?
Name some Big Data products?
What is big data lake?
Name the components of hdfs and yarn respectively
Where does Big Data come from?
Define role of variety in big data?
Can you define big data analytics?
How do big data solutions interact with the existing enterprise infrastructure?
What are the three characteristics of big data according to ibm?
Can you define a udf?
How much data is enough to get valid outcome?
Can you explain the benefits of big data?