What does the high availability of a name-node means?
Explain the level of parallelism in spark streaming?
How can Flume be used with HBase?
What are the Data types in Pig?
What is throughput in HDFS?
Explain the master class and the output class do?
Is hive similar to sql?
What will be the output of cast ('XYZ' as INT)?
What are the components of spark?
Explain distnct(),union(),intersection() and substract() transformation in Spark?
Name the most common input formats defined in hadoop?
What is Your Cluster size ?
How can the columns of a table in hive be written to a file?
Explain what is a sequence file in hadoop?
Give the difference between Drop and Truncate in CQLSH?