Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What happens to a namenode, when job tracker is down?
Can you define a udf?
List the functions of Spark SQL?
Is spark distributed computing?
Explain the various types of partitioners in cassandra?
What is the required action you need to perform if you opt for scheduled maintenance on the cluster nodes?
How is HCatalog different from Hive?
What are the fundamental key structures of HBase?
What is CQL?
What do you mean by meta data in hdfs? List the files associated with metadata.
How the SSTable is different from other relational tables?
Explain the use of .mecia class?
Why would nosql be better than using a sql database? And how much better is it?
What is a commodity hardware? Does commodity hardware include RAM?
In MapReduce, ideally how many mappers should be configured on a slave?