Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What are the tools that are used in ambari monitoring?
What is the process of creating ambari client?
What is a IdentityMapper and IdentityReducer in MapReduce ?
When executing Hive queries in different directories, why is metastore_db created in all places from where Hive is launched?
When to use –target-dir and when to use –warehouse-dir while importing data?
Explain Spark coalesce() operation?
What is hbase in hadoop?
What are the features and characteristics of Apache Spark?
What are Guarantees provided by Kafka?
what is the Hadoop MapReduce APIs contract for a key and value class?
What is Spark Dataset?
What is the sequencefileinputformat in hadoop?
What is structured data?
What do sorting and shuffling do?
How hdfs is different from traditional file systems?