Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What do you mean by the NameNode High Availability in hadoop?
What is sparksession and sparkcontext?
What are the features of apache mahout?
How is HCatalog different from Hive?
When executing Hive queries in different directories, why is metastore_db created in all places from where Hive is launched?
What combiners are and when you should utilize a combiner in a map reduce job?
Wherever (Different Directory) I run the hive query, it creates new metastore_db, please explain the reason for it?
What is pyarrow?
How does apache spark work?
Name the operating system(s) which are supported for production hadoop deployment?
What do we mean by Partitions or slices?
What are the drawbacks of Pig?
What is HDFS - Hadoop Distributed File System?
What is PageRank in Spark?
How tasks are created in spark?