Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) When executing Hive queries in different directories, why is metastore_db created in all places from where Hive is launched?
837What kind of data warehouse application is suitable for Hive? What are the types of tables in Hive?
724How can you prevent a large job from running for a long time? What do u think is more popular among the developers - Pig or Hive?
662
Why do we use spark?
Do I need to know scala to learn spark?
How is dag created in spark?
What is the use of “void close()” method?
Can we say a COGROUP is a group of more than 1 data set?
Explain the filter transformation?
Is there a dual table?
What is a generic UDF in the hive?
Explain mappartitions() and mappartitionswithindex()?
What is the throughput? How does hdfs give great throughput?
Wherever (Different Directory) I run hive query, it creates new metastore_db, please explain the reason for it?
In which language apache kafka is written?
Where is Mapper output stored?
What is Disk Balancer in Hadoop?
Define streaming?