Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Explain foreach() operation in apache spark?
What is the difference between Gen1 and Gen2 Hadoop with regards to the Namenode?
What happens if you don?t override the mapper methods and keep them as it is?
What are the default read and write classes in Hive?
What is the default value of map and reduce max attempts?
How to setup the local repository manually?
Before deploying the hadoop instance, what are the checks that an individual should do?
Why not just use zookeeper for everything?
What are the advantage of spark?
Is it possible to use same metastore by multiple users, in case of embedded hive?
Mention some instances where zookeeper is using?
How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?
What is the function of Cluster.Builder class in Cassandra?
Explain ALTER Table statement in Hive?
Discuss writeahead logging in Apache Spark Streaming?