Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Is there another way to check whether Namenode is working?
Explain keys() operation in Apache spark?
Can you give us some more details about ssh communication between masters and the slaves?
What are the default read and write classes in Hive?
What are the main properties of hdfs-site.xml file?
How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?
What port does spark use?
Explain the Constituents of Apache ZooKeeper Architecture?
Mention what are the main configuration parameters that user need to specify to run mapreduce job?
What is graph db? Explain with an example.
How to set up local repository manually?
Which method is used to access HFile directly without using HBase?
Does google use hadoop?
What do you understand by logging in cassandra?
Can you define rdd lineage?