Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How does hadoop achieve fault tolerance?
Is it possible to have hadoop job output in multiple directories?
What is spark repartition?
How can we drop a table in HCatalog?
Define data replication?
What is a Record Reader in hadoop?
What is the reason behind Transformation being a lazy operation in Apache Spark RDD? How is it useful?
JMX stands for?
What are the different methods to run Spark over Apache Hadoop?
Specify the partitions in hive?
How is security achieved in Apache Hadoop?
What are the consistency levels for read operations in Cassandra?
In which language apache kafka is written?
What is the default replication factor and how will you change it?
what is JobTracker in Hadoop? What are the actions followed by Hadoop?