Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
State benefits of Hadoop users by using Apache Ambari?
How can an application connect to Hive run as a server?
What are the features of RDD, that makes RDD an important abstraction of Spark?
How does inputsplit in mapreduce determines the record boundaries correctly?
What is Client API?
What is Distributed Cache in Hadoop?
Explain transformation and action in RDD in Apache Spark?
Is apache flume real time processing framework?
How data or file is written into Hadoop HDFS?
Suppose hadoop spawned 100 tasks for a job and one of the tasks failed. What will hadoop do?
What happens if the block in HDFS is corrupted?
How does spark work with python?
What are the relational operators available related to combining and splitting in pig language?
Difference between hive and impala?
Does mapreduce programming model provide a way for reducers to communicate with each other? In a mapreduce job can a reducer communicate with another reducer?