Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Why do we need Hadoop?
What is the spark driver?
Write a Mapreduce Program for Character Count ?
What does streams api in kafka?
What is the future of apache spark?
Why can aggregation not be done in Mapper in MapReduce?
Is avro supported?
Mention what is data cleansing?
Explain about the different channel types in Flume.
Did you ever ran into a lop sided job that resulted in out of memory error, if yes then how did you handled it ?
What is the function of Cluster.Builder class in Cassandra?
What are the different catalog tables in hbase?
Discuss the precautions that are needed to take care while adding a column?
What do you understand by receivers in Spark Streaming ?
Explain what are the various types of Transformation on DStream?