Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How can we create table by using command?
How would you import data from MYSQL into HDFS ?
Which method is used to access HFile directly without using HBase?
What is the utility of using Writable Comparable Custom Class in Map Reduce code?
Explain the maximum size of a message that can be received by the Kafka?
When should you use spark cache?
Explain various Apache Spark ecosystem components. In which scenarios can we use these components?
What are Features of Hive?
How would you tackle calculating the number of unique visitors for each hour by mining a huge apache log? You can use post processing on the output of the mapreduce job.
How can we assure that the values regarding a particular key goes to the same reducer?
What are the configuration files in Hadoop?
What is hadoop technology?
Explain the Job OutputFormat?
What does connector api in kafka?
What is Cassandra Database Software ?