Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What are spark stages?
Which language is not supported by spark?
How can you set an arbitrary number of mappers to be created for a job in Hadoop?
What is hfile ?
Explain the sequence of execution of all the components of MapReduce like a map, reduce, recordReader, split, combiner, partitioner, sort, shuffle.
Should we use RAID in Hadoop or not?
Explain tajo configuration files?
How to add/delete a Node to the existing cluster?
Can we use kafka without zookeeper?
What problems have you faced when you are working on Hadoop code?
Explain the role of offset in kafka?
when to choose “internal table” and “external table” in hive?
What is a “Distributed Cache” in Apache Hadoop?
How can you send large messages with kafka (over 15mb)?
Who invented the first spark plug?