Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How Hadoop is cost-effective?
What is Flatten?
What is a Heartbeat in Hadoop?
Do we need scala for spark?
Can you explain sequence file in hadoop?
Explain about the partitioning, shuffle and sort phase in MapReduce?
Clarify Memtable?
Why do we use spark?
What is Spark MLlib?
Map reduce jobs are failing on a cluster that was just restarted. They worked before restart. What could be wrong?
Explain what is big data?
What is HDFS ? How it is different from traditional file systems?
Different running modes for running Pig?
What is an input reader in reference to mapreduce?
Which command is used to show the current hbase user?