Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain what happens when hadoop spawned 50 tasks for a job and one of the task failed?
Is spark used for machine learning?
What are the benefits of lazy evaluation?
What do you mean by metadata in HDFS? Where is it stored in Hadoop?
Define streaming?
What is presto verifier?
How do I know if flume agent is running?
Can I do trforms or add new functionality?
What are the different input sources for Spark Streaming?
What is big data or hooda?
In Hive, can you overwrite Hadoop MapReduce configuration in Hive?
What is spark ml?
Explain the term paired RDD in Apache Spark?
Define Compaction?
Explain some of the basic commands used for Apache Ambari server?