Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What is the Physical plan in pig architecture?
Explain what if rack 2 and datanode fails?
What problem does Apache Pig solve?
Define sparkcontext in apache spark?
What are benefits of Spark over MapReduce?
What is setmaster in spark?
What is pseudo-distributed mode?
Explain the core components of hadoop?
State the usage of 'filters', 'group' , 'orderBy', 'distinct' keywords in pig scripts?
What is an offset?
Explain apache kafka?
What is the procedure to recover a namenode when it is slow?
Explain the CLI In Zookeeper?
What are the different components that are available in kafka?
Why do we need hadoop for big data analytics?