Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Define speculative execution?
Why aggregation cannot be done in Mapper?
What is distinct clause in apache tajo?
What does the high availability of a name-node means?
is it posible to join multiple fields in pig scripts?
What are the difference between of the “HDFS Block” and “Input Split”?
What is spark checkpointing?
What happens to existing data in my cluster when I add new nodes?
What are the four basic parameters of a mapper?
What makes Apache Spark good at low-latency workloads like graph processing and machine learning?
What are possible types of Channel Selectors?
Does spark use yarn?
Is hadoop required for data science?
What is a commodity hardware? Does commodity hardware include RAM?
What is driver memory and executor memory in spark?