Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
When to avoid secondary indexes?
Are Namenode and job tracker on the same host?
What is NameNode? How NameNode tackle Datanode failures in Hadoop?
Is impala intended to handle real time queries in low-latency applications or is it for ad hoc queries for the purpose of data exploration?
How Spark uses Hadoop?
What is heartbeat in hdfs?
What is closing out ledgers?
How do you set up a spark?
What are producer-consumer queues?
What are the tools that are used in ambari monitoring?
What is big data in dbms?
What is Apache Spark Machine learning library?
How to keep HDFS cluster balanced?
List the advantage of Parquet files?
Explain what happens when hadoop spawned 50 tasks for a job and one of the task failed?