Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain the process for starting a kafka server?
How can we remove a znode?
What are the disadvantages of hadoop serialization?
did you maintain the hadoop cluster in-house or used hadoop in the cloud?
How do you list all databases whose name starts with p?
How do I start a spark master?
What is Data Locality in Hadoop?
When do you have to avoid secondary indexes?
What are ‘maps’ and ‘reduces’?
Explain various level of persistence in Apache Spark?
Why does my select statement fail?
List some use cases where classification machine learning algorithms can be used.
What are the ways in which Apache Spark handles accumulated Metadata?
How Spark uses Hadoop?
What is a commodity hardware? Does commodity hardware include RAM?