Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Is hadoop the future?
Is ambari python clients can be used to make the good use of ambari api’s?
What is the difference between rdd and dataframe?
How to create Users in hadoop HDFS?
What do you understand by worker node?
What does the Spark Engine do?
Does Spark provide the storage layer too?
Clarify how hive de-serialize and serialize the information?
How many types of NoSQL databases are there?
Define Cassandra?
Which code do we use to open the connection in Hbase?
How do ‘map’ and ‘reduce’ work?
What are the fundamental key structures of HBase?
Does the HDFS go wrong? If so, how?
what is Cassandra- CQL collections?