Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How can I install Cloudera VM in my system?
Can I do transforms or add new functionality?
Explain process to access sub directories recursively in hive queries.
What do you understand by an inner bag and outer bag in Pig?
What is the role of a zookeeper in a kafka cluster?
Explian the Advantages of HBase?
What are the network requirements for using hadoop?
Who are ‘Data Scientists’?
What is mahout hadoop?
How many types of nosql databases?
explain the concept of RDD (Resilient Distributed Dataset). Also, state how you can create RDDs in Apache Spark.
What is the man difference between hbase and hive?
What is RDD?
What is CONCATENATE command in Hive?
What is Spark?