Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Why spark is faster than hive?
Can you explain hadoop streaming?
What is spark table?
What is the primary purpose of flume in the hadoop architecture?
Why Hive is not suitable for OLTP systems?
Explain the lookup() operation in Spark?
What is full form of rdd?
What is executor cores in spark?
Can you explain spark core?
What is a kafka cluster?
What are the main configuration parameters in a MapReduce program?
What is the usage of "cqlsh-version" command?
Are there any special requirements for namenode?
What is the method to create a data frame?
Will various customers write into an hdfs record simultaneously?