Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How the Client communicates with HDFS?
How does an hadoop application look like or their basic components?
If datanodes increase, then do we need to upgrade namenode?
Is spark an etl?
What is the use of "cqlsh --version" command?
Explain various Apache Spark ecosystem components. In which scenarios can we use these components?
How many types of rdd are there in spark?
How do I try impala out?
How does apache spark work?
What is the role of Connector API?
What is unstructured data?
What are the basic available commands in Hadoop sqoop ?
How is hadoop different from spark?
What is SparkSession in Apache Spark?
Give the sqoop command to see the content of the job named myjob?