Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the main purpose of HDFS fsck command?
Name a few import control commands. How can Sqoop handle large objects?
Can you define inputsplit in hadoop?
What do you mean by metadata in Hadoop?
how Cassandra writes changed data into commitlog?
What is dag spark?
Apache Spark is a good fit for which type of machine learning techniques?
How to load data into table created in hive ?
Define Actions.
Explain various Apache Spark ecosystem components. In which scenarios can we use these components?
Which files are used by the startup and shutdown commands?
What is aws spark?
What is Balancer in Hadoop?
Explain the hadoop-core configuration?
What are the different types of data model?