Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
List the network requirements for using Hadoop ?
Explain about the scalar datatypes in Apache Pig?
Which command is used to show the current hbase user?
Explain about the partitioning, shuffle and sort phase
Can impala be used for complex event processing?
What do you understand by cluster in cassandra?
What are Actions?
Compare hadoop & spark?
Explain how you can reduce churn in isr? When does broker leave the isr?
What is the data storage component used by Hadoop?
Explain the level of parallelism in spark streaming?
How Cassandra provide High availability feature?
What is a bloom filter and how does it help in searching rows?
What are the different execution mode available in Pig?
Why do we need hadoop for big data analytics?