Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
what does the shell commands “Capture” and “Consistency” determines?
How is reporting controlled in hadoop?
List out the different stream grouping in apache storm?
What is Data Log in Kafka?
Can multiple clients write into a Hadoop HDFS file concurrently?
What is the point of apache spark?
What are the benefits of block transfer?
How much Metadata will be created on NameNode in Hadoop?
How does Mappers run method works?
What is kafka Producer?
What is Clustring in Hive?
Mention some important features of spm in cassandra?
How hdfa differs with nfs?
Is impala intended to handle real time queries in low-latency applications or is it for ad hoc queries for the purpose of data exploration?
Explain about the scalar datatypes in Apache Pig?