Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Is it possible to leverage real time analysis on the big data collected by flume directly? If yes, then explain how?
What is Cassandra Database Software ?
List the languages supported by Apache Spark?
Is it possible to search for files using wildcards?
Explain the functionalities of ganglia in ambari?
What is Sqoop?
Is spark secure?
What is pig properties?
Which command is used to SHOW PARTITIONS lists in HCatalog?
Explain about the execution plans of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
Which channel type is faster in Flume?
Which operating system(s) are supported for production hadoop deployment?
What is the use of mysql connector?
Highlight the key differences between MapReduce and Apache Pig?
What is the difference between sort by and order by in hive?