Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How can you debug a pig script?
Is it possible to run Spark and Mesos along with Hadoop?
Why HCatalog?
What is Bucket in Hive?
Explain pigstorage function?
How many datanodes can run on a single Hadoop cluster?
Is spark difficult to learn?
What is nagios is used in ambari?
is it posible to join multiple fields in pig scripts?
What are the different data formats supported by apache tajo?
Clarify how job tracker schedules an assignment?
Explain various Apache Spark ecosystem components. In which scenarios can we use these components?
Define parquet file format? How to convert data to parquet format?
Explain HCatStorer APIs?
What are barriers?