Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Does Pig give any warning when there is a type mismatch or missing field?
What is a parquet file?
How we can check hadoop sqoop installed or not in a system?
What is the use of foreach operation in Pig scripts?
What is Apache Cassandra?
Can you explain the term, Cassandra?
What is Cassandra Data Modelling ?
What all tasks you can perform for managing services using Ambari service tab?
Who developed Apache Avro?
How does data transfer happen from hdfs to hive?
What is a combiner and where you should use it?
Explain about the major libraries that constitute the Spark Ecosystem?
what is a sequence file in Hadoop?
What do we mean by Paraquet?
What are input format, input split & record reader and what they do?