Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Can you explain the common input formats in hadoop?
How hive can improve performance with orc format tables?
What are the relational operators available related to Grouping and joining in Pig language?
What is Apache Cassandra?
Which database is used in hadoop?
How does spark run hadoop?
Define the consistency levels for read operations in Cassandra?
What is the disadvantage of spark sql?
What are 4 v's of big data?
Explain catalyst query optimizer in Apache Spark?
What is pregel api?
What is a dataset? What are its advantages over dataframe and rdd?
What is a local repository and when will you use it?
What is the usage of "cqlsh-version" command?
Explain the operation reduce() in Spark?