Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) While writing evaluate UDF, which method has to be overridden?
State syntax of the command that is used to drop a partition?
List out the commands that are used to start, check the progress and stop the ambari server?
What is the difference between spark and python?
what is ODBC and JDBC connectivity in Hive?
explain apache hbase?
How can you achieve high availability in Apache Spark?
Difference Between Hadoop and HDFS?
What is struct and explain its purpose?
What is the difference between piglatin and hiveql?
Define durable writes?
What alternate way does HDFS provides to recover data in case a Namenode, without backup, fails and cannot be recovered?
What do you understand by the partitions in spark?
How can you stop a partition form being queried?
State the difference between Spark SQL and Hql