Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain about the execution pl of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
Is impala intended to handle real time queries in low-latency applications or is it for ad hoc queries for the purpose of data exploration?
43
What is a udf?
How hdfa differs with nfs?
Is it possible to change the default location of a managed table?
How does apache spark work?
Explain the top() and takeordered() operation?
What do you use spark for?
What is the use of tools command?
What is Federation?
How will you make changes to the default configuration files?
What does the mapred.job.tracker command do?
Explain what is a keyspace in Cassandra?
What are file permissions in HDFS and how HDFS check permissions for files or directory?
What are the data manipulation commands of hbase?
What do you know about schemardd?
What is spark in big data?