What is a block in HDFS? what is the default size in Hadoop 1 and Hadoop 2? Can we change the block size?1
How do I stop flume agent?
Explain about trformations and actions in the context of rdds?
What is a spark shuffle?
Explain what happens if, during the PUT operation, HDFS block is assigned a replication factor 1 instead of the default value 3?
Explain about the core components of a distributed Spark application?
Why is Hive not suitable for OLTP systems?
what is ODBC and JDBC connectivity in Hive?
Explain Apache Ambari architecture?
What is big data concept?
Describe the distnct(),union(),intersection() and substract() transformation in Apache Spark RDD?
What Platforms Cassandra runs on?
What is faster than apache spark?
Is it possible to leverage real time analysis on the big data collected by flume directly? If yes, then explain how?
What is the best hardware configuration to run Hadoop?
Explain Clustering in Hive?