How to create an rdd?
What is an rdd?
Explain the top() and takeordered() operation?
What are the major features/characteristics of rdd (resilient distributed datasets)?
What is the difference between map and flatmap?
How to identify that the given operation is transformation or action?
Why we need compression and what are the different compression format supported?
Is ambari python client can be used to make good use of ambari api’s?
Mention what does the shell commands “capture” and “consistency” determines?
Explain what is composite type in cassandra?
Mention what are the main components of cassandra data model?
Explain what is a cluster in cassandra?
Tell any two features of flume?
What will be the result when you do cast(‘abc’ as int)?
If you omit the overwrite clause while creating a hive table,what happens to file which are new and files which already exist?