explain the use of blinkdb?
How do you parse data in xml? Which kind of class do you use with java to parse data?
Explain parquet file?
What is lazy evaluation and how is it useful?
How is transformation on rdd different from action?
What is a dataset? What are its advantages over dataframe and rdd?
What is pagerank?
What is dag – directed acyclic graph?
Explain schemardd?
Describe coalesce() operation. When can you coalesce to a larger number of partitions? Explain.
When we create an rdd, does it bring the data and load it into the memory?
What does reduce action do?
how can you identify whether a given operation is transformation or action?
Explain the use of broadcast variables
How do you parse data in xml? Which kind of class do you use with java to pass data?