Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) How will you calculate the number of executors required to do real-time processing using Apache Spark? What factors need to be considered for deciding on the number of nodes for real-time processing?
289In a given spark program, how will you identify whether a given operation is Transformation or Action ?
343
How is spark different from hadoop?
What is Big Data?
Does spark need yarn?
To use Spark on an existing Hadoop Cluster, do we need to install Spark on all nodes of Hadoop?
What are the types of transformation in RDD in Apache Spark?
What is difference between client and cluster mode in spark?
How Pig programming gets converted into MapReduce jobs?
what happens when Hadoop spawned 50 tasks for a job and one of the task failed?
What is the significance of ‘IF EXISTS” clause while dropping a table?
Mention the salient features of apache tajo ?
What is the role of JDBC driver in Sqoop?
What is the use of hbase hmaster?
How do I get better performance with spark?
What are some typical functions of job tracker in hadoop?
Is spark sql a database?