Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How to keep files in HDFS?
Elaborate on Identifiers?
What do you think about the speculative execution?
Explain a scenario where you will be using spark streaming.
How many daemon processes run on a hadoop cluster?
Can you define a udf?
What is Directed Acyclic Graph(DAG)?
What is spark code?
Explain the RDD properties?
Is it possible to change the default location of Managed Tables in Hive, if so how?
What is spark used for?
What is configured in /etc/hosts and what is its role in setting Hadoop cluster?
What are the relational operators available related to Grouping and joining in Pig language?
Explain first() operation in Spark?
What do you mean by cassandra-cqlsh?