Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Can you list down the limitations of using Apache Spark?
What will you do when NameNode is down?
Can you define fsck?
What is spark etl?
what do you mean by compaction?
Does spark use zookeeper?
What is sharding in big data?
When do you have to avoid secondary indexes?
Why does my select statement fail?
Define data integrity?
Are sparks dangerous?
Map reduce jobs take too long. What can be done to improve the performance of the cluster?
What are the barriers?
Describe impala shell (impala-shell command)?
What is the difference between Internal Table and External Table in Hive?