Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Name some features of Apache Cassandra?
Is it possible to do an incremental import using Sqoop?
When is it suggested to use a combiner in a MapReduce job?
What is KeyValueTextInputFormat in Hadoop MapReduce?
After the Map phase finishes, the Hadoop framework does 'Partitioning, Shuffle and sort'. Explain what happens in this phase?
What does spark do during speculative execution?
Which command do we use to show the version?
What's rdd?
What are the fundamental configurations parameters specified in map reduce?
Can spark work without hadoop?
If DataNode increases, then do we need to upgrade NameNode in Hadoop?
Is spark streaming real time?
Can I do trforms or add new functionality?
Name different types of NoSQL database?
What does /etc /init.d do?