Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What is the maximum recommended cell size?
Can We Change settings within Hive Session? If Yes, How?
Why replication is required in Kafka?
What is the difference between hbase and hadoop/hdfs?
Explain the need for MapReduce while programming in Apache Pig?
What square measure the options of apache mahout?
Tell me about the execution modes of Apache Pig?
What is the sequence of execution of Mapper, Combiner, and Partitioner in MapReduce?
What is InputSplit and RecordReader?
What features from relational databases or hive are not available in impala?
If the source data gets updated every now and then, how will you synchronize the data in hdfs that is imported by sqoop?
How much Metadata will be created on NameNode in Hadoop?
What is sparkContext?
Can you explain recommendation engine?
What is the most widely used API Write Data to Cassandra ?