Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is Buckets in Hive?
Ideally what should be the replication factor in hadoop?
Does kafka use hdfs?
List the advantage of Parquet files?
Explain the functionality of object-inspector.
When to choose "Internal Table" in Hive?
What language is apache spark?
Is avro supported?
Can you explain indexing?
What are the primary phases of a Reducer?
What is the number of default partitioner in hadoop?
How can one write custom record reader?
Explain accumulators in apache spark.
What is the use of spark in big data?
Explain foreach() operation in apache spark?