Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Can we rename the output file?
Illustrate some demerits of using Spark.
Explain InputSplit in Hadoop?
What happens to existing data in my cluster when I add new nodes?
What does consumer api in kafka?
What is Geo-Replication in Kafka?
What is the maximum size of string data type supported by hive? Mention the hive support binary formats.
Explain about the execution plans of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
Define fsck?
What are different Hive commands available for hive and beeline CLI?
Explain fold() operation in spark?
How do you categorize a big data?
Differentiate between GROUP and COGROUP operators?
What is Spark SQL?
How to start hbase services?