Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Why HDFS?
Explain small file problem in hadoop
What are the benefits of apache tajo?
Can impala do user-defined functions (udfs)?
Explain transformation in rdd. How is lazy evaluation helpful in reducing the complexity of the system?
What is the advantage of a Parquet file?
What is the purpose of retention period in Kafka cluster?
Is hive a nosql database?
What daemons run on master nodes?
Knox and Hadoop Development Tools?
Compare apache pig and sql?
How does an hadoop application look like or their basic components?
What stored in HDFS?
Is the hdfs block size reduced to achieve faster query results?
What are the responsibilities of a data analyst?