Big Data Interview Questions
Questions Answers Views Company eMail

Explain a common use case for Flume?

60

Explain about the different channel types in Flume.

71

What are core components of Flume?

74

Which channel type is faster in Flume?

97

What are Flume events?

72

How many Reducers should be configured?

86

Can we change the body of the flume event?

62

Does Flume provide 100% reliability to the data flow?

113

Explain about the core components of Flume?

66

How can Flume be used with HBase?

89

Does Apache Flume provide support for third party plug-ins?

66

How is Flume-NG different from Flume 0.9?

65

Which is the reliable channel in Flume to ensure that there is no data loss?

98

What problem does Apache Flume solve?

73

How to write data in Hbase using flume?

72


Un-Answered Questions { Big Data }

What is a ledger in bookkeeper?

1


What is decorating filters?

234


Explain write ahead log(journaling) in spark?

188


Name some internal daemons used in spark?

231


explaine wal in hbase?

121






Mention some instances where zookeeper is using?

5


What is KeyValueTextInputFormat in Hadoop?

264


What do you mean by block scanner in hdfs?

26


What is IdentityMapper?

641


Explain the use of broadcast variables

225


What is a speculative execution in Apache Hadoop MapReduce?

438


What do you mean by cassandra-cqlsh?

52


if you run Hive as a server?

638


How can we create children / sub-znode?

5


Suppose there is file of size 514 mb stored in hdfs (hadoop 2.x) using default block size configuration and default replication factor. Then, how many blocks will be created in total and what will be the size of each block?

95