How will you perform the inter cluster data copying work in hdfs?



How will you perform the inter cluster data copying work in hdfs?..

Answer / Arvind Kumar Sinha

Inter-cluster data copying in HDFS can be achieved using DistCP or Data Transfer Protocol (DTP). DistCP is a distributed file copy utility for Hadoop, while DTP allows high-speed transfer of large amounts of data between HDFS clusters.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Apache HDFS Hadoop Distributed File System Interview Questions

Does HDFS allow a client to read a file which is already opened for writing in hadoop?

1 Answers  


What are problems with small files and hdfs?

1 Answers  


What do you mean by metadata in HDFS?

1 Answers  


What is NameNode and DataNode in HDFS?

1 Answers  


Since the data is replicated thrice in hdfs, does it mean that any calculation done on one node will also be replicated on the other two?

1 Answers  


Suppose there is file of size 514 mb stored in hdfs (hadoop 2.x) using default block size configuration and default replication factor. Then, how many blocks will be created in total and what will be the size of each block?

1 Answers  


Explain how are file systems checked in hdfs?

1 Answers  


List the files associated with metadata in hdfs?

1 Answers  


Can multiple clients write into an HDFS file concurrently?

1 Answers  


Explain NameNode and DataNode in HDFS?

1 Answers  


Distinguish HDFS Block and Input Unit?

1 Answers  


Why is HDFS only suitable for large data sets and not the correct tool to use for many small files?

1 Answers  


Categories