Module vector::sinks::webhdfs

source ·
Expand description

webhdfs sink.

The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware. HDFS consists of a namenode and a datanode. We will send rpc to namenode to know which datanode to send and receive data to. Also, HDFS will rebalance data across the cluster to make sure each file has enough redundancy.

                    ┌───────────────┐
                    │  Data Node 2  │
                    └───────────────┘
                            ▲
┌───────────────┐           │            ┌───────────────┐
│  Data Node 1  │◄──────────┼───────────►│  Data Node 3  │
└───────────────┘           │            └───────────────┘
                    ┌───────┴───────┐
                    │   Name Node   │
                    └───────────────┘
                            ▲
                            │
                     ┌──────┴─────┐
                     │   Vector   │
                     └────────────┘

WebHDFS will connect to the HTTP RESTful API of HDFS.

For more information, please refer to:

webhdfs is an OpenDal based services. This mod itself only provide config to build an crate::sinks::opendal_common::OpenDalSink. All real implement are powered by crate::sinks::opendal_common::OpenDalSink.

Structs§