MySQL Applier overview
MySQL Applier provides real-time data processing between MySQL and Hadoop. This is quite an efficient way to load data because of its performance and real-time processing. This will load the data which has been changed rather than loading whole data. Hence, No More Bulk Transfer needed!
MySQL Applier replicates rows inserted into MySQL to the HDFS with help of MySQL binlog. This will use binary log and insert data in real-time; based on events in MySQL. All the events occurred in MySQL Server is available in binlog and Applier takes these changes from the events and applies same in Hadoop. This way we can rapidly acquire new data from the MySQL.
MySQL Applier uses API provided by libhdfs
which is C
library. This library is pre-compiled with Hadoop distributions for connecting MySQL master or read a binlog
file of MySQL. It is responsible for various operations as listed in the following points:
- Get the events (insert, update, delete) occurred on the MySQL Server.
- Decodes...