Please help me to choose a right tool. I have the following task: there are N data sources (N < 20), it could be either relational DB (MySQL, PostgreSQL) or REST API. I need to load all the data from N data sources into a single relational database (only once). So the final goal seems to be a simple ETL:
- extract data from data source
- transform the data (map data to fit into a target DB schema)
- load to DB
(Note: each source DB contain 10-15 coupled tables with 100 000 - 1 000 000 rows)
I'm currently trying to discover the proper instrument and I believe that Apache NiFi is exactly what I need: love the idea to just configure everything via friendly UI instead of coding and reinventing the wheel.
A couple of questions:
- Does Apache NiFi look suitable for my task or it would be an overkill?
- Will I have any benefit configuring Apache NiFi with zero knowledge of that instrument vs writing a custom script using some programming language I'm comfortable with (Python, for example)
Thanks!