Apache DolphinSchedulerApache Open-source Projects in Modern Data StacksEditor: Detong, github.com/mischaZhangNov 23, 2022Nov 23, 2022
Ravishankar NairBuilding Modern Data Lakes with MinIO : Part 3Along with Minio and Presto, we have a new guest: Apache IcebergMay 5, 20221May 5, 20221
Raj SamuelDesign patterns every data engineer should know(empty introductory line to avoid a formatting issue with Medium editor)Jan 22, 20221Jan 22, 20221
InHashmap, an NTT DATA CompanybyHashmap, an NTT DATA CompanyNiFi NAR Files ExplainedAn Introduction to Processor and Controller Service PackagingJun 7, 20171Jun 7, 20171
InTDS ArchivebyVictor SeifertHow to build a data lake from scratch — Part 1: The setupThe complete tutorial of how to make use of popular technology to build a data engineering sandboxNov 18, 20214Nov 18, 20214
InHashmap, an NTT DATA CompanybyHashmap, an NTT DATA CompanyCreating Custom Processors and Controllers in Apache NiFi3 Ways to Get Started with Apache NiFi Data FlowsMay 17, 20187May 17, 20187
InTDS ArchivebyAakash PydiBuilding a Production-Level ETL Pipeline Platform Using Apache AirflowUsing Apache Airflow to Manage Data Workflows in CernerWorksOct 26, 20193Oct 26, 20193
InTDS ArchivebyJohn LafleurWhy the Future of ETL Is Not ELT, But EL+THow we store and manage data has completely changed over the last decade. We moved from an ETL world to an ELT world, with companies like…Nov 3, 20204Nov 3, 20204
InNetflix TechBlogbyNetflix Technology BlogData pipeline asset management with Dataflowby Sam Redai, Jai Balani, Olek GorajekOct 15, 20213Oct 15, 20213