From fe0649fbdc5ba6801cbdab140ff1de8849a9e9c0 Mon Sep 17 00:00:00 2001 From: "tina-cloud-app[bot]" <58178390+tina-cloud-app[bot]@users.noreply.github.com> Date: Fri, 19 Jan 2024 04:08:51 +0000 Subject: [PATCH] TinaCMS content update --- posts/data-engineering-as-a-product.md | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/posts/data-engineering-as-a-product.md b/posts/data-engineering-as-a-product.md index b704599..d58d23f 100644 --- a/posts/data-engineering-as-a-product.md +++ b/posts/data-engineering-as-a-product.md @@ -1,7 +1,7 @@ --- -title: Data Engineering as a Product date: '2024-01-15' -author: 'Chris Forno' +author: Chris Forno +title: Data Engineering as a Product --- Data engineering is essential for all value-added data work, but it’s hard to do well and ripe for disruption. @@ -14,9 +14,9 @@ Data Engineers are paid more than Data Scientists ($127K vs $123K in the US acco According to various surveys, “data engineering” (collection, copying, cleaning, etc.) is the most hated part of data scientists’ jobs. Data engineering consists of: -* Copying data from one place to another (often with [ETL pipelines](dont-etl-elt-or-etlt)) -* Converting data between different formats (such as [CSV](why-you-need-automatic-inference)) -* Aligning data from different sources +* Copying data from one place to another (often with [ETL pipelines](dont-etl-elt-or-etlt)). +* Converting data between different formats (such as [CSV](why-you-need-automatic-inference)). +* Aligning data from different sources. These are all repetitive, error-prone tasks: exactly the kinds of tasks that are well-suited to automation.