Below is a mini data engineering crash course that I quickly put together and this list includes subjects that aspiring data engineers may want to become familiar with -- they include:
πΏ R / RStudio : data.frame
πΏ Spark or DataBricks or EMR : DataFrame / DataSet
πΏ Python : Pandas
πΏ Java: JPA / JPQL / Hibernate / HQL
πΏ Data modeling
πΏ Relational databases & SQL
πΏ NOSQL / MapReduce / Hadoop
πΏ MongoDB
πΏ CSV files
What else would you include?
Top comments (0)