Skip to content

DEV Community

James Briggs

Posted on Jul 5, 2021

Building a MLM Training Input Pipeline

#python #deeplearning #machinelearning #datascience

The input pipeline of our training process is the more complex part of the entire transformer build. It consists of us taking our raw OSCAR training data, transforming it, and preparing it for Masked-Language Modeling (MLM). Finally, we load our data into a DataLoader ready for training!

Top comments (0)

Subscribe

Read next

Top 10 Platforms to Practice Python

Devstories Playground - Nov 11

Building a Chess Game with Python and OpenAI

Yannis Rizos - Nov 24

Part 9: Building Your Own AI - Natural Language Processing (NLP) for Language Understanding

Trix Cyrus - Dec 14

Mastering Algorithms with Go: A Beginner's Guide to Sorting Small Data Sets 🔥

Allan Githaiga - Dec 12