Accelerate Transformer training with AWS Trainium

Julien Simon
Oct 13, 2022

--

In this video, I show you how to accelerate Transformer training with AWS Trainium, a new custom chip designed by AWS.

First, I walk you through the setup of an Amazon EC2 trn1.32xlarge instance, equipped with 16 Trainium chips. Then, I run a natural language processing job where I adapt existing Transformer training code for Trainium, accelerating a BERT model to classify the Yelp restaurant review datatset. Finally, I run the job on 1, 8, and 32 Neuron cores.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

--

--

Julien Simon
Julien Simon

No responses yet

Write a response