Video: Accelerate Transformer inference with AWS Inferentia 2

Julien Simon
Apr 14, 2023

AWS Inferentia2 is now generally available, and I couldn’t resist testing it with BERT models and comparing results with Inferentia1.

This thing is FAST and looks very cost-effective. Check it out!

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Julien Simon
Julien Simon

No responses yet

What are your thoughts?