SageMaker Fridays Season 3 Episode 2 — Easy data preparation with SageMaker Data Wrangler

Julien Simon
Mar 7, 2021

--

In this episode, we start from the popular Titanic survivor dataset. We import it in SageMaker Data Wrangler, where we build visualizations and apply built-in transforms (column operations, imputing missing values, one hot encoding, normalization). Then, we export these transforms to a Jupyter notebook running a SageMaker Processing job. We run the notebook and take a look at the processed dataset, before training a model with XGBoost. We also take a quick look at other export options (Python code, SageMaker Pipelines, SageMaker Feature Store). As usual, 100% live, no slides :)

Join us for future episodes at Join us for more episodes at https://amazonsagemakerfridays.splashthat.com/

--

--

Julien Simon
Julien Simon

No responses yet