SageMaker Fridays Season 3 Episode 2 — Easy data preparation with SageMaker Data Wrangler

Julien Simon
Mar 7, 2021

In this episode, we start from the popular Titanic survivor dataset. We import it in SageMaker Data Wrangler, where we build visualizations and apply built-in transforms (column operations, imputing missing values, one hot encoding, normalization). Then, we export these transforms to a Jupyter notebook running a SageMaker Processing job. We run the notebook and take a look at the processed dataset, before training a model with XGBoost. We also take a quick look at other export options (Python code, SageMaker Pipelines, SageMaker Feature Store). As usual, 100% live, no slides :)

Join us for future episodes at Join us for more episodes at https://amazonsagemakerfridays.splashthat.com/

--

--