In this tutorial, we are going to learn how to set up and run a pre-trained deep learning model and use it creatively. In particular, we will work with a semantic segmentation network capable of recognizing different elements of an image or video and generate a mask from it.

Get the model

The model used in this project performs semantic segmentation, i.e. it splits an image into different regions associated with semantic categories, such as sky, tree, or person. It has been developed by researchers at the MIT Computer Science & Artificial Intelligence Lab using PyTorch and trained on the ADE20K dataset.


Riccardo Miccini

