Skip to content

Rituraj-commits/Semantic-Segmentation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Semantic Segmentation on GTAV-to-Cityscapes Labels

The GTAV dataset consists of 24966 synthetic images with dense pixel level semantic annotations. The images have been rendered using the open-world video game Grand Theft Auto 5 and are all from the car perspective in the streets of American-style virtual cities. There are 35 semantic classes which are compatible with the ones of CamVid and Cityscapes dataset. We use Unet++ as our baseline in this scenario. UNet++ is an extension over UNet which uses dense skip connections to extract better feature information.

Getting Started

Dependencies

  • PyTorch 1.8.0
  • CUDA 10.2
  • TensorBoard 2.5.0

Installing

pip install segmentation_models_pytorch

Execution

 python train.py

train.py contains code for training the model and saving the weights.

loader.py contains code for dataloading and train-test split.

utils.py contains utility functions.

Experimental Details

Loss functions

We use Cross Entropy as the objective function to train the model.




Training

We use Adam optimizer for optimizing the objective function. The learning rate is initially set to 0.001 and is halved if the validation loss plateaus over 10 epochs. We train the network until 100 epochs and the best weights are saved accordingly. We use NVIDIA Tesla P100 with 16 GB of VRAM to train the model.

Quantative Results

We evaluate the model on the basis of Dice Score Coefficient(DSC) and Intersection over Union(IoU) over 35 classes.




Qualitative Results



Fig 1: Semantic Segmentation on GTAV dataset

Acknowledgments

[1] Aerial Segmentation using UNet

[2] UNet Multi Class PyTorch

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages