142

GitHub - MSiam/TFSegmentation: RTSeg: Real-time Semantic Segmentation Comparativ...

 6 years ago
source link: https://github.com/MSiam/TFSegmentation
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

README.md

Real-time Semantic Segmentation Comparative Study

By: Mennatullah Siam, Mostafa Gamal, Moemen AbdelRazek, Senthil Yogamani, Martin Jagersand

The repository contains the official TensorFlow code used in the our paper RTSEG: REAL-TIME SEMANTIC SEGMENTATION COMPARATIVE STUDY for comparing different realtime semantic segmentation architectures.

Description

Semantic segmentation benefits robotics related applications especially autonomous driving. Most of the research on semantic segmentation is only on increasing the accuracy of segmentation models with little attention to computationally efficient solutions. The few work conducted in this direction does not provide principled methods to evaluate the different design choices for segmentation. In this paper, we address this gap by presenting a real-time semantic segmentation benchmarking framework with a decoupled design for feature extraction and decoding methods. The code and the experimental results are presented on the CityScapes dataset for urban scenes.

fig.png

Models

Encoder Skip U-Net DilationV1 DilationV2 VGG-16 Yes Yes Yes No ResNet-18 Yes Yes Yes No MobileNet Yes Yes Yes Yes ShuffleNet Yes Yes Yes Yes

NOTE: The pretrained weights for all the implemented models will be released soon. Stay in touch for the updates.

Reported Results

Test Set

Model GFLOPs** Class IoU Class iIoU Category IoU Category iIoU SegNet 286.03 56.1 34.2 79.8 66.4 ENet 3.83 58.3 24.4 80.4 64.0 DeepLab - 70.4 42.6 86.4 67.7 SkipNet-VGG16 - 65.3 41.7 85.7 70.1 SkipNet-ShuffleNet 2.0 58.3 32.4 80.2 62.2 SkipNet-MobileNet 6.2 61.5 35.2 82.0 63.0

Validation Set

Encoder Decoder Coarse mIoU MobileNet SkipNet No 61.3 ShuffleNet SkipNet No 55.5 ResNet-18 UNet No 57.9 MobileNet UNet No 61.0 ShuffleNet UNet No 57.0 MobileNet Dilation No 57.8 ShuffleNet Dilation No 53.9 MobileNet SkipNet Yes 62.4 ShuffleNet SkipNet Yes 59.3

** GFLOPs is computed on image resolution 360x640. However, the mIOU(s) are computed on the official image resolution required by CityScapes evaluation script 1024x2048.

Usage

Run

The file named run.sh provide a good example for running different architectures. Have a look at this file.

Example to the running command written in the file:

python3 main.py --load_config=[config_file_name].yaml [train/test] [Trainer Class Name] [Model Class Name]

Main Dependencies

Python 3 and above
tensorflow 1.3.0/1.4.0
numpy 1.13.1
tqdm 4.15.0
matplotlib 2.0.2
pillow 4.2.1
PyYAML 3.12

All Dependencies

pip install -r [requirements_gpu.txt] or [requirements.txt]  

Citation

If you find RTSeg useful in your research, please consider citing our work:

@ARTICLE{2018arXiv180302758S,   
   author = {{Siam}, M. and {Gamal}, M. and {Abdel-Razek}, M. and {Yogamani}, S. and    
	{Jagersand}, M.},   
    title = "{RTSeg: Real-time Semantic Segmentation Comparative Study}",   
  journal = {ArXiv e-prints},   
archivePrefix = "arXiv",   
   eprint = {1803.02758},   
 primaryClass = "cs.CV",   
 keywords = {Computer Science - Computer Vision and Pattern Recognition},   
     year = 2018,   
    month = mar,   
   adsurl = {http://adsabs.harvard.edu/abs/2018arXiv180302758S},   
  adsnote = {Provided by the SAO/NASA Astrophysics Data System}   
}

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK