README.md

A Light and Fast Face Detector for Edge Devices

This repo is still under construction. The formal release is coming soon!

Recent Update

2019.07.25 This repos is first online. Face detection code and trained models are released.

Introduction

This repo releases the source code of paper "LFFD: A Light and Fast Face Detector for Edge Devices" . Our paper presents a light and fast face detector (LFFD) for edge devices. LFFD considerably balances both accuracy and latency, resulting in small model size, fast inference speed while achieving excellent accuracy. In practical, we have deployed it in cloud and edge devices (like NVIDIA Jetson series and ARM-based embedding system). The comprehensive performance of LFFD is robust enough to support our applications.

In fact, our method is a general detection framework that applicable to one class detection, such as face detection, pedestrian detection, head detection, vehicle detection and so on. In general, an object class, whose average ratio of the longer side and the shorter side is less than 5, is appropriate to apply our framework for detection.

Several practical advantages:

large scale coverage, and easy to extend to larger scales by adding more layers without much latency gain.
detect small objects (as small as 10 pixels) in images with extremely large resolution (8K or even larger) in only one inference.

Accuracy and Latency

We train LFFD on train set of WIDER FACE benchmark. All methods are evaluated on val/test sets under the SIO schema (please refer to the paper for details).

Accuracy on val set of WIDER FACE (The values in () are results from the original papers):

Method Easy Set Medium Set Hard Set DSFD 0.949(0.966) 0.936(0.957) 0.850(0.904) PyramidBox 0.937(0.961) 0.927(0.950) 0.867(0.889) S3FD 0.923(0.937) 0.907(0.924) 0.822(0.852) SSH 0.921(0.931) 0.907(0.921) 0.702(0.845) FaceBoxes 0.840 0.766 0.395 FaceBoxes3.2× 0.798 0.802 0.715 LFFD 0.910 0.881 0.780

Accuracy on test set of WIDER FACE (The values in () are results from the original papers):

Method Easy Set Medium Set Hard Set DSFD 0.947(0.960) 0.934(0.953) 0.845(0.900) PyramidBox 0.926(0.956) 0.920(0.946) 0.862(0.887) S3FD 0.917(0.928) 0.904(0.913) 0.821(0.840) SSH 0.919(0.927) 0.903(0.915) 0.705(0.844) FaceBoxes 0.839 0.763 0.396 FaceBoxes3.2× 0.791 0.794 0.715 LFFD 0.896 0.865 0.770

Accuracy on FDDB:

Method Disc ROC curves score DFSD 0.984 PyramidBox 0.982 S3FD 0.981 SSH 0.977 FaceBoxes3.2× 0.905 FaceBoxes 0.960 LFFD 0.973

In the paper, three hardware platforms are used for latency evaluation: NVIDIA GTX TITAN Xp, NVIDIA TX2 and Rasberry Pi 3 Model B+ (ARM A53).

We report the latency of inference only (for NVIDIA hardwares, data transfer is included), excluding pre-processing and post-processing. The batchsize is set to 1 for all evaluations.

Latency on NVIDIA GTX TITAN Xp (MXNet+CUDA 9.0+CUDNN7.1):

Resolution-> 640×480 1280×720 1920×1080 3840×2160 DSFD 78.08ms(12.81 FPS) 187.78ms(5.33 FPS) 392.82ms(2.55 FPS) 1562.50ms(0.64 FPS) PyramidBox 50.51ms(19.08 FPS) 143.34ms(6.98 FPS) 331.93ms(3.01 FPS) 1344.07ms(0.74 FPS) S3FD 21.75ms(45.95 FPS) 55.73ms(17.94 FPS) 119.53ms(8.37 FPS) 471.31ms(2.21 FPS) SSH 22.44ms(44.47 FPS) 55.29ms(18.09 FPS) 118.43ms(8.44 FPS) 463.10ms(2.16 FPS) FaceBoxes3.2× 6.80ms(147.00 FPS) 12.96ms(77.19 FPS) 25.37ms(39.41 FPS) 111.98ms(8.93 FPS) LFFD 7.60ms(131.40 FPS) 16.37ms(61.07 FPS) 31.27ms(31.98 FPS) 87.79ms(11.39 FPS)

Latency on NVIDIA TX2 (MXNet+CUDA 9.0+CUDNN7.1) presented in the paper:

Resolution-> 160×120 320×240 640×480 FaceBoxes3.2× 11.20ms(89.29 FPS) 19.62ms(50.97 FPS) 72.74ms(13.75 FPS) LFFD 7.30ms(136.99 FPS) 19.64ms(50.92 FPS) 64.70ms(15.46 FPS)

Latency on Respberry Pi 3 Model B+ (ncnn) presented in the paper:

Resolution-> 160×120 320×240 640×480 FaceBoxes3.2× 167.20ms(5.98 FPS) 686.19ms(1.46 FPS) 3232.26ms(0.31 FPS) LFFD 118.45ms(8.44 FPS) 409.19ms(2.44 FPS) 4114.15ms(0.24 FPS)

On NVIDIA platform, TensorRT is the best choice for inference. So we conduct additional latency evaluations using TensorRT (the latency is dramatically decreased!!!). As for ARM based platform, we plan to use MNN and Tengine for latency evaluation. Details can be found in the sub-project face_detection

Getting Started

We implement the proposed method using MXNet Module API.

Prerequirements (global)

Python>=3.5
numpy>=1.16 (lower versions should work as well, but not tested)
MXNet>=1.4.1 (install guide)
cv2=3.x (pip3 install opencv-python==3.4.5.20, other version should work as well, but not tested)

Tips:

use MXNet with cudnn.
build numpy from source with OpenBLAS. This will improve the training efficiency.
make sure cv2 links to libjpeg-turbo, not libjpeg. This will improve the jpeg decode efficiency.

Sub-directory description

face_detection contains the code of training, evaluation and inference for LFFD, the main content of this repo. The trained models of different versions are provided for off-the-shelf deployment.
head_detection contains the trained models for head detection. The models are obtained by the proposed general one class detection framework.
pedestrian_detection contains the trained models for pedestrian detection. The models are obtained by the proposed general one class detection framework.
vehicle_detection contains the trained models for vehicle detection. The models are obtained by the proposed general one class detection framework.
ChasingTrainFramework_GeneralOneClassDetection is a simple wrapper based on MXNet Module API for general one class detection.

Installation

Download the repo:

git clone https://github.com/YonghaoHe/A-Light-and-Fast-Face-Detector-for-Edge-Devices.git

Refer to the corresponding sub-project for detailed usage.

Citation

If you benefit from our work in your research and product, please kindly cite the paper

@inproceedings{LFFD,
title={LFFD: A Light and Fast Face Detector for Edge Devices},
author={He, Yonghao and Xu, Dezhong and Wu, Lifang and Jian, Meng and Xiang, Shiming and Pan, Chunhong},
booktitle={arXiv:1904.10633},
year={2019}
}

To Do List

face detection
pedestrian detection
head detection
vehicle detection

Contact

Yonghao He

E-mails: [email protected] / [email protected]

If you are interested in this work, any innovative contributions are welcome!!!

Internship is open at NLPR, CASIA all the time. Send me your resumes!

GitHub - YonghaoHe/A-Light-and-Fast-Face-Detector-for-Edge-Devices: The repo is...

README.md

A Light and Fast Face Detector for Edge Devices

Recent Update

Introduction

Accuracy and Latency

Getting Started

Prerequirements (global)

Sub-directory description

Installation

Citation

To Do List

Contact

Recommend

GitHub - milesial/Pytorch-UNet: Pytorch implementation of the U-Net for image se...

GitHub - hugsy/defcon_27_windbg_workshop: DEFCON 27 workshop - Modern Debugging...

GitHub - ChrisTheCoolHut/Firmware_Slap: Discovering vulnerabilities in firmware...

GitHub - bazelbuild/rules_nodejs: JavaScript and NodeJS rules for Bazel

GitHub - TheSph1nx/AbsoluteZero: Python APT Backdoor

有多少人上 YouTube 是因为听歌呢？

没人讨论奶飞新剧全裸导演？

Project Euphonia’s Personalized Speech Recognition for Non-Standard Speech

干货 | 从零开始配置前端开发环境

Windows有一个你从来不用的自带软件，其实是超好用的神器

About Joyk