README.md

Face Attention Network

Pytorch implementation of face attention network as described in Face Attention Network: An Effective Face Detector for the Occluded Faces. The baseline is RetinaNet followed by this repo.

Requirements

Python3
Pytorch0.4
torchvision
tensorboardX

Installation

Install packages.

sudo apt-get install tk-dev python-tk

pip install cffi

pip install cython

pip install pandas

pip install tensorboardX

Build NMS.

cd Face_Attention_Network/lib
sh build.sh

Create folders.

cd Face_Attention_Network/
mkdir ckpt mAP_txt summary weight

Datasets

You should prepare three CSV or TXT files including train annotations file, valid annotations file and label encoding file.

Annotations format

Two examples are as follows:

$image_path/img_1.jpg x1 y1 x2 y2 label
$image_path/img_2.jpg . . . . .

Images with more than one bounding box should use one row per box. When an image does not contain any bounding box, we use ' ' separated with '.'.

Label encoding file

A TXT file is needed to map label to ID. Each line means one label name and its ID. One example is as follows:

face 0

Pretrained Model

We use resnet18, 34, 50, 101, 152 as the backbone. You should download them and put them to '/weight'.

resnet18: https://download.pytorch.org/models/resnet18-5c106cde.pth
resnet34: https://download.pytorch.org/models/resnet34-333f7ec4.pth
resnet50: https://download.pytorch.org/models/resnet50-19c8e357.pth
resnet101: https://download.pytorch.org/models/resnet101-5d3b4d8f.pth
resnet152: https://download.pytorch.org/models/resnet152-b121ed2d.pth

Training

python train.py --csv_train <$path/train.txt> --csv_val <$path/val.txt> --csv_classes <$path/classes.txt> --depth <50> --pretrained resnet50-19c8e357.pth --model_name <model name to save>

Visualization Result

Detection result

Attention map at different level (P3~P7)

GitHub - rainofmine/Face_Attention_Network: Pytorch implementation of face atten...

README.md

Face Attention Network

Requirements

Installation

Datasets

Annotations format

Label encoding file

Pretrained Model

Training

Visualization Result

Reference

Recommend

移动互联网两大梯队已形成，瓜分超过70%用户时长

【豆瓣8.7，拍13天，后期2年！《网络谜踪》最近火了】片子目前豆瓣评分8.7，好于 97%...

爆肝，一晚上时间开发出一个论坛

身陷诉讼ofo运营主体法人生变专家称或有意保全戴威

你的U盘、硬盘降价50%！为啥？一个570亿美元的大机会“曝光”了

从“马蜂窝数据事件”谈软件开发

反爬之字体加密与破解

5G还没来，就被这些手机厂商玩坏了

浅析数据库并发控制 - 知乎

量子计算机优势首次获确证算法利用量子物理学非定域性

About Joyk