121

GitHub - facebookresearch/wav2letter: Facebook AI Research Automatic Speech Reco...

 6 years ago
source link: https://github.com/facebookresearch/wav2letter
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

wav2letter++

Important Note:

wav2letter has been moved and consolidated into Flashlight in the ASR application.

Future wav2letter development will occur in Flashlight.

To build the old, pre-consolidation version of wav2letter, checkout the wav2letter v0.2 release, which depends on the old Flashlight v0.2 release. The wav2letter-lua project can be found on the wav2letter-lua branch, accordingly.

For more information on wav2letter++, see or cite this arXiv paper.

Recipes

This repository includes recipes to reproduce the following research papers as well as pre-trained models. All results reproduction must use Flashlight <= 0.3.2 for exact reproducability. Papers contained here include:

Data preparation for training and evaluation can be found in data directory.

Building the Recipes

First, install Flashlight (using the 0.3 branch is required) with the ASR application.

mkdir build && cd build
cmake .. && make -j8

If Flashlight or ArrayFire are installed in nonstandard paths via a custom CMAKE_INSTALL_PREFIX, they can be found by passing

-Dflashlight_DIR=[PREFIX]/usr/share/flashlight/cmake/ -DArrayFire_DIR=[PREFIX]/usr/share/ArrayFire/cmake

when running cmake.

Join the wav2letter community

License

wav2letter++ is MIT-licensed, as found in the LICENSE file.


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK