GitHub - urduhack/urduhack: Natural Language Processing library for ( ??)Urdu la...
source link: https://github.com/urduhack/urduhack
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
README.md
Urduhack: NLP library for ( ?? ) Urdu language
Feature Support
- Normalization
- Arabic and Urdu Unicode Redundancy Problem
- Character Normalization
- Combined Characters Normalization
- Diacritics Removal
- Spaces Before & After Digits
- Spaces After Punctuations
- Joined Words Fix
- Tokenization
- Sentence Tokenization
- Words Tokenization
Roadmap
- Classification
- Sentimental Analysis
- Sentence Classification
- Documents Classification
- Name Entity Recognition
- Image to Text
- Speak to Text
Installation
Urduhack officially supports Python 3.6–3.7, and runs great on PyPy.
To install Requests, simply use pip
$ pip install urduhack
Documentation
Fantastic documentation is available at https://urduhack.readthedocs.io/
How to Contribute
- Check for open issues or open a fresh issue to start a discussion around a feature idea or a bug. There is a Contributor Friendly tag for issues that should be ideal for people who are not very familiar with the codebase yet.
- Write a test which shows that the bug was fixed or that the feature works as expected.
- Send a pull request and bug the maintainer until it gets merged and published. :)
Community
Get updates on UrduHack nlp development and chat with the project maintainers and community members.
Contributors
Special thanks to everyone who contributed to getting the UrduHack to the current state.
Sponsors
Support this project by becoming a sponsor. Your logo will show up here with a link to your website. [Become a sponsor]
Copyright and license
Code released under the MIT License.
Recommend
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK