30

A guide to working with character data in R

 5 years ago
source link: https://www.tuicool.com/articles/hit/VNbUFjZ
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

YJjAvaM.png!web R is primarily a language for working with numbers, but we often need to work with text as well. Whether it's formatting text for reports, or analyzing natural language data, R provides a number of facilities for working with character data. Handling Strings with R , a free (CC-BY-NC-SA) e-book by UC Berkeley's Gaston Sanchez , provides an overview of the ways you can manipulate characters and strings with R. 

There are many useful sections in the book, but a few selections include:

Note that the book does not cover analysis of natural language data, for which you might want to check out the  CRAN Task View on Natural Language Processing or the book Text Mining with R: A Tidy Approach . It's also sadly silent on the topic of character encoding in R , a topic that often causes problems when dealing with text data, especially from international sources. Nonetheless, the book is a really useful overview of working with text in R, and has been updated extensively since it was last published in 2014 . You can read Handling Strings with R at the link below.

Gaston Sanchez: Handling Strings with R


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK