corpus 0.10.2
  • Get started
  • Reference
  • Articles
    • Chinese text handling
    • Finding gendered words
    • Stemming Words
    • Text data in Corpus and other packages
  • Changelog

Reference

All functions

abbreviations_de abbreviations_en abbreviations_es abbreviations_fr abbreviations_it abbreviations_pt abbreviations_ru

Abbreviations

affect_wordnet

WordNet-Affect Lexicon

corpus-package

The Corpus Package

corpus_frame() as_corpus_frame() is_corpus_frame()

Corpus Data Frame

as_corpus_text() is_corpus_text()

Text Objects

federalist

The Federalist Papers

gutenberg_corpus()

Project Gutenberg Corpora

new_stemmer()

Stemmer Construction

print(<corpus_frame>) format(<corpus_frame>)

Corpus Data Frame Printing

read_ndjson()

JSON Data Input

sentiment_afinn

AFINN Sentiment Lexicon

stem_snowball()

Snowball Stemmer

stopwords_da stopwords_de stopwords_en stopwords_es stopwords_fi stopwords_fr stopwords_hu stopwords_it stopwords_nl stopwords_no stopwords_pt stopwords_ru stopwords_sv

Stop Words

term_matrix() term_counts()

Term Frequency Tabulation

term_stats()

Term Statistics

text_filter() `text_filter<-`()

Text Filters

text_locate() text_count() text_detect() text_match() text_sample() text_subset()

Searching for Terms

text_split() text_nsentence()

Segmenting Text

text_stats()

Text Statistics

text_sub()

Text Subsequences

text_tokens() text_ntoken()

Text Tokenization

text_types() text_ntype()

Text Type Sets

Contents

Developed by Leslie Huang, Patrick O. Perry.

Site built with pkgdown 1.6.1.