remove_corrupt_utf8.Rd
Remove corrupt UTF8 characters that might cause issues, recommended.
remove_corrupt_utf8(text) # S3 method for corpus remove_corrupt_utf8(text) # S3 method for documents remove_corrupt_utf8(text) # S3 method for document remove_corrupt_utf8(text)
text | An object inheriting of class |
---|
# NOT RUN { init_textanalysis() # build document doc <- string_document("this document is clean") # replaces in place! remove_corrupt_utf8(doc) # }