grepNonStandardCharacters {Ecdat} | R Documentation |
Return the indices of elements of x
containing characters that
are not in standardCharacters
.
grepNonStandardCharacters(x, value=FALSE, standardCharacters=c(letters, LETTERS, ' ','.', ',', 0:9, '\"', "\'", '-', '_', '(', ')', '[', ']', '\n'), ... )
x |
character vector in which it is desired to identfy elements
containing characters not in |
value |
logical: |
standardCharacters |
Characters to overlook in |
... |
optional arguments for |
1. x. <- strsplit(x, ”): convert the input character vector to a
list of vectors of character vectors with nchar(x.[i])
== 1
for i in 1:length(x).
2. sapply(x., ...) to identify all elements for which any element of
x[[i]] is not in standardCharacters
.
an integer vector identifying all elements of x
containing a
character not in standardCharacters
.
Spencer Graves
grep
,
regexpr
,
subNonStandardCharacters
,
showNonASCII
Names <- c('Raul', 'Ra`l', 'Torres,Raul', 'Torres, Raul') # confusion in character sets can create # names like Names[2] chk <- grepNonStandardCharacters(Names) all.equal(chk, 2) chkv <- grepNonStandardCharacters(Names, TRUE) all.equal(chkv, 'Ra`l')