Cosine similarity for text documents
COS_TEXT( text_vector1 = NULL, text_vector2 = NULL, threads = 1, separator = " " )
text_vector1 | a character string vector representing text documents (it should have the same length as the text_vector2) |
---|---|
text_vector2 | a character string vector representing text documents (it should have the same length as the text_vector1) |
threads | a numeric value specifying the number of cores to run in parallel |
separator | specifies the separator used between words of each character string in the text vectors |
a numeric vector
The function calculates the cosine distance between pairs of text sequences of two character string vectors
library(textTinyR) vec1 = c('use this', 'function to compute the') vec2 = c('cosine distance', 'between text sequences') out = COS_TEXT(text_vector1 = vec1, text_vector2 = vec2, separator = " ")