embed_terms.Rd
Generate embeddings of terms based on descriptions of visits with using the GloVe algorithm. By default the order of the terms is skipped (all weights in the term coocurrence matrix are equal to 1) and only terms occurring at least 5 times are embedded.
embed_terms(merged_terms, embedding_size = 20L, term_count_min = 5L, x_max = 10L, n_iter = 15L)
merged_terms | A character vector of visits' descriptions with terms
separated by |
---|---|
embedding_size | An integer (default: 20) |
term_count_min | A minimum number of occurences of term to be embedded (default: 5) |
x_max | A |
n_iter | A number of epochs of GloVe (default: 15) |
A matrix of embeddings of the terms.
inter_term_vectors <- embed_terms(interviews, term_count_min = 1L)#> Error in .subset2(public_bind_env, "initialize")(...): unused arguments (word_vectors_size = 20, vocabulary = list(c("fever", "rhinitis", "cough", "eye", "thyroid"), c(3, 3, 4, 4, 6), c(3, 3, 4, 4, 6)))inter_term_vectors#> Error in eval(expr, envir, enclos): object 'inter_term_vectors' not foundinter_term_vectors <- embed_terms(interviews, term_count_min = 1L, embedding_size = 10L)#> Error in .subset2(public_bind_env, "initialize")(...): unused arguments (word_vectors_size = 10, vocabulary = list(c("fever", "rhinitis", "cough", "eye", "thyroid"), c(3, 3, 4, 4, 6), c(3, 3, 4, 4, 6)))inter_term_vectors#> Error in eval(expr, envir, enclos): object 'inter_term_vectors' not foundinter_term_vectors <- embed_terms(interviews, embedding_size = 10L, term_count_min = 1, n_iter = 50, x_max = 20)#> Error in .subset2(public_bind_env, "initialize")(...): unused arguments (word_vectors_size = 10, vocabulary = list(c("fever", "rhinitis", "cough", "eye", "thyroid"), c(3, 3, 4, 4, 6), c(3, 3, 4, 4, 6)))inter_term_vectors#> Error in eval(expr, envir, enclos): object 'inter_term_vectors' not found