No datasets match
- doc2vec::be_parliament_2020Corpus with Questions asked in the Belgium Federal Parliament in 2020
- nametagger::europeananewsTagged news paper articles from Europeana
- textplot::example_btmExample Biterm Topic Model
- textplot::example_embeddingExample word embedding matrix
- textplot::example_embedding_clustersExample words emitted in a ETM text clustering model
- textplot::example_udpipeExample annotation of text using udpipe
- textrank::jobofferThe text of a job offer, annotated with the package udpipe
- tokenizers.bpe::belgium_parliamentDataset from 2017 with Questions asked in the Belgium Federal Parliament
- topicmodels.etm::ng20Bag of words sample of the 20 newsgroups dataset
- udpipe::brussels_listingsBrussels AirBnB address locations available at www.insideairbnb.com
- udpipe::brussels_reviewsReviews of AirBnB customers on Brussels address locations available at www.insideairbnb.com
- udpipe::brussels_reviews_annoReviews of the AirBnB customers which are tokenised, POS tagged and lemmatised
- udpipe::brussels_reviews_w2v_embeddings_lemma_nlAn example matrix of word embeddings
- udpipe::udpipe_annotation_paramsList with training options set by the UDPipe community when building models based on the Universal Dependencies data
