NEWS

doc2vec 0.2.2 (2025-11-27)

Fix DOI in DESCRIPTION
Remove C++11 from Makevars

doc2vec 0.2.1

Make sure words are only 100 characters when getting embeddings of documents (issue #20)
Limit documents to 1000 words by explicitely keeping only the first 1000 words per document + provide warning if doc_id contains spaces

doc2vec 0.2.0 (2021-03-27)

Add top2vec semantic clustering algorithm
Allow transfer learning in paragraph2vec by passing on a pretrained set of word vectors to initialise the word embeddings with (no initialisation of the document embeddings)
In paragraph2vec: close opened files directly after training instead of waiting for R garbage collection to kick in
Added dataset 'be_parliament_2020' with questions asked by members in the Belgium Federal parliament in 2020

doc2vec 0.1.1 (2021-01-21)

Fixes for valgrind R CMD checks
- Fixes for destructors of Vocabulary
- Remove WMD
Added txt_count_words and removed Suggests dependency of udpipe

doc2vec 0.1.0 (2020-12-10)

Initial package based on https://github.com/hiyijian/doc2vec commit dec123e891f17ea664053ee7575b0e5e7dae4fca