doc2vec - Distributed Representations of Sentences, Documents and Topics

Learn vector representations of sentences, paragraphs or documents by using the 'Paragraph Vector' algorithms, namely the distributed bag of words ('PV-DBOW') and the distributed memory ('PV-DM') model. The techniques in the package are detailed in the paper "Distributed Representations of Sentences and Documents" by Mikolov et al. (2014), available at <doi:10.48550/arXiv.1405.4053>. The package also provides an implementation to cluster documents based on these embedding using a technique called top2vec. Top2vec finds clusters in text documents by combining techniques to embed documents and words and density-based clustering. It does this by embedding documents in the semantic space as defined by the 'doc2vec' algorithm. Next it maps these document embeddings to a lower-dimensional space using the 'Uniform Manifold Approximation and Projection' (UMAP) clustering algorithm and finds dense areas in that space using a 'Hierarchical Density-Based Clustering' technique (HDBSCAN). These dense areas are the topic clusters which can be represented by the corresponding topic vector which is an aggregate of the document embeddings of the documents which are part of that topic cluster. In the same semantic space similar words can be found which are representative of the topic. More details can be found in the paper 'Top2Vec: Distributed Representations of Topics' by D. Angelov available at <doi:10.48550/arXiv.2008.09470>.

Last updated

doc2vecembeddingsnatural-language-processingparagraph2vecword2veccpp

6.00 score 51 stars 39 scripts 338 downloads

image.libfacedetection - Convolutional Neural Network for Face Detection

An open source library for face detection in images. Provides a pretrained convolutional neural network based on <https://github.com/ShiqiYu/libfacedetection> which can be used to detect faces which have size greater than 10x10 pixels.

Last updated

canny-edge-detectioncomputer-visioncontoursdarknetdlibf9harris-cornersharris-interest-point-detectorhog-featuresimage-algorithmsimage-recognitionopenpanootsusurfcppopenmp

5.33 score 285 stars 15 scripts 246 downloads

image.CannyEdges - Implementation of the Canny Edge Detector for Images

An implementation of the Canny Edge Detector for detecting edges in images. The package provides an interface to the algorithm available at <https://github.com/Neseb/canny>.

Last updated

canny-edge-detectioncomputer-visioncontoursdarknetdlibf9harris-cornersharris-interest-point-detectorhog-featuresimage-algorithmsimage-recognitionopenpanootsusurffftw3cpp

5.15 score 285 stars 6 scripts 201 downloads

image.Otsu - Otsu's Image Segmentation Method

An implementation of the Otsu's Image Segmentation Method described in the paper: "A C++ Implementation of Otsu's Image Segmentation Method". The algorithm is explained at <doi:10.5201/ipol.2016.158>.

Last updated

canny-edge-detectioncomputer-visioncontoursdarknetdlibf9harris-cornersharris-interest-point-detectorhog-featuresimage-algorithmsimage-recognitionopenpanootsusurfcpp

5.15 score 285 stars 225 downloads

image.LineSegmentDetector - Detect Line Segments in Images

An implementation of the Line Segment Detector on digital images described in the paper: "LSD: A Fast Line Segment Detector with a False Detection Control" by Rafael Grompone von Gioi et al (2012). The algorithm is explained at <doi:10.5201/ipol.2012.gjmr-lsd>.

Last updated

canny-edge-detectioncomputer-visioncontoursdarknetdlibf9harris-cornersharris-interest-point-detectorhog-featuresimage-algorithmsimage-recognitionopenpanootsusurfcpp

5.15 score 285 stars 7 scripts 250 downloads