largeVis | largeVis algorithm | Machine Learning library
kandi X-RAY | largeVis Summary
kandi X-RAY | largeVis Summary
An implementation of the largeVis algorithm for visualizing large, high-dimensional datasets, for R
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of largeVis
largeVis Key Features
largeVis Examples and Code Snippets
Community Discussions
Trending Discussions on largeVis
QUESTION
I am training a doc2vec model with multiple tags, so it includes the typical doc "ID" tag and then it also contains a label tag "Category 1." I'm trying to graph the results such that I get the doc distribution in a 2d (using LargeVis) but am able to color different tags. My problem is that the vectors the model returns exceed the number of training observations by 5 making difficult to align the original tags with the vectors:
...ANSWER
Answered 2018-Oct-08 at 18:37The number of doc-vectors learned will be equal to the number of unique tags you've supplied. It looks like perhaps you've supplied 17,717 unique-IDs and then 5 extra repeating category-tags. Thus, there are 17,722 total known doc-tags (and thus corresponding learned doc-vectors). So, this is expected behavior.
If you need to pass just the 17,717 per-doc vectors to some other process (like a dimensionality-reduction to 2-d), you'll have to pull them out of the model. You could pull them out 1-by-1 – model.docvecs[doc_id]
– and put them into whatever form the next step needs.
If your doc-IDs happen to have been plain ints, from 0 to 17,716, then they will in fact be the first 17,716 entries in the model.docvecs.doctag_syn0
array, which might make things easier - you may just be able to use a view into that array. (The last five rows will be the string tags.)
I would suggest doing all your steps first without the extra complication of adding the secondary category string tags. Such extra tags may help or hurt vector-usefulness for downstream tasks in different situations, but definitely (as you've seen) make things a bit more complicated. So getting baseline results and outputs, without that complication, may be helpful.
QUESTION
I am trying to run HDBSCAN algortihm in R via largeVis package. For visualization of clusters. I am using gplot function in largeVis. Is it possible to change the labels of my data points in the plot from integers to string? I am using Iris dataset with little modification in "class" column and using "class" column as row headers. Is it possible to visualize my current row headers in the plot instead of node numbers?
...ANSWER
Answered 2017-Apr-05 at 20:53The function itself doesn't have an easy option to plot the rownames, bit it does return a ggplot
object and you can add additional layers to that. Here's how you can plot with the rownames
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install largeVis
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page