pycluster | Python Clustering - Including PAM and Clara algorithms

by daveti Python Version: Current License: No License

X-Ray Key Features Code Snippets Community Discussions(5)Vulnerabilities Install Support

kandi X-RAY | pycluster Summary

pycluster is a Python library. pycluster has no bugs, it has no vulnerabilities and it has low support. However pycluster build file is not available. You can download it from GitHub.

Including PAM and Clara algorithms. Added parallel PAM implementation. Added performance comparison logging (set initMedoidsFixed = True). python pam_parallel.py euroTry.txt 4 4 > euroTry_parallel_4_4.log. python pam.py euroTry.txt 4 > euroTry_4.log.

Support

Quality

Security

License

Reuse

Support

pycluster has a low active ecosystem.

It has 9 star(s) with 16 fork(s). There are 3 watchers for this library.

It had no major release in the last 6 months.

There are 1 open issues and 1 have been closed. On average issues are closed in 6 days. There are no pull requests.

It has a neutral sentiment in the developer community.

The latest version of pycluster is current.

Quality

pycluster has no bugs reported.

Security

pycluster has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

License

pycluster does not have a standard license declared.

Check the repository for any license declaration and review the terms closely.

Without a license, all rights are reserved, and you cannot use the library in your applications.

Reuse

pycluster releases are not available. You will need to build from source code and install.

pycluster has no build file. You will be need to create the build yourself to build the component from source.

Top functions reviewed by kandi - BETA

kandi has reviewed pycluster and discovered the below as its top functions. This is intended to give you an instant insight into pycluster implemented functionality, and help decide if they suit your requirements.

Run the kmedoids algorithm
Calculate the total cost for a given cost function
Compute the Pearson distance between two vectors
Compute the euclidean distance between two vectors
Return the manhattan distance between two vectors
Calculates the kmedoids algorithm for a given cluster
Imports data from a file

Get all kandi verified functions for this library.

pycluster Key Features

No Key Features are available at this moment for pycluster.

pycluster Examples and Code Snippets

No Code Snippets are available at this moment for pycluster.

Community Discussions

Trending Discussions on pycluster

pyclustering visualising xmeans when the matrix has more than three dimensions

pyclustering clarans.get_clusters() returns empty list

How to calculate Silhouette coefficient for k-mediod clustering using pyclustering lib?

Can't get result from the pyclustering object with reticulate

pyclustering: intended method of initializing kmeans

QUESTION

pyclustering visualising xmeans when the matrix has more than three dimensions

Asked 2020-Nov-26 at 09:58

I'm trying to cluster and visualise some data with xmeans from the pyclustering lib. I copied the code directly from the example in the documentation,

...

ANSWER

Answered 2020-Nov-22 at 18:01

In line with the error that you got:

Only objects with size dimension 1 (1D plot), 2 (2D plot) or 3 (3D plot) can be displayed. For multi-dimensional data use 'cluster_visualizer_multidim'.

You have to use cluster_visualizer_multidim as it was mentioned. There is a documentation (pyclustering 0.10.1) with an example: https://pyclustering.github.io/docs/0.10.1/html/dc/d6b/classpyclustering_1_1cluster_1_1cluster__visualizer__multidim.html

For example, if you have a data (D > 3) and you want to display (x0, x1) and (x0, x2) then you can display it in the following way:

Source https://stackoverflow.com/questions/64776260

QUESTION

pyclustering clarans.get_clusters() returns empty list

Asked 2020-Jul-05 at 10:52

I am trying CLARANS clustering with pyclustering module, but for all the data I've tried clarans(data, number_clusters, numlocal, maxneighbor) all the methods returns [] (an empty list) no matter what the parameter values are. I made a random data to test the method, but the answer is the same, the only data that works is the iris data datasets.load_iris() form sklearn module. Am I doing something wrong?

This is the testing data:

...

ANSWER

Answered 2020-Jul-05 at 10:52

You forgot to peform the processing using process() function before printing out the clusters.

This statement

Source https://stackoverflow.com/questions/62733352

QUESTION

How to calculate Silhouette coefficient for k-mediod clustering using pyclustering lib?

Asked 2018-Nov-20 at 10:04

I like try the k-mediod clustering method (PAM) over the dataset https://archive.ics.uci.edu/ml/datasets/seeds

I don't know whether there exists other libraries other than pyclustering for this purpose. Anyway, how can I compute Silhouette coefficient for the clustering using this library? It don't provide such a method as an k-means with sklearn.

...

ANSWER

Answered 2018-Jul-27 at 11:36

From the documentation, you can use sklearn.metrics.silhouette_score(X, labels, metric=’euclidean’, sample_size=None, random_state=None, **kwds). This function returns the mean Silhouette Coefficient over all samples. To obtain the values for each sample, use silhouette_samples. I also recommend to see this vignette. There is a nice example in there for you to test too.

Source https://stackoverflow.com/questions/51540214

QUESTION

Can't get result from the pyclustering object with reticulate

Asked 2018-May-18 at 10:09

I want to cluster some data in R with library from Python (pyclustering). I am using reticulate package to do this:

...

ANSWER

Answered 2018-May-18 at 10:09

I think the issue is with use of pyclustering library and not with reticulate or R. As indicated in the README example, you need to run process() function on object:

Source https://stackoverflow.com/questions/50407012

QUESTION

pyclustering: intended method of initializing kmeans

Asked 2017-Aug-16 at 08:20

On wikipedia, there is a description of how to initialize the kmeans cluster locations according to a random method.

In pyclustering, a python clustering library, the various clusters are implemented with a high performance c-core. This core is faster than numpy/sklearn, so I want to avoid implementing anything in sklearn/numpy (or else I might lose the speedy feel of the code right now).

However, the kmeans class requires an initial cluster location list to get going. What is the intended method of initializing these cluster locations in pyclustering?

...

ANSWER

Answered 2017-Aug-16 at 08:20

There is automatically generated pyclustering documentation where API of kmeans algorithm is described.

For example, you have a 2D-data where two clusters should extracted, then you need to specify initial centers (pyclustering doesn't generate initial centers they should be provided by user):

Source https://stackoverflow.com/questions/45698986

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install pycluster

You can download it from GitHub.
You can use pycluster like any standard Python library. You will need to make sure that you have a development environment consisting of a Python distribution including header files, a compiler, pip, and git installed. Make sure that your pip, setuptools, and wheel are up to date. When using pip it is generally recommended to install packages in a virtual environment to avoid changes to the system.

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: