pycluster | Python Clustering - Including PAM and Clara algorithms

 by   daveti Python Version: Current License: No License

kandi X-RAY | pycluster Summary

kandi X-RAY | pycluster Summary

pycluster is a Python library. pycluster has no bugs, it has no vulnerabilities and it has low support. However pycluster build file is not available. You can download it from GitHub.

Including PAM and Clara algorithms. Added parallel PAM implementation. Added performance comparison logging (set initMedoidsFixed = True). python pam_parallel.py euroTry.txt 4 4 > euroTry_parallel_4_4.log. python pam.py euroTry.txt 4 > euroTry_4.log.
Support
    Quality
      Security
        License
          Reuse

            kandi-support Support

              pycluster has a low active ecosystem.
              It has 9 star(s) with 16 fork(s). There are 3 watchers for this library.
              OutlinedDot
              It had no major release in the last 6 months.
              There are 1 open issues and 1 have been closed. On average issues are closed in 6 days. There are no pull requests.
              It has a neutral sentiment in the developer community.
              The latest version of pycluster is current.

            kandi-Quality Quality

              pycluster has no bugs reported.

            kandi-Security Security

              pycluster has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

            kandi-License License

              pycluster does not have a standard license declared.
              Check the repository for any license declaration and review the terms closely.
              OutlinedDot
              Without a license, all rights are reserved, and you cannot use the library in your applications.

            kandi-Reuse Reuse

              pycluster releases are not available. You will need to build from source code and install.
              pycluster has no build file. You will be need to create the build yourself to build the component from source.

            Top functions reviewed by kandi - BETA

            kandi has reviewed pycluster and discovered the below as its top functions. This is intended to give you an instant insight into pycluster implemented functionality, and help decide if they suit your requirements.
            • Run the kmedoids algorithm
            • Calculate the total cost for a given cost function
            • Compute the Pearson distance between two vectors
            • Compute the euclidean distance between two vectors
            • Return the manhattan distance between two vectors
            • Calculates the kmedoids algorithm for a given cluster
            • Imports data from a file
            Get all kandi verified functions for this library.

            pycluster Key Features

            No Key Features are available at this moment for pycluster.

            pycluster Examples and Code Snippets

            No Code Snippets are available at this moment for pycluster.

            Community Discussions

            QUESTION

            pyclustering visualising xmeans when the matrix has more than three dimensions
            Asked 2020-Nov-26 at 09:58

            I'm trying to cluster and visualise some data with xmeans from the pyclustering lib. I copied the code directly from the example in the documentation,

            ...

            ANSWER

            Answered 2020-Nov-22 at 18:01

            In line with the error that you got:

            Only objects with size dimension 1 (1D plot), 2 (2D plot) or 3 (3D plot) can be displayed. For multi-dimensional data use 'cluster_visualizer_multidim'.

            You have to use cluster_visualizer_multidim as it was mentioned. There is a documentation (pyclustering 0.10.1) with an example: https://pyclustering.github.io/docs/0.10.1/html/dc/d6b/classpyclustering_1_1cluster_1_1cluster__visualizer__multidim.html

            For example, if you have a data (D > 3) and you want to display (x0, x1) and (x0, x2) then you can display it in the following way:

            Source https://stackoverflow.com/questions/64776260

            QUESTION

            pyclustering clarans.get_clusters() returns empty list
            Asked 2020-Jul-05 at 10:52

            I am trying CLARANS clustering with pyclustering module, but for all the data I've tried clarans(data, number_clusters, numlocal, maxneighbor) all the methods returns [] (an empty list) no matter what the parameter values are. I made a random data to test the method, but the answer is the same, the only data that works is the iris data datasets.load_iris() form sklearn module. Am I doing something wrong?

            This is the testing data:

            ...

            ANSWER

            Answered 2020-Jul-05 at 10:52

            You forgot to peform the processing using process() function before printing out the clusters.

            This statement

            Source https://stackoverflow.com/questions/62733352

            QUESTION

            How to calculate Silhouette coefficient for k-mediod clustering using pyclustering lib?
            Asked 2018-Nov-20 at 10:04

            I like try the k-mediod clustering method (PAM) over the dataset https://archive.ics.uci.edu/ml/datasets/seeds

            I don't know whether there exists other libraries other than pyclustering for this purpose. Anyway, how can I compute Silhouette coefficient for the clustering using this library? It don't provide such a method as an k-means with sklearn.

            ...

            ANSWER

            Answered 2018-Jul-27 at 11:36

            From the documentation, you can use sklearn.metrics.silhouette_score(X, labels, metric=’euclidean’, sample_size=None, random_state=None, **kwds). This function returns the mean Silhouette Coefficient over all samples. To obtain the values for each sample, use silhouette_samples. I also recommend to see this vignette. There is a nice example in there for you to test too.

            Source https://stackoverflow.com/questions/51540214

            QUESTION

            Can't get result from the pyclustering object with reticulate
            Asked 2018-May-18 at 10:09

            I want to cluster some data in R with library from Python (pyclustering). I am using reticulate package to do this:

            ...

            ANSWER

            Answered 2018-May-18 at 10:09

            I think the issue is with use of pyclustering library and not with reticulate or R. As indicated in the README example, you need to run process() function on object:

            Source https://stackoverflow.com/questions/50407012

            QUESTION

            pyclustering: intended method of initializing kmeans
            Asked 2017-Aug-16 at 08:20

            On wikipedia, there is a description of how to initialize the kmeans cluster locations according to a random method.

            In pyclustering, a python clustering library, the various clusters are implemented with a high performance c-core. This core is faster than numpy/sklearn, so I want to avoid implementing anything in sklearn/numpy (or else I might lose the speedy feel of the code right now).

            However, the kmeans class requires an initial cluster location list to get going. What is the intended method of initializing these cluster locations in pyclustering?

            ...

            ANSWER

            Answered 2017-Aug-16 at 08:20

            There is automatically generated pyclustering documentation where API of kmeans algorithm is described.

            For example, you have a 2D-data where two clusters should extracted, then you need to specify initial centers (pyclustering doesn't generate initial centers they should be provided by user):

            Source https://stackoverflow.com/questions/45698986

            Community Discussions, Code Snippets contain sources that include Stack Exchange Network

            Vulnerabilities

            No vulnerabilities reported

            Install pycluster

            You can download it from GitHub.
            You can use pycluster like any standard Python library. You will need to make sure that you have a development environment consisting of a Python distribution including header files, a compiler, pip, and git installed. Make sure that your pip, setuptools, and wheel are up to date. When using pip it is generally recommended to install packages in a virtual environment to avoid changes to the system.

            Support

            For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .
            Find more information at:

            Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items

            Find more libraries
            CLONE
          • HTTPS

            https://github.com/daveti/pycluster.git

          • CLI

            gh repo clone daveti/pycluster

          • sshUrl

            git@github.com:daveti/pycluster.git

          • Stay Updated

            Subscribe to our newsletter for trending solutions and developer bootcamps

            Agree to Sign up and Terms & Conditions

            Share this Page

            share link