sweetviz | compare datasets , target values | Data Visualization library

 by   fbdesignpro Python Version: 2.3.1 License: MIT

kandi X-RAY | sweetviz Summary

kandi X-RAY | sweetviz Summary

sweetviz is a Python library typically used in Institutions, Learning, Education, Analytics, Data Visualization, Pandas applications. sweetviz has no bugs, it has no vulnerabilities, it has build file available, it has a Permissive License and it has medium support. You can install using 'pip install sweetviz' or download it from GitHub, PyPI.

Sweetviz is an open-source Python library that generates beautiful, high-density visualizations to kickstart EDA (Exploratory Data Analysis) with just two lines of code. Output is a fully self-contained HTML application. The system is built around quickly visualizing target values and comparing datasets. Its goal is to help quick analysis of target characteristics, training vs testing data, and other such data characterization tasks.

            kandi-support Support

              sweetviz has a medium active ecosystem.
              It has 2413 star(s) with 245 fork(s). There are 48 watchers for this library.
              There were 3 major release(s) in the last 12 months.
              There are 42 open issues and 72 have been closed. On average issues are closed in 92 days. There are 8 open pull requests and 0 closed requests.
              It has a neutral sentiment in the developer community.
              The latest version of sweetviz is 2.3.1

            kandi-Quality Quality

              sweetviz has 0 bugs and 0 code smells.

            kandi-Security Security

              sweetviz has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.
              sweetviz code analysis shows 0 unresolved vulnerabilities.
              There are 0 security hotspots that need review.

            kandi-License License

              sweetviz is licensed under the MIT License. This license is Permissive.
              Permissive licenses have the least restrictions, and you can use them in most projects.

            kandi-Reuse Reuse

              sweetviz releases are available to install and integrate.
              Deployable package is available in PyPI.
              Build file is available. You can build the component from source.
              Installation instructions, examples and code snippets are available.
              sweetviz saves you 5073 person hours of effort in developing the same functionality from scratch.
              It has 10964 lines of code, 122 functions and 40 files.
              It has high code complexity. Code complexity directly impacts maintainability of the code.

            Top functions reviewed by kandi - BETA

            kandi has reviewed sweetviz and discovered the below as its top functions. This is intended to give you an instant insight into sweetviz implemented functionality, and help decide if they suit your requirements.
            • Analyze a feature to a dictionary
            • Add base stats to a dictionary
            • Fill out missing counts in the counts dict
            • Get counts of unique values
            • Plot correlation data
            • Set the font style
            • Plot a heatmap
            • Wrap a custom separator
            • Display a notebook
            • Log HTML content
            • End the experiment
            • Generate the html report
            • Calculate the correlation ratio
            • Replace NaN values with value
            • Remove incomplete samples
            • Convert data to a Pandas DataFrame
            • Generates a html summary for a feature summary
            • Format a number
            • Creates the numeric group data for each score
            • Compute the ILSU U
            • Compute conditional entropy
            • Show report
            • Generate HTML for a feature summary target
            • Recursively generate HTML detail
            • Analyze a feature
            • Generate HTML report
            Get all kandi verified functions for this library.

            sweetviz Key Features

            No Key Features are available at this moment for sweetviz.

            sweetviz Examples and Code Snippets

            Showing integer columns as categorical and throwing error in sweetviz compare
            Pythondot img1Lines of Code : 3dot img1License : Strong Copyleft (CC BY-SA 4.0)
            copy iconCopy
            feature_config = sweetviz.FeatureConfig(force_cat = ["Web Desktop Interaction"])
            report = sweetviz.compare(pdf, pdf_2, None, feature_config)
            Tkinter Window Closing Automatically after one run
            Pythondot img2Lines of Code : 19dot img2License : Strong Copyleft (CC BY-SA 4.0)
            copy iconCopy
                advert_report = sv.analyze(X_all,pairwise_analysis="off",target_feat=col)
                advert_report = sv.analyze(X_all,pairwise_analysis="on",target_feat=col)

            Community Discussions


            Showing integer columns as categorical and throwing error in sweetviz compare
            Asked 2021-May-27 at 17:51

            If I analyze these two datasets individually, I don't get any error and the I also get the viz of all the integer columns. But when I try to compare these dataframe, I get the below error.

            Cannot convert series 'Web Visit' in COMPARED from its TYPE_CATEGORICAL to the desired type TYPE_BOOL.

            I also tried the FeatureConfig to skip it, but no avail.

            pid_compare = sweetviz.compare([pdf,"234_7551009"],[pdf_2,"215_220941058"])



            Answered 2021-May-27 at 17:51

            Maintainer of the lib here; this question was asked in the git also, but it will be useful to detail the answer here.

            After looking at your data provided in the link above, it looks like the first dataframe (pdf) only contains 0 & 1, so it is classified as boolean so it cannot be compared against the second one which is categorical (that one has 0,1,2,3 as you probably know!).

            The system will be able to handle it if you use FeatureConfig to force the first dataframe to be considered CATEGORICAL.

            I just tried the following and it seems to work, let me know if it helps!

            Source https://stackoverflow.com/questions/67622175


            Github page's not showing enumeration properly
            Asked 2021-Apr-06 at 01:03

            I'm working on my portfolio using a Github Page: https://diegoguisasola.github.io/

            I have the following problem:

            This is a link to my README.md: https://github.com/DiegoGuisasola/DiegoGuisasola.github.io/blob/main/README.md

            As you can see, in the section called Data Science I have 2 projects. I'm enumerating them as follows:

            1- [EDA y algoritmos de ML para la predicción de precios de propiedades inmobiliarias (Spanish)]

            2- Pokemon EDA (English)

            But when I go the my portfolio page, it is shown as follows:

            1- [EDA y algoritmos de ML para la predicción de precios de propiedades inmobiliarias (Spanish)]

            1- Pokemon EDA (English)

            This is my html code:



            Answered 2021-Apr-05 at 22:57

            Indent the child block of your list items.

            You can not have any non-list item blocks between list items. However, you can have child blocks of a list item. All child blocks must be indented at least one level.

            Source https://stackoverflow.com/questions/66927148


            Tkinter Window Closing Automatically after one run
            Asked 2020-Sep-08 at 18:14

            So I am writing an interface that will do some steps Data Analysis automatically. The code involves multiple screens that will be used for different functions. The first one I wrote is the EDA (exploratory data analysis) screen. The code works but will generate the report and then just close the Tk window.

            In specific, when I generate the report, the code will generate the reports and then shut down. I haven't put the code generation in there because that doesn't use Tkinter. Also left out the other screens because they aren't problematic and I didn't want to dump the whole project here.



            Answered 2020-Aug-31 at 01:37

            I did experience some problems right after an initial, partial generation of the report, but the Tk window, in my case, did not shutdown.

            The error had to do with the following lines in your HTMLCreator file:

            Source https://stackoverflow.com/questions/63547876

            Community Discussions, Code Snippets contain sources that include Stack Exchange Network


            No vulnerabilities reported

            Install sweetviz

            Sweetviz currently supports Python 3.6+ and Pandas 0.25.3+. Reports are output using the base "os" module, so custom environments such as Google Colab which require custom file operations are not yet supported, although I am looking into a solution.


            Please see the "Installation issues & fixes" section at the top of this document. See section above regarding CJK characters support. If you find the need for additional character types, definitely post a request in the issue tracking system. Development is ongoing so absolutely feel free to report any issues and/or suggestions in the issue tracking system here or in our forum (you should be able to log in with your Github account!).
            Find more information at:

            Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items

            Find more libraries