kandi background
Explore Kits

impyla | Python DB API 2.0 client for Impala and Hive

 by   cloudera Python Version: Current License: Apache-2.0

 by   cloudera Python Version: Current License: Apache-2.0

Download this library from

kandi X-RAY | impyla Summary

impyla is a Python library typically used in Big Data applications. impyla has no bugs, it has no vulnerabilities, it has build file available, it has a Permissive License and it has high support. You can install using 'pip install impyla' or download it from GitHub, PyPI.
Python client for HiveServer2 implementations (e.g., Impala, Hive) for distributed query engines. For higher-level Impala functionality, including a Pandas-like interface over distributed data sets, see the Ibis project.
Support
Support
Quality
Quality
Security
Security
License
License
Reuse
Reuse

kandi-support Support

  • impyla has a highly active ecosystem.
  • It has 657 star(s) with 224 fork(s). There are 56 watchers for this library.
  • It had no major release in the last 12 months.
  • There are 160 open issues and 151 have been closed. On average issues are closed in 111 days. There are 7 open pull requests and 0 closed requests.
  • It has a negative sentiment in the developer community.
  • The latest version of impyla is current.
impyla Support
Best in #Python
Average in #Python
impyla Support
Best in #Python
Average in #Python

quality kandi Quality

  • impyla has 0 bugs and 0 code smells.
impyla Quality
Best in #Python
Average in #Python
impyla Quality
Best in #Python
Average in #Python

securitySecurity

  • impyla has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.
  • impyla code analysis shows 0 unresolved vulnerabilities.
  • There are 0 security hotspots that need review.
impyla Security
Best in #Python
Average in #Python
impyla Security
Best in #Python
Average in #Python

license License

  • impyla is licensed under the Apache-2.0 License. This license is Permissive.
  • Permissive licenses have the least restrictions, and you can use them in most projects.
impyla License
Best in #Python
Average in #Python
impyla License
Best in #Python
Average in #Python

buildReuse

  • impyla releases are not available. You will need to build from source code and install.
  • Deployable package is available in PyPI.
  • Build file is available. You can build the component from source.
  • Installation instructions, examples and code snippets are available.
  • impyla saves you 56525 person hours of effort in developing the same functionality from scratch.
  • It has 63010 lines of code, 6187 functions and 74 files.
  • It has high code complexity. Code complexity directly impacts maintainability of the code.
impyla Reuse
Best in #Python
Average in #Python
impyla Reuse
Best in #Python
Average in #Python
Top functions reviewed by kandi - BETA

kandi has reviewed impyla and discovered the below as its top functions. This is intended to give you an instant insight into impyla implemented functionality, and help decide if they suit your requirements.

  • Return a dict of the cmdclass to use .
  • Connect to Impala .
  • Build a summary table .
  • Get http transport .
  • Return a list of git pieces from the git repository .
  • Initialize the connection .
  • Create the versioneer config file .
  • Merge a PR number .
  • Return a connection to the Impala server .
  • Execute a Thrift request .

impyla Key Features

HiveServer2 compliant; works with Impala and Hive, including nested data

Fully DB API 2.0 (PEP 249)-compliant Python client (similar to sqlite or MySQL clients) supporting Python 2.6+ and Python 3.3+.

Works with Kerberos, LDAP, SSL

SQLAlchemy connector

Converter to pandas DataFrame, allowing easy integration into the Python data stack (including scikit-learn and matplotlib); but see the Ibis project for a richer experience

Community Discussions

Trending Discussions on impyla
  • Error while running query on Impala with Superset
  • Ibis create impala table with pandas dataframe and get [Error 61] Connection refused
Trending Discussions on impyla

QUESTION

Error while running query on Impala with Superset

Asked 2020-May-14 at 16:56

I'm trying to connect impala to superset, and when I test the connection prints: "Seems OK!", and when I try to see databases on impala with the SQL Editor in the left side it shows all databases without problems.

Preview of Databases/Tables

But when i write a query and click on "Run Query", it gives the error: "Could not start SASL: b'Error in sasl_client_start (-1) SASL(-1): generic failure: GSSAPI Error: Unspecified GSS failure. Minor code may provide more information (Ticket expired)'"

Error running query

I'm running superset with SSL and in production mode (with Gunicorn) and Impala with SSL in a Kerberized Hadoop Cluster, and my impala database config is:

Impala Config

And in the extras I put:

{
    "metadata_params": {},
    "engine_params":  {
        "connect_args": {
            "port": 21050,
                    "use_ssl": "True", 
                    "ca_cert": "path/to/my/ca_cert.pem",
                    "auth_mechanism": "GSSAPI"
         }
     },
    "metadata_cache_timeout": {},
    "schemas_allowed_for_csv_upload": []
}

How can I solve this error? In my superset log it only shows:

Triggering query_id: 65
INFO:superset.views.core:Triggering query_id: 65
Query 65: Running query on a Celery worker
INFO:superset.views.core:Query 65: Running query on a Celery worker

Versions: Superset 0.36.0, Impyla 0.16.2

ANSWER

Answered 2020-May-14 at 16:56

I was able to fix this error doing this steps:

1 - Created service user for celery-worker, created a kerberos ticket for him and created a crontab to renew the ticket.

2 - Runned celery worker from this service user, instead running from root.

3 - Killed an celery-worker that was running in another machine of my cluster

4 - Restarted Impala and Superset

I think this error ocurred because in some queries instead of use the celery worker in my superset machine, it was using the celery worker that was in another machine without a valid kerberos ticket. I could fix this error because when I was reading celery-worker log , it showed that a connection with the celery worker in other machine failed in a query running.

Source https://stackoverflow.com/questions/61639889

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install impyla

Install the latest release with pip:.

Support

You need to first sign and return an ICLA and CCLA before we can accept and redistribute your contribution. Once these are submitted you are free to start contributing to impyla. Submit these to CLA@cloudera.com.

DOWNLOAD this Library from

Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from
over 430 million Knowledge Items
Find more libraries
Reuse Solution Kits and Libraries Curated by Popular Use Cases

Save this library and start creating your kit

Explore Related Topics

Share this Page

share link
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from
over 430 million Knowledge Items
Find more libraries
Reuse Solution Kits and Libraries Curated by Popular Use Cases

Save this library and start creating your kit

  • © 2022 Open Weaver Inc.