hawq | Apache HAWQ is a Hadoop native SQL query engine

by apache C Version: thirdparty License: Apache-2.0

X-Ray Key Features Code Snippets Community Discussions(10)Vulnerabilities Install Support

kandi X-RAY | hawq Summary

hawq is a C library typically used in Big Data, Hadoop applications. hawq has no bugs, it has no vulnerabilities, it has a Permissive License and it has low support. You can download it from GitHub.

Apache HAWQ is a Hadoop native SQL query engine that combines the key technological advantages of MPP database with the scalability and convenience of Hadoop. HAWQ reads data from and writes data to HDFS natively. HAWQ delivers industry-leading performance and linear scalability. It provides users the tools to confidently and successfully interact with petabyte range data sets. HAWQ provides users with a complete, standards compliant SQL interface. More specifically, HAWQ has the following features:.

Support

Quality

Security

License

Reuse

Support

hawq has a low active ecosystem.

It has 673 star(s) with 325 fork(s). There are 94 watchers for this library.

It had no major release in the last 12 months.

hawq has no issues reported. There are 13 open pull requests and 0 closed requests.

It has a neutral sentiment in the developer community.

The latest version of hawq is thirdparty

Quality

hawq has no bugs reported.

Security

hawq has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

License

hawq is licensed under the Apache-2.0 License. This license is Permissive.

Permissive licenses have the least restrictions, and you can use them in most projects.

Reuse

hawq releases are available to install and integrate.

Installation instructions are available. Examples and code snippets are not available.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of hawq

Get all kandi verified functions for this library.

hawq Key Features

No Key Features are available at this moment for hawq.

hawq Examples and Code Snippets

No Code Snippets are available at this moment for hawq.

Community Discussions

Trending Discussions on hawq

HAWQ PostgreSQL - Increment row based on previous row

OpenTSDB integration with kerberized HBase

Compile error in Apache HAWQ. Cannot find the location

Pivotal HAWQ - Centos 7 Installation

How to connect spark from HAWQ via JDBC/ODBC driver?

Where can I find the location of distributed file on slaves using Apache HAWQ?

Difference between external table and internal table when using Apache HAWQ?

Apache HAWQ installation built on top of HDFS

Apache HAWQ production usage

PIVOTAL HAWQ Backup - shell script error

QUESTION

HAWQ PostgreSQL - Increment row based on previous row

Asked 2018-Aug-15 at 20:48

I need to create a table2 from this table1 trying to update the below table :

...

ANSWER

Answered 2018-Aug-15 at 19:28

you can try to use max with window function

Source https://stackoverflow.com/questions/51865032

QUESTION

OpenTSDB integration with kerberized HBase

Asked 2018-Jun-22 at 10:26

I want to do some POC on OpenTSDB. I have installed OpenTSDB as per the installation instruction, but having a tough time starting it. I am using HDP environment which is Kerberos enabled, I am integrating OpenTSDB with Kerberized HBase but facing below exception. If anybody have integrated OpenTSDB with Kerberos HBase, please guide..

Exception:

...

ANSWER

Answered 2018-Jun-22 at 10:26

You might find the following steps useful. Both databases connect to HBase from a Java client, although the Java client in OpenTSDB might be different.

Source https://stackoverflow.com/questions/44412563

QUESTION

Compile error in Apache HAWQ. Cannot find the location

Asked 2018-Jun-08 at 19:51

I am trying to install Apache HAWQ to my node. I referenced Apache HAWQ wiki page (https://cwiki.apache.org/confluence/display/HAWQ/Build+and+Install) and successfully completed all the required dependent modules including Hadoop, boost, thrift and etc.

And the following step is to install the Apache HAWQ and below are the commands.

...

ANSWER

Answered 2017-Feb-18 at 10:30

maybe you can try install thrift using yum.

Source https://stackoverflow.com/questions/42295471

QUESTION

Pivotal HAWQ - Centos 7 Installation

Asked 2017-Nov-22 at 23:55

We have small Hadoop cluster. Hadoop HDP version Installed in it. Env: VM consist of os : Centos 7

Facing compatibility issue : HAWQ compatibility issue as it is not supported for Centos 7 Yet. Constraint: We already installed Hadoop cluster on Centos 7.

Any help on it would be much appreciated.

...

ANSWER

Answered 2017-Feb-07 at 19:51

HAWQ is not, as of yet, supported on 7. It is in the backlog of items and should hopefully be done quickly - but if you're looking to test it's capabilities in the near term, I suggest you reinstall with < 7.

Source https://stackoverflow.com/questions/42097662

QUESTION

How to connect spark from HAWQ via JDBC/ODBC driver?

Asked 2017-Nov-03 at 17:22

Nice to meet you all. I'm Anqing,a trainee working in China. I'm trying to connect spark from HAWQ via JDBC driver. I know that there is a problem looks like as mine, but I have not solved my issues. Can you help me how to deal with it?Please tell me in detail. Thanks.

Zheng Anqing

...

ANSWER

Answered 2017-Nov-03 at 17:22

Assuming that you are trying to connect to HAWQ database from spark, you may use Postgres 8.4 JDBC driver (since HAWQ is based on Postgres).

Source https://stackoverflow.com/questions/47087093

QUESTION

Where can I find the location of distributed file on slaves using Apache HAWQ?

Asked 2017-Mar-11 at 16:44

I am using Apache HAWQ and trying to handle some data. I have one master node and two hawq slaves.

I made table, inserted the data and identified the data that I inserted using postgreSQL. I thought that the data was mostly distributed on slaves.

When executing below command, multiple gp_segment_id appeared, giving an impression of using multiple slaves.

...

ANSWER

Answered 2017-Mar-11 at 16:44

Your table, retail_demo.order_lineitems_hawq, must be distributed with a hash. When you do this in HAWQ, the number of buckets is determined by default_hash_table_bucket_number which is set when the database is initialized. There will be a file in HDFS for each bucket and this is because there will be a fixed number of virtual segments, or vsegs, with hash distributed tables.

You can specify the number of buckets two ways. One is to do it when you create the table.

Source https://stackoverflow.com/questions/42716111

QUESTION

Difference between external table and internal table when using Apache HAWQ?

Asked 2017-Mar-09 at 16:03

I am using HAWQ to handle a column-based file. While reading the Pivotal document, they suggest that user should use gpfdist to read and write the readable external table in order to quickly process the data in a parallel way.

I made a table as recommended in the documentation and confirmed my data by SQL as below statement.

...

ANSWER

Answered 2017-Mar-09 at 16:03

An External Table that uses gpfdist

Data is in a posix filesystem, not HDFS
No statistics
Files could be on ETL nodes which aren't part of the cluster
You could have multiple files across many servers too
Ideal solution to load data in parallel to an Internal table

Source https://stackoverflow.com/questions/42693002

QUESTION

Apache HAWQ installation built on top of HDFS

Asked 2017-Feb-16 at 03:10

I would like to install Apache HAWQ based on the Hadoop.

Before installing HAWQ, I should install Hadoop and configure the all my nodes.

I have four nodes as below and my question is as blow.

Should I install a hadoop distribution for hawq-master?

...

ANSWER

Answered 2017-Feb-16 at 03:10

Honestly, there is no strictly constraints on how the hadoop installed and hawq installed if they are configured correctly.

For your concern, "I think the hawq-master should be built on top of hadoop, but there are no connection with hadoop-master". IMO, it should be "hawq should be built on top of hadoop". And we configured the hawq-master conf files(hawq-site.xml) to make hawq have connections with hadoop.

Usually, for the hawq master and hadoop master, we could install each component on one node, but we could install some of them on one node to save nodes. But for HDFS datanode and HAWQ segment, we often install them together. Taking the workload of each machine, we could install them as below:

Source https://stackoverflow.com/questions/42263693

QUESTION

Apache HAWQ production usage

Asked 2017-Feb-04 at 03:29

I'm not sure where else to ask this question, so I'll ask it here, as I think this might serve as a nice reference for future users who might have a similar question.

Are there any known production usages of Apache HAWQ (http://hawq.incubator.apache.org/)? I would like to compare this service with others such as Presto, Spark, Impala, etc. But I haven't come across any real-world usages of it other than nice-looking benchmarks. And finally, if you have used this personally, what have been your experiences with it?

...

ANSWER

Answered 2017-Feb-01 at 02:50

Pivotal HDB (Commercial offering of HAWQ) is at various clients. Hawq is true 100% SQL compliant SQL engine based on MPP history. This is a unique product with state of art Query optimizer and dynamic partition elimination, very robust HDFS data federation features with Hbase, Hive, JSON, ORC(beta), and native hadoop file system. Hawq uses parquet storage format so tables created in hawq can be used in Hadoop eco-system.Hawq has ability to collect stats on external tables for faster data access. Support ACID transaction(Insert). On top of all these most compelling feature is doing data science using language extensions right in sql, supports R, Python, Java, Perl. I have seen implementations of HAWQ in Auto, oil and gas, IOT, healthcare industries. The typical use case i experienced is BI on top of hadoop, Data science model training and executing models, Interactive SQL on structured data. Since HAWQ is born out of Greenplum heritage, Some of the features that hawq are hard to find in competitive products. Hawq perfectly complements the Hadoop eco-system.

Source https://stackoverflow.com/questions/41969238

QUESTION

PIVOTAL HAWQ Backup - shell script error

Asked 2017-Jan-18 at 04:10

When I am trying to backup PIVOTAL HAWQ database using shell script.

Getting error :

...

ANSWER

Answered 2017-Jan-18 at 04:10

It looks like a issue with the PATH that the crontab cannot find the pg_dump binaries.Please try to run the script with the absolute path of pg_dump (usr/local/hawq/bin/pg_dump).

You can source the /usr/local/hawq/greenplum_path.sh before calling the pg_dump call too.

Source https://stackoverflow.com/questions/41711061

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install hawq

Please see HAWQ wiki page: https://cwiki.apache.org/confluence/display/HAWQ/Build+and+Install.

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: