parquet-metadata | Dump metadata about a Parquet file
kandi X-RAY | parquet-metadata Summary
kandi X-RAY | parquet-metadata Summary
Dump metadata about a Parquet file.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Dump data to file
- Dump a single column
- Dump a row group
- Dump metadata to file
parquet-metadata Key Features
parquet-metadata Examples and Code Snippets
sudo pip install pyarrow parquet-metadata
parquet-metadata parquet.file
file created_by parquet-cpp version 1.4.1-SNAPSHOT
file columns 9
file row_groups 1
file rows 2
row_group 0 size 634
row_group 0 rows 2
row_group 0 columns 9
row_group 0 bool
Community Discussions
Trending Discussions on parquet-metadata
QUESTION
I use apache parquet to create Parquet tables with process information of a machine and I need to store file wide metadata (Machine ID and Machine Name).
It is stated that parquet files are capable of storing file wide metadata, however i couldn't find anything in the documentation about it.
There is another stackoverflow post that tells how it is done with pyarrow. As far as the post is telling, i need some kind of key value pair (maybe map) and add it to the schema somehow.
I Found a class inside the parquet source code that is called parquet::FileMetaData that may be used for this purpose, however there is nothing in the docs about it.
Is it possible to store file-wide metadata with c++ ?
Currently i am using the stream_reader_writer example for writing parquet files
...ANSWER
Answered 2021-Dec-03 at 15:05You can pass the file level metadata when calling parquet::ParquetFileWriter::Open
, see the source code here
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install parquet-metadata
You can use parquet-metadata like any standard Python library. You will need to make sure that you have a development environment consisting of a Python distribution including header files, a compiler, pip, and git installed. Make sure that your pip, setuptools, and wheel are up to date. When using pip it is generally recommended to install packages in a virtual environment to avoid changes to the system.
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page