pyexcel | Single API for reading , manipulating and writing data | CSV Processing library

by pyexcel Python Version: 0.7.0 License: Non-SPDX

X-Ray Key Features Code Snippets(10)Community Discussions(9)Vulnerabilities Install Support

kandi X-RAY | pyexcel Summary

pyexcel is a Python library typically used in Utilities, CSV Processing applications. pyexcel has no bugs, it has no vulnerabilities, it has build file available and it has high support. However pyexcel has a Non-SPDX License. You can install using 'pip install pyexcel' or download it from GitHub, PyPI.

Single API for reading, manipulating and writing data in csv, ods, xls, xlsx and xlsm files

Support

Quality

Security

License

Reuse

Support

pyexcel has a highly active ecosystem.

It has 1121 star(s) with 162 fork(s). There are 46 watchers for this library.

It had no major release in the last 12 months.

There are 16 open issues and 194 have been closed. On average issues are closed in 53 days. There are 1 open pull requests and 0 closed requests.

It has a positive sentiment in the developer community.

The latest version of pyexcel is 0.7.0

Quality

pyexcel has 0 bugs and 0 code smells.

Security

pyexcel has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

pyexcel code analysis shows 0 unresolved vulnerabilities.

There are 0 security hotspots that need review.

License

pyexcel has a Non-SPDX License.

Non-SPDX licenses can be open source with a non SPDX compliant license, or non open source licenses, and you need to review them closely before use.

Reuse

pyexcel releases are available to install and integrate.

Deployable package is available in PyPI.

Build file is available. You can build the component from source.

pyexcel saves you 4546 person hours of effort in developing the same functionality from scratch.

It has 9637 lines of code, 1093 functions and 117 files.

It has medium code complexity. Code complexity directly impacts maintainability of the code.

Top functions reviewed by kandi - BETA

kandi has reviewed pyexcel and discovered the below as its top functions. This is intended to give you an instant insight into pyexcel implemented functionality, and help decide if they suit your requirements.

Select columns
Convert names to indices
Append an item
Return the range of columns
Select rows from the table
Group rows by column index
Iterate over the rows
Returns the list at the given index
Set column names for a row
Render a list of models to stream
Extend self columns
Save the table to a database
Apply a function to each cell
Render a table to stream
Returns a generator of all rows in the grid
Update the columns of the given infiles
Returns all the rows in the table
Returns an iterator over the rows
Returns an iterator of the columns
Return an iterator over the cells
Vertices of the array
Runs the build process
Save worksheet to database
Decorator for importer
Updates row_dict with row_dicts
Return a DataFrame reader for column series

Get all kandi verified functions for this library.

pyexcel Key Features

No Key Features are available at this moment for pyexcel.

pyexcel Examples and Code Snippets

How to update an existing excel .ods file?

Python

Lines of Code : 4

License : Strong Copyleft (CC BY-SA 4.0)

Copy

from pyexcel_ods3 import get_data

data = get_data("your_file.ods")

Pyexcel random.choice() not quite working how I need it to

Python

Lines of Code : 30

License : Strong Copyleft (CC BY-SA 4.0)

Copy

def another_one(sequence):
    item = random.choice(sequence)
    if item[0] == fighter_one:
        return another_one(sequence)
    elif item[1] < 20:
        return another_one(sequence)
    else:
        new_guy = item
        retur

Can't find file - FileNotFoundError: [Errno 2] No such file or directory

Python

Lines of Code : 7

License : Strong Copyleft (CC BY-SA 4.0)

Copy

for root, dirs, files in os.walk(rootdir):
    for name in files:
        if name.endswith('.xls'):
            fname = os.path.join(root, name)
            new_file = p.save_book_as(file_name=fname, dest_file_name=os.path.join(root, fname

How to write the data to an excel using python

Python

Lines of Code : 16

License : Strong Copyleft (CC BY-SA 4.0)

Copy

df = pd.DataFrame(my_dict.items(), columns=['Summary','Count']).assign(my_list=my_list)

df = pd.DataFrame(my_dict.items(), columns=['Summary','Count'])  # Your existing statement unchanged

df['my_list'] = my_list

How can i export the header together with the .csv

Python

Lines of Code : 2

License : Strong Copyleft (CC BY-SA 4.0)

Copy

df.to_excel(f'{nome}.xlsx')

How to get a sheet name in an ODS file?

Python

Lines of Code : 14

License : Strong Copyleft (CC BY-SA 4.0)

Copy

pip install pyexcel pyexcel-ods

from pyexcel import get_book

sheet = get_book(file_name="file_example_ODS_10.ods")
print(sheet)

from pyexcel import get_book

sheet = get_book(file_name="fi

How to add two decimal zeroes while exporting a dataframe as an excel and add percentage on only values not dash

Python

Lines of Code : 13

License : Strong Copyleft (CC BY-SA 4.0)

Copy

def to_percent_format(p):
    if str(p).strip() != "-":
        return "{:.2%}".format(p/100)
    else:
        return p.strip()

>>> to_percent_format(3)
'3.00%'
>>> to_percent_format("-")
'-'

Run .py from oracle (plsql)PythonLines of Code : 19License : Strong Copyleft (CC BY-SA 4.0)

Copy

begin
dbms_scheduler.create_program
(
program_name => 'PYEXCEL',
program_type => 'EXECUTABLE',
program_action => '/the_path/the_py_script_wrapper.ks',
enabled => TRUE,
comments => 'Call Python stuff'
);
end;
/

Pyexcel, how to save into columns and how to disable to auto sort after saving?

Python

Lines of Code : 11

License : Strong Copyleft (CC BY-SA 4.0)

Copy

keys = []
values = []
for key, value in s_words.items():
    keys.append(key)
    values.append(value)

o_words = OrderedDict({'words': keys, 'frequency': values})

sheet = pe.get_sheet(adict

Pyexcel, how to save a dict to csv file with pyexcel?

Python

Lines of Code : 2

License : Strong Copyleft (CC BY-SA 4.0)

Copy

pe.save_as(adict = myDict, dest_file_name = "dest_file.xls")

Community Discussions

Trending Discussions on pyexcel

How to update an existing excel .ods file?

How to suppress PyExcel Empty Row Warning during file read to SQLAlclemy model

How to merge multiple .xls files with hyperlinks in python?

django-excel pyexcel show unknown parameters for django heroku

Solve python ValueError: max_workers must be <= 61 when running pre-commit?

Can't find file - FileNotFoundError: [Errno 2] No such file or directory

How can i export the header together with the .csv

Usage of LSTM/GRU and Flatten throws dimensional incompatibility error

Writing pandas data to Excel with efficient memory usage

QUESTION

How to update an existing excel .ods file?

Asked 2022-Mar-09 at 15:54

Once I have read data from a sheet in a .xls file. I want to put that data in a sheet in an existing excel file with .ods extension without changing anything else in the excel file. I tried openpyxl but it doesn't support .ods files so I tried using pyexcel-ods3 but I am still not sure how to use pyexcel-ods3 to update an existing .ods file.

This is the code I tried but its for writing a new .ods excel file and not for updating an existing one

...

ANSWER

Answered 2022-Mar-09 at 15:25

You aren't loading the original file in order for it to be changed.

The library doesn't detect that you are looking to change an existing file, if you save a file with the name of a file that exists it will overwrite the existing file.

In order to amend a file, you need to load it, potentially recreate the structure in memory, and make you changes, and then save it back to the filesystem.

The relevant piece of code from the link you posted is:

Source https://stackoverflow.com/questions/71411614

QUESTION

How to suppress PyExcel Empty Row Warning during file read to SQLAlclemy model

Asked 2022-Feb-06 at 15:39

I am saving the contents of an excel file to a SQLAlchemy DB table using pyexcel as following: pyexcel.save_as(file_name='my_excel_file.xlsx',
name_columns_by_row=0,
dest_session=db.session,
dest_table=models.MyModel)

This is working well and saving the data but it's leaving several log messages that One empty row is found in the terminal. How do I suppress these messages to avoid polluting the logs? I know about the empty rows and they have no impact to the data loading

...

ANSWER

Answered 2022-Feb-06 at 15:39

So turns out pyexcel does a bit more on abstractions, the heavy lifting is done by pyexcel-io which does the basic input and output. With this understanding, in the pyexcel docs there is a parameter skip_empty_rows=True for when you reading from a source file which if you pass to the pyexcel.save_as function and this silences the logged messages, so it was exactly what I needed in this use-case. See PyExcel Docs Ref for pyexcel.save_as

Source https://stackoverflow.com/questions/70842859

QUESTION

How to merge multiple .xls files with hyperlinks in python?

Asked 2021-Dec-29 at 08:37

I am trying to merge multiple .xls files that have many columns, but 1 column with hyperlinks. I try to do this with Python but keep running into unsolvable errors.

Just to be concise, the hyperlinks are hidden under a text section. The following ctrl-click hyperlink is an example of what I encounter in the .xls files: ES2866911 (T3).

In order to improve reproducibility, I have added .xls1 and .xls2 samples below.

xls1:

Title Publication_Number P_A ES2866911 (T3) P_B EP3887362 (A1)

.xls2:

Title Publication_Number P_C AR118706 (A2) P_D ES2867600 (T3)

Desired outcome:

Title Publication_Number P_A ES2866911 (T3) P_B EP3887362 (A1) P_C AR118706 (A2) P_D ES2867600 (T3)

I am unable to get .xls file into Python without losing formatting or losing hyperlinks. In addition I am unable to convert .xls files to .xlsx. I have no possibility to acquire the .xls files in .xlsx format. Below I briefly summarize what I have tried:

1.) Reading with pandas was my first attempt. Easy to do, but all hyperlinks are lost in PD, furthermore all formatting from original file is lost.

2.) Reading .xls files with openpyxl.load

...

ANSWER

Answered 2021-Dec-24 at 15:29

Without a clear reproducible example, the problem is not clear. Assume I have two files called tmp.xls and tmp2.xls containing dummy data as in the two screenshots below.

Then pandas can easily, load, concatenate, and convert to .xlsx format without loss of hyperlinks. Here is some demo code and the resulting file:

Source https://stackoverflow.com/questions/70472885

QUESTION

django-excel pyexcel show unknown parameters for django heroku

Asked 2021-Oct-31 at 02:27

My django app deployed in heroku managed to show upload file form. However once I try uploading Excel xlsx file, it shows

...

ANSWER

Answered 2021-Oct-31 at 02:27

I recommend hosting your django app in pythonanywhere.com

With a little bit of search I found that people is having problem with the library you are using for excel files when deploying in heroku, maybe heroku can't support pyexcel.

Here is a tutorial of how to deploy your app in pythonanywhere https://www.youtube.com/watch?v=Y4c4ickks2A

Source https://stackoverflow.com/questions/69783420

QUESTION

Solve python ValueError: max_workers must be <= 61 when running pre-commit?

Asked 2021-Jun-28 at 02:39

I am using Django to develop an ERP and I want to use pre-commit with my project.

I have installed pre-commit, black, flake8, flake8-black. and this is my

...

ANSWER

Answered 2021-Jun-27 at 07:04

This is a known issue with cpython on windows. The error occurs when black tries to run multiple workers on >60 core machines because the default number of process workers given by os.cpu_count() breaks some other windows limit (number of waiting processes? I'm not quite sure). Black >=19.10b0 has a fix for this, so try updating the version of black in your pre-commit config if you can?

Python bug report: https://bugs.python.org/issue26903
Fix applied by this PR in black: https://github.com/psf/black/pull/838

Source https://stackoverflow.com/questions/68148458

QUESTION

Can't find file - FileNotFoundError: [Errno 2] No such file or directory

Asked 2021-Jun-09 at 11:51

my code looks like this:

...

ANSWER

Answered 2021-Jun-09 at 11:51

Try below code. I guess the problem is that you try to delete files in sub_directory that are different from your Python root.

Source https://stackoverflow.com/questions/67903355

QUESTION

How can i export the header together with the .csv

Asked 2021-Mar-26 at 11:45

i wrote a python code to automate a conversion, firstly i get a .pdf, then i covert o a .csv, get the tables i want, and then i convert it to .xlsx. Does anyone know if i can export the headers i have in .csv to the .xlsx file?

here's the .csv file: .csv table

here's the .xlsx export: .xlsx table

I'm adding the headers using pandas:

...

ANSWER

Answered 2021-Mar-26 at 11:45

Instead of using the saved csv try using the df you already created.

Source https://stackoverflow.com/questions/66816282

QUESTION

Usage of LSTM/GRU and Flatten throws dimensional incompatibility error

Asked 2020-Sep-15 at 20:26

I want to make use of a promising NN I found at towardsdatascience for my case study.

The data shapes I have are:

...

ANSWER

Answered 2020-Aug-17 at 18:14

I cannot reproduce your error, check if the following code works for you:

Source https://stackoverflow.com/questions/63455257

QUESTION

Writing pandas data to Excel with efficient memory usage

Asked 2020-Jul-21 at 01:14

I am successfully writing dataframes to Excel using df.to_excel(). Unfortunately, this is slow and consumes gobs of memory. The larger the dataset, the more memory it consumes, until (with the largest datasets I need to deal with) the server starves for resources.

I found that using the df.to_csv() method instead offers the convenient chunksize=nnnn argument. This is far faster and consumes almost no extra memory. Wonderful! I'll just write initially to .csv, then convert the .csv to .xlsx in another process. I tried doing it with openpyxl, but every solution I found iterated through the csv data one row at a time, appending to a Workbook sheet, e.g.

...

ANSWER

Answered 2020-Jul-18 at 01:06

Probably you could use the library pyexcelerate - https://github.com/kz26/PyExcelerate. They have posted the benchmarks on their github repo

Source https://stackoverflow.com/questions/62963532

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install pyexcel

You can install using 'pip install pyexcel' or download it from GitHub, PyPI.
You can use pyexcel like any standard Python library. You will need to make sure that you have a development environment consisting of a Python distribution including header files, a compiler, pip, and git installed. Make sure that your pip, setuptools, and wheel are up to date. When using pip it is generally recommended to install packages in a virtual environment to avoid changes to the system.

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: