dataframe
Here are 535 public repositories matching this topic...
-
Updated
Nov 10, 2021 - Python
-
Updated
Oct 20, 2021 - Java
Describe your feature request
Support projection (reading only specific columns) when reading an IPC file in lazy mode:
Support for projection in lazy mode can be adapted from:
https://github.com/pola-rs/polars/blob/master/polars/polars-io/src/parquet.rs
Specifying colum
-
Updated
Nov 3, 2021 - Java
Describe the bug
Failed to execute Series.drop_duplicates.
In [75]: a = md.DataFrame(np.random.rand(10, 2), columns=['a', 'b'], chunk_size=2)
In [76]: a['a'].drop_duplicates().execute() -
Updated
Apr 20, 2021 - Rust
Indexes in DataFrame are not saved when exporting to csv using to_csv.
const df = new DataFrame(input, { index: ['A', 'B', 'C'] } );
df.to_csv(filename);alexhallam / tv
Example:
In the image below the word starships should begin on a new line to avoid being split.
Terminal width is provided to determine how many columns to print. The terminal width or the total width of the column headers may be used to wrap the text in the footer.
-
Updated
Nov 10, 2021 - C++
-
Updated
Jan 29, 2021 - C#
Hi ,
I am using some basic functions from pyjanitor such as - clean_names() , collapse_levels() in one of my code which I want to productionise.
And there are limitations on the size of the production code base.
Currently ,if I just look at the requirements.txt for just "pyjanitor" , its huge .
I don't think I require all the dependencies in my code.
How can I remove the unnecessary ones ?
-
Updated
Oct 25, 2021 - Go
Thanks for this library, it made my pandas workflow clean & easy to maintain.
Feature Request:
pd.merge is a common operation when dealing with 2 or more data frames & pipeline stage for this is missing.
Currently, I am implementing a custom pipeline stage pdp.Merge from PdPipelineStage & would like to contribute.
-
Updated
Jan 6, 2019 - Python
-
Updated
Jun 4, 2021 - Python
-
Updated
Aug 15, 2021 - Python
-
Updated
Aug 1, 2021 - JavaScript
-
Updated
Nov 8, 2021 - Clojure
Improve this page
Add a description, image, and links to the dataframe topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dataframe topic, visit your repo's landing page and select "manage topics."

Hello,
Considering your amazing efficiency on pandas, numpy, and more, it would seem to make sense for your module to work with even bigger data, such as Audio (for example .mp3 and .wav). This is something that would help a lot considering the nature audio (ie. where one of the lowest and most common sampling rates is still 44,100 samples/sec). For a use case, I would consider vaex.open('Hu