Pinned
1,167 contributions in the last year
Less
More
Contribution activity
June 2021
Created 93 commits in 12 repositories
Created a pull request in vaexio/vaex that received 3 comments
✨ limit rows in groupby using early stop.
This can be useful if you only want to do a groupby when the result is small and otherwise don't care about the result. cc @nicolaskruchten See usage:
+70
−21
•
3
comments
Opened 15 other pull requests in 5 repositories
vaexio/vaex
1
open
7
merged
- 🐎 astype('str') can be faster dispatching to to_string
- ➖ we do not really use psutil, and it can be an heavy dependency
- 🐛 fingerprints using df.copy and projecting were not deterministic
- 🐎 sparse groupby passed over original dataframe, this is faster
- ✨ datetime comparison allows string (ISO 8601 format)
- 🐛 column names equal to function names in selections/filters
- ✨ allow caching of tasks outputs
- 🐎 perform groupby in a sparse way for less memory usage/performance (~250 times faster)