Skip to content
Avatar

Highlights

  • Arctic Code Vault Contributor
  • Pro

Organizations

@apache @tokio-rs
andygrove/README.md

Hi there 👋

I'm Andy Grove, a software engineer specializing in distributed systems and query engines. I am the author of the book How Query Engines Work.

Work

I work for NVIDIA where I contribute to the RAPIDS Accelerator for Apache Spark, which is an open source plugin that provides GPU-acceleration for ETL workloads, leveraging the RAPIDS cuDF GPU DataFrame library.

Open Source

  • I am a PMC member of Apache Arrow, where I donated the initial Rust implementation and later donated the DataFusion query engine.
  • I believe that Rust and Apache Arrow together provide a great foundation for distributed systems and I created the Ballista Distributed Compute platform to demonstrate the possibilities.

Hobbies

I occasionally work on hobby projects involving digital electronics, microcontrollers, woodworking, and similar skills. Some of my projects have been featured on popular "maker" web sites.

Social Media

Pinned

  1. Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for effic…

    C++ 7.2k 1.8k

  2. Distributed compute platform implemented in Rust, and powered by Apache Arrow.

    Rust 2k 122

  3. Spark RAPIDS plugin - accelerate Apache Spark with GPUs

    Scala 261 69

  4. Extensible SQL Lexer and Parser for Rust

    Rust 592 114

  5. cuDF - GPU DataFrame Library

    C++ 3.7k 489

  6. Rust DataBase Connectivity (RDBC) :: Common Rust API for database drivers

    Rust 321 16

2,434 contributions in the last year

Mar Apr May Jun Jul Aug Sep Oct Nov Dec Jan Feb Mon Wed Fri
Activity overview
Contributed to ballista-compute/ballista, NVIDIA/spark-rapids, apache/arrow and 5 other repositories

Contribution activity

February 2021

Created 1 repository

Created a pull request in NVIDIA/spark-rapids that received 10 comments

Detect task failures in benchmarks

This is a breaking change to the JSON reporting of query times. Previously, the queryTime field would contain -1 if the query failed and >= 0 if th…

+70 −60 10 comments
Opened 68 other pull requests in 5 repositories
ballista-compute/ballista
3 open 51 merged 4 closed
NVIDIA/spark-rapids
4 merged
apache/arrow
4 closed
discord/discord-open-source
1 closed
apache/arrow-site
1 closed
Reviewed 45 pull requests in 6 repositories
ballista-compute/ballista 30 pull requests
apache/arrow 5 pull requests
NVIDIA/spark-rapids 4 pull requests
ballista-compute/sqlparser-rs 4 pull requests
andygrove/ballista 1 pull request
apache/arrow-site 1 pull request

Created an issue in NVIDIA/spark-rapids that received 7 comments

[BUG] Some TPC-DS queries fail with AQE when decimal types enabled

Describe the bug I just tried running q4 and q11 with AQE and decimal types enabled, and they fail with: Caused by: java.lang.IllegalStateException…

7 comments
Opened 53 other issues in 2 repositories
ballista-compute/ballista
27 closed 22 open
NVIDIA/spark-rapids
2 open 2 closed

Seeing something unexpected? Take a look at the GitHub profile guide.