TFX is an end-to-end platform for deploying production ML pipelines
-
Updated
Mar 28, 2023 - Python
TFX is an end-to-end platform for deploying production ML pipelines
Google-provided Cloud Dataflow template pipelines for solving simple in-Cloud data tasks
[DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
Yet Another UserAgent Analyzer
ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.
Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
Apache Beam pipelines to make weather data accessible and useful.
Clojure API for a more dynamic Google Dataflow
Collection of transforms for the Apache beam python SDK.
Repository to quickly get you started with new Machine Learning projects on Google Cloud Platform. More info(slides):
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Export a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow
Asgarde allows simplifying error handling with Apache Beam Java, with less code, more concise and expressive code.
A collection of tools for extracting FHIR resources and analytics services on top of that data.
Some class materials for a data processing course using PySpark
Opinionated serverless event analytics pipeline
Microservices in Post-Kubernetes Era. A polyglot monorepo
Blockchain ETL Architecture
Add a description, image, and links to the apache-beam topic page so that developers can more easily learn about it.
To associate your repository with the apache-beam topic, visit your repo's landing page and select "manage topics."