Here are
42 public repositories
matching this topic...
Production PostgreSQL for Kubernetes, from high availability Postgres clusters to full-scale database-as-a-service.
Highly available elephant herd: HA PostgreSQL cluster using Docker
Updated
Apr 22, 2022
Python
A distributed event bus that implements a RESTful API abstraction on top of Kafka-like queues
Updated
Apr 27, 2022
Java
A battle-tested, flexible & comprehensive monitoring solution for your PostgreSQL databases
Updated
Jun 25, 2020
Python
Python function to stream unzip all the files in a ZIP archive - without loading the entire ZIP file or any of its files into memory at once
Updated
Mar 30, 2022
Python
JSON schema parser for Apache Spark
Updated
Feb 10, 2021
Scala
Continuously sync folder to S3, using inotify under the hood
Updated
May 15, 2021
Python
The Data Engineering Book - หนังสือวิศวกรรมข้อมูล ของคนไทย เพื่อคนไทย
Spawns JupyterHub single user servers in Docker containers running in AWS Fargate
Updated
Jun 28, 2020
Python
A DNS proxy server that conditionally rewrites and filters A record requests
Updated
Oct 31, 2020
Python
Updated
Apr 29, 2020
Python
Allows users to access data and launch applications to analyse it
Updated
Apr 27, 2022
Python
Kanadi is a Nakadi client for Scala
Updated
Dec 30, 2021
Scala
Convert JSON to a set of tidy CSV files
Updated
Nov 3, 2020
Python
Jupyter Notebook Contents Manager for AWS S3
Updated
Mar 23, 2020
Python
A generic data pipeline which will map Elasticsearch documents to Bigquery table rows
Updated
Sep 29, 2019
Kotlin
Python utility function to communicate with a subprocess using iterables: for when data is too big to fit in memory and has to be streamed
Updated
Jun 21, 2021
Python
Collections of POC/dev data infrastructure. | #SE
Updated
Mar 31, 2022
Python
Lightweight API service for querying for companies that have expressed interest in exporting to specific countries
Updated
Apr 3, 2022
Python
A command line report on a GitHub organisation's repositories, ordered by priority, and including time-to-SLA for each severity level
Updated
Feb 24, 2022
Python
Stateless JWT authentication in front of PostgreSQL
Updated
Jan 13, 2021
Python
Join iterables in code without loading them all in memory: similar to a SQL left join
Updated
Dec 19, 2020
Python
Updated
Mar 30, 2022
Python
Information relating to topics on Data Engineering, Data Infrastructure, Data Storing, Data Warehouses and Business Analysis. For those interested in both conceptual theory and use case examples for database design and development.
The source for the Department for International Trade's Public Data API
Updated
Apr 24, 2022
HTML
Updated
Jun 24, 2021
Python
Legal Basis for Consent Service API Server
Updated
Apr 25, 2022
Python
Utility function to perform the server-side of Hawk authentication
Updated
Jun 13, 2020
Python
Improve this page
Add a description, image, and links to the
data-infrastructure
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
data-infrastructure
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.
Hi There, noticed there is an environment variable to inject "etcd_host" but there are none in the repo for usernames and password. Could create a PR for that to add them to the operator configmap ?
Could enable etcd3 as well since there are no env variables for v3 (supported by Patroni).
Let me know what you guys think.