Skip to content
@code-kern-ai

Kern AI

Building data-centric open-source tools for NLP

Hi there 👋

We are Kern AI, a team of ambitious data engineers and scientists aiming to make your life as a developer a bit easier. Our libraries and tools aim at improving the data-centric AI lifecycle.

We mainly maintain and publish refinery, the data scientist's open-source choice to scale, assess and maintain natural language data. Also, we maintain bricks, a collection of open-source modular NLP enrichments.

🪢 Community and contact

Feel free to join our community spaces, where we'll discuss about recent findings in data-centric AI:

We send out a (mostly) weekly newsletter about recent findings in data-centric AI, product highlights in development and more. You can subscribe to the newsletter here.

Also, you can follow us on Twitter and LinkedIn.

GitHub Discussions Discord Twitter LinkedIn YouTube Kern AI Docs Website

Pinned

  1. refinery Public

    The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.

    Python 1.1k 43

  2. bricks Public

    Open-source natural language enrichments at your fingertips.

    Python 337 8

  3. Official Python SDK for Kern AI refinery.

    Python 16 3

  4. Containing examples of projects you can use to test refinery. Please select the use case from the branches.

    16 4

  5. CLI-based tool to automatically build ML models from training data into a servable Docker container

    Python 48 4

Repositories

  • bricks Public

    Open-source natural language enrichments at your fingertips.

    Python 337 Apache-2.0 8 38 (15 issues need help) 2 Updated Mar 3, 2023
  • refinery-ui Public

    UI for refinery. Used to interact with the whole system; to find out how to best work with the UI, check out our docs.

    TypeScript 3 Apache-2.0 2 0 0 Updated Mar 3, 2023
  • refinery-updater Public

    Updater for refinery. Manages migration logic to new versions if required.

    Python 0 Apache-2.0 1 0 0 Updated Mar 3, 2023
  • refinery-config Public

    Configuration of refinery. Manages amongst others endpoints and available language models for spaCy.

    Python 1 Apache-2.0 1 1 0 Updated Mar 3, 2023
  • refinery-gateway Public

    Gateway for refinery. Manages incoming requests and holds the workflow logic. To interact with the gateway, the UI or Python SDK can be used.

    Python 0 Apache-2.0 3 2 1 Updated Mar 3, 2023
  • refinery-submodule-model Public

    Data model for refinery. Manages entities and their access for multiple services, e.g. the gateway.

    Python 2 Apache-2.0 1 0 1 Updated Mar 3, 2023
  • refinery-zero-shot Public

    Zero-shot module for refinery. Enables the integration of 🤗 Hugging Face zero-shot classifiers as an off-the-shelf no-code heuristic.

    Python 0 Apache-2.0 1 2 0 Updated Feb 28, 2023
  • refinery-ml-exec-env Public

    Execution environment for the active learning module in refinery. Containerized function as a service to build active learning models using scikit-learn and sequence-learn.

    Python 0 Apache-2.0 1 1 0 Updated Feb 28, 2023
  • refinery-embedder Public

    Embedder for refinery. Manages the creation of document- and token-level embeddings using the embedders library.

    Python 2 Apache-2.0 1 0 0 Updated Feb 28, 2023
  • refinery-torch-cpu-parent-image Public

    Defines parent image for the Docker images of the refinery services that require torch (cpu).

    Shell 0 Apache-2.0 0 0 0 Updated Feb 28, 2023