Skip to content
#

data-engineering

Here are 973 public repositories matching this topic...

superset
marvin-robot
marvin-robot commented Sep 2, 2021

Opened from the Prefect Public Slack Community

michael.ball: Hey there. I’ve been playing around with Docker storage today, trying to get all source code packaged together with the flows each time they are registered, and am using the files and env_vars attributes as outlined in the Docs. But it seems that my .dockerignore file (in the directory from whic

Aylr
Aylr commented Dec 28, 2020

Describe the bug
data docs columns shrink to 1 character width with long query

To Reproduce
Steps to reproduce the behavior:

  1. make a batch from a long query string
  2. run validation
  3. render result to data docs
  4. See screenshot
    <img width="1525" alt="Data_documentation_compiled_by_Great_Expectations" src="https://user-images.githubusercontent.com/928247/103230647-30eca500-4
cgardens
cgardens commented Oct 23, 2021

Tell us about the problem you're trying to solve

We have historically used apache.commons.cli. Because of heavy use we have started to add some light utility functions around it. Instead of entrenching ourselves in this tool, we should investigate using a more modern one. picocli comes highly recommended by @sherifnada . In addition, it's support for argument groups ([docs](https://picocli.i

amommendes
amommendes commented Oct 12, 2021

Expected Behavior

Feature views should have the creation time (i.e., created_timestamp) at the first feast apply

Current Behavior

Features Views do not have creation time at feature view creation

Steps to reproduce

feast init fs
cd fs
feast apply
feast registry-dump
{
  "spec": {
    "name": "driver_id",
    "valueType": "INT64",
    "description": "driver 
lakeFS
johnnyaug
johnnyaug commented Oct 12, 2021

Steps to reproduce:

  1. From the UI, create a repository.
  2. Upload a file.
  3. From the uncommitted tab, commit the change.
  4. From the Objects tab, click the "branch: main" drop down.
  5. Click the arrow on the right.
  6. Select the first commit with the "Repository created" message.

Result: the "get started" screen appears.
Expected: screen should be empty, because this is a past commit.

A comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.

  • Updated Oct 24, 2021
anks7190
anks7190 commented Jan 27, 2021

Hi ,

I am using some basic functions from pyjanitor such as - clean_names() , collapse_levels() in one of my code which I want to productionise.
And there are limitations on the size of the production code base.
Currently ,if I just look at the requirements.txt for just "pyjanitor" , its huge .
I don't think I require all the dependencies in my code.
How can I remove the unnecessary ones ?

edublancas
edublancas commented Oct 26, 2021

Some of our code snippets display as a terminal (i.e., the line starts with a $), or an IPython prompt (i.e., line starts with In []:). See this: https://ploomber.readthedocs.io/en/latest/user-guide/scaffold.html

The corresponding CSS is here: https://github.com/ploomber/ploomber/blob/fd9b4c7a2e787c0206f841928d1be90ac142c7a8/doc/_static/css/custom-theme.css#L83

However, this only works

planetf1
planetf1 commented Apr 27, 2021

When using Ubuntu 'ootb' both natively and within windows WSL2 the asset consumer fvt has a tendency to fail with:

[INFO] --- maven-compiler-plugin:3.8.1:compile (default-compile) @ asset-consumer-fvt ---
[INFO] Changes detected - recompiling the module!
[INFO] Compiling 7 source files to /home/nigel/src/egeria/open-metadata-test/open-metadata-fvt/access-services-fvt/asset-consumer-fvt/tar

Improve this page

Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."

Learn more