Skip to content
#

data-engineering

Here are 801 public repositories matching this topic...

zangell44
zangell44 commented Apr 27, 2021

Current behavior

Right now, the connection string to Azure can be passed as a string at initialization or read AZURE_STORAGE_CONNECTION_STRING from the environment.

The connection string property is not serialized with the storage object. The only way to get this to work is to have AZURE_STORAGE_CONNECTION_STRING available when the flow is retrieved from storage. For most agent types, t

gardnerdev
gardnerdev commented Jan 16, 2021

Describe the bug
When trying to run scaffolding (profiling) command, it fails because of commas in columns.

To Reproduce
Steps to reproduce the behavior:

  1. Run great_expectations suite scaffold scaffold-name on datasource where commas are in column
  2. Bug pandas.errors.ParserError: Error tokenizing data. C error: Expected 1 fields in line 5323 saw 2

Expected behavior
D

anks7190
anks7190 commented Jan 27, 2021

Hi ,

I am using some basic functions from pyjanitor such as - clean_names() , collapse_levels() in one of my code which I want to productionise.
And there are limitations on the size of the production code base.
Currently ,if I just look at the requirements.txt for just "pyjanitor" , its huge .
I don't think I require all the dependencies in my code.
How can I remove the unnecessary ones ?

Improve this page

Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."

Learn more