Skip to content
#

data-processing

Here are 500 public repositories matching this topic...

jeblad
jeblad commented Feb 16, 2021

🚨🚨 Feature Request

  • Related to an existing Issue
  • A new implementation (Improvement, Extension)

Move the docker setup to a separate repository. Moving this outside core makes cleaner code.

Keeping docker-setup inside the core code, then adding a number of other similar systems like Vagrant, will over time create a mess. Better move them out. It also mak

quancore
quancore commented May 4, 2021

Describe the bug
Trying to validate a MultiIndex data frame with a schema that Index has been set to single Index.

  • I have checked that this issue has not already been reported.
  • I have confirmed this bug exists on the latest version of pandera.
  • (optional) I have confirmed this bug exists on the master branch of pandera.

Code Sample, a copy-pastable example

`

pysparkling
svaningelgem
svaningelgem commented Jan 27, 2021

The exception in subject is thrown by the following code:

from datetime import date
from pysparkling.sql.session import SparkSession
from pysparkling.sql.functions import collect_set

spark = SparkSession.Builder().getOrCreate()

dataset_usage = [
    ('steven', 'UUID1', date(2019, 7, 22)),
]
dataset_usage_schema = 'id: string, datauid: string, access_date: date'

df = spa

Improve this page

Add a description, image, and links to the data-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-processing topic, visit your repo's landing page and select "manage topics."

Learn more