apache-spark

MLflow seems to have a length limit of 5000 when setting tags (see below).

[...]
  File "/home/smay/miniconda3/envs/py38/lib/python3.8/site-packages/mlflow/utils/validation.py", line 136, in _validate_length_limit
    raise MlflowException(
mlflow.exceptions.MlflowException: Tag value '[0.8562690322984875, 0.8544098885636596, 0.8544098885636596, 0.8544098885636596, 0.85440988856365

APIs

SparkSession

python def getActiveSession(cls)
scala def executeCommand(runner: String, command: String, options: Map[String, String]): DataFrame

DataFrame

python def transform(self, func)
python def tail(self, num)
scala def tail(n: Int): Array[T]
scala def printSchema(level: Int): Unit
scala def explain(mode: String): U

apache-spark

Here are 930 public repositories matching this topic...

mlflow / mlflow

[FR] Check and truncate string length

[BUG] Run name not set when using start_run inside a MLproject execution

[BUG] uploading artifacts to FTP server doesn't work

lw-lin / CoolplaySpark

spark-notebook / spark-notebook

intel-analytics / analytics-zoo

OryxProject / oryx

dotnet / spark

[FEATURE REQUEST]: Spark 3.0 Readiness

APIs

SparkSession

DataFrame

[FEATURE REQUEST]: Expose metadata

[FEATURE REQUEST]: Implement ML Features

big-data-europe / docker-spark

GoogleCloudPlatform / spark-on-k8s-operator

lensacom / sparkit-learn

databricks / spark-sklearn

awesome-spark / awesome-spark

japila-books / apache-spark-internals

ironmussa / Optimus

microsoft / Mobius

sparklyr / sparklyr

apache / incubator-sedona

miguno / kafka-storm-starter

san089 / goodreads_etl_pipeline

cerndb / dist-keras

apache-spark-on-k8s / spark

nchammas / flintrock

openscoring / openscoring

lw-lin / streaming-readings

tweag / sparkle

rjurney / Agile_Data_Code_2

infoslack / awesome-kafka

jaceklaskowski / spark-structured-streaming-book

miguno / wirbelsturm

LucaCanali / sparkMeasure

Hydrospheredata / mist

Improve this page

Add this topic to your repo