Skip to content
#

spark-sql

Here are 352 public repositories matching this topic...

suhsteve
suhsteve commented Aug 19, 2020

APIs

SparkSession

  • python def getActiveSession(cls)
  • scala def executeCommand(runner: String, command: String, options: Map[String, String]): DataFrame

DataFrame

  • python def transform(self, func)
  • python def tail(self, num)
    scala def tail(n: Int): Array[T]
  • scala def printSchema(level: Int): Unit
  • scala def explain(mode: String): U

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

  • Updated Feb 1, 2019
  • TypeScript

Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .

  • Updated Jul 1, 2020
  • Scala

Improve this page

Add a description, image, and links to the spark-sql topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the spark-sql topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.