#

spark-sql

Here are 352 public repositories matching this topic...

getredash / redash

Star

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

visualization javascript mysql python bigquery bi spark dashboard athena analytics postgresql business-intelligence redash redshift databricks spark-sql

Updated Sep 3, 2020
JavaScript

dotnet / spark

Star

Open

[FEATURE REQUEST]: Spark 3.0 Readiness

8

suhsteve commented Aug 19, 2020

APIs

SparkSession

python def getActiveSession(cls)
scala def executeCommand(runner: String, command: String, options: Map[String, String]): DataFrame

DataFrame

python def transform(self, func)
python def tail(self, num)
scala def tail(n: Int): Array[T]
scala def printSchema(level: Int): Unit
scala def explain(mode: String): U

Read more

enhancement good first issue help wanted

Open

[FEATURE REQUEST]: Expose metadata

3

Open

[FEATURE REQUEST]: Implement ML Features

15

Find more good first issues →

almond-sh / almond

Star

A scala kernel for Jupyter

scala spark jupyter repl jupyter-notebook jupyter-kernels spark-sql

Updated Sep 4, 2020
Scala

oeljeklaus-you / UserActionAnalyzePlatform

Star

电商用户行为分析大数据平台

java spark hadoop sparkjava accumulator spark-sql kyro

Updated Jul 1, 2020
Java

qubole / sparklens

Star

Qubole Sparklens tool for performance tuning Apache Spark

performance scala spark simulation cluster scheduler scheduling performance-metrics performance-tuning performance-visualization performance-analysis sparkjava spark-job spark-applications spark-sql spark-mllib spark-ml

Updated Jun 29, 2020
Scala

yaooqinn / kyuubi

Star

Kyuubi is an enhanced editon of Apache Spark's primordial Thrift JDBC/ODBC Server.

multi-tenant sql spark yarn hive jdbc odbc thrift sql-query hiveserver2 spark-sql kyuubi kyuubi-server thrift-jdbc odbc-server

Updated Aug 21, 2020
Scala

microsoft / data-accelerator

Star

Open

Web: Improve readability for screen readers

carlbrochu commented Apr 18, 2019

Is your feature request related to a problem? Please describe.
Some areas of the web portal have issues with screen readers. Here are a few examples

Describe the solution you'd like
Improve readability for screen readers across the web portal

Read more

bug good first issue help wanted

Open

Web: Enable uploading jar files and csv from the web portal

Open

Web: Handle saving automatically and avoid navigating away with changes

Find more good first issues →

jaceklaskowski / spark-workshop

Star

Apache Spark™ and Scala Workshops

workshop spark apache-spark spark-sql spark-mllib spark-structured-streaming spark-workshops

Updated Feb 15, 2020
HTML

jaceklaskowski / mastering-spark-sql-book

Star

The Internals of Spark SQL

apache-spark mkdocs internals spark-sql

Updated Sep 3, 2020

Chabane / bigdata-playground

Star

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

Updated Feb 1, 2019
TypeScript

polomarcus / Spark-Structured-Streaming-Examples

Star

Spark Structured Streaming / Kafka / Cassandra / Elastic

kafka spark cassandra structured-streaming spark-sql

Updated Oct 1, 2018
Scala

microsoft / MCW-Big-data-and-visualization

Star

MCW Big data and visualization

machine-learning power-bi spark-sql hdinsight azure-data-factory database-administrator

Updated Sep 2, 2020
JavaScript

mc2-project / opaque

Star

An encrypted data analytics platform

security machine-learning privacy spark analytics enclave spark-sql

Updated Aug 22, 2020
C++

databricks / LearningSparkV2

Star

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

spark apache-spark mllib structured-streaming spark-sql spark-mllib mlflow delta-lake

Updated Sep 3, 2020
Scala

kevinschaich / pyspark-cheatsheet

Star

🐍 Quick reference guide to common patterns & functions in PySpark.

documentation data-science data docs spark reference guide pyspark cheatsheet cheat quickstart references guides cheatsheets spark-sql pyspark-tutorial

Updated Mar 9, 2020

wangj1106 / recommendMoteur

Star

电影推荐系统、电影推荐引擎、使用Spark完成的电影推荐引擎

movies kafka spark spark-streaming recommendation-engine recommender-system flume als recommendation spark-sql

Updated Jun 25, 2018
Scala

minio / spark-select

Star

A library for Spark DataFrame using MinIO Select API

select spark sbt bigdata pyspark minio parquet-files spark-sql amazon-s3

Updated Sep 27, 2019
Scala

Re1tReddy / Spark

Star

Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .

streaming consumer parquet kafka-producer spark-sql spark-kafka-integration spark-streaming-data spark-transformations spark-to-cassandra-connection spark-dataframes spark-joins spark-hive-context spark-jdbc-connection spark-with-mangodb spark-aggregations-using-dataframe spark-use-cases cassandra-installation spark-datadog spark-mangodb spark-catalog-api

Updated Jul 1, 2020
Scala

streamnative / awesome-pulsar

Star

A curated list of Pulsar tools, integrations and resources.

spark apache-spark messaging prometheus apache-storm apache-flink apache-kafka pub-sub grafana-dashboard spark-sql elastic-beats spark-structured-streaming apache-bookkeeper apache-pulsar

Updated Dec 25, 2019

huangyueranbbc / SparkDemo

Star

spark全示例代码(java、scala) Spark most full instance code DEMO (java、scala)

spark hadoop bigdata spark-streaming operator sparkline sparkjava spark-sql sparkfun-products sparkp

Updated May 9, 2020
Java

streamnative / pulsar-spark

Star

When Apache Pulsar meets Apache Spark

data-science spark apache-spark stream-processing data-processing batch-processing structured-streaming spark-sql apache-pulsar

Updated Feb 27, 2020
Scala

harryprince / geospark

Star

bring sf to spark in production

r apache-spark gis spatial-analysis spark-sql spatial-queries sparklyr-extension large-scale-spatial-analysis

Updated Mar 2, 2020
R

dbiir / paraflow

Star

A real-time analytical system for ID-associated data

kafka presto hadoop parquet orc spark-sql

Updated Jul 1, 2020
Java

kaantas / spark-twitter-sentiment-analysis

Star

Sentiment Analysis of a Twitter Topic with Spark Structured Streaming

python twitter kafka spark apache-spark sentiment-analysis twitter-api pyspark apache-kafka afinn twitter-sentiment-analysis spark-sql spark-structured-streaming pykafka twitter-topic

Updated Dec 12, 2018
Python

airbnb / airbnb-spark-thrift

Star

A library for loadling Thrift data into Spark SQL

spark thrift spark-streaming spark-sql

Updated Sep 7, 2018
Scala

xiaogp / recsys_spark

Star

Spark SQL 实现 ItemCF，UserCF，Swing，推荐系统，推荐算法，协同过滤

collaborative-filtering recommender-system spark-sql

Updated Dec 19, 2019
Scala

sjyttkl / spark_learning

Star

尚硅谷大数据Spark-2019版最新 Spark 学习

spark spark-sql spark-core

Updated Aug 11, 2020
Scala

hablapps / sparkOptics

Star

Optics for Spark DataFrames

scala spark optics dataframe dataframes spark-sql

Updated Mar 20, 2020
Scala

mayur2810 / sope

Star

Apache Spark ETL Utilities

yaml framework scala spark etl dsl transformer spark-sql

Updated Aug 28, 2020
Scala

harryprince / awesome-sparklyr

Star

An awesome sparklyr related package collection

machine-learning awesome r big-data apache-spark dbi sparklyr spark-sql r-stats

Updated Mar 17, 2020

Improve this page

Add a description, image, and links to the spark-sql topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the spark-sql topic, visit your repo's landing page and select "manage topics."

You can’t perform that action at this time.