Skip to content
#

big-data

Here are 1,888 public repositories matching this topic...

donnemartin
donnemartin commented Jul 1, 2015

"Bokeh is a Python interactive visualization library that targets modern web browsers for presentation. Its goal is to provide elegant, concise construction of novel graphics in the style of D3.js, but also deliver this capability with high-performance interactivity over very large or streaming datasets. Bokeh can help anyone who would like to quickly and easi

ines
ines commented Sep 29, 2019

I was going though the existing enhancement issues again and though it'd be nice to collect ideas for spaCy plugins and related projects. There are always people in the community who are looking for new things to build, so here's some inspiration For existing plugins and projects, check out the spaCy universe.

If you have questions about the projects I suggested,

bionicles
bionicles commented Jan 3, 2020

There's no published benchmark for IOPS on S3 storage

Would it be possible to post this alongside the other benchmarks?

S3 storage would be super cheap way to get started because it's serverless (thus more folks would potentially use gun.js)

Thank you for the useful service. I would like to see more Auth/ABAC for startup usage, right now I'm using a centralized database because it's uncle

Open Source Fast Scalable Machine Learning Platform For Smarter Applications: Deep Learning, Gradient Boosting & XGBoost, Random Forest, Generalized Linear Modeling (Logistic Regression, Elastic Net), K-Means, PCA, Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

  • Updated Feb 19, 2020
  • Java
mentax007
mentax007 commented Dec 22, 2019

Description

I install new 6 node cluster. Enable authentication and add 5 nodes through Fauxton.
When I run Verify CouchDB Installation from Fauxton I see an error in Replication check

Error: unauthorized to access or create database http://0.0.0.0:5984/verifytestdb_replicate/
And on one of the node I see an error:
[error] 2019-12-22T16:05:37.312700Z couchdb@s2dfw.domain.net <0.26254.18

awick
awick commented Jan 30, 2020

Can't search fields that can be in both request/response. For example adding content-type to both request and response headers creates a single http.content-type expression and which it actually searches is unknown. Probably should create http.request.content-type and http.response.content-type or something.

Work around for now is

[custom-fields]
http.request.content-type=db:http.reque
yiheng
yiheng commented Jul 11, 2018

Spark 2.3 officially support run on kubernetes. While our guide of "Run on Kubernetes" is still based on a special version of Spark 2.2, which is out of date. We need to:

  1. update that document to Spark 2.3
  2. release the corresponding docker images.
ramkumarkb
ramkumarkb commented Feb 5, 2020

I have noticed a small error in the documentation around S3 configurations:
https://docs.delta.io/latest/delta-storage.html#amazon-s3

On the read part, it should be load and not save:
spark.read.format("delta").load("s3a://<your-s3-bucket>/<path>/<to>/<delta-table>")

Also, I have successfully tested Delta 0.5.0 with on-premise S3 - https://min.io
There were some quirks around the

Improve this page

Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.