DataOps
DataOps is an automated, process-oriented methodology, used by analytic and data teams, to improve the quality and reduce the cycle time of data analytics. While DataOps began as a set of best practices, it has now matured to become a new and independent approach to data analytics. DataOps applies to the entire data lifecycle from data preparation to reporting, and recognizes the interconnected nature of the data analytics team and information technology operations.
Here are 93 public repositories matching this topic...
Describe the issue
Currently we run the Linter CI for golang repos using the golang-ci-linter binary. But according to the documentation it is faster and better to use the github action.
https://golangci-lint.run/usage/install/
The UX of all the error highlighting is also better.
What if we do not do this?
Finding linter errors is troublesome as users have to parse through the cons
-
Updated
Feb 15, 2022 - Shell
-
Updated
Feb 18, 2022 - Scala
We're using marshmallow to parse whylogs config from YAML
However, Pydantic is much more powerful as it allows users to set config via various mechanims, from YAML, JSON to Environment settings.
We should consider moving to pydantic
Sending a rest call to delete a job specification throws 404 where as grpc call works fine. Steps to reproduce
curl -X DELETE "http://localhost:9100/v1/project/my-project/namespace/kush/helloworld" -H "accept: application/json"-
Updated
Feb 17, 2022 - Python
Support copy into queries
-
Updated
Mar 7, 2022
-
Updated
Mar 7, 2022 - Shell
-
Updated
Mar 10, 2022 - Java
Currently, both Kafka and Influx sink logs only the data(Row) that is being sent.
Add support for logging column names as well along with data points similar to the implementation in log sink.
This will enable users to correlate the data points with column names.
Zap configurations should be pushed to grpc middleware here: cmd/setup.go#L47
-
Updated
Feb 16, 2022 - Shell
-
Updated
Mar 16, 2022 - Python
-
Updated
Feb 25, 2022 - Go
In golang client, consumers get dynamic message instance after parsing. Add an example in the docs on how to use dynamic message instance to get values of different types in consumer code.
List of protobuf types to cover
- timestamp
- duration
- bytes
- message type
- struct
- map
Describe the bug
"client not configured." error is printed even after the config init step.
❯ ./guardian config init
config created: /Users/sushmithbhatta/.config/odpf/guardian.yml
client not configured. try running `guardian config init`To Reproduce
Steps to reproduce the behavior:
- Remove
~/.config/odpf/guardian.ymlfile if exists in local - Run `guard
-
Updated
Feb 27, 2022 - Python
-
Updated
Aug 2, 2020 - Smarty
-
Updated
Feb 18, 2022 - Go
-
Updated
Jan 31, 2022
Siren creates alertmanager config and sycns with alertmanger. The alert manager config can change for the same subscriptions if their order changes. We should follow some sorting conventions and stick to those conventions to create an alert manager config.
-
Updated
Feb 1, 2022 - Python
We are using the protobuf-git configuration as described at https://cloudhut.dev/docs/features/protobuf#git-repository
In our repository the proto-files live within a
protodirectory, which seems to be very common, and contains 5 levels of nested folders.Currently KOWL searches only the first 5 levels of the checkout for
.protofiles, so our last level is not considered.Please