pachyderm

所属分类:大数据
开发工具:GO
文件大小:21366KB
下载次数:0
上传日期:2023-06-09 23:00:13
上 传 者sh-1993
说明:  以数据为中心的管道和数据版本控制
(Data-Centric Pipelines and Data Versioning)

文件列表:
.circleci (0, 2023-09-13)
.circleci\.prettierrc.yml (28, 2023-09-13)
.circleci\config.yml (8809, 2023-09-13)
.circleci\main.yml (84115, 2023-09-13)
.dockerignore (26, 2023-09-13)
.drone.yml (95, 2023-09-13)
.golangci.yml (3945, 2023-09-13)
.ignore (6, 2023-09-13)
.spelling (2273, 2023-09-13)
.vscode (0, 2023-09-13)
.vscode\settings.json (299, 2023-09-13)
.vscode\tasks.json (867, 2023-09-13)
CHANGELOG-1.x.md (73554, 2023-09-13)
CHANGELOG.md (52299, 2023-09-13)
CONTRIBUTING.md (930, 2023-09-13)
Dockerfile.etcd (416, 2023-09-13)
Dockerfile.mount-server (189, 2023-09-13)
Dockerfile.pachctl (156, 2023-09-13)
Dockerfile.pachd (501, 2023-09-13)
Dockerfile.pachdoc (367, 2023-09-13)
Dockerfile.pachdoc.dockerignore (19, 2023-09-13)
Dockerfile.pgbouncer (1419, 2023-09-13)
Dockerfile.worker (679, 2023-09-13)
LICENSE (10771, 2023-09-13)
Makefile (19093, 2023-09-13)
Pachyderm_Icon-01.svg (24501, 2023-09-13)
dex-assets (0, 2023-09-13)
... ...

[![GitHub release](https://img.shields.io/github/release/pachyderm/pachyderm.svg?style=flat-square)](https://github.com/pachyderm/pachyderm/releases) [![GitHub license](https://img.shields.io/badge/license-Pachyderm-blue)](https://github.com/pachyderm/pachyderm/blob/master/LICENSE) [![GoDoc](https://godoc.org/github.com/pachyderm/pachyderm?status.svg)](https://pkg.go.dev/github.com/pachyderm/pachyderm/v2/src/client) [![Go Report Card](https://goreportcard.com/badge/github.com/pachyderm/pachyderm)](https://goreportcard.com/report/github.com/pachyderm/pachyderm) [![Slack Status](https://badge.slack.pachyderm.io/badge.svg)](https://slack.pachyderm.io) [![CLA assistant](https://cla-assistant.io/readme/badge/pachyderm/pachyderm)](https://cla-assistant.io/pachyderm/pachyderm) # Pachyderm “ Automate data transformations with data versioning and lineage Pachyderm is cost-effective at scale, enabling data engineering teams to automate complex pipelines with sophisticated data transformations across any type of data. Our unique approach provides parallelized processing of multi-stage, language-agnostic pipelines with data versioning and data lineage tracking. Pachyderm delivers the ultimate CI/CD engine for data. ## Features - Data-driven pipelines automatically trigger based on detecting data changes. - Immutable data lineage with data versioning of any data type. - Autoscaling and parallel processing built on Kubernetes for resource orchestration. - Uses standard object stores for data storage with automatic deduplication. - Runs across all major cloud providers and on-premises installations. ## Getting Started To start deploying your end-to-end version-controlled data pipelines, run Pachyderm [locally](https://docs.pachyderm.com/latest/set-up/local-deploy/) or you can also [deploy on AWS/GCE/Azure](https://docs.pachyderm.com/latest/set-up/cloud-deploy) in about 5 minutes. You can also refer to our complete [documentation](https://docs.pachyderm.com) to see tutorials, check out example projects, and learn about advanced features of Pachyderm. If you'd like to see some examples and learn about core use cases for Pachyderm: - [Examples](https://github.com/pachyderm/examples) - [Use Cases](https://www.pachyderm.com/use-cases/) - [Case Studies](https://www.pachyderm.com/case-studies/) ## Documentation [Official Documentation](https://docs.pachyderm.com/) ## Community Keep up to date and get Pachyderm support via: - [![Twitter](https://img.shields.io/twitter/follow/pachyderminc?style=social)](https://twitter.com/pachyderminc) Follow us on Twitter. - [![Slack Status](https://badge.slack.pachyderm.io/badge.svg)](https://slack.pachyderm.io) Join our community [Slack Channel](https://slack.pachyderm.io) to get help from the Pachyderm team and other users. ## Contributing To get started, sign the [Contributor License Agreement](https://cla-assistant.io/pachyderm/pachyderm). You should also check out our [contributing guide](https://docs.pachyderm.com/latest/contributing/setup/). Send us PRs, we would love to see what you do! You can also check our GH issues for things labeled "help-wanted" as a good place to start. We're sometimes bad about keeping that label up-to-date, so if you don't see any, just let us know. ## Usage Metrics Pachyderm automatically reports anonymized usage metrics. These metrics help us understand how people are using Pachyderm and make it better. They can be disabled by setting the env variable `METRICS` to `false` in the pachd container.

近期下载者

相关文件


收藏者