Skip to content
forked from numaproj/numaflow

Kubernetes-native platform to run massively parallel data/streaming jobs

License

Notifications You must be signed in to change notification settings

xdevxy/numaflow

This branch is 2 commits ahead of, 861 commits behind numaproj/numaflow:main.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Mar 30, 2023
4fda89f · Mar 30, 2023
Mar 30, 2023
Mar 30, 2023
Feb 28, 2023
Mar 30, 2023
Mar 30, 2023
Mar 23, 2023
Nov 3, 2022
Mar 30, 2023
Mar 29, 2023
Mar 30, 2023
Mar 29, 2023
May 20, 2022
Mar 13, 2023
May 20, 2022
May 20, 2022
Jan 16, 2023
May 20, 2022
Mar 30, 2023
Jan 13, 2023
Oct 30, 2022
Oct 14, 2022
Mar 16, 2023
Mar 16, 2023
Mar 27, 2023
Nov 3, 2022

Repository files navigation

Numaflow

Go Report Card slack GoDoc License Release Version CII Best Practices

Summary

Numaflow is a Kubernetes-native tool for running massively parallel stream processing. A Numaflow Pipeline is implemented as a Kubernetes custom resource and consists of one or more source, data processing, and sink vertices.

Numaflow installs in a few minutes and is easier and cheaper to use for simple data processing applications than a full-featured stream processing platforms.

Use Cases

  • Real-time data analytics applications.
  • Event driven applications such as anomaly detection, monitoring and alerting.
  • Streaming applications such as data instrumentation and data movement.
  • Workflows running in a streaming manner.

Key Features

  • Kubernetes-native: If you know Kubernetes, you already know how to use Numaflow.
  • Language agnostic: Use your favorite programming language.
  • Exactly-Once semantics: No input element is duplicated or lost even as pods are rescheduled or restarted.
  • Auto-scaling with back-pressure: Each vertex automatically scales from zero to whatever is needed.

Data Integrity Guarantees:

  • Minimally provide at-least-once semantics
  • Provide exactly-once semantics for unbounded and near real-time data sources
  • Preserving order is not required

Roadmap

  • User Defined Transformer at Source for better deserialization and filter for cost reduction (v0.8)
  • Multi partitioned edges for higher throughput (v0.9)

Resources

About

Kubernetes-native platform to run massively parallel data/streaming jobs

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Go 85.6%
  • TypeScript 11.7%
  • Shell 1.4%
  • Makefile 0.6%
  • CSS 0.2%
  • Smarty 0.2%
  • Other 0.3%