Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Operational Awareness] HA support for KIM to improve availability and become scalable #519

Open
6 tasks
tobiscr opened this issue Nov 21, 2024 · 0 comments
Open
6 tasks
Labels
area/control-plane Related to all activities around Kyma Control Plane kind/feature Categorizes issue or PR as related to a new feature.

Comments

@tobiscr
Copy link
Contributor

tobiscr commented Nov 21, 2024

Description

KIM is currently not running in HA mode as multiple replicas are not supported. This is blocking scalabiilty and can also have impacts on rolling updates.

To get rid of this restriction, KIM has to be adjusted to run in HA mode (multiple replicas) and it has to distribute the load proportionally over the available KIM instances.

AC:

  • Implement a POC confirming that KIM is fully stateless and can run with multiple replicas.
    • Verify how load can be distributed over multiple instances
      • e.g. by sharding RuntimeCRs
      • using a shared queue like RabbitMQ
      • use leader election to ensure just on instance is processing the queue dispach events via Kubernetes events etc.
    • Propose an architectural solution for HA of KIM to the team which covers
      • pros / cons of the different options for HA (see list above)
      • how scalability will be supported by the chosen HA approach
  • Depending on the architectural conclusion, adjust KIM to support HA

Reasons

HA is required to become scalable and improve availability of KIM.

Attachments

@tobiscr tobiscr added kind/feature Categorizes issue or PR as related to a new feature. area/control-plane Related to all activities around Kyma Control Plane labels Nov 21, 2024
@tobiscr tobiscr changed the title HA support for KIM to improve availability and become scalable [Operational Awareness] HA support for KIM to improve availability and become scalable Nov 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/control-plane Related to all activities around Kyma Control Plane kind/feature Categorizes issue or PR as related to a new feature.
Projects
None yet
Development

No branches or pull requests

1 participant