Kubernetes

Memgraph can be deployed on Kubernetes. The easiest way to do that is with Helm, the package manager for Kubernetes. Helm uses a packaging format called charts. A chart is a collection of files that describe a related set of Kubernetes resources.

Currently, we prepared and released the following charts:

The Helm charts are published on Artifact Hub. For details on the implementation of the Helm charts, check Memgraph Helm charts repository.

Due to numerous possible use cases and deployment setups via Kubernetes, the provided Helm charts are a starting point you can modify according to your needs. This page will highlight some of the specific parts of the Helm charts that you might want to adjust.

Helm chart for standalone Memgraph

Memgraph is a stateful application (database), hence the Helm chart for standalone Memgraph is configured to deploy Memgraph as a Kubernetes StatefulSet workload.

It will deploy a single Memgraph instance in a single pod.

Typically, when deploying a stateful application like Memgraph, a StatefulSet workload is used to ensure that each pod has a unique identity and stable network identity. When deploying Memgraph, it is also necessary to define a PersistentVolumeClaims to store the data directory (/var/lib/memgraph). This enables the data to be persisted even if the pod is restarted or deleted.

Storage configuration

By default, the Helm chart will create a PersistentVolumeClaim (PVC) for storage and logs. If the storage class for PVC is not defined, PVC will use the default one available in the cluster. The storage class can be configured in the values.yaml file. To avoid losing your data, make sure you have Retain reclaim policy. If you delete PersistentVolumeClaim without having Retain reclaim policy, you will lose your data because PersistentVolume will be deleted.

An example of a storage class for AWS EBS volumes:

storageClass:
  name: "gp2"
  provisioner: "kubernetes.io/aws-ebs"
  storageType: "gp2"
  fsType: "ext4"
  reclaimPolicy: "Retain"
  volumeBindingMode: "Immediate"

Default template for a storage class is part of the Helm chart and can be found in the repository.

More details on the configuration options can be found in the configuration section.

Secrets

The Helm chart allows you to use Kubernetes secrets to store Memgraph credentials. By default, the secrets are disabled. If you want to use secrets, you can enable them in the values.yaml file.

The secrets are prepared to work for environment variables MEMGRAPH_USER and MEMGRAPH_PASSWORD.

System configuration

The Helm chart will set the linux kernel vm.max_map_count parameter to 262144 by default to ensure Memgraph won’t run into issues with memory mapping.

The vm.max_map_count parameter is a kernel parameter that specifies the maximum number of memory map areas a process may have. This change will be applied to all nodes in the cluster. If you want to disable this feature, you can set sysctlInitContainer.enabled to false in the values.yaml file.

Installing Memgraph standalone Helm chart

To include a standalone Memgraph into your Kubernetes cluster, you need to add the repository and install Memgraph.

The steps below will work in the Minikube environment, but you can also use them in other Kubernetes environments with minor adjustments.

Add the repository

Add the Memgraph Helm chart repository to your local Helm setup by running the following command:

helm repo add memgraph https://memgraph.github.io/helm-charts

Make sure to update the repository to fetch the latest Helm charts available:

helm repo update

Install Memgraph

To install Memgraph Helm chart, run the following command:

helm install <release-name> memgraph/memgraph

Replace <release-name> with the name of the release you chose.

Access Memgraph

Once Memgraph is installed, you can access it using the provided services and endpoints, such as various client libraries, command-line interface mgconsole or visual user interface Memgraph Lab.

Configuration options

The following table lists the configurable parameters of the Memgraph chart and their default values.

ParameterDescriptionDefault
image.repositoryMemgraph Docker image repositorymemgraph/memgraph
image.tagSpecific tag for the Memgraph Docker image. Overrides the image tag whose default is chart version."" (Defaults to chart’s app version)
image.pullPolicyImage pull policyIfNotPresent
useImagePullSecretsOverride the default imagePullSecretsfalse
imagePullSecretsSpecify image pull secrets- name: regcred
replicaCountNumber of Memgraph instances to run. Note: no replication or HA support.1
affinity.nodeKeyKey for node affinity (Preferred)""
affinity.nodeValueValue for node affinity (Preferred)""
nodeSelectorConstrain which nodes your Memgraph pod is eligible to be scheduled on, based on the labels on the nodes. Left empty by default.{}
service.typeKubernetes service typeClusterIP
service.enableBoltEnable Bolt protocoltrue
service.boltPortBolt protocol port7687
service.boltProtocolProtocol used by BoltTCP
service.enableWebsocketMonitoringEnable WebSocket monitoringfalse
service.websocketPortMonitoringWebSocket monitoring port7444
service.websocketPortMonitoringProtocolProtocol used by WebSocket monitoringTCP
service.enableHttpMonitoringEnable HTTP monitoringfalse
service.httpPortMonitoringHTTP monitoring port9091
service.httpPortMonitoringProtocolProtocol used by HTTP monitoringhttp
service.annotationsAnnotations to add to the service{}
persistentVolumeClaim.createStorageClaimEnable creation of a Persistent Volume Claim for storagetrue
persistentVolumeClaim.storageClassNameStorage class name for the persistent volume claim""
persistentVolumeClaim.storageSizeSize of the persistent volume claim for storage10Gi
persistentVolumeClaim.existingClaimUse an existing Persistent Volume Claimmemgraph-0
persistentVolumeClaim.storageVolumeNameName of an existing Volume to create a PVC for""
persistentVolumeClaim.createLogStorageEnable creation of a Persistent Volume Claim for logstrue
persistentVolumeClaim.logStorageClassNameStorage class name for the persistent volume claim for logs""
persistentVolumeClaim.logStorageSizeSize of the persistent volume claim for logs1Gi
memgraphConfigList of strings defining Memgraph configuration settings["--also-log-to-stderr=true"]
secrets.enabledEnable the use of Kubernetes secrets for Memgraph credentialsfalse
secrets.nameThe name of the Kubernetes secret containing Memgraph credentialsmemgraph-secrets
secrets.userKeyThe key in the Kubernetes secret for the Memgraph user, the value is passed to the MEMGRAPH_USER envUSER
secrets.passwordKeyThe key in the Kubernetes secret for the Memgraph password, the value is passed to the MEMGRAPH_PASSWORDPASSWORD
memgraphEnterpriseLicenseMemgraph Enterprise License""
memgraphOrganizationNameOrganization name for Memgraph Enterprise License""
statefulSetAnnotationsAnnotations to add to the stateful set{}
podAnnotationsAnnotations to add to the pod{}
resourcesCPU/Memory resource requests/limits. Left empty by default.{}
tolerationsA toleration is applied to a pod and allows the pod to be scheduled on nodes with matching taints. Left empty by default.[]
serviceAccount.createSpecifies whether a service account should be createdtrue
serviceAccount.annotationsAnnotations to add to the service account{}
serviceAccount.nameThe name of the service account to use. If not set and create is true, a name is generated.""
container.terminationGracePeriodSecondsGrace period for pod termination1800
probes.liveliness.initialDelaySecondsInitial delay for liveliness probe10
probes.liveliness.periodSecondsPeriod seconds for liveliness probe60
probes.liveliness.failureThresholdFailure threshold for liveliness probe3
probes.readiness.initialDelaySecondsInitial delay for readiness probe10
probes.readiness.periodSecondsPeriod seconds for readiness probe30
probes.readiness.failureThresholdFailure threshold for readiness probe3
probes.startup.initialDelaySecondsInitial delay for startup probe10
probes.startup.periodSecondsPeriod seconds for startup probe10
probes.startup.failureThresholdFailure threshold for startup probe30
nodeSelectorsNode selectors for pod. Left empty by default.{}
customQueryModulesList of custom Query modules that should be mounted to Memgraph Pod[]
sysctlInitContainer.enabledEnable the init container to set sysctl parameterstrue
sysctlInitContainer.maxMapCountValue for vm.max_map_count to be set by the init container262144
storageClass.nameName of the StorageClass"memgraph-generic-storage-class"
storageClass.provisionerProvisioner for the StorageClass""
storageClass.storageTypeType of storage for the StorageClass""
storageClass.fsTypeFilesystem type for the StorageClass""
storageClass.reclaimPolicyReclaim policy for the StorageClassRetain
storageClass.volumeBindingModeVolume binding mode for the StorageClassImmediate

To change the default chart values, provide your own values.yaml file during the installation:

helm install <resource-name> memgraph/memgraph -f values.yaml

Default chart values can also be changed by setting the values of appropriate parameters:

helm install <resource-name> memgraph/memgraph --set <flag1>=<value1>,<flag2>=<value2>,...

Memgraph will start with the --also-log-to-stderr=true flag, meaning the logs will also be written to the standard error output and you can access logs using the kubectl logs command. To modify other Memgraph database settings, you should update the memgraphConfig parameter. It should be a list of strings defining the values of Memgraph configuration settings. For example, this is how you can define memgraphConfig parameter in your values.yaml:

memgraphConfig: 
  - "--also-log-to-stderr=true"
  - "--log-level=TRACE"

For all available database settings, refer to the Configuration settings reference guide.

Helm chart for Memgraph high availability cluster (Enterprise)

A Helm chart for deploying Memgraph in high availability setup. This helm chart requires Memgraph Enterprise license.

Memgraph HA cluster includes 3 coordinators and 2 data instances by default. Since multiple instances are deployed, you need to have multiple workers nodes in Kubernetes to deploy the Memgraph HA cluster.

Most of the features and configurations discussed in the Helm chart for Standalone Memgraph are also applicable to the Memgraph HA Helm chart if the configuration is not specific to the standalone setup. Differences can be observed in the configuration options section.

Node affinity

Since HA Memgraph deploys multiple pods, you can control how they are distributed in the cluster.

The Memgraph HA Helm chart provides the following node affinity options:

  • default: Default affinity will try to schedule the data pods and coordinator pods on the nodes where there is no other pod with the same role. If there is no such node, the pods will still be scheduled on the same node, and deployment will not fail.
  • unique: This is achieved with the memgraph.affinity.unique set to true. This option will try to deploy the data pods and coordinator pods on different nodes in the cluster so that each pod is on a unique node. If there are no sufficient nodes, this deployment will fail.
  • parity: This is achieved with the memgraph.affinity.parity set to true. This option will try to deploy the data pods and coordinator pods on the same node with maximum one coordinator and one data pod on the node. If there are no sufficient nodes to deploy the pods, this deployment will fail. Coordinators get scheduled first. After that, data pods are looking for the nodes with coordinators.
  • nodeSelection: This is achieved with the memgraph.affinity.nodeSelection set to true. This option will try to deploy the data pods and coordinator pods on the nodes with specific labels. You can set the labels with the memgraph.affinity.dataNodeLabelValue and memgraph.affinity.coordinatorNodeLabelValue parameters. If all the nodes with labels are occupied by the pods with the same role, the deployment will fail.

During the usage of nodeSelection affinity, make sure that the nodes are properly labeled, the default key for the role label is role, and default values are data-node and coordinator-node. Labels can be added to nodes using the kubectl label nodes <node-name> <key>=<value> command. Here is an example of how to deploy Memgraph HA cluster in AKS.

Installing the Memgraph HA Helm Chart

To include Memgraph HA cluster as a part of your Kubernetes cluster, you need to add the repository and install Memgraph.

The steps below will work in the Minikube environment, but you can also use them in other Kubernetes environments with minor adjustments.

To test affinity, you can use minikube in multi-node mode.

Add the repository

Add the Memgraph Helm chart repository to your local Helm setup by running the following command:

helm repo add memgraph https://memgraph.github.io/helm-charts

Make sure to update the repository to fetch the latest Helm charts available:

helm repo update

Install Memgraph HA

Since Memgraph HA requires an Enterprise license, you need to provide the license and organization name during the installation.

helm install <release-name> memgraph/memgraph-high-availability --set memgraph.env.MEMGRAPH_ENTERPRISE_LICENSE=<your-license>,memgraph.env.MEMGRAPH_ORGANIZATION_NAME=<your-organization-name>

Replace <release-name> with a name of your choice for the release and set the Enterprise license.

Changing the default chart values

To change the default chart values, run the command with the specified set of flags:

helm install <resource-name> memgraph/memgraph-high-availability --set <flag1>=<value1>,<flag2>=<value2>,...

Or you can modify a values.yaml file and override the desired values:

helm install <resource-name> memgraph/memgraph-high-availability -f values.yaml

Configuration options

The following table lists the configurable parameters of the Memgraph chart and their default values.

ParameterDescriptionDefault
memgraph.image.repositoryMemgraph Docker image repositorymemgraph/memgraph
memgraph.image.tagSpecific tag for the Memgraph Docker image. Overrides the image tag whose default is chart version.2.22.0
memgraph.image.pullPolicyImage pull policyIfNotPresent
memgraph.env.MEMGRAPH_ENTERPRISE_LICENSEMemgraph enterprise license<your-license>
memgraph.env.MEMGRAPH_ORGANIZATION_NAMEOrganization name<your-organization-name>
memgraph.probes.startup.failureThresholdStartup probe failure threshold30
memgraph.probes.startup.periodSecondsStartup probe period in seconds10
memgraph.probes.readiness.initialDelaySecondsReadiness probe initial delay in seconds5
memgraph.probes.readiness.periodSecondsReadiness probe period in seconds5
memgraph.probes.liveness.initialDelaySecondsLiveness probe initial delay in seconds30
memgraph.probes.liveness.periodSecondsLiveness probe period in seconds10
memgraph.data.volumeClaim.storagePVCEnable storage PVCtrue
memgraph.data.volumeClaim.storagePVCSizeSize of the storage PVC1Gi
memgraph.data.volumeClaim.logPVCEnable log PVCfalse
memgraph.data.volumeClaim.logPVCSizeSize of the log PVC256Mi
memgraph.coordinators.volumeClaim.storagePVCEnable storage PVC for coordinatorstrue
memgraph.coordinators.volumeClaim.storagePVCSizeSize of the storage PVC for coordinators1Gi
memgraph.coordinators.volumeClaim.logPVCEnable log PVC for coordinatorsfalse
memgraph.coordinators.volumeClaim.logPVCSizeSize of the log PVC for coordinators256Mi
memgraph.externalAccess.coordinator.serviceTypeNodePort, CommonLoadBalancer or LoadBalancer. Use LoadBalancer for Cloud production deployment and NodePort for local testing. ‘CommonLoadBalancer’ will open one load balancer for all coordinators while ‘LoadBalancer’ will open one load balancer for each coordinators.NodePort
memgraph.externalAccess.dataInstance.serviceTypeNodePort or LoadBalancer. Use LoadBalancer for Cloud production deployment and NodePort for local testing.NodePort
memgraph.ports.boltPortBolt port used on coordinator and data instances.7687
memgraph.ports.managementPortManagement port used on coordinator and data instances.10000
memgraph.ports.replicationPortReplication port used on data instances.20000
memgraph.ports.coordinatorPortCoordinator port used on coordinators.12000
memgraph.affinity.uniqueSchedule pods on different nodes in the clusterfalse
memgraph.affinity.paritySchedule pods on the same node with maximum one coordinator and one data nodefalse
memgraph.affinity.nodeSelectionSchedule pods on nodes with specific labelsfalse
memgraph.affinity.roleLabelKeyLabel key for node selectionrole
memgraph.affinity.dataNodeLabelValueLabel value for data nodesdata-node
memgraph.affinity.coordinatorNodeLabelValueLabel value for coordinator nodescoordinator-node
dataConfiguration for data instancesSee data section
coordinatorsConfiguration for coordinator instancesSee coordinators section

For the data and coordinators sections, each item in the list has the following parameters:

ParameterDescriptionDefault
idID of the instance0 for data, 1 for coordinators
argsList of arguments for the instanceSee args section

The args section contains a list of arguments for the instance. The default values are the same for all instances:

- "--also-log-to-stderr"
- "--log-level=TRACE"
- "--replication-restore-state-on-startup=true"

For all available database settings, refer to the configuration settings docs.

Helm chart for Memgraph Lab

A Helm chart for deploying Memgraph Lab on Kubernetes.

Installing the Memgraph Lab Helm chart

To install the Memgraph Lab Helm chart, follow the steps below:

helm install <release-name> memgraph/memgraph-lab

Replace <release-name> with a name of your choice for the release.

Changing the default chart values

To change the default chart values, run the command with the specified set of flags:

helm install <resource-name> memgraph/memgraph-lab --set <flag1>=<value1>,<flag2>=<value2>,...

Or you can modify a values.yaml file and override the desired values:

helm install <resource-name> memgraph/memgraph-lab -f values.yaml

Configuration options

The following table lists the configurable parameters of the Memgraph Lab chart and their default values.

ParameterDescriptionDefault
image.repositoryMemgraph Lab Docker image repositorymemgraph/memgraph-lab
image.tagSpecific tag for the Memgraph Lab Docker image. Overrides the image tag whose default is chart version."" (Defaults to chart’s app version)
image.pullPolicyImage pull policyIfNotPresent
replicaCountNumber of Memgraph Lab instances to run.1
service.typeKubernetes service typeClusterIP
service.portKubernetes service port3000
service.targetPortKubernetes service target port3000
service.protocolProtocol used by the serviceTCP
service.annotationsAnnotations to add to the service{}
podAnnotationsAnnotations to add to the pod{}
resourcesCPU/Memory resource requests/limits. Left empty by default.{} (See note on uncommenting)
serviceAccount.createSpecifies whether a service account should be createdtrue
serviceAccount.annotationsAnnotations to add to the service account{}
serviceAccount.nameThe name of the service account to use. If not set and create is true, a name is generated.""

Memgraph Lab can be further configured with environment variables in your values.yaml file.

env:
  - name: QUICK_CONNECT_MG_HOST
    value: memgraph
  - name: QUICK_CONNECT_MG_PORT
    value: "7687"
  - name: KEEP_ALIVE_TIMEOUT_MS
    value: 65000

In case you added Nginx Ingress service or web server for a reverse proxy, update the following proxy timeout annotations to avoid potential timeouts:

proxy_read_timeout X;
proxy_connect_timeout X;
proxy_send_timeout X;

where X is the number of seconds the connection (request query) can be alive. Additionally, update the Memgraph Lab KEEP_ALIVE_TIMEOUT_MS environment variable to a higher value to ensure that Memgraph Lab stays connected to Memgraph when running queries over 65 seconds.

Refer to the Memgraph Lab documentation for details on how to connect to and interact with Memgraph.