Skip to content

Files

This branch is 2949 commits behind datahub-project/datahub:master.

docker

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
Aug 25, 2023
Sep 25, 2022
Apr 26, 2022
May 1, 2023
Aug 31, 2023
Sep 6, 2023
Sep 12, 2023
Sep 12, 2023
Jun 7, 2023
Jun 7, 2023
Sep 6, 2023
Aug 17, 2023
Feb 1, 2023
Sep 28, 2021
May 1, 2023
Aug 25, 2023
Jan 12, 2021
Aug 6, 2020
Dec 2, 2022
May 8, 2023
Aug 17, 2023
Jun 23, 2023
Dec 14, 2021
Sep 6, 2023
Sep 6, 2023
Aug 17, 2023
May 1, 2023
Jan 24, 2023
Aug 6, 2020
May 23, 2023
Aug 22, 2023
Sep 14, 2022
Feb 1, 2023
Sep 14, 2022
Aug 17, 2023
Jan 24, 2023
Aug 4, 2023
Sep 6, 2023
Sep 6, 2023
May 8, 2023
May 8, 2023
May 8, 2023
Jun 21, 2023
May 8, 2023
Jan 24, 2023
Jun 13, 2023
Jun 13, 2023
Aug 17, 2023
Apr 26, 2021
Sep 14, 2022
title hide_title
Deploying with Docker
true

Docker Images

Prerequisites

You need to install docker and docker-compose (if using Linux; on Windows and Mac compose is included with Docker Desktop).

Make sure to allocate enough hardware resources for Docker engine. Tested & confirmed config: 2 CPUs, 8GB RAM, 2GB Swap area.

Quickstart

The easiest way to bring up and test DataHub is using DataHub Docker images which are continuously deployed to Docker Hub with every commit to repository.

You can easily download and run all these images and their dependencies with our quick start guide.

DataHub Docker Images:

Do not use latest or debug tags for any of the image as those are not supported and present only due to legacy reasons. Please use head or tags specific for versions like v0.8.40. For production we recommend using version specific tags not head.

Dependencies:

Ingesting demo data.

If you want to test ingesting some data once DataHub is up, use the ./docker/ingestion/ingestion.sh script or datahub docker ingest-sample-data. See the quickstart guide for more details.

Using Docker Images During Development

See Using Docker Images During Development.

Building And Deploying Docker Images

We use GitHub Actions to build and continuously deploy our images. There should be no need to do this manually; a successful release on Github will automatically publish the images.

Building images

This is not our recommended development flow and most developers should be following the Using Docker Images During Development guide.

To build the full images (that we are going to publish), you need to run the following:

COMPOSE_DOCKER_CLI_BUILD=1 DOCKER_BUILDKIT=1 docker-compose -p datahub build

This is because we're relying on builtkit for multistage builds. It does not hurt also set DATAHUB_VERSION to something unique.

Community Built Images

As the open source project grows, community members would like to contribute additions to the docker images. Not all contributions to the images can be accepted because those changes are not useful for all community members, it will increase build times, add dependencies and possible security vulns. In those cases this section can be used to point to Dockerfiles hosted by the community which build on top of the images published by the DataHub core team along with any container registry links where the result of those images are maintained.