Skip to main content

TapData Community

Applicable EditionsTapData CommunityTapData Community is an open-source data integration platform that provides basic data synchronization and transformation capabilities. This helps you quickly explore and implement data integration projects. As your project or business grows, you can seamlessly upgrade to TapData Cloud or TapData Enterprise to access more advanced features and service support.

TapData Community is an open-source real-time data platform that facilitates data synchronization and transformation. This guide demonstrates how to quickly install and start TapData Community.

Prerequisites

Before you begin, ensure your environment meets the following requirements:

  • Hardware specifications: 8-core CPU (x86 architecture), 16 GB of memory
  • Storage specifications: 100 GB
  • Operating System: CentOS 7+ or Ubuntu 16.04+

Component Overview

TapData Community includes the following main components:

  • Data Connectors: Allow TapData Community to connect to various data sources, such as databases, data warehouses, and message queues.
  • Data Processing Engine: Responsible for performing tasks such as data transformation, cleaning, and processing.
  • Monitoring and Management Interface: Provides an easy-to-use graphical platform for configuring, managing, and monitoring data flows.

Install TapData Community

  1. Ensure Docker is installed and running.

  2. Open a terminal or command line interface and run the following command to pull the latest TapData Docker image:

    docker pull ghcr.io/tapdata/tapdata:latest
  3. Run the following command to start the TapData container:

    docker run -d -p 3030:3030 --restart always --name tapdata --privileged ghcr.io/tapdata/tapdata:latest

    Explanation of parameters:

    • -d: Run the container in the background.
    • -p 3030:3030: Map port 3030 of the container to port 3030 on the host machine, allowing access to TapData through a browser.
    • --name tapdata: Assign a name to your container, in this case, tapdata.
    • --restart always: Automatically start this container when Docker services restart.
    • --privileged: Grants the container elevated permissions to ensure proper functioning of dependencies (e.g., embedded MongoDB). It is recommended to configure permissions more precisely in production environments as needed.
    tip

    By default, TapData Community uses an internal MongoDB to store metadata, task configurations, etc. If you want to use your own MongoDB, specify the MongoDB URI connection string during container startup with the -e parameter, for example: docker run -d -p 3030:3030 --name tapdata -e MONGO_URI='mongodb://root:Tap123456@192.168.1.18:29917/tapdata_community?authSource=admin' --restart always ghcr.io/tapdata/tapdata:latest.

  4. (Optional) Run docker logs -f tapdata to view container startup logs. Key logs after successful startup should indicate:

    <<< Start Server [SUCCESS]
    All Done, Please Visit http://localhost:3030
  5. Access the TapData platform via a browser at http://localhost:3030. The default login is admin@admin.com with the password admin. Promptly change your password after the first login to ensure security.

    tip

    To access TapData services from other devices on the same network, ensure the network is interconnected.

Next Steps

Connect to a Database