Data Pipelines

Data engineering and analytics architecture diagrams.

Real-Time Streaming

Kafka-based streaming pipeline.

@arch
#!ext icons

[styles]
  stream: color:#F59E0B|width:2
  process: color:#3B82F6|width:2
  store: color:#10B981

[nodes]
  [Sources]
    web:Web Events|icon:cloud
    mobile:Mobile Events|icon:cloud
    iot:IoT Sensors|icon:server

  [Collection]
    collector:Event Collector|icon:server

  [Streaming Platform]
    kafka:Kafka Cluster|icon:aws-amazon-managed-streaming-for-apache-kafka

    [Topics]
      raw:raw-events|cyl
      enriched:enriched-events|cyl
      alerts:alerts|cyl

  [Stream Processing]
    flink:Apache Flink|icon:server
    ksql:ksqlDB|icon:server

  [Storage]
    s3:Data Lake (S3)|icon:aws-s3
    elastic:Elasticsearch|icon:database
    redis:Redis|icon:database

  [Consumers]
    dashboard:Real-time Dashboard|icon:cloud
    alerting:Alert System|icon:server
    ml:ML Pipeline|icon:server

[edges]
  web->collector|style:stream
  mobile->collector|style:stream
  iot->collector|style:stream
  collector->raw|style:stream
  raw->flink|style:process
  flink->enriched|style:stream
  flink->alerts|style:stream
  enriched->ksql|style:process
  ksql->s3|style:store
  ksql->elastic|style:store
  enriched->redis|style:store
  redis->dashboard|style:stream
  alerts->alerting|style:stream
  s3->ml|style:store
Rendered diagram

Rendered diagram

ETL Pipeline

Classic batch ETL with Airflow orchestration.

Rendered diagram

Rendered diagram

ML Feature Pipeline

Feature engineering for machine learning.

Rendered diagram

Rendered diagram

CDC Pipeline

Change Data Capture with Debezium.

Rendered diagram

Rendered diagram

More Data Pipeline Examples

See the example files in the repository:

  • data-platform.tdl - Complete analytics platform

Last updated