Flowfile

An open-source data platform with a visual pipeline builder, data catalog, Delta Lake storage, scheduling, Kafka ingestion, sandboxed Python execution, and a Polars-compatible API — all in a single pip install.

Flowfile Interface

Why Flowfile?

A complete data platform — visual pipeline builder, catalog, scheduling, and more — powered by Polars

🎨

Visual Pipeline Builder

30+ nodes for joins, filters, aggregations, fuzzy matching, pivots, and more. See your data flow in real-time with instant previews at each step.

📚

Data Catalog & Delta Lake

Every table stored as Delta Lake with version history, time travel, and merge/upsert support. Track lineage, runs, and artifacts in one place.

Scheduling & Triggers

Run flows on intervals or trigger them when catalog tables update. Built into the catalog — not a separate orchestration tool.

📡

Kafka & Cloud Storage

Ingest from Kafka/Redpanda as a canvas node. Read and write to S3, Azure Data Lake, and GCS. Connect to PostgreSQL, MySQL, and more.

🐍

Sandboxed Python

Run arbitrary Python code in isolated Docker containers. Use any library — matplotlib, scikit-learn, or your own — output flows back into the pipeline.

🔄

Code Generation & Python API

Export visual flows as Python/Polars scripts. Or build pipelines programmatically with a Polars-compatible API and visualize them on the canvas.

Build Pipelines Your Way

Use the visual designer for exploration or write code for automation - seamlessly switch between both

📊 Visual Approach

  • Drag and drop nodes to build workflows
  • Configure transformations with intuitive forms
  • Preview data at each step instantly
  • Export your pipeline as Python code
  • Perfect for exploration and prototyping

🐍 Code Approach

import flowfile as ff

df = ff.read_csv("data.csv")
result = df.filter(
    ff.col("amount") > 1000
).group_by("region").agg(
    ff.col("amount").sum()
)

# Visualize your pipeline
ff.open_graph_in_editor(result.flow_graph)

See It For Yourself

Start rediscovering how we bridge the gap between business users and technical users.

5 min
to first pipeline
100%
Familiar API
0
vendor lock-in
Try Interactive Tutorial Browse Examples

Free, open-source and customizable