Curated by the Data Stack Community

The database of AI-first tools for data engineers

Search across ingestion, processing, orchestration, observability, and governance platforms to assemble an intelligent data pipeline.

+ Add tool

AI data engineering stack

50+ copilots across warehouses, ETL, observability, and cataloging.

60 tools shown

#ToolWhat it's good forAI highlightsCategory
1
โ„๏ธ
SQL, ML, document processingAI SQL, doc processing, vector search, NL queryingWarehouse / AI ETL
2
๐Ÿค–
Developer experienceCode generation, pipeline suggestions, auto-SQLWarehouse
3
๐Ÿ“Š
Analytics and SQLNatural language to SQL, model-assisted optimizationsWarehouse
4
๐Ÿ”ฅ
ETL, notebooks, SQLAI SQL, notebook assistants, code generationUnified analytics
5
๐Ÿฆ†
Analytics on DuckDBAI SQL assistant, insightsWarehouse / Processing
6
๐Ÿš€
SQL, analyticsAuto SQL explanation and tuningWarehouse
7
โ˜๏ธ
ETL pipelines, Spark-based jobsAI code generation, schema inference, test generationCloud ETL
8
๐ŸŒ
Batch + streaming ETLAI job explanations and pipeline optimizationCloud ETL
9
๐Ÿ›ฐ๏ธ
ETL pipeline designNatural language pipeline generation, code suggestionsCloud ETL
10
๐Ÿข
Traditional ETL + governanceSmart mappings and rule suggestionsETL
11
โ™Ÿ๏ธ
Enterprise ETLSemantic matching, metadata enrichmentETL
12
๐Ÿงฑ
Cloud ETL UIAI pipeline generation, smart suggestionsCloud ETL
13
๐Ÿงฉ
No-code ETLAI mapping plus anomaly detectionNo-code ETL
14
โ˜๏ธ
No-code ETLAI-based transformations and mappingNo-code / ETL
15
โœจ
SQL transformationsModel generation, test suggestionsTransformation
16
๐Ÿ”ฎ
SQL + notebooksAI SQL and insight generationTransformation
17
๐Ÿฆ†
DuckDB analyticsNL to SQL and automated analysisTransformation
18
๐Ÿ“ˆ
Notebook/SQL BIGenerate SQL and narrative analysisTransformation
19
๐Ÿ“
Collaborative notebooksWrite SQL/Python with AITransformation
20
๐Ÿ“
Data modelingAI model inference and lintingTransformation
21
๐Ÿ”ข
Automated ETL/ELTGenerate pipelines from natural languageTransformation
22
๐Ÿ›ซ
SQL generationNatural language to SQL via LLMsTransformation
23
๐ŸŒŠ
Ingestion connectorsGenerate connectors from plain languageIngestion
24
๐Ÿ”Œ
Managed ingestionAI-mapped schemas and transformsIngestion
25
๐Ÿ› ๏ธ
Open-source ingestionAI code generation for connectorsIngestion
26
๐ŸŽฏ
Reverse ETLNL transform logic and AI audience buildingIntegration
27
๐Ÿ—‚๏ธ
Reverse ETLAI-driven transformation logicIntegration
28
๐Ÿงต
Lightweight ingestionSmart mapping suggestionsIngestion
29
๐Ÿ›ฐ๏ธ
Data observabilityAI anomaly detection and root-causeObservability
30
๐Ÿ“
Data testingAI-generated data quality rulesData quality
31
๐Ÿฅค
Data qualityAI rule generation and NL checksData quality
32
๐Ÿ‘๏ธ
Data observabilityAI baselines and anomaly detectionObservability
33
โš ๏ธ
AI-native data qualityAutomated drift and completeness checksData quality
34
๐Ÿ’“
Data reliabilityAI anomaly detectionObservability
35
๐Ÿ“ก
Pipeline observabilityAI pipeline anomaly detectionObservability
36
๐Ÿ›ก๏ธ
Data monitoringLineage-based anomaly predictionsObservability
37
๐Ÿ“š
CatalogingAuto-tagging and natural language searchCatalog
38
๐Ÿ›๏ธ
Governance + catalogSemantic classification with NL searchCatalog
39
๐ŸŒ
Active metadata catalogAI lineage and documentationCatalog
40
๐Ÿงฉ
Open-source catalogAI tag inferenceCatalog
41
๐Ÿงญ
Open-source metadata catalogSemantic suggestionsCatalog
42
โญ
Usage analytics catalogAI column naming and documentationCatalog
43
โšก
OrchestrationAI-generated flowsOrchestration
44
๐Ÿ•ธ๏ธ
OrchestrationCode and graph generationOrchestration
45
๐Ÿช„
AI-native pipeline toolAutogenerated ETL pipelinesOrchestration
46
๐Ÿ›ฉ๏ธ
Workflow automationGenerate DAGs from natural languageOrchestration
47
๐Ÿ“„
Document to structured ETLAI extraction, OCR, classificationAI extraction
48
๐Ÿง 
Enterprise extractionOCR plus entity extractionAI extraction
49
๐Ÿ“‘
Document extractionEntity extraction and summarizationAI extraction
50
๐Ÿ“ƒ
OCR + structure extractionNLP entity extractionAI extraction
51
๐Ÿฆ™
Parsing PDFs for RAGAI structuring and tablesAI extraction
52
๐Ÿ“ค
AI-based structuringTransform unstructured text into structured dataAI extraction
53
๐Ÿ“š
Index building + ETLAuto-chunking and schema extractionRAG / ETL
54
๐Ÿ”—
ETL for LLM pipelinesAI extraction, structuring, loadersRAG / ETL
55
๐Ÿง 
Turn documents into embeddingsAutomatic chunking and metadata ETLRAG / ETL
56
๐ŸŒฒ
Vector ingestionManaged embedding pipelinesRAG / ETL
57
๐ŸŒ€
Vector DB ingestionUnstructured to vector pipelinesRAG / ETL
58
๐Ÿ•ธ๏ธ
Vector DB ingestionAuto-schema inferenceRAG / ETL
59
๐ŸŸฅ
Embedding pipelinesAI-based structuringRAG / ETL
60
๐Ÿ’ 
Vector ingestionSmart metadata extractionRAG / ETL