ML Pipeline Orchestrator Agent

Pipeline successfully executed page
Final confirmation of Job page (Successfully completed)
Pipeline running page
Creating a new issue page
Final report page - 1
Initial landing output page (Process has been started)
Issues history page
Agents running page (one after another)
Agents running page ( one after another)
Final report page - 2
Agents running page (one after another)
Official GitLab repo code base
VS code , code deployment/development page
Job description page on executing pipeline

Inspiration

Building ML pipelines manually takes 2-3 days. We wanted to automate the entire process with autonomous agents.

What it does

Transforms a GitLab issue into a production-ready ML pipeline in 2 minutes. File an issue with your dataset, and 5 autonomous agents orchestrate the entire pipeline creation — from data analysis to model training to results reporting.

How we built it

Trigger Agent: Reads GitLab issues and extracts task specifications
Dataset Analyst: Profiles CSV datasets and identifies key characteristics
Strategist Agent: Claude Sonnet 4.6 reasons about optimal ML strategy
Code Generator: Generates 8-file production ML pipeline (preprocess, train, evaluate, etc.)
Reporter Agent: Trains model, evaluates performance, posts results to GitLab

Challenges we ran into

Multi-agent orchestration and state management
Ensuring Claude generates syntactically correct production code
Integrating with GitLab CI/CD pipelines automatically
Handling edge cases in dataset profiling

Accomplishments that we're proud of

✅ 5 autonomous agents working seamlessly together ✅ 131 unit tests, all passing ✅ Real ML models trained (F1=0.7616, ROC-AUC=0.8454) ✅ Production-grade code generation ✅ SHAP feature importance analysis ✅ Automatic CI/CD integration ✅ Completed in 3-4 days

What we learned

Multi-agent systems require careful state management
Claude's reasoning is powerful for ML domain knowledge
GitLab's agent platform enables complex integrations
SHAP provides excellent model interpretability
Production ML requires robust error handling

What's next for ML Pipeline Orchestrator Agent

Support for deep learning (TensorFlow/PyTorch)
Model monitoring and drift detection
A/B testing framework
Automated model registry integration
Multi-class and regression task support

Built With

claude-sonnet-4.6
docker
dotenv
gitlab-ci/cd
gitlab-duo-agent-platform
pytest
python
scikit-learn
shap
xgboost

Submitted to

GitLab AI Hackathon

Created by

I designed and built the entire ML Pipeline Orchestrator Agent from scratch during this hackathon. My contributions include:

1. **Multi-Agent Orchestration** — Architected 5 autonomous agents (Trigger, Dataset Analyst, Strategist, Code Generator, Reporter) that coordinate seamlessly to build complete ML pipelines.

2. **Claude Integration** — Integrated Claude Sonnet 4.6 as the core reasoning engine for ML strategy design and production code generation.

3. **GitLab Duo Platform Integration** — Built the system on GitLab Duo Agent Platform, enabling real-time issue reading, automatic branch creation, MR generation, and CI/CD pipeline triggering.

4. **Full ML Pipeline Generation** — Implemented automatic generation of 8-file production ML pipelines including data preprocessing, model training, evaluation, and SHAP analysis.

5. **Production-Grade Quality** — Wrote and tested 131 unit tests (all passing), ensuring reliability and correctness of generated code.

6. **Real Results** — Achieved F1=0.7616 and ROC-AUC=0.8454 on the churn prediction dataset, proving the system works end-to-end.

This is a complete, working system built in 3-4 days that demonstrates advanced multi-agent orchestration, LLM reasoning, and production ML automation.

Yashas S

Updates

Yashas S posted an update — Mar 21, 2026 03:32 AM EDT

Submission Complete!

The ML Pipeline Orchestrator Agent is ready for judging.

Key achievements: 5 autonomous agents fully functional 131 unit tests passing Real ML models trained (F1=0.7616, ROC-AUC=0.8454) Production-grade code generation Full GitLab + Claude Sonnet 4.6 integration Demo video uploaded

The system autonomously transforms a GitLab issue into a complete, trained ML pipeline in ~2 minutes with zero manual code.

Repository: https://gitlab.com/gitlab-ai-hackathon/participants/35358782 Video: https://youtu.be/k5M0RWr_j5c

Log in or sign up for Devpost to join the conversation.

Yashas S started this project — Mar 21, 2026 03:32 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.