Talk

Micromonoliths: designing Python data pipelines that scale with the team

Saturday, May 30

14:40 - 15:25
RoomPassatelli
LanguageEnglish
Audience levelAdvanced
Elevator pitch

How can you quickly get a growing team up to speed when data and AI pipelines become complex? In this talk, I will present the micromonolith architecture: a hybrid architectural approach designed to evolve code and teams together, even in rapidly changing cloud and LLM environments.

Abstract

The debate between monoliths and microservices often frames scalability as a simple matter of performance. In reality, it extends to the challenge of growing code and teams together. In complex data and AI pipelines, orchestration, serverless components, and external foundation models—rapidly changing and subject to strict operational constraints—make it increasingly difficult to maintain a balance between maintainability, performance, and fast project onboarding.

In this talk, I will present an architectural approach that I have refined over time, which I call the micromonolith architecture. It is a model explicitly designed for the Python ecosystem that combines: • the cohesion, maintainability, and ease of onboarding typical of a monolith • the scalability and isolation of microservices

The architecture is built around serverless components common in data pipelines, clear development processes, and a standardized design of the Python modules in the repository, achieved through: • standardized scaffolding • minimal and functional flow documentation • systematic use of the Factory, Strategy, and Singleton patterns • standardized data exchange via a data lake • a workflow orchestrator (such as AWS Step Functions)

This approach enables teams to: • collaborate effectively, reducing conflicts and maximizing parallel work • keep core logic in a single repository per data pipeline, preserving the monolithic development experience • accelerate onboarding and support team growth • facilitate developers moving between different data pipelines • faithfully reproduce cloud behavior locally

This architectural paradigm helps teams grow, accelerates learning, and maintains agility even as pipelines become more complex and the technological landscape evolves rapidly.

TagsScaling, ML and AI, Teams management
Participant

Raffaele Bongo

Oggi ricopro il ruolo di Senior Machine Learning Engineer in Talentware, sviluppando la prima piattaforma di talent management skills based in Italia.

Sono appassionato di framework e metodologie di lavoro volti a integrare il pensiero AI nella cultura organizzativa e a favorire una collaborazione fluida e scalabile tra team AI e le altre funzioni aziendali.

Prima di Talentware ho lavorato per 3 startup in fasi diverse — dal pre-seed al Series A — per un’azienda di consulenza e un laboratorio di ricerca universitario. Negli anni ho insegnato ai droni a volare in modo autonomo, sviluppato agenti cooperativi per vincere partite di poker e aiutato aziende a prendere decisioni operative e strategiche migliori attraverso i dati. Amo giocare a Padel, sono fanatico dei podcast del Post, di serie tv scify e non resisto all’occasione di provare sapori nuovi e “lontani”.