Available for New Missions
GUILLAUME Rakotonjanahary Tsantaniaina

AI/ML Engineer and Cloud Architect with 5+ years engineering production-grade intelligent systems for international clients across healthcare, fintech, and e-commerce. Fluent in the full AI stack — from Speech AI and LLM fine-tuning to GPU cloud deployment and MLOps — delivering measurable business outcomes from Antananarivo to the world.

Scroll
0 Years Experience
0 AI Systems in Production
0 International Clients
0 Countries Served
0 MSc Grade (/ 20)

Engineering Intelligence
at Every Layer

From GPU-powered speech models to autonomous multi-agent systems, I architect and ship AI that generates real business value.

Speech AI & NLP

End-to-end speech pipelines combining ASR fine-tuning, neural machine translation, and voice cloning. Production systems processing 6 languages with sub-2s latency on GPU cloud.

ASR / TTSNeural MT LoRA Fine-tuningCTranslate2

LLM & Multi-Agent Systems

RAG architectures (vector, graph, hybrid, agentic), multi-agent orchestration with LangGraph and Google ADK, and RLHF-optimized pipelines that automate complex enterprise workflows.

RAGLangGraph Google ADKPEFT / LoRA

Cloud Architecture & MLOps

Scalable cloud-native platforms on GCP and AWS. Vertex AI Pipelines, Cloud Run GPU, BigQuery data warehouses, and full CI/CD MLOps lifecycles — budget-optimized and production-hardened.

GCP / AWSVertex AI KubernetesTerraform

Full-Stack AI Platforms

SaaS platforms built from scratch with Next.js, NestJS, WebRTC real-time features, payment integrations, and AI-powered onboarding — shipped with 750+ tests and full DevOps pipelines.

Next.js 15FastAPI WebRTCDocker / GKE

35+ Technologies.
One Engineer.

A Decade of Impact
Compressed into Five Years

From USAID health data systems to GPU-powered speech AI and UK fintech — building consequential software across three continents.

Apr 2026 — Present Maison du Numerique (MGVaovao) Antananarivo, Madagascar
AI/ML Engineer — Speech AI & Cloud Architecture
  • Architected and deployed a first-of-its-kind real-time Speech-to-Speech translation system converting 6 international languages into 6 Malagasy dialect outputs, running on Cloud Run GPU (NVIDIA L4) at just $42/month total infrastructure cost.
  • Engineered a 4-stage cascade pipeline (Silero VAD → Whisper INT8 ASR → NLLB-200 LoRA NMT → MMS-TTS VITS) achieving chrF++ improvements of +2 to +8 pts over baseline; VRAM footprint of 6.2 GB leaving capacity for 3–5 concurrent WebRTC sessions.
  • Implemented full Vertex AI MLOps lifecycle: Kubeflow DAG pipelines triggered by Pub/Sub, versioned Model Registry with chrF++ and UTMOS metadata, automated drift monitoring, and blue/green GPU deployment.
  • Fine-tuned TTS across 6 Malagasy dialects (Officiel, Merina, Betsileo, Betsimisaraka, Sakalava, Antandroy) using 80–150 samples per dialect in under 2 hours on L4 Spot GPU at $0.84/dialect.
Whisper INT8NLLB-200 LoRA MMS-TTS VITSSilero VAD Cloud Run GPU L4Vertex AI Pipelines WebRTCCTranslate2
Apr 2025 — Present Mediwyz.com Remote · Mauritius / East Africa
Tech Lead & AI Full-Stack Engineer
  • Architected from zero a multi-country SaaS digital health marketplace (Next.js 15, NestJS, Prisma, PostgreSQL) connecting patients with 17 healthcare provider categories — doctors, labs, pharmacies, physiotherapists, and more — across Mauritius and East Africa.
  • Delivered AI-powered provider onboarding using VLM-based OCR for automatic extraction and fraud-detection of medical licenses and professional credentials; built real-time video consultations via WebRTC with Socket.IO signaling.
  • Engineered a configurable workflow engine powering 33+ consultation types with ~310 status steps, automated notifications, and role-based state transitions; integrated MCB Juice mobile money payment gateway.
  • Shipped 750+ automated tests and 40+ API routes; owned SEO strategy, OG image design, and VPS deployment; mentored junior engineers on architecture and code quality.
Next.js 15NestJS WebRTCSocket.IO VLM OCRRAG (LLaMA) PostgreSQLDocker
Dec 2025 — Present Wikolabs Antananarivo, Madagascar
Tech Lead — AI & Automation Studio
  • Leading a multi-agent autonomous B2B sales system (Google ADK, Vertex AI, LangGraph) that automates the entire pipeline from cold lead sourcing to deal closing, with RLHF-based outreach optimization and real-time CRM sync.
  • Built a multimodal local service search platform where users describe a need in natural language or upload a photo; benchmarked Gemini 2 Multimodal Embedding vs CLIP, selected Gemini 2 as production backbone with BigQuery Vector Search geolocation clustering.
  • Delivered an intelligent e-commerce product catalogue search with Mobile Money payment integration (Mvola, Orange Money) for seamless in-app purchases directly from search results.
Google ADKLangGraph Gemini 2 EmbeddingBigQuery Vector Vertex AIRLHFMvola API
Jan 2026 — Apr 2026 Vohitra MG (Exponent) Antananarivo, Madagascar
Generative AI Engineer
  • Built AI email classification and routing system using VLM-based multi-format attachment analysis (PDF, images, tables) with Microsoft Graph API; integrated SharePoint extraction and Calendar-based workload distribution.
  • Developed a multi-agent NL interface for facility management using LangGraph, converting natural language to NoSQL queries with a 6-layer RAG context enhancement pipeline and RAGAS-based quality evaluation.
  • Delivered a production RAG compliance chatbot over 100+ documents with hybrid retrieval (vector similarity + BM25 + Reciprocal Rank Fusion + LLM reranking) and parallel PDF extraction via Docling.
Qwen3 VLMLangGraph DoclingElasticsearch RAGASMicrosoft Graph API
Aug 2025 — Jan 2026 eTech CDI — TASKFORCE AI Program Antananarivo, Madagascar
Data Scientist (Permanent Contract)
  • Migrated OLTP PostgreSQL to BigQuery star schema with BigQuery ML ARIMA+ forecasting for biometric kit usage prediction; built Looker Studio real-time KPI dashboards via Airbyte CDC replication.
  • Benchmarked SQLCoder-7B-2, GPT-3.5/4, and Claude on architecture, hallucination mitigation, and BigQuery syntax (Spider benchmark); produced multi-cloud deployment guide for enterprise adoption.
  • Built a LangGraph + RAG text-to-SQL BI chatbot and a Technical Support RAG Agent for biometric kit troubleshooting, both serving production users over BigQuery.
  • Conducted comprehensive AWS vs GCP innovation benchmarking (SageMaker vs Vertex AI, Bedrock vs Model Garden, Glue vs Dataflow) with TCO analysis.
BigQuery MLLangGraph Gemini 2.5dbt Airbyte CDCLooker Studio AWS SageMakerTerraform
Feb 2025 — Oct 2025 FinAlchemy Ltd Remote · United Kingdom
Fullstack AI Engineer & Tech Lead
  • Engineered an AI document processing engine using Vertex AI Gemini Flash achieving >95% first-pass accuracy on pension provider document extraction — eliminating manual data entry for UK pension administrators.
  • Automated provider phone communication with OpenAI Whisper STT and TTS for AI-driven IVR navigation and automated email sequencing with SLA tracking.
  • Implemented FCA COBS 9/19, PROD, and Consumer Duty compliance framework with real-time breach alerts, immutable audit trails, and automated suitability report generation.
Gemini FlashOpenAI Whisper FastAPIReact.js MongoDB AtlasCloud Run GitHub Actions
Jan 2025 — Aug 2025 eTech CDI — TASKFORCE AI Program Antananarivo, Madagascar
Data Scientist Junior (Permanent Contract)
  • Distributed clustering of 105,000+ e-commerce product images using Apache Spark and Google Dataproc with Spark ML automatic categorization — presented at AI Event 2024.
  • Deployed a multilingual airport conversational AI assistant on GKE with WhatsApp Business API, real-time translation, BigQuery query analysis, and multilingual NLP for flight services.
  • Fine-tuned YOLO V8 for retail inventory counting with TensorRT optimization; built multi-class detection REST API with real-time inference pipeline.
Apache SparkGoogle Dataproc YOLO V8TensorRT GKEGemini 2.5 Pro WhatsApp API
Dec 2023 — Dec 2024 ETech Consultant Antananarivo, Madagascar
Data Scientist Junior (Consultant)
  • Designed a hybrid RAG architecture combining Neo4j knowledge graph, Pinecone vector search, and Elasticsearch for multi-modal document retrieval — validated as Master's Thesis in AI.
  • Built a 105,000-image vectorization ETL pipeline using Google Dataflow and Apache Beam with K-means clustering and cross-modal vector similarity search.
  • Delivered AI Email Classification Agent for a Swiss industrial client using Microsoft Graph webhooks and LLM-based routing; built CV Recommendation Chatbot with NER skills extraction (spaCy) and semantic matching.
  • Conducted architectural study of ASR/TTS for Malagasy as a low-resource language — establishing the methodology later applied in full fine-tuning at Maison du Numerique.
LangGraphNeo4j PineconeApache Beam DataflowspaCy Microsoft Graph
Jan 2024 — Aug 2024 JSI Research & Training Institute (USAID · CHISU Project) Madagascar · Remote
Data Engineer & Software Engineer, Health Data Systems
  • Designed technical architecture for malaria bulletin data collection modules, enhancing interoperability between the Ministry of Health and DHIS2 systems for the USAID CHISU program.
  • Built NL-to-SQL chatbot using LLM for DHIS2 queries with React.js and Chart.js dashboards; collaborated with multidisciplinary team including Malaria Advisors, Program Officers, and Health Informatics Advisors.
  • Architected and deployed DHIS2 instances for dev and pre-production environments; developed KPIs including WASH, PSERAN 2022-2026, and PARN indicators.
DHIS2Java PostgreSQLReact.js LLM NL-to-SQLLinux
Mar 2023 — Sep 2023 JSI Research & Training Institute (USAID · Self-employed) Madagascar
Data & Software Engineer — PMI Measure Malaria Program
  • Built a DHIS2 COVID-19 data anomaly detection module (duplicates, outliers, missing values) using React.js and Node.js, maintaining dataset integrity for national public health assessments.
  • Developed a vaccine stock management and geolocation app for USAID PMM — real-time inventory visualization, healthcare facility navigation with turn-by-turn directions on interactive maps.
  • Built the Ministry of Health's digital document library improving accessibility and retrieval of health documents integrated with existing Health Information Systems; trained civil servants in KoBoToolbox and QGIS across multiple Malagasy regions.
DHIS2React.js React NativeNode.js QGISKoBoToolbox
Jun 2022 — Mar 2023 JSI Research & Training Institute (USAID · PMI Measure Malaria) Madagascar
Data & Fullstack Software Engineer (Internship)
  • Supported management of COVID-19 vaccination data systems; facilitated stakeholder engagement workshops at central and peripheral levels.
  • Contributed to data regularization and supervision activities ensuring integrity and compliance; undertook self-directed learning in AI and data science alongside beginning the Master 2 program.
DHIS2React.js Node.jsPostgreSQLQGIS

More from the Lab

Text-to-SQL
BI Agent Chatbot (Text-to-SQL)

LangGraph and RAG-powered text-to-SQL solution retrieving SQL patterns, DDL, and metadata for multi-query execution over BigQuery. Fine-tuned SQLCoder-7B-2 on BigQuery dialect.

LangGraphSQLCoder-7B-2BigQueryFastAPI
RAG Architecture
Hybrid RAG — Vector + Knowledge Graphs

Master's Thesis: hybrid RAG system combining Neo4j knowledge graph, Pinecone vector search, and Elasticsearch. Multi-modal document and image extraction from cloud infrastructure.

Neo4jPineconeLangGraphCrewAI
Multi-Agent
Lead Generation Multi-Agent System

Google ADK and LLM orchestration with multi-source intelligence gathering from social media and company data. Pattern-based investment signal analysis for B2B prospecting.

Google ADKVertex AILLMPython
AI Automation
AI Email Classification & Routing

VLM-based multi-format attachment analysis (PDF, images, tables) for a Swiss industrial client. Microsoft Graph webhooks, SharePoint Excel integration, and LLM-based routing and triage.

Qwen3 VLMMicrosoft GraphLangGraphSharePoint
RAG / NLP
CV Recommendation Chatbot

NER-based skills extraction with spaCy, NLP semantic matching for candidate-job pairing. Python FastAPI and React.js with advanced recruiter interface with candidate ranking.

spaCy NERVertex AIFastAPIReact.js
Computer Vision
Retail Inventory Counting — YOLO V8

YOLO V8 fine-tuned for real-time multi-class retail inventory detection with instance segmentation and TensorRT optimization. REST API with real-time inference pipeline. Presented at AI Event 2024.

YOLO V8TensorRTOpenCVFastAPI
Data Engineering
Massive Vectorization ETL (105K Images)

Google Dataflow and Apache Beam distributed ETL pipeline vectorizing 105,000 product images into BigQuery. K-means clustering and cross-modal vector similarity search across images and text.

Apache BeamDataflowBigQueryK-Means
Data Engineering
Real-Time Biometric KPI Platform

PostgreSQL replicated via Airbyte CDC to BigQuery; dbt SQL transformations; Looker Studio real-time dashboards for mobile biometric kit performance monitoring and BigQuery ML ARIMA+ forecasting.

Airbyte CDCBigQuerydbtLooker Studio
BI / AutoML
Automated BI Assistant

LangGraph with memory persistence, Vertex AI and BigQuery integration, SQLCoder fine-tuning on BigQuery dialect, and automatic visualization dashboard generation for business intelligence.

LangGraphSQLCoderVertex AIBigQuery
Speech AI
Malagasy ASR/TTS Architectural Analysis

Architectural study of ASR and TTS systems for Malagasy as a low-resource language using Chirp (Vertex AI Studio). Established the transformer adaptation methodology later applied in production at Maison du Numerique.

ChirpVertex AIWhisperPython
AI + Full-Stack
Multilingual Airport AI Assistant

Deployed on GKE with WhatsApp Business API integration. NLP for flight status, navigation, and airport services. BigQuery for query analysis, real-time translation, and multilingual NLP.

GKEGemini 2.5WhatsApp APIBigQuery
Multimodal AI
E-commerce Product Search (Text + Image)

No-code configurable chatbot with image recognition on GCP. RAG-based chatbot with configurable modes (image, text, combined). Google Sheets catalog integration with under 3s response time.

CLIPGemini 2.5BigQuery VectorFastAPI
Health Data
COVID-19 Data Quality Pipeline

DHIS2 platform module automating detection of duplicates, missing values, outliers, and non-applicable entries in national COVID-19 health data using Isolation Forest anomaly detection.

DHIS2React.jsNode.jsIsolation Forest
Mobile + Health
Vaccine & Healthcare Geolocation App

Vaccine stock management and healthcare facility navigation tool for the USAID PMM Program. Interactive maps with turn-by-turn directions, vaccine inventory visualization by type and date.

React NativeFastAPIPostgreSQLQGIS
Competition · 3rd Place
IndabaX Madagascar 2023 — 3rd Place

NLP pipeline for pathology prediction from clinical symptoms across 10 disease classes. Achieved 94% accuracy with XGBoost + CatBoost ensemble, TF-IDF, Word2Vec, and spaCy preprocessing.

XGBoostCatBoostspaCyWord2Vec
Pan-African Competition
DataTour 2025 — Credit & Content Models

Two models in the Pan-African data science competition: a content recommendation model for 122M social media interactions; and a credit default risk classifier trained on real bank lending data using XGBoost and LightGBM.

XGBoostLightGBMCollaborative FilteringFeature Engineering
Benchmarking
SQL Generation Model Benchmarking

Evaluated SQLCoder-7B-2, GPT-3.5/4, and Claude on architecture, hallucination mitigation, BigQuery syntax, and Spider benchmark. Produced multi-cloud deployment guide for enterprise adoption.

SQLCoder-7BGPT-4Claude APIBigQuery

Built on Rigorous
Academic Foundations

Master of Science — Big Data & Artificial Intelligence
IT University & ESTIA France
Dec 2022 – Dec 2024
Specializations: Big Data, AI, Machine Learning, Full-Stack Development
Grade: 17.2 / 20
Bachelor's Degree — Computer Science, Software Engineering
IT University · Antananarivo, Madagascar
Dec 2019 – Dec 2022
Specializations: Full-Stack Engineering, Software Engineering, Web Development, Databases, AI
Scientific Baccalaureate — Series C
Sainte Famille Mahamasina · Antananarivo, Madagascar
October 2019
Advanced English C1
EF-SET Certified
Python 3
CodingGame Certified
English for Business
HP Life Certification
IndabaX Madagascar 2023 — 3rd Place
Text classification NLP pipeline for pathology prediction across 10 disease classes. Achieved 94% accuracy using XGBoost, CatBoost, TF-IDF, Word2Vec, and spaCy preprocessing.
DataTour 2025 — Pan-African Data Science Competition
Content recommendation model for ~122M social media interactions (collaborative + content-based filtering; causal feature engineering to resolve temporal data leakage). Default risk classifier on real bank lending data evaluated with XGBoost and LightGBM.
Tech Lead — Wikolabs AI Studio
Founding tech lead of Wikolabs, an AI and automation studio delivering production AI solutions for local and international clients from Madagascar.
USAID Health Data Trainer
Specialized trainer for Madagascar Ministry of Health civil servants in KoBoToolbox and QGIS cartography across multiple Malagasy regions, under the USAID PMI Measure Malaria program.
English
C1 Advanced · EF-SET
French
Native / Fluent
Malagasy
Native

Ready to Bring Your
AI Vision to Life

Whether you're a CEO evaluating an AI strategy, a CTO scaling a platform, or a recruiter building a world-class team — let's talk. I deliver production-grade AI, not prototypes.

atr.guillaume@gmail.com
Antananarivo, Madagascar · Available Worldwide (Remote)