Kassoum Sanogo - AI & Data Engineer

My Skills

Core Skills

Artificial Intelligence Machine Learning Big Data Deep Learning Data Science Software Engineering Computer Vision LangChain Embedded Systems RAG & CAG AI Agent Creation

Certifications (10)

AWS Partner: Generative AI Essentials

AWS Technical Accreditation

LLMOps Certification

AWS Certified Data Engineer - Associate

AWS Developing Machine Learning Solutions

Build LLM Apps with LangChain.js

Building Multimodal Search and RAG

Functions, Tools and Agents with LangChain

Knowledge Graphs for RAG

LangChain for LLM Application Development

Main Programming Languages

Python AI and data science

R Statistics and data science

Java Big data, production

C++ Performance, embedded AI

C Performance computing

HTML/CSS/JavaScript website

ML/DL Frameworks

TensorFlow Deep learning framework

PyTorch Research and production

Keras High-level with TensorFlow

Scikit-learn Traditional ML

XGBoost / LightGBM Gradient boosting

Hugging Face NLP and transformers

Data Processing & Visualization

Pandas / NumPy Data manipulation

Matplotlib / Seaborn Python visualization

Plotly Interactive visualization

Power BI / Tableau Business Intelligence

Jupyter / JupyterLab Development environment

NLP & Language Processing

spaCy Industrial NLP

NLTK NLP toolkit

Transformers State-of-the-art models

LangChain LLM applications

LlamaIndex Agents, RAG

LLMs & RAG

OpenAI API GPT models integration

Anthropic Claude Conversational LLM

Mistral AI French LLM

Ollama Local execution

GroqCloud Fast inference

Pinecone / FAISS Vector search

ChromaDB Vector database

Computer Vision

OpenCV Computer vision library

YOLO Object detection

Detectron2 Meta Vision

MediaPipe Google Vision

SAM Segmentation models

CVAT Data labeling and annotation

MLOps & DevOps

MLflow ML lifecycle

DVC Data Version Control

Apache Airflow Workflow orchestration

Kubeflow ML on Kubernetes

Weights & Biases Experiment tracking

Docker Containerization

Cloud AI & Platforms

AWS Some service mastered like sagemaker

GCP Some service mastered like Vertex AI

Azure Some service mastered like AI Foundry

H2O.ai AutoML

Databricks Data and AI platform

Databases & Big Data

MongoDB NoSQL database

PostgreSQL SQL + JSON

Neo4j Graph database

Apache Kafka Streaming platform

Cassandra Distributed database

DuckDB Local analytics

MySQL-SQLite Data Storage

ChromaDB Vector storage

Generative AI

DALL·E / Midjourney Image generation

Stable Diffusion Image generation

Runway ML / Sora Video generation

ElevenLabs / Coqui Voice synthesis

MusicGen / Suno Music generation

ChatGPT / Gemini Conversational AI

Claude Code Code generation

Hugging Face models Open source models

Some of my Projects (17)

Note: Private and confidential projects developed during consulting missions are not disclosed here to respect client confidentiality agreements and non-disclosure terms.

AI for Historical Documents - UQAC Mitacs(research)

Hisorical document process by creating AI Agent for retranscription

Analysis and process document pipelines
Find metadata for context specialisation
Create AI Platform for this process
LLM + OCR for relevant process

LLM OCR Historical Document AI architecture

Multi-LLM Platform Arlette - BPIFRANCE(company)

Production solution enabling interconnection of multiple LLM models on a single RAG system with shared memory.

LiteLLM to unify model outputs
LangChain for model chaining
GroqCloud for Open Source models
Data science techniques for document processing

LLM RAG LangChain Production Finance

Log Analysis Platform - MMA(company)

Production platform at MMA using advanced techniques for system log analysis and interpretation.

Regex for log synthesis
Machine Learning for classification
Vectorization and LLMs for automatic interpretation

Machine Learning LLM Production Prometheus

AI and Plants - IRD PARIS(research laboratory)

Research project on artificial intelligence application to plant studies.

Python and Java API development
Data visualization through graphics
Multi-platform API for data input
Applied AI research

Research API Visualization AI

Hydrological Analysis - Bagré Dam(thesis-PHD support)

Predictive analysis system for Bagré dam water using 10 year dam data.

Temperature, pressure, and water level data processing
Water level rise and fall prediction
Flood and drought period analysis

Machine Learning Data Processing Predictive Analysis Visualisation Graphs

Specification Analyzer - GROUP COVEA(company)

Automatic specification document analysis platform for development project estimation.

Development time estimation
Developer number and level recommendations
Complete specification analysis
LLM and Machine Learning techniques

LLM Machine Learning Document Analysis and process CAG

Order Logistics Optimization - Carrefour(company)

Automation and optimization system for warehouse order preparation time.

Route optimization algorithms
Preparation process automation
Significant processing time reduction

Optimization Automation Logistics

University Carpooling Platform - Le Mans University(University)

Carpooling application dedicated to university with restricted access to university community members.

Java development
University authentication system
Trip and reservation management

Java Web Application Authentication

Werewolf AI Game - 24 hours Code(hackaton)

Werewolf game adaptation allowing solo players to play with conversational AI agents.

LLM integration for characters
Interactive chat with AI agents
Automated game logic

LLM Conversational AI Interactive Game

Indoor Localization System

Development of a high-precision indoor localization system using Machine Learning techniques.

Improved localization data accuracy
Machine Learning algorithms for triangulation
Real-time performance optimization

Machine Learning Localization Real-time

Voice Recognition

Automatic language recognition system from voice samples using Machine Learning techniques.

Audio signal processing
Voice feature extraction
Multi-language classification

Machine Learning NLP Audio Processing

Clothing Classification

Development of automatic classification models for clothing recognition and categorization.

Clothing image processing
Supervised classification models
Classification performance optimization

Machine Learning Computer Vision Classification

AI for Othello Game

AI development for Othello game with automatic piece ranking, recognized as the best model in class.

Deep Learning algorithms for game strategy
Automatic optimal position classification
Game performance optimization

Deep Learning Game AI Classification

Arduino Autonomous Car

Autonomous vehicle prototype capable of real-time obstacle avoidance.

DC motor and servo motor for propulsion
Ultrasonic sensors for obstacle detection
Real-time avoidance algorithms
Arduino programming

Arduino Embedded Systems Robotics

STM32 Presence Detection

Presence detection system using PIR sensors and STM32 F476 microcontroller.

PIR sensor integration
STM32 F476 programming
Interrupt and signal management

STM32 PIR Sensors Real-time

STM32 Weather Station

Weather forecasting system using DHT11 sensor and STM32 F411RE Nucleo microcontroller.

DHT11 sensor for temperature and humidity
Integrated LCD control with STM32 F476
System interrupt usage

STM32 Weather Sensors LCD

Real-time Weather Application

Weather application using Météo France and Base Adresse France APIs for real-time data.

Météo France API integration
Geolocation with Base Adresse France
Intuitive Java user interface

Java REST API Weather

Some of my Research Areas (16)

PULSAR Multi-Agent Orchestration - Apside France(company) - CIR research project

Dynamic orchestration of specialized AI agents via a Super Agent (PULSAR) with shared memory and intelligent prioritization.

Intelligent coordination pipeline
Shared memory between agents
Dynamic task prioritization
Contextual human interventions

PULSAR Multi-Agents Orchestration

AI Ethics and GDPR Compliance - Le Mans University(Research document for my MBA)

In-depth study on algorithmic bias minimization and privacy protection according to European and Canadian standards.

Explicit consent mechanism implementation
Data anonymization and pseudonymization
Model auditability and decision traceability
International regulation compliance

GDPR AI Ethics Bias

AI Data Governance - Le Mans University(Research document for my MBA)

Data flow traceability, metadata management, and access policy integration based on user profiles.

Traceable and auditable data pipelines
Fine metadata and lineage management
Role-based access control (RBAC)
Retention and archiving policies

Data Governance Metadata Access

AI for Historical Documents - (Research at UQAC - Mitacs)

Hisorical document process by creating AI Agent for retranscription

Analysis and process document pipelines
Find metadata for context specialisation
Create AI Platform for this process
LLM + OCR for relevant process

LLM OCR Historical Document AI architecture

AI and Plants - IRD PARIS(Research Laboratory)

Applied research on artificial intelligence and plants with API development and data visualization.

Python and Java API for data collection
Interactive graphics for visualization
Multi-platform connectivity
Botanical data input interface

IRD Botany API Visualization

Document Vectorization System

Implementation of an advanced vectorization system to efficiently process data from complex documents.

Embedding optimization for different document types
Improved semantic search accuracy
Adaptation to multimodal formats (text, images, tables)

Semantic search Vectorization Embeddings Documents Key Word search

Embedding Model Optimization

Research on optimizing Sentence Transformer embedding models to improve semantic similarity performance.

Adaptive fine-tuning by domain
Dimensionality reduction without quality loss
Model distillation techniques

Sentence Transformer Optimization Fine-tuning

SQLite Memory optimisation for LLM

Development of a persistent memory system for LLMs using SQLite, reducing context loss.

Intelligent storage of previous conversations
Dynamic contextual retrieval
Multi-model adaptation (GPT, Claude, Llama)
Long-task focus management

LLM Memory SQLite Context

LLM Comparison Benchmark

Implementation of a comprehensive benchmark system to compare different LLM model performances.

Standardized performance metrics
Testing across different domains and languages
Consistency and creativity evaluation
Results visualization interface

Benchmark Evaluation Performance

LLM Bias Audit and Detection

Semi-automated system for detecting bias or unethical behavior in LLM-generated responses.

Automatic cultural and social bias detection
Toxicity and discrimination analysis
Fairness and equity metrics
Alert and correction system

AI Audit Bias Detection Fairness

Explainability (XAI) for NLP

Development of local and intrinsic explanation methods for classification model, RAG, or LLM outputs.

LIME and SHAP implementation for NLP
Attention visualization techniques
RAG system explainability
User interface for interpretation

XAI LIME/SHAP Interpretability

AI Security - Adversarial Attacks

Defense and filtering strategies against adversarial attacks and prompt injection in AI systems.

Jailbreak and prompt injection detection
Intelligent filtering of malicious inputs
Robustness against adversarial attacks
Adaptive security mechanisms

AI Security Attacks Defense

Autonomous Specialized Agents

Modular design of AI agents with specific capabilities, collaborating via shared protocol.

Domain-specialized RAG agents
Web scraping and analysis agents
Classification and annotation agents
Modular and extensible architecture

AI Agents Autonomy Specialization

Agent-to-Agent Interoperability

Standardization of agent exchanges via JSON for adaptive message passing protocols.

Standardized communication protocols
Adaptive and asynchronous message passing
Automatic protocol negotiation
Failure management and recovery

Interoperability Protocols Communication

AI Workflow Optimization

Integration of advanced decision mechanisms for complex task execution with intelligent planning.

Tree-of-Thoughts and ReAct implementation
Hierarchical task planning
AutoGen for automatic coordination
Global performance optimization

Workflows Planning AutoGen

Multi AI Agent Performance Evaluation

Benchmark on efficiency, consistency, and scalability of collaborative multi-agent systems.

Collaborative performance metrics
Role distribution evaluation
Latency and throughput measurement
Final result quality and consistency

AI Agent Benchmark Performance Scalability

Education

2024 - 2026

Engineering Degree

Specialization in Data Science and Artificial Intelligence

ESEO - TOP Engineering School (France)

Advanced training in data science and artificial intelligence, with focus on emerging technologies and industrial applications.

2024 - 2025

MBA Master

Management and Business Administration

Le Mans University (France)

Complementary training in management and administration to develop entrepreneurial and leadership skills.

2022 - 2024

Engineering Degree

Specialization in Embedded Systems and Real-time

ENSIM - Engineering School Le Mans University (France)

In-depth training in embedded systems, real-time programming, hardware architecture and IoT solution development.

2020 - 2022

Preparatory Classes for Engineering Schools

Mathematics and Physics (CPGE)

École Polytechnique - Polytechnic School of engineering (Burkina Faso)

Intensive training in mathematics and physics, development of analytical and complex problem-solving capabilities.

Academic Highlights

Academic Excellence

Excellence track with multiple specializations in AI and embedded systems. GPA 5.0/5.0 in engineering school ESEO

Dual Competency

Deep technical training complemented by management skills

Innovation

Focus on emerging technologies and practical enterprise application

Kassoum Sanogo Artificial Intelligence, Machine Learning & Data Engineer

My Skills

Core Skills

Certifications (10)

Main Programming Languages

ML/DL Frameworks

Data Processing & Visualization

NLP & Language Processing

LLMs & RAG

Computer Vision

MLOps & DevOps

Cloud AI & Platforms

Databases & Big Data

Generative AI

Professional Experience

AI Research fellow - MITACS - UQAC

AI & Data Engineer

AI & Data Engineer Apprentice

AI Researcher / CEO Univers AI

Consultant AI Engineer

Consultant AI Engineer

Consultant AI Engineer

AI Researcher Internship

Data Analyst Internship

Some of my Projects (17)

AI for Historical Documents - UQAC Mitacs(research)

Multi-LLM Platform Arlette - BPIFRANCE(company)

Log Analysis Platform - MMA(company)

AI and Plants - IRD PARIS(research laboratory)

Hydrological Analysis - Bagré Dam(thesis-PHD support)

Specification Analyzer - GROUP COVEA(company)

Order Logistics Optimization - Carrefour(company)

University Carpooling Platform - Le Mans University(University)

Werewolf AI Game - 24 hours Code(hackaton)

Indoor Localization System

Voice Recognition

Clothing Classification

AI for Othello Game

Arduino Autonomous Car

STM32 Presence Detection

STM32 Weather Station

Real-time Weather Application

Some of my Research Areas (16)

PULSAR Multi-Agent Orchestration - Apside France(company) - CIR research project

AI Ethics and GDPR Compliance - Le Mans University(Research document for my MBA)

AI Data Governance - Le Mans University(Research document for my MBA)

AI for Historical Documents - (Research at UQAC - Mitacs)

AI and Plants - IRD PARIS(Research Laboratory)

Document Vectorization System

Embedding Model Optimization

SQLite Memory optimisation for LLM

LLM Comparison Benchmark

LLM Bias Audit and Detection

Explainability (XAI) for NLP

AI Security - Adversarial Attacks

Autonomous Specialized Agents

Agent-to-Agent Interoperability

AI Workflow Optimization

Multi AI Agent Performance Evaluation

Education

Engineering Degree

Specialization in Data Science and Artificial Intelligence

MBA Master

Management and Business Administration

Engineering Degree

Specialization in Embedded Systems and Real-time

Preparatory Classes for Engineering Schools

Mathematics and Physics (CPGE)

Academic Highlights

Academic Excellence

Dual Competency

Innovation

Contact