Kassoum Sanogo Artificial Intelligence, Machine Learning & Data Engineer

Engineer, researcher and manager with academic and industrial skills in Artificial Intelligence, machine learning, deep learning, data science and embedded systems.

Kassoum Sanogo

My Skills

Main Programming Languages

Python AI and data science
R Statistics and data science
Java Big data, production
C++ Performance, embedded AI
C Performance computing
HTML/CSS/JavaScript website

ML/DL Frameworks

TensorFlow Deep learning framework
PyTorch Research and production
Keras High-level with TensorFlow
Scikit-learn Traditional ML
XGBoost / LightGBM Gradient boosting
Hugging Face NLP and transformers

Data Processing & Visualization

Pandas / NumPy Data manipulation
Matplotlib / Seaborn Python visualization
Plotly Interactive visualization
Power BI / Tableau Business Intelligence
Jupyter / JupyterLab Development environment

NLP & Language Processing

spaCy Industrial NLP
NLTK NLP toolkit
Transformers State-of-the-art models
LangChain LLM applications
LlamaIndex Agents, RAG

LLMs & RAG

OpenAI API GPT models integration
Anthropic Claude Conversational LLM
Mistral AI French LLM
Ollama Local execution
GroqCloud Fast inference
Pinecone / FAISS Vector search
ChromaDB Vector database

Computer Vision

OpenCV Computer vision library
YOLO Object detection
Detectron2 Meta Vision
MediaPipe Google Vision
SAM Segmentation models
CVAT Data labeling and annotation

MLOps & DevOps

MLflow ML lifecycle
DVC Data Version Control
Apache Airflow Workflow orchestration
Kubeflow ML on Kubernetes
Weights & Biases Experiment tracking
Docker Containerization

Cloud AI & Platforms

AWS Some service mastered like sagemaker
GCP Some service mastered like Vertex AI
Azure Some service mastered like AI Foundry
H2O.ai AutoML
Databricks Data and AI platform

Databases & Big Data

MongoDB NoSQL database
PostgreSQL SQL + JSON
Neo4j Graph database
Apache Kafka Streaming platform
Cassandra Distributed database
DuckDB Local analytics
MySQL-SQLite Data Storage
ChromaDB Vector storage

Generative AI

DALL·E / Midjourney Image generation
Stable Diffusion Image generation
Runway ML / Sora Video generation
ElevenLabs / Coqui Voice synthesis
MusicGen / Suno Music generation
ChatGPT / Gemini Conversational AI
Claude Code Code generation
Hugging Face models Open source models

Professional Experience

AI Research fellow - MITACS - UQAC

2025 : Canada

AI & Data Engineer

2025 : Canada

AI & Data Engineer Apprentice

2024 - 2026 : France

AI Researcher / CEO Univers AI

2025 : France

Consultant AI Engineer

2024 : France

Consultant AI Engineer

2025 : France

Consultant AI Engineer

2024 - 2025 : France

AI Researcher Internship

2023 - 2024 : France

Data Analyst Internship

2023 : France

Some of my Projects (17)

Note: Private and confidential projects developed during consulting missions are not disclosed here to respect client confidentiality agreements and non-disclosure terms.

AI for Historical Documents - UQAC Mitacs(research)

Hisorical document process by creating AI Agent for retranscription

  • Analysis and process document pipelines
  • Find metadata for context specialisation
  • Create AI Platform for this process
  • LLM + OCR for relevant process
LLM OCR Historical Document AI architecture

Multi-LLM Platform Arlette - BPIFRANCE(company)

Production solution enabling interconnection of multiple LLM models on a single RAG system with shared memory.

  • LiteLLM to unify model outputs
  • LangChain for model chaining
  • GroqCloud for Open Source models
  • Data science techniques for document processing
LLM RAG LangChain Production Finance

Log Analysis Platform - MMA(company)

Production platform at MMA using advanced techniques for system log analysis and interpretation.

  • Regex for log synthesis
  • Machine Learning for classification
  • Vectorization and LLMs for automatic interpretation
Machine Learning LLM Production Prometheus

AI and Plants - IRD PARIS(research laboratory)

Research project on artificial intelligence application to plant studies.

  • Python and Java API development
  • Data visualization through graphics
  • Multi-platform API for data input
  • Applied AI research
Research API Visualization AI

Hydrological Analysis - Bagré Dam(thesis-PHD support)

Predictive analysis system for Bagré dam water using 10 year dam data.

  • Temperature, pressure, and water level data processing
  • Water level rise and fall prediction
  • Flood and drought period analysis
Machine Learning Data Processing Predictive Analysis Visualisation Graphs

Specification Analyzer - GROUP COVEA(company)

Automatic specification document analysis platform for development project estimation.

  • Development time estimation
  • Developer number and level recommendations
  • Complete specification analysis
  • LLM and Machine Learning techniques
LLM Machine Learning Document Analysis and process CAG

Order Logistics Optimization - Carrefour(company)

Automation and optimization system for warehouse order preparation time.

  • Route optimization algorithms
  • Preparation process automation
  • Significant processing time reduction
Optimization Automation Logistics

University Carpooling Platform - Le Mans University(University)

Carpooling application dedicated to university with restricted access to university community members.

  • Java development
  • University authentication system
  • Trip and reservation management
Java Web Application Authentication

Werewolf AI Game - 24 hours Code(hackaton)

Werewolf game adaptation allowing solo players to play with conversational AI agents.

  • LLM integration for characters
  • Interactive chat with AI agents
  • Automated game logic
LLM Conversational AI Interactive Game

Indoor Localization System

Development of a high-precision indoor localization system using Machine Learning techniques.

  • Improved localization data accuracy
  • Machine Learning algorithms for triangulation
  • Real-time performance optimization
Machine Learning Localization Real-time

Voice Recognition

Automatic language recognition system from voice samples using Machine Learning techniques.

  • Audio signal processing
  • Voice feature extraction
  • Multi-language classification
Machine Learning NLP Audio Processing

Clothing Classification

Development of automatic classification models for clothing recognition and categorization.

  • Clothing image processing
  • Supervised classification models
  • Classification performance optimization
Machine Learning Computer Vision Classification

AI for Othello Game

AI development for Othello game with automatic piece ranking, recognized as the best model in class.

  • Deep Learning algorithms for game strategy
  • Automatic optimal position classification
  • Game performance optimization
Deep Learning Game AI Classification

Arduino Autonomous Car

Autonomous vehicle prototype capable of real-time obstacle avoidance.

  • DC motor and servo motor for propulsion
  • Ultrasonic sensors for obstacle detection
  • Real-time avoidance algorithms
  • Arduino programming
Arduino Embedded Systems Robotics

STM32 Presence Detection

Presence detection system using PIR sensors and STM32 F476 microcontroller.

  • PIR sensor integration
  • STM32 F476 programming
  • Interrupt and signal management
STM32 PIR Sensors Real-time

STM32 Weather Station

Weather forecasting system using DHT11 sensor and STM32 F411RE Nucleo microcontroller.

  • DHT11 sensor for temperature and humidity
  • Integrated LCD control with STM32 F476
  • System interrupt usage
STM32 Weather Sensors LCD

Real-time Weather Application

Weather application using Météo France and Base Adresse France APIs for real-time data.

  • Météo France API integration
  • Geolocation with Base Adresse France
  • Intuitive Java user interface
Java REST API Weather

Some of my Research Areas (16)

PULSAR Multi-Agent Orchestration - Apside France(company) - CIR research project

Dynamic orchestration of specialized AI agents via a Super Agent (PULSAR) with shared memory and intelligent prioritization.

  • Intelligent coordination pipeline
  • Shared memory between agents
  • Dynamic task prioritization
  • Contextual human interventions
PULSAR Multi-Agents Orchestration

AI Ethics and GDPR Compliance - Le Mans University(Research document for my MBA)

In-depth study on algorithmic bias minimization and privacy protection according to European and Canadian standards.

  • Explicit consent mechanism implementation
  • Data anonymization and pseudonymization
  • Model auditability and decision traceability
  • International regulation compliance
GDPR AI Ethics Bias

AI Data Governance - Le Mans University(Research document for my MBA)

Data flow traceability, metadata management, and access policy integration based on user profiles.

  • Traceable and auditable data pipelines
  • Fine metadata and lineage management
  • Role-based access control (RBAC)
  • Retention and archiving policies
Data Governance Metadata Access

AI for Historical Documents - (Research at UQAC - Mitacs)

Hisorical document process by creating AI Agent for retranscription

  • Analysis and process document pipelines
  • Find metadata for context specialisation
  • Create AI Platform for this process
  • LLM + OCR for relevant process
LLM OCR Historical Document AI architecture

AI and Plants - IRD PARIS(Research Laboratory)

Applied research on artificial intelligence and plants with API development and data visualization.

  • Python and Java API for data collection
  • Interactive graphics for visualization
  • Multi-platform connectivity
  • Botanical data input interface
IRD Botany API Visualization

Document Vectorization System

Implementation of an advanced vectorization system to efficiently process data from complex documents.

  • Embedding optimization for different document types
  • Improved semantic search accuracy
  • Adaptation to multimodal formats (text, images, tables)
Semantic search Vectorization Embeddings Documents Key Word search

Embedding Model Optimization

Research on optimizing Sentence Transformer embedding models to improve semantic similarity performance.

  • Adaptive fine-tuning by domain
  • Dimensionality reduction without quality loss
  • Model distillation techniques
Sentence Transformer Optimization Fine-tuning

SQLite Memory optimisation for LLM

Development of a persistent memory system for LLMs using SQLite, reducing context loss.

  • Intelligent storage of previous conversations
  • Dynamic contextual retrieval
  • Multi-model adaptation (GPT, Claude, Llama)
  • Long-task focus management
LLM Memory SQLite Context

LLM Comparison Benchmark

Implementation of a comprehensive benchmark system to compare different LLM model performances.

  • Standardized performance metrics
  • Testing across different domains and languages
  • Consistency and creativity evaluation
  • Results visualization interface
Benchmark Evaluation Performance

LLM Bias Audit and Detection

Semi-automated system for detecting bias or unethical behavior in LLM-generated responses.

  • Automatic cultural and social bias detection
  • Toxicity and discrimination analysis
  • Fairness and equity metrics
  • Alert and correction system
AI Audit Bias Detection Fairness

Explainability (XAI) for NLP

Development of local and intrinsic explanation methods for classification model, RAG, or LLM outputs.

  • LIME and SHAP implementation for NLP
  • Attention visualization techniques
  • RAG system explainability
  • User interface for interpretation
XAI LIME/SHAP Interpretability

AI Security - Adversarial Attacks

Defense and filtering strategies against adversarial attacks and prompt injection in AI systems.

  • Jailbreak and prompt injection detection
  • Intelligent filtering of malicious inputs
  • Robustness against adversarial attacks
  • Adaptive security mechanisms
AI Security Attacks Defense

Autonomous Specialized Agents

Modular design of AI agents with specific capabilities, collaborating via shared protocol.

  • Domain-specialized RAG agents
  • Web scraping and analysis agents
  • Classification and annotation agents
  • Modular and extensible architecture
AI Agents Autonomy Specialization

Agent-to-Agent Interoperability

Standardization of agent exchanges via JSON for adaptive message passing protocols.

  • Standardized communication protocols
  • Adaptive and asynchronous message passing
  • Automatic protocol negotiation
  • Failure management and recovery
Interoperability Protocols Communication

AI Workflow Optimization

Integration of advanced decision mechanisms for complex task execution with intelligent planning.

  • Tree-of-Thoughts and ReAct implementation
  • Hierarchical task planning
  • AutoGen for automatic coordination
  • Global performance optimization
Workflows Planning AutoGen

Multi AI Agent Performance Evaluation

Benchmark on efficiency, consistency, and scalability of collaborative multi-agent systems.

  • Collaborative performance metrics
  • Role distribution evaluation
  • Latency and throughput measurement
  • Final result quality and consistency
AI Agent Benchmark Performance Scalability

Education

2024 - 2026

Engineering Degree

Specialization in Data Science and Artificial Intelligence

ESEO - TOP Engineering School (France)
Advanced training in data science and artificial intelligence, with focus on emerging technologies and industrial applications.
2024 - 2025

MBA Master

Management and Business Administration

Le Mans University (France)
Complementary training in management and administration to develop entrepreneurial and leadership skills.
2022 - 2024

Engineering Degree

Specialization in Embedded Systems and Real-time

ENSIM - Engineering School Le Mans University (France)
In-depth training in embedded systems, real-time programming, hardware architecture and IoT solution development.
2020 - 2022

Preparatory Classes for Engineering Schools

Mathematics and Physics (CPGE)

École Polytechnique - Polytechnic School of engineering (Burkina Faso)
Intensive training in mathematics and physics, development of analytical and complex problem-solving capabilities.

Academic Highlights

Academic Excellence

Excellence track with multiple specializations in AI and embedded systems. GPA 5.0/5.0 in engineering school ESEO

Dual Competency

Deep technical training complemented by management skills

Innovation

Focus on emerging technologies and practical enterprise application

Contact

kassoum.sanogo@outlook.com
🇫🇷 France & 🇨🇦 Canada
Artificial Intelligence, Machine Learning & Data Engineer @ Apside

Development: This website was entirely developed by myself using modern web technologies. The complete source code is available on my GitHub. Please provide proper attribution if you decide to use or reference this work. Thank you!