SEPTEO (2024)
Migration of in house legacy transformation tool to modern data stack using Airbyte + DBT+ Snowflake.
TOTAL / FINDIT (2020 - 2024)
Machine Learning project : Data Extraction, transformation and serving of data from Oracle Operational DBs using Amazon Glue and Redshift. DataLake with Parquet files on bronze and Iceberg on Silver and DW on Gold Medallion architecture.
AWS Data Catalog, Step Functions, materialized vews.
Data Wiz Project : Development of price tuning tool using opendata,SSIS and PowerBI.
Restructuring of a Price Forecast system on R and its underlying SSIS ETL for AS400.
IOT Backend Project : Design and implementation of IOT streaming architecture on Azure over SIGFOX. Refactoring of Sql Databases.
HYPERSTACK (2024)
Multiple account Klaviyo campaign data integration via Airbyte and dbt. Destination BigQuery.
Xandr daily csv files received by mail integrated through Google Cloud, using gcp cloud functions , buckets + airbyte + Bigquery scheduled queries.
MALAKOFF HUMANIS (2023)
Functional Analysis and Consultancy on PowerBI reports, performance and KPI / metric standardization in a data mesh environment.
Lulu Dans Ma Rue (2022-2023)
Analysis of different SAAS choices and related data integration problems. Roadmap for a modern data stack approach. Integration of Hubspot, Ogustine, Breezy, mssql data sources via Airbyte. Destination Bigquery.
PERNOD RICARD (2021)
Data transformation best practices using Databricks on Snowflake datawarehouses.
RENAULT (2017-2018)
Data analysis and warehousing project to enhance the scoring algorithm used for forecasting the likelihood of future car purchases, enabling the marketing team to target the customer-base more effectively.
Creation of SQL Server data warehouses and SSIS procedures for data integration, data cleansing and transformation and creation of features for logistic and linear regression analysis.
BNP (2016)
Carried out a business analysis and ETL implementation project for BNP Paribas in Morocco. Held several workshops with business and technical stakeholders, to specify the scope and requirements for a data extraction project aimed at improving the end-to-end account opening and closure processes, within both the retail and corporate banking divisions using SSIS.
REPAM (2024)
Data Stack Migration to ELT architecture using Apache Nifi, Airflow and dbt on postgres.
900 CARE (2021-2024)
Shopify, Recharge ,Yotpo, Funnel.io , Google Big Query Datawarehouse, R scripts for API data ingestion, Power BI reporting on commercial activity, churn stats, average order value, Life TIme Value and Cost of acquitions reports.Integration of Shopify, Klaviyo, Gorgias, Recharge.
Modern Data Stack implementation using Airbyte,Fivetran, google cloud components like PUB/Sub + cloud functions and dbt.
(15 Data Sources, 50 entities, 200 models including cumulative tables, sharded tables, snapshots, incremental loads)
LIVEJOURNEY (2020 - 2021)
Big data consulting. Data Cleansing and transformation modules using pyspark on Apache spark , sql on Vertica and aggregation pipelines on mongoDB. Datawarehouse and lakehouse modelling and implementation.
WEBHELP (2020-2021)
Power BI Premium Governance consultancy and implementation of best practices with reporting teams for resolving performance issues on premium capacities. Restructuring of SSAS Data Architecture, Dataset refresh schedules, Workspace Management. Development of Power BI reports for capacity metrics using audit logs and artifacts.
CATALINA Marketing (2019-2020)
Development of data pipelines to transfer 42 Europe biggest retailers’ coupon and sale data over 2 years from Netezza on-premise servers to Snowflake on Azure, via Rundeck scripts for Catalina Marketing. Salesforce campaign data and SAS files transfer to Snowflake . Data Science team’s Databricks scripts integration on Azure Data Factory.
LFB (pharma) (2018-2019)
Led the data migration from existing HR, pay and related SaaS systems to Workday clusters. Development of SSIS packages from all data sources, data quality reporting, stakeholders management of referential data mapping, master data decisions and data cleansing rules. Implementing data versioning and time travel capacity.
SOLIMUT MUTUELLE : (2023)
Data Pipelines & Datawarehouse creation on PowerBI ecosystem.