Développement d’applications data sur Microsoft Azure
• Lead data engineer, projet OME : OME (outil de management de l’énergie) est un outil qui vise à offrir une vue détaillée de la
consommation d’énergie sur les trains SNCF
– Croiser diverses sources de données pour produire de la donnée enrichie sur le trafic et l’énergie des trains
– Participer aux ateliers métier; rédiger des user stories
– Accompagner la recette métier
– Travailler avec les équipes transverses : OPs (flux, usine logicielle, cloud), architecte, visualisation
– Accompagner l’équipe web
• Créer une application pour la gestion de la qualité de données
• Créer une template pour les projets spark
scala, spark, hDInsights, microsoft azure, qlik sense
Build a system for a finer tracking of parts and materials used in the manufacturing of vehicles in order to make significant costs avoidances
• Process data sources related to vehicles manufacturing processes and output relevant tracking information
• Expose data through REST Apis querying NoSQL storage
• Handle continuous integration and delivery (CI/CD) with DevOps team
• Work in a Scrum team
spark (sql), scala, hive, hbase, elasticsearch, play framework, zeppelin, aws, hdp, gitlab, scrum
Groupe La Poste · Colissimo (parcel service) Issy-Les-Moulineaux, France
Build software for parcel tracking in an event-driven architecture
• Worked for several projects through three teams (4 to 9 members), following the Scrum methodology
• Built event processing pipelines using Kafka, Spark Streaming and Cassandra
• Developed web services for data manipulation using the Play Framework
scala, kafka, avro, spark (streaming), cassandra, elasticsearch, play framework, ansible, lxc, jenkins
EDF R&D (electric utility) Clamart, France
Provide EDF’s sales division with an efficient, operational tool, to discover customers’ (power consumption) profiles
• Implemented features on an in house customized version (for time series) of the Spark MLlib Decision Trees
• Built generic, scalable algorithms to compute approximate quantiles on massive time series
• Evaluated scalability and approximation accuracy through exhaustive testing
• Designed a simple UI for users
spark, scala, hdp, statistics, data mining, scalability
Evaluated native dictionary mobile apps development with Apache Cordova
• Performed competitor analysis
• Reviewed testing tools and set up a CI environment
• Designed and built a bilingual dictionary app prototype for the CUP; created a template engine
cordova, javascript, jenkins, jasmine, appium