scraping excel file, stored in AWS S3, using AWS Lambda, trigger by CRON
- preprocess script trigger by a deposit of file in AWS S3
- visualization app using Streamlit (deployed on AWS EC2)
- gitlab ci/cd with serverless framework to deploy AWS services
streamlit app creation to visualize OCR performances on invoices
- API rest conversion to gRPC
- webscraping with selenium, deployed on AWS lambda
- gitlab ci/cd with serverless framework to deploy AWS services : lambda, SQS
Use of streamlit to create an IHM used by data scientists to see an OCR pipeline.
Refactoring code to split script into gRPC microservice.
Disney (06-09/2021) : Refactoring python, cloud, python developer
+33 6 99 81 66 25 | ******** |75013
- python scripts fixing and refactoring that parse excels files on S3 (AWS), transforming
and inserting them into SQL Server db through SqlAlchemy. Increase API script speed
with multithreading
- Use of batch (AWS) services to orchestrate scripts execution.
- FastAPI API containerized (Docker) updating
- API google sheet used to generate jsons used in ******** website configuration
- QuickSight (AWS) dashboards edition
- conversion of Flask API to FastAPI, containerized with docker
- fine tuning BERT deep learning model to perform sentiment analysis on 16 languages
(HuggingFace, Tensorflow 2, Pytorch)
- models metrics tracked and stored in neptune.ai
- use of AWS lambda and API Gateway to expose APIs
- benchmark of Tesseract version and hyperparameters to perform OCR on forms
Re-mind : ChaGPT, API, Application, MongoDB
- API that call openAI’s API
- streamlit app to edit prompt and prepromt
- questions/answers stored in MongoDB for further analysis