As a highly skilled data scientist with a PhD in bioinformatics and over ten years of working on relevant projects, I developed a strong data science and analytics foundation. I have spent the last few years working at world-renowned companies, developing end-to-end machine learning applications. Additionally, driven by my passion for knowledge, I’ve taught specialized courses in various data science topics.
MBA in Data Science & Analytics, 2023
Universidade de São Paulo
PhD in Science, 2021
Universidade de São Paulo
MSc in Science, 2016
Universidade de São Paulo
Pandas, Numpy, Scikit-learn, PyCaret, Matplotlib, Seaborn, Plotly, Folium, BeautifulSoup, Selenium, etc.
Create, modify and retrieve data from relational database manage systems (e.g. MySQL, Postgres)
R base, data.table, tidyverse (e.g. dplyr, ggplot2), plotly, Rmarkdown, Bioconductor packages, and more…
All essential commands in bash, plus some knowledge with awk and sed.
Familiarity with git and GitHub (using git for all projects)
Worked with AWS and GCP (e.g., data storage, BigQuery, and Vertex AI.)
API requests and API development using either Flask or FastAPI
Create images for model and applications
Process big data with SparkSQL and build ML models with SparkMLlib
Good problem-solver for data wrangling challenges. Familiar with data cleaning and feature-engineering for ML tasks.
Advanced experience building data visualizations using Python and R (specially static figures, but I’m also familiar with interactive approaches).
Descriptive and inferential statistics, incorporating theory into practical applications and in the AI projects I worked on.
Build ML models for regression, classification, and clustering problems, as well as recommender systems, and time series models. I’m also familiar with autoML.
Currently studying and applying tensorflow (keras) and pytorch
I’ve deployed models using Databricks, Dataiku, MLflow, Docker, FastAPI/Flask, alongside the version control and best coding practices.
Main activities:
Deliveries:
I work developing end-to-end AI SaaS products to our internal customers.
Main activities:
Deliveries:
Quick facts:
Tools: Dataiku, GCP, SQL, Python, Dash, Streamlit, machine learning libraries (e.g., scikit-learn), data visualization libraries.
I have worked in multiple roles: facilitator, mentor, and consultant/instructor.
As a consultant/instructor, I prepared the course modules and recorded classes. I recorded four modules: statistics, data cleaning/wrangling, clustering, and model deployment. I have also devised data science activities about descriptive and inferential statistics, unsupervised machine learning models, MLFlow, big data with PySpark, among others.
As a mentor, I assessed student reports and addressed their questions through Q&A sessions, aiding in academic and real-world projects.
As a facilitator, I participated in data science activities from exploratory analysis to model deployment. I also prepared some of those activities.
Squad: Revenue Management
Activities / Deliverables:
Tools: • Python • PySpark • MLFlow • Scikit-learn • Pycaret • Statsmodels • Forecasting frameworks (Prophet, NeuralProphet, StatsForecast)
Activities:
Deliverables:
Tools: • Python • PySpark • MLFlow • Scikit-learn • Pycaret • Catboost • Prophet • AWS • GitHub • Data visualization libraries (Plotly, Seaborn, Matplotlib)
Activities:
Grade: 10
Thesis: Identifying natural selection in Native American populations. Supported by: CAPES (2016 - 2018) and FAPESP (2018 - 2020)
Activities:
Deliverables:
I presented our preliminar work in international conferences, including USA. I also took a internship of 3 months in Barcelona - Spain.
Dissertation: Role of cellular prion protein and its ligand, stip1, in the adult neurogenesis. Supported by: CNPq (2014 - 2016)
Main techniques: primary cell culture, immunofluorescence, and hypothesis testing.
Monography: Role of the interaction between the cellular prion protein and its ligand, STI1, in the biology of neural precursors from the murine adult brain. Supported by: University scholarship (2011 - 2012) and FAPESP (2012 - 2013)
Honor & Awards:
Feel free to contact me through this form: