Data Analyst Nanodegree @Udacity:

  • Projects:
    • Project 6: Data Visualization (NEW)

      Data visualization: D3.js graph showing the relationship between health expenditure and life expectancy at birth for countries in each continent.

      Skills: D3.js, JavaScript, HTML, CSS, storytelling

    • Project 5: Intro to Machine Learning

      Machine learning: detect people involved in the ENRON scandal from e-mail data.

      Skills: python, sklearn, Gaussian Naive Bayes

    • Project 4: Explore and Summarize Data

      Statistical analysis: Analysis of features influencing Red Wine quality

      Skills: ggplot, Ggally, gridExtra, data.table, R, rmarkdown

    • Project 3: OpenStreetMap Data Wrangling with SQL

      Relational Databases: Cleaning of OpenStreetMap data for the Málaga area using data munging techniques, such as assessing the quality of the data for validity, accuracy, completeness, consistency and uniformity using a SQL data schema.

      Skills: SQL, xml, OpenStreetMap

    • Project 2: Investigate a dataset

      Statistical analysis: Investigate a dataset using in Python and perform chi-squared test for independence to draw conclusions using the module stats from scipy.

      Skills: chi-squared test, pandas, numpy, stats, scipy, Jupyter-notebook

    • Project 1: Test a Perceptual Phenomenon

      Statistical analysis: Use descriptive statistics and a statistical test to analyze the Stroop effect, a classic result of experimental psychology.

      Skills: t-test, ggplot, R, rmarkdown

Machine Learning Engineer Nanodegree @Udacity:

  • Capstone project: Right Whale call recognition using Convolutional Neural Networks

    Recognition of Right Whale up-call using Convolutional Neural Networks (ConvNets) models implemented on Tensorflow.

  • Projects:

    • Project 4 : Smartcab train

      Reinforcement Learning: Training of a smartcab to drive following traffic rules in a populated environment.

    • Project 3 : Creating customer segments

      Unsupervised Learning: Analysis of a dataset containing data on various customers’ annual spending amounts (reported in monetary units) of diverse product categories for internal structure.

      Description of the different types of customers that a wholesale distributor interacts with to give the distributor an insight into how to best structure their delivery service to meet the needs of each customer.

      Dataset from the UCI Machine Learning Repository (https://archive.ics.uci.edu/ml/datasets/Wholesale+customers).

      Skills: pandas, matplotlib, numpy, scikit-learn

    • Project 2 : Building a Student Intervention System

      Supervised Learning: Analysis of a dataset of students at high school to determine whether they will or will not pass the course in order to have an intervention of the student and prevent from failing the course.

      Selection and optimization of supervised learning algorithms in Python.

      Skills: pandas, matplotlib, numpy, scikit-learn, Jupyter-notebook

    • Project 1 : Predicting Boston Housing Prices

      Model Evaluation & Validation: Development of model for the estimation of house prices in Boston.

      Skills: matplotlib, numpy, scikit-learn, Jupyter-notebook

    • Project 0 : Titanic Survival explorations

      Exploration of the survival rate of the Titanic’s passengers according to their characteristics.

      Skills: pandas, numpy, stats, csv, Jupyter-notebook