Data Scientist Career Path after Formal Education and Training

In earlier posts over 5 years ago, I detailed my journey into data science, which started with formal education and training, where I enrolled in a Masters of Science (MSc) program to obtain a graduate degree in Data Science. During my time as an MSc student, I struggled in the computer science courses, as my…

Enough Talk of Fake Data Scientists: Time for a Recognized Professional Certification

I’m a newly-minted data scientist, with my MSc in computing and data analytics degree earned a couple of days ago. Yet, I often hear from other senior data scientists that everyone who does not have their credentials and years of experience are to be classified as “fake” data scientists. Yet these same senior data scientists…

Retrieving, Processing, and Visualizing Enormous Email Data

In this project, I will retrieve a large email data set from the web, and then I will place it into a database. I will then analyze and clean up the email data so that it can be visualized. I will run the Python script gmane.py to retrieve the email data, then will place the…

Google Geocoding API with a Database and Visualization on Google Maps

Today I will use the Google Geocoding API with a database, and visualize the mapping data on Google Maps. In this project, I have a list of university names with no location information, and I will proceed to add location coordinates to the universities using the Google Geocoding API, load those university names with location…

Webcrawl and Pagerank of a Website

Today, I will demonstrate a webcrawl and pagerank of a website. For the parser, I’m using a python code, spider.py, which incorporates BeautifulSoup, a Python library for pulling data out of HTML and XML files. I’ll limit the amount of pages to crawl to 100, and will crawl the website AnxietyBoss.com, a leading website for anxiety,…

A Tour of Data Science Educational Programs

In my quest to become a data scientist, I have embarked on a series of educational journeys, which have included both formal, in-school, educational forums and self-learning, self-paced MOOCS (massive open online courses). Let me start at the beginning of my path to getting an education and training in data science. Business Intelligence Analytics Advanced…

Why Coding is Important for Data Scientists

As a Data Scientist in training, much of my orientation to the field has been about what skills are needed to become one. In my research and exposure to the field of data science, the knowledge, experience, and skillsets that data scientists have are domain expertise, computer science, and statistics. It appears that the most successful…

Analysis of Lottery Draws Between 2009 and 2017

This project entails the analysis of a dataset of historical lottery draws between 2009 and 2017 inclusive, scraped from the website of a lottery by my colleague, Gregory Horne. We had a question whether the winning numbers could be predicted, based on past draws, but needed to know if the winning numbers clustered, or were…