Exploding a list into a nested list array, and then into a multidimensional array in PHP

Today, I show you how to convert (explode) a list, in this case a list of authors, first and last names separated by a comma, into a nested list array, and then iterate through the nested list array and convert (explode) into a multidimensional array:     Exploding a list   <?php $AUTHOR_NAMES = “Carlo…

Retrieving, Processing, and Visualizing Enormous Email Data

In this project, I will retrieve a large email data set from the web, and then I will place it into a database. I will then analyze and clean up the email data so that it can be visualized. I will run the Python script gmane.py to retrieve the email data, then will place the…

Google Geocoding API with a Database and Visualization on Google Maps

Today I will use the Google Geocoding API with a database, and visualize the mapping data on Google Maps. In this project, I have a list of university names with no location information, and I will proceed to add location coordinates to the universities using the Google Geocoding API, load those university names with location…

Webcrawl and Pagerank of a Website

Today, I will demonstrate a webcrawl and pagerank of a website. For the parser, I’m using a python code, spider.py, which incorporates BeautifulSoup, a Python library for pulling data out of HTML and XML files. I’ll limit the amount of pages to crawl to 100, and will crawl the website AnxietyBoss.com, a leading website for anxiety,…

A Tour of Data Science Educational Programs

In my quest to become a data scientist, I have embarked on a series of educational journeys, which have included both formal, in-school, educational forums and self-learning, self-paced MOOCS (massive open online courses). Let me start at the beginning of my path to getting an education and training in data science. Business Intelligence Analytics Advanced…

Why Coding is Important for Data Scientists

As a Data Scientist in training, much of my orientation to the field has been about what skills are needed to become one. In my research and exposure to the field of data science, the knowledge, experience, and skillsets that data scientists have are domain expertise, computer science, and statistics. It appears that the most successful…

Analysis of Lottery Draws Between 2009 and 2017

This project entails the analysis of a dataset of historical lottery draws between 2009 and 2017 inclusive, scraped from the website of a lottery by my colleague, Gregory Horne. We had a question whether the winning numbers could be predicted, based on past draws, but needed to know if the winning numbers clustered, or were…