Posts

What Separates the Top PGA Tour Players from the Rest of the Field

Image
This article uses explainable machine learning techniques to identify what separates the top PGA Tour players from the rest of the competition using H2O and LIME. The article and visualizations were produced using Tableau Public and the predictions and explainable machine learning techniques were performed in R. Click here for the full article which was published on Tableau Public.

Python v R Series: Part 09: Aggregations

Image
  Introduction Aggregating data in R and Python is fairly similar.   The examples were inspired by Matt Dancho and his incredible library of training videos on  Business Science University .  In the R examples, we will be using the RStudio IDE.  It is assumed that you have a project set up in the folder where the data files exist. For more information about getting a project set up in R Studio and the Python environment set up in VS Code, click the links below. Setting Up Projects in R Setting Up Python VS Code Environment R: Aggregating Data In the previous articles, we detailed how to perform the following functions listed below.  For more information, click on the links below. 01: Data Imports 02: Join Data 03: Change Data Types 04: String Splits 05: Calculate New Columns 06: Organize Columns  07: Rename Columns 08: Saving Data library(tidyverse) library(readxl) bikes <- read_excel( "00_Data_Files/bikes.xlsx" ) bikeshops <- r...

MySQL Part 9: Joins

Image
Introduction In this article, we discuss joining data in MySQL.  In the last article, we discussed logical operators.  Click the link below to check out that article. MySQL Part 8: Logical Operations To begin your MySQL session in the goormIDE, please be sure to type this code into the terminal. mysql - ctl cli Please ensure that the bike_shop.sql file is selected and available.  If not, double-click it on the left-hand side.  This is where we will type most of our code. If there is information saved in there, go ahead and clear out that information. Joins Joins in databases are very powerful and help combine two or more tables.  If you are familiar with VLOOKUPs in Excel, they somewhat behave in similar fashions. There are many different ways to join data.  This article will focus specifically on left joins, but the images for the other join types are listed below.   Source: https://www.dofactory.com/sql/join Data Before we get started on th...

Accessing Spotify's Track Info Using R

Image
Introduction Spotify has an API that allows you to build applications from its platform.  This article shows you how to gather the data necessary to construct the Tableau Public application shown above using R.  This article also assumes that you already have downloaded the Spotify application. To view the live version of the Tableau Public application, please click here . Some additional resources I used along the way are listed below.  The first article goes into greater detail regarding retrieving the appropriate tokens and keys necessary to ping Spotify's API. https://medium.com/swlh/accessing-spotifys-api-using-r-1a8eef0507c http://rcharlie.net/sentify/ https://www.rcharlie.com/spotifyr/ https://developer.spotify.com/ Tokens, IDs, and Secrets As stated earlier, you will need several items to access the Spotify API.  Many items are available to download additional information, but this article requires only the 5 items listed below. profileID clientID secret auth...

Predicting Employee Attrition

Image
  Project Description Reducing turnover in an organization is vital. This project utilized advanced machine learning methods that accurately predicted which employees have a higher or lesser likelihood to leave using a fabricated data set. This project also displayed which key features influenced Employee Attrition at both the macro level and the individual employee level. Acknowledgment I want to thank and give credit to Matt Dancho, CEO and Founder of Business Science University . The advanced machine learning methods taught in his courses were heavily implemented in this application. Business Science University is truly an end-to-end journey that gently walks you through how professional data scientists set up, build, and deploy bleeding-edge machine learning models. Contents Data Descriptions, Correlations, and Variable Importance H2O’s GBM Model Metrics Top 15 Employees Most Likely to Leave Explain Why Individual Employees Are Likely to Leave Using Shapley Takeaways and Next S...