
Airline Safety
The goal of this project was to collect data from a variety of sources, prepare the data, merge and store the datasets into a database, and create visualizations of the data.
I obtained data from the following sources:
unique identification number
title of the book
name of book authors
average rating on Goodreads
ten digit unique identifier
thirteen digit unique idenifier
primary language
total number of pages
total number of ratings received
total number of reviews written
date of book publication
name of book publisher
book title
name of book authors
language in which the book was originially written
year in which the book was first published
how much the book made in millions of dollars
the genre of the book
I performed data transformations and cleaning techniques on each of these datasets, as seen in the following:
Goodreads CSVOnce each dataset was prepared, I merged and stored them in a SQLite database and created visualization of my findings, as seen in the following:
Code for SQL and VisualsThis project provide many learning opportunities. As a whole, it provided an in-depth explanation of the data wrangling process from start to finish. As a hands-on learner, I feel like it provided me with a solid understanding of data wrangling.