Skip to content

Latest commit

 

History

History
21 lines (14 loc) · 1.54 KB

python_cse250.md

File metadata and controls

21 lines (14 loc) · 1.54 KB

This challenge is a short assignment in our data wrangling and visualization course that is done in R. We would like you to do it the task in Python using Pandas and Altair.

You can read about the baseball data at seanlahman.com. Please use the SQLite database as your data sources.

The Class Task

Over the campfire, you and a friend get into a debate about which college in Utah has had the best MLB success. As an avid BYU fan, you want to prove your point, and you go to data to settle the debate. You need a clear visualization that depicts the performance of BYU players compared to other Utah college players that have played in the major leagues. This data is great for testing out our relational data skills.

Tasks

  • Make a private copy of this repo using the template button.
  • Invite the GitHub user hathawayj to your repo.
  • Use pandas and Altair in your Python code.
  • Find the 4-5 different data sets that you will need to show full college and player names as well as their annual earnings.
  • Make a convincing chart showing how professional baseball player earnings that played baseball at BYU compared to the players from other Utah schools.
  • Save your .py script and image to your repository.
  • Submit a link to your private repository.

Background

Here is the similar task in CSE 250 - https://byuistats.github.io/CSE250-Hathaway/projects/project-3/.