Data management

60 views 8:42 am 0 Comments March 20, 2023

 

Data management contains multiple steps, including data cleaning and exploratory analysis. In this project, you will showcase skill in data management using Pandas.

 

Data

 

You will use publicly available files. The first contains data on causes of death, while the second contains population data. Both files have state-level information for multiple years.

 

 

 

 

 

Requirements

 

 

  1. To demonstrate pandas skills and ability, answer these questions:

 

    1. Are Americans facing increasing, decreasing, or steady likelihood of death?

 

    1. What are the four leading causes of death for Americans?

 

    1. Do individual states show the same four leading causes of death?

 

    1. Are there year-by-year changes in the four leading causes of death nationwide?

 

 

 

  1. Use appropriately constructed and formatted tables to show results. There is no need to use visualization in this project.

 

  1. Use population data appropriately to demonstrate your understanding of how variables are normalized/standardized.

 

  1. Show skill in constructing a formal report using Jupyter.

 

 

Your formal report should contain components such as:

 

 

  • An introduction that discusses the scope of the analysis

 

  • A description of data used in the analysis along with data cleaning procedures

 

  • Code that clearly shows how an algorithm is implemented

 

  • Results

 

  • Discussion of results and generation of insight when appropriate

 

  • Summary when appropriate