Introduction
In the following paper, I aim to clarify the specific risks of having a child at an older age. I examine how increased maternal age affects infant weight, gestation time, and number of miscarriages.
In order to determine how maternal factors influence infant health, I use data from Google Big Query's public dataset natality. To isolate data from the natality dataset that will allow my specific analyses, I use Google Cloud Datalab and SQL query. SQL query allows me to parse the natality dataframe into a pandas dataframe, keeping only the variables needed for the specific analyses. These analyses include maternal age's effect on the following three indicators of infant health: infant weight, gestation time, and number of miscarriages. Within Google Cloud Datalab's Jupyter notebook function, I use matplotlib to create graphs for each respective pandas dataframe. Each graph provides a visual for the relationship between one maternal risk factor and one predictor of infant health.
View the Full Project Below