mjpdatascience - Tumblr blog

mjpdatascience · 7 years ago

Text

Week 4 DAT

Table of Contents:

Discussion

My Week 3 Output

Week 3 Program

1) Discussion

Is crater depth correlated with crater diameter? More energetic impacts are expected to leave wider craters, but do they leave deeper craters? If there *is* a relationship between crater depth and diameter, does that relationship change depending on whether the craters are in the northern or southern hemisphere of Mars? Last week, crater diameter and depth were analyzed. The data were subset to exclude craters with zero or negative depth. A scatterplot of crater depth vs diameter with a least-squares linear fit shows some correlation, but the plot is truly scattered. That said, the Peterson correlation shows an r value of 0.49 (r^2 = 0.24), with a p value so small that the software is returning 0.0.

Thus, crater depth and diameter are significantly correlated. However, only 24% of the variation in depth is explained by variation in diameter.

This week, the data were subset for northern (LATITUDE_CIRCLE_IMAGE > 0) and southern (LATITUDE_CIRCLE_IMAGE < 0) craters. The plots for craters in both hemispheres look similar. The Peterson correlations are as follows:

Northern: r = 0.45 (r^2 = 0.21) Southern: r = 0.50 (r^2 = 0.25)

Both with p values that are so small (and thus significant) that the software is outputting zeroes.

Thus, the relationship between crater depth and diamter is not moderated by whether the crater is in the northern or southern hemisphere.

2) My Week 4 Output

length of data: 384343

length of data after subset: 76804 association between crater diameter and depth Out[13]: (0.48599045388140061, 0.0)

length of marsnorth after subset: 28623

length of marssouth after subset: 48180

association between crater diameter and depth for northern hemisphere Out[18]: (0.45861085510523519, 0.0)

association between crater diameter and depth for souther hemisphere Out[19]: (0.49619338821080183, 0.0)

3) Week 4 Program

# -*- coding: utf-8 -*- """ Created on Sat Feb 17 18:57:28 2018

@author: MJP """

import pandas import numpy import seaborn import scipy import matplotlib.pyplot as plt

#Read the Mars Crater Database into memory marsdata = pandas.read_csv("dab_marscrater_pds.csv", low_memory=False)

#Set PANDAS to show all columns in DataFrame pandas.set_option('display.max_columns', None) #Set PANDAS to show all rows in DataFrame pandas.set_option('display.max_rows', None)

# bug fix (that I don't fully understand) "for display formats to avoid run time errors", or so our instructors tell us pandas.set_option('display.float_format', lambda x:'%f'%x)

# variables of interest are already numeric, so no need to change

# check length of data print ("length of data:") print(len(marsdata))

# subset data for craters with depth > 0 (i.e. no raised craters or depthless ones) marssub1 = marsdata[marsdata['DEPTH_RIMFLOOR_TOPOG']>0]

#make a copy of my new subsetted data marssub2 = marssub1.copy()

# check that data are properly subset print ('\n'"length of data after subset:") print(len(marssub2))

plt.ylabel('Crater Depth')#basic scatterplot: Q->Q scat1 = seaborn.regplot(x="DIAM_CIRCLE_IMAGE", y="DEPTH_RIMFLOOR_TOPOG", fit_reg=True, data=marssub2) plt.xlabel('Crater Diameter') plt.title('Scatterplot for the Association Between Crater Diameter and Depth')

print ('association between crater diameter and depth') scipy.stats.pearsonr(marssub2['DIAM_CIRCLE_IMAGE'], marssub2['DEPTH_RIMFLOOR_TOPOG'])

# subset data for craters in norther hemisphere marsnorth = marssub2[marssub2['LATITUDE_CIRCLE_IMAGE']>0]

# subset data for craters in norther hemisphere marssouth = marssub2[marssub2['LATITUDE_CIRCLE_IMAGE']<0]

# check that data are properly subset print ('\n'"length of marsnorth after subset:") print(len(marsnorth))

print ('\n'"length of marssouth after subset:") print(len(marssouth))

plt.ylabel('Crater Depth')#basic scatterplot: Q->Q scat2 = seaborn.regplot(x="DIAM_CIRCLE_IMAGE", y="DEPTH_RIMFLOOR_TOPOG", fit_reg=True, data=marsnorth) plt.xlabel('Crater Diameter') plt.title('Scatterplot for the Northern Hemisphere Association Between Crater Diameter and Depth')

plt.ylabel('Crater Depth')#basic scatterplot: Q->Q scat3 = seaborn.regplot(x="DIAM_CIRCLE_IMAGE", y="DEPTH_RIMFLOOR_TOPOG", fit_reg=True, data=marssouth) plt.xlabel('Crater Diameter') plt.title('Scatterplot for the Southern Hemisphere Association Between Crater Diameter and Depth')

print ('association between crater diameter and depth for northern hemisphere') scipy.stats.pearsonr(marsnorth['DIAM_CIRCLE_IMAGE'], marsnorth['DEPTH_RIMFLOOR_TOPOG'])

print ('association between crater diameter and depth for souther hemisphere') scipy.stats.pearsonr(marssouth['DIAM_CIRCLE_IMAGE'], marssouth['DEPTH_RIMFLOOR_TOPOG'])

0 notes

mjpdatascience · 7 years ago

Text

Week 3 DAT

Table of Contents:

Discussion

My Week 3 Output

Week 3 Program

1) Discussion

Is crater depth correlated with crater diameter? More energetic impacts are expected to leave wider craters, but do they leave deeper craters? Crater diameter and depth were analyzed. The data were subset to exclude craters with zero or negative depth. A scatterplot of crater depth vs diameter with a least-squares linear fit shows some correlation, but the plot is truly scattered. That said, the Peterson correlation shows an r value of 0.49 (r^2 = 0.24), with a p value so small that the software is returning 0.0.

Thus, crater depth and diameter are significantly correlated. However, only 24% of the variation in depth is explained by variation in diameter.

2) My Week 3 Output

length of data: 384343

length of data after subset: 76804 association between crater diameter and depth Out[7]: (0.48599045388140061, 0.0)

3) Week 3 Program

# -*- coding: utf-8 -*- """ Created on Sat Feb 10 18:57:28 2018