You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Certain line plots, for example, EDA for Women in State Legislature, provides no information. Having so many cities in a line plot, it is really confusing for the reader to follow through.
EDA Analysis and Description
D
The first horizontal bar plot could be a line graph, since you are working with time series data. For table 4: aisau4 (focusing on regions), the comment this could be due to population, is highly vague and seems inaccurate. By the logic, the South should have the highest number of clinics, given they have the most % share of population in the United States. [Reference: https://www.census.gov/popclock/data_tables.php?component=growth] The population difference between Midwest and West is roughly 3%. In some places, you state what you plotted but not what you learned - should guide viewer as to what is to be learned.
EDA Figures
D
The bar plot for states should be in ascending or descending order, a random order never helps. Need to improve visual quality of plots, work on providing titles, descriptive axis labels. There seems to be no consistency in alignment of x-axis labels across bar plots.
Comments
This checkpoint appears to be a huge deviation from the earlier checkpoints. The story telling in the EDA has not been tied well, even in the correlation matrix, there is no explanation as to what is going on. No comments on whether the analysis is in sync with the hypothesis. If you are working on the prediction task, that appears to be missing as well.
Regrade Feedback
Rubric
Unsatisfactory
Developing
Proficient
Excellent
EDA relevance
EDA is mostly neither relevant to the question nor helpful in figuring out how to address the question. Or the EDA does address the question, but many obviously relevant variables / analyses / figures were not included. EDA does not include explore distributions of single variables or relationships between variables or both
EDA is partly irrelevant/unhelpful. Or some obviously relevant variables / analyses / figures were not included. EDA does not include a few distributions of single variables or relationships between variables
EDA is almost all relevant / helpful in addressing the question. No obviously relevant variables / analyses / figures were not included.
Thorough EDA addressed all aspects that are relevant to the question
EDA analysis and description
Many of the analyses are poor choices (e.g., using means instead of medians for obviously skewed data), or are poorly described in the text, or do not aid understanding the data
Some of the analyses are poor choices, or are poorly described in the text, or do not aid understanding the data
All analyses are correct choices. Only one or two have minor issues in the text descriptions supporting them. Mostly they fit well with other elements of the EDA and support understanding the data
All analyses are correct choices with clear text descriptions supporting them. The figures fit well with the other elements of the EDA, producing a clear understanding of the data.
EDA figures
Many of the figures are poor plot choices (e.g., using a bar plot to represent a time series where it would be better to use a line plot) or have poor aesthetics (including colormap, data point shape/color, axis labels, titles, annotations, text legibility) or do not aid understanding the data
Some of the figures are poor plot choices or have poor aesthetics. Some figures do not aid understanding the data
All figures are correct plot choices. Only one or two have minor questionable aesthetic choices. The figures mostly fit well with the other elements of the EDA and support understanding the data
All figures are correct plot choices with beautiful aesthetics. The figures fit well with the other elements of the EDA, producing a clear understanding of the data.
Grading Rules
Scoring: Out of 5 points
Each Developing => -1 pts
Each Unsatisfactory=> -2 pts
until the score is 0
If students address the detailed feedback in a future checkpoint they will earn these points back
DETAILED FEEDBACK should be left in the data section AND anywhere the student addressed proposal feedback but did not do it to your satisfaction
The text was updated successfully, but these errors were encountered:
EDA Checkpoint Feedback
Score (out of 5 pts)
Score = 5
EDA Checkpoint Feedback
EDA for Women in State Legislature
, provides no information. Having so many cities in a line plot, it is really confusing for the reader to follow through.this could be due to population
, is highly vague and seems inaccurate. By the logic, the South should have the highest number of clinics, given they have the most % share of population in the United States. [Reference: https://www.census.gov/popclock/data_tables.php?component=growth] The population difference between Midwest and West is roughly 3%. In some places, you state what you plotted but not what you learned - should guide viewer as to what is to be learned.Comments
This checkpoint appears to be a huge deviation from the earlier checkpoints. The story telling in the EDA has not been tied well, even in the correlation matrix, there is no explanation as to what is going on. No comments on whether the analysis is in sync with the hypothesis. If you are working on the prediction task, that appears to be missing as well.
Regrade Feedback
Rubric
Grading Rules
Scoring: Out of 5 points
Each Developing => -1 pts
Each Unsatisfactory=> -2 pts
until the score is 0
If students address the detailed feedback in a future checkpoint they will earn these points back
DETAILED FEEDBACK should be left in the data section AND anywhere the student addressed proposal feedback but did not do it to your satisfaction
The text was updated successfully, but these errors were encountered: