Analysis Prompts¶
Loading Data¶
Python |
Process |
Access |
Summarize |
Visualize |
Prepare |
Construct |
---|---|---|---|---|---|---|
No |
No |
Yes |
No |
No |
No |
No |
Look at all of the different ways pandas can load data. Consider some of the questions below and add a notebook that’s styled like a report that answers a few of them with text and code.
Which seem like good idea? are any dangerous?
Which seem more or less common?
Can you compare them on speed? Is it ever worth transforming a dataset before loading?
How much can you repair a dataset using the parameters of the load functions?
CheatSheet¶
Python |
Process |
Access |
Summarize |
Visualize |
Prepare |
Construct |
Classification |
Clustering |
Regression |
Evaluation |
---|---|---|---|---|---|---|---|---|---|---|
No |
No |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
Make a cheatsheet with examples of the several different parameter settings for common operations for one topic.
This cheatsheet is an example, it’s too broad, but it’s the same idea. Yours should
Deeper Analysis¶
Python |
Process |
Access |
Summarize |
Visualize |
Prepare |
Construct |
Classification |
Clustering |
Regression |
Evaluation |
---|---|---|---|---|---|---|---|---|---|---|
Yes |
No |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
For one of the assignments, if there was something you were curious about. Try it out and investigate how to answer it. Vary parameters and document your investigation.
New Analysis¶
Python |
Process |
Access |
Summarize |
Visualize |
Prepare |
Construct |
Classification |
Clustering |
Regression |
Evaluation |
---|---|---|---|---|---|---|---|---|---|---|
Yes |
No |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
For one a topic of interest, clean, explore and model the data. Work with messy data or data provided in multiple files to earn prepare and construct achievements or use clean data to earn only EDA and modeling achievements.