-
-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft episode/lesson content #15
base: gh-pages
Are you sure you want to change the base?
Conversation
This PR is ready for review. I chose to develop the bones of a lesson in draft form to present more specific content that we can work from to continue to focus and develop the lesson structure. I incorporated feedback from #12 into the episode content without major revisions to the structure of the lesson. More detailed feedback may motivate such revisions. |
|
||
### Audience | ||
|
||
Have you ever seen a figure that tried to convey way too much? Or a figure that just didn't seem to give you any useful information? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Add "Have you ever worried that you've created such figures yourself, and wondered if there was a reliable way to do better?"
This lesson is for scientists that are getting started using R to analyze their | ||
data and want to know more about data visualization using `ggplot2` and want to | ||
practice choosing the best figure to present their data and data analysis. The | ||
lesson should prepare you to choose and generate the best publication quality chart to answer your research question. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
does the chart answer the research question?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
address (?)
@@ -9,119 +9,148 @@ objectives: | |||
- "To read Gapminder data to R" | |||
- "To evaluate Gapminder data structure" | |||
keypoints: | |||
- "Be sure to `setwd()` to point to your data file before importing it." | |||
- "Be sure to setup an RStudio project or `setwd()` to point to your data file | |||
before importing it." |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"Be sure to setup" => "Set up"
@@ -149,3 +178,5 @@ names(data) | |||
> > B. 38 | |||
> {: .solution} | |||
{: .challenge} | |||
|
|||
[data]: {{ site.baseurl }}/data/gapminder_all.csv |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 to putting all the URLs at the bottom like this - we should do that for all our lessons.
|
||
The shape of your data ought to be rectangular and can be described by the number of rows and columns. | ||
|
||
Column in your data can also be called __variables__ or __fields__. However, the general use of the term __variable__ to describe a column can also be used more specifically to describe numerically continuous data. As a complement to this specific use of variables, __categories__ describe discrete or categorical data |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No change now, but you can also use [variables]({{site.baseurl}}/reference/#variable)
to refer to a glossary entry in reference.md
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Numerical values should be recognized as __absolute__ or __relative__. Absolute | ||
values receive context by their units, while relative values are standardized in | ||
some fashion (i.e., proportion, per unit) and most often used for comparison | ||
among categories. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good summary.
2. No duplicate rows | ||
3. Every cell contains one value | ||
4. One column per type of information | ||
5. No redundant information |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Include link to Wickham's "Tidy Data" paper http://vita.had.co.nz/papers/tidy-data.pdf
@@ -13,6 +13,61 @@ keypoints: | |||
- "A well detailed research question can be answered with a good chart." | |||
--- | |||
|
|||
## Gapminder Ignorance Project |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why "ignorance"?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think that is its name. The foundation name is Gapminder. The project name is Ignorance. I got these general questions from the about section of the Ignorance Project - https://www.gapminder.org/ignorance/.
world view from a deeper understanding of reliable statistics. From their data, | ||
we can begin to formulate some research questions to explore: | ||
|
||
What are the major trends in global populations and economies? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Turn this into a bulleted list?
|
||
## Defining your research question for visualization | ||
|
||
Dr. Andrew Abela from Catholic University of America and 'Extreme Presentation' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- Link to Abela's home page?
- I think "Extreme Presentation" is Abela's company, so probably more accurate to say "Dr. Andrew Abela from the Catholic University of America put together..."
2. Determine the broad category that describes the goal of your research | ||
question from [Choosing a Good Chart][good-chart]. | ||
- *You may need to redefine | ||
or focus your research question to better suit visualization.* |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why make this a sub-bullet? Why not inline in the main bullet (in italics)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think you're right that conceptually the note is part of the main bullet. I just don't like how it looks. It extends the line across the page (unlike the other points) and has an awkward wrap. If you feel strongly about it, I would be happy to make the change or consider a different option.
All visualization elements are clearly labeled and/or included in a legend and | ||
are described in the accompanying caption for the graphic. These elements | ||
include the axis, data shape and color, and any statistical elements giving | ||
analytical context to the data. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Q: is it worth having a callout somewhere saying that not all figures need to be publication quality? I.e., it's OK to cut corners and do the "wrong" thing sometimes, and explain when?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
potentially relevant recommendation to reference: Rule 9 from Ten Simple Rules for Better Figures: Message Trumps Beauty
Figure 8. Message trumps beauty.
This figure is an extreme case where the message is particularly clear even if the aesthetic of the figure is questionable. The uncanny valley is a well-known hypothesis in the field of robotics that correlates our comfort level with the human-likeness of a robot. To express this hypothetical nature, hypothetical data were used () and the figure was given a sketched look (xkcd filter on matplotlib) associated with a cartoonish font that enhances the overall effect. Tick labels were also removed since only the overall shape of the curve matters. Using a sketch style conveys to the viewer that the data is approximate, and that it is the higher-level concepts rather than low-level details that are important [10].
~~~ | ||
{: .r} | ||
|
||
> ## Density Plot Fill-in Challenge Continued |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice work.
across the world per country in 2007?" | ||
- Relationship with three variables (*at least two are continuous*) | ||
- Bubble Chart | ||
> ## What is it's category? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"its" (possessive) rather than "it's" (contraction of "it is")
2. Determine the broad category that describes the goal of your research | ||
question from [Choosing a Good Chart][good-chart]. | ||
- *You may need to redefine | ||
or focus your research question to better suit visualization.* |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Again, why a sub-bullet?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for all of this - the lesson is looking really good.
The PR closes #13 with a
rebase
to include #14 in the history