Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft episode/lesson content #15

Open
wants to merge 20 commits into
base: gh-pages
Choose a base branch
from

Conversation

brymz
Copy link
Collaborator

@brymz brymz commented Dec 27, 2016

The PR closes #13 with a rebase to include #14 in the history

@brymz brymz mentioned this pull request Dec 27, 2016
3 tasks
@brymz brymz changed the title WIP: Draft episode/lesson content Draft episode/lesson content Dec 28, 2016
@brymz
Copy link
Collaborator Author

brymz commented Dec 28, 2016

This PR is ready for review.

I chose to develop the bones of a lesson in draft form to present more specific content that we can work from to continue to focus and develop the lesson structure. I incorporated feedback from #12 into the episode content without major revisions to the structure of the lesson. More detailed feedback may motivate such revisions.


### Audience

Have you ever seen a figure that tried to convey way too much? Or a figure that just didn't seem to give you any useful information?
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add "Have you ever worried that you've created such figures yourself, and wondered if there was a reliable way to do better?"

This lesson is for scientists that are getting started using R to analyze their
data and want to know more about data visualization using `ggplot2` and want to
practice choosing the best figure to present their data and data analysis. The
lesson should prepare you to choose and generate the best publication quality chart to answer your research question.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

does the chart answer the research question?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

address (?)

@@ -9,119 +9,148 @@ objectives:
- "To read Gapminder data to R"
- "To evaluate Gapminder data structure"
keypoints:
- "Be sure to `setwd()` to point to your data file before importing it."
- "Be sure to setup an RStudio project or `setwd()` to point to your data file
before importing it."
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"Be sure to setup" => "Set up"

@@ -149,3 +178,5 @@ names(data)
> > B. 38
> {: .solution}
{: .challenge}

[data]: {{ site.baseurl }}/data/gapminder_all.csv
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1 to putting all the URLs at the bottom like this - we should do that for all our lessons.


The shape of your data ought to be rectangular and can be described by the number of rows and columns.

Column in your data can also be called __variables__ or __fields__. However, the general use of the term __variable__ to describe a column can also be used more specifically to describe numerically continuous data. As a complement to this specific use of variables, __categories__ describe discrete or categorical data
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No change now, but you can also use [variables]({{site.baseurl}}/reference/#variable) to refer to a glossary entry in reference.md.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

#17

Numerical values should be recognized as __absolute__ or __relative__. Absolute
values receive context by their units, while relative values are standardized in
some fashion (i.e., proportion, per unit) and most often used for comparison
among categories.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good summary.

2. No duplicate rows
3. Every cell contains one value
4. One column per type of information
5. No redundant information
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Include link to Wickham's "Tidy Data" paper http://vita.had.co.nz/papers/tidy-data.pdf

@@ -13,6 +13,61 @@ keypoints:
- "A well detailed research question can be answered with a good chart."
---

## Gapminder Ignorance Project
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why "ignorance"?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think that is its name. The foundation name is Gapminder. The project name is Ignorance. I got these general questions from the about section of the Ignorance Project - https://www.gapminder.org/ignorance/.

world view from a deeper understanding of reliable statistics. From their data,
we can begin to formulate some research questions to explore:

What are the major trends in global populations and economies?
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Turn this into a bulleted list?


## Defining your research question for visualization

Dr. Andrew Abela from Catholic University of America and 'Extreme Presentation'
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. Link to Abela's home page?
  2. I think "Extreme Presentation" is Abela's company, so probably more accurate to say "Dr. Andrew Abela from the Catholic University of America put together..."

2. Determine the broad category that describes the goal of your research
question from [Choosing a Good Chart][good-chart].
- *You may need to redefine
or focus your research question to better suit visualization.*
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why make this a sub-bullet? Why not inline in the main bullet (in italics)?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think you're right that conceptually the note is part of the main bullet. I just don't like how it looks. It extends the line across the page (unlike the other points) and has an awkward wrap. If you feel strongly about it, I would be happy to make the change or consider a different option.

All visualization elements are clearly labeled and/or included in a legend and
are described in the accompanying caption for the graphic. These elements
include the axis, data shape and color, and any statistical elements giving
analytical context to the data.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Q: is it worth having a callout somewhere saying that not all figures need to be publication quality? I.e., it's OK to cut corners and do the "wrong" thing sometimes, and explain when?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

potentially relevant recommendation to reference: Rule 9 from Ten Simple Rules for Better Figures: Message Trumps Beauty

image

Figure 8. Message trumps beauty.
This figure is an extreme case where the message is particularly clear even if the aesthetic of the figure is questionable. The uncanny valley is a well-known hypothesis in the field of robotics that correlates our comfort level with the human-likeness of a robot. To express this hypothetical nature, hypothetical data were used () and the figure was given a sketched look (xkcd filter on matplotlib) associated with a cartoonish font that enhances the overall effect. Tick labels were also removed since only the overall shape of the curve matters. Using a sketch style conveys to the viewer that the data is approximate, and that it is the higher-level concepts rather than low-level details that are important [10].

~~~
{: .r}

> ## Density Plot Fill-in Challenge Continued
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice work.

across the world per country in 2007?"
- Relationship with three variables (*at least two are continuous*)
- Bubble Chart
> ## What is it's category?
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"its" (possessive) rather than "it's" (contraction of "it is")

2. Determine the broad category that describes the goal of your research
question from [Choosing a Good Chart][good-chart].
- *You may need to redefine
or focus your research question to better suit visualization.*
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Again, why a sub-bullet?

Copy link
Contributor

@gvwilson gvwilson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for all of this - the lesson is looking really good.

@brymz
Copy link
Collaborator Author

brymz commented Jan 4, 2017

Thanks for the thoughtful feedback, @gvwilson & @aflaxman. I have made revisions or added comments as per the review. This is ready for another look.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants