Chapter 2 Get the data
In this section you retrieve some example data from the Internet. This data is used later in the guide.
The data set for this guide is available online as a git hub repository.
- Go the file, by clicking on the following git hub address:
(https://raw.githubusercontent.com/tercen/starters-guide2/master/data/crabs_longformat.csv)
- Once it opens, right click and select
Save as...
Save your data on your desktop. Use the default name (it should be crabs_longformat.csv
).
Crab data description:
The data set is called “crabs” and it is in a long format. It has a total of 1000 measurements of crab physical traits. It is composed of four groups (two sexes and two species) of 50 measurements for five traits variables, FL
(frontal lobe size in mm), RW
(rear width mm), CL
(carapace length mm), CW
(carapace width mm) and BD
(body depth mm). In summary it is:
Factor | values |
---|---|
sp |
species, B or O for Blue or Orange. |
sex |
M or F |
index |
index 1 to 50 within each of the four groups, 2 sex and 2 species |
variable |
one of five variables: FL , RW , CL , CW and BD |
measurement |
the value of one of the five variable |
observation |
index 1 to 200 observations for each variable |