Skip to content

Commit

Permalink
Rescrape of Galiano data 2024-10-29
Browse files Browse the repository at this point in the history
  • Loading branch information
amb26 committed Oct 29, 2024
1 parent d776430 commit 381b86c
Show file tree
Hide file tree
Showing 10 changed files with 190,211 additions and 8 deletions.
6,869 changes: 6,869 additions & 0 deletions data/Galiano 2024/Galiano_Coll_Not_Trad_2024_10_29.csv

Large diffs are not rendered by default.

778 changes: 778 additions & 0 deletions data/Galiano 2024/Galiano_Trad_Not_Coll_2024_10_29.csv

Large diffs are not rendered by default.

46,413 changes: 46,413 additions & 0 deletions data/Galiano 2024/Galiano_Union_Catalogue_2024_10_29.csv

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion data/Galiano 2024/combinedOutMap.json
Original file line number Diff line number Diff line change
Expand Up @@ -57,7 +57,7 @@
"datasets": {
"iNat": {
"name": "iNaturalist (2005-2021)",
"input": "E:/source/gits/bagatelle/data/Galiano 2024/Galiano_Union_Catalogue_2024_07_31.csv",
"input": "E:/source/gits/bagatelle/data/Galiano 2024/Galiano_Union_Catalogue_2024_10_29.csv",
"map": "E:/source/gits/bagatelle/data/iNaturalist/iNaturalist-obs-map-new-cap.json",
"outMap": "E:/source/gits/bagatelle/data/iNaturalist/iNaturalist-obs-out-map-cap.json",
"colour": "#2C8C99",
Expand Down
4 changes: 2 additions & 2 deletions data/Galiano 2024/fusion.json5
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
datasets: {
iNat: {
name: "iNaturalist (2005-2021)",
input: "%imerss-bioinfo/data/Galiano 2024/Galiano_Union_Catalogue_2024_07_31.csv",
input: "%imerss-bioinfo/data/Galiano 2024/Galiano_Union_Catalogue_2024_10_29.csv",
map: "%imerss-bioinfo/data/iNaturalist/iNaturalist-obs-map-new-cap.json",
outMap: "%imerss-bioinfo/data/iNaturalist/iNaturalist-obs-out-map-cap.json",
colour: "#2C8C99",
Expand Down Expand Up @@ -38,7 +38,7 @@
},

summarise: true,
output: "%imerss-bioinfo/data/Galiano 2024/reintegrated.csv",
output: "%imerss-bioinfo/data/Galiano 2024/reintegrated-2024-10-29.csv",
combinedOutMap: "%imerss-bioinfo/data/Galiano 2024/combinedOutMap.json",
filters: {
}
Expand Down
45,767 changes: 45,767 additions & 0 deletions data/Galiano 2024/reintegrated-2024-10-29-obs.csv

Large diffs are not rendered by default.

5,193 changes: 5,193 additions & 0 deletions data/Galiano 2024/reintegrated-2024-10-29.csv

Large diffs are not rendered by default.

12 changes: 7 additions & 5 deletions data/Galiano 2024/synthesizeTradColl.R
Original file line number Diff line number Diff line change
@@ -1,9 +1,11 @@
library(dplyr)

setwd(dirname(rstudioapi::getActiveDocumentContext()$path))

source("utils.R")

trad <- timedRead("../iNaturalist/Galiano_Trad_Catalogue_2024_07_31.csv")
coll <- timedRead("../iNaturalist/Galiano_Coll_Catalogue_2024_07_31.csv")
trad <- timedRead("../iNaturalist/Galiano_Trad_Catalogue_2024_10_29.csv")
coll <- timedRead("../iNaturalist/Galiano_Coll_Catalogue_2024_10_29.csv")

tradNotColl <- dplyr::anti_join(trad, coll, by=c("id"))
collNotTrad <- dplyr::anti_join(coll, trad, by=c("id"))
Expand All @@ -13,6 +15,6 @@ collTradUnion <- merge(trad, coll, all=TRUE)
# If some fields, e.g. commonName contain discrepant values we may end up with two rows
collTradUnion <- collTradUnion[!duplicated(collTradUnion$id), ]

timedWrite(tradNotColl, "Galiano_Trad_Not_Coll_2024_07_31.csv")
timedWrite(collNotTrad, "Galiano_Coll_Not_Trad_2024_07_31.csv")
timedWrite(collTradUnion, "Galiano_Union_Catalogue_2024_07_31.csv")
timedWrite(tradNotColl, "Galiano_Trad_Not_Coll_2024_10_29.csv")
timedWrite(collNotTrad, "Galiano_Coll_Not_Trad_2024_10_29.csv")
timedWrite(collTradUnion, "Galiano_Union_Catalogue_2024_10_29.csv")
45,636 changes: 45,636 additions & 0 deletions data/iNaturalist/Galiano_Coll_Catalogue_2024_10_29.csv

Large diffs are not rendered by default.

39,545 changes: 39,545 additions & 0 deletions data/iNaturalist/Galiano_Trad_Catalogue_2024_10_29.csv

Large diffs are not rendered by default.

0 comments on commit 381b86c

Please sign in to comment.