Standardising Taxonomic Names in Australian Plants
- Source:R/APCalign-package.R
+ Source: R/APCalign-package.R
APCalign.Rd
Author<
Align Australian plant scientific names to the APC or APNI
- Source: R/align_taxa.R
+ Source: R/align_taxa.R
align_taxa.Rd
@@ -322,7 +322,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/create_species_state_origin_matrix.html b/reference/create_species_state_origin_matrix.html
index 08a277b..3200806 100644
--- a/reference/create_species_state_origin_matrix.html
+++ b/reference/create_species_state_origin_matrix.html
@@ -42,7 +42,7 @@
State level native and introduced origin status
- Source: R/create_species_state_origin_matrix.R
+ Source: R/create_species_state_origin_matrix.R
create_species_state_origin_matrix.Rd
@@ -88,7 +88,7 @@ Examples#> Loading resources into memory...
#>
===========================
=====================================================
================================================================================
#> ...done
-#> # A tibble: 26,507 × 19
+#> # A tibble: 26,509 × 19
#> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not …
@@ -101,7 +101,7 @@ Examples#> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not …
#> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not …
#> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not …
-#> # ℹ 26,497 more rows
+#> # ℹ 26,499 more rows
#> # ℹ 7 more variables: ACT <chr>, CSI <chr>, AR <chr>, MI <chr>, HI <chr>,
#> # MDI <chr>, CaI <chr>
@@ -118,7 +118,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/create_taxonomic_update_lookup.html b/reference/create_taxonomic_update_lookup.html
index eb0167e..f1ffb7e 100644
--- a/reference/create_taxonomic_update_lookup.html
+++ b/reference/create_taxonomic_update_lookup.html
@@ -44,7 +44,7 @@
Create a table with the best-possible scientific name match for Australian plant names
- Source: R/create_taxonomic_update_lookup.R
+ Source: R/create_taxonomic_update_lookup.R
create_taxonomic_update_lookup.Rd
@@ -342,7 +342,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/default_version.html b/reference/default_version.html
index c9744f1..3f79030 100644
--- a/reference/default_version.html
+++ b/reference/default_version.html
@@ -38,7 +38,7 @@
Get the default version for stable data
- Source: R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
default_version.Rd
@@ -60,7 +60,7 @@ Value
Examples
default_version()
-#> [1] "2024-07+31"
+#> [1] "2024-09-25"
@@ -73,7 +73,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/gbif_lite.html b/reference/gbif_lite.html
index fb37c2f..8f376c2 100644
--- a/reference/gbif_lite.html
+++ b/reference/gbif_lite.html
@@ -36,7 +36,7 @@
@@ -89,7 +89,7 @@ Source<
Lookup Family by Genus from APC
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
get_apc_genus_family_lookup.Rd
@@ -100,7 +100,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/index.html b/reference/index.html
index 9974f33..12f6ce0 100644
--- a/reference/index.html
+++ b/reference/index.html
@@ -176,7 +176,7 @@ Data
diff --git a/reference/load_taxonomic_resources.html b/reference/load_taxonomic_resources.html
index aa3f3c7..5152356 100644
--- a/reference/load_taxonomic_resources.html
+++ b/reference/load_taxonomic_resources.html
@@ -40,7 +40,7 @@
Load taxonomic reference lists, APC & APNI
- Source: R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
load_taxonomic_resources.Rd
@@ -335,7 +335,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/native_anywhere_in_australia.html b/reference/native_anywhere_in_australia.html
index d6b2d65..1fb56fb 100644
--- a/reference/native_anywhere_in_australia.html
+++ b/reference/native_anywhere_in_australia.html
@@ -38,7 +38,7 @@
Native anywhere in Australia
- Source: R/native_anywhere_in_australia.R
+ Source: R/native_anywhere_in_australia.R
native_anywhere_in_australia.Rd
@@ -115,7 +115,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/reexports.html b/reference/reexports.html
index 8828b56..039308d 100644
--- a/reference/reexports.html
+++ b/reference/reexports.html
@@ -50,7 +50,7 @@
@@ -73,7 +73,7 @@ Objects exported from other packages
diff --git a/reference/standardise_names.html b/reference/standardise_names.html
index c2948e1..f86649b 100644
--- a/reference/standardise_names.html
+++ b/reference/standardise_names.html
@@ -44,7 +44,7 @@
Standardise taxon names
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_names.Rd
@@ -108,7 +108,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/standardise_taxon_rank.html b/reference/standardise_taxon_rank.html
index e195304..7466b30 100644
--- a/reference/standardise_taxon_rank.html
+++ b/reference/standardise_taxon_rank.html
@@ -36,7 +36,7 @@
Standardise taxon ranks
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_taxon_rank.Rd
@@ -82,7 +82,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/state_diversity_counts.html b/reference/state_diversity_counts.html
index 677c24c..5298c45 100644
--- a/reference/state_diversity_counts.html
+++ b/reference/state_diversity_counts.html
@@ -40,7 +40,7 @@
State- and territory-level diversity
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
state_diversity_counts.Rd
@@ -116,7 +116,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names.html b/reference/strip_names.html
index 5cbc4e3..1535f60 100644
--- a/reference/strip_names.html
+++ b/reference/strip_names.html
@@ -38,7 +38,7 @@
@@ -93,7 +93,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
diff --git a/reference/get_apc_genus_family_lookup.html b/reference/get_apc_genus_family_lookup.html
index f54601e..4cb3c8a 100644
--- a/reference/get_apc_genus_family_lookup.html
+++ b/reference/get_apc_genus_family_lookup.html
@@ -38,7 +38,7 @@
diff --git a/reference/align_taxa.html b/reference/align_taxa.html
index 6e05183..05dbd64 100644
--- a/reference/align_taxa.html
+++ b/reference/align_taxa.html
@@ -52,7 +52,7 @@
Align Australian plant scientific names to the APC or APNI
- Source:R/align_taxa.R
+ Source: R/align_taxa.R
align_taxa.Rd
Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/create_species_state_origin_matrix.html b/reference/create_species_state_origin_matrix.html
index 08a277b..3200806 100644
--- a/reference/create_species_state_origin_matrix.html
+++ b/reference/create_species_state_origin_matrix.html
@@ -42,7 +42,7 @@
State level native and introduced origin status
- Source: R/create_species_state_origin_matrix.R
+ Source: R/create_species_state_origin_matrix.R
create_species_state_origin_matrix.Rd
@@ -88,7 +88,7 @@ Examples#> Loading resources into memory...
#>
===========================
=====================================================
================================================================================
#> ...done
-#> # A tibble: 26,507 × 19
+#> # A tibble: 26,509 × 19
#> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not …
@@ -101,7 +101,7 @@ Examples#> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not …
#> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not …
#> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not …
-#> # ℹ 26,497 more rows
+#> # ℹ 26,499 more rows
#> # ℹ 7 more variables: ACT <chr>, CSI <chr>, AR <chr>, MI <chr>, HI <chr>,
#> # MDI <chr>, CaI <chr>
@@ -118,7 +118,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/create_taxonomic_update_lookup.html b/reference/create_taxonomic_update_lookup.html
index eb0167e..f1ffb7e 100644
--- a/reference/create_taxonomic_update_lookup.html
+++ b/reference/create_taxonomic_update_lookup.html
@@ -44,7 +44,7 @@
Create a table with the best-possible scientific name match for Australian plant names
- Source: R/create_taxonomic_update_lookup.R
+ Source: R/create_taxonomic_update_lookup.R
create_taxonomic_update_lookup.Rd
@@ -342,7 +342,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/default_version.html b/reference/default_version.html
index c9744f1..3f79030 100644
--- a/reference/default_version.html
+++ b/reference/default_version.html
@@ -38,7 +38,7 @@
Get the default version for stable data
- Source: R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
default_version.Rd
@@ -60,7 +60,7 @@ Value
Examples
default_version()
-#> [1] "2024-07+31"
+#> [1] "2024-09-25"
@@ -73,7 +73,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/gbif_lite.html b/reference/gbif_lite.html
index fb37c2f..8f376c2 100644
--- a/reference/gbif_lite.html
+++ b/reference/gbif_lite.html
@@ -36,7 +36,7 @@
@@ -89,7 +89,7 @@ Source<
Lookup Family by Genus from APC
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
get_apc_genus_family_lookup.Rd
@@ -100,7 +100,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/index.html b/reference/index.html
index 9974f33..12f6ce0 100644
--- a/reference/index.html
+++ b/reference/index.html
@@ -176,7 +176,7 @@ Data
diff --git a/reference/load_taxonomic_resources.html b/reference/load_taxonomic_resources.html
index aa3f3c7..5152356 100644
--- a/reference/load_taxonomic_resources.html
+++ b/reference/load_taxonomic_resources.html
@@ -40,7 +40,7 @@
Load taxonomic reference lists, APC & APNI
- Source: R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
load_taxonomic_resources.Rd
@@ -335,7 +335,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/native_anywhere_in_australia.html b/reference/native_anywhere_in_australia.html
index d6b2d65..1fb56fb 100644
--- a/reference/native_anywhere_in_australia.html
+++ b/reference/native_anywhere_in_australia.html
@@ -38,7 +38,7 @@
Native anywhere in Australia
- Source: R/native_anywhere_in_australia.R
+ Source: R/native_anywhere_in_australia.R
native_anywhere_in_australia.Rd
@@ -115,7 +115,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/reexports.html b/reference/reexports.html
index 8828b56..039308d 100644
--- a/reference/reexports.html
+++ b/reference/reexports.html
@@ -50,7 +50,7 @@
@@ -73,7 +73,7 @@ Objects exported from other packages
diff --git a/reference/standardise_names.html b/reference/standardise_names.html
index c2948e1..f86649b 100644
--- a/reference/standardise_names.html
+++ b/reference/standardise_names.html
@@ -44,7 +44,7 @@
Standardise taxon names
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_names.Rd
@@ -108,7 +108,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/standardise_taxon_rank.html b/reference/standardise_taxon_rank.html
index e195304..7466b30 100644
--- a/reference/standardise_taxon_rank.html
+++ b/reference/standardise_taxon_rank.html
@@ -36,7 +36,7 @@
Standardise taxon ranks
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_taxon_rank.Rd
@@ -82,7 +82,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/state_diversity_counts.html b/reference/state_diversity_counts.html
index 677c24c..5298c45 100644
--- a/reference/state_diversity_counts.html
+++ b/reference/state_diversity_counts.html
@@ -40,7 +40,7 @@
State- and territory-level diversity
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
state_diversity_counts.Rd
@@ -116,7 +116,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names.html b/reference/strip_names.html
index 5cbc4e3..1535f60 100644
--- a/reference/strip_names.html
+++ b/reference/strip_names.html
@@ -38,7 +38,7 @@
@@ -93,7 +93,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
diff --git a/reference/get_apc_genus_family_lookup.html b/reference/get_apc_genus_family_lookup.html
index f54601e..4cb3c8a 100644
--- a/reference/get_apc_genus_family_lookup.html
+++ b/reference/get_apc_genus_family_lookup.html
@@ -38,7 +38,7 @@
State level native and introduced origin status
- Source:R/create_species_state_origin_matrix.R
+ Source: R/create_species_state_origin_matrix.R
create_species_state_origin_matrix.Rd
Examples#> Loading resources into memory...
#>
===========================
=====================================================
================================================================================
#> ...done
-#> # A tibble: 26,507 × 19
+#> # A tibble: 26,509 × 19
#> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not …
@@ -101,7 +101,7 @@ Examples#> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not …
#> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not …
#> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not …
-#> # ℹ 26,497 more rows
+#> # ℹ 26,499 more rows
#> # ℹ 7 more variables: ACT <chr>, CSI <chr>, AR <chr>, MI <chr>, HI <chr>,
#> # MDI <chr>, CaI <chr>
@@ -118,7 +118,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/create_taxonomic_update_lookup.html b/reference/create_taxonomic_update_lookup.html
index eb0167e..f1ffb7e 100644
--- a/reference/create_taxonomic_update_lookup.html
+++ b/reference/create_taxonomic_update_lookup.html
@@ -44,7 +44,7 @@
Create a table with the best-possible scientific name match for Australian plant names
- Source: R/create_taxonomic_update_lookup.R
+ Source: R/create_taxonomic_update_lookup.R
create_taxonomic_update_lookup.Rd
@@ -342,7 +342,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/default_version.html b/reference/default_version.html
index c9744f1..3f79030 100644
--- a/reference/default_version.html
+++ b/reference/default_version.html
@@ -38,7 +38,7 @@
Get the default version for stable data
- Source: R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
default_version.Rd
@@ -60,7 +60,7 @@ Value
Examples
default_version()
-#> [1] "2024-07+31"
+#> [1] "2024-09-25"
@@ -73,7 +73,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/gbif_lite.html b/reference/gbif_lite.html
index fb37c2f..8f376c2 100644
--- a/reference/gbif_lite.html
+++ b/reference/gbif_lite.html
@@ -36,7 +36,7 @@
@@ -89,7 +89,7 @@ Source<
Lookup Family by Genus from APC
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
get_apc_genus_family_lookup.Rd
@@ -100,7 +100,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/index.html b/reference/index.html
index 9974f33..12f6ce0 100644
--- a/reference/index.html
+++ b/reference/index.html
@@ -176,7 +176,7 @@ Data
diff --git a/reference/load_taxonomic_resources.html b/reference/load_taxonomic_resources.html
index aa3f3c7..5152356 100644
--- a/reference/load_taxonomic_resources.html
+++ b/reference/load_taxonomic_resources.html
@@ -40,7 +40,7 @@
Load taxonomic reference lists, APC & APNI
- Source: R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
load_taxonomic_resources.Rd
@@ -335,7 +335,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/native_anywhere_in_australia.html b/reference/native_anywhere_in_australia.html
index d6b2d65..1fb56fb 100644
--- a/reference/native_anywhere_in_australia.html
+++ b/reference/native_anywhere_in_australia.html
@@ -38,7 +38,7 @@
Native anywhere in Australia
- Source: R/native_anywhere_in_australia.R
+ Source: R/native_anywhere_in_australia.R
native_anywhere_in_australia.Rd
@@ -115,7 +115,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/reexports.html b/reference/reexports.html
index 8828b56..039308d 100644
--- a/reference/reexports.html
+++ b/reference/reexports.html
@@ -50,7 +50,7 @@
@@ -73,7 +73,7 @@ Objects exported from other packages
diff --git a/reference/standardise_names.html b/reference/standardise_names.html
index c2948e1..f86649b 100644
--- a/reference/standardise_names.html
+++ b/reference/standardise_names.html
@@ -44,7 +44,7 @@
Standardise taxon names
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_names.Rd
@@ -108,7 +108,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/standardise_taxon_rank.html b/reference/standardise_taxon_rank.html
index e195304..7466b30 100644
--- a/reference/standardise_taxon_rank.html
+++ b/reference/standardise_taxon_rank.html
@@ -36,7 +36,7 @@
Standardise taxon ranks
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_taxon_rank.Rd
@@ -82,7 +82,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/state_diversity_counts.html b/reference/state_diversity_counts.html
index 677c24c..5298c45 100644
--- a/reference/state_diversity_counts.html
+++ b/reference/state_diversity_counts.html
@@ -40,7 +40,7 @@
State- and territory-level diversity
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
state_diversity_counts.Rd
@@ -116,7 +116,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names.html b/reference/strip_names.html
index 5cbc4e3..1535f60 100644
--- a/reference/strip_names.html
+++ b/reference/strip_names.html
@@ -38,7 +38,7 @@
@@ -93,7 +93,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
diff --git a/reference/get_apc_genus_family_lookup.html b/reference/get_apc_genus_family_lookup.html
index f54601e..4cb3c8a 100644
--- a/reference/get_apc_genus_family_lookup.html
+++ b/reference/get_apc_genus_family_lookup.html
@@ -38,7 +38,7 @@
Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/create_taxonomic_update_lookup.html b/reference/create_taxonomic_update_lookup.html
index eb0167e..f1ffb7e 100644
--- a/reference/create_taxonomic_update_lookup.html
+++ b/reference/create_taxonomic_update_lookup.html
@@ -44,7 +44,7 @@
Create a table with the best-possible scientific name match for Australian plant names
- Source: R/create_taxonomic_update_lookup.R
+ Source: R/create_taxonomic_update_lookup.R
create_taxonomic_update_lookup.Rd
@@ -342,7 +342,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/default_version.html b/reference/default_version.html
index c9744f1..3f79030 100644
--- a/reference/default_version.html
+++ b/reference/default_version.html
@@ -38,7 +38,7 @@
Get the default version for stable data
- Source: R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
default_version.Rd
@@ -60,7 +60,7 @@ Value
Examples
default_version()
-#> [1] "2024-07+31"
+#> [1] "2024-09-25"
@@ -73,7 +73,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/gbif_lite.html b/reference/gbif_lite.html
index fb37c2f..8f376c2 100644
--- a/reference/gbif_lite.html
+++ b/reference/gbif_lite.html
@@ -36,7 +36,7 @@
@@ -89,7 +89,7 @@ Source<
Lookup Family by Genus from APC
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
get_apc_genus_family_lookup.Rd
@@ -100,7 +100,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/index.html b/reference/index.html
index 9974f33..12f6ce0 100644
--- a/reference/index.html
+++ b/reference/index.html
@@ -176,7 +176,7 @@ Data
diff --git a/reference/load_taxonomic_resources.html b/reference/load_taxonomic_resources.html
index aa3f3c7..5152356 100644
--- a/reference/load_taxonomic_resources.html
+++ b/reference/load_taxonomic_resources.html
@@ -40,7 +40,7 @@
Load taxonomic reference lists, APC & APNI
- Source: R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
load_taxonomic_resources.Rd
@@ -335,7 +335,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/native_anywhere_in_australia.html b/reference/native_anywhere_in_australia.html
index d6b2d65..1fb56fb 100644
--- a/reference/native_anywhere_in_australia.html
+++ b/reference/native_anywhere_in_australia.html
@@ -38,7 +38,7 @@
Native anywhere in Australia
- Source: R/native_anywhere_in_australia.R
+ Source: R/native_anywhere_in_australia.R
native_anywhere_in_australia.Rd
@@ -115,7 +115,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/reexports.html b/reference/reexports.html
index 8828b56..039308d 100644
--- a/reference/reexports.html
+++ b/reference/reexports.html
@@ -50,7 +50,7 @@
@@ -73,7 +73,7 @@ Objects exported from other packages
diff --git a/reference/standardise_names.html b/reference/standardise_names.html
index c2948e1..f86649b 100644
--- a/reference/standardise_names.html
+++ b/reference/standardise_names.html
@@ -44,7 +44,7 @@
Standardise taxon names
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_names.Rd
@@ -108,7 +108,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/standardise_taxon_rank.html b/reference/standardise_taxon_rank.html
index e195304..7466b30 100644
--- a/reference/standardise_taxon_rank.html
+++ b/reference/standardise_taxon_rank.html
@@ -36,7 +36,7 @@
Standardise taxon ranks
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_taxon_rank.Rd
@@ -82,7 +82,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/state_diversity_counts.html b/reference/state_diversity_counts.html
index 677c24c..5298c45 100644
--- a/reference/state_diversity_counts.html
+++ b/reference/state_diversity_counts.html
@@ -40,7 +40,7 @@
State- and territory-level diversity
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
state_diversity_counts.Rd
@@ -116,7 +116,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names.html b/reference/strip_names.html
index 5cbc4e3..1535f60 100644
--- a/reference/strip_names.html
+++ b/reference/strip_names.html
@@ -38,7 +38,7 @@
@@ -93,7 +93,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
diff --git a/reference/get_apc_genus_family_lookup.html b/reference/get_apc_genus_family_lookup.html
index f54601e..4cb3c8a 100644
--- a/reference/get_apc_genus_family_lookup.html
+++ b/reference/get_apc_genus_family_lookup.html
@@ -38,7 +38,7 @@
Create a table with the best-possible scientific name match for Australian plant names
- Source:R/create_taxonomic_update_lookup.R
+ Source: R/create_taxonomic_update_lookup.R
create_taxonomic_update_lookup.Rd
Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/default_version.html b/reference/default_version.html
index c9744f1..3f79030 100644
--- a/reference/default_version.html
+++ b/reference/default_version.html
@@ -38,7 +38,7 @@
Get the default version for stable data
- Source: R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
default_version.Rd
@@ -60,7 +60,7 @@ Value
Examples
default_version()
-#> [1] "2024-07+31"
+#> [1] "2024-09-25"
@@ -73,7 +73,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/gbif_lite.html b/reference/gbif_lite.html
index fb37c2f..8f376c2 100644
--- a/reference/gbif_lite.html
+++ b/reference/gbif_lite.html
@@ -36,7 +36,7 @@
@@ -89,7 +89,7 @@ Source<
Lookup Family by Genus from APC
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
get_apc_genus_family_lookup.Rd
@@ -100,7 +100,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/index.html b/reference/index.html
index 9974f33..12f6ce0 100644
--- a/reference/index.html
+++ b/reference/index.html
@@ -176,7 +176,7 @@ Data
diff --git a/reference/load_taxonomic_resources.html b/reference/load_taxonomic_resources.html
index aa3f3c7..5152356 100644
--- a/reference/load_taxonomic_resources.html
+++ b/reference/load_taxonomic_resources.html
@@ -40,7 +40,7 @@
Load taxonomic reference lists, APC & APNI
- Source: R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
load_taxonomic_resources.Rd
@@ -335,7 +335,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/native_anywhere_in_australia.html b/reference/native_anywhere_in_australia.html
index d6b2d65..1fb56fb 100644
--- a/reference/native_anywhere_in_australia.html
+++ b/reference/native_anywhere_in_australia.html
@@ -38,7 +38,7 @@
Native anywhere in Australia
- Source: R/native_anywhere_in_australia.R
+ Source: R/native_anywhere_in_australia.R
native_anywhere_in_australia.Rd
@@ -115,7 +115,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/reexports.html b/reference/reexports.html
index 8828b56..039308d 100644
--- a/reference/reexports.html
+++ b/reference/reexports.html
@@ -50,7 +50,7 @@
@@ -73,7 +73,7 @@ Objects exported from other packages
diff --git a/reference/standardise_names.html b/reference/standardise_names.html
index c2948e1..f86649b 100644
--- a/reference/standardise_names.html
+++ b/reference/standardise_names.html
@@ -44,7 +44,7 @@
Standardise taxon names
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_names.Rd
@@ -108,7 +108,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/standardise_taxon_rank.html b/reference/standardise_taxon_rank.html
index e195304..7466b30 100644
--- a/reference/standardise_taxon_rank.html
+++ b/reference/standardise_taxon_rank.html
@@ -36,7 +36,7 @@
Standardise taxon ranks
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_taxon_rank.Rd
@@ -82,7 +82,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/state_diversity_counts.html b/reference/state_diversity_counts.html
index 677c24c..5298c45 100644
--- a/reference/state_diversity_counts.html
+++ b/reference/state_diversity_counts.html
@@ -40,7 +40,7 @@
State- and territory-level diversity
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
state_diversity_counts.Rd
@@ -116,7 +116,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names.html b/reference/strip_names.html
index 5cbc4e3..1535f60 100644
--- a/reference/strip_names.html
+++ b/reference/strip_names.html
@@ -38,7 +38,7 @@
@@ -93,7 +93,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
diff --git a/reference/get_apc_genus_family_lookup.html b/reference/get_apc_genus_family_lookup.html
index f54601e..4cb3c8a 100644
--- a/reference/get_apc_genus_family_lookup.html
+++ b/reference/get_apc_genus_family_lookup.html
@@ -38,7 +38,7 @@
Get the default version for stable data
- Source:R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
default_version.Rd
Value
Examples
default_version()
-#> [1] "2024-07+31"
+#> [1] "2024-09-25"
Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/gbif_lite.html b/reference/gbif_lite.html
index fb37c2f..8f376c2 100644
--- a/reference/gbif_lite.html
+++ b/reference/gbif_lite.html
@@ -36,7 +36,7 @@
@@ -89,7 +89,7 @@ Source<
Lookup Family by Genus from APC
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
get_apc_genus_family_lookup.Rd
@@ -100,7 +100,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/index.html b/reference/index.html
index 9974f33..12f6ce0 100644
--- a/reference/index.html
+++ b/reference/index.html
@@ -176,7 +176,7 @@ Data
diff --git a/reference/load_taxonomic_resources.html b/reference/load_taxonomic_resources.html
index aa3f3c7..5152356 100644
--- a/reference/load_taxonomic_resources.html
+++ b/reference/load_taxonomic_resources.html
@@ -40,7 +40,7 @@
Load taxonomic reference lists, APC & APNI
- Source: R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
load_taxonomic_resources.Rd
@@ -335,7 +335,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/native_anywhere_in_australia.html b/reference/native_anywhere_in_australia.html
index d6b2d65..1fb56fb 100644
--- a/reference/native_anywhere_in_australia.html
+++ b/reference/native_anywhere_in_australia.html
@@ -38,7 +38,7 @@
Native anywhere in Australia
- Source: R/native_anywhere_in_australia.R
+ Source: R/native_anywhere_in_australia.R
native_anywhere_in_australia.Rd
@@ -115,7 +115,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/reexports.html b/reference/reexports.html
index 8828b56..039308d 100644
--- a/reference/reexports.html
+++ b/reference/reexports.html
@@ -50,7 +50,7 @@
@@ -73,7 +73,7 @@ Objects exported from other packages
diff --git a/reference/standardise_names.html b/reference/standardise_names.html
index c2948e1..f86649b 100644
--- a/reference/standardise_names.html
+++ b/reference/standardise_names.html
@@ -44,7 +44,7 @@
Standardise taxon names
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_names.Rd
@@ -108,7 +108,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/standardise_taxon_rank.html b/reference/standardise_taxon_rank.html
index e195304..7466b30 100644
--- a/reference/standardise_taxon_rank.html
+++ b/reference/standardise_taxon_rank.html
@@ -36,7 +36,7 @@
Standardise taxon ranks
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_taxon_rank.Rd
@@ -82,7 +82,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/state_diversity_counts.html b/reference/state_diversity_counts.html
index 677c24c..5298c45 100644
--- a/reference/state_diversity_counts.html
+++ b/reference/state_diversity_counts.html
@@ -40,7 +40,7 @@
State- and territory-level diversity
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
state_diversity_counts.Rd
@@ -116,7 +116,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names.html b/reference/strip_names.html
index 5cbc4e3..1535f60 100644
--- a/reference/strip_names.html
+++ b/reference/strip_names.html
@@ -38,7 +38,7 @@
@@ -93,7 +93,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
diff --git a/reference/get_apc_genus_family_lookup.html b/reference/get_apc_genus_family_lookup.html
index f54601e..4cb3c8a 100644
--- a/reference/get_apc_genus_family_lookup.html
+++ b/reference/get_apc_genus_family_lookup.html
@@ -38,7 +38,7 @@
Source<
Lookup Family by Genus from APC
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
get_apc_genus_family_lookup.Rd
@@ -100,7 +100,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/index.html b/reference/index.html
index 9974f33..12f6ce0 100644
--- a/reference/index.html
+++ b/reference/index.html
@@ -176,7 +176,7 @@ Data
diff --git a/reference/load_taxonomic_resources.html b/reference/load_taxonomic_resources.html
index aa3f3c7..5152356 100644
--- a/reference/load_taxonomic_resources.html
+++ b/reference/load_taxonomic_resources.html
@@ -40,7 +40,7 @@
Load taxonomic reference lists, APC & APNI
- Source: R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
load_taxonomic_resources.Rd
@@ -335,7 +335,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/native_anywhere_in_australia.html b/reference/native_anywhere_in_australia.html
index d6b2d65..1fb56fb 100644
--- a/reference/native_anywhere_in_australia.html
+++ b/reference/native_anywhere_in_australia.html
@@ -38,7 +38,7 @@
Native anywhere in Australia
- Source: R/native_anywhere_in_australia.R
+ Source: R/native_anywhere_in_australia.R
native_anywhere_in_australia.Rd
@@ -115,7 +115,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/reexports.html b/reference/reexports.html
index 8828b56..039308d 100644
--- a/reference/reexports.html
+++ b/reference/reexports.html
@@ -50,7 +50,7 @@
@@ -73,7 +73,7 @@ Objects exported from other packages
diff --git a/reference/standardise_names.html b/reference/standardise_names.html
index c2948e1..f86649b 100644
--- a/reference/standardise_names.html
+++ b/reference/standardise_names.html
@@ -44,7 +44,7 @@
Standardise taxon names
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_names.Rd
@@ -108,7 +108,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/standardise_taxon_rank.html b/reference/standardise_taxon_rank.html
index e195304..7466b30 100644
--- a/reference/standardise_taxon_rank.html
+++ b/reference/standardise_taxon_rank.html
@@ -36,7 +36,7 @@
Standardise taxon ranks
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_taxon_rank.Rd
@@ -82,7 +82,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/state_diversity_counts.html b/reference/state_diversity_counts.html
index 677c24c..5298c45 100644
--- a/reference/state_diversity_counts.html
+++ b/reference/state_diversity_counts.html
@@ -40,7 +40,7 @@
State- and territory-level diversity
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
state_diversity_counts.Rd
@@ -116,7 +116,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names.html b/reference/strip_names.html
index 5cbc4e3..1535f60 100644
--- a/reference/strip_names.html
+++ b/reference/strip_names.html
@@ -38,7 +38,7 @@
@@ -93,7 +93,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
diff --git a/reference/get_apc_genus_family_lookup.html b/reference/get_apc_genus_family_lookup.html
index f54601e..4cb3c8a 100644
--- a/reference/get_apc_genus_family_lookup.html
+++ b/reference/get_apc_genus_family_lookup.html
@@ -38,7 +38,7 @@
Lookup Family by Genus from APC
- Source:R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
get_apc_genus_family_lookup.Rd
Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/index.html b/reference/index.html
index 9974f33..12f6ce0 100644
--- a/reference/index.html
+++ b/reference/index.html
@@ -176,7 +176,7 @@ Data
diff --git a/reference/load_taxonomic_resources.html b/reference/load_taxonomic_resources.html
index aa3f3c7..5152356 100644
--- a/reference/load_taxonomic_resources.html
+++ b/reference/load_taxonomic_resources.html
@@ -40,7 +40,7 @@
Load taxonomic reference lists, APC & APNI
- Source: R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
load_taxonomic_resources.Rd
@@ -335,7 +335,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/native_anywhere_in_australia.html b/reference/native_anywhere_in_australia.html
index d6b2d65..1fb56fb 100644
--- a/reference/native_anywhere_in_australia.html
+++ b/reference/native_anywhere_in_australia.html
@@ -38,7 +38,7 @@
Native anywhere in Australia
- Source: R/native_anywhere_in_australia.R
+ Source: R/native_anywhere_in_australia.R
native_anywhere_in_australia.Rd
@@ -115,7 +115,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/reexports.html b/reference/reexports.html
index 8828b56..039308d 100644
--- a/reference/reexports.html
+++ b/reference/reexports.html
@@ -50,7 +50,7 @@
@@ -73,7 +73,7 @@ Objects exported from other packages
diff --git a/reference/standardise_names.html b/reference/standardise_names.html
index c2948e1..f86649b 100644
--- a/reference/standardise_names.html
+++ b/reference/standardise_names.html
@@ -44,7 +44,7 @@
Standardise taxon names
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_names.Rd
@@ -108,7 +108,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/standardise_taxon_rank.html b/reference/standardise_taxon_rank.html
index e195304..7466b30 100644
--- a/reference/standardise_taxon_rank.html
+++ b/reference/standardise_taxon_rank.html
@@ -36,7 +36,7 @@
Standardise taxon ranks
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_taxon_rank.Rd
@@ -82,7 +82,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/state_diversity_counts.html b/reference/state_diversity_counts.html
index 677c24c..5298c45 100644
--- a/reference/state_diversity_counts.html
+++ b/reference/state_diversity_counts.html
@@ -40,7 +40,7 @@
State- and territory-level diversity
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
state_diversity_counts.Rd
@@ -116,7 +116,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names.html b/reference/strip_names.html
index 5cbc4e3..1535f60 100644
--- a/reference/strip_names.html
+++ b/reference/strip_names.html
@@ -38,7 +38,7 @@
@@ -93,7 +93,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
Load taxonomic reference lists, APC & APNI
- Source:R/load_taxonomic_resources.R
+ Source: R/load_taxonomic_resources.R
load_taxonomic_resources.Rd
Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/native_anywhere_in_australia.html b/reference/native_anywhere_in_australia.html
index d6b2d65..1fb56fb 100644
--- a/reference/native_anywhere_in_australia.html
+++ b/reference/native_anywhere_in_australia.html
@@ -38,7 +38,7 @@
Native anywhere in Australia
- Source: R/native_anywhere_in_australia.R
+ Source: R/native_anywhere_in_australia.R
native_anywhere_in_australia.Rd
@@ -115,7 +115,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/reexports.html b/reference/reexports.html
index 8828b56..039308d 100644
--- a/reference/reexports.html
+++ b/reference/reexports.html
@@ -50,7 +50,7 @@
@@ -73,7 +73,7 @@ Objects exported from other packages
diff --git a/reference/standardise_names.html b/reference/standardise_names.html
index c2948e1..f86649b 100644
--- a/reference/standardise_names.html
+++ b/reference/standardise_names.html
@@ -44,7 +44,7 @@
Standardise taxon names
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_names.Rd
@@ -108,7 +108,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/standardise_taxon_rank.html b/reference/standardise_taxon_rank.html
index e195304..7466b30 100644
--- a/reference/standardise_taxon_rank.html
+++ b/reference/standardise_taxon_rank.html
@@ -36,7 +36,7 @@
Standardise taxon ranks
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_taxon_rank.Rd
@@ -82,7 +82,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/state_diversity_counts.html b/reference/state_diversity_counts.html
index 677c24c..5298c45 100644
--- a/reference/state_diversity_counts.html
+++ b/reference/state_diversity_counts.html
@@ -40,7 +40,7 @@
State- and territory-level diversity
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
state_diversity_counts.Rd
@@ -116,7 +116,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names.html b/reference/strip_names.html
index 5cbc4e3..1535f60 100644
--- a/reference/strip_names.html
+++ b/reference/strip_names.html
@@ -38,7 +38,7 @@
@@ -93,7 +93,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
Native anywhere in Australia
- Source:R/native_anywhere_in_australia.R
+ Source: R/native_anywhere_in_australia.R
native_anywhere_in_australia.Rd
Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/reexports.html b/reference/reexports.html
index 8828b56..039308d 100644
--- a/reference/reexports.html
+++ b/reference/reexports.html
@@ -50,7 +50,7 @@
@@ -73,7 +73,7 @@ Objects exported from other packages
diff --git a/reference/standardise_names.html b/reference/standardise_names.html
index c2948e1..f86649b 100644
--- a/reference/standardise_names.html
+++ b/reference/standardise_names.html
@@ -44,7 +44,7 @@
Standardise taxon names
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_names.Rd
@@ -108,7 +108,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/standardise_taxon_rank.html b/reference/standardise_taxon_rank.html
index e195304..7466b30 100644
--- a/reference/standardise_taxon_rank.html
+++ b/reference/standardise_taxon_rank.html
@@ -36,7 +36,7 @@
Standardise taxon ranks
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_taxon_rank.Rd
@@ -82,7 +82,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/state_diversity_counts.html b/reference/state_diversity_counts.html
index 677c24c..5298c45 100644
--- a/reference/state_diversity_counts.html
+++ b/reference/state_diversity_counts.html
@@ -40,7 +40,7 @@
State- and territory-level diversity
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
state_diversity_counts.Rd
@@ -116,7 +116,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names.html b/reference/strip_names.html
index 5cbc4e3..1535f60 100644
--- a/reference/strip_names.html
+++ b/reference/strip_names.html
@@ -38,7 +38,7 @@
@@ -93,7 +93,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
Objects exported from other packages
diff --git a/reference/standardise_names.html b/reference/standardise_names.html index c2948e1..f86649b 100644 --- a/reference/standardise_names.html +++ b/reference/standardise_names.html @@ -44,7 +44,7 @@Standardise taxon names
- Source:R/standardise_names.R
+ Source: R/standardise_names.R
standardise_names.Rd
Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/standardise_taxon_rank.html b/reference/standardise_taxon_rank.html
index e195304..7466b30 100644
--- a/reference/standardise_taxon_rank.html
+++ b/reference/standardise_taxon_rank.html
@@ -36,7 +36,7 @@
Standardise taxon ranks
- Source: R/standardise_names.R
+ Source: R/standardise_names.R
standardise_taxon_rank.Rd
@@ -82,7 +82,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/state_diversity_counts.html b/reference/state_diversity_counts.html
index 677c24c..5298c45 100644
--- a/reference/state_diversity_counts.html
+++ b/reference/state_diversity_counts.html
@@ -40,7 +40,7 @@
State- and territory-level diversity
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
state_diversity_counts.Rd
@@ -116,7 +116,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names.html b/reference/strip_names.html
index 5cbc4e3..1535f60 100644
--- a/reference/strip_names.html
+++ b/reference/strip_names.html
@@ -38,7 +38,7 @@
@@ -93,7 +93,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
Standardise taxon ranks
- Source:R/standardise_names.R
+ Source: R/standardise_names.R
standardise_taxon_rank.Rd
Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/state_diversity_counts.html b/reference/state_diversity_counts.html
index 677c24c..5298c45 100644
--- a/reference/state_diversity_counts.html
+++ b/reference/state_diversity_counts.html
@@ -40,7 +40,7 @@
State- and territory-level diversity
- Source: R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
state_diversity_counts.Rd
@@ -116,7 +116,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names.html b/reference/strip_names.html
index 5cbc4e3..1535f60 100644
--- a/reference/strip_names.html
+++ b/reference/strip_names.html
@@ -38,7 +38,7 @@
@@ -93,7 +93,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
State- and territory-level diversity
- Source:R/state_diversity_counts.R
+ Source: R/state_diversity_counts.R
state_diversity_counts.Rd
Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names.html b/reference/strip_names.html
index 5cbc4e3..1535f60 100644
--- a/reference/strip_names.html
+++ b/reference/strip_names.html
@@ -38,7 +38,7 @@
@@ -93,7 +93,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/strip_names_extra.html b/reference/strip_names_extra.html
index 4fcdd56..8eeeef2 100644
--- a/reference/strip_names_extra.html
+++ b/reference/strip_names_extra.html
@@ -38,7 +38,7 @@
@@ -96,7 +96,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/reference/update_taxonomy.html b/reference/update_taxonomy.html
index ddb147e..9b88e67 100644
--- a/reference/update_taxonomy.html
+++ b/reference/update_taxonomy.html
@@ -40,7 +40,7 @@
Update to currently accepted APC name and add APC/APNI name metadata
- Source: R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd
@@ -212,7 +212,7 @@ Examples
- Site built with pkgdown 2.1.0.
+ Site built with pkgdown 2.1.1.
diff --git a/search.json b/search.json
index f63d2c0..b021ca8 100644
--- a/search.json
+++ b/search.json
@@ -1 +1 @@
-[{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"installation","dir":"Articles","previous_headings":"","what":"Installation","title":"APCalign","text":"demonstrate use APCalign, use example dataset gbif_lite documented ?gbif_lite","code":"install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\") library(APCalign) dim(gbif_lite) #> [1] 129 7 gbif_lite |> print(n = 6) #> # A tibble: 129 × 7 #> species infraspecificepithet taxonrank decimalLongitude decimalLatitude scientificname #> #> 1 Tetratheca… SPECIES 145. -37.4 Tetratheca ci… #> 2 Peganum ha… SPECIES 139. -33.3 Peganum harma… #> 3 Calotis mu… SPECIES 115. -24.3 Calotis multi… #> 4 Leptosperm… SPECIES 151. -34.0 Leptospermum … #> 5 Lepidosper… SPECIES 142. -37.3 Lepidosperma … #> 6 Enneapogon… SPECIES 129. -17.8 Enneapogon po… #> # ℹ 123 more rows #> # ℹ 1 more variable: verbatimscientificname "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"retrieve-taxonomic-resources","dir":"Articles","previous_headings":"","what":"Retrieve taxonomic resources","title":"APCalign","text":"first step retrieve entire APC APNI name databases store locally taxonomic resources. achieve using load_taxonomic_resources(). resources compressed parquet files speed download local loading. two versions databases can retrieve stable_or_current_data argument. Calling: stable retrieve recent, archived version databases GitHub releases. set default option. current retrieve --date databases directly APC APNI website. Note databases reasonably large initial retrieval core data take minutes. taxonomic resources stored locally, subsequent retrievals take less time. Retrieving current resources always take longer since accessing latest information website uncompressed format. reproducible workflow, recommend specifying exact stable version want use.","code":"# Benchmarking the retrieval of `stable` or `current` resources stable_start_time <- Sys.time() stable_resources <- load_taxonomic_resources(stable_or_current_data = \"stable\") #> Loading resources......done stable_end_time <- Sys.time() current_start_time <- Sys.time() current_resources <- load_taxonomic_resources(stable_or_current_data = \"current\") #> Loading resources......done current_end_time <- Sys.time() # Compare times stable_end_time - stable_start_time #> Time difference of 16.48976 secs resources <- load_taxonomic_resources(stable_or_current_data = \"stable\", version = \"0.0.2.9000\") #> Loading resources......done"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"align-and-update-plant-taxon-names","dir":"Articles","previous_headings":"","what":"Align and update plant taxon names","title":"APCalign","text":"Now can query taxonomic names taxonomic resources just retrieved using create_taxonomic_update_lookup(). --one function : Align taxonomic names APC APNI using matching algorithms Update names APC-accepted species infraspecific name whenever possible. Return suggested name names, defaulting accepted_name available, otherwise providing APNI name name genus-level alignment possible. like learn step, take look section Closer look name alignment updating ‘APCalign’ original_name taxon name used original data. aligned_name taxon name used link APC identify synonyms. accepted_name currently, accepted taxon name used Australian Plant Census. suggested_name best possible name option original_name.","code":"library(dplyr) updated_gbif_names <- gbif_lite |> pull(species) |> create_taxonomic_update_lookup(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked updated_gbif_names |> print(n = 6) #> # A tibble: 129 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank taxonomic_dataset #> #> 1 Tetratheca c… Tetratheca … Tetratheca c… Tetratheca ci… Tetr… species APC #> 2 Peganum harm… Peganum har… Peganum harm… Peganum harma… Pega… species APC #> 3 Calotis mult… Calotis mul… Calotis mult… Calotis multi… Calo… species APC #> 4 Leptospermum… Leptospermu… Leptospermum… Leptospermum … Lept… species APC #> 5 Lepidosperma… Lepidosperm… Lepidosperma… Lepidosperma … Lepi… species APC #> 6 Enneapogon p… Enneapogon … Enneapogon p… Enneapogon po… Enne… species APC #> # ℹ 123 more rows #> # ℹ 5 more variables: taxonomic_status , scientific_name_authorship , #> # aligned_reason , update_reason , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"plant-established-status-across-statesterritories","dir":"Articles","previous_headings":"","what":"Plant established status across states/territories","title":"APCalign","text":"‘APCalign’ can also provide state/territory distribution established status (native/introduced) APC. can access established status data state/territory using create_species_state_origin_matrix() breakdown possible values origin can also obtain breakdown species established status particular state/territory using state_diversity_counts() Using established status data state/territory information, can check plant taxa native using native_anywhere_in_australia()","code":"# Retrieve status data by state/territory status_matrix <- create_species_state_origin_matrix(resources = resources) library(purrr) library(janitor) # Obtain unique values status_matrix |> select(-species) |> flatten_chr() |> tabyl() #> flatten_chr(select(status_matrix, -species)) n percent #> doubtfully naturalised 1120 2.371003e-03 #> formerly naturalised 277 5.863998e-04 #> native 40336 8.538997e-02 #> native and doubtfully naturalised 9 1.905270e-05 #> native and naturalised 136 2.879075e-04 #> native and uncertain origin 2 4.233933e-06 #> naturalised 8765 1.855521e-02 #> not present 421606 8.925258e-01 #> presumed extinct 101 2.138136e-04 #> uncertain origin 22 4.657327e-05 state_diversity_counts(\"NSW\", resources = resources) #> # A tibble: 7 × 3 #> origin state num_species #> #> 1 doubtfully naturalised NSW 93 #> 2 formerly naturalised NSW 8 #> 3 native NSW 5958 #> 4 native and doubtfully naturalised NSW 2 #> 5 native and naturalised NSW 34 #> 6 naturalised NSW 1580 #> 7 presumed extinct NSW 8 library(dplyr) updated_gbif_names |> sample_n(1) |> # Choosing a random species pull(suggested_name) |> # Extracting this APC accepted name native_anywhere_in_australia(resources = resources) #> # A tibble: 1 × 2 #> species native_anywhere_in_aus #> #> 1 Solanum prinophyllum considered native to Australia by APC"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"closer-look","dir":"Articles","previous_headings":"","what":"Closer look at name standardisation with ‘APCalign’","title":"APCalign","text":"create_taxonomic_update_lookup simple, wrapper, function novice users want quickly check standardise taxon names. experienced users, can take look sub functions match_taxa(), align_taxa() update_taxonomy() see taxon names processed, aligned updated.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"aligning-names-to-apc-and-apni","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Aligning names to APC and APNI","title":"APCalign","text":"function align_taxa : functions standardise_names, strip_names strip_names_extra standardise infraspecific taxon designations clean punctuation whitespaces taxonomic name flows progression 50 match algorithms able aligned name either APC APNI list. include exact fuzzy matches. Fuzzy matches designed capture small spelling mistakes syntax errors phrase names. include matches entire name string matches just select words sequence. sequence matches carefully curated align names fewest mistakes. names can resolved genus, reformats name offer standardised genus sp. name, additional information/notes provided part original name square brackets, Acacia sp. [skinny leaves] Acacia sp. [Broken Hill] Determine taxonomic_reference (APC APNI) name-alignment. Note align_taxa seek update outdated taxonomy. process occurs update_taxonomy process. align_taxa instead aligns name input closest match amongst names documented APC APNI. every aligned_name, align_taxa() provide aligned_reason can review table counts:","code":"library(dplyr) aligned_gbif_taxa <- gbif_lite |> pull(species) |> align_taxa(resources = resources) #> Checking alignments of 121 taxa #> -> 0 names already matched; 0 names checked but without a match; 121 taxa yet to be checked aligned_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank aligned_reason #> #> 1 Tetratheca ciliata Tetratheca … Tetratheca … APC species Exact match o… #> 2 Peganum harmala Peganum har… Peganum har… APC species Exact match o… #> 3 Calotis multicaulis Calotis mul… Calotis mul… APC species Exact match o… #> 4 Leptospermum triner… Leptospermu… Leptospermu… APC species Exact match o… #> 5 Lepidosperma latera… Lepidosperm… Lepidosperm… APC species Exact match o… #> 6 Enneapogon polyphyl… Enneapogon … Enneapogon … APC species Exact match o… #> # ℹ 123 more rows #> # ℹ 1 more variable: alignment_code library(janitor) aligned_gbif_taxa |> pull(aligned_reason) |> tabyl() |> tibble() #> # A tibble: 6 × 4 #> `pull(aligned_gbif_taxa, aligned_reason)` n percent valid_percent #> #> 1 Exact match of taxon name to an APC-accepted canonical name o… 118 0.915 0.929 #> 2 Exact match of taxon name to an APC-known canonical name once… 6 0.0465 0.0472 #> 3 Exact match of taxon name to an APNI-listed canonical name on… 1 0.00775 0.00787 #> 4 Exact match of the first two words of the taxon name to an AP… 1 0.00775 0.00787 #> 5 Exact match of the first word of the taxon name to an APC-acc… 1 0.00775 0.00787 #> 6 2 0.0155 NA"},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"fuzzy-match","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Aligning names to APC and APNI","what":"Configuring matching precision and aligned output","title":"APCalign","text":"arguments align_taxa allows select 50 matching algorithms activated/deactivated degree fuzziness fuzzy matching function fuzzy_matches turns fuzzy matching / (defaults TRUE). fuzzy_abs_dist fuzzy_rel_dist control degree fuzzy matching (default fuzzy_abs_dist = 3 & fuzzy_rel_dist = 0.2). imprecise_fuzzy_matches turns imprecise fuzzy matching / (defaults FALSE; true set fuzzy_abs_dist = 5 & fuzzy_rel_dist = 0.25). APNI_matches turns matches APNI list /(defaults TRUE). identifier allows specify text string added genus-level matches, indicating site, study, etc e.g. Acacia sp. [Blue Mountains]","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"update","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’","what":"Updating to APC-accepted names","title":"APCalign","text":"update_taxonomy() uses information generated align_taxa() , whenever possible, update names APC-accepted names.","code":"updated_gbif_taxa <- aligned_gbif_taxa |> update_taxonomy(resources = resources) updated_gbif_taxa |> print(n = 6) #> # A tibble: 129 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Tetratheca ciliata Tetratheca c… Tetratheca c… Tetratheca ci… Tetr… Elaeo… species #> 2 Peganum harmala Peganum harm… Peganum harm… Peganum harma… Pega… Nitra… species #> 3 Calotis multicaulis Calotis mult… Calotis mult… Calotis multi… Calo… Aster… species #> 4 Leptospermum trinervium Leptospermum… Leptospermum… Leptospermum … Lept… Myrta… species #> 5 Lepidosperma laterale Lepidosperma… Lepidosperma… Lepidosperma … Lepi… Cyper… species #> 6 Enneapogon polyphyllus Enneapogon p… Enneapogon p… Enneapogon po… Enne… Poace… species #> # ℹ 123 more rows #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-resources-used-for-updating-names","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic resources used for updating names","title":"APCalign","text":"APC includes previously recorded taxonomic names current taxon concept, designating currently-accepted name taxonomic_status: accepted, previously used inappropriately used names taxon concept alternative taxonomic statuses documented (e.g. taxonomic synonym, orthographic variant, misapplied). APC includes column acceptedNameUsageID links taxon name alternative taxonomic status current taxon name, allowing outdated/inappropriately used names synced current name. Note: Names listed APNI absent APC designated taxonomic_dataset: APNI APCalign. names currently unknown APC. time, list shrinks, taxonomists link ever occasionally used name variants APC-accepted taxon. However, now, names listed APNI updated","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"name-updates-at-different-taxonomic-levels","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Name updates at different taxonomic levels","title":"APCalign","text":"update_taxonomy() divides names lists based taxon_rank taxonomic_dataset assigned align_taxa, list requires different updating algorithms. taxonomic names designated taxon_rank = species/infraspecific taxonomic_dataset = APC can updated APC-accepted name. taxa, may possible align genus-name APC-accepted genus. taxa, suggested_name provided, selecting accepted_name available, otherwise aligned_name, , possible, updated, APC-accepted genus name.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/APCalign.html","id":"taxonomic-splits","dir":"Articles","previous_headings":"Closer look at name standardisation with ‘APCalign’ > Updating to APC-accepted names","what":"Taxonomic splits","title":"APCalign","text":"Taxonomic splits refers instances single taxon concept subsequently split multiple taxon concepts. taxa, aligned_name “old” taxon concept name, impossible know currently accepted taxon concepts name represents. function update_taxonomy includes argument taxonomic_splits, offering three alternative outputs taxon concepts split. most_likely_species default value, returns accepted_name original taxon_concept; alternative names documented square brackets part suggested name (Acacia aneura [alternative possible names: Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied) | Acacia quadrimarginea (misapplied)). return_all returns currently accepted names split original taxon_concept; leads increase number rows output table. (Acacia aneura, Acacia minyura Acacia paraneura output separate row, unique taxon_ID) collapse_to_higher_taxon declares split names, way certain accepted name appropriate therefore best possible match genus level; accepted_name returned, taxon_rank demoted genus suggested name documents possible species-level names square brackets (Acacia sp. [collapsed names: Acacia aneura (accepted) | Acacia minyura (pro parte misapplied) | Acacia paraneura (pro parte misapplied)])","code":"library(dplyr) aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"most_likely_species\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura [alternat… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"return_all\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 3 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia aneura Acacia aneura Acacia Fabaceae species #> 2 Acacia aneura Acacia aneura Acacia minyura Acacia minyura Acacia Fabaceae species #> 3 Acacia aneura Acacia aneura Acacia paraneura Acacia paraneura Acacia Fabaceae species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa aligned_gbif_taxa |> update_taxonomy(taxonomic_splits = \"collapse_to_higher_taxon\", resources = resources) |> filter(original_name == \"Acacia aneura\") # Subsetting Acacia aneura as an example #> # A tibble: 1 × 21 #> original_name aligned_name accepted_name suggested_name genus family taxon_rank #> #> 1 Acacia aneura Acacia aneura Acacia sp. Acacia sp. [collapsed n… Acac… Fabac… species #> # ℹ 14 more variables: taxonomic_dataset , taxonomic_status , #> # taxonomic_status_aligned , aligned_reason , update_reason , #> # subclass , taxon_distribution , scientific_name_authorship , #> # taxon_ID , taxon_ID_genus , scientific_name_ID , canonical_name , #> # row_number , number_of_collapsed_taxa "},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-census-apc","dir":"Articles","previous_headings":"","what":"Australian Plant Census (APC)","title":"Data Providers","text":"Australian Plant Census (APC) national taxonomic database accepted names Australian vascular plants. APC includes information synonyms, misapplications , well established status (native/introduced) distribution across states territories. ‘APCalign’ first attempt align plant names scientific names exist APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"australian-plant-names-index-apni","dir":"Articles","previous_headings":"","what":"Australian Plant Names Index (APNI)","title":"Data Providers","text":"Australian Plant Names Index (APNI) database containing names used Australian plants scientific literature. primarily used botanical community standardising synonyms. Importantly, APNI provide recommendations taxonomy nomenclature; APC indicates taxonomy considered accepted. ‘APCalign’ uses APNI alignment found APC.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/data-providers.html","id":"data-standards-and-meta-data","dir":"Articles","previous_headings":"","what":"Data standards and meta-data","title":"Data Providers","text":"Data APNI APC formatted according Darwin Core standard, widely used data standard biodiversity data. can find meta-data APC APNI : Meta-data APC output Meta-data APNI output details APNI APC, recommend taking read extensive documentation.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"apcalign-r-package-version","dir":"Articles","previous_headings":"","what":"APCalign R package version","title":"How to be more reproducible with APCalign","text":"determine version APCalign package :","code":"packageVersion(\"APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"taxonomic-resources","dir":"Articles","previous_headings":"","what":"Taxonomic Resources","title":"How to be more reproducible with APCalign","text":"APCalign allows users load static downloads taxonomic resources APC APNI latest version National Species List website. functionality specified using stable_or_current_data argument load_taxonomic_resources(). want taxonomic alignment update reproducible, recommend always use stable_or_current_data = \"stable\". default value stable_or_current_data = \"stable\". static downloads version controlled stored repository releases. default, load_taxonomic_resources() load latest version static downloads. order transparent, recommend check latest default_version alignment copying pasting output load_taxonomic_resources() directly. way makes version taxonomic resources explicit code. ensure specific version taxonomic resources available subsequent functions make sure assign object: alignment update, make sure supply version taxonomic resources using resources argument:","code":"load_taxonomic_resources(stable_or_current_data = \"stable\") load_taxonomic_resources( stable_or_current_data = \"stable\", version = default_version() ) default_version() #> [1] \"2024-07+31\" resources_0.0.4.9000 <- load_taxonomic_resources( stable_or_current_data = \"stable\", version = \"0.0.4.9000\" ) # Align taxa aligned_taxa <- align_taxa(gbif_lite$species, resources = resources_0.0.4.9000) # Update taxonomy updated_taxa <- update_taxonomy(aligned_taxa, resources = resources_0.0.4.9000) # Align and update all-in-one aligned_updated_taxa <- create_taxonomic_update_lookup(gbif_lite$species, resources = resources_0.0.4.9000)"},{"path":"https://traitecoevo.github.io/APCalign/articles/reproducibility.html","id":"citing-the-r-package","dir":"Articles","previous_headings":"","what":"Citing the R package","title":"How to be more reproducible with APCalign","text":"completion, can also cite R package calling citation(). also research article introducing APCalign, share details citation press.","code":"citation(\"APCalign\") #> To cite package 'APCalign' in publications use: #> #> Wenk E, Cornwell W, Fuchs A, Kar F, Monro A, Sauquet H, Stephens R, #> Falster D (2024). \"APCalign: an R package workflow and app for #> aligning and updating flora names to the Australian Plant Census.\" #> _Australian Journal of Botany_. R package version: 1.0.1, #> . #> #> A BibTeX entry for LaTeX users is #> #> @Article{, #> title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, #> journal = {Australian Journal of Botany}, #> author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, #> year = {2024}, #> note = {R package version: 1.0.1}, #> url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, #> }"},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"aligning-taxon-names-with-taxon-conceptsnames-in-apc-and-apni","dir":"Articles","previous_headings":"","what":"Aligning taxon names with taxon concepts/names in APC and APNI","title":"Methods for updating taxon names in APCalign","text":"following table indicates rules 51 separate algorithms sequentially applied attempt align submitted name taxon concept APC scientific names APNI. Note, table truncated screen, use horizontal scroll view entire table.","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"updating-taxonomy","dir":"Articles","previous_headings":"","what":"Updating taxonomy","title":"Methods for updating taxon names in APCalign","text":"following table indicates separate functions used : update aligned names accepted names APC add best-practice suggested names submitted names add identifiers taxon concepts (APC) scientific names (APC APNI) Different functions used depending taxon rank aligned name taxonomic dataset name aligned (APC vs APNI). -* genus updated APC accepted genus possible; ** species infraspecific taxon name","code":""},{"path":"https://traitecoevo.github.io/APCalign/articles/updating-taxon-names.html","id":"outputs-of-apcalign","dir":"Articles","previous_headings":"","what":"Outputs of APCalign","title":"Methods for updating taxon names in APCalign","text":"following columns output core function create_taxonomic_update_lookup two component functions align_taxa update_taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":null,"dir":"","previous_headings":"","what":"Authors","title":"Authors and Citation","text":"Daniel Falster. Author, maintainer, copyright holder. Elizabeth Wenk. Author, contributor. Cornwell. Author, contributor. Fonti Kar. Author, contributor. Carl Boettiger. Contributor.","code":""},{"path":"https://traitecoevo.github.io/APCalign/authors.html","id":"citation","dir":"","previous_headings":"","what":"Citation","title":"Authors and Citation","text":"Wenk E, Cornwell W, Fuchs , Kar F, Monro , Sauquet H, Stephens R, Falster D (2024). “APCalign: R package workflow app aligning updating flora names Australian Plant Census.” Australian Journal Botany. R package version: 1.0.1, https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1.","code":"@Article{, title = {APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census}, journal = {Australian Journal of Botany}, author = {Elizabeth Wenk and Will Cornwell and Ann Fuchs and Fonti Kar and Anna Monro and Herve Sauquet and Ruby Stephens and Daniel Falster}, year = {2024}, note = {R package version: 1.0.1}, url = {https://www.biorxiv.org/content/10.1101/2024.02.02.578715v1}, }"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"apcalign-","dir":"","previous_headings":"","what":"Resolving Plant Taxon Names Using the Australian Plant Census","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"APCalign uses Australian Plant Census (APC) Australian Plant Name Index align update Australian plant taxon name strings. ‘APCalign’ also supplies information established status (native/introduced) plant taxa across different states/territories. ’s useful updating species list intersecting APC consensus understanding established status (native/introduced).","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"installation","dir":"","previous_headings":"","what":"Installation","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"CRAN: github version:","code":"install.packages(\"APCalign\") install.packages(\"remotes\") remotes::install_github(\"traitecoevo/APCalign\")"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"a-quick-demo","dir":"","previous_headings":"","what":"A quick demo","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Generating look-table can done just one function: ’re going use APCalign , save time load taxonomic resources memory first: Checking list species see classified Australian natives: Getting family lookup table genera specified taxonomy:","code":"library(APCalign) create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\" ) ) #> ================================================================================================================================================================ #> # A tibble: 3 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa tax_resources <- load_taxonomic_resources() #> ================================================================================================================================================================ create_taxonomic_update_lookup( taxa = c( \"Banksia integrifolia\", \"Acacia longifolia\", \"Commersonia rosea\", \"not a species\" ), resources = tax_resources ) #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 2 Acacia longifolia Acacia long… Acacia longi… Acacia longif… Acac… species #> 3 Commersonia rosea Commersonia… Androcalva r… Androcalva ro… Andr… species #> 4 not a species #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason , #> # number_of_collapsed_taxa native_anywhere_in_australia(c(\"Eucalyptus globulus\",\"Pinus radiata\"), resources = tax_resources) #> # A tibble: 2 × 2 #> species native_anywhere_in_aus #> #> 1 Eucalyptus globulus native #> 2 Pinus radiata introduced get_apc_genus_family_lookup(c(\"Eucalyptus\", \"Pinus\", \"Actinotus\", \"Banksia\", \"Acacia\", \"Triodia\"), resources = tax_resources) #> # A tibble: 6 × 2 #> genus family #> #> 1 Eucalyptus Myrtaceae #> 2 Pinus Pinaceae #> 3 Actinotus Apiaceae #> 4 Banksia Proteaceae #> 5 Acacia Fabaceae #> 6 Triodia Poaceae"},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"shiny-application","dir":"","previous_headings":"","what":"Shiny application","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"also developed shiny application non-R users update align taxonomic names. can find application : https://unsw.shinyapps.io/APCalign-app","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"learn-more","dir":"","previous_headings":"","what":"Learn more","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"Highly recommend looking Getting Started vignette learn use APCalign. can also learn taxa matching algorithm.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"found-a-bug","dir":"","previous_headings":"","what":"Found a bug?","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"come across unexpected taxon name change? Elusive error can’t debug - submit issue try best help.","code":""},{"path":"https://traitecoevo.github.io/APCalign/index.html","id":"comments-and-contributions","dir":"","previous_headings":"","what":"Comments and contributions","title":"Resolving Plant Taxon Names Using the Australian Plant Census","text":"welcome comments contributions package, start submit issue can take !","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":null,"dir":"Reference","previous_headings":"","what":"Standardising Taxonomic Names in Australian Plants — APCalign","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"process standardising taxon names necessary working biodiversity data. 'APCalign' uses Australian Plant Name Index (APNI) Australian Plant Census (APC) align update plant taxon names current, accepted standards. 'APCalign' can also supply information established status plant taxa across different states/territories.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"functions","dir":"Reference","previous_headings":"","what":"Functions","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Standarise taxon names load_taxonomic_resources create_taxonomic_update_lookup align_taxa update_taxonomy Established status region state_diversity_counts create_species_state_origin_matrix native_anywhere_in_australia","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"references","dir":"Reference","previous_headings":"","what":"References","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"questions, comments suggestions, please submit issue GitHub repository","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/APCalign.html","id":"author","dir":"Reference","previous_headings":"","what":"Author","title":"Standardising Taxonomic Names in Australian Plants — APCalign","text":"Maintainer: Daniel Falster daniel.falster@unsw.edu.au (ORCID) [copyright holder] Authors: Elizabeth Wenk e.wenk@unsw.edu.au (ORCID) [contributor] Cornwell w.cornwell@unsw.edu.au (ORCID) [contributor] Fonti Kar f.kar@unsw.edu.au (ORCID) [contributor] contributors: Carl Boettiger cboettig@gmail.com (ORCID) [contributor]","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":null,"dir":"Reference","previous_headings":"","what":"Align Australian plant scientific names to the APC or APNI — align_taxa","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"list Australian plant names, find taxonomic scientific name alignments APC APNI standardizing formatting fixing spelling errors. Usage case: Users run function wish see details matching algorithms, many output columns matching function compares seeks best alignment. may also select function want adjust “fuzziness” level fuzzy matches, options allowed create_taxonomic_update_lookup. function first half create_taxonomic_update_lookup.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"align_taxa( original_name, output = NULL, full = FALSE, resources = load_taxonomic_resources(), quiet = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, imprecise_fuzzy_matches = FALSE, APNI_matches = TRUE, identifier = NA_character_ )"},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"original_name list names query taxonomic alignments. output (optional) name file save results . full Parameter determine many columns output resources taxonomic resources used align taxa names. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources . quiet Logical indicate whether display messages aligning taxa. fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. identifier dataset, location identifier, defaults NA.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"tibble columns include original_name, aligned_name, taxonomic_dataset, taxon_rank, aligned_reason, alignment_code. original_name: original plant name input. aligned_name: original plant name function standardise_names standardised syntax infraspecific taxon designations. taxonomic_dataset: source aligned names (APC APNI). taxon_rank: taxonomic rank aligned name. aligned_reason: explanation specific taxon name alignment (original name aligned name). alignment_code: code accompanies aligned_reason, indicating relative sequence match alignment process. cleaned_name: original name punctuation infraspecific taxon designation terms standardised function standardise_names; streamlines exact matches. stripped_name: cleaned name punctuation infraspecific taxon designation terms removed function strip_names; improves fuzzy matches. stripped_name2: cleaned name punctuation, infraspecific taxon designation terms, filler words removed function strip_names_extra; required matches first two word first three words. trinomial: first three words stripped_name2, required matches ignore text original_name; improves phrase name matches. binomial: first two words stripped_name2, required matches ignore text original_name; improves phrase name matches. genus: first two words cleaned_name; required genus-rank matches reprocessing genus-rank names. fuzzy_match_genus: fuzzy match genus column best match among APC-accepted names; required fuzzy matches genus-rank names. fuzzy_match_genus_synonym: fuzzy match genus column best match among APC-synonymous names, considering different matches documented APC-accepted genera; required fuzzy matches genus-rank names. fuzzy_match_genus_APNI: fuzzy match genus column best match among APNI names, considering different matches documented APC-accepted APC-known genera; required fuzzy matches genus-rank names. fuzzy_match_family: fuzzy match genus column best match among APC-accepted family names; required fuzzy matches family-rank names. fuzzy_match_family_synonym: fuzzy match genus column best match among APC-synonymous family names; required fuzzy matches family-rank names. fuzzy_match_cleaned_APC: fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 05a function match_taxa. fuzzy_match_cleaned_APC_synonym: fuzzy match stripped_name APC-synonymous names; created yet---aligned names match step 05b function match_taxa. fuzzy_match_cleaned_APC_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07a function match_taxa. fuzzy_match_cleaned_APC_synonym_imprecise: imprecise fuzzy match stripped_name APC-accepted names; created yet---aligned names match step 07b function match_taxa. fuzzy_match_binomial: fuzzy match binomial column best match among APC-accepted names; created yet---aligned names match step 10c function match_taxa. fuzzy_match_binomial_APC_synonym: fuzzy match binomial column best match among APC-synonymous names; created yet---aligned names match step 10d function match_taxa. fuzzy_match_trinomial: fuzzy match trinomial column best match among APC-accepted names; created yet---aligned names match step 09c function match_taxa. fuzzy_match_trinomial_synonym: fuzzy match trinomial column best match among APC-synonymous names; created yet---aligned names match step 09d function match_taxa. fuzzy_match_cleaned_APNI: fuzzy match stripped_name APNI names; created yet---aligned names match step 11a function match_taxa. fuzzy_match_cleaned_APNI_imprecise: imprecise fuzzy match stripped_name APNI names; created yet---aligned names match step 11b function match_taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"function finds taxonomic alignments APC scientific name alignments APNI. uses internal function match_taxa attempt match input strings taxon names APC/APNI. sequentially searches matches 20 different string patterns, prioritising exact matches (accepted names well synonyms, orthographic variants) fuzzy matches. prioritises matches taxa APC names APNI. identifies string patterns input names suggest name can aligned genus (hybrids APC/APNI; graded species; taxa identified species), indicates names genus-rank match. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. function option changing fuzzy matching parameters. defaults, fuzzy matches allowing changes 3 (fewer) characters 20% (less) characters carefully calibrated catch just typos, , rarely mis-align name. wish introduce less conservative fuzzy matching recommended manually check aligned names. recommended begin imprecise_fuzzy_matches = FALSE (default), quite less precise fuzzy matches likely erroneous. argument turned plan check alignments manually. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return \"Acacia sp. [Royal NP]\".","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/align_taxa.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Align Australian plant scientific names to the APC or APNI — align_taxa","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> Downloading... #> File downloaded successfully. #> File downloaded successfully. #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 align_taxa(c(\"Poa annua\", \"Abies alba\"), resources=resources) #> Checking alignments of 2 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 2 × 7 #> original_name cleaned_name aligned_name taxonomic_dataset taxon_rank #> #> 1 Poa annua Poa annua Poa annua APC species #> 2 Abies alba Abies alba NA NA NA #> # ℹ 2 more variables: aligned_reason , alignment_code # example 2 input <- c(\"Banksia serrata\", \"Banksia serrate\", \"Banksia cerrata\", \"Banksia serrrrata\", \"Dryandra sp.\", \"Banksia big red flowers\") aligned_taxa <- APCalign::align_taxa( original_name = input, identifier = \"APCalign test\", full = TRUE, resources=resources ) #> Checking alignments of 6 taxa #> -> of these 1 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. # }"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":null,"dir":"Reference","previous_headings":"","what":"State level native and introduced origin status — create_species_state_origin_matrix","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"function uses taxon distribution data APC determine state level native introduced origin status. function processes geographic data available APC returns state level native, introduced complicated origins status taxa.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix(resources = load_taxonomic_resources())"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"resources taxonomic resources required make summary statistics. Loading can slow, call load_taxonomic_resources separately greatly speed function pass resources .","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"tibble columns representing state rows representing species. values cell represent origin species state.","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_species_state_origin_matrix.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"State level native and introduced origin status — create_species_state_origin_matrix","text":"","code":"create_species_state_origin_matrix() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done #> # A tibble: 26,507 × 19 #> species WA NT Qld SA NSW Vic Tas ChI LHI CoI NI #> #> 1 Acanthus e… nati… nati… nati… not … not … not … not … not … not … not … not … #> 2 Acanthus i… not … nati… nati… not … not … not … not … not … not … not … not … #> 3 Acanthus m… natu… not … doub… natu… natu… natu… natu… not … not … not … not … #> 4 Andrograph… not … natu… doub… not … not … not … not … nati… not … not … not … #> 5 Asystasia … not … not … not … not … not … not … not … nati… not … not … not … #> 6 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 7 Asystasia … natu… natu… natu… not … natu… not … not … natu… not … not … not … #> 8 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> 9 Asystasia … not … not … nati… not … not … not … not … not … not … not … not … #> 10 Asystasia … not … not … not … not … not … not … not … natu… not … not … not … #> # ℹ 26,497 more rows #> # ℹ 7 more variables: ACT , CSI , AR , MI , HI , #> # MDI , CaI "},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":null,"dir":"Reference","previous_headings":"","what":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"function takes list Australian plant names need reconciled current taxonomy generates lookup table best-possible scientific name match input name. Usage case: APCalign’s core function, merging together alignment updating taxonomy.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-usage","dir":"Reference","previous_headings":"","what":"Usage","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"create_taxonomic_update_lookup( taxa, stable_or_current_data = \"stable\", version = default_version(), taxonomic_splits = \"most_likely_species\", full = FALSE, fuzzy_abs_dist = 3, fuzzy_rel_dist = 0.2, fuzzy_matches = TRUE, APNI_matches = TRUE, imprecise_fuzzy_matches = FALSE, identifier = NA_character_, resources = load_taxonomic_resources(), quiet = FALSE, output = NULL )"},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"arguments","dir":"Reference","previous_headings":"","what":"Arguments","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"taxa list Australian plant species needs reconciled current taxonomy. stable_or_current_data either \"stable\" consistent version, \"current\" leading edge version. version version number dataset use. taxonomic_splits handle one_to_many taxonomic matches. Default \"return_all\". options \"collapse_to_higher_taxon\" \"most_likely_species\". most_likely_species defaults original_name name accepted APC; right certain species subsets, make errors cases, use caution. full logical whether full lookup table returned just key columns fuzzy_abs_dist number characters allowed different fuzzy match. fuzzy_rel_dist proportion characters allowed different fuzzy match. fuzzy_matches Fuzzy matches turned default. relative absolute distances allowed fuzzy matches species infraspecific taxon names defined parameters fuzzy_abs_dist fuzzy_rel_dist. APNI_matches Name matches APNI (Australian Plant Names Index) turned default. imprecise_fuzzy_matches Imprecise fuzzy matches uses fuzzy matching function lenient levels set (absolute distance 5 characters; relative distance = 0.25). offers way get wider range possible names, possibly corresponding distant spelling mistakes. FALSE default outputs checked often makes erroneous matches. identifier dataset, location identifier, defaults NA. resources taxonomic resources used cleaning, default loading local place computer. called repeatedly, much faster load resources using load_taxonomic_resources separately pass data . quiet Logical indicate whether display messages aligning taxa. output file path save output. file already exists, function check subset species passed try add file. can useful large growing projects.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"value","dir":"Reference","previous_headings":"","what":"Value","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"lookup table containing accepted suggested names original name input, additional taxonomic information taxon rank, taxonomic status, taxon IDs genera. original_name: original plant name. aligned_name: input plant name aligned taxon name APC APNI align_taxa function. accepted_name: APC-accepted plant name, available. suggested_name: suggested plant name use. Identical accepted_name, accepted_name exists; otherwise suggested_name aligned_name. genus: genus accepted (suggested) name; APC-accepted genus names filled . family: family accepted (suggested) name; APC-accepted family names filled . taxon_rank: taxonomic rank suggested (accepted) name. taxonomic_dataset: source suggested (accepted) names (APC APNI). taxonomic_status: taxonomic status suggested (accepted) name. taxonomic_status_aligned: taxonomic status aligned name, taxonomic updates applied. aligned_reason: explanation specific taxon name alignment (original name aligned name). update_reason: explanation specific taxon name update (aligned name accepted suggested name). subclass: subclass accepted name. taxon_distribution: distribution accepted name; filled APC accepted_name available. scientific_name_authorship: authorship information accepted (synonymous) name; available APC APNI names. taxon_ID: unique taxon concept identifier accepted_name; filled APC accepted_name available. taxon_ID_genus: identifier genus; filled APC-accepted genus name available. scientific_name_ID: identifier nomenclatural (taxonomic) details scientific name; available APC APNI names. row_number: row number specific original_name input. number_of_collapsed_taxa: taxonomic_splits == \"collapse_to_higher_taxon\", number possible taxon names collapsed.","code":""},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"details","dir":"Reference","previous_headings":"","what":"Details","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"uses first function align_taxa, function update_taxonomy achieve output. aligned name plant name aligned taxon name APC APNI align_taxa function. Notes: running function APCalign::create_taxonomic_update_lookup many times, best load taxonomic resources separately using resources <- load_taxonomic_resources(), add argument resources = resources name Banksia cerrata align fuzzy matching algorithm allow first letter genus species epithet change. argument taxonomic_splits allows choose outcome updating names taxa ambiguous taxonomic histories; applies scientific names attached broadly circumscribed taxon concept, split several narrowly circumscribed taxon concepts, one retains original name. three options: most_likely_species returns name retained, alternative names documented square brackets; return_all adds additional rows output, one possible taxon concept; collapse_to_higher_taxon returns genus possible names square brackets. argument identifier allows add fix text string genus- family- level names, identifier = \"Royal NP\" return Acacia sp. \\[Royal NP].","code":""},{"path":[]},{"path":"https://traitecoevo.github.io/APCalign/reference/create_taxonomic_update_lookup.html","id":"ref-examples","dir":"Reference","previous_headings":"","what":"Examples","title":"Create a table with the best-possible scientific name match for Australian plant names — create_taxonomic_update_lookup","text":"","code":"# \\donttest{ resources <- load_taxonomic_resources() #> #> Loading resources into memory... #> =========================== ===================================================== ================================================================================ #> ...done # example 1 create_taxonomic_update_lookup(c(\"Eucalyptus regnans\", \"Acacia melanoxylon\", \"Banksia integrifolia\", \"Not a species\"), resources = resources) #> Checking alignments of 4 taxa #> -> of these 3 names have a perfect match to a scientific name in the APC. #> Alignments being sought for remaining names. #> # A tibble: 4 × 12 #> original_name aligned_name accepted_name suggested_name genus taxon_rank #> #> 1 Eucalyptus regnans Eucalyptus … Eucalyptus r… Eucalyptus re… Euca… species #> 2 Acacia melanoxylon Acacia mela… Acacia melan… Acacia melano… Acac… species #> 3 Banksia integrifol… Banksia int… Banksia inte… Banksia integ… Bank… species #> 4 Not a species NA NA NA NA NA #> # ℹ 6 more variables: taxonomic_dataset , taxonomic_status , #> # scientific_name , aligned_reason , update_reason
Update to currently accepted APC name and add APC/APNI name metadata
- Source:R/update_taxonomy.R
+ Source: R/update_taxonomy.R
update_taxonomy.Rd