v.in.pygbif

Search and import GBIF species distribution data

Command linePython (grass.script)Python (grass.tools)

v.in.pygbif [-brpigotns] output=name taxa=string [mask=name] [date_from=string] [date_to=string] [basisofrecord=string] rank=string [recordedby=string] [institutioncode=string] [country=string] [continent=string] [--overwrite] [--verbose] [--quiet] [--qq] [--ui]

Example:

v.in.pygbif output=name taxa=string rank=species

grass.script.run_command("v.in.pygbif", output, taxa, mask=None, date_from=None, date_to=None, basisofrecord="ALL", rank="species", recordedby=None, institutioncode=None, country=None, continent=None, flags=None, overwrite=None, verbose=None, quiet=None, superquiet=None)

Example:

gs.run_command("v.in.pygbif", output="name", taxa="string", rank="species")

grass.tools.Tools.v_in_pygbif(output, taxa, mask=None, date_from=None, date_to=None, basisofrecord="ALL", rank="species", recordedby=None, institutioncode=None, country=None, continent=None, flags=None, overwrite=None, verbose=None, quiet=None, superquiet=None)

Example:

tools = Tools()
tools.v_in_pygbif(output="name", taxa="string", rank="species")

This grass.tools API is experimental in version 8.5 and expected to be stable in version 8.6.

Parameters

Command linePython (grass.script)Python (grass.tools)

output=name [required]
    Name of resulting vector map with occurrences
taxa=string [required]
    Comma separated list of taxon names or keys to fetch data for
mask=name
    Name of input vector map
    Vector map that delimits region of interest
date_from=string
    Lower bound of acceptable dates (format: yyyy, yyyy-MM, yyyy-MM-dd, or MM-dd)
date_to=string
    Upper bound of acceptable dates (format: yyyy, yyyy-MM, yyyy-MM-dd, or MM-dd)
basisofrecord=string
    Accepted basis of records
    Allowed values: ALL, FOSSIL_SPECIMEN, HUMAN_OBSERVATION, LITERATURE, LIVING_SPECIMEN, MACHINE_OBSERVATION, OBSERVATION, PRESERVED_SPECIMEN, UNKNOWN
    Default: ALL
rank=string [required]
    Rank of the taxon to search for
    Allowed values: class, cultivar, cultivar_group, domain, family, form, genus, informal, infrageneric_name, infraorder, infraspecific_name, infrasubspecific_name, kingdom, order, phylum, section, series, species, strain, subclass, subfamily, subform, subgenus, subkingdom, suborder, subphylum, subsection, subseries, subspecies, subtribe, subvariety, superclass, superfamily, superorder, superphylum, suprageneric_name, tribe, unranked, variety
    Default: species
recordedby=string
    The person who recorded the occurrence.
institutioncode=string
    An identifier of any form assigned by the source to identify the institution the record belongs to.
country=string
    The 2-letter country code (as per ISO-3166-1) of the country in which the occurrence was recorded
continent=string
    The continent in which the occurrence was recorded
    Allowed values: africa, antarctica, asia, europe, north_america, oceania, south_america
-b
    Do not build topology
-r
    Do not limit import to current region (works only in lat/lon)
-p
    Print result from matching taxa names and exit
-i
    Produce individual map for each taxon
-g
    Print result from matching taxon names in shell script style and exit
-o
    Print number of matching occurrences per taxon and exit
-t
    Print result of taxon matching in table format and exit
-n
    Do not limit search to records with coordinates
-s
    Do also import occurrences with spatial issues
--overwrite
    Allow output files to overwrite existing files
--help
    Print usage summary
--verbose
    Verbose module output
--quiet
    Quiet module output
--qq
    Very quiet module output
--ui
    Force launching GUI dialog

output : str, required
    Name of resulting vector map with occurrences
    Used as: output, vector, name
taxa : str, required
    Comma separated list of taxon names or keys to fetch data for
mask : str, optional
    Name of input vector map
    Vector map that delimits region of interest
    Used as: input, vector, name
date_from : str, optional
    Lower bound of acceptable dates (format: yyyy, yyyy-MM, yyyy-MM-dd, or MM-dd)
date_to : str, optional
    Upper bound of acceptable dates (format: yyyy, yyyy-MM, yyyy-MM-dd, or MM-dd)
basisofrecord : str, optional
    Accepted basis of records
    Allowed values: ALL, FOSSIL_SPECIMEN, HUMAN_OBSERVATION, LITERATURE, LIVING_SPECIMEN, MACHINE_OBSERVATION, OBSERVATION, PRESERVED_SPECIMEN, UNKNOWN
    Default: ALL
rank : str, required
    Rank of the taxon to search for
    Allowed values: class, cultivar, cultivar_group, domain, family, form, genus, informal, infrageneric_name, infraorder, infraspecific_name, infrasubspecific_name, kingdom, order, phylum, section, series, species, strain, subclass, subfamily, subform, subgenus, subkingdom, suborder, subphylum, subsection, subseries, subspecies, subtribe, subvariety, superclass, superfamily, superorder, superphylum, suprageneric_name, tribe, unranked, variety
    Default: species
recordedby : str, optional
    The person who recorded the occurrence.
institutioncode : str, optional
    An identifier of any form assigned by the source to identify the institution the record belongs to.
country : str, optional
    The 2-letter country code (as per ISO-3166-1) of the country in which the occurrence was recorded
continent : str, optional
    The continent in which the occurrence was recorded
    Allowed values: africa, antarctica, asia, europe, north_america, oceania, south_america
flags : str, optional
    Allowed values: b, r, p, i, g, o, t, n, s
    b
        Do not build topology
    r
        Do not limit import to current region (works only in lat/lon)
    p
        Print result from matching taxa names and exit
    i
        Produce individual map for each taxon
    g
        Print result from matching taxon names in shell script style and exit
    o
        Print number of matching occurrences per taxon and exit
    t
        Print result of taxon matching in table format and exit
    n
        Do not limit search to records with coordinates
    s
        Do also import occurrences with spatial issues
overwrite : bool, optional
    Allow output files to overwrite existing files
    Default: None
verbose : bool, optional
    Verbose module output
    Default: None
quiet : bool, optional
    Quiet module output
    Default: None
superquiet : bool, optional
    Very quiet module output
    Default: None

Returns:

result : grass.tools.support.ToolResult | None
If the tool produces text as standard output, a ToolResult object will be returned. Otherwise, None will be returned.

Raises:

grass.tools.ToolError: When the tool ended with an error.

DESCRIPTION

The module v.in.pygbif is a wrapper around the pygbif package.
Thus, pygbif is a dependency of v.in.pygbif. pygbif can be installed like this:

pip install pygbif [--user]

Through pygbif, the module allows to download data from the Global Biodiversity Information Facility (GBIF) using different search/filter criteria.

Since some of the Darwin Core attribute columns represent SQL key-words, the prefix "g_" was added to all attribute columns. The names of taxa provided at input to the search are written to the column "g_search".

The point data is downloaded and projected into the current location. By default import is limited to the current computational region in order to avoid possible projection errors, e.g. when projecting global data into UTM locations. However, in lat⁄lon location this limitation can be skiped using the -r flag.
Providing a mask automatically overrides the limitation of the search to the current computational region.

Terminology in v.in.pygbif is oriented on the Darwin Core standard: https://rs.tdwg.org/dwc.

Please note that the GBIF Search API has a hard limit of 200,000 occurrences per request. If you want to fetch more records, either subivide your area of interest or split up your search by using different search criteria.
When a list of taxa is given as input, v.in.pygbif issues a search for each taxon individually. Thus, in order to split up a search it is recommended to either use different filters on time or space.

EXAMPLES

# Check matching taxon names and alternatives in GBIF:
v.in.pygbif taxa="Poa,Plantago" rank=genus  -p

# Check matching taxon names and alternatives in GBIF and print output in table:
v.in.pygbif taxa="Poa pratensis,Plantago media,Acer negundo" rank=species -t

# Get number of occurrences for two geni:
v.in.pygbif taxa="Poa,Plantago" rank=genus  -o

# Get number of occurrences for two species:
v.in.pygbif taxa="Poa pratensis,Plantago media" rank=species  -o

# Fetch occurrences for two species into a map for each species:
v.in.pygbif taxa="Poa pratensis,Plantago media" rank=species output=gbif -i

REFERENCES

https://pygbif.readthedocs.io/en/latest/index.html
https://www.gbif.org
https://techdocs.gbif.org/en/openapi

AUTHORS

Stefan Blumentrath, Norwegian Institute for Nature Research, Oslo, Norway
Helmut Kudrnovsky

SOURCE CODE

Available at: v.in.pygbif source code (history)
Latest change: Friday Feb 21 12:27:42 2025 in commit 8fce680

v.in.pygbif

Parameters

DESCRIPTION

EXAMPLES

SEE ALSO

REFERENCES

AUTHORS

SOURCE CODE