Biodiversity

Species Occurrence Records

A billion-record archive of where and when species have been observed worldwide.

What this dataset is

A harmonised archive of more than one billion species-occurrence records — every record being an observation of a species at a place and time. Records are aggregated from public biodiversity networks, deduplicated, and cleaned against an authoritative taxonomic backbone.

Each record carries: the canonical species name, a stable taxon identifier, latitude and longitude, observation date, basis of record (human observation, specimen, machine observation, etc.), and a link back to the original source.

What you get

A single global Parquet archive, partitioned by phylum and year for fast filtering.
A CSV mirror for users who prefer flat files.
A taxonomy lookup table mapping every record to kingdom, phylum, class, order, family, genus, species.

Typical uses

Species-distribution and range modelling.
Biodiversity hotspot mapping.
Invasive species and biosecurity monitoring.
Citizen-science engagement and education.

Data quality

Many records carry uncertainty — about taxonomy, location, or date. Each record has a quality flag indicating whether it passed our standard quality filters. We recommend the standard quality subset for most analyses.

Suggested attribution

Biodiversity: OpenData.Earth species occurrences — CC BY 4.0