Download Weather Data from Environment and Climate Change Canada
Provides means for downloading historical weather data from the Environment and Climate Change Canada website (https://climate.weather.gc.ca/historical_data/search_historic_data_e.html). Data can be downloaded from multiple stations and over large date ranges and automatically processed into a single dataset. Tools are also provided to identify stations either by name or proximity to a location.
Scientific use casesDownload Time Series Data from Waterinfo.be
wateRinfo facilitates access to waterinfo.be (https://www.waterinfo.be), a website managed by the Flanders Environment Agency (VMM) and Flanders Hydraulics Research. The website provides access to real-time water and weather related environmental variables for Flanders (Belgium), such as rainfall, air pressure, discharge, and water level. The package provides functions to search for stations and variables, and download time series.
View DocumentationCatalogue of Life Plus Client
Client for the Catalogue of Life Plus (CoL+) webservice (https://github.com/CatalogueOfLife/general). The CoL+ webservice is a new interface to the Catalogue of Life. Includes functions for each of the API methods, including searching for names, and more.
View DocumentationEasily Download and Visualise Climate Data from CliFlo
CliFlo is a web portal to the New Zealand National Climate Database and provides public access (via subscription) to around 6,500 various climate stations (see https://cliflo.niwa.co.nz/ for more information). Collating and manipulating data from CliFlo (hence clifro) and importing into R for further analysis, exploration and visualisation is now straightforward and coherent. The user is required to have an internet connection, and a current CliFlo subscription (free) if data from stations, other than the public Reefton electronic weather station, is sought.
Scientific use casesIUCN Red List Client
IUCN Red List (http://apiv3.iucnredlist.org/api/v3/docs) client. The IUCN Red List is a global list of threatened and endangered species. Functions cover all of the Red List API routes. An API key is required.
Scientific use casesNOAA Weather Data from R
Client for many NOAA data sources including the NCDC climate API at https://www.ncdc.noaa.gov/cdo-web/webservices/v2, with functions for each of the API endpoints: data, data categories, data sets, data types, locations, location categories, and stations. In addition, we have an interface for NOAA sea ice data, the NOAA severe weather inventory, NOAA Historical Observing Metadata Repository (HOMR) data, NOAA storm data via IBTrACS, tornado data via the NOAA storm prediction center, and more.
Scientific use caseseBird Data Extraction and Processing in R
Extract and process bird sightings records from eBird (http://ebird.org), an online tool for recording bird observations. Public access to the full eBird database is via the eBird Basic Dataset (EBD; see http://ebird.org/ebird/data/download for access), a downloadable text file. This package is an interface to AWK for extracting data from the EBD based on taxonomic, spatial, or temporal filters, to produce a manageable file size that can be imported into R.
View DocumentationA Tool for Automating Download and Preprocessing of MODIS Land Products Data
Allows automating the creation of time series of rasters derived from MODIS Satellite Land Products data. It performs several typical preprocessing steps such as download, mosaicking, reprojection and resize of data acquired on a specified time period. All processing parameters can be set using a user-friendly GUI. Users can select which layers of the original MODIS HDF files they want to process, which additional Quality Indicators should be extracted from aggregated MODIS Quality Assurance layers and, in the case of Surface Reflectance products , which Spectral Indexes should be computed from the original reflectance bands. For each output layer, outputs are saved as single-band raster files corresponding to each available acquisition date. Virtual files allowing access to the entire time series as a single file are also created. Command-line execution exploiting a previously saved processing options file is also possible, allowing to automatically update time series related to a MODIS product whenever a new image is available.
Scientific use casesInterface to the Global Biodiversity Information Facility API
A programmatic interface to the Web Service methods provided by the Global Biodiversity Information Facility (GBIF; https://www.gbif.org/developer/summary). GBIF is a database of species occurrence records from sources all over the globe. rgbif includes functions for searching for taxonomic names, retrieving information on data providers, getting species occurrence records, getting counts of occurrence records, and using the GBIF tile map service to make rasters summarizing huge amounts of data.
Scientific use casesAccess iNaturalist Data Through APIs
A programmatic interface to the API provided by the iNaturalist website https://www.inaturalist.org/ to download species occurrence data submitted by citizen scientists.
View DocumentationBielefeld Academic Search Engine (BASE) Client
Interface to the API for the Bielefeld Academic Search Engine (BASE) (https://www.base-search.net/). BASE is a search engine for more than 150 million scholarly documents from more than 7000 sources. Methods are provided for searching for documents, as well as getting information on higher level groupings of documents: collections and repositories within collections. Search includes faceting, so you can get a high level overview of number of documents across a given variable (e.g., year). BASE asks users to respect a rate limit, but does not enforce it themselves; we enforce that rate limit.
View DocumentationWeb scraper for Atlantic and east Pacific hurricanes and tropical storms
Get archived data of past and current hurricanes and tropical
storms for the Atlantic and eastern Pacific oceans. Data is available for
storms since 1998. Datasets are updated via the rrricanesdata package.
Currently, this package is about 6MB of datasets. See the README or view
vignette("drat")
for more information.
Mangal Client
An interface to the Mangal database - a collection of ecological networks. This package includes functions to work with the Mangal RESTful API methods (https://mangal.io/doc/api/).
View DocumentationAccesses Weather Data from the Iowa Environment Mesonet
Allows to get weather data from Automated Surface Observing System (ASOS) stations (airports) in the whole world thanks to the Iowa Environment Mesonet website.
Scientific use casesAustralian Government Bureau of Meteorology (BOM) Data Client
Provides functions to interface with Australian Government Bureau of Meteorology (BOM) data, fetching data and returning a data frame of precis forecasts, historical and current weather data from stations, agriculture bulletin data, BOM 0900 or 1500 weather bulletins and downloading and importing radar and satellite imagery files. Data (c) Australian Government Bureau of Meteorology Creative Commons (CC) Attribution 3.0 licence or Public Access Licence (PAL) as appropriate. See http://www.bom.gov.au/other/copyright.shtml for further details.
Scientific use casesNASA POWER API Client
Client for NASA POWER global meteorology, surface solar energy and climatology data API. POWER (Prediction Of Worldwide Energy Resource) data are freely available global meteorology and surface solar energy climatology data for download with a resolution of 1/2 by 1/2 arc degree longitude and latitude and are funded through the NASA Earth Science Directorate Applied Science Program. For more on the data themselves, a web-based data viewer and web access, please see https://power.larc.nasa.gov/.
Scientific use casesGlobal Surface Summary of the Day (GSOD) Weather Data Client
Provides automated downloading, parsing, cleaning, unit conversion and formatting of Global Surface Summary of the Day (GSOD) weather data from the from the USA National Centers for Environmental Information (NCEI). Units are converted from from United States Customary System (USCS) units to International System of Units (SI). Stations may be individually checked for number of missing days defined by the user, where stations with too many missing observations are omitted. Only stations with valid reported latitude and longitude values are permitted in the final data. Additional useful elements, saturation vapour pressure (es), actual vapour pressure (ea) and relative humidity (RH) are calculated from the original data using the improved August-Roche-Magnus approximation (Alduchov & Eskridge 1996) and included in the final data set. The resulting metadata include station identification information, country, state, latitude, longitude, elevation, weather observations and associated flags. For information on the GSOD data from NCEI, please see the GSOD readme.txt file available from, https://www1.ncdc.noaa.gov/pub/data/gsod/readme.txt.
Scientific use casesAPI Client for CHIRPS
API Client for the Climate Hazards Group InfraRed Precipitation with Station Data CHIRPS. The CHIRPS data is a 35+ year quasi-global rainfall data set, which incorporates 0.05 arc-degrees resolution satellite imagery, and in-situ station data to create gridded rainfall time series for trend analysis and seasonal drought monitoring. For more details on CHIRPS data please visit its official home page https://www.chc.ucsb.edu/data/chirps. Requests from large time series (> 10 years) and large geographic coverage (global scale) may take several minutes.
View DocumentationCRU CL v. 2.0 Climatology Client
Provides functions that automate downloading and importing University of East Anglia Climate Research Unit (CRU) CL v. 2.0 climatology data, facilitates the calculation of minimum temperature and maximum temperature and formats the data into a tidy data frame as a tibble or a list of raster stack objects for use. CRU CL v. 2.0 data are a gridded climatology of 1961-1990 monthly means released in 2002 and cover all land areas (excluding Antarctica) at 10 arcminutes (0.1666667 degree) resolution. For more information see the description of the data provided by the University of East Anglia Climate Research Unit, https://crudata.uea.ac.uk/cru/data/hrg/tmc/readme.txt.
View DocumentationExtract and Tidy Canadian Hydrometric Data
Provides functions to access historical and real-time national hydrometric data from Water Survey of Canada data sources (https://dd.weather.gc.ca/hydrometric/csv/ and https://collaboration.cmc.ec.gc.ca/cmc/hydrometrics/www/) and then applies tidy data principles.
Scientific use casesInterface to the Pleiades Archeological Database
Provides a set of functions for interacting with the Pleiades (https://pleiades.stoa.org/) API, including getting status data, places data, and creating a GeoJSON based map on GitHub gists.
View DocumentationFingertips Data for Public Health
Fingertips (http://fingertips.phe.org.uk/) contains data for many indicators of public health in England. The underlying data is now more easily accessible by making use of the API.
Scientific use casesDownloading Supplementary Data from Published Manuscripts
Downloads data supplementary materials from manuscripts, using papers’ DOIs as references. Facilitates open, reproducible research workflows: scientists re-analyzing published datasets can work with them as easily as if they were stored on their own computer, and others can track their analysis workflow painlessly. The main function suppdata() returns a (temporary) location on the user’s computer where the file is stored, making it simple to use suppdata() with standard functions like read.csv().
Scientific use casesSustainable Transport Planning
Tools for transport planning with an emphasis on spatial transport data and non-motorized modes. Enables common transport planning tasks including: downloading and cleaning transport datasets; creating geographic “desire lines” from origin-destination (OD) data; route assignment, locally and via interfaces to routing services such as https://cyclestreets.net/; calculation of route segment attributes such as bearing and aggregate flow; and travel watershed analysis. See Lovelace and Ellison (2018) doi:10.32614/RJ-2018-053 and vignettes for details.
Scientific use casesGenerates Networks from BTS Data
A flexible tool that allows generating bespoke air transport statistics for urban studies based on publicly available data from the Bureau of Transport Statistics (BTS) in the United States https://www.transtats.bts.gov/databases.asp?Mode_ID=1&Mode_Desc=Aviation&Subject_ID2=0.
Scientific use casesChemical Information from the Web
Chemical information from around the web. This package interacts with a suite of web services for chemical information. Sources include: Alan Wood’s Compendium of Pesticide Common Names, Chemical Identifier Resolver, ChEBI, Chemical Translation Service, ChemIDplus, ChemSpider, ETOX, Flavornet, NIST Chemistry WebBook, OPSIN, PAN Pesticide Database, PubChem, SRS, Wikidata.
Scientific use casesDownload and Explore Datasets from UCSC Xena Data Hubs
Download and explore datasets from UCSC Xena data hubs, which are a collection of UCSC-hosted public databases such as TCGA, ICGC, TARGET, GTEx, CCLE, and others. Databases are normalized so they can be combined, linked, filtered, explored and downloaded.
Scientific use casesR Interface to FishBase
A programmatic interface to http://www.fishbase.org, re-written based on an accompanying RESTful API. Access tables describing over 30,000 species of fish, their biology, ecology, morphology, and more. This package also supports experimental access to http://www.sealifebase.org data, which contains nearly 200,000 species records for all types of aquatic life not covered by FishBase.
Scientific use casesInterface to the MODIS Land Products Subsets Web Services
Programmatic interface to the Oak Ridge National Laboratories MODIS Land Products Subsets web services (https://modis.ornl.gov/data/modis_webservice.html). Allows for easy downloads of MODIS time series directly to your R workspace or your computer.
Scientific use casesWork with Open Road Traffic Casualty Data from Great Britain
Tools to help download, process and analyse the UK road collision data collected using the STATS19 form. The data are provided as CSV files with detailed road safety data about the circumstances of car crashes and other incidents on the roads resulting in casualties in Great Britain from 1979, the types (including make and model) of vehicles involved and the consequential casualties. The statistics relate only to personal casualties on public roads that are reported to the police, and subsequently recorded, using the STATS19 accident reporting form. See the Department for Transport website https://data.gov.uk/dataset/cb7ae6f0-4be6-4935-9277-47e5ce24a11f/road-safety-data for more information on these data.
View DocumentationA High-Performance Database of Shipment-Level CITES Trade Data
Provides convenient access to over 40 years and 20 million records of endangered wildlife trade data from the Convention on International Trade in Endangered Species of Wild Fauna and Flora, stored on a local on-disk, out-of memory DuckDB database for bulk analysis.
Scientific use casesR Interface to the Data Retriever
Provides an R interface to the Data Retriever https://retriever.readthedocs.io/en/latest/ via the Data Retriever’s command line interface. The Data Retriever automates the tasks of finding, downloading, and cleaning public datasets, and then stores them in a local database.
View DocumentationDownload and Process Public Domain Works from Project Gutenberg
Download and process public domain works in the Project Gutenberg collection http://www.gutenberg.org/. Includes metadata for all Project Gutenberg works, so that they can be searched and retrieved.
View DocumentationAccess to the Neotoma Paleoecological Database Through R
Access paleoecological datasets from the Neotoma Paleoecological Database using the published API (http://wnapi.neotomadb.org/). The functions in this package access various pre-built API functions and attempt to return the results from Neotoma in a usable format for researchers and the public.
Scientific use casesAccess the Global Plant Phenology Data Portal
An R interface to the Global Plant Phenology Data Portal, which is accessible online at https://www.plantphenology.org/.
View DocumentationFunctions to Automate Downloading Geospatial Data Available from Several Federated Data Sources
Functions to automate downloading geospatial data available from several federated data sources (mainly sources maintained by the US Federal government). Currently, the package enables extraction from seven datasets: The National Elevation Dataset digital elevation models (1 and 1/3 arc-second; USGS); The National Hydrography Dataset (USGS); The Soil Survey Geographic (SSURGO) database from the National Cooperative Soil Survey (NCSS), which is led by the Natural Resources Conservation Service (NRCS) under the USDA; the Global Historical Climatology Network (GHCN), coordinated by National Climatic Data Center at NOAA; the Daymet gridded estimates of daily weather parameters for North America, version 3, available from the Oak Ridge National Laboratory’s Distributed Active Archive Center (DAAC); the International Tree Ring Data Bank; and the National Land Cover Database (NLCD).
Scientific use casesOpen Trade Statistics API Wrapper and Utility Program
Access Open Trade Statistics API from R to download international trade data.
View DocumentationDownload and Prepare C14 Dates from Different Source Databases
Query different C14 date databases and apply basic data cleaning, merging and calibration steps. Currently available databases: 14cpalaeolithic, 14sea, adrac, austarch, calpal, context, emedyd, eubar, euroevol, irdd, jomon, katsianis, kiteeastafrica, medafricarbon, mesorad, pacea, palmisano, radon, radonb.
View DocumentationGet SNP (Single-Nucleotide Polymorphism) Data on the Web
A programmatic interface to various SNP datasets on the web: OpenSNP (https://opensnp.org), and NBCIs dbSNP database (https://www.ncbi.nlm.nih.gov/projects/SNP/). Functions are included for searching for NCBI. For OpenSNP, functions are included for getting SNPs, and data for genotypes, phenotypes, annotations, and bulk downloads of data by user.
Scientific use casesSpecies Trait Data from Around the Web
Species trait data from many different sources, including sequence data from NCBI (https://www.ncbi.nlm.nih.gov/), plant trait data from BETYdb, data from EOL Traitbank, Birdlife International, and more.
Scientific use casesInterface to the CAVD DataSpace
Provides a convenient API interface to access immunological data within the CAVD DataSpace(https://dataspace.cavd.org), a data sharing and discovery tool that facilitates exploration of HIV immunological data from pre-clinical and clinical HIV vaccine studies.
View DocumentationAccess Data from the Oregon State Prism Climate Project
Allows users to access the Oregon State Prism climate data (http://www.prism.oregonstate.edu/). Using the web service API data can easily downloaded in bulk and loaded into R for spatial analysis. Some user friendly visualizations are also provided.
View DocumentationClient for CCAFS GCM Data
Client for Climate Change, Agriculture, and Food Security (CCAFS) General Circulation Models (GCM) data. Data is stored in Amazon S3, from which we provide functions to fetch data.
View DocumentationDrugBank Database XML Parser
This tool is for parsing the DrugBank XML database https://www.drugbank.ca/. The parsed data are then returned in a proper R dataframe with the ability to save them in a given database.
View DocumentationSearch Vertnet, a Database of Vertebrate Specimen Records
Retrieve, map and summarize data from the VertNet.org archives (http://vertnet.org/). Functions allow searching by many parameters, including taxonomic names, places, and dates. In addition, there is an interface for conducting spatially delimited searches, and another for requesting large datasets via email.
Scientific use casesGenomic Data Retrieval
Perform large scale genomic data retrieval and functional annotation retrieval. This package aims to provide users with a standardized way to automate genome, proteome, RNA, coding sequence (CDS), GFF, and metagenome retrieval from NCBI RefSeq, NCBI Genbank, ENSEMBL, and UniProt databases. Furthermore, an interface to the BioMart database (Smedley et al. (2009) doi:10.1186/1471-2164-10-22) allows users to retrieve functional annotation for genomic loci. In addition, users can download entire databases such as NCBI RefSeq (Pruitt et al. (2007) doi:10.1093/nar/gkl842), NCBI nr, NCBI nt, NCBI Genbank (Benson et al. (2013) doi:10.1093/nar/gks1195), etc. with only one command.
Scientific use casesInterface to Species Occurrence Data Sources
A programmatic interface to many species occurrence data sources, including Global Biodiversity Information Facility (GBIF), USGSs Biodiversity Information Serving Our Nation (BISON), iNaturalist, Berkeley Ecoinformatics Engine, eBird, Integrated Digitized Biocollections (iDigBio), VertNet, Ocean Biogeographic Information System (OBIS), and Atlas of Living Australia (ALA). Includes functionality for retrieving species occurrence data, and combining those data.
Scientific use casesAccesses Air Quality Data from the Open Data Platform OpenAQ
Allows access to air quality data from the API of the OpenAQ platform https://docs.openaq.org/, with the different services the API offers (getting measurements for a given query, getting latest measurements, getting lists of available countries/cities/locations).
Scientific use casesDownload and Process Data from the Paleobiology Database
Includes 19 functions to wrap each endpoint of the PaleobioDB API, plus 8 functions to visualize and process the fossil data. The API documentation for the Paleobiology Database can be found in http://paleobiodb.org/data1.1/.
Scientific use casesFetch Phylogenies from Many Sources
Includes methods for fetching phylogenies from a variety of sources, including the Phylomatic web service (http://phylodiversity.net/phylomatic), and Phylocom (https://github.com/phylocom/phylocom/).
Scientific use casesInternational Cricket Data
Data on all international cricket matches is provided by ESPNCricinfo. This package provides some scraper functions to download the data into tibbles ready for analysis. Some innings-level data sourced from Howzstat is also included in the package.
View DocumentationClient for Neuroscience Information Framework APIs
Client for Neuroscience Information Framework (NIF) APIs (https://neuinfo.org/; https://neuinfo.org/about/webservices). Package includes functions for each API route, and gives back data in tidy data.frame format.
View DocumentationInterface to the USGS BISON API
Interface to the USGS BISON (https://bison.usgs.gov/) API, a database for species occurrence data. Data comes from species in the United States from participating data providers. You can get data via taxonomic and location based queries. A simple function is provided to help visualize data.
Scientific use casesFunctions to mine endoscopic and associated pathology datasets
This script comprises the functions that are used to clean up endoscopic
reports and pathology reports as well as many of the scripts used for analysis.
The scripts assume the endoscopy and histopathology data set is merged already but it can
also be used of course with the unmerged datasets.
General Purpose Client for ERDDAP Servers
General purpose R client for ERDDAP servers. Includes functions to search for datasets, get summary information on datasets, and fetch datasets, in either csv or netCDF format. ERDDAP information: https://upwell.pfeg.noaa.gov/erddap/information.html.
Scientific use casesRetrieve Data from the 1000 Plants Initiative (1KP)
The 1000 Plants Initiative (www.onekp.com) has sequenced the transcriptomes
of over 1000 plant species. This package allows these sequences and
metadata to be retrieved and filtered by code, species or recursively by
clade. Scientific names and NCBI taxonomy IDs are both supported.
View Documentation
Interface to the Open Tree of Life API
An interface to the Open Tree of Life API to retrieve phylogenetic trees, information about studies used to assemble the synthetic tree, and utilities to match taxonomic names to ‘Open Tree identifiers. The Open Tree of Life’ aims at assembling a comprehensive phylogenetic tree for all named species.
Scientific use casesPredict Gender from Names Using Historical Data
Infers state-recorded gender categories from first names and dates of birth using historical datasets. By using these datasets instead of lists of male and female names, this package is able to more accurately infer the gender of a name, and it is able to report the probability that a name was male or female. GUIDELINES: This method must be used cautiously and responsibly. Please be sure to see the guidelines and warnings about usage in the README or the package documentation. See Blevins and Mullen (2015) http://www.digitalhumanities.org/dhq/vol/9/3/000223/000223.html.
View DocumentationDownload Qualtrics Survey Data
Provides functions to access survey results directly into R using the Qualtrics API. Qualtrics https://www.qualtrics.com/about/ is an online survey and data collection software platform. See https://api.qualtrics.com/ for more information about the Qualtrics API. This package is community-maintained and is not officially supported by Qualtrics.
View DocumentationClient for the CORE API
Client for the CORE API (https://core.ac.uk/docs/). CORE (https://core.ac.uk) aggregates open access research outputs from repositories and journals worldwide and make them available to the public.
View DocumentationDirectory of Open Access Journals Client
Client for the Directory of Open Access Journals (DOAJ) (https://doaj.org/). API documentation at https://doaj.org/api/v1/docs. Methods included for working with all DOAJ API routes: fetch article information by identifier, search for articles, fetch journal information by identifier, and search for journals.
View DocumentationR Interface to the Species+ Database
A programmatic interface to the Species+ https://speciesplus.net/ database via the Species+/CITES Checklist API https://api.speciesplus.net/.
Scientific use casesAccess for Dryad Web Services
Interface to the Dryad “Solr” API, their “OAI-PMH” service, and fetch datasets. Dryad (https://datadryad.org/) is a curated host of data underlying scientific publications.
Scientific use casesInterface to Bold Systems API
A programmatic interface to the Web Service methods provided by Bold Systems (http://www.boldsystems.org/) for genetic barcode data. Functions include methods for searching by sequences by taxonomic names, ids, collectors, and institutions; as well as a function for searching for specimens, and downloading trace files.
Scientific use casesAccess Nomis UK Labour Market Data
Access UK official statistics from the Nomis database. Nomis includes data from the Census, the Labour Force Survey, DWP benefit statistics and other economic and demographic data from the Office for National Statistics, based around statistical geographies. See https://www.nomisweb.co.uk/api/v01/help for full API documentation.
View DocumentationAutomated Phylogenetic Sequence Cluster Identification from GenBank
A pipeline for the identification, within taxonomic groups, of orthologous sequence clusters from GenBank https://www.ncbi.nlm.nih.gov/genbank/ as the first step in a phylogenetic analysis. The pipeline depends on a local alignment search tool and is, therefore, not dependent on differences in gene naming conventions and naming errors.
Scientific use casesFetch Species Origin Data from the Web
Get species origin data (whether species is native/invasive) from the following sources on the web: Encyclopedia of Life (http://eol.org), Flora Europaea (http://rbg-web2.rbge.org.uk/FE/fe.html), Global Invasive Species Database (http://www.iucngisd.org/gisd), the Native Species Resolver (https://bien.nceas.ucsb.edu/bien/tools/nsr/), Integrated Taxonomic Information Service (https://www.itis.gov/), and Global Register of Introduced and Invasive Species (http://www.griis.org/).
View DocumentationInterface to the Biodiversity Heritage Library
Interface to Biodiversity Heritage Library (BHL) (https://www.biodiversitylibrary.org/) API (https://www.biodiversitylibrary.org/docs/api3.html). BHL is a repository of digitized literature on biodiversity studies, including floras, research papers, and more.
Scientific use casesHydrological Data Discovery Tools
Tools to discover hydrological data, accessing catalogues and databases from various data providers.
View DocumentationAcquisition and Processing of NASA Soil Moisture Active-Passive (SMAP) Data
Facilitates programmatic access to NASA Soil Moisture Active
Passive (SMAP) data with R. It includes functions to search for, acquire,
and extract SMAP data.
View Documentation
Download Data from the European Social Survey on the Fly
Download data from the European Social Survey directly from their website http://www.europeansocialsurvey.org/. There are two families of functions that allow you to download and interactively check all countries and rounds available.
View DocumentationInterface to USDA Databases
An interface to the web service methods provided by the United States Department of Agriculture (USDA). The Agricultural Research Service (ARS) provides a large set of databases. The current version of the package holds interfaces to the Systematic Mycology and Microbiology Laboratory (SMML), which consists of four databases: Fungus-Host Distributions, Specimens, Literature and the Nomenclature database. It provides functions for querying these databases. The main function is \code{associations}, which allows searching for fungus-host combinations.
Scientific use casesAn API Client for the Internet Archive
Search the Internet Archive (https://archive.org), retrieve metadata, and download files.
View DocumentationR Interface to the Global Population Dynamics Database
R Interface to the Global Population Dynamics Database (https://ecologicaldata.org/wiki/global-population-dynamics-database)
View DocumentationNatureServe Interface
Interface to NatureServe (https://www.natureserve.org/). Includes methods to get data, image metadata, search taxonomic names, and make maps.
View DocumentationInterface with the United Nations Comtrade API
Interface with and extract data from the United Nations Comtrade API https://comtrade.un.org/data/. Comtrade provides country level shipping data for a variety of commodities, these functions allow for easy API query and data returned as a tidy data frame.
View DocumentationR Client for the eBird Database of Bird Observations
A programmatic client for the eBird database (https://ebird.org/home), including functions for searching for bird observations by geographic location (latitude, longitude), eBird hotspots, location identifiers, by notable sightings, by region, and by taxonomic name.
Scientific use casesAPI Client for the Open Context Archeological Database
Search, browse, and download data from Open Context (https://opencontext.org)
View DocumentationAn R client for HathiTrust API
An R client for HathiTrust API (https://www.hathitrust.org). Only for the bibliographic API for now.
View DocumentationR Interface to Global Biotic Interactions
A programmatic interface to the web service methods provided by Global Biotic Interactions (GloBI) (https://www.globalbioticinteractions.org/). GloBI provides access to spatial-temporal species interaction records from sources all over the world. rglobi provides methods to search species interactions by location, interaction type, and taxonomic name. In addition, it supports Cypher, a graph query language, to allow for executing custom queries on the GloBI aggregate species interaction data set.
Scientific use casesAPI Wrapper for US Energy Information Administration Open Data
Provides API access to data from the US Energy Information Administration (EIA) https://www.eia.gov/. Use of the API requires a free API key obtainable at https://www.eia.gov/opendata/register.php. The package includes functions for searching EIA data categories and importing time series and geoset time series datasets. Datasets returned by these functions are provided in a tidy format or alternatively in more raw form. It also offers helper functions for working with EIA date strings and time formats and for inspecting different summaries of series metadata. The package also provides control over API key storage and caching of API request results.
View DocumentationPhoto Searcher
Queries the Flick API (https://www.flickr.com/services/api/) to return photograph metadata as well as the ability to download the images as jpegs.
View DocumentationA package for accessing World Bank climate data
This package will download model predictions from 15 different global circulation models in 20 year intervals from the world bank. Users can also access historical data, and create maps at 2 different spatial scales.
Scientific use casesGeneral Purpose R Interface to Solr
Provides a set of functions for querying and parsing data from Solr (https://lucene.apache.org/solr) endpoints (local and remote), including search, faceting, highlighting, stats, and more like this. In addition, some functionality is included for creating, deleting, and updating documents in a Solr database.
View DocumentationProvides some helper functions for using the GitHub V4 API
Uses the ghql package and jqr to get some common data from Github V4 API.
View DocumentationInteract with the UK AIR Pollution Database from DEFRA
Get data from DEFRA’s UK-AIR website https://uk-air.defra.gov.uk/. It basically scrapes the HTML content.
Scientific use casesObtain and Visualize Regulome-Gene Expression Correlations in Cancer
Builds a SQLite database file of pre-calculated transcription factor/microRNA-gene correlations (co-expression) in cancer from the Cistrome Cancer Liu et al. (2011) doi:10.1186/gb-2011-12-8-r83 and miRCancerdb databases (in press). Provides custom classes and functions to query, tidy and plot the correlation data.
Scientific use casesAPI Client and Dataset Management for the Demographic and Health Survey (DHS) Data
Provides a client for (1) querying the DHS API for survey indicators and metadata (https://api.dhsprogram.com/#/index.html), (2) identifying surveys and datasets for analysis, (3) downloading survey datasets from the DHS website, (4) loading datasets and associate metadata into R, and (5) extracting variables and combining datasets for pooled analysis.
Scientific use casesInterface to the Libraries.io API
Interface to the Libraries.io API (https://libraries.io/api). Libraries.io indexes data from 36 different package managers for programming languages.
View DocumentationR Package Client for the Netherlands Biodiversity API
Access to the digitised Natural History collection at the Naturalis Biodiversity Center. This is the official client to the Netherlands Biodiversity API (NBA, http://api.biodiversitydata.nl) for the R programming language. More information on the NBA can be found at http://docs.biodiversitydata.nl.
View DocumentationInterface to the Greek National Data Bank for Hydrometeorological Information
R interface to the Greek National Data Bank for Hydrological and Meteorological Information http://www.hydroscope.gr/. It covers Hydroscope’s data sources and provides functions to transliterate, translate and download them into tidy dataframes.
Scientific use casesProgrammatic Interface to the Web Service Methods Provided by UC Berkeley's Natural History Data
The ecoengine (ecoengine; https://ecoengine.berkeley.edu/). provides access to more than 5 million georeferenced specimen records from the University of California, Berkeley’s Natural History Museums.
View DocumentationDownload and Aggregate Data from Public Hire Bicycle Systems
Download and aggregate data from all public hire bicycle systems which provide open data, currently including Santander Cycles in London, U.K.; from the U.S.A., Ford GoBike in San Francisco CA, citibike in New York City NY, Divvy in Chicago IL, Capital Bikeshare in Washington DC, Hubway in Boston MA, Metro in Los Angeles LA, Indego in Philadelphia PA, and Nice Ride in Minnesota; Bixi from Montreal, Canada; and mibici from Guadalajara, Mexico.
Scientific use casesDownload Data from the Catchment Data Explorer Website
Facilitates searching, download and plotting of Water Framework Directive (WFD) reporting data for all waterbodies within the UK Environment Agency area. The types of data that can be downloaded are: WFD status classification data, Reasons for Not Achieving Good (RNAG) status, objectives set for waterbodies, measures put in place to improve water quality and details of associated protected areas. The site accessed is https://environment.data.gov.uk/catchment-planning/. The data are made available under the Open Government Licence v3.0 https://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/.
View DocumentationParse NOAA Integrated Surface Data Files
Tools for parsing NOAA Integrated Surface Data (ISD) files, described at https://www.ncdc.noaa.gov/isd. Data includes for example, wind speed and direction, temperature, cloud data, sea level pressure, and more. Includes data from approximately 35,000 stations worldwide, though best coverage is in North America/Europe/Australia. Data is stored as variable length ASCII character strings, with most fields optional. Included are tools for parsing entire files, or individual lines of data.
View DocumentationClient for the Pangaea Database
Tools to interact with the Pangaea Database (https://www.pangaea.de), including functions for searching for data, fetching datasets by dataset ID, and working with the Pangaea OAI-PMH service.
Scientific use casesCollecting Twitter Data
An implementation of calls designed to collect and organize Twitter data via Twitter’s REST and stream Application Program Interfaces (API), which can be found at the following URL: https://developer.twitter.com/en/docs. This package has been peer-reviewed by rOpenSci (v. 0.6.9).
Scientific use casesGet Australian Flight Data, 1985-2016
A package to obtain Australian aviation data from BITRE. This incudes airport traffic data between 1985-2016 covering international freight data, and both international and domestic data on number of passengers, and flight movements - for both regional and metropolitan airports. The Package also includes distances of flight originating in or ending in Australia, and the location of all relevant airports.
View DocumentationProgrammatic Interface to the openfisheries.org API
A programmatic interface to openfisheries.org. This package is part of the rOpenSci suite (https://ropensci.org).
Scientific use casesAccess London Natural History Museum Host-Helminth Record Database
Access to large host-parasite data is often hampered by the availability of data and difficulty in obtaining it in a programmatic way to encourage analyses. helminthR provides a programmatic interface to the London Natural History Museum’s host-parasite database, one of the largest host-parasite databases existing currently http://www.nhm.ac.uk/research-curation/scientific-resources/taxonomy-systematics/host-parasites/. The package allows the user to query by host species, parasite species, and geographic location.
Scientific use casesA DoOR to the Complete Olfactome
This is a function package providing functions to perform data manipulations and visualizations for DoOR.data. See the URLs for the original and the DoOR 2.0 publication.
View DocumentationA DoOR to the Complete Olfactome
This is a data package providing Drosophila odorant response data for DoOR.functions. See URLs for the original and the DoOR 2.0 publications.
View DocumentationDiscovery, Access and Manipulation of TreeBASE Phylogenies
Interface to the API for TreeBASE http://treebase.org from R. TreeBASE is a repository of user-submitted phylogenetic trees (of species, population, or genes) and the data used to create them.
View DocumentationData for Atlantic and east Pacific tropical cyclones since 1998
Includes storm discussions, forecast/advisories, public advisories, wind speed probabilities, strike probabilities and more. This package can be used along with rrricanes (>= 0.2.0-6). Data is considered public domain via the National Hurricane Center.
View DocumentationGet Texts from the Perseus Digital Library
The Perseus Digital Library is a collection of classical texts. This package helps you get them. The available works can also be viewed here: http://cts.perseids.org/.
View DocumentationHigh Resolution World Vector Map Data from Natural Earth used in rnaturalearth
Facilitates mapping by making natural earth map data from http:// www.naturalearthdata.com/ more easily available to R users. Focuses on vector data.
View DocumentationInterface to the Bird-Watching Dataset Proyecto AVIS
Interface to http://proyectoavis.com database. It provides means to download data filtered by species, order, family, and several other criteria. Provides also basic functionality to plot exploratory maps of the datasets.
View DocumentationDownload and Read RAM Legacy Stock Assessment Database
Contains functions to download, cache and read in Excel version of the RAM Legacy Stock Assessment Data Base, an online compilation of stock assessment results for commercially exploited marine populations from around the world. The database is named after Dr. Ransom A. Myers whose original stock-recruitment database, is no longer being updated. More information about the database can be found at https://ramlegacy.org/. Ricard, D., Minto, C., Jensen, O.P. and Baum, J.K. (2012) doi:10.1111/j.1467-2979.2011.00435.x.
View DocumentationClient for Various Ocean Time Series Datasets
Interact with various ocean time series datasets, including BATS, HOT, and more. Package focuses on data retrieval only. All functions return a data.frame for easy downstream use for plots, vizualization, analysis.
View DocumentationDatasets for Historians
These sample data sets are intended for historians learning R. They include population, institutional, religious, military, and prosopographical data suitable for mapping, quantitative analysis, and network analysis.
View DocumentationGet Landsat 8 Data from Amazon Public Data Sets
Get Landsat 8 Data from Amazon Web Services (AWS) public data sets (https://registry.opendata.aws/landsat-8/). Includes functions for listing images and fetching them, and handles caching to prevent unnecessary additional requests.
View DocumentationHistorical Datasets for Predicting Gender from Names
The historical datasets in this package are used in the gender package to predict gender from first names and birth years.
View DocumentationClient for CAMS Radiation Service
Copernicus Atmosphere Monitoring Service (CAMS) Radiation Service provides time series of global, direct, and diffuse irradiations on horizontal surface, and direct irradiation on normal plane for the actual weather conditions as well as for clear-sky conditions. The geographical coverage is the field-of-view of the Meteosat satellite, roughly speaking Europe, Africa, Atlantic Ocean, Middle East. The time coverage of data is from 2004-02-01 up to 2 days ago. Data are available with a time step ranging from 15 min to 1 month. For license terms and to create an account, please see http://www.soda-pro.com/web-services/radiation/cams-radiation-service.
Scientific use casesClient for the Bittrex Exchange
A client for the Bittrex crypto-currency exchange https://bittrex.com including the ability to query trade data, manage account balances, and place orders.
View Documentationprogrammatic interface to the AntWeb
A complete programmatic interface to the AntWeb database from the California Academy of Sciences.
Scientific use casesAntarctic Geographic Place Names
Antarctic geographic names from the Composite Gazetteer of Antarctica, and functions for working with those place names.
View DocumentationInterface to the National Phenology Network API
Programmatic interface to the Web Service methods provided by the National Phenology Network (https://usanpn.org/), which includes data on various life history events that occur at specific times.
View DocumentationAccesses the Monkeylearn API for Text Classifiers and Extractors
Allows using some services of Monkeylearn http://monkeylearn.com/ which is a Machine Learning platform on the cloud for text analysis (classification and extraction).
View DocumentationOpenBIS API Access to the InfectX Data Repository
The Open Source Biology Information System (openBIS) is a general purpose framework for management, annotation and publication of large data sets that arise from biological experiments. By making the JSON-RPC based openBIS API available to R, image-based high throughput screening data as generated by the InfectX/TargetInfectX projects can be browsed, searched and downloaded directly from R. Currently, several kinome-wide RNA interference screens performed on HeLa cells in presence of a selection of bacterial and viral pathogens and using oligo libraries form multiple vendors are available. Further genome-wide screens are forthcoming. The full data obtained from these experiments is accessible, including raw microscopy images, object segmentation masks, single cell feature data generated by CellProfiler and infection scoring data, alongside rich meta data and quality control data.
View DocumentationWorking with GTFS (General Transit Feed Specification) feeds in R
Provides API wrappers for popular public GTFS feed sharing sites, reads feed data into a gtfs data object, validates data quality, provides convenience functions for common tasks.
View DocumentationInterface to Chromosome Counts Database API
A programmatic interface to the Chromosome Counts Database (http://ccdb.tau.ac.il/). This package is part of the rOpenSci suite (https://ropensci.org).
Scientific use casesRead EPUB File Metadata and Text
Provides functions supporting the reading and parsing of internal e-book content from EPUB files. The epubr package provides functions supporting the reading and parsing of internal e-book content from EPUB files. E-book metadata and text content are parsed separately and joined together in a tidy, nested tibble data frame. E-book formatting is not completely standardized across all literature. It can be challenging to curate parsed e-book content across an arbitrary collection of e-books perfectly and in completely general form, to yield a singular, consistently formatted output. Many EPUB files do not even contain all the same pieces of information in their respective metadata. EPUB file parsing functionality in this package is intended for relatively general application to arbitrary EPUB e-books. However, poorly formatted e-books or e-books with highly uncommon formatting may not work with this package. There may even be cases where an EPUB file has DRM or some other property that makes it impossible to read with epubr. Text is read as is for the most part. The only nominal changes are minor substitutions, for example curly quotes changed to straight quotes. Substantive changes are expected to be performed subsequently by the user as part of their text analysis. Additional text cleaning can be performed at the users discretion, such as with functions from packages like tm or qdap’.
View DocumentationPopler R Package
Browse and query the popler database.
View DocumentationEntrez in R
Provides an R interface to the NCBIs EUtils’ API, allowing users to search databases like GenBank https://www.ncbi.nlm.nih.gov/genbank/ and PubMed https://www.ncbi.nlm.nih.gov/pubmed/, process the results of those searches and pull data into their R sessions.
Scientific use casesDBHYDRO Hydrologic and Water Quality Data
Client for programmatic access to the South Florida Water Management Districts DBHYDRO’ database at https://www.sfwmd.gov/science-data/dbhydro, with functions for accessing hydrologic and water quality data.
View DocumentationHistorical and Contemporary Boundaries of the United States of America
The boundaries for geographical units in the United States of America contained in this package include state, county, congressional district, and zip code tabulation area. Contemporary boundaries are provided by the U.S. Census Bureau (public domain). Historical boundaries for the years from 1629 to 2000 are provided form the Newberry Librarys Atlas of Historical County Boundaries’ (licensed CC BY-NC-SA). Additional data is provided in the USAboundariesData package; this package provides an interface to access that data.
View DocumentationAustralian Popular Baby Names
Data on the most popular baby names in Australia.
View DocumentationR client to Joint Research Centre's DOPA REST API
R client for REST web services of DOPA (Digital Observatory for protected Areas) by the European Union Joint Research Centre.
View DocumentationDatasets for the USAboundaries package
Contains datasets, including higher resolution boundary data, for use in the USAboundaries package. These datasets come from the U.S. Census Bureau, the Newberry Librarys Historical Atlas of U.S. County Boundaries, and Erik Steiners ‘United States Historical City Populations, 1790-2010’.
View DocumentationClient for the Index Database of Remote Sensing Indices
Index Database (http://www.indexdatabase.de/) of remote sensing indices.
View DocumentationGet data related to transportation and cultural places from Rio de Janeiro, Brazil.
Get data related to transportation and cultural places from Rio de Janeiro, Brazil.
View Documentation