visdat - Preliminary Visualisation of Data

Create preliminary exploratory data visualisations of an entire dataset to identify problems or unexpected features using 'ggplot2'.

Last updated 4 months ago

exploratory-data-analysismissingnesspeer-reviewedropenscivisualisation

12.77 score 453 stars 8 packages 1.8k scripts 21k downloads

stplanr - Sustainable Transport Planning

Tools for transport planning with an emphasis on spatial transport data and non-motorized modes. The package was originally developed to support the 'Propensity to Cycle Tool', a publicly available strategic cycle network planning tool (Lovelace et al. 2017) <doi:10.5198/jtlu.2016.862>, but has since been extended to support public transport routing and accessibility analysis (Moreno-Monroy et al. 2017) <doi:10.1016/j.jtrangeo.2017.08.012> and routing with locally hosted routing engines such as 'OSRM' (Lowans et al. 2023) <doi:10.1016/j.enconman.2023.117337>. The main functions are for creating and manipulating geographic "desire lines" from origin-destination (OD) data (building on the 'od' package); calculating routes on the transport network locally and via interfaces to routing services such as <https://cyclestreets.net/> (Desjardins et al. 2021) <doi:10.1007/s11116-021-10197-1>; and calculating route segment attributes such as bearing. The package implements the 'travel flow aggregration' method described in Morgan and Lovelace (2020) <doi:10.1177/2399808320942779> and the 'OD jittering' method described in Lovelace et al. (2022) <doi:10.32866/001c.33873>. Further information on the package's aim and scope can be found in the vignettes and in a paper in the R Journal (Lovelace and Ellison 2018) <doi:10.32614/RJ-2018-053>, and in a paper outlining the landscape of open source software for geographic methods in transport planning (Lovelace, 2021) <doi:10.1007/s10109-020-00342-2>.

Last updated 3 months ago

cyclecyclingdesire-linesorigin-destinationpeer-reviewedpubic-transportroute-networkroutesroutingspatialtransporttransport-planningtransportationwalking

12.17 score 421 stars 2 packages 660 scripts 1.7k downloads

openalexR - Getting Bibliographic Records from 'OpenAlex' Database Using 'DSL' API

A set of tools to extract bibliographic content from 'OpenAlex' database using API <https://docs.openalex.org>.

Last updated 2 days ago

bibliographic-databibliographic-databasebibliometricsbibliometrixscience-mapping

10.13 score 102 stars 5 packages 168 scripts 9.9k downloads

rredlist - 'IUCN' Red List Client

'IUCN' Red List (<https://api.iucnredlist.org/>) client. The 'IUCN' Red List is a global list of threatened and endangered species. Functions cover all of the Red List 'API' routes. An 'API' key is required.

Last updated 4 days ago

iucnbiodiversityapiweb-servicestraitshabitatspeciesconservationapi-wrapperiucn-red-listtaxize

10.09 score 49 stars 25 packages 212 scripts 1.7k downloads

bib2df - Parse a BibTeX File to a Data Frame

Parse a BibTeX file to a data.frame to make it accessible for further analysis and visualization.

Last updated 3 months ago

bibtexpeer-reviewed

9.76 score 99 stars 6 packages 211 scripts 1.1k downloads

bibtex - Bibtex Parser

Utility to parse a bibtex file.

Last updated 12 months ago

bibtexparser

9.69 score 36 stars 16 packages 552 scripts 7.3k downloads

iheatmapr - Interactive, Complex Heatmaps

Make complex, interactive heatmaps. 'iheatmapr' includes a modular system for iteratively building up complex heatmaps, as well as the iheatmap() function for making relatively standard heatmaps.

Last updated 3 months ago

heatmapplotlyinteractive-visualizationsdata-visualizationhtmlwidgetspeer-reviewed

9.07 score 267 stars 1 packages 97 scripts 625 downloads

weatherOz - An API Client for Australian Weather and Climate Data Resources

Provides automated downloading, parsing and formatting of weather data for Australia through API endpoints provided by the Department of Primary Industries and Regional Development ('DPIRD') of Western Australia and by the Science and Technology Division of the Queensland Government's Department of Environment and Science ('DES'). As well as the Bureau of Meteorology ('BOM') of the Australian government precis and coastal forecasts, agriculture bulletin data, and downloading and importing radar and satellite imagery files. 'DPIRD' weather data are accessed through public 'APIs' provided by 'DPIRD', <https://www.agric.wa.gov.au/weather-api-20>, providing access to weather station data from the 'DPIRD' weather station network. Australia-wide weather data are based on data from the Australian Bureau of Meteorology ('BOM') data and accessed through 'SILO' (Scientific Information for Land Owners) Jeffrey et al. (2001) <doi:10.1016/S1364-8152(01)00008-1>. 'DPIRD' data are made available under a Creative Commons Attribution 3.0 Licence (CC BY 3.0 AU) license <https://creativecommons.org/licenses/by/3.0/au/deed.en>. SILO data are released under a Creative Commons Attribution 4.0 International licence (CC BY 4.0) <https://creativecommons.org/licenses/by/4.0/>. 'BOM' data are (c) Australian Government Bureau of Meteorology and released under a Creative Commons (CC) Attribution 3.0 licence or Public Access Licence ('PAL') as appropriate, see <http://www.bom.gov.au/other/copyright.shtml> for further details.

Last updated 11 days ago

dpirdbommeteorological-dataweather-forecastaustraliaweatherweather-datameteorologywestern-australiaaustralia-bureau-of-meteorologywestern-australia-agricultureaustralia-agricultureaustralia-climateaustralia-weatherapi-clientclimatedatarainfallweather-api

8.49 score 26 stars 37 scripts 263 downloads

GSODR - Global Surface Summary of the Day ('GSOD') Weather Data Client

Provides automated downloading, parsing, cleaning, unit conversion and formatting of Global Surface Summary of the Day ('GSOD') weather data from the from the USA National Centers for Environmental Information ('NCEI'). Units are converted from from United States Customary System ('USCS') units to International System of Units ('SI'). Stations may be individually checked for number of missing days defined by the user, where stations with too many missing observations are omitted. Only stations with valid reported latitude and longitude values are permitted in the final data. Additional useful elements, saturation vapour pressure ('es'), actual vapour pressure ('ea') and relative humidity ('RH') are calculated from the original data using the improved August-Roche-Magnus approximation (Alduchov & Eskridge 1996) and included in the final data set. The resulting metadata include station identification information, country, state, latitude, longitude, elevation, weather observations and associated flags. For information on the 'GSOD' data from 'NCEI', please see the 'GSOD' 'readme.txt' file available from, <https://www1.ncdc.noaa.gov/pub/data/gsod/readme.txt>.

Last updated 30 days ago

us-nceimeteorological-dataglobal-weatherweatherweather-datameteorologystation-datasurface-weatherdata-accessus-ncdcdaily-datadaily-weatherglobal-datagsodhistorical-datahistorical-weatherncdcnceiweather-informationweather-stations

8.41 score 90 stars 100 scripts 1.5k downloads

fingertipsR - Fingertips Data for Public Health

Fingertips (<http://fingertips.phe.org.uk/>) contains data for many indicators of public health in England. The underlying data is now more easily accessible by making use of the API.

Last updated 9 months ago

api-wrapperfingertipshealthopen-datapeer-reviewedpublic-healthpublic-health-england

8.15 score 92 stars 1 packages 255 scripts 316 downloads

osmplotr - Bespoke Images of 'OpenStreetMap' Data

Bespoke images of 'OpenStreetMap' ('OSM') data and data visualisation using 'OSM' objects.

Last updated 26 days ago

data-visualisationhighlighting-clustersopenstreetmaposmoverpassoverpass-apipeer-reviewed

7.88 score 135 stars 78 scripts 172 downloads

osmapiR - 'OpenStreetMap' API

Interface to 'OpenStreetMap API' for fetching and saving data from/to the 'OpenStreetMap' database (<https://wiki.openstreetmap.org/wiki/API_v0.6>).

Last updated 3 days ago

open street mapopenstreetmaposmopenstreetmap-apiosmapiapi

7.41 score 18 stars 6 scripts 219 downloads

jstor - Read Data from JSTOR/DfR

Functions and helpers to import metadata, ngrams and full-texts delivered by Data for Research by JSTOR.

Last updated 4 months ago

jstorpeer-reviewedtext-analysistext-mining

7.29 score 47 stars 55 scripts 318 downloads

occCite - Querying and Managing Large Biodiversity Occurrence Datasets

Facilitates the gathering of biodiversity occurrence data from disparate sources. Metadata is managed throughout the process to facilitate reporting and enhanced ability to repeat analyses.

Last updated 18 days ago

biodiversity-databiodiversity-informaticsbiodiversity-standardscitationsmuseum-collection-specimensmuseum-collectionsmuseum-metadata

7.23 score 21 stars 40 scripts 312 downloads

FedData - Download Geospatial Data Available from Several Federated Data Sources

Download geospatial data available from several federated data sources (mainly sources maintained by the US Federal government). Currently, the package enables extraction from nine datasets: The National Elevation Dataset digital elevation models (<https://www.usgs.gov/3d-elevation-program> 1 and 1/3 arc-second; USGS); The National Hydrography Dataset (<https://www.usgs.gov/national-hydrography/national-hydrography-dataset>; USGS); The Soil Survey Geographic (SSURGO) database from the National Cooperative Soil Survey (<https://websoilsurvey.sc.egov.usda.gov/>; NCSS), which is led by the Natural Resources Conservation Service (NRCS) under the USDA; the Global Historical Climatology Network (<https://www.ncei.noaa.gov/products/land-based-station/global-historical-climatology-network-daily>; GHCN), coordinated by National Climatic Data Center at NOAA; the Daymet gridded estimates of daily weather parameters for North America, version 4, available from the Oak Ridge National Laboratory's Distributed Active Archive Center (<https://daymet.ornl.gov/>; DAAC); the International Tree Ring Data Bank; the National Land Cover Database (<https://www.mrlc.gov/>; NLCD); the Cropland Data Layer from the National Agricultural Statistics Service (<https://www.nass.usda.gov/Research_and_Science/Cropland/SARS1a.php>; NASS); and the PAD-US dataset of protected area boundaries (<https://www.usgs.gov/programs/gap-analysis-project/science/pad-us-data-overview>; USGS).

Last updated 8 days ago

peer-reviewed

7.21 score 97 stars 299 scripts 403 downloads

tradestatistics - Open Trade Statistics API Wrapper and Utility Program

Access 'Open Trade Statistics' API from R to download international trade data.

Last updated 3 months ago

api-wrapperdata-tableinternational-tradejsonliteopen-trade-statistics

7.11 score 76 stars 84 scripts 378 downloads

bowerbird - Keep a Collection of Sparkly Data Resources

Tools to get and maintain a data repository from third-party data providers.

Last updated 2 months ago

ropensciantarcticsouthern oceandataenvironmentalsatelliteclimatepeer-reviewed

7.10 score 47 stars 1 packages 15 scripts

riem - Accesses Weather Data from the Iowa Environment Mesonet

Allows to get weather data from Automated Surface Observing System (ASOS) stations (airports) in the whole world thanks to the Iowa Environment Mesonet website.

Last updated 2 months ago

airportsasosiowa-environment-mesonetmetarpeer-reviewedtemperatureweatherweather-api

6.78 score 43 stars 186 scripts 295 downloads

natserv - 'NatureServe' Interface

Interface to 'NatureServe' (<https://www.natureserve.org/>). Includes methods to get data, image metadata, search taxonomic names, and make maps.

Last updated 10 months ago

taxonomyspeciesapiweb-servicesnatureservemetadatamapstaxize

6.59 score 10 stars 27 packages 18 scripts 1.8k downloads

waywiser - Ergonomic Methods for Assessing Spatial Models

Assessing predictive models of spatial data can be challenging, both because these models are typically built for extrapolating outside the original region represented by training data and due to potential spatially structured errors, with "hot spots" of higher than expected error clustered geographically due to spatial structure in the underlying data. Methods are provided for assessing models fit to spatial data, including approaches for measuring the spatial structure of model errors, assessing model predictions at multiple spatial scales, and evaluating where predictions can be made safely. Methods are particularly useful for models fit using the 'tidymodels' framework. Methods include Moran's I ('Moran' (1950) <doi:10.2307/2332142>), Geary's C ('Geary' (1954) <doi:10.2307/2986645>), Getis-Ord's G ('Ord' and 'Getis' (1995) <doi:10.1111/j.1538-4632.1995.tb00912.x>), agreement coefficients from 'Ji' and Gallo (2006) (<doi: 10.14358/PERS.72.7.823>), agreement metrics from 'Willmott' (1981) (<doi: 10.1080/02723646.1981.10642213>) and 'Willmott' 'et' 'al'. (2012) (<doi: 10.1002/joc.2419>), an implementation of the area of applicability methodology from 'Meyer' and 'Pebesma' (2021) (<doi:10.1111/2041-210X.13650>), and an implementation of multi-scale assessment as described in 'Riemann' 'et' 'al'. (2010) (<doi:10.1016/j.rse.2010.05.010>).

Last updated 4 months ago

spatialspatial-analysistidymodelstidyverse

6.59 score 38 stars 17 scripts 279 downloads

epubr - Read EPUB File Metadata and Text

Provides functions supporting the reading and parsing of internal e-book content from EPUB files. The 'epubr' package provides functions supporting the reading and parsing of internal e-book content from EPUB files. E-book metadata and text content are parsed separately and joined together in a tidy, nested tibble data frame. E-book formatting is not completely standardized across all literature. It can be challenging to curate parsed e-book content across an arbitrary collection of e-books perfectly and in completely general form, to yield a singular, consistently formatted output. Many EPUB files do not even contain all the same pieces of information in their respective metadata. EPUB file parsing functionality in this package is intended for relatively general application to arbitrary EPUB e-books. However, poorly formatted e-books or e-books with highly uncommon formatting may not work with this package. There may even be cases where an EPUB file has DRM or some other property that makes it impossible to read with 'epubr'. Text is read 'as is' for the most part. The only nominal changes are minor substitutions, for example curly quotes changed to straight quotes. Substantive changes are expected to be performed subsequently by the user as part of their text analysis. Additional text cleaning can be performed at the user's discretion, such as with functions from packages like 'tm' or 'qdap'.

Last updated 2 months ago

epubepub-filesepub-formatpeer-reviewed

6.35 score 24 stars 47 scripts 399 downloads

Rpolyhedra - Polyhedra Database

A polyhedra database scraped from various sources as R6 objects and 'rgl' visualizing capabilities.

Last updated 15 days ago

geometrypolyhedra-databasergl

6.21 score 12 stars 30 scripts 481 downloads

pangaear - Client for the 'Pangaea' Database

Tools to interact with the 'Pangaea' Database (<https://www.pangaea.de>), including functions for searching for data, fetching 'datasets' by 'dataset' 'ID', and working with the 'Pangaea' 'OAI-PMH' service.

Last updated 2 years ago

pangaeaenvironmental scienceearth sciencearchivepaleontologyecologychemistryatmosphereapi-clientdatapaleobiologyscientificwebservice-client

6.18 score 21 stars 24 scripts 784 downloads

rb3 - Download and Parse Public Data Released by B3 Exchange

Download and parse public files released by B3 and convert them into useful formats and data structures common to data analysis practitioners.

Last updated 3 months ago

brazilexchange-datafinancefinancial-datafinancial-servicesmarket-data

6.03 score 71 stars 50 scripts 174 downloads

rtika - R Interface to 'Apache Tika'

Extract text or metadata from over a thousand file types, using Apache Tika <https://tika.apache.org/>. Get either plain text or structured XHTML content.

Last updated 2 years ago

extract-metadataextract-textjavaparsepdf-filespeer-reviewedtesseracttika

5.99 score 54 stars 12 scripts 225 downloads

workloopR - Analysis of Work Loops and Other Data from Muscle Physiology Experiments

Functions for the import, transformation, and analysis of data from muscle physiology experiments. The work loop technique is used to evaluate the mechanical work and power output of muscle. Josephson (1985) <doi:10.1242/jeb.114.1.493> modernized the technique for application in comparative biomechanics. Although our initial motivation was to provide functions to analyze work loop experiment data, as we developed the package we incorporated the ability to analyze data from experiments that are often complementary to work loops. There are currently three supported experiment types: work loops, simple twitches, and tetanus trials. Data can be imported directly from .ddf files or via an object constructor function. Through either method, data can then be cleaned or transformed via methods typically used in studies of muscle physiology. Data can then be analyzed to determine the timing and magnitude of force development and relaxation (for isometric trials) or the magnitude of work, net power, and instantaneous power among other things (for work loops). Although we do not provide plotting functions, all resultant objects are designed to be friendly to visualization via either base-R plotting or 'tidyverse' functions. This package has been peer-reviewed by rOpenSci (v. 1.1.0).

Last updated 4 months ago

ddfmuscle-forcemuscle-physiology-experimentstetanuswork-loopworkloop

5.92 score 3 stars 46 scripts 147 downloads

virtuoso - Interface to 'Virtuoso' using 'ODBC'

Provides users with a simple and convenient mechanism to manage and query a 'Virtuoso' database using the 'DBI' (Data-Base Interface) compatible 'ODBC' (Open Database Connectivity) interface. 'Virtuoso' is a high-performance "universal server," which can act as both a relational database, supporting standard Structured Query Language ('SQL') queries, while also supporting data following the Resource Description Framework ('RDF') model for Linked Data. 'RDF' data can be queried using 'SPARQL' ('SPARQL' Protocol and 'RDF' Query Language) queries, a graph-based query that supports semantic reasoning. This allows users to leverage the performance of local or remote 'Virtuoso' servers using popular 'R' packages such as 'DBI' and 'dplyr', while also providing a high-performance solution for working with large 'RDF' 'triplestores' from 'R.' The package also provides helper routines to install, launch, and manage a 'Virtuoso' server locally on 'Mac', 'Windows' and 'Linux' platforms using the standard interactive installers from the 'R' command-line. By automatically handling these setup steps, the package can make using 'Virtuoso' considerably faster and easier for a most users to deploy in a local environment. Managing the bulk import of triples from common serializations with a single intuitive command is another key feature of this package. Bulk import performance can be tens to hundreds of times faster than the comparable imports using existing 'R' tools, including 'rdflib' and 'redland' packages.

Last updated 7 months ago

5.61 score 9 stars 15 scripts 176 downloads

concstats - Market Structure, Concentration and Inequality Measures

Based on individual market shares of all participants in a market or space, the package offers a set of different structural and concentration measures frequently - and not so frequently - used in research and in practice. Measures can be calculated in groups or individually. The calculated measure or the resulting vector in table format should help practitioners make more informed decisions. Methods used in this package are from: 1. Chang, E. J., Guerra, S. M., de Souza Penaloza, R. A. & Tabak, B. M. (2005) "Banking concentration: the Brazilian case". 2. Cobham, A. and A. Summer (2013). "Is It All About the Tails? The Palma Measure of Income Inequality". 3. Garcia Alba Idunate, P. (1994). "Un Indice de dominancia para el analisis de la estructura de los mercados". 4. Ginevicius, R. and S. Cirba (2009). "Additive measurement of market concentration" <doi:10.3846/1611-1699.2009.10.191-198>. 5. Herfindahl, O. C. (1950), "Concentration in the steel industry" (PhD thesis). 6. Hirschmann, A. O. (1945), "National power and structure of foreign trade". 7. Melnik, A., O. Shy, and R. Stenbacka (2008), "Assessing market dominance" <doi:10.1016/j.jebo.2008.03.010>. 8. Palma, J. G. (2006). "Globalizing Inequality: 'Centrifugal' and 'Centripetal' Forces at Work". 9. Shannon, C. E. (1948). "A Mathematical Theory of Communication". 10. Simpson, E. H. (1949). "Measurement of Diversity" <doi:10.1038/163688a0>.

Last updated 8 months ago

business-analyticscompetitionconcentrationdiversityinequalitypackage-development

5.32 score 7 stars 15 scripts 193 downloads

hydroscoper - Interface to the Greek National Data Bank for Hydrometeorological Information

R interface to the Greek National Data Bank for Hydrological and Meteorological Information. It covers Hydroscope's data sources and provides functions to transliterate, translate and download them into tidy dataframes.

Last updated 4 months ago

climategreecehydrologyhydrometeorologyhydroscopemeteorological-datameteorological-stationspeer-reviewedtidy-datatime-serieswater-resources

5.23 score 13 stars 33 scripts 136 downloads

smapr - Acquisition and Processing of NASA Soil Moisture Active-Passive (SMAP) Data

Facilitates programmatic access to NASA Soil Moisture Active Passive (SMAP) data with R. It includes functions to search for, acquire, and extract SMAP data.

Last updated 2 years ago

acquisitionextract-datanasapeer-reviewedrastersmap-datasoil-mappingsoil-moisturesoil-moisture-sensor

4.90 score 79 stars 20 scripts 83 downloads

rperseus - Get Texts from the Perseus Digital Library

The Perseus Digital Library is a collection of classical texts. This package helps you get them. The available works can also be viewed here: <http://cts.perseids.org/>.

Last updated 1 years ago

classicsgreekgreek-biblegreek-new-testamentlatinpeer-reviewedperseusperseus-digital-librarytranslation

4.70 score 18 stars 28 scripts

antanym - Antarctic Geographic Place Names

Antarctic geographic names from the Composite Gazetteer of Antarctica, and functions for working with those place names.

Last updated 2 years ago

antarcticsouthern oceanplace namesgazetteerpeer-reviewed

3.89 score 7 stars 22 scripts

popler - Popler R Package

Browse and query the popler database.

Last updated 5 years ago

3.88 score 8 stars 47 scripts

ReLTER - An Interface for the eLTER Community

ReLTER provides access to DEIMS-SDR (https://deims.org/), and allows interaction with data and software implemented by eLTER Research Infrastructure (RI) thus improving data sharing among European LTER projects. ReLTER uses the R language to access and interact with the DEIMS-SDR archive of information shared by the Long Term Ecological Research (LTER) network. This package grew within eLTER H2020 as a major project that will help advance the development of European Long-Term Ecosystem Research Infrastructures (eLTER RI - https://elter-ri.eu). The ReLTER package functions in particular allow to: - retrieve the information about entities (e.g. sites, datasets, and activities) shared by DEIMS-SDR (see e.g. get_site_info function); - interact with the [ODSEurope](maps.opendatascience.eu) starting with the dataset shared by [DEIMS-SDR](https://deims.org/) (see e.g. [get_site_ODS](https://docs.ropensci.org/ReLTER/reference/get_site_ODS.html) function); - use the eLTER site informations to download and crop geospatial data from other platforms (see e.g. get_site_ODS function); - improve the quality of the dataset (see e.g. get_id_worms). Functions currently implemented are derived from discussions of the needs among the eLTER users community. The ReLTER package will continue to follow the progress of eLTER-RI and evolve, adding new tools and improvements as required.

Last updated 11 months ago

biodiversity-informaticsdata-scienceecologyelterresearch-infrastructure

3.68 score 12 stars 4 scripts