visdat - Preliminary Visualisation of Data

Create preliminary exploratory data visualisations of an entire dataset to identify problems or unexpected features using 'ggplot2'.

Last updated 9 days ago

exploratory-data-analysismissingnesspeer-reviewedropenscivisualisation

449 stars 7.34 score 47 dependencies 8 dependents

stplanr - Sustainable Transport Planning

Tools for transport planning with an emphasis on spatial transport data and non-motorized modes. The package was originally developed to support the 'Propensity to Cycle Tool', a publicly available strategic cycle network planning tool (Lovelace et al. 2017) <doi:10.5198/jtlu.2016.862>, but has since been extended to support public transport routing and accessibility analysis (Moreno-Monroy et al. 2017) <doi:10.1016/j.jtrangeo.2017.08.012> and routing with locally hosted routing engines such as 'OSRM' (Lowans et al. 2023) <doi:10.1016/j.enconman.2023.117337>. The main functions are for creating and manipulating geographic "desire lines" from origin-destination (OD) data (building on the 'od' package); calculating routes on the transport network locally and via interfaces to routing services such as <https://cyclestreets.net/> (Desjardins et al. 2021) <doi:10.1007/s11116-021-10197-1>; and calculating route segment attributes such as bearing. The package implements the 'travel flow aggregration' method described in Morgan and Lovelace (2020) <doi:10.1177/2399808320942779> and the 'OD jittering' method described in Lovelace et al. (2022) <doi:10.32866/001c.33873>. Further information on the package's aim and scope can be found in the vignettes and in a paper in the R Journal (Lovelace and Ellison 2018) <doi:10.32614/RJ-2018-053>, and in a paper outlining the landscape of open source software for geographic methods in transport planning (Lovelace, 2021) <doi:10.1007/s10109-020-00342-2>.

Last updated 3 months ago

cyclecyclingdesire-linesorigin-destinationpeer-reviewedpubic-transportroute-networkroutesroutingspatialtransporttransport-planningtransportationwalking

414 stars 7.11 score 47 dependencies 2 dependents

openalexR - Getting Bibliographic Records from 'OpenAlex' Database Using 'DSL' API

A set of tools to extract bibliographic content from 'OpenAlex' database using API <https://docs.openalex.org>.

Last updated 22 days ago

bibliographic-databibliographic-databasebibliometricsbibliometrixscience-mapping

89 stars 4.74 score 23 dependencies 5 dependents

osmplotr - Bespoke Images of 'OpenStreetMap' Data

Bespoke images of 'OpenStreetMap' ('OSM') data and data visualisation using 'OSM' objects.

Last updated 10 days ago

data-visualisationhighlighting-clustersopenstreetmaposmoverpassoverpass-apipeer-reviewed

132 stars 4.68 score 86 dependencies

natserv - 'NatureServe' Interface

Interface to 'NatureServe' (<https://www.natureserve.org/>). Includes methods to get data, image metadata, search taxonomic names, and make maps.

Last updated 6 months ago

taxonomyspeciesapiweb-servicesnatureservemetadatamapstaxize

10 stars 4.21 score 20 dependencies 30 dependents

fingertipsR - Fingertips Data for Public Health

Fingertips (<http://fingertips.phe.org.uk/>) contains data for many indicators of public health in England. The underlying data is now more easily accessible by making use of the API.

Last updated 5 months ago

api-wrapperfingertipshealthopen-datapeer-reviewedpublic-healthpublic-health-england

91 stars 4.13 score 67 dependencies 1 dependents

GSODR - Global Surface Summary of the Day ('GSOD') Weather Data Client

Provides automated downloading, parsing, cleaning, unit conversion and formatting of Global Surface Summary of the Day ('GSOD') weather data from the from the USA National Centers for Environmental Information ('NCEI'). Units are converted from from United States Customary System ('USCS') units to International System of Units ('SI'). Stations may be individually checked for number of missing days defined by the user, where stations with too many missing observations are omitted. Only stations with valid reported latitude and longitude values are permitted in the final data. Additional useful elements, saturation vapour pressure ('es'), actual vapour pressure ('ea') and relative humidity ('RH') are calculated from the original data using the improved August-Roche-Magnus approximation (Alduchov & Eskridge 1996) and included in the final data set. The resulting metadata include station identification information, country, state, latitude, longitude, elevation, weather observations and associated flags. For information on the 'GSOD' data from 'NCEI', please see the 'GSOD' 'readme.txt' file available from, <https://www1.ncdc.noaa.gov/pub/data/gsod/readme.txt>.

Last updated 2 days ago

us-nceimeteorological-dataglobal-weatherweatherweather-datameteorologystation-datasurface-weatherdata-accessus-ncdcdaily-datadaily-weatherglobal-datagsodhistorical-datahistorical-weatherncdcnceiweather-informationweather-stations

88 stars 4.10 score 6 dependencies

bibtex - Bibtex Parser

Utility to parse a bibtex file.

Last updated 8 months ago

bibtexparser

35 stars 4.10 score 1 dependencies 15 dependents

tradestatistics - Open Trade Statistics API Wrapper and Utility Program

Access 'Open Trade Statistics' API from R to download international trade data.

Last updated 1 years ago

api-wrapperdata-tableinternational-tradejsonliteopen-trade-statistics

75 stars 3.73 score 15 dependencies

smapr - Acquisition and Processing of NASA Soil Moisture Active-Passive (SMAP) Data

Facilitates programmatic access to NASA Soil Moisture Active Passive (SMAP) data with R. It includes functions to search for, acquire, and extract SMAP data.

Last updated 1 years ago

acquisitionextract-datanasapeer-reviewedrastersmap-datasoil-mappingsoil-moisturesoil-moisture-sensor

79 stars 3.64 score 30 dependencies

rtika - R Interface to 'Apache Tika'

Extract text or metadata from over a thousand file types, using Apache Tika <https://tika.apache.org/>. Get either plain text or structured XHTML content.

Last updated 1 years ago

extract-metadataextract-textjavaparsepdf-filespeer-reviewedtesseracttika

54 stars 3.24 score 4 dependencies

bowerbird - Keep a Collection of Sparkly Data Resources

Tools to get and maintain a data repository from third-party data providers.

Last updated 1 days ago

ropensciantarcticsouthern oceandataenvironmentalsatelliteclimatepeer-reviewed

46 stars 3.24 score 100 dependencies 1 dependents

jstor - Read Data from JSTOR/DfR

Functions and helpers to import metadata, ngrams and full-texts delivered by Data for Research by JSTOR.

Last updated 10 days ago

jstorpeer-reviewedtext-analysistext-mining

46 stars 3.14 score 42 dependencies

weatherOz - An API Client for Australian Weather and Climate Data Resources

Provides automated downloading, parsing and formatting of weather data for Australia through API endpoints provided by the Department of Primary Industries and Regional Development ('DPIRD') of Western Australia and by the Science and Technology Division of the Queensland Government's Department of Environment and Science ('DES'). As well as the Bureau of Meteorology ('BOM') of the Australian government precis and coastal forecasts, agriculture bulletin data, and downloading and importing radar and satellite imagery files. 'DPIRD' weather data are accessed through public 'APIs' provided by 'DPIRD', <https://www.agric.wa.gov.au/weather-api-20>, providing access to weather station data from the 'DPIRD' weather station network. Australia-wide weather data are based on data from the Australian Bureau of Meteorology ('BOM') data and accessed through 'SILO' (Scientific Information for Land Owners) Jeffrey et al. (2001) <doi:10.1016/S1364-8152(01)00008-1>. 'DPIRD' data are made available under a Creative Commons Attribution 3.0 Licence (CC BY 3.0 AU) license <https://creativecommons.org/licenses/by/3.0/au/deed.en>. SILO data are released under a Creative Commons Attribution 4.0 International licence (CC BY 4.0) <https://creativecommons.org/licenses/by/4.0/>. 'BOM' data are (c) Australian Government Bureau of Meteorology and released under a Creative Commons (CC) Attribution 3.0 licence or Public Access Licence ('PAL') as appropriate, see <http://www.bom.gov.au/other/copyright.shtml> for further details.

Last updated 11 days ago

dpirdbommeteorological-dataweather-forecastaustraliaweatherweather-datameteorologywestern-australiaaustralia-bureau-of-meteorologywestern-australia-agricultureaustralia-agricultureaustralia-climateaustralia-weatherapi-clientclimatedatarainfallweather-api

21 stars 3.08 score 55 dependencies

waywiser - Ergonomic Methods for Assessing Spatial Models

Assessing predictive models of spatial data can be challenging, both because these models are typically built for extrapolating outside the original region represented by training data and due to potential spatially structured errors, with "hot spots" of higher than expected error clustered geographically due to spatial structure in the underlying data. Methods are provided for assessing models fit to spatial data, including approaches for measuring the spatial structure of model errors, assessing model predictions at multiple spatial scales, and evaluating where predictions can be made safely. Methods are particularly useful for models fit using the 'tidymodels' framework. Methods include Moran's I ('Moran' (1950) <doi:10.2307/2332142>), Geary's C ('Geary' (1954) <doi:10.2307/2986645>), Getis-Ord's G ('Ord' and 'Getis' (1995) <doi:10.1111/j.1538-4632.1995.tb00912.x>), agreement coefficients from 'Ji' and Gallo (2006) (<doi: 10.14358/PERS.72.7.823>), agreement metrics from 'Willmott' (1981) (<doi: 10.1080/02723646.1981.10642213>) and 'Willmott' 'et' 'al'. (2012) (<doi: 10.1002/joc.2419>), an implementation of the area of applicability methodology from 'Meyer' and 'Pebesma' (2021) (<doi:10.1111/2041-210X.13650>), and an implementation of multi-scale assessment as described in 'Riemann' 'et' 'al'. (2010) (<doi:10.1016/j.rse.2010.05.010>).

Last updated 18 days ago

spatialspatial-analysistidymodelstidyverse

35 stars 2.90 score 44 dependencies

virtuoso - Interface to 'Virtuoso' using 'ODBC'

Provides users with a simple and convenient mechanism to manage and query a 'Virtuoso' database using the 'DBI' (Data-Base Interface) compatible 'ODBC' (Open Database Connectivity) interface. 'Virtuoso' is a high-performance "universal server," which can act as both a relational database, supporting standard Structured Query Language ('SQL') queries, while also supporting data following the Resource Description Framework ('RDF') model for Linked Data. 'RDF' data can be queried using 'SPARQL' ('SPARQL' Protocol and 'RDF' Query Language) queries, a graph-based query that supports semantic reasoning. This allows users to leverage the performance of local or remote 'Virtuoso' servers using popular 'R' packages such as 'DBI' and 'dplyr', while also providing a high-performance solution for working with large 'RDF' 'triplestores' from 'R.' The package also provides helper routines to install, launch, and manage a 'Virtuoso' server locally on 'Mac', 'Windows' and 'Linux' platforms using the standard interactive installers from the 'R' command-line. By automatically handling these setup steps, the package can make using 'Virtuoso' considerably faster and easier for a most users to deploy in a local environment. Managing the bulk import of triples from common serializations with a single intuitive command is another key feature of this package. Bulk import performance can be tens to hundreds of times faster than the comparable imports using existing 'R' tools, including 'rdflib' and 'redland' packages.

Last updated 3 months ago

9 stars 2.43 score 21 dependencies

pangaear - Client for the 'Pangaea' Database

Tools to interact with the 'Pangaea' Database (<https://www.pangaea.de>), including functions for searching for data, fetching 'datasets' by 'dataset' 'ID', and working with the 'Pangaea' 'OAI-PMH' service.

Last updated 2 years ago

pangaeaenvironmental scienceearth sciencearchivepaleontologyecologychemistryatmosphereapi-clientdatapaleobiologyscientificwebservice-client

21 stars 2.37 score 33 dependencies

epubr - Read EPUB File Metadata and Text

Provides functions supporting the reading and parsing of internal e-book content from EPUB files. The 'epubr' package provides functions supporting the reading and parsing of internal e-book content from EPUB files. E-book metadata and text content are parsed separately and joined together in a tidy, nested tibble data frame. E-book formatting is not completely standardized across all literature. It can be challenging to curate parsed e-book content across an arbitrary collection of e-books perfectly and in completely general form, to yield a singular, consistently formatted output. Many EPUB files do not even contain all the same pieces of information in their respective metadata. EPUB file parsing functionality in this package is intended for relatively general application to arbitrary EPUB e-books. However, poorly formatted e-books or e-books with highly uncommon formatting may not work with this package. There may even be cases where an EPUB file has DRM or some other property that makes it impossible to read with 'epubr'. Text is read 'as is' for the most part. The only nominal changes are minor substitutions, for example curly quotes changed to straight quotes. Substantive changes are expected to be performed subsequently by the user as part of their text analysis. Additional text cleaning can be performed at the user's discretion, such as with functions from packages like 'tm' or 'qdap'.

Last updated 8 days ago

epubepub-filesepub-formatpeer-reviewed

24 stars 2.36 score 24 dependencies

occCite - Querying and Managing Large Biodiversity Occurrence Datasets

Facilitates the gathering of biodiversity occurrence data from disparate sources. Metadata is managed throughout the process to facilitate reporting and enhanced ability to repeat analyses.

Last updated 1 months ago

biodiversity-databiodiversity-informaticsbiodiversity-standardscitationsmuseum-collection-specimensmuseum-collectionsmuseum-metadata

21 stars 2.32 score 159 dependencies

Rpolyhedra - Polyhedra Database

A polyhedra database scraped from various sources as R6 objects and 'rgl' visualizing capabilities.

Last updated 2 years ago

geometrypolyhedra-databasergl

12 stars 1.76 score 42 dependencies

srr - 'rOpenSci' Review Roclets

Companion package to 'rOpenSci' statistical software review project.

Last updated 2 months ago

4 stars 1.75 score 41 dependencies 2 dependents

concstats - Market Structure, Concentration and Inequality Measures

Based on individual market shares of all participants in a market or space, the package offers a set of different structural and concentration measures frequently - and not so frequently - used in research and in practice. Measures can be calculated in groups or individually. The calculated measure or the resulting vector in table format should help practitioners make more informed decisions. Methods used in this package are from: 1. Chang, E. J., Guerra, S. M., de Souza Penaloza, R. A. & Tabak, B. M. (2005) "Banking concentration: the Brazilian case". 2. Cobham, A. and A. Summer (2013). "Is It All About the Tails? The Palma Measure of Income Inequality". 3. Garcia Alba Idunate, P. (1994). "Un Indice de dominancia para el analisis de la estructura de los mercados". 4. Ginevicius, R. and S. Cirba (2009). "Additive measurement of market concentration" <doi:10.3846/1611-1699.2009.10.191-198>. 5. Herfindahl, O. C. (1950), "Concentration in the steel industry" (PhD thesis). 6. Hirschmann, A. O. (1945), "National power and structure of foreign trade". 7. Melnik, A., O. Shy, and R. Stenbacka (2008), "Assessing market dominance" <doi:10.1016/j.jebo.2008.03.010>. 8. Palma, J. G. (2006). "Globalizing Inequality: 'Centrifugal' and 'Centripetal' Forces at Work". 9. Shannon, C. E. (1948). "A Mathematical Theory of Communication". 10. Simpson, E. H. (1949). "Measurement of Diversity" <doi:10.1038/163688a0>.

Last updated 5 months ago

business-analyticscompetitionconcentrationdiversityinequalitypackage-development

8 stars 1.70 score 0 dependencies

ReLTER - An Interface for the eLTER Community

ReLTER provides access to DEIMS-SDR (https://deims.org/), and allows interaction with data and software implemented by eLTER Research Infrastructure (RI) thus improving data sharing among European LTER projects. ReLTER uses the R language to access and interact with the DEIMS-SDR archive of information shared by the Long Term Ecological Research (LTER) network. This package grew within eLTER H2020 as a major project that will help advance the development of European Long-Term Ecosystem Research Infrastructures (eLTER RI - https://elter-ri.eu). The ReLTER package functions in particular allow to: - retrieve the information about entities (e.g. sites, datasets, and activities) shared by DEIMS-SDR (see e.g. get_site_info function); - interact with the [ODSEurope](maps.opendatascience.eu) starting with the dataset shared by [DEIMS-SDR](https://deims.org/) (see e.g. [get_site_ODS](https://docs.ropensci.org/ReLTER/reference/get_site_ODS.html) function); - use the eLTER site informations to download and crop geospatial data from other platforms (see e.g. get_site_ODS function); - improve the quality of the dataset (see e.g. get_id_worms). Functions currently implemented are derived from discussions of the needs among the eLTER users community. The ReLTER package will continue to follow the progress of eLTER-RI and evolve, adding new tools and improvements as required.

Last updated 7 months ago

biodiversity-informaticsdata-scienceecologyelterresearch-infrastructure

12 stars 1.64 score 220 dependencies

rperseus - Get Texts from the Perseus Digital Library

The Perseus Digital Library is a collection of classical texts. This package helps you get them. The available works can also be viewed here: <http://cts.perseids.org/>.

Last updated 1 years ago

classicsgreekgreek-biblegreek-new-testamentlatinpeer-reviewedperseusperseus-digital-librarytranslation

16 stars 1.64 score 44 dependencies

popler - Popler R Package

Browse and query the popler database.

Last updated 5 years ago

8 stars 1.24 score 67 dependencies

workloopR - Analysis of Work Loops and Other Data from Muscle Physiology Experiments

Functions for the import, transformation, and analysis of data from muscle physiology experiments. The work loop technique is used to evaluate the mechanical work and power output of muscle. Josephson (1985) <doi:10.1242/jeb.114.1.493> modernized the technique for application in comparative biomechanics. Although our initial motivation was to provide functions to analyze work loop experiment data, as we developed the package we incorporated the ability to analyze data from experiments that are often complementary to work loops. There are currently three supported experiment types: work loops, simple twitches, and tetanus trials. Data can be imported directly from .ddf files or via an object constructor function. Through either method, data can then be cleaned or transformed via methods typically used in studies of muscle physiology. Data can then be analyzed to determine the timing and magnitude of force development and relaxation (for isometric trials) or the magnitude of work, net power, and instantaneous power among other things (for work loops). Although we do not provide plotting functions, all resultant objects are designed to be friendly to visualization via either base-R plotting or 'tidyverse' functions. This package has been peer-reviewed by rOpenSci (v. 1.1.0).

Last updated 10 days ago

ddfmuscle-forcemuscle-physiology-experimentstetanuswork-loopworkloop

3 stars 1.18 score 3 dependencies

dataaimsr - AIMS Data Platform API Client

AIMS Data Platform API Client which provides easy access to AIMS Data Platform scientific data and information.

Last updated 1 years ago

aimsaustraliadatamarinemonitoringsstweather

4 stars 0.91 score 58 dependencies

antanym - Antarctic Geographic Place Names

Antarctic geographic names from the Composite Gazetteer of Antarctica, and functions for working with those place names.

Last updated 2 years ago

antarcticsouthern oceanplace namesgazetteerpeer-reviewed

7 stars 0.91 score 54 dependencies