March 2025 Top 40 New CRAN Packages

R-bloggers 2025-05-01

[This article was first published on R Works, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

In March, one hundred eighty-two new packages made it to CRAN. Here are my Top 40 picks in sixteen categories: Agriculture, Archaeology, Biology, Climate Modeling, Computational Methods, Data, Ecology, Epidemiology, Genomics, Machine Learning, Medicine, Risk Forecasting, Statistics, Time Series, Utilities, and Visualization.

Agriculture

STCCGEV v1.0.0: Provides functions to model and forecast crop yields using a spatial temporal conditional copula approach. The package incorporates extreme weather covariates and Bayesian Structural Time Series models to analyze crop yield dependencies across multiple regions. Includes tools for fitting, simulating, and visualizing results. See the vignette.

Copula Forecast for Wellington

Archaeology

clayringsmiletus v1.0.1: Provides tools for analyzing the clay rings of the ancient city of Miletus, Turkey, including functions for data visualization and statistical analysis of archaeological data. See Steinmann (2020) for background and the vignette for examples.

Distributions for Milesian City ring variables

Biology

MicrobialGrowth v1.0.0: Provides a framework for modeling microbial growth curves. See Zwietering et al. (1990), Rosso et al. (1993), Baranyi and Roberts (1994) and Dantigny (2005) for background and the vignette for examples.

Sample growth curve

Climate Modeling

CMIP6VisR v1.0.0: Provides tools to manipulate Coupled Model Intercomparison Project, Phase-6 (CMIP6) hydroclimatic data, which are archived in the Federated Research Data Repository (FRDR). See Abdelmoaty et al. (2025) for a description of the data set and the vignette for examples.

rIACI v1.0.0: Provides functions to calculate the Iberian Actuarial Climate Index and its components to support climate change analysis and risk assessment. Calculations include temperature, precipitation, wind power, and sea level data—. See “Zhou et al.” (2023) for further details and the vignette to get started.

Computational Methods

globpso v1.3.0: Implements a general purpose particle swarm optimization and differential evolution algorithm for solving optimization problems with nonlinear, non-differentiable, and multi-modal objective functions. See Kennedy and Eberhart (1995), Sun et al. (2004), and Storn & Price (1997) for the theory and README to get started.

MetabolSSMF v0.1.0: Provides a framework for performing soft clustering using simplex-structured matrix factorization (SSMF), and includes functions for determining the optimal number of prototypes, the optimal algorithmic parameters, the estimation confidence intervals and the diversity of clusters. See Abdolali & Nicolas (2020) for the theory and the vignette for examples.

Visulation of clusters of metabolites

RcppPlanc v2.0.5: Provides Rcpp bindings for PLANC, a highly parallel and extensible Non-negative Matrix/Tensor Factorization library described in Kannan et al. (2018) and Eswar et al. (2021). See the vignette.

Data

autodb v2.3.1: Provides tools to automatically normalize a data frame to third normal form in order to facilitate data cleaning. Includes functions to discover approximate dependencies and plot the resulting “database” via Graphviz. See README for documentation.

Table schema

maths.genealogy v0.1.2: Provides functions to query, extract, and plot genealogical data from The Mathematics Genealogy Project. Data is gathered from the WebSocket server run by the geneagrapher-core project. See the vignette to get started.

Genealogy from Fejėr to Georgiou and Aslett

RcensusPkg v0.1.5: Provides a structured way to access US Census Bureau survey and geographic data. Functions access and layer displayable geometries (states, counties, blocks, tracts, roads, landmarks, places, bodies of water) and return a data frame. See README for an example.

uisapi v0.1.0: Implements a wrapper for the UNESCO Institute for Statistics API providing public access to more than 4,000 indicators focusing on education, science and technology, culture, and communication . See README for documentation.

Ecology

ecotrends v1.0: Provides tools to compute a time series of ecological niche models, using species occurrence data and environmental variables, and then map the existence and direction of linear temporal trends in environmental suitability. See Arenas-Castro & Sillero (2021) for background and look here for an example. This package is part of the MontObEO project.

Package workflow

hatchR v0.3.2: Provides access to established phenological models to predict hatch and emergence timing for a wide range of wild fishes using the effective value framework of Sparks et al. (2019). There are seven vignettes, including an Introduction and Parameterize Models.

Plot annotated with emergence and hatch times

Epidemiology

facilityepimath v0.1.0: Provides functions to calculate useful quantities for a user-defined differential equation model of infectious disease transmission among individuals in a healthcare facility, including functions to calculate the model equilibrium and the basic facility reproduction number, as described in Toth et al. (2025). See the vignette Equilibrium and R0.

pcpr v1.0.0: Implements the pattern recognition technique Principal Component Pursuit tailored to environmental health data, as described in Gibson et al. (2022). There is a Quickstart guide, a Theory crash course, and a vignette on AEr pollution.

𝐷 = sum of ground-truth matrices:

Genomics

CSeQTL v1.0.0: Implements a novel method for bulk and cell type-specific expression quantitative trail loci mapping. See Little et al. (2023) for details and the vignette for examples.

Simulated SNP data

ROKET v1.0.0: Provides functions to perform optimal transport on somatic point mutations and kernel regression hypothesis testing by integrating pathway level similarities at the gene level. See Little et al. (2023) for background and the vignette for examples.

Plot showing mutated genes by sample

SerolyzeR v1.1.0: Designed for inexperienced R users, the package facilitates the process of loading raw data from MBA (Multiplex Bead Assay) examinations, performs quality control checks, and automatically normalizes the data, preparing it for more advanced, downstream tasks. See the vignette.

Violin plot of test sample coverage

Machine Learning

SLmetrics v0.1.0: Provides a collection of metrics for evaluating the performance of machine learning models, including classification, regression, and clustering metrics. There are three vignettes: ROC-PRC, Classification Metrics, and Regression Metrics.

ROC curve and PR curve for a classification model

Medicine

clinify v 0.1.2: Builds on flextable and officer to simplify the formatting of clinical tables for regulatory use. See the vignette.

MariNET v1.0.0: Provides tools for analyzing longitudinal clinical data from Electronic Health Records (EHRs) using linear mixed models (LMM) and visualizing the results as networks. It includes functions for fitting LMM, normalizing adjacency matrices, and comparing networks. See the vignette.

Network of symptoms

Risk Forecasting

EQRN v0.1.0: Provides a framework for forecasting and extrapolating measures of conditional risk (e.g., of extreme or unprecedented events), including quantiles and exceedance probabilities, using extreme value statistics and flexible neural network architectures. See Pasche and Engelke (2024) and Pasche and Engelke (2022) for the theory and the README for an example.

Statistics

deltatest v0.1.0: Implements the Delta method as proposed by Deng et al. (2018) for hypothesis testing. This method replaces the standard variance estimation formula in the Z-test with an approximate formula which can account for within-user correlation. See README for examples.

P-values from delta method

gmwmx2 v0.0.2: Implements the Generalized Method of Wavelet Moments with Exogenous Inputs estimator (GMWMX) presented in Voirol et al. (2024) for estimating functional and stochastic parameters of linear models with correlated residuals in the presence of missing data. There are four vignettes including Estimate a model and Plot a large network of GNSS data.

Plot of North/East velocities over US.

stors v1.0.1: Implements a rejection sampling approach for sampling from general univariate probability density functions. Many standard densities are implemented in C for high performance. There are three vignettes, including an Introduction and Sampling from User-Defined Distributions.

tinyVAST v1.0.1: Fits a wide variety of multivariate spatio-temporal models with simultaneous and lagged interactions among variables, including vector autoregressive spatio-temporal dynamics for areal, continuous, or network spatial domains. See Thorson et al. (2024) for more details. There are five vignettes, including Model Description and Spatial Modeling.

Response surface

vecmatch v1.0.0: Implements the Vector Matching algorithm to match multiple treatment groups based on previously estimated generalized propensity scores. See Lopez and Gutman (2017) for details and the vignette for examples.

Conditional distribution of age across groups

Time Series

actfts v0.3.0: Provides tools for performing autocorrelation analysis of time series data, including functions to compute and visualize the autocorrelation function (ACF) and the partial autocorrelation function (PACF), Dickey-Fuller, KPSS, and Phillips-Perron unit root tests. See the vignette.

Autocorrelation plot

cpam v0.1.3:Provides a framework for time series omics analysis, integrating changepoint detection, smooth and shape-constrained trends, and uncertainty quantification and includes an interactive shiny interface. The methods are described in Yates et al. (2024). See the vignette for examples.

Changepoint at first time point: the gene responds immediately

jumps v1.0: Provides a set of functions to compute the Hodrick-Prescott (HP) filter with automatically selected jumps. The original HP filter extracts a smooth trend from a time series, and our version allows for a small number of automatically identified jumps. See Maranzano and Pelagatti (2024) for details. There is a package Introduction and a vignette on Formulae.

Time serieswith jump

tsaux v1.0.0: Provides a suite of auxiliary functions that enhance time series estimation and forecasting, including a robust anomaly detection routine based on Chen and Liu (1993), utilities for managing calendar and time conversions, performance metrics to assess both point forecasts and distributional predictions, and advanced simulation by allowing the generation of time series components. See the vignette.

Plot decomposing a time series

Utilities

DataSimilarity v0.1.1: A collection of methods for quantifying the similarity of two or more datasets, many of which can be used for two- or k-sample testing. See the vignette.

sixtyfour v0.2.0: Implements an opinionated interface to Amazon Web Services, with functions for interacting with IAM (Identity and Access Management), S3 (Simple Storage Service), RDS (Relational Data Service), Redshift, and Billing. There are eight vignettes including a Getting Started guide and S3.

tsbtibble v0.0.1: Provides functions to simplify reporting many tables by creating tibbles of tables, create a tibble of tables with captions and automatically print. See the vignette for an example.

xlr v1.0.3: Implements a high-level interface for creating and exporting summary tables to Excel. It provides tools for generating one-way to n-way tables, and summarizing multiple response questions and question blocks. Tables are exported with native Excel formatting, including titles, footnotes, and basic styling options. See the vignette.

Visualization

ggvfields v1.0.0: Extends ggplot2, enabling visualizing vector fields in two-dimensional space by creating vector and stream field layers, visualizing gradients and potential fields, and smoothing vector and scalar data to estimate underlying patterns. Look here for examples.

Plot of 2D vector field with streamlines and arrows

kitesquare v0.0.2: Implements kite-square plots for contingency tables using ggplot2 to display marginal, conditional, expected, observed, chi-squared values in a single frame. The plot resembles a flying kite inside a square if the variables are independent and deviates from this the more dependence exists. See the vignette and the package GitHub page for documentation and examples.

kitesquare plot

mooplot v0.1.0: Provides functions to visualize multi-dimensional data arising in multi-objective optimization, including plots of the empirical attainment function (EAF) (López-Ibáñez et al. (2010)) and symmetric Vorob’ev expectation and deviation (Binois et al. (2015).

Empirical attainment function (EAF) plot

quadkeyr v0.1.0: Provides a toolkit for generating raster images from Quadkey-Identified data within Microsoft’s Bing Maps Tile System in order to integrate Quadkey-Identified data into R workflows. Functions facilitate the creation of QuadKey grids and raster images and process Meta Mobility data. See D’Andrea et al. (2024) for details and look here for package documentation.

Hierarchical coding of spatial position

To leave a comment for the author, please follow the link and comment on their blog: R Works.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Continue reading: March 2025 Top 40 New CRAN Packages