March 2025 Top 40 New CRAN Packages
R-bloggers 2025-05-01
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
In March, one hundred eighty-two new packages made it to CRAN. Here are my Top 40 picks in sixteen categories: Agriculture, Archaeology, Biology, Climate Modeling, Computational Methods, Data, Ecology, Epidemiology, Genomics, Machine Learning, Medicine, Risk Forecasting, Statistics, Time Series, Utilities, and Visualization.
Agriculture
STCCGEV v1.0.0: Provides functions to model and forecast crop yields using a spatial temporal conditional copula approach. The package incorporates extreme weather covariates and Bayesian Structural Time Series models to analyze crop yield dependencies across multiple regions. Includes tools for fitting, simulating, and visualizing results. See the vignette.
Archaeology
clayringsmiletus v1.0.1: Provides tools for analyzing the clay rings of the ancient city of Miletus, Turkey, including functions for data visualization and statistical analysis of archaeological data. See Steinmann (2020) for background and the vignette for examples.
Biology
MicrobialGrowth v1.0.0: Provides a framework for modeling microbial growth curves. See Zwietering et al. (1990), Rosso et al. (1993), Baranyi and Roberts (1994) and Dantigny (2005) for background and the vignette for examples.
Climate Modeling
CMIP6VisR v1.0.0: Provides tools to manipulate Coupled Model Intercomparison Project, Phase-6 (CMIP6) hydroclimatic data, which are archived in the Federated Research Data Repository (FRDR). See Abdelmoaty et al. (2025) for a description of the data set and the vignette for examples.
rIACI v1.0.0: Provides functions to calculate the Iberian Actuarial Climate Index and its components to support climate change analysis and risk assessment. Calculations include temperature, precipitation, wind power, and sea level data—. See “Zhou et al.” (2023) for further details and the vignette to get started.
Computational Methods
globpso v1.3.0: Implements a general purpose particle swarm optimization and differential evolution algorithm for solving optimization problems with nonlinear, non-differentiable, and multi-modal objective functions. See Kennedy and Eberhart (1995), Sun et al. (2004), and Storn & Price (1997) for the theory and README to get started.
MetabolSSMF v0.1.0: Provides a framework for performing soft clustering using simplex-structured matrix factorization (SSMF), and includes functions for determining the optimal number of prototypes, the optimal algorithmic parameters, the estimation confidence intervals and the diversity of clusters. See Abdolali & Nicolas (2020) for the theory and the vignette for examples.
RcppPlanc v2.0.5: Provides Rcpp
bindings for PLANC
, a highly parallel and extensible Non-negative Matrix/Tensor Factorization library described in Kannan et al. (2018) and Eswar et al. (2021). See the vignette.
Data
autodb v2.3.1: Provides tools to automatically normalize a data frame to third normal form in order to facilitate data cleaning. Includes functions to discover approximate dependencies and plot the resulting “database” via Graphviz. See README for documentation.
maths.genealogy v0.1.2: Provides functions to query, extract, and plot genealogical data from The Mathematics Genealogy Project. Data is gathered from the WebSocket server run by the geneagrapher-core project. See the vignette to get started.
RcensusPkg v0.1.5: Provides a structured way to access US Census Bureau survey and geographic data. Functions access and layer displayable geometries (states, counties, blocks, tracts, roads, landmarks, places, bodies of water) and return a data frame. See README for an example.
uisapi v0.1.0: Implements a wrapper for the UNESCO Institute for Statistics API providing public access to more than 4,000 indicators focusing on education, science and technology, culture, and communication . See README for documentation.
Ecology
ecotrends v1.0: Provides tools to compute a time series of ecological niche models, using species occurrence data and environmental variables, and then map the existence and direction of linear temporal trends in environmental suitability. See Arenas-Castro & Sillero (2021) for background and look here for an example. This package is part of the MontObEO project.
hatchR v0.3.2: Provides access to established phenological models to predict hatch and emergence timing for a wide range of wild fishes using the effective value framework of Sparks et al. (2019). There are seven vignettes, including an Introduction and Parameterize Models.
Epidemiology
facilityepimath v0.1.0: Provides functions to calculate useful quantities for a user-defined differential equation model of infectious disease transmission among individuals in a healthcare facility, including functions to calculate the model equilibrium and the basic facility reproduction number, as described in Toth et al. (2025). See the vignette Equilibrium and R0.
pcpr v1.0.0: Implements the pattern recognition technique Principal Component Pursuit tailored to environmental health data, as described in Gibson et al. (2022). There is a Quickstart guide, a Theory crash course, and a vignette on AEr pollution.
Genomics
CSeQTL v1.0.0: Implements a novel method for bulk and cell type-specific expression quantitative trail loci mapping. See Little et al. (2023) for details and the vignette for examples.
ROKET v1.0.0: Provides functions to perform optimal transport on somatic point mutations and kernel regression hypothesis testing by integrating pathway level similarities at the gene level. See Little et al. (2023) for background and the vignette for examples.
SerolyzeR v1.1.0: Designed for inexperienced R users, the package facilitates the process of loading raw data from MBA (Multiplex Bead Assay) examinations, performs quality control checks, and automatically normalizes the data, preparing it for more advanced, downstream tasks. See the vignette.
Machine Learning
SLmetrics v0.1.0: Provides a collection of metrics for evaluating the performance of machine learning models, including classification, regression, and clustering metrics. There are three vignettes: ROC-PRC, Classification Metrics, and Regression Metrics.
Medicine
clinify v 0.1.2: Builds on flextable
and officer
to simplify the formatting of clinical tables for regulatory use. See the vignette.
MariNET v1.0.0: Provides tools for analyzing longitudinal clinical data from Electronic Health Records (EHRs) using linear mixed models (LMM) and visualizing the results as networks. It includes functions for fitting LMM, normalizing adjacency matrices, and comparing networks. See the vignette.
Risk Forecasting
EQRN v0.1.0: Provides a framework for forecasting and extrapolating measures of conditional risk (e.g., of extreme or unprecedented events), including quantiles and exceedance probabilities, using extreme value statistics and flexible neural network architectures. See Pasche and Engelke (2024) and Pasche and Engelke (2022) for the theory and the README for an example.
Statistics
deltatest v0.1.0: Implements the Delta method as proposed by Deng et al. (2018) for hypothesis testing. This method replaces the standard variance estimation formula in the Z-test with an approximate formula which can account for within-user correlation. See README for examples.
gmwmx2 v0.0.2: Implements the Generalized Method of Wavelet Moments with Exogenous Inputs estimator (GMWMX) presented in Voirol et al. (2024) for estimating functional and stochastic parameters of linear models with correlated residuals in the presence of missing data. There are four vignettes including Estimate a model and Plot a large network of GNSS data.
stors v1.0.1: Implements a rejection sampling approach for sampling from general univariate probability density functions. Many standard densities are implemented in C
for high performance. There are three vignettes, including an Introduction and Sampling from User-Defined Distributions.
tinyVAST v1.0.1: Fits a wide variety of multivariate spatio-temporal models with simultaneous and lagged interactions among variables, including vector autoregressive spatio-temporal dynamics for areal, continuous, or network spatial domains. See Thorson et al. (2024) for more details. There are five vignettes, including Model Description and Spatial Modeling.
vecmatch v1.0.0: Implements the Vector Matching algorithm to match multiple treatment groups based on previously estimated generalized propensity scores. See Lopez and Gutman (2017) for details and the vignette for examples.
Time Series
actfts v0.3.0: Provides tools for performing autocorrelation analysis of time series data, including functions to compute and visualize the autocorrelation function (ACF) and the partial autocorrelation function (PACF), Dickey-Fuller, KPSS, and Phillips-Perron unit root tests. See the vignette.
cpam v0.1.3:Provides a framework for time series omics analysis, integrating changepoint detection, smooth and shape-constrained trends, and uncertainty quantification and includes an interactive shiny
interface. The methods are described in Yates et al. (2024). See the vignette for examples.
jumps v1.0: Provides a set of functions to compute the Hodrick-Prescott (HP) filter with automatically selected jumps. The original HP filter extracts a smooth trend from a time series, and our version allows for a small number of automatically identified jumps. See Maranzano and Pelagatti (2024) for details. There is a package Introduction and a vignette on Formulae.
tsaux v1.0.0: Provides a suite of auxiliary functions that enhance time series estimation and forecasting, including a robust anomaly detection routine based on Chen and Liu (1993), utilities for managing calendar and time conversions, performance metrics to assess both point forecasts and distributional predictions, and advanced simulation by allowing the generation of time series components. See the vignette.
Utilities
DataSimilarity v0.1.1: A collection of methods for quantifying the similarity of two or more datasets, many of which can be used for two- or k-sample testing. See the vignette.
sixtyfour v0.2.0: Implements an opinionated interface to Amazon Web Services, with functions for interacting with IAM (Identity and Access Management), S3 (Simple Storage Service), RDS (Relational Data Service), Redshift, and Billing. There are eight vignettes including a Getting Started guide and S3.
tsbtibble v0.0.1: Provides functions to simplify reporting many tables by creating tibbles of tables, create a tibble of tables with captions and automatically print. See the vignette for an example.
xlr v1.0.3: Implements a high-level interface for creating and exporting summary tables to Excel
. It provides tools for generating one-way to n-way tables, and summarizing multiple response questions and question blocks. Tables are exported with native Excel
formatting, including titles, footnotes, and basic styling options. See the vignette.
Visualization
ggvfields v1.0.0: Extends ggplot2
, enabling visualizing vector fields in two-dimensional space by creating vector and stream field layers, visualizing gradients and potential fields, and smoothing vector and scalar data to estimate underlying patterns. Look here for examples.
kitesquare v0.0.2: Implements kite-square plots for contingency tables using ggplot2
to display marginal, conditional, expected, observed, chi-squared values in a single frame. The plot resembles a flying kite inside a square if the variables are independent and deviates from this the more dependence exists. See the vignette and the package GitHub page for documentation and examples.
mooplot v0.1.0: Provides functions to visualize multi-dimensional data arising in multi-objective optimization, including plots of the empirical attainment function (EAF) (López-Ibáñez et al. (2010)) and symmetric Vorob’ev expectation and deviation (Binois et al. (2015).
quadkeyr v0.1.0: Provides a toolkit for generating raster images from Quadkey-Identified data within Microsoft’s Bing Maps Tile System in order to integrate Quadkey-Identified data into R
workflows. Functions facilitate the creation of QuadKey grids and raster images and process Meta Mobility data. See D’Andrea et al. (2024) for details and look here for package documentation.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.