Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Addition of cbind2() to append multiple spectra variables to the spectra data #343

Merged
merged 5 commits into from
Dec 9, 2024
Merged
Show file tree
Hide file tree
Changes from 3 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion DESCRIPTION
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
Package: Spectra
Title: Spectra Infrastructure for Mass Spectrometry Data
Version: 1.17.1
Version: 1.17.2
Description: The Spectra package defines an efficient infrastructure
for storing and handling mass spectrometry spectra and functionality to
subset, process, visualize and compare spectra data. It provides different
Expand Down
2 changes: 2 additions & 0 deletions NAMESPACE
Original file line number Diff line number Diff line change
Expand Up @@ -75,6 +75,7 @@ exportMethods(backendParallelFactor)
exportMethods(backendRequiredSpectraVariables)
exportMethods(bin)
exportMethods(c)
exportMethods(cbind2)
exportMethods(centroided)
exportMethods(collisionEnergy)
exportMethods(combinePeaks)
Expand Down Expand Up @@ -309,4 +310,5 @@ importMethodsFrom(S4Vectors,extractROWS)
importMethodsFrom(S4Vectors,isEmpty)
importMethodsFrom(S4Vectors,lapply)
importMethodsFrom(S4Vectors,split)
importMethodsFrom(methods,cbind2)
importMethodsFrom(methods,show)
6 changes: 6 additions & 0 deletions NEWS.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,11 @@
# Spectra 1.17

## Change in 1.17.2

- Add `cbind2()` method to easily add multiple `spectraVariables` and their
content to the `spectraData` of a `Spectra` object.
See also [issue #342](https://github.com/rformassspectrometry/Spectra/issues/342)

## Changes in 1.17.1

- Refactor `containsMz()` to support chunk-wise processing.
Expand Down
31 changes: 30 additions & 1 deletion R/MsBackend.R
Original file line number Diff line number Diff line change
Expand Up @@ -182,10 +182,14 @@
#' @param value replacement value for `<-` methods. See individual
#' method description or expected data type.
#'
#' @param values for `filterValues()`: A `numeric` vector that define the
#' @param values For `filterValues()`: A `numeric` vector that define the
#' values to filter the `object`. `values` needs to be of same length than
#' parameter `spectraVariables` and in the same order.
#'
#' @param y For `cbind2()`: A `data.frame` or `DataFrame` with the
#' spectra variables to be added to the backend. Need to be of the same
philouail marked this conversation as resolved.
Show resolved Hide resolved
#' length as the number of spectra in the backend.
#'
#' @param x Object extending `MsBackend`.
#'
#' @param ... Additional arguments.
Expand Down Expand Up @@ -313,6 +317,11 @@
#' `dropNaSpectraVariables()` might still show columns containing `NA` values
#' for *core* spectra variables.
#'
#' - `cbind2()`: allows to appends multiple spectra variables to the backend at
#' once. It does so *blindly* and is therefore at the risk of the user. For a
philouail marked this conversation as resolved.
Show resolved Hide resolved
#' more controlled way of adding spectra variables, the `joinSpectraData()`
#' should be used.
#'
#' - `centroided()`, `centroided<-`: gets or sets the centroiding
#' information of the spectra. `centroided()` returns a `logical`
#' vector of length equal to the number of spectra with `TRUE` if a
Expand Down Expand Up @@ -1022,6 +1031,26 @@ setMethod("peaksVariables", "MsBackend", function(object) {
c("mz", "intensity")
})


setClassUnion("dataframeOrDataFrameOrmatrix", c("data.frame", "DataFrame", "matrix"))
#' @exportMethod cbind2
#'
#' @importMethodsFrom methods cbind2
#'
#' @rdname MsBackend
setMethod("cbind2", signature = c("MsBackend", "dataframeOrDataFrameOrmatrix"),
function(x, y = data.frame(), ...) {
if (is(y, "matrix"))
y <- as.data.frame(y)
if (nrow(y) != length(x))
philouail marked this conversation as resolved.
Show resolved Hide resolved
stop("Length of 'y' does not match the number of spectra in 'x'")
philouail marked this conversation as resolved.
Show resolved Hide resolved
for (i in colnames(y)) {
x[[i]] <- y[, i]
}
x
})


#' @exportMethod centroided
#'
#' @aliases centroided<-,MsBackend-method
Expand Down
17 changes: 17 additions & 0 deletions R/MsBackendDataFrame.R
Original file line number Diff line number Diff line change
Expand Up @@ -567,6 +567,23 @@ setMethod("[", "MsBackendDataFrame", function(x, i, j, ..., drop = FALSE) {
.subset_backend_data_frame(x, i)
})

setClassUnion("dataframeOrDataFrameOrmatrix",
c("data.frame", "DataFrame", "matrix"))
#' @importMethodsFrom methods cbind2
#'
#' @rdname hidden_aliases
setMethod("cbind2", signature = c("MsBackendDataFrame",
"dataframeOrDataFrameOrmatrix"),
function(x, y = data.frame(), ...) {
if (is(y, "matrix"))
y <- as.data.frame(y)
if (nrow(y) != length(x))
stop("Length of 'y' does not match the number of spectra in 'x'")
philouail marked this conversation as resolved.
Show resolved Hide resolved
x@spectraData <- cbind(x@spectraData, y)
validObject(x)
x
})

#' @rdname hidden_aliases
setMethod("split", "MsBackendDataFrame", function(x, f, drop = FALSE, ...) {
if (!is.factor(f))
Expand Down
17 changes: 17 additions & 0 deletions R/MsBackendMemory.R
Original file line number Diff line number Diff line change
Expand Up @@ -670,6 +670,23 @@ setMethod("[", "MsBackendMemory", function(x, i, j, ..., drop = FALSE) {
.df_subset(x, i)
})

setClassUnion("dataframeOrDataFrameOrmatrix",
c("data.frame", "DataFrame", "matrix"))
#' @importMethodsFrom methods cbind2
#'
#' @rdname hidden_aliases
setMethod("cbind2", signature = c("MsBackendMemory",
"dataframeOrDataFrameOrmatrix"),
function(x, y = data.frame(), ...) {
if (is(y, "matrix"))
y <- as.data.frame(y)
if (nrow(y) != length(x))
stop("Length of 'y' does not match the number of spectra in 'x'")
x@spectraData <- cbind(x@spectraData, y)
validObject(x)
x
})

#' @rdname hidden_aliases
setMethod("split", "MsBackendMemory", function(x, f, drop = FALSE, ...) {
if (!is.factor(f))
Expand Down
30 changes: 29 additions & 1 deletion R/Spectra.R
Original file line number Diff line number Diff line change
Expand Up @@ -1447,6 +1447,7 @@ setReplaceMethod("[[", "Spectra", function(x, i, j, ..., value) {
#' @aliases combineSpectra
#' @aliases split
#' @aliases joinSpectraData
#' @aliases cbind2
#'
#' @description
#'
Expand All @@ -1463,6 +1464,15 @@ setReplaceMethod("[[", "Spectra", function(x, i, j, ..., value) {
#' function and to eventually (if needed) apply the processing queue using
#' the [applyProcessing()] function.
#'
#' - `cbind2()`: Appends multiple spectra variables from a `data.frame`,
#' `DataFrame` or `matrix` to the `Spectra` object at once. It does so
#' *blindly* (e.g. do not check rownames compatibility) and is therefore at
#' the risk of the user. For a more controlled way of adding spectra
#' variables, the `joinSpectraData()` should be used. It will return a
#' `Spectra` object with the appended spectra variables. `cbind2()` does
#' check however that the number of rows of the `data.frame` or `DataFrame`
#' matches the number of spectra in the `Spectra` object.
#'
#' - `combineSpectra()`: combines sets of spectra (defined with parameter `f`)
#' into a single spectrum per set aggregating their MS data (i.e. their
#' *peaks data* matrices with the *m/z* and intensity values of their
Expand Down Expand Up @@ -1507,6 +1517,8 @@ setReplaceMethod("[[", "Spectra", function(x, i, j, ..., value) {
#' should be explored and ideally be removed using for
#' `QFeatures::reduceDataFrame()`, `PMS::reducePSMs()` or similar
#' functions.
#' For a more general function that allows to append `data.frame`,
#' `DataFrame` and `matrix` see `cbind2()`.
#'
#' - `split()`: splits the `Spectra` object based on parameter `f` into a `list`
#' of `Spectra` objects.
Expand Down Expand Up @@ -1543,7 +1555,9 @@ setReplaceMethod("[[", "Spectra", function(x, i, j, ..., value) {
#'
#' @param x A `Spectra` object.
#'
#' @param y A `DataFrame` with the spectra variables to join/add.
#' @param y For `joinSpectraData()`: `DataFrame` with the spectra variables
#' to join/add. For `cbind2()`: a `data.frame`, `DataFrame` or
#' `matrix`.
#'
#' @param ... Additional arguments.
#'
Expand Down Expand Up @@ -1660,6 +1674,10 @@ setReplaceMethod("[[", "Spectra", function(x, i, j, ..., value) {
#'
#' spectraVariables(sciex2)
#' spectraData(sciex2)[1:13, c("spectrumId", "var1", "var2")]
#'
#' ## Append new spectra variables with cbind2()
#' df <- data.frame(cola = seq_len(length(sciex1)), colb = "b")
#' data_append <- cbind2(sciex1, df)
NULL

#' @rdname combineSpectra
Expand All @@ -1669,6 +1687,16 @@ setMethod("c", "Spectra", function(x, ...) {
.concatenate_spectra(unname(list(unname(x), ...)))
})

setClassUnion("dataframeOrDataFrame", c("data.frame", "DataFrame"))
philouail marked this conversation as resolved.
Show resolved Hide resolved
#' @rdname combineSpectra
#'
#' @export
setMethod("cbind2", signature(x = "Spectra",
y = "dataframeOrDataFrame"), function(x, y, ...) {
x@backend <- cbind2(x@backend, y, ...)
x
})

#' @rdname combineSpectra
setMethod("split", "Spectra", function(x, f, drop = FALSE, ...) {
bcknds <- split(x@backend, f, ...)
Expand Down
19 changes: 19 additions & 0 deletions inst/test_backends/test_MsBackend/test_spectra_subsetting.R
Original file line number Diff line number Diff line change
Expand Up @@ -59,6 +59,25 @@ test_that("[", {
expect_equal(res, be[which(l)])
})

test_that("cbind2 works", {
seql <- length(be)
df <- data.frame(cola = seq_len(seql), colb = "b", colz = "z")
res <- cbind2(be, df)
expect_true(validObject(res))
expect_equal(ncol(spectraData(res)), length(spectraVariables(be)) + 3)
expect_equal(res$cola, seq_len(seql))
expect_equal(res$colb, rep("b", seql))
expect_equal(res$colz, rep("z", seql))
df2 <- data.frame(cola = 3:6, colb = "b", colz = "z")
expect_error(cbind2(be, df2), "does not match")
## with matrix
m <- matrix(1:seql, ncol = 1, dimnames = list(NULL, "m"))
res <- cbind2(be, m)
expect_true(validObject(res))
expect_equal(ncol(spectraData(res)), length(spectraVariables(be)) + 1)
expect_equal(res$m, 1:seql)
})

#' extractByIndex. Uses [ if not implemented
test_that("extractByIndex", {
i <- sample(seq_along(be), floor(length(be) / 2))
Expand Down
17 changes: 14 additions & 3 deletions man/MsBackend.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

22 changes: 21 additions & 1 deletion man/combineSpectra.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

6 changes: 6 additions & 0 deletions man/hidden_aliases.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Loading
Loading