NeEDS4BigData

The R package “NeEDS4BigData” provides approaches to implement subsampling methods to analyse big data.

What is “NeEDS4BigData” an abbreviation for?

New Experimental Design based Subsampling methods for Big Data.

How to engage with “NeEDS4BigData” the first time ?

## Installing the package from GitHub
devtools::install_github("Amalan-ConStat/NeEDS4BigData")

## Installing the package from CRAN
install.packages("NeEDS4BigData")

Subsampling Methods

A- and L-optimality based subsampling for GLMs.
A-optimality based subsampling for Gaussian Linear Models.
Leverage sampling for GLMs.
Local case control sampling for logistic regression.
A-optimality based subsampling under measurement constraints for GLMs.
Model robust subsampling method for GLMs.
Subsampling method for GLMs when the model is potentially misspecified.

These seven methods are described in the following articles under the topics

Introduction - explains the need for subsampling methods.
Model based subsampling
Model robust and misspecification
Benchmarking Functions

For $2)$ we assume the main effects model can describe the data. While for $3)$ first we consider there are several models that can describe the big data, then later we assume the given main effects model is misspecified. Under these conditions from $2)$ and $3)$ we explore subsampling for four given big data sets. Further, to explore the computation time we ran simulations for the scenarios $2)$ and $3)$ where we compare our subsampling functions against full data modelling.

Name	Name	Last commit message	Last commit date
Latest commit Amalan-ConStat model misspecification function updated with model input and new data… Jan 22, 2025 10a3646 · Jan 22, 2025 History 57 Commits
R	R	model misspecification function updated with model input and new data…	Jan 22, 2025
data-raw	data-raw	model misspecification function updated with model input and new data…	Jan 22, 2025
data	data	model misspecification function updated with model input and new data…	Jan 22, 2025
docs	docs	model misspecification function updated with model input and new data…	Jan 22, 2025
inst	inst	model misspecification function updated with model input and new data…	Jan 22, 2025
man	man	model misspecification function updated with model input and new data…	Jan 22, 2025
pkgdown/favicon	pkgdown/favicon	Update all the files after new logo	Jun 21, 2024
renv	renv	Rprofvis is used for optimisation of code	Jan 8, 2025
tests	tests	model misspecification function updated with model input and new data…	Jan 22, 2025
vignettes	vignettes	model misspecification function updated with model input and new data…	Jan 22, 2025
.Rbuildignore	.Rbuildignore	Rprofvis is used for optimisation of code	Jan 8, 2025
.Rprofile	.Rprofile	All files committed	May 13, 2024
.gitignore	.gitignore	All files committed	May 13, 2024
CONDUCT.md	CONDUCT.md	All files committed	May 13, 2024
DESCRIPTION	DESCRIPTION	model misspecification function updated with model input and new data…	Jan 22, 2025
LICENSE	LICENSE	All files committed	May 13, 2024
LICENSE.md	LICENSE.md	All files committed	May 13, 2024
NAMESPACE	NAMESPACE	model misspecification function updated with model input and new data…	Jan 22, 2025
NEWS.md	NEWS.md	After final seminar address the comments and update the package	Dec 13, 2024
NeEDS4BigData.Rproj	NeEDS4BigData.Rproj	Rprofvis is used for optimisation of code	Jan 8, 2025
README.Rmd	README.Rmd	model misspecification function updated with model input and new data…	Jan 22, 2025
README.md	README.md	model misspecification function updated with model input and new data…	Jan 22, 2025
_pkgdown.yml	_pkgdown.yml	model misspecification function updated with model input and new data…	Jan 22, 2025
codecov.yml	codecov.yml	code coverage fixed	Jun 28, 2024
renv.lock	renv.lock	Rprofvis is used for optimisation of code	Jan 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

NeEDS4BigData

What is “NeEDS4BigData” an abbreviation for?

How to engage with “NeEDS4BigData” the first time ?

Subsampling Methods

Thank You

About

Licenses found

Releases

Packages

Languages

License

Amalan-ConStat/NeEDS4BigData

Folders and files

Latest commit

History

Repository files navigation

NeEDS4BigData

What is “NeEDS4BigData” an abbreviation for?

How to engage with “NeEDS4BigData” the first time ?

Subsampling Methods

Thank You

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages