Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
This article provides a list of the functions provided by the RevoScaleR package and lists comparable functions included in the base distribution of R.
Data Input and Output
rx function | Description | Nearest base R function |
---|---|---|
rxGetInfo | Retrieves header information from an .XDF file or summary information from a data frame | str() names() colNames() |
rxGetVarInfo | Retrieves variable information from an .XDF file or data frame | names() str() nrow() min() max() |
RxSasData | Creates a SAS data source object | foreign::read.ssd() |
RxSpssData | Creates an SPSS data source object | foreign::read.ssps() |
rxOpen | Opens a data source for reading | read.table() etc. |
rxReadNext | Reads data from a data source | read.table() , etc. |
Data Manipulation and Chunking
rx function | Description | Nearest base R function |
---|---|---|
rxDataStep | Transforms and subsets data in .XDF files or data frames | transform() with() within() subset() |
rxFactors | Recodes a factor variable, or converts a non-factor variable into a factor | factor() |
rxSort | Performs multi-key sorting of the variables in an .XDF file or data frame | sort() order() |
rxMerge | Merges two .XDF files or two data frames using a variety of merge types | merge() rbind() cbind() |
rxSplit | Splits an .XDF file or a data frame into multiple .XDF files or data frames | split() |
Descriptive Statistics and Cross-Tabulation
rx function | Description | Nearest base R function |
---|---|---|
rxSummary | Generates summary statistics for a data frame, including computations by group | summary() lapply(x, …) |
rxQuantile | Computes approximate quantiles for an .XDF file or data frame without sorting | quantile() |
rxCrossTabs | Creates a cross-tabulation of data based on a formula provided as parameter | xtabs() |
rxCube | Creates a cross-tabulation of data based on formula provided as parameter This function is an alternative to rxCrossTabs and is designed for efficient representation. |
xtabs() |
rxMarginals | Creates a marginal summary for an xtab object | addmargins() colSums() rowSums() |
as.crosstabs | Converts cross tabulation results to an xtab object | xtabs() |
rxChiSquaredTest | Performs a chi-squared test on an xtab object | chisq.test() |
rxFisherTest | Performs Fisher's Exact Test on an xtab object | fisher.test() |
rxKendallCor | Computes Kendall's Tau Rank Correlation Coefficient using an xtab object | cor(…, method="kendall") |
Statistical Modeling
rx function | Description | Nearest base R function |
---|---|---|
rxLinMod | Fits a linear model to data | lm() |
rxCovCor | Calculates the covariance, correlation, or sum of squares (cross-product) matrix for a set of variables | cor() cov() crossprod() |
rxCov | Calculates the covariance matrix for a set of variables | cov() |
rxCor | Calculates the correlation matrix for a set of variables | cov() |
rxLogit | Fits a logistic regression model to data | glm(…, family="binomial") |
rxGlm | Fits a generalized linear model to data | glm() |
rxDTree | Fits a classification or regression tree to data | tree::tree() rpart::rpart() |
rxPredict | Calculates predictions for fitted models | predict() |
rxKmeans | Performs K-means clustering | cluster::kmeans() |
Basic Graphing
rx function | Description | Nearest base R function |
---|---|---|
rxHistogram | Creates a histogram from data | hist() |
rxLinePlot | Creates a line plot from data | plot() lines() |