DataFrameStat
Defined in: data-frame-stat.ts:17
Statistical and approximate-query operations on a DataFrame.
Obtained via DataFrame.stat. Mirrors DataFrameStatFunctions in
the JVM Spark API.
Example
Section titled “Example”const c = await df.stat.corr("height", "weight");const quantiles = await df.stat.approxQuantile("latency", [0.5, 0.95, 0.99], 0.01);Spark source: DataFrameStatFunctions.scala
Methods
Section titled “Methods”approxQuantile()
Section titled “approxQuantile()”approxQuantile( cols, probabilities, relativeError): DataFrame;Defined in: data-frame-stat.ts:67
Compute approximate quantiles for numerical columns.
Parameters
Section titled “Parameters”| Parameter | Type |
|---|---|
cols | string[] |
probabilities | number[] |
relativeError | number |
Returns
Section titled “Returns”corr()
Section titled “corr()”corr( col1, col2, method?): DataFrame;Defined in: data-frame-stat.ts:26
Compute Pearson correlation between two columns.
Parameters
Section titled “Parameters”| Parameter | Type | Default value |
|---|---|---|
col1 | string | undefined |
col2 | string | undefined |
method | string | "pearson" |
Returns
Section titled “Returns”cov(col1, col2): DataFrame;Defined in: data-frame-stat.ts:37
Compute sample covariance between two columns.
Parameters
Section titled “Parameters”| Parameter | Type |
|---|---|
col1 | string |
col2 | string |
Returns
Section titled “Returns”crosstab()
Section titled “crosstab()”crosstab(col1, col2): DataFrame;Defined in: data-frame-stat.ts:47
Compute a pair-wise frequency table (contingency table).
Parameters
Section titled “Parameters”| Parameter | Type |
|---|---|
col1 | string |
col2 | string |
Returns
Section titled “Returns”freqItems()
Section titled “freqItems()”freqItems(cols, support?): DataFrame;Defined in: data-frame-stat.ts:57
Find frequent items in the given columns.
Parameters
Section titled “Parameters”| Parameter | Type |
|---|---|
cols | string[] |
support? | number |