statsmodels.stats.descriptivestats.describe¶

statsmodels.stats.descriptivestats.describe(data, stats=None, *, numeric=True, categorical=True, alpha=0.05, use_t=False, percentiles=(1, 5, 10, 25, 50, 75, 90, 95, 99), ntop=5)[source]¶

Extended descriptive statistics for data

Parameters:¶

data : array_like¶: Data to describe. Must be convertible to a pandas DataFrame.
stats : Sequence[str], optional¶: Statistics to include. If not provided the full set of statistics is computed. This list may evolve across versions to reflect best practices. Supported options are: “nobs”, “missing”, “mean”, “std_err”, “ci”, “ci”, “std”, “iqr”, “iqr_normal”, “mad”, “mad_normal”, “coef_var”, “range”, “max”, “min”, “skew”, “kurtosis”, “jarque_bera”, “mode”, “freq”, “median”, “percentiles”, “distinct”, “top”, and “freq”. See Notes for details.
numeric : bool, default True¶: Whether to include numeric columns in the descriptive statistics.
categorical : bool, default True¶: Whether to include categorical columns in the descriptive statistics.
alpha : float, default 0.05¶: A number between 0 and 1 representing the size used to compute the confidence interval, which has coverage 1 - alpha.
use_t : bool, default False¶: Use the Student’s t distribution to construct confidence intervals.
percentiles : sequence[float]¶: A distinct sequence of floating point values all between 0 and 100. The default percentiles are 1, 5, 10, 25, 50, 75, 90, 95, 99.
ntop : int, default 5¶: The number of top categorical labels to report. Default is

Returns:¶

Descriptive statistics

Return type:¶

DataFrame