Pre 0.5.0 Release History¶

0.5.0¶

Main Changes and Additions * Add patsy dependency

Compatibility and Deprecation

cleanup of import paths (lowess)

Bug Fixes

input shapes of tools.isestimable

Enhancements and Additions

formula integration based on patsy (new dependency)
Time series analysis - ARIMA modeling - enhanced forecasting based on pandas datetime handling
expanded margins for discrete models
OLS outlier test
empirical likelihood - Google Summer of Code 2012 project - inference for descriptive statistics - inference for regression models - accelerated failure time models
expanded probability plots
improved graphics - plotcorr - acf and pacf
new datasets
new and improved tools - numdiff numerical differentiation

0.4.3¶

The only change compared to 0.4.2 is for compatibility with python 3.2.3 (changed behavior of 2to3)

0.4.2¶

This is a bug-fix release, that affects mainly Big-Endian machines.

Bug Fixes

discrete_model.MNLogit fix summary method
tsa.filters.hp_filter don’t use umfpack on Big-Endian machine (scipy bug)
the remaining fixes are in the test suite, either precision problems on some machines or incorrect testing on Big-Endian machines.

0.4.1¶

This is a backwards compatible (according to our test suite) release with bug fixes and code cleanup.

Bug Fixes

build and distribution fixes
lowess correct distance calculation
genmod correction CDFlink derivative
adfuller _autolag correct calculation of optimal lag
het_arch, het_lm : fix autolag and store options
GLSAR: incorrect whitening for lag>1

Other Changes

add lowess and other functions to api and documentation
rename lowess module (old import path will be removed at next release)
new robust sandwich covariance estimators, moved out of sandbox
compatibility with pandas 0.8
new plots in statsmodels.graphics - ABLine plot - interaction plot

0.4.0¶

Main Changes and Additions

Added pandas dependency.
Cython source is built automatically if cython and compiler are present
Support use of dates in timeseries models
Improved plots - Violin plots - Bean Plots - QQ Plots
Added lowess function
Support for pandas Series and DataFrame objects. Results instances return pandas objects if the models are fit using pandas objects.
Full Python 3 compatibility
Fix bugs in genfromdta. Convert Stata .dta format to structured array preserving all types. Conversion is much faster now.
Improved documentation
Models and results are pickleable via save/load, optionally saving the model data.
Kernel Density Estimation now uses Cython and is considerably faster.
Diagnostics for outlier and influence statistics in OLS
Added El Nino Sea Surface Temperatures dataset
Numerous bug fixes
Internal code refactoring
Improved documentation including examples as part of HTML

Changes that break backwards compatibility

Deprecated scikits namespace. The recommended import is now:
```
import statsmodels.api as sm
```
model.predict methods signature is now (params, exog, ...) where before it assumed that the model had been fit and omitted the params argument.
For consistency with other multi-equation models, the parameters of MNLogit are now transposed.
tools.tools.ECDF -> distributions.ECDF
tools.tools.monotone_fn_inverter -> distributions.monotone_fn_inverter
tools.tools.StepFunction -> distributions.StepFunction

0.3.1¶

Removed academic-only WFS dataset.
Fix easy_install issue on Windows.

0.3.0¶

Changes that break backwards compatibility

Added api.py for importing. So the new convention for importing is:

import statsmodels.api as sm

Importing from modules directly now avoids unnecessary imports and increases the import speed if a library or user only needs specific functions.

sandbox/output.py -> iolib/table.py
lib/io.py -> iolib/foreign.py (Now contains Stata .dta format reader)
family -> families
families.links.inverse -> families.links.inverse_power
Datasets’ Load class is now load function.
regression.py -> regression/linear_model.py
discretemod.py -> discrete/discrete_model.py
rlm.py -> robust/robust_linear_model.py
glm.py -> genmod/generalized_linear_model.py
model.py -> base/model.py
t() method -> tvalues attribute (t() still exists but raises a warning)

Main changes and additions

Numerous bugfixes.
Time Series Analysis model (tsa)
- Vector Autoregression Models VAR (tsa.VAR)
- Autogressive Models AR (tsa.AR)
- Autoregressive Moving Average Models ARMA (tsa.ARMA) optionally uses Cython for Kalman Filtering use setup.py install with option –with-cython
- Baxter-King band-pass filter (tsa.filters.bkfilter)
- Hodrick-Prescott filter (tsa.filters.hpfilter)
- Christiano-Fitzgerald filter (tsa.filters.cffilter)
Improved maximum likelihood framework uses all available scipy.optimize solvers
Refactor of the datasets sub-package.
Added more datasets for examples.
Removed RPy dependency for running the test suite.
Refactored the test suite.
Refactored codebase/directory structure.
Support for offset and exposure in GLM.
Removed data_weights argument to GLM.fit for Binomial models.
New statistical tests, especially diagnostic and specification tests
Multiple test correction
General Method of Moment framework in sandbox
Improved documentation
and other additions

0.2.0¶

Main changes

renames for more consistency RLM.fitted_values -> RLM.fittedvalues GLMResults.resid_dev -> GLMResults.resid_deviance

GLMResults, RegressionResults: lazy calculations, convert attributes to properties with _cache

fix tests to run without rpy

expanded examples in examples directory

add PyDTA to lib.io – functions for reading Stata .dta binary files and converting them to numpy arrays

made tools.categorical much more robust

add_constant now takes a prepend argument

fix GLS to work with only a one column design

New

add four new datasets

A dataset from the American National Election Studies (1996)

Grunfeld (1950) investment data

Spector and Mazzeo (1980) program effectiveness data

A US macroeconomic dataset

add four new Maximum Likelihood Estimators for models with a discrete dependent variables with examples

Logit

Probit

MNLogit (multinomial logit)

Poisson

Sandbox

add qqplot in sandbox.graphics

add sandbox.tsa (time series analysis) and sandbox.regression (anova)

add principal component analysis in sandbox.tools

add Seemingly Unrelated Regression (SUR) and Two-Stage Least Squares for systems of equations in sandbox.sysreg.Sem2SLS

add restricted least squares (RLS)

0.1.0b1¶

initial release

Pre 0.5.0 Release History¶

0.5.0¶

0.4.3¶

0.4.2¶

0.4.1¶

0.4.0¶

0.3.1¶

0.3.0¶

0.2.0¶

0.1.0b1¶

Table Of Contents

This Page