:hero: statistical models, hypothesis tests, and data exploration statsmodels documentation ========================= .. container:: sm-landing-meta **Date:** |today| **Version:** |version| **Install:** ``python -m pip install statsmodels`` **Previous versions:** Documentation of previous statsmodels versions is available at `statsmodels.org `__. **Useful links:** `Binary Installers `__ | `Source Repository `__ | `Issues & Ideas `__ | `Q&A Support `__ | `Mailing List `__ | `DOI `__ .. container:: sm-landing-summary ``statsmodels`` provides classes and functions for estimating statistical models, running hypothesis tests, and exploring data in Python. .. raw:: html

Start here Getting started Install statsmodels, fit a first model, and learn the core workflow. To the getting started guide Learn User Guide Explore statistical models, tools, diagnostics, and workflows by topic. To the user guide Practice Examples Browse applied notebooks and recipes using real datasets and model outputs. To the examples Reference API Reference Find classes, functions, result objects, and module-level documentation. To the API reference

.. container:: sm-index-logo .. image:: images/statsmodels-logo-v2-horizontal.svg :alt: statsmodels :class: sm-index-logo__image sm-index-logo__image--light only-light .. image:: images/statsmodels-logo-v3-horizontal.svg :alt: statsmodels :class: sm-index-logo__image sm-index-logo__image--dark only-dark :ref:`statsmodels ` is a Python module that provides classes and functions for the estimation of many different statistical models, as well as for conducting statistical tests, and statistical data exploration. An extensive list of result statistics are available for each estimator. The results are tested against existing statistical packages to ensure that they are correct. The package is released under the open source Modified BSD (3-clause) license. The online documentation is hosted at `statsmodels.org `__. Introduction ============ ``statsmodels`` supports specifying models using R-style formulas and ``pandas`` DataFrames. Here is a simple example using ordinary least squares: .. ipython:: python import numpy as np import statsmodels.api as sm import statsmodels.formula.api as smf # Load data dat = sm.datasets.get_rdataset("Guerry", "HistData").data # Fit regression model (using the natural log of one of the regressors) results = smf.ols('Lottery ~ Literacy + np.log(Pop1831)', data=dat).fit() # Inspect the results print(results.summary()) You can also use ``numpy`` arrays instead of formulas: .. ipython:: python import numpy as np import statsmodels.api as sm # Generate artificial data (2 regressors + constant) nobs = 100 X = np.random.random((nobs, 2)) X = sm.add_constant(X) beta = [1, .1, .5] e = np.random.random(nobs) y = np.dot(X, beta) + e # Fit regression model results = sm.OLS(y, X).fit() # Inspect the results print(results.summary()) Have a look at `dir(results)` to see available results. Attributes are described in `results.__doc__` and results methods have their own docstrings. Citation ======== Please use following citation to cite statsmodels in scientific publications: Seabold, Skipper, and Josef Perktold. "`statsmodels: Econometric and statistical modeling with python. `_" *Proceedings of the 9th Python in Science Conference.* 2010. Bibtex entry:: @inproceedings{seabold2010statsmodels, title={statsmodels: Econometric and statistical modeling with python}, author={Seabold, Skipper and Perktold, Josef}, booktitle={9th Python in Science Conference}, year={2010}, } .. toctree:: :maxdepth: 1 install gettingstarted user-guide examples/index api about dev/index release/index Index ===== :ref:`genindex` :ref:`modindex`