statsmodels.genmod.generalized_linear_model.GLM.fit#

GLM.fit(start_params=None, maxiter=100, method='IRLS', tol=1e-08, scale=None, cov_type='nonrobust', cov_kwds=None, use_t=None, full_output=True, disp=False, max_start_irls=3, **kwargs)[source]#

Fits a generalized linear model for a given family.

Parameters:

start_paramsarray_like, optional: Initial guess of the solution for the log-likelihood maximization. The default is family-specific and is given by the family.starting_mu(endog). If start_params is given then the initial mean will be calculated as np.dot(exog, start_params).
maxiterint, optional: Default is 100.
methodstr: Default is ‘IRLS’ for iteratively reweighted least squares. Otherwise gradient optimization is used.
tolfloat: Convergence tolerance. Default is 1e-8.
scalestr or float, optional: scale can be ‘X2’, ‘dev’, or a float The default value is None, which uses X2 for Gamma, Gaussian, and Inverse Gaussian. X2 is Pearson’s chi-square divided by df_resid. The default is 1 for the Binomial and Poisson families. dev is the deviance divided by df_resid
cov_typestr: The type of parameter estimate covariance matrix to compute.
cov_kwdsdict-like: Extra arguments for calculating the covariance of the parameter estimates.
use_tbool: If True, the Student t-distribution is used for inference.
full_outputbool, optional: Set to True to have all available output in the Results object’s mle_retvals attribute. The output is dependent on the solver. See LikelihoodModelResults notes section for more information. Not used if method is IRLS.
dispbool, optional: Set to True to print convergence messages. Not used if method is IRLS.
max_start_irlsint: The number of IRLS iterations used to obtain starting values for gradient optimization. Only relevant if method is set to something other than ‘IRLS’.
atolfloat, optional: (available with IRLS fits) The absolute tolerance criterion that must be satisfied. Defaults to tol. Convergence is attained when: \(rtol * prior + atol > abs(current - prior)\)
rtolfloat, optional: (available with IRLS fits) The relative tolerance criterion that must be satisfied. Defaults to 0 which means rtol is not used. Convergence is attained when: \(rtol * prior + atol > abs(current - prior)\)
tol_criterionstr, optional: (available with IRLS fits) Defaults to 'deviance'. Can optionally be 'params'.
wls_methodstr, optional: (available with IRLS fits) options are ‘lstsq’, ‘pinv’ and ‘qr’ specifies which linear algebra function to use for the irls optimization. Default is lstsq which uses the same underlying svd based approach as ‘pinv’, but is faster during iterations. ‘lstsq’ and ‘pinv’ regularize the estimate in singular and near-singular cases by truncating small singular values based on rcond of the respective numpy.linalg function. ‘qr’ is only valid for cases that are not singular nor near-singular.
optim_hessian{‘eim’, ‘oim’}, optional: (available with scipy optimizer fits) When ‘oim’–the default–the observed Hessian is used in fitting. ‘eim’ is the expected Hessian. This may provide more stable fits, but adds assumption that the Hessian is correctly specified.

Notes

If method is ‘IRLS’, then an additional keyword ‘attach_wls’ is available. This is currently for internal use only and might change in future versions. If attach_wls’ is true, then the final WLS instance of the IRLS iteration is attached to the results instance as results_wls attribute.