## 统计代写|回归分析作业代写Regression Analysis代考|Maximum Likelihood with Non-normal Distributions Gives Non-OLS Estimates

The ordinary least squares (OLS) estimates are maximum likelihood estimates from the classical, normally distributed model. But just as linearity is never precisely true, normality is never precisely true either. There are always asymmetries, levels of discreteness, levels of outlier potential, and boundedness characteristics that make all real data-generating processes non-normal. Can you still use OLS, then? The answer is yes-as with any statistical procedure based on the assumption of normality, you can still use it with non-normal distributions. The procedure will be reasonably good if the distributions that produced the data are reasonably close to normal distributions. But, if the distributions are far from normal, other methods may be better.

An interesting alternative to the normal distribution is the Laplace distribution, for which
$$p(y)=\frac{1}{\sqrt{2} \sigma} \exp \left[-\sqrt{2} \frac{|y-\mu|}{\sigma}\right]$$
The mathematical form of the Laplace distribution looks similar to that of the normal distribution, but since the values in the exponent are absolute deviations from the mean rather than squared deviations, the Laplace distribution allows much higher probability that an observation can be far from the mean. In other words, the Laplace distribution allows a higher probability of an extreme observation, commonly called an outlier. The excess kurtosis of the Laplace distribution is 3 (that of the normal distribution is 0 ), which also implies that the Laplace distribution is more outlier-prone than the normal distribution.

Figure $2.2$ compares the normal distribution with $\mu=0, \sigma=1$ with the corresponding Laplace distribution. Notice that the Laplace distribution extends farther into the tails, despite the fact both distributions have the same standard deviation.

## 统计代写|回归分析作业代写Regression Analysis代考|The Classical Model and Its Consequences

The classical regression model assumes normality, independence, constant variance, and linearity of the conditional mean function, and is (once again) stated as follows:
$$Y_i \mid X_i=x_i \quad \sim_{\text {independent }} \mathrm{N}\left(\beta_0+\beta_1 x_i, \sigma^2\right) \text {, for } i=1,2, \ldots, n .$$
Whether you like it or not, this model is also what your computer assumes when you ask it to analyze your data via standard regression methods. The parameter estimates you get from the computer are best under this model, and the inferences ( $p$-values and confidence intervals) are exactly correct under this model. If the assumptions of the model are not true, then the estimates are not best, and the inferences are incorrect. You might think we are saying that assumptions must be true in order to use statistical methods that make such assumptions, but we are not. As we noted in Chapter 1, it is not necessarily a problem that any or all of the assumptions of the model are wrong, depending on how badly violated is the assumption. And the easiest way to understand whether an assumption is violated “too badly” is to use simulation.

We have found that students in statistics classes often resist learning simulation. After all, the data that researchers use is usually real, and not simulated, so the students wonder, what is the point of using simulation? Here are some answers:

• Simulation shows you, clearly and concretely, how to interpret the regression analysis of your real (not simulated) data.
• Simulation helps you to understand how a regression model can be useful even when the model is wrong.
• Simulation models help you to understand the meaning of the regression model parameters.
• Simulation models help you to understand the meaning of the regression model assumptions.
• Simulation models help you to understand the meaning of a “research hypothesis.”
• Simulation helps you to understand how to interpret your data in the presence of chance effects.
• Simulation helps you to understand all the commonly misunderstood concepts in statistics, like “unbiasedness,” “standard error,” “p-value,” and “confidence interval.”
• Simulation methods are commonly used in the analysis of real data; examples include the bootstrap and Markov Chain Monte Carlo.

An alternative to using simulation is to use advanced mathematics, typically involving multidimensional calculus. But this is much, much harder than simulation.

$$p(y)=\frac{1}{\sqrt{2} \sigma} \exp \left[-\sqrt{2} \frac{|y-\mu|}{\sigma}\right]$$拉普拉斯分布的数学形式看起来类似于正态分布，但由于指数中的值是对平均值的绝对偏差，而不是方差的平方偏差，因此拉普拉斯分布允许观测值远离平均值的概率更高。换句话说，拉普拉斯分布允许出现一个极端观测值(通常称为离群值)的更高概率。拉普拉斯分布的超额峰度为3(正态分布的超额峰度为0)，这也意味着拉普拉斯分布比正态分布更容易出现异常值

