Skip to main content

Null and Alternative Hypotheses

Fine... Constructing a statistical hypothesis is mainly to define what's called the null and the alternative hypotheses. Mostly in academic life, students are given the hypothesis to test. But in research or real experiment, constructing the hypotheses correctly is such vital step toward inference or statistical decision.

Let's focus on hypotheses for population mean, for simplicity goals...

Null hypothesis is mainly our initial information or primary belief about the population. Let's consider the production of 500 ml bottles. The 500 ml is the mean capacity of bottles capacity. Since this is the information we know from previous knowledge, it will be our null hypothesis.

We write:
H0: Mu = 500

Note: null hypothesis always has equality sign!



Test for a hypothesis is usually done when we are worried that some factors affected the population, or population has changed for any possible reasons.
OK, so the null hypothesis is the information we know primarily before the suspected change. So, here come the alternative hypothesis to be defined. The alternative hypothesis is usually the region where we suspect or worry that population has changed into.

For bottles capacity example: the change in 500 ml bottle capacity (for any reason) is a bad issue to encounter. For production: it's bad that our bottles be either less than or greater than 500 ml. Lower capacity means unmatched regulations and higher means extra content added.

This is called two sided hypothesis because either change up/down is undesired. Thus, we define our alternative hypothesis as:


H1: Mu != 500 (!= here means not equal)

The other type of hypothesis is to be one sided, when we interested only in (> or <) in alternative hypothesis. In such cases, we suspect/only worried that some factors changed our population toward one direction (up only, down only).
____

Comments

Popular posts from this blog

The "Sample"

Anytime you aim to perform a study on the entire population, you will surely find that this task will be: Much time and/or efforts consuming as populations are normally huge . Impossible if the population is infinite (such as products). Here comes the role of taking samples. Yes! we just take a sample from the whole population, perform the study on the chosen sample, apply the results back to our population. This is the core of  inferential statistics because what we do is to infer parameters/properties of the population using information from a small sample. Well, this does not mean we will obtain 100% exact accurate estimations or inferences. But to be as close as possible, sample elements should be taken randomly ! At least, being random in sample selection will mostly include the diversity of information/facts within our population.

Conclusions of Hypothesis Testing

A general hypothesis is defined as following (eg a hypothesis on the population mean): H0: Mu = Mu0 H1: Mu !=  Mu0 OK, apart from we have a two or one sided hypothesis, after performing the checking and statistical tests: our conclusion should be one of the following: Rejecting the null hypothesis (H0). Failing to reject the null hypothesis (H0). The following statements for conclusions are not accurate : Accepting the null hypothesis (H0). Accepting the alternative hypothesis (H1). But why? When we fail to reject H0, it does not mean we accept H0 as a fact because we still could not prove it as a fact. But what happened is that we failed to prove it to be false. This goes like following: we have suspected new factors may affected the population mean, then we have taken all possible evidences and checking, but all checking failed to prove our suspects. As well, rejecting H0 does not mean accepting H1 as a fact. What happens in this case is we p

Confidence Level and Confidence Interval

Being confident make one's self more reassured. Briefly, explanations below are for two sided confidence levels/intervals in order to simplify the idea. Saying " two sided " gives initial impression that there is something like two limits, yeah they are: upper and lower limits where the confidence interval lies in between. Example: Let's look at the population of a specific mobile phone model. Suppose we are now interested in the ' weight ' property. We found that weight property follows a normal distribution with mean value of 120 grams and a standard deviation of 1.4 grams. Weight ~ Normal (Mu, Sigma) = Normal (120, 1.4) This understanding means that majority of mobiles tested will weigh very closely to 120 grams. Yes, there should be fluctuations above and below the mean value but surely that still relatively close to mean value. Suppose a question: do you expect weights like: 121, 119.5, 122.1, 118.9? Answer: Yes , I surely expect such