expected value | The Chemical Statistician

Video Tutorial – Obtaining the Expected Value of the Exponential Distribution Using the Moment Generating Function

March 28, 2018 2 Comments

In this video tutorial on YouTube, I use the exponential distribution’s moment generating function (MGF) to obtain the expected value of this distribution. Visit my YouTube channel to watch more video tutorials!

Filed under Mathematical Statistics, Mathematics, Probability, Statistics, Tutorials, Video Tagged with expectation, expected value, exponential distribution, moment generating function, probability, probability density function, probability distribution, statistics

Eric’s Enlightenment for Friday, May 22, 2015

May 22, 2015 Leave a comment

John Urschel (academically published mathematician and NFL football player) uses logistic regression, expected value and variance to anticipate that the new farther distance for the extra-point conversion will not reduce its use in the NFL.
John Ioannidis is widely known for his 2005 paper “Why most published research findings are false“. In 2014, he wrote another paper on the same topic called “How to Make More Published Research True“.
Yoshitaka Fujii holds the record for the number of retractions of academic publications for a single author: 183 papers, or “roughly 7 percent of all retracted papers between 1980 and 2011”.
The chemistry of why bread stales, and how to slow retrogradation.

Filed under Eric's Enlightenment Tagged with academia, bread, chemistry, expected value, extra point, football, John Ioannidis, john urschel, logistic regression, nfl, publication, retraction, retrogradation, statistics, variance, Yoshitaka Fujii

Mathematics and Mathematical Statistics Lesson of the Day – Convex Functions and Jensen’s Inequality

September 10, 2014 Leave a comment

Consider a real-valued function $f(x)$ that is continuous on the interval $[x_1, x_2]$ , where $x_1$ and $x_2$ are any 2 points in the domain of $f(x)$ . Let

$x_m = 0.5x_1 + 0.5x_2$

be the midpoint of $x_1$ and $x_2$ . Then, if

$f(x_m) \leq 0.5f(x_1) + 0.5f(x_2),$

then $f(x)$ is defined to be midpoint convex.

More generally, let’s consider any point within the interval $[x_1, x_2]$ . We can denote this arbitrary point as

$x_\lambda = \lambda x_1 + (1 - \lambda)x_2,$ where $0 < \lambda < 1$ .

Then, if

$f(x_\lambda) \leq \lambda f(x_1) + (1 - \lambda) f(x_2),$

then $f(x)$ is defined to be convex. If

$f(x_\lambda) < \lambda f(x_1) + (1 - \lambda) f(x_2),$

then $f(x)$ is defined to be strictly convex.

There is a very elegant and powerful relationship about convex functions in mathematics and in mathematical statistics called Jensen’s inequality. It states that, for any random variable $Y$ with a finite expected value and for any convex function $g(y)$ ,

$E[g(Y)] \geq g[E(Y)]$ .

A function $f(x)$ is defined to be concave if $-f(x)$ is convex. Thus, Jensen’s inequality can also be stated for concave functions. For any random variable $Z$ with a finite expected value and for any concave function $h(z)$ ,

$E[h(Z)] \leq h[E(Z)]$ .

In future Statistics Lessons of the Day, I will prove Jensen’s inequality and discuss some of its implications in mathematical statistics.

Filed under Mathematical Statistics, Mathematics, Probability, Statistics, Statistics Lesson of the Day Tagged with concave function, convex function, expectation, expected value, Jensen's inequality, math, mathematical statistics, mathematics, midpoint convex, probability, random variable, statistics

Mathematical and Applied Statistics Lesson of the Day – The Motivation and Intuition Behind Chebyshev’s Inequality

September 5, 2014 Leave a comment

In 2 recent Statistics Lessons of the Day, I

introduced Markov’s inequality.
explained the motivation and intuition behind Markov’s inequality.

Chebyshev’s inequality is just a special version of Markov’s inequality; thus, their motivations and intuitions are similar.

$P[|X - \mu| \geq k \sigma] \leq 1 \div k^2$

Markov’s inequality roughly says that a random variable $X$ is most frequently observed near its expected value, $\mu$ . Remarkably, it quantifies just how often $X$ is far away from $\mu$ . Chebyshev’s inequality goes one step further and quantifies that distance between $X$ and $\mu$ in terms of the number of standard deviations away from $\mu$ . It roughly says that the probability of $X$ being $k$ standard deviations away from $\mu$ is at most $k^{-2}$ . Notice that this upper bound decreases as $k$ increases – confirming our intuition that it is highly improbable for $X$ to be far away from $\mu$ .

As with Markov’s inequality, Chebyshev’s inequality applies to any random variable $X$ , as long as $E(X)$ and $V(X)$ are finite. (Markov’s inequality requires only $E(X)$ to be finite.) This is quite a marvelous result!

Filed under Applied Statistics, Mathematical Statistics, Mathematics, Probability, Statistics, Statistics Lesson of the Day Tagged with applied statistics, Chebyshev's inequality, expectation, expected value, finite expected value, finite variance, Markov's inequality, math, mathematical statistics, probability, standard deviation, statistics, variance

Mathematical Statistics Lesson of the Day – Chebyshev’s Inequality

September 3, 2014 Leave a comment

The variance of a random variable $X$ is just an expected value of a function of $X$ . Specifically,

$V(X) = E[(X - \mu)^2], \ \text{where} \ \mu = E(X)$ .

Let’s substitute $(X - \mu)^2$ into Markov’s inequality and see what happens. For convenience and without loss of generality, I will replace the constant $c$ with another constant, $b^2$ .

$\text{Let} \ b^2 = c, \ b > 0. \ \ \text{Then,}$

$P[(X - \mu)^2 \geq b^2] \leq E[(X - \mu)^2] \div b^2$

$P[ (X - \mu) \leq -b \ \ \text{or} \ \ (X - \mu) \geq b] \leq V(X) \div b^2$

$P[|X - \mu| \geq b] \leq V(X) \div b^2$

Now, let’s substitute $b$ with $k \sigma$ , where $\sigma$ is the standard deviation of $X$ . (I can make this substitution, because $\sigma$ is just another constant.)

$\text{Let} \ k \sigma = b. \ \ \text{Then,}$

$P[|X - \mu| \geq k \sigma] \leq V(X) \div k^2 \sigma^2$

$P[|X - \mu| \geq k \sigma] \leq 1 \div k^2$

This last inequality is known as Chebyshev’s inequality, and it is just a special version of Markov’s inequality. In a later Statistics Lesson of the Day, I will discuss the motivation and intuition behind it. (Hint: Read my earlier lesson on the motivation and intuition behind Markov’s inequality.)

Filed under Mathematical Statistics, Mathematics, Probability, Statistics, Statistics Lesson of the Day Tagged with Chebyshev's inequality, expectation, expected value, Markov's inequality, math, mathematical statistics, mathematics, probability, random variable, statistics, variance

Mathematical and Applied Statistics Lesson of the Day – The Motivation and Intuition Behind Markov’s Inequality

August 27, 2014 4 Comments

Markov’s inequality may seem like a rather arbitrary pair of mathematical expressions that are coincidentally related to each other by an inequality sign:

$P(X \geq c) \leq E(X) \div c,$ where $c > 0$ .

However, there is a practical motivation behind Markov’s inequality, and it can be posed in the form of a simple question: How often is the random variable $X$ “far” away from its “centre” or “central value”?

Intuitively, the “central value” of $X$ is the value that of $X$ that is most commonly (or most frequently) observed. Thus, as $X$ deviates farther and farther from its “central value”, we would expect those distant-from-the-centre values to be less frequently observed.

Recall that the expected value, $E(X)$ , is a measure of the “centre” of $X$ . Thus, we would expect that the probability of $X$ being very far away from $E(X)$ is very low. Indeed, Markov’s inequality rigorously confirms this intuition; here is its rough translation:

As $c$ becomes really far away from $E(X)$ , the event $X \geq c$ becomes less probable.

You can confirm this by substituting several key values of $c$ .

If $c = E(X)$ , then $P[X \geq E(X)] \leq 1$ ; this is the highest upper bound that $P(X \geq c)$ can get. This makes intuitive sense; $X$ is going to be frequently observed near its own expected value.

If $c \rightarrow \infty$ , then $P(X \geq \infty) \leq 0$ . By Kolmogorov’s axioms of probability, any probability must be inclusively between $0$ and $1$ , so $P(X \geq \infty) = 0$ . This makes intuitive sense; there is no possible way that $X$ can be bigger than positive infinity.

Filed under Applied Statistics, Mathematical Statistics, Mathematics, Probability, Statistics, Statistics Lesson of the Day Tagged with continuous random variable, discrete random variable, expected value, finite expected value, Markov's inequality, math, mathematical statistics, probability, random variable, statistics

Mathematical Statistics Lesson of the Day – Markov’s Inequality

August 14, 2014 1 Comment

Markov’s inequality is an elegant and very useful inequality that relates the probability of an event concerning a non-negative random variable, $X$ , with the expected value of $X$ . It states that

$P(X \geq c) \leq E(X) \div c,$

where $c > 0$ .

I find Markov’s inequality to be beautiful for 2 reasons:

It applies to both continuous and discrete random variables.
It applies to any non-negative random variable from any distribution with a finite expected value.

In a later lesson, I will discuss the motivation and intuition behind Markov’s inequality, which has useful implications for understanding a data set.

Filed under Mathematical Statistics, Mathematics, Probability, Statistics, Statistics Lesson of the Day Tagged with continuous random variable, discrete random variable, expected value, finite expected value, Markov's inequality, math, mathematical statistics, mathematics, probability, random variable

Applied Statistics Lesson of the Day – The Coefficient of Variation

August 12, 2014 3 Comments

In my statistics classes, I learned to use the variance or the standard deviation to measure the variability or dispersion of a data set. However, consider the following 2 hypothetical cases:

the standard deviation for the incomes of households in Canada is $2,000
the standard deviation for the incomes of the 5 major banks in Canada is $2,000

Even though this measure of dispersion has the same value for both sets of income data, $2,000 is a significant amount for a household, whereas $2,000 is not a lot of money for one of the “Big Five” banks. Thus, the standard deviation alone does not give a fully accurate sense of the relative variability between the 2 data sets. One way to overcome this limitation is to take the mean of the data sets into account.

A useful statistic for measuring the variability of a data set while scaling by the mean is the sample coefficient of variation:

$\text{Sample Coefficient of Variation (} \bar{c_v} \text{)} \ = \ s \ \div \ \bar{x},$

where $s$ is the sample standard deviation and $\bar{x}$ is the sample mean.

Analogously, the coefficient of variation for a random variable is

$\text{Coefficient of Variation} \ (c_v) \ = \ \sigma \div \ \mu,$

where $\sigma$ is the random variable’s standard deviation and $\mu$ is the random variable’s expected value.

The coefficient of variation is a very useful statistic that I, unfortunately, never learned in my introductory statistics classes. I hope that all new statistics students get to learn this alternative measure of dispersion.

Filed under Applied Statistics, Descriptive Statistics, Statistics, Statistics Lesson of the Day Tagged with applied statistics, coefficient of variation, data, descriptive statistics, expected value, relative variability, sample mean, sample standard deviation, standard deviation, statistics

Exploratory Data Analysis: Combining Histograms and Density Plots to Examine the Distribution of the Ozone Pollution Data from New York in R

July 29, 2013 9 Comments

Introduction

This is a follow-up post to my recent introduction of histograms. Previously, I presented the conceptual foundations of histograms and used a histogram to approximate the distribution of the “Ozone” data from the built-in data set “airquality” in R. Today, I will examine this distribution in more detail by overlaying the histogram with parametric and non-parametric kernel density plots. I will finally answer the question that I have asked (and hinted to answer) several times: Are the “Ozone” data normally distributed, or is another distribution more suitable?

Read the rest of this post to learn how to combine histograms with density curves like this above plot!

This is another post in my continuing series on exploratory data analysis (EDA). Previous posts in this series on EDA include

Exploratory Data Analysis: Conceptual Foundations of Empirical Cumulative Distribution Functions

June 24, 2013 13 Comments

Introduction

Continuing my recent series on exploratory data analysis (EDA), this post focuses on the conceptual foundations of empirical cumulative distribution functions (CDFs); in a separate post, I will show how to plot them in R. (Previous posts in this series include descriptive statistics, box plots, kernel density estimation, and violin plots.)

To give you a sense of what an empirical CDF looks like, here is an example created from 100 randomly generated numbers from the standard normal distribution. The ecdf() function in R was used to generate this plot; the entire code is provided at the end of this post, but read my next post for more detail on how to generate plots of empirical CDFs in R.

Read to rest of this post to learn what an empirical CDF is and how to produce the above plot!

	Eric Cai - The Chemi… on Convert multiple variables bet…
	Jack on Convert multiple variables bet…
	Eric Cai - The Chemi… on Getting the names, types, form…
	Emily V on Getting the names, types, form…
	Lauren McClain on Convert multiple variables bet…
	Eric Cai - The Chemi… on Convert multiple variables bet…
	Lauren McClain on Convert multiple variables bet…
	Eric Cai - The Chemi… on Exploratory Data Analysis: Com…
	CK on Exploratory Data Analysis: Com…
	Eric Cai - The Chemi… on Video Tutorial: Breaking Down…

The Chemical Statistician

Video Tutorial – Obtaining the Expected Value of the Exponential Distribution Using the Moment Generating Function

Eric’s Enlightenment for Friday, May 22, 2015

Mathematics and Mathematical Statistics Lesson of the Day – Convex Functions and Jensen’s Inequality

Mathematical and Applied Statistics Lesson of the Day – The Motivation and Intuition Behind Chebyshev’s Inequality

Mathematical Statistics Lesson of the Day – Chebyshev’s Inequality

Mathematical and Applied Statistics Lesson of the Day – The Motivation and Intuition Behind Markov’s Inequality

Mathematical Statistics Lesson of the Day – Markov’s Inequality

Applied Statistics Lesson of the Day – The Coefficient of Variation

Exploratory Data Analysis: Combining Histograms and Density Plots to Examine the Distribution of the Ozone Pollution Data from New York in R

Introduction

Exploratory Data Analysis: Conceptual Foundations of Empirical Cumulative Distribution Functions

Introduction

Eric’s Twitter Feed (@chemstateric)

Recent Comments

Popular Topics

Recent Posts

About Eric

Blogs and Web Sites That I Like to Read

Archives

Categories