 # Python statistics variance()

## Python statistics | variance()

Statistics module provides very powerful tools, which can be used to compute anything related to Statistics. variance() is one such function. This function helps to calculate the variance from a sample of data (sample is a subset of populated data).
variance() function should only be used when variance of a sample needs to be calculated. There’s another function known as pvariance(), which is used to calculate the variance of an entire population.
In pure statistics, variance is the squared deviation of a variable from its mean. Basically, it measures the spread of random data in a set from its mean or median value. A low value for variance indicates that the data are clustered together and are not spread apart widely, whereas a high value would indicate that the data in the given set are much more spread apart from the average value.
Variance is an important tool in the sciences, where statistical analysis of data is common. It is the square of standard deviation of the given data-set and is also known as second central moment of a distribution. It is usually represented by in pure Statistics.
Variance is calculated by the following formula :

It’s calculated by mean of square minus square of mean Syntax : variance( [data], xbar )
Parameters :
[data] : An iterable with real valued numbers.
xbar (Optional) : Takes actual mean of data-set as value.
Returnype : Returns the actual variance of the values passed as parameter.
Exceptions :
StatisticsError is raised for data-set less than 2-values passed as parameter.
Throws impossible values when the value provided as xbar doesn’t match actual mean of the data-set.

Code #1 :

 # Python code to demonstrate the working of # variance() function of Statistics Module # Importing Statistics module import statistics # Creating a sample of data sample = [2.74, 1.23, 2.63, 2.22, 3, 1.98] # Prints variance of the sample set # Function will automatically calculate # it's mean and set it as xbar print("Variance of sample set is % s"       %(statistics.variance(sample)))

Output :

Variance of sample set is 0.40924

Code #2 : Demonstrates variance() on a range of data-types

 # Python code to demonstrate variance() # function on varying range of data-types # importing statistics module from statistics import variance # importing fractions as parameter values from fractions import Fraction as fr # tuple of a set of positive integers # numbers are spread apart but not very much sample1 = (1, 2, 5, 4, 8, 9, 12) # tuple of a set of negative integers sample2 = (-2, -4, -3, -1, -5, -6) # tuple of a set of positive and negative numbers # data-points are spread apart considerably sample3 = (-9, -1, -0, 2, 1, 3, 4, 19) # tuple of a set of fractional numbers sample4 = (fr(1, 2), fr(2, 3), fr(3, 4),                      fr(5, 6), fr(7, 8)) # tuple of a set of floating point values sample5 = (1.23, 1.45, 2.1, 2.2, 1.9) # Print the variance of each samples print("Variance of Sample1 is % s " %(variance(sample1))) print("Variance of Sample2 is % s " %(variance(sample2))) print("Variance of Sample3 is % s " %(variance(sample3))) print("Variance of Sample4 is % s " %(variance(sample4))) print("Variance of Sample5 is % s " %(variance(sample5)))

Output :

Variance of Sample 1 is 15.80952380952381
Variance of Sample 2 is 3.5
Variance of Sample 3 is 61.125
Variance of Sample 4 is 1/45
Variance of Sample 5 is 0.17613000000000006

Code #3 : Demonstrates the use of xbar parameter

 # Python code to demonstrate # the use of xbar parameter # Importing statistics module import statistics # creating a sample list sample = (1, 1.3, 1.2, 1.9, 2.5, 2.2) # calculating the mean of sample set m = statistics.mean(sample) # calculating the variance of sample set print("Variance of Sample set is % s"     %(statistics.variance(sample, xbar = m)))

Output :

Variance of Sample set is 0.3656666666666667

Code #4 : Demonstrates the Error when value of xbar is not same as the mean/average value

 # Python code to demonstrate the error caused # when garbage value of xbar is entered # Importing statistics module import statistics # creating a sample list sample = (1, 1.3, 1.2, 1.9, 2.5, 2.2) # calculating the mean of sample set m = statistics.mean(sample) # Actual value of mean after calculation # comes out to 1.6833333333333333 # But to demonstrate xbar error let's enter # -100 as the value for xbar parameter print(statistics.variance(sample, xbar = -100))

Output :

0.3656666666663053

Note : It is different in precision from the output in Code #3

Code #4 : Demonstrates StatisticsError

 # Python code to demonstrate StatisticsError # importing Statistics module import statistics # creating an empty data-srt sample = [] # will raise Statistics Error print(statistics.variance(sample))

Output :

Traceback (most recent call last):
File "/home/64bf6d80f158b65d2b75c894d03a7779.py", line 10, in
print(statistics.variance(sample))
File "/usr/lib/python3.5/statistics.py", line 555, in variance
raise StatisticsError('variance requires at least two data points')
statistics.StatisticsError: variance requires at least two data points

Applications :
Variance is a very important tool in Statistics and handling huge amounts of data. Like, when the omniscient mean is unknown (sample mean) then variance is used as biased estimator. Real world observations like the value of increase and decrease of all shares of a company throughout the day cannot be all sets of possible observations. As such, variance is calculated from a finite set of data, although it won’t match when calculated taking the whole population into consideration, but still it will give the user an estimate which is enough to chalk out other calculations.

Last Updated on March 1, 2022 by admin

## Three-dimensional Plotting in Python using MatplotlibThree-dimensional Plotting in Python using Matplotlib

Three-dimensional Plotting in Python using Matplotlib Matplotlib was introduced keeping in mind, only two-dimensional plotting. But at the time when the release of 1.0 occurred, the 3d utilities were developed upon the 2d and thus, we have 3d implementation of data

## struct module in Pythonstruct module in Python

struct module in Python This module performs conversions between Python values and C structs represented as Python bytes objects. Format strings are the mechanism used to specify the expected layout when packing and unpacking data. Module struct is available in

## How to Convert Int to Bytes in Python?How to Convert Int to Bytes in Python?

How to Convert Int to Bytes in Python? An int object can be used to represent the same value in the format of the byte. The integer represents a byte, is stored as an array with its most significant digit

## How to get file size in Python?How to get file size in Python?

How to get file size in Python? We can follow different approaches to get the file size in Python. It’s important to get the file size in Python to monitor file size or in case of ordering files in the

## Python program to get all subsets of given size of a setPython program to get all subsets of given size of a set

Python program to get all subsets of given size of a set Given a set, write a Python program to generate all possible subset of size n of given set within a list. Examples:   Input : {1, 2, 3}, n =

## How to save a Python Dictionary to a CSV File?How to save a Python Dictionary to a CSV File?

How to save a Python Dictionary to a CSV File? CSV (comma-separated values) files are one of the easiest ways to transfer data in form of string especially to any spreadsheet program like Microsoft Excel or Google spreadsheet. In this article,

## Pandas – How to shuffle a DataFrame rowsPandas – How to shuffle a DataFrame rows

Pandas – How to shuffle a DataFrame rows Let us see how to shuffle the rows of a DataFrame. We will be using the sample() method of the pandas module to to randomly shuffle DataFrame rows in Pandas. Algorithm : Import the pandas and numpy modules.

## Python – Largest, Smallest, Second Largest, Second Smallest in a ListPython – Largest, Smallest, Second Largest, Second Smallest in a List

Python | Largest, Smallest, Second Largest, Second Smallest in a List Since, unlike other programming languages, Python does not have arrays, instead, it has list. Using lists is more easy and comfortable to work with in comparison to arrays. Moreover,