What method is used to compute average or central value of collected data brainly?

Want to join the conversation?

  • What method is used to compute average or central value of collected data brainly?

    What's the point of squaring the difference at

    5:49

    just to make it positive when we could have taken the absolute value?

    • What method is used to compute average or central value of collected data brainly?

      What method is used to compute average or central value of collected data brainly?

      The main reason to square the values is so they are all positive. You could take the absolute value instead, but squaring means that more variable points have a higher weighting. Squaring rather than taking the absolute value also means that taking the derivative of the function is easier.
      Finally you can also view the equal as being the Euclidean distance between all the points and the mean of the points (in the same way that the distance between two points (x1, y1) and (x2, y2) is the square root of (x1-x2)^2 + (y1-y2)^2).

      Credits to Peter Collingridge

  • Why is it for the variance we square the deviations for data sets to make them positive? Doesn't it make more sense to simplely take the sum of their absolute values, then divide that by the number of data points?

  • In what case will either Variance or Standard Deviation be preferred over each other ?
    At around

    9:55

    Sal says that Variance has an odd set of units so is standard deviation better as it has the same units as the data itself ?

    • What method is used to compute average or central value of collected data brainly?

      In practical settings, the standard deviation is probably almost always preferred. the variance is more often used in the background, deriving this or that, or used in the theory of something.

  • What is the difference between the standard deviation and the variance? Why is the variance in units squared and not represented by the units in the measurement? If squaring the numbers is just to make it positive (@

    5:48

    ), why not use the average absolute deviation?

    • What method is used to compute average or central value of collected data brainly?

      Q1) The Standard Deviation is the "mean of mean". Basically, it is the square-root of the Variance (the mean of the differences between the data points and the average). Standard Deviation is the measure of how far a typical value in the set is from the average. The smaller the Standard Deviation, the closely grouped the data point are. The standard deviation of {1,2,3} would be less than the Standard Deviation of {0,4,7,10}. You can see clearly that the data-points are grouped closely together more in the first set than the second set of data-points. And of course, you will see the same when you have endured the boring process of calculating the Variance and then the Standard Deviation.

      Therefore, the difference between Variance and the Standard Difference is that the Variance is "The average of the squared differences from the Mean" and the Standard Deviation is it's square-root.

      Q2) I think we could use the absolute value,but for the official definition, you have to square the differences. Thanks to Lura Ercolano for clearing my misconception about using absolute value to get the variance. But remember, by squaring the differences, we get a wider spread between the differences, which is called higher weighting.

  • I'm still kind of confused as to what exactly variance measures.

    • Variance simply tells you how spread your data is. On its own, it does not mean much, but it is particularly helpful when you compare two different samples:
      Sample 1: 1,2,3,4,3,1,2,3. Variance: small
      Sample 2: 1,2000,-23,500. Variance: much larger

  • There are many questioners here (including myself) wondering why squaring is used in the definition of variance instead of the more sensible absolute value. I've done a quick Web search on this question, and I believe I understand this better.

    First, almost all the the reasons given have to do with ease of computation. This is largely irrelevant, since we have computers to aid us. For example, the first derivative of the abs val func has a discontinuity at zero. But computer numerical analysis can handle discontinuities, so calculations using the abs val definition should be easy using an advanced computer calculator program.

    Next, some people like Euclidean distance over Manhattan (rectangular) distance. There is not much justification for this, as variance is not obviously an unconstrained 2-D distance problem. In fact, it is an n-dimensional problem, where n is the number of data measurements.

    No, the real reason is historical: Gauss used the square variance definition to introduce his concept of normal distributions, where it is a perfect and natural fit. We might say, a least-squares fit, since one of the motivations is fitting 2nd-order polynomials to the error data.

    The point is that the use of the mean and squaring in the definition of variance works great only for normal (Gaussian) distributions. As soon as you have data derived from two or more normal distributions, or a gamma distribution, or a Poisson distribution, or anything else, using abs val works better. In fact, the mean itself only works when there IS a mean,which is when the data is normally distributed. For general data, the mean is not defined, and other, more robust statistical measures must be found. (See https://en.wikipedia.org/wiki/Robust_statistics)

    In summary, the definition of variance given here by Khan only applies to Gaussian distributions, which frequently arise in nature and in human behavior. But non-Gaussian distributions also frequently arise (such as when making many measurements with a ruler). For non-Gaussian data, this def of variance is erroneous. Instead, a measure of variance that shows the spreading of the data from each other should be used rather than the standard one, which shows the order-2 spreading from the mean.

    Note: for more detailed information on the advantages of the related Absolute Mean Deviation, see http://www.leeds.ac.uk/educol/documents/00003759.htm .

  • So, by reading some of the questions and answers for this video, I have concluded the following: variance and standard deviation are artificial measures of dispersion, designed to be most useful in statistical calculations. The average of the absolute value of the difference of each data point from the mean COULD be used but the square method (variance) is generally adopted by statisticians and mathematicians for various reasons (eg derivatives are easier). Is this conclusion correct? Or is/are there other reasons that more variable points are given more weight (by use of squares not absolute values)?
    Thanks.

    • To some extent, I would say yes. Using squares (or the method of "least squares") certainly does often make derivations easier. Though it's not entirely the only reason. The Normal distribution goes hand-in-hand with the notion of squaring deviations, and scientists centuries ago noticed that the Normal distribution worked quite well to model their astronomical data.

      The method of least squares also results in the sample mean - a very intuitive and common measure of central tendency - being the "best" measure of center. And even better, the sampling distribution of the sample mean converges to the Normal distribution, so a lot of methods can be built on top of the Normal distribution, and as long as the sample mean is the starting point, everything should work out.

      I have a longer, more detailed answer here (I really wish the KA links were shorter):

      https://www.khanacademy.org/math/probability/descriptive-statistics/variance_std_deviation/v/range-variance-and-standard-deviation-as-measures-of-dispersion?qa_expand_key=kaencrypted_230b1942c74c60023e9fc29baaa3fd3e_2a53d64f8e4e84ddb90698dfc32164a7370b55e38674387fc44a77a9aa93675a7db92954f954bc475d07d0df2d6df8531fabb52aa5578a3086284bbcd09a14cf0f4833f1c1a9097f7763c5a7fecf9f46f9b173b3083ad409f140a4db31622998a483418a29c48e7b7cb59bef4f004d6edc1c381a3ac93f6e473241f6164026726e30926987a145fdab3c7817889565b99532a6d38f12105af2dcd93070dfb0dc

  • You lost me at "Standard Deviation"

    • i know.. watch the video twice and if you still dont get it, try to find additional sites online that could help you.. or just ask your teacher for help

  • I thought that when you calculate variance you divide by the number of terms minus 1? (n-1) Is this correct or was I told wrong?

    • Great question. We use (n-1) when we are estimating the variance of a population based on a sample which is much smaller than the population. We almost always use the variable "s²" for that. But in this video, we are saying that this dataset is the entire population, not just a sample. If you actually have the entire population, then you divide by n, and since that's the situation here, we use "σ²" to signal that this is an exact population variance, not just from a sample. The (n-1) is a correction factor that improves (on average) an estimated variance based on a sample.

What method is used to compute the average of central value of collected data?

The mean is the arithmetic average, and it is probably the measure of central tendency that you are most familiar. Calculating the mean is very simple. You just add up all of the values and divide by the number of observations in your dataset.

What method is used to compute the average?

Average This is the arithmetic mean, and is calculated by adding a group of numbers and then dividing by the count of those numbers. For example, the average of 2, 3, 3, 5, 7, and 10 is 30 divided by 6, which is 5.

Which measure of central tendency is derived from the most common values brainly?

The most frequently used measure of central tendency is the mean.

Which measure of central tendency is obtained using the middle score when all scores are organized numerical order?

Median = "middle" data point (or average of two middle data points) when the data points are arranged in numerical order.