Business Statistics презентация

Содержание

Слайд 2

About Applied Statistics

About Applied Statistics

Слайд 3

Слайд 4

Data Structures Classifying the Various Types of Data Sets Data

Data Structures Classifying the Various Types of Data Sets

Data can come to

you in several different forms, and it will be useful to have a basic catalog of the different kinds of data so that you can recognize them and use appropriate techniques for each. A data set consists of observations on items, typically with the same information being recorded for each item.
A piece of information recorded for every item (eg, its cost) is called a variable. The number of variables (pieces of information) recorded for each item indicates the complexity of the data set and will guide you toward the proper kinds of analyses. Depending on whether one, two, or many variables are present, you have univariate, bivariate, or multivariate data, respectively.
Слайд 5

Univariate Data Univariate (one-variable) data sets have just one piece

Univariate Data

Univariate (one-variable) data sets have just one piece of information

recorded for each item. Statistical methods are used to summarize the basic properties of this single piece of information, answering such questions as:
1. What is a typical (summary) value?
2. How diverse are these items?
3. Do any individuals or groups require special attention?
Слайд 6

Bivariate Data Bivariate (two-variable) data sets have exactly two pieces

Bivariate Data

Bivariate (two-variable) data sets have exactly two pieces of information

recorded for each item. In addition to summarizing each of these two variables separately (each as its own univariate data set), statistical methods would also be used to explore the relationship between the two factors being measured in the following ways:
1. Is there a simple relationship between the two?
2. How strongly are they related?
3. Can you predict one from the other? If so, with what degree of reliability?
4. Do any individuals or groups require special attention?
Слайд 7

Multivariate Data Multivariate (many-variable) data sets have three or more

Multivariate Data

Multivariate (many-variable) data sets have three or more pieces of

information recorded for each item. In addition to summarizing each of these variables separately (as a univariate data set), and in addition to looking at the relationship between any two variables (as a bivariate data set), statistical methods would also be used to look at the interrelationships among all the items, addressing the following questions:
1. Is there a simple relationship among them?
2. How strongly are they related?
3. Can you predict one (a “special variable”) from the others? With what degree of reliability?
4. Do any individuals or groups require special attention?
Слайд 8

Слайд 9

QUANTITATIVE DATA: NUMBERS Meaningful numbers are numbers that directly represent

QUANTITATIVE DATA: NUMBERS
Meaningful numbers are numbers that directly represent the measured

or observed amount of some characteristic or quality of the elementary units, as the result of an observation of a variable. Meaningful numbers include, for example, dollar amounts, counts, sizes, numbers of employees.
If the data for a variable comes to you as meaningful numbers, then you have quantitative data (ie, they represent quantities). With quantitative data, you can do all of the usual number-crunching tasks, such as finding the average and measuring the variability.
It is straightforward to compute directly with numerical data.
There are two kinds of quantitative data, discrete and continuous, depending on the values potentially observable.
Слайд 10

Discrete data is a count that can't be made more

Discrete data is a count that can't be made more precise.

Typically it involves integers. For instance, the number of children (or adults, or pets) in your family is discrete data, because you are counting whole, indivisible entities: you can't have 2.5 kids, or 1.3 pets.
Continuous data, on the other hand, could be divided and reduced to finer and finer levels. For example, you can measure the height of your kids at progressively more precise scales—meters, centimeters, millimeters, and beyond—so height is continuous data.
Слайд 11

QUALITATIVE DATA: CATEGORIES Qualitative data is defined as the data

QUALITATIVE DATA: CATEGORIES

Qualitative data is defined as the data that approximates

and characterizes.
This data type is non-numerical in nature. This type of data is collected through methods of observations, one-to-one interview, conducting focus groups and similar methods.
Qualitative data in statistics is also known as categorical data.
There are two kinds of qualitative data: ordinal (for which there is a meaningful ordering but no meaningful numerical assignment) and nominal (for which there is no meaningful order).
Слайд 12

Charts, Histograms, Graphing

Charts, Histograms, Graphing

Слайд 13

Слайд 14

USING A HISTOGRAM TO DISPLAY THE FREQUENCIES The histogram displays

USING A HISTOGRAM TO DISPLAY THE FREQUENCIES

The histogram displays the frequencies as

a bar chart rising above the number line, indicating how often the various values occur in the data set. The horizontal axis represents the measurements of the data set (eg, in dollars, number of people, miles per gallon, etc.), and the vertical axis represents how often these values occur. An especially high bar indicates that many data values were found at this position on the horizontal number line, while a shorter bar indicates a less common value.
A histogram is a bar chart of the frequencies, not of the data.
Слайд 15

Creating Frequency Distributions and Histograms in EXCEL

Creating Frequency Distributions and Histograms in EXCEL

Слайд 16

Consider the interest rate for 25-year fixed-rate home mortgages charged by mortgage companies in Seattle

Consider the interest rate for 25-year fixed-rate home mortgages charged by

mortgage companies in Seattle
Слайд 17

Слайд 18

Слайд 19

Слайд 20

NORMAL DISTRIBUTIONS A normal distribution is an idealized, smooth, bell-shaped

NORMAL DISTRIBUTIONS

A normal distribution is an idealized, smooth, bell-shaped histogram with

all of the randomness removed. It represents an ideal data set that has lots of numbers concentrated in the middle of the range, with the remaining numbers trailing off symmetrically on both sides. This degree of smoothness is not attainable by real data.
Слайд 21

Слайд 22

There are actually many different normal distributions, all symmetrically bell-shaped.

There are actually many different normal distributions, all symmetrically bell-shaped. They

differ in that the center can be anywhere, and the scale (the width of the bell) can have any size.
Слайд 23

Слайд 24

BIMODAL DISTRIBUTIONS WITH TWO GROUPS It is important to be

BIMODAL DISTRIBUTIONS WITH TWO GROUPS

It is important to be able to recognize

when a data set consists of two or more distinct groups so that they may be analyzed separately, if appropriate. This can be seen in a histogram as a distinct gap between two cohesive groups of bars. When two clearly separate groups are visible in a histogram, you have a bimodal distribution. Literally, a bimodal distribution has two modes, or two distinct clusters of data.
Слайд 25

Task 1. Using the table below build different types of charts and graphing

Task 1. Using the table below build different types of charts

and graphing
Слайд 26

TASK 2 Consider the yields (as an interest rate, in

TASK 2 Consider the yields (as an interest rate, in percent

per year) of municipal bonds, as shown in Table 3.8.1.
a. Construct a histogram of this data set.
b. Describe the shape of the distribution.
Слайд 27

Landmark Summaries Interpreting Typical Values and Percentiles WHAT IS THE

Landmark Summaries Interpreting Typical Values and Percentiles

WHAT IS THE MOST TYPICAL VALUE?
There

are three different ways to obtain such a summary measure:
1. The average or mean, which can be computed only for meaningful numbers (quantitative data).
2. The median, or halfway point, which can be computed either for ordered categories (ordinal data) or for numbers.
3. The mode, or most common category, which can be computed for unordered categories (nominal data), ordered categories, or numbers.
Слайд 28

The Average: A Typical Value for Quantitative Data The average

The Average: A Typical Value for Quantitative Data

The average (also called the

mean) is the most common method for finding a typical value for a list of numbers, found by adding up all the values and then dividing by the number of items. The sample average expressed as a formula is:
Слайд 29

Слайд 30

Excel’s Average function

Excel’s Average function

Слайд 31

Слайд 32

Слайд 33

The Weighted Average: Adjusting for Importance The weighted average is

The Weighted Average: Adjusting for Importance

The weighted average is like the average,

except that it allows you to give a different importance, or “weight,” to each data item. The weighted average gives you the flexibility to define your own system of importance when it is not appropriate to treat each item equally.
Слайд 34

Слайд 35

The Median: A Typical Value for Quantitative and Ordinal Data

The Median: A Typical Value for Quantitative and Ordinal Data

The median is

the middle value; half of the items in the set are larger and half are smaller. Thus, itmust be in the center of the data and provide an effective summary of the list of data. You find it by first putting the data in order and then locating the middle value. To be precise, you have to pay attention to some details; for example, you might have to average the two middle values if there is no single value in the middle.
One way to define the median is in terms of ranks. Ranks associate the numbers 1, 2, 3,…,n with the data values so that the smallest has rank 1, the next smallest has rank 2, and so forth up to the largest, which has rank n.
Слайд 36

Слайд 37

Слайд 38

The Mode: A Typical Value Even for Nominal Data The

The Mode: A Typical Value Even for Nominal Data

The mode is the

most common category, the one listed most often in the data set. It is the only summary measure available for nominal qualitative data because unordered categories cannot be summed (as for the average) and cannot be ranked (as for the median). The mode is easily found for ordinal data by ignoring the ordering of the categories and proceeding as if you had a nominal data set with unordered categories.
Mode is the value which occurs most frequently. The mode may not exist, and even if it does, it may not be unique.
Слайд 39

Example 1 The moda of the ungrouped data: 20, 18,

Example 1
The moda of the ungrouped data: 20, 18, 15, 15,

14, 12, 11, 9, 7, 6, 4, 1 is 15
Example 2
{2, 2, 2, 4, 5, 6, 7, 7, 7}
Mode = 2 or 7 (Bimodal)
Слайд 40

Слайд 41

Task To calculate: The Average and The Weighted Average and

Task

To calculate:
The Average and The Weighted Average and Rang
1) 24,

26, 26, 29, 33, 36, 37
2) 24, 26, 26, 29, 33, 36, 37, 45
Median:
1) 37, 26, 29, 33, 24, 36, 26
2) 45, 26, 36, 29, 33, 37, 24, 26,
Moda
1) 24, 26, 26, 29, 33, 36, 37
2) 36, 26, 26, 29, 33, 36, 37, 45
Слайд 42

Consider the profits of health care companies in the Fortune

Consider the profits of health care companies in the Fortune 500,

as shown in Table 4.3.3
a. Draw a histogram of this data set, and briefly describe the shape of the distribution.
b. Find the profit of the average firm.
c. Find the median profit level.
d. Compare the average and median; in particular, which is larger? Is this what you would expect for a distribution with this shape?
Слайд 43

Task Consider the loan fees charged for granting home mortgages,

Task

Consider the loan fees charged for granting home mortgages,
as shown in

Table 4.3.8, for Dallas TX, 30-year fixed rate for home purchase, with credit Score 740+, and with 20% down payment. These are given as a percentage of the loan amount and are one-time fees paid when the loan is closed.
a. Find the average loan fee.
b. Find the median loan fee.
c. Find the mode.
d. Which summary is most useful as a description of the “typical” loan fee, the average, median, or mode? Why?
Слайд 44

Слайд 45

Probability Understanding Random Situations Probability is a measure quantifying the

Probability Understanding Random Situations

Probability is a measure quantifying the likelihood that events will

occur. 
Probability. How likely something is to happen. Many events can't be predicted with total certainty. 
If events are mutually incompatible (they can’t occure at the same time) and they constitute all possibilities for a given situation, the sum of their individual probabilities is one. For example, a fair cubical die has a probability
pi = 1/6
of landing with the top face showing one dot (i=1) facing up; indeed pi = 1/6 for all i. The probability that a fair die will land showing a top face of either a 1 or a 2 or a 3 or a 4 or a 5 or a 6 is
Σpi = 1
Слайд 46

A probability tree is a picture indicating probabilities and conditional

A probability tree is a picture indicating probabilities and conditional probabilities

for combinations of two or more events. We will begin with an example of a completed tree and follow up with the details of how to construct the tree.
Probability trees are closely related to decision trees, which are used in finance and other fields in business.
Имя файла: Business-Statistics.pptx
Количество просмотров: 86
Количество скачиваний: 0