Descriptive Statistics and Covariance and correlation Quiz
Question 1
Which of the following best describes the role of statistics in data science?
It helps in cleaning and visualizing data
It enables predictive modeling
It allows inference from sample data to the population
All of the above
Question 2
Which level of measurement allows all mathematical operations including multiplication and division?
Nominal
Ordinal
Interval
Ratio
Question 3
If a dataset contains the following values: [2, 3, 3, 5, 7] what is the mode?
2
3
5
7
Question 4
What is the correct formula for variance?
σ² = ∑(x - μ)² / N
σ = ∑(x - μ)² / N
σ = ∑(x - μ) / N
σ² = ∑(x - μ) / N
Question 5
Which measure of central tendency is most affected by outliers?
Median
Mode
Mean
None of these
Question 6
What does standard deviation measure?
The difference between the largest and smallest values
The spread of data around the mean
The middle value of a dataset
The most frequently occurring value
Question 7
What is the purpose of a frequency distribution table?
To summarize and categorize data
To calculate the mean of a dataset
To measure variability in data
To determine correlation between variables
Question 8
What is the main difference between covariance and correlation?
Covariance measures both direction and strength, while correlation only measures direction
Covariance depends on the scale of variables, whereas correlation is standardized
Covariance is always positive, while correlation can be negative
Covariance is used only for categorical data, while correlation is used for numerical data
Question 9
Which of the following is true about a covariance matrix?
It only measures relationships between two variables
The diagonal elements represent the covariance between different variables
It is a square matrix where diagonal elements represent variance and off-diagonal elements represent covariance
It is used to find non-linear relationships in a dataset
Question 10
When should Spearman’s Rank Correlation be used instead of Pearson’s correlation?
When the relationship between variables is strictly linear
When the data contains outliers or follows a non-normal distribution
When the data is already standardized
When analyzing categorical data
There are 11 questions to complete.