Contents
Multiple Choice Questions (MCQs)
1. Which Python library is primarily used to load the IPL dataset?
a. TensorFlow
b. Pandas
c. OpenCV
d. Keras
Answer: b. Pandas
Pandas provides the read_csv() function, which is commonly used to load CSV datasets into DataFrames.
2. What is the purpose of Exploratory Data Analysis (EDA)?
a. To deploy a machine learning model
b. To understand the structure and characteristics of a dataset
c. To create a database
d. To replace missing values automatically
Answer: b. To understand the structure and characteristics of a dataset
EDA helps identify trends, patterns, missing values, and potential issues before detailed analysis.
3. Which visualization is best suited for comparing the number of matches won by different teams?
a. Scatter Plot
b. Histogram
c. Bar Chart
d. Heatmap
Answer: c. Bar Chart
Bar charts provide a clear comparison between different categories.
4. Which column is commonly used to analyze the best-performing players in the IPL dataset?
a. venue
b. toss_winner
c. player_of_match
d. date
Answer: c. player_of_match
The player_of_match column records the player who received the Player of the Match award.
5. Why is data cleaning performed before analysis?
a. To reduce the number of Python libraries
b. To improve dataset quality and ensure accurate analysis
c. To increase the dataset size
d. To improve internet speed
Answer: b. To improve dataset quality and ensure accurate analysis
Cleaning removes inconsistencies and handles missing values, making the analysis more reliable.










