Data Science Quiz For Humanities

R-bloggers 2025-11-22

[This article was first published on coding-the-past, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Test your skills with this interactive data science quiz covering statistics, Python, R, and data analysis.

.quiz-container { font-family: Inter, system-ui, -apple-system, "Segoe UI", Roboto, "Helvetica Neue", Arial; max-width: 900px; margin: 2rem auto; padding: 1.25rem; } .meta { text-align: center; color: #555; margin-bottom: 1.25rem; } .progress-wrap { background:#eee; border-radius:999px; overflow:hidden; height:14px; margin-bottom:1rem; box-shadow: inset 0 1px 2px rgba(0,0,0,0.03); } .progress-bar { height:100%; width:0%; transition: width 450ms cubic-bezier(.2,.8,.2,1); background: linear-gradient(90deg,#4f46e5,#06b6d4); } .question { background:#fbfdff; border:1px solid #eef2ff; padding:14px; border-radius:12px; margin-bottom:14px; box-shadow: 0 1px 2px rgba(13,17,25,0.03); } .q-head { display:flex; justify-content:space-between; align-items:center; gap:12px; } .q-num { background:#eef2ff; color:#3730a3; padding:6px 10px; border-radius:999px; font-weight:600; font-size:0.9rem; } .options label { display:block; margin:8px 0; padding:8px 10px; border-radius:8px; cursor:pointer; transition: background 180ms, transform 120ms; } .options input { margin-right:8px; } .options label:hover { transform: translateY(-2px); } .correct { background: #ecfdf5; border:1px solid #bbf7d0; } .incorrect { background: #ffefef; border:1px solid #fca5a5; } .muted { color:#666; font-size:0.9rem; } .controls { display:flex; gap:12px; justify-content:flex-end; align-items:center; margin-top:12px; } button.primary { background:#4f46e5; color:white; border:none; padding:10px 16px; border-radius:10px; cursor:pointer; font-weight:600; } button.ghost { background:transparent; border:1px solid #e5e7eb; padding:8px 12px; border-radius:10px; cursor:pointer; } #result { margin-top:16px; font-size:1.05rem; font-weight:700; text-align:center; } .explanation { margin-top:8px; font-size:0.95rem; color:#0f172a; } .fade-in { animation: fadeIn 380ms ease both; } @keyframes fadeIn { from { opacity:0; transform: translateY(6px);} to {opacity:1; transform:none;} } Progress

Answered 0 of 15

Which of the following best describes a z-score?

A measure of central tendency The number of standard deviations a value is from the mean The square of the correlation coefficient A type of probability distribution

What is the main advantage of using tidy data principles in R?

Increased computation speed Easier visualization and consistent analysis Reduced memory usage Automatically removes missing values

In Python, which library is most commonly used for data manipulation?

matplotlib numpy pandas statsmodels

Which metric is best for evaluating a classification model on imbalanced data?

Accuracy Recall Variance R-squared

In a linear regression, what does R² represent?

Slope of the regression line Variance explained by the model Covariance between variables Degree of overfitting

In historical or humanities datasets, which challenge occurs most frequently?

Excessively large sample sizes Perfectly standardized variable names Missing or incomplete records Highly structured relational databases

What does the groupby() function do in pandas?

Sorts values by category Applies aggregate operations to subsets of data Removes duplicates Normalizes columns

What is the primary purpose of cross-validation?

Increase training accuracy Test different loss functions Evaluate a model on unseen data to reduce overfitting Speed up model training

Feature engineering refers to:

Training a model with more iterations Preparing input variables to improve model performance Removing outliers Selecting the best model

Which visualization is most appropriate for the distribution of a continuous variable?

Bar chart Histogram Pie chart Line plot

A z-score of +2.5 means:

The value is below the mean The value is 2.5 SD above the mean The value is an outlier The standard deviation is 2.5

Which is an advantage of using R for statistical analysis?

Native GPU acceleration Strong statistical libraries and ggplot2 Automatic machine learning Faster than Python

Normalization in data preprocessing means:

Converting categorical data to numeric Rescaling values to a standard range like 0–1 Detecting outliers Filling missing values

Why may historical datasets be biased?

They always include all records Selective or incomplete record-keeping Automatic modern data collection Perfect measurement systems

Which Python function can compute a z-score?

pandas.normalize() scipy.stats.zscore() numpy.z() matplotlib.stats()

Submit Quiz Try again

To leave a comment for the author, please follow the link and comment on their blog: coding-the-past.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Data Science Quiz For Humanities