Introduction I am a current student in Eastern University’s MS in Data Science program. The program started in August, and is now starting its 6th term. I’m only entering my third term though. This program provides comprehensive education in all aspects of data science, data visualization and machine learning. I’m really enjoying it! There’s not…
Category: Math and Statistics
Using Benford’s Law to Detect Fraud
Introduction I’ve been working my way through Fraud and Fraud Detection (Gee, 2014), a 2014 book by Sunder Gee intended for accountants and auditors (but valuable to the rest of us!) The book goes through statistical tests that can be used on sales, purchases, expense reports, payroll, and other data sources in order to detect…
Review of UpLevel Data Science 2020
Introduction Data Science and data analysis have been an interest of mine for a while. Combining statistics, computer programming and domain knowledge into one applied field ticks a lot of boxes of interest for me. I learn best in a structured environment though, and I’d been looking for something to help me build my skills….
Scales of Measurement
Introduction This post is part of a series I’ve been chipping away at, where I teach basic statistics and probability. The other posts in the series include: Measures of Central Tendency Frequency Distributions Dispersion and Variability (Standard Deviation) Z-Scores Z-Test Hypothesis Testing Correlation (Calculating Pearson’s r) Least-Squares Regression Statistical Coding and Classification Variables are the…
Z-Test Hypothesis Testing
Introduction The Z-test is a simple tool for hypothesis testing that can be used to identify whether a mean result, when compared to a larger set is statistically significant when the larger set is a normal distribution. Many datasets (for instance population height, test scores, etc.) have normal distributions. If you’re unsure whether your dataset has…