Loading...
Articles
60
Tags
24
Categories
46
Home
Archives
Tags
Categories
Project
About
Chris Wen's Blog
Exploratory Data Analysis
Back to Home
Search
Home
Archives
Tags
Categories
Project
About
Exploratory Data Analysis
Created
2025-11-10
|
Updated
2025-11-27
|
DA
EDA
|
Word Count:
0
|
Reading Time:
1mins
Author:
Chris Wen
Link:
https://wenyupeng.github.io/2025/11/10/big_data/eda/01-concept/
Copyright Notice:
All articles on this blog are licensed under
CC BY-NC-SA 4.0
unless otherwise stated.
EDA
Previous
Outlier
Outliers123456import numpy as npimport pandas as pdimport matplotlib.pyplot as pltimport seaborn as snspd.set_option("display.notebook_repr_html", False) # disable "rich" outputplt.style.use("seaborn") Unidimensional Data1234567x = np.loadtxt("https://raw.githubusercontent.com/gagolews/" + "teaching-data/master/marek/blobs2.txt")plt.subplot(121)sns.boxplot(data=x, orient="h")plt.subplot(122)sns.histplot(x, binwidth=1)plt.show() M...
Next
Continuous Probability Distributions
Continuous Probability Distributions1234import numpy as npimport matplotlib.pyplot as pltimport seaborn as snsplt.style.use("seaborn") 123456heights = np.loadtxt("https://raw.githubusercontent.com/gagolews/" + "teaching-data/master/marek/nhanes_adult_female_height_2020.txt")sns.histplot(heights, stat="density", kde=True)plt.show()import scipy.stats Normal DistributionNormal Distribution12345678μ = np.mean(heights) # an estimator of expected valueσ =...
Chris Wen
Dwell not on the past, nor fear the future.
Articles
60
Tags
24
Categories
46
Announcement
Maintain the motivation to learn and stay humble when facing every problem.
Recent Posts
Outlier
2025-11-11
Exploratory Data Analysis
2025-11-10
Continuous Probability Distributions
2025-11-10
Multivariate Categorical and Relational Data
2025-11-10
Visualising Multidimensional Data and Measuring Correlation
2025-11-10
Search
Loading Database