Overview

Dataset statistics

Number of variables6
Number of observations865
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory190.2 KiB
Average record size in memory225.2 B

Variable types

NUM3
CAT3

Warnings

Hex has a high cardinality: 765 distinct values High cardinality
Hex is uniformly distributed Uniform
Code has unique values Unique
Name has unique values Unique
R has 81 (9.4%) zeros Zeros
G has 58 (6.7%) zeros Zeros
B has 80 (9.2%) zeros Zeros

Reproduction

Analysis started2020-10-25 20:12:01.642222
Analysis finished2020-10-25 20:12:05.507834
Duration3.87 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

Code
Categorical

UNIQUE

Distinct865
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
deep_carmine
 
1
tango_pink
 
1
pale_magenta
 
1
lavender_gray
 
1
deep_carmine_pink
 
1
Other values (860)
860 
ValueCountFrequency (%) 
deep_carmine10.1%
 
tango_pink10.1%
 
pale_magenta10.1%
 
lavender_gray10.1%
 
deep_carmine_pink10.1%
 
licorice10.1%
 
plum_traditional10.1%
 
bright_cerulean10.1%
 
amethyst10.1%
 
harlequin10.1%
 
rose_madder10.1%
 
navy_blue10.1%
 
orange_color_wheel10.1%
 
green_color_wheel_x11_green10.1%
 
fandango10.1%
 
sand_dune10.1%
 
peach_crayola10.1%
 
beige10.1%
 
blue_ncs10.1%
 
otter_brown10.1%
 
orchid10.1%
 
peach_puff10.1%
 
amber10.1%
 
ufo_green10.1%
 
bright_turquoise10.1%
 
Other values (840)84097.1%
 
2020-10-25T20:12:05.628240image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique865 ?
Unique (%)100.0%
2020-10-25T20:12:05.861737image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length39
Median length11
Mean length11.37572254
Min length3

Overview of Unicode Properties

Unique unicode characters31
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e120112.2%
 
_7998.1%
 
r7968.1%
 
a7888.0%
 
l6957.1%
 
n6266.4%
 
i5585.7%
 
o5195.3%
 
t3964.0%
 
u3733.8%
 
s3433.5%
 
d3413.5%
 
c3233.3%
 
p3063.1%
 
g3023.1%
 
b2923.0%
 
m2532.6%
 
h1841.9%
 
y1811.8%
 
k1471.5%
 
w1271.3%
 
f920.9%
 
v900.9%
 
z410.4%
 
q210.2%
 
Other values (6)460.5%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter902591.7%
 
Connector Punctuation7998.1%
 
Decimal Number160.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e120113.3%
 
r7968.8%
 
a7888.7%
 
l6957.7%
 
n6266.9%
 
i5586.2%
 
o5195.8%
 
t3964.4%
 
u3734.1%
 
s3433.8%
 
d3413.8%
 
c3233.6%
 
p3063.4%
 
g3023.3%
 
b2923.2%
 
m2532.8%
 
h1842.0%
 
y1812.0%
 
k1471.6%
 
w1271.4%
 
f921.0%
 
v901.0%
 
z410.5%
 
q210.2%
 
j170.2%
 

Most frequent Connector Punctuation characters

ValueCountFrequency (%) 
_799100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
11381.2%
 
916.2%
 
716.2%
 
316.2%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin902591.7%
 
Common8158.3%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e120113.3%
 
r7968.8%
 
a7888.7%
 
l6957.7%
 
n6266.9%
 
i5586.2%
 
o5195.8%
 
t3964.4%
 
u3734.1%
 
s3433.8%
 
d3413.8%
 
c3233.6%
 
p3063.4%
 
g3023.3%
 
b2923.2%
 
m2532.8%
 
h1842.0%
 
y1812.0%
 
k1471.6%
 
w1271.4%
 
f921.0%
 
v901.0%
 
z410.5%
 
q210.2%
 
j170.2%
 

Most frequent Common characters

ValueCountFrequency (%) 
_79998.0%
 
1131.6%
 
910.1%
 
710.1%
 
310.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII9840100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e120112.2%
 
_7998.1%
 
r7968.1%
 
a7888.0%
 
l6957.1%
 
n6266.4%
 
i5585.7%
 
o5195.3%
 
t3964.0%
 
u3733.8%
 
s3433.5%
 
d3413.5%
 
c3233.3%
 
p3063.1%
 
g3023.1%
 
b2923.0%
 
m2532.6%
 
h1841.9%
 
y1811.8%
 
k1471.5%
 
w1271.3%
 
f920.9%
 
v900.9%
 
z410.4%
 
q210.2%
 
Other values (6)460.5%
 

Name
Categorical

UNIQUE

Distinct865
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
Scarlet
 
1
Telemagenta
 
1
Fluorescent Orange
 
1
Dark Byzantium
 
1
Honeydew
 
1
Other values (860)
860 
ValueCountFrequency (%) 
Scarlet10.1%
 
Telemagenta10.1%
 
Fluorescent Orange10.1%
 
Dark Byzantium10.1%
 
Honeydew10.1%
 
Sandstorm10.1%
 
Air Force Blue (Raf)10.1%
 
Light Green10.1%
 
Lion10.1%
 
Blue (Ryb)10.1%
 
Medium Purple10.1%
 
Columbia Blue10.1%
 
Redwood10.1%
 
Pale Taupe10.1%
 
Shocking Pink10.1%
 
Light Cyan10.1%
 
Aqua10.1%
 
Usc Gold10.1%
 
Feldgrau10.1%
 
Dark Orchid10.1%
 
Dark Pastel Red10.1%
 
Fuchsia Pink10.1%
 
Burgundy10.1%
 
Dark Jungle Green10.1%
 
International Orange (Golden Gate Bridge)10.1%
 
Other values (840)84097.1%
 
2020-10-25T20:12:06.121238image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique865 ?
Unique (%)100.0%
2020-10-25T20:12:06.345917image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length41
Median length11
Mean length11.59190751
Min length3

Overview of Unicode Properties

Unique unicode characters69
Unique unicode categories9 ?
Unique unicode scripts2 ?
Unique unicode blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e116811.6%
 
7657.6%
 
a7377.4%
 
r6616.6%
 
l6116.1%
 
n6096.1%
 
i5365.3%
 
o4634.6%
 
u3453.4%
 
t3283.3%
 
d2512.5%
 
s2502.5%
 
B2062.1%
 
P1741.7%
 
c1651.6%
 
g1621.6%
 
h1621.6%
 
m1581.6%
 
C1581.6%
 
y1441.4%
 
G1401.4%
 
k1371.4%
 
R1351.3%
 
p1321.3%
 
M950.9%
 
Other values (44)133513.3%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter736973.5%
 
Uppercase Letter166116.6%
 
Space Separator7657.6%
 
Open Punctuation890.9%
 
Close Punctuation890.9%
 
Dash Punctuation200.2%
 
Other Punctuation170.2%
 
Decimal Number160.2%
 
Final Punctuation1< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
B20612.4%
 
P17410.5%
 
C1589.5%
 
G1408.4%
 
R1358.1%
 
M955.7%
 
S935.6%
 
D905.4%
 
L845.1%
 
T684.1%
 
O563.4%
 
A513.1%
 
F482.9%
 
W402.4%
 
Y372.2%
 
E332.0%
 
V301.8%
 
U281.7%
 
I221.3%
 
H221.3%
 
N171.0%
 
J130.8%
 
K100.6%
 
X70.4%
 
Q20.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e116815.9%
 
a73710.0%
 
r6619.0%
 
l6118.3%
 
n6098.3%
 
i5367.3%
 
o4636.3%
 
u3454.7%
 
t3284.5%
 
d2513.4%
 
s2503.4%
 
c1652.2%
 
g1622.2%
 
h1622.2%
 
m1582.1%
 
y1442.0%
 
k1371.9%
 
p1321.8%
 
w871.2%
 
b861.2%
 
v600.8%
 
f440.6%
 
z390.5%
 
q190.3%
 
x60.1%
 
Other values (4)90.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
765100.0%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
(89100.0%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)89100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
/741.2%
 
'635.3%
 
#211.8%
 
&15.9%
 
.15.9%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-20100.0%
 

Most frequent Final Punctuation characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
11381.2%
 
916.2%
 
716.2%
 
316.2%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin903090.1%
 
Common9979.9%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e116812.9%
 
a7378.2%
 
r6617.3%
 
l6116.8%
 
n6096.7%
 
i5365.9%
 
o4635.1%
 
u3453.8%
 
t3283.6%
 
d2512.8%
 
s2502.8%
 
B2062.3%
 
P1741.9%
 
c1651.8%
 
g1621.8%
 
h1621.8%
 
m1581.7%
 
C1581.7%
 
y1441.6%
 
G1401.6%
 
k1371.5%
 
R1351.5%
 
p1321.5%
 
M951.1%
 
S931.0%
 
Other values (30)101011.2%
 

Most frequent Common characters

ValueCountFrequency (%) 
76576.7%
 
(898.9%
 
)898.9%
 
-202.0%
 
1131.3%
 
/70.7%
 
'60.6%
 
#20.2%
 
10.1%
 
&10.1%
 
910.1%
 
710.1%
 
310.1%
 
.10.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1002199.9%
 
None5< 0.1%
 
Punctuation1< 0.1%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e116811.7%
 
7657.6%
 
a7377.4%
 
r6616.6%
 
l6116.1%
 
n6096.1%
 
i5365.3%
 
o4634.6%
 
u3453.4%
 
t3283.3%
 
d2512.5%
 
s2502.5%
 
B2062.1%
 
P1741.7%
 
c1651.6%
 
g1621.6%
 
h1621.6%
 
m1581.6%
 
C1581.6%
 
y1441.4%
 
G1401.4%
 
k1371.4%
 
R1351.3%
 
p1321.3%
 
M950.9%
 
Other values (40)132913.3%
 

Most frequent Punctuation characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent None characters

ValueCountFrequency (%) 
é360.0%
 
à120.0%
 
ú120.0%
 

Hex
Categorical

HIGH CARDINALITY
UNIFORM

Distinct765
Distinct (%)88.4%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
#c19a6b
 
5
#967117
 
4
#fada5e
 
4
#008000
 
3
#f88379
 
3
Other values (760)
846 
ValueCountFrequency (%) 
#c19a6b50.6%
 
#96711740.5%
 
#fada5e40.5%
 
#00800030.3%
 
#f8837930.3%
 
#d2691e30.3%
 
#fad6a530.3%
 
#0f030.3%
 
#dda0dd30.3%
 
#90030.3%
 
#0ff30.3%
 
#80808030.3%
 
#a52a2a30.3%
 
#cf030.3%
 
#483c3230.3%
 
#f3e5ab20.2%
 
#e9745120.2%
 
#c2b28020.2%
 
#65432120.2%
 
#00ff7f20.2%
 
#50c87820.2%
 
#6f00ff20.2%
 
#e9d66b20.2%
 
#f0e68c20.2%
 
#80008020.2%
 
Other values (740)79692.0%
 
2020-10-25T20:12:06.606268image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique684 ?
Unique (%)79.1%
2020-10-25T20:12:06.971269image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length7
Median length7
Mean length6.798843931
Min length4

Overview of Unicode Properties

Unique unicode characters17
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
#86514.7%
 
066511.3%
 
f62510.6%
 
83175.4%
 
c3005.1%
 
a2925.0%
 
e2694.6%
 
42684.6%
 
b2684.6%
 
32674.5%
 
d2654.5%
 
62654.5%
 
72524.3%
 
92504.3%
 
52484.2%
 
22434.1%
 
12223.8%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number299751.0%
 
Lowercase Letter201934.3%
 
Other Punctuation86514.7%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
#865100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
066522.2%
 
831710.6%
 
42688.9%
 
32678.9%
 
62658.8%
 
72528.4%
 
92508.3%
 
52488.3%
 
22438.1%
 
12227.4%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
f62531.0%
 
c30014.9%
 
a29214.5%
 
e26913.3%
 
b26813.3%
 
d26513.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Common386265.7%
 
Latin201934.3%
 

Most frequent Common characters

ValueCountFrequency (%) 
#86522.4%
 
066517.2%
 
83178.2%
 
42686.9%
 
32676.9%
 
62656.9%
 
72526.5%
 
92506.5%
 
52486.4%
 
22436.3%
 
12225.7%
 

Most frequent Latin characters

ValueCountFrequency (%) 
f62531.0%
 
c30014.9%
 
a29214.5%
 
e26913.3%
 
b26813.3%
 
d26513.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII5881100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
#86514.7%
 
066511.3%
 
f62510.6%
 
83175.4%
 
c3005.1%
 
a2925.0%
 
e2694.6%
 
42684.6%
 
b2684.6%
 
32674.5%
 
d2654.5%
 
62654.5%
 
72524.3%
 
92504.3%
 
52484.2%
 
22434.1%
 
12223.8%
 

R
Real number (ℝ≥0)

ZEROS

Distinct221
Distinct (%)25.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean158.5988439
Minimum0
Maximum255
Zeros81
Zeros (%)9.4%
Memory size6.9 KiB
2020-10-25T20:12:07.166206image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1101
median178
Q3236
95-th percentile255
Maximum255
Range255
Interquartile range (IQR)135

Descriptive statistics

Standard deviation85.33843164
Coefficient of variation (CV)0.5380772617
Kurtosis-0.9264508707
Mean158.5988439
Median Absolute Deviation (MAD)66
Skewness-0.5936792074
Sum137188
Variance7282.647915
MonotocityNot monotonic
2020-10-25T20:12:07.388018image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
25511012.7%
 
0819.4%
 
250151.7%
 
204131.5%
 
128111.3%
 
150111.3%
 
227101.2%
 
153101.2%
 
244101.2%
 
24091.0%
 
25191.0%
 
23091.0%
 
25380.9%
 
21880.9%
 
22280.9%
 
24880.9%
 
20570.8%
 
25470.8%
 
19370.8%
 
10270.8%
 
7260.7%
 
15260.7%
 
23360.7%
 
21560.7%
 
25260.7%
 
Other values (196)47755.1%
 
ValueCountFrequency (%) 
0819.4%
 
140.5%
 
210.1%
 
320.2%
 
510.1%
 
610.1%
 
840.5%
 
1010.1%
 
1110.1%
 
1310.1%
 
ValueCountFrequency (%) 
25511012.7%
 
25470.8%
 
25380.9%
 
25260.7%
 
25191.0%
 
250151.7%
 
24940.5%
 
24880.9%
 
24730.3%
 
24620.2%
 

G
Real number (ℝ≥0)

ZEROS

Distinct234
Distinct (%)27.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean124.683237
Minimum0
Maximum255
Zeros58
Zeros (%)6.7%
Memory size6.9 KiB
2020-10-25T20:12:07.613197image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q164
median123
Q3190
95-th percentile250
Maximum255
Range255
Interquartile range (IQR)126

Descriptive statistics

Standard deviation76.27022506
Coefficient of variation (CV)0.6117119422
Kurtosis-1.097846721
Mean124.683237
Median Absolute Deviation (MAD)63
Skewness0.0522334723
Sum107851
Variance5817.14723
MonotocityNot monotonic
2020-10-25T20:12:07.835215image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0586.7%
 
255354.0%
 
128131.5%
 
105121.4%
 
51111.3%
 
204111.3%
 
6691.0%
 
10291.0%
 
21891.0%
 
16091.0%
 
11380.9%
 
6480.9%
 
13080.9%
 
15480.9%
 
12770.8%
 
15370.8%
 
22170.8%
 
11170.8%
 
10470.8%
 
21460.7%
 
22260.7%
 
6960.7%
 
7960.7%
 
13260.7%
 
24060.7%
 
Other values (209)58667.7%
 
ValueCountFrequency (%) 
0586.7%
 
120.2%
 
220.2%
 
320.2%
 
620.2%
 
820.2%
 
1030.3%
 
1120.2%
 
1230.3%
 
1420.2%
 
ValueCountFrequency (%) 
255354.0%
 
25430.3%
 
25320.2%
 
25220.2%
 
25110.1%
 
25050.6%
 
24910.1%
 
24840.5%
 
24720.2%
 
24610.1%
 

B
Real number (ℝ≥0)

ZEROS

Distinct230
Distinct (%)26.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean119.0878613
Minimum0
Maximum255
Zeros80
Zeros (%)9.2%
Memory size6.9 KiB
2020-10-25T20:12:08.050504image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q153
median119
Q3186
95-th percentile253.6
Maximum255
Range255
Interquartile range (IQR)133

Descriptive statistics

Standard deviation78.34386249
Coefficient of variation (CV)0.6578660634
Kurtosis-1.13796004
Mean119.0878613
Median Absolute Deviation (MAD)66
Skewness0.1072876893
Sum103011
Variance6137.76079
MonotocityNot monotonic
2020-10-25T20:12:08.276173image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0809.2%
 
255414.7%
 
107151.7%
 
128141.6%
 
204101.2%
 
12091.0%
 
9491.0%
 
5180.9%
 
3380.9%
 
5980.9%
 
15380.9%
 
9680.9%
 
5080.9%
 
22070.8%
 
6070.8%
 
12270.8%
 
8170.8%
 
20770.8%
 
7170.8%
 
23070.8%
 
3470.8%
 
3270.8%
 
3070.8%
 
10270.8%
 
12770.8%
 
Other values (205)55564.2%
 
ValueCountFrequency (%) 
0809.2%
 
230.3%
 
310.1%
 
520.2%
 
720.2%
 
830.3%
 
910.1%
 
1020.2%
 
1130.3%
 
1230.3%
 
ValueCountFrequency (%) 
255414.7%
 
25430.3%
 
25210.1%
 
25110.1%
 
25070.8%
 
24910.1%
 
24530.3%
 
24420.2%
 
24110.1%
 
24060.7%
 

Interactions

2020-10-25T20:12:03.439980image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2020-10-25T20:12:03.624100image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2020-10-25T20:12:03.796224image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2020-10-25T20:12:03.966143image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2020-10-25T20:12:04.143099image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/