Overview

Dataset info

Number of variables14
Number of observations45726
Missing cells29703 (4.6%)
Duplicate rows0 (0.0%)
Total size in memory4.6 MiB
Average record size in memory105.0 B

Variables types

Numeric4
Categorical5
Boolean1
Date1
URL0
Text (Unique)1
Rejected2
Unsupported0

Warnings

GeoLocation has a high cardinality: 17101 distinct values Warning
GeoLocation has 7315 (16.0%) missing values Missing
mass_(g) is highly skewed (γ1 = 76.91847245) Skewed
recclass has a high cardinality: 466 distinct values Warning
reclat has 6438 (14.1%) zeros Zeros
reclat has 7315 (16.0%) missing values Missing
reclat_city is highly correlated with reclat (ρ = 0.9942518712) Rejected
reclong has 6214 (13.6%) zeros Zeros
reclong has 7315 (16.0%) missing values Missing
source has constant value "NASA" Rejected

Variables

boolean
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
True
23002
False
22724
ValueCountFrequency (%) 
True 23002 50.3%
 
False 22724 49.7%
 

fall
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Found
44609
Fell
 
1117
ValueCountFrequency (%) 
Found 44609 97.6%
 
Fell 1117 2.4%
 
Max length5
Mean length4.975571885
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

GeoLocation
Categorical

Distinct count17101
Unique (%)37.4%
Missing (%)16.0%
Missing (n)7315
(0.0, 0.0)
6214
(-71.5, 35.66667)
 
4761
(-84.0, 168.0)
 
3040
Other values (17097)
24396
(Missing)
7315
ValueCountFrequency (%) 
(0.0, 0.0) 6214 13.6%
 
(-71.5, 35.66667) 4761 10.4%
 
(-84.0, 168.0) 3040 6.6%
 
(-72.0, 26.0) 1505 3.3%
 
(-79.68333, 159.75) 657 1.4%
 
(-76.71667, 159.66667) 637 1.4%
 
(-76.18333, 157.16667) 539 1.2%
 
(-79.68333, 155.75) 473 1.0%
 
(-84.21667, 160.5) 263 0.6%
 
(-86.36667, -70.0) 226 0.5%
 
Other values (17090) 20096 43.9%
 
(Missing) 7315 16.0%
 
Max length24
Mean length15.01640205
Min length3
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

id
Numeric

Distinct count45716
Unique (%)> 99.9%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean26883.9062
Minimum1
Maximum57458
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile2388.75
Q112681.25
Median24256.5
Q340653.5
95-th percentile54890.75
Maximum57458
Range57457
Interquartile range27972.25

Descriptive statistics

Standard deviation16863.44557
Coef of variation0.6272691713
Kurtosis-1.160130804
Mean26883.9062
MAD14489.93531
Skewness0.2665300704
Sum1229293495
Variance284375796.4
Memory size357.4 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[1.00000e+00 1.16250e+03 1.24350e+03 2.35650e+03 2.43150e+03 ... 5.49015e+04 5.49255e+04 5.72245e+04 5.72885e+04 5.74580e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
417 2 < 0.1%
 
398 2 < 0.1%
 
1 2 < 0.1%
 
6 2 < 0.1%
 
392 2 < 0.1%
 
370 2 < 0.1%
 
379 2 < 0.1%
 
2 2 < 0.1%
 
390 2 < 0.1%
 
10 2 < 0.1%
 
Other values (45706) 45706 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
1 2 < 0.1%
 
2 2 < 0.1%
 
4 1 < 0.1%
 
5 1 < 0.1%
 
6 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
57458 1 < 0.1%
 
57457 1 < 0.1%
 
57456 1 < 0.1%
 
57455 1 < 0.1%
 
57454 1 < 0.1%
 

mass_(g)
Numeric

Distinct count12577
Unique (%)27.5%
Missing (%)0.3%
Missing (n)131
Infinite (%)0.0%
Infinite (n)0
Mean13278.42646
Minimum0
Maximum60000000
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile1.1
Q17.2
Median32.61
Q3202.9
95-th percentile4000
Maximum60000000
Range60000000
Interquartile range195.7

Descriptive statistics

Standard deviation574926.0121
Coef of variation43.2977517
Kurtosis6798.398388
Mean13278.42646
MAD25112.89201
Skewness76.91847245
Sum605429854.6
Variance3.305399193e+11
Memory size357.4 KiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1.3 171 0.4%
 
1.2 140 0.3%
 
1.4 138 0.3%
 
2.1 130 0.3%
 
2.4 126 0.3%
 
1.6 120 0.3%
 
0.5 119 0.3%
 
1.1 116 0.3%
 
3.8 114 0.2%
 
0.7 111 0.2%
 
Other values (12566) 44310 96.9%
 
(Missing) 131 0.3%
 

Minimum 5 values

ValueCountFrequency (%) 
0 19 < 0.1%
 
0.01 2 < 0.1%
 
0.013 1 < 0.1%
 
0.02 1 < 0.1%
 
0.03 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
60000000 1 < 0.1%
 
58200000 1 < 0.1%
 
50000000 1 < 0.1%
 
30000000 1 < 0.1%
 
28000000 1 < 0.1%
 

mixed
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
1
22935
A
22791
ValueCountFrequency (%) 
1 22935 50.2%
 
A 22791 49.8%
 
Max length1
Mean length1
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesFalse
Contains non-wordsFalse

name
Categorical, Unique

First 5 values
Aachen
Aachen copy
Aarhus
Aarhus copy
Abajo
Last 5 values
Österplana 062
Österplana 063
Österplana 064
Łowicz
Święcany

First 5 values

ValueCountFrequency (%) 
Aachen 1 < 0.1%
 
Aachen copy 1 < 0.1%
 
Aarhus 1 < 0.1%
 
Aarhus copy 1 < 0.1%
 
Abajo 1 < 0.1%
 

Last 5 values

ValueCountFrequency (%) 
Święcany 1 < 0.1%
 
Łowicz 1 < 0.1%
 
Österplana 064 1 < 0.1%
 
Österplana 063 1 < 0.1%
 
Österplana 062 1 < 0.1%
 

nametype
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Valid
45651
Relict
 
75
ValueCountFrequency (%) 
Valid 45651 99.8%
 
Relict 75 0.2%
 
Max length6
Mean length5.001640205
Min length5
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

recclass
Categorical

Distinct count466
Unique (%)1.0%
Missing (%)0.0%
Missing (n)0
L6
8287
H5
7143
L5
 
4797
Other values (463)
25499
ValueCountFrequency (%) 
L6 8287 18.1%
 
H5 7143 15.6%
 
L5 4797 10.5%
 
H6 4529 9.9%
 
H4 4211 9.2%
 
LL5 2766 6.0%
 
LL6 2043 4.5%
 
L4 1253 2.7%
 
H4/5 428 0.9%
 
CM2 416 0.9%
 
Other values (456) 9853 21.5%
 
Max length26
Mean length3.052530289
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

reclat
Numeric

Distinct count12739
Unique (%)27.9%
Missing (%)16.0%
Missing (n)7315
Infinite (%)0.0%
Infinite (n)0
Mean-39.10709514
Minimum-87.36667
Maximum81.16667
Zeros (%)14.1%
Mini histogram

Quantile statistics

Minimum-87.36667
5-th percentile-84.35476
Q1-76.71377
Median-71.5
Q30
95-th percentile34.494325
Maximum81.16667
Range168.53334
Interquartile range76.71377

Descriptive statistics

Standard deviation46.38601095
Coef of variation-1.186127755
Kurtosis-1.476865084
Mean-39.10709514
MAD43.93747025
Skewness0.4913157316
Sum-1502142.632
Variance2151.662012
Memory size357.4 KiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0 6438 14.1%
 
-71.5 4761 10.4%
 
-84 3040 6.6%
 
-72 1506 3.3%
 
-79.68333 1130 2.5%
 
-76.71667 680 1.5%
 
-76.18333 539 1.2%
 
-84.21667 263 0.6%
 
-86.36667 226 0.5%
 
-86.71667 217 0.5%
 
Other values (12728) 19611 42.9%
 
(Missing) 7315 16.0%
 

Minimum 5 values

ValueCountFrequency (%) 
-87.36667 4 < 0.1%
 
-87.03333 3 < 0.1%
 
-86.93333 3 < 0.1%
 
-86.71667 217 0.5%
 
-86.56667 17 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
81.16667 1 < 0.1%
 
76.53333 1 < 0.1%
 
76.13333 1 < 0.1%
 
72.88333 1 < 0.1%
 
72.68333 1 < 0.1%
 

reclat_city
Highly correlated

This variable is highly correlated with reclat and should be ignored for analysis

Correlation0.9942518712

reclong
Numeric

Distinct count14641
Unique (%)32.0%
Missing (%)16.0%
Missing (n)7315
Infinite (%)0.0%
Infinite (n)0
Mean61.05259359
Minimum-165.43333
Maximum354.47333
Zeros (%)13.6%
Mini histogram

Quantile statistics

Minimum-165.43333
5-th percentile-90.427
Q10
Median35.66667
Q3157.16667
95-th percentile168
Maximum354.47333
Range519.90666
Interquartile range157.16667

Descriptive statistics

Standard deviation80.65525774
Coef of variation1.321078319
Kurtosis-0.7313935567
Mean61.05259359
MAD67.60562132
Skewness-0.1743813291
Sum2345091.172
Variance6505.2706
Memory size357.4 KiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0 6214 13.6%
 
35.66667 4985 10.9%
 
168 3040 6.6%
 
26 1506 3.3%
 
159.75 657 1.4%
 
159.66667 637 1.4%
 
157.16667 542 1.2%
 
155.75 473 1.0%
 
160.5 263 0.6%
 
-70 228 0.5%
 
Other values (14630) 19866 43.4%
 
(Missing) 7315 16.0%
 

Minimum 5 values

ValueCountFrequency (%) 
-165.43333 9 < 0.1%
 
-165.11667 17 < 0.1%
 
-163.16667 1 < 0.1%
 
-162.55 1 < 0.1%
 
-157.86667 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
354.47333 1 < 0.1%
 
178.2 1 < 0.1%
 
178.08333 1 < 0.1%
 
175.73028 1 < 0.1%
 
175.13333 1 < 0.1%
 

source
Constant

This variable is constant and should be ignored for analysis

Constant valueNASA

year
Date

Distinct count246
Unique (%)0.5%
Missing (%)0.7%
Missing (n)312
Infinite (%)0.0%
Infinite (n)0
Minimum1688-01-01 00:00:00
Maximum2101-01-01 00:00:00
Mini histogram
Histogram
Histogram of 'year' (bins=N)

Correlations