Overview

Dataset info

Number of variables14
Number of observations220056
Missing cells41688 (1.4%)
Duplicate rows0 (0.0%)
Total size in memory23.5 MiB
Average record size in memory112.0 B

Variables types

Numeric5
Categorical2
Boolean0
Date1
URL0
Text (Unique)0
Rejected6
Unsupported0

Warnings

AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG (ρ = 0.9538974473) Rejected
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with AANTAL_PAT_PER_SPC (ρ = 0.937317187) Rejected
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD (ρ = 0.9295724875) Rejected
DATUM_BESTAND has constant value "2019-07-10" Rejected
GEMIDDELDE_VERKOOPPRIJS has 39256 (17.8%) missing values Missing
PEILDATUM has constant value "2019-07-01" Rejected
TYPERENDE_DIAGNOSE_CD has a high cardinality: 1772 distinct values Warning
VERSIE has constant value "1.0" Rejected
ZORGPRODUCT_CD has a high cardinality: 5872 distinct values Warning
ZORGPRODUCT_CD has 2432 (1.1%) missing values Missing

Variables

AANTAL_PAT_PER_DIAG
Numeric

Distinct count6901
Unique (%)3.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean7389.779034
Minimum1
Maximum205513
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile32
Q1354
Median1570
Q36061
95-th percentile35984
Maximum205513
Range205512
Interquartile range5707

Descriptive statistics

Standard deviation17269.46408
Coef of variation2.336939169
Kurtosis31.30153635
Mean7389.779034
MAD9038.062863
Skewness4.920357863
Sum1626165215
Variance298234389.4
Memory size1.7 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[1.000000e+00 1.500000e+00 2.350000e+01 3.450000e+01 3.550000e+01 ... 1.383325e+05 1.520900e+05 1.572990e+05 1.738185e+05 2.055130e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
6 420 0.2%
 
32 398 0.2%
 
4 391 0.2%
 
12 388 0.2%
 
21 386 0.2%
 
19 386 0.2%
 
8 385 0.2%
 
23 385 0.2%
 
5 379 0.2%
 
17 378 0.2%
 
Other values (6891) 216160 98.2%
 

Minimum 5 values

ValueCountFrequency (%) 
1 340 0.2%
 
2 374 0.2%
 
3 355 0.2%
 
4 391 0.2%
 
5 379 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
205513 19 < 0.1%
 
200182 17 < 0.1%
 
199981 16 < 0.1%
 
197742 20 < 0.1%
 
189114 19 < 0.1%
 

AANTAL_PAT_PER_SPC
Numeric

Distinct count217
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean642319.0376
Minimum83
Maximum1489568
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum83
5-th percentile30777
Q1242846
Median713937
Q3977237
95-th percentile1328494
Maximum1489568
Range1489485
Interquartile range734391

Descriptive statistics

Standard deviation426698.7108
Coef of variation0.6643096122
Kurtosis-1.108280086
Mean642319.0376
MAD373675.5045
Skewness0.06739837272
Sum1.413461581e+11
Variance1.820717898e+11
Memory size1.7 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[8.300000e+01 9.250000e+01 1.170000e+02 2.470000e+02 4.725000e+02 ... 1.302159e+06 1.318038e+06 1.436341e+06 1.470131e+06 1.489568e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
881250 5107 2.3%
 
870026 4401 2.0%
 
871428 4372 2.0%
 
841375 4367 2.0%
 
1061055 3974 1.8%
 
1058528 3972 1.8%
 
693207 3970 1.8%
 
977237 3872 1.8%
 
1040393 3858 1.8%
 
995598 3724 1.7%
 
Other values (207) 178439 81.1%
 

Minimum 5 values

ValueCountFrequency (%) 
83 13 < 0.1%
 
102 6 < 0.1%
 
132 3 < 0.1%
 
362 57 < 0.1%
 
583 102 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
1489568 2981 1.4%
 
1450694 3057 1.4%
 
1421988 3588 1.6%
 
1328494 3616 1.6%
 
1307582 3590 1.6%
 

AANTAL_PAT_PER_ZPD
Numeric

Distinct count8019
Unique (%)3.6%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean484.0502236
Minimum1
Maximum150256
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q12
Median12
Q392
95-th percentile1592
Maximum150256
Range150255
Interquartile range90

Descriptive statistics

Standard deviation3048.750513
Coef of variation6.298417736
Kurtosis369.6471736
Mean484.0502236
MAD776.5827494
Skewness16.23883553
Sum106518156
Variance9294879.691
Memory size1.7 MiB
Histogram
Histogram with fixed size bins (bins=50)