diff --git a/stayvers2019/all_users_eda.html b/stayvers2019/all_users_eda.html new file mode 100644 index 0000000..be28617 --- /dev/null +++ b/stayvers2019/all_users_eda.html @@ -0,0 +1,15462 @@ +Lumocity All Users EDA Report

Overview

Dataset statistics

Number of variables30
Number of observations2737585
Missing cells238072
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory626.6 MiB
Average record size in memory240.0 B

Variable types

Numeric18
Boolean1
Categorical11

Warnings

runlengthprev has 238072 (8.7%) missing values Missing

Reproduction

Analysis started2021-02-25 23:46:00.622032
Analysis finished2021-02-25 23:47:27.632007
Duration1 minute and 27.01 seconds
Software versionpandas-profiling v2.10.0
Download configurationconfig.yaml

Variables

df_index
Real number (ℝ≥0)

Distinct750114
Distinct (%)27.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean349316.8219
Minimum0
Maximum750113
Zeros4
Zeros (%)< 0.1%
Memory size20.9 MiB
2021-02-26T00:47:28.477006image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile34219.2
Q1171099
median342198
Q3513297
95-th percentile695667.8
Maximum750113
Range750113
Interquartile range (IQR)342198

Descriptive statistics

Standard deviation208132.9105
Coefficient of variation (CV)0.5958284785
Kurtosis-1.101560579
Mean349316.8219
Median Absolute Deviation (MAD)171099
Skewness0.1226952489
Sum9.562844919 × 1011
Variance4.331930844 × 1010
MonotocityNot monotonic
2021-02-26T00:47:28.758862image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20474
 
< 0.1%
908864
 
< 0.1%
949804
 
< 0.1%
847394
 
< 0.1%
826904
 
< 0.1%
888334
 
< 0.1%
867844
 
< 0.1%
4392954
 
< 0.1%
4413424
 
< 0.1%
4351974
 
< 0.1%
Other values (750104)2737545
> 99.9%
ValueCountFrequency (%)
04
< 0.1%
14
< 0.1%
24
< 0.1%
34
< 0.1%
44
< 0.1%
ValueCountFrequency (%)
7501131
< 0.1%
7501121
< 0.1%
7501111
< 0.1%
7501101
< 0.1%
7501091
< 0.1%

correct
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
True
2620419 
False
 
117166
ValueCountFrequency (%)
True2620419
95.7%
False117166
 
4.3%

game_result_id
Real number (ℝ≥0)

Distinct46470
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31632.99275
Minimum1
Maximum76473
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:28.902801image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2712
Q113005
median28517
Q349148
95-th percentile69293
Maximum76473
Range76472
Interquartile range (IQR)36143

Descriptive statistics

Standard deviation21377.80202
Coefficient of variation (CV)0.6758071292
Kurtosis-1.061939459
Mean31632.99275
Median Absolute Deviation (MAD)17561
Skewness0.3422095459
Sum8.659800647 × 1010
Variance457010419.2
MonotocityNot monotonic
2021-02-26T00:47:29.023391image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1212279
 
< 0.1%
1207272
 
< 0.1%
1210268
 
< 0.1%
1209257
 
< 0.1%
1222249
 
< 0.1%
1257249
 
< 0.1%
1223245
 
< 0.1%
1250243
 
< 0.1%
1249242
 
< 0.1%
1233221
 
< 0.1%
Other values (46460)2735060
99.9%
ValueCountFrequency (%)
1121
< 0.1%
2114
< 0.1%
397
< 0.1%
493
< 0.1%
5110
< 0.1%
ValueCountFrequency (%)
7647371
< 0.1%
7647265
< 0.1%
7647063
< 0.1%
7645564
< 0.1%
7645164
< 0.1%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
R
685479 
D
684674 
U
684179 
L
683253 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
R685479
25.0%
D684674
25.0%
U684179
25.0%
L683253
25.0%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
R
685485 
D
684766 
U
683900 
L
683434 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
R685485
25.0%
D684766
25.0%
U683900
25.0%
L683434
25.0%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
L
692256 
R
688978 
U
679642 
D
676709 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
L692256
25.3%
R688978
25.2%
U679642
24.8%
D676709
24.7%

response_time
Real number (ℝ≥0)

Distinct4443
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean958.9477788
Minimum200
Maximum5000
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:29.443458image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum200
5-th percentile642
Q1757
median870
Q31054
95-th percentile1575
Maximum5000
Range4800
Interquartile range (IQR)297

Descriptive statistics

Standard deviation340.5888668
Coefficient of variation (CV)0.3551693578
Kurtosis15.06450001
Mean958.9477788
Median Absolute Deviation (MAD)135
Skewness2.912185805
Sum2625201055
Variance116000.7762
MonotocityNot monotonic
2021-02-26T00:47:29.564760image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
75018501
 
0.7%
78015313
 
0.6%
79013160
 
0.5%
82012968
 
0.5%
81012879
 
0.5%
86012591
 
0.5%
85012439
 
0.5%
89012351
 
0.5%
84011955
 
0.4%
77011917
 
0.4%
Other values (4433)2603511
95.1%
ValueCountFrequency (%)
20058
< 0.1%
20113
 
< 0.1%
20212
 
< 0.1%
20346
< 0.1%
20419
 
< 0.1%
ValueCountFrequency (%)
50001
 
< 0.1%
49981
 
< 0.1%
49951
 
< 0.1%
49934
< 0.1%
49911
 
< 0.1%

trial_num
Real number (ℝ≥0)

Distinct107
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.44657682
Minimum1
Maximum107
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:29.686306image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q116
median31
Q348
95-th percentile67
Maximum107
Range106
Interquartile range (IQR)32

Descriptive statistics

Standard deviation19.85221324
Coefficient of variation (CV)0.6118430722
Kurtosis-0.8569413615
Mean32.44657682
Median Absolute Deviation (MAD)16
Skewness0.2920395952
Sum88825262
Variance394.1103707
MonotocityNot monotonic
2021-02-26T00:47:29.808398image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
645141
 
1.6%
1345133
 
1.6%
1445127
 
1.6%
945124
 
1.6%
1545124
 
1.6%
1245114
 
1.6%
745108
 
1.6%
1945102
 
1.6%
145099
 
1.6%
845093
 
1.6%
Other values (97)2286420
83.5%
ValueCountFrequency (%)
145099
1.6%
245001
1.6%
345089
1.6%
445070
1.6%
545047
1.6%
ValueCountFrequency (%)
1071
< 0.1%
1061
< 0.1%
1051
< 0.1%
1041
< 0.1%
1031
< 0.1%

trial_type
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
P
1429403 
M
1308182 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowP
2nd rowP
3rd rowP
4th rowP
5th rowP
ValueCountFrequency (%)
P1429403
52.2%
M1308182
47.8%

user_id
Real number (ℝ≥0)

Distinct1000
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean114134.7336
Minimum183
Maximum226123
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:30.023628image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum183
5-th percentile9383
Q157105
median117998
Q3170462
95-th percentile216742
Maximum226123
Range225940
Interquartile range (IQR)113357

Descriptive statistics

Standard deviation66193.64372
Coefficient of variation (CV)0.5799605573
Kurtosis-1.198677487
Mean114134.7336
Median Absolute Deviation (MAD)56492
Skewness-0.03730791676
Sum3.124535348 × 1011
Variance4381598470
MonotocityIncreasing
2021-02-26T00:47:30.134610image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2059075227
 
0.2%
1308805157
 
0.2%
393254912
 
0.2%
1873344737
 
0.2%
680954623
 
0.2%
415634509
 
0.2%
75084495
 
0.2%
587954471
 
0.2%
1662844352
 
0.2%
673604337
 
0.2%
Other values (990)2690765
98.3%
ValueCountFrequency (%)
1833424
0.1%
7201208
 
< 0.1%
9431985
0.1%
11712851
0.1%
14093731
0.1%
ValueCountFrequency (%)
2261232964
0.1%
2258782507
0.1%
2255592785
0.1%
2254572247
0.1%
2253552345
0.1%

accuracy
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
1
2620419 
0
 
117166

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
12620419
95.7%
0117166
 
4.3%

uid
Real number (ℝ≥0)

Distinct1000
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean503.3857889
Minimum1
Maximum1000
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:30.341874image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48
Q1258
median505
Q3752
95-th percentile951
Maximum1000
Range999
Interquartile range (IQR)494

Descriptive statistics

Standard deviation287.8709563
Coefficient of variation (CV)0.5718694542
Kurtosis-1.191228998
Mean503.3857889
Median Absolute Deviation (MAD)247
Skewness-0.01357732368
Sum1378061385
Variance82869.68751
MonotocityIncreasing
2021-02-26T00:47:30.461416image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9005227
 
0.2%
5735157
 
0.2%
1854912
 
0.2%
8224737
 
0.2%
3034623
 
0.2%
1994509
 
0.2%
364495
 
0.2%
2674471
 
0.2%
7384352
 
0.2%
3004337
 
0.2%
Other values (990)2690765
98.3%
ValueCountFrequency (%)
13424
0.1%
21208
 
< 0.1%
31985
0.1%
42851
0.1%
53731
0.1%
ValueCountFrequency (%)
10002964
0.1%
9992507
0.1%
9982785
0.1%
9972247
0.1%
9962345
0.1%

compatible
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
1
1389795 
0
1347790 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
11389795
50.8%
01347790
49.2%

gamecount
Real number (ℝ≥0)

Distinct60
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.6504262
Minimum1
Maximum60
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.343040image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q117
median32
Q346
95-th percentile58
Maximum60
Range59
Interquartile range (IQR)29

Descriptive statistics

Standard deviation17.01935919
Coefficient of variation (CV)0.5377292263
Kurtosis-1.174470311
Mean31.6504262
Median Absolute Deviation (MAD)15
Skewness-0.06156084448
Sum86645732
Variance289.6585873
MonotocityNot monotonic
2021-02-26T00:47:31.457216image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5249656
 
1.8%
5349601
 
1.8%
5749574
 
1.8%
5649368
 
1.8%
4648970
 
1.8%
5148902
 
1.8%
3348600
 
1.8%
5548575
 
1.8%
5848566
 
1.8%
4248463
 
1.8%
Other values (50)2247310
82.1%
ValueCountFrequency (%)
131446
1.1%
233462
1.2%
337013
1.4%
438437
1.4%
539990
1.5%
ValueCountFrequency (%)
6048172
1.8%
5948289
1.8%
5848566
1.8%
5749574
1.8%
5649368
1.8%

totalcount
Real number (ℝ≥0)

Distinct304
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean215.4372357
Minimum100
Maximum2976
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.576919image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum100
5-th percentile104
Q1123
median159
Q3235
95-th percentile470
Maximum2976
Range2876
Interquartile range (IQR)112

Descriptive statistics

Standard deviation190.3430738
Coefficient of variation (CV)0.8835198483
Kurtosis65.73041639
Mean215.4372357
Median Absolute Deviation (MAD)44
Skewness6.364027397
Sum589777745
Variance36230.48576
MonotocityNot monotonic
2021-02-26T00:47:31.692428image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10345147
 
1.6%
11443535
 
1.6%
11641625
 
1.5%
10740849
 
1.5%
10639642
 
1.4%
12338340
 
1.4%
13035801
 
1.3%
12635543
 
1.3%
10433174
 
1.2%
10932868
 
1.2%
Other values (294)2351061
85.9%
ValueCountFrequency (%)
10024692
0.9%
10130732
1.1%
10223750
0.9%
10345147
1.6%
10433174
1.2%
ValueCountFrequency (%)
29762214
0.1%
24372148
0.1%
22952335
0.1%
13032704
0.1%
11433027
0.1%

agebin
Real number (ℝ≥0)

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.291678615
Minimum2
Maximum7
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.792415image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2
Q13
median4
Q36
95-th percentile7
Maximum7
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.683548655
Coefficient of variation (CV)0.3922820896
Kurtosis-1.218957282
Mean4.291678615
Median Absolute Deviation (MAD)1
Skewness0.1484660005
Sum11748835
Variance2.834336073
MonotocityNot monotonic
2021-02-26T00:47:31.873626image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2530345
19.4%
3504662
18.4%
4477557
17.4%
5448493
16.4%
6414230
15.1%
7362298
13.2%
ValueCountFrequency (%)
2530345
19.4%
3504662
18.4%
4477557
17.4%
5448493
16.4%
6414230
15.1%
ValueCountFrequency (%)
7362298
13.2%
6414230
15.1%
5448493
16.4%
4477557
17.4%
3504662
18.4%

trialtypecount
Real number (ℝ≥0)

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.684821111
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.955642image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile8
Maximum18
Range17
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.34778765
Coefficient of variation (CV)0.6371510527
Kurtosis0.5219900274
Mean3.684821111
Median Absolute Deviation (MAD)2
Skewness0.9242221618
Sum10087511
Variance5.512106848
MonotocityNot monotonic
2021-02-26T00:47:32.038607image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
6217435
7.9%
7149572
 
5.5%
895438
 
3.5%
956438
 
2.1%
1030498
 
1.1%
Other values (8)25150
 
0.9%
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
ValueCountFrequency (%)
182
 
< 0.1%
1714
 
< 0.1%
1662
 
< 0.1%
15236
 
< 0.1%
14856
< 0.1%

rtsum
Real number (ℝ≥0)

Distinct18605
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3559.939113
Minimum200
Maximum36940
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.151188image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum200
5-th percentile822
Q11681
median2997
Q34808
95-th percentile8265
Maximum36940
Range36740
Interquartile range (IQR)3127

Descriptive statistics

Standard deviation2482.743047
Coefficient of variation (CV)0.6974116603
Kurtosis4.057938333
Mean3559.939113
Median Absolute Deviation (MAD)1474
Skewness1.530360499
Sum9745635916
Variance6164013.038
MonotocityNot monotonic
2021-02-26T00:47:32.264292image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10002451
 
0.1%
8902263
 
0.1%
8602153
 
0.1%
8802109
 
0.1%
9102100
 
0.1%
9202093
 
0.1%
9501997
 
0.1%
8501964
 
0.1%
7501925
 
0.1%
9801923
 
0.1%
Other values (18595)2716607
99.2%
ValueCountFrequency (%)
20058
< 0.1%
20113
 
< 0.1%
20212
 
< 0.1%
20346
< 0.1%
20419
 
< 0.1%
ValueCountFrequency (%)
369401
< 0.1%
365632
< 0.1%
351652
< 0.1%
339701
< 0.1%
338761
< 0.1%

runlength
Real number (ℝ≥0)

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.369214472
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.367379image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q14
median6
Q38
95-th percentile11
Maximum18
Range17
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.577218232
Coefficient of variation (CV)0.4046367482
Kurtosis-0.1976977492
Mean6.369214472
Median Absolute Deviation (MAD)2
Skewness0.3229929995
Sum17436266
Variance6.642053817
MonotocityNot monotonic
2021-02-26T00:47:32.454938image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
6406575
14.9%
5387264
14.1%
7379647
13.9%
4320426
11.7%
8311379
11.4%
9233452
8.5%
3218517
8.0%
10156953
 
5.7%
2115100
 
4.2%
1189827
 
3.3%
Other values (8)118445
 
4.3%
ValueCountFrequency (%)
135205
 
1.3%
2115100
 
4.2%
3218517
8.0%
4320426
11.7%
5387264
14.1%
ValueCountFrequency (%)
1836
 
< 0.1%
17194
 
< 0.1%
16742
 
< 0.1%
152602
 
0.1%
148825
0.3%

maxrtblock
Real number (ℝ≥0)

Distinct16981
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6062.76754
Minimum200
Maximum36940
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.571465image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum200
5-th percentile2140
Q13991
median5640
Q37596
95-th percentile11420
Maximum36940
Range36740
Interquartile range (IQR)3605

Descriptive statistics

Standard deviation2967.265276
Coefficient of variation (CV)0.4894242203
Kurtosis3.422175822
Mean6062.76754
Median Absolute Deviation (MAD)1778
Skewness1.243493494
Sum1.659734148 × 1010
Variance8804663.219
MonotocityNot monotonic
2021-02-26T00:47:32.686684image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4860938
 
< 0.1%
5010902
 
< 0.1%
4110891
 
< 0.1%
4500884
 
< 0.1%
5250875
 
< 0.1%
4720852
 
< 0.1%
4640827
 
< 0.1%
5030820
 
< 0.1%
4470820
 
< 0.1%
4890819
 
< 0.1%
Other values (16971)2728957
99.7%
ValueCountFrequency (%)
2004
< 0.1%
2011
 
< 0.1%
2021
 
< 0.1%
2033
< 0.1%
2051
 
< 0.1%
ValueCountFrequency (%)
3694014
< 0.1%
3656324
< 0.1%
3397014
< 0.1%
3387614
< 0.1%
3386114
< 0.1%

runlengthprev
Real number (ℝ≥0)

MISSING

Distinct18
Distinct (%)< 0.1%
Missing238072
Missing (%)8.7%
Infinite0
Infinite (%)0.0%
Mean5.174957682
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.793060image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q37
95-th percentile10
Maximum18
Range17
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.553763578
Coefficient of variation (CV)0.49348492
Kurtosis-0.1032082987
Mean5.174957682
Median Absolute Deviation (MAD)2
Skewness0.5046729018
Sum12934874
Variance6.52170841
MonotocityNot monotonic
2021-02-26T00:47:32.879531image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
4372882
13.6%
5365340
13.3%
3331518
12.1%
6322462
11.8%
7260088
9.5%
2251188
9.2%
8187111
6.8%
1135396
 
4.9%
9125513
 
4.6%
1075827
 
2.8%
Other values (8)72188
 
2.6%
(Missing)238072
8.7%
ValueCountFrequency (%)
1135396
 
4.9%
2251188
9.2%
3331518
12.1%
4372882
13.6%
5365340
13.3%
ValueCountFrequency (%)
186
 
< 0.1%
1755
 
< 0.1%
16234
 
< 0.1%
15808
 
< 0.1%
143070
0.1%

rtsumprev
Real number (ℝ≥0)

Distinct16779
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4493.714501
Minimum20
Maximum36940
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.998234image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q12426
median4180
Q36169
95-th percentile9794
Maximum36940
Range36920
Interquartile range (IQR)3743

Descriptive statistics

Standard deviation2994.103889
Coefficient of variation (CV)0.6662870747
Kurtosis2.241926806
Mean4493.714501
Median Absolute Deviation (MAD)1855
Skewness0.9604178463
Sum1.230192541 × 1010
Variance8964658.099
MonotocityNot monotonic
2021-02-26T00:47:33.116630image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20238072
 
8.7%
2640923
 
< 0.1%
3610887
 
< 0.1%
4110874
 
< 0.1%
3860863
 
< 0.1%
3250844
 
< 0.1%
3940839
 
< 0.1%
2990834
 
< 0.1%
3830833
 
< 0.1%
4860828
 
< 0.1%
Other values (16769)2491788
91.0%
ValueCountFrequency (%)
20238072
8.7%
20021
 
< 0.1%
2015
 
< 0.1%
2025
 
< 0.1%
20315
 
< 0.1%
ValueCountFrequency (%)
369407
< 0.1%
365636
< 0.1%
339701
 
< 0.1%
338766
< 0.1%
338612
 
< 0.1%

isswitch
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
0
2242195 
1
495390 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
02242195
81.9%
1495390
 
18.1%

movementd
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
2
685479 
4
684674 
3
684179 
1
683253 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
2685479
25.0%
4684674
25.0%
3684179
25.0%
1683253
25.0%

pointingd
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
2
685485 
4
684766 
3
683900 
1
683434 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
2685485
25.0%
4684766
25.0%
3683900
25.0%
1683434
25.0%

task
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
2
1429403 
1
1308182 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2
ValueCountFrequency (%)
21429403
52.2%
11308182
47.8%

choice
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
1
692256 
2
688978 
3
679642 
4
676709 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
1692256
25.3%
2688978
25.2%
3679642
24.8%
4676709
24.7%

rlprev
Real number (ℝ≥0)

Distinct19
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.464206226
Minimum1
Maximum20
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:33.879759image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q13
median5
Q38
95-th percentile20
Maximum20
Range19
Interquartile range (IQR)5

Descriptive statistics

Standard deviation4.837929472
Coefficient of variation (CV)0.748418182
Kurtosis2.575020147
Mean6.464206226
Median Absolute Deviation (MAD)2
Skewness1.751748867
Sum17696314
Variance23.40556157
MonotocityNot monotonic
2021-02-26T00:47:33.972148image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
4372882
13.6%
5365340
13.3%
3331518
12.1%
6322462
11.8%
7260088
9.5%
2251188
9.2%
20238072
8.7%
8187111
6.8%
1135396
 
4.9%
9125513
 
4.6%
Other values (9)148015
 
5.4%
ValueCountFrequency (%)
1135396
 
4.9%
2251188
9.2%
3331518
12.1%
4372882
13.6%
5365340
13.3%
ValueCountFrequency (%)
20238072
8.7%
186
 
< 0.1%
1755
 
< 0.1%
16234
 
< 0.1%
15808
 
< 0.1%

newuid
Real number (ℝ≥0)

Distinct1000
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean503.3857889
Minimum1
Maximum1000
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:34.088637image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48
Q1258
median505
Q3752
95-th percentile951
Maximum1000
Range999
Interquartile range (IQR)494

Descriptive statistics

Standard deviation287.8709563
Coefficient of variation (CV)0.5718694542
Kurtosis-1.191228998
Mean503.3857889
Median Absolute Deviation (MAD)247
Skewness-0.01357732368
Sum1378061385
Variance82869.68751
MonotocityIncreasing
2021-02-26T00:47:34.208259image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9005227
 
0.2%
5735157
 
0.2%
1854912
 
0.2%
8224737
 
0.2%
3034623
 
0.2%
1994509
 
0.2%
364495
 
0.2%
2674471
 
0.2%
7384352
 
0.2%
3004337
 
0.2%
Other values (990)2690765
98.3%
ValueCountFrequency (%)
13424
0.1%
21208
 
< 0.1%
31985
0.1%
42851
0.1%
53731
0.1%
ValueCountFrequency (%)
10002964
0.1%
9992507
0.1%
9982785
0.1%
9972247
0.1%
9962345
0.1%

trialtypecount2
Real number (ℝ≥0)

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.608637175
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:34.308116image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile8
Maximum8
Range7
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.157646622
Coefficient of variation (CV)0.597911765
Kurtosis-0.7473274996
Mean3.608637175
Median Absolute Deviation (MAD)2
Skewness0.5600049944
Sum9878951
Variance4.655438947
MonotocityNot monotonic
2021-02-26T00:47:34.399523image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
6217435
7.9%
8207524
 
7.6%
7149572
 
5.5%
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
ValueCountFrequency (%)
8207524
7.6%
7149572
 
5.5%
6217435
7.9%
5294700
10.8%
4374701
13.7%
\ No newline at end of file diff --git a/stayvers2019/all_users_eda.html b/stayvers2019/all_users_eda.html new file mode 100644 index 0000000..be28617 --- /dev/null +++ b/stayvers2019/all_users_eda.html @@ -0,0 +1,15462 @@ +Lumocity All Users EDA Report

Overview

Dataset statistics

Number of variables30
Number of observations2737585
Missing cells238072
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory626.6 MiB
Average record size in memory240.0 B

Variable types

Numeric18
Boolean1
Categorical11

Warnings

runlengthprev has 238072 (8.7%) missing values Missing

Reproduction

Analysis started2021-02-25 23:46:00.622032
Analysis finished2021-02-25 23:47:27.632007
Duration1 minute and 27.01 seconds
Software versionpandas-profiling v2.10.0
Download configurationconfig.yaml

Variables

df_index
Real number (ℝ≥0)

Distinct750114
Distinct (%)27.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean349316.8219
Minimum0
Maximum750113
Zeros4
Zeros (%)< 0.1%
Memory size20.9 MiB
2021-02-26T00:47:28.477006image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile34219.2
Q1171099
median342198
Q3513297
95-th percentile695667.8
Maximum750113
Range750113
Interquartile range (IQR)342198

Descriptive statistics

Standard deviation208132.9105
Coefficient of variation (CV)0.5958284785
Kurtosis-1.101560579
Mean349316.8219
Median Absolute Deviation (MAD)171099
Skewness0.1226952489
Sum9.562844919 × 1011
Variance4.331930844 × 1010
MonotocityNot monotonic
2021-02-26T00:47:28.758862image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20474
 
< 0.1%
908864
 
< 0.1%
949804
 
< 0.1%
847394
 
< 0.1%
826904
 
< 0.1%
888334
 
< 0.1%
867844
 
< 0.1%
4392954
 
< 0.1%
4413424
 
< 0.1%
4351974
 
< 0.1%
Other values (750104)2737545
> 99.9%
ValueCountFrequency (%)
04
< 0.1%
14
< 0.1%
24
< 0.1%
34
< 0.1%
44
< 0.1%
ValueCountFrequency (%)
7501131
< 0.1%
7501121
< 0.1%
7501111
< 0.1%
7501101
< 0.1%
7501091
< 0.1%

correct
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
True
2620419 
False
 
117166
ValueCountFrequency (%)
True2620419
95.7%
False117166
 
4.3%

game_result_id
Real number (ℝ≥0)

Distinct46470
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31632.99275
Minimum1
Maximum76473
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:28.902801image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2712
Q113005
median28517
Q349148
95-th percentile69293
Maximum76473
Range76472
Interquartile range (IQR)36143

Descriptive statistics

Standard deviation21377.80202
Coefficient of variation (CV)0.6758071292
Kurtosis-1.061939459
Mean31632.99275
Median Absolute Deviation (MAD)17561
Skewness0.3422095459
Sum8.659800647 × 1010
Variance457010419.2
MonotocityNot monotonic
2021-02-26T00:47:29.023391image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1212279
 
< 0.1%
1207272
 
< 0.1%
1210268
 
< 0.1%
1209257
 
< 0.1%
1222249
 
< 0.1%
1257249
 
< 0.1%
1223245
 
< 0.1%
1250243
 
< 0.1%
1249242
 
< 0.1%
1233221
 
< 0.1%
Other values (46460)2735060
99.9%
ValueCountFrequency (%)
1121
< 0.1%
2114
< 0.1%
397
< 0.1%
493
< 0.1%
5110
< 0.1%
ValueCountFrequency (%)
7647371
< 0.1%
7647265
< 0.1%
7647063
< 0.1%
7645564
< 0.1%
7645164
< 0.1%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
R
685479 
D
684674 
U
684179 
L
683253 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
R685479
25.0%
D684674
25.0%
U684179
25.0%
L683253
25.0%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
R
685485 
D
684766 
U
683900 
L
683434 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
R685485
25.0%
D684766
25.0%
U683900
25.0%
L683434
25.0%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
L
692256 
R
688978 
U
679642 
D
676709 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
L692256
25.3%
R688978
25.2%
U679642
24.8%
D676709
24.7%

response_time
Real number (ℝ≥0)

Distinct4443
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean958.9477788
Minimum200
Maximum5000
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:29.443458image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum200
5-th percentile642
Q1757
median870
Q31054
95-th percentile1575
Maximum5000
Range4800
Interquartile range (IQR)297

Descriptive statistics

Standard deviation340.5888668
Coefficient of variation (CV)0.3551693578
Kurtosis15.06450001
Mean958.9477788
Median Absolute Deviation (MAD)135
Skewness2.912185805
Sum2625201055
Variance116000.7762
MonotocityNot monotonic
2021-02-26T00:47:29.564760image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
75018501
 
0.7%
78015313
 
0.6%
79013160
 
0.5%
82012968
 
0.5%
81012879
 
0.5%
86012591
 
0.5%
85012439
 
0.5%
89012351
 
0.5%
84011955
 
0.4%
77011917
 
0.4%
Other values (4433)2603511
95.1%
ValueCountFrequency (%)
20058
< 0.1%
20113
 
< 0.1%
20212
 
< 0.1%
20346
< 0.1%
20419
 
< 0.1%
ValueCountFrequency (%)
50001
 
< 0.1%
49981
 
< 0.1%
49951
 
< 0.1%
49934
< 0.1%
49911
 
< 0.1%

trial_num
Real number (ℝ≥0)

Distinct107
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.44657682
Minimum1
Maximum107
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:29.686306image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q116
median31
Q348
95-th percentile67
Maximum107
Range106
Interquartile range (IQR)32

Descriptive statistics

Standard deviation19.85221324
Coefficient of variation (CV)0.6118430722
Kurtosis-0.8569413615
Mean32.44657682
Median Absolute Deviation (MAD)16
Skewness0.2920395952
Sum88825262
Variance394.1103707
MonotocityNot monotonic
2021-02-26T00:47:29.808398image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
645141
 
1.6%
1345133
 
1.6%
1445127
 
1.6%
945124
 
1.6%
1545124
 
1.6%
1245114
 
1.6%
745108
 
1.6%
1945102
 
1.6%
145099
 
1.6%
845093
 
1.6%
Other values (97)2286420
83.5%
ValueCountFrequency (%)
145099
1.6%
245001
1.6%
345089
1.6%
445070
1.6%
545047
1.6%
ValueCountFrequency (%)
1071
< 0.1%
1061
< 0.1%
1051
< 0.1%
1041
< 0.1%
1031
< 0.1%

trial_type
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
P
1429403 
M
1308182 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowP
2nd rowP
3rd rowP
4th rowP
5th rowP
ValueCountFrequency (%)
P1429403
52.2%
M1308182
47.8%

user_id
Real number (ℝ≥0)

Distinct1000
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean114134.7336
Minimum183
Maximum226123
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:30.023628image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum183
5-th percentile9383
Q157105
median117998
Q3170462
95-th percentile216742
Maximum226123
Range225940
Interquartile range (IQR)113357

Descriptive statistics

Standard deviation66193.64372
Coefficient of variation (CV)0.5799605573
Kurtosis-1.198677487
Mean114134.7336
Median Absolute Deviation (MAD)56492
Skewness-0.03730791676
Sum3.124535348 × 1011
Variance4381598470
MonotocityIncreasing
2021-02-26T00:47:30.134610image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2059075227
 
0.2%
1308805157
 
0.2%
393254912
 
0.2%
1873344737
 
0.2%
680954623
 
0.2%
415634509
 
0.2%
75084495
 
0.2%
587954471
 
0.2%
1662844352
 
0.2%
673604337
 
0.2%
Other values (990)2690765
98.3%
ValueCountFrequency (%)
1833424
0.1%
7201208
 
< 0.1%
9431985
0.1%
11712851
0.1%
14093731
0.1%
ValueCountFrequency (%)
2261232964
0.1%
2258782507
0.1%
2255592785
0.1%
2254572247
0.1%
2253552345
0.1%

accuracy
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
1
2620419 
0
 
117166

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
12620419
95.7%
0117166
 
4.3%

uid
Real number (ℝ≥0)

Distinct1000
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean503.3857889
Minimum1
Maximum1000
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:30.341874image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48
Q1258
median505
Q3752
95-th percentile951
Maximum1000
Range999
Interquartile range (IQR)494

Descriptive statistics

Standard deviation287.8709563
Coefficient of variation (CV)0.5718694542
Kurtosis-1.191228998
Mean503.3857889
Median Absolute Deviation (MAD)247
Skewness-0.01357732368
Sum1378061385
Variance82869.68751
MonotocityIncreasing
2021-02-26T00:47:30.461416image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9005227
 
0.2%
5735157
 
0.2%
1854912
 
0.2%
8224737
 
0.2%
3034623
 
0.2%
1994509
 
0.2%
364495
 
0.2%
2674471
 
0.2%
7384352
 
0.2%
3004337
 
0.2%
Other values (990)2690765
98.3%
ValueCountFrequency (%)
13424
0.1%
21208
 
< 0.1%
31985
0.1%
42851
0.1%
53731
0.1%
ValueCountFrequency (%)
10002964
0.1%
9992507
0.1%
9982785
0.1%
9972247
0.1%
9962345
0.1%

compatible
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
1
1389795 
0
1347790 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
11389795
50.8%
01347790
49.2%

gamecount
Real number (ℝ≥0)

Distinct60
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.6504262
Minimum1
Maximum60
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.343040image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q117
median32
Q346
95-th percentile58
Maximum60
Range59
Interquartile range (IQR)29

Descriptive statistics

Standard deviation17.01935919
Coefficient of variation (CV)0.5377292263
Kurtosis-1.174470311
Mean31.6504262
Median Absolute Deviation (MAD)15
Skewness-0.06156084448
Sum86645732
Variance289.6585873
MonotocityNot monotonic
2021-02-26T00:47:31.457216image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5249656
 
1.8%
5349601
 
1.8%
5749574
 
1.8%
5649368
 
1.8%
4648970
 
1.8%
5148902
 
1.8%
3348600
 
1.8%
5548575
 
1.8%
5848566
 
1.8%
4248463
 
1.8%
Other values (50)2247310
82.1%
ValueCountFrequency (%)
131446
1.1%
233462
1.2%
337013
1.4%
438437
1.4%
539990
1.5%
ValueCountFrequency (%)
6048172
1.8%
5948289
1.8%
5848566
1.8%
5749574
1.8%
5649368
1.8%

totalcount
Real number (ℝ≥0)

Distinct304
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean215.4372357
Minimum100
Maximum2976
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.576919image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum100
5-th percentile104
Q1123
median159
Q3235
95-th percentile470
Maximum2976
Range2876
Interquartile range (IQR)112

Descriptive statistics

Standard deviation190.3430738
Coefficient of variation (CV)0.8835198483
Kurtosis65.73041639
Mean215.4372357
Median Absolute Deviation (MAD)44
Skewness6.364027397
Sum589777745
Variance36230.48576
MonotocityNot monotonic
2021-02-26T00:47:31.692428image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10345147
 
1.6%
11443535
 
1.6%
11641625
 
1.5%
10740849
 
1.5%
10639642
 
1.4%
12338340
 
1.4%
13035801
 
1.3%
12635543
 
1.3%
10433174
 
1.2%
10932868
 
1.2%
Other values (294)2351061
85.9%
ValueCountFrequency (%)
10024692
0.9%
10130732
1.1%
10223750
0.9%
10345147
1.6%
10433174
1.2%
ValueCountFrequency (%)
29762214
0.1%
24372148
0.1%
22952335
0.1%
13032704
0.1%
11433027
0.1%

agebin
Real number (ℝ≥0)

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.291678615
Minimum2
Maximum7
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.792415image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2
Q13
median4
Q36
95-th percentile7
Maximum7
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.683548655
Coefficient of variation (CV)0.3922820896
Kurtosis-1.218957282
Mean4.291678615
Median Absolute Deviation (MAD)1
Skewness0.1484660005
Sum11748835
Variance2.834336073
MonotocityNot monotonic
2021-02-26T00:47:31.873626image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2530345
19.4%
3504662
18.4%
4477557
17.4%
5448493
16.4%
6414230
15.1%
7362298
13.2%
ValueCountFrequency (%)
2530345
19.4%
3504662
18.4%
4477557
17.4%
5448493
16.4%
6414230
15.1%
ValueCountFrequency (%)
7362298
13.2%
6414230
15.1%
5448493
16.4%
4477557
17.4%
3504662
18.4%

trialtypecount
Real number (ℝ≥0)

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.684821111
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.955642image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile8
Maximum18
Range17
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.34778765
Coefficient of variation (CV)0.6371510527
Kurtosis0.5219900274
Mean3.684821111
Median Absolute Deviation (MAD)2
Skewness0.9242221618
Sum10087511
Variance5.512106848
MonotocityNot monotonic
2021-02-26T00:47:32.038607image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
6217435
7.9%
7149572
 
5.5%
895438
 
3.5%
956438
 
2.1%
1030498
 
1.1%
Other values (8)25150
 
0.9%
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
ValueCountFrequency (%)
182
 
< 0.1%
1714
 
< 0.1%
1662
 
< 0.1%
15236
 
< 0.1%
14856
< 0.1%

rtsum
Real number (ℝ≥0)

Distinct18605
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3559.939113
Minimum200
Maximum36940
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.151188image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum200
5-th percentile822
Q11681
median2997
Q34808
95-th percentile8265
Maximum36940
Range36740
Interquartile range (IQR)3127

Descriptive statistics

Standard deviation2482.743047
Coefficient of variation (CV)0.6974116603
Kurtosis4.057938333
Mean3559.939113
Median Absolute Deviation (MAD)1474
Skewness1.530360499
Sum9745635916
Variance6164013.038
MonotocityNot monotonic
2021-02-26T00:47:32.264292image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10002451
 
0.1%
8902263
 
0.1%
8602153
 
0.1%
8802109
 
0.1%
9102100
 
0.1%
9202093
 
0.1%
9501997
 
0.1%
8501964
 
0.1%
7501925
 
0.1%
9801923
 
0.1%
Other values (18595)2716607
99.2%
ValueCountFrequency (%)
20058
< 0.1%
20113
 
< 0.1%
20212
 
< 0.1%
20346
< 0.1%
20419
 
< 0.1%
ValueCountFrequency (%)
369401
< 0.1%
365632
< 0.1%
351652
< 0.1%
339701
< 0.1%
338761
< 0.1%

runlength
Real number (ℝ≥0)

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.369214472
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.367379image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q14
median6
Q38
95-th percentile11
Maximum18
Range17
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.577218232
Coefficient of variation (CV)0.4046367482
Kurtosis-0.1976977492
Mean6.369214472
Median Absolute Deviation (MAD)2
Skewness0.3229929995
Sum17436266
Variance6.642053817
MonotocityNot monotonic
2021-02-26T00:47:32.454938image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
6406575
14.9%
5387264
14.1%
7379647
13.9%
4320426
11.7%
8311379
11.4%
9233452
8.5%
3218517
8.0%
10156953
 
5.7%
2115100
 
4.2%
1189827
 
3.3%
Other values (8)118445
 
4.3%
ValueCountFrequency (%)
135205
 
1.3%
2115100
 
4.2%
3218517
8.0%
4320426
11.7%
5387264
14.1%
ValueCountFrequency (%)
1836
 
< 0.1%
17194
 
< 0.1%
16742
 
< 0.1%
152602
 
0.1%
148825
0.3%

maxrtblock
Real number (ℝ≥0)

Distinct16981
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6062.76754
Minimum200
Maximum36940
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.571465image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum200
5-th percentile2140
Q13991
median5640
Q37596
95-th percentile11420
Maximum36940
Range36740
Interquartile range (IQR)3605

Descriptive statistics

Standard deviation2967.265276
Coefficient of variation (CV)0.4894242203
Kurtosis3.422175822
Mean6062.76754
Median Absolute Deviation (MAD)1778
Skewness1.243493494
Sum1.659734148 × 1010
Variance8804663.219
MonotocityNot monotonic
2021-02-26T00:47:32.686684image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4860938
 
< 0.1%
5010902
 
< 0.1%
4110891
 
< 0.1%
4500884
 
< 0.1%
5250875
 
< 0.1%
4720852
 
< 0.1%
4640827
 
< 0.1%
5030820
 
< 0.1%
4470820
 
< 0.1%
4890819
 
< 0.1%
Other values (16971)2728957
99.7%
ValueCountFrequency (%)
2004
< 0.1%
2011
 
< 0.1%
2021
 
< 0.1%
2033
< 0.1%
2051
 
< 0.1%
ValueCountFrequency (%)
3694014
< 0.1%
3656324
< 0.1%
3397014
< 0.1%
3387614
< 0.1%
3386114
< 0.1%

runlengthprev
Real number (ℝ≥0)

MISSING

Distinct18
Distinct (%)< 0.1%
Missing238072
Missing (%)8.7%
Infinite0
Infinite (%)0.0%
Mean5.174957682
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.793060image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q37
95-th percentile10
Maximum18
Range17
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.553763578
Coefficient of variation (CV)0.49348492
Kurtosis-0.1032082987
Mean5.174957682
Median Absolute Deviation (MAD)2
Skewness0.5046729018
Sum12934874
Variance6.52170841
MonotocityNot monotonic
2021-02-26T00:47:32.879531image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
4372882
13.6%
5365340
13.3%
3331518
12.1%
6322462
11.8%
7260088
9.5%
2251188
9.2%
8187111
6.8%
1135396
 
4.9%
9125513
 
4.6%
1075827
 
2.8%
Other values (8)72188
 
2.6%
(Missing)238072
8.7%
ValueCountFrequency (%)
1135396
 
4.9%
2251188
9.2%
3331518
12.1%
4372882
13.6%
5365340
13.3%
ValueCountFrequency (%)
186
 
< 0.1%
1755
 
< 0.1%
16234
 
< 0.1%
15808
 
< 0.1%
143070
0.1%

rtsumprev
Real number (ℝ≥0)

Distinct16779
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4493.714501
Minimum20
Maximum36940
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.998234image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q12426
median4180
Q36169
95-th percentile9794
Maximum36940
Range36920
Interquartile range (IQR)3743

Descriptive statistics

Standard deviation2994.103889
Coefficient of variation (CV)0.6662870747
Kurtosis2.241926806
Mean4493.714501
Median Absolute Deviation (MAD)1855
Skewness0.9604178463
Sum1.230192541 × 1010
Variance8964658.099
MonotocityNot monotonic
2021-02-26T00:47:33.116630image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20238072
 
8.7%
2640923
 
< 0.1%
3610887
 
< 0.1%
4110874
 
< 0.1%
3860863
 
< 0.1%
3250844
 
< 0.1%
3940839
 
< 0.1%
2990834
 
< 0.1%
3830833
 
< 0.1%
4860828
 
< 0.1%
Other values (16769)2491788
91.0%
ValueCountFrequency (%)
20238072
8.7%
20021
 
< 0.1%
2015
 
< 0.1%
2025
 
< 0.1%
20315
 
< 0.1%
ValueCountFrequency (%)
369407
< 0.1%
365636
< 0.1%
339701
 
< 0.1%
338766
< 0.1%
338612
 
< 0.1%

isswitch
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
0
2242195 
1
495390 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
02242195
81.9%
1495390
 
18.1%

movementd
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
2
685479 
4
684674 
3
684179 
1
683253 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
2685479
25.0%
4684674
25.0%
3684179
25.0%
1683253
25.0%

pointingd
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
2
685485 
4
684766 
3
683900 
1
683434 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
2685485
25.0%
4684766
25.0%
3683900
25.0%
1683434
25.0%

task
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
2
1429403 
1
1308182 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2
ValueCountFrequency (%)
21429403
52.2%
11308182
47.8%

choice
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
1
692256 
2
688978 
3
679642 
4
676709 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
1692256
25.3%
2688978
25.2%
3679642
24.8%
4676709
24.7%

rlprev
Real number (ℝ≥0)

Distinct19
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.464206226
Minimum1
Maximum20
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:33.879759image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q13
median5
Q38
95-th percentile20
Maximum20
Range19
Interquartile range (IQR)5

Descriptive statistics

Standard deviation4.837929472
Coefficient of variation (CV)0.748418182
Kurtosis2.575020147
Mean6.464206226
Median Absolute Deviation (MAD)2
Skewness1.751748867
Sum17696314
Variance23.40556157
MonotocityNot monotonic
2021-02-26T00:47:33.972148image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
4372882
13.6%
5365340
13.3%
3331518
12.1%
6322462
11.8%
7260088
9.5%
2251188
9.2%
20238072
8.7%
8187111
6.8%
1135396
 
4.9%
9125513
 
4.6%
Other values (9)148015
 
5.4%
ValueCountFrequency (%)
1135396
 
4.9%
2251188
9.2%
3331518
12.1%
4372882
13.6%
5365340
13.3%
ValueCountFrequency (%)
20238072
8.7%
186
 
< 0.1%
1755
 
< 0.1%
16234
 
< 0.1%
15808
 
< 0.1%

newuid
Real number (ℝ≥0)

Distinct1000
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean503.3857889
Minimum1
Maximum1000
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:34.088637image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48
Q1258
median505
Q3752
95-th percentile951
Maximum1000
Range999
Interquartile range (IQR)494

Descriptive statistics

Standard deviation287.8709563
Coefficient of variation (CV)0.5718694542
Kurtosis-1.191228998
Mean503.3857889
Median Absolute Deviation (MAD)247
Skewness-0.01357732368
Sum1378061385
Variance82869.68751
MonotocityIncreasing
2021-02-26T00:47:34.208259image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9005227
 
0.2%
5735157
 
0.2%
1854912
 
0.2%
8224737
 
0.2%
3034623
 
0.2%
1994509
 
0.2%
364495
 
0.2%
2674471
 
0.2%
7384352
 
0.2%
3004337
 
0.2%
Other values (990)2690765
98.3%
ValueCountFrequency (%)
13424
0.1%
21208
 
< 0.1%
31985
0.1%
42851
0.1%
53731
0.1%
ValueCountFrequency (%)
10002964
0.1%
9992507
0.1%
9982785
0.1%
9972247
0.1%
9962345
0.1%

trialtypecount2
Real number (ℝ≥0)

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.608637175
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:34.308116image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile8
Maximum8
Range7
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.157646622
Coefficient of variation (CV)0.597911765
Kurtosis-0.7473274996
Mean3.608637175
Median Absolute Deviation (MAD)2
Skewness0.5600049944
Sum9878951
Variance4.655438947
MonotocityNot monotonic
2021-02-26T00:47:34.399523image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
6217435
7.9%
8207524
 
7.6%
7149572
 
5.5%
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
ValueCountFrequency (%)
8207524
7.6%
7149572
 
5.5%
6217435
7.9%
5294700
10.8%
4374701
13.7%
\ No newline at end of file diff --git a/stayvers2019/eda.py b/stayvers2019/eda.py new file mode 100644 index 0000000..fe5d954 --- /dev/null +++ b/stayvers2019/eda.py @@ -0,0 +1,39 @@ +# %% + +import dask.dataframe as dd +import pandas as pd + +# gamedata_path = '~/Downloads/gamedata_preprocessed.csv' +# gamedata_path = '~/Downloads/gamedata_original-v1.csv' +gamedata_path = '/Users/morteza/Downloads/data/Sample 1000 individuals/data/data_sim6007.csv' + +# lazy load gamedata +DATA = dd.read_csv(gamedata_path) + +# df.npartitions +DATA.columns +#%% +# number of unique subjects +DATA['user_id'].value_counts().compute() +# 1000 users + +DATA['task'].value_counts().compute() +# task "1" or "2" +#%% + +# dask runs queries in multiple thread; we only want one user_id though +sample_user = DATA.loc[0,'user_id'].compute().iloc[0] + +SAMPLE_USER_DATA = DATA.query('user_id == @sample_user', local_dict={'sample_user': sample_user}).compute() + +#%% +# generate EDA reports + +from pandas_profiling import ProfileReport + +profile = ProfileReport(DATA.compute(), title='Lumocity All Users EDA Report', minimal=True) +profile.to_file("all_users_eda.html") + + +profile = ProfileReport(SAMPLE_USER_DATA, title='Lumocity Sample User EDA Report', explorative=True) +profile.to_file("sample_user_eda.html") diff --git a/stayvers2019/all_users_eda.html b/stayvers2019/all_users_eda.html new file mode 100644 index 0000000..be28617 --- /dev/null +++ b/stayvers2019/all_users_eda.html @@ -0,0 +1,15462 @@ +Lumocity All Users EDA Report

Overview

Dataset statistics

Number of variables30
Number of observations2737585
Missing cells238072
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory626.6 MiB
Average record size in memory240.0 B

Variable types

Numeric18
Boolean1
Categorical11

Warnings

runlengthprev has 238072 (8.7%) missing values Missing

Reproduction

Analysis started2021-02-25 23:46:00.622032
Analysis finished2021-02-25 23:47:27.632007
Duration1 minute and 27.01 seconds
Software versionpandas-profiling v2.10.0
Download configurationconfig.yaml

Variables

df_index
Real number (ℝ≥0)

Distinct750114
Distinct (%)27.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean349316.8219
Minimum0
Maximum750113
Zeros4
Zeros (%)< 0.1%
Memory size20.9 MiB
2021-02-26T00:47:28.477006image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile34219.2
Q1171099
median342198
Q3513297
95-th percentile695667.8
Maximum750113
Range750113
Interquartile range (IQR)342198

Descriptive statistics

Standard deviation208132.9105
Coefficient of variation (CV)0.5958284785
Kurtosis-1.101560579
Mean349316.8219
Median Absolute Deviation (MAD)171099
Skewness0.1226952489
Sum9.562844919 × 1011
Variance4.331930844 × 1010
MonotocityNot monotonic
2021-02-26T00:47:28.758862image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20474
 
< 0.1%
908864
 
< 0.1%
949804
 
< 0.1%
847394
 
< 0.1%
826904
 
< 0.1%
888334
 
< 0.1%
867844
 
< 0.1%
4392954
 
< 0.1%
4413424
 
< 0.1%
4351974
 
< 0.1%
Other values (750104)2737545
> 99.9%
ValueCountFrequency (%)
04
< 0.1%
14
< 0.1%
24
< 0.1%
34
< 0.1%
44
< 0.1%
ValueCountFrequency (%)
7501131
< 0.1%
7501121
< 0.1%
7501111
< 0.1%
7501101
< 0.1%
7501091
< 0.1%

correct
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
True
2620419 
False
 
117166
ValueCountFrequency (%)
True2620419
95.7%
False117166
 
4.3%

game_result_id
Real number (ℝ≥0)

Distinct46470
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31632.99275
Minimum1
Maximum76473
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:28.902801image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2712
Q113005
median28517
Q349148
95-th percentile69293
Maximum76473
Range76472
Interquartile range (IQR)36143

Descriptive statistics

Standard deviation21377.80202
Coefficient of variation (CV)0.6758071292
Kurtosis-1.061939459
Mean31632.99275
Median Absolute Deviation (MAD)17561
Skewness0.3422095459
Sum8.659800647 × 1010
Variance457010419.2
MonotocityNot monotonic
2021-02-26T00:47:29.023391image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1212279
 
< 0.1%
1207272
 
< 0.1%
1210268
 
< 0.1%
1209257
 
< 0.1%
1222249
 
< 0.1%
1257249
 
< 0.1%
1223245
 
< 0.1%
1250243
 
< 0.1%
1249242
 
< 0.1%
1233221
 
< 0.1%
Other values (46460)2735060
99.9%
ValueCountFrequency (%)
1121
< 0.1%
2114
< 0.1%
397
< 0.1%
493
< 0.1%
5110
< 0.1%
ValueCountFrequency (%)
7647371
< 0.1%
7647265
< 0.1%
7647063
< 0.1%
7645564
< 0.1%
7645164
< 0.1%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
R
685479 
D
684674 
U
684179 
L
683253 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
R685479
25.0%
D684674
25.0%
U684179
25.0%
L683253
25.0%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
R
685485 
D
684766 
U
683900 
L
683434 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
R685485
25.0%
D684766
25.0%
U683900
25.0%
L683434
25.0%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
L
692256 
R
688978 
U
679642 
D
676709 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
L692256
25.3%
R688978
25.2%
U679642
24.8%
D676709
24.7%

response_time
Real number (ℝ≥0)

Distinct4443
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean958.9477788
Minimum200
Maximum5000
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:29.443458image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum200
5-th percentile642
Q1757
median870
Q31054
95-th percentile1575
Maximum5000
Range4800
Interquartile range (IQR)297

Descriptive statistics

Standard deviation340.5888668
Coefficient of variation (CV)0.3551693578
Kurtosis15.06450001
Mean958.9477788
Median Absolute Deviation (MAD)135
Skewness2.912185805
Sum2625201055
Variance116000.7762
MonotocityNot monotonic
2021-02-26T00:47:29.564760image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
75018501
 
0.7%
78015313
 
0.6%
79013160
 
0.5%
82012968
 
0.5%
81012879
 
0.5%
86012591
 
0.5%
85012439
 
0.5%
89012351
 
0.5%
84011955
 
0.4%
77011917
 
0.4%
Other values (4433)2603511
95.1%
ValueCountFrequency (%)
20058
< 0.1%
20113
 
< 0.1%
20212
 
< 0.1%
20346
< 0.1%
20419
 
< 0.1%
ValueCountFrequency (%)
50001
 
< 0.1%
49981
 
< 0.1%
49951
 
< 0.1%
49934
< 0.1%
49911
 
< 0.1%

trial_num
Real number (ℝ≥0)

Distinct107
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.44657682
Minimum1
Maximum107
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:29.686306image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q116
median31
Q348
95-th percentile67
Maximum107
Range106
Interquartile range (IQR)32

Descriptive statistics

Standard deviation19.85221324
Coefficient of variation (CV)0.6118430722
Kurtosis-0.8569413615
Mean32.44657682
Median Absolute Deviation (MAD)16
Skewness0.2920395952
Sum88825262
Variance394.1103707
MonotocityNot monotonic
2021-02-26T00:47:29.808398image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
645141
 
1.6%
1345133
 
1.6%
1445127
 
1.6%
945124
 
1.6%
1545124
 
1.6%
1245114
 
1.6%
745108
 
1.6%
1945102
 
1.6%
145099
 
1.6%
845093
 
1.6%
Other values (97)2286420
83.5%
ValueCountFrequency (%)
145099
1.6%
245001
1.6%
345089
1.6%
445070
1.6%
545047
1.6%
ValueCountFrequency (%)
1071
< 0.1%
1061
< 0.1%
1051
< 0.1%
1041
< 0.1%
1031
< 0.1%

trial_type
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
P
1429403 
M
1308182 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowP
2nd rowP
3rd rowP
4th rowP
5th rowP
ValueCountFrequency (%)
P1429403
52.2%
M1308182
47.8%

user_id
Real number (ℝ≥0)

Distinct1000
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean114134.7336
Minimum183
Maximum226123
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:30.023628image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum183
5-th percentile9383
Q157105
median117998
Q3170462
95-th percentile216742
Maximum226123
Range225940
Interquartile range (IQR)113357

Descriptive statistics

Standard deviation66193.64372
Coefficient of variation (CV)0.5799605573
Kurtosis-1.198677487
Mean114134.7336
Median Absolute Deviation (MAD)56492
Skewness-0.03730791676
Sum3.124535348 × 1011
Variance4381598470
MonotocityIncreasing
2021-02-26T00:47:30.134610image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2059075227
 
0.2%
1308805157
 
0.2%
393254912
 
0.2%
1873344737
 
0.2%
680954623
 
0.2%
415634509
 
0.2%
75084495
 
0.2%
587954471
 
0.2%
1662844352
 
0.2%
673604337
 
0.2%
Other values (990)2690765
98.3%
ValueCountFrequency (%)
1833424
0.1%
7201208
 
< 0.1%
9431985
0.1%
11712851
0.1%
14093731
0.1%
ValueCountFrequency (%)
2261232964
0.1%
2258782507
0.1%
2255592785
0.1%
2254572247
0.1%
2253552345
0.1%

accuracy
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
1
2620419 
0
 
117166

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
12620419
95.7%
0117166
 
4.3%

uid
Real number (ℝ≥0)

Distinct1000
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean503.3857889
Minimum1
Maximum1000
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:30.341874image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48
Q1258
median505
Q3752
95-th percentile951
Maximum1000
Range999
Interquartile range (IQR)494

Descriptive statistics

Standard deviation287.8709563
Coefficient of variation (CV)0.5718694542
Kurtosis-1.191228998
Mean503.3857889
Median Absolute Deviation (MAD)247
Skewness-0.01357732368
Sum1378061385
Variance82869.68751
MonotocityIncreasing
2021-02-26T00:47:30.461416image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9005227
 
0.2%
5735157
 
0.2%
1854912
 
0.2%
8224737
 
0.2%
3034623
 
0.2%
1994509
 
0.2%
364495
 
0.2%
2674471
 
0.2%
7384352
 
0.2%
3004337
 
0.2%
Other values (990)2690765
98.3%
ValueCountFrequency (%)
13424
0.1%
21208
 
< 0.1%
31985
0.1%
42851
0.1%
53731
0.1%
ValueCountFrequency (%)
10002964
0.1%
9992507
0.1%
9982785
0.1%
9972247
0.1%
9962345
0.1%

compatible
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
1
1389795 
0
1347790 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
11389795
50.8%
01347790
49.2%

gamecount
Real number (ℝ≥0)

Distinct60
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.6504262
Minimum1
Maximum60
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.343040image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q117
median32
Q346
95-th percentile58
Maximum60
Range59
Interquartile range (IQR)29

Descriptive statistics

Standard deviation17.01935919
Coefficient of variation (CV)0.5377292263
Kurtosis-1.174470311
Mean31.6504262
Median Absolute Deviation (MAD)15
Skewness-0.06156084448
Sum86645732
Variance289.6585873
MonotocityNot monotonic
2021-02-26T00:47:31.457216image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5249656
 
1.8%
5349601
 
1.8%
5749574
 
1.8%
5649368
 
1.8%
4648970
 
1.8%
5148902
 
1.8%
3348600
 
1.8%
5548575
 
1.8%
5848566
 
1.8%
4248463
 
1.8%
Other values (50)2247310
82.1%
ValueCountFrequency (%)
131446
1.1%
233462
1.2%
337013
1.4%
438437
1.4%
539990
1.5%
ValueCountFrequency (%)
6048172
1.8%
5948289
1.8%
5848566
1.8%
5749574
1.8%
5649368
1.8%

totalcount
Real number (ℝ≥0)

Distinct304
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean215.4372357
Minimum100
Maximum2976
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.576919image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum100
5-th percentile104
Q1123
median159
Q3235
95-th percentile470
Maximum2976
Range2876
Interquartile range (IQR)112

Descriptive statistics

Standard deviation190.3430738
Coefficient of variation (CV)0.8835198483
Kurtosis65.73041639
Mean215.4372357
Median Absolute Deviation (MAD)44
Skewness6.364027397
Sum589777745
Variance36230.48576
MonotocityNot monotonic
2021-02-26T00:47:31.692428image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10345147
 
1.6%
11443535
 
1.6%
11641625
 
1.5%
10740849
 
1.5%
10639642
 
1.4%
12338340
 
1.4%
13035801
 
1.3%
12635543
 
1.3%
10433174
 
1.2%
10932868
 
1.2%
Other values (294)2351061
85.9%
ValueCountFrequency (%)
10024692
0.9%
10130732
1.1%
10223750
0.9%
10345147
1.6%
10433174
1.2%
ValueCountFrequency (%)
29762214
0.1%
24372148
0.1%
22952335
0.1%
13032704
0.1%
11433027
0.1%

agebin
Real number (ℝ≥0)

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.291678615
Minimum2
Maximum7
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.792415image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2
Q13
median4
Q36
95-th percentile7
Maximum7
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.683548655
Coefficient of variation (CV)0.3922820896
Kurtosis-1.218957282
Mean4.291678615
Median Absolute Deviation (MAD)1
Skewness0.1484660005
Sum11748835
Variance2.834336073
MonotocityNot monotonic
2021-02-26T00:47:31.873626image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2530345
19.4%
3504662
18.4%
4477557
17.4%
5448493
16.4%
6414230
15.1%
7362298
13.2%
ValueCountFrequency (%)
2530345
19.4%
3504662
18.4%
4477557
17.4%
5448493
16.4%
6414230
15.1%
ValueCountFrequency (%)
7362298
13.2%
6414230
15.1%
5448493
16.4%
4477557
17.4%
3504662
18.4%

trialtypecount
Real number (ℝ≥0)

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.684821111
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.955642image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile8
Maximum18
Range17
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.34778765
Coefficient of variation (CV)0.6371510527
Kurtosis0.5219900274
Mean3.684821111
Median Absolute Deviation (MAD)2
Skewness0.9242221618
Sum10087511
Variance5.512106848
MonotocityNot monotonic
2021-02-26T00:47:32.038607image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
6217435
7.9%
7149572
 
5.5%
895438
 
3.5%
956438
 
2.1%
1030498
 
1.1%
Other values (8)25150
 
0.9%
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
ValueCountFrequency (%)
182
 
< 0.1%
1714
 
< 0.1%
1662
 
< 0.1%
15236
 
< 0.1%
14856
< 0.1%

rtsum
Real number (ℝ≥0)

Distinct18605
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3559.939113
Minimum200
Maximum36940
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.151188image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum200
5-th percentile822
Q11681
median2997
Q34808
95-th percentile8265
Maximum36940
Range36740
Interquartile range (IQR)3127

Descriptive statistics

Standard deviation2482.743047
Coefficient of variation (CV)0.6974116603
Kurtosis4.057938333
Mean3559.939113
Median Absolute Deviation (MAD)1474
Skewness1.530360499
Sum9745635916
Variance6164013.038
MonotocityNot monotonic
2021-02-26T00:47:32.264292image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10002451
 
0.1%
8902263
 
0.1%
8602153
 
0.1%
8802109
 
0.1%
9102100
 
0.1%
9202093
 
0.1%
9501997
 
0.1%
8501964
 
0.1%
7501925
 
0.1%
9801923
 
0.1%
Other values (18595)2716607
99.2%
ValueCountFrequency (%)
20058
< 0.1%
20113
 
< 0.1%
20212
 
< 0.1%
20346
< 0.1%
20419
 
< 0.1%
ValueCountFrequency (%)
369401
< 0.1%
365632
< 0.1%
351652
< 0.1%
339701
< 0.1%
338761
< 0.1%

runlength
Real number (ℝ≥0)

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.369214472
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.367379image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q14
median6
Q38
95-th percentile11
Maximum18
Range17
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.577218232
Coefficient of variation (CV)0.4046367482
Kurtosis-0.1976977492
Mean6.369214472
Median Absolute Deviation (MAD)2
Skewness0.3229929995
Sum17436266
Variance6.642053817
MonotocityNot monotonic
2021-02-26T00:47:32.454938image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
6406575
14.9%
5387264
14.1%
7379647
13.9%
4320426
11.7%
8311379
11.4%
9233452
8.5%
3218517
8.0%
10156953
 
5.7%
2115100
 
4.2%
1189827
 
3.3%
Other values (8)118445
 
4.3%
ValueCountFrequency (%)
135205
 
1.3%
2115100
 
4.2%
3218517
8.0%
4320426
11.7%
5387264
14.1%
ValueCountFrequency (%)
1836
 
< 0.1%
17194
 
< 0.1%
16742
 
< 0.1%
152602
 
0.1%
148825
0.3%

maxrtblock
Real number (ℝ≥0)

Distinct16981
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6062.76754
Minimum200
Maximum36940
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.571465image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum200
5-th percentile2140
Q13991
median5640
Q37596
95-th percentile11420
Maximum36940
Range36740
Interquartile range (IQR)3605

Descriptive statistics

Standard deviation2967.265276
Coefficient of variation (CV)0.4894242203
Kurtosis3.422175822
Mean6062.76754
Median Absolute Deviation (MAD)1778
Skewness1.243493494
Sum1.659734148 × 1010
Variance8804663.219
MonotocityNot monotonic
2021-02-26T00:47:32.686684image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4860938
 
< 0.1%
5010902
 
< 0.1%
4110891
 
< 0.1%
4500884
 
< 0.1%
5250875
 
< 0.1%
4720852
 
< 0.1%
4640827
 
< 0.1%
5030820
 
< 0.1%
4470820
 
< 0.1%
4890819
 
< 0.1%
Other values (16971)2728957
99.7%
ValueCountFrequency (%)
2004
< 0.1%
2011
 
< 0.1%
2021
 
< 0.1%
2033
< 0.1%
2051
 
< 0.1%
ValueCountFrequency (%)
3694014
< 0.1%
3656324
< 0.1%
3397014
< 0.1%
3387614
< 0.1%
3386114
< 0.1%

runlengthprev
Real number (ℝ≥0)

MISSING

Distinct18
Distinct (%)< 0.1%
Missing238072
Missing (%)8.7%
Infinite0
Infinite (%)0.0%
Mean5.174957682
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.793060image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q37
95-th percentile10
Maximum18
Range17
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.553763578
Coefficient of variation (CV)0.49348492
Kurtosis-0.1032082987
Mean5.174957682
Median Absolute Deviation (MAD)2
Skewness0.5046729018
Sum12934874
Variance6.52170841
MonotocityNot monotonic
2021-02-26T00:47:32.879531image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
4372882
13.6%
5365340
13.3%
3331518
12.1%
6322462
11.8%
7260088
9.5%
2251188
9.2%
8187111
6.8%
1135396
 
4.9%
9125513
 
4.6%
1075827
 
2.8%
Other values (8)72188
 
2.6%
(Missing)238072
8.7%
ValueCountFrequency (%)
1135396
 
4.9%
2251188
9.2%
3331518
12.1%
4372882
13.6%
5365340
13.3%
ValueCountFrequency (%)
186
 
< 0.1%
1755
 
< 0.1%
16234
 
< 0.1%
15808
 
< 0.1%
143070
0.1%

rtsumprev
Real number (ℝ≥0)

Distinct16779
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4493.714501
Minimum20
Maximum36940
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.998234image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q12426
median4180
Q36169
95-th percentile9794
Maximum36940
Range36920
Interquartile range (IQR)3743

Descriptive statistics

Standard deviation2994.103889
Coefficient of variation (CV)0.6662870747
Kurtosis2.241926806
Mean4493.714501
Median Absolute Deviation (MAD)1855
Skewness0.9604178463
Sum1.230192541 × 1010
Variance8964658.099
MonotocityNot monotonic
2021-02-26T00:47:33.116630image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20238072
 
8.7%
2640923
 
< 0.1%
3610887
 
< 0.1%
4110874
 
< 0.1%
3860863
 
< 0.1%
3250844
 
< 0.1%
3940839
 
< 0.1%
2990834
 
< 0.1%
3830833
 
< 0.1%
4860828
 
< 0.1%
Other values (16769)2491788
91.0%
ValueCountFrequency (%)
20238072
8.7%
20021
 
< 0.1%
2015
 
< 0.1%
2025
 
< 0.1%
20315
 
< 0.1%
ValueCountFrequency (%)
369407
< 0.1%
365636
< 0.1%
339701
 
< 0.1%
338766
< 0.1%
338612
 
< 0.1%

isswitch
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
0
2242195 
1
495390 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
02242195
81.9%
1495390
 
18.1%

movementd
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
2
685479 
4
684674 
3
684179 
1
683253 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
2685479
25.0%
4684674
25.0%
3684179
25.0%
1683253
25.0%

pointingd
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
2
685485 
4
684766 
3
683900 
1
683434 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
2685485
25.0%
4684766
25.0%
3683900
25.0%
1683434
25.0%

task
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
2
1429403 
1
1308182 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2
ValueCountFrequency (%)
21429403
52.2%
11308182
47.8%

choice
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
1
692256 
2
688978 
3
679642 
4
676709 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
1692256
25.3%
2688978
25.2%
3679642
24.8%
4676709
24.7%

rlprev
Real number (ℝ≥0)

Distinct19
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.464206226
Minimum1
Maximum20
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:33.879759image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q13
median5
Q38
95-th percentile20
Maximum20
Range19
Interquartile range (IQR)5

Descriptive statistics

Standard deviation4.837929472
Coefficient of variation (CV)0.748418182
Kurtosis2.575020147
Mean6.464206226
Median Absolute Deviation (MAD)2
Skewness1.751748867
Sum17696314
Variance23.40556157
MonotocityNot monotonic
2021-02-26T00:47:33.972148image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
4372882
13.6%
5365340
13.3%
3331518
12.1%
6322462
11.8%
7260088
9.5%
2251188
9.2%
20238072
8.7%
8187111
6.8%
1135396
 
4.9%
9125513
 
4.6%
Other values (9)148015
 
5.4%
ValueCountFrequency (%)
1135396
 
4.9%
2251188
9.2%
3331518
12.1%
4372882
13.6%
5365340
13.3%
ValueCountFrequency (%)
20238072
8.7%
186
 
< 0.1%
1755
 
< 0.1%
16234
 
< 0.1%
15808
 
< 0.1%

newuid
Real number (ℝ≥0)

Distinct1000
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean503.3857889
Minimum1
Maximum1000
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:34.088637image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48
Q1258
median505
Q3752
95-th percentile951
Maximum1000
Range999
Interquartile range (IQR)494

Descriptive statistics

Standard deviation287.8709563
Coefficient of variation (CV)0.5718694542
Kurtosis-1.191228998
Mean503.3857889
Median Absolute Deviation (MAD)247
Skewness-0.01357732368
Sum1378061385
Variance82869.68751
MonotocityIncreasing
2021-02-26T00:47:34.208259image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9005227
 
0.2%
5735157
 
0.2%
1854912
 
0.2%
8224737
 
0.2%
3034623
 
0.2%
1994509
 
0.2%
364495
 
0.2%
2674471
 
0.2%
7384352
 
0.2%
3004337
 
0.2%
Other values (990)2690765
98.3%
ValueCountFrequency (%)
13424
0.1%
21208
 
< 0.1%
31985
0.1%
42851
0.1%
53731
0.1%
ValueCountFrequency (%)
10002964
0.1%
9992507
0.1%
9982785
0.1%
9972247
0.1%
9962345
0.1%

trialtypecount2
Real number (ℝ≥0)

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.608637175
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:34.308116image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile8
Maximum8
Range7
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.157646622
Coefficient of variation (CV)0.597911765
Kurtosis-0.7473274996
Mean3.608637175
Median Absolute Deviation (MAD)2
Skewness0.5600049944
Sum9878951
Variance4.655438947
MonotocityNot monotonic
2021-02-26T00:47:34.399523image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
6217435
7.9%
8207524
 
7.6%
7149572
 
5.5%
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
ValueCountFrequency (%)
8207524
7.6%
7149572
 
5.5%
6217435
7.9%
5294700
10.8%
4374701
13.7%
\ No newline at end of file diff --git a/stayvers2019/eda.py b/stayvers2019/eda.py new file mode 100644 index 0000000..fe5d954 --- /dev/null +++ b/stayvers2019/eda.py @@ -0,0 +1,39 @@ +# %% + +import dask.dataframe as dd +import pandas as pd + +# gamedata_path = '~/Downloads/gamedata_preprocessed.csv' +# gamedata_path = '~/Downloads/gamedata_original-v1.csv' +gamedata_path = '/Users/morteza/Downloads/data/Sample 1000 individuals/data/data_sim6007.csv' + +# lazy load gamedata +DATA = dd.read_csv(gamedata_path) + +# df.npartitions +DATA.columns +#%% +# number of unique subjects +DATA['user_id'].value_counts().compute() +# 1000 users + +DATA['task'].value_counts().compute() +# task "1" or "2" +#%% + +# dask runs queries in multiple thread; we only want one user_id though +sample_user = DATA.loc[0,'user_id'].compute().iloc[0] + +SAMPLE_USER_DATA = DATA.query('user_id == @sample_user', local_dict={'sample_user': sample_user}).compute() + +#%% +# generate EDA reports + +from pandas_profiling import ProfileReport + +profile = ProfileReport(DATA.compute(), title='Lumocity All Users EDA Report', minimal=True) +profile.to_file("all_users_eda.html") + + +profile = ProfileReport(SAMPLE_USER_DATA, title='Lumocity Sample User EDA Report', explorative=True) +profile.to_file("sample_user_eda.html") diff --git a/stayvers2019/requirements.txt b/stayvers2019/requirements.txt new file mode 100644 index 0000000..98f53cd --- /dev/null +++ b/stayvers2019/requirements.txt @@ -0,0 +1,3 @@ +dask +dask[dataframe] +pandas diff --git a/stayvers2019/all_users_eda.html b/stayvers2019/all_users_eda.html new file mode 100644 index 0000000..be28617 --- /dev/null +++ b/stayvers2019/all_users_eda.html @@ -0,0 +1,15462 @@ +Lumocity All Users EDA Report

Overview

Dataset statistics

Number of variables30
Number of observations2737585
Missing cells238072
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory626.6 MiB
Average record size in memory240.0 B

Variable types

Numeric18
Boolean1
Categorical11

Warnings

runlengthprev has 238072 (8.7%) missing values Missing

Reproduction

Analysis started2021-02-25 23:46:00.622032
Analysis finished2021-02-25 23:47:27.632007
Duration1 minute and 27.01 seconds
Software versionpandas-profiling v2.10.0
Download configurationconfig.yaml

Variables

df_index
Real number (ℝ≥0)

Distinct750114
Distinct (%)27.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean349316.8219
Minimum0
Maximum750113
Zeros4
Zeros (%)< 0.1%
Memory size20.9 MiB
2021-02-26T00:47:28.477006image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile34219.2
Q1171099
median342198
Q3513297
95-th percentile695667.8
Maximum750113
Range750113
Interquartile range (IQR)342198

Descriptive statistics

Standard deviation208132.9105
Coefficient of variation (CV)0.5958284785
Kurtosis-1.101560579
Mean349316.8219
Median Absolute Deviation (MAD)171099
Skewness0.1226952489
Sum9.562844919 × 1011
Variance4.331930844 × 1010
MonotocityNot monotonic
2021-02-26T00:47:28.758862image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20474
 
< 0.1%
908864
 
< 0.1%
949804
 
< 0.1%
847394
 
< 0.1%
826904
 
< 0.1%
888334
 
< 0.1%
867844
 
< 0.1%
4392954
 
< 0.1%
4413424
 
< 0.1%
4351974
 
< 0.1%
Other values (750104)2737545
> 99.9%
ValueCountFrequency (%)
04
< 0.1%
14
< 0.1%
24
< 0.1%
34
< 0.1%
44
< 0.1%
ValueCountFrequency (%)
7501131
< 0.1%
7501121
< 0.1%
7501111
< 0.1%
7501101
< 0.1%
7501091
< 0.1%

correct
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
True
2620419 
False
 
117166
ValueCountFrequency (%)
True2620419
95.7%
False117166
 
4.3%

game_result_id
Real number (ℝ≥0)

Distinct46470
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31632.99275
Minimum1
Maximum76473
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:28.902801image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2712
Q113005
median28517
Q349148
95-th percentile69293
Maximum76473
Range76472
Interquartile range (IQR)36143

Descriptive statistics

Standard deviation21377.80202
Coefficient of variation (CV)0.6758071292
Kurtosis-1.061939459
Mean31632.99275
Median Absolute Deviation (MAD)17561
Skewness0.3422095459
Sum8.659800647 × 1010
Variance457010419.2
MonotocityNot monotonic
2021-02-26T00:47:29.023391image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1212279
 
< 0.1%
1207272
 
< 0.1%
1210268
 
< 0.1%
1209257
 
< 0.1%
1222249
 
< 0.1%
1257249
 
< 0.1%
1223245
 
< 0.1%
1250243
 
< 0.1%
1249242
 
< 0.1%
1233221
 
< 0.1%
Other values (46460)2735060
99.9%
ValueCountFrequency (%)
1121
< 0.1%
2114
< 0.1%
397
< 0.1%
493
< 0.1%
5110
< 0.1%
ValueCountFrequency (%)
7647371
< 0.1%
7647265
< 0.1%
7647063
< 0.1%
7645564
< 0.1%
7645164
< 0.1%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
R
685479 
D
684674 
U
684179 
L
683253 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
R685479
25.0%
D684674
25.0%
U684179
25.0%
L683253
25.0%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
R
685485 
D
684766 
U
683900 
L
683434 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
R685485
25.0%
D684766
25.0%
U683900
25.0%
L683434
25.0%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
L
692256 
R
688978 
U
679642 
D
676709 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
L692256
25.3%
R688978
25.2%
U679642
24.8%
D676709
24.7%

response_time
Real number (ℝ≥0)

Distinct4443
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean958.9477788
Minimum200
Maximum5000
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:29.443458image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum200
5-th percentile642
Q1757
median870
Q31054
95-th percentile1575
Maximum5000
Range4800
Interquartile range (IQR)297

Descriptive statistics

Standard deviation340.5888668
Coefficient of variation (CV)0.3551693578
Kurtosis15.06450001
Mean958.9477788
Median Absolute Deviation (MAD)135
Skewness2.912185805
Sum2625201055
Variance116000.7762
MonotocityNot monotonic
2021-02-26T00:47:29.564760image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
75018501
 
0.7%
78015313
 
0.6%
79013160
 
0.5%
82012968
 
0.5%
81012879
 
0.5%
86012591
 
0.5%
85012439
 
0.5%
89012351
 
0.5%
84011955
 
0.4%
77011917
 
0.4%
Other values (4433)2603511
95.1%
ValueCountFrequency (%)
20058
< 0.1%
20113
 
< 0.1%
20212
 
< 0.1%
20346
< 0.1%
20419
 
< 0.1%
ValueCountFrequency (%)
50001
 
< 0.1%
49981
 
< 0.1%
49951
 
< 0.1%
49934
< 0.1%
49911
 
< 0.1%

trial_num
Real number (ℝ≥0)

Distinct107
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.44657682
Minimum1
Maximum107
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:29.686306image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q116
median31
Q348
95-th percentile67
Maximum107
Range106
Interquartile range (IQR)32

Descriptive statistics

Standard deviation19.85221324
Coefficient of variation (CV)0.6118430722
Kurtosis-0.8569413615
Mean32.44657682
Median Absolute Deviation (MAD)16
Skewness0.2920395952
Sum88825262
Variance394.1103707
MonotocityNot monotonic
2021-02-26T00:47:29.808398image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
645141
 
1.6%
1345133
 
1.6%
1445127
 
1.6%
945124
 
1.6%
1545124
 
1.6%
1245114
 
1.6%
745108
 
1.6%
1945102
 
1.6%
145099
 
1.6%
845093
 
1.6%
Other values (97)2286420
83.5%
ValueCountFrequency (%)
145099
1.6%
245001
1.6%
345089
1.6%
445070
1.6%
545047
1.6%
ValueCountFrequency (%)
1071
< 0.1%
1061
< 0.1%
1051
< 0.1%
1041
< 0.1%
1031
< 0.1%

trial_type
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
P
1429403 
M
1308182 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowP
2nd rowP
3rd rowP
4th rowP
5th rowP
ValueCountFrequency (%)
P1429403
52.2%
M1308182
47.8%

user_id
Real number (ℝ≥0)

Distinct1000
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean114134.7336
Minimum183
Maximum226123
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:30.023628image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum183
5-th percentile9383
Q157105
median117998
Q3170462
95-th percentile216742
Maximum226123
Range225940
Interquartile range (IQR)113357

Descriptive statistics

Standard deviation66193.64372
Coefficient of variation (CV)0.5799605573
Kurtosis-1.198677487
Mean114134.7336
Median Absolute Deviation (MAD)56492
Skewness-0.03730791676
Sum3.124535348 × 1011
Variance4381598470
MonotocityIncreasing
2021-02-26T00:47:30.134610image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2059075227
 
0.2%
1308805157
 
0.2%
393254912
 
0.2%
1873344737
 
0.2%
680954623
 
0.2%
415634509
 
0.2%
75084495
 
0.2%
587954471
 
0.2%
1662844352
 
0.2%
673604337
 
0.2%
Other values (990)2690765
98.3%
ValueCountFrequency (%)
1833424
0.1%
7201208
 
< 0.1%
9431985
0.1%
11712851
0.1%
14093731
0.1%
ValueCountFrequency (%)
2261232964
0.1%
2258782507
0.1%
2255592785
0.1%
2254572247
0.1%
2253552345
0.1%

accuracy
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
1
2620419 
0
 
117166

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
12620419
95.7%
0117166
 
4.3%

uid
Real number (ℝ≥0)

Distinct1000
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean503.3857889
Minimum1
Maximum1000
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:30.341874image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48
Q1258
median505
Q3752
95-th percentile951
Maximum1000
Range999
Interquartile range (IQR)494

Descriptive statistics

Standard deviation287.8709563
Coefficient of variation (CV)0.5718694542
Kurtosis-1.191228998
Mean503.3857889
Median Absolute Deviation (MAD)247
Skewness-0.01357732368
Sum1378061385
Variance82869.68751
MonotocityIncreasing
2021-02-26T00:47:30.461416image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9005227
 
0.2%
5735157
 
0.2%
1854912
 
0.2%
8224737
 
0.2%
3034623
 
0.2%
1994509
 
0.2%
364495
 
0.2%
2674471
 
0.2%
7384352
 
0.2%
3004337
 
0.2%
Other values (990)2690765
98.3%
ValueCountFrequency (%)
13424
0.1%
21208
 
< 0.1%
31985
0.1%
42851
0.1%
53731
0.1%
ValueCountFrequency (%)
10002964
0.1%
9992507
0.1%
9982785
0.1%
9972247
0.1%
9962345
0.1%

compatible
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
1
1389795 
0
1347790 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
11389795
50.8%
01347790
49.2%

gamecount
Real number (ℝ≥0)

Distinct60
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.6504262
Minimum1
Maximum60
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.343040image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q117
median32
Q346
95-th percentile58
Maximum60
Range59
Interquartile range (IQR)29

Descriptive statistics

Standard deviation17.01935919
Coefficient of variation (CV)0.5377292263
Kurtosis-1.174470311
Mean31.6504262
Median Absolute Deviation (MAD)15
Skewness-0.06156084448
Sum86645732
Variance289.6585873
MonotocityNot monotonic
2021-02-26T00:47:31.457216image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5249656
 
1.8%
5349601
 
1.8%
5749574
 
1.8%
5649368
 
1.8%
4648970
 
1.8%
5148902
 
1.8%
3348600
 
1.8%
5548575
 
1.8%
5848566
 
1.8%
4248463
 
1.8%
Other values (50)2247310
82.1%
ValueCountFrequency (%)
131446
1.1%
233462
1.2%
337013
1.4%
438437
1.4%
539990
1.5%
ValueCountFrequency (%)
6048172
1.8%
5948289
1.8%
5848566
1.8%
5749574
1.8%
5649368
1.8%

totalcount
Real number (ℝ≥0)

Distinct304
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean215.4372357
Minimum100
Maximum2976
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.576919image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum100
5-th percentile104
Q1123
median159
Q3235
95-th percentile470
Maximum2976
Range2876
Interquartile range (IQR)112

Descriptive statistics

Standard deviation190.3430738
Coefficient of variation (CV)0.8835198483
Kurtosis65.73041639
Mean215.4372357
Median Absolute Deviation (MAD)44
Skewness6.364027397
Sum589777745
Variance36230.48576
MonotocityNot monotonic
2021-02-26T00:47:31.692428image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10345147
 
1.6%
11443535
 
1.6%
11641625
 
1.5%
10740849
 
1.5%
10639642
 
1.4%
12338340
 
1.4%
13035801
 
1.3%
12635543
 
1.3%
10433174
 
1.2%
10932868
 
1.2%
Other values (294)2351061
85.9%
ValueCountFrequency (%)
10024692
0.9%
10130732
1.1%
10223750
0.9%
10345147
1.6%
10433174
1.2%
ValueCountFrequency (%)
29762214
0.1%
24372148
0.1%
22952335
0.1%
13032704
0.1%
11433027
0.1%

agebin
Real number (ℝ≥0)

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.291678615
Minimum2
Maximum7
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.792415image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2
Q13
median4
Q36
95-th percentile7
Maximum7
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.683548655
Coefficient of variation (CV)0.3922820896
Kurtosis-1.218957282
Mean4.291678615
Median Absolute Deviation (MAD)1
Skewness0.1484660005
Sum11748835
Variance2.834336073
MonotocityNot monotonic
2021-02-26T00:47:31.873626image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2530345
19.4%
3504662
18.4%
4477557
17.4%
5448493
16.4%
6414230
15.1%
7362298
13.2%
ValueCountFrequency (%)
2530345
19.4%
3504662
18.4%
4477557
17.4%
5448493
16.4%
6414230
15.1%
ValueCountFrequency (%)
7362298
13.2%
6414230
15.1%
5448493
16.4%
4477557
17.4%
3504662
18.4%

trialtypecount
Real number (ℝ≥0)

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.684821111
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:31.955642image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile8
Maximum18
Range17
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.34778765
Coefficient of variation (CV)0.6371510527
Kurtosis0.5219900274
Mean3.684821111
Median Absolute Deviation (MAD)2
Skewness0.9242221618
Sum10087511
Variance5.512106848
MonotocityNot monotonic
2021-02-26T00:47:32.038607image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
6217435
7.9%
7149572
 
5.5%
895438
 
3.5%
956438
 
2.1%
1030498
 
1.1%
Other values (8)25150
 
0.9%
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
ValueCountFrequency (%)
182
 
< 0.1%
1714
 
< 0.1%
1662
 
< 0.1%
15236
 
< 0.1%
14856
< 0.1%

rtsum
Real number (ℝ≥0)

Distinct18605
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3559.939113
Minimum200
Maximum36940
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.151188image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum200
5-th percentile822
Q11681
median2997
Q34808
95-th percentile8265
Maximum36940
Range36740
Interquartile range (IQR)3127

Descriptive statistics

Standard deviation2482.743047
Coefficient of variation (CV)0.6974116603
Kurtosis4.057938333
Mean3559.939113
Median Absolute Deviation (MAD)1474
Skewness1.530360499
Sum9745635916
Variance6164013.038
MonotocityNot monotonic
2021-02-26T00:47:32.264292image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10002451
 
0.1%
8902263
 
0.1%
8602153
 
0.1%
8802109
 
0.1%
9102100
 
0.1%
9202093
 
0.1%
9501997
 
0.1%
8501964
 
0.1%
7501925
 
0.1%
9801923
 
0.1%
Other values (18595)2716607
99.2%
ValueCountFrequency (%)
20058
< 0.1%
20113
 
< 0.1%
20212
 
< 0.1%
20346
< 0.1%
20419
 
< 0.1%
ValueCountFrequency (%)
369401
< 0.1%
365632
< 0.1%
351652
< 0.1%
339701
< 0.1%
338761
< 0.1%

runlength
Real number (ℝ≥0)

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.369214472
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.367379image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q14
median6
Q38
95-th percentile11
Maximum18
Range17
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.577218232
Coefficient of variation (CV)0.4046367482
Kurtosis-0.1976977492
Mean6.369214472
Median Absolute Deviation (MAD)2
Skewness0.3229929995
Sum17436266
Variance6.642053817
MonotocityNot monotonic
2021-02-26T00:47:32.454938image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
6406575
14.9%
5387264
14.1%
7379647
13.9%
4320426
11.7%
8311379
11.4%
9233452
8.5%
3218517
8.0%
10156953
 
5.7%
2115100
 
4.2%
1189827
 
3.3%
Other values (8)118445
 
4.3%
ValueCountFrequency (%)
135205
 
1.3%
2115100
 
4.2%
3218517
8.0%
4320426
11.7%
5387264
14.1%
ValueCountFrequency (%)
1836
 
< 0.1%
17194
 
< 0.1%
16742
 
< 0.1%
152602
 
0.1%
148825
0.3%

maxrtblock
Real number (ℝ≥0)

Distinct16981
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6062.76754
Minimum200
Maximum36940
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.571465image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum200
5-th percentile2140
Q13991
median5640
Q37596
95-th percentile11420
Maximum36940
Range36740
Interquartile range (IQR)3605

Descriptive statistics

Standard deviation2967.265276
Coefficient of variation (CV)0.4894242203
Kurtosis3.422175822
Mean6062.76754
Median Absolute Deviation (MAD)1778
Skewness1.243493494
Sum1.659734148 × 1010
Variance8804663.219
MonotocityNot monotonic
2021-02-26T00:47:32.686684image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4860938
 
< 0.1%
5010902
 
< 0.1%
4110891
 
< 0.1%
4500884
 
< 0.1%
5250875
 
< 0.1%
4720852
 
< 0.1%
4640827
 
< 0.1%
5030820
 
< 0.1%
4470820
 
< 0.1%
4890819
 
< 0.1%
Other values (16971)2728957
99.7%
ValueCountFrequency (%)
2004
< 0.1%
2011
 
< 0.1%
2021
 
< 0.1%
2033
< 0.1%
2051
 
< 0.1%
ValueCountFrequency (%)
3694014
< 0.1%
3656324
< 0.1%
3397014
< 0.1%
3387614
< 0.1%
3386114
< 0.1%

runlengthprev
Real number (ℝ≥0)

MISSING

Distinct18
Distinct (%)< 0.1%
Missing238072
Missing (%)8.7%
Infinite0
Infinite (%)0.0%
Mean5.174957682
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.793060image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q37
95-th percentile10
Maximum18
Range17
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.553763578
Coefficient of variation (CV)0.49348492
Kurtosis-0.1032082987
Mean5.174957682
Median Absolute Deviation (MAD)2
Skewness0.5046729018
Sum12934874
Variance6.52170841
MonotocityNot monotonic
2021-02-26T00:47:32.879531image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
4372882
13.6%
5365340
13.3%
3331518
12.1%
6322462
11.8%
7260088
9.5%
2251188
9.2%
8187111
6.8%
1135396
 
4.9%
9125513
 
4.6%
1075827
 
2.8%
Other values (8)72188
 
2.6%
(Missing)238072
8.7%
ValueCountFrequency (%)
1135396
 
4.9%
2251188
9.2%
3331518
12.1%
4372882
13.6%
5365340
13.3%
ValueCountFrequency (%)
186
 
< 0.1%
1755
 
< 0.1%
16234
 
< 0.1%
15808
 
< 0.1%
143070
0.1%

rtsumprev
Real number (ℝ≥0)

Distinct16779
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4493.714501
Minimum20
Maximum36940
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:32.998234image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q12426
median4180
Q36169
95-th percentile9794
Maximum36940
Range36920
Interquartile range (IQR)3743

Descriptive statistics

Standard deviation2994.103889
Coefficient of variation (CV)0.6662870747
Kurtosis2.241926806
Mean4493.714501
Median Absolute Deviation (MAD)1855
Skewness0.9604178463
Sum1.230192541 × 1010
Variance8964658.099
MonotocityNot monotonic
2021-02-26T00:47:33.116630image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20238072
 
8.7%
2640923
 
< 0.1%
3610887
 
< 0.1%
4110874
 
< 0.1%
3860863
 
< 0.1%
3250844
 
< 0.1%
3940839
 
< 0.1%
2990834
 
< 0.1%
3830833
 
< 0.1%
4860828
 
< 0.1%
Other values (16769)2491788
91.0%
ValueCountFrequency (%)
20238072
8.7%
20021
 
< 0.1%
2015
 
< 0.1%
2025
 
< 0.1%
20315
 
< 0.1%
ValueCountFrequency (%)
369407
< 0.1%
365636
< 0.1%
339701
 
< 0.1%
338766
< 0.1%
338612
 
< 0.1%

isswitch
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
0
2242195 
1
495390 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
02242195
81.9%
1495390
 
18.1%

movementd
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
2
685479 
4
684674 
3
684179 
1
683253 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
2685479
25.0%
4684674
25.0%
3684179
25.0%
1683253
25.0%

pointingd
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
2
685485 
4
684766 
3
683900 
1
683434 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
2685485
25.0%
4684766
25.0%
3683900
25.0%
1683434
25.0%

task
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
2
1429403 
1
1308182 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2
ValueCountFrequency (%)
21429403
52.2%
11308182
47.8%

choice
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.9 MiB
1
692256 
2
688978 
3
679642 
4
676709 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
1692256
25.3%
2688978
25.2%
3679642
24.8%
4676709
24.7%

rlprev
Real number (ℝ≥0)

Distinct19
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.464206226
Minimum1
Maximum20
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:33.879759image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q13
median5
Q38
95-th percentile20
Maximum20
Range19
Interquartile range (IQR)5

Descriptive statistics

Standard deviation4.837929472
Coefficient of variation (CV)0.748418182
Kurtosis2.575020147
Mean6.464206226
Median Absolute Deviation (MAD)2
Skewness1.751748867
Sum17696314
Variance23.40556157
MonotocityNot monotonic
2021-02-26T00:47:33.972148image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
4372882
13.6%
5365340
13.3%
3331518
12.1%
6322462
11.8%
7260088
9.5%
2251188
9.2%
20238072
8.7%
8187111
6.8%
1135396
 
4.9%
9125513
 
4.6%
Other values (9)148015
 
5.4%
ValueCountFrequency (%)
1135396
 
4.9%
2251188
9.2%
3331518
12.1%
4372882
13.6%
5365340
13.3%
ValueCountFrequency (%)
20238072
8.7%
186
 
< 0.1%
1755
 
< 0.1%
16234
 
< 0.1%
15808
 
< 0.1%

newuid
Real number (ℝ≥0)

Distinct1000
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean503.3857889
Minimum1
Maximum1000
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:34.088637image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48
Q1258
median505
Q3752
95-th percentile951
Maximum1000
Range999
Interquartile range (IQR)494

Descriptive statistics

Standard deviation287.8709563
Coefficient of variation (CV)0.5718694542
Kurtosis-1.191228998
Mean503.3857889
Median Absolute Deviation (MAD)247
Skewness-0.01357732368
Sum1378061385
Variance82869.68751
MonotocityIncreasing
2021-02-26T00:47:34.208259image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9005227
 
0.2%
5735157
 
0.2%
1854912
 
0.2%
8224737
 
0.2%
3034623
 
0.2%
1994509
 
0.2%
364495
 
0.2%
2674471
 
0.2%
7384352
 
0.2%
3004337
 
0.2%
Other values (990)2690765
98.3%
ValueCountFrequency (%)
13424
0.1%
21208
 
< 0.1%
31985
0.1%
42851
0.1%
53731
0.1%
ValueCountFrequency (%)
10002964
0.1%
9992507
0.1%
9982785
0.1%
9972247
0.1%
9962345
0.1%

trialtypecount2
Real number (ℝ≥0)

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.608637175
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Memory size20.9 MiB
2021-02-26T00:47:34.308116image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile8
Maximum8
Range7
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.157646622
Coefficient of variation (CV)0.597911765
Kurtosis-0.7473274996
Mean3.608637175
Median Absolute Deviation (MAD)2
Skewness0.5600049944
Sum9878951
Variance4.655438947
MonotocityNot monotonic
2021-02-26T00:47:34.399523image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
6217435
7.9%
8207524
 
7.6%
7149572
 
5.5%
ValueCountFrequency (%)
1540489
19.7%
2505140
18.5%
3448024
16.4%
4374701
13.7%
5294700
10.8%
ValueCountFrequency (%)
8207524
7.6%
7149572
 
5.5%
6217435
7.9%
5294700
10.8%
4374701
13.7%
\ No newline at end of file diff --git a/stayvers2019/eda.py b/stayvers2019/eda.py new file mode 100644 index 0000000..fe5d954 --- /dev/null +++ b/stayvers2019/eda.py @@ -0,0 +1,39 @@ +# %% + +import dask.dataframe as dd +import pandas as pd + +# gamedata_path = '~/Downloads/gamedata_preprocessed.csv' +# gamedata_path = '~/Downloads/gamedata_original-v1.csv' +gamedata_path = '/Users/morteza/Downloads/data/Sample 1000 individuals/data/data_sim6007.csv' + +# lazy load gamedata +DATA = dd.read_csv(gamedata_path) + +# df.npartitions +DATA.columns +#%% +# number of unique subjects +DATA['user_id'].value_counts().compute() +# 1000 users + +DATA['task'].value_counts().compute() +# task "1" or "2" +#%% + +# dask runs queries in multiple thread; we only want one user_id though +sample_user = DATA.loc[0,'user_id'].compute().iloc[0] + +SAMPLE_USER_DATA = DATA.query('user_id == @sample_user', local_dict={'sample_user': sample_user}).compute() + +#%% +# generate EDA reports + +from pandas_profiling import ProfileReport + +profile = ProfileReport(DATA.compute(), title='Lumocity All Users EDA Report', minimal=True) +profile.to_file("all_users_eda.html") + + +profile = ProfileReport(SAMPLE_USER_DATA, title='Lumocity Sample User EDA Report', explorative=True) +profile.to_file("sample_user_eda.html") diff --git a/stayvers2019/requirements.txt b/stayvers2019/requirements.txt new file mode 100644 index 0000000..98f53cd --- /dev/null +++ b/stayvers2019/requirements.txt @@ -0,0 +1,3 @@ +dask +dask[dataframe] +pandas diff --git a/stayvers2019/sample_user_eda.html b/stayvers2019/sample_user_eda.html new file mode 100644 index 0000000..5f862b5 --- /dev/null +++ b/stayvers2019/sample_user_eda.html @@ -0,0 +1,10308 @@ +Lumocity Sample User EDA Report

Overview

Dataset statistics

Number of variables29
Number of observations3424
Missing cells242
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 MiB
Average record size in memory490.0 B

Variable types

Boolean1
Numeric12
Categorical16

Warnings

user_id has constant value "183" Constant
uid has constant value "1" Constant
totalcount has constant value "125" Constant
agebin has constant value "2" Constant
newuid has constant value "1" Constant
runlengthprev has 242 (7.1%) missing values Missing

Reproduction

Analysis started2021-02-25 23:47:35.405206
Analysis finished2021-02-25 23:47:36.161678
Duration0.76 seconds
Software versionpandas-profiling v2.10.0
Download configurationconfig.yaml

Variables

correct
Boolean

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size30.1 KiB
True
3271 
False
 
153
ValueCountFrequency (%)
True3271
95.5%
False153
 
4.5%

game_result_id
Real number (ℝ≥0)

Distinct49
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50029.30637
Minimum47986
Maximum51985
Zeros0
Zeros (%)0.0%
Memory size53.5 KiB
2021-02-26T00:47:36.286515image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum47986
5-th percentile48256
Q149008
median49802
Q351069
95-th percentile51906
Maximum51985
Range3999
Interquartile range (IQR)2061

Descriptive statistics

Standard deviation1260.955837
Coefficient of variation (CV)0.02520434379
Kurtosis-1.41843207
Mean50029.30637
Median Absolute Deviation (MAD)1036
Skewness0.09574972514
Sum171300345
Variance1590009.624
MonotocityNot monotonic
2021-02-26T00:47:36.424038image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
5190680
 
2.3%
5190380
 
2.3%
5190580
 
2.3%
5139579
 
2.3%
4907378
 
2.3%
4906878
 
2.3%
4907078
 
2.3%
5101078
 
2.3%
5198577
 
2.2%
4907677
 
2.2%
Other values (39)2639
77.1%
ValueCountFrequency (%)
4798640
1.2%
4798753
1.5%
4798947
1.4%
4825655
1.6%
4825765
1.9%
4825865
1.9%
4826064
1.9%
4890549
1.4%
4891559
1.7%
4891763
1.8%
ValueCountFrequency (%)
5198577
2.2%
5190776
2.2%
5190680
2.3%
5190580
2.3%
5190476
2.2%
5190380
2.3%
5139579
2.3%
5139472
2.1%
5137171
2.1%
5129974
2.2%
Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
R
897 
U
871 
D
840 
L
816 

Characters and Unicode

Total characters3424
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
R897
26.2%
U871
25.4%
D840
24.5%
L816
23.8%
ValueCountFrequency (%)
r897
26.2%
u871
25.4%
d840
24.5%
l816
23.8%

Most occurring characters

ValueCountFrequency (%)
R897
26.2%
U871
25.4%
D840
24.5%
L816
23.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
R897
26.2%
U871
25.4%
D840
24.5%
L816
23.8%

Most occurring scripts

ValueCountFrequency (%)
Latin3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
R897
26.2%
U871
25.4%
D840
24.5%
L816
23.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
R897
26.2%
U871
25.4%
D840
24.5%
L816
23.8%
Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
R
872 
D
853 
U
851 
L
848 

Characters and Unicode

Total characters3424
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
R872
25.5%
D853
24.9%
U851
24.9%
L848
24.8%
ValueCountFrequency (%)
r872
25.5%
d853
24.9%
u851
24.9%
l848
24.8%

Most occurring characters

ValueCountFrequency (%)
R872
25.5%
D853
24.9%
U851
24.9%
L848
24.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
R872
25.5%
D853
24.9%
U851
24.9%
L848
24.8%

Most occurring scripts

ValueCountFrequency (%)
Latin3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
R872
25.5%
D853
24.9%
U851
24.9%
L848
24.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
R872
25.5%
D853
24.9%
U851
24.9%
L848
24.8%
Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
R
880 
D
863 
L
851 
U
830 

Characters and Unicode

Total characters3424
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR
2nd rowU
3rd rowD
4th rowD
5th rowR
ValueCountFrequency (%)
R880
25.7%
D863
25.2%
L851
24.9%
U830
24.2%
ValueCountFrequency (%)
r880
25.7%
d863
25.2%
l851
24.9%
u830
24.2%

Most occurring characters

ValueCountFrequency (%)
R880
25.7%
D863
25.2%
L851
24.9%
U830
24.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
R880
25.7%
D863
25.2%
L851
24.9%
U830
24.2%

Most occurring scripts

ValueCountFrequency (%)
Latin3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
R880
25.7%
D863
25.2%
L851
24.9%
U830
24.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
R880
25.7%
D863
25.2%
L851
24.9%
U830
24.2%

response_time
Real number (ℝ≥0)

Distinct703
Distinct (%)20.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean800.3717874
Minimum212
Maximum2921
Zeros0
Zeros (%)0.0%
Memory size53.5 KiB
2021-02-26T00:47:36.884116image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum212
5-th percentile621
Q1692
median752
Q3858
95-th percentile1125
Maximum2921
Range2709
Interquartile range (IQR)166

Descriptive statistics

Standard deviation187.9602643
Coefficient of variation (CV)0.2348411917
Kurtosis16.28080182
Mean800.3717874
Median Absolute Deviation (MAD)73
Skewness2.891247563
Sum2740473
Variance35329.06097
MonotocityNot monotonic
2021-02-26T00:47:37.013882image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
71432
 
0.9%
72631
 
0.9%
72531
 
0.9%
69229
 
0.8%
70329
 
0.8%
70228
 
0.8%
68026
 
0.8%
68125
 
0.7%
73725
 
0.7%
73624
 
0.7%
Other values (693)3144
91.8%
ValueCountFrequency (%)
2121
< 0.1%
2181
< 0.1%
2531
< 0.1%
2851
< 0.1%
3261
< 0.1%
3401
< 0.1%
3441
< 0.1%
3791
< 0.1%
3841
< 0.1%
3871
< 0.1%
ValueCountFrequency (%)
29211
< 0.1%
24341
< 0.1%
24161
< 0.1%
22701
< 0.1%
21261
< 0.1%
21071
< 0.1%
20601
< 0.1%
20461
< 0.1%
20382
0.1%
19881
< 0.1%

trial_num
Real number (ℝ≥0)

Distinct84
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.80403037
Minimum1
Maximum84
Zeros0
Zeros (%)0.0%
Memory size53.5 KiB
2021-02-26T00:47:37.150527image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q119
median37
Q356
95-th percentile74
Maximum84
Range83
Interquartile range (IQR)37

Descriptive statistics

Standard deviation22.05120793
Coefficient of variation (CV)0.5833030952
Kurtosis-1.113330421
Mean37.80403037
Median Absolute Deviation (MAD)19
Skewness0.0978722568
Sum129441
Variance486.2557711
MonotocityNot monotonic
2021-02-26T00:47:37.276279image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1249
 
1.4%
1449
 
1.4%
1649
 
1.4%
2449
 
1.4%
4248
 
1.4%
4448
 
1.4%
2548
 
1.4%
1548
 
1.4%
2948
 
1.4%
3348
 
1.4%
Other values (74)2940
85.9%
ValueCountFrequency (%)
142
1.2%
246
1.3%
347
1.4%
448
1.4%
548
1.4%
647
1.4%
746
1.3%
847
1.4%
945
1.3%
1048
1.4%
ValueCountFrequency (%)
842
 
0.1%
832
 
0.1%
827
 
0.2%
8111
 
0.3%
8012
0.4%
7913
0.4%
7818
0.5%
7723
0.7%
7629
0.8%
7529
0.8%

trial_type
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
P
1777 
M
1647 

Characters and Unicode

Total characters3424
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowP
2nd rowP
3rd rowP
4th rowP
5th rowP
ValueCountFrequency (%)
P1777
51.9%
M1647
48.1%
ValueCountFrequency (%)
p1777
51.9%
m1647
48.1%

Most occurring characters

ValueCountFrequency (%)
P1777
51.9%
M1647
48.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
P1777
51.9%
M1647
48.1%

Most occurring scripts

ValueCountFrequency (%)
Latin3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
P1777
51.9%
M1647
48.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
P1777
51.9%
M1647
48.1%

user_id
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size227.4 KiB
183
3424 

Characters and Unicode

Total characters10272
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row183
2nd row183
3rd row183
4th row183
5th row183
ValueCountFrequency (%)
1833424
100.0%
ValueCountFrequency (%)
1833424
100.0%

Most occurring characters

ValueCountFrequency (%)
13424
33.3%
83424
33.3%
33424
33.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number10272
100.0%

Most frequent character per category

ValueCountFrequency (%)
13424
33.3%
83424
33.3%
33424
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common10272
100.0%

Most frequent character per script

ValueCountFrequency (%)
13424
33.3%
83424
33.3%
33424
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII10272
100.0%

Most frequent character per block

ValueCountFrequency (%)
13424
33.3%
83424
33.3%
33424
33.3%

accuracy
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
1
3271 
0
 
153

Characters and Unicode

Total characters3424
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
13271
95.5%
0153
 
4.5%
ValueCountFrequency (%)
13271
95.5%
0153
 
4.5%

Most occurring characters

ValueCountFrequency (%)
13271
95.5%
0153
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
13271
95.5%
0153
 
4.5%

Most occurring scripts

ValueCountFrequency (%)
Common3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
13271
95.5%
0153
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
13271
95.5%
0153
 
4.5%

uid
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
1
3424 

Characters and Unicode

Total characters3424
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
13424
100.0%
ValueCountFrequency (%)
13424
100.0%

Most occurring characters

ValueCountFrequency (%)
13424
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
13424
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
13424
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
13424
100.0%

compatible
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
1
1740 
0
1684 

Characters and Unicode

Total characters3424
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
11740
50.8%
01684
49.2%
ValueCountFrequency (%)
11740
50.8%
01684
49.2%

Most occurring characters

ValueCountFrequency (%)
11740
50.8%
01684
49.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
11740
50.8%
01684
49.2%

Most occurring scripts

ValueCountFrequency (%)
Common3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
11740
50.8%
01684
49.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
11740
50.8%
01684
49.2%

gamecount
Real number (ℝ≥0)

Distinct49
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.18545561
Minimum1
Maximum60
Zeros0
Zeros (%)0.0%
Memory size53.5 KiB
2021-02-26T00:47:37.910589image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q115
median29
Q343
95-th percentile55
Maximum60
Range59
Interquartile range (IQR)28

Descriptive statistics

Standard deviation16.32928073
Coefficient of variation (CV)0.5595006277
Kurtosis-1.092027542
Mean29.18545561
Median Absolute Deviation (MAD)14
Skewness0.0729068468
Sum99931
Variance266.6454092
MonotocityNot monotonic
2021-02-26T00:47:38.031912image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
5580
 
2.3%
5480
 
2.3%
5180
 
2.3%
4979
 
2.3%
4178
 
2.3%
2478
 
2.3%
2278
 
2.3%
1978
 
2.3%
4377
 
2.2%
6077
 
2.2%
Other values (39)2639
77.1%
ValueCountFrequency (%)
140
1.2%
253
1.5%
347
1.4%
455
1.6%
565
1.9%
665
1.9%
764
1.9%
849
1.4%
959
1.7%
1063
1.8%
ValueCountFrequency (%)
6077
2.2%
5676
2.2%
5580
2.3%
5480
2.3%
5276
2.2%
5180
2.3%
4979
2.3%
4772
2.1%
4671
2.1%
4574
2.2%

totalcount
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size227.4 KiB
125
3424 

Characters and Unicode

Total characters10272
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row125
2nd row125
3rd row125
4th row125
5th row125
ValueCountFrequency (%)
1253424
100.0%
ValueCountFrequency (%)
1253424
100.0%

Most occurring characters

ValueCountFrequency (%)
13424
33.3%
23424
33.3%
53424
33.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number10272
100.0%

Most frequent character per category

ValueCountFrequency (%)
13424
33.3%
23424
33.3%
53424
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common10272
100.0%

Most frequent character per script

ValueCountFrequency (%)
13424
33.3%
23424
33.3%
53424
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII10272
100.0%

Most frequent character per block

ValueCountFrequency (%)
13424
33.3%
23424
33.3%
53424
33.3%

agebin
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
2
3424 

Characters and Unicode

Total characters3424
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2
ValueCountFrequency (%)
23424
100.0%
ValueCountFrequency (%)
23424
100.0%

Most occurring characters

ValueCountFrequency (%)
23424
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
23424
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
23424
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
23424
100.0%

trialtypecount
Real number (ℝ≥0)

Distinct13
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.653621495
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Memory size53.5 KiB
2021-02-26T00:47:38.555639image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile8
Maximum13
Range12
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.33188439
Coefficient of variation (CV)0.6382391807
Kurtosis0.5117298265
Mean3.653621495
Median Absolute Deviation (MAD)2
Skewness0.9322119801
Sum12510
Variance5.437684807
MonotocityNot monotonic
2021-02-26T00:47:38.650201image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
1684
20.0%
2635
18.5%
3571
16.7%
4472
13.8%
5360
10.5%
6262
 
7.7%
7185
 
5.4%
8122
 
3.6%
966
 
1.9%
1037
 
1.1%
Other values (3)30
 
0.9%
ValueCountFrequency (%)
1684
20.0%
2635
18.5%
3571
16.7%
4472
13.8%
5360
10.5%
6262
 
7.7%
7185
 
5.4%
8122
 
3.6%
966
 
1.9%
1037
 
1.1%
ValueCountFrequency (%)
135
 
0.1%
128
 
0.2%
1117
 
0.5%
1037
 
1.1%
966
 
1.9%
8122
 
3.6%
7185
 
5.4%
6262
7.7%
5360
10.5%
4472
13.8%

rtsum
Real number (ℝ≥0)

Distinct2478
Distinct (%)72.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2942.023364
Minimum212
Maximum14052
Zeros0
Zeros (%)0.0%
Memory size53.5 KiB
2021-02-26T00:47:38.757355image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum212
5-th percentile747
Q11472.75
median2501.5
Q34017.75
95-th percentile6589.25
Maximum14052
Range13840
Interquartile range (IQR)2545

Descriptive statistics

Standard deviation1925.73271
Coefficient of variation (CV)0.6545606446
Kurtosis1.708100294
Mean2942.023364
Median Absolute Deviation (MAD)1193
Skewness1.17350725
Sum10073488
Variance3708446.47
MonotocityNot monotonic
2021-02-26T00:47:38.877807image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8157
 
0.2%
8226
 
0.2%
8956
 
0.2%
7786
 
0.2%
14856
 
0.2%
7255
 
0.1%
7145
 
0.1%
9275
 
0.1%
8345
 
0.1%
7925
 
0.1%
Other values (2468)3368
98.4%
ValueCountFrequency (%)
2121
< 0.1%
2181
< 0.1%
2531
< 0.1%
2851
< 0.1%
3261
< 0.1%
3401
< 0.1%
3441
< 0.1%
3791
< 0.1%
3841
< 0.1%
3871
< 0.1%
ValueCountFrequency (%)
140521
< 0.1%
130511
< 0.1%
121641
< 0.1%
119921
< 0.1%
114291
< 0.1%
114111
< 0.1%
111931
< 0.1%
109691
< 0.1%
106581
< 0.1%
105741
< 0.1%

runlength
Real number (ℝ≥0)

Distinct13
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.277161215
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Memory size53.5 KiB
2021-02-26T00:47:38.982672image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q14
median6
Q38
95-th percentile11
Maximum13
Range12
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.568374026
Coefficient of variation (CV)0.4091617115
Kurtosis-0.2973719233
Mean6.277161215
Median Absolute Deviation (MAD)2
Skewness0.3020572337
Sum21493
Variance6.596545138
MonotocityNot monotonic
2021-02-26T00:47:39.069148image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
5527
15.4%
7492
14.4%
6426
12.4%
8420
12.3%
4405
11.8%
3300
8.8%
9266
7.8%
10200
 
5.8%
2148
 
4.3%
1193
 
2.7%
Other values (3)147
 
4.3%
ValueCountFrequency (%)
151
 
1.5%
2148
 
4.3%
3300
8.8%
4405
11.8%
5527
15.4%
6426
12.4%
7492
14.4%
8420
12.3%
9266
7.8%
10200
 
5.8%
ValueCountFrequency (%)
1362
 
1.8%
1234
 
1.0%
1193
 
2.7%
10200
 
5.8%
9266
7.8%
8420
12.3%
7492
14.4%
6426
12.4%
5527
15.4%
4405
11.8%

maxrtblock
Real number (ℝ≥0)

Distinct680
Distinct (%)19.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5005.564836
Minimum682
Maximum14052
Zeros0
Zeros (%)0.0%
Memory size53.5 KiB
2021-02-26T00:47:39.167700image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum682
5-th percentile1735.1
Q13414
median4833
Q36292.75
95-th percentile9050
Maximum14052
Range13370
Interquartile range (IQR)2878.75

Descriptive statistics

Standard deviation2205.264083
Coefficient of variation (CV)0.4405624849
Kurtosis0.838521816
Mean5005.564836
Median Absolute Deviation (MAD)1437
Skewness0.7053398591
Sum17139054
Variance4863189.674
MonotocityNot monotonic
2021-02-26T00:47:39.291279image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
723520
 
0.6%
452117
 
0.5%
673317
 
0.5%
607717
 
0.5%
565715
 
0.4%
798915
 
0.4%
691715
 
0.4%
509714
 
0.4%
486713
 
0.4%
482613
 
0.4%
Other values (670)3268
95.4%
ValueCountFrequency (%)
6821
< 0.1%
6841
< 0.1%
6851
< 0.1%
6951
< 0.1%
7031
< 0.1%
7121
< 0.1%
7131
< 0.1%
7152
0.1%
7251
< 0.1%
7311
< 0.1%
ValueCountFrequency (%)
1405212
0.4%
1216410
0.3%
1142912
0.4%
114118
0.2%
106589
0.3%
1055113
0.4%
1042612
0.4%
103349
0.3%
1031812
0.4%
98408
0.2%

runlengthprev
Real number (ℝ≥0)

MISSING

Distinct13
Distinct (%)0.4%
Missing242
Missing (%)7.1%
Infinite0
Infinite (%)0.0%
Mean5.05185418
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Memory size53.5 KiB
2021-02-26T00:47:39.392368image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q37
95-th percentile10
Maximum13
Range12
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.517617526
Coefficient of variation (CV)0.4983551457
Kurtosis-0.07733253587
Mean5.05185418
Median Absolute Deviation (MAD)2
Skewness0.5286485988
Sum16075
Variance6.338398007
MonotocityNot monotonic
2021-02-26T00:47:39.478639image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
5522
15.2%
4482
14.1%
3446
13.0%
7365
10.7%
2324
9.5%
6316
9.2%
8225
6.6%
1190
 
5.5%
9147
 
4.3%
1085
 
2.5%
Other values (3)80
 
2.3%
(Missing)242
7.1%
ValueCountFrequency (%)
1190
 
5.5%
2324
9.5%
3446
13.0%
4482
14.1%
5522
15.2%
6316
9.2%
7365
10.7%
8225
6.6%
9147
 
4.3%
1085
 
2.5%
ValueCountFrequency (%)
1325
 
0.7%
127
 
0.2%
1148
 
1.4%
1085
 
2.5%
9147
 
4.3%
8225
6.6%
7365
10.7%
6316
9.2%
5522
15.2%
4482
14.1%

rtsumprev
Real number (ℝ≥0)

Distinct639
Distinct (%)18.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3744.828855
Minimum20
Maximum14052
Zeros0
Zeros (%)0.0%
Memory size53.5 KiB
2021-02-26T00:47:39.578384image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q12217.25
median3612
Q35096
95-th percentile7660.6
Maximum14052
Range14032
Interquartile range (IQR)2878.75

Descriptive statistics

Standard deviation2273.316357
Coefficient of variation (CV)0.6070548069
Kurtosis0.6372573805
Mean3744.828855
Median Absolute Deviation (MAD)1442
Skewness0.6225977513
Sum12822294
Variance5167967.261
MonotocityNot monotonic
2021-02-26T00:47:39.690330image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20242
 
7.1%
173316
 
0.5%
338015
 
0.4%
228314
 
0.4%
493414
 
0.4%
582313
 
0.4%
365713
 
0.4%
282613
 
0.4%
351113
 
0.4%
243712
 
0.4%
Other values (629)3059
89.3%
ValueCountFrequency (%)
20242
7.1%
6826
 
0.2%
6841
 
< 0.1%
6855
 
0.1%
6952
 
0.1%
7032
 
0.1%
7123
 
0.1%
7136
 
0.2%
7152
 
0.1%
7313
 
0.1%
ValueCountFrequency (%)
140523
 
0.1%
121643
 
0.1%
114296
0.2%
114119
0.3%
106586
0.2%
105513
 
0.1%
104266
0.2%
103342
 
0.1%
103186
0.2%
98402
 
0.1%

isswitch
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
0
2782 
1
642 

Characters and Unicode

Total characters3424
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
02782
81.2%
1642
 
18.8%
ValueCountFrequency (%)
02782
81.2%
1642
 
18.8%

Most occurring characters

ValueCountFrequency (%)
02782
81.2%
1642
 
18.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
02782
81.2%
1642
 
18.8%

Most occurring scripts

ValueCountFrequency (%)
Common3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
02782
81.2%
1642
 
18.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
02782
81.2%
1642
 
18.8%

movementd
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
2
897 
3
871 
4
840 
1
816 

Characters and Unicode

Total characters3424
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
2897
26.2%
3871
25.4%
4840
24.5%
1816
23.8%
ValueCountFrequency (%)
2897
26.2%
3871
25.4%
4840
24.5%
1816
23.8%

Most occurring characters

ValueCountFrequency (%)
2897
26.2%
3871
25.4%
4840
24.5%
1816
23.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
2897
26.2%
3871
25.4%
4840
24.5%
1816
23.8%

Most occurring scripts

ValueCountFrequency (%)
Common3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
2897
26.2%
3871
25.4%
4840
24.5%
1816
23.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
2897
26.2%
3871
25.4%
4840
24.5%
1816
23.8%

pointingd
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
2
872 
4
853 
3
851 
1
848 

Characters and Unicode

Total characters3424
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
2872
25.5%
4853
24.9%
3851
24.9%
1848
24.8%
ValueCountFrequency (%)
2872
25.5%
4853
24.9%
3851
24.9%
1848
24.8%

Most occurring characters

ValueCountFrequency (%)
2872
25.5%
4853
24.9%
3851
24.9%
1848
24.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
2872
25.5%
4853
24.9%
3851
24.9%
1848
24.8%

Most occurring scripts

ValueCountFrequency (%)
Common3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
2872
25.5%
4853
24.9%
3851
24.9%
1848
24.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
2872
25.5%
4853
24.9%
3851
24.9%
1848
24.8%

task
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
2
1777 
1
1647 

Characters and Unicode

Total characters3424
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2
ValueCountFrequency (%)
21777
51.9%
11647
48.1%
ValueCountFrequency (%)
21777
51.9%
11647
48.1%

Most occurring characters

ValueCountFrequency (%)
21777
51.9%
11647
48.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
21777
51.9%
11647
48.1%

Most occurring scripts

ValueCountFrequency (%)
Common3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
21777
51.9%
11647
48.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
21777
51.9%
11647
48.1%

choice
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
2
880 
4
863 
1
851 
3
830 

Characters and Unicode

Total characters3424
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row2
ValueCountFrequency (%)
2880
25.7%
4863
25.2%
1851
24.9%
3830
24.2%
ValueCountFrequency (%)
2880
25.7%
4863
25.2%
1851
24.9%
3830
24.2%

Most occurring characters

ValueCountFrequency (%)
2880
25.7%
4863
25.2%
1851
24.9%
3830
24.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
2880
25.7%
4863
25.2%
1851
24.9%
3830
24.2%

Most occurring scripts

ValueCountFrequency (%)
Common3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
2880
25.7%
4863
25.2%
1851
24.9%
3830
24.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
2880
25.7%
4863
25.2%
1851
24.9%
3830
24.2%

rlprev
Real number (ℝ≥0)

Distinct14
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.108352804
Minimum1
Maximum20
Zeros0
Zeros (%)0.0%
Memory size53.5 KiB
2021-02-26T00:47:40.259368image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q37
95-th percentile20
Maximum20
Range19
Interquartile range (IQR)4

Descriptive statistics

Standard deviation4.535535065
Coefficient of variation (CV)0.7425136057
Kurtosis3.505809002
Mean6.108352804
Median Absolute Deviation (MAD)2
Skewness1.904584971
Sum20915
Variance20.57107833
MonotocityNot monotonic
2021-02-26T00:47:40.343079image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
5522
15.2%
4482
14.1%
3446
13.0%
7365
10.7%
2324
9.5%
6316
9.2%
20242
7.1%
8225
6.6%
1190
 
5.5%
9147
 
4.3%
Other values (4)165
 
4.8%
ValueCountFrequency (%)
1190
 
5.5%
2324
9.5%
3446
13.0%
4482
14.1%
5522
15.2%
6316
9.2%
7365
10.7%
8225
6.6%
9147
 
4.3%
1085
 
2.5%
ValueCountFrequency (%)
20242
7.1%
1325
 
0.7%
127
 
0.2%
1148
 
1.4%
1085
 
2.5%
9147
 
4.3%
8225
6.6%
7365
10.7%
6316
9.2%
5522
15.2%

newuid
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size220.7 KiB
1
3424 

Characters and Unicode

Total characters3424
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
13424
100.0%
ValueCountFrequency (%)
13424
100.0%

Most occurring characters

ValueCountFrequency (%)
13424
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3424
100.0%

Most frequent character per category

ValueCountFrequency (%)
13424
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common3424
100.0%

Most frequent character per script

ValueCountFrequency (%)
13424
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII3424
100.0%

Most frequent character per block

ValueCountFrequency (%)
13424
100.0%

trialtypecount2
Real number (ℝ≥0)

Distinct8
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.581191589
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Memory size53.5 KiB
2021-02-26T00:47:40.517721image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile8
Maximum8
Range7
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.149769756
Coefficient of variation (CV)0.6002945395
Kurtosis-0.70981439
Mean3.581191589
Median Absolute Deviation (MAD)2
Skewness0.5819494996
Sum12262
Variance4.621510002
MonotocityNot monotonic
2021-02-26T00:47:40.597997image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1684
20.0%
2635
18.5%
3571
16.7%
4472
13.8%
5360
10.5%
6262
 
7.7%
8255
 
7.4%
7185
 
5.4%
ValueCountFrequency (%)
1684
20.0%
2635
18.5%
3571
16.7%
4472
13.8%
5360
10.5%
6262
 
7.7%
7185
 
5.4%
8255
 
7.4%
ValueCountFrequency (%)
8255
 
7.4%
7185
 
5.4%
6262
 
7.7%
5360
10.5%
4472
13.8%
3571
16.7%
2635
18.5%
1684
20.0%
\ No newline at end of file