Example_autocleanΒΆ

[4]:
import datacleanbot.dataclean as dc
import openml as oml
import numpy as np
[5]:
# acquire data
data = oml.datasets.get_dataset(51)
X, y, categorical_indicator, features = data.get_data(target=data.default_target_attribute, dataset_format='array')
Xy = np.concatenate((X,y.reshape((y.shape[0],1))), axis=1)
[6]:
# input openml dataset id
Xy = dc.autoclean(Xy, data.name, features)

Important Features

_images/Example_autoclean_3_1.png

Statistical Information

0 1 2 3 4 5 6 7 8 9 10 11 12 13
count 294.000000 3.0 294.000000 271.000000 293.000000 286.000000 294.000000 293.000000 294.000000 104.000000 28.000000 293.000000 293.000000 294.000000
mean 47.826531 0.0 1.867347 250.848708 0.303754 0.930070 0.586054 1.156997 0.724490 1.105769 1.035714 139.129693 132.583618 0.360544
std 7.811812 0.0 0.956077 67.657711 0.460665 0.255476 0.908648 0.417011 0.447533 0.338995 0.881167 23.589749 17.626568 0.480977
min 28.000000 0.0 0.000000 85.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 82.000000 92.000000 0.000000
25% 42.000000 0.0 1.000000 209.000000 0.000000 1.000000 0.000000 1.000000 0.000000 1.000000 0.000000 122.000000 120.000000 0.000000
50% 49.000000 0.0 2.000000 243.000000 0.000000 1.000000 0.000000 1.000000 1.000000 1.000000 1.000000 140.000000 130.000000 0.000000
75% 54.000000 0.0 3.000000 282.500000 1.000000 1.000000 1.000000 1.000000 1.000000 1.000000 2.000000 155.000000 140.000000 1.000000
max 66.000000 0.0 3.000000 603.000000 1.000000 1.000000 5.000000 2.000000 1.000000 2.000000 2.000000 190.000000 200.000000 1.000000

Discover Data Types

Simple Data Types

['int64', 'int64', 'int64', 'int64', 'int64', 'int64', 'int64', 'int64', 'bool', 'float64', 'float64', 'int64', 'int64', 'bool']

Statistical Data Types

['Type.POSITIVE', 'Type.CATEGORICAL', 'Type.CATEGORICAL', 'Type.POSITIVE', 'Type.COUNT', 'Type.CATEGORICAL', 'Type.POSITIVE', 'Type.COUNT', 'Type.COUNT', 'Type.CATEGORICAL', 'Type.CATEGORICAL', 'Type.POSITIVE', 'Type.POSITIVE', 'Type.CATEGORICAL']

Duplicated Rows

Identifying Duplicated Rows ...

Duplicated rows are detected.

       0   1    2   3    4    5    6    7    8   9   10     11     12   13
101  49.0 NaN  3.0 NaN  0.0  1.0  0.0  1.0  0.0 NaN NaN  160.0  110.0  0.0
102  49.0 NaN  3.0 NaN  0.0  1.0  0.0  1.0  0.0 NaN NaN  160.0  110.0  0.0

Do you want to drop the duplicated rows? [y/n]y

Duplicated rows are dropped.

Inconsitent Column Names


Column names
============
['age', 'sex', 'chest_pain', 'trestbps', 'chol', 'fbs', 'restecg', 'thalach', 'exang', 'oldpeak', 'slope', 'ca', 'thal']

Column names are consistent

Missing values

Identify Missing Data ...

The default setting of missing characters is ['n/a', 'na', '--', '?']
Do you want to add extra character? [y/n]n

Missing values detected!

Number of missing in each feature
0       0
1     290
2       0
3      22
4       1
5       8
6       0
7       1
8       0
9     189
10    265
11      1
12      1
13      0
dtype: int64

Records containing missing values:
0 1 2 3 4 5 6 7 8 9 10 11 12 13
0 28.0 NaN 3.0 132.0 0.0 1.0 0.0 0.0 1.0 NaN NaN 185.0 130.0 0.0
1 29.0 NaN 3.0 243.0 0.0 1.0 0.0 1.0 1.0 NaN NaN 160.0 120.0 0.0
2 29.0 NaN 3.0 NaN 0.0 1.0 0.0 1.0 1.0 NaN NaN 170.0 140.0 0.0
3 30.0 NaN 0.0 237.0 0.0 1.0 0.0 2.0 0.0 NaN 0.0 170.0 170.0 0.0
4 31.0 NaN 3.0 219.0 0.0 1.0 0.0 2.0 0.0 NaN NaN 150.0 100.0 0.0

Missing correlation between features containing missing values and other features
1 3 4 5 7 9 10 11 12
0 -0.054393 0.019737 0.001330 -0.014961 0.053771 -0.234171 0.001532 0.001330 0.001330
1 1.000000 -0.099671 0.005952 0.017041 0.005952 -0.004595 0.082259 0.005952 0.005952
2 0.020988 0.067940 0.069733 0.023981 -0.114337 0.342524 0.075190 0.069733 0.069733
3 -0.099671 1.000000 -0.016674 -0.047736 -0.016674 0.076025 0.048563 -0.016674 -0.016674
4 0.005952 -0.016674 1.000000 -0.009805 -0.003425 -0.078890 0.019022 1.000000 1.000000
5 0.017041 -0.047736 -0.009805 1.000000 -0.009805 -0.007021 -0.088011 -0.009805 -0.009805
6 0.009863 -0.077552 0.091000 -0.039312 -0.037900 -0.841642 0.005952 0.091000 0.091000
7 0.005952 -0.016674 -0.003425 -0.009805 1.000000 0.043410 0.019022 -0.003425 -0.003425
8 -0.062333 0.000198 -0.095489 -0.085351 0.035864 -0.038358 -0.016808 -0.095489 -0.095489
9 -0.004595 0.076025 -0.078890 -0.007021 0.043410 1.000000 0.025752 -0.078890 -0.078890
10 0.082259 0.048563 0.019022 -0.088011 0.019022 0.025752 1.000000 0.019022 0.019022
11 0.005952 -0.016674 1.000000 -0.009805 -0.003425 -0.078890 0.019022 1.000000 1.000000
12 0.005952 -0.016674 1.000000 -0.009805 -0.003425 -0.078890 0.019022 1.000000 1.000000
Missing mechanism is probably missing at random

Visualize Missing Data ...


_images/Example_autoclean_3_27.png
_images/Example_autoclean_3_28.png
_images/Example_autoclean_3_29.png

Clean Missing Data ...

Feature [1, 10] has extreme large proportion of missing data
Do you want to delete the above features? [y/n]y

Choose the missing mechanism [a/b/c/d]:
a.MCAR b.MAR c.MNAR d.Skip
b
Imputation score of knn is 0.7567397233586597
Imputation score of matrix factorization is 0.7567397233586597
Imputation score of multiple imputation is 0.8122681667640756
Imputation method with the highest socre is multiple imputation

Recommended Approach!
The recommended approach is multiple imputation
Do you want to apply the recommended approach? [y/n]y

Applying multiple imputation ...
Missing values cleaned!

Outliers

Recommend Algorithm ...

The recommended approach is isolation forest.
Do you want to apply the recommended outlier detection approach? [y/n]y

Visualize Outliers ...

_images/Example_autoclean_3_38.png
0 1 2 3 4 5 6 7 8 9 10 11 anomaly_score
232 48 1 275 1 0 2 2 1 0 150 122 1 -0.119937
254 46 0 272 0 0 2 1 1 1 175 140 1 -0.0797964
275 59 1 264 1 0 0 0 1 0.944132 119 140 1 -0.0689509
90 48 3 308 0.141883 1 2 2 0 2 147.257 139.446 0 -0.068584
220 59 1 338 1 0 1.5 2 0 1 130 130 1 -0.0657597
291 58 3 393 1 1 1 1 0 1 110 180 1 -0.0580726
248 58 2 211 0 0 0 2 1 1.15535 92 160 1 -0.0565477
3 30 0 237 0 1 0 2 0 1.32164 170 170 0 -0.0521636
117 51 2 220 1 1 2 1 0 2 160 130 0 -0.0464553
223 65 1 306 1 0 1.5 1 1 1 87 140 1 -0.0440714
94 48 1 163 0 1 2 1 0 2 175 108 0 -0.0405723
268 55 3 292 1 0 2 1 1 1 143 160 1 -0.0397377
171 57 1 347 1 1 0.8 2 0 1 126 180 0 -0.0360816
242 54 1 603 1 0 1 1 1 1 125 130 1 -0.0355075
146 54 0 171 0 1 2 1 1 2 137 120 0 -0.0348489
276 65 1 263 1 0 2 1 1 1 112 170 1 -0.0329883
273 58 3 164 1 1 2 2 1 1 99 136 1 -0.0309242
12 35 0 160 0 1 0 2 0 1.31709 185 120 0 -0.0297764
154 54 1 365 0 1 1 2 1 2 134 150 0 -0.0297346
157 55 3 394 0 1 0 0 0 1.43866 150 130 0 -0.0280174
289 54 2 294 1 1 0 2 0 1 100 130 1 -0.026562
0 28 3 132 0 1 0 0 1 1.42619 185 130 0 -0.0263437
31 39 3 224.323 0 1 2 2 1 2 146 120 0 -0.0256593
263 52 1 246 1 1 4 2 1 1 82 160 1 -0.0254428
185 62 0 193 0 1 0 1 0 1.36576 116 160 0 -0.0246483
170 57 0 308 0 1 1 1 0 1 98 130 0 -0.0235655
227 40 1 392 0 1 2 1 0 1 130 150 1 -0.0210837
290 56 1 342 1 0 3 1 1 1 150 155 1 -0.0182647
130 53 3 468 0 0.864092 0 1 0 1.37019 127 113 0 -0.0181947
91 48 3 256.72 0 0 0 2 0 1.3819 148 120 0 -0.0151125
285 50 1 231 1 1 5 2 1 1 140 140 1 -0.0115976
183 61 1 294 1 1 1 2 0 1 120 130 0 -0.0100509
255 47 2 248 0 0 0 1 0 1.16329 170 135 1 -0.00722342
37 39 2 147 0 0 0 1 1 1.36338 160 160 0 -0.00674078
89 47 1 276 1 0 0 1 1 1.11308 125 140 0 -0.00241484
195 38 1 117 1 1 2.5 1 1 1 134 92 1 -0.00238595
168 56 2 276 1 1 1 1 1 2 128 130 0 -0.00109016
150 54 3 195 0 1 1 2 1 2 130 160 0 -0.000929664
205 48 1 263 0 0 0 1 1 1.00107 110 106 1 0.00114704
250 41 1 172 0 1 2 2 1 1 130 130 1 0.00212158
196 40 1 466 1 0.86451 1 1 1 1 152 120 1 0.00309686
188 33 1 246 1 1 1 1 0 1 150 100 1 0.00348012
118 51 2 200 0 1 0.5 1 0 2 120 150 0 0.0043416
131 53 3 216 1 1 2 1 0 1 142 140 0 0.00484593
224 32 1 529 0 1 0 1 1 0.99009 130 118 1 0.00797082
282 47 1 291 1 1 3 2 1 1 158 160 1 0.010827
246 56 1 213 1 0 1 1 1 1 125 150 1 0.011012
74 45 3 224 0 0 0 1 1 1.35054 122 140 0 0.0124018
281 47 1 205 1 1 2 1 0 1 98 120 1 0.013517
140 54 3 230 0 0 0 1 0 1.38866 140 120 0 0.013915
252 44 3 288 1 1 3 1 1 1 150 150 1 0.0143166
59 43 0 223 0 1 0 1 0 1.24638 142 100 0 0.0148435
228 43 0 291 0 1 0 2 1 1.07937 155 120 1 0.0149449
84 46 1 280 0 1 0 2 1 1.36972 120 180 0 0.0158468
22 37 1 173 0 1 0 2 0 1.38258 184 130 0 0.0158852
14 35 3 308 0 1 0 0 1 1.39788 180 120 0 0.0173538
4 31 3 219 0 1 0 2 0 1.3727 150 100 0 0.0187209
272 56 1 388 1 1 2 2 1 1 122 170 1 0.0187329
184 61 1 292 1 1 0 2 1 1.21704 115 125 0 0.0191592
218 57 3 265 1 1 1 2 1 1 145 140 1 0.0224086
186 62 3 271 0 1 1 1 1 2 152 140 0 0.0224566
265 53 1 285 1 1 1.5 2 1 1 120 180 1 0.0235483
158 55 3 256 0 0 0 1 1 1.37734 137 120 0 0.0238808
35 39 3 241 0 1 0 1 1 1.43285 106 190 0 0.0244046
172 57 3 260 0 0 0 1 1 1.41142 140 140 0 0.0244428
17 36 2 340 0 1 1 1 1 1 184 112 0 0.0256382
260 52 1 342 1 1 1 2 1 1 96 112 1 0.0256705
213 51 1 303 1 1 1 1 0 1 150 160 1 0.0263497
191 36 3 267 0 1 3 1 1 1 160 120 1 0.0272168
244 54 1 198 1 1 2 1 1 1 142 200 1 0.0278133
165 56 2 219 0 0.904268 0 2 0 1.46569 164 130 0 0.0278579
112 50 3 209 0 1 0 2 1 1.48383 116 170 0 0.0279521
109 50 1 328 1 1 1 1 0 1 110 120 0 0.0285039
125 52 1 180 1 1 1.5 1 0 1 140 130 0 0.0286704
143 54 3 309 0 0.889135 0 2 0 1.47009 140 140 0 0.0291233
23 37 3 283 0 1 0 2 1 1.34845 98 130 0 0.0291719
127 52 3 100 1 1 0 1 1 1.356 138 140 0 0.0303691
85 47 3 257 0 1 1 1 0 2 135 140 0 0.030758
72 45 3 244.979 0 1 0 1 0 1.53949 180 180 0 0.0310886
67 44 1 218 0 1 0 2 0 1.30464 115 120 0 0.0312676
141 54 3 273 0 1 1.5 1 0 1 150 120 0 0.0313078
95 48 1 254 0 1 0 2 0 1.30643 110 120 0 0.0314005
155 55 3 344 0 1 0 2 0 1.46327 160 110 0 0.0316206
189 34 0 156 0 1 0 1 1 1.1145 180 140 1 0.0317022
256 48 1 214 1 1 1.5 1 0 1 108 138 1 0.0319747
264 53 2 518 0 1 0 1 1 1.15593 130 145 1 0.0321085
288 52 1 331 1 1 2.5 1 1 0.96057 94 160 1 0.0322196
30 39 2 182 0 1 0 2 0 1.41003 180 110 0 0.0322735
211 50 2 288 1 1 0 1 0 1.06802 140 140 1 0.0325019
292 65 1 275 1 1 1 2 1 1 115 130 1 0.032828
136 53 1 260 1 1 3 2 1 1 112 124 0 0.0339107
247 57 1 255 1 1 3 1 1 1 92 150 1 0.0339626
139 54 3 221 0 1 1 1 0 2 138 120 0 0.0344146
32 39 3 200 1 1 1 1 1 1 160 120 0 0.0366313
177 59 3 188 0 1 1 1 0 1 124 130 0 0.0368223
86 47 2 241.057 0 1 2 1 0 1 145 130 0 0.0369351
208 49 2 180 0 1 1 1 0 1 156 160 1 0.0377091
78 46 1 238 0 1 0 1 0 1.2769 90 130 0 0.0380135
278 41 1 336 1 1 3 1 1 1 118 120 1 0.0391171
180 59 2 213 0 1 0 1 1 1.44776 100 180 0 0.0392088
182 60 2 246 0 1 0 0 1 1.40395 135 120 0 0.0408451
79 46 3 275 1 1 0 1 1 1.32789 165 140 0 0.0412702
233 48 1 193 1 1 3 1 1 1 102 160 1 0.04237
96 48 1 227 1 1 1 1 0 1 130 150 0 0.0457815
286 50 1 341 1 1 2.5 2 1 1 125 140 1 0.0464844
277 66 1 276.836 1 1 1 1 1 1 94 140 1 0.0468952
267 55 0 295 0 1 0 1.11432 1 1.1145 136 140 1 0.0469683
61 43 3 215 0 1 0 2 0 1.47417 175 120 0 0.0490634
9 34 3 161 0 1 0 1 0 1.46804 190 130 0 0.0499133
234 48 1 329 1 1 1.5 1 1 1 92 160 1 0.049914
10 34 3 214 0 1 0 2 1 1.46008 168 150 0 0.050185
82 46 1 238 1 1 1 2 1 1 140 110 0 0.0513488
270 56 3 279 0 1 1 1 0 1 150 120 1 0.0514012
221 60 1 248 0 1 1 1 1 1 125 100 1 0.0528763
280 44 1 491 0 1 0 1 1 1.07103 135 135 1 0.0529694
235 48 1 355 1 1 2 1 1 1 99 160 1 0.0532494
271 56 1 230 1 1 1.5 2 1 1 124 150 1 0.053556
162 55 2 220 0 1 0 0 1 1.38884 134 120 0 0.0541677
62 43 3 249 0 1 0 2 0 1.46809 176 120 0 0.0545522
103 49 2 207 0 1 0 2 0 1.41247 135 130 0 0.0554622
229 45 1 219 1 1 1 2 1 1 130 130 1 0.057132
52 42 2 211 0 1 0 2 0 1.36915 137 115 0 0.05732
45 41 3 250 0 1 0 2 0 1.40704 142 110 0 0.0591437
106 49 1 297 0 0.93087 1 1 1 1 132 120 0 0.0593337
70 44 1 412 0 1 0 1 1 1.34646 170 150 0 0.0608182
187 31 1 270 1 1 1.5 1 1 1 153 120 1 0.0609306
284 49 1 222 0 1 2 1 1 1 122 150 1 0.0609557
142 54 3 253 0 1 0 2 0 1.49633 155 130 0 0.0612764
266 54 1 216 0 1 1.5 1 1 1 105 140 1 0.0616474
145 54 3 312 0 1 0 1 0 1.47594 130 160 0 0.0620174
115 51 3 194 0 1 0 1 0 1.53814 170 160 0 0.0643856
219 58 2 213 0 1 0 2 1 1.24797 140 130 1 0.0646421
259 51 2 160 0 1 2 1 1 1 150 135 1 0.0650465
56 42 2 228 1 1 1.5 1 1 1 152 120 0 0.0666529
217 55 1 201 1 1 3 1 1 1 130 140 1 0.0689589
151 54 3 305 0 1 0 1 1 1.5261 175 160 0 0.0689688
13 35 1 167 0 1 0 1 0 1.33391 150 140 0 0.0695931
97 48 3 240.484 0 1 0 1 1 1.35496 100 100 0 0.0698431
179 59 2 318 1 1 1 1 1 1 120 130 0 0.0702111
240 54 2 237 1 1 1.5 1 1 1.06656 150 120 1 0.070729
241 54 1 242 1 1 1 1 1 1 91 130 1 0.0725249
129 52 2 259 0 1 0 2 1 1.46127 170 140 0 0.0725429
198 41 1 237 1 0.939522 1 1 1 1 138 120 1 0.0725942
283 49 1 212 1 1 0 1 1 0.956925 96 128 1 0.0733429
48 41 3 291 0 1 0 2 1 1.42586 160 120 0 0.073357
206 48 1 260 0 1 2 1 1 1 115 120 1 0.0735847
251 43 1 175 1 1 1 1 1 1 120 120 1 0.0736604
216 54 1 224 0 1 2 1 1 1 122 125 1 0.0743512
27 38 3 275 0 1.00804 0 1 0 1.37389 129 120 0 0.0744339
83 46 1 240 0 1 0 2 1 1.32033 140 110 0 0.0748799
253 44 1 290 1 1 2 1 1 1 100 130 1 0.0750852
65 43 2 240.056 0 1 0 1 0 1.44147 175 150 0 0.0752259
5 32 3 198 0 1 0 1 0 1.39257 165 105 0 0.0754935
64 43 3 186 0 1 0 1 0 1.47751 154 150 0 0.0760975
116 51 2 190 0 1 0 1 0 1.36956 120 110 0 0.0768654
68 44 3 184 0 1 1 1 1 1 142 120 0 0.077039
269 55 1 248 1 1 2 1 1 1 96 145 1 0.0784453
222 63 1 223 0 1 0 1 1 1.19586 115 150 1 0.0786586
144 54 3 230 0 1 0 1 0 1.48177 130 150 0 0.0788217
204 47 1 226 1 1 1.5 1 1 1 98 150 1 0.078888
262 52 1 404 1 1 2 1 1 1 124 140 1 0.0790452
46 41 3 184 0 1 0 1 0 1.47235 180 125 0 0.0791359
73 45 1 297 0 1 0 1 0 1.32829 144 132 0 0.0798444
238 52 1 273.523 1 1 1.5 1 1 1 126 170 1 0.0799551
58 42 1 358 0 1 0 1 1 1.3385 170 140 0 0.0804819
114 50 1 215 1 1 0 1 1 1.23782 140 150 0 0.0805382
192 37 1 207 1 1 1.5 1 1 1 130 140 1 0.0807155
169 56 1 85 0 1 0 1 1 1.39164 140 120 0 0.0808652
203 47 2 193 1 1 1 1 1 1 145 140 1 0.0824245
207 48 1 268 1 1 1 1 1 1 103 160 1 0.0824746
8 33 2 298 0 1 0 1 1 1.36098 185 120 0 0.0830886
6 32 3 225 0 1 0 1 1 1.40973 184 110 0 0.0831834
156 55 3 320 0 1 0 1 0 1.46382 155 122 0 0.0851178
87 47 0 249 0 1 0 1 1 1.27185 150 110 0 0.0854966
63 43 3 266 0 1 0 1 0 1.38138 118 120 0 0.0855935
11 34 3 220 0 1 0 1 1 1.3632 150 98 0 0.0861795
199 43 1 247 1 1 2 1 1 1 130 150 1 0.0869482
230 46 1 231 1 1 0 1 1 0.954839 115 120 1 0.0870761
200 46 1 202 1 1 0 1 1 0.991815 150 110 1 0.0879442
132 53 2 274 0 1 0 1 0 1.38323 130 120 0 0.0884486
93 48 2 195 0 1 0 1 0 1.37463 125 120 0 0.0886647
81 46 2 163 0 0.995578 0 1 1 1.39172 116 150 0 0.0887371
88 47 3 263 0 1 0 1 1 1.50662 174 160 0 0.0891336
225 38 1 258.901 1 1 1 1 1 1 150 110 1 0.0905199
166 56 3 184 0 1 0 1 1 1.43349 100 130 0 0.090794
16 36 3 166 0 1 0 1 1 1.44486 180 120 0 0.0917647
104 49 3 253 0 1 0 1 1 1.44605 174 100 0 0.0918137
128 52 3 196 0 1 0 1 1 1.52954 165 160 0 0.0920144
239 53 1 246 1 1 0 1 1 0.980103 116 120 1 0.0920738
124 52 2 272 0 1 0 1 0 1.39657 139 125 0 0.0924291
249 58 1 263 1 1 2 1 1 1 140 130 1 0.0925944
164 55 1 229 1 1 0.5 1 1 1 110 140 0 0.0933395
92 48 3 284 0 1 0 1 0 1.39942 120 120 0 0.0933896
176 58 1 222 0 1 0 1 1 1.3391 100 135 0 0.0943583
279 43 1 288 1 1 2 1 1 1 135 140 1 0.0953903
19 36 2 160 0 1 0 1 1 1.42173 172 150 0 0.0962968
121 51 1 179 0 1 0 1 1 1.31518 100 130 0 0.0964982
160 55 3 326 0 1 0 1 1 1.48357 155 145 0 0.0965331
20 37 3 260 0 1 0 1 0 1.37387 130 120 0 0.0966762
174 58 3 251 0 1 0 1 1 1.43906 110 130 0 0.0969323
243 54 1 274.224 1 1 0 1 1 1.00388 118 140 1 0.0969578
44 40 2 233.377 0 1 0 1 1 1.42926 188 140 0 0.0970642
36 39 2 339 0 1 0 1 1 1.35734 170 120 0 0.097233
257 49 1 341 1 1 1 1 1 1 120 130 1 0.0981915
102 49 3 201 0 1 0 1 0 1.47926 164 124 0 0.0998013
274 59 1 263.489 0 1 0 1 1 1.16023 125 130 1 0.100086
123 52 3 245.244 0 1 0 1 0 1.47111 140 140 0 0.100332
21 37 2 211 0 1 0 1 0 1.36075 142 130 0 0.101185
201 46 1 186 0 1 0 1 1 1.1109 124 118 1 0.101386
261 52 1 298 1 1 1 1 1 1 110 130 1 0.101688
26 37 1 315 0 1 0 1 1 1.30192 158 130 0 0.1017
202 46 1 277 1 1 1 1 1 1 125 120 1 0.101707
190 35 3 257 0 1 0 1 1 1.16276 140 110 1 0.102072
60 43 3 201 0 1 0 1 0 1.4524 165 120 0 0.10245
245 55 1 268 1 1 1.5 1 1 1 128 140 1 0.102462
71 45 3 237 0 1 0 1 0 1.47029 170 130 0 0.102634
236 50 1 233 1 1 2 1 1 1 121 130 1 0.102873
113 50 1 129 0 1 0 1 1 1.37627 135 140 0 0.103391
29 38 2 292 0 1 0 1 1 1.34433 130 145 0 0.103463
101 49 3 237.575 0 1 0 1 0 1.4501 160 110 0 0.103521
214 52 1 225 1 1 2 1 1 1 120 130 1 0.103535
209 49 2 265 0 1 0 1 1 1.21401 175 115 1 0.103773
193 38 1 196 0 1 0 1 1 1.1192 166 110 1 0.103921
57 42 2 147 0 1 0 1 1 1.42807 146 160 0 0.103971
2 29 3 234.165 0 1 0 1 1 1.41433 170 140 0 0.103982
134 53 3 320 0 1 0 1 1 1.47969 162 140 0 0.104802
197 41 1 289 0 1 0 1 1 1.1158 170 110 1 0.104835
38 39 1 273 0 1 0 1 1 1.26364 132 110 0 0.104857
175 58 2 179 0 1 0 1 1 1.47702 160 140 0 0.104863
108 50 3 202 0 1 0 1 0 1.44341 145 110 0 0.104991
122 52 3 210 0 1 0 1 0 1.46488 148 120 0 0.105036
15 35 3 264 0 1 0 1 1 1.44063 168 150 0 0.105069
287 52 1 266 1 1 2 1 1 1 134 140 1 0.10621
178 59 3 287 0 1 0 1 1 1.49556 150 140 0 0.106245
39 39 1 307 0 1 0 1 1 1.28957 140 130 0 0.106678
194 38 1 282 0 1 0 1 1 1.11737 170 120 1 0.107229
110 50 3 168 0 1 0 1 1 1.47467 160 120 0 0.107247
149 54 3 246 0 1 0 1 1 1.4128 110 120 0 0.108111
258 49 1 234 1 1 1 1 1 1 140 140 1 0.108172
1 29 3 243 0 1 0 1 1 1.37679 160 120 0 0.108396
231 46 1 222 0 1 0 1 1 1.1027 112 130 1 0.108427
47 41 3 245 0 1 0 1 0 1.42871 150 130 0 0.108429
18 36 2 209 0 1 0 1 1 1.39501 178 130 0 0.108539
226 39 1 280 0 1 0 1 1 1.08565 150 110 1 0.10873
34 39 3 240.837 0 1 0 1 1 1.37937 120 130 0 0.110684
75 45 2 243.4 0 1 0 1 1 1.34597 110 135 0 0.110873
105 49 2 187 0 1 0 1 1 1.45483 172 140 0 0.11151
153 54 2 245.877 0 1 0 1 1 1.4127 122 150 0 0.111691
126 52 3 284 0 1 0 1 1 1.40657 118 120 0 0.111722
41 40 3 289 0 1 0 1 1 1.44785 172 140 0 0.112222
181 59 1 242.428 0 1 0 1 1 1.39307 140 140 0 0.112398
167 56 2 244.212 0 1 0 1 1 1.38763 114 130 0 0.112951
111 50 3 216 0 1 0 1 1 1.50003 170 140 0 0.112987
55 42 3 268 0 1 0 1 1 1.42817 136 150 0 0.114312
99 48 3 238 0 1 0 1 1 1.42436 118 140 0 0.114949
51 41 1 250 0 1 0 1 1 1.29086 142 112 0 0.115035
210 49 1 206 0 1 0 1 1 1.18827 170 130 1 0.115295
25 37 1 223 0 1 0 1 1 1.32205 168 120 0 0.115366
173 58 3 230 0 1 0 1 1 1.49214 150 130 0 0.11884
49 41 3 295 0 1 0 1 1 1.42452 170 120 0 0.119412
212 50 1 264 0 1 0 1 1 1.17306 150 145 1 0.120193
159 55 3 196 0 1 0 1 1 1.4995 150 140 0 0.120452
237 52 1 182 0 1 0 1 1 1.16905 150 120 1 0.120626
215 54 1 216 0 1 0 1 1 1.16328 140 125 1 0.121607
66 43 3 207 0 1 0 1 1 1.43818 138 142 0 0.121936
161 55 2 277 0 1 0 1 1 1.40906 160 110 0 0.122253
28 38 3 297 0 1 0 1 1 1.41162 150 140 0 0.123202
7 32 3 254 0 1 0 1 1 1.38592 155 125 0 0.124091
163 55 1 270 0 1 0 1 1 1.34807 140 120 0 0.124804
43 40 2 281 0 1 0 1 1 1.38178 167 130 0 0.125651
147 54 3 208 0 1 0 1 1 1.44806 142 110 0 0.126332
107 49 1 241.799 0 1 0 1 1 1.34211 130 140 0 0.126964
138 53 1 243 0 1 0 1 1 1.3878 155 140 0 0.127054
148 54 3 238 0 1 0 1 1 1.46795 154 120 0 0.127242
100 48 2 211 0 1 0 1 1 1.36923 138 110 0 0.127324
137 53 1 182 0 1 0 1 1 1.38062 148 130 0 0.128076
50 41 3 269 0 1 0 1 1 1.4044 144 125 0 0.128387
133 53 3 240.445 0 1 0 1 1 1.43681 132 120 0 0.12858
42 40 2 215 0 1 0 1 1 1.36072 138 130 0 0.128797
119 51 3 188 0 1 0 1 1 1.46193 145 125 0 0.129163
54 42 3 198 0 1 0 1 1 1.431 155 120 0 0.130047
135 53 2 195 0 1 0 1 1 1.40632 140 120 0 0.130304
77 45 1 224 0 1 0 1 1 1.34735 144 140 0 0.130991
33 39 3 204 0 1 0 1 1 1.40589 145 120 0 0.131109
24 37 2 194 0 1 0 1 1 1.36811 150 130 0 0.13112
40 40 3 275 0 1 0 1 1 1.41238 150 130 0 0.131227
152 54 2 217 0 1 0 1 1 1.40185 137 120 0 0.131401
76 45 1 225 0 1 0 1 1 1.31877 140 120 0 0.131506
120 51 3 224 0 1 0 1 1 1.46616 150 130 0 0.131625
69 44 3 215 0 1 0 1 1 1.42261 135 130 0 0.131691
53 42 3 196 0 1 0 1 1 1.42536 150 120 0 0.132573
98 48 3 245 0 1 0 1 1 1.46212 160 130 0 0.133118
80 46 2 230 0 1 0 1 1 1.38369 150 120 0 0.136404
_images/Example_autoclean_3_40.png
_images/Example_autoclean_3_41.png

Drop Outliers ...

Do you want to drop outliers? [y/n]y
Outliers are dropped.
[ ]: