Dataset info
Number of variables | 14 |
---|---|
Number of observations | 156 |
Missing cells | 156 (7.1%) |
Duplicate rows | 0 (0.0%) |
Total size in memory | 17.2 KiB |
Average record size in memory | 112.8 B |
Variables types
Numeric | 5 |
---|---|
Categorical | 5 |
Boolean | 1 |
Date | 0 |
URL | 0 |
Text (Unique) | 2 |
Rejected | 1 |
Unsupported | 0 |
Age
Numeric
Distinct count | 57 |
---|---|
Unique (%) | 36.5% |
Missing (%) | 19.2% |
Missing (n) | 30 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 28.14150794 |
---|---|
Minimum | 0.83 |
Maximum | 71 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 0.83 |
---|---|
5-th percentile | 4.25 |
Q1 | 19 |
Median | 26 |
Q3 | 35 |
95-th percentile | 55.375 |
Maximum | 71 |
Range | 70.17 |
Interquartile range | 16 |
Descriptive statistics
Standard deviation | 14.61387993 |
---|---|
Coef of variation | 0.5192998172 |
Kurtosis | 0.6137465312 |
Mean | 28.14150794 |
MAD | 11.1428584 |
Skewness | 0.7003666519 |
Sum | 3545.83 |
Variance | 213.5654865 |
Memory size | 1.3 KiB |
Value | Count | Frequency (%) | |
21 | 8 | 5.1% | |
29 | 6 | 3.8% | |
19 | 6 | 3.8% | |
22 | 6 | 3.8% | |
24 | 5 | 3.2% | |
28 | 5 | 3.2% | |
20 | 4 | 2.6% | |
38 | 4 | 2.6% | |
26 | 4 | 2.6% | |
17 | 3 | 1.9% | |
Other values (46) | 75 | 48.1% | |
(Missing) | 30 | 19.2% |
Minimum 5 values
Value | Count | Frequency (%) | |
0.83 | 1 | 0.6% | |
2 | 3 | 1.9% | |
3 | 1 | 0.6% | |
4 | 2 | 1.3% | |
5 | 1 | 0.6% |
Maximum 5 values
Value | Count | Frequency (%) | |
71 | 1 | 0.6% | |
70.5 | 1 | 0.6% | |
66 | 1 | 0.6% | |
65 | 1 | 0.6% | |
59 | 1 | 0.6% |
Cabin
Categorical
Distinct count | 29 |
---|---|
Unique (%) | 18.6% |
Missing (%) | 80.1% |
Missing (n) | 125 |
D26 | 2 |
---|---|
C23 C25 C27 | 2 |
C123 | 2 |
Other values (25) | 25 |
(Missing) |
Value | Count | Frequency (%) | |
D26 | 2 | 1.3% | |
C23 C25 C27 | 2 | 1.3% | |
C123 | 2 | 1.3% | |
E31 | 1 | 0.6% | |
A6 | 1 | 0.6% | |
F G73 | 1 | 0.6% | |
C110 | 1 | 0.6% | |
B86 | 1 | 0.6% | |
C85 | 1 | 0.6% | |
F33 | 1 | 0.6% | |
Other values (18) | 18 | 11.5% | |
(Missing) | 125 | 80.1% |
Max length | 11 |
---|---|
Mean length | 3.179487179 |
Min length | 2 |
Contains chars | True |
Contains digits | True |
Contains spaces | True |
Contains non-words | True |
Embarked
Categorical
Distinct count | 4 |
---|---|
Unique (%) | 2.6% |
Missing (%) | 0.6% |
Missing (n) | 1 |
S | |
---|---|
C | |
Q | 13 |
(Missing) | 1 |
Value | Count | Frequency (%) | |
S | 110 | 70.5% | |
C | 32 | 20.5% | |
Q | 13 | 8.3% | |
(Missing) | 1 | 0.6% |
Max length | 3 |
---|---|
Mean length | 1.012820513 |
Min length | 1 |
Contains chars | True |
Contains digits | False |
Contains spaces | False |
Contains non-words | False |
Fare
Numeric
Distinct count | 93 |
---|---|
Unique (%) | 59.6% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 28.10958718 |
---|---|
Minimum | 6.75 |
Maximum | 263 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 6.75 |
---|---|
5-th percentile | 7.2292 |
Q1 | 8.00315 |
Median | 14.4542 |
Q3 | 30.37185 |
95-th percentile | 77.765625 |
Maximum | 263 |
Range | 256.25 |
Interquartile range | 22.3687 |
Descriptive statistics
Standard deviation | 39.4010467 |
---|---|
Coef of variation | 1.401694249 |
Kurtosis | 21.10998543 |
Mean | 28.10958718 |
MAD | 22.61971348 |
Skewness | 4.17640798 |
Sum | 4385.0956 |
Variance | 1552.442481 |
Memory size | 1.3 KiB |
Value | Count | Frequency (%) | |
8.05 | 12 | 7.7% | |
7.8958 | 8 | 5.1% | |
10.5 | 5 | 3.2% | |
13 | 5 | 3.2% | |
26 | 5 | 3.2% | |
7.75 | 5 | 3.2% | |
7.925 | 4 | 2.6% | |
7.2292 | 3 | 1.9% | |
77.2875 | 2 | 1.3% | |
7.225 | 2 | 1.3% | |
Other values (83) | 105 | 67.3% |
Minimum 5 values
Value | Count | Frequency (%) | |
6.75 | 1 | 0.6% | |
6.975 | 1 | 0.6% | |
7.05 | 1 | 0.6% | |
7.1417 | 1 | 0.6% | |
7.225 | 2 | 1.3% |
Maximum 5 values
Value | Count | Frequency (%) | |
263 | 2 | 1.3% | |
247.5208 | 1 | 0.6% | |
146.5208 | 1 | 0.6% | |
83.475 | 1 | 0.6% | |
82.1708 | 1 | 0.6% |
Name
Categorical, Unique
First 5 values |
---|
Ahlin, Mrs. Johan (Johanna Persdotter Larsson) |
Allen, Mr. William Henry |
Andersson, Miss. Ellis Anna Maria |
Andersson, Miss. Erna Alexandra |
Andersson, Mr. Anders Johan |
Last 5 values |
---|
Williams, Mr. Charles Duane |
Williams, Mr. Charles Eugene |
Woolner, Mr. Hugh |
Zabour, Miss. Hileni |
van Billiard, Mr. Austin Blyler |
First 5 values
Value | Count | Frequency (%) | |
Ahlin, Mrs. Johan (Johanna Persdotter Larsson) | 1 | 0.6% | |
Allen, Mr. William Henry | 1 | 0.6% | |
Andersson, Miss. Ellis Anna Maria | 1 | 0.6% | |
Andersson, Miss. Erna Alexandra | 1 | 0.6% | |
Andersson, Mr. Anders Johan | 1 | 0.6% |
Last 5 values
Value | Count | Frequency (%) | |
van Billiard, Mr. Austin Blyler | 1 | 0.6% | |
Zabour, Miss. Hileni | 1 | 0.6% | |
Woolner, Mr. Hugh | 1 | 0.6% | |
Williams, Mr. Charles Eugene | 1 | 0.6% | |
Williams, Mr. Charles Duane | 1 | 0.6% |
Parch
Numeric
Distinct count | 5 |
---|---|
Unique (%) | 3.2% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.3974358974 |
---|---|
Minimum | 0 |
Maximum | 5 |
Zeros (%) | 77.6% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
Median | 0 |
Q3 | 0 |
95-th percentile | 2 |
Maximum | 5 |
Range | 5 |
Interquartile range | 0 |
Descriptive statistics
Standard deviation | 0.8701463516 |
---|---|
Coef of variation | 2.189400498 |
Kurtosis | 9.346661872 |
Mean | 0.3974358974 |
MAD | 0.6165351742 |
Skewness | 2.760421073 |
Sum | 62 |
Variance | 0.7571546733 |
Memory size | 1.3 KiB |
Value | Count | Frequency (%) | |
0 | 121 | 77.6% | |
2 | 17 | 10.9% | |
1 | 15 | 9.6% | |
5 | 2 | 1.3% | |
3 | 1 | 0.6% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 121 | 77.6% | |
1 | 15 | 9.6% | |
2 | 17 | 10.9% | |
3 | 1 | 0.6% | |
5 | 2 | 1.3% |
Maximum 5 values
Value | Count | Frequency (%) | |
5 | 2 | 1.3% | |
3 | 1 | 0.6% | |
2 | 17 | 10.9% | |
1 | 15 | 9.6% | |
0 | 121 | 77.6% |
PassengerId
Numeric
Distinct count | 156 |
---|---|
Unique (%) | 100.0% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 78.5 |
---|---|
Minimum | 1 |
Maximum | 156 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 8.75 |
Q1 | 39.75 |
Median | 78.5 |
Q3 | 117.25 |
95-th percentile | 148.25 |
Maximum | 156 |
Range | 155 |
Interquartile range | 77.5 |
Descriptive statistics
Standard deviation | 45.17742799 |
---|---|
Coef of variation | 0.5755086368 |
Kurtosis | -1.2 |
Mean | 78.5 |
MAD | 39 |
Skewness | 0 |
Sum | 12246 |
Variance | 2041 |
Memory size | 1.3 KiB |
Value | Count | Frequency (%) | |
156 | 1 | 0.6% | |
49 | 1 | 0.6% | |
56 | 1 | 0.6% | |
55 | 1 | 0.6% | |
54 | 1 | 0.6% | |
53 | 1 | 0.6% | |
52 | 1 | 0.6% | |
51 | 1 | 0.6% | |
50 | 1 | 0.6% | |
48 | 1 | 0.6% | |
Other values (146) | 146 | 93.6% |
Minimum 5 values
Value | Count | Frequency (%) | |
1 | 1 | 0.6% | |
2 | 1 | 0.6% | |
3 | 1 | 0.6% | |
4 | 1 | 0.6% | |
5 | 1 | 0.6% |
Maximum 5 values
Value | Count | Frequency (%) | |
156 | 1 | 0.6% | |
155 | 1 | 0.6% | |
154 | 1 | 0.6% | |
153 | 1 | 0.6% | |
152 | 1 | 0.6% |
Pclass
Categorical
Distinct count | 3 |
---|---|
Unique (%) | 1.9% |
Missing (%) | 0.0% |
Missing (n) | 0 |
3 | |
---|---|
2 | |
1 |
Value | Count | Frequency (%) | |
3 | 96 | 61.5% | |
2 | 30 | 19.2% | |
1 | 30 | 19.2% |
Max length | 1 |
---|---|
Mean length | 1 |
Min length | 1 |
Contains chars | False |
Contains digits | True |
Contains spaces | False |
Contains non-words | False |
Sex
Categorical
Distinct count | 2 |
---|---|
Unique (%) | 1.3% |
Missing (%) | 0.0% |
Missing (n) | 0 |
male | |
---|---|
female |
Value | Count | Frequency (%) | |
male | 100 | 64.1% | |
female | 56 | 35.9% |
Max length | 6 |
---|---|
Mean length | 4.717948718 |
Min length | 4 |
Contains chars | True |
Contains digits | False |
Contains spaces | False |
Contains non-words | False |
SibSp
Numeric
Distinct count | 6 |
---|---|
Unique (%) | 3.8% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 0.6153846154 |
---|---|
Minimum | 0 |
Maximum | 5 |
Zeros (%) | 62.8% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
Median | 0 |
Q3 | 1 |
95-th percentile | 3 |
Maximum | 5 |
Range | 5 |
Interquartile range | 1 |
Descriptive statistics
Standard deviation | 1.056235179 |
---|---|
Coef of variation | 1.716382167 |
Kurtosis | 4.996881448 |
Mean | 0.6153846154 |
MAD | 0.7731755424 |
Skewness | 2.220684855 |
Sum | 96 |
Variance | 1.115632754 |
Memory size | 1.3 KiB |
Value | Count | Frequency (%) | |
0 | 98 | 62.8% | |
1 | 40 | 25.6% | |
3 | 6 | 3.8% | |
2 | 6 | 3.8% | |
4 | 4 | 2.6% | |
5 | 2 | 1.3% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 98 | 62.8% | |
1 | 40 | 25.6% | |
2 | 6 | 3.8% | |
3 | 6 | 3.8% | |
4 | 4 | 2.6% |
Maximum 5 values
Value | Count | Frequency (%) | |
5 | 2 | 1.3% | |
4 | 4 | 2.6% | |
3 | 6 | 3.8% | |
2 | 6 | 3.8% | |
1 | 40 | 25.6% |
Survived
Boolean
Distinct count | 2 |
---|---|
Unique (%) | 1.3% |
Missing (%) | 0.0% |
Missing (n) | 0 |
0 | |
---|---|
1 |
Value | Count | Frequency (%) | |
0 | 102 | 65.4% | |
1 | 54 | 34.6% |
Ticket
Categorical
Distinct count | 145 |
---|---|
Unique (%) | 92.9% |
Missing (%) | 0.0% |
Missing (n) | 0 |
CA 2144 | 2 |
---|---|
19950 | 2 |
W./C. 6608 | 2 |
Other values (142) |
Value | Count | Frequency (%) | |
CA 2144 | 2 | 1.3% | |
19950 | 2 | 1.3% | |
W./C. 6608 | 2 | 1.3% | |
2651 | 2 | 1.3% | |
35281 | 2 | 1.3% | |
11668 | 2 | 1.3% | |
113803 | 2 | 1.3% | |
S.O.C. 14879 | 2 | 1.3% | |
349909 | 2 | 1.3% | |
237736 | 2 | 1.3% | |
Other values (135) | 136 | 87.2% |
Max length | 18 |
---|---|
Mean length | 6.961538462 |
Min length | 4 |
Contains chars | True |
Contains digits | True |
Contains spaces | True |
Contains non-words | True |
try
Categorical, Unique
First 5 values |
---|
Ahlin, Mrs. Johan (Johanna Persdotter Larsson)female |
Allen, Mr. William Henrymale |
Andersson, Miss. Ellis Anna Mariafemale |
Andersson, Miss. Erna Alexandrafemale |
Andersson, Mr. Anders Johanmale |
Last 5 values |
---|
Williams, Mr. Charles Duanemale |
Williams, Mr. Charles Eugenemale |
Woolner, Mr. Hughmale |
Zabour, Miss. Hilenifemale |
van Billiard, Mr. Austin Blylermale |
First 5 values
Value | Count | Frequency (%) | |
Ahlin, Mrs. Johan (Johanna Persdotter Larsson)female | 1 | 0.6% | |
Allen, Mr. William Henrymale | 1 | 0.6% | |
Andersson, Miss. Ellis Anna Mariafemale | 1 | 0.6% | |
Andersson, Miss. Erna Alexandrafemale | 1 | 0.6% | |
Andersson, Mr. Anders Johanmale | 1 | 0.6% |
Last 5 values
Value | Count | Frequency (%) | |
van Billiard, Mr. Austin Blylermale | 1 | 0.6% | |
Zabour, Miss. Hilenifemale | 1 | 0.6% | |
Woolner, Mr. Hughmale | 1 | 0.6% | |
Williams, Mr. Charles Eugenemale | 1 | 0.6% | |
Williams, Mr. Charles Duanemale | 1 | 0.6% |
Unnamed_0
Highly correlated
This variable is highly correlated with PassengerId
and should be ignored for analysis
Correlation | 1 |
---|
First rows
Age | Cabin | Embarked | Fare | Name | Parch | PassengerId | Pclass | Sex | SibSp | Survived | Ticket | try | Unnamed_0 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 22.0 | NaN | S | 7.2500 | Braund, Mr. Owen Harris | 0 | 1 | 3 | male | 1 | 0 | A/5 21171 | Braund, Mr. Owen Harrismale | 0 |
1 | 38.0 | C85 | C | 71.2833 | Cumings, Mrs. John Bradley (Florence Briggs Thayer) | 0 | 2 | 1 | female | 1 | 1 | PC 17599 | Cumings, Mrs. John Bradley (Florence Briggs Thayer)female | 1 |
2 | 26.0 | NaN | S | 7.9250 | Heikkinen, Miss. Laina | 0 | 3 | 3 | female | 0 | 1 | STON/O2. 3101282 | Heikkinen, Miss. Lainafemale | 2 |
3 | 35.0 | C123 | S | 53.1000 | Futrelle, Mrs. Jacques Heath (Lily May Peel) | 0 | 4 | 1 | female | 1 | 1 | 113803 | Futrelle, Mrs. Jacques Heath (Lily May Peel)female | 3 |
4 | 35.0 | NaN | S | 8.0500 | Allen, Mr. William Henry | 0 | 5 | 3 | male | 0 | 0 | 373450 | Allen, Mr. William Henrymale | 4 |
5 | NaN | NaN | Q | 8.4583 | Moran, Mr. James | 0 | 6 | 3 | male | 0 | 0 | 330877 | Moran, Mr. Jamesmale | 5 |
6 | 54.0 | E46 | S | 51.8625 | McCarthy, Mr. Timothy J | 0 | 7 | 1 | male | 0 | 0 | 17463 | McCarthy, Mr. Timothy Jmale | 6 |
7 | 2.0 | NaN | S | 21.0750 | Palsson, Master. Gosta Leonard | 1 | 8 | 3 | male | 3 | 0 | 349909 | Palsson, Master. Gosta Leonardmale | 7 |
8 | 27.0 | NaN | S | 11.1333 | Johnson, Mrs. Oscar W (Elisabeth Vilhelmina Berg) | 2 | 9 | 3 | female | 0 | 1 | 347742 | Johnson, Mrs. Oscar W (Elisabeth Vilhelmina Berg)female | 8 |
9 | 14.0 | NaN | C | 30.0708 | Nasser, Mrs. Nicholas (Adele Achem) | 0 | 10 | 2 | female | 1 | 1 | 237736 | Nasser, Mrs. Nicholas (Adele Achem)female | 9 |
Last rows
Age | Cabin | Embarked | Fare | Name | Parch | PassengerId | Pclass | Sex | SibSp | Survived | Ticket | try | Unnamed_0 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
146 | 27.0 | NaN | S | 7.7958 | Andersson, Mr. August Edvard ("Wennerstrom") | 0 | 147 | 3 | male | 0 | 1 | 350043 | Andersson, Mr. August Edvard ("Wennerstrom")male | 146 |
147 | 9.0 | NaN | S | 34.3750 | Ford, Miss. Robina Maggie "Ruby" | 2 | 148 | 3 | female | 2 | 0 | W./C. 6608 | Ford, Miss. Robina Maggie "Ruby"female | 147 |
148 | 36.5 | F2 | S | 26.0000 | Navratil, Mr. Michel ("Louis M Hoffman") | 2 | 149 | 2 | male | 0 | 0 | 230080 | Navratil, Mr. Michel ("Louis M Hoffman")male | 148 |
149 | 42.0 | NaN | S | 13.0000 | Byles, Rev. Thomas Roussel Davids | 0 | 150 | 2 | male | 0 | 0 | 244310 | Byles, Rev. Thomas Roussel Davidsmale | 149 |
150 | 51.0 | NaN | S | 12.5250 | Bateman, Rev. Robert James | 0 | 151 | 2 | male | 0 | 0 | S.O.P. 1166 | Bateman, Rev. Robert Jamesmale | 150 |
151 | 22.0 | C2 | S | 66.6000 | Pears, Mrs. Thomas (Edith Wearne) | 0 | 152 | 1 | female | 1 | 1 | 113776 | Pears, Mrs. Thomas (Edith Wearne)female | 151 |
152 | 55.5 | NaN | S | 8.0500 | Meo, Mr. Alfonzo | 0 | 153 | 3 | male | 0 | 0 | A.5. 11206 | Meo, Mr. Alfonzomale | 152 |
153 | 40.5 | NaN | S | 14.5000 | van Billiard, Mr. Austin Blyler | 2 | 154 | 3 | male | 0 | 0 | A/5. 851 | van Billiard, Mr. Austin Blylermale | 153 |
154 | NaN | NaN | S | 7.3125 | Olsen, Mr. Ole Martin | 0 | 155 | 3 | male | 0 | 0 | Fa 265302 | Olsen, Mr. Ole Martinmale | 154 |
155 | 51.0 | NaN | C | 61.3792 | Williams, Mr. Charles Duane | 1 | 156 | 1 | male | 0 | 0 | PC 17597 | Williams, Mr. Charles Duanemale | 155 |