The data was collected from 299 patients (105 women and 194 men) who were admitted to Institute of Cardiology and Allied hospital Faisalabad-Pakistan in 2015. All the patients have left ventricular systolic dysfunction and belong to the New York Heart Association (NYHA) class III and IV. The average follow-up time was 130 days.
The dataset can be downloaded from the Kaggle website.
Data items
age
: Age of the patientanaemia
: Whether the patient had anaemia or not determined by the haematocrit level.creatinine_phosphokinase
: The level of the creatinine phosphokinase (CPK) enzyme in the blood.diabetes
: Whether the patient had diabetes.ejection_fraction
: The proportion of blood pumped out of the heart during a single contraction, given as a percentage.high_blood_pressure
: Whether the patient had high blood pressure, though the definition of high blood pressure is not clear in this dataset.platelets
: The number of platelets in the blood.serum_creatinine
: The level of creatinine in the blood.serum_sodium
: The level of sodium in the blood.sex
: Whether the patient is male (1
) or female (0
).smoking
: Whether the patient had a smoking habbit.time
: The follow-up period in days.DEATH_EVENT
: Whether the patient was died during the follow-up period.
import pandas as pd
df = pd.read_csv('heart_failure_clinical_records_dataset.csv')
df.head()
age | anaemia | creatinine_phosphokinase | diabetes | ejection_fraction | high_blood_pressure | platelets | serum_creatinine | serum_sodium | sex | smoking | time | DEATH_EVENT | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 75.0 | 0 | 582 | 0 | 20 | 1 | 265000.00 | 1.9 | 130 | 1 | 0 | 4 | 1 |
1 | 55.0 | 0 | 7861 | 0 | 38 | 0 | 263358.03 | 1.1 | 136 | 1 | 0 | 6 | 1 |
2 | 65.0 | 0 | 146 | 0 | 20 | 0 | 162000.00 | 1.3 | 129 | 1 | 1 | 7 | 1 |
3 | 50.0 | 1 | 111 | 0 | 20 | 0 | 210000.00 | 1.9 | 137 | 1 | 0 | 7 | 1 |
4 | 65.0 | 1 | 160 | 1 | 20 | 0 | 327000.00 | 2.7 | 116 | 0 | 0 | 8 | 1 |
Comments