# data **Repository Path**: hiyky/data ## Basic Information - **Project Name**: data - **Description**: No description available - **Primary Language**: Unknown - **License**: MIT - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2020-08-14 - **Last Updated**: 2020-12-19 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Data Sets for Machine Learning Practice Name | Description | Source ---|---|--- [iris](https://github.com/dataprofessor/data/blob/master/iris.csv) | 150 flowers (rows) belonging to the 3 species (*setosa*is , *versicolor* and *virginica*) of the *Iris* genus. The dataset consists of 4 input variables (sepal length, sepal width, petal length and petal width) and 1 output variable (the class label of the Iris species as being *setosa*is , *versicolor* and *virginica*. | [1](https://archive.ics.uci.edu/ml/datasets/iris) [dhfr](https://github.com/dataprofessor/data/blob/master/dhfr.csv) | 325 molecules (rows) with biological activity against the DHFR enzyme (an anti-marial drug target). The dataset consists of 228 input variables (molecular descriptors describing the physicochemical properties of the molecule) and 1 output variable (the biological activity as being either *active* or *inactive*). | [2](https://www.rdocumentation.org/packages/caret/versions/6.0-84/topics/dhfr) [heart-disease-cleveland](https://github.com/dataprofessor/data/blob/master/heart-disease-cleveland.csv) | 303 patients (rows) who have been diagnosed as having (diagnosis score of 1, 2, 3, or 4) or not having (diagnosis score of 0) heart disease. The dataset consists of 13 input variables (the health parameters) and 1 output variable (diagnosis). | [3](https://archive.ics.uci.edu/ml/datasets/Heart+Disease)