Show simple item record

dc.contributor.authorYonasi, Safari
dc.contributor.authorNakasi, Rose
dc.contributor.authorSingh, Yashik
dc.date.accessioned2022-06-22T08:38:30Z
dc.date.available2022-06-22T08:38:30Z
dc.date.issued2018-08
dc.identifier.citationYonasi, S., Nakasi, R., & Singh, Y. (2018), Predicting Cellular Protein localization Sites on Ecoli’s Minimal Dataset using a Comparison of Machine Learning Techniques. International Journal of Computer Applications, 975, 8887.en_US
dc.identifier.urihttp://ir.must.ac.ug/xmlui/handle/123456789/2153
dc.description.abstractSeveral Machine Learning Classification Techniques have been applied in predicting Protein Localization sites of E. coli using a number of techniques. However, research done is limited to no prediction of Localization sites of Proteins on Ecoli0s minimal dataset with the most informative features obtained using different feature selection techniques. This study investigated several Machine Learning Classification and Feature Selection Techniques as applied on Ecoli0s minimal dataset. The implementation of classifiers aided in predicting localization sites of E. coli0s minimal subset using its informative features obtained by feature selection techniques. Results were achieved in four parts including; (Data Collection, Cleaning and Preprocessing), Feature selection where the most informative features are selected, Classification where prediction of the localization of proteins is done and then Evaluation of the Classifiers to assess their performance using a number of measures including Accuracy from Cross-validation, and AUROCC to enable in recommending the best Classifier at the end. Among the Classifiers used, Extra Tree Classifier and Gradient Boosting are seen to be the best at performance followed by Random forest as seen from Precision, Recall and F-measure scores. Ada Boost is the worst at 83%.en_US
dc.language.isoen_USen_US
dc.publisherInternational Journal of Computer Applicationsen_US
dc.subjectPredictingen_US
dc.subjectEnsemble and Non Ensemble Classifiersen_US
dc.subjectMachine Learning Techniquesen_US
dc.titlePredicting Cellular Protein localization Sites on Ecoli’s Minimal Dataset using a Comparison of Machine Learning Techniquesen_US
dc.typeArticleen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record