Showing 131 - 140 of 3,804
Feature selection is essential to improve the classification effectiveness. This paper presents a new adaptive algorithm called FS-PeSOA (feature selection penguins search optimization algorithm) which is a meta-heuristic feature selection method based on “Penguins Search Optimization...
Persistent link: https://www.econbiz.de/10012046774
In this article, the authors present a new malicious code detection model. The detection model improves typical n-gram feature extraction algorithms that are easy to be obfuscated. Specifically, the proposed model can dynamically determine obfuscation features and then adjust the selection of...
Persistent link: https://www.econbiz.de/10012046977
Software Defect Prediction (SDP) models are used to predict, whether software is clean or buggy using the historical data collected from various software repositories. The data collected from such repositories may contain some missing values. In order to estimate missing values, imputation...
Persistent link: https://www.econbiz.de/10012046987
This article presents a novel approach for fraud detection in automobile insurance claims by applying various data mining techniques. Initially, the most relevant attributes are chosen from the original dataset by using an evolutionary algorithm based feature selection method. A test set is then...
Persistent link: https://www.econbiz.de/10012047413
Dimensionality reduction of feature vector size plays a vital role in enhancing the text processing capabilities; it aims in reducing the size of the feature vector used in the mining tasks (classification, clustering, etc.). This paper proposes an efficient approach to be used in reducing the...
Persistent link: https://www.econbiz.de/10012047801
Building classification models from real-world datasets became a difficult task, especially in datasets with high dimensional features. Unfortunately, these datasets may include irrelevant or redundant features which have a negative effect on the classification performance. Selecting the...
Persistent link: https://www.econbiz.de/10012047807
This paper proposes the application of a swarm intelligence algorithm called Artificial Bee Colony (ABC) for the feature selection to feed a Random Forest (RF) classifier aiming to recognise Traffic Signs. In this paper, the authors define and assess several fitness functions for the feature...
Persistent link: https://www.econbiz.de/10012047848
Instance selection and feature selection are important steps in the data mining process. They help reduce the excessive number of instances and features. The purpose of this reduction is to eliminate the noisy and redundant instances and features in order to improve the classifiers performance....
Persistent link: https://www.econbiz.de/10012047913
A significant amount of microarray gene expression data is available on the Internet, and researchers are allowed to analyze such data freely. However, microarray data includes thousands of genes, and analysis using conventional techniques is too difficult. Therefore, selecting informative...
Persistent link: https://www.econbiz.de/10012048041
A DNA microarray can measure the expression of thousands of genes simultaneously, and this enables us to study the molecular pathways underlying Age-related Macular Degeneration. Previous studies have not determined which genes are responsible for the process of AMD. The authors address this...
Persistent link: https://www.econbiz.de/10012048138