Author: Roy, Puloma; Sharmin, Sadia; Ali, Amin Ahsan; Shoyaib, Mohammad
                    Title: Discretization and Feature Selection Based on Bias Corrected Mutual Information Considering High-Order Dependencies  Cord-id: 725qw4ge  Document date: 2020_4_17
                    ID: 725qw4ge
                    
                    Snippet: Mutual Information (MI) based feature selection methods are popular due to their ability to capture the nonlinear relationship among variables. However, existing works rarely address the error (bias) that occurs due to the use of finite samples during the estimation of MI. To the best of our knowledge, none of the existing methods address the bias issue for the high-order interaction term which is essential for better approximation of joint MI. In this paper, we first calculate the amount of bia
                    
                    
                    
                     
                    
                    
                    
                    
                        
                            
                                Document: Mutual Information (MI) based feature selection methods are popular due to their ability to capture the nonlinear relationship among variables. However, existing works rarely address the error (bias) that occurs due to the use of finite samples during the estimation of MI. To the best of our knowledge, none of the existing methods address the bias issue for the high-order interaction term which is essential for better approximation of joint MI. In this paper, we first calculate the amount of bias of this term. Moreover, to select features using [Formula: see text] based search, we also show that this term follows [Formula: see text] distribution. Based on these two theoretical results, we propose Discretization and feature Selection based on bias corrected Mutual information (DSbM). DSbM is extended by adding simultaneous forward selection and backward elimination (DSbM[Formula: see text]). We demonstrate the superiority of DSbM over four state-of-the-art methods in terms of accuracy and the number of selected features on twenty benchmark datasets. Experimental results also demonstrate that DSbM outperforms the existing methods in terms of accuracy, Pareto Optimality and Friedman test. We also observe that compared to DSbM, in some dataset DSbM[Formula: see text] selects fewer features and increases accuracy. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this chapter (10.1007/978-3-030-47426-3_64) contains supplementary material, which is available to authorized users.
 
  Search related documents: 
                                Co phrase  search for related documents- Try single phrases listed below for: 1
 
                                Co phrase  search for related documents, hyperlinks ordered by date