Skip to main content

2024 | OriginalPaper | Buchkapitel

Intelligent Feature Engineering and Feature Selection Techniques for Machine Learning Evaluation

verfasst von : Janjhyam Venkata Naga Ramesh, Ajay kushwaha, Tripti Sharma, A. Aranganathan, Ankur Gupta, Sanjiv Kumar Jain

Erschienen in: Mobile Radio Communications and 5G Networks

Verlag: Springer Nature Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Manual feature engineering can take a long time and be ineffective at capturing complicated patterns, while choosing the wrong features can produce less-than-ideal outcomes. As a result, effective feature engineering and selection strategies are crucial for enhancing machine learning evaluation. To improve the assessment of machine learning algorithms, we suggest intelligent feature engineering and feature selection strategies in this study. The two key phases of our strategy are feature engineering and feature selection. We use cutting-edge techniques like deep learning and autoencoders for feature engineering to automatically extract pertinent representations from raw data. High-level characteristics that represent intricate linkages and buried patterns can be extracted using these techniques. We use sophisticated algorithms, such as statistical methods, evolutionary algorithms, and correlation analysis, during the feature selection step to find the most informative features while minimizing dimensionality. Results from experiments show that models created with our intelligent feature engineering and selection strategies perform better than models created using more conventional methods, with an MSE value of 0.0202920. We intend to investigate novel deep learning architectures created expressly for feature engineering tasks in upcoming research. We also intend to research dynamic feature set adaptation techniques for feature selection based on reinforcement learning. For further improvement, ensemble methods integrating various feature engineering and selection strategies will be investigated. Additionally, we stress the necessity of standardized benchmarks and evaluation procedures to enable fair comparisons between various feature engineering and selection methods and to promote improvements in the field.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Anand R, Singh B, Sindhwani N (2009) Speech perception & analysis of fluent digits’ strings using level-by-level time alignment. Int J Inf Technol Knowl Manag 2(1):65–68 Anand R, Singh B, Sindhwani N (2009) Speech perception & analysis of fluent digits’ strings using level-by-level time alignment. Int J Inf Technol Knowl Manag 2(1):65–68
2.
Zurück zum Zitat Bakshi G, Shukla R, Yadav V, Dahiya A, Anand R, Sindhwani N, Singh H (2021) An optimized approach for feature extraction in multi-relational statistical learning. J Sci Ind Res 80(6):537–542 Bakshi G, Shukla R, Yadav V, Dahiya A, Anand R, Sindhwani N, Singh H (2021) An optimized approach for feature extraction in multi-relational statistical learning. J Sci Ind Res 80(6):537–542
3.
Zurück zum Zitat Anand R, Sindhwani N, Juneja S (2022) Cognitive internet of things, its applications, and its challenges: a survey. Harnessing Internet Things (IoT) Hyper-Connect Smart World, 91–113 Anand R, Sindhwani N, Juneja S (2022) Cognitive internet of things, its applications, and its challenges: a survey. Harnessing Internet Things (IoT) Hyper-Connect Smart World, 91–113
4.
Zurück zum Zitat Sindhwani N, Anand R, Meivel S, Shukla R, Yadav MP, Yadav V (2021) Performance analysis of deep neural networks using computer vision. EAI Endorsed Trans Ind Netw Intell Syst 8(29):e3–e3 Sindhwani N, Anand R, Meivel S, Shukla R, Yadav MP, Yadav V (2021) Performance analysis of deep neural networks using computer vision. EAI Endorsed Trans Ind Netw Intell Syst 8(29):e3–e3
5.
Zurück zum Zitat Anand R, Arora S, Sindhwani N (Jan 2022) A miniaturized UWB antenna for high speed applications. In: 2022 international conference on computing, communication and power technology (IC3P). IEEE, pp 264–267 Anand R, Arora S, Sindhwani N (Jan 2022) A miniaturized UWB antenna for high speed applications. In: 2022 international conference on computing, communication and power technology (IC3P). IEEE, pp 264–267
6.
Zurück zum Zitat Anand R, Sindhwani N, Dahiya A (Mar 2022) Design of a high directivity slotted fractal antenna for C-band, X-band and Ku-band applications. In: 2022 9th international conference on computing for sustainable global development (INDIACom). IEEE, pp 727–730 Anand R, Sindhwani N, Dahiya A (Mar 2022) Design of a high directivity slotted fractal antenna for C-band, X-band and Ku-band applications. In: 2022 9th international conference on computing for sustainable global development (INDIACom). IEEE, pp 727–730
7.
Zurück zum Zitat Arora S, Sharma S, Anand R, Shrivastva G (2023) Miniaturized pentagon-shaped planar monopole antenna for ultra-wideband applications. Prog Electromagn Res C 133:195–208CrossRef Arora S, Sharma S, Anand R, Shrivastva G (2023) Miniaturized pentagon-shaped planar monopole antenna for ultra-wideband applications. Prog Electromagn Res C 133:195–208CrossRef
8.
Zurück zum Zitat Baek S, Kim J, Yu H, Yang G, Sohn I, Cho Y, Park C (2023) Intelligent feature selection for ECG-based personal authentication using deep reinforcement learning. Sensors 23(3):1230CrossRef Baek S, Kim J, Yu H, Yang G, Sohn I, Cho Y, Park C (2023) Intelligent feature selection for ECG-based personal authentication using deep reinforcement learning. Sensors 23(3):1230CrossRef
9.
Zurück zum Zitat Rani P, Sharma R (2023) Intelligent transportation system for internet of vehicles based vehicular networks for smart cities. Comput Electr Eng 105:108543CrossRef Rani P, Sharma R (2023) Intelligent transportation system for internet of vehicles based vehicular networks for smart cities. Comput Electr Eng 105:108543CrossRef
10.
Zurück zum Zitat Subramani S, Selvi M (2023) Multi-objective PSO based feature selection for intrusion detection in IoT based wireless sensor networks. Optik 273:170419CrossRef Subramani S, Selvi M (2023) Multi-objective PSO based feature selection for intrusion detection in IoT based wireless sensor networks. Optik 273:170419CrossRef
11.
Zurück zum Zitat Damaneh MM, Mohanna F, Jafari P (2023) Static hand gesture recognition in sign language based on convolutional neural network with feature extraction method using ORB descriptor and Gabor filter. Expert Syst Appl 211:118559CrossRef Damaneh MM, Mohanna F, Jafari P (2023) Static hand gesture recognition in sign language based on convolutional neural network with feature extraction method using ORB descriptor and Gabor filter. Expert Syst Appl 211:118559CrossRef
12.
Zurück zum Zitat Lao Z, He D, Wei Z, Shang H, Jin Z, Miao J, Ren C (2023) Intelligent fault diagnosis for rail transit switch machine based on adaptive feature selection and improved LightGBM. Eng Fail Anal 148:107219CrossRef Lao Z, He D, Wei Z, Shang H, Jin Z, Miao J, Ren C (2023) Intelligent fault diagnosis for rail transit switch machine based on adaptive feature selection and improved LightGBM. Eng Fail Anal 148:107219CrossRef
13.
Zurück zum Zitat Gupta N, Janani S, Dilip R, Hosur R, Chaturvedi A, Gupta A (2022) Wearable sensors for evaluation over smart home using sequential minimization optimization-based random forest. Int J Commun Netw Inf Secur 14(2):179–188 Gupta N, Janani S, Dilip R, Hosur R, Chaturvedi A, Gupta A (2022) Wearable sensors for evaluation over smart home using sequential minimization optimization-based random forest. Int J Commun Netw Inf Secur 14(2):179–188
14.
Zurück zum Zitat Maqsood S, Damaševičius R (2023) Multiclass skin lesion localization and classification using deep learning based features fusion and selection framework for smart healthcare. Neural Netw 160:238–258CrossRef Maqsood S, Damaševičius R (2023) Multiclass skin lesion localization and classification using deep learning based features fusion and selection framework for smart healthcare. Neural Netw 160:238–258CrossRef
15.
Zurück zum Zitat Lee Y, Park B, Jo M, Lee J, Lee C (2023) A quantitative diagnostic method of feature coordination for machine learning model with massive data from rotary machine. Expert Syst Appl 214:119117CrossRef Lee Y, Park B, Jo M, Lee J, Lee C (2023) A quantitative diagnostic method of feature coordination for machine learning model with massive data from rotary machine. Expert Syst Appl 214:119117CrossRef
16.
Zurück zum Zitat Ahakonye LAC, Nwakanma CI, Lee JM, Kim DS (2023) SCADA intrusion detection scheme exploiting the fusion of modified decision tree and Chi-square feature selection. Internet Things 21:100676CrossRef Ahakonye LAC, Nwakanma CI, Lee JM, Kim DS (2023) SCADA intrusion detection scheme exploiting the fusion of modified decision tree and Chi-square feature selection. Internet Things 21:100676CrossRef
17.
Zurück zum Zitat Amin SA, Al Shanabari H, Iqbal R, Karyotis C (2023) An intelligent framework for automatic breast cancer classification using novel feature extraction and machine learning techniques. J Signal Process Syst 95(2–3):293–303CrossRef Amin SA, Al Shanabari H, Iqbal R, Karyotis C (2023) An intelligent framework for automatic breast cancer classification using novel feature extraction and machine learning techniques. J Signal Process Syst 95(2–3):293–303CrossRef
18.
Zurück zum Zitat Güney H (2023) Preprocessing impact analysis for machine learning-based network intrusion detection. Sak Univ J Comput Inf Sci 6(1):67–79 Güney H (2023) Preprocessing impact analysis for machine learning-based network intrusion detection. Sak Univ J Comput Inf Sci 6(1):67–79
19.
Zurück zum Zitat Rani P, Karambelkar VH (2023) Automated detection of diabetic retinopathy using machine learning in ophthalmology. Int J Intell Syst Appl Eng 11(7s):58–64 Rani P, Karambelkar VH (2023) Automated detection of diabetic retinopathy using machine learning in ophthalmology. Int J Intell Syst Appl Eng 11(7s):58–64
20.
Zurück zum Zitat Jeyaselvi M, Dhanaraj RK, Sathya M, Memon FH, Krishnasamy L, Dev K, Qureshi NMF et al (2023) A highly secured intrusion detection system for IoT using EXPSO-STFA feature selection for LAANN to detect attacks. Cluster Computing 26(1):559–574 Jeyaselvi M, Dhanaraj RK, Sathya M, Memon FH, Krishnasamy L, Dev K, Qureshi NMF et al (2023) A highly secured intrusion detection system for IoT using EXPSO-STFA feature selection for LAANN to detect attacks. Cluster Computing 26(1):559–574
21.
22.
Zurück zum Zitat Thakkar A, Lohiya R (2023) Fusion of statistical importance for feature selection in deep neural network-based intrusion detection system. Inf Fusion 90:353–363CrossRef Thakkar A, Lohiya R (2023) Fusion of statistical importance for feature selection in deep neural network-based intrusion detection system. Inf Fusion 90:353–363CrossRef
23.
Zurück zum Zitat Surya S, Muthukumaravel A (2023) Efficient feature extraction on mammogram images using enhanced grey level co-occurrence matrix. Int J Intell Eng Inform 11(1):35–53 Surya S, Muthukumaravel A (2023) Efficient feature extraction on mammogram images using enhanced grey level co-occurrence matrix. Int J Intell Eng Inform 11(1):35–53
24.
Zurück zum Zitat Prasad M, Tripathi S, Dahal K (2023) An intelligent intrusion detection and performance reliability evaluation mechanism in mobile ad-hoc networks. Eng Appl Artif Intell 119:105760CrossRef Prasad M, Tripathi S, Dahal K (2023) An intelligent intrusion detection and performance reliability evaluation mechanism in mobile ad-hoc networks. Eng Appl Artif Intell 119:105760CrossRef
25.
Zurück zum Zitat Kaushik D, Garg M, Gupta A, Pramanik S (2021) Application of machine learning and deep learning in cyber security: an innovative approach. In: Ghonge M, Pramanik S, Mangrulkar R, Le DN (eds) Cybersecurity and digital forensics: challenges and future trends. Wiley Kaushik D, Garg M, Gupta A, Pramanik S (2021) Application of machine learning and deep learning in cyber security: an innovative approach. In: Ghonge M, Pramanik S, Mangrulkar R, Le DN (eds) Cybersecurity and digital forensics: challenges and future trends. Wiley
26.
Zurück zum Zitat Pandey BK et al (2022) Effective and secure transmission of health information using advanced morphological component analysis and image hiding. In: Gupta M, Ghatak S, Gupta A, Mukherjee AL (eds) Artificial intelligence on medical data. Lecture notes in computational vision and biomechanics, vol 37. Springer, Singapore. https://doi.org/10.1007/978-981-19-0151-5_19 Pandey BK et al (2022) Effective and secure transmission of health information using advanced morphological component analysis and image hiding. In: Gupta M, Ghatak S, Gupta A, Mukherjee AL (eds) Artificial intelligence on medical data. Lecture notes in computational vision and biomechanics, vol 37. Springer, Singapore. https://​doi.​org/​10.​1007/​978-981-19-0151-5_​19
27.
Zurück zum Zitat Kosuru VSR, Kavasseri Venkitaraman A (2023) A smart battery management system for electric vehicles using deep learning-based sensor fault detection. World Electr Veh J 14(4):101 Kosuru VSR, Kavasseri Venkitaraman A (2023) A smart battery management system for electric vehicles using deep learning-based sensor fault detection. World Electr Veh J 14(4):101
28.
Zurück zum Zitat Htun HH, Biehl M, Petkov N (2023) Survey of feature selection and extraction techniques for stock market prediction. Financ Innov 9(1):26CrossRef Htun HH, Biehl M, Petkov N (2023) Survey of feature selection and extraction techniques for stock market prediction. Financ Innov 9(1):26CrossRef
29.
Zurück zum Zitat Dushyant K, Muskan G, Gupta A, Pramanik S (2022) Utilizing machine learning and deep learning in cybesecurity: an innovative approach. Cyber Secur Digit Forensics, 271–293 Dushyant K, Muskan G, Gupta A, Pramanik S (2022) Utilizing machine learning and deep learning in cybesecurity: an innovative approach. Cyber Secur Digit Forensics, 271–293
30.
Zurück zum Zitat Bharadiya J (2023) Machine learning in cybersecurity: techniques and challenges. Eur J Technol 7(2):1–14CrossRef Bharadiya J (2023) Machine learning in cybersecurity: techniques and challenges. Eur J Technol 7(2):1–14CrossRef
31.
Zurück zum Zitat Bhoj N, Bhadoria RS (2022) Time-series based prediction for energy consumption of smart home data using hybrid convolution-recurrent neural network. Telemat Inf 75:101907CrossRef Bhoj N, Bhadoria RS (2022) Time-series based prediction for energy consumption of smart home data using hybrid convolution-recurrent neural network. Telemat Inf 75:101907CrossRef
32.
Zurück zum Zitat Amini M, Rahmani A (2023) Machine learning process evaluating damage classification of composites. Int J Sci Adv Technol 9(2023):240–250 Amini M, Rahmani A (2023) Machine learning process evaluating damage classification of composites. Int J Sci Adv Technol 9(2023):240–250
33.
Zurück zum Zitat Ben Jabeur S, Stef N, Carmona P (2023) Bankruptcy prediction using the XGBoost algorithm and variable importance feature engineering. Comput Econ 61(2):715–741CrossRef Ben Jabeur S, Stef N, Carmona P (2023) Bankruptcy prediction using the XGBoost algorithm and variable importance feature engineering. Comput Econ 61(2):715–741CrossRef
34.
Zurück zum Zitat Dahou A, Chelloug SA, Alduailij M, Elaziz MA (2023) Improved feature selection based on chaos game optimization for social internet of things with a novel deep learning model. Mathematics 11(4):1032CrossRef Dahou A, Chelloug SA, Alduailij M, Elaziz MA (2023) Improved feature selection based on chaos game optimization for social internet of things with a novel deep learning model. Mathematics 11(4):1032CrossRef
35.
Zurück zum Zitat Bansal R, Gupta A, Singh R, Nassa VK (2021) Role and impact of digital technologies in E-learning amidst COVID-19 pandemic. In: 2021 fourth international conference on computational intelligence and communication technologies (CCICT). IEEE, pp 194–202 Bansal R, Gupta A, Singh R, Nassa VK (2021) Role and impact of digital technologies in E-learning amidst COVID-19 pandemic. In: 2021 fourth international conference on computational intelligence and communication technologies (CCICT). IEEE, pp 194–202
36.
Zurück zum Zitat Juneja A, Bajaj S, Anand R, Sindhwani N (2020) Improvising green computing using multi-criterion decision making. J Adv Res Dyn Control Syst 12(3):10–5373 Juneja A, Bajaj S, Anand R, Sindhwani N (2020) Improvising green computing using multi-criterion decision making. J Adv Res Dyn Control Syst 12(3):10–5373
37.
Zurück zum Zitat Meivel S, Sindhwani N, Valarmathi S, Dhivya G, Atchaya M, Anand R, Maurya S (2022) Design and method of 16.24 GHz microstrip network antenna using underwater wireless communication algorithm. In: Cyber technologies and emerging sciences: ICCTES 2021. Springer Nature Singapore, Singapore, pp 363–371 Meivel S, Sindhwani N, Valarmathi S, Dhivya G, Atchaya M, Anand R, Maurya S (2022) Design and method of 16.24 GHz microstrip network antenna using underwater wireless communication algorithm. In: Cyber technologies and emerging sciences: ICCTES 2021. Springer Nature Singapore, Singapore, pp 363–371
38.
Zurück zum Zitat Sindhwani N, Anand R, Nageswara Rao G, Chauhan S, Chaudhary A, Gupta A, Pandey D (2023) Comparative analysis of optimization algorithms for antenna selection in MIMO systems. In: Advances in signal processing, embedded systems and IoT: proceedings of seventh ICMEET-2022. Springer Nature Singapore, Singapore, pp 607–617 Sindhwani N, Anand R, Nageswara Rao G, Chauhan S, Chaudhary A, Gupta A, Pandey D (2023) Comparative analysis of optimization algorithms for antenna selection in MIMO systems. In: Advances in signal processing, embedded systems and IoT: proceedings of seventh ICMEET-2022. Springer Nature Singapore, Singapore, pp 607–617
39.
Zurück zum Zitat Kaura C, Sindhwani N, Chaudhary A (Mar 2022) Analysing the impact of cyber-threat to ICS and SCADA systems. In: 2022 international mobile and embedded technology conference (MECON). IEEE, pp 466–470 Kaura C, Sindhwani N, Chaudhary A (Mar 2022) Analysing the impact of cyber-threat to ICS and SCADA systems. In: 2022 international mobile and embedded technology conference (MECON). IEEE, pp 466–470
Metadaten
Titel
Intelligent Feature Engineering and Feature Selection Techniques for Machine Learning Evaluation
verfasst von
Janjhyam Venkata Naga Ramesh
Ajay kushwaha
Tripti Sharma
A. Aranganathan
Ankur Gupta
Sanjiv Kumar Jain
Copyright-Jahr
2024
Verlag
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-97-0700-3_56