A Cloud-Based Enhanced Discretized Support Vector Classifier for Scalable Big Data Prediction

Authors

  • Alaa Abdullhussain Hussain Sumer University, college of Management and Economics, Iraq

Keywords:

EEDSV-CP, High-performance clusters, classification, hyperplanes

Abstract

Big data is a huge amount of data that is such a large amount that it is difficult to process using conventional methods of database and software. When using big data-related applications technical barriers are encountered when moving data between different locations that is costly and requires massive main memory for processing. Big data is a term used to describe interactions and transactions of data in relation to their magnitude and complexity that go beyond the normal technical capability of the capture, organization and processing of data within the cloud. It features real-time processing of data which runs in high-performance clusters. Applications that use big data are designed to share structured and unstructured information. They collect the data in a way that allows for speedier response and reduce the time for classification. Similarly, in this paper, a Discretized Support Vector Classification and Prediction (EEDSV-CP) model is suggested to provide effectual computation upon huge data apps and sharing in a cloud computing environment. Originally, pre-processing was carried out in the EEDSV-CP model using interval equivalence discretization, which aids in the removal of noise and erratic data obtained out of various sources. The computation temporal and spatial complexity are mitigated out by denoising and inconsistizing the data. Furthermore, the EEDSV-CP model employs a supportive vector prediction classifier to categorize data centered upon user query request by employing parallel hyperplanes, with the aim of increasing classification accuracy of customer data requesting on big data. The proposed EEDSV-CP precisely predicts the customer data requesting on big data with the classified data.

References

V. Casola, A. De Benedictis, J. Modic, M. Rak and U. Villano, "Perservice Security SLA: a New Model for Security Management in Clouds," 2016 IEEE 25th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE), 2016, pp. 83-88.

Centre of Protection of National Infrastructure, Information Security Briefing: Cloud Computing, [Online]. Available: https://www.cpni.gov.uk/system/files/documents/1f/8d/cloud-computing-briefing.pdf

C. Yang, Q. Huang, Z. Li, K. Liu and F. Hu, "Big Data and Cloud Computing: Innovation Opportunities and Challenges," International Journal of Digital Earth, vol. 10, no. 1, pp. 15-53, 2017.

J. Cao, H. Cui, H. Shi and L. Jiao, "Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce," PLoS ONE, vol. 11, no. 6, pp. e0157551, 2016.

I. Ha, B. Back and B. Ahn, "MapReduce Functions to Analyze Sentiment Information from Social Big Data," International Journal of Distributed Sensor Networks, vol. 11, no. 6, pp. 1-11, 2015.

D. Agrawal and P. Kulurkar, "A cloud-based system for enhancing security of android devices using modern encryption standard-II algorithm," International Journal of Innovations and Advancement in Computer Science, vol. 5, no. 4, pp. 60-69, 2016.

E. Ezhilarsan and M. Dinakaran, "Secure Big Data Storage Using Training Dataset Filtering-K Nearest Neighbour Classification with Elliptic Curve Cryptography," Journal of Computational and Theoretical Nanoscience, vol. 15, no. 6-7, pp. 2437-2442, 2018.

J. Cao and Z. Lin, "Extreme Learning Machines on High Dimensional and Large Data Applications: A Survey," Mathematical Problems in Engineering, vol. 2015, pp. 1-21, 2015.

J. Chase, D. Niyato, P. Wang, S. Chaisiri and R. Ko, "A Scalable Approach to Joint Cyber Insurance and Security-as-a-Service Provisioning in Cloud Computing," IEEE Transactions on Dependable and Secure Computing, 2017.

P. D. Diamantoulakis, V. M. Kapinas and G. K. Karagiannidis, "Big Data Analytics for Dynamic Energy Management in Smart Grids," Big Data Research, vol. 2, no. 3, pp. 94-101, 2015.

I. D. Dinov et al., "Predictive Big Data Analytics: A Study of Parkinson’s Disease Using Large, Complex, Heterogeneous, Incongruent, Multi-Source and Incomplete Observations," PLoS ONE, vol. 11, no. 8, pp. e0157077, 2016.

B. Kalyani and Y. V. Reddy, "Big data and cloud-based health care records monitoring using deep learning technology," Journal of Critical Reviews, vol. 7, no. 12, pp. 5192-5201, 2020.

G. Gao, R. Li, H. He and Z. Xu, "Distributed caching in unstructured peer-to-peer file sharing networks," Computers and Electrical Engineering, vol. 40, no. 2, pp. 688-703, 2014.

S. Garcia, J. Luengo, J. A. Sáez, V. Lopez and F. Herrera, "A survey of discretization techniques: Taxonomy and empirical analysis in supervised learning," IEEE Transactions on Knowledge and Data Engineering, vol. 25, no. 4, pp. 734-750, 2013.

C. S. Dule and H. A. Girijamma, "Content an Insight to Security Paradigm for Big Data on Cloud: Current Trend and Research," International Journal of Electrical and Computer Engineering, vol. 7, no. 5, pp. 2873-2882, 2017.

M. H. U. Rehman and A. Batool, "The Concept of Pattern based Data Sharing in Big Data Environments," International Journal of Database Theory and Application, vol. 8, no. 4, pp. 11-18, 2015.

S. Ramírez-Gallego et al., "Data discretization: taxonomy and big data challenge," Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol. 6, no. 1, pp. 5–21, 2016, doi: 10.1002/widm.1173.

Downloads

Published

2025-10-01

How to Cite

Hussain, A. A. (2025). A Cloud-Based Enhanced Discretized Support Vector Classifier for Scalable Big Data Prediction. Vital Annex: International Journal of Novel Research in Advanced Sciences (2751-756X), 4(9), 378–388. Retrieved from https://journals.innoscie.com/index.php/ijnras/article/view/119

Issue

Section

Articles