Dynamic Real-time Classification of Data Streams

Mohini Patil, Dnyandev Bagul, Mayur Indrekar, Shraddha Pawar

Abstract


To develop real time classification from high throughput of data stream (dynamic data) is one of the most challenging areas of big data analysis. In this proposed system we are using concept drift. (Changes of the pattern encoded in the stream over time). And imposes unique challenges in comparison with real time classification data mining from dynamic data. Several real-time classifications of data stream algorithms exist. The proposed system highlights the Fixed Width clustering, variable width calculation and global width clustering for data stream classifier. The result of these algorithms provides high accuracy, less time & high speed.

Full Text:

PDF

References


M. Ebbers, A. Abdel-Gayed, V. Budhi, F. Dolot, Addressing Data Volume, Velocity, and Variety with IBM InfoSphere Streams V3.0, 2013.

B. Babcock, S. Babu, M. Datar, R. Motwani, J. Widom, Models and issues in data stream systems, in: PODS, 2002, pp. 1–16.

M.M. Gaber, A. Zaslavsky, S. Krishnaswamy, Mining data streams: a review, ACM SIGMOD Rec. 34 (2005) 18–26.

M. Gaber, A. Zaslavsky, S. Krishnaswamy, A survey of classification methods in data streams, Data Streams (2007) 39–59.

J. Gama, Knowledge Discovery from Data Streams, Chapman and Hall / CRC, 2010.

T. Bujlow, T. Riaz, J. Pedersen, A method for classification of network traffic based on c5.0 machine learning algorithm, in: 2012 International Conference on Computing, Networking and Communications, ICNC, 2012, pp. 237–241.

A. Jadhav, A. Jadhav, P. Jadhav, P. Kulkarni, A novel approach for the

design of network intrusion detection system (NIDS), in: 2013 International Conference on Sensor Network Security Technology and Privacy Communication System, SNS PCS, 2013, pp. 22–27.

A. Salazar, G. Safont, A. Soriano, L. Vergara, Automatic credit card fraud detection based on non-linear signal processing, in: 2012 IEEE International Carnahan Conference on Security Technology, ICCST, 2012, pp. 207–212.

P. Domingos, G. Hulten, Mining high-speed data streams, in: Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’00, ACM, New York, NY, USA, 2000, pp. 71–80.

T. Le, F. Stahl, J.B. Gomes, M.M. Gaber, G.D. Fatta, Computationally efficient rule-based classification for continuous streaming data, in: Research and Development in Intelligent Systems XXXI, Springer International Publishing, 2014, pp. 21–34. M. Tennant et al. / Future Generation Computer Systems 75 (2017) 187–199 199


Refbacks

  • There are currently no refbacks.