ارائه یک روش مبتنی بر هستان شناسی برای کاوش قواعد هم آیی در محیط های توزیع‌شده چندعاملی

صابری, حسین; کنگاوری, محمدرضا; حسنی آهنگر, محمد رضا

ارائه یک روش مبتنی بر هستان شناسی برای کاوش قواعد هم آیی در محیط های توزیع‌شده چندعاملی

نوع مقاله : مقاله پژوهشی

نویسندگان

¹ دانشگاه جامع امام حسین(ع)

² دانشیار دانشگاه علم و صنعت ایران

چکیده

کاوش قواعد همآیی توزیع‌شده یکی از روشهای مهم داده‌کاوی است که وابستگی بین اقلام داده‌ای را از منابع داده‌ای غیرمتمرکز، بدون توجه به مکان فیزیکی آن‌ها و بر مبنای فرآیند استخراج اقلام مکرر استخراج می‌کند. هنگامیکه الگوریتمهای کاوش روی دادههای بزرگ‌مقیاس اجرا میشوند، مقدار زیادی اقلام مکرر تولید میگردد که بسیاری از آن‌ها غیرمرتبط، مبهم و غیرقابل استفاده برای کسب و کار است و سبب بروز چالشی به نام " انفجار ترکیبی" خواهد شد. در این مقاله یک روش ائتلافی جدید مبتنی بر دادهکاوی توزیع شده و هستانشناسی دامنه که به‌اختصار DARMASO نامیده میشود برای برخورد با این چالش پیشنهاد شده است. این روش از سه الگوریتم به نام ARMASOMAIN جهت هدایت و کنترل فرآیند کاوش و تجمیع قواعد همآیی، DARMASOPRU برای کاهش و هرس داده‌ها و الگوریتم DARMASOINT برای کاوش و تجمیع قواعد همآیی تولیدشده از منابع دادهای توزیع‌شده استفاده میکند. DARMASO از یک الگوی محاسباتی توزیع‌شده مبتنی بر چارچوب نگاشت-کاهش در محیط توزیع‌شده چندعاملی استفاده میکند. همچنین یک روش عملی را برای کاوش معنایی از مجموعه دادههای بزرگ‌مقیاس فراهم میکند. این روش، قواعد همآیی را مبتنی بر اهداف دادهکاوی و نیاز کاربر فیلتر کرده و فقط قواعد مفید را تولید و نگهداری میکند.کاهش فضای کاوش و فیلترسازی قواعد، با فرآیند هرس معنایی در قالب حذف نامزدهای نامناسب از مجموعه اقلام مکرر و تولید قواعد همآیی سودمند حاصل میشود. پیاده‌سازی با استفاده از یک مجموعه دادهای از دامنه حوادث طبیعی و کلاس زمین‌لرزه انجام شده است. نتایج ارزیابی با استفاده از معیارهای کمی و کیفی نشان میدهد، الگوریتمهای ارائه‌شده در DARMASO ، فضای کاوش را به میزان قابل توجهی کاهش میدهد. همچنین سرعت و کیفیتِ استخراج قواعد را بهبود بخشیده و قواعدِ کاربردی، مطمئن، منطقی، با کیفیت و ارزشمندی را برای پشتیبانی از تصمیم‌گیری از میان انبوه دادهها تولید میکند.

کلیدواژه‌ها

DOR: 20.1001.1.23224347.1400.9.1.1.8

عنوان مقاله [English]

Providing an Ontology-Based Method for Exploring the Association Rules in Multi-Agent Distributed Environments

نویسندگان [English]

H. Saberi ¹
M. R. Kangaavari ²
M. R. Hasani Ahangar ¹

¹ ihu

² iust.ac.ir

چکیده [English]

Distributed association rules mining is one of the most important data mining methods that extracts the inter dependence of data items from decentralized data sources, regardless of their physical location and is based on the process of extracting repeated items. When exploration algorithms are implemented on large-scale data, a large number of recurring items are produced, many of which are irrelevant, ambiguous, and unusable for the business, thus causing a challenge called "combination explosion ". In this paper, a new coalition method based on distributed data mining and domain archeology, abbreviated to DARMASO, is proposed to address this challenge. This method uses three algorithms: the DARMASOMAIN algorithm to guide and control the process of exploration and aggregation of universal rules, the DARMASOPRU algorithm to reduce and prune the data and the DARMASOINT algorithm to explore and aggregate the rules of all the generated data sources. DARMASO uses a map-reduce-based distributed computational model in a multi-agent distributed environment. It also provides a practical way for semantic mining of large-scale data sets. This method filters out the association rules of generality based on the purposes of data mining as well as the needs of the user and only produces and maintains useful rules. Reducing the scope of exploration and filtration of rules is achieved through the process of semantic pruning in the form of removing inappropriate candidates from the set of frequent items and producing association rules of utility. The implementation is performed using a data set from the scope of natural disasters and the earthquake class. It also improves the speed and quality of rule extraction and generates practical, reliable, logical, quality and valuable rules to support decision-making amid the masses of data.

کلیدواژه‌ها [English]

Association Rules
Ontology
Multi Agent Systems
Mapping-Reduction

مراجع

[1] D. Patel and J. Shah, “Jade Agent Framework for Distributed Data Mining and Pattern Analysis,” International Journal of Computer Applications, vol. 975, p. 8887, 2017.##

[2] H. M. Safhi, B. Frikh, and B. Ouhbi, “Assessing reliability of Big Data Knowledge Discovery process,” Procedia computer science, vol. 148, pp. 30-36, 2019.##

[3] R. M. Gahar, O. Arfaoui, M. S. Hidri, and N. B. Hadj-Alouane, “An Ontology-driven MapReduce Framework for Association Rules Mining in Massive Data,” Procedia Computer Science, vol. 126, pp. 224-233, 2018.##

[4] B. Eine, M. Jurisch, and W. Quint, “Ontology-based big data management,” Systems, vol. 5, no. 3, p. 45, 2017.##

[5] P. V. Bhagat, P. M. J. I. J. O. R. Gourshettiwar, I. T. i. Computing, and Communication, “A survey paper on ontology-based approaches for semantic data mining,” vol. 3, no. 4, pp. 2137-2141, 2015.##

[6] M. R. Chikhale, “Study of Distributed Data Mining Algorithm and Trends,” IOSR Journal of Computer Engineering (IOSR-JCE), pp. 41-47, 2016.##

[7] D. Dou, H. Wang, and H. Liu, “Semantic data mining: A survey of ontology-based approaches,” In Proceedings of the 2015 IEEE 9th international conference on semantic computing (IEEE ICSC 2015), IEEE, pp. 244-251, 2015.##

[8] V. S. Ms and K. J. P. C. S. Shah, “Performance evaluation of distributed association rule mining algorithms,” vol. 79, pp. 127-134, 2016.##

[9] R. Agrawal, T. Imieliński, and A. Swami, “Mining association rules between sets of items in large databases,” In Proceedings of the 1993 ACM SIGMOD international conference on Management of data, 1993, pp. 207-216.##

[10] Z. Farzanyar, “Development of Algorithms for Detecting Frequent Items Set to Large-Scale Peer-to-Peer Environments with Flow Data Attitudes,” Iran University of Science and Technology, 1391. (In Persian)##

[11] D. Apiletti, E. Baralis, T. Cerquitelli, P. Garza, F. Pulvirenti, and L. Venturini, “Frequent itemsets mining for big data: a comparative analysis,” Big data research, vol. 9, pp. 67-83, 2017.##

[12] D. Xia, X. Lu, H. Li, W. Wang, Y. Li, and Z. Zhang, “A MapReduce-based parallel frequent pattern growth algorithm for spatiotemporal association analysis of mobile trajectory big data,” Complexity, 2018.##

[13] D. C. Anastasiu, J. Iverson, S. Smith, and G. Karypis, “Big data frequent pattern mining,” In Frequent pattern mining: Springer, pp. 225-259, 2014.##

[14] A. G. Touzi, H. B. Massoud, and A. Ayadi, “Automatic ontology generation for data mining using fca and clustering,” arXiv preprint arXiv:1311.1764, 2013.##

[15] D. A. Koutsomitropoulos and A. K. Kalou, “A standards-based ontology and support for Big Data Analytics in the insurance industry,” ICT Express, vol. 3, no. 2, pp. 57-61, 2017.##

[16] A. Soylu et al., “Ontology-based Visual Querying with OptiqueVQS: Statoil and Siemens Cases,” 2016.##

[17] S. Urmela and M. Nandhini, “Approaches and Techniques of Distributed Data Mining: A Comprehensive Study,” International Journal of Engineering and Technology (IJET), 2017.##

[18] S. Patil, S. Karnik, and V. Sawant, “A Review on Multi-Agent Data Mining Systems,” International Journal of Computer Science and Information Technologies, vol. 6, no. 6, pp. 4888-4893, 2015.##

[19] F. Jiang, “Efficient frequent pattern mining from big data and its applications,” Ph.D. Thesis, Department of Computer ScienceThe University of Manitoba Winnipeg, Manitoba, Canada, 2014.##

[20] Y. Lin, P.-C. Huang, D. Liu, and L. Liang, “Scalable frequent-pattern mining on nonvolatile memories,” In 2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC), IEEE, pp. 578-583, 2017.##

[21] Bhamra, Gurpreet S Verma, Anil K Patel, Ram B, “Agent Based Frameworks for Distributed Association Rule Mining: An Analysis,” International Journal in Foundations of Computer Science & Technology (IJFCST), 2015.##

[22] H. Saberi, M. R. Kangavari, and M. R. H. Ahangar, “Provide an agent-oriented architecture for semantic exploration of large-scale data in distributed environments,” Scientific-Research Journal of Electronic and Cyber Defense, 1398. [Online]. Available: https://ecdj.ihu.ac.ir, In Persian.##

[23] H. M. Basir, H. Saberi, and M. A. Javadzadeh, “Provide a method for selecting data based on the ontology of the organization's goals,” presented at the Fifth National Conference on Defense Science and Engineering, Tehran, 1398. (In Persian)##

[24] A. H. Yazdi, “Extract association rules from semantic data streams,” Computer Engineering, Ferdowsi University of Mashhad, 1392. [Online]. Available: http://www.um.ac.ir, In Persian##