Leveraging Semantic Technologies in ETL Processes for Data Integration in Heterogeneous Environments
Main Article Content
Abstract
In an era where data is the cornerstone of decision-making, organizations face the challenge of integrating diverse data sources from heterogeneous environments. Traditional Extract, Transform, and Load (ETL) processes focus primarily on syntactic transformations, ensuring compatibility between disparate datasets. However, these approaches fall short in addressing semantic inconsistencies that arise from differences in meaning, context, and relationships among data [6][7]. This paper proposes a semantic-aware ETL framework that integrates ontology-based reasoning and dynamic rule generation to bridge this gap. The framework enhances data coherence and alignment by leveraging cutting-edge semantic technologies, including Apache Jena for ontology management, SPARQL for semantic querying, and Protégé for ontology design. Detailed analysis, design, and evaluation of the framework showcase its ability to revolutionize ETL processes in data warehousing, enabling robust integration of heterogeneous data sources.
Downloads
Download data is not yet available.
Article Details
How to Cite
Waseem Jeelani Bakshi, Dr. Shahzad Aasim, Dr. Muheet Ahmed Butt, & Dr. Majid Hussain Qadri. (2021). Leveraging Semantic Technologies in ETL Processes for Data Integration in Heterogeneous Environments. Educational Administration: Theory and Practice, 27(4), 1324–1328. https://doi.org/10.53555/kuey.v27i4.8982
Issue
Section
Articles