Abstract
![CDATA[RDF models are widely used in the web of data due to their flexibility and similarity to graph patterns. Because of the growing use of RDFs, their volumes and contents are increasing. Therefore, processing of such massive amount of data on a single machine is not efficient enough, because of the response time and limited hardware resources. A common approach to overcome this limitation is cluster processing and huge datasets could benefit distributed cluster processing on Apache Hadoop. Because of using too much of hard disks, the processing time is usually inadequate. In this paper, we propose a partitiong approach based on Apache Spark for rapid processing of RDF data models. A key feature of Apache Spark is using main memory instead of hard disk, so the speed of data processing in our method is improved. We have evaluated the proposed method by runing SQL queris on RDF data which partitioned on the cluster and demonstrates improved performance.]]
Original language | English |
---|---|
Title of host publication | Proceedings of the 3rd International Conference on Web Research (ICWR), Tehran, Iran, 19-20 April, 2017 |
Publisher | IEEE |
Pages | 73-77 |
Number of pages | 5 |
ISBN (Print) | 9781538604205 |
DOIs | |
Publication status | Published - 2017 |
Event | International Conference on Web Research - Duration: 19 Apr 2017 → … |
Conference
Conference | International Conference on Web Research |
---|---|
Period | 19/04/17 → … |