Scalable Data Integration System using Representational State Transfer
Keywords:Data integration, distributed computing, architectural approach, REST-based
Data integration from remote data sources is complicated, due to a large number of heterogeneous information sources that are usually stored in different data schema. This topic is challenging and requires the management of information system enterprises to be able to integrate distributed data and applications effectively. Numerous methodologies and systems have been proposed and developed to address the related issues from different aspects. However, there is still a lack of methodological support and appropriate data integration development in an internet environment that requires good performance and scalability on large-scale data execution. The concept of centralized design for the integration of distributed data repositories as server-based processing solves the traditional problem of tight coupling, but does not address the integrated logic task. To solve this problem, this paper describes a data Integrated Client-Rich approach (ICR) and implementation of Thesis Core System (TCS) data integration system that follows the Representational State Transfer (REST) architectural principle for system implementation. REST is a new principle for system development that has good support for development tools to meet the needs of data integration, and obtain the advantages of simplicity, scalability, and performance. We design a testing scheme to evaluate and analyze the scalability of traditional data integration with server-based integrated processing and TCS data integration. The results show that our approach is more suitable, scalable, and practical for Internet-scale distributed data integration.
P Budny, S Govindharaj and K Schwan. Worldtravel: A testbed for service-oriented applications. In: Proceedings of the 6th International Conference Sydney on Service-Oriented Computing 2008. Springer-Verlag, Sydney, Australia, 2008, p. 438-52.
S Finkelstein. Common expression analysis in database applications. In: Proceedings of the ACM SIGMOD International Conference on Management of Data. ACM Press, Orlando, Florida, USA, 1982, p. 235-45.
JA Blakeley, PA Larson and FW Tompa. Efficiently updating materialized views. In: Proceedings of the ACM SIGMOD Conference on Management of Data. ACM Press, Washington DC, USA, 1986, p. 61-71.
JJ Lu, G Moerkotte, J Schue and VS Subrahmanian. Efficient maintenance of materialized mediated views. SIGMOD Record. 1995; 24, 340-51.
B Ludäscher, Y Papakonstantinou and P Velikhov. Navigation-driven evaluation of virtual mediated views. In: Proceedings of 7th the International Conference on Extending Database Technology 2000. Springer-Verlag, London, UK, 2000, p. 150-65.
A Silberschatz, M Stonebraker and JD Ullman. Database systems: Achievements and opportunities. Commun. ACM 1995; 34, 110-20.
L Zamboulis, N Martin and A Poulovassilis. Query performance evaluation of an architecture for fine-grained integration of heterogeneous grid data sources. Future Gener. Comput. Syst. 2010; 26, 1073-91.
O Rihawi, Y Secq and P Mathieu. Load-balancing for large scale situated agent-based simulations. Proc. Comput. Sci. 2015; 51, 90-9.
B Yildiz and GC Fox. Toward a modular and efficient distribution for Web service handlers. Concurr. Comput. Pract. E 2013; 25, 410-26.
J Kampars and J Grabis. An approach to parallelization of remote data integration tasks. Sci. J. Tech. Univ. 2011; 45, 24-30.
G Chen, Y Wu, J Liu, G Yang and W Zheng. Optimization of sub-query processing in distributed data integration systems. J. Netw. Comput. Appl. 2011; 34, 1035-42.
RT Fielding. 2000, Architectural Styles and the Design of Network-Based Software Architectures. Ph. D. Dissertation. University of California, Irvine.
M Jian, M Shujun and Y Zhao. RESTful web services: A solution for distributed data integration. In: Proceedings of the International Conference on Computational Intelligence and Software Engineering. Wuhan, China, 2009, p. 1-4.
S Zhang, F Zhang and B Wang. Data integration based on REST and IEC61970 for smart grid. Electric Power Autom. Equip. 2012; 32, 124-9.
X Feng, J Shen and Y Fan. REST: An alternative to RPC for web services architecture. In: Proceedings of the 1st International Conference on Future Information Networks 2009. Beijing China, p. 7-10.
F Belqasmi, R Glitho and C Fu. RESTful web services for service provisioning in next-generation networks: A survey. IEEE Comm. Mag. 2011; 49, 66-73.
H Hamad, M Saad and R Abed. Performance evaluation of RESTful web services for mobile devices. Int. Arab J. e-Technol. 2010; 1, 72-8.
G Alonso, F Casati, H Kuno and V Machiraju. Web Services - Concepts, Architectures and Applications. Data-Centric Systems and Applications. 1st ed. Springer Publishing Company, New York, 2004.
M Frehner and M Brandli. Virtual database: Spatial analysis in a Web-based data management system for distributed ecological data. Environ. Model. Softw. 2006; 21, 1544-54.
WW Eckerson. Three Tier Client/Server architecture: Achieving scalability, performance, and efficiency in client server applications. Open Inf. Syst. J. 1995; 3, 10.
A Tarhini. Concept of Three Tier Architecture. Available at: https://alitarhini. wordpress.com/ 2011/01/22/concepts-of-three-tier-architecture, accessed April 2014.
S Ramani. A modern architecture for open source cloud computing. Int. J. Adv. Res. Comput. Sci. 2011; 2, 95-8.
H Zhou, Q Duan and Z Liang. A multiple-tier distributed data integration architecture. In: Proceedings of the 7th World Conference on Integrated Design and Process Technology. Texas, 2003, p. 153-9.
P Vongsumedh and A Sukstrienwong. A framework for developing the Web-based data integration tool for Web-Oriented data warehousing. In: Proceedings of the 2013 International Conference on Systems, Control and Informatics. Greece, 2013, p. 130-5.
C Cimen, Y Kavurucu and H Aydin. Usage of thin-client/server architecture in computer aided education. Turk. Online J. Educ. Tech. 2014; 13, 181-5.
A Mesbah and AV Deursen. A component- and push-based architectural style for Ajax applications. J. Syst. Softw. 2008; 81, 2194-209.
D Chen, R Chirkova, F Sadri and TJ Salo. Query optimization in information integration. Acta Inform. 2013; 50, 257-87.
M Karnstedt, KU Sattler and M Hauswirth. Scalable distributed indexing and query processing over Linked Data. Web Semant. 2012; 10, 3-32.
TW Kang and CH Hong. A study on software architecture for effective BIM/GIS-based facility management data integration. Autom. Constr. 2015; 54, 25-38.
A Datta and H Thomas. A conceptual model and algebra for on-line analytical processing in data warehouses. In: Proceedings of the 7th Workshop for Information technology and Systems. Citeseer, 1997, p. 91-100.
S Lynden, A Mukherjee, AC Hume, AAA Fernandes, NW Paton, R Sakellariou and P Watson. The design and implementation of OGSA-DQP: A service-based distributed query processor. Future Gener. Comp. Syst. 2009; 25, 224-36.
C Pahl and ZHU Yaoling. Data integration in mediated service compositions. Comput. Inform. 2012; 31, 1129-49.
M Lablans, A Borg and F Uckert. A RESTful interface to pseudonymization services in modern web applications. BMC Med. Inform. Decis. 2015; 15, 2.
H Lee and MR Mehta. Defense against REST-based web service attacks for enterprise systems. Comm. Int. Inform. Manag. Assoc. 2013; 13, 57-68.
J Parra, MA Hossain, A Uribarren and E Jacob. RESTful discovery and eventing for service provisioning in assisted living environments. Sensors 2014; 14, 9227-46.
C Pautasso. RESTful Web service composition with BPEL for REST. Data Knowl. Eng. 2009; 68, 851-66.
A Dey, A Fekete and U Rohm. REST+T: Scalable transactions over HTTP. In: Proceedings of the 2015 IEEE International Conference on Cloud Engineering. Tempe Arizona, USA, 2015, p. 36-41.
OF Luis. 2010, Design and Development of a REST based Web Service Platform for Applications Integration. Master in Artificial Intelligence. Universitat Politecnica de Catalunya, Barcelona, Spain.
M Hourani, Q Shambour, A Al-Zubidy and A Al-Smadi. Proposed design and implementation for RESTful web server. J. Softw. 2014; 9, 1071-80.
JC Delgado. Architectural styles for distributed interoperability. Inform. Resour. Manag. J. 2013; 26, 40-65.
D Fuentes-Lorenzo, L Sanchez, A Cuadra and M Cutanda. A RESTful and semantic framework for data integration. Softw. Pract. Ex. 2015; 45, 1161-88.
Y Yuan, Y Wu, X Feng, J Li, G Yang and W Zheng. VDB-MR: MapReduce-based distributed data integration using virtual database. Future Gener. Comput. Syst. 2010; 26, 1418-25.
E Platen and D Heath. A Benchmark Approach to Quantitative Finance. 2nd ed. Springer-Verlag, Berlin, Germany, 2010, p. 1-720.
How to Cite
Copyright (c) 2016 Walailak Journal of Science and Technology (WJST)
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.