> Publications > Basic Research Reports
Studies on Developing Transport Big Data Platform and its Application
||Sangjin HAN․ Kyunguk PARK․ Taijin SONG
November 30, 2017
||The Korea Transport Institute
Transport big data platform is the physical and virtual environment in which one user could share transport and transport-related big data with other users. In the platform, big data from one source are collected, processed, analyzed and integrated with big data from another source. Given that transport big data platform facilitates the utilization of public and private data and improve the efficiency of data analysis, it could be the powerful platform for data ecosystem.
To begin with, transport big data platform has an advantage over existing data platforms in that it could induce the integration of public data in transport and transport-related sectors. In Korea, it is true to say that there are many data platforms which are separately operated and not connected with other ones. If those platforms are connected and integrated into one system in which big data could be collected, processed, analyzed and utilized, it is expected that data sharing will be easier and quicker. And this would make data fusion and convergence more effective.
In addition, transport big data platform is needed to make sure of data in private sectors. The data of navigation, smart phones, and transit cards are the most frequently mentioned and utilized ones which are largely produced and collected in private sectors. As there is a cognizance that those data are looked upon as the property or asset of a private company, it is not easy to persuade private companies to share data managed by them. Furthermore, because some data are traded expensively, general public who have low purchasing power are usually restricted to access private data. To handle this obstacle, public sectors including government agencies in an agreement with private companies could purchase private data and share them within the transport big data platform.
Last but not least, the true and valuable insights from big data are as a result of data convergence, and data standardization is a prerequisite for this convergence. Data which are produced by different agencies inevitably have a variety of features in terms of type and content. The inconsistency from those varieties could lead to increasing the time and cost of data processing. If data standardization would be considered and executed in the stage of data collection and storing, the time and cost of data processing might be rapidly decreased that results in improve the efficiency of data utilization. It is expected that data standardization is naturally achieved within transport big data platform in which data are treated jointly.