Hello I'm

Julián Rojas

A Colombian living in Belgium .

I'm a data interoperability and Semantic Web researcher at IDLab (which makes me part of both IMEC and Ghent University). My research has focused mainly on the design of cost-efficient system architectures for scalable and interoperable data exchange at Web scale. I've had the opportunity of applying the results of my research mainly on the transportation domain, including public transport, logistics and infrastructure. However I've always aimed at desgining generic solutions that can be also applied and facilitate data exchange in various domains.

julianandres.rojasmelendez@ugent.be

+3293314959

Publications

Academic and peer-reviewed publications I have contributed to.

  • 2023

    Bringing IDE Support to JSON-LD with the Language Server Protocol

    Arthur Vercruysse , Julián Rojas , Pieter Colpaert ,

    ISWC 2023

    Abstract - JSON-LD is a popular data format used to describe and share semantic data on the web. However, creating and editing JSON-LD documents can be a challenging task, especially when dealing with complex contexts that include many properties. The existing JSON editing functionality may not suffice for developers, and a JSON-LD editor could greatly enhance their experience. In this paper, we introduce a JSON-LD Language Server based on the Language Server Protocol (LSP) that empowers text editors compatible with the LSP (e.g., Visual Studio Code and NeoVim) with IDE functionality, including autocompletion suggestions based on the defined context, semantic highlighting and renaming identifiers inside the document. We believe that a JSON-LD LSP will enhance developer ergonomics and promote its adoption. Moreover, we see high potential for additional features that can be added such as hovering, go-to-definition and code actions like flattening or structuring of JSON-LD documents. read less

    Abstract - read more

  • 2023

    YARRRML + LDES: Simultaneously Lowering Complexity from Knowledge Graph Generation and Publication

    Gerald Haesendonck , Ben De Meester , Julián Rojas , Dylan Van Assche , Pieter Colpaert ,

    ISWC 2023

    Abstract - Linked Data Event Streams (LDES) is an advanced Knowledge Graph (KG) publication specification aimed at continuous data source replication and synchronization with benefits such as data entities versioning and history retention while providing a self-descriptive API. However, building an LDES requires a high level of expertise in the Semantic Web ecosystem. In this demo paper, we show how we lower the complexity and need for expertise when using a more advanced KG publication method such as LDES by providing an extension point to YARRRML, a human-friendly way to configure KG generation via RML. Integrated in Matey, an online YARRRML editor, we show how little effort (adding five characters to the YARRRML syntax in the simplest case) allows (re)generating multiple versions of a KG as an Event Stream. As such, this extension provides an easy-to-use starting point for anyone wanting to create an LDES from non-semantic data. read less

    Abstract - read more

  • 2023

    A Rule-Based Software Agent on Top of Personal Data Stores

    Wout Slabbinck , Ruben Dedecker , Julián Rojas , Ruben Verborgh ,

    ISWC 2023

    Abstract - As a response to centralised platforms hoarding user data on the web, Personal Data Stores (PDSs) are becoming more prominent. In this emerging space of decentralized PDS platforms, we see a growing need for automated agents to take over tasks that are normally handled by those data platforms. These intelligent agents already have their place in our daily lives, however, they are currently largely missing from the domain of Personal Data Stores. To address this need, we created a demonstrator software web agent that acts on the real world and personal data via condition–action rules. We applied this demonstrator to a smart home environment use case, where smart home appliances can be actuated and monitored via a personal data store. The generic architecture used by the implementation leads to maximal re-use of the agent, enabling multi-agent systems and researching more complex use cases. read less

    Abstract - read more

  • 2022

    Publishing planned, live and historical public transport data on the Web with the Linked Connections framework

    Julián Rojas , Harm Delva , Pieter Colpaert , Ruben Verborgh ,

    Semantic Web Journal

    Abstract - Publishing transport data on the Web for consumption by others poses several challenges for data publishers. In addition to planned schedules, access to live schedule updates (e.g. delays or cancellations) and historical data is fundamental to enable reliable applications and to support machine learning use cases. However publishing such dynamic data further increases the computational burden for data publishers, resulting in often unavailable historical data and live schedule updates for most public transport networks. In this paper we apply and extend the current Linked Connections approach for static data to also support cost-efficient live and historical public transport data publishing on the Web. Our contributions include (i) a reference specification and system architecture to support cost-efficient publishing of dynamic public transport schedules and historical data; (ii) empirical evaluations on route planning query performance based on data fragmentation size, publishing costs and a comparison with a traditional route planning engine such as OpenTripPlanner; (iii) an analysis of potential correlations of query performance with particular public transport network characteristics such as size, average degree, density, clustering coefficient and average connection duration. Results confirm that fragmentation size influences route planning query performance and converges on an optimal fragment size per network. Size (stops), density and connection duration also show correlation with route planning query performance. Our approach proves to be more cost-efficient and in some cases outperforms OpenTripPlanner when supporting the earliest arrival time route planning use case. Moreover, the cost of publishing live and historical schedules remains in the same order of magnitude for server-side resources compared to publishing planned schedules only. Yet, further optimizations are needed for larger networks (> 1000 stops) to be useful in practice. Additional dataset fragmentation strategies (e.g. geospatial) may be studied for designing more scalable and performant Web API s that adapt to particular use cases, not only limited to the public transport domain. read less

    Abstract - read more

  • 2022

    Continuous generation of versioned collections' members with RML and LDES

    Dylan Van Assche , Sitt Min Oo , Julián Rojas , Pieter Colpaert ,

    ESWC 2022 - 3rd Knowledge Graph Construction Workshop

    Abstract - When evolving datasets are used to generate a knowledge graph, it is usually challenging to keep the graph synchronized in a timely manner when changes occur in the source data. Current approaches fully regenerate a knowledge graph in such cases, which may be time consuming depending on the data type, size, and update frequency. We propose a continuous knowledge graph generation approach that can be applied on different types of data sources. We describe continuously updating knowledge graph versions represented as a Linked Data Events Stream, and use an rml processor for rdf generation. In this paper, we present our approach and demonstrate it on different types of data such as bike-sharing, public transport timetables, and weather data. By describing entities with unique, immutable, and reproducible iris, we were able to identify changes in the original data collection, reducing the number of materialized triples and generation time. Our use-cases show the importance of mechanisms to derive unique and stable IRI strategies of data source updates, to enable efficient knowledge graph generation pipelines. In the future, we will extend our approach to handle deletions in data collections, and conduct an extensive performance evaluation. read less

    Abstract - read more

  • 2021

    Leveraging semantic technologies for digital interoperability in the European railway domain

    Julián Rojas , Marina Aguado , Polymnia Vasilopoulou , Ivo Velitchkov , Dylan Van Assche , Pieter Colpaert , Ruben Verborgh ,

    ISWC 2021

    Abstract - The European Union Agency for Railways is an European authority, tasked with the provision of a legal and technical framework to support harmonized and safe cross-border railway operations throughout the EU. So far, the agency relied on traditional application-centric approaches to support the data exchange among multiple actors interacting within the railway domain. This lead however, to isolated digital environments that consequently added barriers to digital interoperability while increasing the cost of maintenance and innovation. In this work, we show how Semantic Web technologies are leveraged to create a semantic layer for data integration across the base registries maintained by the agency. We validate the usefulness of this approach by supporting route compatibility checks, a highly demanded use case in this domain, which was not available over the agency’s registries before. Our contributions include (i) an official ontology for the railway infrastructure and authorized vehicle types, including 28 reference datasets; (ii) a reusable Knowledge Graph describing the European railway infrastructure; (iii) a cost-efficient system architecture that enables high-flexibility for use case development; and (iv) an open source and RDF native Web application to support route compatibility checks. This work demonstrates how data-centric system design, powered by Semantic Web technologies and Linked Data principles, provides a framework to achieve data interoperability and unlock new and innovative use cases and applications. Based on the results obtained during this work, ERA officially decided to make Semantic Web and Linked Data-based approaches, the default setting for any future development of the data, registers and specifications under the agency’s remit for data exchange mandated by the EU legal framework. The next steps, which are already underway, include further developing and bringing these solutions to a production-ready state. read less

    Abstract - read more

  • 2021

    A sustainable method for publishing interoperable open data on the web

    Raf Buyle , Brecht Van de Vyvere , Julián Rojas , Dwight Van Lancker , Eveline Vlassenroot , Mathias Van Compernolle , Stefan Lefever , Pieter Colpaert , Peter Mechant , Erik Mannens ,

    Data

    Abstract - Smart cities need (sensor) data for better decision-making. However, while there are vast amounts of data available about and from cities, an intermediary is needed that connects and interprets (sensor) data on a Web-scale. Today, governments in Europe are struggling to publish open data in a sustainable, predictable and cost-effective way. Our research question considers what methods for publishing Linked Open Data time series, in particular air quality data, are suitable in a sustainable and cost-effective way. Furthermore, we demonstrate the cross-domain applicability of our data publishing approach through a different use case on railway infrastructure-Linked Open Data. Based on scenarios co-created with various governmental stakeholders, we researched methods to promote data interoperability, scalability and flexibility. The results show that applying a Linked Data Fragments-based approach on public endpoints for air quality and railway infrastructure data, lowers the cost of publishing and increases availability due to better Web caching strategies. read less

    Abstract - read more

  • 2021

    Geospatially partitioning public transit networks for open data publishing

    Harm Delva , Julián Rojas , Pieter Colpaert , Ruben Verborgh ,

    Journal of Web Engineering

    Abstract - Public transit operators often publish their open data in a data dump, but developers with limited computational resources may not have the means to process all this data efficiently. In our prior work we have shown that geospatially partitioning an operator's network can improve query times for client-side route planning applications by a factor of 2.4. However, it remains unclear whether this works for all network types, or other kinds of applications. To answer these questions, we must evaluate the same method on more networks and analyze the effect of geospatial partitioning on each network separately. In this paper we process three networks in Belgium: (i) the national railways, (ii) the regional operator in Flanders, and (iii) the network of the city of Brussels, using both real and artificially generated query sets. Our findings show that on the regional network, we can make query processing 4 times more efficient, but we could not improve the performance over the city network by more than 12%. Both the network's topography, and to a lesser extent how users interact with the network, determine how suitable the network is for partitioning. Thus, we come to a negative answer to our question: our method does not work equally well for all networks. Moreover, since the network's topography is the main determining factor, we expect this finding to apply to other graph-based geospatial data, as well as other Link Traversal-based applications. read less

    Abstract - read more

  • 2021

    Publishing base registries as linked data event streams

    Dwight Van Lancker , Pieter Colpaert , Harm Delva , Brecht Van de Vyvere , Julián Rojas , Ruben Dedecker , Philippe Michiels , Raf Buyle , Annelies De Craene , Ruben Verborgh ,

    ICWE 2021

    Abstract - Fostering interoperability, Public Sector Bodies (PSBs) maintain datasets that should become queryable as an integrated Knowledge Graph (KG). While some PSBs allow to query a part of the KG on their servers, others favor publishing data dumps allowing the querying to happen on third party servers. As the budget of a PSB to publish their dataset on the Web is finite, PSBs need guidance on what interface to offer first. A core API can be designed that covers the core tasks of Base Registries, which is a well-defined term in Flanders for the management of authoritative datasets. This core API should be the basis on which an ecosystem of data services can be built. In this paper, we introduce the concept of a Linked Data Event Stream (LDES) for datasets like air quality sensors and observations or a registry of officially registered addresses. We show that extra ecosystem requirements can be built on top of the LDES using a generic fragmenter. By using hypermedia for describing the LDES as well as the derived datasets, agents can dynamically discover their best way through the KG, and server administrators can dynamically add or remove functionality based on costs and needs. This way, we allow PSBs to prioritize API functionality based on three tiers: (i) the LDES, (ii) intermediary indexes and (iii) querying interfaces. While the ecosystem will never be feature-complete, based on the market needs, PSBs as well as market players can fill in gaps as requirements evolve. read less

    Abstract - read more

  • 2020

    Decentralised open data publishing for the public transport route planning ecosystem

    Julián Rojas , Bert Marcelis , Eveline Vlassenroot , Mathias Van Compernolle , Pieter Colpaert , Ruben Verborgh ,

    African Minds and International Development Research Centre

    Abstract - Open data initiatives have created a revolution in the route planning ecosystem for the public transport sector. The creation of a large amount of route planning services like Google Maps, CityMapper or Navitia, has only been possible thanks to the availability of public transport data as open data. Ever since the disclosure of the London public transport data sources as open data (Hogge 2016) more public transport companies are following their lead around the world. The benefits obtained by disclosing public transport datasets as open data are diverse and influence the different actors present in the route planning ecosystem: public transport organisations in the role of data publishers for instance may increase their revenue streams as new and better information channels attract more travellers (UK Department for Transport et al. 2018). Also, new analysis and improvements to their operations become possible through feedback received from data reusers on areas where they do not collect data by themselves (e.g. crowdsourced data). read less

    Abstract - read more

  • 2020

    Geospatial partitioning of open transit data

    Harm Delva , Julián Rojas , Pieter-Jan Vandenberghe , Pieter Colpaert , Ruben Verborgh ,

    ICWE 2020

    Abstract - Public transit operators often publish their open data as a single data dump, but developers with limited computational resources may not be able to process all this data. Existing work has already focused on fragmenting the data by departure time, so that data consumers can be more selective in the data they process. However, each fragment still contains data from the entire operator’s service area. We build upon this idea by fragmenting geospatially as well as by departure time. Our method is robust to changes in the original data, such as the deletion or the addition of stops, which is crucial in scenarios where data publishers do not control the data itself. In this paper we explore popular clustering methods such as k-means and METIS, alongside two simple domain-specific methods of our own. We compare the effectiveness of each for the use case of client-side route planning, focusing on the ease of use of the data and the cacheability of the data fragments. Our results show that simply clustering stops by their proximity to 8 transport hubs yields the most promising results: queries are 2.4 times faster and download 4 times less data. More than anything though, our results show that the difference between clustering methods is small, and that engineers can safely choose practical and simple solutions. We expect that this insight also holds true for publishing other geospatial data such as road networks, sensor data, or points of interest. read less

    Abstract - read more

  • 2020

    Efficient live public transport data sharing for route planning on the Web

    Julián Rojas , Dylan Van Assche , Harm Delva , Pieter Colpaert , Ruben Verborgh ,

    ICWE 2020

    Abstract - Web-based information services transformed how we interact with public transport. Discovering alternatives to reach destinations and obtaining live updates about them is necessary to optimize journeys and improve the quality of travellers’ experience. However, keeping travellers updated with opportune information is demanding. Traditional Web APIs for live public transport data follow a polling approach and allocate all data processing on either data providers, lowering data accessibility, or data consumers, increasing the costs of innovative solutions. Moreover, data processing load increases further because previously obtained route plans are fully recalculated when live updates occur. In between solutions sharing processing load between clients and servers, and alternative Web API architectures were not thoroughly investigated yet. We study performance trade-offs of polling and push-based Web architectures to efficiently publish and consume live public transport data. We implement (i) alternative architectures that allow sharing data processing load between clients and servers, and evaluate their performance following polling- and push-based approaches; (ii) a rollback mechanism that extends the Connection Scan Algorithm to avoid unnecessary full route plan recalculations upon live updates. Evaluations show polling as a more efficient alternative on CPU and RAM but hint towards push-based alternatives when bandwidth is a concern. Clients update route plan results 8–10 times faster with our rollback approach. Smarter API design combining polling and push-based Web interfaces for live public transport data reduces the intrinsic costs of data sharing by equitably distributing the processing load between clients and servers. Future work can investigate more complex multimodal transport scenarios. read less

    Abstract - read more

  • 2020

    Velopark: A linked open data platform for bicycle parkings

    Julián Rojas , Pieter Morlion , Han Tambuyzer , Wout Baert , Pieter Colpaert , Ruben Verborgh ,

    ICWE 2020 - 2nd Semantics for Transport Workshop

    Abstract - Cycling as a mean of urban transportation is positively correlated with cleaner, healthier and happier cities. By providing more infrastructure, such as secure parking facilities, cities aim on attracting more cyclists. However, authoritative information about parking facilities is heavily decentralized and heterogeneous, which makes secure parking facilities harder to be discovered by cyclists. Can an open dataset about bike parkings be managed decentrally? In this paper, we present the results of the Velopark project, carried out in Belgium by different actors that include local public authorities, public transport operators and pro-cycling organizations. During the project execution we (i) introduced the Open Velopark Vocabulary as a common semantic data model; and (ii) implemented the Velopark platform, an open data publishing environment for both static and live authoritative parking data. So far, 1599 parking facilities were published through the Velopark platform, 31 different Belgian municipalities and 4 parking related organizations use the platform to describe, publish and manage their parking facilities. A common data publishing environment supports organizations for providing access to their information, while guaranteeing data reliability for cyclists. In future work we will further extend our data model to cover other kinds of infrastructure and bicycle-related services. read less

    Abstract - read more

  • 2020

    How to Prototype a Client-Side Route Planner for Helsinki with Routable Tiles and Linked Connections

    Julián Rojas , Harm Delva , Pieter Colpaert , Ruben Verborgh ,

    ICWE 2020 - 2nd Semantics for Transport Workshop

    Abstract - Route planning is key in application domains such as delivery services, tourism advice and ride sharing. Today’s route planning as a service solutions do not cover all requirements of each use case, forcing application developers to build their own self-hosted route planners. This quickly becomes expensive to develop and maintain, especially when it requires integrating data from different sources. We demo a configurable route planner that takes advantage of strategically designed data publishing approaches and performs data integration and query execution on the client. For this demonstrator, we (i) publish a Linked Connections interface for the public transit data in Helsinki, including live updates; (ii) integrate Routable Tiles, a tiled Linked Data version of OpenStreetMap road network and (iii) implement a graphical user interface, on top of the Planner.js SDK we have built, to display the query results. By moving the data integration to the client, we provide higher flexibility for application developers to customize their solutions according to their needs. While the querying might be slow today, these preliminary results already hint at different data publishing strategies that may increase query evaluation performance on the client-side. read less

    Abstract - read more

  • 2019

    Decentralized route planning across the web of data

    Julián Rojas ,

    ISWC 2019 - PhD Symposium

    Abstract - Wheelchair users looking for accessible public transport routes, tourists discovering attractive routes to go around a new city, or bicycle users trying to avoid highly polluted routes are some examples where highly individualized route planning is needed. Current route planning applications lack query flexibility. The types of queries supported by a route planner are only determined at design time and heavily depend on centralized pre-selected data sources. Integrating a new data source such as another transport mode, a different road network or wheelchair accessibility is not straightforward as it generally requires human intervention to extend the subjacent data model and route planning algorithm implementation. I investigate how relevant data sources available on the Web can be dynamically reused for answering custom queries, and thus allow creating more flexible and personalized route planning applications. Semantic Web and Linked Data technologies provide a common framework for data integration. Yet it is still unclear how relevant data can be automatically reused while remaining independent from specific route planning algorithm implementations. Preliminary work (i) tests the feasibility of solving route planning queries over live and static public transport data sources on the Web, (ii) explores the trade-offs of different Web APIs for publishing and consuming live data streams on the Web and (iii) introduces a Linked Data based approach for publishing road networks data. read less

    Abstract - read more

  • 2019

    Client-side route planning : preprocessing the OpenStreetMap road network for Routable Tiles

    Harm Delva , Julián Rojas , Ben Abelshausen , Pieter Colpaert , Ruben Verborgh ,

    State of the Map 2019

    Abstract - Travelers have higher expectations than current route planning providers can fulfill, yet new solutions struggle to break through. Matching user experience from existing applications is already challenging without the large-scale infrastructure most of them have at their disposal; additionally integrating datasets such as the road network, public transportation schedules, or even real time air quality data is an even more laborious endeavour. read less

    Abstract - read more

  • 2019

    Decentralized publication and consumption of transfer footpaths

    Harm Delva , Julián Rojas , Pieter Colpaert , Ruben Verborgh ,

    SEMANTiCS 2019 - 1st Semantics for Transport Workshop

    Abstract - Users expect route planners that combine all modes of transportation to propose good journeys to their destination. These route planners use data from several sources such as road networks and schedule-based public transit. We focus on the link between the two; specifically, the walking distances between stops. Research in this field so far has found that computing these paths dynamically is too slow, but that computing all of them results in a quadratically scaling number of edges which is prohibitively expensive in practice. The common solution is to cluster the stops into small unconnected graphs, but this restricts the amount of walking and has a significant impact on the travel times. Moreover, clustering operates on a closed-world assumption, which makes it impractical to add additional public transit services. A decentralized publishing strategy that fixes these issues should thus (i) scale gracefully with the number of stops; (ii) support unrestricted walking; (iii) make it easy to add new services and (iv) support splitting the work among several actors. We introduce a publishing strategy that is based on the Delaunay triangulation of public transit stops, where every triangle edge corresponds to a single footpath that is precomputed. This guarantees that all stops are reachable from one another, while the number of precomputed paths increases linearly with the number of stops. Each public transit service can be processed separately, and combining several operators' can be done with a minimal amount of work. Approximating the walking distance with a path along the triangle edges overestimates the actual distance by 20% on average. Our results show that our approach is a middle-ground between completeness and practicality. It consistently overestimates the walking distances, but this seems workable since overestimating the time needed to catch a connection is arguably better than recommending an impossible journey. The estimates could still be improved by combining the great-circle distance with our approximations. Alternatively, different triangulations could be combined to create a more complete graph. read less

    Abstract - read more

  • 2019

    Republishing OpenStreetMap's roads as linked routable tiles

    Pieter Colpaert , Ben Abelshausen , Julián Rojas , Harm Delva , Ruben Verborgh ,

    ESWC 2019 - Poster and Demos

    Abstract - Route planning providers manually integrate different geo-spatial datasets before offering a Web service to developers, thus creating a closed world view. In contrast, combining open datasets at runtime can provide more information for user-specific route planning needs. For example, an extra dataset of bike sharing availabilities may provide more relevant information to the occasional cyclist. A strategy for automating the adoption of open geo-spatial datasets is needed to allow an ecosystem of route planners able to answer more specific and complex queries. This raises new challenges such as (i) how open geo-spatial datasets should be published on the Web to raise interoperability, and (ii) how route planners can discover and integrate relevant data for a certain query on the fly. We republished OpenStreetMap’s road network as “Routable Tiles” to facilitate its integration into open route planners. To achieve this, we use a Linked Data strategy and follow an approach similar to vector tiles. In a demo, we show how client-side code can automatically discover tiles and perform a shortest path algorithm. We provide four contributions: (i) we launched an open geo-spatial dataset that is available for everyone to reuse at no cost, (ii) we published a Linked Data version of the OpenStreetMap ontology, (iii) we introduced a hypermedia specification for vector tiles that extends the Hydra ontology, and (iv) we released the mapping scripts, demo and routing scripts as open source software. read less

    Abstract - read more

  • 2019

    Open data sectors and communities : transport

    Pieter Colpaert , Julián Rojas ,

    African Minds and International Development Research Centre

    Abstract - Public transport has been a poster-child of the open data movement with a variety of route planning applications used by millions of people every day. Transport data can also be used to analyse policy and advocate for service improvements. Tensions exist between centralised route planning services and distributed, open data-driven approaches to transport data. Only a fraction of the data used to drive mobility apps is truly open, and current technical architectures risk holding back a next wave of innovation. Data-driven transport tools have been developed worldwide; however, established standards need to be more flexible in order to accommodate semi-structured and informal transport networks in the developing world. The future success of “Mobility as a Service” will depend on a much greater range of open transport data and application programming interfaces (APIs) read less

    Abstract - read more

  • 2018

    Decentralized Open Data Publishing for The Public Transport Route Planning Ecosystem

    Julián Rojas , Bert Marcelis , Eveline Vlassenroot , Pieter Colpaert , Mathias Van Compernolle , Ruben Verborgh ,

    3rd Open Data Research Symposium

    Abstract - Open data became a fundamental requirement for the route planning application ecosystem in the public transport sector. The way this data is published has a direct influence in the architectural design of route planning applications, which today tends towards centralized solutions (e.g. Google Maps, CityMapper). Linked Connections emerged as a more scalable and cost-efficient decentralized alternative for public transport open data publishing, compared to centralized approaches . However, how this impacts the different actors that belong to the route planning ecosystem is still unknown. In this work, we study and discuss the potential impact of Linked Connections through a set of evaluations that measure the technical and user-perceived performance of route planning applications that run on the client-side, compared to a traditional approach performing the route planning logic on the server-side. Results showed that (i) for some use cases, a Linked Connections based approach outperforms centralized approaches but it heavily depends on the underlying hardware. (ii) More than half of the travellers that participated on the tests preferred the Linked Connections client application due to additional features such as offline querying and privacy safeguarding regardless of the slower performance in some cases. This work provided insights and an initial assessment of the potential effects of implementing a decentralized open data publishing strategy on the public transport route planning ecosystem. The potential benefits of such an approach are aligned with the ideals of open data of fostering innovation, boosting economic growth and providing solutions for more specific necessities. read less

    Abstract - read more

  • 2018

    Supporting sustainable publishing and consuming of live Linked Time Series Streams

    Julián Rojas , Gayane Sedrakyan , Pieter Colpaert , Miel Vander Sande , Ruben Verborgh ,

    ESWC 2018 - Poster and Demos

    Abstract - The road to publishing public streaming data on the Web is paved with trade-offs that determine its viability. The cost of unrestricted query answering on top of data streams, may not be affordable for all data publishers. Therefore, public streams need to be funded in a sustainable fashion to remain online. In this paper we present an overview of possible query answering features for live time series in the form of multidimensional interfaces. For example, from a live parking availability data stream, pre-calculated time constrained statistical indicators or geographically classified data can be provided to clients on demand. Furthermore, we demonstrate the initial developments of a Linked Time Series server that supports such features through an extensible modular architecture. Benchmarking the costs associated to each of these features allows to weigh the trade-offs inherent to publishing live time series and establishes the foundations to create a decentralized and sustainable ecosystem for live data streams on the Web. read less

    Abstract - read more

  • 2018

    A Preliminary Open Data Publishing Strategy for Live Data in Flanders

    Julián Rojas , Brecht Van de Vyvere , Arne Gevaert , Ruben Taelman , Pieter Colpaert , Ruben Verborgh ,

    The Web Conference 2018 - Web Stream Processing Workshop

    Abstract - For smart decision making, user agents need live and historic access to open data from sensors installed in the public domain. In contrast to a closed environment, for Open Data and federated query processing algorithms, the data publisher cannot anticipate in advance on specific questions, nor can it deal with a bad cost-efficiency of the server interface when data consumers increase. When publishing observations from sensors, different fragmentation strategies can be thought of depending on how the historic data needs to be queried. Furthermore, both publish/subscribe and polling strategies exist to publish live updates. Each of these strategies come with their own trade-offs regarding cost-efficiency of the server-interface, user-perceived performance and cpu use. A polling strategy where multiple observations are published in a paged collection was tested in a proof of concept for parking spaces availability. In order to understand the different resource trade-offs presented by publish/subscribe and polling publication strategies, we devised an experiment on two machines, for a scalability test. The preliminary results were inconclusive and suggest more large scale tests are needed in order to see a trend. While the large-scale tests will be performed in future work, the proof of concept helped to identify the technical Open Data principles for the 13 biggest cities in Flanders. read less

    Abstract - read more

  • 2017

    Providing Reliable Access to Real-Time and Historic Public Transport Data Using Linked Connections

    Julián Rojas , David Chaves Fraga , Pieter Colpaert , Ruben Verborgh , Erik Mannens ,

    ISWC 2017 - Posters and Demos

    Abstract - Using Linked Data based approaches, public transport companies are able to share their time tables and its updates in an affordable way while allowing user agents to perform multimodal route planning algorithms. Providing time table updates, usually published as data streams, means that data is being constantly modified and if there is a large analytical query, its response might be affected due to the changing data. In this demo we introduce a mechanism to tackle this problem by guaranteeing that a user agent will always receive version based responses, therefore ensuring data consistency. Such mechanism also enables access to historical data that could be used for deep analysis of transport systems. However, how this data shall be archived, in order to keep this ap- proach scalable and inexpensive is still a matter of study. In a demonstrator, we published and query data from the Belgium national train system (SNCB) and Madrid Regional Transport Consortium (CRTM). This paper represents the first step towards establishing an affordable framework to publish reliable transport data. read less

    Abstract - read more

  • 2017

    The tripscore Linked Data client: calculating specific summaries over large time series

    David Chaves Fraga , Julián Rojas , Pieter-Jan Vandenberghe , Pieter Colpaert , Oscar Corcho ,

    ISWC 2017 - Decentralizing the Semantic Web Workshop

    Abstract - Time series – such as public transport time schedules and their actual departure times – may deliver insights about the public transport network to third parties. Today, however, public transport data is published in a way in which analytical processing is too expensive. In previous work, the Linked Connections(LC) framework was introduced as a cost-efficient publishing alternative to the de-facto GTFS standard and route planning APIs. We study whether this server interface can also be used by Linked Data agents to solve analytical queries over longer periods of time. In this work, we created a serverless Linked Data client in Javascript for the analysis of time series on top of public transport data sources, called tripscore.eu. In this example, it calculates the quality of experience for your journey for the last 5 weeks using the public transport agencies it can discover. We have made the code to this proof of concept available as open source in different reusable components. As the user-perceived performance is quite slow, we formulate opportunities to achieve better response times. We could, on the one hand, suggest the data publisher to publish summaries over longer periods of time. On the other hand, we could also, as reusers, create a private summary of the data on our server and expose this to our user agents. Still an open issue is how this client would discover new public transport agencies reliably, for which we started working on a metadata profile for transport datasets. read less

    Abstract - read more

Projects

Current and past projects I have been involved in.

Data Models LDES Marine Data

MareGraph

Jan 2023 - ongoing

Funded by Digital Europe Programme

MareGraph will build a data architecture for cost-efficient, flexible and sustainable data sharing for a Marine Knowledge Graph. In this project we collaborate with the Flemish Marine Institute (VLIZ), Digital Flanders and the Italian National Research Council (CNR). read less

read more

Serendipity Recommender Systems Web Querying

Serendipity Engine

Oct 2022 - ongoing

Funded by SBO

We increasingly rely on algorithmically generated recommendations to navigate in both online and offline contexts: listening to music on streaming platforms, reading news online, or following recommendations about activities and events in your favorite city. These recommender systems help us dealing with the abundance of available information, but at the same time raise questions about their impact on individual citizens and society. Many advocate for designs for serendipity in recommenders, but what does this mean in practice? While serendipity is generally understood as a beneficial design principle ought to deliver societal value, putting it into practice still presents major challenges. The Serendipity Engine project sets out to address these challenges and support societal stakeholders in designing recommender systems to foster serendipity in public contexts. read less

read more

LDES Data Space Logistics

SYTADEL

Apr 2022 - ongoing

Funded by cSBO

Companies are certainly not immune to the increasing demand for sustainable logistics, and yet they almost automatically turn to road transportation when planning their supply chain. Trains and inland waterways are disregarded, because they require much more organization - and besides: there's fear of the unknown. Fortunately, technology that will allow companies to plan their transportation more consciously and sustainably – without compromising on efficiency – is gradually becoming more available. The trick is to break through the silos of various transportation systems and combine their data into one high-quality planning system. That is the aim of SYTADEL. read less

read more

Data Models Mobility LDES

GreenMov

Sep 2021 - Sep 2023

Funded by Connecting Europe Facility

We're not nearly exploiting all the opportunities open data have to offer. For example, wouldn't it be convenient if you could check before your departure if there will be a shared bike available at your last stop? This kind of application is accomplished by complex models that combine several sources of data. Like the up-to-date train traffic information with a prediction of the number of shared bikes at the station – based on realtime and historic data, and personal preferences. A significant obstacle soon pops up: individual data sources are often developed from a specific viewpoint, which means they can't be used for multiple applications and processes. We need to better describe these data sources and make them more accessible. Only then can we find a user-friendly way to integrate them into different applications. Within the GreenMov project, we realize this through concrete examples. OSLO provides a semantic description of the exact meaning of a dataset. We demonstrate that the OSLO schematics are compatible with others. And Linked Data Event Streams (LDES) and NGSI-LD are two variants of Linked Data that GreenMov brings together. read less

read more

Data Models LDES Data Space

SEMIC

Jan 2021 - ongoing

Funded by Interoperable Europe

SEMIC sets up pilots to showcase the value of new approaches and ecosystems (e.g. Linked Data Event Streams, SOLID and Wikibase/Wikidata), which can be leveraged across public administrations to scale up their interoperability maturity (e.g. catalogue of services, base registries). Pilots usually involve participants from several Member States and sector-specific DGs co-creating solutions with SEMIC's support. read less

read more

Route Planning Railways Knowledge Graph

ERA-KG

Oct 2020 - ongoing

Funded by European Agency for Railways

Bilateral collaboration with the European Agency for Railways (ERA) to create and evolve a semantic interoperability layer that integrates the different base registries maintained by the agency and support innovative services such as the Route Compatibility Check and Infrastructure Change Alert use cases. read less

read more

Route Planning Geocoding TREE

BAS-X

Jan 2020 - Jan 2021

Funded by Victoris

BAS-X aims to allow as many people (45+) as possible to spend a basic number of active hours per week outdoors through the 3 most popular activities: running, cycling and walking. It offers you a suitable route at the starting point of your choice. Whether you’re a cyclist, walker or runner, whether you’re in excellent shape or getting started at improving your physical condition… at any location. There’s no need to install an app or buy an extra device (GPS, wearable…). BAS-X will also “nudge” you in order to convince you to go outside and follow a route on a regular basis. read less

read more

Route Planning Water Ways GIS

SmartWaterWay

Dec 2019 - Feb 2022

Funded by imec.icon

SmartWaterWay will offer a cost-efficient manner to automate small vessels used for urban logistics in cities with a dense network of small waterways, by distributing the intelligence between low-cost onboard sensors and onshore infrastructure. To drive the shift from roadway to waterway-based logistics, the SmartWaterWay project will make autonomous pallet shuttle barges cost effective in urban settings by combining low-cost onboard sensors with onshore sensors equipped at bridges, locks and bays. read less

read more

Linked Data Data Models Mobility

Velopark

Dec 2018 - Mar 2021

Funded by Smart Mobility Belgium

Velopark is a digital platform that makes information about bike parks in Belgium accessible to all. All useful user information is gathered at one central location. This makes bicycle parking data immediately accessible as open data for websites (public transport, trains, towns and municipalities), apps and online applications (digital maps, journey planners, etc.). The information is always up to date, any time, anywhere. The Velopark project is an initiative of Fietsberaad Vlaanderen, in partnership with imec, Nazka and MORE LION. The project is supported by the federal Belgian government through the Smart Mobility Belgium fund. read less

read more

Linked Data Transport Route Planning

Open Planner

Jul 2018 - Jun 2019

Funded by Smart Mobility Belgium

Countless projects have tried to create a perfect intermodal route planning experience, and all of them have been investing over and over again in integrating the same data. When the funding from these projects disappears, also the data integration disappeared. The Open Planner Team wants to empower anyone to build any kind of route planner, without having to invest in data integration. Your route planner should work out of the box. Therefore, we are first and foremost working on the Open Data itself, by helping data owners publish their data. By building open source tools that can consume these open data sources automatically, we hope to build a level playing field for Mobility as a Service players. read less

read more

Linked Data Transport Data Models

OASIS

Oct 2016 - Sep 2018

Funded by INEA

Together, Ghent and the region of Madrid have initiated and innovative action that will increase the accessibility of public services and public transport. To do this, they will collaborate to publish linked open data. Both cities are experienced publishers of open data, and together they will prove that new technologies (such as the Semantic Web) can lead to economies of scale, such as the creation of cross-country applications. The project called OASIS (Open Applications for Semantically Interoperable Services) has received funding from the European Commission (INEA) and is a partnership between the Universities of Ghent and Madrid (imec and UPM), the city of Ghent, the Consorcio Regional de Transported de Madrid, supported by the Flemish ICT organization (V-ICT-OR) and coordinated by Open Knowledge Belgium. read less

read more

Linked Data Benchmarking RDF Streams

HOBBIT

Dec 2015 - Dec 2018

Funded by HORIZON 2020

HOBBIT aims at abolishing the barriers in the adoption and deployment of Big Linked Data by European companies, by means of open benchmarking reports that allow them to assess the fitness of existing solutions for their purposes. These benchmarks are based on data that reflects reality and measures industry-relevant Key Performance Indicators (KPIs) with comparable results using standardized hardware. read less

read more

Work Experience

Places and organizations I have worked for.

Education

My academic titles.