Slovenia 404 - On the e-Readiness of Modern Public Administration

Alois Paulin, 2010

International conference on Information Society and Information Technologies - ISIT, Dolenjske Toplice,

Abstract

This document reveals the shocking incompetence of modern state-administration when challenged to deal with unusual request for public data. The challenge was stated in form of a simple query that (under optimized conditions) could be accomplished within minutes, but instead hundreds of hours were wasted. This paper further describes methods and technologies that – if applied properly – would contribute to an agile, efficient and sustainable public administration.

Key words

public administration, e-government, linked data

Introduction - the request

Between the 11th and 12th June 2010 more than 350 Slovenian public agencies and authorities (ministries, municipalities, local administrative units etc.) received a request for public data with the demand to provide information about the name and surname, as well as the cumulative gross expense for each employed public servant or official from the addressed organization, for the year 2009. Additionally, a full work report for each person in the specified year, as well as the date the person started her affiliation with the addressee were requested.

The requests were conducted in an automated manner, using contact data from Slovenia’s central catalogue of public information (CKIJZ). To evaluate the answers, supervised data mining techniques were applied, which proved useful in the process of tracking the evolution of threads during negotiations with the addressees.

With the exception of the work reports (which the organizations in general were not able to provide as they have no evidence of their employees’ activities), all of the requested data is stored in structured form either within electronic databases of accounting systems, or in the central Slovenian financial database of the Ministry of finance, the MFeRAC system. Regarding this situation, to retrieve the requested data from the database would take a skilled professional in informatics not more than a few minutes of work. However, as this article reveals, the requests were handled in an inappropriate manner, resulting in huge overhead in cost for both the data-provider as well as the consumer of the data.

How to measure e-Government maturity?

When it comes to measuring the maturity of e-government implementations, many models are available (Howard 2001, Chandler and Emanuels 2002, Layne and Lee 2001, UN-DPEPA 2002) that generally range from the passive governmental info-page (“informational stage”) to the fully transactional one-stop-shop that allows a consumption of governmental services without interaction with public officials (“transactional stage”). Most of the models were developed and published at the fin-de-millénaire, at the advent of the “web 2.0”, when it became obvious that a sophisticated web presence can improve governmental public relations. However, as primarily researchers from sociological fields have developed the published models with only shallow insight into the bits and bytes of available technologies, the proposed visions lack a clear technical agenda on which they would base their assertions.

Almost a decade after the initial “e-Gov” hype, Coursey and Norris (2008) observed that the evolution on this field has slowed considerably or even came to a halt. Based on empirical data from US-wide surveys on e-government they conclude that although authorities have reached the informational stage and do publish downloadable data and documents, their services are “not highly interactive or transactional as the models predict” (ibid).

A common flaw in e-Gov theories is the absence of a systematic approach and the misleading focus on governmental “services” as the objects of transaction. Such unclear goals and definitions encourage the development of botched web portals that can be quickly turned into scores at diverse e-Gov surveys, but lack sustainability in both technical terms as well as regarding the content.

The responses - a failure of e-Government

According to several surveys, when it comes to e-readiness of the government, Slovenia is a top-developed country and constantly among the top five in Europe (Capgemini 2009; The Economist 2009). However, those formal statistics focus on parameters like the quantity of electronically accessible services, the provided legislation and other aspects of the façade, while they do not observe the continuity and quality of available services, nor the practical issues like the education of the supporting personnel.

Capgemini (2009) for example highly prices Slovenia for the one-stop-shop “e-uprava.gov.si”, which offers a handful of transactional services and some downloadable forms. However, most of the published content falls under the informational category with (often dead) links to other governmental websites.

The failure of CKIJZ

A paradigm for failure of the Slovene e-government is the already mentioned CKIJZ. As a centralized source of public information catalogues, the CKIJZ should conceptually be a most valuable database containing all existing public legal persons, which would allow both efficient management and supervision for the government, as well as provide good insight into the structure of the state for the public – if maintained properly. However, this database is obsolete and broken and according to the Ministry of public administration, neither is information regarding its completeness available, nor does the Ministry know when the database was last seen up-to-date.

When accessing CKIJZ in order to collect the contact data of all public organizations, it was observed that this database listed only 863 legal persons, including 165 out of 210 municipalities. From the listed organizations, 152 exposed dead links, 190 did not provide any contact information and 15 provided dead e-mail addresses.

Although the CKIJZ provides means to publish the public-information catalogue within the central system itself, only 122 public legal persons (14.6% of all listed) used this possibility. Among them are all administrative units (58), 9 out of 15 ministries and 23 municipalities.

The "passion" of the requests

A request to a public authority is basically a client-server transaction: The client creates the request using any querying language the addressee recognizes as valid (in Slovenia, this is typically Slovenian, encoded in the Roman alphabet) and sends it to any valid and recognized endpoint URI of the server (either post address, e-mail address, fax number). The server validates the request and if it passes the validation, it is processed, else the server rejects it by issuing a formal decision which must contain a “stack trace” including the description of the error to facilitate client-side “debugging”. This process is regulated by formal protocols (laws), which define both the protocol of communication as well as the “schema” for well-formed requests.

Figure 1: Modern public administration is a paradigm for highly inefficient management of transactions. Instead of providing APIs to the public so that the requesting subject himself would manage the workload of accessing information, subjects are forced to communicate via “human interfaces”, i.e. public officials.

The requests were sent as plain e-mail messages and were not digitally signed. According to Slovenian legislation, personal data must not be provided based on an unsigned request. However, less than 40 (of more than 350) addressed corporations demanded a proper identification.

From the received answers, less than 100 were sent in a machine-readable format like PDF (45) or MS Excel (33) and no dataset was sent in a plaintext-format like CSV. Most answers revealed a huge ignorance of modern ICT, as the provided datasets were first transformed into nice-looking tables, then printed, signed, scanned and sent via unsigned/-encrypted e-mails without reference to the UID (The Unique IDentifier is mandatory data for any sent e-mail via SMTP) of the requesting message.

This procedure, as illustrated in Figure 1, is not only exorbitantly time-consuming, but also contributes to a vast overhead in the size of exchanged communication: If measured in bytes, the exchanged communication needed to receive less than 70% of all requested data, would amount to between 500 MB – 1 GB, if taken into consideration also the data sent by post and exchanged communication via phone. The size of attachments sent via e-mail amounted to 219 MB, while the post had to deliver almost 10kg of mail. Name, surname and gross earning of one single public servant would amount to max. 100 bytes of data if stored in UTF-8. The Slovene public sector has approx. 160.000 employees, which would yield in dataset of totally 16MB. The received data encompassed approx. 20.000 names and if 2MB would suffice, then a measurable overhead of 24,900% has been created by the state.

Only sparse evidence is provided about the time that was needed by the addressed organizations to create the answer. The Employment Service of Slovenia, an employer of 998 public servants, estimated that it would take them at least 20 hours to collect the requested information (72 sec/employee), while an employer of 226 public servants from the health sector tried to charge 12 hours of work (191 sec/employee). This information leads to the conclusion, that somebody from the accounting department would click trough a list of all employees and copy-paste their data into a spreadsheet. However, a skilled professional would manage this task by applying an SQL-query over the database itself and resolve the puzzle within minutes.

How Linked Data can save e-Gov

Instead of focusing on the façade, governmental e-Gov infrastructure must provide a layered stack of interfaces for an indiscriminative access to core data. Any governmental service is basically a composition of many actions in which data is read or written according to a predefined procedure or protocol.

A popular transactional governmental service in Slovenia is an application, which allows renewing the car registration online – a service, which effectively triggers the change of a date at some database entry and - based on this change - initiates the process of issuing a new license. Another such service is an online-cadastre that interactively provides data about the owner of the searched land-parcel, as well as geo-spatial information about it. While it is good that such services come bundled with the appropriate governmental user interface, a much greater added value could be achieved if the government would allow users to browse the data trough machine-readable application programming interfaces (API), as in this case, commercial providers could generate creative new solutions and by this contribute to both a more competitive environment as well as a cheaper government.

The Linked Data initiative

Two key figures of the modern web sphere, the inventor of the core web standards, Tim Berners-Lee and the marketing-father of the “web 2.0”, Tim O’Reilly, are currently focusing on the rich and fruitful fields of yet unexploited governmental data. Berners-Lee calls his quest “Linked Data” and appeals to all citizens on the WWW to publish their structured data in a machine-consumable format – preferably RDF to be queried with SPARQL. Linked Data is a subset of the “Semantic Web” mission, which is considered by many as having failed (Bussler 2008).

Noteworthy projects and organizations followed this appeal, like the Linked Movie Database (Hassanzadeh and Consens 2009), the BBC and New York Times. Even the UK Government allowed Berners-Lee to set-up and to supervise a one-stop-shop for governmental data (http://data.gov.uk), but the published RDF triples are cumbersome to consume due to the lack of popular acceptance of the standard as well as the SPARQL query language; besides, RDF is frequently implemented the wrong way (Bergman 2009, Halpin and Hayes 2010). As Sheridan and Tennison (2009) observed, the vast majority of developers would prefer the data to be consumable through RESTful APIs like the ones that are provided by Google &co.

When it comes to the public’s demand for APIs, O’Reilly provides the proper ideology – his vision of the future is the government as a platform. By “platform” he means the iPhone way of providing access to the underlying infrastructure to allow independent developers to enable a KaDeWe experience for the consumer (O’Reilly 2010). During the US presidential elections, O’Reilly openly supported Obama, who promised the connected and transparent government (O’Reilly 2008).

As a part of the political bargain, the US web-community received a one-stop-shop for governmental data, which – contrary to the British version – focuses on APIs, although a wiki-based approach to transform the published datasets to RDF was added later (Bergman 2009). To fulfil his promise, Obama issued an order in which he demands from federal agencies to publish at least three of their high-value sets of statistics or other information in a downloadable format, however, the published material leaned towards the least controversial matters (Kirkpatrick 2009) and was partly ridiculous: the Transportation Department provided information about child-seat safety, while the Justice Department made available the population counts for wild horses and burros (O’Keefe 2010). As noticed by non-partisan organizations, the government “went for the low-hanging fruit for things that are already out there and not terribly controversial” (ibid.).

Linked Data as a gateway to core data

A major problem of the political part of the Linked Data initiative is the fact that the allegedly participating governments provide data that is of not much value. The technical part however, is capable of providing a solid foundation for a global interlinked network of structured data, if only the public authorities would willingly publish their core data in form of publicly accessible RDF streams.

Making a study as the one described in this article, could be made very easy, if every public organization would provide a live representation of its accounting database to allow public browsing of that data. Eventual privacy and legal concerns could be eliminated by a smart server, which would demand from every requester to identify himself using a digital certificate so that access could be granted or rejected based on the digital identity.

Conclusion

Governing a society by means of ICT is the natural way of evolution in the “information society”. The current stage of development in the technical field is sufficient to allow a truly transactional society in which public officials would not be needed any more for routing citizens’ requests and demands from one desk to another, but could take the role of friendly facilitators in bureaucratic processes or highly skilled technical advisors from the back office.

Ever since, bureaucracy was the driving force behind the evolution of ICT, as only a competitive advantage could keep the hegemony of the respective holders stable. Paradoxically, today we can observe – probably the first time in human history – a bureaucratic colossus that desperately tries to conquer the flood of data and communications by increasing the numbers of physically working bureaucrats instead of educating them in new skills. What had been more than 10,000 years ago the Mesopotamian clay tokens, are today languages for the communication with computers, like C#, SQL and XSL.

As “internet” generations are emerging, new governmental paradigms will need to emerge, but until then bureaucratic thinking and ways of processing data will still follow the path as illustrated in Figure 1, in which a request traverses multiple joints until the needed data is collected and again multiple joints until a response is returned.

References