THOR’s last hurrah

Project THOR is coming to a close. Our final event was held in Italy on 15 November at ‘La Sapienza’ – the University of Rome – just a stone’s throw from Michelangelo’s impressive sculpture of Moses in the church of San Pietro in Vincoli. The day combined a retrospective review of THOR’s achievements and impact with a forward-looking perspective on the wider persistent identifier (PID) landscape.

With help and advice from CINECA’s Paola Gargiulo the event attracted an audience of Italian researchers from a wide range of disciplines, to join delegates from THOR partner organisations. Adam Farquhar of the British Library followed Paola’s welcome with an overview of THOR’s mission to embed PIDs within the heart of scholarly communications.

Key notes

Herbert van de Sompel, one of our two keynote speakers and Digital Library Research & Prototyping Team lead at Los Alamos National University, presented important research on maintaining the integrity of hyperlinks to managed collections such as journal repositories. ‘Link rot’ – broken links – and ‘content drift’ – where a link works but takes you to something other than the original content – are both problems, albeit less so than for the web at large. Solutions suggested for the scholarly web included using metadata that describes the relationships between links using resources such as signposting.org.

Photo 1

Photo: Herbert van de Sompel giving keynote on achieving link integrity

Our second keynote talk was given by Fiona Murphy, an independent research data and publishing consultant and excellent THOR Ambassador. She has been considering the question of what scholarship would look like had it been digital from inception, with PIDs integral to the process. The Matrix of the Commons – found at www.scholarlycommons.org – is the result of work on how to get us from here to there, via a guiding set of decision trees on how to treat a range of digital entities appropriately and consistently. Important take home messages were that we should all assume 1) that our tools will be used in conjunction with other systems and 2) that they should be built for drivers, not mechanics.

THOR round-up

Between keynote talks, THOR partners presented highlights from the project deliverables over the course of two sessions. In the first, Maaike Duine (ORCID) summarised THOR’s communications activities and Ambassador programme. Elizabeth Hull presented DRYAD’s perspective on linking data and publications. Martin Fenner (DataCite) looked at claiming workflows to ORCID. Robin Dasler spoke both about analysis preservation at CERN and THOR’s analyses of PID service adoption. Robert Petryszak (EMBL-EBI) talked on how to realise the full potential of PIDs in the biosciences, and the session closed with a demo of dynamic data identification in the earth and environmental sciences from Markus Stocker (PANGAEA).

Pic 2

Image: PID service adoption study

After lunch, Tom Demeranville (ORCID) gave updates on developing tools for federated identity management and better use of organisation identifiers. Martin Fenner returned to share outcomes from a joint THOR-OpenAIRE workshop on article-data linking. Angela Dappert (British Library) considered the challenges of embedding persistent identifier services within the humanities. Sünje Dallmeier-Tiessen (CERN) fed back from a high-energy physics community workshop. Lastly, Maaike Duine reported on insights gathered from a focus group meeting to envision the ‘ideal PID world’ for publishing workflows.

Points for discussion

It wouldn’t have been a THOR event without the opportunity for everyone to participate, and the mid-afternoon panel session provided a dedicated forum. Panellists Hannah Hope (Wellcome Trust), Clifford Tatum (CWTS), Andres Mori (Digital Sciences), Erika Bilicsi (Library and Information Centre of the Hungarian Academy of Sciences) and chair Adam Farquhar addressed comments and questions from the room.

Photo 3

Photo: Discussion panel on PID use in different communities

Discussion points included the relative underuse of funder IDs, identifying a current difficulty in capturing full information for works involving multiple funders as a possible stumbling block. A question on how best to approach differences of opinion, e.g. between co-authors faced with article- or data-retraction, provided useful food for thought. Changing our general view was suggested as a potential way forward given that not all retractions are equal and the reasons may well be mundane, rather than issues of capability. Also covered were the concepts of PIDs as a compromise between the readability needs of human and machine, and of the importance of working out which features of an object are the most critical to identify.

Important take-homes from the day were that PID space is developing at an active and encouraging pace and that we should stay alert to idea that, in terms of policy, one size may not fit all.

Finally, Simon Lambert (STFC) looked ahead to the FREYA project, which will follow on from THOR, starting on 1 December 2017. One of its key strands is to achieve long-term sustainability for the outstanding progress and development that began with ODIN – THOR’s predecessor – and advanced significantly during the 30 months of THOR.

Want to know more?

Presentation slides from the day are available for download: