Things you can do dumping your Invenio database into a flat file
Jorba, Ferran (Universitat Autònoma de Barcelona)

Date: 2017
Abstract: Invenio database design and interfaces are optimized for fast end user search and retrieval. As administrators, we can add indexes at will and use them via web or API. However, many maintenance tasks are not well covered with those indexes. For most of those cases, reading the records sequentialy is the optimal solution. However, if the database is large enough, reading them via Invenio API may take hours, while the system slows down and it may become unresponsive. In this presentation I'll show a small Python tool that uses Invenio API and a SQLite database as cache to keep an up to date flat file with your bibliographic records. We'll see how whith this flat file it is much faster and easier to do tasks like generate specialised statistics, quality control, automatic record enrichment or cleaning, or even creating exotic indexes or counters.
Rights: Aquest document està subjecte a una llicència d'ús de Creative Commons, amb la qual es permet qualsevol explotació de l'obra, incloent-hi una finalitat comercial, així com la creació d'obres derivades, la distribució de les quals també està permesa sense cap restricció, tal com queda estipulat en la llicència d'ús Creative Commons
Language: Anglès.
Document: conferenceObject ; recerca ; publishedVersion
Subject: Dipòsit Digital de Documents de la UAB ; DDD ; Invenio programari
Published in: Invenio user group workshop. Garching, Alemanya, : 2017

Adreça alternativa: https://indico.cern.ch/event/557956/contributions/2486181/


10 p, 113.2 KB

The record appears in these collections:
Contributions to meetings and congresses > Presentations

 Record created 2017-03-29, last modified 2018-10-21



   Favorit i Compartir