The default export XML-based and follows the Dublin Core format. The fields are described below. The Dublin Core data is wrapped in a OAI-PMH envelope. Every XML file corresponds to one article. The following sections will cover the most important tags.
This element contains a generated unique identifier (<identifier>) as well as a datestamp (<datestamp>) of when the data has been exported. It is not recomended to work with this identifier. Instead, use the value in <dc:identifier>.
This is a unique and persistent identifier using ARK. The BnL is in the progress of transitioning to ARK. That is why PID-based identifiers are still provided in other fields.
Describes the source of the document. For example
means that this article comes from the newspaper “luxwort” (ID for Luxemburger Wort) issued on 15.12.1848.
The complete title of the source document e.g. “Luxemburger Wort”.
Another generated string that uniquely identifies the exported resource.
The publishing date of the document e.g “1848-12-15”.
The publisher of the document e.g. “Verl. der St-Paulus-Druckerei”.
The unique identifier of the parent document (e.g. newspaper issue), also referred to as PID.
The link to the BnLViewer on eluxemburgensia.lu to view the resource online.
The main title of the article, section, advertisement, etc.
The full text of the entire article, section, advertisement etc. It includes any titles and subtitles as well. The content does not contain layout information, such as headings, paragraphs or lines.
The type of the exported data e.g. ARTICLE, SECTION, ADVERTISEMENT, …
The detected language of the text.
The number of words in the <dc:description> field.