Harvesting

Warning

Version 1 of the OAI-PMH repository is DEPRECIATED. Please refer to the warning on the home page.

The openedition repository follows the version 2.0 of the OAI-PMH protocol available on the Open Archives Initiative website: http://www.openarchives.org/OAI/openarchivesprotocol.html

Selective Harvesting

The repository allows selective harvesting, by set and by date.

Selective harvesting by set

More info about available sets: Sets description

Selective harvesting by date

The repository allows harvesting by date, i.e. harvesting of records added or updated before or after a specified date

The parameters to use are from and until. Allowed date formats are dd-mm-yyyy and dd-mm-yyyyThh:mm:ssZ.

Example

https://oai.openedition.org/?verb=ListRecords&metadataPrefix=oai_dc&from=2017-03-13T16:47:48Z will retrieve a list of records added or updated since the 13 march 2017 at 4.47 pm.

resumptionToken

The repository uses the resumptionToken system. Therefore, it is not possible to retrieve all documents with a single ListRecords, ListIdentifiers or ListSets request.

For instance, for retrieving the metadata of all documents from the journal Revista Crítica de Ciências Sociais, you will use the query:

https://oai.openedition.org/?verb=ListRecords&set=journals:rccs&metadataPrefix=oai_dc

The repository will return a list of the first 10 documents + a resumptionToken element at the end of the response.

 1<?xml version="1.0" encoding="UTF-8"?>
 2<OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd">
 3  <responseDate>2018-10-12T07:11:17Z</responseDate>
 4  <request verb="ListRecords" metadataPrefix="oai_dc" set="journals:rccs">https://oai.openedition.org/</request>
 5  <ListRecords xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:dc="http://purl.org/dc/elements/1.1/">
 6    <record>
 7      [...]
 8    </record>
 9    <record>
10      [...]
11    </record>
12    [...]
13
14    <resumptionToken completeListSize="830">metadataPrefix%3Doai_dc%26set%3Djournals%253Arccs%26until%3D2018-10-12T07%253A11%253A17Z%26cursor%3D10</resumptionToken>
15  </ListRecords>
16</OAI-PMH>

For retrieving the next 10 documents, you will pass the content of the resumptionToken element as an argument of a new URL request:

https://oai.openedition.org/?verb=ListRecords&resumptionToken=metadataPrefix%3Doai_dc%26set%3Djournals%253Arccs%26until%3D2018-10-12T07%253A11%253A17Z%26cursor%3D10

and so on.

The OAI-PMH documentation available at http://www.openarchives.org/OAI/openarchivesprotocol.html gives a more detailed insight of the resumptionToken parameter.

Deleted Records

The repository does not provide information about deleted records.