Skip to content

Add submittedDate and lastUpdatedDate to the Wiki description of arXiv API #180

@sedimentation-fault

Description

@sedimentation-fault

Please add the following to

https://github.com/ContentMine/getpapers/wiki/arxiv-query-format

Dates and ranges of dates

The terms submittedDate (date/time of first submission) and lastUpdatedDate (date/time of last revision) are particularly useful for limiting a search with many results, so that you may combine multiple searches together, each within some window of time, to get the full results.

The date/time information is of the form YYYYMMDDHHMMSS, for example 20071018122534 for 2007-10-18 12:25:34. You can use * for a wildcard for the times. For example, to get all manuscripts with initial submission on 2007-10-18:

getpapers --api 'arxiv' --query "cat:math.AG AND submittedDate:20071018*"

But you can't use the wildcard within the dates.

getpapers --api 'arxiv' --query "cat:math.AG AND submittedDate:2007*"

To get all manuscripts that were updated in a date range, use [from_date TO to_date]. (If you give a partial date, it's treated as the earliest date/time that matches, and the range appears to be up to but not including the second date/time.)

getpapers --api 'arxiv' --query "cat:math.AG AND lastUpdatedDate:[20170101* TO 20190827*]"

NOTE: For updates of a local collection the lastUpdatedDate parameter and a date range as above are the right options - submittedDate will not catch all updates, whereas lastUpdatedDate will also catch submissions, in addition to updates.
(adapted from: https://cran.r-project.org/web/packages/aRxiv/vignettes/aRxiv.html)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions