Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Different amount of hits between API and search in Mediestream #18

Open
maxodsbjerg opened this issue Jun 10, 2021 · 0 comments
Open

Different amount of hits between API and search in Mediestream #18

maxodsbjerg opened this issue Jun 10, 2021 · 0 comments
Labels
question Further information is requested

Comments

@maxodsbjerg
Copy link
Contributor

I am using the following query to load data for a text mining task:
støj* AND lplace:København AND py:[1850 TO 1881]

Resulting in this response-URL: http://labs.statsbiblioteket.dk/labsapi/api/aviser/export/fields?query=st%C3%B8j%2A%20AND%20lplace%3AK%C3%B8benhavn%20AND%20py%3A%5B1850%20TO%201881%5D&fields=link&fields=recordID&fields=timestamp&fields=pwa&fields=cer&fields=fulltext_org&fields=pageUUID&fields=editionUUID&fields=titleUUID&fields=editionId&fields=familyId&fields=newspaper_page&fields=newspaper_edition&fields=lplace&fields=location_name&fields=location_coordinates&max=3000&structure=header&structure=content&format=CSV

The response is 1933 hits.

As an explanatory example for my audience it would like to say that it is the same result as doing the query in Mediestream, but just presented in a different way. But here is the problem - It's not!

The exact same query in Mediestream results in 2184 hits. https://www2.statsbiblioteket.dk/mediestream/avis/search/støj*%20AND%20lplace%3AKøbenhavn%20AND%20py%3A%5B1850%20TO%201881%5D

Why is this?

@maxodsbjerg maxodsbjerg added the question Further information is requested label Jun 10, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant