AIP - Copyright statement only in PDF #85

cameronneylon · 2014-04-16T08:55:18Z

Noting for the future. There is an OA tag on the landing page but nothing that gives license information until you hit the PDF itself.

Something for further down the track.

An example: http://scitation.aip.org/content/aip/journal/jap/114/5/10.1063/1.4817422

emanuil-tolev · 2014-04-16T09:26:16Z

Another note for the future:
URL to pdf: http://scitation.aip.org/deliver/fulltext/aip/journal/jap/114/5/1.4817422.pdf?itemId=/content/aip/journal/jap/114/5/10.1063/1.4817422&mimeType=pdf&containerItemId=content/aip/journal/jap

Nothing out of the ordinary, can be generated by a plugin specific to AIP.

Downloading the whole PDF could be problematic - we do have a lot of bandwidth now, but memory consumption could also be a problem. Still, it should supposedly work. if the license string is present in there, but it will be very brittle. For the size, we could chunk up incoming files (regardless of whether they're PDF-s or not) and run all the needed comparisons on the chunks (e.g. of 1 MB). Then if nothing found, next chunk, and so on.

cameronneylon · 2014-04-17T14:38:43Z

MDPI is another publisher that does this: http://www.mdpi.com/2071-1050/5/7/3095 and the relevant pdf is: http://www.mdpi.com/2071-1050/5/7/3095/pdf

emanuil-tolev · 2015-01-06T16:04:05Z

Note for future readers here: we don't download PDF-s anymore, so in order to eventually support statements in PDFs this would have to change. We use the robus python-magic library to check the file header, so it's pretty unlikely a PDF will slip by.

cameronneylon added enhancement labels Apr 16, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AIP - Copyright statement only in PDF #85

AIP - Copyright statement only in PDF #85

cameronneylon commented Apr 16, 2014

emanuil-tolev commented Apr 16, 2014

cameronneylon commented Apr 17, 2014

emanuil-tolev commented Jan 6, 2015

AIP - Copyright statement only in PDF #85

AIP - Copyright statement only in PDF #85

Comments

cameronneylon commented Apr 16, 2014

emanuil-tolev commented Apr 16, 2014

cameronneylon commented Apr 17, 2014

emanuil-tolev commented Jan 6, 2015