Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

parsePreprints #3

Open
bluetyson opened this issue Aug 5, 2020 · 2 comments
Open

parsePreprints #3

bluetyson opened this issue Aug 5, 2020 · 2 comments

Comments

@bluetyson
Copy link
Contributor

Had a 'links' error when attempting to download all (more than one I think) around the 102nd paper starting from a default range of 2017-01-01.

I just put this in to bypass so far

	# The Relationships data has links to more information
	# Use our helper function to extract those links and put them in our preprint object
	try:
		preprint.parseRelData( rel )
	except Exception as prd:
		print(prd)
		print("error: on preprint ", len(preprints)+1, prd)
		return

so could do a pull request along those lines, I haven't made an error log yet to record which entry

@narock
Copy link
Contributor

narock commented Aug 6, 2020

@bluetyson thanks for the testing/debugging. This is very helpful!

I believe this is due to the COS API simply returning all preprint links in the database and not checking the "status" field. You're likely getting links to withdrawn preprints, which no longer have links.

@bluetyson
Copy link
Contributor Author

Thanks, thought it might be something like that, so just skipping them for the time being.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants