-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding Web of Science harvest #204
Conversation
1062202
to
becde5b
Compare
b3b97ca
to
3620115
Compare
@@ -28,7 +28,8 @@ dependencies = [ | |||
[tool.pytest.ini_options] | |||
pythonpath = ["."] | |||
markers = "mais_tests: Tests requiring MAIS access" | |||
addopts = "-v --cov --cov-report=html --cov-report=term" | |||
addopts = "-v --cov --cov-report=html --cov-report=term --log-level INFO --log-file test.log" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It was useful for me to see the log messages. But I can remove if this is noisy.
@@ -150,6 +159,9 @@ def publish(pubs_to_contribs, merge_publications): | |||
|
|||
openalex_jsonl = openalex_harvest(snapshot) | |||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Eventually we'll want to tie the WoS harvesting into the workflow, but for now nothing is dependent on it.
|
||
|
||
def check_status(resp): | ||
# see https://github.com/sul-dlss/rialto-airflow/issues/208 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks for the links to these issues
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approved pending minor comments
This adds a harvester for Web of Science publications by ORCID. I've tested with
AIRFLOW_VAR_DEV_LIMIT=50000
which helped discover some unexpected responses from the WoS API that I needed to handle see #207 and #208.The API call was determined by looking at sul_pub's querying by ORCID and how the API KEY is passed.
Closes #165