Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve tidy_warcs command #88

Open
1 of 3 tasks
anjackson opened this issue Dec 17, 2021 · 1 comment
Open
1 of 3 tasks

Improve tidy_warcs command #88

anjackson opened this issue Dec 17, 2021 · 1 comment
Assignees

Comments

@anjackson
Copy link
Contributor

anjackson commented Dec 17, 2021

As well as moving properly-named WARCPROX WARCs into place, tidy_warcs should:

  • Move default-named WARCPROX-### WARCs into the closest matching job folder.
  • Find any older WARCs that are still .open and close them
  • Count totals for WARCs and push those numbers to Prometheus too, so we can monitor any move-to-hdfs backlog more reliably.
@anjackson anjackson self-assigned this Jan 4, 2022
@anjackson
Copy link
Contributor Author

Added support for ukwa_files_count and ukwa_files_moved_count metrics to the tidy_warcs command.6238ae4

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant