Skip to content

Navigating the Application

Andy Jackson edited this page Aug 10, 2015 · 2 revisions
  1. Login with your personal credentials
    1. Email address
    2. Password: DDHAPT is now making use of the W3ACT production database. This means that if you have a W3ACT account, you should log in to DDHAPT with the same credentials.
      1. NOTE: If you do not have a W3ACT account, your default password may be secret. If this also does not work, contact the W3ACT SysAdmin.
    3. Click on the “W3ACT” logo in the top left corner to reach the home screen
      1. NOTE: This is a known issue since the W3ACT merge – not all users will have the same home page.
    4. You can change your password by selecting the “Users & Agencies” drop-down menu item “Change Password”
    5. If your home page (where you start when you log in or when you click on the W3ACT logo in the upper left corner) is not your personal “Document Harvesting” home page, please request an Archivist or SysAdmin to select “show DDHAPT start page” in your user profile.
  2. User Home Page
    1. NOTE: you can always return to your home page by clicking on the W3ACT logo in the upper-left corner of the application
    2. My Watched Targets Tab
      1. NOTE: This list automatically includes any child Watched Targets
      2. Lists all of the Watched Targets owned by you (the user who is logged in)
      3. Click on “Title” column header to sort by title
      4. Click on “Crawl Date” column header to sort by Crawl Date
        1. NOTE: Live Crawl Documents do not have a Crawl Date (because Crawl Date refers to the date crawled by Heritrix)
      5. Click on the link in the title column to view the Watched Target object
      6. Click on the link in the URL column to view the target URL on the live web in a new tab
      7. Click on the link in the Curator column to view the Curator object
      8. Click on the number link in the Documents column to see the Documents associated with that Watched Target
      9. Click on the “Live Crawl” button in the Action column to perform an instant live web crawl of the selected Watched Target (will return at most three PDFs)
    3. My New Documents Tab
      1. Lists all of the Documents owned by you (the user who is logged in) that have not been submitted or ignored
      2. Click on the “Title” column header to sort by title
      3. Click on the “Landing Page” column header to sort by Landing Page URL
      4. Click on the “Document” column header to sort by Document URL
      5. Click on the “Crawl Date” column header to sort by Crawl Date
      6. Click on the link in the title column to edit the Document object
      7. Click on the link in the Landing page column to view the landing page on the live web in a new tab
      8. Click on the link in the Document column to view the Document PDF on the live web in a new tab
      9. Click on “Ignore” button in the Action column in order to ignore the document.
    4. Alerts Tab
      1. Lists all of the Alerts associated with your Watched Targets and Documents
        1. Unread Alerts are displayed in boldface type
        2. Read Alerts are displayed in normal type
      2. Click on links within the message body in order to navigate to the Watched Target or Document referred to by the link.
        1. NEW: in the case of duplicate or near-duplicate Documents (versions), a compare link is offered, where the two Documents will be displayed side by side.
      3. Click on the “Message” column header to sort by message text
      4. Click on the “Date” column header to sort by Alert Date
      5. Toggle the checkbox in the final column header in order to select/de-select all Alerts in the list
      6. Click on the “Mark As Read” button in order to mark all selected Alerts as read
      7. Click on the “Delete” button in order to delete all of the selected Alerts
    5. New Target Field (available only from the “My Watched Targets” tab)
      1. If you wish to create a new Watched Target, enter the URL in this field and click on the “Add Target URL” button
  3. Document Harvesting: Watched Targets
    1. Search
      1. Enter text in to the search field and click on “Search”
      2. Results list should return any positive matches on the Watched Target URL
    2. Curator Filter
      1. Select the owner of the Watched Target in question from the drop-down list
        1. You can select “All” to see all Watched Targets in the system
          1. the “Child Targets” filter is removed in this case
        2. Select a specific Curator to see all of their Watched Targets
    3. Include Child Targets Filter
      1. Check the filter “Include Child Targets” to see related Watched Targets
    4. Results List
      1. Click on the “Title” column header to sort by title
      2. Click on the “Curator” column header to sort by Curator
      3. Click on the “Crawl Date” column header to sort by Crawl Date
      4. Click on the link in the title column to view the Watched Target object
      5. Click on the link in the URL column to view the live web target in a new tab
      6. Click on the link in the Curator column to view the Curator object
      7. Click on the number link in the Documents column to see the Documents associated with that Watched Target
      8. Click on the “Live Crawl” button to perform an instant live web crawl of the selected Watched Target (will return at most three PDFs)
  4. Document Harvesting: Documents
    1. Tab Selection
      1. New
        1. Discovered Documents that not yet been submitted or ignored
      2. Submitted
        1. Documents that have been edited and submitted to the ingestion queue
      3. Ignored
        1. Documents that have been tagged as ignored by a Selector
    2. Search
      1. Enter text in to the search field and click on “Search”
      2. Results list should return any positive matches on the Document Title
    3. Curator Filter
      1. Select the owner of the Document in question from the drop-down list
        1. You can select “All” to see all Documents in the system
          1. the “Watched Target” filter is removed in this case
        2. Select a specific Curator to see all of their Documents
    4. Watched Target Filter
      1. Select a Watched Target from the drop-down list in order to see Documents related to that specific Watched Targets
    5. Service Filter [User Story 54]
      1. Select one of the Services from the drop-down menu in order to restrict the results list to Documents that should be published to that Service
    6. Subject Filter [User Story 33, 34]
      1. Select one or more of the FAST Subjects from the Subject scroll box in order to restrict the results list to Documents that match that Subject
        1. NOTE: Selecting multiple Subjects will lead to results that match one or more of these (technically speaking: it is an OR rather than an AND operation).
    7. Crawl Date Filter
      1. Define a date range in order to restrict the results list to Documents that were crawled on specific dates
        1. Clicking on a field brings up a Calendar widget
        2. To find all Documents crawled after a specific date, select this date in the first field (leaving the second field blank)
        3. To find all Documents before after a specific date, select this date in the second field (leaving the first field blank)
    8. Export Button (Reporting) [User Story 46]
      1. Click on the export button in order to download the existing results list as a CSV file. Note that all results (not just the ones seen on the screen) will be exported.
      2. Choose the option “Save File.” Do not choose the option “Open with Microsoft Excel”, as this does not properly import the CSV file.
      3. Refer to Section 6 below for instructions on opening the CSV file in Excel.
    9. Results List
      1. Click on the “Title” column header to sort by title
      2. Click on the “Landing Page” column header to sort by Landing Page URL
      3. Click on the “Document” column header to sort by Document URL
      4. Click on the link in the title column to edit the Document object
        1. This is not possible if the Document is ignored
      5. Click on the link in the Landing page column to view the landing page on the live web in a new tab
      6. Click on the link in the Document column to view the Document PDF on the live web in a new tab
      7. Click on the “Ignore” button in the Action column in order to ignore the document.
        1. OR Click on the “Restore” button in the Action column to return the document to the New list.