-
Notifications
You must be signed in to change notification settings - Fork 398
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Anchors are being stripped out (using sitemaps
, linkExtractor
and externalData
)
#1831
Comments
Hey, thanks for opening the issue. #1823 seems related. I'll investigate if there's a way for us to differentiate hash routed pages from anchored sections |
Thank you for a quick response! |
Hey @shortcuts, any news on this one? Somehow related, I tried to provide anchored URLs to the Crawler with Example CSV:
Single URL under Crawler admin > External Data: I would expect the same issue would appear with your API client (JS), but I've just successfully created 2 objects containing URLs with anchors in our demo app (free plan, app ID So, technically, my wild guess would be your system supports anchored URLs, they are just not supported by the crawler? |
sitemaps
and/or linkExtractor
)sitemaps
, linkExtractor
and externalData
)
Hey @shortcuts, and news about this one? |
Description
We are using Algolia Crawler UI for parsing our mixed static HTML & SPA website (using hash router). All URLs are provided in
sitemaps
Crawler config.Steps to reproduce
Use a sitemap with the following content:
... or using the static
linkExtractor
:Then run the URL Tester.
Result:
Expected behavior
Expected result:
Note those are not section anchors. Those are actual pages, correctly parsed in URL Tester with the
renderJavaScript: true
option when passing the full URL with the anchor.Environment
Similar issues:
The text was updated successfully, but these errors were encountered: