You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Sitemaps are automatically detected in the robots.txt but not checked for cross-submits. From time to time this leads to spam-like injections of URLs not matching the news genre. Recently, via one of their periodicals a publishing company "injects" their entire publishing program including landing pages for books and other media. This also happened for real estate ads before.
Note that the sitemaps must follow the news sitemap format which is the barrier for most cross-submits but not always.
The text was updated successfully, but these errors were encountered:
Further scenario: a news site redirects on of their news articles to a page on another site as kind of an advertisement. We need to check the robots.txt of the target site, of course. But we should ignore the sitemap directives.
Sitemaps are automatically detected in the robots.txt but not checked for cross-submits. From time to time this leads to spam-like injections of URLs not matching the news genre. Recently, via one of their periodicals a publishing company "injects" their entire publishing program including landing pages for books and other media. This also happened for real estate ads before.
Note that the sitemaps must follow the news sitemap format which is the barrier for most cross-submits but not always.
The text was updated successfully, but these errors were encountered: