Mailing List Archive

Spam and history
How to keep google out of the history copies of pages?
(would adding noindex metatag work? how do i do that for non-current revisions
of a page?)

All the spam gets indexed although the pages are reverted.


Selva.
Re: Spam and history [ In reply to ]
Yes, I'm pretty sure Google respects at least robots.txt, and probably
the meta tags as well.

See: http://www.robotstxt.org/wc/exclusion.html


On Tue, 12 Oct 2004 21:01:08 +0000, selva@thescian.com
<selva@thescian.com> wrote:
> How to keep google out of the history copies of pages?
> (would adding noindex metatag work? how do i do that for non-current revisions
> of a page?)
>
> All the spam gets indexed although the pages are reverted.
>
> Selva.
> _______________________________________________
> MediaWiki-l mailing list
> MediaWiki-l@Wikimedia.org
> http://mail.wikipedia.org/mailman/listinfo/mediawiki-l
>


--
Rowan Collins BSc
[IMSoP]
Re: Spam and history [ In reply to ]
On Oct 12, 2004, at 2:01 PM, selva@thescian.com wrote:
> How to keep google out of the history copies of pages?
> (would adding noindex metatag work? how do i do that for non-current
> revisions
> of a page?)

It should, and it's already included.

> All the spam gets indexed although the pages are reverted.

Can you clarify what exactly is being indexed? Note that google doesn't
update instantly when your pages change; if it happened to last visit a
page while it was vandalised, the google cache will contain the
vandalised version until googlebot next visits.

-- brion vibber (brion @ pobox.com)