Make one of the *buntu wiki's the "canonical" source to search engines
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ubuntu Website - OBSOLETE |
Won't Fix
|
Undecided
|
Unassigned |
Bug Description
The kubuntu, ubuntu and edubuntu wikis each point to the same data but with different themes. This confuses search engines so sometimes we see one search result directing to the edubuntu wiki, another to kubuntu and another to ubuntu. Generally this appears to be quite arbitrary.
The original report:
I've noticed for a while that when Ubuntu wiki pages are returned in Google
results, it's very often a wiki.edubuntu.org URL rather than a
wiki.ubuntu.com one. This is surprising, since it seems to happen even when
"ubuntu" is one of the search terms.
This indicates to me that there may be a problem with how wiki.ubuntu.com is
indexed by Google.
Is anyone else seeing this, and do you have an idea as to what might be
causing it?
A follow up:
Yes, I encountered this yesterday with
<http://
Simpler examples (wiki.edubuntu.org version ranked first):
<http://
<http://
Counterexamples (wiki.ubuntu.com version ranked first):
<http://
<http://
...
Though the text on these four wikis is similar, it's apparently not
similar enough for Google to consider the pages as variants (and thereby
trigger its "In order to show you the most relevant results, we have
omitted some entries very similar" message). So, each result gets ranked
independently based on its own incoming links.
...
I can think of three ways to fix this:
1. Retire wiki.edubuntu.org, redirecting it to wiki.ubuntu.com.
2. Link from <http://
thereby boosting the ranking of wiki.ubuntu.com pages in general.
This might not be appropriate in itself, though, unless
<https:/
Center" or similar.
3. Use the coincidentally-
<http://
in the templates for wiki.edubuntu.org and wiki.kubuntu.org, to
specify that wiki.ubuntu.
wiki.
be undesirable for pages that really were Edubuntu- or
Kubuntu-
The email thread where this was discussed is at https:/
Changed in ubuntu-website: | |
status: | New → Confirmed |
yeah, this can be done http:// googlewebmaster central. blogspot. com/2009/ 12/handling- legitimate- cross-domain. html edubuntu) >> somewhere and it would sort it all out. Some pages we probably don't care which one gets indexed and would normally end up with the wiki.ubuntu.com branding.
and I think could be implemented nicely with a moin macro so on pages where we always want a particular variant to be indexed we could include <<Canonical(