CZ:Proposals/Disambiguation mechanics

From Citizendium
< CZ:Proposals
Revision as of 21:16, 15 May 2008 by imported>J. Noel Chiappa (→‎Discussion: typo)
Jump to navigation Jump to search
[edit intro]

This proposal has not yet been assigned to any decisionmaking group or decisionmaker(s).
The Proposals Manager will do so soon if and when the proposal or issue is "well formed" (including having a driver).
For now, the proposal record can be found in the new proposals queue.


Driver: J. Noel Chiappa

Note: This proposal is not about when we should disambiguate, which names/articles should be disambiguated, what a disambiguation page should look like (e.g. how should it be organized, and what should it contain), etc. It's only about the low-level mechanics of how disambiguation should work.

Complete explanation

All articles/names which have multiple potential meanings (i.e. need disambiguation) will be handled as follows:

  • The disambiguation page (i.e. the page that lists all the potential meanings, and provides links to the articles, for those for which we have articles) should be at "{DisambiguatedName} (disambiguation)" (where {DisambiguatedName} is the name in question).
  • A redirect should always be placed at the main "{DisambiguatedName}" location; i.e. with no article actually at "{DisambiguatedName}", not even the main meaning. That redirect will normally point at the disambiguation page (see discussion below for possible exceptions). It will also be placed in a category, to allow all such disambiguation redirects to be easily found.
    • The talk page of the redirect should always be redirected to the talk page of the disambiguation page; that way, discussion of any issues related to the redirect will all be in only one location.
  • All "DisambiguatedName" articles should be at pages of the form "{DisambiguatedName} (song)", "{DisambiguatedName} (automobile)", etc; i.e. disambiguated by a modifer enclosed in ()'s.

Reasoning

Always having a redirect at "{DisambiguatedName}" enables us to quickly check for pages which have been linked to "{DisambiguatedName}" without the writer of those pages having checked to make sure they have linked to the correct page for whichever meaning of "{DisambiguatedName}" they wanted.

If the disambiguation page is always at "{DisambiguatedName} (disambiguation)", and there is always a redirect at "{DisambiguatedName}", then all links to "{DisambiguatedName}" are automatically 'wrong' (although they might accidentally wind up at the right page - see below), therefore making it totally trivial to find the pages that need to be fixed.

On a periodic basis, the 'What links here' of all such redirects should be checked, and all articles linking to it updated to link to the correct page. With this scheme, there is no 'build up' of 'legitimate' entries in the 'What links here' to wade through, since there are no such 'legitimate' entries.


The reason why we don't want the main content at "{DisambiguatedName}" is that with a popular page like tree, it's impossibly painstaking to go click on every entry in itsWhat links here, and look through all the text of each page in that list to find all the references to tree, to make sure they are all to the arboreal "tree", as opposed to someone who wanted, say, a 'tree data structure'.

Even worse, even were such a painstaking sweep performed, after some time had passed, the list might again contain erroneous links - with no way to sort them out from the mass of previously checked links (since there is no 'History' for 'What links here').

Although the redirect at the 'main' name (e.g. tree) would generally point to the disambiguation page (so that for readers of articles which link to the ambiguous title, they are at most one click away from the article they want), we might make some exceptions. In some cases, where one meaning is much more common than others, we could set that redirect to point directly to the article on the primary meaning; that article would contain a link to the disambiguation page at the top of the article (e.g. For other meanings, see tree (disambiguation)). This would still provide most of the benefits of this proposal (i.e. making it very easy to find articles which have linked to ambiguous article titles), but be a little more user-friendly in the case of erroneous links to an ambiguous term, in that in most such cases, the extra click would not be needed.


The reason why we shouldn't have the disambiguation at "{DisambiguatedName}" is that for many disambiguation pages, there are some meanings of "{DisambiguatedName}" which don't have articles, and linking to the disambig page is the right thing (since the meaning is defined there). E.g. for "hack", some of the meanings don't have pages (e.g. party hack), and so some pages might legitimately link to "hack (disambiguation)", e.g. an article on Soviet art.

So even a disambig page can have legitimate links to it, and if we had the disambig page at "{DisambiguatedName}", when 'What links here' for the page was examined, there would still be a mix of legitimate links, and bogus ones (where someone was lazy, and linked to "{DisambiguatedName}", without checking to see what they got).

However, if the disambiguation page is always at "{DisambiguatedName} (disambiguation)", then all links to "{DisambiguatedName} (disambiguation)" are automatically good.

Background

This proposal is based on a great deal of practical experience (principally at Wikipedia), and was originally proposed there some time ago; time has not changed those conclusions.

So many instances of the kind of problems with the Wikipedia style of disambiguation pages have been seen that it's amazing that Wikipedia still uses their existing system.

Some of those with Wikipedia experience have either regularly 'cleaned' disambig pages they created, or tried to clean up other ones, and it's always a fair amount of work. The most annoying thing is that one can go fix them all - and go back some months later and they are more erroneous links, and one has to go check them all, all over again, because one usually doesn't remember any more which ones were legitimate, and which ones are not. And there's no history on "What links here" one can use, to call out only the ones that have been added since the last time it was checked!

Implementation

Unlike Wikipedia, where a jillion pages already use the old way, we still have a manageable problem on our hands. Since this is not a technical change, merely a change to usage, there is no need to have a 'flag day' where we fix all the old pages. People can fix them as they run across them, and have the time and energy to do so.

There is, however, no reason to keep making more of them, so if we adopt this, we should spread the word about the new policy, to prevent making unnecessary work for us (by creating things we will later have to fix up).

Using the same reasoning ('don't make the hole any deeper'), we should try and fairly quickly move all amniguously-named article pages to their new, disambiguated names (that's all that's absolutely needed - the redirect will maintain all the existing links to that name, until such time as someone gets around to fixing them). That way, new links to those articles go to the new name, and won't need to be corrected later.

Implementation details

  • All the redirects to the disambiguation page will be tagged with a template, which adds them to a category, so it's easy to find them all to check them. (The reason it's via a template, as opposed to directly, is that that allows us to change what we do with these redirects without editing every last redirect.) The redirects will look like this:
#Redirect [[DisambiguatedName (dismabiguation)]] {{dabredir}}
  • All pages which are the 'main' meanings, and have the redirect at the base term pointing to them, will have a template (name not yet picked) at the head of them, which says something like:
For other meanings of DisambiguatedName, see DisambiguatedName (disambiguation)
  • All such 'main' meaning redirects will be tagged with a similar template, {{mainredir}}, which tags them with a (different) category.
  • All disambiguation pages should be tagged with the {{disambig}} template, which in addition to a standard header, tags them with a category.

Implementation issues

There aren't very many implementation issues.

  • What to call the category all the redirects to disambiguation pages are placed in? I would suggest "Disambiguation Redirects".
  • What to call the category all the redirects to main pages are placed in? I would suggest "Main-meaning Redirects".
  • What should the header template placed on such main articles be called? Should it add the articles to a category, and if so, what should it be called? (This decision can easily be changed later, simply by changing the template, so it's not super-urgent that it be decided, or decided correctly.)

Open issues

There aren't really very many open issues in this.

  • Probably the biggest one is 'when do we grant an exception to allow the redirect to point to the main meaning, instead of the disambiguation page'? I would suggest that this is probably something to be decided on a case-by-case basis (the issue to be discussed, and settled, on the talk page of the disambiguation page), with the general guideline that one particular meaning should be preferred only when it is overwhelmingly the most popular meaning. Or is this likely to lead to too many interminable debates? If so, we could simply say that it always has to point to the disambiguation page, or say that a Constable gets to make a decision which is binding, saving an appeal to e.g. the Editorial Council.
  • Do we want to bother having all the articles in a 'disambiguation group' have a header on them that says something of the form 'For other meanings of DisambiguatedName, see DisambiguatedName (disambiguation)'? I would say no, because people shouldn't be on such a page unless they deliberately went there, or were sent there, not as a result of confusion. However, I don't have a strong bias against doing so; if people want to do so, that would be fine with me. (If so, however, it should be via a different template, so we can separate out the 'main' meaning articles from the 'subsidiary' meaning ones.)
  • Some people will no doubt be offended at having to title the article on trees at "Tree (plant)". I had proposed on the Forums that we separate page-names from article titles, but a few people strongly disliked this idea. If we had that technical capability, we could of course avoid this issue entirely. Without it, however, we are forced to chose one of either i) ugly article titles, or ii) widespread links to the wrong page (as on Wikipedia), and to me the former is the lesser of two evils - especially as we are already living with that evil in article titles like "Charles I (Spain)".
  • What to do with "{DisambiguatedName}/Definition"? Should it then always (perhaps with the same exceptions as above) read "Can have several meanings, summarized at DisambiguatedName (disambiguation)"? Or should it not exist at all?

Discussion

  • YES! I am coming out strongly in favor of this proposal. Despite the nuisance of having articles with names like "Tree (plant)", it is necessary to do something for topics which have important articles in completely different disciplines. "Tree" is a good example, because a "tree" is an important concept in computer science and mathematics. Without such a policy, there will inevitably be disagreements about which article gets top billing, i.e., sits at "Tree".Pat Palmer 15:23, 13 May 2008 (CDT)
    • Tree is also important in phylogenetics. We must learn from wikipedia and become better, this is a prime example. Chris Day 09:31, 15 May 2008 (CDT)
  • I too support this proposal. I especially agree with the redirect from {foo} to {foo} (disambig). It helps separate good links from "bad" links, which could help prevent confusion in many cases. My only question is this: Who decides (or rather, what will the standard be) on how to distinguish articles? I know this is a bad example, but would the article Victoria be changed to Victoria (Queen), Victoria (British queen), or Victoria (Royalty)? And why? I think there should be clear guidelines and standards to prevent confusion before this gets put into effect. John Dvorak 15:39, 14 May 2008 (CDT)
    • In the biology workgroup there was a long discussion on naming conventions. Each species has a scientific name and many have a common name too, some even have multiple common names. In summary, this has to be on a case by case basis. In that discussion we learned that it is hard to find one rule that fits all situations (at least, one rule that everyone is happy with). I would add, I think this is a tangential topic to the need being discussed here. Chris Day 09:37, 15 May 2008 (CDT)
    • That whole Victoria thing is a wholly separate issue; it's an issue of how we name articles (something we've had some long debates about, and tacitly put on the back burner while we get busy an create content). So, can we not dive down that rathole here? Thanks! J. Noel Chiappa 09:46, 15 May 2008 (CDT)
  • YES too! The {foo}/definition could exist but it is not critical. The {{R}} template could be coded such that if Foo (disambiguation) exists then the definition is automatically given as: "There are several meanings; see [[{{{1}}} (disambiguation)]]". This has the advantage of overiding any definition that might already exist at foo/definition as well as alerting the author of the specific related articles page to clarify the link that gives the correct definition in the context of that RA subpage. For example, {{R|Cell}} can be fixed to {{R|Cell (biology)|Cell}}, or <nowiki>{{R|Cell (electric)|Cell}}. Chris Day 09:52, 15 May 2008 (CDT)
  • I join in supporting this initiative and plead for a rather strict (but easy to navigate) implementation (along the lines of Chris' demonstration of {{R}}), as any discussion about priorities will result in the dissipation of much-needed energy, for generally very little gain. I expect a compulsory disambig to also increase interaction between the writers of the different Foo articles, especially if the disambig page provides some background as to the aspects in which the trees in mathematics, computer science, phylogenetics and elsewhere are related to the more ancient concept of the plant. And it makes tracking false links easier, too. -- Daniel Mietchen 10:32, 15 May 2008 (CDT)
  • No. I'm coming out against the proposal, logical though it is. I note that the first ones to weigh in are scientists, I think that's because it is well-reasoned, thoughtful and seems sensible. However, it is also counterintuitive, and I think that's the rub. It seems to me that the appeal to the scientific, as opposed to the artistic, mind is that it will ostensibly stop arguments, unify the format, and so ultimately save time and energy, things we waste entirely too much of, I'd be the first to agree.
Pat, I think you capture the essence of the problem I have with this, to wit, tree (plant) will be a nuisance. It will be more than a nuisance, it will be a major headache. Tree was a tree before it was a flowchart or a thing you stretched shoes on, and I suggest that most people's instinct is to conceive first of a tree, unless they were specifically thinking (writing) in some other context. To that end, we're going to have endless problems.
Ah, but, I hear you say--what about the constant arguments about which article name is the primary one? Well, look, so far we've gone with Standard English and Common Sense; I see no reason to abandon either. Who in their right mind is going to argue about the primacy of Tree? The difficulty arises, of course, if or when the matter is less clear. But in so many of these cases, there is a primary meaning, and when there isn't, we should of course use diambiguation.
You guys think differently. Or *people like me* think differently, as you prefer. For example, in a universal proposal you do not write article name or common noun, you write foo, and you're dealing with people who don't know what foo is. I'm learning, but it's a process.
What you're suggesting is that using universal disambiguation will make arguments go away. As far as I can tell, all that will happen here is that the place and level at which arguments take place will shift. The need to redirect will not change, what will change is where the redirects are to. In fact, it will likely make things even worse, because people will have to redirect common nouns all over the place. All the clusters. All the subpages. All the definitions. All because we're going to attempt to avoid defining a baker as someone who cooks things in the oven and a ball as a round spherical thing. Seems to me it'll be a lot easier in the long run to have ball and redirect the odd ball to "ball (dance)" when you run into a needed redirect than it will be to run around the wiki changing every blasted instance of ball to ball (round spherical thing)
I also note that the proposal already attempts to provide for exceptions. My point would be that there will be more exceptions that you have anticipated. Many nouns involved here. Many, many nouns. 26 letters of the alphabet. A gazillion plants and animals. The colours of the rainbow. Chris harked back to biology naming arguments; Noel doesn't want to go there, and I do agree, but I think that that argument helps illustrate how complicated matters can get.
People's brains do not instinctively disambiguate, and that's where the trouble is. People generally know what a primary use is. When I recently wrote the last great Moose--that's Moose, I placed the article at Moose (dog actor), not Moose, which is Rocky-n-Bullwinkle (and currently does not exist). It does not matter one wit that I am not a zoologist, a dog called Moose, no matter how famous, is not a Moose, and I know it. Now unless you can show me that it makes sense to have, instead of Moose, Moose (disambiguation): Moose (elk-like creature), Moose (dog actor), I cannot support the argument.
Having said this much, I think the *idea* is rooted in a good, solid premise, but some ammending is needed. Aleta Curry 19:23, 15 May 2008 (CDT)
  • The thing is that the old saying "There Ain't No Such Thing As A Free Lunch" applies here. Yes, typing "[[Tree (plant)]]" is a pain, but so is sorting through all the links to find the possibly erroneous ones. Your assertion that it will be "a lot easier in the long run to have [[ball]] and redirect the odd [[ball]] to [[ball (dance)]]" is not correct.
If you think it's not going to be a huge problem, trundle on over to Wikipedia and click on 'What-Links-Here' for "Tree", here, and go through that list and find (and fix) all the links to the wrong kind of tree. Hint: here is an incorrect one I found in a few seconds of looking - but how many others are in that list? How long will it take you to look through all the articles on that lengthy list and find all the mistakes? And how many new ones will have been added when you go back in 3 months, and have to look through the same long list?
Trust me, if we don't do this, we're going to have just what Wikipedia already has, which is a lot of articles that have links that link to the wrong thing (because what you're arguing for is basically the Wikipedia system, and we have evidence of what happens with that). I don't know about you, but that's not the quality level I'm hoping for from CZ.
And we won't have to "run around the wiki" changing anything; the redirects will keep things properly linked until someone changes things to link directly to the disambiguated article. The number of articles is still small enough that there isn't a giant backlog that will take forever to clear.
Also, this proposal is not in any way intended to get rid of arguments (I don't know where you got that from, because there's nothing suggesting that in the proposal, I thought - perhaps you could point it out?) But it won't make things any worse than it is now. If you have a name that has to be disambiguated, even without this proposal, you're still going to have the argument of 'which variant, if any, gets the 'base' page'. Been there, done that too, on Wikipedia (which, again, is basically the system you're arguing for).
Still, if you think it can be amended to be better, I'd more than willing to hear suggestions for improvements. Did you have any specific modifications to suggest? J. Noel Chiappa 20:36, 15 May 2008 (CDT)
  • Actually, now that I think about it, there might actually be less arguments with this system. As long as one meaning can 'claim' the title {DisambiguratedPage}, that increases the reward (i.e. incentive) for arguing over which one gets it. But when no article is going to get that article title, and the most one can hope for is to be the default link (which is only ever of temporary use anyway, since all links will be modified fairly quickly to not use it), there's a lot less incentive to drag it out.
On another topic, I expect eventually we might even have a human-assisted bot to fix those links to the 'base' name; it would e.g. pull up the text around the link, and look at the dab page to find the options, and then it could offer to link to the main meaning if one hits the space bar, or to pick one of the other ones, type a number, or something like that. J. Noel Chiappa 21:16, 15 May 2008 (CDT)


Proposals System Navigation (advanced users only)

Proposal lists (some planned pages are still blank):