CZ:Proposals/Disambiguation mechanics: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>J. Noel Chiappa
(→‎Complete explanation: Change to allow minimal effort in article writing)
imported>Pat Palmer
(this page is no longer needed; it will direct people to go to Disambiguation)
 
(58 intermediate revisions by 9 users not shown)
Line 1: Line 1:
{{editintro}}
{{editintro}}
{{proposal assignment}}


'''Driver:''' [[User:J. Noel Chiappa|J. Noel Chiappa]]
[[CZ:Proposals/Disambiguation_mechanics/Archive 1|Archive 1]]


''Note:'' This proposal is not about ''when'' we should disambiguate, ''which'' names/articles should be disambiguated, ''what'' a disambiguation page should look like (e.g. how should it be organized, and what should it contain), etc. It's only about the low-level mechanics of ''how'' disambiguation should work.
The disambiguation policy is explained at [[CZ:Disambiguation]].
 
== Complete explanation ==
 
'''All''' articles/names which have multiple potential meanings (i.e. need disambiguation) will be handled as follows:
 
* The disambiguation page (i.e. the page that <!-- lists all the potential meanings, and --> provides links to the articles, for those meanings for which we have articles) should be at "{DisambiguatedName} (disambiguation)" (where {DisambiguatedName} is the name in question).
 
* A redirect should '''always''' be placed at the base "{DisambiguatedName}" location; i.e. with '''no''' article actually at "{DisambiguatedName}", not even the main meaning. That redirect will point at either the disambiguation page, or the most common meaning of the disambuated term (see discussion below for possible exceptions). It will also be placed in a category, to allow all such disambiguation redirects to be easily found.
** The talk page of the redirect should '''always''' be redirected to the talk page of the disambiguation page; that way, discussion of any issues related to the redirect will all be in only one location.
** When the 'base' redirect points to one particular meaning, the talk: page for the disambiguation page will have a link to that meaning's talk page at the head, for ease of navigation.
 
* All "DisambiguatedName" articles should be at pages of the form "{DisambiguatedName} (song)", "{DisambiguatedName} (automobile)", etc; i.e. disambiguated by a modifer enclosed in ()'s.
 
=== Reasoning ===
 
'''''Always''''' having a redirect at "{DisambiguatedName}" enables us to quickly check for pages which have been linked to "{DisambiguatedName}" without the writer of those pages having checked to make sure they have linked to the correct page for whichever meaning of "{DisambiguatedName}" they wanted.
 
If the disambiguation page is ''always'' at "{DisambiguatedName} (disambiguation)", and there is always a redirect at "{DisambiguatedName}", then '''all''' links to "{DisambiguatedName}" are '''automatically''' 'wrong' (although they might accidentally wind up at the right page - see below), therefore making it totally trivial to find the pages that need to be fixed.
 
On a periodic basis, the 'What links here' of all such redirects should be checked, and all articles linking to it updated to link to the correct page. With this scheme, there is no 'build up' of 'legitimate' entries in the 'What links here' to wade through, since there ''are'' no such 'legitimate' entries.
 
 
The reason why we don't want the ''main content'' at "{DisambiguatedName}" is that with a popular page like [[tree]], it's impossibly painstaking to go click on ''every'' entry in its[[Special:Whatlinkshere/Tree|What links here]], and look through all the text of each page in that list to find ''all'' the references to [[tree]], to make sure they are all to the arboreal "tree", as opposed to someone who wanted, say, a 'tree data structure'.
 
Even worse, even were such a painstaking sweep performed, after some time had passed, the list might again contain erroneous links - with no way to sort them out from the mass of previously checked links (since there is no 'History' for 'What links here').
 
 
Although the redirect at the 'base' name (e.g. [[tree]]) will often point to the disambiguation page (so that for readers of articles which link to the ambiguous title, they are at most one click away from the article they want), we will often point it to a particular meaning.
 
Where one meaning is ''much'' more common than others, we could set the 'base' redirect to point directly to the article on the primary meaning; that article would contain a link to the disambiguation page at the top of the article (e.g. ''For other meanings, see <font color=blue>tree (disambiguation)</font>''). This would still provide most of the benefits of this proposal (i.e. making it very easy to find articles which have linked to ambiguous article titles), but be a little more user-friendly in the case of erroneous links to an ambiguous term, in that in ''most'' such cases, the extra click would not be needed.
 
In addition, it will allow the work of creating articles to be ''exactly'' the same as the old system. If we redirect from "tree" to "tree (plant)", writing <nowiki>[[tree]]</nowiki> in an article will get you ''exactly'' the same result as under the old system; authors will neither have to look up the correct name to link to, nor write the extra characters (e.g. <nowiki>[[tree (plant)|tree]]</nowiki>. Someone else (likely aided by a bot, to make the work easy) can come along later and fix the link to go directly to the appropriate target.
 
 
The reason why we shouldn't have the ''disambiguation'' at "{DisambiguatedName}" is that for many disambiguation pages, there are some meanings of "{DisambiguatedName}" which ''don't have'' articles, and linking to the disambig page ''is'' the right thing (since the meaning is defined there). E.g. for "hack", some of the meanings don't have pages (e.g. party hack), and so some pages might legitimately link to "hack (disambiguation)", e.g. an article on Soviet art.
 
So even a disambig page can have legitimate links to it, and if we had the disambig page at "{DisambiguatedName}", when 'What links here' for the page was examined, there would still be a mix of legitimate links, and bogus ones (where someone was lazy, and linked to "{DisambiguatedName}", without checking to see what they got).
 
However, if the disambiguation page is ''always'' at "{DisambiguatedName} (disambiguation)", then '''all''' links to "{DisambiguatedName} (disambiguation)" are automatically good.
 
===Background===
 
This proposal is based on a great deal of ''practical'' experience (principally at Wikipedia), and was originally proposed {{WP|User:Jnc/Disambiguation|there}} some time ago; time has not changed those conclusions.
 
So many instances of the kind of problems with the Wikipedia style of disambiguation pages have been seen that it's amazing that Wikipedia still uses their existing system.
 
Some of those with Wikipedia experience have either regularly 'cleaned' disambig pages they created, or tried to clean up other ones, and it's always a fair amount of work. The most annoying thing is that one can go fix them all - and go back some months later and they are more erroneous links, and one has to go check them '''all''', all over again, because one usually doesn't remember any more which ones were legitimate, and which ones are not. And there's no history on "What links here" one can use, to call out only the ones that have been added since the last time it was checked!
 
== Implementation ==
 
Unlike Wikipedia, where a jillion pages already use the old way, we still have a manageable problem on our hands. Since this is not a technical change, merely a change to usage, there is no need to have a 'flag day' where we fix all the old pages. People can fix them as they run across them, and have the time and energy to do so.
 
There is, however, no reason to ''keep'' making more of them, so if we adopt this, we should spread the word about the new policy, to prevent making unnecessary work for us (by creating things we will later have to fix up).
 
Using the same reasoning ('don't make the hole any deeper'), we should try and fairly quickly move all amniguously-named article pages to their new, disambiguated names (that's all that's absolutely needed - the redirect will maintain all the existing links to that name, until such time as someone gets around to fixing them). That way, new links to those articles go to the new name, and won't need to be corrected later.
 
=== Implementation details ===
 
* All the redirects to the disambiguation page will be tagged with a template, which adds them to a category, so it's easy to find them all to check them. (The reason it's via a template, as opposed to directly, is that that allows us to change what we do with these redirects without editing every last redirect.) The redirects will look like this:
 
:<code><nowiki>#Redirect [[DisambiguatedName (dismabiguation)]] {{dabredir}}</nowiki></code>
 
* All pages which are the 'main' meanings, and have the redirect at the base term pointing to them, will have a template (name not yet picked) at the head of them, which says something like:
 
: ''For other meanings of '''DisambiguatedName''', see <font color=blue>DisambiguatedName (disambiguation)</font>''
 
* All such 'main' meaning redirects will be tagged with a similar template, <nowiki>{{mainredir}}</nowiki>, which tags them with a (different) category.
 
* All disambiguation pages should be tagged with the {{tl|disambig}} template, which in addition to a standard header, tags them with a category.
 
=== Implementation issues ===
 
There aren't very many implementation issues.
 
* What to call the category all the redirects to disambiguation pages are placed in? I would suggest "Disambiguation Redirects".
 
* What to call the category all the redirects to main pages are placed in? I would suggest "Main-meaning Redirects".
 
* What should the header template placed on such main articles be called? Should it add the articles to a category, and if so, what should it be called? (This decision can easily be changed later, simply by changing the template, so it's not super-urgent that it be decided, or decided correctly.)
 
==Open issues==
 
There aren't really very many open issues in this.
 
* Probably the biggest one is 'when do we grant an exception to allow the redirect to point to the main meaning, instead of the disambiguation page'? I would suggest that this is probably something to be decided on a case-by-case basis (the issue to be discussed, and settled, on the talk page of the disambiguation page), with the general guideline that one particular meaning should be preferred only when it is overwhelmingly the most popular meaning. Or is this likely to lead to too many interminable debates? If so, we could simply say that it ''always'' has to point to the disambiguation page, or say that a Constable gets to make a decision which is binding, saving an appeal to e.g. the Editorial Council.
 
* Do we want to bother having ''all'' the articles in a 'disambiguation group' have a header on them that says something of the form 'For other meanings of '''DisambiguatedName''', see <font color=blue>DisambiguatedName (disambiguation)</font>'? I would say no, because people shouldn't be on such a page unless they deliberately went there, or were sent there, not as a result of confusion. However, I don't have a strong bias against doing so; if people want to do so, that would be fine with me. (If so, however, it should be via a ''different'' template, so we can separate out the 'main' meaning articles from the 'subsidiary' meaning ones.)
 
* Some people will no doubt be offended at having to title the article on trees at "Tree (plant)". I had proposed [http://forum.citizendium.org/index.php/topic,1654.0.html on the Forums] that we separate page-names from article titles, but a few people strongly disliked this idea. If we had that technical capability, we could of course avoid this issue entirely. Without it, however, we are forced to chose one of either i) ugly article titles, or ii) widespread links to the wrong page (as on Wikipedia), and to me the former is the lesser of two evils - especially as we are ''already'' living with that evil in article titles like "Charles I (Spain)".
 
* What to do with "{DisambiguatedName}/Definition"? Should it then always (perhaps with the same exceptions as above) read "Can have several meanings, summarized at <font color=blue>DisambiguatedName (disambiguation)</font>"? Or should it not exist at all?
 
== Discussion ==
 
* YES! I am coming out strongly in favor of this proposal.  Despite the nuisance of having articles with names like "Tree (plant)", it is necessary to do something for topics which have important articles in completely different disciplines.  "Tree" is a good example, because a "tree" is an important concept in computer science and mathematics.  Without such a policy, there will inevitably be disagreements about which article gets top billing, i.e., sits at "Tree".[[User:Pat Palmer|Pat Palmer]] 15:23, 13 May 2008 (CDT)
**Tree is also important in phylogenetics. We must learn from wikipedia and become better, this is a prime example. [[User:Chris Day|Chris Day]] 09:31, 15 May 2008 (CDT)
 
* I too support this proposal. I especially agree with the redirect from {foo} to {foo} (disambig). It helps separate good links from "bad" links, which could help prevent confusion in many cases. My only question is this: Who decides (or rather, what will the standard be) on how to distinguish articles? I know this is a bad example, but would the article [[Victoria]] be changed to [[Victoria (Queen)]], [[Victoria (British queen)]], or [[Victoria (Royalty)]]? And why? I think there should be clear guidelines and standards to prevent confusion before this gets put into effect. [[User:John Dvorak|John Dvorak]] 15:39, 14 May 2008 (CDT)
**In the biology workgroup there was a long discussion on naming conventions.  Each species has a scientific name and many have a common name too, some even have multiple common names. In summary, this has to be on a case by case basis. In that discussion we learned that it is hard to find one rule that fits all situations (at least, one rule that everyone is happy with). I would add, I think this is a tangential topic to the ''need'' being discussed here. [[User:Chris Day|Chris Day]] 09:37, 15 May 2008 (CDT)
** That whole Victoria thing is a wholly separate issue; it's an issue of how we name articles (something we've had some long debates about, and tacitly put on the back burner while we get busy an create content). So, can we not dive down that rathole here? Thanks! [[User:J. Noel Chiappa|J. Noel Chiappa]] 09:46, 15 May 2008 (CDT)
 
*YES too! The {foo}/definition could exist but it is not critical.  The {{tl|R}} template could be coded such that if ''Foo (disambiguation)'' exists then the definition is automatically given as: "''There are several meanings; see <nowiki>[[{{{1}}} (disambiguation)]]</nowiki>''".  This has the advantage of overiding any definition that might already exist at ''foo/definition'' as well as alerting the author of the specific related articles page to clarify the link that gives the correct definition in the context of that RA subpage.  For example, <nowiki>{{R|Cell}}</nowiki> can be fixed to <nowiki>{{R|Cell (biology)|Cell}}, or <nowiki>{{R|Cell (electric)|Cell}}</nowiki>. [[User:Chris Day|Chris Day]] 09:52, 15 May 2008 (CDT)
 
*I join in supporting this initiative and plead for a rather strict (but easy to navigate) implementation (along the lines of Chris' demonstration of {{tl|R}}), as any discussion about priorities will result in the dissipation of much-needed [[Energy (psychology)|energy]], for generally very little gain. I expect a compulsory disambig to also increase interaction between the writers of the different Foo articles, especially if the disambig page provides some background as to the aspects in which the trees in mathematics, computer science, phylogenetics  and elsewhere are related to the more ancient concept of the plant. And it makes tracking false links easier, too. -- [[User:Daniel Mietchen|Daniel Mietchen]] 10:32, 15 May 2008 (CDT)
 
*No.  I'm coming out ''against'' the proposal, logical though it is.  I note that the first ones to weigh in are scientists, I think that's because it is well-reasoned, thoughtful and seems sensible.  However, ''it is also counterintuitive'', and I think that's the rub.  It seems to me that the appeal to the scientific, as opposed to the artistic, mind is that it will ostensibly stop arguments, unify the format, and so ultimately save time and energy, things we waste entirely too much of, I'd be the first to agree.
:Pat, I think you capture the essence of the problem I have with this, to wit,  [[tree (plant)]] will be a nuisance.  It will be more than a nuisance, it will be a major headache.  Tree was a tree before it was a flowchart or a thing you stretched shoes on, and I suggest that most people's instinct is to conceive ''first'' of a tree, unless they were specifically thinking (writing) in some other context.  To that end, we're going to have endless problems.
:Ah, but, I hear you say--what about the constant arguments about which article name is the primary one?  Well, look, so far we've gone with Standard English and Common Sense; I see no reason to abandon either.  Who in their right mind is going to argue about the primacy of Tree?  The difficulty arises, of course, if or when the matter is less clear.  But in so many of these cases, there is a primary meaning, and when there isn't, we should of course use diambiguation.
:You guys think differently.  Or *people like me* think differently, as you prefer.  For example, in a universal proposal you do not write [[article name]] or [[common noun]], you write [[foo]], and you're dealing with people who don't know what foo is.  I'm learning, but it's a process.
:What you're suggesting is that using universal disambiguation will make arguments go away.  As far as I can tell, all that will happen here is that the place and level at which arguments take place will shift.  The need to redirect will not change, what will change is ''where'' the redirects are ''to''.  In fact, it will likely make things even worse, because people will have to redirect common nouns all over the place.  All the clusters.  All the subpages.  All the definitions.  All because we're going to attempt to avoid defining a baker as someone who cooks things in the oven and a ball as a round spherical thing.  Seems to me it'll be a lot easier in the long run to have [[ball]] and redirect the odd [[ball]] to "[[ball (dance)]]" when you run into a needed redirect than it will be to run around the wiki changing every blasted instance of [[ball]] to [[ball (round spherical thing)]]
:I also note that the proposal already attempts to provide for exceptions.  My point would be that there will be more exceptions that you have anticipated.  Many nouns involved here.  Many, many nouns.  26 letters of the alphabet.  A gazillion plants and animals.  The colours of the rainbow.  Chris harked back to biology naming arguments; Noel doesn't want to go there, and I do agree, but I think that that argument helps illustrate how complicated matters can get.
:'''People's brains do not instinctively disambiguate''', and that's where the trouble is. People generally know what a primary use is.  When I recently wrote the last great Moose--that's [[Moose (dog actor)|Moose]], I placed the article at [[Moose (dog actor)]], not [[Moose]], which is Rocky-n-Bullwinkle (and currently does not exist).  It does not matter one wit that I am not a zoologist, a dog called Moose, no matter how famous, is not a Moose, and I know it.  Now unless you can show me that it makes sense to have, instead of Moose, [[Moose (disambiguation)]]: Moose (elk-like creature), Moose (dog actor), I cannot support the argument.
:Having said this much, I think the *idea* is rooted in a good, solid premise, but some ammending is needed. [[User:Aleta Curry|Aleta Curry]] 19:23, 15 May 2008 (CDT)
::*Wouldn't [[Moose (Alces alces)]] be the elk-like creature?  But, what if I'm writing in British English? Then I claim [[Elk (Alces alces)]] as opposed to [[Elk (Cervus canadensis)]].  That's when the scientisit in me says "Why not just go with the unambiguous ''[[Alces alces]]'' and ''[[Cervus canadensis]]''?"  As opposed to the encyclopedia britanica's computer scientists that name their articles [http://www.britannica.com/eb/article-9053639/moose article-9053639] for moose and [http://www.britannica.com/eb/article-9032422/elk article-9032422] for elk. It's enough to make one dizzy and I think might be a red herring for this proposal.
:::::Thinking about the red herring a bit more. I can see that readers might be perplexed to find themselves on an esoterically named page, Aleta cites [[Tree (plant)]] as an example. But would this really be a problem?  In the text this would be pipelinked so it would still be [[Tree (plant)|tree]] (<nowiki>[[Tree (plant)|tree]]</nowiki>). Do people even read the titles when they click onto a new page from a hyper link? For me, i tend to jump right to the content.
:::::If it is just the title of the page that is a problem is there not a way to mask the page name with an alternative more user friendly version, possibly one coded in the metadata. This gets back to another proposal from Noel where he suggested there is no reason why the title at the top has to be exactly the same as the page name.  Similarly the articles that have numbers in EB do have sensible titles for show (figure head titles?). [[User:Chris Day|Chris Day]] 10:58, 16 May 2008 (CDT)
::::To me this proposal is not about which name makes more sense or which name has priority, and both are valid discussions.  This is about how can we be sure that a reader always finds the article they are looking for. In reality the names are not that important as long as the information is relevant.  At worst a reader is surprised to find themselves following a  link to a disambiguation page. This is a small price to pay, IMO, and may actually be educational. [[User:Chris Day|Chris Day]] 22:14, 15 May 2008 (CDT)
 
:* The thing is that the old saying "There Ain't No Such Thing As A Free Lunch" applies here. Yes, typing "<nowiki>[[Tree (plant)]]"</nowiki> is a pain, but so is sorting through all the links to find the possibly erroneous ones. Your assertion that it will be "a lot easier in the long run to have <nowiki>[[ball]]</nowiki> and redirect the odd <nowiki>[[ball]]</nowiki> to <nowiki>[[ball (dance)]]</nowiki>" is not correct.
:: If you think it's ''not'' going to be a '''huge''' problem, trundle on over to Wikipedia and click on 'What-Links-Here' for "Tree", {{WP|Special:Whatlinkshere/Tree|here}}, and go through that list and find (and fix) all the links to the wrong kind of tree. Hint: {{WP|Data Web|here}} is an incorrect one I found in a few seconds of looking - but how many others are in that list? How long will it take you to look through all the articles on that lengthy list and find all the mistakes? And how many new ones will have been added when you go back in 3 months, and have to look through the same long list?
:: Trust me, if we ''don't'' do this, we're going to have just what Wikipedia ''already'' has, which is a lot of articles that have links that link to the '''wrong thing''' (because what you're arguing for is basically the Wikipedia system, and we have ''evidence'' of what happens with that). I don't know about you, but that's not the quality level I'm hoping for from CZ.
:: And we won't have to "run around the wiki" changing anything; the redirects will keep things properly linked until someone changes things to link directly to the disambiguated article. The number of articles is still small enough that there isn't a giant backlog that will take forever to clear.
:: Also, this proposal is not in any way intended to get rid of arguments (I don't know where you got that from, because there's nothing suggesting that in the proposal, I thought - perhaps you could point it out?) But it won't make things any worse than it is now. If you have a name that has to be disambiguated, even without this proposal, you're still going to have the argument of 'which variant, if any, gets the 'base' page'. Been there, done that too, on Wikipedia (which, again, is basically the system you're arguing for).
:: Still, if you think it can be amended to be better, I'd more than willing to hear suggestions for improvements. Did you have any specific modifications to suggest? [[User:J. Noel Chiappa|J. Noel Chiappa]] 20:36, 15 May 2008 (CDT)
:* Actually, now that I think about it, there might actually be ''less'' arguments with this system. As long as one meaning can 'claim' the title {DisambiguratedPage}, that increases the reward (i.e. incentive) for arguing over which one gets it. But when ''no'' article is going to get that article title, and the most one can hope for is to be the default link (which is only ever of temporary use anyway, since all links will be modified fairly quickly to not use it), there's a lot less incentive to drag it out.
:: On another topic, I expect eventually we might even have a human-assisted bot to fix those links to the 'base' name; it would e.g. pull up the text around the link, and look at the dab page to find the options, and then it could offer to link to the main meaning if one hits the space bar, or to pick one of the other ones, type a number, or something like that. [[User:J. Noel Chiappa|J. Noel Chiappa]] 21:16, 15 May 2008 (CDT)
 
* I think Aleta makes a good point in saying that [[tree (plant)]] looks weird.  One looks at that and says, "Err...does that mean a ''tree,'' or doesn't it?"  That conceded, I think the simplicity and general usefulness of the proposed policy outweigh such considerations.  (Of course, I know, I'm just being logical.)
: The way to address Aleta's concern is by giving it a ''different'' disambiguating word or phrase that makes sense.  (I have sheer faith that there is such a word or phrase, so I am not entirely logical.)
: One critical comment about the current wording of the proposal: "The disambiguation page (i.e. the page that lists all the potential meanings, and provides links to the articles, for those for which we have articles..."  This seems incorrect.  Disambiguations do not and should not list all ''potential meanings,'' but only all ''potential articles that people might look for under this title.''  Disambiguation pages should not look like definitions. --[[User:Larry Sanger|Larry Sanger]] 10:41, 16 May 2008 (CDT)
 
:* I think Aleta was also concerned at the amount of extra typing in having to type <nowiki>[[tree (plant)|tree]]</nowiki>, and also the time/energy/delay in having to go look, when you just wanted to link to "tree", to find out that the thing you actually need to link to is actually "tree (plant)".
:: To answer both of those, Aleta, if we redirect from "tree" to "tree (plant)", writing nowiki>[[tree]]</nowiki> in an article will get you ''exactly'' the same thing as if you wrote that in the Wikipedia system - and someone else can come along later and fix the link to go directly. So I don't think that's an issue - feel free to link to "tree" (and "dog" :-) in everything you write! (I will update the proposal text to point this out.)
:: Now, about the "all the potential meanings ... those for which we have articles" stuff. The ''content'' of dab pages is still on my (lengthy :-) ToDo list... Whether dab pages should ''only'' contain entries for which we have articles, or other things too, is an interesting question, but one which I would prefer to defer (oooh, I love that euphony :-), to keep this focused on the mechanics issues, so I will reword that section to omit any mention. (That text came from a page on Wikipedia, which does list non-article meanings in dab pages, is all - nothing sinister! :-) [[User:J. Noel Chiappa|J. Noel Chiappa]] 12:55, 16 May 2008 (CDT)
 
 
{{Proposals navigation}}

Latest revision as of 18:49, 29 September 2020

[edit intro]

Archive 1

The disambiguation policy is explained at CZ:Disambiguation.