用户:TongcyDai/沙盒/0D
这是维基词典类义词库(Wiktionary Thesaurus)的主页面,它是维基词典的一个子项目和一个维基命名空间,目标在于创建一个类义词库,也就是一个包含同义词、反义词以及其他语义相关术语(如下位词、上位词、部分词和整体词)的词典。该项目以前被称为 Wikisaurus。
第一次来到类义词库?
- 查看类义词库中的随机页面
- 浏览所有类义词条目
- 在下方的输入框中搜寻类义词库:
欢迎您贡献类义词条目,或者补充现有的类义词条目。
目的
编辑- 目的:
- 帮助人们找到他们
- 不记得的词或
- 不知道的词
- 帮助人们探索字词网络
- 帮助人们找到他们
- 语义关系
- 同义关系 – 相同或相似的意思
- 反义关系 – 相反的意思
- 下位关系 – 更狭窄的意思,子类别
- 上位关系 – 更广泛的意思,亲类别
- 实例关系 – 作为某类别的例子、某集合中的成员
- 部分关系 – 部分,例如车轮之于汽车
- 整体关系 – 整体,例如汽车之于车轮
- 使用者:
- 作家
- 经理
- 维基媒体专案贡献者
- 部落格作者
- 写情书的人
- 期刊撰稿人
维基词典类义词库的目的是作为一个电子类义词库的角色——一个包含同义词、近义词、反义词和近反义词以及其他语义相关术语(如下位词、上位词、部分词和整体词)的词典。
一般而言,这样一个类义词库的目的主要是帮助任何为了生计或乐趣而写作的人(作家、经理、维基贡献者、部落格作者和写情书的人)在搜寻语意上相关的词时,找到他们想不起来或甚至不知道的词。一般来说,任何对字词选择很在意的人都可以从类义词库中受益,尤其是与提供定义的字典连结的同义词库。
类义词库的一般目的是主要帮助任何为生计或乐趣而写作的人——作家、经理、维基媒体专案贡献者、博客作者和写情书的人——找到他们想不起来或甚至不知道的词,当他们回忆与所寻词语语义相关的词时。总的来说,任何对词语选择很在意的人都能从类义词库中受益,尤其是那种连结到提供定义的词典的类义词库。
维基词典类义词库的附加价值在于它与维基词典的整合——它链接到维基词典,维基词典也反过来连结到类义词库。
浏览
编辑要开始浏览类义词库,你可以从根目录 Thesaurus:entity 开始,然后通过下位词网络进一步浏览高级分类,如 Thesaurus:person, Thesaurus:organism, Thesaurus:animal, Thesaurus:plant 和Thesaurus:artifact 等。你也可以按主题分类浏览,例如 Category:Thesaurus:地理 (Thesaurus:forest), Category:Thesaurus:人格 (Thesaurus:humble) 或 Category:Thesaurus:外貌 (Thesaurus:beautiful)。
Model
编辑The thesaurus is organized primarily on the model of WordNet. That is, the key organizing principles are the relations of hyponymy (subclass) and hypernymy (superclass), and to a lesser extent meronymy (part of a whole) and holonymy (whole of the part). See also Wiktionary:Semantic relations. The design of Roget's 1911 thesaurus is somewhat similar in that it does not restrict the entries to lists of synonyms and antonyms; however, Roget's thesaurus does not use WordNet relations. The design of Oxford English Dictionary thesaurus is somewhat similar in that it has a hierarchically organized thesaurus. However, its subordination relation is not a strict hyponymy but is in part thematic. By contrast, the thesaurus of Merriam-Webster has synonyms, antonyms and words "related". Editors who want to create thesaurus entries that are primarily for lists of synonyms can do so without worrying about the other relations, but keep in mind that there should be only a single thesaurus entry for a synonym set, thereby avoiding duplication.
One sense per entry
编辑Each entry should ideally have a single sense. Nonetheless, the format supports multiple senses for the cases where this seems to be the best option. It is usually possible to pick different headwords for different senses. Each entry should ideally stand for a semantic object; the headword should be in part an accident. The point is not to list all senses of the headword. Thus, there can be a single sense in Thesaurus:sound, and another sense of "sound" is covered at Thesaurus:inlet. If it becomes impossible to keep finding dedicated headwords, we may resort to disambiguating naming like "rich (wealthy)" or "rich, wealthy". Sometimes, the headword becomes less ambiguous by using a phrase: there is Thesaurus:English language. WordNet seems to do fine mostly by the comma convention.
To use the entry headword as a basis for covering all the senses of the headword would lead to a duplication of synonym rings covered in other headwords, at odds with the duplication-avoidance rationale for the thesaurus. Thus, there is no point in duplicating Thesaurus:spicy in Thesaurus:hot. By contrast, having a sense for an adjective and a sense for a noun in Thesaurus:German is a different use case and makes a little bit more sense, being caused by a lack of suitable English disambiguating headword unless one opts for "German person" and "German language" as headwords.
Multilingualism
编辑English Wiktionary Thesaurus shall contain entries for other languages than English.
Category:Thesaurus entries by language features these entries. Categorization is done by {{ws sense}}
。
Historically, there was no agreed-upon naming scheme for non-English thesaurus entries. The following conventions existed:
- No language code, native headword. For example: Thesaurus:صار (Arabic), Thesaurus:yaxşı (Azerbaijani), Thesaurus:chat (French), Thesaurus:god (English and Danish). Entries in different languages with identically spelled titles are placed on the same Thesaurus page, exactly as we do for ordinary dictionary entries.
- Language code, native headword. For example: Thesaurus:fr:embêter (French), Thesaurus:sga:ar (Old Irish), Thesaurus:non:sverð (Old Norse).
- Language code, English headword. For example: Thesaurus:da:beautiful (Danish), Thesaurus:ar:become happy (Arabic), Thesaurus:sound/fi (Finnish).
By October 2022, convention A was followed by 98.8% of non-English thesaurus entries. In November 2023, all remaining thesaurus entries using convention B and C were standardised to convention A.
Convention A carries certain disadvantages. One is that the automatic display of the [⇒ thesaurus] link next to synonyms and other terms can be triggered in situations where it is not relevant. For example, Thesaurus:yes lists "ar" as a synonym, but the [⇒ thesaurus] link next to "ar" points to a page containing synonyms for an unrelated term in a different language. Another disadvantage is that the scheme is unique to English Wiktionary. Having Thesaurus entries for multiple languages on a single page causes problems for interwiki linking to other-language Wiktionary editions, such as French Wiktionnaire.
Topical categorization is an unsolved problem: there is Category:Thesaurus:Geography, but no language-specific one. We could create "Category:Thesaurus:en:Geography", "Category:Thesaurus:es:Geography", etc., on the model of mainspace topical categories.
Discussions:
Formatting
编辑Formatting is specified and discussed at:
Example entries:
- Thesaurus:error – mostly synonyms
- Thesaurus:aircraft – mostly hyponyms
- Thesaurus:food – a complex entry
- Thesaurus:word – hyponyms are grouped semantically using
{{ws ----}}
as separator. However, multiple people indicated they preferred the text labels for clarity, so using labels is probably the way to go instead going forward. - Thesaurus:animal sound – hyponyms grouped semantically with text labels
Inclusion
编辑As for entry headwords, they must be attested. Not all mainspace entries should have their own thesaurus entry: the point of thesaurus is in part to prevent duplication of lists. The headwords can sometimes be sum of parts if deemed preferable, as in Thesaurus:beautiful person.
As for list items, all items in lists of synonyms, antonyms, hyponyms, etc. on Thesaurus pages are required to be attested, using the same attestation criteria as the mainspace. There is no requirement that they must be more than sum of parts. Roget's Thesaurus did include many sum-of-parts phrases.
Semantic relations
编辑See also Wiktionary:Semantic relations.
If you want to create synonym-only entries, you do not need to worry about the other relationships all that much. This is especially true of adjectives. For nouns, it often pays off to figure out a good node in the hyponymic (subclass/superclass) network.
Synonyms and antonyms
编辑Synonyms are terms with the same or very similar meaning. Register (informal, vulgar, etc.) does not impact synonymy. Examples: Thesaurus:wise, Thesaurus:drunk. Some putative synonyms are better classified as hyponyms.
Antonyms are terms with opposite meaning. Antonyms are sometimes concentrated in an opposite thesaurus entry. Example: Thesaurus:drunk.
Hypernyms and hyponyms
编辑Hypernyms are terms with broader meaning, capturing a superclass relationship: X is a hypernym of Y if each Y is also an instance of X. Example: Thesaurus:bird.
Hyponyms are terms with narrower meaning, capturing a subclass relationship: X is a hyponym of Y is each X is an instance of Y. Examples: Thesaurus:drunk, Thesaurus:bird. In many entries, hyponyms can be listed only up to a point, to some nesting level. For instance, it makes no sense to list all hyponyms in Thesaurus:person; by contrast, listing all hyponyms in Thesaurus:relative or Thesaurus:musician seems fine.
Holonyms and meronyms
编辑Holonyms are terms for wholes containing parts: X is a holonym of Y if Y is part of X. Example: Thesaurus:relative.
Meronyms are terms for parts of wholes: X is a meronym of Y if X is part of Y. Example: Thesaurus:aircraft.
Classes and instances
编辑X is a class of Y if Y is an instance of X, different from hypernyms. Example: Thesaurus:Ecuador.
Instances are opposite of classes, different from hyponyms. Example: Thesaurus:country.
Coordinate terms and troponyms
编辑Coordinate terms, also known as cohyponyms, are mostly unused in the thesaurus since it duplicates hyponymic structures from other entries.
Troponyms are unused: use hyponyms and hypernyms for verbs as well.
Various
编辑The section "Various" is intended to capture other interesting relations, to broaden the navigation network beyond specifically defined relations. It supports creativity, but may lead to disagreements between editors since there is no set of specific rules governing the section.
Example entries:
- Thesaurus:number: has all sorts of terms relating to numbers that are not hyponyms or instances.
- Thesaurus:size: has adjectives for size and these do not fit hyponymy or instance-of relationships.
- Thesaurus:aircraft: has people on board, who are strictly speaking not meronyms.
Minimum item count
编辑A putative thesaurus entry with 2–5 items can probably be comfortably handled by the mainspace synonym lists, and may be not worth an entry. However, there is no agreed on rigid rule for this. The thesaurus most pays off when the item counts are larger.
There is usually no need to create "leaf node" entries for 1 or 2 synonyms and 1 hypernym. Such items are sufficiently covered in the hypernym entries and in the mainspace. Thus, there is Thesaurus:lake but no Thesaurus:pond.
Templates
编辑Lists of templates:
Templates:
Template | Example | Note | |||
---|---|---|---|---|---|
第二个条目是一个懒惰的条目。第三个没有显示工具提示,但也没有明确指出缺少工具提示。 | |||||
|
To be put around a list of {{ws}} entries. Currently formats the list as a 3-column one.
| ||||
|
Entered at the very top of the entry. When without parameter, determines the headword automatically. | ||||
{{R:Roget 1911|beauty}}
|
Template:R:Roget 1911 | ||||
{{ws sense|en|glad; in a good mood}}
|
词义:glad; in a good mood | Used after the third level heading for the part of speech. |
Mainspace
编辑Linking from mainspace to Thesaurus entries:
- Links to thesaurus entries can be added to the "Synonyms" section (or "Hyponyms", "Antonyms", etc. where appropriate) using the template
{{seeSynonyms}}
(which displays something like: 参见 Thesaurus:error), or using conventional wikitext syntax (''See also'' [[Thesaurus:error]]
). - The
{{synonyms}}
template, used to render per-sense synonyms directly beneath definitions, acceptsThesaurus:
links, which should be placed after any specific synonyms (e.g.{{synonyms|en|goof|blunder|Thesaurus:error}}
) - Especially for Thesaurus entries featuring mostly synonyms, it is good to add a link to the Thesaurus entry from all the mainspace entries for the synonyms, so that the user knows that there is a Thesaurus entry when visiting the mainspace.
维基数据
编辑维基数据及其子类别和关系实例完成了类义词库的一些工作,并且更加完整。然而,它不适合广泛的同义词列表,并且不能方便地浏览下义网络,仅支援从项目到其亲类别的轻松导航。它的一些子类结构看起来过于复杂和过度设计。
Roget-MICRA 类义词库
编辑Roget 的1911年类义词库(带有 MICRA 补充)可在此处取得:
附录中有一个搜寻框,并方便地提供到主命名空间的连结。
A search box for convenience:
Moby Thesaurus II
编辑Moby Thesaurus II is available here:
附录中有一个搜寻框,并方便地提供到主命名空间的连结。
A search box for convenience:
Identity
编辑The current title of the project is "Thesaurus" and "Wiktionary Thesaurus". Before mid-2017, it was "Wikisaurus" Alternatives considered include "Wikithesaurus". In the past, WikiSaurus spelling with capital 'S' must have existed at some point.
Online thesauri
编辑Public domain
- 1911 version of Roget's Thesaurus hosted by Project Gutenberg
- Moby Thesaurus II by Grady Ward - public domain
- Dictionary at datasegment.com - includes Moby thesaurus in its search results
Free as in "freedom"
- http://wordnet.princeton.edu/ - licensed under [1]; see also Wiktionary:Princeton WordNet
- https://en-word.net/ - licensed under [Creative Commons Attribution (CC-BY) 4.0 License]; see also Github [[2]]
Proprietary
- http://thesaurus.reference.com
- http://www.merriam-webster.com/thesaurus
- http://www.bartleby.com/62/
- http://www.visualthesaurus.com/
- http://encarta.msn.com/thesaurus__/thesaurus.html
- http://www.fao.org/agrovoc/
- http://www.smartdefine.org
- http://www.powerthesaurus.org
Other
- None listed.
统计
编辑Statistics about the thesaurus entries, as of Oct 2022:
- Entries: 4,833
- English entries: 2,487
- Chinese entries: 1,900
- Other-language entries: 446
- Entries containing colon (:) in title: 29
Page views
编辑Anatomy entries get a fair amount of page views, as is expected. But they are not alone; other entries with non-trivial page views include Thesaurus:pros and cons and Thesaurus:child.[3]
Recent changes
编辑Shortcuts
编辑- WT:WSI - a Thesaurus index.
- WT:WS - to this page.
- See also Wiktionary:Shortcut
参见
编辑- Wiktionary:Semantic relations
- Synonym (P5973) property talk of Wikidata:Lexicographical data
Subpages
编辑Highlighted subpages:
Project subpages:
- /Format - how to format a Thesaurus entry
- /Purpose - on the purpose of the Thesaurus
- Wiktionary:Thesaurus considerations - Original discussion about the project.
- /Improvements 1 - Its talkpage has a discussion from July 2008.
- /Improvements 2 - Discussion about the direction and overall project.
- /Requested entries - A lot of words with candidate lists of synonyms that can be used as a starting point for creation of entries. The size of the page: 700 words.
To do
编辑Things to do:
- /Requested entries - add requested entries
- Requests for cleanup - clean up entries with formatting and other problems
- Appendix:Roget's thesaurus classification - add entries using Roget's thesaurus as a checklist and model
All entries
编辑Lists of all Thesaurus entries:
Discussion
编辑Discussions about Thesaurus are scattered across various pages. In the future, they should better take place in Beer Parlour, a general policy discussion room.
Pages with discussions:
- Wiktionary:Thesaurus considerations -- starting in 2002 and 2003, getting more traffic in 2004, with most discussion ended by the end of 2006
- Wiktionary:Wikisaurus/Improvements 1 -- created in February 2005, and stopped immediately; a surge of activity appeared in July 2008
- Wiktionary:Wikisaurus/Improvements 2 -- created in April 2006, active in May 2006 and then stopped; a surge of activity appeared in May 2008
For more discussions, see #Beer parlour.
Index
编辑An index to this page:
- All entries - see #All entries, Wiktionary:All Thesaurus pages, and Category:Thesaurus
- Example entries - see #Formatting
- Layout - see #Formatting
- Logo - see #Identity
- Monitoring - see #Recent changes
- Recent changes - see #Recent changes
- Requested entries - see /Requested entries and #Subpages
- Spelling - see #Identity
- Title - see #Identity