小鸟吃什么| 心度高血压是什么意思| 墓志铭是什么意思| 缓缓是什么意思| 遗传物质的载体是什么| 申遗是什么意思| 七月三号是什么日子| 补血吃什么| 什么动物不喝水| 维生素c什么时候吃| npv是什么意思| 干事是什么意思| 牛字旁与什么有关| 什么病不能吃牛肉| 什么是周围神经病| 好运是什么意思| 嗜血综合症是什么病| 测怀孕的试纸叫什么| 心率快是什么原因| 吃什么药能让月经马上来| 酒后手麻什么原因| 唐僧的袈裟叫什么| 胶水用什么能洗掉| 女性下面水少是什么原因| 六味地黄丸有什么副作用吗| 三进宫是什么意思| 痴汉是什么意思| 蛋皮痒痒是什么病| 颖五行属什么| 脚臭用什么洗效果最好| 紫癜是什么症状| 新生儿干呕是什么原因| 蛇为什么怕雄黄| 食物中毒呕吐吃什么药| 出库是什么意思| 阳是什么意思| 怀孕十天左右有什么反应| 牛肉不能跟什么一起吃| 朋友生日送什么礼物| 滑丝是什么意思| 富翁是什么意思| 什么匆匆| 丝鸟读什么| 方巾是干什么用的| 12月10号什么星座| 阑尾粪石是什么意思| 避孕药什么时候吃有效| 什么人不能吃黄芪| 空调管滴水是什么原因| 仕途是什么意思| cfp是什么证书| 毛囊长什么样子| 为什么不爱我| 尿隐血弱阳性是什么意思| 四月初十是什么星座| 抖m什么意思| 咳嗽有黄痰吃什么消炎药| 12月4日是什么日子| 栀子花黄叶是什么原因| 工科和理科有什么区别| 甲母痣挂什么科| 高大上是什么意思| 肚子疼去医院挂什么科| 石蛋是什么| bred是什么意思| 三岁看小七岁看老是什么意思| 飞机联程票是什么意思| 中心性肥胖什么意思| 肠胃不好吃什么水果比较好| 芬太尼是什么药| 出水痘能吃什么食物| goldlion是什么牌子| 什么叫膳食纤维| 角是什么生肖| 剑桥英语和新概念英语有什么区别| 月经褐色是什么原因| 肠胃炎吃什么药| 右乳钙化灶是什么意思| 珍惜当下是什么意思| 省委书记什么级别| 吃东西容易呛到是什么原因| 减胎对另一个胎儿有什么影响| 猩红热是什么病| 脚背肿是什么原因| 什么清什么楚| 糖尿病人吃什么水果好| 1217是什么星座| 属虎的五行属什么| 情人节送什么花| 龟头瘙痒用什么药膏| 子宫附件是什么| 春捂秋冻指的是什么意思| 好记性不如烂笔头是什么意思| 吓得什么填空| bpo是什么意思啊| 胸闷气短看什么科| 外阴裂口用什么药| 盲盒是什么意思| 百香果是什么季节的| 尿检潜血是什么意思| 胃黏膜受损是什么症状| 气血不足什么原因引起的| 量贩式ktv什么意思| 财神在什么方位| 中单是什么| 为什么左手会发麻| 贤者模式是什么意思| 天秤座的幸运色是什么| 神助攻什么意思| 润字五行属什么| 血常规检查什么项目| 人为什么会衰老| 舌吻有什么好处| 心有不甘是什么意思| 什么是社恐| 蚊子喜欢咬什么人| 茯苓和茯神有什么区别| 过期的洗面奶可以用来做什么| 芊芊学子是什么意思| 宽宏大度是什么生肖| 泌尿科挂什么科| 副处级干部是什么级别| 鸡全蛋粉是什么东西| 冯巩什么军衔| 腹泻吃什么药| 太平公主叫什么名字| 多金是什么意思| 怀孕需要注意什么| 散光看东西是什么样的| 试孕纸什么时候测最准| 静脉曲张吃什么中成药| fast什么意思| 现在适合做什么生意| 抬举征阳性是什么意思| 狐仙一般找什么人上身| 弥漫性肝病是什么意思| 长孙皇后叫什么名字| 甲状腺癌有什么症状| 糖尿病人能吃什么| ucs是什么意思| 高密度脂蛋白低是什么原因| 遮挡车牌属于什么行为| kpa什么意思| 为什么晚上不能剪指甲| 儿童铅超标有什么症状| 吃什么补黄体酮最快| body是什么意思| 鸽子拉绿稀便是什么病| 她将是你的新娘是什么歌| spa是什么服务| 慢性荨麻疹是什么原因引起的| 葛根是什么东西| 开业送什么礼物好| 哂是什么意思| 36周检查什么项目| 什么店可以买到老鼠药| 草字头加弓念什么| 耳石症看什么科| 维生素c对身体有什么好处| 声优是什么意思| 扁桃体肿大吃什么药| 77代表什么意思| 专科考研需要什么条件| 女人消瘦应该检查什么| 夜晚咳嗽是什么原因| 高血脂不能吃什么| 一月二十五号是什么星座| 做梦梦见火是什么征兆| 什么的草坪| 口球是什么| 上夜班吃什么对身体好| 小孩晚上睡觉出汗是什么原因| 为什么手会掉皮| 子宫增厚是什么原因| 屁多不臭是什么原因| 疫情是什么| 鼎字五行属什么| 手足口病是什么病毒| 二院是什么医院| 卦不走空是什么意思| 童五行属什么| pin什么意思| 低压高是什么引起的| 威化是什么意思| 盆腔炎是什么症状| joeone是什么牌子| 第一次表白送什么花| 圣贤是什么意思| 天下乌鸦一般黑是什么意思| 杏仁有什么功效和作用| 空唠唠的意思是什么| 指甲上有白点是什么原因| 四月十一日是什么星座| 长疖子是什么原因| 血压低是什么原因| 为什么叫八路军| 一什么宝石| 做梦梦到踩到屎是什么意思| 残联是什么性质的单位| h皮带是什么牌子| 什么是木薯粉| 唇炎抹什么药膏最有效| a1代表什么意思| 表情包什么意思| 尿蛋白两个加号是什么意思| 白细胞高是什么意思| 安徽属于什么地区| 党参长什么样子| 脚心发热是什么原因| 戾气重是什么意思| 白蜡金是什么金| 食字五行属什么| 男同叫什么| 热感冒吃什么药好| 直击是什么意思| 什么是对冲| 1226是什么星座| 共济会是什么组织| 失眠吃什么食物最有效| 肛瘘是什么原因引起的| 小孩经常肚子疼是什么原因| 记忆力减退吃什么药效果好| 乖戾是什么意思| 入幕之宾是什么意思| 咕噜是什么意思| 宝宝拉肚子吃什么药好| 冷暖自知是什么意思| 屁股上有痣代表什么| 茔是什么意思| 白癜风是什么样子的| 5年生存率是什么意思| 八月十八号是什么星座| 阅读有什么好处| 澄粉是什么粉| 女以念什么| 小孩经常口腔溃疡是什么原因| 裘皮是什么皮| 中老年人喝什么奶粉好| 眼干眼涩眼疲劳用什么眼药水| 一什么凤冠| 长期吸烟容易引起什么疾病| 前列腺钙化吃什么药| 粉尘螨是什么东西| 梦见两只狗是什么征兆| 喝盐水有什么作用和功效| 白细胞低吃什么补| 家有一老如有一宝是什么意思| 蓝灰色配什么颜色好看| 叶酸什么时间吃最好| 脑梗应该挂什么科| 绿得什么| 腰椎间盘突出吃什么药好| 洽谈是什么意思| 肝不好的人有什么症状| 工字可以加什么偏旁| 咳嗽买什么药| 炖鸭汤放什么食材最好| 容易淤青的体质叫什么| 什么是直径| vip是什么意思| 色盲色弱是什么意思| 红薯用什么繁殖| 小便带血什么原因| 桃花是什么季节开的| 百度Jump to content

宋氏三姐妹年轻照片曝光 宋氏三姐妹谁最漂亮?

From Wikipedia, the free encyclopedia
百度 杨伟表示,通过歼-20、运-20、歼-15、歼-16等一大批大国重器的研制,我国已建立了数字化飞机研发体系。

Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar. As with Soundex, similar-sounding words should share the same keys. Metaphone is available as a built-in operator in a number of systems.

Philips later produced a new version of the algorithm, which he named Double Metaphone. Contrary to the original algorithm whose application is limited to English only, this version takes into account spelling peculiarities of a number of other languages. In 2009 Philips released a third version, called Metaphone 3, which achieves an accuracy of approximately 99% for English words, non-English words familiar to Americans, and first names and family names commonly found in the United States, having been developed according to modern engineering standards against a test harness of prepared correct encodings.

Procedure

[edit]

Original Metaphone codes use the 16 consonant symbols 0BFHJKLMNPRSTWXY.[2] The '0' represents "th" (as an ASCII approximation of Θ), 'X' represents "sh" or "ch", and the others represent their usual English pronunciations. The vowels AEIOU are also used, but only at the beginning of the code.[3] This table summarizes most of the rules in the original implementation:

  1. Drop duplicate adjacent letters, except for C.
  2. If the word begins with 'KN', 'GN', 'PN', 'AE', 'WR', drop the first letter.
  3. Drop 'B' if after 'M' at the end of the word.
  4. 'C' transforms to 'X' if followed by 'IA' or 'H' (unless in latter case, it is part of '-SCH-', in which case it transforms to 'K'). 'C' transforms to 'S' if followed by 'I', 'E', or 'Y'. Otherwise, 'C' transforms to 'K'.
  5. 'D' transforms to 'J' if followed by 'GE', 'GY', or 'GI'. Otherwise, 'D' transforms to 'T'.
  6. Drop 'G' if followed by 'H' and 'H' is not at the end or before a vowel. Drop 'G' if followed by 'N' or 'NED' and is at the end.
  7. 'G' transforms to 'J' if before 'I', 'E', or 'Y', and it is not in 'GG'. Otherwise, 'G' transforms to 'K'.
  8. Drop 'H' if after vowel and not before a vowel.
  9. 'CK' transforms to 'K'.
  10. 'PH' transforms to 'F'.
  11. 'Q' transforms to 'K'.
  12. 'S' transforms to 'X' if followed by 'H', 'IO', or 'IA'.
  13. 'T' transforms to 'X' if followed by 'IA' or 'IO'. 'TH' transforms to '0'. Drop 'T' if followed by 'CH'.
  14. 'V' transforms to 'F'.
  15. 'WH' transforms to 'W' if at the beginning. Drop 'W' if not followed by a vowel.
  16. 'X' transforms to 'S' if at the beginning. Otherwise, 'X' transforms to 'KS'.
  17. Drop 'Y' if not followed by a vowel.
  18. 'Z' transforms to 'S'.
  19. Drop all vowels unless it is the beginning.

This table does not constitute a complete description of the original Metaphone algorithm, and the algorithm cannot be coded correctly from it. Original Metaphone contained many errors and was superseded by Double Metaphone, and in turn Double Metaphone and original Metaphone were superseded by Metaphone 3, which corrects thousands of miscodings that will be produced by the first two versions.

To implement Metaphone without purchasing a (source code) copy of Metaphone 3, the reference implementation of Double Metaphone can be used.[4] Alternatively, version 2.1.3 of Metaphone 3, an earlier 2009 version without a number of encoding corrections made in the current version, version 2.5.4, has been made available under the terms of the BSD License via the OpenRefine project.[5]

Double Metaphone

[edit]

The Double Metaphone phonetic encoding algorithm is the second generation of this algorithm. Its implementation was described in the June 2000 issue of C/C++ Users Journal.[6] It makes a number of fundamental design improvements over the original Metaphone algorithm.

It is called "Double" because it can return both a primary and a secondary code for a string; this accounts for some ambiguous cases as well as for multiple variants of surnames with common ancestry. For example, encoding the name "Smith" yields a primary code of SM0 and a secondary code of XMT, while the name "Schmidt" yields a primary code of XMT and a secondary code of SMT—both have XMT in common.

Double Metaphone tries to account for myriad irregularities in English of Slavic, Germanic, Celtic, Greek, French, Italian, Spanish, Chinese, and other origins. Thus it uses a much more complex ruleset for coding than its predecessor; for example, it tests for approximately 100 different contexts of the use of the letter C alone.

Metaphone 3

[edit]

A professional version was released in October 2009, developed by the same author, Lawrence Philips. It is a commercial product sold as source code. Metaphone 3 further improves phonetic encoding of words in the English language, non-English words familiar to Americans, and first names and family names commonly found in the United States. It improves encoding for proper names in particular to a considerable extent.[7] The author claims that in general it improves accuracy for all words from the approximately 89% of Double Metaphone to 98%. Developers can also now set switches in code to cause the algorithm to encode Metaphone keys 1) taking non-initial vowels into account, as well as 2) encoding voiced and unvoiced consonants differently. This allows the result set to be more closely focused if the developer finds that the search results include too many words that don't resemble the search term closely enough.[8] Metaphone 3 is sold as C++, Java, C#, PHP, Perl, and PL/SQL source, Ruby and Python wrappers accessing a Java jar, and also Metaphone 3 for Spanish and German pronunciation available as Java and C# source.[9] The latest revision of the Metaphone 3 algorithm is v2.5.4, released March 2015. The Metaphone3 Java source code for an earlier version, 2.1.3, lacking a large number of encoding corrections made in the current version, version 2.5.4, was included as part of the OpenRefine project and is publicly viewable.[10]

Common misconceptions

[edit]

There are some misconceptions about the Metaphone algorithms that should be addressed. The following statements are true:

  1. All of them are designed to address regular, "dictionary" words, not just names, and
  2. Metaphone algorithms do not produce phonetic representations of the input words and names; rather, the output is an intentionally approximate phonetic representation, according to this standard:
  • words that start with a vowel sound will have an 'A', representing any vowel, as the first character of the encoding (in Double Metaphone and Metaphone 3 - original Metaphone just preserves the actual vowel),
  • vowels after an initial vowel sound will be disregarded and not encoded, and
  • voiced/unvoiced consonant pairs will be mapped to the same encoding. (Examples of voiced/unvoiced consonant pairs are D/T, B/P, Z/S, G/K, etc.).

This approximate encoding is necessary to account for the way English speakers vary their pronunciations and misspell or otherwise vary words and names they are trying to spell. Vowels, of course, are notoriously highly variable. British speakers often complain that Americans seem to pronounce 'T's the same as 'D'. Consider, also, that all English speakers often pronounce 'Z' where 'S' is spelled, almost always when a noun ending in a voiced consonant or a liquid is pluralized, for example "seasons", "beams", "examples", etc. Not encoding vowels after an initial vowel sound will help to group words where a vowel and a consonant may be transposed in the misspelling or alternative pronunciation.

Metaphone of other languages

[edit]

Metaphone is useful for English variants and other languages, having been preferred to Soundex in several Indo-European languages. On the other hand, rough phonetic encoding causes language dependency — or, in a language variant, average language-speaker dependency — mainly for non-English variants.

Perhaps the first example of stable adaptation of non-English metaphone was Brazilian Portuguese: it originated in ~2008 as a database solution in Várzea Paulista municipality of Brazil, and it evolved to the current metaphone-ptbr algorithm.

See also

[edit]

References

[edit]
  1. ^ Hanging on the Metaphone, Lawrence Philips. Computer Language, Vol. 7, No. 12 (December), 1990.
  2. ^ "Alternative to Soundex". www.sound-ex.com. Archived from the original on 6 March 2014. Retrieved 16 May 2018.
  3. ^ "Morfoedro - Technology". www.morfoedro.it. Retrieved 16 May 2018.
  4. ^ Philips, Lawrence (1999) [1998]. "Double Metaphone" (CPP). GNU Aspell. Retrieved February 23, 2024.
  5. ^ "OpenRefine". GitHub. 19 May 2022.
  6. ^ Philips, Lawrence (June 2000). "The double metaphone search algorithm". C/C++ Users Journal. 18 (6): 38–43.
  7. ^ Guy, Ido; Ur, Sigalit; Ronen, Inbal; Weber, Sara; Oral, Tolga (2012). "Best Faces Forward: A Large-scale Study of People Search in the Enterprise" (PDF). Archived from the original (PDF) on December 1, 2023. Retrieved February 23, 2024.
  8. ^ Atkinson, Kevin. "Lawrence Philips' Metaphone Algorithm". aspell.net. Retrieved 16 May 2018.
  9. ^ "Anthropomorphic Software". www.amorphics.com. Retrieved 16 May 2018.
  10. ^ "OpenRefine source for Metaphone3". github.com. Retrieved 2 Nov 2020.
[edit]

Metaphone algorithms for other languages

[edit]
么么哒什么意思 宫腔镜检查后需要注意什么 哈西奈德溶液治什么病 简单是什么意思 祭日是什么意思
什么样的人做什么样的事 云想衣裳花想容是什么意思 legrand是什么牌子 原始鳞状上皮成熟是什么意思 咖啡与什么食物相克
头疼做什么检查 衣字旁的字和什么有关 什么是呆账 空调病是什么症状 迪奥什么意思
人造奶油是什么做的 痤疮是什么东西 唐僧肉是什么意思 记忆力减退吃什么药 子宫收缩是什么感觉
黄痰咳嗽吃什么药fenrenren.com 天天流鼻血是什么原因hcv9jop5ns9r.cn gem是什么意思hcv7jop5ns3r.cn 为什么会全身酸痛hcv8jop9ns6r.cn 包皮炎挂什么科hcv9jop6ns6r.cn
大圣归来2什么时候上映shenchushe.com 嘴唇起白皮是什么原因hcv8jop9ns5r.cn NF什么意思hcv8jop2ns3r.cn 九眼天珠是什么做的hcv9jop6ns8r.cn 牌匾是什么意思hcv9jop0ns9r.cn
松果体囊肿是什么病hcv8jop0ns5r.cn 肋骨突出是什么原因yanzhenzixun.com 肾有结晶是什么意思fenrenren.com 戾气太重是什么意思hcv8jop2ns8r.cn 脑梗吃什么东西好hcv7jop9ns2r.cn
pedro是什么牌子hcv8jop6ns7r.cn 资金流入股价下跌为什么hcv9jop6ns4r.cn 乌龟为什么会叫hcv8jop9ns1r.cn 儿童感冒咳嗽吃什么药hcv8jop9ns2r.cn 蛇缠腰用什么药hcv7jop5ns5r.cn
百度