什么时候是排卵期| 椎间盘轻度膨出是什么意思| 偏头疼是什么原因引起| 或是什么意思| 什么姿势睡觉最好| 下午六点是什么时辰| 刷题是什么意思| 印第安纹是什么| 缠腰蛇是什么原因引起的| 宫颈肥大有什么危害| 西红柿不能跟什么一起吃| 轻微脑梗吃什么药| 寒是什么生肖| 灰色配什么颜色好看| 北京属于什么气候| 谷丙转氨酶偏高吃什么药| 吃瓜群众是什么意思| 盘古是一个什么样的人| 什么品牌蓝牙耳机好| 冰心原名叫什么| 什么叫全日制本科| 吃什么有奶水| 相思成疾是什么意思| 诸事不宜是什么意思| 螃蟹和什么食物相克| 咽喉炎吃什么消炎药| 检查是否怀孕要做什么检查| 癫痫病是什么症状| 分泌物呈褐色是什么原因| 北京都有什么大学| 壶嘴为什么不能对着人| pink是什么颜色| 例假是什么意思| 汴去掉三点水念什么| 敬谢不敏是什么意思| 木舌是什么字| 陕西有什么烟| 憋屈是什么意思| 梦见母亲去世预示什么| 12点半是什么时辰| 抹布是什么意思| 后背疼应该挂什么科| ira是什么品牌| 痰湿吃什么中成药| 尿微量白蛋白是什么意思| qa和qc有什么区别| 老当益壮是什么意思| 什么属相不能戴貔貅| 自相矛盾什么意思| 胃烧心吃什么药效果好| 纸上谈兵是什么生肖| 元气是什么| 覃读什么| 茉字五行属什么| 脸颊两边长痘痘是什么原因引起的| 什么是钓鱼执法| 车仔面为什么叫车仔面| 什么洗衣液是中性的| 梦见头发长长了是什么意思| 弱冠之年是什么意思| 经常按摩头皮有什么好处| 火影忍者什么时候出的| 二氧化钛是什么东西| 补肾吃什么东西效果最好| 蛀牙是什么原因引起的| 潜伏是什么意思| 甲减要多吃什么食物好| 耳屎多是什么原因| 封神是什么意思| 2月15号是什么星座| cta是什么意思| 贤淑是什么意思| 医师是什么意思| 甘油三脂高是什么意思| 男性左下腹疼痛是什么原因| 旧历是什么意思| 血脂高吃什么药效果好| 代言是什么意思| 什么是脂溢性脱发| 液基薄层细胞学检查是什么| 皮肤病是什么原因造成的| 什么是视同缴费| cob是什么意思| 洛阳以前叫什么名字| 天上的彩虹像什么| 啤酒花是什么东西| 小麦淀粉是什么| 亲子鉴定需要什么样本| 属猪的护身佛是什么佛| 但愿人长久的下一句是什么| 鼻子里面痒是什么原因| 3月18是什么星座| 锻炼pc肌有什么好处| 领域是什么意思| 乙肝检查挂什么科| 外阴是指什么部位| 什么是钾肥| 什么条什么理| 广西北海有什么好玩的地方| 什么是阴历| 小孩子长白头发是什么原因| 自强是什么意思| 男生被口是什么感觉| 什么的水流| 吃什么补肝| 5月6日什么星座| 美特斯邦威是什么档次| 十二生肖分别是什么| 亦字五行属什么| 为什么会得卵巢肿瘤| 油菜花什么颜色| 口腔异味挂什么科| 雾化治疗的作用是什么| 乳房里面有硬块是什么原因| 什么是宫颈纳囊| 康斯坦丁是什么意思| 涵字五行属什么| 夏枯草有什么功效| 为什么不建议开眼角| 广州地铁什么时候停运| 西米是什么东西做的| 血栓吃什么药化得快| 唏嘘不已的意思是什么| 打饱嗝是什么病的前兆| 刚怀孕吃什么最好最营养| 焦亚硫酸钠是什么| 上火有什么症状| 石榴花是什么季节开的| 什么是虫草| 一什么水井| 拉谷谷女装什么档次的| 女人血虚吃什么补最快| 活珠子是什么| 拿铁和美式有什么区别| 白子画什么时候爱上花千骨的| 省委组织部部长什么级别| 暂住证和居住证有什么区别| 父亲b型血母亲o型血孩子什么血型| 一见倾心什么意思| 晚上左眼皮跳预示什么| 男人吃什么增大增长| 焦亚硫酸钠是什么| 可乐加味精女人喝了什么效果| 什么叫体位性低血压| 仪表堂堂是什么生肖| 颈动脉斑块吃什么药效果最好| 坐骨神经吃什么药| 丘疹性荨麻疹吃什么药| 青蛙吃什么| 眼睛经常充血是什么原因引起的| fbi相当于中国的什么| 看日历是什么生肖| 排卵期出血吃什么药| 眉目比喻什么| 外阴痒用什么药膏| 梦见买床是什么意思| 为什么总是放屁| 什么面不能吃| 去香港澳门旅游需要准备什么| 生小孩需要准备什么| 右眼一直跳是什么预兆| 心肝火旺吃什么中成药| 腊月初八是什么星座| 族谱是什么意思| 窦性心动过缓是什么意思| 杨桃是什么季节的水果| 刘邦为什么怕吕后| 下嘴唇跳动是什么原因| 什么叫人| 小燕子吃什么食物| 梦见鸡啄我是什么意思| 吃什么补精养肾| 晚上看见蛇预示着什么| 屌丝男是什么意思| 张伦硕为什么娶钟丽缇| 吃鱼肝油有什么好处| 91是什么东西| 爱恨就在一瞬间是什么歌| 朝阳是什么意思| 一视同仁什么意思| 男性补肾壮阳吃什么药效果比较好| 白鳍豚用什么呼吸| 文化底蕴是什么意思| 官鬼是什么意思| 眼睛发炎用什么眼药水| 三点水加盆读什么| 但愿人长久的下一句是什么| 单核细胞偏高是什么原因| 健康的舌苔是什么样的| 农历六月是什么生肖| 痔疮吃什么药好得快| 蓝颜知己是什么关系| 战狼三什么时候上映| 什么水果消炎| 干碟是什么| 雪中送炭是什么意思| 梦见石榴是什么意思| 海底椰是什么| 总维生素d偏低会导致什么| 佝偻病是什么病| 走麦城是什么意思| 尿常规白细胞偏高是什么原因| 小个子适合什么发型| 5月24日是什么星座| 文盲是什么意思| 终结者是什么意思| 阴道是什么味道| 什么环境唱什么歌原唱| tpp是什么意思| 火疖子挂什么科| 盐酸舍曲林片治疗什么程度的抑郁| 吃什么东西会长胖| 孕妇吃什么水果好| 以免是什么意思| 嗓子苦是什么原因引起的| 冬的部首是什么| 杀鸡取卵是什么生肖| 蓝玫瑰代表什么| 蓝莓什么时候成熟| 什么是扬州瘦马| 大便拉不干净是什么原因| 棉条是什么| 扬长避短什么意思| 夕火念什么| 童子尿能治什么病| 荔枝什么品种最贵| 捡肥皂是什么意思| 16岁可以做什么工作| 七月份是什么季节| 三色线分别代表什么| 乌鸦飞进家里什么征兆| 港澳通行证办理需要什么材料| 本心是什么意思| 小孩经常口腔溃疡是什么原因| 为什么会牙龈出血| 放行是什么意思| bp是什么| 视力矫正是什么意思| 淋巴结是什么病严重吗| 皮肤瘙痒症用什么药| 四个又念什么| 前列腺回声欠均匀什么意思| 丝瓜烧什么好吃| 1964年什么命| 有所作为的意思是什么| 极端是什么意思| 天体是什么| 遗精什么意思| 舒字五行属什么的| 坐骨神经吃什么药效果最好| 教师节送什么礼物呢| 肾虚腰疼吃什么药最有效| 路痴是什么原因造成的| 垣什么意思| 咖啡是什么| 滞后是什么意思| 小便分叉是什么原因男| 稻谷是什么| 文殊菩萨是管什么的| 车厘子什么季节吃| 浮白是什么意思| 补充公积金是什么意思| 月经来头疼是什么原因引起的| 失眠吃什么药效果好| 百度Jump to content

研究表明相同成长背景下黑人男孩更难实现“美国梦”

From Wikipedia, the free encyclopedia
百度 笑话虽小,但是足以折射退伍军人重新适应社会之难。

A numeric character reference (NCR) is a common markup construct used in SGML and SGML-derived markup languages such as HTML and XML. It consists of a short sequence of characters that, in turn, represents a single character. Since WebSgml, XML and HTML 4, the code points of the Universal Character Set (UCS) of Unicode are used. NCRs are typically used in order to represent characters that are not directly encodable in a particular document (for example, because they are international characters that do not fit in the 8-bit character set being used, or because they have special syntactic meaning in the language). When the document is interpreted by a markup-aware reader, each NCR is treated as if it were the character it represents.

Examples

[edit]

In SGML, HTML, and XML, the following are all valid numeric character references for the Greek capital letter Sigma

Numerical character reference of U+03A3 Σ GREEK CAPITAL LETTER SIGMA
(3A316 = 93110)
Unicode character Numerical base Numerical reference in markup Effect
U+03A3 Decimal Σ Σ
U+03A3 Decimal Σ Σ
U+03A3 Hexadecimal Σ Σ
U+03A3 Hexadecimal Σ Σ
U+03A3 Hexadecimal Σ Σ

In SGML, HTML, and XML, the following are all valid numeric character references for the Latin capital letter AE

Numerical character reference of U+00C6 Æ LATIN CAPITAL LETTER AE
Unicode character Numerical base Numerical reference in markup Effect
U+00C6 Decimal Æ ?
U+00C6 Hexadecimal Æ ?

In SGML, HTML, and XML, the following are all valid numeric character references for the Latin small letter sharp s ?

Numerical character reference of U+00DF ß LATIN SMALL LETTER SHARP S
Unicode character Numerical base Numerical reference in markup Effect
U+00DF Decimal ß ?
U+00DF Hexadecimal ß ?

List of numeric character references for the printable ASCII characters:

Unicode character Character
Reference
(decimal)
Character
Reference
(hexadecimal)
Effect
U+0020     (space)
U+0021 ! ! !
U+0022 " " "
U+0023 # # #
U+0024 $ $ $
U+0025 % % %
U+0026 & & &
U+0027 ' ' '
U+0028 ( ( (
U+0029 ) ) )
U+002A * * *
U+002B + + +
U+002C , , ,
U+002D - - -
U+002E . . .
U+002F / / /
U+0030 0 0 0
U+0031 1 1 1
U+0032 2 2 2
U+0033 3 3 3
U+0034 4 4 4
U+0035 5 5 5
U+0036 6 6 6
U+0037 7 7 7
U+0038 8 8 8
U+0039 9 9 9
U+003A : : :
U+003B &#59; &#x3B; ;
U+003C &#60; &#x3C; <
U+003D &#61; &#x3D; =
U+003E &#62; &#x3E; >
U+003F &#63; &#x3F; ?
U+0040 &#64; &#x40; @
U+0041 &#65; &#x41; A
U+0042 &#66; &#x42; B
U+0043 &#67; &#x43; C
U+0044 &#68; &#x44; D
U+0045 &#69; &#x45; E
U+0046 &#70; &#x46; F
U+0047 &#71; &#x47; G
U+0048 &#72; &#x48; H
U+0049 &#73; &#x49; I
U+004A &#74; &#x4A; J
U+004B &#75; &#x4B; K
U+004C &#76; &#x4C; L
U+004D &#77; &#x4D; M
U+004E &#78; &#x4E; N
U+004F &#79; &#x4F; O
U+0050 &#80; &#x50; P
U+0051 &#81; &#x51; Q
U+0052 &#82; &#x52; R
U+0053 &#83; &#x53; S
U+0054 &#84; &#x54; T
U+0055 &#85; &#x55; U
U+0056 &#86; &#x56; V
U+0057 &#87; &#x57; W
U+0058 &#88; &#x58; X
U+0059 &#89; &#x59; Y
U+005A &#90; &#x5A; Z
U+005B &#91; &#x5B; [
U+005C &#92; &#x5C; \
U+005D &#93; &#x5D; ]
U+005E &#94; &#x5E; ^
U+005F &#95; &#x5F; _
U+0060 &#96; &#x60; '
U+0061 &#97; &#x61; a
U+0062 &#98; &#x62; b
U+0063 &#99; &#x63; c
U+0064 &#100; &#x64; d
U+0065 &#101; &#x65; e
U+0066 &#102; &#x66; f
U+0067 &#103; &#x67; g
U+0068 &#104; &#x68; h
U+0069 &#105; &#x69; i
U+006A &#106; &#x6A; j
U+006B &#107; &#x6B; k
U+006C &#108; &#x6C; l
U+006D &#109; &#x6D; m
U+006E &#110; &#x6E; n
U+006F &#111; &#x6F; o
U+0070 &#112; &#x70; p
U+0071 &#113; &#x71; q
U+0072 &#114; &#x72; r
U+0073 &#115; &#x73; s
U+0074 &#116; &#x74; t
U+0075 &#117; &#x75; u
U+0076 &#118; &#x76; v
U+0077 &#119; &#x77; w
U+0078 &#120; &#x78; x
U+0079 &#121; &#x79; y
U+007A &#122; &#x7A; z
U+007B &#123; &#x7B; {
U+007C &#124; &#x7C; |
U+007D &#125; &#x7D; }
U+007E &#126; &#x7E; ~

Discussion

[edit]

Markup languages are typically defined in terms of UCS or Unicode characters. That is, a document consists, at its most fundamental level of abstraction, of a sequence of characters, which are abstract units that exist independently of any encoding.

Ideally, when the characters of a document utilizing a markup language are encoded for storage or transmission over a network as a sequence of bits, the encoding that is used will be one that supports representing each and every character in the document, if not in the whole of Unicode, directly as a particular bit sequence.

Sometimes, though, for reasons of convenience or due to technical limitations, documents are encoded with an encoding that cannot represent some characters directly. For example, the widely used encodings based on ISO 8859 can only represent, at most, 256 unique characters as one 8-bit byte each.

Documents are rarely, in practice, ever allowed to use more than one encoding internally, so the onus is usually on the markup language to provide a means for document authors to express unencodable characters in terms of encodable ones. This is generally done through some kind of "escaping" mechanism.

The SGML-based markup languages allow document authors to use special sequences of characters from the ASCII range (the first 128 code points of Unicode) to represent, or reference, any Unicode character, regardless of whether the character being represented is directly available in the document's encoding. These special sequences are character references.

Character references that are based on the referenced character's UCS or Unicode code point are called numeric character references. In HTML 4 and in all versions of XHTML and XML, the code point can be expressed either as a decimal (base 10) number or as a hexadecimal (base 16) number. The syntax is as follows:

Character U+0026 (ampersand), followed by character U+0023 (number sign), followed by one of the following choices:

  • one or more decimal digits zero (U+0030) through nine (U+0039); or
  • character U+0078 ("x") followed by one or more hexadecimal digits, which are zero (U+0030) through nine (U+0039), Latin capital letter A (U+0041) through F (U+0046), and Latin small letter a (U+0061) through f (U+0066);

all followed by character U+003B (semicolon). Older versions of HTML disallowed the hexadecimal syntax.

The characters that comprise a numeric character reference can be represented in every character encoding used in computing and telecommunications today, so there is no risk of the reference itself being unencodable.

There is another kind of character reference called a character entity reference, which allows a character to be referred to by a name instead of a number. (Naming a character creates a character entity.) HTML defines some character entities, but not many; all other characters can only be included by direct encoding or using NCRs.

Restrictions

[edit]

The Universal Character Set defined by ISO 10646 is the "document character set" of SGML, HTML 4, so by default, any character in such a document, and any character referenced in such a document, must be in the UCS.

While the syntax of SGML does not prohibit references to invalid or unassigned code points, such as &#xFFFF;, SGML-derived markup languages such as HTML and XML can, and often do, restrict numeric character references to only those code points that are assigned to characters.

Restrictions may also apply for other reasons. For example, in HTML 4, &#12;, which is a reference to a non-printing "form feed" control character, is allowed because a form feed character is allowed. But in XML, the form feed character cannot be used, not even by reference.[1][citation needed] As another example, &#128;, which is a reference to another control character, is not allowed to be used or referenced in either HTML or XML, but when used in HTML, it is usually not flagged as an error by web browsers – some of which interpret it as a reference to the character represented by code value 128 in the Windows-1252 encoding for compatibility reasons. This character, "€", has to be represented as &#8364; in a standard-compliant HTML code. As a further example, prior to the publication of XML 1.0 Second Edition on October 6, 2000, XML 1.0 was based on an older version of ISO 10646 and prohibited using characters above U+FFFD, except in character data, thus making a reference like &#65536; (U+10000) illegal. In XML 1.1 and newer editions of XML 1.0, such a reference is allowed, because the available character repertoire was explicitly extended.

Markup languages also place restrictions on where character references can occur.

Compatibility issues

[edit]

In the initial versions of SGML and HTML, numeric character references were interpreted in relationship to the document character encoding, rather than Unicode. For Latin-script documents, numeric character references to characters between x80 and x9F in those documents will not be correct against Unicode, and must be recoded. HTML standards prior to HTML 4 supported only Western Latin script documents: the treatment of character references above #7F may vary between applications and national conventions.

For example, as mentioned above, the correct numeric character reference for the Euro sign "€" U+20AC when using Unicode is decimal &#8364; and hexadecimal &#x20AC;. However, if using tools supporting obsolete implementations of HTML, the reference &#128; (Euro sign in the CP-1252 code page) or &#164; (Euro sign in ISO/IEC 8859-15) may work.

As another example, if some text was created originally using the MacRoman character set, the left double quotation mark " will be represented with code point xD2. This will not display properly in a system expecting a document encoded as UTF-8, ISO 8859-1, or CP-1252, where this code point is occupied by the letter ò. The correct numeric character reference for " in HTML 4 and newer is &#x201C;, because U+201C is its UCS code. In some systems, the named character reference &ldquo; may also be available.

See also

[edit]

References

[edit]
  1. ^ "HTML 5.2: 8. The HTML syntax". www.w3.org.
为什么一热脸就特别红 面瘫是什么引起的 厚黑学什么意思 工作单位是什么 梦见芝麻是什么意思
缢死是什么意思 海棠花什么季节开花 肛门出血什么原因 人流后吃什么补身体 养肝要吃什么
螺蛳粉为什么那么臭 鱼油是什么鱼提炼的 排卵试纸阴性是什么意思 pc肌是什么 阿司匹林什么时间吃最好
山梨酸是什么 女性尿血是什么原因引起的 spank是什么意思 头响脑鸣是什么原因引起的 蝴蝶的翅膀像什么
射进去是什么感觉hcv8jop3ns3r.cn 金牛座是什么星象hcv8jop0ns1r.cn 牛逼是什么意思hcv8jop3ns6r.cn 晚上9点是什么时辰jinxinzhichuang.com 副词是什么adwl56.com
人老放屁是什么原因hcv7jop6ns6r.cn 白带黄用什么药hcv9jop5ns4r.cn 外籍是什么意思hcv8jop2ns0r.cn 言字旁的字和什么有关hcv8jop3ns9r.cn 车暴晒有什么影响wuhaiwuya.com
熊猫血是什么血型hcv8jop5ns7r.cn 可人是什么意思hcv7jop9ns5r.cn 偶发性房性早搏是什么意思hcv7jop6ns0r.cn 嫦娥奔月是什么节日hcv9jop0ns2r.cn 高考报名号是什么hcv9jop4ns3r.cn
为什么会起鸡皮疙瘩hcv8jop6ns3r.cn 辟邪是什么意思hcv7jop4ns5r.cn 分泌多巴胺是什么意思hcv8jop1ns6r.cn 心电图是什么hcv8jop9ns0r.cn 血红蛋白什么意思hcv8jop8ns9r.cn
百度