舌苔厚是什么原因引起的| 庚什么意思| ube手术是什么意思| 叶绿素主要吸收什么光| 破财消灾什么意思| 为什么要做羊水穿刺检查| 刘姥姥和贾府什么关系| 脑梗不能吃什么| 万圣节应该送什么礼物| 白猫进家有什么预兆| 宫寒是什么| 女人梦见自己掉牙齿是什么征兆| 宫颈小有什么影响| 为什么来月经肚子疼| 1月13日是什么星座| 一什么招牌| 把你的心我的心串一串是什么歌| 大人睡觉流口水是什么原因引起的| 八哥鸟吃什么| 酥油是什么油| 蜈蚣代表什么生肖| 宇五行属性是什么| 小便有点黄是什么原因| 费玉清为什么不结婚| 大家闺秀是什么生肖| 套话是什么意思| 什么是巨细胞病毒| 脚气用什么药最好| 安分守己什么意思| 苦瓜煮水喝有什么功效| 谷丙转氨酶偏高吃什么药| 屁股长痘痘用什么药膏| 护士节送什么鲜花| au999是什么金| 过敏性鼻炎有什么症状| 什么米叫粳米| 摩羯座是什么象| 庸人自扰之是什么意思| pd是什么金属| 梦见很多人是什么意思| 被和谐了是什么意思| 少字五行属什么| 竹棉和纯棉有什么区别| 颈椎病去医院挂什么科| 小便有点刺痛是什么原因引起的| 女人梦见棺材代表什么| 下架是什么意思| 梅花在什么季节开放| 为什么会长花斑癣| 楚国是现在的什么地方| 属蛇的人适合佩戴什么| 国际是什么意思| 哺乳期可以吃什么消炎药| 为什么会得人乳头瘤病毒| 9.25是什么星座| 湿度大对人体有什么影响| 有肝病的人吃什么好| 吃什么降脂肪最快| 白蛋白偏高是什么原因| 给医生送锦旗写什么| 起风疹了用什么快速方法能解决| 夫妻肺片里面都有什么| 重字五行属什么| 什么降血糖| 做梦来月经了什么预兆| 妙手回春是什么意思| 晕车贴什么时候贴| 处暑是什么季节| 君王是什么生肖| 过敏源挂什么科| 花漾是什么意思| 狮子座与什么星座最配| 老蒯是什么意思| 125是什么意思| 跳空缺口是什么意思| 吃万艾可有什么副作用| 为什么要抽烟| 蛇形分班是什么意思| 小孩腿抽筋是什么原因引起的| 冠心病喝什么茶最好| 西瓜可以做什么饮料| alpha是什么意思| 焗是什么意思| 劲酒加红牛有什么功能| 禄代表什么生肖| 弓山文念什么| 助听器什么牌子最好| 商鞅变法是什么时期| 婚检挂什么科| 彩超低回声是什么意思| 脑子萎缩是什么原因造成的| 李子吃多了有什么坏处| 特殊门诊是什么意思| 吃什么能提高代谢| 耳堵是什么| 七月七是什么节| 治前列腺炎吃什么药效果最好| 两女 一杯是什么| 象是什么结构的字| 棉花代表什么生肖| smt是什么| 婴儿游泳有什么好处和坏处| 总胆红素是什么| 胃胀不消化吃什么药| 乘字五行属什么| 大便干燥是什么原因| 蜘蛛吃什么食物| 奶粉罐可以做什么手工| 肾病可以吃什么水果| 做梦梦见兔子是什么意思| 过敏性鼻炎吃什么| 蓁是什么意思| 看望老人买什么礼物好| jessica是什么意思| 脂溢性脱发是什么意思| 阑尾炎手术后吃什么好| 金骏眉属于什么茶类| 灵媒是什么意思| 为什么清真不吃猪肉| 男人脖子后面有痣代表什么| 高铁上不能带什么东西| 八月二十五是什么星座| 马到成功是什么生肖| 十二生肖排第一是什么生肖| 吃什么尿酸降得快| 备孕期间不能吃什么| 12月10日是什么星座| 胃疼肚子疼是什么原因| 叶倩文属什么生肖| 女人长期喝西洋参有什么好处| 什么是龙抬头| 目瞪口呆是什么生肖| creative是什么意思| 七七事变是什么生肖| 阿耨多罗三藐三菩提是什么意思| 是什么日子| 相濡以沫不如相忘于江湖是什么意思| 井代表什么数字| 长期肚子疼是什么原因| 白醋加盐洗脸有什么好处| 惨不忍睹是什么意思| 宝珀手表属于什么档次| 梦见自己被绑架了是什么意思| 诛仙讲的是什么故事| 大姨妈来了能吃什么水果| 什么是心理学| 嘴唇薄的男人面相代表什么意味| 什么原因引起高血压| 盆腔积液吃什么消炎药| 什么中药补气血效果最好| 红斑狼疮是什么引起的| 新疆人是什么人种| 天肖是什么生肖| 上海的市花是什么花| 孕妇喝咖啡有什么危害| 白癜风是什么病| 主观意识是什么意思| 大小脸是什么原因造成的| 鬼冢虎什么档次| 猫在风水上代表什么| 什么是前列腺| 感情里什么叫偏爱| 努力的意义是什么| 焦虑症什么症状| 黄加蓝色是什么颜色| 花痴是什么意思| 跳大神什么意思| 2026年是什么生肖年| 2021年是什么生肖| 火代表什么数字| 药流是吃什么药| 肚脐眼上面疼是什么原因| 入驻是什么意思| 骨髓增生活跃是什么意思| 人乳头病毒是什么意思| 28岁今年属什么| 甲状腺回声不均匀什么意思| 大伽是什么意思| 心悸是什么病| 脸上长癣是什么原因| 肠镜前一天可以吃什么| 更年期什么症状| c反应蛋白是什么意思| 2023年五行属什么| 世界第一大河是什么河| 戌时是什么时候| 阴虚火旺有什么症状| g6pd是检查什么的| 天秤座男生和什么星座最配| 点蜡烛什么意思| 高冷什么意思| 血气方刚什么意思| 什么名字最好听| maxrieny是什么品牌| 脾大是什么原因造成的| 画蛇添足什么意思| 恨天高是什么意思| o型血和o型血生的孩子是什么血型| 牛磺酸是什么东西| 吃月饼是什么节日| 囊肿是什么东西| 包袱什么意思| 肠道问题挂什么科| 绿色裤子配什么上衣| 甘油三酯高有什么症状| 中气下陷是什么意思| 脖子大是什么原因| 冰妹什么意思| 01年属什么生肖| edp是什么意思| 黑枸杞和红枸杞有什么区别| 女人梦见桃子预示什么| 吃人嘴短拿人手软什么意思| 二尖瓣微量反流什么意思| 猥琐什么意思| 徒手是什么意思| 精满自溢是什么意思| 属牛的跟什么属相最配| 老人适合吃什么水果| pdi是什么| ade是什么意思| 左脸长痘是什么原因| 当归是什么| 舟五行属什么| 什么时间量血压最准| 4月23日是什么星座| 伊朗说什么语言| 比肩劫财是什么意思| 梨状肌综合征吃什么药| 运动后喝什么饮料最好| 舅舅和外甥女是什么关系| 三顾茅庐是什么意思| 螺蛳粉为什么那么臭| 贵族是什么意思啊| 舌头尖有小红点这是什么症状| 梦到女鬼是什么意思| 9月是什么星座| 众望所归是什么意思| 成吉思汗是什么意思| 绣球花什么时候开花| 拔智齿后吃什么消炎药| 为什么会呼吸性碱中毒| 孕妇贫血吃什么好| 58年属什么| 8月6号什么星座| 6月24日什么星座| 头皮痒用什么洗发水效果好| 想怀孕需要检查什么项目| 予字五行属什么| 干预是什么意思| 梦见小白蛇是什么预兆| 化妆品属于什么行业| 什么样的夜晚| s是什么m是什么| 乳糖不耐受什么意思| 白茶和绿茶有什么区别| 口腔溃疡反复发作是什么原因| 女性肝阳上亢吃什么药| 秦五行属什么| 内什么外什么| 尿检粘液丝高什么意思| 平均血小板体积偏低是什么意思| 籍贯一般写什么| 梦见牛顶我是什么意思| 百度Jump to content

从棉衣到短袖就一天 西安昨26.7℃创今年最高温

From Wikipedia, the free encyclopedia
(Redirected from Gawk (GNU package))
AWK
Usage of AWK in shell to check matching fields in two files
ParadigmScripting, procedural, data-driven[1]
Designed byAlfred Aho, Peter Weinberger, and Brian Kernighan
First appeared1977; 48 years ago (1977)
Stable release
IEEE Std 1003.1-2008 (POSIX) / 1985
Typing disciplinenone; can handle strings, integers and floating-point numbers; regular expressions
OSCross-platform
Major implementations
awk, GNU Awk, mawk, nawk, MKS AWK, Thompson AWK (compiler), Awka (compiler)
Dialects
old awk oawk 1977, new awk nawk 1985, GNU Awk gawk
Influenced by
C, sed, SNOBOL[2][3]
Influenced
Tcl, AMPL, Perl, Korn Shell (ksh93, dtksh, tksh), Lua
百度 此外,金融城三期还规划了成都交易所大厦项目,预计2020年竣工并投用,计划重点引入10家以上要素交易类机构,5家以上总部经济类机构入驻。

AWK (/??k/[4]) is a domain-specific language designed for text processing and typically used as a data extraction and reporting tool. Like sed and grep, it is a filter,[4] and it is a standard feature of most Unix-like operating systems.

The AWK language is a data-driven scripting language consisting of a set of actions to be taken against streams of textual data – either run directly on files or used as part of a pipeline – for purposes of extracting or transforming text, such as producing formatted reports. The language extensively uses the string datatype, associative arrays (that is, arrays indexed by key strings), and regular expressions. While AWK has a limited intended application domain and was especially designed to support one-liner programs, the language is Turing-complete, and even the early Bell Labs users of AWK often wrote well-structured large AWK programs.[5]

AWK was created at Bell Labs in the 1970s,[6] and its name is derived from the surnames of its authors: Alfred Aho (author of egrep), Peter Weinberger (who worked on tiny relational databases), and Brian Kernighan. The acronym is pronounced the same as the name of the bird species auk, which is illustrated on the cover of The AWK Programming Language.[7] When written in all lowercase letters, as awk, it refers to the Unix or Plan 9 program that runs scripts written in the AWK programming language.

History

[edit]

According to Brian Kernighan, one of the goals of AWK was to have a tool that would easily manipulate both numbers and strings. AWK was also inspired by Marc Rochkind's programming language that was used to search for patterns in input data, and was implemented using yacc.[8]

As one of the early tools to appear in Version 7 Unix, AWK added computational features to a Unix pipeline besides the Bourne shell, the only scripting language available in a standard Unix environment. It is one of the mandatory utilities of the Single UNIX Specification,[9] and is required by the Linux Standard Base specification.[10]

In 1983, AWK was one of several UNIX tools available for Charles River Data Systems' UNOS operating system under Bell Laboratories license.[11]

AWK was significantly revised and expanded in 1985–88, resulting in the GNU AWK implementation written by Paul Rubin, Jay Fenlason, and Richard Stallman, released in 1988.[12] GNU AWK may be the most widely deployed version[13] because it is included with GNU-based Linux packages. GNU AWK has been maintained solely by Arnold Robbins since 1994.[12] Brian Kernighan's nawk (New AWK) source was first released in 1993 unpublicized, and publicly since the late 1990s; many BSD systems use it to avoid the GPL license.[12]

AWK was preceded by sed (1974). Both were designed for text processing. They share the line-oriented, data-driven paradigm, and are particularly suited to writing one-liner programs, due to the implicit main loop and current line variables. The power and terseness of early AWK programs – notably the powerful regular expression handling and conciseness due to implicit variables, which facilitate one-liners – together with the limitations of AWK at the time, were important inspirations for the Perl language (1987). In the 1990s, Perl became very popular, competing with AWK in the niche of Unix text-processing languages.

Structure of AWK programs

[edit]

AWK reads the input a line at a time. A line is scanned for each pattern in the program, and for each pattern that matches, the associated action is executed.

—?Alfred V. Aho[14]

An AWK program is a series of pattern action pairs, written as:

condition { action }
condition { action }
...

where condition is typically an expression and action is a series of commands. The input is split into records, where by default records are separated by newline characters so that the input is split into lines. The program tests each record against each of the conditions in turn, and executes the action for each expression that is true. Either the condition or the action may be omitted. The condition defaults to matching every record. The default action is to print the record. This is the same pattern-action structure as sed.

In addition to a simple AWK expression, such as foo == 1 or /^foo/, the condition can be BEGIN or END causing the action to be executed before or after all records have been read, or pattern1, pattern2 which matches the range of records starting with a record that matches pattern1 up to and including the record that matches pattern2 before again trying to match against pattern1 on subsequent lines.

In addition to normal arithmetic and logical operators, AWK expressions include the tilde operator, ~, which matches a regular expression against a string. As handy syntactic sugar, /regexp/ without using the tilde operator matches against the current record; this syntax derives from sed, which in turn inherited it from the ed editor, where / is used for searching. This syntax of using slashes as delimiters for regular expressions was subsequently adopted by Perl and ECMAScript, and is now common. The tilde operator was also adopted by Perl.

Commands

[edit]

AWK commands are the statements that are substituted for action in the examples above. AWK commands can include function calls, variable assignments, calculations, or any combination thereof. AWK contains built-in support for many functions; many more are provided by the various flavors of AWK. Also, some flavors support the inclusion of dynamically linked libraries, which can also provide more functions.

The print command

[edit]

The print command is used to output text. The output text is always terminated with a predefined string called the output record separator (ORS) whose default value is a newline. The simplest form of this command is:

print
This displays the contents of the current record. In AWK, records are broken down into fields, and these can be displayed separately:
print $1
Displays the first field of the current record
print $1, $3
Displays the first and third fields of the current record, separated by a predefined string called the output field separator (OFS) whose default value is a single space character

Although these fields ($X) may bear resemblance to variables (the $ symbol indicates variables in the usual Unix shells and in Perl), they actually refer to the fields of the current record. A special case, $0, refers to the entire record. In fact, the commands "print" and "print $0" are identical in functionality.

The print command can also display the results of calculations and/or function calls:

/regex_pattern/ {
    # Actions to perform in the event the record (line) matches the above regex_pattern
    print 3+2
    print foobar(3)
    print foobar(variable)
    print sin(3-2)
}

Output may be sent to a file:

/regex_pattern/ {
    # Actions to perform in the event the record (line) matches the above regex_pattern
    print "expression" > "file name"
}

or through a pipe:

/regex_pattern/ {
    # Actions to perform in the event the record (line) matches the above regex_pattern
    print "expression" | "command"
}

Built-in variables

[edit]

AWK's built-in variables include the field variables: $1, $2, $3, and so on ($0 represents the entire record). They hold the text or values in the individual text-fields in a record.

Other variables include:

  • NR: Number of Records. Keeps a current count of the number of input records read so far from all data files. It starts at zero, but is never automatically reset to zero.[15]
  • FNR: File Number of Records. Keeps a current count of the number of input records read so far in the current file. This variable is automatically reset to zero each time a new file is started.[15]
  • NF: Number of Fields. Contains the number of fields in the current input record. The last field in the input record can be designated by $NF, the 2nd-to-last field by $(NF-1), the 3rd-to-last field by $(NF-2), etc.
  • FILENAME: Contains the name of the current input-file.
  • FS: Field Separator. Contains the "field separator" used to divide fields in the input record. The default, "white space", allows any sequence of space and tab characters. FS can be reassigned with another character or character sequence to change the field separator.
  • RS: Record Separator. Stores the current "record separator" character. Since, by default, an input line is the input record, the default record separator character is a "newline".
  • OFS: Output Field Separator. Stores the "output field separator", which separates the fields when awk prints them. The default is a "space" character.
  • ORS: Output Record Separator. Stores the "output record separator", which separates the output records when awk prints them. The default is a "newline" character.
  • OFMT: Output Format. Stores the format for numeric output. The default format is "%.6g".

Variables and syntax

[edit]

Variable names can use any of the characters [A-Za-z0-9_], with the exception of language keywords, and cannot begin with a numeric digit. The operators + - * / represent addition, subtraction, multiplication, and division, respectively. For string concatenation, simply place two variables (or string constants) next to each other. It is optional to use a space in between if string constants are involved, but two variable names placed adjacent to each other require a space in between. Double quotes delimit string constants. Statements need not end with semicolons. Finally, comments can be added to programs by using # as the first character on a line, or behind a command or sequence of commands.

User-defined functions

[edit]

In a format similar to C, function definitions consist of the keyword function, the function name, argument names and the function body. Here is an example of a function.

function add_three(number) {
    return number + 3
}

This statement can be invoked as follows:

(pattern) {
   print add_three(36)     # Outputs '''39'''
}

Functions can have variables that are in the local scope. The names of these are added to the end of the argument list, though values for these should be omitted when calling the function. It is convention to add some whitespace in the argument list before the local variables, to indicate where the parameters end and the local variables begin.

Examples

[edit]

Hello, World!

[edit]

Here is the customary "Hello, World!" program written in AWK:

BEGIN {
        print "Hello, world!"
        exit
}
[edit]

Print all lines longer than 80 characters. The default action is to print the current line.

length($0) > 80

Count words

[edit]

Count words in the input and print the number of lines, words, and characters (like wc):

{
    words += NF
    chars += length + 1 # add one to account for the newline character at the end of each record (line)
}
END { print NR, words, chars }

As there is no pattern for the first line of the program, every line of input matches by default, so the increment actions are executed for every line. words += NF is shorthand for words = words + NF.

Sum last word

[edit]
{ s += $NF }
END { print s + 0 }

s is incremented by the numeric value of $NF, which is the last word on the line as defined by AWK's field separator (by default, white-space). NF is the number of fields in the current line, e.g. 4. Since $4 is the value of the fourth field, $NF is the value of the last field in the line regardless of how many fields this line has, or whether it has more or fewer fields than surrounding lines. $ is actually a unary operator with the highest operator precedence. (If the line has no fields, then NF is 0, $0 is the whole line, which in this case is empty apart from possible white-space, and so has the numeric value 0.)

At the end of the input, the END pattern matches, so s is printed. However, since there may have been no lines of input at all, in which case no value has ever been assigned to s, s will be an empty string by default. Adding zero to a variable is an AWK idiom for coercing it from a string to a numeric value. This results from AWK's arithmetic operators, like addition, implicitly casting their operands to numbers before computation as required. (Similarly, concatenating a variable with an empty string coerces from a number to a string, e.g., s "". Note, there is no operator to concatenate strings, they are just placed adjacently.) On an empty input, the coercion in { print s + 0 } causes the program to print 0, whereas with just the action { print s }, an empty line would be printed.

Match a range of input lines

[edit]
NR % 4 == 1, NR % 4 == 3 { printf "%6d  %s\n", NR, $0 }

The action statement prints each line numbered. The printf function emulates the standard C printf and works similarly to the print command described above. The pattern to match, however, works as follows: NR is the number of records, typically lines of input, AWK has so far read, i.e. the current line number, starting at 1 for the first line of input. % is the modulo operator. NR % 4 == 1 is true for the 1st, 5th, 9th, etc., lines of input. Likewise, NR % 4 == 3 is true for the 3rd, 7th, 11th, etc., lines of input. The range pattern is false until the first part matches, on line 1, and then remains true up to and including when the second part matches, on line 3. It then stays false until the first part matches again on line 5.

Thus, the program prints lines 1,2,3, skips line 4, and then 5,6,7, and so on. For each line, it prints the line number (on a 6 character-wide field) and then the line contents. For example, when executed on this input:

Rome
Florence
Milan
Naples
Turin
Venice

The previous program prints:

     1 Rome
     2 Florence
     3 Milan
     5 Turin
     6 Venice

Printing the initial or the final part of a file

[edit]

As a special case, when the first part of a range pattern is constantly true, e.g. 1, the range will start at the beginning of the input. Similarly, if the second part is constantly false, e.g. 0, the range will continue until the end of input. For example,

 /^--cut here--$/, 0

prints lines of input from the first line matching the regular expression ^--cut here--$, that is, a line containing only the phrase "--cut here--", to the end.

Calculate word frequencies

[edit]

Word frequency using associative arrays:

BEGIN {
    FS="[^a-zA-Z]+"
}
{
    for (i=1; i<=NF; i++)
        words[tolower($i)]++
}
END {
    for (i in words)
        print i, words[i]
}

The BEGIN block sets the field separator to any sequence of non-alphabetic characters. Separators can be regular expressions. After that, we get to a bare action, which performs the action on every input line. In this case, for every field on the line, we add one to the number of times that word, first converted to lowercase, appears. Finally, in the END block, we print the words with their frequencies. The line

for (i in words)

creates a loop that goes through the array words, setting i to each subscript of the array. This is different from most languages, where such a loop goes through each value in the array. The loop thus prints out each word followed by its frequency count. tolower was an addition to the One True awk (see below) made after the book was published.

Match pattern from command line

[edit]

This program can be represented in several ways. The first one uses the Bourne shell to make a shell script that does everything. It is the shortest of these methods:

#!/bin/sh

pattern="$1"
shift
awk '/'"$pattern"'/ { print FILENAME ":" $0 }' "$@"

The $pattern in the awk command is not protected by single quotes so that the shell does expand the variable but it needs to be put in double quotes to properly handle patterns containing spaces. A pattern by itself in the usual way checks to see if the whole line ($0) matches. FILENAME contains the current filename. awk has no explicit concatenation operator; two adjacent strings concatenate them. $0 expands to the original unchanged input line.

There are alternate ways of writing this. This shell script accesses the environment directly from within awk:

#!/bin/sh

export pattern="$1"
shift
awk '$0 ~ ENVIRON["pattern"] { print FILENAME ":" $0 }' "$@"

This is a shell script that uses ENVIRON, an array introduced in a newer version of the One True awk after the book was published. The subscript of ENVIRON is the name of an environment variable; its result is the variable's value. This is like the getenv function in various standard libraries and POSIX. The shell script makes an environment variable pattern containing the first argument, then drops that argument and has awk look for the pattern in each file.

~ checks to see if its left operand matches its right operand; !~ is its inverse. A regular expression is just a string and can be stored in variables.

The next way uses command-line variable assignment, in which an argument to awk can be seen as an assignment to a variable:

#!/bin/sh

pattern="$1"
shift
awk '$0 ~ pattern { print FILENAME ":" $0 }' pattern="$pattern" "$@"

Or You can use the -v var=value command line option (e.g. awk -v pattern="$pattern" ...).

Finally, this is written in pure awk, without help from a shell or without the need to know too much about the implementation of the awk script (as the variable assignment on command line one does), but is a bit lengthy:

BEGIN {
    pattern = ARGV[1]
    for (i = 1; i < ARGC; i++) # remove first argument
        ARGV[i] = ARGV[i + 1]
    ARGC--
    if (ARGC == 1) { # the pattern was the only thing, so force read from standard input (used by book)
        ARGC = 2
        ARGV[1] = "-"
    }
}
$0 ~ pattern { print FILENAME ":" $0 }

The BEGIN is necessary not only to extract the first argument, but also to prevent it from being interpreted as a filename after the BEGIN block ends. ARGC, the number of arguments, is always guaranteed to be ≥1, as ARGV[0] is the name of the command that executed the script, most often the string "awk". ARGV[ARGC] is the empty string, "". # initiates a comment that expands to the end of the line.

Note the if block. awk only checks to see if it should read from standard input before it runs the command. This means that

awk 'prog'

only works because the fact that there are no filenames is only checked before prog is run! If you explicitly set ARGC to 1 so that there are no arguments, awk will simply quit because it feels there are no more input files. Therefore, you need to explicitly say to read from standard input with the special filename -.

Self-contained AWK scripts

[edit]

On Unix-like operating systems self-contained AWK scripts can be constructed using the shebang syntax.

For example, a script that sends the content of a given file to standard output may be built by creating a file named print.awk with the following content:

#!/usr/bin/awk -f
{ print $0 }

It can be invoked with: ./print.awk <filename>

The -f tells awk that the argument that follows is the file to read the AWK program from, which is the same flag that is used in sed. Since they are often used for one-liners, both these programs default to executing a program given as a command-line argument, rather than a separate file.

Versions and implementations

[edit]

AWK was originally written in 1977 and distributed with Version 7 Unix.

In 1985 its authors started expanding the language, most significantly by adding user-defined functions. The language is described in the book The AWK Programming Language, published 1988, and its implementation was made available in releases of UNIX System V. To avoid confusion with the incompatible older version, this version was sometimes called "new awk" or nawk. This implementation was released under a free software license in 1996 and is still maintained by Brian Kernighan (see external links below).[citation needed]

Old versions of Unix, such as UNIX/32V, included awkcc, which converted AWK to C. Kernighan wrote a program to turn awk into C++; its state is not known.[16]

  • BWK awk, also known as nawk, refers to the version by Brian Kernighan. It has been dubbed the "One True AWK" because of the use of the term in association with the book that originally described the language and the fact that Kernighan was one of the original authors of AWK.[7] FreeBSD refers to this version as one-true-awk.[17] This version also has features not in the book, such as tolower and ENVIRON that are explained above; see the FIXES file in the source archive for details. This version is used by, for example, Android, FreeBSD, NetBSD, OpenBSD, macOS, and illumos. Brian Kernighan and Arnold Robbins are the main contributors to a source repository for nawk: github.com/onetrueawk/awk.
  • gawk (GNU awk) is another free-software implementation and the only implementation that makes serious progress implementing internationalization and localization and TCP/IP networking. It was written before the original implementation became freely available. It includes its own debugger, and its profiler enables the user to make measured performance enhancements to a script. It also enables the user to extend functionality with shared libraries. Some Linux distributions include gawk as their default AWK implementation.[citation needed] As of version 5.2 (September 2022) gawk includes a persistent memory feature that can remember script-defined variables and functions from one invocation of a script to the next and pass data between unrelated scripts, as described in the Persistent-Memory gawk User Manual: www.gnu.org/software/gawk/manual/pm-gawk/.
    • gawk-csv. The CSV extension of gawk provides facilities for inputting and outputting CSV formatted data.[18]
  • mawk is a very fast AWK implementation by Mike Brennan based on a bytecode interpreter.
  • libmawk is a fork of mawk, allowing applications to embed multiple parallel instances of awk interpreters.
  • awka (whose front end is written atop the mawk program) is another translator of AWK scripts into C code. When compiled, statically including the author's libawka.a, the resulting executables are considerably sped up and, according to the author's tests, compare very well with other versions of AWK, Perl, or Tcl. Small scripts will turn into programs of 160–170 kB.
  • tawk (Thompson AWK) is an AWK compiler for Solaris, DOS, OS/2, and Windows, previously sold by Thompson Automation Software (which has ceased its activities).[19]
  • Jawk is a project to implement AWK in Java, hosted on SourceForge.[20] Extensions to the language are added to provide access to Java features within AWK scripts (i.e., Java threads, sockets, collections, etc.).
  • xgawk is a fork of gawk[21] that extends gawk with dynamically loadable libraries. The XMLgawk extension was integrated into the official GNU Awk release 4.1.0.
  • QSEAWK is an embedded AWK interpreter implementation included in the QSE library that provides embedding application programming interface (API) for C and C++.[22]
  • libfawk is a very small, function-only, reentrant, embeddable interpreter written in C
  • BusyBox includes an AWK implementation written by Dmitry Zakharov. This is a very small implementation suitable for embedded systems.
  • CLAWK by Michael Parker provides an AWK implementation in Common Lisp, based upon the regular expression library of the same author.[23]
  • goawk is an AWK implementation in Go with a few convenience extensions by Ben Hoyt, hosted on Github.

The gawk manual has a list of more AWK implementations.[24]

Books

[edit]
  • Aho, Alfred V.; Kernighan, Brian W.; Weinberger, Peter J. (2025-08-05). The AWK Programming Language. New York, NY: Addison-Wesley. ISBN 0-201-07981-X. Retrieved 2025-08-05.
  • Aho, Alfred V.; Kernighan, Brian W.; Weinberger, Peter J. (2025-08-05). The AWK Programming Language, Second Edition. Hoboken, New Jersey: Addison-Wesley Professional. ISBN 978-0-13-826972-2. Archived from the original on 2025-08-05. Retrieved 2025-08-05.
  • Dougherty, Dale; Robbins, Arnold (2025-08-05). sed & awk (2nd ed.). Sebastopol, CA: O'Reilly Media. ISBN 1-56592-225-5. Retrieved 2025-08-05.
  • Robbins, Arnold (2025-08-05). Effective awk Programming (3rd ed.). Sebastopol, CA: O'Reilly Media. ISBN 0-596-00070-7. Retrieved 2025-08-05.
  • Robbins, Arnold (2000). Effective Awk Programming: A User's Guide for Gnu Awk (1.0.3 ed.). Bloomington, IN: iUniverse. ISBN 0-595-10034-1. Archived from the original on 12 April 2009. Retrieved 2025-08-05.

See also

[edit]

References

[edit]
  1. ^ Stutz, Michael (September 19, 2006). "Get started with GAWK: AWK language fundamentals" (PDF). developerWorks. IBM. Archived (PDF) from the original on 2025-08-05. Retrieved 2025-08-05. [AWK is] often called a data-driven language -- the program statements describe the input data to match and process rather than a sequence of program steps
  2. ^ Andreas J. Pilavakis (1989). UNIX Workshop. Macmillan International Higher Education. p. 196.
  3. ^ Arnold Robbins (2015). Effective Awk Programming: Universal Text Processing and Pattern Matching (4th ed.). O'Reilly Media. p. 560.
  4. ^ a b James W. Livingston (May 2, 1988). "The Great awk Program is No Birdbrain". Digital Review. p. 91.
  5. ^ Raymond, Eric S. "Applying Minilanguages". The Art of Unix Programming. Case Study: awk. Archived from the original on July 30, 2008. Retrieved May 11, 2010. The awk action language is Turing-complete, and can read and write files.
  6. ^ Aho, Alfred V.; Kernighan, Brian W.; Weinberger, Peter J. (September 1, 1978). Awk — A Pattern Scanning and Processing Language (Second Edition) (Technical report). Unix Seventh Edition Manual, Volume 2. Bell Telephone Laboratories, Inc. Retrieved February 1, 2020.
  7. ^ a b Aho, Alfred V.; Kernighan, Brian W.; Weinberger, Peter J. (1988). The AWK Programming Language. Addison-Wesley Publishing Company. ISBN 9780201079814. Retrieved 16 May 2015.
  8. ^ "UNIX Special: Profs Kernighan & Brailsford". Computerphile. September 30, 2015. Archived from the original on 2025-08-05.
  9. ^ "The Single UNIX Specification, Version 3, Utilities Interface Table". Archived from the original on 2025-08-05. Retrieved 2025-08-05.
  10. ^ "Chapter 15. Commands and Utilities". Linux Standard Base Core Specification 4.0 (Technical report). Linux Foundation. 2008. Archived from the original on 2025-08-05. Retrieved 2025-08-05.
  11. ^ The Insider's Guide To The Universe (PDF). Charles River Data Systems, Inc. 1983. p. 13.
  12. ^ a b c Robbins, Arnold (March 2014). "The GNU Project and Me: 27 Years with GNU AWK" (PDF). skeeve.com. Archived (PDF) from the original on October 6, 2014. Retrieved October 4, 2014.
  13. ^ Dougherty, Dale; Robbins, Arnold (1997). sed & awk (2nd ed.). Sebastopol, CA: O'Reilly. p. 221. ISBN 1-565-92225-5.
  14. ^ Hamilton, Naomi (May 30, 2008). "The A-Z of Programming Languages: AWK". Computerworld. Archived from the original on 2025-08-05. Retrieved 2025-08-05.
  15. ^ a b "Records". GAWK: Effective AWK Programming: A User’s Guide for GNU Awk (5.3 ed.). September 2024. Retrieved 2025-08-05.
  16. ^ Kernighan, Brian W. (April 24–25, 1991). An AWK to C++ Translator (PDF). Usenix C++ Conference. Washington, DC. pp. 217–228. Archived (PDF) from the original on 2025-08-05. Retrieved 2025-08-05.
  17. ^ "FreeBSD's work log for importing BWK awk into FreeBSD's core". May 16, 2005. Archived from the original on September 8, 2013. Retrieved September 20, 2006.
  18. ^ "CSV Processing With gawk (using the gawk-csv extension)". gawkextlib. 2018. Archived from the original on 2025-08-05.
  19. ^ James K. Lawless (May 1, 1997). "Examining the TAWK Compiler". Dr. Dobb's Journal. Archived from the original on February 21, 2020. Retrieved February 21, 2020.
  20. ^ "Jawk at SourceForge". Archived from the original on 2025-08-05. Retrieved 2025-08-05.
  21. ^ "xgawk Home Page". Archived from the original on 2025-08-05. Retrieved 2025-08-05.
  22. ^ "QSEAWK at GitHub". GitHub. Archived from the original on 2025-08-05. Retrieved 2025-08-05.
  23. ^ "CLAWK at GitHub". GitHub. Archived from the original on 2025-08-05. Retrieved 2025-08-05.
  24. ^ "B.5 Other Freely Available awk Implementations". GAWK: Effective AWK Programming: A User's Guide for GNU Awk (5.3 ed.). September 2024. Retrieved 2025-08-05.

Further reading

[edit]
[edit]
香皂和肥皂有什么区别 肛门疼痛是什么原因引起的 什么是川崎病是什么病 胃疼是什么症状 六月二号什么星座
世界八大奇迹分别是什么 一直打哈欠是什么原因 心动过缓吃什么药最好 大便带绿色是什么原因 降钙素原检测是什么
食物中毒吃什么药 24号来月经什么时候是排卵期 指甲黑是什么原因 什么知什么明 老人吃什么水果对身体好
mrcp是什么检查 太平天国失败的根本原因是什么 图谱是什么意思 甜叶菊有什么功效 昆布是什么
金樱子配什么才壮阳hcv9jop3ns2r.cn 什么人容易得多囊卵巢hcv7jop6ns9r.cn 渗湿是什么意思hcv8jop5ns1r.cn 为情所困是什么意思hkuteam.com 身体缺钾会有什么症状hcv8jop9ns1r.cn
拉稀吃什么药好xianpinbao.com 月经推迟什么原因youbangsi.com 人授和试管有什么区别hcv8jop2ns1r.cn 长生是什么意思hanqikai.com 红细胞分布宽度偏低是什么意思hcv9jop6ns2r.cn
请佛容易送佛难什么意思hcv7jop6ns0r.cn 什么是三有保护动物hcv8jop5ns1r.cn 小产是什么意思hcv9jop2ns0r.cn 什么是黑茶hcv9jop7ns4r.cn 梦见芹菜是什么意思youbangsi.com
二十七岁属什么生肖hcv8jop8ns5r.cn 夏天手脱皮是什么原因hcv9jop6ns2r.cn 祸害是什么意思wzqsfys.com 三个犬念什么字hcv8jop1ns9r.cn 黑糖和红糖有什么区别hcv8jop0ns8r.cn
百度