grep 总结

Grep

　　以行为单位搜索那些包含给出模板列表的输入文件。当在一行中找到匹配，默认把该行拷贝到标准输出（默认），或者其他你以选项要求的任何种类的输出。（global search regular expression(RE) and print out theline,全面搜索正则表达式并把行打印出来）

格式：grep [选项]... PATTERN [FILE]...

Grep命令行

-?: 同时显示匹配行上下的？行，如：grep -2 pattern filename同时显示匹配行的上下2行。
-b，--byte-offset: 打印匹配行前面打印该行所在的块号码。
-c,--count: 只打印匹配的行数，不显示匹配的内容。
-f File，--file=File: 从文件中提取模板。空文件中包含0个模板，所以什么都不匹配。
-h，--no-filename: 当搜索多个文件时，不显示匹配文件名前缀。
-i，--ignore-case: 忽略大小写差别。
-q，--quiet: 取消显示，只返回退出状态。0则表示找到了匹配的行。
-l，--files-with-matches: 打印匹配模板的文件清单。
-L，--files-without-match: 打印不匹配模板的文件清单。
-n，--line-number: 在匹配的行前面打印行号。
-s，--silent: 不显示关于不存在或者无法读取文件的错误信息。
-v，--revert-match: 反检索，只显示不匹配的行。
-w，--word-regexp: 如果被\<和\>引用，就把表达式做为一个单词搜索。
-V，--version: 显示软件版本信息。

特殊字符

^: 行的开始如：'^grep'匹配所有以grep开头的行。
$: 行的结束如：'grep$'匹配所有以grep结尾的行。
.: 匹配一个非换行符的字符如：'gr.p'匹配gr后接一个任意字符，然后是p。
*: 匹配零个或多个先前字符如：'*grep'匹配所有一个或多个空格后紧跟grep的行。 .*一起用代表任意字符。
[]: 匹配一个指定范围内的字符，如'[Gg]rep'匹配Grep和grep。
[^]: 匹配一个不在指定范围内的字符，如：'[^A-FH-Z]rep'匹配不包含A-R和T-Z的一个字母开头，紧跟rep的行。
$..$: 标记匹配字符，如'$love$'，love被标记为1。
\<: 单词的开始，如:'\<grep'匹配包含以grep开头的单词的行。
\>: 单词的结束，如'grep\>'匹配包含以grep结尾的单词的行。
x\{m\}: 重复字符x，m次，如：'0\{5\}'匹配包含5个o的行。
x\{m,\}: 重复字符x,至少m次，如：'o\{5,\}'匹配至少有5个o的行。
x\{m,n\}: 重复字符x，至少m次，不多于n次，如：'o\{5,10\}'匹配5--10个o的行。
\w: 匹配文字和数字字符，也就是[A-Za-z0-9]，如：'G\w*p'匹配以G后跟零个或多个文字或数字字符，然后是p。
\W: \w的反置形式，匹配一个或多个非单词字符，如点号句号等。
\b: 单词锁定符，如: '\bgrep\b'只匹配grep。

　　请特别留意的是，『正规表示法的特殊字符』与一般在指令列输入指令的『万用字符』并不相同，例如，在万用字符当中， * 代表的是 0 ~ 无限多个字符的意思，但是在正规表示法当中， * 则是重复前一个字符的意思～使用的意义并不相同，不要搞混了！

Grep 应用实例

　　为方便说明，我们使用两个文件cwot.conf和cwot.log作为例子。cwot.conf是一个典型的服务器程序配置档，其内容为：

# CWOT Configuration File: See the cwot(8) manpage for details
# ========================== Connection =============================
# What ports, IPs and protocols we listen for
Port 1009
# Use these options to restrict which interfaces/protocols cwot will bind to
#ListenAddress 0.0.0.0
# Timeout: The number of seconds before receives and sends time out.
Timeout 300
# ============================ File =================================
# The LockFile directive sets the path to the lockfile used by cwot
# This directive should normally be left at its default value except
# the directory is NFS mounted.
LockFile /var/lock/cwot/cwot.lock
# PidFile: The file in which the server should record its process
# identification number when it starts.
PidFile /var/run/cwot.pid

cwot.log则是一个日志文件，其内容为：

1997/06/30 23:03:34 +0800 cwot: successful login f854 from f854@presenter
1997/06/30 23:30:13 +0800 cwot: fail to login jack from f891@presenter
1997/06/30 23:42:30 +0800 cwot: user f854 logout
1997/06/30 23:46:27 +0800 cwot: fail to login f823 from f823@previewer
1997/06/30 23:52:54 +0800 cwot: successful login f823 from f823@previewer
1997/06/30 23:54:34 +0800 cwot: fail to login jack from f891@presenter
1997/06/30 23:58:23 +0800 cwot: successful login fred from fred@presenter
1997/06/30 23:48:29 +0800 cwot: fail to login jack from f891@presenter
1997/06/30 23:58:48 +0800 cwot: fail to login jack from f891@presenter
1997/06/30 23:59:14 +0800 cwot: fail to login jack from f891@presenter
1997/06/30 23:59:30 +0800 cwot: user f821 logout
1997/06/30 23:59:31 +0800 cwot: successful login jack from f891@presenter
1997/06/30 23:59:50 +0800 cwot: user f826 logout

1．搜寻字串

要使用regular expression搜寻特定的英文字串，一般只列要输入要搜寻字串就可以了。例如，要在档案cwot.conf中插寻"File"，可以直接使用grep -e File cwot.conf其输出结果为：

# grep -e File cwot.conf
# CWOT Configuration File: See the cwot(8) manpage for details
# ============================ File =================================
# The LockFile directive sets the path to the lockfile used by cwot
LockFile /var/lock/cwot/cwot.lock
# PidFile: The file in which the server should record its process
PidFile /var/run/cwot.pid

2．任何字符（.）

　　在regular expression上，点号（.）可以用来匹配任何字符。举例来说，使用命令grep f821@ cwot.log可以搜寻字符串"f821@"，其输出结果为：

# grep f821@ cwot.log\
1997/06/30 23:57:14 +0800 cwot: successful login jack from f821@previewer
1997/07/01 00:00:49 +0800 cwot: successful login jack from f821@previewer

　　使用使用grep f82.@ cwot.log就可以搜寻"f82"和"@"之间有一字符的字符串：

# grep f82.@ cwot.log\
1997/06/30 23:46:27 +0800 cwot: fail to login f823 from f823@previewer
1997/06/30 23:52:54 +0800 cwot: successful login f823 from f823@previewer
1997/06/30 23:55:44 +0800 cwot: successful login f826 from f826@operator
1997/06/30 23:57:14 +0800 cwot: successful login jack from f821@previewer
1997/07/01 00:00:49 +0800 cwot: successful login jack from f821@previewer
1997/07/01 00:13:38 +0800 cwot: successful login f822 from f822@previewer

3．字符列举（[...]）

　　点号（.）可以用来匹配任何字符，但有些时候我们只想匹配几个特定的字符，方括号（[...]）就可以处理这个问题。只要把要匹配的字符放入"["和"]"之间就可以了。例如使用命令egrep f82[012]@cwot.log，"f82[012]@"可以搜寻"f82"和"@"之间有一个"0"、"1"或"2"的字符串，其输出结果为：

# grep f82[012]@ cwot.log\
1997/06/30 23:57:14 +0800 cwot: successful login jack from f821@previewer
1997/07/01 00:00:49 +0800 cwot: successful login jack from f821@previewer
1997/07/01 00:13:38 +0800 cwot: successful login f822 from f822@previewer

4．重复任何次数（*）

星号（*）可以标示之前一个单元重复匹配的任何次数。例如命令grep f82*@ cwot.log中，"f82*@"可以搜寻"f8"和"@"之间有任何数目的"2"的输出结果为：

1997/07/01 00:13:38 +0800 cwot: successful login f822 from f822@previewer

另一例子中，命令grep '=* File' cwot.log的输出结果为：

# CWOT Configuration File: See the cwot(8) manpage for details
# ============================ File =================================

5. 基本操作

$ ls -l | grep '^a'
　　通过管道过滤ls -l输出的内容，只显示以a开头的行。
$ grep 'test' d*
　　显示所有以d开头的文件中包含test的行。
$ grep 'test' aa bb cc
　　显示在aa，bb，cc文件中匹配test的行。
$ grep '[a-z]\{5\}' aa
　　显示所有包含每个字符串至少有5个连续小写字符的字符串的行。
$ grep 'w$es$t.*\1' aa
　　如果west被匹配，则es就被存储到内存中，并标记为1，然后搜索任意个字符（.*），这些字符后面紧跟着另外一个es（\1），找到就显示该行。如果用egrep或grep -E，就不用"\"号进行转义，直接写成'w(es)t.*\1'就可以了。

实例来源于： http://book.51cto.com/art/200903/113280.htm