正则表达式
文章平均质量分 85
wushuai1346
这个作者很懒,什么都没留下…
展开
-
正则表达式学习指南(一)----入门简介
写在最开始的地方关于正则表达式的入门文章很多,比如deeerchao的30分钟入门正则表达式>>,微软自己的MSDN帮助文档,还有一些翻译文章,以及你能在网上找到的关于正则的电子书等.本文是RegexBuddy的帮助文档翻译文.也许你以前有看过深入浅出之正则表达式>>,但是也许你不知道,这篇文章翻译的只是RegexBuddy帮助文档中的Quick Start部分,如果你想对正则有较深一步的翻译 2012-01-17 10:36:31 · 806 阅读 · 0 评论 -
正则表达式学习指南(二)----教程目录
正则表达式教程目录这篇正则表达式教程将教授你关于正则表达式方方面面的知识.每一个主题都假设你已经读过并理解前面的主题内容.所以,如果你刚接触正则表达式,我推荐你按顺序阅读这些文章.入门简介入门简介指出了本教程涵盖的范围,和我们将要讨论的正则的流派.同时也介绍了一些基本的术语.字面字符和元字符最简单的正则表达式仅仅由字面字符组成.一些字符在正则中具有特殊意义,翻译 2012-01-17 11:29:34 · 639 阅读 · 0 评论 -
正则表达式学习指南(二十五)----Free-Spacing Regexes
Free-Spacing Regular ExpressionsThe JGsoft engine, .NET, Java, Perl, PCRE, Python, Ruby and XPath support a variant of the regular expression syntax called free-spacing mode. You can turn on thi翻译 2012-01-18 13:05:23 · 551 阅读 · 0 评论 -
正则表达式学习指南(二十一)----If-Then-Else Conditionals
If-Then-Else Conditionals in Regular ExpressionsA special construct (?ifthen|else) allows you to create conditional regular expressions. If theif part evaluates to true, then the regex engine will a翻译 2012-01-18 13:01:07 · 2091 阅读 · 0 评论 -
正则表达式学习指南(二十二)----XML Character Classes
XML Schema Character ClassesXML Schema Regular Expressions support the usual six shorthand character classes, plus four more. These four aren't supported by any other regular expression flavor.\i ma翻译 2012-01-18 13:01:36 · 704 阅读 · 0 评论 -
正则表达式学习指南(二十三)----POSIX Bracket Expressions
POSIX Bracket ExpressionsPOSIX bracket expressions are a special kind of character classes. POSIX bracket expressions match one character out of a set of characters, just like regular character clas翻译 2012-01-18 13:03:25 · 953 阅读 · 0 评论 -
正则表达式学习指南(二十四)----Adding Comments
Adding Comments to Regular ExpressionsIf you have worked through the entire tutorial, I guess you will agree that regular expressions can quickly become rather cryptic. Therefore, many modern regex翻译 2012-01-18 13:04:32 · 411 阅读 · 0 评论 -
正则表达式30分钟入门教程
<!--h1 {text-align:center}h2 {background-color:#eee; border-bottom:solid 1px gray; border-right:solid 1px gray; clear:both}p {text-indent:2em; line-height:140%; margin:5px 20px}转载 2012-01-06 13:29:49 · 903 阅读 · 0 评论 -
正则表达式语言元素
正则表达式语言元素(转自MSDN).NET Framework 4其他版本正则表达式是正则表达式引擎尝试匹配输入文本的一种模式。 模式由一个或多个字符文本、运算符或构造组成。 有关简单介绍,请参见 .NET Framework 正则表达式。此快速参考中的每一节都列出了可用于定义正则表达式的字符、运算转载 2012-01-18 13:06:48 · 628 阅读 · 0 评论 -
正则表达式学习指南(三)----字符
原义字符最基本的正则表达式只有一个最基本的原义字符构成,例如:"a".他将匹配字符串中第一个出现的字符.如果字符串是"Jack is a boy",他将匹配"J"后面的"a".事实上,这个"a"是否在单词中间对正则引擎来说并不重要.但如果这对你来讲很重要,你就需要使用单词边界来告诉引擎这件事.我们稍后会谈到这点.这个正则也可以匹配第二个"a".但这需要你来告诉引擎:在结束第一次匹配后,翻译 2012-01-17 11:30:53 · 587 阅读 · 0 评论 -
List.Exists 方法和几种字符串去重方法
今天群里提了一个小问题:给出一个字符串"abcabc12345612ab";能不能用某种方式去除重复 字符,输出 "abc123456"的形式. 我的想法是首先构造一个List,然后将打印后的字符传到List里面,如果List.Exists这个字符,就不输出,否则,添加到List里面,然后输出此字符.结果就差一秒啊,发现List.Exists 方法和我想的不一样,中间传的不是string,而原创 2012-02-03 15:56:18 · 3972 阅读 · 0 评论 -
正则表达式学习指南(二十)----Continuing from The Previous Match
Continuing at The End of The Previous MatchThe anchor \G matches at the position where the previous match ended. During the first match attempt,\G matches at the start of the string in the way\A翻译 2012-01-17 12:16:48 · 1109 阅读 · 0 评论 -
正则表达式学习指南(十九)----Testing The Same Part of a String for More Than One
Testing The Same Part of a String for More Than One RequirementLookaround, which I introduced in detail in the previous topic, is a very powerful concept. Unfortunately, it is often underused by peo翻译 2012-01-17 12:14:50 · 632 阅读 · 0 评论 -
正则表达式学习指南(十七)----Atomic Grouping
Atomic GroupingAn atomic group is a group that, when the regex engine exits from it, automatically throws away all backtracking positions remembered by any tokens inside the group. Atomic groups are翻译 2012-01-17 12:10:17 · 1009 阅读 · 0 评论 -
正则表达式学习指南(四)----How a Regex Engine Works Internally
First Look at How a Regex Engine Works InternallyKnowing how the regex engine works will enable you to craft better regexes more easily. It will help you understand quickly why a particular regex do翻译 2012-01-17 11:31:34 · 621 阅读 · 0 评论 -
正则表达式学习指南(五)----Character Classes
Character Classes or Character SetsWith a "character class", also called "character set", you can tell the regex engine to match only one out of several characters. Simply place the characters you w翻译 2012-01-17 11:35:02 · 4098 阅读 · 1 评论 -
正则表达式学习指南(六)----Dot (Any Character)
The Dot Matches (Almost) Any CharacterIn regular expressions, the dot or period is one of the most commonly used metacharacters. Unfortunately, it is also the most commonly misused metacharacter.翻译 2012-01-17 11:36:21 · 868 阅读 · 0 评论 -
正则表达式学习指南(七)----Start of String and End of String Anchors
Start of String and End of String AnchorsThus far, I have explained literal characters and character classes. In both cases, putting one in a regex will cause the regex engine to try to match a sing翻译 2012-01-17 11:38:42 · 1088 阅读 · 0 评论 -
正则表达式学习指南(八)----Word Boundaries
Word BoundariesThe metacharacter \b is an anchor like the caret and the dollar sign. It matches at a position that is called a "word boundary". This match is zero-length.There are three different翻译 2012-01-17 11:41:28 · 2209 阅读 · 0 评论 -
正则表达式学习指南(九)----Alternation
I already explained how you can use character classes to match a single character out of several possible characters. Alternation is similar. You can use alternation to match a single regular expressi翻译 2012-01-17 11:42:04 · 1575 阅读 · 0 评论 -
正则表达式学习指南(十)----Making a Token Optional
Optional ItemsThe question mark makes the preceding token in the regular expression optional. E.g.: colou?r matches bothcolour and color.You can make several tokens optional by grouping them tog翻译 2012-01-17 12:01:00 · 444 阅读 · 0 评论 -
正则表达式学习指南(十一)----Quantifiers(Repetition)
Repetition with Star and PlusI already introduced one repetition operator or quantifier: the question mark. It tells the engine to attempt match the preceding token zero times or once, in effect mak翻译 2012-01-17 12:02:04 · 969 阅读 · 0 评论 -
正则表达式学习指南(十二)----Grouping and Backreferences
Use Round Brackets for GroupingBy placing part of a regular expression inside round brackets or parentheses, you can group that part of the regular expression together. This allows you to apply a re翻译 2012-01-17 12:04:30 · 1687 阅读 · 0 评论 -
正则表达式学习指南(十三)----Named Capturing Groups
Named Capturing GroupsAll modern regular expression engines support capturing groups, which are numbered from left to right, starting with one. The numbers can then be used inbackreferences to mat翻译 2012-01-17 12:05:27 · 1749 阅读 · 0 评论 -
正则表达式学习指南(十四)----Unicode
Unicode Regular ExpressionsUnicode is a character set that aims to define all characters and glyphs from all human languages, living and dead. With more and more software being required to support m翻译 2012-01-17 12:06:50 · 3631 阅读 · 0 评论 -
正则表达式学习指南(十五)----Mode Modifiers
Regex Matching ModesMost regular expression engines discussed in this tutorial support the following four matching modes:/i makes the regex match case insensitive. /s enables "single-line mode".翻译 2012-01-17 12:08:28 · 577 阅读 · 0 评论 -
正则表达式学习指南(十六)----Possessive Quantifiers
Possessive QuantifiersWhen discussing the repetition operators or quantifiers, I explained the difference between greedy and lazy repetition. Greediness and laziness determine the order in which the翻译 2012-01-17 12:09:22 · 1573 阅读 · 0 评论 -
正则表达式学习指南(十八)----Lookahead and Lookbehind
Lookahead and Lookbehind Zero-Width AssertionsPerl 5 introduced two very powerful constructs: "lookahead" and "lookbehind". Collectively, these are called "lookaround". They are also called "zero-wi翻译 2012-01-17 12:11:01 · 2483 阅读 · 0 评论 -
网页信息抓取
——看到社区里面好多人都有抓取网页信息的需要,特地把我自己做的DEMO放上来,供大家参考,也希望大家多提意见,共同学习完善这个小程序. 准备工具:Dreamweaver,RegexBuddy抓取分3步进行:1)下载网页数据;2)处理网页得到想要的数据;3)保存数据一、下载网页源代码这个步骤有很多种方法可以实现,但是目的只有一个,那就是:给定网页Url,获得原创 2011-12-28 13:00:55 · 5083 阅读 · 0 评论