1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
|
1
00:00:00,000 --> 00:00:04,034
Now we're gonna talk about text
processing. The most basic and fundamental
2
00:00:04,034 --> 00:00:09,016
tool we have for text processing is the
regular expression. And regular expression
3
00:00:09,016 --> 00:00:13,068
is a formal language for specifying text
strings. So let's suppose that we're
4
00:00:13,068 --> 00:00:18,069
looking for woodchucks in a text document,
Woodchucks can be expressed in a number of
5
00:00:18,069 --> 00:00:23,014
ways. We could have a singular woodchuck,
we could have the plural S at the end. We
6
00:00:23,014 --> 00:00:26,093
could have a capital letter at the
beginning, or a lower case, and any |
Partager