正则表达式中的其他构造

项目
05/09/2023

.NET 中的正则表达式包括三个其他语言构造。其中一个使你可以在正则表达式模式中间启用或禁用特定匹配选项。其余两个使你可以在正则表达式中包含注释。

内联选项

可以使用语法为正则表达式的一部分设置或禁用特定模式匹配选项

(?imnsx-imnsx)

在问号后列出要启用的选项，在负号后列出要禁用的选项。下表对每个选项进行了描述。有关每个选项的更多信息，请参见正则表达式选项。

选项	描述
`i`	不区分大小写的匹配。
`m`	多行模式。
`n`	仅显式捕获。（圆括号不充当捕获组。）
`s`	单行模式。
`x`	忽略未转义空格，并允许 x 模式注释。

如果 (?imnsx-imnsx) 构造定义的正则表达式选项有任何更改，更改在封闭组结束前一直有效。

注意

(?imnsx-imnsx:subexpression) 分组构造为子表达式提供了完全相同的功能。有关详细信息，请参阅分组构造。

下面的示例使用 i、n 和 x 选项，启用不区分大小写和显式捕获，并在正则表达式中间忽略正则表达式模式中的空格。

using System;
using System.Text.RegularExpressions;

public class Example
{
   public static void Main()
   {
      string pattern;
      string input = "double dare double Double a Drooling dog The Dreaded Deep";

      pattern = @"\b(D\w+)\s(d\w+)\b";
      // Match pattern using default options.
      foreach (Match match in Regex.Matches(input, pattern))
      {
         Console.WriteLine(match.Value);
         if (match.Groups.Count > 1)
            for (int ctr = 1; ctr < match.Groups.Count; ctr++)
               Console.WriteLine("   Group {0}: {1}", ctr, match.Groups[ctr].Value);
      }
      Console.WriteLine();

      // Change regular expression pattern to include options.
      pattern = @"\b(D\w+)(?ixn) \s (d\w+) \b";
      // Match new pattern with options.
      foreach (Match match in Regex.Matches(input, pattern))
      {
         Console.WriteLine(match.Value);
         if (match.Groups.Count > 1)
            for (int ctr = 1; ctr < match.Groups.Count; ctr++)
               Console.WriteLine("   Group {0}: '{1}'", ctr, match.Groups[ctr].Value);
      }
   }
}
// The example displays the following output:
//       Drooling dog
//          Group 1: Drooling
//          Group 2: dog
//
//       Drooling dog
//          Group 1: 'Drooling'
//       Dreaded Deep
//          Group 1: 'Dreaded'

Imports System.Text.RegularExpressions

Module Example
    Public Sub Main()
        Dim pattern As String
        Dim input As String = "double dare double Double a Drooling dog The Dreaded Deep"

        pattern = "\b(D\w+)\s(d\w+)\b"
        ' Match pattern using default options.
        For Each match As Match In Regex.Matches(input, pattern)
            Console.WriteLine(match.Value)
            If match.Groups.Count > 1 Then
                For ctr As Integer = 1 To match.Groups.Count - 1
                    Console.WriteLine("   Group {0}: {1}", ctr, match.Groups(ctr).Value)
                Next
            End If
        Next
        Console.WriteLine()

        ' Change regular expression pattern to include options.
        pattern = "\b(D\w+)(?ixn) \s (d\w+) \b"
        ' Match new pattern with options. 
        For Each match As Match In Regex.Matches(input, pattern)
            Console.WriteLine(match.Value)
            If match.Groups.Count > 1 Then
                For ctr As Integer = 1 To match.Groups.Count - 1
                    Console.WriteLine("   Group {0}: '{1}'", ctr, match.Groups(ctr).Value)
                Next
            End If
        Next
    End Sub
End Module
' The example displays the following output:
'       Drooling dog
'          Group 1: Drooling
'          Group 2: dog
'       
'       Drooling dog
'          Group 1: 'Drooling'
'       Dreaded Deep
'          Group 1: 'Dreaded'

该示例定义两个正则表达式。第一个 \b(D\w+)\s(d\w+)\b 匹配以一个大写“D”和一个小写“d”开头的两个连续单词。第二个正则表达式 \b(D\w+)(?ixn) \s (d\w+) \b 使用内联选项修改此模式，如下表所述。结果的比较会确认 (?ixn) 构造的效果。

模式	描述
`\b`	在单词边界处开始。
`(D\w+)`	匹配后跟一个或多个单词字符的大写“D”。这是第一个捕获组。
`(?ixn)`	从此处起，使比较不区分大小写，仅进行显式捕获，以及忽略正则表达式模式中的空格。
`\s`	与空白字符匹配。
`(d\w+)`	匹配后跟一个或多个单词字符的大写或小写“d”。因为 `n`（显式捕获）选项已启用，所以不会捕获此组。
`\b`	与字边界匹配。

内联注释

(?# comment) 构造可用于在正则表达式中添加内联注释。正则表达式引擎在模式匹配中不使用注释的任何部分，尽管注释仍包含在 Regex.ToString 方法返回的字符串中。该注释在第一个右括号处终止。

下面的示例重复了上一部分的示例中的第一个正则表达式模式。它将两个内联注释添加到该正则表达式，以指示比较是否区分大小写。正则表达式模式 \b((?# case-sensitive comparison)D\w+)\s(?ixn)((?#case-insensitive comparison)d\w+)\b 按以下方式定义。

模式	描述
`\b`	在单词边界处开始。
`(?# case-sensitive comparison)`	注释。它不影响模式匹配行为。
`(D\w+)`	匹配后跟一个或多个单词字符的大写“D”。这是第一个捕获组。
`\s`	与空白字符匹配。
`(?ixn)`	从此处起，使比较不区分大小写，仅进行显式捕获，以及忽略正则表达式模式中的空格。
`(?#case-insensitive comparison)`	注释。它不影响模式匹配行为。
`(d\w+)`	匹配后跟一个或多个单词字符的大写或小写“d”。这是第二个捕获组。
`\b`	与字边界匹配。

using System;
using System.Text.RegularExpressions;

public class Example
{
   public static void Main()
   {
      string pattern = @"\b((?# case-sensitive comparison)D\w+)\s(?ixn)((?#case-insensitive comparison)d\w+)\b";
      Regex rgx = new Regex(pattern);
      string input = "double dare double Double a Drooling dog The Dreaded Deep";

      Console.WriteLine("Pattern: " + pattern.ToString());
      // Match pattern using default options.
      foreach (Match match in rgx.Matches(input))
      {
         Console.WriteLine(match.Value);
         if (match.Groups.Count > 1)
         {
            for (int ctr = 1; ctr <match.Groups.Count; ctr++)
               Console.WriteLine("   Group {0}: {1}", ctr, match.Groups[ctr].Value);
         }
      }
   }
}
// The example displays the following output:
//    Pattern: \b((?# case-sensitive comparison)D\w+)\s(?ixn)((?#case-insensitive comp
//    arison)d\w+)\b
//    Drooling dog
//       Group 1: Drooling
//    Dreaded Deep
//       Group 1: Dreaded

Imports System.Text.RegularExpressions

Module Example
    Public Sub Main()
        Dim pattern As String = "\b((?# case-sensitive comparison)D\w+)\s(?ixn)((?#case-insensitive comparison)d\w+)\b"
        Dim rgx As New Regex(pattern)
        Dim input As String = "double dare double Double a Drooling dog The Dreaded Deep"

        Console.WriteLine("Pattern: " + pattern.ToString())
        ' Match pattern using default options.
        For Each match As Match In rgx.Matches(input)
            Console.WriteLine(match.Value)
            If match.Groups.Count > 1 Then
                For ctr As Integer = 1 To match.Groups.Count - 1
                    Console.WriteLine("   Group {0}: {1}", ctr, match.Groups(ctr).Value)
                Next
            End If
        Next
    End Sub
End Module
' The example displays the following output:
'    Pattern: \b((?# case-sensitive comparison)D\w+)\s(?ixn)((?#case-insensitive comp
'    arison)d\w+)\b
'    Drooling dog
'       Group 1: Drooling
'    Dreaded Deep
'       Group 1: Dreaded

行尾注释

数字符号 (#) 会标记 x 模式注释，该模式从正则表达式模式末尾的未转义 # 字符开始，并继续到行尾。若要使用此构造，必须启用 x 选项（通过内联选项），或在实例化 Regex 对象或调用静态 Regex 方法时向 option 参数提供 RegexOptions.IgnorePatternWhitespace 值。

下面的示例说明行尾注释构造。它确定字符串是否为包含至少一个格式项的复合格式字符串。下表描述了正则表达式模式中的构造：

\{\d+(,-*\d+)*(\:\w{1,4}?)*\}(?x) # Looks for a composite format item.

模式	描述
`\{`	匹配左大括号。
`\d+`	匹配一个或多个十进制数字。
`(,-\d+)`	与零个或一个后跟一个可选负号、再后跟一个或多个十进制数字的逗号匹配。
`(\:\w{1,4}?)*`	与零个或一个后跟一到四个（但尽可能少）空白字符的冒号匹配。
`\}`	匹配右大括号。
`(?x)`	启用忽略模式空格选项，以便识别行尾注释。
`# Looks for a composite format item.`	行尾注释。

using System;
using System.Text.RegularExpressions;

public class Example
{
   public static void Main()
   {
      string pattern = @"\{\d+(,-*\d+)*(\:\w{1,4}?)*\}(?x) # Looks for a composite format item.";
      string input = "{0,-3:F}";
      Console.WriteLine("'{0}':", input);
      if (Regex.IsMatch(input, pattern))
         Console.WriteLine("   contains a composite format item.");
      else
         Console.WriteLine("   does not contain a composite format item.");
   }
}
// The example displays the following output:
//       '{0,-3:F}':
//          contains a composite format item.

Imports System.Text.RegularExpressions

Module Example
    Public Sub Main()
        Dim pattern As String = "\{\d+(,-*\d+)*(\:\w{1,4}?)*\}(?x) # Looks for a composite format item."
        Dim input As String = "{0,-3:F}"
        Console.WriteLine("'{0}':", input)
        If Regex.IsMatch(input, pattern) Then
            Console.WriteLine("   contains a composite format item.")
        Else
            Console.WriteLine("   does not contain a composite format item.")
        End If
    End Sub
End Module
' The example displays the following output:
'       '{0,-3:F}':
'          contains a composite format item.

请注意，还可以调用 Regex.IsMatch(String, String, RegexOptions) 方法并向它传递 RegexOptions.IgnorePatternWhitespace 枚举值，从而识别注释，而不用在正则表达式中提供 (?x) 构造。

请参阅

正则表达式语言 - 快速参考

正则表达式中的其他构造

内联选项

内联注释

行尾注释

请参阅

反馈

其他资源