Introduction to Pattern Matching and Search with Grep

Welcome! In this lesson, we will explore the powerful tool grep in Bash for pattern matching and searching within files. grep stands for "Global Regular Expression Print," and it is widely used for filtering and searching through text data. Think of grep like automating Ctrl+F within your scripts. Mastering grep will make your text processing tasks easier and more efficient. Let's get started!

Basic Search

Let's start with a basic search using grep. The basic command syntax is:

This command searches for the specified pattern within the file. Here's an example:

Output:

This query outputs only the lines that contain is anywhere in the line. Notice that "This line will match" is included because "This" contains the substring "is".

Search for Whole Words Only

To search for a whole word rather than a substring, use the -w option. Let's take a look:

Output:

Even though "This" contains the substring "is", only the lines with "is" as a whole word are matched.

Case-Insensitive Search

Sometimes, you need to perform a search without considering the case (uppercase or lowercase). The -i option allows for case-insensitive searches. For example:

Output:

In this example, "Hello" and "HELLO" are both matched because -i ignores case differences.

Search for Lines Beginning with a Pattern

To find lines that begin with a specific pattern, you can use the caret (^) symbol. For example, ^Hello will match any lines that start with Hello. This anchor signifies the start of a line in regular expressions. Let's look at an example:

Output:

Only the lines that begin with Hello are matched. The caret ^ ensures that the pattern must appear at the start of the line.

Show Line Numbers

If you want to know the line numbers of matching patterns, you can use the -n option. Let's look at an example:

Output

Both lines 2 and 4 contain "Grep". Notice that line numbers start at 1, not 0.

Invert Match

To find lines that do not match a given pattern, use the -v option. Suppose we have the following script:

Output:

Using -v, only lines that do not contain "Hello" are printed

Count Matches

To count the number of lines matching a pattern, use the -c option. For example:

Output:

Only 2 of the 3 lines contain the word "file". -c only counts matching lines. "This file is a test file" contains "file" twice, but the line is only counted once.

Search for Multiple Patterns (OR Logic)

You can simulate OR logic using the -e option. This allows you to search for lines that contain any of the provided patterns. Here's an example:

Output:

Only lines that contain grep and/or Hello are matched.

Show Lines Matching Multiple Patterns (AND Logic)

You can simulate AND logic by pipeing together multiple patterns. For example:

Output:

grep -i 'grep' file.txt matches the 3 lines that contain grep (case-insensitive). These 3 lines are then sent as input to grep -i 'hello' which matches only the lines that contain hello (case-insensitive.)

Summary and Next Steps

Great job on learning various techniques to search and match patterns using grep in Bash. In this lesson you learned how to:

  1. Perform basic searches using grep
  2. Use -w to search for whole words
  3. Use -i to conduct case-insensitive searches
  4. Use -n to show line numbers of matching patterns
  5. Use -v to invert matches
  6. Use -c to count the number of matching lines
  7. Use -e to implement OR logic to search for multiple patterns
  8. Use | to implement AND logic by piping multiple commands together

These skills will enhance your ability to manipulate and search through text files efficiently. Now it’s time to practice what you’ve learned. Head over to the practice section and start applying these grep commands in real-world scenarios. Happy scripting!

Sign up
Join the 1M+ learners on CodeSignal
Be a part of our community of 1M+ users who develop and demonstrate their skills on CodeSignal