When we use grep or sed in our business, we often use word matching to search and replace.
In this case, it is not uncommon that the exact match is not found.
In this article, we will record the regular expression notation for doing so with a high probability of an exact match.
Result
A description of the results is given first.
※It is designed to work with BSD-style grep and sed on Mac.
# Data Example$ cat data
10,30,50,100,150,250,300,500
# grep only 50 of the data examples$ cat data | grep-o'[[:<:]]50[[:>:]]'
50
# Replace only 50 of the example data with "match".$ cat data.txt | sed's;[[:<:]]50[[:>:]];match;g'
10,30,match,100,150,250,300,500
Description
The above example shows the process of search and replace for arbitrary data.
The method used here is to enclose the word like [[:<:]]word[[:>:]].
grep
In the first place, grep extracts not only words but also lines.
To extract only words, use option and use grep -o word.
However, in the above data example, there is a partial match as shown below, and four are extracted.
$ cat data | grep-o 50
50
50
50
50
From these things, we use the [[:<:]]word[[:>:]] which has the following meanings.
Since it matches the beginning and end of a word, it does not recognize the character strings before and after.
sed
The same is true for sed. If you replace it with the string as it is, it will be replaced 4 times.
We built pgai Vectorizer to simplify embedding management for AI applications—without needing a separate database or complex infrastructure. Since launch, developers have created over 3,000 vectorizers on Timescale Cloud, with many more self-hosted.
Coherence is the first Platform-as-a-Service you can control. Unlike "black-box" platforms that are opinionated about the infra you can deploy, Coherence is powered by CNC, the open-source IaC framework, which offers limitless customization.
Top comments (0)