Skip to main content

regex: Using notepad++

Goal: Learn and practice regex

. : zero or one character

[wildcards]
. : any character (t.p matches tip, top, tap...)
* : zero or more times (.* matches all letter in 'tip')
? : zero or one time (tips? matches 'tip' or 'tips' - with s or without s)
+ : one or more times (tip.+ matches tips but not tip)
x|y : x or y (tip|top match both tip or top)
\ : escape any special character (where\? matches where?
[Anchors]
^ : start of a string (^T matches 'This is an example.')
$ : end of a string ($s matches 'There are many users')
[Groups]
[x|X] : matches either x or X ([C|c]at matches Cat or cat)
{n} : matches n number of times (.{3} matches The in There - 3 characters)

Real life example: Each line, remove everything except e-mail address


Line 1: era@gmail.com
Line 22: zeo0326@gmail.com
Line 39: ro97xu@gmail.com
Line 95: zeroday224@gmail.com
Line 1002: dhs@gmail.com
Line 50044: aint@gmail.com
Line 61345: se7en@gmail.com
Line 7893465: tn@gmail.com

Use notepad++ freeware.

Dùng tính năng tìm kiếm và thay thế (Ctrl + H)
PHẢI chọn Regular expression trong phần Search Mode


Find what: "^.*(\<[A-Za-z0-9._%+-]+@[A-Za-z0-9.-]+\.[A-Za-z][A-Za-z][A-Za-z]?[A-Za-z]?\>).*$" (không nhập dấu nháy kép)
Replace with: \1

Thử nghiệm trước để kiểm tra:
Nhấn FindFind Next vài lần để xem có tìm đúng nội dung cần xóa không.

Nhấn Replace All


Commonly used "patterns" in search/replace


- Email address: [A-Z0-9._%+-]+@[A-Z0-9.-]{3,65}\.[A-Z]{2,4}
- Search for emails ending with .vn or .com: [^\.vn][^\.com]$




Example: "${Keep everything inside}"

Search: "\$\{(.*[^{"\}])\}\"

Replace: \1

provider "oci" {
  region           = "${var.region}"
  tenancy_ocid     = "${var.tenancy_ocid}"
  user_ocid        = "${var.user_ocid}"
  fingerprint      = "${var.fingerprint}"
  private_key_path = "${var.private_key_path}"
}

Regex functions in Google Sheet

REGEXMATCH

REGEXREPLACE

REGEXEXTRACT

Regex functions in Microsoft Excel

New Regular expression (Regex) functions in Excel

REGEXTEST: Checks if any part of supplied text matches a regex pattern.

REGEXEXTRACT: Extracts one or more parts of supplied text that match a regex pattern.

REGEXREPLACE: Searches for a regex pattern within supplied text and replaces it with different text.

Resources

>>> https://regex101.com/

>>> https://regexr.com

Popular posts from this blog

PHP: Notes

" PHP was originally an abbreviation of Personal Home Page, but it now stands for the recursive initialism PHP: Hypertext Preprocessor. " — Wikipedia

Hydra: Notes

Hydra v9.1 (c) 2020 by van Hauser/THC & David Maciejak - Please do not use in military or secret service organizations, or for illegal purposes (this is non-binding, these *** ignore laws and ethics anyway).

Check and install php7.0-mbstring extention on Ubuntu 16.04 and Microsoft Windows

While learning Facebook SDK for PHP I overlooked one of system requirements preventing my code from running on my host. That is The mbstring extension. Facebook SDK for PHP's system requirements Check if mbstring is enabled root@ubuntu001:~# php -a Interactive mode enabled php > echo extension_loaded('mbstring'); php >exit root@ubuntu001:~# apt-cache search php7 | grep mbstring php7.0-mbstring - MBSTRING module for PHP root@ubuntu001:~# Install php7.0-mbstring root@ubuntu001:~# apt-get install php7.0-mbstring Reading package lists... Done Building dependency tree Reading state information... Done The following packages were automatically installed and are no longer required:   fontconfig-config fonts-dejavu-core libfontconfig1 libgd3 libjbig0   libjpeg-turbo8 libjpeg8 libmcrypt4 libtiff5 libvpx3 libxpm4 Use 'apt autoremove' to remove them. The following additional packages will be installed:   libapache2-mod-php7.0 php7.0-cli ph...