Skip to main content

regex: Using notepad++

Goal: Learn and practice regex

. : zero or one character

[wildcards]
. : any character (t.p matches tip, top, tap...)
* : zero or more times (.* matches all letter in 'tip')
? : zero or one time (tips? matches 'tip' or 'tips' - with s or without s)
+ : one or more times (tip.+ matches tips but not tip)
x|y : x or y (tip|top match both tip or top)
\ : escape any special character (where\? matches where?
[Anchors]
^ : start of a string (^T matches 'This is an example.')
$ : end of a string ($s matches 'There are many users')
[Groups]
[x|X] : matches either x or X ([C|c]at matches Cat or cat)
{n} : matches n number of times (.{3} matches The in There - 3 characters)

Real life example: Each line, remove everything except e-mail address


Line 1: era@gmail.com
Line 22: zeo0326@gmail.com
Line 39: ro97xu@gmail.com
Line 95: zeroday224@gmail.com
Line 1002: dhs@gmail.com
Line 50044: aint@gmail.com
Line 61345: se7en@gmail.com
Line 7893465: tn@gmail.com

Use notepad++ freeware.

Dùng tính năng tìm kiếm và thay thế (Ctrl + H)
PHẢI chọn Regular expression trong phần Search Mode


Find what: "^.*(\<[A-Za-z0-9._%+-]+@[A-Za-z0-9.-]+\.[A-Za-z][A-Za-z][A-Za-z]?[A-Za-z]?\>).*$" (không nhập dấu nháy kép)
Replace with: \1

Thử nghiệm trước để kiểm tra:
Nhấn FindFind Next vài lần để xem có tìm đúng nội dung cần xóa không.

Nhấn Replace All


Commonly used "patterns" in search/replace


- Email address: [A-Z0-9._%+-]+@[A-Z0-9.-]{3,65}\.[A-Z]{2,4}
- Search for emails ending with .vn or .com: [^\.vn][^\.com]$




Example: "${Keep everything inside}"

Search: "\$\{(.*[^{"\}])\}\"

Replace: \1

provider "oci" {
  region           = "${var.region}"
  tenancy_ocid     = "${var.tenancy_ocid}"
  user_ocid        = "${var.user_ocid}"
  fingerprint      = "${var.fingerprint}"
  private_key_path = "${var.private_key_path}"
}

Regex functions in Google Sheet

REGEXMATCH

REGEXREPLACE

REGEXEXTRACT

Regex functions in Microsoft Excel

New Regular expression (Regex) functions in Excel

REGEXTEST: Checks if any part of supplied text matches a regex pattern.

REGEXEXTRACT: Extracts one or more parts of supplied text that match a regex pattern.

REGEXREPLACE: Searches for a regex pattern within supplied text and replaces it with different text.

Resources

>>> https://regex101.com/

>>> https://regexr.com

Popular posts from this blog

hmailserver: Notes from the field

hmailserver is one of free open source mail servers running on Microsoft Windows operating system.

stegseek: Notes

StegCracker has been retired following the release of StegSeek, which will blast through the rockyou.txt wordlist within 1.9 second as opposed to StegCracker which takes ~5 hours.

VMware ESXi host loads ipmi_si_drv too long

Workaround: remove ipmi_si_drv esxcli software vib remove --dry-run --vibname ipmi-ipmi-si-drv esxcli software vib remove --vibname ipmi-ipmi-si-drv ~ # esxcli software vib remove --dry-run --vibname ipmi-ipmi-si-drv Removal Result    Message: Dryrun only, host not changed. The following installers will be applied: [BootBankInstaller]    Reboot Required: true    VIBs Installed:    VIBs Removed: VMware_bootbank_ipmi-ipmi-si-drv_39.1-4vmw.550.0.0.1331820    VIBs Skipped: ~ # ~ # esxcli software vib remove --vibname ipmi-ipmi-si-drv Removal Result    Message: The update completed successfully, but the system needs to be rebooted for the changes to be effective.    Reboot Required: true    VIBs Installed:    VIBs Removed: VMware_bootbank_ipmi-ipmi-si-drv_39.1-4vmw.550.0.0.1331820    VIBs Skipped: ~ #