[JavaScript] Understanding the Regular Expression - Explained with Examples.

Understand the Regular Expression with an email input validation.

·

3 min read

[JavaScript] Understanding the Regular Expression - Explained with Examples.

Introduction

A regular expression is a pattern of characters. It is used to match with a certain string and see if it matches a certain condition.

Because it is too complicated, most developers do not memorize it, but you need to be able to understand it and know how to read it.

In this article, we will be using a regular expression to validate the user's email. This should help you to understand a regular expression.

In addition, this article talks about two methods that can be used with a regular expression:

  1. test()

  2. search()

  3. match()


Test(), Search() and Match()

Test() is used to test if the regular expression matches the string.

If it matches, it returns true. If not, it returns false.

Syntax:

let str = "123abc456def";
const regExp = /[0-9]{2}a/g; // regular expression 
console.log(regExp.test(str)); // true

Search() is a method that is used with a string.

It searches the first element that is given in the parameter and returns the index number.

If it cannot find one, it returns -1.

let str = "123abc456def";
const regExp = /[0-9]{2}a/g; // regular expression 
let result = str.search(regExp);
console.log(result); // 1

Match() puts matching values into an array and returns the array.

let str = "123abc456def";
const regExp = /[a-zA-Z]/g;
result = str.match(regExp);
console.log(result); // ['23a']

Regular Expression

We will be writing a regular expression to match or validate the user's email.

Possible email inputs:

The regular expression must match any possible email inputs.

const regexpEmail = 
/^([a-z]+\d*)+(\.?[a-z]+\d*)+@([a-z]+\d*)+(\.[a-z]{2,3})+$/;

Meanings of Characters:

  • ^ character - beginning of input.

  • $ character - ending of input.

  • plus (+) - one or more than one.

  • star(*) - zero or more than zero.

  • question mark(?) - only zero or one.

  • [a-z] - the lowercase letter must be placed.

  • \d - number must be placed.

  • \. - dot(.) must be placed.

    • We use backslash because dot(.) is a special character.

    • For example, if you want to ensure \d is placed, you can do: \\d because \d has a meaning in a regular expression.

I will explain each part:

([a-z]+\d*)+ means more than one lowercase letter and zero or more can be repeated more than one time becuase the whole thing is enclosed by the bracket and the plus(+) sign is at the back.

(\.?[a-z]+\d*)+ means dot(.) can be placed zero or one time only, then everything else is the same as above.

@([a-z]+\d*) means that @ symbol must be placed, then everything else is the same as above.

(\.?[a-z]{2, 3})+ is for the top level domain (TLD) like .com, .org, .net and .gov. It means that a dot(.) must come, then only a string with a length of either 2 or 3 can be placed. As the whole thing is encolsed by the bracket followed by the plus(+), this can be repeated one or more than one time.

Did you find this article valuable?

Support Lim Woojae by becoming a sponsor. Any amount is appreciated!