JS Regex url validation

Question

I tried to validate url with or without http No matter what i did the function return false. I checked my regex string in this site: http://regexr.com/ And its seen as i expect.

    function isUrlValid(userInput) {
        var regexQuery = "/(http(s)?://.)?(www\.)?[-a-zA-Z0-9@:%._\+~#=]{2,256}\.[a-z]{2,6}\b([-a-zA-Z0-9@:%_\+.~#?&//=]*)/";
        var url = new RegExp(regexQuery,"g");
        if (url.test(userInput)) {
            alert('Great, you entered an E-Mail-address');
            return true;
        }
        return false;
    }

I fix the problem by change the .test to .match and leave the regex as is.

URLs are like emails: using a regex to match them is error-prone; all the answers so far reject a lot of valid URL patterns and accept some strings that aren’t valid ones. — bfontaine
– bfontaine, Commented Apr 25, 2024 at 14:21

motis10 · Accepted Answer · 2015-06-21 23:26:26Z

45

I change the function to Match + make a change here with the slashes and its work: (http(s)?://.)

The fixed function:

function isUrlValid(userInput) {
    var res = userInput.match(/(http(s)?:\/\/.)?(www\.)?[-a-zA-Z0-9@:%._\+~#=]{2,256}\.[a-z]{2,6}\b([-a-zA-Z0-9@:%_\+.~#?&//=]*)/g);
    if(res == null)
        return false;
    else
        return true;
}

answered Jun 21, 2015 at 23:26

motis10

2,6261 gold badge27 silver badges48 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

saruftw Over a year ago

Should be (http(s)?:\/\/.)?(www\.)?[-a-zA-Z0-9@:%._\+~#=]{2,256}\.[a-z]{2,6}\b([-a-zA-Z0-9@:%_\+.~#?&=]*) . Checking the above regex on regex101.com gives error.

Abdallah M Yassin Over a year ago

This expression gives a lint warning: Unsafe Regular Expression

Marian07 Over a year ago

I used uibakery.io/regex-library/url because it has 2 versions. 1 with https and 2 without it.

Douglas C Dec 11, 2024 at 16:37

it does not work properly. it should not be marked as an accepted answer.

m69 ''snarky and unwelcoming'' · Accepted Answer · 2023-08-24 02:17:32Z

I believe the other answer will reject some valid url's (like domain names in uppercase or long sub-domains) and allow some invalid ones (like www.-example-.com or www.%@&.com). I tried to take into account a number of additional url syntax rules (without getting into internationalisation).

function isUrlValid(userInput) {
    var regexQuery = "^(https?:\\/\\/)?((([-a-z0-9]{1,63}\\.)*?[a-z0-9]([-a-z0-9]{0,253}[a-z0-9])?\\.[a-z]{2,63})|((\\d{1,3}\\.){3}\\d{1,3}))(:\\d{1,5})?((\\/|\\?)((%[0-9a-f]{2})|[-\\w\\+\\.\\?\\/@~#&=])*)?$";
    var url = new RegExp(regexQuery,"i");
    return url.test(userInput);
}
var input = ["https://a.long.sub-domain.example.com/foo/bar?foo=bar&boo=far#a%20b",
             "HTTP://EX-AMPLE.COM",
             "example.c",
             "example-.com",
             "www.police.academy",
             "https://x.com/?twitter?",
             "12.34.56.78:9000",
             "http://example.com?a=%bc&d=%ef&g=%H"];
for (var i in input) document.write(isUrlValid(input[i]) + ": " + input[i] + "<br>");

Here's a breakdown of the regex:

^                                      // start of URL

(                                      // protocol section
    https?                             // http or https
    :\\/\\/                            // colon and double slash
)?                                     // section can be omitted

(                                      // domain or IP address
    (
        (                              // sub-domain section
            [-a-z0-9]{1,63}            // 1 to 63 characters
            \\.                        // followed by dot
        )*?                            // any number of sections (lazy)

        [a-z0-9]                       // no hyphen at start
        (
            [-a-z0-9]{0,253}           // domain name
            [a-z0-9]                   // no hyphen at end
        )?                             // allow 1-letter domains

        \\.                            // dot
        [a-z]{2,63}                    // top-level domain
    )
    |                                  // or ...
    (                                  // IP address
        (
            \\d{1,3}                   // 1 to 3 digits
            \\.                        // followed by dot
        ){3}                           // three times
        \\d{1,3}                       // 1 to 3 digits
    )
)

(                                      // port section
    :                                  // colon
    \\d{1,5}                           // port number
)?                                     // section can be omitted

(                                      // file path and/or query section
    (                                  // section must start with ...
        \\/                            // slash
        |                              // or ...
        \\?                            // question mark
    )
    (
        (                              // escaped character
            %                          // percent
            [0-9a-f]{2}                // hex number
        )
        |                              // or ...
        [                              // literal character
            -                          // hyphen
            \\w                        // letter, digit or underscore
            \\+                        // plus
            \\.                        // dot
            \\?                        // question mark
            \\/                        // slash
            @~#&=                      // at, tilde, hash, ampersand, equal sign
        ]
    )*                                 // any number of characters
)?                                     // section can be omitted

$                                      // end of URL

Note that the regex is used in case-insensitive mode, because capital letters are allowed in every part of a url.

Theoretically, there should always be a slash between the domain and a query, but in the wild you will find a lot of urls with the domain immediately followed by a question mark, so I've allowed those.

There are also rules on the maximum length of a url, so you may want to check that separately.

(The original answer was written in 2015, but I've updated it because longer top-level domains are now in use, and single-letter domains have become more relevant because of x.com).

AmerllicA · Accepted Answer · 2021-12-21 11:42:59Z

8

Actually, this question needs a powerful regex and the following code is not very hard to understand, please see below(ES6 - TypeScript):

const isValidUrl = (url: string): boolean => {
  const urlRegex = /^((http(s?)?):\/\/)?([wW]{3}\.)?[a-zA-Z0-9\-.]+\.[a-zA-Z]{2,}(\.[a-zA-Z]{2,})?$/g;
  const result = url.match(urlRegex);

  return result !== null;
};

edited Dec 21, 2021 at 11:42

answered Jan 25, 2020 at 6:13

AmerllicA

33.2k18 gold badges146 silver badges170 bronze badges

Comments

Rahul Mahadik · Accepted Answer · 2017-12-08 07:10:15Z

1

Try this code.

function CheckURL(fieldId, alertMessage) {
    var url = fieldId.value;
    if(url !== "")
    {
        if (url.match(/(http(s)?:\/\/.)?(www\.)?[-a-zA-Z0-9@:%._\+~#=]{2,256}\.[a-z]{2,6}\b([-a-zA-Z0-9@:%_\+.~#?&//=]*)/g) !== null)
            return true;
        else {
            alert(alertMessage);
            fieldId.focus();
            return false;
        }
    }
}

var website = document.getElementById('Website');
if (!CheckURL(website, "Enter a valid website address")) {
    return false;
}

answered Dec 8, 2017 at 7:10

Rahul Mahadik

1,06013 silver badges20 bronze badges

Comments

Bullsized · Accepted Answer · 2024-01-08 09:42:59Z

0

Here's my TypeScript solution and a link to test it:

/**
 * This regex pattern aims to match URLs that start with optional protocols (http://, https://, or ftp://),
 * followed by a domain name, domain extension, and various characters that form the path, query, or fragment part of
 * the URL, as well as allowing `%` as a valid character (for encoded characters).
 *
 * https://regexr.com/7q3qi
 */
const URL_PATTERN: RegExp = /^(?:(?:http|https|ftp):\/\/)?[\w.-]+(?:\.[\w\.-]+)+[\w\-._~:/?#[\]@!$&'()*+,;=%]+$/;

answered Jan 8, 2024 at 9:42

Bullsized

7379 silver badges11 bronze badges

Comments

akash gupta · Accepted Answer · 2024-04-24 14:06:13Z

0

function isUrlValid(userInput) {
    var regexQuery = "^(https?:\\/\\/)?((([-a-z0-9]{1,63}\\.)*?[a-z0-9]([-a-z0-9]{0,253}[a-z0-9])?\\.[a-z]{2,63})|((\\d{1,3}\\.){3}\\d{1,3}))(:\\d{1,5})?((\\/|\\?)((%[0-9a-f]{2})|[-\\w\\+\\.\\?\\/@~#&=])*)?$";
    var url = new RegExp(regexQuery,"i");
    return url.test(userInput);
}
var input = ["http://localhost/pwc/public/enus/forms/pwc-external-learning-object/NTA3NDA/NTg3",
             "HTTP://EX-AMPLE.COM",
             "example.c",
             "example-.com",
             "www.police.academy",
             "https://x.com/?twitter?",
             "12.34.56.78:9000",
             "http://example.com?a=%bc&d=%ef&g=%H"];
for (var i in input) document.write(isUrlValid(input[i]) + ": " + input[i] + "<br>");

edited Apr 24, 2024 at 14:06

answered Apr 24, 2024 at 14:03

akash gupta

11 bronze badge

1 Comment

bfontaine Over a year ago

Please explain your code.

Collectives™ on Stack Overflow

JS Regex url validation

6 Answers 6

4 Comments

Comments

Comments

Comments

Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

4 Comments

Comments

Comments

Comments

Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related