DEV Community

John Au-Yeung
John Au-Yeung

Posted on

Use the URL Object in JavaScript for Easy URL Manipulation

Subscribe to my email list now at http://jauyeung.net/subscribe/

Follow me on Twitter at https://twitter.com/AuMayeung

Many more articles at https://medium.com/@hohanga

Manipulating and extracting parts from a URL is a pain if we have to write all the code ourselves. Fortunately, most browsers have the URL object built into their standard library. Now, we can pass in the URL, as a string, to the URL constructor and create an instance of it. Then, we can use the handy value properties and methods to manipulate and get the parts of a URL that we want.

To create a URL object with the URL constructor, we use the new keyword like in the following code:

new URL('http://medium.com');

In above code, we create an instance of the URL object with an absolute URL. We can also create a URL object by passing-in a relative URL as the first argument and the base URL as the second argument. For example, we can write:

new URL('/@hohanga', 'http://medium.com');

Note that the second argument must be a base URL. This means that it must be a valid URL and not just part of it. It must start with the protocol like http:// or https://. We can also define them in a chain-like in the following code:

const mozillaUrl = 'https://developer.mozilla.org';  
const mozillaDevUrl = new URL("/", mozillaUrl);  
const mozillaUrl2 = new URL(mozillaUrl);  
const docsUrl = new URL('/en-US/docs', mozillaUrl2);  
const urlUrl = new URL('Web/API/URL', docsUrl);

If we call toString() on the URL objects above, we should get something like:

https://developer.mozilla.org
https://developer.mozilla.org/
https://developer.mozilla.org/
https://developer.mozilla.org/en-US/docs
https://developer.mozilla.org/en-US/Web/API/URL

The second argument is an optional argument and it should only be passed in when the first argument is a relative URL. The strings or URL objects that we pass in are converted to USVString objects which correspond to the set of all possible sequences of Unicode scalar values. In our code, we can treat them the same as regular strings. A TypeError will be thrown if both argument are relative URLs or if the base and the relative URL together aren’t valid. We can pass in the URL object directly to the second argument because the toString method of the URL object will convert a URL object into a full URL string before being manipulated in the constructor.

A URL object can have the following properties. This article will have the first half of the list of properties and Part 2 of this article will have the second half.

Hash Property

The hash property has the part of the URL that comes after the pound sign (# ). The string isn’t percent decoded, so special symbols like the following are still encoded. They’re encoding with the following mapping. The characters on the left side are converted to the ones on the right side during encoding:

  • ‘:’%3A
  • ‘/’%2F
  • ‘?’%3F
  • ‘#’%23
  • ‘[‘%5B
  • ‘]’%5D
  • ‘@’%40
  • ‘!’%21
  • ‘$’%24
  • “‘“%27
  • ‘(‘%28
  • ‘)’%29
  • ‘*’%2A
  • ‘+’%2B
  • ‘,’%2C
  • ‘;’%3B
  • ‘=’%3D
  • ‘%’%25
  • ‘ ‘%20 or +

For example, if we have the URL https://example.com#hash . Then we can extract the hash with the hash property as follows:

const exampleUrl = new URL('https://example.com#hash');  
console.log(exampleUrl.hash);

Then we get '#hash' in the console.log statement. The property is a USVString , but it’s converted to a string when we retrieve it like we did above. This is not a read-only property, so we can also assign to it like in the following code:

exampleUrl.hash = '#newHash';

For example, if we have:

const exampleUrl = new URL('https://example.com#hash');  
exampleUrl.hash = '#newHash';  
console.log(exampleUrl.hash);

Then we get https://example.com/#newHash as the new URL when we get it with the href property.

Host Property

The host property of the URL object is a USVString that contains the host name. If a port is included after the : , then we also get the port number along with the host. For example, if we have:

const exampleUrl = new URL('https://example.com:8888');  
console.log(exampleUrl.host);

Then we get example.com:8888 . Like other USVString properties, it’s converted to a string when we retrieve it. This is not a read only property, so we can also assign to it like in the following code:

exampleUrl.host = 'example.com';

We can also set the host property to change the URL structure. For example, if we have:

const exampleUrl = new URL('https://example.com:8888/en-US/docs/Web/API/URL/href/');

exampleUrl as the new URL when we get it with the href property.

Hostname Property

With the hostname property, we can get the host name from the URL as a USVString. It will not include the port even if it’s included in the URL string that we used to construct the URL object. For example, if we have:

const exampleUrl = new URL('https://example.com:8888'));  
console.log(exampleUrl.hostname);

Then we get example.com from the console.log statement. Like other USVString properties, it’s converted to a string when we retrieve it. This is not a read-only property, so we can also assign to it like in the following code:

exampleUrl.hostname = newExample.com;

Then we get that the new URL is https://newexample.com:8888/ when we get it with the href property.

Href Property

The href property of the URL object is USVString that has the whole URL. For example, if we have:

const exampleUrl = new URL(https://example.com:8888/en-US/docs/Web/API/URL/href/');  
console.log(exampleUrl.href);

Then when we run the code above, we get https://example.com:8888/en-US/docs/Web/API/URL/href/ which is what we passed into the URL constructor. We can also set the href property by assigning our own value to it. For example, we can write:

const exampleUrl = new URL('https://example.com:8888/en-US/docs/Web/API/URL/href/');  
exampleUrl.href = 'https://newExample.com';  
console.log(exampleUrl.href);

Origin Property

The origin property is a read-only property that returns a USVString with the Unicode serialization of the origin of the URL. The structure of the origin will depend on the type of URL that’s used to construct the URL object. For http or https URLs, the origin will be the protocol followed by :// , then followed by the domain, then followed by the colon (: ), then followed by the port. The default port will only be omitted, which is 80 for http and 443 for https , if it’s explicitly specified. For file protocol URLs, the value will be browser dependent. For blob: URLs, the origin will be the part following blob: . For example, "blob:http://example.com" will be returned as "http://example.com" . For example, if we have:

const url = new URL("https://example.org:443")  
console.log(url.origin);

Then we get:

https://example.org

If we have an http URL like the following:

const url = new URL("http://example.org:80")  
console.log(url.origin);

Then we get:

http://example.org

If we have a non-default port for http or https URLs, like in the following:

const url = new URL("http://example.org:8888")  
console.log(url.origin);
const url2 = new URL("http://example.org:8888")  
console.log(url.origin);

Then we get http://example.org:8888

and http://example.org:8888 respectively.

If we have a blob URL, like in the following:

const url = new URL("blob:http://example.org:8888")  
console.log(url.origin);

Then we get http://example.org:8888 . And for file URLs like the following:

const url = new URL("file://c:/documents/document.docx")  
console.log(url.origin);

Then we get file:// in Chrome.

Password Property

The password property is a writable property that extracts the password portion of the URL before the domain name. Like the other properties, the string is a USVString . If it’s set without the username property being set first, then assigning to this property will silently fail. For example, if we have:

const url = new URL('https://username:password@example.com');  
console.log(url.password);

Then we get password in the console.log statement above. We can also set the password property to the password of our choice. For example, if we have:

const url = new URL('https://username:password@example.com');  
url.password = 'newPassword'  
console.log(url);

Then we get https://username:newPassword@example.com as the new URL when we get it with the href property.

Pathname Property

The pathname property is a USVString property which has the part following the first slash (/). It’s writable property. For example, we can get the pathname value from the URL in the following code:

const url = new URL('https://example.com/path1/path2');  
console.log(url.pathname);

Then we get /path1/path2 when we run the console.log statement above. We can also set the pathname property to the relative URL of your choice by setting it to the part after the domain. For example, we can write:

const url = new URL('https://example.com/path1/path2');  
url.pathname = '/newpath1/newpath2';  
console.log(url.href);

Then we get https://example.com/newpath1/newpath2 when we run the console.log statement above.

Manipulating and extract parts from a URL is a pain if we have to write all the code to do it ourselves. Fortunately, most browsers have the URL object built in to their standard library. Now we can pass in the URL as a string to the URL constructor and to create an instance of a URL. Then we can use the handy value properties and methods to manipulate and get the parts of a URL that we want.

To create an URL object with the URL constructor, we use the new keyword like in the following code:

new URL('http://medium.com');

To see more ways to create an URL object, see Part 1 for more details.

A URL object can have the following properties. This article will have the second half of the list of properties and Part 1 of this article will have the first half. All string values are stored as USVString. USVString are objects which correspond to the set of all possible sequences of Unicode scalar values. All USVString values are converted to strings before we get them, so we can treat them all like regular strings.

Value Properties

port Property

The port property is a USVString property that has the port number of the URL. For example, if we have:

const url = new URL('https://example.com:8888');  
console.log(url.port);

Then we get 8888 from the console.log statement. We can also set the port to get a new URL with by assigning the port property with a new value. For example, if we have the following code:

const url = new URL('https://example.com:8888');  
url.port = '3333';  
console.log(url.href);

Then we get https://example.com:3333/ from the console.log statement when we run the code above.

protocol Property

With the protocol property, we can get the the protocol portion of the URL as a USVString , which is the very first part of the URL. Example protocols include http: , https: , ftp: , file: , etc. For instance, if we have the URL https://example.com, then https is the protocol portion of the URL. If we have the following code:

const url = new URL('https://example.com:8888');  
console.log(url.protocol);

Then we get https: from the console.log statement in the code above.

For other protocols like ftp , it also works:

const url = new URL('ftp://example.com:8888');  
console.log(url.protocol);

If we run the code above, then we get ftp: from the console.log statement in the code above.

We can also set the protocol property to get a new URL. For example, if we have:

const url = new URL('ftp://example.com:8888');  
url.protocol = 'http://'
console.log(url.href);

Then we get http://example.com:8888/ from the console.log above. It also works if we omit the slashes or omit both the colon and the slashes. For example, if we have:

const url = new URL('ftp://example.com:8888');  
url.protocol = 'http:' 
console.log(url.href);

Then we also get http://example.com:8888/ from the console.log above. Likewise, if we have:

const url = new URL('ftp://example.com:8888');  
url.protocol = 'http'
console.log(url.href);

We get the same thing.

search Property

To get the search string or query string from a URL, we can use the search property. The query string is anything after the ? in a URL. Like all the other properties, this one is also USVString property. Note that this property get the whole query string. If you want the individual keys and values from the query string, then we can use the searchParams property. We can use the search property like the following code:

const url = new URL('http://example.com:8888?key1=value1&key2=value2');  
console.log(url.search);

Then we get back ?key1=value1&key2=value2 from the console.log statement when we run the code above. We can also change the query string of the URL by assigning it a new value. For example, if we have the following code:

const url = new URL('http://example.com:8888?key1=value1&key2=value2');  
url.search = 'newKey1=newValue1&newKey2=newValue2';  
console.log(url.href);

Then we get http://example.com:8888/?newKey1=newValue1&newKey2=newValue2 from the console.log statement when we run the code above.

searchParams Property

The search property only gets us the whole query string. To parse the query string into keys and values, we can use the searchParams property, which will get us a URLSearchParams object which has the list of keys and values listed on the query string. The keys and values will be decoded so that special characters are all intact. This is a read only property so we can’t set the query string directly by assigning a new value to this property. For example, to get the query string as a list of keys and values, we can write the following code:

const url = new URL('http://example.com:8888/?key1=value1&key2=value2');  
console.log(url.searchParams.get('key1'));  
console.log(url.searchParams.get('key2'));

Then we get value1 from the first console.log statement and value2 from the second console.log statement. The URLSearchParams object has a get method to get the value of the given query string key by the key name.

Username Property

We can get and set the username part of the URL with the username property. The username is a URL comes before the password and domain name. For example, if we have the following code:

const url = new URL('http://username:password@example.com');  
console.log(url.username);

Then when we run the code above, we get username from the console.log statement above. Also, we can use it to set the username in a URL. For instance, if we have the following code:

const url = new URL('http://username:password@example.com');  
url.username = 'newUserName';  
console.log(url.href);

Then we get http://newUserName:password@example.com/ from the console.log statement in the code above.

Instance Methods

URL object instances also have 2 methods. It has a toString() and a toJSON() method. The toString() method returns a USVString containing the whole URL. It’s the same as the href property for getting the whole URL, but we can’t use the toString() method to set the whole URL like we do with the href property. For example, if we have:

const url = new URL('http://username:password@example.com');  
console.log(url.toString());

Then we get http://username:password@example.com/ from the console.log statement above.

The toJSON() method returns a JSON serialized version of the URL object. However, in most cases, it’s actually the same as what toString() will return. For example, if we have:

const url = new URL('http://username:password@example.com');  
console.log(url.toJSON());  
console.log(url.toString());  
console.log(url.toJSON() === url.toString());

Then we get:

http://username:password@example.com/
http://username:password@example.com
true

from the console.log statements when the code above is run. Then url.toJSON() and url.toString() calls returned the same output, and it’s confirmed by the comparison statement url.toJSON() === url.toString() , which evaluated to true .

Static Methods

There are 2 static methods in the URL constructor. It has the createObjectURL() method and the revokeObjectURL() method. The createObjectURL() static method creates a DOMString containing a URL representing the object given in the parameter. A DOMString is a UTF-16 encoded string. Since JavaScript uses UTF-16 strings, DOMStrings are mapped directly to strings. Passing null to a method that accepts a DOMString will convert null to 'null' . The createObbjectURL() method accepts an object, which can be a File, Blob, or MediaSource object to create a URL for.For example, we can call the createObjectURL() method like in the following code:

const objUrl = URL.createObjectURL(new File([""], "filename"));  
console.log(objUrl);

Then we get something like:

blob:https://fiddle.jshell.net/4cfe2ef3-74ea-4145-87a7-0483b117b429

Each time we create an object URL with the createObjectURL() method, we have to release it by calling the revokeObjectURL() method to clear it from memory. However, browser will release object URLs automatically when the document is unloaded, but for the sake for improving performance, we should release it manually. The revokeObjectURL() method takes in an object URL as its argument. For example, if we have the object URL above created, then we can use the revokeObjectURL() method to release the object URL from memory like in the following code:

const objUrl = URL.createObjectURL(new File([""], "filename"));  
console.log(objUrl);  
URL.revokeObjectURL(objUrl);

The revokeObjectURL() method returns undefined .

When any part of the URL has special characters, they’re encoded by the rules found in RFC-3896, which has a long list of rules on how to encode different types of characters. It also includes non-English characters, and the percent encoding standard for encoding URLs. The full list of rules are at https://tools.ietf.org/html/rfc3986.

With the URL object, manipulating and extract parts from a URL is no longer a pain since we don’t have to write all the code to do it ourselves. Most browsers have the URL object built in to their standard library. Now we can pass in the URL as a string to the URL constructor and to create an instance of a URL. Then we can use the handy value properties and methods to manipulate and get the parts of a URL that we want. It can manipulate all parts of the URL except the query string portion, which can be manipulated by parsing it into a URLSeachParams object and then call its methods and set its properties to modify the query string. To toString and toJSON methods both convert a URL object into a full URL string.

Top comments (2)

Collapse
 
maximusmaxwell6 profile image
maximus Maxwell

Thanks. Very useful.

Collapse
 
aumayeung profile image
John Au-Yeung

Thanks very much for reading.