Subscribe

JavaScript removing HTML tags

โœ๏ธ

Learn how to delete all HTML tags using Vanilla JavaScript. See the example code in the Codepen!

28 Nov, 2020 ยท 2 min read

I recently needed to remove all HTML tags from the text content of my application to return clean text.

In this case, it was to share a plain text version for meta descriptions. It can also be used for several other outputs.

Today Iโ€™ll show you how to remove HTML tags in Javascript.

Iโ€™ll show you two ways of removing HTML tags from a string. I want to note that when you accept user input, you should always opt for a more secure server-side check.

1. JS remove HTML tags with innerHTML

One method is to create a temporary HTML element and get the innerText from it.

const original = `<h1>Welcome to my blog</h1>
<p>Some more content here</p><br /><img alt="a > 2" src="img.jpg" />`;

let removeHTML = (input) => {
  let tmp = document.createElement('div');
  tmp.innerHTML = input;
  return tmp.textContent || tmp.innerText || '';
};
console.log(removeHTML(original));

This will result in the following:

'Welcome to my blog
Some more content here'

As you can see, we removed every HTML tag, including an image, and extracted the text content.

2. JS Remove HTML tags with regex

My favorite for my applications is using a regex match. Itโ€™s a cleaner solution, and I trust my inputs to be valid HTML.

How it works:

const original = `<h1>Welcome to my blog</h1>
<p>Some more content here</p><br /><img src="img.jpg" />`;

const regex = original.replace(/<[^>]*>/g, '');
console.log(regex);

This will result in:

'Welcome to my blog
Some more content here'

As you can see, we removed the heading, paragraph, break, and image. This is because we escape all HTML brackets with < > format.

It could be breached by something silly like:

const original = `<h1>Welcome to my blog</h1>
<p>Some more content here</p><br /><img alt="a > 2" src="img.jpg" />`;

I know itโ€™s not valid HTML anyhow, and one should use &gt; for this.

But running this will result in:

'Welcome to my blog
Some more content here 2" src="img.jpg" />'

Itโ€™s just something to be aware of.

Try the example code on Codepen

You can try out the code of both methods in this Codepen:

See the Pen Exyqbqa by Chris Bongers (@rebelchris) on CodePen.

Thank you for reading, and letโ€™s connect!

Thank you for reading my blog. Feel free to subscribe to my email newsletter and connect on Facebook or Twitter

Spread the knowledge with fellow developers on Twitter
Tweet this tip
Powered by Webmentions - Learn more

Read next ๐Ÿ“–

Removing all vowels with JavaScript

3 Dec, 2022 ยท 2 min read

Removing all vowels with JavaScript

10 games to learn JavaScript

2 Dec, 2022 ยท 3 min read

10 games to learn JavaScript

Join 2097 devs and subscribe to my newsletter