PROWAREtech








.NET: Strip/Remove HTML SCRIPT Tags from Text Using Regex
How to remove the SCRIPT tag and its containing code from HTML text using C# and regular expressions.
See related: Find Keywords in Text and strip all HTML tags from text
It is very easy to remove all SCRIPT tags in HTML text using Regex.Replace()
.
Do this is JavaScript, client-side (in the browser).
using System;
using System.Text.RegularExpressions;
namespace ConsoleApp1
{
class Program
{
static void Main(string[] args)
{
var html = @"
<p>This is a test</p>
<script type='text/javascript'>
document.write(123);
</script>
<p>The script tag should have been removed!</p>
";
Console.Write(Regex.Replace(html, @"<script\b[^<]*(?:(?!<\/script>)<[^<]*)*<\/script>", string.Empty, RegexOptions.IgnoreCase));
}
}
}
Comment