利用正则表达式除去html得到纯文本

前端之家收集整理的这篇文章主要介绍了利用正则表达式除去html得到纯文本前端之家小编觉得挺不错的,现在分享给大家,也给大家做个参考。

public static string DelHTML(string Htmlstring)//将HTML去除

{
#region
//删除脚本

Htmlstring =System.Text.RegularExpressions. Regex.Replace(Htmlstring,@"<script[^>]*?>.*?</script>","",System.Text.RegularExpressions.RegexOptions.IgnoreCase);

//删除HTML


//Htmlstring =System.Text.RegularExpressions. Regex.Replace(Htmlstring,@"<A>.*</A>","");

//Htmlstring =System.Text.RegularExpressions. Regex.Replace(Htmlstring,@"<[a-zA-Z]*=\.[a-zA-Z]*\?[a-zA-Z]+=\d&\w=%[a-zA-Z]*|[A-Z0-9]","");

Htmlstring = System.Text.RegularExpressions.Regex.Replace(Htmlstring,@"&(cent|#162);","\xa2",@"&(pound|#163);","\xa3",@"&(copy|#169);","\xa9",@"&#(\d+);",51); font-family:Arial; font-size:14px; line-height:26px">
Htmlstring.Replace("<",51); font-family:Arial; font-size:14px; line-height:26px"> Htmlstring.Replace(">",51); font-family:Arial; font-size:14px; line-height:26px"> Htmlstring.Replace("\r\n",51); font-family:Arial; font-size:14px; line-height:26px"> //Htmlstring=HttpContext.Current.Server.HtmlEncode(Htmlstring).Trim();
#endregion


return Htmlstring;

}

原文链接:https://www.f2er.com/regex/360307.html

猜你在找的正则表达式相关文章