C# - Define The Regular Expression To Match The Following Html Code?
Oct 29, 2010
I was doing some web scraping and i was looking for some div elements with particular class names and markup. This is my objective , i have to extract everything within the div having the class s_specs_box s_box_4 Could someone provide the regular expression in .NET terms (i.e., which can be straight away passed into Regex's constructor)to match one such div (given below)
<div class="s_specs_box s_box_4"><h3>Display</h3><ul><li><strong><span class='s_tooltip_anchor'>Display:</span>
<span class='s_tooltip_content'><p class='s_help'><b>Display</b> - Phone's main display</p></span></strong><ul>
<li class='clear clearfix'><strong><span class='s_tooltip_anchor'>Type:</span>
<span class='s_tooltip_content'><p class='s_help'><b>Type</b> - Refers to the type of the display. There are four major display types: Greyscale, Black&White, LCD:STN-color and LCD:TFT-color</p></span></strong><ul><li>Color</li></ul>
</li><li class='clear clearfix'><strong><span class='s_tooltip_anchor'>Technology:</span>
<span class='s_tooltip_content'><p class='s_help'><b>Technology</b> - Refers to the type of the color displays. There are five major types: LCD, TFT, TFD, STN and OLED</p></span></strong><ul><li>Super AMOLED</li></ul>..............................
I'm trying to use a RegularexpressionValidator to match an IP address (with possible wildcards) for an IP filtering system.I'm using the following Regex:
"([0-9]{1,3}\.|\*\.){3}([0-9]{1,3}|\*){1}"
Which works fine when running it in LINQPad with Regex.Matches, but doesn't seem to work when I'm using the validator.Does anyone have a suggestion as to either a better Regex or why it would work in test but not in situ?
I'm testing out a CMS I've built in ASP.NET and I'm having issues validating a form textarea with a regular expression.
I've used a ReguarExpressionValidator control to make sure a maximum of 4000 characters are entered but have realised I'm not including line breaks in the regular expression and can't figure out how to include them?
This is slightly different to most questions regarding HTML tags in strings - I want to add HTML!
I'm using a WYSIWYG editor which produces some eratic results. What im looking to do is to check the string it produces and check whether there are any sentences that aren't wrapped in a <p></p> tag. Typical strings could be:
<div id="mydiv">This is a "div" with quotation marks</div>
I want to use regular expressions to return the following:
<div id='mydiv'>This is a "div" with quotation marks</div>
Notice how the id attribute in the div is now surrounded by apostrophes?
How can I do this with a regular expression?
Edit: I'm not looking for a magic bullet to handle every edge case in every situation. We should all be weary of using regex to parse HTML but, in this particular case and for my particular need, regex IS the solution.
Edit #2: Jens Ameskamp helped to find a solution for me but anyone randomly coming to this page should think long and very hard about using this solution. In my case it works because I am very confident of the type of strings that I'll be dealing with. I know the dangers and the risks and make sure you do to. If you're not sure if you know then it probably indicates that you don't know and shouldn't use this method.
I am trying to match an email, however when put the same expression in the code behind, vs the aspx, I seem to be getting different matches for email address. The aspx regex validator seems to be working correctly, however I need to validate for my webservice as well.
Regex regExEmail = new Regex(@"w+([-+.]w+)*@w+([-.]w+)*.w+([-.]w+)*"); if (!regExEmail.IsMatch(contact.emailAddress)) { //do something }
the text box should validate in such a manner that the special characters like <,>,%,!,? should not appear in the text box.if appears an erroe msg should be displayed representing invalid name.or else it sholud be processen....
can any one tell the code(regular expression) for the above mentioned.
I want to move some parts of the html page containing this function to a new web user control. now "form1" is not defined and it is not working correctly without any error message!
I'm trying to use the validator to work on a email form to ensure that they enter a valid from email address. That part works. I also want to add to the expression the text that I pre-populate in the txtbox ("Enter your email address") so on postback, after sending the message, I can clear the fields and repopulate that box.
How do I add that wording to the current expression: w+([-+.']w+)*@w+([-.]w+)*.w+([-.]w+)*
I am using Regular Expression validator for a text box. The below is working fine.It is not performing validation when I enter values like 0000..001.How can I modify validation expression? But it would allow values like 100,...5000 i.e zeros after a number.
<asp:RegularExpressionValidator ID="reg2" runat="server" ControlToValidate="rng2" ValidationExpression="^[0-9]+" ErrorMessage="*Please Enter a Valid Number for Second Range." ForeColor="Red" Font-Bold="True"></asp:RegularExpressionValidator>
I need some help in regular expression.I am validating the textbox text when updating the records.When i click the update button,the first 5 letters should be equal to CM000 or cm000.How to validate this using regular expression in asp.net. does anyone know validationexpression for this.
Is it possible to set the property of a server tag from a c# expression, i.e. something like <asp:TextBox Width='<%= [some c# expression] %>'/> I though this would be pretty straightforward, but I can't get such an expression to run.