C# - Regular Expression To Replace Quotation Marks In HTML Tags Only?
Mar 15, 2010
I have the following string:
<div id="mydiv">This is a "div" with quotation marks</div>
I want to use regular expressions to return the following:
<div id='mydiv'>This is a "div" with quotation marks</div>
Notice how the id attribute in the div is now surrounded by apostrophes?
How can I do this with a regular expression?
Edit: I'm not looking for a magic bullet to handle every edge case in every situation. We should all be weary of using regex to parse HTML but, in this particular case and for my particular need, regex IS the solution.
Edit #2: Jens Ameskamp helped to find a solution for me but anyone randomly coming to this page should think long and very hard about using this solution. In my case it works because I am very confident of the type of strings that I'll be dealing with. I know the dangers and the risks and make sure you do to. If you're not sure if you know then it probably indicates that you don't know and shouldn't use this method.
I am wanting to concenate a string inserting a quote. How do I do that? i.e. label.text = "quotation mark" & "new" & " " & "name" I want to be able to see "new name" in the end with the actual quotation marks.
i want to find and replace img src path by regular expression.suppose i have the below html and from there i want to replace file path of all the src attribute of image tag.
<P>Hello</P> <P> </P> <P><IMG alt="C:\Documents and Settings\All Users\Documents\My Pictures\Sample Pictures\Sunset.jpg" src="C:\Documents and Settings\All Users\Documents\My Pictures\Sample Pictures\Sunset.jpg"></P>
here the file path is src="C:\Documents and Settings\All Users\Documents\My Pictures\Sample Pictures\Sunset.jpg"
and i want to replace it with path like src="www.mysite.com/images/a.gif" with regular expression.how can i do it with regular expression.
We have a custom set of custom control tags, eg:<ourTag:OurControl runat="server" />Throughout our project we have discovered Visual Studio's marvelous(sarcasm) helper which automatically pastes an ID with the tags name followed by a counter number. I am now trying to remove them globally.
What I'm interested in is a regular expression that will accept HTML input and remove all attributes inside the tag while leaving the tag intact. For example I want this...
<p class="test" id="TestParagraph">This is some test text right here.</p>
I am trying to write a regular expression to replace all special chracters within a string so that I am left with just alphanumeric characters and spaces or full stops.
I am using this expression at the moment
[Code]....
But then I test this with the string "This is a test!", it returns "Thisis a test", which would be fine, except for the space it removes between "This" and "is", can anyone else me where I am going wrong or supply a better expression?
I'm having problem getting the syntax right for an SQL statement for a Progress database. I have the column "Resource" which is a reserved word and according to the Openedge SQL Reference I can only use reserved words if I delimit them with double quotation marks.
So how do I write this line...
Dim comm As New OdbcCommand("SELECT Resource FROM PUB.ActOper WHERE CreDate > TO_DATE ('05/24/2010')", conn)
...with double quotes around Resource? I've tried a bunch of different combinations but I get "Expression expected" or "valid continuation expected" or something similar.
I want a limited amount of html tags for the user to be able to put in my form and by looking at the posts, regular expressions seem to be the way forward. I don't want to use javascript or 3rd party as this is for an assignment at university.
The examples I have found seem to be set out differently than mine with using asp id identifiers whereas my code below is set out differently so I am unsure if I can use these examples.
This is slightly different to most questions regarding HTML tags in strings - I want to add HTML!
I'm using a WYSIWYG editor which produces some eratic results. What im looking to do is to check the string it produces and check whether there are any sentences that aren't wrapped in a <p></p> tag. Typical strings could be:
I was doing some web scraping and i was looking for some div elements with particular class names and markup. This is my objective , i have to extract everything within the div having the class s_specs_box s_box_4 Could someone provide the regular expression in .NET terms (i.e., which can be straight away passed into Regex's constructor)to match one such div (given below)
<div class="s_specs_box s_box_4"><h3>Display</h3><ul><li><strong><span class='s_tooltip_anchor'>Display:</span> <span class='s_tooltip_content'><p class='s_help'><b>Display</b> - Phone's main display</p></span></strong><ul> <li class='clear clearfix'><strong><span class='s_tooltip_anchor'>Type:</span> <span class='s_tooltip_content'><p class='s_help'><b>Type</b> - Refers to the type of the display. There are four major display types: Greyscale, Black&White, LCD:STN-color and LCD:TFT-color</p></span></strong><ul><li>Color</li></ul> </li><li class='clear clearfix'><strong><span class='s_tooltip_anchor'>Technology:</span> <span class='s_tooltip_content'><p class='s_help'><b>Technology</b> - Refers to the type of the color displays. There are five major types: LCD, TFT, TFD, STN and OLED</p></span></strong><ul><li>Super AMOLED</li></ul>..............................
I'm trying to use the validator to work on a email form to ensure that they enter a valid from email address. That part works. I also want to add to the expression the text that I pre-populate in the txtbox ("Enter your email address") so on postback, after sending the message, I can clear the fields and repopulate that box.
How do I add that wording to the current expression: w+([-+.']w+)*@w+([-.]w+)*.w+([-.]w+)*
I am using Regular Expression validator for a text box. The below is working fine.It is not performing validation when I enter values like 0000..001.How can I modify validation expression? But it would allow values like 100,...5000 i.e zeros after a number.
<asp:RegularExpressionValidator ID="reg2" runat="server" ControlToValidate="rng2" ValidationExpression="^[0-9]+" ErrorMessage="*Please Enter a Valid Number for Second Range." ForeColor="Red" Font-Bold="True"></asp:RegularExpressionValidator>
I need some help in regular expression.I am validating the textbox text when updating the records.When i click the update button,the first 5 letters should be equal to CM000 or cm000.How to validate this using regular expression in asp.net. does anyone know validationexpression for this.
i'm doing ajax website using PostWebRequest() function, when i call any .aspx page to the target html element the output is coming fine in IE but in FF(FireFox) is coming in html format(html tags) IE output : [URL] FF output : [URL]
javascript functions : function PostWebRequest(postPage, HTMLTarget, parameter) { displayElement = $get(HTMLTarget); displayElement.innerHTML = "<div style='text-align:center;'><br /><br /><br /><br /><br /><img src='images/ajax-loader.gif' algin='center' /></div>"; var wRequest = new Sys.Net.WebRequest(); wRequest.set_url(postPage); wRequest.set_httpVerb("POST"); var body = parameter; wRequest.set_body(body); wRequest.get_headers()["Content-Length"] = body.length; wRequest.add_completed(onWebRequestCompleted); wRequest.invoke(); } function onWebRequestCompleted(executor, eventArgs) { if (executor.get_responseAvailable()) { if (document.all) { displayElement.innerHTML = executor.get_responseData(); } else { displayElement.textContent = executor.get_responseData(); } } else { if (executor.get_timeOut()) { alert("Timed Out"); } else { if (executor.get_aborted()) alert("Aborted"); } } }
i'm trying to find all the anchor tags and appending the href value with a variable. for example
<a href="/page.aspx">link</a> will become <a href="/page.aspx?id=2"> <A hRef='http://www.google.com'><img src='pic.jpg'></a> will become <A hRef='http://www.google.com?id=2'><img src='pic.jpg'></a>
I'm able to match all the anchor tags and href values using regex, then i manually replace the values using string.replace, however i dont think its the efficient way to do this.Is there a solution where i can use something like regex.replace(html,newurlvalue)
have a web based application (ASP.NET 2.0) on a virtual dedicated Windows Server 2003 SP2 box, that converts HTML forms and converts them to PDF using ABCpdf. It has been around for years, and hasn't been changed for at least the last two of those years.Three days ago it started placing question marks throughout the outputed PDF documents