Remove HTML tags from a text area

Hi, here is my problem:
I have a form with a text area item; this item is “Display as Editor HTML standard”. So it is possible to enter formatted text with tags HTML. Then I save the text in a table. In the column the text maintain the HTML tags. Afterwards I can put the text in a report, and I can see the formatted text with the tags HTML interpreted.
But I need also to use that text for other aims, (i.e. sending it in a mail) with the html tags removed.
Is there any way to remove HTML tags from a text item?
Regards
Dario

From http://asktom.oracle.com/pls/asktom/f?p=100:11:0::::P11_QUESTION_ID:769425837805
   FUNCTION str_html (line IN VARCHAR2)
      RETURN VARCHAR2
   IS
      x         VARCHAR2 (32767) := NULL;
      in_html   BOOLEAN          := FALSE;
      s         VARCHAR2 (1);
   BEGIN
      IF line IS NULL
      THEN
         RETURN line;
      END IF;
      FOR i IN 1 .. LENGTH (line)
      LOOP
         s := SUBSTR (line, i, 1);
         IF in_html
         THEN
            IF s = '>'
            THEN
               in_html := FALSE;
            END IF;
         ELSE
            IF s = '<'
            THEN
               in_html := TRUE;
            END IF;
         END IF;
         IF NOT in_html AND s != '>'
         THEN
            x := x || s;
         END IF;
      END LOOP;
      RETURN x;
   END str_html;There's also a reqular expression approach that I've not tried. Remove HTML Tags and parse the text out of it

Similar Messages

  • How to remove html-tags from a text.

    Hello!
    I have a text-field which I will remove html-tag's from.
    Example:
    "This is a test<br><p> and another test"
    The function must return a similar text, but without the html-
    tags <br> and <p> (in this case).
    Anybody that can help me with this little problem?
    Thanks in advance for any help :-)
    Best regards
    Kjetil Klxve

    You can wait for some kind personal to post a complete code
    solution... But if you want to fix this yourself (which is good
    for the soul) here are some hints:
    - You can use SUBSTR to get at chunks of text
    - You can use INSTR to find particular characters.
    - You can use INSTR as an argument of SUBSTR
    Hence:
    bit_of_text := SUBSTR(text, 1, INSTR(text, '<'));
    chopped_text := SUBSTR(text, INSTR(text, '<'));
    bit_of_text := bit_of_text||SUBSTR(chopped_text, INSTR
    (text, '>'), INSTR(text, '<'));
    will give you the first bit of text that doesn't contain any
    angle brackets.
    From this you should be able to work out how to functionalised
    this (you'll need to store the offsets and use them in a loop
    construct).
    Note that this assumes that the text only contains the '<'
    character when it's part of a HTML tag. If you can't guarantee
    this then you'll have to explicitly search for all the tags e.g.
    bit_of_text := SUBSTR(text, 1, INSTR(lower(text), '<p>'));
    bit_of_text := SUBSTR(text, 1, INSTR(lower(text), '<br>'));
    This will be a bit of pain. And completely rules out XML!
    rgds APC

  • Problem removing html tags from the text retrived

    Hi there,
    I am using jdbc to connect the database and retriving the data. In one of the columns along with the description there are some html tags in few of the recors of that column. is there a way to retrive the text only ignoring the html tags in between. Or can i retrive and then strip off the html code in the text to display only normal text.
    example of the data retrived which are pipe seperated and one of the columns has html tags in it:
    209|The euphoria |187945-2|http://www.abc/lst.jsp?mktgChannel=I86023&sku=18791-2&siteID=qpF0HYnRugA|http://www.abc.com/assets/images/product/medium/18793-2_198.jpg|Rooftop Singers: Walk Right In | abc Music proudly presents THE FOLK YEARS, an unforgettable era in music history!<BR><BR><B>Featuring:</B><BR>
    <LI>The most complete collection of folk and folk-rock songs ever put together -- 132 classics!
    <LI>Original hits by the original artists!
    Now i need to remove the tags before displaying this on the output. Is there a simple way to do this.
    Thanks...

    Did you read the documentation of the trim() method,
    where it describes which whitespace it removes?I believe his problem is that
    "Some text here  
    <blah> 
    More text"becomes
    "Some text here  
    More text"... and he wants ...
    "Some text here
    More text"So, your problem is that your regex isn't matching whitespace as well.
    See the "Trimming Whitespace" section:
    http://www.regular-expressions.info/examples.html

  • How to remove HTML tags from a String ?

    Hello,
    How can I remove all HTML Tags from a String ?
    Would you please to give me a simple example ?
    Best regards,
    Eric

    Here's some code I cooked up. I have created an object that processes code so that it can be incorporated directly into a project. There is some redundancy so that the it can be used in more than one way. Depending on your situation you might have to make the condition statement a little more sophisticated to catch stray ">" tags.
    I have also included a Tester application.
    //This removes Html tags from a String either by submitting the String during construction and then
    // calling getProcessedString() or
    // by simply calling " stringwithoutTags=removeHtmlTags(stringWithTagsSubmission); "
    //Note: This code assumes that all"<" tags are accompanied by a ">" tag in the proper order.
    public class HtmlTagRemover
         private String stringSubmission,processedString,stringBeingProcessed;
         private int indexOfTagStart,indexOfTagEnd;
         public HtmlTagRemover()
         public HtmlTagRemover(String s)
              removeHtmlTags(s);          
         public String removeHtmlTags(String s)
              stringSubmission=s;
              stringBeingProcessed=stringSubmission;
              removeNextTag();
              return processedString;
         private void removeNextTag()
              checkForNextTag();
              while((!(indexOfTagStart==-1||indexOfTagEnd==-1))&<indexOfTagEnd)
                   removeTag();
                   checkForNextTag();
              processedString=stringBeingProcessed;
         private void checkForNextTag()
              indexOfTagStart=stringBeingProcessed.indexOf("<");
              indexOfTagEnd=stringBeingProcessed.indexOf(">");
         private void removeTag()
              StringBuffer sb=new StringBuffer("");
              sb.append(stringBeingProcessed);
              sb.delete(indexOfTagStart,indexOfTagEnd+1);
              stringBeingProcessed=sb.toString();
         public String getProcessedString()
              return processedString;
         public String getLastStringSubmission()
              return stringSubmission;
    public class HtmlRemovalTester
         static void main(String[] args)
              String output;
              HtmlTagRemover h=new HtmlTagRemover();
              output="The processed String: "+h.removeHtmlTags("<Html tag>This is a test<another Html tag> string<yet another Html tag>.");
              output=output+"\n"+" The original string:"+h.getLastStringSubmission();
              System.out.print(output);

  • Way to remove HTML tags from a page-scoped attribute using JSTL?

    Hi,
    I'm using JSTL 1.2 with Tomcat 6.0.26. Does anyone know of a way to remove HTML tags from a page attribute, "${myExpr}". I would prefer a solution that uses JSTL only, but ultimately whatever gets the job done is fine with me.
    Thanks, - Dave

    I'm sorry, I don't understand your requirement. What do you mean by "remove HTML tags from a page attribute"?
    If you are dealing with a value of an attribute, it is most likely a String, and should be treated as such. The best approach would probably be java coding.

  • How to remove html tags from a column

    Hi
    Problem is this: I get a column with a comma separated list of id's and I can successfully parse these id's and use them elsewhere. BUT, occasionally there are html tags within that id list like this:
    1082471,1237423<br xmlns="http://www.w3.org/1999/xhtml" />
    Is there a way to just automatically remove all tags from a column? Could do this with regex, but since there is no support, I don't know what to do.

    Hi,
    If the HTML can be detected by a starting symbol like „<“, then you could use the following:
    Unfortuntely the operation “ReplaceRange” is only available on a Text-level, so you have to invoke a function (at least to my knowledge). You also need an Index-column in your table, so if you don’t have it, you need to create one as well.
    This is your function:
    let
       fnRemoveHTML = (Value, Index) =>
    let
       Source = Excel.CurrentWorkbook(){[Name="Tabelle1"]}[Content],
       IndeNo = Index,
       Value_ = Source{IndeNo-1}[Value],
       length = Text.Length(Text.From(Value_)),
       position = Text.PositionOf(Text.From(Value_), "<"),
       range = length-position,
       new= if Value_ is number then Value_ else Text.ReplaceRange(Value_, position, range, "")
    in
        new
    in
      fnRemoveHTML
    And this is how you invoke it:
    let
        Quelle = Excel.CurrentWorkbook(){[Name="Tabelle1"]}[Content],
        Last = Table.AddColumn(Quelle, "Custom", each fn_RemoveHTML([Value], [Index])),
        ChangedType = Table.TransformColumnTypes(Last,{{"Custom", type number}})
    in
        ChangedType
    Provided your table is called “Tabelle1” & the column with your values to be replaced “Value” & your index-col “Index”
    Imke

  • How can I eliminate HTML tags from Oracle Text Snippet?

    I perform a search on many tables and on many columns of those tables.
    Some of those columns are VARCHAR2 and some CLOB.
    Also, some of the searchable data are HTML and some are plain text.
    My problem is that ctx_doc.snippet fetches the HTML tags.
    For example I get this, as a snippet result in one of my searches: Qual Germany n1 &lt;p&gt;Test Qual Germany n1&lt;/p&gt;
    I want the result to be fetched without the HTML tags.
    In my index configuration I have used NULL FILTER and HTML_SECTION_GROUP.With that configuration I managed to eliminate the HTML tags but not in all cases!
    For example:
    I search table CONTENTS columns TITLE(VARCHAR2) and MAIN_TEXT(CLOB)
    I created the following procedure that concatenates the two columns:
    CREATE OR REPLACE PROCEDURE CONTENTS_PROC( p_id in rowid, p_lob IN OUT clob)
    IS
    BEGIN
    FOR c1 IN (SELECT main_text||' '||title data FROM contents WHERE ROWID = p_id)
    LOOP
    dbms_lob.copy( p_lob, c1.data,
    dbms_lob.getlength( c1.data ));
    END LOOP;
    END;
    I created a user Datastore:
    BEGIN
    ctx_ddl.create_preference( 'content_trans_datastore', 'user_datastore' );
    ctx_ddl.set_attribute( 'content_trans_datastore', 'procedure', 'CONTENTS_PROC' );
    END;
    and finally I create the index:
    CREATE INDEX content_trans_ot_idx ON contents(ORACLE_TEXT_COLUMN)
    INDEXTYPE IS ctxsys.CONTEXT PARAMETERS ('datastore content_trans_datastore SYNC(ON COMMIT) STORAGE INDEX_STORAGE filter ctxsys.null_filter section group ctxsys.html_section_group');
    When I perform the search on those data: &lt;p&gt; &lt;strong&gt;Test Doc-Test &lt;/strong&gt; &lt;/p&gt; the snippet I get is: Test Doc-Test.
    That's fine, the html tags are removed!
    In another case I search table NCP columns NAME(VARCHAR2) and BODY(VARCHAR2)
    I created the following procedure that concatenates the two columns:
    CREATE OR REPLACE PROCEDURE NCP_PROC( p_id in rowid, p_lob IN OUT clob)
    IS
    BEGIN
    FOR c1 IN (SELECT name||' '||body data FROM ncp WHERE ROWID = p_id)
    LOOP
    dbms_lob.copy( p_lob, c1.data,
    dbms_lob.getlength( c1.data ));
    END LOOP;
    END;
    I created a user Datastore:
    BEGIN
    ctx_ddl.create_preference( 'ncp_trans_datastore', 'user_datastore' );
    ctx_ddl.set_attribute( 'ncp_trans_datastore', 'procedure', 'NCP_PROC' );
    END;
    and finally I create the index:
    CREATE INDEX ncp_trans_ot_idx ON ncp(ORACLE_TEXT_COLUMN)
    INDEXTYPE IS ctxsys.CONTEXT PARAMETERS('datastore ncp_trans_datastore SYNC(ON COMMIT) STORAGE INDEX_STORAGE filter ctxsys.null_filter section group ctxsys.html_section_group');
    When I perform the search on those data: test &lt;strong&gt; &lt;/strong&gt;http://deleteme.com the snippet I get is: test &lt;strong&gt; &lt;/strong&gt;http://deleteme.com!!!!!!!!!!
    How is this possible? Why in the first case the HTML tags are eliminated and in the second case they are not?
    Thanks,
    Margarita
    Edited by: user13312701 on 07-Sep-2010 08:51

    Doing various tests I found out that the problem is when I need to search in multiple columns of a table.
    That is when I create a user_datastore that uses a procedure that concatenates the columns.
    And especially when the data with the html tags is in a VARCHAR2 column.
    e.g
    --create the table*
    CREATE TABLE CONTENT_TRANS (content_trans_id NUMBER,
    main_text CLOB,
    title vARCHAR2(2000),
    oracle_text_column VARCHAR2(1));
    alter table "CONTENT_TRANS" add constraint CONTENT_PK primary key("CONTENT_TRANS_ID") ;
    --Insert dummy data*
    Insert into CONTENT_TRANS
    (CONTENT_TRANS_ID,MAIN_TEXT,TITLE)
    values
    (1,'lorem','lorem <p>qualification</p> 2.1 ');
    Insert into CONTENT_TRANS
    (CONTENT_TRANS_ID,MAIN_TEXT,TITLE)
    values
    (2,'lorem','lorem <br>qualification</br> 2.1 ');
    --CREATE THE procedure that concatenates main_text(CLOB) and title(VARCHAR2)*
    CREATE OR REPLACE PROCEDURE CONTENT_TRANS_PROC( p_id in rowid, p_lob IN OUT clob)
    IS
    BEGIN
    FOR c1 IN (SELECT main_text||' '||title data FROM content_trans WHERE ROWID = p_id)
    LOOP
    dbms_lob.copy( p_lob, c1.data,
    dbms_lob.getlength( c1.data ));
    END LOOP;
    END;
    --Create the user datastore*
    BEGIN
    ctx_ddl.create_preference( 'content_trans_datastore', 'user_datastore' );
    ctx_ddl.set_attribute( 'content_trans_datastore', 'procedure', 'CONTENT_TRANS_PROC' );
    END;
    --Create the index*
    CREATE INDEX content_trans_ot_idx ON content_trans(ORACLE_TEXT_COLUMN)
    INDEXTYPE IS ctxsys.CONTEXT PARAMETERS ('datastore content_trans_datastore SYNC(ON COMMIT) filter ctxsys.null_filter section group ctxsys.html_section_group');
    exec ctx_doc.set_key_type('PRIMARY_KEY');
    --Perform the query
    SELECT SCORE(1),ct.content_trans_id, ctx_doc.snippet('content_trans_ot_idx', ct.content_trans_id, 'lorem') as snippet
    from content_trans ct
    where contains(ct.ORACLE_TEXT_COLUMN, 'lorem', 1) > 1;
    Results WITH NOT WANTED HTML TAGS:
    6     1     <b>lorem</b> <b>lorem</b> &lt;p&gt;qualification&lt;/p&gt; 2.1
    6     2     <b>lorem</b> <b>lorem</b> &lt;br&gt;qualification&lt;/br&gt; 2.1
    Edited by: user13312701 on 13-Oct-2010 01:18

  • How to remove html tags from the pdf file ?

    Hello,
    Using BI publisher we are generating a pdf file. In the table, we have data which contains html tags. for example " test1<br> 2.test2<br> 3.test3<br> ".
    In the pdf file we need to get the output like this
    test1
    test2
    test3
    But the output is as follows :"test1<br> 2.test<br> 3.test3<br> "
    Any idea, how these html tags can be removed from the pdf file and obtain the required result?
    Thanks in advance!!
    Archana

    Archana,
    Can you wrap your code in <code> tags (use square brackets rather than angled ones), as the forum software is interpretting the HTML tags, in other words we can't see what you mean ;)
    In any case, there are a few different options (guessing at what your problem is, without seeing the actual data), you could use htf.escape_sc or replace, regexp_replace etc to substitute the values before you output them to your PDF.
    Hope this helps,
    John.
    Blog: http://jes.blogs.shellprompt.net
    Work: http://www.apex-evangelists.com
    Author of Pro Application Express: http://tinyurl.com/3gu7cd
    REWARDS: Please remember to mark helpful or correct posts on the forum, not just for my answers but for everyone!

  • Remove Html tags from export Crystal report Setting i

    Hi,
    I ma using CR 2008, BOXI 3.1 , Oracle
    i)When ever i try to export i export it into CSV format and i am getting HTML tags , how do we remove them
    ii) My second question is; I have a crystal report that is given by my friend and when ever i open it i ma getting a error saying UFL u2lgmt.dll missing .
    i am using a java version ; How to overcome this?

    hi Venkatesh,
    regarding the first question...you need to replace these tags manually in a formula and have that formula on the report instead of the db field.
    e.g.
    stringvar s:= {your field};
    s:= replace(s, '<div>', '');
    s:= replace(s, '</div>', '');
    you'll have to do this for any potential tag in the field unfortunately.
    please post question 2 as a new discussion as per the forum rules.
    cheers,
    jamie

  • Remove HTML tags from a string

    I have a string that contains a couple of HTML or XHTML tag, for example
    lv_my_string = '<p style="something">Hello <strong>World</strong>!</p>'.
    For a special use case, I want to remove all HTML from that string and process only the plain text
    lv_my_new_string = 'Hello World!'.
    Is there any method, function module, XSLT or anything else for that already?

    Hi Daniel,
    I tried using the FM (SWA_STRING_REMOVE_SUBSTRING) but I guess it is expecting a particular pattern which is not so apparent in your case. Iu2019ve written a small piece of code which you can try using in a FM or a PERFORM and that should do the trick. Please let me know if you have any questions.
    PARAMETER: P_LINE(100).
    TYPES: BEGIN OF TY_LINE,
             LINE(100),
           END OF TY_LINE.
    DATA: T_LINE TYPE STANDARD TABLE OF TY_LINE,
          WA_LINE LIKE LINE OF T_LINE.
    DATA: W_LINE(100),
          W_LEN(100),
          W_COUNT TYPE I,
          W_FLAG,
          W_FLAG1,
          W_I TYPE I.
    W_COUNT = STRLEN( P_LINE ).
    DO W_COUNT TIMES.
      IF P_LINE+W_I(1) = '<'.
        W_FLAG = 1.
        W_I = W_I + 1.
        IF NOT WA_LINE-LINE IS INITIAL.
          APPEND WA_LINE-LINE TO T_LINE.
          CLEAR WA_LINE.
        ENDIF.
        CONTINUE.
      ELSEIF P_LINE+W_I(1) = '>'.
        W_FLAG = 0.
        W_I = W_I + 1.
        CONTINUE.
      ENDIF.
      IF W_FLAG = 1.
        W_I = W_I + 1.
        CONTINUE.
      ELSE.
        CONCATENATE WA_LINE-LINE P_LINE+W_I(1) INTO WA_LINE-LINE.
        W_I = W_I + 1.
      ENDIF.
    ENDDO.
    LOOP AT T_LINE INTO WA_LINE.
      CONCATENATE W_LINE WA_LINE-LINE INTO W_LINE SEPARATED BY SPACE.
    ENDLOOP.
    SHIFT W_LINE LEFT DELETING LEADING SPACE.
    WRITE: W_LINE.
    Input:
    <p style="something">Hello <strong>World</strong>!</p>
    Output:
    HELLO WORLD !
    Regards,
    Pritam

  • Removing index.html from home page / remove .html tags from all other pages

    Hello All,
    I was wondering if there is a way to remove the index.html from my main page, so that www.mypage.com/index.html is simply www.mypage.com/. This also applies to all the other pages in the site, but only for the .html extension: www.mypage.com/contact.html becomes www.mypage.com/contact. Almost all sites I know do this, but I can't figure out the technique beyond working some crazy voodoo with Apache.
    Thanks in advance for your help!

    if 'index.html' is in your HOST server's default filename list, then you can omit its name from any link, e.g.,
    <a href="/">Home</a>
    That link will cause the server to load the default file found in the root folder of the site.
    This is actually the preferred way to link to your home page.

  • Remove HTML tags from String

    it sounded prettty easy..
    I know the first part of my String is <html><center><p> and the last part is </p></center></html>
    but for saving it into the database I want to remove the HTML parts..
    I know how much characters the html stuff is.. I know what the html is.. how do I remove it from my string?

    If your String always starts with <html><center> and always ends with </center></html> you can use:
         int startLength = "<html><center>".length();
         int endLength = "</center></html>".length();
         String withoutHtml = myString.substring(startLength, myString.length() - endLength);

  • Remove HTML tags in text

    Hi,
    I have to read some text from a text editor, that can be formatted for example with Bold, which means that when I execute the function to read its content, it returns something like this:
    Do you know how can I remove these HTML tags from the text?
    Thanks in advance.
    Regards,
    Sónia Gonçalves

    Hi,
    Something like this should do the trick.
    report  ztag.
    data: v_data type char30 value '<H>blablabla</H>'.
    if v_data(1) = '<' and
      v_data cs '>'.
    * Remove the HTML opening header
      shift v_data left up to '>'.
      shift v_data left.
    * Remove the HTML closing header
      shift v_data right up to '<'.
      shift v_data right.
      shift v_data left deleting leading space.
    endif.
    write: / v_data.
    Regards,
    Darren

  • Remove HTML Tags and parse the text out of it

    Hi All -
    I had a text file with all the HTML Tags on it. I want to parse text out of it. Is there any package available to remove all the HTML Tags from the text.
    For example
    <HTML><BODY bgColor=#ffffff> This is the text i want to parse.</BODY></HTML>
    The result would be: This is the text I want to parse.
    The text can be very long and can have many different HTML Tags. I cannot use REPLACE becuase tags can me lot more then I thought.
    Please respond as soon as possible..Thanks for all your help!!
    Anuj Sharma

    thank you all, but my code is only html no xml , and is other application that save in table
    <html><head><title>Aprovação de ARC</title></head><body><font face=arial size=2><b>974-17016/ugadiego-2013</b></font><br><br><table border=0><tr><td><b><font face=arial size=1>Data da Abertura</font></b></td>    <td><font face=arial size=1>8/3/2013</font></td><tr><td><b><font face=arial size=1>Quebra Produtividade</font></b></td>    <td><font face=arial size=1>Sim</font></td><tr><td><b><font face=arial size=1>Quantidade</font></b></td>    <td><font face=arial size=1>17,5</font></td><tr><td><b><font face=arial size=1>Valor</font></b></td>    <td><font face=arial size=1>R$ 17496</font></td><tr><td><b><font face=arial size=1>Forma de Indenização</font></b></td>    <td><font face=arial size=1>Nota de Crédito</font></td><tr><td><b><font face=arial size=1>Observação</font></b></td>    <td><font face=arial size=1>Evidenciado a não conformidade do produto em visita a cliente pela assessoria agronômica e qualidade.
    Produto apresenta-se empedrado com desuniformidade de grânulos e por consequência geração de finos e falha de óleo.
    Produto expedido com GDAP.
    Bonificar o cliente em 10% do valor da compra = R$ 17.496,00 ou em toneladas e fertilizantes  que podem ficar em forma de crédito para o cliente retirar em fertilizante para o plantio  da soja. Conforme relatório do Sr. Ademilson Palharin em anexo.</font></td><tr><td><b><font face=arial size=1>Centro de Custo</font></b></td>    <td><font face=arial size=1>CAS1I4671 - MISTURA E ENSAQUE I                     </font></td></table><hr><font face=arial size=2><b>Favor incluir uma Observação (Se necessário) e selecionar o botão desejado para aprovar ou reprovar essa Indenização.</b></font><FORM ACTION='http://10.176.10.123/pgAprovaARCServidor.asp' METHOD='GET' ><font face=arial size=2><div>Observações:</div><textarea name='txtObs' rows='4' cols='60' maxlength='4000'></textarea><br><br><div><input type='submit' value='Aprovar'  name='acao'> <input type='submit' value='Reprovar' name='acao'></div></font><br><hr><font face=arial size=2 >Essa é uma mensagem automática.<br>Favor não responder esse email</font><hr><input type='hidden' name='cdARC' value='17016' ><input type='hidden' name='cdSeq' value='1' ><input type='hidden' name='cdFase' value='Indenizacao' ><input type='hidden' name='dsResp' value='ustrenat' ><input type='hidden' name='dsCargo' value='Vice Presidência' ><input type='hidden' name='dsSolic' value='LESIANE CIESLAK' ><input type='hidden' name='index' value='3' ><input type='hidden' name='rowatu' value='3' ></FORM></body></html>using oracle 9.2.08
    Edited by: muttleychess on Mar 19, 2013 11:36 AM

  • How to exlcude HTML Tags from Excel Reports

    Hi Guys
    Within Project Online - OData extract to Excel
    Has anyone found a way to eliminate the HTML tags from Multi Line Text fields within Project Server? I can easily extract the text and generate nice Excel Reports, but the html tag is very annoying in the Excel Reports and it doesn't read easily.
    Any help would be appreciated.
    Marc Soester [MVP] http://marcsoester.blogspot.com

    Marc, 
    What you could do (given that you find the required time and energy to write the lines),
    would be to replace all (!) html characters like here (http://stackoverflow.com/questions/14705605/remove-html-tags-from-cell-strings-excel-formula -
    this is one of the Excel UDF/VB-based solutions, but will not refresh in Excel Services - however there is a good list of what to replace) with PowerQuery.
    That would refresh over a PowerBI subscription in the least..
    -Ville

Maybe you are looking for