How to index words containing letters as html entities?

The title says it all.
I am currently replacing known HTML entities with their Unicode counterparts upfront, but I was wondering if some built-in feature in Oracle Text could do the same and save an additional headache.
Indexed html entities anyone?
Thank you
Flavio
http://oraclequirks.blogspot.com
http://www.yocoya.com

You can create your own procedure that uses whatever method you like, then use that procedure in a procedure filter and use that procedure filter in your index parameters.  In the example below, I have borrowed a strip_html function from
http://www.supermanhamuerto.com/doku.php?id=oracle:fixhtml
and used that in the procedure.
SCOTT@orcl12c_11gR2> set scan off
SCOTT@orcl12c_11gR2> -- table, data, and lexer:
SCOTT@orcl12c_11gR2> create table example (t varchar2(4000))
  2  /
Table created.
SCOTT@orcl12c_11gR2> insert all
  2  into example values ('crónicas y relatos')
  3  into example values ('crónicas y relatos')
  4  into example values ('CRÓnicas y Relatos de México')
  5  into example values ('Crónicas y relatos de México')
  6  select * from dual
  7  /
4 rows created.
SCOTT@orcl12c_11gR2> begin
  2    ctx_ddl.create_preference ('mylex', 'BASIC_LEXER');
  3    ctx_ddl.set_attribute ( 'mylex', 'base_letter', 'YES');
  4  end;
  5  /
PL/SQL procedure successfully completed.
SCOTT@orcl12c_11gR2> -- function from http://www.supermanhamuerto.com/doku.php?id=oracle:fixhtml
SCOTT@orcl12c_11gR2> CREATE OR REPLACE FUNCTION strip_html(dirty IN clob,
  2                         to_cvs IN NUMBER DEFAULT 0)
  3    RETURN clob IS OUT clob ;
  4 
  5    TYPE arr_string IS varray (200) OF VARCHAR2(64);
  6 
  7    entities_search_for arr_string;
  8    entities_replace arr_string;
  9    cont NUMBER;
10 
11  BEGIN
12 
13 
14  -- to accelerate the issue
15  IF dirty IS NULL THEN
16      RETURN dirty;
17  END IF; -- isnull(dirty)
18 
19  IF LENGTH( dirty ) = 0 THEN
20      RETURN dirty;
21  END IF; -- length(dirty)
22 
23  entities_search_for := arr_string(
24  '!',
25  '#',
26  '$',
27  '%',
28  '&',
29  '"',
30  '(',
31  ')',
32  '*',
33  '+',
34  ',',
35  '‐',
36  '.',
37  '/',
38  ':',
39  ';',
40  '<',
41  '=',
42  '>',
43  '?',
44  '@',
45  '[',
46  '\',
47  ']',
48  'ˆ',
49  '_',
50  '`',
51  '{',
52  '|',
53  '}',
54  '˜',
55  ' ',
56  '¡',
57  '¢',
58  '£',
59  '¤',
60  '¥',
61  '¦',
62  '§',
63  '¨',
64  '©',
65  'ª',
66  '«',
67  '¬',
68  '­',
69  '®',
70  '¯',
71  '°',
72  '±',
73  '²',
74  '³',
75  '´',
76  'µ',
77  '¶',
78  '·',
79  '¸',
80  '¹',
81  'º',
82  '»',
83  '&fr;',
84  '&fr;',
85  '&fr;',
86  '¿',
87  'À',
88  'Á',
89  'Â',
90  'Ã',
91  'Ä',
92  'Å',
93  'Æ',
94  '&il;',
95  'È',
96  'É',
97  'Ê',
98  'Ë',
99  'Ì',
100  'Í',
101  'Î',
102  'Ï',
103  'Ð',
104  'Ñ',
105  'Ò',
106  'Ó',
107  'Ô',
108  'Õ',
109  'Ö',
110  '×',
111  'Ø',
112  'Ù',
113  'Ú',
114  'Û',
115  'Ü',
116  'Ý',
117  'Þ',
118  'ß',
119  'à',
120  'á',
121  'â',
122  'ã',
123  'ä',
124  'è',
125  'é',
126  'ê',
127  '&etilde;',
128  'ë',
129  'ì',
130  'í',
131  'î',
132  'ĩ',
133  'ï',
134  'ò',
135  'ó',
136  'ô',
137  'õ',
138  'ö',
139  'ù',
140  'ú',
141  'û',
142  'ũ',
143  'ü');
144 
145  entities_replace := arr_string(
146  '¡',
147  'º',
148  '$',
149  '%',
150  '&',
151  '"',
152  '(',
153  ')',
154  '*',
155  '+',
156  ',',
157  '-',
158  '.',
159  'Sol',
160  'Colon',
161  '*',
162  '<',
163  '=',
164  '>',
165  '?',
166  ',',
167  '*',
168  '*',
169  '*',
170  '*',
171  '_',
172  '''',
173  '*',
174  '*',
175  '*',
176  '''',
177  ' ',
178  '¡',
179  'cent',
180  'L',
181  '*',
182  'Y',
183  '*',
184  '*',
185  '.',
186  '(c)',
187  '*',
188  '*',
189  '!',
190  '*',
191  '(r)',
192  '*',
193  '*',
194  '*',
195  '*',
196  '*',
197  'á',
198  'u',
199  '*',
200  '·',
201  'ç',
202  '*',
203  '*',
204  '*',
205  '*',
206  '*',
207  '*',
208  '¿',
209  'È',
210  'Á',
211  'Ä',
212  'Á',
213  '*',
214  '*',
215  'AE',
216  '*',
217  'È',
218  'É',
219  '*',
220  '*',
221  'Ì',
222  'Í',
223  'Î',
224  '*',
225  '*',
226  'N',
227  'Ò',
228  'Ó',
229  'Ô',
230  'O',
231  '*',
232  '*',
233  'O',
234  'Ù',
235  'Ú',
236  'Û',
237  '*',
238  '*',
239  '*',
240  '*',
241  'à',
242  'á',
243  'â',
244  'a',
245  '*',
246  'è',
247  'é',
248  'ê',
249  'e',
250  '*',
251  'ì',
252  'í',
253  'î',
254  'i',
255  '*',
256  'ò',
257  'ó',
258  'ô',
259  'o',
260  '*',
261  'ù',
262  'ú',
263  'û',
264  'u',
265  '*');
266 
267    OUT := dirty;
268 
269    -- replace what is enclosed between <xml> and </xml>
270    -- *? -> lazy star (catches the minimum possible)
271    OUT := regexp_replace(OUT, '<xml>.*?</xml>', '', 1, 0, 'ni' );
272    -- clean what it is inside the style tags
273    OUT := regexp_replace(OUT, '<style>.*?</style>', '', 1, 0, 'ni' );
274 
275    IF to_cvs = 2 THEN
276        -- sanitize (not clean) the html
277 
278        -- clean the tag <?xml:whatever>
279        OUT := regexp_replace(OUT, '<\?xml:.*?>', '', 1, 0, 'ni');
280        -- clean the tags <img whatever>
281        OUT := regexp_replace(OUT, '<img.*?>', '', 1, 0, 'ni');
282        -- clean comments
283        OUT := regexp_replace(OUT,'<!--.*?-->','', 1, 0, 'ni');
284        -- clean meta
285        OUT := regexp_replace(OUT,'<meta.*?>','', 1, 0, 'ni');
286        -- clean link
287        OUT := regexp_replace(OUT,'<link.*?>','', 1, 0, 'ni');
288        -- clean DIV
289        OUT := regexp_replace(OUT,'</?div.*?>','', 1, 0, 'ni');
290        -- clean SPAN
291        OUT := regexp_replace(OUT,'</?span.*?>','', 1, 0, 'ni');
292        -- clean "class inside tags"
293        OUT := regexp_replace(OUT,'(<.*?)class="?[a-zA-Z0-9-_]*"?(.*?>)', '\1\2', 1, 0, 'ni');
294        -- clean "style" inside the following tags: i b p
295        OUT := regexp_replace(OUT,'(<[ibp] .*?)style=".*?"(.*?>)', '\1\2', 1, 0, 'ni');
296        -- clean namespaces <o:p> </o:p>
297        OUT := regexp_replace(OUT, '(<)[a-zA-Z0-9-_]*:(.*?>)', '\1\2', 1, 0, 'ni');
298        OUT := regexp_replace(OUT, '(</)[a-zA-Z0-9-_]*:(.*?>)', '\1\2', 1, 0, 'ni');
299 
300        -- clean empty opening and closing tags: it has to be
301        -- passed twice or three times to clean things like this:
302        -- <strong><u></u></strong>
303        -- TWEAK: <p></p> must be replaced by <br/>
304        OUT := regexp_replace(OUT,'<p></p>','<br/>', 1, 0, 'ni');
305        OUT := regexp_replace(OUT,'<([a-zA-Z0-9-_]*)></\1>','', 1, 0, 'ni');
306        -- TWEAK: <p></p> must be replaced by <br/>
307        OUT := regexp_replace(OUT,'<p></p>','<br/>', 1, 0, 'ni');
308        OUT := regexp_replace(OUT,'<([a-zA-Z0-9-_]*)></\1>','', 1, 0, 'ni');
309 
310    ELSE
311       -- clean html
312 
313       -- replace all the stuff that is similar to a carriage return
314       OUT := regexp_replace(OUT, '</p[^>]*>',CHR(10)||CHR(13));
315       OUT := regexp_replace(OUT, '</br[^>]*>',CHR(10)||CHR(13));
316       OUT := regexp_replace(OUT, '</tr[^>]*>',CHR(10)||CHR(13));
317 
318       -- replace all the remaining html stuff
319       OUT := regexp_replace(OUT,'<[^>]*>','', 1, 0, 'ni');
320 
321       -- replace all the entities
322       FOR cont IN 1..119 LOOP
323         OUT := REPLACE( OUT, entities_search_for(cont), entities_replace(cont) );
324       END LOOP;
325 
326       -- cleaning for export to cvs
327       IF to_cvs = 1 THEN
328          OUT := REPLACE( OUT, CHR(10), '' );
329          OUT := REPLACE( OUT, CHR(13), '' );
330          OUT := REPLACE( OUT, CHR(9), '' );
331          OUT := REPLACE( OUT, ';', ',' );
332          OUT := REPLACE( OUT, '"', '''' );
333       END IF;
334 
335 
336    END IF;
337 
338 
339    RETURN(OUT);
340  END strip_html;
341  /
Function created.
SCOTT@orcl12c_11gR2> -- procedure that uses function:
SCOTT@orcl12c_11gR2> create or replace procedure normalize
  2    (p_input  in           clob,
  3      p_output in out nocopy clob)
  4  as
  5  begin
  6    p_output := strip_html (p_input);
  7  end normalize;
  8  /
Procedure created.
SCOTT@orcl12c_11gR2> -- filter that uses procedure:
SCOTT@orcl12c_11gR2> begin
  2    ctx_ddl.create_preference ('myfilt', 'procedure_filter');
  3    ctx_ddl.set_attribute ('myfilt', 'procedure', 'normalize');
  4    ctx_ddl.set_attribute ('myfilt', 'input_type', 'clob');
  5    ctx_ddl.set_attribute ('myfilt', 'output_type', 'clob');
  6  end;
  7  /
PL/SQL procedure successfully completed.
SCOTT@orcl12c_11gR2> -- index that uses filter:
SCOTT@orcl12c_11gR2> create index myindex on example(t) indextype is ctxsys.context
  2  parameters ('FILTER myfilt LEXER mylex')
  3  /
Index created.
SCOTT@orcl12c_11gR2> -- tokens indexed:
SCOTT@orcl12c_11gR2> select token_text from dr$myindex$i
  2  /
TOKEN_TEXT
CRONICAS
DE
MEXICO
RELATOS
Y
5 rows selected.
SCOTT@orcl12c_11gR2> -- searches:
SCOTT@orcl12c_11gR2> select * from example where contains (t, 'cronicas') > 0
  2  /
T
crónicas y relatos
cr&oacute;nicas y relatos
CR&Oacute;nicas y Relatos de M&eacute;xico
Crónicas y relatos de México
4 rows selected.
SCOTT@orcl12c_11gR2> select * from example where contains (t, 'Mexico') > 0
  2  /
T
CR&Oacute;nicas y Relatos de M&eacute;xico
Crónicas y relatos de México
2 rows selected.

Similar Messages

  • How to display word document as a HTML document in portal

    Hi
    I developed a portal application in that i am displaying word document using IFrames, from my application i am uploading word document and saving those document in KM
    Requirement is that i need to provide an option to show this word document in html.
    Thanks
    Rudradev.

    Hi Glenn Mendonca
    I thing it is possible
    Even in SDN search results page, we have option called "HTML version" link
    to see the search document in HTML
    see the below url
    https://www.sdn.sap.com/irj/servlet/prt/portal/prtroot/com.sap.km.cm.highlightedcontent?documenturi=%2fforums%2fsdn_forums%2fthread%7b74687265616449443d39313633%7d.jspa
    i hope passing doucment as a query parameter to com.sap.km.cm.highlightedcontent?DocumentUri=
    might solve this
    pls give your inputs
    Thanks

  • How to open word document form html container url iam getting internal_error while opening file

    Hi all,
    By using below code i am able to download word document file from html container. But before downloading i need edit these document to add Macro.
    I am unable to open word document.
    CREATE OBJECT G_HTML_CONTAINER
          EXPORTING
            CONTAINER_NAME = 'PDF'.
       CREATE OBJECT G_HTML_CONTROL
          EXPORTING
            PARENT = G_HTML_CONTAINER.
    * Convert xstring to binary table to pass to the LOAD_DATA method
        CALL FUNCTION 'SCMS_XSTRING_TO_BINARY'
          EXPORTING
            BUFFER     = LV_CONTENT
          TABLES
            BINARY_TAB = LT_DATA.
    data_tab = LT_DATA ).
    * Load the HTML
        CALL METHOD G_HTML_CONTROL->LOAD_DATA(
           EXPORTING
             TYPE         = 'application'
             SUBTYPE      = 'DOC'
           IMPORTING
             ASSIGNED_URL         = LV_URL
           CHANGING
             DATA_TABLE           = LT_DATA
           EXCEPTIONS
             DP_INVALID_PARAMETER = 1
             DP_ERROR_GENERAL     = 2
             CNTL_ERROR           = 3
             OTHERS               = 4 ).
    * Show it
        CALL METHOD G_HTML_CONTROL->SHOW_URL( URL = LV_URL
          IN_PLACE = 'X' ).
    Before file download i want edit the file for that i am using below class.
    PROXY TYPE REF TO I_OI_DOCUMENT_PROXY.
    by using (get_document_proxy) method  getting proxy .
    call method control->get_document_proxy
    exporting
    document_format   = 'x'
    document_type      = 'Word.Document'
    register_container  = 'x'
    importing
    document_proxy     = proxy
    after these.
    call method proxy->open_document
    exporting
    document_url    = lv_url
    open_inplace    = 'x'
    hear i m getting retcode as INTERNAL ERROR
    can any one suggest me how to open document from html container.
    Thanks and reagards
    jogu yadav

    I got resolution
    Thanks and regards,
    Jogu yadav

  • How to get index words to open in a "new window"

    I am using RoboHelp 7, WebHelp.
    I want some of my index words to link to www and NOT a topic.  I can do this but when you click on the index word it opens the link in the RoboHelp pane and NOT a new window.  I thought I figured out how to fix this by right-clicking on my index word, choosing Properties, Advanced, and click on the down arrow next to Frame and choose New Window.  I was so proud of myself for figiuring this out.  After I published my site, this still won't open the link in a new window.  Does someone know if this is a known bug or am I doing something wrong?  Thanks for your help.

    Hi there
    I just tested and the settings you mentioned caused my own WebHelp to open the link in a new window (well, it appeared as a new tab in FF, but it worked there). I also tested in IE and it worked in that as well.
    Have you tested from different machines? Perhaps it's a configuration setting on your PC?
    Cheers... Rick
    Helpful and Handy Links
    RoboHelp Wish Form/Bug Reporting Form
    Begin learning RoboHelp HTML 7 or 8 within the day - $24.95!
    Adobe Certified RoboHelp HTML Training
    SorcerStone Blog
    RoboHelp eBooks

  • How to create a word container

    Hello All,
    Can any one tell me How to create a word container.
    Thanks and regards,
    Lisa

    Hi,
    check this program :
    SAPRDEMOWORD97INTEGRATION
    Regards
    Appana

  • How to convert MS Word to XML or Html

    Now I have to do one job that convert MS Word to XML or Html. Does anyone have better ideas? Would you like share with me?
    Thanks in advance.
    Jackal

    Despite the opinions expressed by others, simply "renaming" the file will NOT "convert" it from a MS Word format into an XML format.
    As far as I am aware, the only way to do this is via the APIs that Microsoft provide. Any other mechanism you find will either be doing this, or they will be "guessing" about the construct of the file.
    Unfortunately, the APIs made available by Microsoft are primarily C++ (COM) components. So, you CAN achieve your desired result by creating a C++ "wrapper" around the MS Office COM components and exposing this wrapper to JNI. The exposure to JNI is actually quite simple, however the C++ wrapper itself (that calls the COMponents) will be a bit more challenging.
    I hope I am wrong, because I need to do the same thing and am dreading having to delve into C++ land!

  • How can I make CONTAINS query work for a date range

    In either 9i or 10g (eventual). I have a CONTEXT index that contains multiple columns from multiple tables and using a USER_DATASTORE. E.g., I have names that come from 3 different table locations and dates that come from 4. I can index them fine but how can I perform a single consolidated CONTAINS query against the single CONTEXT index to do the following:
    smith WITHIN lname AND john WITHIN fname AND dob BETWEEN '19870315' and '19970315'
    I know that I can use a mixed query but this is inefficient (esp since I have birth dates in multiple tables). Is there any algorithm for a range operator (>, <, between?) within the CONTAINS operator?
    CTXCAT index is not an option, as I have many text columns I am searching.
    Thanks!

    When you run the cdstore.sql, in addition to creating the ctx_cd package, it also creates the friedman package that contains the algorithm that the ctx_cd package uses. You could use the functions from that friedman package in your procedure for your user_datastore and in the creation of your query string, as demonstrated below.
    SCOTT@orcl_11g> CREATE OR REPLACE PROCEDURE my_proc
      2    (p_rowid IN           ROWID,
      3       p_clob     IN OUT NOCOPY CLOB)
      4  AS
      5  BEGIN
      6    FOR r IN
      7        (SELECT emp.ename, emp.job, emp.hiredate, dept.dname
      8         FROM      emp, dept
      9         WHERE  emp.deptno = dept.deptno
    10         AND      emp.ROWID = p_rowid)
    11    LOOP
    12        DBMS_LOB.WRITEAPPEND (p_clob, 7, '<ename>');
    13        DBMS_LOB.WRITEAPPEND (p_clob, LENGTH (r.ename), r.ename);
    14        DBMS_LOB.WRITEAPPEND (p_clob, 8, '</ename>');
    15        DBMS_LOB.WRITEAPPEND (p_clob, 5, '<job>');
    16        DBMS_LOB.WRITEAPPEND (p_clob, LENGTH (r.job), r.job);
    17        DBMS_LOB.WRITEAPPEND (p_clob, 6, '</job>');
    18        DBMS_LOB.WRITEAPPEND (p_clob, 7, '<dname>');
    19        DBMS_LOB.WRITEAPPEND (p_clob, LENGTH (r.dname), r.dname);
    20        DBMS_LOB.WRITEAPPEND (p_clob, 8, '</dname>');
    21        DBMS_LOB.WRITEAPPEND (p_clob, 10, '<hiredate>');
    22        -- apply friedman algorithm to date column ------------------
    23        friedman.init
    24          (TO_NUMBER (TO_CHAR (TO_DATE (19000101, 'YYYYMMDD'), 'J')),
    25           TO_NUMBER (TO_CHAR (TO_DATE (21001231, 'YYYYMMDD'), 'J')));
    26        DBMS_LOB.WRITEAPPEND
    27          (p_clob,
    28           LENGTH (friedman.encodedate (r.hiredate)),
    29           friedman.encodedate (r.hiredate));
    30        --------------------------------------------------------------
    31        DBMS_LOB.WRITEAPPEND (p_clob, 11, '</hiredate>');
    32    END LOOP;
    33  END my_proc;
    34  /
    Procedure created.
    SCOTT@orcl_11g> SHOW ERRORS
    No errors.
    SCOTT@orcl_11g> BEGIN
      2    CTX_DDL.CREATE_PREFERENCE ('my_datastore', 'USER_DATASTORE');
      3    CTX_DDL.SET_ATTRIBUTE      ('my_datastore', 'PROCEDURE', 'my_proc');
      4  END;
      5  /
    PL/SQL procedure successfully completed.
    SCOTT@orcl_11g> CREATE INDEX my_index ON emp (ename)
      2  INDEXTYPE IS CTXSYS.CONTEXT
      3  PARAMETERS
      4    ('DATASTORE     my_datastore
      5        SECTION GROUP CTXSYS.AUTO_SECTION_GROUP')
      6  /
    Index created.
    SCOTT@orcl_11g> EXEC DBMS_STATS.GATHER_TABLE_STATS (USER, 'DEPT')
    PL/SQL procedure successfully completed.
    SCOTT@orcl_11g> EXEC DBMS_STATS.GATHER_TABLE_STATS (USER, 'EMP')
    PL/SQL procedure successfully completed.
    SCOTT@orcl_11g> VARIABLE cstring VARCHAR2(4000)
    SCOTT@orcl_11g> BEGIN
      2    :cstring := 'smith WITHIN ename';
      3    :cstring := :cstring || ' AND ' || 'clerk WITHIN job';
      4    :cstring := :cstring || ' AND ' || 'research WITHIN dname';
      5    -- apply friedman algorithm to search criteria ---------------------------
      6    friedman.init
      7        (TO_NUMBER (TO_CHAR (TO_DATE (19000101, 'YYYYMMDD'), 'J')),
      8         TO_NUMBER (TO_CHAR (TO_DATE (21001231, 'YYYYMMDD'), 'J')));
      9    :cstring := :cstring || ' AND ((' ||
    10               friedman.integercontainscriteria
    11                 (TO_NUMBER (TO_CHAR (TO_DATE ('19800315', 'YYYYMMDD'), 'J')),
    12                  TO_NUMBER (TO_CHAR (TO_DATE ('19810315', 'YYYYMMDD'), 'J')),
    13                  'B')
    14               || ') WITHIN hiredate)';
    15    ---------------------------------------------------------------------------
    16  END;
    17  /
    PL/SQL procedure successfully completed.
    SCOTT@orcl_11g> SET AUTOTRACE ON EXPLAIN
    SCOTT@orcl_11g> SELECT *
      2  FROM   emp
      3  WHERE  CONTAINS (ename, :cstring) > 0
      4  /
         EMPNO ENAME      JOB              MGR HIREDATE         SAL       COMM     DEPTNO
          7369 SMITH      CLERK           7902 17-DEC-80        800                    20
    Execution Plan
    Plan hash value: 1887222286
    | Id  | Operation                   | Name     | Rows  | Bytes | Cost (%CPU)| Time     |
    |   0 | SELECT STATEMENT            |          |     1 |    37 |     4   (0)| 00:00:01 |
    |   1 |  TABLE ACCESS BY INDEX ROWID| EMP      |     1 |    37 |     4   (0)| 00:00:01 |
    |*  2 |   DOMAIN INDEX              | MY_INDEX |       |       |     4   (0)| 00:00:01 |
    Predicate Information (identified by operation id):
       2 - access("CTXSYS"."CONTAINS"("ENAME",:CSTRING)>0)
    SCOTT@orcl_11g>

  • Why does text I copy from a .pdf to a Word document, not copy correctly words containing the combinations 'tt' or 'ti'?

    When I copy text from a .pdf to a Word document, all words containing the combination 'tt' or 'ti' come up as blank boxes,or question marks.  How can I correct this?

    To comment a bit further, several letter combinations like tt and ti as well as many combinations with f are given as a single character in some languages and apparently you do not have that special character on your system.

  • Mapping Styles from Word 2010 to RoboHelp HTML 10

    Hello all,
    I have created customized styles for myself and my team to use for authoring content, which I then import to RoboHelp, as I am the only individual who understands RoboHelp and HTML/CSS. I created a corresponding CSS style sheet in RoboHelp to correspond with the names of the Word styles. When I go to import the files in RoboHelp, I am running in to an issue with paragraph tags and heading tags.
    When I import a file from a Word document, I have not found a way to map a <p> tag that is parsed from a Word document to an <h> tag, as heading tags do not appear as a selectable item in the Conversion Settings  window when importing Word files. I would be able to keep the formatting if I left the headings as <p> tags; however, it is my understanding that RoboHelp 10 search relies heavily on the heading tags for weighting search results. I have about 400 topics, and search is used heavily by those using my documentation. I would prefer to keep the <h> tags in place in the RoboHelp files and retain the search functionality, if at all possible, while finding an easier import option from Word files.
    At this point, I have to manually change the HTML for each <p> tag to match the correct heading style. While Find and Replace works well for this, I would prefer to have a seamless import from Word (which bloats an HTML file horrendously, I know) by mapping from one style to the next. Has anyone found a way to map a <p> tag to an <h> tag so that the search functionality does not suffer? Or am I just approaching this the wrong way?

    You are correct in your assumption that I am referring to paragraph styles when speaking of <p> tags. The paragraph styles come over as <p> tags, and I was just thinking of my style sheet. Sorry for the confusion.
    Speaking of my workflow, I'm attempting to map the styles for one topic at a time when importing a new Word file. I set up a test project to test out my style sheet, as I am upgrading from RoboHelp 7 right now. .I need to get one document to map correctly to save the settings for future imports.
    From the Import window, I click the Edit... button in the Word Document section. After RoboHelp scans the Word file, a list of all the styles from the Word document display. When I attempt to select from the available styles I want to map to from my RoboHelp style sheet, I don't have the option to select the heading styles I created in my RoboHelp CSS file. Does this provide you enough information to give you context on how I am attempting to map the styles?
    I found this article that leads me to belive that RoboHelp does not support mapping to a customized heading style from a CSS file. I realize that this is around the printed output, but it states the following:
    If custom heading styles aren’t named in the format Heading <number>, they are not treated as headings.
    If it doesn't work going from HTML to printed output, I'm guessing that there is not support going from Word to HTML. Am I wrong in this assumption? Do I just need to alter my style sheet to only use the standard <h> tags to fix this issue?

  • Word Wrap functionality in HTML Editor

    Hi,
    Is their any configuration to enable the word wrap functionality in HTML Editor.
    Best Regards,
    Ajay

    Hi Priyaranjan,
    you need to use modules for that. The below link explains step by step a sample for word wrapping in an analysis item.
    http://www.sdn.sap.com/irj/scn/go/portal/prtroot/docs/library/uuid/a0f7d724-c0f0-2d10-828f-be249d59b5c3?QuickLink=index&…
    Regards
    Yasemin...

  • How can I include external javascript/css/html

    Hi there
    I want to create a analyser bar for when music plays in my edge project, can anyone let me know how to include a external file wthin my project? Is it a yepnope fuction?
    http://www.developphp.com/view.php?tid=1348
    Here is the link I am using as a tutorial for creating the analyser...

    You're welcome, George
    Say you have a symbol whose instance is named slider,
    and the symbol has an element named phrase (an empty
    text element used as a container for the html snippets
    you load in it) :
    var $phrase = sym.getSymbol('slider').$('phrase');
    var filename = buildFileName(); // a function defined elsewhere
    $phrase.load( 'text/' +filename +'.html');
    The HTML file contains a html snippet, that is to say what
    you would normally write inside <body> (no <header>).
    Gil

  • How to remove browser warning message in HTML 5 Cap 8 projects?

    How to remove browser warning message in HTML 5 Cap 8 projects?

    My conclusion after a morning struggling with this, is that there's no point in suppressing the message if your html5 content then goes on to misbehave in Firefox (like mine does).  I've chosen to keep the popup, but change the wording to refer Firefox users to a completely separate Flash version.  The wording of the popup is also in CPM.js.
    Before you all shout at me, no I can't use the multiscreen.html approach, because my Flash version will have to be non-rescaleable, because web objects don't rescale in Flash.

  • Importing Word Document into RH HTML

    I'm trying to Import a Word Document into RoboHelp HTML and I
    receive this message.
    "Import Session Failed 0x80004005. "
    What can I do to Import the document properly?

    This is a permissions issue. Click
    here for
    information on how to fix this.

  • How to make word files in mac

    how to make word files, presentations orview such files in mac

    Open Office can be opened in MS Word, and vice versa, looks nearly 100% same as Office and Word but is FREE
    tons use it
    http://www.openoffice.org/download/index.html

  • Quering database with word containing apostrophe

    hi all,
    hopefully someone can help me with this, i am trying to query an sql database with a word containing an apostrophe from within my .jsp file. How do i go about replacing the " ' " with " '' " for all query's made?
    any help would be great,
    thanks,
    KJ
    **

    There's quite a few threads on this issue...go to the Forums home page, search the JDBC forum on "escape characters" and read the results. Here's a sample:
    http://forum.java.sun.com/thread.jsp?forum=48&thread=147408

Maybe you are looking for

  • Firefox won't start after update?

    Apparently my system crashed during a win7 64 update overnight. I restored to a previous restore setting and was able to get windows back and going again. Then, when I clicked my firefox shortcut to open the browser and restore my previous session it

  • Catch-22: need to assign a local variable within an anonymous class

    static boolean showMessage(Window parent, String button0, String button1)     throws HeadlessException {           final Window w = new Window(parent);           Panel p = new Panel(new GridBagLayout());           final Button b[] = new Button[]{new

  • Error in posting Document

    Friends,       When i use single costcenter for employees Posting is done.       But when i use different CC for different employees and try to post at a time getting error saying that Error in document :HRPAY 00000000103 QA1_L (Given document split

  • What rules do I need to include in Windows Firewall to allow Thunderbird out?

    This issue applies to all mail clients I try (tried with Outlook too), but I'm asking here because Thunderbird is my preference. I'm using Windows Firewall on Windows 8.1. I changed the default Allow Outbound traffic in Domain, Public and Private set

  • Consolidation Doubt

    Dear Experts 1 I have one doubt for consolidation : Our clinet having one legal enity in India and also having 2 plants under this legal entity. Both 2 plants are defined and assigned to above legal entity. Now another plant is established in Pune bu