Ignoring 'hidden text' during text searching
We are currently using the PDWordFinderAcquireWordList API in order to get / find words from some PDF documents. Inside some of the documents, there is sometimes some hidden text. This text is found and retrieved by the function.
But, we would like to be able to ignore this text because we do not want to expose it to the end user as this is invisible data.
We tried the PDWordFinderAcquireVisibleWordList API as it seems to be the good function that can filter the visible / invisible text. Unfortunatly, we did not succeeded.
Does anybody already used this API to filter visible text ?
Do you have any sample concerning the way to construct the PDOCContent parameter in order to filter the invisible text ?
Thanks for your help,
PT
I call 'hidden text', textual information that is not visible when you browse the PDF using Acrobat Reader, but that is extracted by the API mentioned.
But, when I use Acrobat Professional, and I start the 'Examine Document', the dialog box which appear says that Acrobat found 'Hidden text'. I can click on the right of this information on a link called 'Preview', and it list, for each pages, the strings that where found as 'Hidden' by Acrobat. That is the reason why I use this terminology.
I do not know to which 'concept' this refer in the specified documents.
Similar Messages
-
Is it possible to ignore noise words conditionally in working with Full text search containstable
I have a question on stoplist file. I need to search for exact phrase string("this is the incident") which contains noise words. As part of the FT search engine, during parsing it eliminates noise word and search on remaining string in
the given phrase.
let us say there are 10 rows which contains the term "incident" in the FT table . and 1 row which has the exact phrase.i.e."this is the incident".
if we use containstable() to search for "this is the incident", we are getting 10 rows instead of 1 row.
To resolve the issue, we have 3 solutions
1.either stoplist file needs to be modified to remove the words (this,is,the)
2. set stoplist = OFF.
3.empty stoplist.
Apart from the above solutions, is there any better solution with out touching noise words file list.
If any solution that provides flexibility to ignore noise words conditionally at one time and not to ignore them.
Please provide your suggestion.
kkprasadOne question that I ask is: Why would I want to exclude noise words?
Noise words were created to limit the size of the full text indexes and avoid processing the many 'this', 'is', and 'the' common words. But the disadvantage of doing so is that you cannot find some things as you would like.
My feeling is that computers are more powerful and have more storage and it is often better to just index everything. As long as your search does not include 'the', then the large number of 'the's in the system will pretty much be ignored.
NOTE: If you change the noise words, including SET STOPLIST = OFF, you have to rebuild the index in order for it to implement your decision.
Of course, for very, very large full text indexes you would need to test.
Is your full text search on relation database columns, e.g. Description NVARCHAR(1000) or are you searching Word, Excel, and other more complex data?
If your full text is relational columns, it might be that you could:
1. Select only the fulltextkey into a temp table (e.g. #FTSfulltextkey) from the full text index using noise words. That would give you 10 rows.
2. Then directly query the table to find the string as you define above. (But remember that punctuation and symbols are generally ignored by Full Text Indexing, but would still be there in the string of text.)
SELECT *
FROM MyTextTable T
JOIN #FTSfulltextkey K
ON T.fulltextkey = k.fulltextkey
WHERE T.Description like '%this is the incident%'
Full text search is powerful, but it has limits. And the behaviour changes depending on the Language of the search.
RLF -
OCR and hidden text in PDF scans of historic documents
I need to edit the hidden text behind a scanned PDF image of a document. The image must remain as an “exact” copy of the original scanned document.
I used Acrobat Pro (versions 7 and 9) to make PDF images of old typed documents from the 1940’s. When I open those images and run OCR in version 9, then examine the hidden (invisible) text layer behind the image, there are errors. For example, the word “book” has been picked-up by the OCR as the word “look.” I need to change the “l” to a “b” in order to make the PDF accurate when it is searched at a later date.
I have checked many user forums. Most people imply that hidden text can be viewed, but NOT edited in Acrobat Pro 7 and 9. (Hidden text can be viewed in Version 9 by selecting “Document” “Examine Document” and then clicking on the “+” symbol next to “Hidden Text,” then clicking “Show preview.”) Some say to use Adobe Capture 3.0 to edit hidden text. Others say to use Photoshop or Illustrator to edit hidden text (I think these folks may have been confused, because Photoshop and Illustrator would be used, logically, to edit the image ON TOP OF the hidden text). Yet another person seemed to say that a hidden text editor was added to Acrobat 8, but was taken away in Acrobat 9. (I can’t verify that because I don’t have version 8.)
The closest answer I was able to find involved using the Text Touch Up Tool on top of the image to edit hidden text behind it, but when you do that you are typing “blind.” In other words, you highlight a spot on the image (top layer) where you THINK the error MIGHT be, and you type the correction without being able to see what you are typing over. Then, you go back to the “Examine Document” procedure (described above) to see if you “hit” your mark, and if not, you redo it until you do “hit” your mark. With the number of documents and corrections that we have, that procedure would be too labor intensive and thus a budget breaker.
If we have to buy more software, my preference would be to buy a genuine Adobe product because I have experienced problems in the past switching back and forth between Adobe products and other PDF manipulation software.
Can anyone answer any of these questions:
(1) Is there a way in Acrobat versions 7, 8 or 9 to edit hidden text, and if so, how?
(2) What Adobe software (other than Acrobat) will edit hidden text behind a PDF image?
(3) Assuming no Adobe product will edit hidden text behind a PDF image, is there any non-Adobe products that will do that?
Thank you!Hi,
Unless you use Acrobat 8 Pro's Formatted Text & Graphics" or Acrobat 9 Pro's ClearScan you will find that there is no
practicable means of editing the OCR "hidden text" in a PDF.
The TouchUp text tool (Advanced Editing toolbar) is reliant upon the selected text having an available system font to use during touchup. However, both Searchable Image and Searchable Image (Exact) OCR output is of text rendering mode 3 (invisible text) that is provided from within Acrobat and not any installed system or other application installed font.
With Searchable Image (Exact) you have the untouched image augmented by the invisible text which is provided as a user aid for search or find with Adobe Reader or Acrobat. The invisible text is not intended to support word processor like editing.
To your questions:
#1. There is no practicable way to edit invisible text (text rendering mode 3) with Acrobat (any past or current release).
#2. None.
#3. A good question. Perhaps a specialty program. Keep in mind, many products provide a promise but those those that actually deliver tend to be expensive.
Something to play with. Using Acrobat 9 Pro or Pro Extended, try the Preflight Fixup to embed hidden text.
Then try using the TouchUp Text tool. You may also want to see if you can change the font type of this newly embedded font.
(use copies of the "real" files - just in case <g>).
Be well... -
I
I made this file in indesign, the highlited empty spaces indicates that their is a hidden text and it pop up when searching for some words in pdf file. so how can I save pdf file to keep only the seen text ???Dear lrosenth,
I went through some codes/suggestions in internet and I found that I need to have cmap file and cid font file for the respective font since pdf doesn't support unicode fonts directly.
Can you help me to know where can I get cmap file and cid font file for tamil language font Latha(TrueType) microsoft font.
Regards,
Safiq -
How can I correct "hidden" text in a searchable PDF file?
This seems like a simple question. However, the answers are invariably complex, do not yield the desired result, and often answer a different question entirely. I say all that just to warn people up front that the "problem" is easier than how many people and PDF application developers, including Adobe, typically understand it while the proposed "solutions" are invariably a total...well, botch is a reasonable word if a bit understated.
Here is the actual problem:
I have "searchable" PDF files created by scanning documents and running them through an OCR process. I create "searchable" PDF files in order to archive, index, and eventually enable searching for the documents scanned. A "searchable" PDF satisfies those criteria better than any other commonly used, "portable" archive format -- though I would be happy if someone could point out an obvious alternative I may have overlooked. I do not need perfect OCR results. If I need a document to edit or perhaps feed into a spreadsheet or database, I expect to be able to reprocess the page images in a given "searchable" PDF file to OCR and convert the contents to Word, RTF, Excel, or another file format as necessary with more care for the results than for the archived document itself. Therefore, the "searchable" PDF document is the scanned page images which compose it while the OCR generated "searchable" text is secondary, but still important. Therefore, each file must contain scanned page images of sufficient detail to be efficiently converted by OCR if possible and legible enough for whoever views the images to be able to work out what an OCR process may fail to understand. Once scanned, those pages are the "document" and therefore "immutable." However, OCR is imperfect. For a searchable document archive, it does not have to be, but some errors are significant in that they may prevent the document from being found by a search. Therefore, there must be a way to view and, if necessary, edit the "hidden" text in a "searchable" PDF without altering the visual display of a document or how it is printed. No strike-throughs. No visible "corrections." None of the stuff PDF editors want to insert into a PDF file when editing it. I do not want to edit the document without exporting it to a format appropriate for an editable document. I just want adequately "correct" hidden text in a "searchable" PDF file.
I apologize for the length and redundancy in my description of the problem. However, past attempts to explain my problem and objectives as well as what I have seen in reply to similar queries across the Internet indicate that most people trying to answer this question come at it from the same point of view shared by most, if not all, PDF tool or application vendors. They seem to think that any desire to edit a PDF file is a desire to have a PDF word processor of some sort. Or, they assume that the OCR process employed may need tweaking of the means by which people apply it and then a process like "find suspects" is adequate to deal with any errors. But no, those are not what I am trying to accomplish and answers which address those topics do not answer this question.
In short, which tool or application from any vendor will reveal the "searchable" hidden text in a PDF produced by any OCR or other process and then enable corrections to the hidden text without changing any document display parameters at all? Note, hidden text typically includes bounding box information denoting the portion of the image from which the text was recognized. That information must not be lost or changed when editing the "searchable" text.
So, any tools or applications capable of doing this? If Adobe Acrobat XI Pro can (use of a trial copy demonstrated that the hidden text content can be reviewed, but editing did not work by any straight-forward means I could work out while trying out the application), fine. However, $500.00 list or even a $200.00 possible upgrade from a copy of Adobe Acrobat X Standard which came with my scanner is a lot of money for personal use when review and edit of the OCR generated hidden text in a "searchable" PDF file is the only function I require. Therefore, other suggested tools or applications which do what I need for less would be greatly appreciated.My "claim"? Actually I've made no "claim" such as you've mentioned.
Simply stated your OP has foundational premises that presume as factual what is not.
Here, we're in Adobe's hosted user forum for Acrobat.
Any other application use is not material.
Acrobat XI provides 3 OCR methods.
Searchable Image, Searchable Image (Exact) & ClearScan.
Only the first two provide the "hidden" text output.
(Glyphs have no stroke, no fill)
From back to the Acrobat 3 product family the design functionality of Searchable Image and Searchable Image (Exact) has been to facilitate the use of Find / Search.
The "hidden" text is can be touched up. Acrobat Pro provides the facility to view the hidden text.
So you can see what the OCR output that correlates to the bit-map images of the characters that are present.
With Acrobat XI Pro use Tools - Protection -Remove Hidden Information
In the Remove Hidden Information pane select "Hidden text" then "Show preview".
The default for the preview is "Show Only Hidden Text".
Back in the PDF --
You'd select some of the hidden text and retype what you suspect is the correct string of characters.
Save and return to the preview of the hidden text.
If you got it right, good. Continue.
If not, darn - try again.
Plug 'n chug -- somewhere over the rainbow it'll be done eh.
Full disclosure -- this is something I've done (enquiring minds don't you know).
I've found it to be a rather Sisypean undertaking.
So, "doable" but not practicable.
This is to be expected because such touchups are not the concern / focus of the output from Searchable Image or Searchable Image (Exact) - (the names tell it all).
To have touchup "editablity" of an OCR output using Acrobat make use of ClearScan.
ClearScan replaces recognized character bit-maps with a character from an Acrobat internal font.
The character strings can be selected to change to a generic, system available font.
Something that is good to know when embarking on the "tweak the PDF" journey is that PDF (the file format / technology as defined by its ISO Standard, ISO 32000-1) does not tolerate "editing". PDF is decidely not a word processor file format and "editing" can quickly render a PDF unusable.
Minor touchups can be made and your best "tool" for this is still Acrobat Pro. (Save As often and periodically "bank" the PDF via some file rename scheme.)
Be well... -
How to print hidden text in answer box
How can you print the hidden text in answer boxes on PDF forms?
I am not sure I understand your question. Are you importing a PDF with Hidden Fields into FormsCentral?
If this is the case FormsCentral ignores hidden fields and doesn't create column in the response table for those.
Gen -
Text search stopped working in Adobe Reader X
We create a weekly report that is posted in a shared network directory for multiple user access. These pdf files can be accessed by Adobe reader or full Adobe as needed. The reports are created in the same manner each week by printing an electronic file to the Adobe PDF printer. The reports generated after the 10th of January are no longer searchable. Any report before that date can be opened and when a search is performed the occurrances are highlighted as needed and can be jumped to by clicking on the search list created. Any report created after January 10, yields no results. This despite the visible presence of the search word or words in the visible document. The search mechanism is looking at the file and all of its pages but producing no results. I have looked at the document properties and can find no difference in their properties or behavior. Anyone have any ideas as to what might contribute to this change? As these are public record reports, I can send copies for analysis if needed. Thanks for any help.
Ok,
First -In context of your post above this one it might be a good thing to slick those files sooner rather than later.
While present in the account they are available to the "world" through the links.
The two pre-Jan 10 PDFs have content mastered in a word processing application.
I'd say it was MS Word as the PDFs were produced via printing with Adobe Distiller 9 (installed by Acrobat 9 Standard or Pro).
A File > Print > Adobe Printer selected.
The Distiller job option used had PDF version 1.5 (associated with release of Acrobat 6) selected; so, that's the version of the two PDFs.
These PDFs both have renderable text rather than hidden text associated with OCR output.
Both PDFs, as outputs of Adobe's 'Acrobat' product are ISO 32000-1 (ISO's PDF standard) compliant.
Consequently, Acrobat's Catalog index / Search feature encounters a "well-formed" (Standards compliant) PDF.
The post-Jan10 PDFs were also sourced from a word processing application (again, I'd say MS Word).
However, the free PrimoPDF application developed the PDF output.
The PDFs contain renderable and hidden text. It is as if the initial output PDF was and image and that was sent through an OCR process that replaced recognized characters with a renderable character and left unrecognized characters as hidden text.
Consequence: What Catalog index / Search / Find can 'grab' is not the same as what you 'see'.
In sum, post Jan 10 the way the PDFs were produced changed. That is the crux of it.
Be well... -
Document Management / Full Text Search in Projects & SolMan Directory
Hello experts!
I want to implement Full Text Search in my SolMan and therefore need your help.
I have installed Trex 7.10 and am already able to search within my Solution Database (Problems/Solutions).
Now I want to be able to search within all documents in my SolMan Installation - e.g. Roadmaps, SAP Scenario Descriptions or my own uploaded documents.
In the IMG I already found the section Document Management but I cannot get the needed RFCs (IMSDEFAULT and IMSDEFAULT_REG) working.
There is already a RFC called IMSDEFAULT, which was created during the installation of Trex and runs the
Registered Server-Program: Trex_SOT_20080327090311
I tried to configure my RFCs according to the IMG but, as I said, cannot manage to get them working.
I hope anybody has an idea or an guide on how to implement Full Text Search!
Please be so kind and provide help - thanks in advance!
Best regards
GeroldGerold,
I am running into the same issues because we want to do full text search on project documentation within Solution Manager. The default RFC connections IMSDEFAULT_REG and IMSDEFAULT do not work with the standard settings.
We created one index for IWBSOLAR so maybe I also need to check IWBPROJ in SKPR06 for project documentation and create a new index for this one ?
The indexing for IWBSOLAR now fails although there seem to be over 6000 documents marked but none are put in the index. After indexing there is a list of the documents all with code 6401 but cannot find any where what this code means. It seems to be succesful as all entries are green.
Any help/suggestions on this topice would be greatly appreciated. -
Exception thrown while doing a full text search in JavaHelp
hi,
I am using JavaHelp 2.0_02 in my web application. The Table of contents (TOC ) and the[b] Index work absolutely fine. But when I try to do a text search i get the following exception ..
Exception while creating engine named com.sun.java.help.search.DefaultSearchEngine
for view: javax.help.SearchView@1e20a9a
.. java.lang.reflect.InvocationTargetException
.. at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
.. at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
.. at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
.. at java.lang.reflect.Constructor.newInstance(Constructor.java:274)
.. at javax.help.search.MergingSearchEngine.makeEngine(MergingSearchEngine.java:169)
.. at javax.help.search.MergingSearchEngine.<init>(MergingSearchEngine.java:63)
.. at javax.help.tagext.SearchTOCItemTag.doStartTag(SearchTOCItemTag.java:77)
.. at org.apache.jsp.javax_help_SearchView_jsp._jspService(javax_help_SearchView_jsp.java:101)
.. at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:137)
.. at javax.servlet.http.HttpServlet.service(HttpServlet.java:853)
at org.mortbay.http.SocketListener.handleConnection(SocketListener.java:201)
at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:289)
at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:455)
Caused by: java.security.InvalidParameterException
at com.sun.java.help.search.DefaultSearchEngine.<init>
DefaultSearchEngine.java:75)
... 28 more
My Helpset(.hs) file is located in the Help folder of my application and the entry for search is as follows
<view>
<name>Search</name>
<label>Search</label>
<type>javax.help.SearchView</type>
<data engine="com.sun.java.help.search.DefaultSearchEngine">JavaHelpSearch</data>
</view>
</helpset>and the jh.jar file is located in the applications classpath.
and I have successfully done the indexing using jhindexer
I searched through many sites including the java forum.. but not able to find a solution.
Can anyone help me and point out where I could have made a mistake or how to go about this
thanks in advanceI had approximate the same problem, and I've fixed it. In my jar file the javaHelpSearch Folder was named with a leading lower case "j". But in my helpset.hs it was with an upper Case "J"
<name>Search</name>
<label>Search</label>
<type>javax.help.SearchView</type>
<data engine="com.sun.java.help.search.DefaultSearchEngine">JavaHelpSearch</data>
</view>When the helpset is called from a folder: windows ignore it, but when the helpset is called from into a jar you must type it correct! -
Oracle Text Search Enabling, Content Server does not start
We are trying to enable Oracle Text Search in UCM 11g that uses 11g Database. We followed instructions to add parameter 'SearchIndexEngine=OracleTextSearach' to config.cfg and restart content server. Content server doesn't come back up with the following error showing up:
Are there any extra steps to follow to enable Search Indexer Engine?
<Apr 27, 2015 4:50:29 PM PDT> <Error> <oracle.ucm.idccs> <UCM-CS-050021> <Oracle WebCenter Content failed to initialize inside its servlet environment.>
<Apr 27, 2015 4:50:29 PM PDT> <Error> <oracle.ucm.idccs> <UCM-CS-000001> <general exception
intradoc.common.ServiceException:
at intradoc.server.IdcServerManager.init(IdcServerManager.java:259)
at intradoc.idcwls.IdcServletRequestUtils.initializeServer(IdcServletRequestUtils.java:627)
at intradoc.idcwls.IdcServletRequestUtils.initializeServer(IdcServletRequestUtils.java:457)
at intradoc.idcwls.IdcIntegrateWrapper.initializeServer(IdcIntegrateWrapper.java:104)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at idcservlet.common.IdcMethodHolder.invokeMethod(IdcMethodHolder.java:88)
at idcservlet.common.ClassHelperUtils.executeMethodEx(ClassHelperUtils.java:305)
at idcservlet.common.ClassHelperUtils.executeMethodWithArgs(ClassHelperUtils.java:278)
at idcservlet.ServletUtils.initializeContentServer(ServletUtils.java:1259)
at idcservlet.ServletUtils.startAndConfigureServer(ServletUtils.java:531)
at idcservlet.ServletUtils.initializeAllServers(ServletUtils.java:460)
at idcservlet.IdcFilter.initContentServer(IdcFilter.java:181)
at idcservlet.IdcFilter.init(IdcFilter.java:156)
at weblogic.servlet.internal.FilterManager$FilterInitAction.run(FilterManager.java:343)
at weblogic.security.acl.internal.AuthenticatedSubject.doAs(AuthenticatedSubject.java:321)
at weblogic.security.service.SecurityManager.runAs(SecurityManager.java:120)
at weblogic.servlet.internal.FilterManager.loadFilter(FilterManager.java:96)
at weblogic.servlet.internal.FilterManager.preloadFilters(FilterManager.java:57)
at weblogic.servlet.internal.WebAppServletContext.preloadResources(WebAppServletContext.java:1877)
at weblogic.servlet.internal.WebAppServletContext.start(WebAppServletContext.java:3174)
at weblogic.servlet.internal.WebAppModule.startContexts(WebAppModule.java:1527)
at weblogic.servlet.internal.WebAppModule.start(WebAppModule.java:489)
at weblogic.application.internal.flow.ModuleStateDriver$3.next(ModuleStateDriver.java:427)
at weblogic.application.utils.StateMachineDriver.nextState(StateMachineDriver.java:52)
at weblogic.application.internal.flow.ModuleStateDriver.start(ModuleStateDriver.java:119)
at weblogic.application.internal.flow.ScopedModuleDriver.start(ScopedModuleDriver.java:201)
at weblogic.application.internal.flow.ModuleListenerInvoker.start(ModuleListenerInvoker.java:249)
at weblogic.application.internal.flow.ModuleStateDriver$3.next(ModuleStateDriver.java:427)
at weblogic.application.utils.StateMachineDriver.nextState(StateMachineDriver.java:52)
at weblogic.application.internal.flow.ModuleStateDriver.start(ModuleStateDriver.java:119)
at weblogic.application.internal.flow.StartModulesFlow.activate(StartModulesFlow.java:28)
at weblogic.application.internal.BaseDeployment$2.next(BaseDeployment.java:672)
at weblogic.application.utils.StateMachineDriver.nextState(StateMachineDriver.java:52)
at weblogic.application.internal.BaseDeployment.activate(BaseDeployment.java:212)
at weblogic.application.internal.EarDeployment.activate(EarDeployment.java:59)
at weblogic.application.internal.DeploymentStateChecker.activate(DeploymentStateChecker.java:161)
at weblogic.deploy.internal.targetserver.AppContainerInvoker.activate(AppContainerInvoker.java:80)
at weblogic.deploy.internal.targetserver.BasicDeployment.activate(BasicDeployment.java:187)
at weblogic.deploy.internal.targetserver.BasicDeployment.activateFromServerLifecycle(BasicDeployment.java:379)
at weblogic.management.deploy.internal.DeploymentAdapter$1.doActivate(DeploymentAdapter.java:52)
at weblogic.management.deploy.internal.DeploymentAdapter.activate(DeploymentAdapter.java:200)
at weblogic.management.deploy.internal.AppTransition$2.transitionApp(AppTransition.java:31)
at weblogic.management.deploy.internal.ConfiguredDeployments.transitionApps(ConfiguredDeployments.java:240)
at weblogic.management.deploy.internal.ConfiguredDeployments.activate(ConfiguredDeployments.java:170)
at weblogic.management.deploy.internal.ConfiguredDeployments.deploy(ConfiguredDeployments.java:124)
at weblogic.management.deploy.internal.DeploymentServerService.resume(DeploymentServerService.java:181)
at weblogic.management.deploy.internal.DeploymentServerService.start(DeploymentServerService.java:97)
at weblogic.t3.srvr.SubsystemRequest.run(SubsystemRequest.java:64)
at weblogic.work.ExecuteThread.execute(ExecuteThread.java:256)
at weblogic.work.ExecuteThread.run(ExecuteThread.java:221)
Caused By: intradoc.data.DataException: !csSearchIndexerEngineNotExist,DATABASE.ORACLETEXTSEARCH
at intradoc.shared.CommonSearchConfig.createDataException(CommonSearchConfig.java:2580)
at intradoc.shared.CommonSearchConfig.setCurrentConfig(CommonSearchConfig.java:1041)
at intradoc.shared.CommonSearchConfig.init(CommonSearchConfig.java:178)
at intradoc.server.SearchIndexerUtils.initSearchIndexerConfig(SearchIndexerUtils.java:64)
at intradoc.server.IdcSystemLoader.initSearchIndexerConfig(IdcSystemLoader.java:1338)
at intradoc.server.IdcSystemLoader.loadCaches(IdcSystemLoader.java:1255)
at intradoc.server.IdcServerManager.init(IdcServerManager.java:141)
at intradoc.idcwls.IdcServletRequestUtils.initializeServer(IdcServletRequestUtils.java:627)
at intradoc.idcwls.IdcServletRequestUtils.initializeServer(IdcServletRequestUtils.java:457)
at intradoc.idcwls.IdcIntegrateWrapper.initializeServer(IdcIntegrateWrapper.java:104)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at idcservlet.common.IdcMethodHolder.invokeMethod(IdcMethodHolder.java:88)
at idcservlet.common.ClassHelperUtils.executeMethodEx(ClassHelperUtils.java:305)
at idcservlet.common.ClassHelperUtils.executeMethodWithArgs(ClassHelperUtils.java:278)
at idcservlet.ServletUtils.initializeContentServer(ServletUtils.java:1259)
at idcservlet.ServletUtils.startAndConfigureServer(ServletUtils.java:531)
at idcservlet.ServletUtils.initializeAllServers(ServletUtils.java:460)
at idcservlet.IdcFilter.initContentServer(IdcFilter.java:181)
at idcservlet.IdcFilter.init(IdcFilter.java:156)
at weblogic.servlet.internal.FilterManager$FilterInitAction.run(FilterManager.java:343)
at weblogic.security.acl.internal.AuthenticatedSubject.doAs(AuthenticatedSubject.java:321)
at weblogic.security.service.SecurityManager.runAs(SecurityManager.java:120)
at weblogic.servlet.internal.FilterManager.loadFilter(FilterManager.java:96)
at weblogic.servlet.internal.FilterManager.preloadFilters(FilterManager.java:57)
at weblogic.servlet.internal.WebAppServletContext.preloadResources(WebAppServletContext.java:1877)
at weblogic.servlet.internal.WebAppServletContext.start(WebAppServletContext.java:3174)
at weblogic.servlet.internal.WebAppModule.startContexts(WebAppModule.java:1527)
at weblogic.servlet.internal.WebAppModule.start(WebAppModule.java:489)
at weblogic.application.internal.flow.ModuleStateDriver$3.next(ModuleStateDriver.java:427)
at weblogic.application.utils.StateMachineDriver.nextState(StateMachineDriver.java:52)
at weblogic.application.internal.flow.ModuleStateDriver.start(ModuleStateDriver.java:119)
at weblogic.application.internal.flow.ScopedModuleDriver.start(ScopedModuleDriver.java:201)
at weblogic.application.internal.flow.ModuleListenerInvoker.start(ModuleListenerInvoker.java:249)
at weblogic.application.internal.flow.ModuleStateDriver$3.next(ModuleStateDriver.java:427)
at weblogic.application.utils.StateMachineDriver.nextState(StateMachineDriver.java:52)
at weblogic.application.internal.flow.ModuleStateDriver.start(ModuleStateDriver.java:119)
at weblogic.application.internal.flow.StartModulesFlow.activate(StartModulesFlow.java:28)
at weblogic.application.internal.BaseDeployment$2.next(BaseDeployment.java:672)
at weblogic.application.utils.StateMachineDriver.nextState(StateMachineDriver.java:52)
at weblogic.application.internal.BaseDeployment.activate(BaseDeployment.java:212)
at weblogic.application.internal.EarDeployment.activate(EarDeployment.java:59)
at weblogic.application.internal.DeploymentStateChecker.activate(DeploymentStateChecker.java:161)
at weblogic.deploy.internal.targetserver.AppContainerInvoker.activate(AppContainerInvoker.java:80)
at weblogic.deploy.internal.targetserver.BasicDeployment.activate(BasicDeployment.java:187)
at weblogic.deploy.internal.targetserver.BasicDeployment.activateFromServerLifecycle(BasicDeployment.java:379)
at weblogic.management.deploy.internal.DeploymentAdapter$1.doActivate(DeploymentAdapter.java:52)
at weblogic.management.deploy.internal.DeploymentAdapter.activate(DeploymentAdapter.java:200)
at weblogic.management.deploy.internal.AppTransition$2.transitionApp(AppTransition.java:31)
at weblogic.management.deploy.internal.ConfiguredDeployments.transitionApps(ConfiguredDeployments.java:240)
at weblogic.management.deploy.internal.ConfiguredDeployments.activate(ConfiguredDeployments.java:170)
at weblogic.management.deploy.internal.ConfiguredDeployments.deploy(ConfiguredDeployments.java:124)
at weblogic.management.deploy.internal.DeploymentServerService.resume(DeploymentServerService.java:181)
at weblogic.management.deploy.internal.DeploymentServerService.start(DeploymentServerService.java:97)
at weblogic.t3.srvr.SubsystemRequest.run(SubsystemRequest.java:64)
at weblogic.work.ExecuteThread.execute(ExecuteThread.java:256)
at weblogic.work.ExecuteThread.run(ExecuteThread.java:221)
>
<Apr 27, 2015 4:50:29 PM PDT> <Error> <ServletContext-> <BEA-000000> <There is an initialization failure during start up of UCM.>
<Apr 27, 2015 4:50:29 PM PDT> <Error> <ServletContext-> <BEA-000000> <Could not start server of type 'server' at default relative web root URL 'cs'.
javax.servlet.ServletException: Could not start up IDC server deployment.
at idcservlet.ServletUtils.initializeContentServer(ServletUtils.java:1268)
at idcservlet.ServletUtils.startAndConfigureServer(ServletUtils.java:531)
at idcservlet.ServletUtils.initializeAllServers(ServletUtils.java:460)
at idcservlet.IdcFilter.initContentServer(IdcFilter.java:181)
at idcservlet.IdcFilter.init(IdcFilter.java:156)
Truncated. see log file for complete stacktrace
Caused By: java.io.IOException: Oracle WebCenter Content failed to initialize inside its servlet environment.
at intradoc.idcwls.IdcIntegrateWrapper.initializeServer(IdcIntegrateWrapper.java:139)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
Truncated. see log file for complete stacktrace
Caused By: intradoc.common.ServiceException:
at intradoc.server.IdcServerManager.init(IdcServerManager.java:259)
at intradoc.idcwls.IdcServletRequestUtils.initializeServer(IdcServletRequestUtils.java:627)
at intradoc.idcwls.IdcServletRequestUtils.initializeServer(IdcServletRequestUtils.java:457)
at intradoc.idcwls.IdcIntegrateWrapper.initializeServer(IdcIntegrateWrapper.java:104)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
Truncated. see log file for complete stacktrace
Caused By: intradoc.data.DataException: !csSearchIndexerEngineNotExist,DATABASE.ORACLETEXTSEARCH
at intradoc.shared.CommonSearchConfig.createDataException(CommonSearchConfig.java:2580)
at intradoc.shared.CommonSearchConfig.setCurrentConfig(CommonSearchConfig.java:1041)
at intradoc.shared.CommonSearchConfig.init(CommonSearchConfig.java:178)
at intradoc.server.SearchIndexerUtils.initSearchIndexerConfig(SearchIndexerUtils.java:64)
at intradoc.server.IdcSystemLoader.initSearchIndexerConfig(IdcSystemLoader.java:1338)
Truncated. see log file for complete stacktraceCan you let us know the config.cfg entry in your Environment for SearchIndexerEngineName.
Is it SearchIndexerEngineName= DATABASE.ORACLETEXTSEARCH ? It should be as below
SearchIndexerEngineName=ORACLETEXTSEARCH -
Using Punctuations in Full Text Searches
Hi, I'm having a bit of trouble implementing full text search effectively in regards to punctuations.
In my thesaurus file I have something along the lines of
<expansion>
<sub>intl</sub>
<sub>int'l</sub>
<sub>international</sub>
</expansion>
and when I run the query
select * from sys.dm_fts_parser('Formsof(freetext,"international")',1033,null,0)
it show's that it is included
international's
internationals
internationals'
international
intl
int'l
However when I do end up running the query using Contains, it ignores everything that contains int'l and only returns the results of the other matches.I am moving it to Search.
Kalman Toth Database & OLAP Architect
SQL Server 2014 Design & Programming
New Book / Kindle: Exam 70-461 Bootcamp: Querying Microsoft SQL Server 2012 -
Hidden Text / Conditional Formatting
How do I hide text in a pages document? I use pages to prepare exams for teaching and in MSWord I used to have the answers to the questions included in the document as hidden text. How do I do this in Pages? Is this possible at all? Setting the text color to white is often not feasible because this would result in large white areas (and I would have a hard time to find my text again)
Maybe the "Change Tracking" feature would be a soluce.
If it doesn't, Move back to my proposal.
Save the document with the responses greyed.
open it search all item in the grey style, call my cut_sparse script
print what is the students version.
Back to the complete one, disable the grey style
print what is the teacher version.
Save the script as an Application or an Application Bundle: cut_sparse.app
Move the newly created application into the folder:
<startup Volume>:Users:<yourAccount>:Library:Scripts:Applications:Pages:
Go to a Pages document
select some text, contiguous or sparse
menu Scripts > cut_sparse
The script will put in the clipboard the selected text inserting a return between different chunks.
--(SCRIPT cut_sparse.app]
tell application "Pages"
activate
tell document 1
set sel to (get selection)
set k to count of sel
repeat with i from k to 1 by -1
select (item i of sel)
(* I didn't found an other way to remove the selected text *)
tell application "System Events" to tell application process "pages" to keystroke "x" using {command down}
end repeat
end tell
end tell
--[/SCRIPT]
Yvan KOENIG (from FRANCE jeudi 28 février 2008 17:24:40) -
Hello. I am having problems with hidden text in docx files. If I have document with Heading 2 formatting and color text to blac I can't see it nor on iPhone 4 nor on iPad/iPad 2. Only blank space is shown where text is located. If I select area where text is and copy it into notes text appears. I can send example of document if needed. Anyone has solution?
Thank you.Hi,
I have just installed SqlExpress 2008 r2. I created database, some tables and when i try to write any query in new query editor it does not show my text but the text really exists. I've spend 2 days for searching the issue online but get no help.
thanks
Not so sure why but it may be the case that text font color is set to white. Please press Ctrl + A and see if the queries are highlighted.
Please post screenshot if possible.
Cheers,
Vaibhav Chaudhari
MCP, MCTS, MCSA (SQL Server 2012) -
Disable Full Text Search in Records Management
Hi Everyone,
This is Shiva, we implemented SAP Records management & i am came in middle of this project. Right now client want to do some customizations in Records.
Actually i need to disable Full Text Search under Records. Under Cross Query i disabled the Full text by the following process
(Two sample element types are delivered with the service provider
SRM_SP_QUERY:
SRM_SPS_QUERY
Search for records, documents, cases, record models and file plans
SRM_SPS_QUERY_REC_DOC)
Under(SPRO) Cross SP search based on Property Unification; you can able to disable the FullText search field(this will be effects only for Cross Query), but under Records, when you right click there will be one option called FIND in which i am unable to disable the FULL TEXT search.
I used DMWB T-code under LOIO, i search for Standard/free/instance attributes, but i didn't find any Full Text Search Field in that one.
Can anyone please help me in this. I really appreciate your inputs & help.
Thanks,
Shiva.
Edited by: Shivasap on Aug 2, 2010 6:01 PM
Edited by: Shivasap on Aug 3, 2010 4:01 PMDear Shiva,
Full text search needs to be deactivated at content model level for attributes. You will find a document class in the element type of the record. In transaction DMWB, you will find LOIO & PHIO classes for the document class. For each attribute, you will find a checkbox "index-relevant" which needs to be unticked if you dont want to use TREX. For deactivating full-text search for which a field is given in the FIND popup, TREX needs to be deactivated. There is no option to deactivate just for a record element type in RM. Unfortunately, the field itself cannot be hidden. This is programmed in the logic.
Best Regards,
Pragya -
Is it possible to place hidden text "behind" or within a jpg image?
I am using RH 9 to write and edit a user help manual. All the help topics must be searchable from within the Help contents in the application.
One such topic is Keyboard Shortcuts. The problem is that this topic is a JPG image, so it's not possible to search from it within the Help Contents and Index.
From my work with MS Word, I know how to place hidden text (transparent) in a textbox behind an image, which would enable a search. Basically, I'd type "Keyboard Shortcut" into the textbox, paste it behind the JPG image, so that when a user types that term in the Search field, they are directed to the location of the JPG. Can I accomplish this, or the equivalent, in RH9?
The only other option I can think of is to type the list of shortcuts out, but that's inefficient and time consuming.
If anyone has any other suggestions for making the JPG (or at least the term "Keyboard Shortcuts") searchable, I'd appreciate it.
I know MS OneNote allows users to search for text within images.
Thanks!Hi there
Look at the properties for the topic in question. Specifically, the tab labeled General. On that tab is a field where you are able to add Keywords for just such a purpose.
Cheers... Rick
Maybe you are looking for
-
I am trying to stream via ESPNWatch but I get a message saying I have a popup blocker that does not allow the streaming to happen. I unchecked the pop up blocker box in the Safari preferences to no avail. Any ideas?
-
.swf being able to load contents into it issue. Please help?
Hi there, I have strange situation regarding flash that I can't figure out. I have a flash .swf file that is intended to load .flv video files into them at the local root level. I have been able to do this on my own server, but the .swf and the .flv
-
How can you sort by number of dowloads or rating in iTunes app store?
Is there a way to get iTunes - including the app store to give a view that you can sort by ration or number of downloads. I assume they would have this since they are all about ease-of use. Thanks
-
Hi, I just want to see which table or t.code gives me the details of Stock(Good receipt) and it's value posted in a partciular month against a Material. We have Material Ledger active and Price difference/exchange rate difference are also taking plac
-
Use of 'Free Chars' of a Query in the APD
Hello I have a task of building an APD on a query. My question is: Do I need to have all the chars in the 'ROWS' section and all the KFs available in the 'COLUMNS' section (none hidden) before I start building the APD on top of it. Or can I still use