Need help in text search
We have a many folders which contain a lot of pdf documents. We need to write a script and search for a particular keyword through all the pdf documents and print the path and filename which contains this keyword.
Thats it. I am new to Adobe and I do not know whether adobe has any API to do this and looking for some sample programs.
Thanks in Advance
Chari
You can use a batch process (http://livedocs.adobe.com/acrobat_sdk/9.1/Acrobat9_1_HTMLHelp/BatchSeq_BatchSequences.96.1 .html) to scan through a collection of PDFs. You can also seach for words in a PDF using the getPageNthWord JavaScript method: http://livedocs.adobe.com/acrobat_sdk/9.1/Acrobat9_1_HTMLHelp/JS_API_AcroJS.88.486.html
So the batch process would be a JavaScript that loops through the pages of a document, loops through the words on each page, and if a match is found, it could write the file path (http://livedocs.adobe.com/acrobat_sdk/9.1/Acrobat9_1_HTMLHelp/JS_API_AcroJS.88.411.html) to the JavaScript console.
Similar Messages
-
Need help doing text search of Blob
I presently have a simple single table candidate tracking application that create records of candidate information and you attach their resume in a word format into a blob column named RESUME within the table.
I need to be able to do a full text boolean search of the attached documents, since we're going to use this to allow us to search an internal database for people with particular skill sets.
I'm not a DBA or a developer, I just started working with Oracle Application Express and need a simple way of creating this search feature.
I also need to add the search feature into a region for users to input search terms.
Any help would be greatly appreciated.
Edited by: user10608055 on Nov 25, 2008 11:57 AMtry
http://tahiti.oracle.com/
in the list of books, look at 'TEX' for Oracle text.
Interesting you should bring this up. I'm slighly involved with US Transition team.
Many resumes are coming in.
Tim -
Need help in building search query
Guys ..
Problem Description:
I have a huge table that is indexed using CONTEXT.
I want to write a search query that considers the following:
1. number of keywords match
2. takes care of spelling mistakes, synonyms and acronyms
3. proximity - the keywords should not be too far of each other.
e.g. I have this phrase: "Horizontal Stabilizer Trim Brake"
I was thinking of writing a query like:
SELECT SCORE(1) SCORE,
TEXT text
FROM MY_TABLE
WHERE CONTAINS(TEXT, '(Horz | Horizontal) ACCUM (Stab | Stabilier) ACCUM Trim ACCUM (Brk | Break)', 1) >= 0
ORDER BY SCORE DESC
The results doesnt look satisfactory. I have not used "near" operator as i dont know how to use it.
Please help me as I am very much new to Oracle Text.
-GWell, I'm not going to write the function for you, but we can at least talk through a general strategy.
A lot depends on how you help your users on the front end -- for example, if they're searching a technical document, you may want to return results that aren't perfect matches but you do want to make sure the user picks 'mandatory' and 'useful' keywords in a way that lets you figure out which ones are really important. On the other hand, if you're google and have to handle queries like 'horizontal stabilizer trim brake' and 'were Pete and Jenny in the break room' then you run the risk of spending too much time looking for interesting words, almost doing a full-text search on the query trying to derive meaning.
So I'm going to presume that you have some control over what/how the users generate their searches so that finding keywords isn't the issue.
The plan will be to parse the query a bit to find the interesting words, clean them up, and weigh their importance, then use transformed data to build the query template to score various combinations.
So here's some pseudocode for the function:
function parse_query(pQueryWords in clob) returns clob as
begin
generate_token_list (); -- split the query into a set of individual tokens/words
for each token in token_list
if it's a mandatory word then accumtokenlist := accumtokenlist || ' ' || token ||'*10' -- weigh the presence of the token strongly
if it's a useful word then accumtokenlist := accumtokenlist || ' ' || token ||'*5' -- domain-specific words are also important
if it's a stopword or reserved word, then do not add it to the list
if it's not on my lists, then accumtokenlist := accumtokenlist || ' ' || token
and normaltokenlist := normaltokenlist ||' ' || token
end;
--so now, we have two lists, one for NEAR and one for ACCUM
now build the guts of the template
querytemplate := querytemplate || '<seq> || normaltokenlist || '</seq>';
querytemplate := querytemplate || '<seq> || replace (accumtokenlist, ' ',' ACCUM ') || '</seq>';
querytemplate := querytemplate || '<seq>$' || replace(normaltokenlist,' ','$') || '</seq>';
querytemplate := querytemplate || '<seq>? || replace(replace(accumtokenlist,' ',' ?'),' ', ' accum ') || </seq>'; -- first fuzzy the words, then accum
querytemplate := querytemplate || '<seq>? || replace(replace(normaltokenlist,' ',' ?'),' ', ' near ') || </seq>'; -- first fuzzy the words, then near
return querytemplate
end;So, with a 'cooked' query text that is template-friendly, all we need to do is apply a template that is aware of your inputs:
query_Template_string := '
<query>
<textquery lang="ENGLISH" grammar="CONTEXT"> horizontal stabilizer*5 trim brake*10
<progression> '
|| parse_query('horizontal stabilizer trim brake') ||
' </progression>
</textquery>
<score datatype="INTEGER" algorithm="COUNT"/>'
</query>So that's an example of one approach. -
HELP! Need help generating TEXT-ONLY portal page...
Text Only Portal Question:
PLATFORM:
=================================================================
Sun Solaris (5.2 if memory serves) for db and mid-tier, running
8.1.7 DB and 3.0.9 (1.0.2.2) portal.
THE NEED:
=================================================================
I need to display text only portal pages. Some of the more
detailed concerns at this point are below. Also, I've had an open
tar on Metalink for about two weeks, and after research from
their end has resulted in no help.
THE ISSUES (so far):
=================================================================
IMAGES:
If an anchor [A HREF=...] tag uses an image as it's "text", I
need to strip out the ALT= text to show inside the anchor. If no
ALT text is available, then I would like to show the image name
as a default.
For example:
<img src=home.gif
alt=Home>
should display as:
Home
FORMS:
How do I get the resulting page from a form (which include the
login inputs and submit button, search box, advanced search page,
etc.) to be displayed by the text only page?
For example:
When a form is called, the <FORM> elements are as follows:
METHOD=GET or POST
ACTION=url (relative or absolute) to the script.
In this case, the action value is:
ACTION=/servlet/page?
pageid=6&dad=portal30&_schema=PORTAL30.
This calls the advanced search API.
I would expect that to redirect the browser back to some
text-only version, the ACTION= element would have to be changed
to be something like:
ACTION=[pathscraper]?/servlet/page?
pageid=6&dad=portal30&_schema=PORTAL30
REDIRECTION:
What happens when portal pages redirect internally? How do you
get back to the text-only page?
For example:
The login link on the standard Oracle Portal home page flips
from url to url to get to the actual login page. Our
implementation of Oracle portal goes from
[DOMAIN]/pls/portal30_sso/portal30_sso.wwsso_app_admin.ls_login
to [domain]/pls/portal30_sso/portal30_sso.login_page.
Since this is standard Oracle redirection, how can it be
intercepted so the portal30_sso.login_page can be presented as
text only?
TRIED SO FAR:
=================================================================
I've written a socket/text scraper in Perl, running it from a web
server. The problems mentioned above are really causing problems,
plus the whole cookie thing. Since Oracle Portal tries to push a
cookie to the client, when the client is another UNIX server,
the cookie thing doesn't work.
POSSIBLE OTHER SOLUTIONS:
=================================================================
Something...anything. I've tried to think of some method to
create some sort of PL/SQL procedure to catch the content then
strip out the HTML calls.
An Applet to do the same thing, but on the client side, but
since time is an issue, coding a complete Java applet isn't
really an option.
THE CONCLUSION:
=================================================================
HELP! I need some help. This is for a client that is government
funded, and to meet Section 508 (part of the Americans with
Disabilities Act that states web sites and applications must be
made accessible. A text-only page is one of the requirements for
an accessible page.
Thanks,
Ryan Stefani
ps: feel free to contact me via [email protected] or
[email protected]Use Find/Change and the GREP tab.
Search for .+ and set the Find formatting to find the charcteristics you want.
What will you do with this text once found? You'll need something to "change" to, either new text or Change Formatting options... -
Need Help regarding text Output
Dear gurus.
I need help regarding formatting of a text.
I want to format a employee sub group text.
im getting a text workers (7) from a table t503t having field ptext.
i want to show only (7) in the output not the whole text how can i do this ?
Please help
regards
Saad.NisarDATA: BEGIN OF itab_odoe OCCURS 0,
department_text LIKE t527x-orgtx,"Holds the short text for department
department_no LIKE pernr-orgeh,
pernr LIKE pernr-pernr,
ename LIKE pernr-ename,
grade like t503t-ptext, "THIS AREA GET ME TEXT OF EMPLOYEE SUBGROUP"
* department_text LIKE t527x-orgtx,"Holds the short text for department
current_year LIKE sy-datum,
wt0001 LIKE q0008-betrg,"Basic Pay
wt1101 LIKE q0008-betrg," COLA
wt3002 LIKE p0015-betrg,"Overtime
per_basic type p DECIMALS 2,"Overtime percentage on basic
per_basic_sum type p decimals 2,"Overtime Sum Division
overtime_sum LIKE p0015-betrg,"holds sum of overtime
basic_sum like q0008-betrg,"holds sum of basic
END OF itab_odoe.
Im using the select statement to get the employee subgroup from the table
select single ptext
from t503t
into itab_odoe-grade
where persk eq pernr-persk
AND SPRSL eq 'EN'.
now in itab_odoe-grade the values comes is Workers (7) , Snr Mgt (M3)
i want to show only the text in Brackets. -
Need help in text field with 2D array
text field with 2D array
Hi
I need help to represent (i) in from field and (j) in to field
I and j are 2D an array indices.
This code are not complated
import java.applet.*;
import java.awt.*;
import java.awt.event.*;
//declaring class
public class test3 extends Applet implements ActionListener
{ //declaring the TextField
private TextField fromField ,toField;
//declaring an array
int weight[][];
int m = 99; // m is infinity
int N; // Set of Nodes
int d; // distance
int i; // source Node
int j; // destition Node
//declaring values of text field
private int from = i; // start Node
private int to = j; // end node
public void init()
setBackground(Color.white);
setForeground(Color.red);
//giving labels
Label TITLE2,TITLE1;
TITLE1 = new Label("from:");
add(TITLE1);
fromField = new TextField(5);
add(fromField);
// register listener using void add actionListener
fromField.addActionListener(this);
TITLE2 = new Label("to");
add(TITLE2);
toField = new TextField(5);
add(toField);
// register listener using void add actionListener
toField.addActionListener(this);
// event handler methods
public void actionPerformed(ActionEvent event) {
//declaring textfield
from=Integer.parseInt(fromField.getText());
to=Integer.parseInt(toField.getText());
weight =new int[7][7];
weight[1][1] = 0; weight[2][1]= 2;
weight[1][2]= 2; weight[2][2]= 0;
weight[1][3]= 5; weight[2][3]= 3;
weight[1][4]= 1; weight[2][4]= 2;
weight[1][5]= 99; weight[2][5]= 99;
weight[1][6]= 99; weight[2][6]= 99;
weight[3][1]= 5;
weight[3][2]= 3;
weight[3][3]= 0;
weight[3][4]= 3;
weight[3][5]= 1;
weight[3][6]= 5;
for (int i=1; i<7; ++i) {
for (int j=1; j<7; ++j)all your base are belong to us
-
Clarifications needed for full text search
Hi,
I need some clarification regarding full text search.
1) Is japanese part of the standard Oracle full text search?
2) if it is not, how to install the japanese lexer?
3) how oracle is sorting international characters. If a column contains both english, japanese and french, how will be the output?
Thanks
MuneerFollwoing is the sql statement and the result i got
select language, description,lengthb(description) bytes, length(description) length, vsize(description) vsize from t2;
LANGUAGE DESCRIPTION BYTES LENGTH VSIZE
English abcdefghij 10 10 10
English zyxwvutsrq 10 10 10
French désignéess 16 12 16
French réconcilia 13 11 13
German Einfuhrzöl 13 11 13
German müÃtämpfer 19 13 19
Greek δημοÏιογÏα 40 20 40
Greek αÏοκλειÏÏι 42 20 42
Russian пÑеÑÑÑпник 42 20 42
Russian пÑÐ¸Ð²ÐµÐ´ÐµÐ½Ð¸Ñ 41 20 41
Japanese å ¥éå¸ã®ä¼ç¤¾ã®éè¡å£ 65 30 65
Japanese ç¥æ¸å¸ä¸å¤®åºã®æ±éå 62 30 62
Korean ì¶ë°ì ë¶í°ì¶ë°ì ë¶í° 64 30 64
Korean ë³´ì¢ê´ìì¶ë°ì ë¶í°ê²½ 64 30 64
Hindi à¤à¤¤à¤à¤¨à¤¤à¤®à¤¨à¤à¤¤à¤¶à¥à¤° 73 36 73
Hindi नà¥à¤à¥à¤¨à¥à¤à¥à¤¨à¥à¤à¥à¤¨à¥à¤à¥à¤¨à¥à¤à¥ 130 60 130 I think it explains a lot. I am facing another problem in searching blob columns when it contains japanese or korean characters. I tried with multi lexer (adding japanese as sub lexer and making english as default lexer). But it is not searching the column. Do i have to set any other parameters (editing registry, changing enviornment setting etc). I used the following script to set the lexer.
begin
ctx_ddl.create_preference('english_lexer','basic_lexer');
ctx_ddl.set_attribute('english_lexer','index_themes','yes');
ctx_ddl.set_attribute('english_lexer','theme_language','english');
ctx_ddl.create_preference('german_lexer','basic_lexer');
ctx_ddl.set_attribute('german_lexer','composite','german');
ctx_ddl.set_attribute('german_lexer','mixed_case','yes');
ctx_ddl.set_attribute('german_lexer','alternate_spelling','german');
ctx_ddl.create_preference('japanese_lexer','japanese_vgram_lexer');
ctx_ddl.create_preference('korean_lexer','KOREAN_MORPH_LEXER');
ctx_ddl.set_attribute('korean_lexer','COMPOSITE','NGRAM');
ctx_ddl.create_preference('global_lexer', 'multi_lexer');
ctx_ddl.add_sub_lexer('global_lexer','default','english_lexer');
ctx_ddl.add_sub_lexer('global_lexer','german','german_lexer','ger');
ctx_ddl.add_sub_lexer('global_lexer','japanese','japanese_lexer','jpn');
ctx_ddl.add_sub_lexer('global_lexer','korean','Korean_lexer');
end;Hope i presented enough details. -
Need help with text() processing in XSL
Hello,
I have an xml that contains such text in my xml:
before<a>inside</a>after
and an xsl that transforms it to HTML (a cut for xsl):
<xsl:template match="a">
<xsl:apply-templates/>
</xsl:template>
<xsl:template match="text()">
<xsl:value-of disable-output-escaping="yes" select="."/>
</xsl:template>
The result is: inside before after
but I need: before inside after
It seems it happens 'cause of this: http://www.w3.org/TR/xslt#conflict
but I cannot find a way to solve this problem :(
I had tried to use priority in xsl:template, but it didn't help :(
Thanks a lot.DrClap
here are xml and xsl.
That's not a real xml and xsl, but they might describe the idea and problem. I hope I miss nothing.
P.S. I cannot control xml, that's why I cannot use: <xsl:text> in xml.
Thank you!
xml:
<?xml version="1.0" encoding="UTF-8"?>
<root>
<title>Page title</title>
<page>
Location: <red>http://host</red>
</page>
</root>
xsl:
<?xml version='1.0' encoding='ISO-8859-1'?>
<xsl:stylesheet version="1.0"
xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
xmlns:fo="http://www.w3.org/1999/XSL/Format"
xmlns:fox="http://xml.apache.org/fop/extensions"
exclude-result-prefixes="fo">
<xsl:template match="root">
<html>
<head>
<title>
<xsl:apply-templates select="title"/>
</title>
</head>
<body>
<xsl:apply-templates select="page"/>
</body>
</html>
</xsl:template>
<xsl:template match="page">
<xsl:apply-templates/>
</xsl:template>
<xsl:template match="title">
[Test]: <xsl:apply-templates/>
</xsl:template>
<xsl:template match="red">
<xsl:element name="span"><xsl:attribute name="style">color:red</xsl:attribute><xsl:apply-templates/></xsl:element>
</xsl:template>
<xsl:template match="text()">
<xsl:value-of disable-output-escaping="yes" select="."/>
</xsl:template>
</xsl:stylesheet> -
Need help removing text that is covering the document - Adobe Acrobat Standard X
I am trying to remove "preview only" that is splashed across our document diagonally. I can remove this on my computer which has Adobe Acrobat XI standard with content editing --> edit text and images (a cursor pops up and i can just use the delete button), but not on another computer with Adobe Acrobat X standard. The only thing we could do was add a red line over the "preview only" and delete that red line.
I did try to do my research, but all of the search ideas I was using didn't prove fruitful.
Any help or ideas you could provider, I would certainly appreciate.
Thank you in advance.Hey courtney evans,
Please let me know how have you sent the PDF file on other computer.
Also, are you viewing the file in a browser or downloading it and then opening in Acrobat.
Is the document scanned?
Please specify and let me know.
Regards,
Anubha -
Need Help Adding Text To A Template
I am using this template: http://www.templatemonster.com/flash-templates/21091.html
I would like to add text as it opens, going across the flag, something like this....
Sam Young. Not Uncle Sam, but here to serve you!
Now, obviously, that's not what I want it to say, but it's an example, of how I want it to move across the page, on the flag, before it opens to the first page. I've tried everything I know to do. I'm sure I'm just missing something silly. Can anyone help me?
Thanks!Wow... am I asking the impossible here? lol
I'm still looking for help with this, if anyone can help me.
I am searching for template help tutorials, and I don't know if I'm just calling it the wrong thing or what.
I want the text to scroll ACROSS the screen, on entry to the site, across one of the stripes on the flag. Every tutorial I have found tells you how to add scrolling text in the box, like paragraphs. I just want one line, a Welcome Message, if you will, that goes across the screen. I was able to do this in DHTML previously, but Flash is a whole new ball game for me.
Ok, I just did a search, and it's a MARQUEE that I want. So, I've searched, and found this: http://www.kirupa.com/forum/archive/index.php/t-3601.html
So, I'll try that and see if it does what I want it to do.
I'm including all of this, in the event that someone else is interested. -
I need some help with PE 12 and adding text...
this forum is for photoshop elements and photoshop elements doesn't support editing GIF.
You might want to post to photoshop community Photoshop General Discussion -
Need Help Printing Text Messages From E71
I need to print some saved text messages that are on my e71 but cannot ge **bleep** to hook up to my bluetooth printer it just never finds it. Is there another way to print these messages fromt he phone i really need them
connect to pc and use ovi suite
If i have helped at all a click on the white star below would be nice thanks.
Now using the Lumia 1520 -
Need Help removing text from an image.
I am using illustrator Cs3 version. I have an image of a sun with text in front of it, now I only need the image of the sun to then use in photoshop. How do I remove the text and still have a full-color image of the only the sun? Please Help! (Image below)
Hi,
You have posted your question in the Adobe Illustrator Draw iOS app forum. To get help for your question please post in the Adobe Illustrator desktop forum: https://forums.adobe.com/community/illustrator.
Regards,
Jose -
Using CS4 on Win7 Pro. My client bought a template that has a flash piece and I am a total novice in Flash but I do know that you edit the FLA file and export it as a movie. So I opened the FLA file and I'm able to edit the text (all that I want to do right now) but some of the text I'm replacing is longer than the original so it gets hidden under a replay button and some is shorter and is spaced too far away from the replay button (I also want to change the color of that and I think I can figure that one out). There is plenty of room to the left for the text to move to as the original file does have varying lengths of text and it does adjust for that but I can't figure out how to change it for what I've done (I haven't received the final text yet so this is a practice run)... I changed the text by going to the timeline and selecting text from the icon (Edit Symbols) in the upper right of the screen. But where do I tell it where the text should start or how wide it needs to be? I tried moving them individually in the timeline but that didn't change it.
Here's the file: http://do-rightweb.com/fertility/flash/header_vJT.fla (I only want to leave this up temporarily because it is huge. The test location of the file in action is here: http://do-rightweb.com/fertility/
In the future I may have to swap out some of the images leaving the transitions which I think I can do by adding them as a layer deleting the ones I'm replacing...
So if anyone can lend a hand in helping me figure out how to adjust the text width I would be totally jacked! Any help and advice about swapping images or changing colors would be greatly appreciated too. Or even a link to a video to help me understand how to reverse engineer this would be cool.
Thanks in advance for your help and assistance!Hi,
Since the text animation is done using the timeline in this file you have to manually edit the positioning of the each symbol element in every keyframe i.e
You have to go in to the editing mode for txt_c instance of txt_2 > Layer 8 where the required elements are placed
1. You have to position the Layer 1(under Layer 8) items at every key frame for the display text
you may have problem while positioning second text onwards as you will not be seeing them on stage. For this you may want to duplicate the symbol(txt_5) at key frame 37 and edit it to have only the second text and remove everything else. And you can swap the existing symbols at key frame 44 and key frame 54. You can repeat this for the rest of the text.
2. Next you have to position the numbers which spread across three layers (txt_3, txt_4 and Layer 6) under Layer 8. They are for prefix number, suffix number and the dot respectively
Thanks! -
Need help entering text into my website
Hello,
You'll have to forgive me as I am new to Dreamweaver, HTML
and such things.
I have created my website interface, as shown here:
http://img88.imageshack.us/img88/1452/mywebsitepm9.jpg
I created this in photoshop, then sliced the image into
slices and exported it into Dreamweaver.
I wish to add text to the large white box in the middle of
the page, what is the best way to do this?
Thanks for any help> I created this in photoshop, then sliced the image into
slices and
> exported it
> into Dreamweaver
A method that is highly unlikely to produce a worthwhile
website. Photoshop
is an image editing program, not a website building program.
If you are
trying to become a bona fide web designer I suggest you start
by learning
HTML & CSS. This is a good starting point:
http://www.amazon.com/XHTML-Sixth-Visual-Quickstart-Guide/dp/0321430840/sr=1-1/qid=1165172 849/ref=pd_bbs_sr_1/102-5389401-2687307?ie=UTF8&s=books
If you are just trying to create a one-time site for your
personal use you
are probably better off buying a template and then just
filling in the
blanks. Search this NG for sources of quality templates.
Beware, many
template sites sell pure junk. Try here:
http://groups.google.com/group/macromedia.dreamweaver
(You may want to bookmark that page.)
Walt
"shallowdeep" <[email protected]> wrote in
message
news:ekuht8$s0n$[email protected]..
> Hello,
>
> You'll have to forgive me as I am new to Dreamweaver,
HTML and such
> things.
>
> I have created my website interface, as shown here:
>
>
http://img88.imageshack.us/img88/1452/mywebsitepm9.jpg
>
> I created this in photoshop, then sliced the image into
slices and
> exported it
> into Dreamweaver.
>
> I wish to add text to the large white box in the middle
of the page, what
> is
> the best way to do this?
>
> Thanks for any help
>
>
>
Maybe you are looking for
-
Why can't I open a FCE project file that I have saved? (It now says ZERO kb
Hello... I have just started working on a project in Final Cut Express - I render, save, and close out of FCE - but, when I go back to FCE, the project I was working on does not open. Why can't I open a project file that I have saved? Here was my pro
-
IPod nano issues restoring/holding charge?
My ipod nano needs to be restored nightly and is running out of charge without being used, any suggestions?
-
How to get the macbook air by students discount in pune , maharashtra india
I want to buy macbook air 2013 128 gb i5 13 inch by student discount please tell me how to get the discount in pune , india.
-
If i purchase Apple TV or Time Cpsule from London's Apple store... is it possible to choose European schucko shocket??? i want Socket Type to be working in Greece: Europlug, Schuko not the British plug... thank u
-
Smart Collection on a single keyword in Lightroom4
Hi I´m having some trouble figuring out this one: I have twins, a boy and a girl. I take a lot of photos of them. I want to make a smart collection for each of the kids separately as well as of photos of them together. There are also lots of photos o