Reading content from PDF to XI

Hi All,
Can I know about, is XI capable of reading the content from PDF. As I came to know that we can achieve this with the adapter module can you please ignite me how to do for this or is there any other option other than this way or show me if already any threads or articles are existed?

Hi,
Please find some links on Convertion agent which not only convert the PDF documents. it is used to convert PDF, Word Document, HL7 and more...
pdf files *
SAP Network Blog: XI: Read data from PDF file in Sender Adapter
/people/sap.user72/blog/2005/07/31/xi-read-data-from-pdf-file-in-sender-adapter
SAP Network Blog: XI: Generate PDF file out of file adapter
/people/sap.user72/blog/2005/07/27/xi-generate-pdf-file-out-of-file-adapter
http://help.sap.com/saphelp_nw04/helpdata/en/43/6d95e0ac846fcbe10000000a1553f6/CMGetStart.pdf
http://help.sap.com/saphelp_nw04/helpdata/en/43/4c38c4cf105f85e10000000a1553f6/content.htm
Regards,
Phani

Similar Messages

How can I read content from PDF file stored in Oracle 9i XMLDB

Hi Friends:
Now I have met one question that I don`t know how to read some String , for example "Hello", from the PDF file stored in the Oracle 9i XMLDB, I have stored that PDF file into the XMLDB now, any suggestions are appriciated . Thank you in advance.

You may be able to do something with Oracle Text. The following shows how to get an HTML rendiditon of a binary document. I think you can also get plain text instead of HTML
set echo on
spool xfilesUtilties.log
connect sys/&1 as sysdba
grant ctxapp to &2
connect &2/&3
begin
ctxsys.ctx_ddl.create_policy(policy_name=>'XFILES_HTML_GENERATION', filter=>'ctxsys.auto_filter');
end;
create or replace package xfiles_internal_11010
authid definer
as
function renderAsHTML(sourceDoc BLOB) return CLOB;
end;
show errors
create or replace package body xfiles_internal_11010
as
function renderAsHTML(sourceDoc BLOB)
return CLOB
as
html_content CLOB;
begin
dbms_lob.createTemporary(html_content,true,DBMS_LOB.SESSION);
ctx_doc.policy_filter(policy_name => 'XFILES_HTML_GENERATION',
                        document => sourceDoc,
                        restab => html_content,
                        plaintext => false);
return html_content;
end;
end;
show errors
create or replace package xfiles_utilities_11010
authid current_user
as
HOME_FOLDER   constant varchar2(700) := xdb_constants.HOME_FOLDER;
PUBLIC_FOLDER constant varchar2(700) := xdb_constants.PUBLIC_FOLDER;
function renderAsHTML(sourceFile VARCHAR2) return CLOB;
function transformToHTML(xmldoc XMLType, xslPath VARCHAR2) return CLOB;
end;
show errors
create or replace package body xfiles_utilities_11010
as
function renderAsHTML(sourceFile VARCHAR2)
return CLOB
as
begin
return xfiles_internal_11010.renderAsHTML(xdburitype(sourceFile).getBLOB());
end;
function transformToHTML(xmldoc XMLType, xslPath VARCHAR2)
return CLOB
as
html clob;
begin
select xmldoc.transform(xdburitype(xslPath).getXML()).getClobVal()
    into HTML
    from dual;
return html;
end;
end;
show errors
grant execute on xfiles_utilities_11010 to public
create or replace public synonym xfiles_utilities for xfiles_utilities_11010
quitMessage was edited by:
mdrake

Copy Paste content from PDF shows a missing character

Our website, a4academics provides a platform for sharing academic projects and seminar reports in PDF for Students. We are getting a complaint from students, that when they copy and paste the content from PDF, the word " F " is not getting copied in some reports. What might be the reason ? Is it because of the issue with the software which created the PDF report. Its happening in few PDF files only.

Currently to download the PDF FB like is needed. That why i asked your mail id. If you dont want to give FB like, please send the mail id.
PDF is attached at the bottom of the link given below
http://a4academics.com/final-year-be-project/12-be-ece-electronics-and-communication-proje ct/503-home-based-wireless-health-monitoring-system
I had copy pasted a part of the report to notepad++. Its appears as shown below. F is missing in defining
"de ning the "Einthoven triangle" - an equilateral triangle with the heart at the centre."

Read contents inside pdf file programmatically in SharePoint

I have a SharePoint document library, My Requirement is when user add PDF file on the document library the event receiver fire and read contents inside
pdf file programmatically. After the start workflow according to the result of event receiver.

If your question is about handling events in apps for SharePoint, see these links:
http://msdn.microsoft.com/en-us/library/office/jj220048%28v=office.15%29.aspx
http://msdn.microsoft.com/en-us/library/office/jj220051%28v=office.15%29.aspx
If what you need is a way to extract text from the PDF inside the event handler, see this example that uses leadtools.
http://support.leadtools.com/CS/forums/ShowPost.aspx?PostID=43894
You should use PDF text extractor in your Event Handler code -
You can use iTextSharp for reading content
http://www.codeproject.com/Tips/387327/Convert-PDF-file-content-into-string-using-Csharp

How to read tags from pdf to print in different printers (PCL or PS)

Hello from Spain.. I would apreciate some help who to read tags from pdf to discriminate PDF Production tag for send print PDF to printer job using driver PCL or PS. I get trubles when print PDFs files made by PDFcreator (Ghostscript, intermediate PDF from PS) in PCL job printer. I need to eval tags from PDF for send command line os javascript inside Adobe Acrobat to print in the correct job printer: PCL or PS...
Thanks.

As noted you cannot access the "Tag" of a PDF but you can access the meta data of a PDF and one of the items recorded in the meta data is the "producer". This meta data tag is supposed contain the name of the program that produced the PDF. One can access this through the "info" property or the "metadata" property of the doc object.
var cProducer = this.info.Producer; // get the producer application for this PDF;
console.show();
console.clear();
console.println("This PDF was created by " + cProducer);
Be aware that the user can easily modify the Producer information with Acrobat, JavaScript, or other application after the PDF has been created.

How to read HyperLinks from pdf file??

hi developer's,
I am in PDF processing... I am having doubt in that Processing.
How to read Hyperlinks from PDF file?
I can able to set the hyperlink.. But i cant able to get the hyperlinks..
The following example program will set the hyperlink to the PDF file using lowagie API..
import com.lowagie.text.Anchor;
import com.lowagie.text.Chunk;
import com.lowagie.text.Document;
import com.lowagie.text.DocumentException;
import com.lowagie.text.Paragraph;
import com.lowagie.text.html.HtmlWriter;
import com.lowagie.text.pdf.PdfReader;
import com.lowagie.text.pdf.PdfWriter;
public class Argu1 {
     public static void main(String[] args) {
          Document document = new Document();
          try {
               PdfWriter pdf = PdfWriter.getInstance(document,
                         new FileOutputStream("PageLink.pdf"));
PdfReader pdf_read=new
               document.open();
               document.add(new Paragraph("Hi Everbody....!"));
               Anchor pdfRef = new Anchor("Click Me");
               pdfRef.setReference("www.java2s.com");
               Anchor rtfRef = new Anchor("Touch Me");
               rtfRef.setReference("www.sun.com");
               System.out.println(rtfRef.reference());
               document.add(pdfRef);
               document.add(Chunk.NEWLINE);
               document.add(rtfRef);
          } catch (DocumentException de) {
               System.err.println(de.getMessage());
          } catch (IOException ioe) {
               System.err.println(ioe.getMessage());
          document.close();
Help me how to read the Hyperlinks from the PDF file using java ...
Thanks in advance,
With Regards,
J.Imran

Instead of cross-posting unformatted code you could have taken a look at the API, because there you might have come across a method named getLinks...Even though it's not documented, I really suspect that it will return the Hyperlinks on a given page.

Read barcode from pdf

Hi all,
Please can any one tell me how to read barcode from pdf in java. This will be very help full for me.
Thanks,
Avinash

Avinash_loves_JAVA wrote:
Please can any one tell me how to read barcode from pdf in java. This will be very help full for me.JMF doesn't support PDFs or barcode reading.
If you need further assistance with this issue, you'll need to repost in a generic Java forum, or perhaps do a google search for a barcode-reading library.

Read Comments from PDF...

Hello Everyone,
I need to know that how can we read comments from PDF file programmatically using C++.
I was thinking was to read from file itself. I thought the PDF file must have some text like "StartComments" and "EndComments". (if we open it in notepad) but didn't found anything like that.
Any suggestions ?
Or anyone can refer any free Library which does this thing.
Thanks
...Pankaj

For API, Application Program Interface, there is the Acrobat SDK forum:
http://forums.adobe.com/community/acrobat/acrobat_sdk
For coding in Acorbat JavaScirpt there is ths the Acrobat Scripting forum:
http://forums.adobe.com/community/acrobat/acrobat_scripting

Could not get exact contents from pdf using adobe acrobat professional

Hi,
I am using acrobat professional to extract contents from a pdf into HTML. During extraction of pdf into HTML some contents are getting rendered as images. But with pdf to xml extraction i can get the exact contents. But i need HTML file from pdf. Any suggestions. Thanks in advance.

You might want to see if you can select the background with the object touchup tool. If so, can you then just delete the selected object. May take a while to go through the document, but if it solves the problem you are ahead. You may be able to select it with JavaScript and repeat the process. My point of using the object touchup tool is that it may not be a background set by Acrobat, but something that is simply labeled as an image from the Acrobat viewpoint.

ITunes not reading contents from iPhone on MacBook

Hi, new year greetings !! New year new problem with my apple environment.
1)This is the sequence if events. I had downloaded few music podcasts on my iTunes through my MacBook.
2)When I try To sync the podcast it to my iPhone, it fails, does not copy but gives no error message
3)Some one tells me about transferring podcasts from MacBook to iPhone using third party software.
4) I download the software and started using it.
5)After 5 minutes figured out it was no use so I uninstalled the software
6)i plugged my iPhone to my mac immediately after and iTunes pops this message "iTunes is unable to read contents of this iPhone, goto summary tab in iPhone preferences and click Restore to restore this phone to factory setting"
This has never happened before, it happened after using the software which I have already uninstalled.
Pls help, what am I to do..
And my initial problem how do I copy the podcasts to my iPhone ??
Pls help
Sayeed

Same thing just happened to me. iPhone synced successfully yesterday, but now iTunes wants me to restore the factory settings on my iPhone. Not a cable problem, as I have tried two different USB cables.
What's more, this is not the only iTunes problem of the morning. I was trying to use my iPod (not iPhone) to move my Music library to my son's computer. An error message popped up that my computer (not his) was not authorized to play iTunes purchases, and all purchases would be deleted from my iPod unless I authorized my computer. Even though this computer has been authorized for years, I tried to authorize it as instructed -- only to repeatedly get an error message (-42408), which is apparently a common problem these days.
So I thought, let's see if the iPhone gets the same authorization problem when synced. Well, that didn't work out too well.....
Not too happy with Apple at the moment. What should have been a simple and quick process has now turned into a time-consuming nightmare of multiple problems affecting multiple devices, and apparently these problems are common.

I am trying to Copy paste some content from PDF reader with no success

Hi,
I am new here, hope I will find the answer to my prob.
I am currently working on an Ebook I plan publishing, discussing affiliate marketing.
I am using some resources from other Ebooks I purchased and have in hand.
* While I am trying to copy paste some content, it does copy, but when trying to place it in a notepad file, the content is being copied as gibberish. (signs and spaces).
I've tried using microsoft word, then i totally gets out of contrast.
Did anyone experience something alike?
Thank you.
Mia.

This is common. In this case it may be done to protect the copyright
content of the ebook from reuse. Since the amount you can use under
fair use is only a few sentences, it shouldn't take too long to
retype...
Aandi Inston

How to read text from PDF and HTML

I have got solution to read text form .txt file but did'nt get code for PDF and HTML.
I dont want to convert PDF to txt.
Please help me ...

reading from a file is always the same. using the same strategy used for a .txt will allow you to read a .pdf file.
Offcourse in itself it will be useless becuase pdf files have a special internal structure.
html files are identical to txt files.
What are you trying to accomplisch with the files you are reading ?

Grabbing content from PDF exported from Pages

Hi,
I have a number of reports that I have created using Pages that include pictures, text and bar charts. For the most part, I used the Graphs function in Pages to create these charts (as opposed to using Numbers and then copying in).
I exported these document in PDF's for my clients, but they would like to be able to select specific graphs to include in their PowerPoint presentations etc.
I have instructed them to use the Select Tool in Acrobat Reader (these are PC users), however the quality is AWFUL...very very fuzzy to the point of being unreadable.
So I have tried all these things:
- Using Select Tool in Preview to select from a PDF converted in Good quality (fuzzy)
- Using Select Tool in Preview to select from a PDF converted in Best quality (not much better)
- Using Grab utility on document opened in Preview (maybe slightly better but not much)
- pulling the bar chart into Photoshop first then pasting into Word (awful)
- converting the Pages document into Word document (the worst ever)
The charts look crisp when viewed in Preview/Acrobat Reader, so I don't know why they won't copy better. If I select and copy the text boxes and the pictures (from Preview), they seem to be fine, so it is just the charts that are a problem.
Any ideas?

The text and line drawings will be fine, they are generally not rendered, unless they overlap some bitmap material.
You can try simplifying the bar charts to basic line material, none of the fancy rendered versions.
You do not need to unshadow the graphs if they are upscaled, the shadowing is the same resolution as the graph.
I understand the scope of the work.
I just experimented with trying to print the graph to 400% to see if that would get around the issue and just hit an amazing bug. When you do that Pages shrinks the page size!
Apple catches you coming and going!
Peter
ps I have +Provided Pages Feedback,+ on this and suggest you do also.

ITEXT protect the content from pdf

Hi
All
I run this jsp it show me a pdf, but i got a problem i can edit that pdf and change the content, Is there one way to protect it ?
How ?
Thanks
<%@
page import="java.io.*,
                com.lowagie.text.*,
                com.lowagie.text.pdf.*,
              java.awt.Color,
               java.net.MalformedURLException,
             java.net.URL,
                com.lowagie.text.*,
                com.lowagie.text.pdf.*,
                java.io.FileOutputStream,
                com.lowagie.text.pdf.PdfWriter,
                com.lowagie.text.rtf.RtfWriter2,
                com.lowagie.text.pdf.codec.TiffImage,
                com.lowagie.text.pdf.codec.GifImage,
                com.lowagie.text.Image"
%><%
%><%
          // Template JSP file for iText
          // by Tal Liron
          response.setContentType( "application/pdf" );
          // step 1: creation of a document-object
          Document document = new Document();
        ByteArrayOutputStream buffer = new ByteArrayOutputStream();
        PdfWriter writer = PdfWriter.getInstance( document, buffer );
         document.open();
         //Configurar saida como array buffer
         //Configurar criptografia do conteudo em 128 bits
        PdfContentByte cb = writer.getDirectContent();
        PdfContentByte cb2 = writer.getDirectContent();
       //Tipo de Fonte
        BaseFont bf = BaseFont.createFont(BaseFont.HELVETICA_BOLD, BaseFont.CP1252, BaseFont.NOT_EMBEDDED );
        cb.stroke();
        cb.setColorFill(Color.blue);
            cb.stroke();
            cb.beginText();
            cb.setFontAndSize(bf, 12);
            cb.setColorFill(Color.darkGray);
            cb.showTextAligned(PdfContentByte.ALIGN_LEFT, "text 1", 140, 591, 0);
            cb.showTextAligned(PdfContentByte.ALIGN_LEFT, "text 1", 140, 577, 0);
            cb.showTextAligned(PdfContentByte.ALIGN_LEFT, "text 1", 240, 562, 0);
            cb.showTextAligned(PdfContentByte.ALIGN_LEFT, "text 1", 140, 548, 0);
            cb.showTextAligned(PdfContentByte.ALIGN_LEFT, "text 1", 140, 532, 0);
            cb.showTextAligned(PdfContentByte.ALIGN_LEFT, "text 1", 140, 517, 0);
            cb.showTextAligned(PdfContentByte.ALIGN_LEFT, "text 1", 208, 502, 0);
            cb.endText();
          document.close();
          DataOutput output = new DataOutputStream( response.getOutputStream() );
          byte[] bytes = buffer.toByteArray();
          response.setContentLength(bytes.length);
          for( int i = 0; i < bytes.length; i++ ) { output.writeByte( bytes[i] ); }
%>Edited by: silvaneto on Jun 26, 2009 8:03 AM

Here's the Bullet.fx file
import java.lang.Math;
import javafx.scene.CustomNode;
import javafx.scene.Node;
import javafx.scene.paint.Color;
import javafx.scene.shape.Circle;
public class Bullet extends CustomNode {
    def GRAVITY : Number = 0.0002;
    def WALL_HARDNESS = .90;
    public var x : Number;
    public var y : Number;
    public var radius : Number = 5;
    public var color : Color = Color.RED;
    public var velocity_x : Number;
    public var velocity_y : Number;
    var offScreen: Boolean = false;
    public function collide( invader : Invader ): Boolean {
        var dx : Number = invader.x - x;
        var dy : Number = invader.y - y;
        var distance : Number = Math.sqrt( dx * dx + dy * dy );
        var minDist : Number = invader.radius + radius;
        return ( distance < minDist )
    public function move( delta_time: Number, width : Number, height : Number ): Boolean {
        velocity_y += GRAVITY * delta_time;
        y += velocity_y * delta_time;
        x += velocity_x * delta_time;
        if( x + radius > width ) {
            offScreen = true;
        } else
        if( x - radius < 0 ) {
            offScreen = true;
        if( y + radius > height ) {
            offScreen = true;
        } else
        if( y - radius < 0 ) {
            offScreen = true;
        return offScreen;
    public override function create(): Node {
        return Circle {
            centerX: bind x,
            centerY: bind y,
            radius: bind radius
            fill: bind color
}

Read metadata from PDF via VBA

I wrote a VBA script that reads the custom properties of a collection of Word docs and writes the results to a file which I can then import to Excel.
Now I have to do the same thing for the PDF files created from the Word docs - that is, read the metadata from a collection of PDF files.
The fields that have to be read are always the same.
How can I read the metadata of a CLOSED PDF file from VBA?
For Word, the solution was to use DSOleFile.PropertyReader (part of the DS: OLE Document Properties 1.4 object library.
TIA

I refer you to some of the vendors that I just posted - perhaps there is a low cost solution you can find. Pound Hill has some scripting interfaces that might help.
Otherwise you will need to take the free SDK and hook it up somehow.
Gunar

Reading content from PDF to XI

Similar Messages

Maybe you are looking for