Extracting informations about rectangles, lines, fields etc from PDF

I'm looking for a way, to extract rectangles, lines, fields etc from a PDF, for example think of a blue rectangle which is drawn in a Word document, which is later exported to pdf. I need informations about the position of the rectangle, the color, the size, if it is filled or not, etc. Is there something like an API or an SDK which can help me? I searched for quite a time (so please, no RTFM-answers), but the only references I found where about producing PDF or extracting text, images or graphics (oh, just at the moment, I had the idea, that lines and rectangles could be coded as vector-graphics in PDF, could that be possible?).
I'm thankful for all answers and ideas (even if you just post links with informations),
kind regards,
Pivo

Yes, lines and rects are coded as vector graphics (or specifically path drawing instructions) in a PDF.
Adobe offers an SDK around Adobe Acrobat which would enable you to get this information. If you need a standalone solution, we also offer our full PDFLibrary.

Similar Messages

  • EtreCheck version: 2.1.5 (108) Report generated 4 January 2015 14:29:26 GMT  Click the [Support] links for help with non-Apple products. Click the [Details] links for more information about that line. Click the [Adware] links for help removing adware

    My Mac is very slow and applications take a long time to load, especially Safari and iTunes.  Please help.    I have run the Etrecheck report and these are results.
    Thanks Pat
    EtreCheck version: 2.1.5 (108)
    Report generated 4 January 2015 14:29:26 GMT
    Click the [Support] links for help with non-Apple products.
    Click the [Details] links for more information about that line.
    Click the [Adware] links for help removing adware.
    Hardware Information: ℹ️
      iMac (21.5-inch, Mid 2011) (Verified)
      iMac - model: iMac12,1
      1 2.7 GHz Intel Core i5 CPU: 4-core
      4 GB RAM Upgradeable
      BANK 0/DIMM0
      2 GB DDR3 1333 MHz ok
      BANK 1/DIMM0
      2 GB DDR3 1333 MHz ok
      BANK 0/DIMM1
      empty empty empty empty
      BANK 1/DIMM1
      empty empty empty empty
      Bluetooth: Old - Handoff/Airdrop2 not supported
      Wireless:  en1: 802.11 a/b/g/n
    Video Information: ℹ️
      AMD Radeon HD 6770M - VRAM: 512 MB
      iMac 1920 x 1080
    System Software: ℹ️
      OS X 10.10.1 (14B25) - Uptime: 0:32:50
    Disk Information: ℹ️
      ST31000528AS disk0 : (1 TB)
      EFI (disk0s1) <not mounted> : 210 MB
      Macintosh HD (disk0s2) / : 999.35 GB (717.51 GB free)
      Recovery HD (disk0s3) <not mounted>  [Recovery]: 650 MB
      OPTIARC DVD RW AD-5690H 
    USB Information: ℹ️
      Apple Inc. FaceTime HD Camera (Built-in)
      Seagate Expansion Desk 2 TB
      EFI (disk1s1) <not mounted> : 210 MB
      Seagate Expansion Drive (disk1s2) /Volumes/Seagate Expansion Drive : 2.00 TB (1.66 TB free)
      Apple Inc. BRCM2046 Hub
      Apple Inc. Bluetooth USB Host Controller
      Apple Inc. iPhone
      Apple Internal Memory Card Reader
      Apple Computer, Inc. IR Receiver
    Thunderbolt Information: ℹ️
      Apple Inc. thunderbolt_bus
    Gatekeeper: ℹ️
      Mac App Store and identified developers
    Kernel Extensions: ℹ️
      /Library/Application Support/Avast/components/fileshield/unsigned
      [loaded] com.avast.AvastFileShield (2.1.0 - SDK 10.9) [Support]
      /Library/Application Support/Avast/components/proxy/unsigned
      [loaded] com.avast.PacketForwarder (2.0 - SDK 10.9) [Support]
    Problem System Launch Agents: ℹ️
      [failed] com.apple.syncservices.SyncServer.plist
    Launch Agents: ℹ️
      [loaded] com.avast.userinit.plist [Support]
      [running] com.epson.Epson_Low_Ink_Reminder.launcher.plist [Support]
      [loaded] com.epson.esua.launcher.plist [Support]
      [running] com.epson.eventmanager.agent.plist [Support]
      [loaded] com.oracle.java.Java-Updater.plist [Support]
      [running] com.trusteer.rapport.rapportd.plist [Support]
    Launch Daemons: ℹ️
      [loaded] com.adobe.fpsaud.plist [Support]
      [loaded] com.avast.init.plist [Support]
      [loaded] com.avast.uninstall.plist [Support]
      [failed] com.avast.update.plist [Support]
      [loaded] com.microsoft.office.licensing.helper.plist [Support]
      [loaded] com.oracle.java.Helper-Tool.plist [Support]
      [running] com.trusteer.rooks.rooksd.plist [Support]
    User Launch Agents: ℹ️
      [loaded] com.adobe.ARM.[...].plist [Support]
      [invalid?] com.avast.home.userinit.plist [Support]
      [running] com.microsoft.LaunchAgent.SyncServicesAgent.plist [Support]
    User Login Items: ℹ️
      iTunesHelper ApplicationHidden (/Applications/iTunes.app/Contents/MacOS/iTunesHelper.app)
    Internet Plug-ins: ℹ️
      FlashPlayer-10.6: Version: 16.0.0.235 - SDK 10.6 [Support]
      Default Browser: Version: 600 - SDK 10.10
      AdobePDFViewerNPAPI: Version: 11.0.07 - SDK 10.6 [Support]
      AdobePDFViewer: Version: 11.0.07 - SDK 10.6 [Support]
      DivXBrowserPlugin: Version: 2.2 [Support]
      Flash Player: Version: 16.0.0.235 - SDK 10.6 [Support]
      OVSHelper: Version: 1.1 [Support]
      QuickTime Plugin: Version: 7.7.3
      JavaAppletPlugin: Version: Java 8 Update 25 Check version
    Safari Extensions: ℹ️
      wrc [Installed]
    3rd Party Preference Panes: ℹ️
      DivX  [Support]
      Flash Player  [Support]
      Flip4Mac WMV  [Support]
      GoToMyPC Preferences  [Support]
      Java  [Support]
      Trusteer Endpoint Protection  [Support]
    Time Machine: ℹ️
      Skip System Files: NO
      Auto backup: YES
      Volumes being backed up:
      Macintosh HD: Disk size: 999.35 GB Disk used: 281.84 GB
      Destinations:
      Seagate Expansion Drive [Local]
      Total size: 2.00 TB
      Total number of backups: 78
      Oldest backup: 2013-07-28 18:09:06 +0000
      Last backup: 2015-01-04 14:29:38 +0000
      Size of backup disk: Adequate
      Backup size 2.00 TB > (Disk used 281.84 GB X 3)
    Top Processes by CPU: ℹ️
          2% WindowServer
          1% mds
          0% fontd
          0% mds_stores
          0% com.avast.daemon
    Top Processes by Memory: ℹ️
      120 MB Safari
      112 MB com.avast.daemon
      94 MB com.apple.WebKit.WebContent
      56 MB spindump
      52 MB mds_stores
    Virtual Memory Information: ℹ️
      479 MB Free RAM
      1.56 GB Active RAM
      1.11 GB Inactive RAM
      904 MB Wired RAM
      5.37 GB Page-ins
      75 MB Page-outs
    Diagnostics Information: ℹ️
      Jan 4, 2015, 01:57:18 PM Self test - passed
      Standard users cannot read /Library/Logs/DiagnosticReports.
      Run as an administrator account to see more information.

    patbythesea wrote:
    Can I assume that with my Mac I do not need any additional virus protection software?  If I do, what should I use?
    See my Mac Malware Guide for help on protecting yourself from malware. You generally don't need anti-virus software.
    (Fair disclosure: I may receive compensation from links to my sites, TheSafeMac.com and AdwareMedic.com, in the form of buttons allowing for donations. Donations are not required to use my site or software.)

  • How to read line number text from PDF using plugin?

    Hi, I would like to know how to read line number text from PDF using plugin?
    Thanks in advance.

    Ok, some background reading of the PDF Reference will help you understand why this is so difficult. PDF files are not organised into lines. It is best to think of each word or character on the page as being a graphic with its own position. The human eye sees lines where a series of graphics (words) are roughly in the same horizontal region.
    In the general case it is difficult or even impossible to answer this. You may have columns with different spacing (but the PDF stores no information on what is a column). You may have subscripts and superscripts. You may have text in graphics coinciding with other text. Commonly, there may be titles, headings or page numbers which are just ordinary text and might count as lines.
    That said, what you need to do is extract the text on the page and its positions. The WordFinder APIs are the way to do that. Now, sort all the words out, using the Y coordinates and size to try and guess what makes a "line". Now you are in a position to find the text (divided into words, not strings) and report the "line number" you have estimated.

  • Extract information about Travel Management

    Hi !!
       I need to work on extractor to get information about Travel Management, the user's need to know how much the company spend on travel.
      I read some documentation about  0FI_TV_01 and 0FI_TV_02, But I don´t know where I can get more information about how to activate this extractor's and how work on R/3 and connect with BW (7.0)
      At this time I konw how work 2LIS_02_ITM and how this extractor was configured and activated on R/3 Systems and their connection with BW.
      Please, If you know some documentation or you can send me one tutorial about this extractors, just let me know
      I will assign points to you.
      Thank you.
    Alice

    Alice,
    My working week is Sunday -Thursday.
    It is true that you can see if an extractor is delta-enabled by looking at the table value.  It is also apparent in the info-package.
    However, the value in the table is not the reason why the extractor is delta-enabled.  There is an extractor program which handles the extraction, which, for these extractors have been programmed for full extraction.
    Delta extraction in principle is programmed based on certain fields like a change/creation date.  Sometimes the business process i.e 'management of the travel scenario' may not be readily amenable to programming data-extraction.
    In short, you do not have a quick solution for making these extractors delta-enabled.
    There has been some discussion in this forum on the concept of pseudo-delta - take a look at the link below:
    Pseudo delta
    BR/
    Mathew.
    Edited by: Mathew Muthalaly on Apr 6, 2008 6:59 AM

  • Information about PFAL txn fields

    Hi Forum,
    When we replicate data from R/3 to SRM using transcation PFAL we fill different values.
    Please help me to understand what these fields are ?
    Plan version --> 01
    What is plan version and what is the meaning of selection 01 (Current Plan) for this field?
    Similarly ..
    Reporting period u2013all
    Evaluation path 
    Status vector 1
    Transfer mode u2013 UPDATE
    Receiv . Partner No
    Message type -HRMD_ABA
    I hope this is the right forum.
    Regards,
    Anubhav

    Hello Anubhav,
    Status Vector
    Contains a list of one or more statuses (1 active, 2 planned, and so on) for Relationship infotypes. The list allows you to determine which objects are documented in inquiries/reports, by selecting objects according to the status of their Relationship infotypes.
    Plan Version:Contains an abbreviation or code that represents a specific plan version. A plan version is a designated area where you deposit or store sets of information.
    Object Type
    Contains a code that represents different types of objects.like US,O,P..etc
    Object ID:Contains a unique eight-digit number that represents a specific object, such as an organizational unit, a qualification, a business event, and so on for the Object type
    Evaluation Path
    Contains a code representing an evaluation path. An evaluation path allows you to focus inquiries/reports on objects which are affected by certain relationships.
    Example
    You may want to focus on objects involved in the relationship construct: organizational unit (O) > position (S) > person (P)
    Transfer Mode
    The transfer mode determines how data on the objects (= plan version/ object type/object ID) is imported in the target system.
    Insert (complete transfer: delete in target system, recreate)
    The data records for all of the object types and infotypes from the distribution model are transferred in full to the target system. If one of these objects already exists in the target system, it is replaced in full. In other words, it is deleted in full, and then recreated using the distributed data records.
    You can specify a reporting period (data selection period). The system distributes all of the data records that are valid for at least one day between the 'start date' and 'end date'.
    When a complete transfer is executed for the first time, the insert mode must be used.
    Update (for changes: infotype/subtype records in reporting period)
    You can specify a reporting period (data selection period). The data records of the infotype/subtype to be entered are distributed if they are completely within this period.
    In the target system, data records of the specified infotype/ subtype are first deleted if they are completely within this period (relationships are only deleted if they were created earlier by distribution). The distributed data records are then recreated.
    The change pointers are despatched in update mode.
    Partner Number of Receiver
    This field contains the partner number of the receiver. The partner number and partner type uniquely identify the receiver.
    Infotype
    Contains a code that represents a specific infotype. Infotypes allow you to describe object attributes.
    Arshad

  • Suggest to have more information about ducument line close status

    Version: (Please provide the current version)
    2005B Patch 35
    Description of requirements: (Please provide a detailed description)
         We cant tell line status as close but which type in default report or query report
         Mainly there are 3 type :
         1-1 close by hand because typing mistake
         1-2 close by hand because customer modification
         1-3 close by target document normally
         1-4 close by target document normally but return
    Business needs: (Please describe the impact on your business, if the functionality is not realized)
         For this reason, when boss ask for sales report from sales order, we can't tell the number
         and have some fake number and when the line is closed, we can't ask customer put more reason.
    Valid as of: (Date that this legal requirement is applicable)
         Not a legal issue.
    Examples: (Please describe a typical example, how the functionality should work.)
         For Exemple - Sales Order Report or Query Report for Sales Order
         When the line status is close in Sales Order, there are a few situation.
         (1) Maybe after partial delivery, the customer cancle left quantity.
         (2) Maybe after partial delivery, the sales find out there is typing wrong so that close the line
             for a new line to modify.
         (3) Maybe after partial delivery, the price getting lower for left quantity or ship date modified for left quantity.
         (4) More there maybe some data wrong after partial delivery and need to return to redo again.
    Current Workaround: (Please describe the workarounds you are using at the moment)
         Close whole document and put status reason with user define field on header.
         Rewrite sales report by query and use the user define as flag.
    Proposed solution: (Please suggest how the new functionality should work)
         More line close status information support and sales/purchase report support.

    Version: (Please provide the current version)
    2005B Patch 35
    Description of requirements: (Please provide a detailed description)
         We cant tell line status as close but which type in default report or query report
         Mainly there are 3 type :
         1-1 close by hand because typing mistake
         1-2 close by hand because customer modification
         1-3 close by target document normally
         1-4 close by target document normally but return
    Business needs: (Please describe the impact on your business, if the functionality is not realized)
         For this reason, when boss ask for sales report from sales order, we can't tell the number
         and have some fake number and when the line is closed, we can't ask customer put more reason.
    Valid as of: (Date that this legal requirement is applicable)
         Not a legal issue.
    Examples: (Please describe a typical example, how the functionality should work.)
         For Exemple - Sales Order Report or Query Report for Sales Order
         When the line status is close in Sales Order, there are a few situation.
         (1) Maybe after partial delivery, the customer cancle left quantity.
         (2) Maybe after partial delivery, the sales find out there is typing wrong so that close the line
             for a new line to modify.
         (3) Maybe after partial delivery, the price getting lower for left quantity or ship date modified for left quantity.
         (4) More there maybe some data wrong after partial delivery and need to return to redo again.
    Current Workaround: (Please describe the workarounds you are using at the moment)
         Close whole document and put status reason with user define field on header.
         Rewrite sales report by query and use the user define as flag.
    Proposed solution: (Please suggest how the new functionality should work)
         More line close status information support and sales/purchase report support.

  • Need information about some address fields in DQXI

    I need to know the 'Content type' and 'Data field name' of the following address fields in DQXi
    1. Alley
    2.Building
    3. HouseNumber
    4. HouseSuffix
    5. Lane
    6. Pob Line
    7. PobNumber
    8. Section
    9. PrefectureCode
    10.DistrictName
    11.Mail Stop
    12. Postal Code Extension
    13. GeoCode
    14. Outside CityFlag

    Hello Smitha,
    Please post this query to the [Enterprise Information Management|Data Services and Data Quality; forum.
    That forum is dedicated to topics related to Data Integration (Data Integrator, Data Federator, Text Analysis for Data Integrator, Rapid Marts, and Integration for PeopleSoft Enterprise), Data Quality (Data Quality and Data Insight), and Metadata Management.
    The forum is monitored by qualified technicians and you will get a faster response there.
    Also, all BO Enterprise Information Management queries remain in one place and thus can be easily searched in one place.
    Thanks a lot,
    Falk

  • Can I extract information about objects created in Designer

    Is there a way to extract a list of all the objects used within a form from designer. I know that the objects can be viewed in the Hierarchy palette but I would like to have this information extracted. Is this possible?

    I have made a form with email submit button in Adobe Designer8. Then I opened this form into Acrobat8 and distributed it. Every thing goes fine.
    The problem arises when recipients are also able to distribute my form if they open it in Acrobat8. Its very frustrating. If some one can tell mew how I can put restriction over this option to the recipients.
    I will be thankful for this help.

  • Extracting information about image compression settings

    Hi,
    Is there any way using SDK 7.0.5 to determine what image compression settings were used for color, grayscale, and monochrome images for a given pdf file?
    Also, is it possible to determine which color management policy was used?
    Thanks,
    Daniel Sheiner

    Okay, Distiller was a bad example. The same range of settings are available for exporting PDFs from Word. What if someone converted a Word document to a PDF?
    Supposing some or most of the information I'm looking for is lost, how much can I infer? For example, if a given PDEImage has the PDEColorSpace DeviceRGB or CalRGB, that at least tells me the user didn't select Color Management Policies -> Convert All Colors to CMYK. It would actually be tremendously helpful if you could tell me which other ColorSpace names, if any, would indicate the user didn't select Convert All Colors to CMYK.
    It would also be quite helpful to know if there is some indirect way to determine that a given color or grayscale PDEImage has not been compressed using ZIP, or that a monochrome PDEImage has not been compressed using CCITT Group 4.
    Thanks!

  • Generell Information about Netweaver,BW,SEM,etc.

    Hello
    I read at the moment a book but I'm very confused.
    Then I don't know about SAP environment.
    I look for a overview. There will explain the Netweaver to BW and SEM and Business Analytics.
    (e.g. firstly ist Netweaver install and BW is a part from Netweaver. then comes SEM??? and then Business Analytics??? or there are the same???)
    I'm confused sorry
    Can anybody tell we, where I can find a picture or tell me which is there related. But not details please. Only for unterstanding.

    Hello Oesi,
    NetWevaer is having different components and BI/BW is among one of them. When you talk in terms of BW/SEM/Analytics, i will give u suggestion to read the below linked weblog, it will help you lot.
    /people/kamaljeet.kharbanda/blog/2005/08/26/sap-analytics-applications
    Regards,
    Kamaljeet Singh

  • Information about document condition field in creation

    after creating the sales order the field document condition(knumv) is populated with numeric values . from where this value comes

    Hi,
    I think it comes from the number range object KONV. ...Tcode...TST5
    Thanks
    Naren

  • HT1296 I have a question about transferring files, photos etc, from my ipad air to my PC Windows XP and visa versa

    I want to know how to transfer files between my ipad air and my pc Windows XP.  I know I can't use icloud because my windows is XP, but there must be some easy way, I hope to transfer files back and forth.  If you know of a way or an app, is it user friendly?

    The way that you can transfer apps can vary depending upon the apps themselves. Many apps use iOS file sharing with iTunes. Read about file sharing here.
    iOS: About File Sharing - Support - Apple
    Some people use DropBox to move photos, videos and other files back and forth.
    Dropbox - Dropbox for iPad
    There are apps that will move files back and forth using USB and WiFi.
    File Manager - Folder Plus on the App Store on iTunes
    And of course  files can be transferred via email.

  • Created PDF from MS Access - lines are missing from PDF

    I'm using Acrobat 8 (from Adobe CS3) to create PDF files.  When I "print" reports from Microsoft Access 2003, the thin line borders that outline cells are often missing from the PDF. Typically, 1 or 2 of the sides are missing, with the other sides appearing normally.
    This appears to happen primarily when the line thickness is set to "hairline".
    If I use a non-Adobe PDF writer (PDF Creator), then the lines appear normally.
    Any suggestions about what I can do to correct this?
    Thanks - Jim

    These are the settings.

  • SharePoint 2013 Document Conversion from PDF to Word (docx) format

    Hello all,
    I know that SharePoint has the facility to convert documents from Word to PDF using the Word Conversion Service, and I've worked with it in C# using the object model.
    We have a client request for the opposite process. They have an email-enabled document library that receives PDF documents...they would like these converted to Word (.docx).
    I'm believe that SharePoint 2013 Word Automation Service has no way to do this (although I could be wrong); and I'm not sure of the object model either. However, in Office Web Apps you can open a PDF file in the browser, and the form has a button labeled
    "Edit in Word". When selected, it shows a message: "Word Web App will make a copy of your PDF and convert it to a Word document for you to edit."
    Is there a way to access the SharePoint Office Web Apps object model/API in code, to do the same thing that the "Edit in Word" button does? It would be great to be able to hook into those methods, if possible.
    Thanks for your help.
    Kind Regards,
    Kevin Worthington, Senior Consultant
    Sogeti USA

    Hi,
    According to your post, you want to convert the pdf file to word format.
    For converting the pdf file to word format, a workaround is that you can read the content of PDF file out and then construct a Word document using C#.
    Here are some detailed articles about how to read content from pdf file using C#
    http://jadn.co.uk/w/ReadPdfUsingCsharp.htm
    http://social.msdn.microsoft.com/Forums/en-US/4a9fb479-b48e-4366-ad39-02b2dac674f5/read-pdf-content-into-text-file-using-cnet
    http://www.codeproject.com/Tips/387327/Convert-PDF-file-content-into-string-using-Csharp
    Here are some detailed articles about how to create docx file using C#:
    http://www.codeproject.com/Articles/660478/Csharp-Create-and-Manipulate-Word-Documents-Progra
    Best regards

  • How to delete line-WILL YOU HELP IMPROVE MOZILLA FIREFOX BY SENDING INFORMATION ABOUT ETC.

    The following line appears every time I open up a Firefox website and I want to get rid of it:
    "Will you help improve Mozilla Firefox by sending information about performance, hardware, usage and customization to Monzilla"
    When I answer YES I WANT TO HELP or NO, the line still keeps appearing when i open up another web page.
    Also, how do I stop the message "CHECKING COMPATABILITY OF ADD-ONS" when I open up a web page.
    Hoping you can assist-Thanking you
    Dick Hunt

    First you need to update to Firefox 17.0.1, as the version you are running is out of date.
    Then, follow cor-el's instructions above, and if that doesn't resolve the issue, please read [[How to fix preferences that won't save]]

Maybe you are looking for

  • Dual monitor setup minus FCP GUI

    Just purchased a Macbook Pro for editing, and would like to know if there is a way to set up a second monitor that would give me the timeline video minus the GUI of the Canvas.  Just a clean picture without the editing interface. Many thanks in advan

  • How to create an event marker with durations of upto 10mins

    Hi, I want to record the time of an event, occuring on the acquired data or a user generated keyboard event, since the start of a recording sessions. A typical session of record will take 10-15 mins. The timer need to start as soon as the recording i

  • Need Urgent Help... Essbase service is stopping automatically

    Hello All, In our production box essbase service is stopping automatically. Can you please help me to resolve this issue... OS : Windows Server 2003 Hyperion : 11.1.1.3 SharedServices_Security_Client.log 2013-12-09 01:17:25,609 INFO  [main] CSS is in

  • Exchange (Outlook) meeting acceptances/declines don't sync

    I have an iPhone4 and I sync to my office Outlook OTA. When I receive a calendar invite on my desktop and then accept it, the iPhone does not know that the meeting was accepted. I have to accept it a 2nd time on the iPhone. The same issue works the s

  • Safari 5.0.1 Crashing

    Hi, Safari keeps crashing. Hope you can help. Here's my crash log. Thanks so much! Process: Safari [763] Path: /Applications/Safari.app/Contents/MacOS/Safari Identifier: com.apple.Safari Version: 5.0.1 (6533.17.8) Build Info: WebBrowser-75331708~1 Co