Extracting MS Word properties automatically

Hi all,
I'm looking for a easy way to automatically update a webpage table that simply lists the files in a directory. The catch is I would like the table to be formatted into columns (ie: Title, Subject, Author, Description, last modified date).
Since I am working predominately with MS Word files, I was thinking of a script that could automatically extract these fields from the Properties associated each Word document in the directory. I've asked a couple of people and they've said I need to use something like ColdFusion to make it dynamic.
Does anyone know if such a script exists out there?
thanks

Hello,
It is possible if you use only alphbet/numeric/characters. But i u use other object like table, textfiled in the MS Word document, then it is not possible(I think).
For this u can use Properties file. With the properties file u can use setProperty() and getProperty() methods to retrieve.
For example:
If u maintain the MSWord doc file in the following manner :
Title = Java Box
Subject = Eploring Java
Author = KKKG
Description = Its all about your parents
last modified date = 16-09-2002
Then u can get the acuired result.

Similar Messages

  • Microsoft Word launches automatically at startup

    This is an unusual situation. Most people can't get MS Word to launch or it crashes.
    I have the opposite. When I log into my MacBook Pro, Microsoft Word for Office 2004 launches automatically. This has just started about a month ago and I've had Office 2004 since 2005.
    1. There is no file in "Startup" that would start MS Word
    2. I've deleted the MS Word preferences file several times to combat and that hasn't worked
    3. I've removed all temp files from the Microsoft folder on my username/home --> Documents --> Microsoft folder
    And, I get the following in Console when I log in (an example from July 18th):
    Jul 18 13:05:23 DavidNovick06-2 [0x0-0x11d11d].com.microsoft.Word[4550]: In
    Jul 18 13:05:23 DavidNovick06-2 [0x0-0x11d11d].com.microsoft.Word[4550]: HrO
    Jul 18 13:05:23 DavidNovick06-2 [0x0-0x11d11d].com.microsoft.Word[4550]: amA
    Jul 18 13:05:23 DavidNovick06-2 [0x0-0x11d11d].com.microsoft.Word[4550]: ppl
    Jul 18 13:05:23 DavidNovick06-2 [0x0-0x11d11d].com.microsoft.Word[4550]: ic
    Jul 18 13:05:23 DavidNovick06-2 [0x0-0x11d11d].com.microsoft.Word[4550]: atio
    Jul 18 13:05:23 DavidNovick06-2 [0x0-0x11d11d].com.microsoft.Word[4550]: nGet
    Jul 18 13:05:23 DavidNovick06-2 [0x0-0x11d11d].com.microsoft.Word[4550]: Com
    Jul 18 13:05:23 DavidNovick06-2 [0x0-0x11d11d].com.microsoft.Word[4550]: m
    Jul 18 13:05:23 DavidNovick06-2 [0x0-0x11d11d].com.microsoft.Word[4550]: a
    Jul 18 13:05:23 DavidNovick06-2 [0x0-0x11d11d].com.microsoft.Word[4550]: ndB
    Jul 18 13:05:23 DavidNovick06-2 [0x0-0x11d11d].com.microsoft.Word[4550]: ars
    Jul 18 13:20:13 DavidNovick06-2 [0x0-0x11d11d].com.microsoft.Word[4550]: Ignoring Quickdraw drawing between QDBeginCGContext and QDEndCGContext
    How do I stop MS Word from automatically launching?

    Check System Preferences>Accounts>Login Items. If you don't get it solved here check out the MS forums: http://www.officeformac.com/ProductForums/Word/

  • How do i have adobe X pro encrypt word files automatically when created.

    how do i have adobe X pro encrypt word files automatically when created

    Hi sbuckton,
    Sure, you can encrypt your document while converting it. However you will have to use your Microsoft office for that.
    Please refer to the above screenshot & follow the steps :-
    1) Open Microsoft word with your word file.
    2) Click on the 'Acrobat' tab on the top.
    3) Then choose the Create PDF option form below.
    4) A pop up window will appear asking you to save the file as PDF, however before saving the file click on the 'Restrict Editing' option.
    5) Again a pop up window will come up on your screen, you can choose for the Encryption method & password protection option in order to secure the document.
    6) Once done, Save the file.
    In case if you still experience any issue please let us know.
    Regards,
    Aadesh

  • I don't want Microsoft Word to automatically launch when I restart/reboot

    Hi All,
    Microsoft Word application automatically launches when I restart/reboot the iMac. How can I fix this?
    Smells like a virus...

    Hi ctconnor;
    Since there are no virus for OS X, I seriously doubt that is what is causing your problem here.
    My suggestion open System Preferences then Accounts followed by Login Items. Once there deselect Microsnot Word.
    Allan

  • Whenever I want to open a template in Microsoft Word it automatically opens in Pages.  how do I get this to stop?

    Whenever I open a template document in Microsoft word, it automatically opens in Pages.  THis has happened since I downloaded Mavericks.  How do I get this to stop and allow me to use Microsoft word templates in Miicrosoft word?

    Right click on a template and choose Get Info. Find the "Open with" section, and change it to Word. Click Change All if you want all templates to be opened in Word by default.

  • Extracting two words from the article in english

    i have an english article which is to be classified into a particular category based on the keywords. There are lacks of keywords stored in database. What i have to do is to obtain the keywords from the article and match it from the database. if match is found then the article belongs to that particular category. This keyword matching i did for one word by using split(" "), but now i want to do for 2 words from an article.that is getting 2 words from the article which is repeated many times.then searching it in the db.(here 2 words will be considered as one keyword)
    Now what i should do to get the two appropriate keywords from the article without taking a,am,the,is,when etc...(leaving many generic words).
    Any help will be appreciated.

    hi,
    thanks for reply!
    I know its a bad algorithm classify the article written in english only based on few words appearing in the article.
    But what i want to do is first extract the words from the article leaving the generic words, then count the single word each.Then i am sorting the words based on count and taking the five words from the article which has highest count. Now i have the database where millions of keywords are stored. These keywords are refering to particle category
    ie. if we consider a category as sports, then under this category i have many keywords stored in the database like cricket, football, worldcup,tennis... etc
    Now if i search the appropriate word from the article it will be considered as keyword. then this will be searched in the database. if match is found then it means the article belongs to sports category.
    Now problem is some times article can have two words which can be considered as one keyword and can be used to classify article in much better way.
    The question is how to get such words from the article???
    ex.. if Hero's Journey is combined word appearing many times in the article then this keyword can be used to classify the article much better than going for single word.
    Can anybody help me in this regard.
    Any help will be appreciated.

  • How to use GREP to find one particular word with automatic hyphen?

    I know I have in my text a word with automatic hyphen on the end of line, say Wo-rd.
    How can I find it?
    Thanks.

    I think you might do better to edit the hyphenation in the Dictionary.
    One reason words hyphenate incorrectly is the wrong language has been applied. Both spell checking and hyphenation rules are governed by the language selected for the text, and language is a character-level attribute so that you can select inidvidual words and assign a different language to them if you like.
    If that's the probelm here, you can look for the word using find/change (without worying about the hyphen) and change the language for all instances.
    In either case you will be far better off fixing the problem on a global scale in the document than you will be finding a particular instance of that word at the end of a line. Any editing you do in the future may cause the problem to reappear somewhere else.

  • Extract a word using Regex.

    Hi
    I am trying to extract a word after the "=".
    Example :
    String str = " Book = Java ";
    Output : Java
    String str2 = "Book = 235Java ";
    Output : 235Java
    I am trying the code
    String criteria = "Book = Java     ";
    regex = "([a-z]+=)(\\w+)";               // a-z is group 1
    pattern = Pattern.compile(regex);
    matcher = pattern.matcher(criteria);
    System.out.println(matcher.group(2));   // Get the word after =Any suggestions please? Help appreciated

    Try below code
            String criteria = "     Book     =          Java         ";
            String regexSecondWord = "(=)(\\s*)(\\w+)";
            Pattern patternSecondWord = Pattern.compile(regexSecondWord);
            Matcher matcherSecondWord = patternSecondWord.matcher(criteria);
            if (matcherSecondWord.find()) {           
                System.out.println(matcherSecondWord.group(2) + matcherSecondWord.group(3));
            } else {
                System.out.println("nomatch");
            }If you don't need leading space in " Java " use matcherSecondWord.group(3) alone.
    Thanks,
    Sabareesh.

  • Microsoft word opens automatically when I start my computer. How do I turn it off so it will not start up when I turn on my computer?

    Microsoft word opens automatically when I start my computer. How do I turn it off so it will not start up when I turn on my computer?
    janer

    Go to System Preferences>Accounts and then select your account, then startup items, remove Word from the list and reboot, you should be fine then.

  • How do I prevent word from automatically starting when I turn on my macbook pro?

    I opened up Microsoft Word on my Macbook Pro one time, and now it automatically everytime I turn on my computer. How do I prevent this from happening?

    CPalluth wrote:
    That's unchecked, as well...
    I had a similar issue last week from a customer.
    It seems that MS-Office fiddle a lot in the preferences and plist's during installation.
    Performing a permission repair (Disk Utility) does the trick in that case.
    Lupunus

  • How to show Word properties in PDF doc

    I'd like to be able to see the Title from a Word document also show as a Title of a converted PDF file in a separate column when using the detail view of explorer in Vista.  I changed conversion settings to include document properties, but why won't they show up in the converted PDF file?

    Currently, Windows Explorer does not support what you want (for PDF).
    The available alternative - You can display the PDF file Properties with Win Explorer and then click the PDF tab.
    If there is a value for the metadata "Title" it will be diplayed.
    Be well...

  • EF6: How can I get navigation properties automatically updated?

    I've defined the following entities:
    public abstract class EntityBase
    public int Id { get; set; }
    public string Name { get; set; }
    public class Person : EntityBase
    public virtual HashSet<Message> Messages { get; set; }
    public int CountryId { get; set; }
    public virtual Country Country { get; set; }
    public class Message : EntityBase
    public string Text { get; set; }
    public int PersonId { get; set; }
    public Person Person { get; set; }
    public class Country : EntityBase
    internal class TestContext : DbContext
    public DbSet<Person> Persons { get; set; }
    public DbSet<Country> Countries { get; set; }
    Here's my problem: While the Messages.Person navigation property gets automatically updated according to the Messages.PersonId
    foreign key property value when my changes get saved to the database, the Person.Country
    navigation property remains null although the Person.CountryId
    foreign key property is != 0.
    Why is this happening? Why does the Messages.Person navigation property get aligned with the Messages.PersonId
    foreign key property automatically while the Person.Country navigation property does not get aligned with the Person.CountryId
    foreign key property?
    These are the entity values before SaveChanges() is called:
    person.CountryId = 1
    person.Country = null
    message.PersonId = 1
    message.Person = null
    And these are the entity values after SaveChanges() is called:
    person.CountryId = 1
    person.Country = null
    message.PersonId = 1
    message.Person = { Id = 1, Name = "Test" }
    Your help is appreciated.
    Still people out there alive using the keyboard?
    Working with SQL Server/Office/Windows and their poor keyboard support they seem extinct...

    Thank you for replying, Fred.
    Here's the unit test code I'm using (it calls into a repository which more or less simply calls DbSet<Entity>::Add()
    and SaveChanges();.
    [TestMethod]
    public void MessageDbTest()
    using (Repository<Person> rp = new Repository<Person>(true))
    Person p;
    Message m;
    BusinessLayer.TestContext.CreateNewDB();
    rp.AddOrUpdate(p = new Person("Hello", 1));
    Assert.AreEqual<int>(1, p.Id);
    Assert.AreEqual<string>("Hello", p.Name);
    m = new Message("Hello, too", p.Id, "This is a long test message.");
    p.Messages.Add(m);
    rp.AddOrUpdate(p);
    Assert.AreSame(m, p.Messages.Single());
    Assert.IsNotNull(m.Person);
    Assert.AreEqual<int>(1, m.Id);
    Assert.AreEqual<string>("Hello, too", m.Name);
    Assert.AreEqual<string>("This is a long test message.", m.Text);
    p = rp.GetItem(1);
    p.Messages.Single().Name = "This works, too";
    rp.AddOrUpdate(p);
    [TestMethod]
    public void CountryDbTest()
    using (Repository<Person> rp = new Repository<Person>(true))
    Person p;
    Message m;
    BusinessLayer.TestContext.CreateNewDB();
    rp.AddOrUpdate(p = new Person("Hello", 1));
    Assert.AreEqual<int>(1, p.Id);
    Assert.AreEqual<string>("Hello", p.Name);
    Assert.AreEqual<int>(1, p.CountryId);
    Assert.IsNotNull(p.Country); // ** fails ! ***
    Following is a number of screenshots, depicting the entities' properties.
    Please notice the highlighted navigation property values in the Watch window on the right.
    You will notice that while the Messages.Person navigation property gets aligned with the Messages.PersonId
    foreign key property, the Person.Country navigation property does not get aligned with the Person.CountryId
    foreign key property when TestContext::SaveChanges() is called:
    Still people out there alive using the keyboard?
    Working with SQL Server/Office/Windows and their poor keyboard support they seem extinct...

  • Read MS Word properties

    Hi every body,
    i need to read the properties of any MS-WORD document. Author, number of pages, number of words, etc. I really don't have any idea on where to begin.
    Can somebody help me? Giving me a hint.
    Waiting your response.
    Thanks

    Yes, it's quite complicated but doable using http://danadler.com/jacob. Search for examples in the JACOB mailing list hosted at yahoo groups. Here is a snippet:/*
    * For MS Word Built-in Properties, see:
    * http://msdn.microsoft.com/library/default.asp?url=/library/en-us/vbaof10/html/ofobjDocumentProperty.asp
            //Getting the doc's properties
         Object buildtInProperties = Dispatch.get(aDoc, "BuiltInDocumentProperties").toDispatch();
         // Get Title attribute (#1)
         Object aTitle = Dispatch.invoke(buildtInProperties, "Item", Dispatch.Get, new Object[] {new Integer(1)}, new int[1]).toDispatch();
         String strTitle = Dispatch.get(aTitle,"Value"));
            String newTitle = "New Title";
         Dispatch.put(aTitle, "Value", newTitle);
         // Get Subject attribute (#2)
         Object aSubject = Dispatch.invoke(buildtInProperties, "Item", Dispatch.Get, new Object[] {new Integer(2)}, new int[2]).toDispatch();
         System.out.println("Subject: " + Dispatch.get(aSubject,"Value"));

  • Extract highlighted words from a pdf (Acrobat SDK, OLE)

    Hello Acrobat gurus ! :-)
    I'm new to the SDK, so please excuse any "stupid" question i might have.
    Here is what i want to do:
    I want to search for a group of words in a pdf document. According to the SDK documentation, once i search for a text using AcroExch.AVDoc.FindText(), the function "Finds the specified text, scrolls so that it is visible, and highlights it."
    I was assuming that after calling this function with my string, once the string is found i will have acces to the coordinates of the rectangle containing the highlighted group of words (i presumed that those words would be automatically contained in an object of the type AcroExch.HiliteList) and to the coordinates of those words.But i'm not able to do so, i cannot find any function(s) that give me that kind of access.
    So question is:
    Is it possible to access the coordinates of the rectangle/words that are highlighted in a pdf after calling the FindText() function ? Can someone help me get on the right track ?
    Thanks

    Ok, let me give you an more elaborate example, maybe i don't ask the right question.
    Let's say i have a pdf, containing the following text in the first page
    --- arbitrary number of ":"
    Mother's Name: Joanna
    Father's Name: Josh
    other text
    If i call the function like this: FindText("Mother's Name:"), acrobat is going to find the first occurance of my string. What i want to do is to be able to get the coordinates of this WHOLE string OR the coordinates of the last character in the string  (in this case ":").
    The problem is that if i go for the coordinates ofthe double dots i cannot just look for them in the pdf, because i may have an unknown number of double dots (":") before the ones i'm interested in. The logical solution in this case would be to get the coordinates of the entire string ("Mother's Name:" in this case) and then get the coordinates of the double dots i'm interested in.
    Would that be possible ?

  • How to extract unique words in all files

    Hi,
    I am trying to extract all the UNIQUE words of all the files in a directory. well it gives me all the words but along with that all the single letters(alphabets and also repetitions of the same ) and also the output is not unique. I dont want the alphabets only unique words in the files. [Like for example if I encounter file x with words "cat mat sat" and then file y with words "cat mat bat pat". My output should be "cat mat sat bat pat"]
    Can you please let me know what is used to get the unique words only? Or how I can modify my code to get the desired output...
    //String input = "Input text, with words, punctuation, etc. Well, it's rather short.";
                   Pattern p = Pattern.compile("[\\w']+",Pattern.MULTILINE);
                   FileInputStream fis = new FileInputStream(file);
                 FileChannel fc = fis.getChannel();
                 ByteBuffer bb = fc.map(FileChannel.MapMode.READ_ONLY, 0, (int)fc.size());
                 Charset cs = Charset.forName("8859_1");
                 CharsetDecoder cd = cs.newDecoder();
                 CharBuffer cb = cd.decode(bb);
                 // Run some matches
                 Matcher m = p.matcher(cb);
                   while ( m.find() ) {
                       //System.out.println(cb.substring(m.start(), m.end()));
                       System.out.println(m.group());
                    Thanks

    Can you tell me how to add the contents of a file into a set?
       import java.io.*;
    import java.util.*;
    public class RmDup{
         public static void main(String args[]){
                   FileReader fr = new FileReader("Dups.txt");
                   BufferedReader br = new BufferedReader(fr);
                   String s1[]=br.readLine();
                   String s2;
                   while ((s2=br.readLine())!=null){
                    HashSet ref = new HashSet( s1 ); // create a HashSet
                    Iterator i = ref.iterator(); // get iterator
                    System.out.println( "\nNonduplicates are: " );
                    while ( i.hasNext() )
                         System.out.print( i.next() + " " );
                    System.out.println();
    }I am trying using this: but getting errors 1: incompatible types and 2:cannot find symbol constructor HashSet(java.lang.String[]).
    Thanks

Maybe you are looking for

  • HP Officejet Pro 8600 Plus not able to use full features

    HP Officejet Pro 8600 Plus (Model No. N911h) Serial No.[Personal Information Removed] Windows 7 Professional Service Pack 1 The printer is being used on the desktop as a network printer.  Windows Firewall is turned off.  Office uses Trend Micro Antiv

  • Why does a JButton automatically have a mouse listener registered with it?

    If I instantiate a JButton, I find there's a mouse listener already registered (see code below). Does anyone know why? Looking at the source code for JButton, I can't see how this is. If a create a dummy subclass of AbstractButton (of which JButton i

  • The iPhone cannot be synced. An unknown error occurred (-39).

    Last couple of days--since the new 4.3.5 update, it seems--I have been getting this message each time I try to sync my iPhone: The iPhone cannot be synced. An unknown error occurred (-39). Or, sometimes, I get this one: The iPhone cannot be synced. T

  • Locking Screen Resolution with KVM

    I have a MacBook Pro 15" running 10.4.8. I have a Iogear DVI KVM attached to the DVI port. The monitor attached to the KVM is a Gateway FPD2485W 24" LCD with optimal resolution of 1920x1200. The Mac performs wonderfully at this resolution. The proble

  • Text Query Syntax

    I am brand new to Oracle Text. I am trying to do a where clause with CONTAINS with a pattern I want found ANDed to a pattern I want excluded (using NOT) from the results that the condition before the AND would return on its own. Example: select disti