Crawler stores metadata or the real content?

Hi,
We were asked this question in a portal demo session and we don't know a definite answer since we are pretty new to use crawler/search services in ALI.
Does ALI crawlers really copy the content from the original content repository(e.g. Microsoft Visual SourceSafe) into the portal space or just stores a pointer to the real content into the portal space? e.g. If I need to crawl 30G of files, do I need ~30G in the portal server to store them?
Based on the ALI documents, we can see Search Service allow real time access to the document so I guess Search Service will just go to the original content repository to fetch data in realtime.
Any of your help to clarify this concept is highly appreciated.
Thanks
-Jimmy

ALUI Crawlers walk through content repositories (such as NTFS, Notes, Exchange and the Web, not Visual Source Safe AFAIK) and they store document metadata in the portal's DB. Those metadata can include (but certainly are not limited to) document title, author, creation/modified date, keywords, document URL, etc. That requires a minimal amount of space.
Documents are also full-text indexed into Ripfire, which is ALUI's powerful search engine. I'm sure there is a metric for how much space that takes up. (I think I remember reading somewhere that once all the stop words are removed and the document is indexed, its size will be roughly equal to 30% of the original document size. I'm not sure about that figure, so someone please do correct me if I'm mistaken.)
Once something is crawled, you can use ALUI's built-in search capabilities to search its metadata and its full text. However, the search is not performed in realtime because changes to documents and their associated metadata will only be applied as often as the job that refreshes documents (Card Update Agent or whatever it's called now) is run.
Search Web Services are an entirely different animal. They extend (or federate, if you like) ALUI's search into other search engines. That happens in realtime. (However, it can only be as current as the target search repository's index.) There are no SWSs that come out of the box with ALUI, although there is a nice example for searching Google posted on dev2dev somewhere (try the CodeShare).
HTH,
Chris Bucchere | bdg | [email protected] | http://www.bdg-online.com

Similar Messages

  • What kind of database the SSMA uses to store metadata in the file named "source-metabase.mb" ?

    What kind of database the SSMA uses to store metadata in the file named "source-metabase.mb" ?
    I'm looking for the method to open the file and add 'cutom migration script' (some automatization).

    Hi Poman.Pokrovskij,
    When you generate SSMA Assessment Report, there are several files created and saved into Report folder under your SSMA project folder. It includes
    source-metabase.mb, project-container.mappings, preferences.prefs, and so on.
    . MD files are usually saved in plain text format including inline text symbols, defining how a text is formatted such as the indentations, its table formatting, fonts, and headers.
    SSMA provides a project setting that allow you to customize how to set customized database migration. For example, to customize data migration SQL statement, you can modify project setting by navigating to Tools and choosing Project Settings,
    then looking for the setting for
    Extended Data Migration Options and change the value to
    Show. You can select use custom select and modify the SQL statement.
    For more information about SSMA for Oracle, you can review the following articles.How To Perform Incremental Data Migration Using SSMA:
    http://blogs.msdn.com/b/ssma/archive/2010/10/04/how-to-perform-incremental-data-migration-using-ssma.aspx
    Using SSMA Project Setting to Customize Database Migration:
    http://blogs.msdn.com/b/ssma/archive/2011/03/16/using-ssma-project-setting-to-customize-database-migration.aspx?Redirected=true
    Regards,
    Sofiya Li
    Sofiya Li
    TechNet Community Support

  • Change default view across multiple page libraries using the same content type/types?

    I am the editor for a large SharePoint publishing site with approximately 130+ subsites. Based on our content, I've create two diffferent default views that will come in handy as we move to a distributed authorship model. (I'm thinking specifically about
    the Quick Edit/Excel-like feature to make quick changes to page/document metadata.) 
    The authors/content owner/editors have NO experience with SharePoint, so while the benefit of using views is that they can be tailored to individual needs, I'd like to start them off with something that is VERY easy to change on the fly. It's either that
    or, as just a power user in the CMS, go to each individual page and document library and change it manually, which could take days, if not longer. Thanks!
    Ryan D Watters

    Essentially, yes.
    That said, I am using SharePoint 2013, and I just now realized that I posted this in the legacy SP forum. You'll have to excuse me -- this is my first time posting:)
    Basically, I have two different "ideal" views based on two different site templates. These two site templates account for over 100 publishing subsites. The view has 18 different metadata that will make editing/making changes to article pages much
    easier for content owners because they will be able to go into the page libary>ribbon>library>quick edit and edit the metadata as if it were an Excel spreadsheet. This comes in especially handy for things like rollup order between primary, secondary,
    tertiary pages within a subsite.  Ideally, it would be nice to be able to add that view as an option within the CMS (similar to the way you can add a view style).
    I hope that makes sense. Also, I'll repost the question to SharePoint 2013. Much appreciated!
    Ryan D Watters

  • Error connecting integrator and record store with fetching the metadata

    I am trying to fetch crawled data from record store in integrator and then upload them in a data domain. I am encountering an error regarding the connection between integrator and record store (error while generating metadata), It says "Exception encountered connecting to the record store and writing the metadata with the following message: Transport error: 404 error: Not Found". Please suggest the required action. 

    I found a installer for Quicktime that was not connected to iTunes and installed quicktime again.
    Everything is now working.

  • I can t record my computer on the i tune store. My goal is to download music from my playlist i tune store (virtual) to my real iTunes play list?

    i can t record my computer on the i tune store. My goal is to download music from my playlist i tune store (virtual) to my real iTunes play list?

    When you say they are displaying in your iTunes library, are you talking about the Audiobooks section of your iTunes library and not the audiobooks section indented under the iPod in the left pane of iTunes under Devices?
    Is the iPod Shuffle configured to sync audiobooks or do you manually manage the iPod's contents?
    B-rock

  • Access is denied. Verify that either the Default Content Access Account has access to this repository, or add a crawl rule to crawl this repository. If the repository being crawled is a SharePoint repository, verify that the account you are using has "Ful

    I am trying to resolve this after setting up my new Farm.I am having 2 wfe ,1 sppserver,1 server dedicated for crawl ,1 for search and index  in my farm. I guess dedicated crawl server  is the root cause of the issue,i also did
    disableloopback check settings but still facing the same issue,any solution?
    Please Mark it as answer if this reply helps you in resolving the issue,It will help other users facing similar problem

    Hi Aditya,
    Please refer to the links below and try if they help:
    Add the full read rights to Default Content Access Account of Search Administration via the web application’s user policy.
    http://sharepoint.stackexchange.com/questions/88696/access-is-denied-verify-that-either-the-default-content-access-account-has-acce
    Grant the Default Content Access Account permission in User Profile Service Application
    http://www.sysadminsblog.com/microsoft/sharepoint-search-service-access-is-denied/
    Modify you crawl rule
    http://wingleungchan.blogspot.com/2011/11/access-is-denied-when-crawling-despite.html
    Add crawl servers ip to local host file
    http://wellytonian.com/2012/04/sharepoint-search-crawl-errors-and-fixing-them/
    Regards,
    Rebecca Tu
    TechNet Community Support

  • Best practice to populate metadata of the content based on the folder

    Hi,
    What is the best practice to follow to automatically populate metadata of a content being checked-in based on the folder in which it is coming in?!
    The folder I have may be a contribution folder or a collab project folder.
    But I would like to populate the metadata of the content automatically when the content is dropped into a folder using the desktop integrator.
    Thanks,
    Leo

    Yes Leo, that's correct, all documents inheriting the metadata of the folder and the option to propagate changes to documents and sub-folders is out-of-the-box functionality.
    Just create a folder and set the metadata fields you want, then add some documents via the desktop integartion or simply via webdav (you can map ucm as a web folder in windows explorer without having to install the UCM desktop integration).. all the document should have the folder's metadata by default.
    Give it a try and let me know how you go.
    Regards,
    Juan

  • Reading the contents of a folder and store them in the database using 6i

    Hi all,
    I'm using developer 6i and oracle 8i,now am building personnel database,every employee has many certificates (graduate certificates,post-graduate certificates and work experience certificates),I want to scan all these certificates,put them in a folder ,give the form (employee form ) the path to the folder and the form read the contents of the folder (3 or 4 image files for example) dynamically and store them in the database.
    All examples I came across explain how to load a defined image file name into oracle database,but what if the image file name is not defined (i,e dynamically generated by the scanner).
    Hope I explained the case.
    Thanks in advance

    Sorry mhdamer,
    I read the example and thought it could be modified to retreive a directory listing. There is a way to do what you want, but you will have to check to see if you have the D2kwutil.pll installed with Forms 6i. If you do not have it installed you can download the pll from OTN. This Forms Library will only work on Windows, so if you want this functionality on a non-windows machine, it will not work. Read the D2KWUTIL.html for details on how to use this library. There are also some good posts here in the forums on how to use this library. Just search on D2KWUTIL.PLL.
    Craig...

  • HT4059 Will the iBooks content for the Malaysia store be expanded soon?

    For the Malaysia App Store, will the Books content ever be expanded to include mainstream titles? Currently available are mostly classic novels.

    We are just fellow users on here, we won't know until if/when Apple announce something. But before Apple can sell an item in a particular country's store they need to be granted a license from the content provider e.g. the book publisher.
    Do you have access to other ebook apps/stores e.g. Amazon/Kindle, Nook, Kobo ?

  • Final Cut Pro X after installing Mountain Lion I have lost some of the "additional content" which was available by clicking on "Download Additional Content" in the roll down from "Final cut Pro". Now this feature takes you to store update?

    Final Cut Pro X after installing Mountain Lion I have lost some of the "additional content" which was available by clicking on "Download Additional Content" in the roll down from "Final cut Pro". Now this feature takes you to Apple store update? which produces nothing! How can I download these free sound and music files, what has happened? Please help as there is nothing on the apple site to cover this??

    Final Cut Pro X after installing Mountain Lion I have lost some of the "additional content" which was available by clicking on "Download Additional Content" in the roll down from "Final cut Pro". Now this feature takes you to Apple store update? which produces nothing! How can I download these free sound and music files, what has happened? Please help as there is nothing on the apple site to cover this??

  • HT204053 When I try to update apps from ap store the displayed I'd password is the wrong one I've tried all sorts to change it to the real one to no avail any suggestions it just keeps asking for the wrong one

    When I try to update apps the iTunes id comes up wrong, I've tried to change it to the real one to no avail
    Suggestions please

    The only way to get those apps registered to new apple id - to delete them and redownload again. If they are paid you have to pay again.

  • Archiving the Expired content in URM

    Hi Everyone,
    We need to archive an expired document, URM provides this facility out of box ...
    While we specify the disposition rule as archiving, i am clueless where the expired gets archived.
    I tried looking in the archives for Contentserver but couldnt find any.
    Please Help
    Thanks in Advance.

    I think you should get a working copy of URM and play around a little bit with that.
    To answer your question: URM itself (if you use URM-specific processes) does not change location of content items. What it does that it changes content items metadata. If you want to change physical location of the content item (fast disk, slow disk, tape, etc.) you have to combine URM's metadata with other product's features. You have two options:
    - if you store items in the database you can use Automatic Storage Management (ASM) feature of the database
    - if you store items in a filesystem, you can use ZFS (got it from Sun Microsystems, more details see here http://www.oracle.com/technetwork/server-storage/solaris/overview/zfs-jsp-138393.html)

  • Can time capsule store more than the data on my computer?

    Can time capsule store more than the data on my computer?
    I have a lack of data storage space on my macbook hence I would like to know if Time Capsule is an external hard drive and not just a back up device.

    It is a backup device.. in fact it is a backup device for wireless clients in particular hence it is a wireless router with a hard disk inside.. if you use it for storage.. it has one or two big weaknesses..
    It is slow.. cf a real NAS with raid.. a single slow green drive is plenty for backups.. less than adequate for storage.. fine over wireless of course.. lousy if you are trying to pass big files.. like your iphoto library back and forth.
    But the biggie.. IMHO.. no backup.. the TC has no way to back itself up.. and TM cannot include the content of the TC in a backup to another location.. TM cannot backup any network drive.
    So your files will be stored on one bottom of the market, cheapest possible drive available.. with no backup.. !!
    Time Machine and data files may also not so happily co-exist.. the TC cannot be partitioned (well not internally).
    You can create data images... but is this a good idea??
    Read a bit of pondini.
    http://pondini.org/TM/Time_Capsule.html
    Q3

  • The real difference between iPad Mini & iPad Mini Retina

    I was window-shopping on the Apple-direct store's "refurbished & clearance" web site, and noticed that iPad Mini and iPad Mini with Retina Display are now available for resale there.
    If I want to use an iPad Mini for certain functions, such as:
    1: checking e-mail and address books while on-the-road
    2: using an AppleTV and adaptors to wirelessly give presentations via Airplay to business meetings via HDTVs and projector-screen rigs
    3: giving one-on-one image presentations for business/personal discussions using the tablet directly
    4: consulting maps, weather forcasts and various light web-browsing
    5: dappling in art tablet apps, note-writing, and math (calculator apps)
    6: using the iPad as an entertainment device (playing music, watching movies/TV)
    ... then what is the real difference, functionally, between the iPad Mini and the iPad Mini with Retina Display?

    iPad mini with Retina Display vs. iPad mini
    http://www.gizmag.com/ipad-mini-vs-ipad-mini-2/29541/
    First iPad Mini vs. Retina: No contest
    http://news.cnet.com/8301-13579_3-57613975-37/first-ipad-mini-vs-retina-no-conte st/
     Cheers, Tom

  • How do I set up a new Apple ID and iTunes account for my daughter, but let her keep the current content of my iTunes account?

    How do I set up a new Apple ID and iTunes account for my daughter's MacBook, but let her keep the current content of my iTunes account? (We currently share the same Apple ID and iTunes account). Hope someone can help... Thanks

    Discussions on using purchases from multiple AppleIDs in one iTunes library - https://discussions.apple.com/message/19543804
    As I mentioned earlier, the main time when this becomes an issue is if you need to do something involving associating a computer with a particular AppleID.  Careful management of your collection should minimize this situation.
    iTunes Store: Associating a device or computer to your Apple ID - http://support.apple.com/kb/HT4627 -  In connection with, "When you turn on iTunes Match or Automatic Downloads, or when you download past purchases on an iOS device or computer, that device or computer becomes associated with your Apple ID." "Your Apple ID can have up to 10 devices and computers (combined) associated with it. Each computer must also be authorized using the same Apple ID. Once a device or computer is associated with your Apple ID, you cannot associate that device or computer with another Apple ID for 90 days." - Additionally instructions for "Removing an associated device or computer from an Apple ID"
    So the first account is really "yours" and you are setting her up with her own account?  It helps to know this because if both are "hers" then there isn't an issue with her having full access to both accounts.  If in 10 years she moves 2000 miles away and she is pretty much independent then you may not want her to have full access to your AppleID just so she can authorize a device.

Maybe you are looking for

  • How do I move music from iPod back to my computer w/reformatted hard drive?

    My computer was malfunctioning and required a reinstallation of the OS. Doing so overwrote my iTunes library. Now the only place the library exists is on my iPod. Is there any way for me to transfer the library back to my computer? Thanks.

  • Function Module For F-04

    Hi Is there any function module for F-04 transaction. Please help with answers.

  • Outputting pdf to browser from a db

    Hi , I have a web-app that queries a db for a pdf file (Blob) and then outputs the file to the browser. It seems to be downloaded to the browser, but then nothing is displayed. I've used the same code for other apps, which all work fine. I believe it

  • No Longer Accepting My Login?

    Our Apple ID login/password no longer seems to allow us to administer our account but can be used elsewhere. Who could we contact?

  • Yosemite Clean Install Problems

    Hi, I have recently tried to do a clean install of the Yosemite OS but the install was unsuccessful and I have had nothing but trouble since. If you have any ideas on how to fix my problem, I'd be very grateful. Perhaps you have or have had the same