Error while crawling URL containing diacritic characters

Hi,
I have a content source in SharePoint 2013 that is showing errors while trying to crawl links with diacritic characters (portuguese words). The reason is that the crawler regards the URL as invalid.
The problem still occurs if the link URL is encoded (see example 2).
Examples:
1) Atualização 037 de 16-4-2008.htm
2) Atualiza%E7%E3o%20037%20de%2016-4-2008.htm
Log message:
The item could not be accessed on the remote server because its address has an invalid syntax.
I already tried to save the home page (which contains the links) as UTF-8, UTF-8 without BOM, and ANSI.
Also, I tried to include a meta charset tag:
<meta charset="UTF-8">
in addition to the first line with:
<?xml version="1.0" encoding="UTF-8"?>
All unsuccessful attempts. Has anyone found solution for this problem?

Hi,
Just checking in to see if the information was helpful. Please let us know if you would like further assistance.
Have a great day!
Best Regards,
Lisa Chen
TechNet Community Support
Please remember to mark the replies as answers if they help, and unmark the answers if they provide no help. If you have feedback for TechNet Support, contact
[email protected]

Similar Messages

  • Lotus Notes connector: Error while crawling LOB contents

    Hi all,
    I am trying to configure Lotus Notes connector on SP search. We have successfully followed the Randy Rempel's blog on
    test environment. No problem to search lotus notes content.
    In production environment, we've followed the same procedure but we are facing the following problem. During the full crawl, the below error is thrown
    Error while crawling LOB contents. ( Error caused by exception: Microsoft.BusinessData.Infrastructure.BdcException The shim execution failed unexpectedly - Exception has been thrown by
    the target of an invocation..: System.ServiceModel.FaultException`1[System.ServiceModel.ExceptionDetail] An unexpected error occurred in the Lotus Notes protocol handler while processing the URL
    We have forced Lotus Notes Connector to be more verbose, without success.
    Any help will be appreciated
    David

    Reset the index and re-crawl. That usually claers it
    If you are using NTLM authentication, then make sure that you specified the PassThrough authentication for crawling
    Probably you need to debug the BDC code that underlies the external content types.
    can you Check the permission that you have set for the lotus notes domino databases? Was the content access account added to the local administrator group?
    If this helped you resolve your issue, please mark it Answered

  • Error in crawl log "Error while crawling LOB contents. ( Error caused by exception: Microsoft.BusinessData.Infrastructure.BdcException The shim execution failed unexpectedly - The method or operation is not implemented..; SearchID "

    Hi 
    I get the following error in my crawl logs
    "Error while crawling LOB contents. ( Error caused by exception: Microsoft.BusinessData.Infrastructure.BdcException The shim execution failed unexpectedly - The method or operation is not implemented..; SearchID "
    Because of this i suspect, the search results are not including those aspx pages marked as "Hide physical urls from search".
    This error is not available in the another environment where the aspx pages are coming in the results.
    Thanks
    Joe

    Hi Joe,
    Greetings!
    Reset the index and re-crawl. That usually clears it
    If you are using NTLM authentication, then make sure that you specified the PassThrough authentication for crawling
    Probably you need to debug the BDC code that underlies the external content types.
    http://social.technet.microsoft.com/Forums/sharepoint/en-US/41a86c43-151d-47cd-af73-967a4c940611/lotus-notes-connector-error-while-crawling-lob-contents?forum=sharepointsearch
    Please remember to click 'Mark as Answer' on the answer if it helps you

  • URL contains illegal characters

    hi,
    The following is the error i get when i am click a link in my page
    The URL contains illegal characters. You need to encode these illegal characters: OA.jsp?page=/danka/oracle/apps/bis/supplyTrend/webui/DKProduct&fromMonth=01-Aug-2005&toMonth=01-Apr-20&periodType=MONTH geoType=ordered_item geoMainName=0-20 Copies/Minute&geoName=CANON retainAM=Y addBreadCrumb=Y&_ti=1414051287
    There is a space in my geoMainName value between 0-20 and Copies/Minute.
    I know that we have to do some encoding for this. But i am not knowing how to do it. I saw your previous thread with the same problem, but i was not able to understand.
    The following is the line of code i wrote to make a item as a link
    geoNameBean.setDestination("OA.jsp?page=/danka/oracle/apps/bis/supplyTrend/webui/DKProduct&fromMonth="+fromMonthStartDate+"&toMonth="+toMonthStartDate+"&periodType="+periodType+"&geoType=item_manufacturer&geoName={@GeoName}&retainAM=Y&addBreadCrumb=Y");
    Please guide me to encode this parameter value of geoMainName.
    Thanking you
    ri

    did you go through the javadoc for this class ?
    the usage is given there
    Thanks
    Tapash

  • Error while crawling LOB contents SharePoint 2013

    Error while crawling LOB contents SharePoint 2013
    I have Configured the BDC Service application using SQL external content. The connection was successful and I am able to see the external content in the List "BDC Demo" . But when I search in the BDC Demo site it gives nothing.
    So I checked in the crawl logs and identified that it shows " 1 " under error. to further drill down the issue , I went to click on "1" and see the error message : Error while crawling LOB contents SharePoint 2013 .
    I have created an external DB named BCSDemo_DB for which I have granted my search Service account read& write permission.
    I have added the same account under administrators for both secure store and BCS service applications. 
    I have done index reset , done a full crawl but the error still occurs.
    Can someone please advise if I am missing something.
    Regards

    Hi Aravinda,
    According to your description, my understanding is that you got an error when you crawled SQL database table in SharePoint 2013.
    This error is caused by the fact that the default content access account does not have any rights to access the metadata store in the Business Data Connectivity Service Application.
    Or it is caused by the default content access account has no rights on the SQL database.
    For fixing it, you need to grant the default content access account permission on metadata store in the Business Data Connectivity Service Application and the SQL database. You can refer to the link below:
    http://www.sharepointinspiration.com/Lists/Posts/Post.aspx?ID=5
    After that, do a full crawl for the content source.
    Best Regards,
    Wendy
    Wendy Li
    TechNet Community Support

  • Sharepoint 2013 vs Exchange 2010 SP3 search (Error while crawling LOB contents)

    Hi there:
    We are trying to solve the problem: ERROR CRAWLING LOB CONTENTS when we wish to search Exchange 2010 SP3 public folder content on Sharepoint 2013 Foundation.
    Quick briefing:
    Followed this instructions:
    http://technet.microsoft.com/en-us/library/jj591608(v=office.15).aspx
    * Created CRAWL RULE
    - Used Domain Admin for content access ---> IS THIS WRONG?
    - Domain Admin can access public folder thru Outlook Web Access (checked)
    - Included all items in this path
    PRINTSCREEN 1
    * Added a content source for Exchange Server public folders
    - Logged to Outlook Web Access with domain admin, expanded Public folders and opened 1st subfolder in new window and copied the address
    - Logged to Outlook Web Access with domain admin, expanded Public folders and opened 2nd subfolder in new window and copied the address
    PRINTSCREEN2
    * Did a FULL CRAWL
    PROBLEM:
    - Search results does not throw "correct data". Some items are not being found
    CRAWL LOG is reporting: Error while crawling LOB contents
    Detailed error message:
    https://mail.domain.com/OWA/?ae=Folder&id=PSF.LgAAAAAaRHOQqmYRzZvIAKoAL8RaAwAnt2ed15IATLg8XoXLNj4EAAAAXsN8AAAB&t=IPF.Note
    Error while crawling LOB contents.
    Error caused by exception: Microsoft.BusinessData.Infrastructure.BdcException
    The shim execution failed unexpectedly - Exception has been thrown by the target of an invocation..:
    System.InvalidOperationException An internal server error occurred.
    Try again later.; SearchID = 4E8542D3-48EF-404E-8025-8D9AAEFE777A )
    We thought it's a throttling issue / found possible solution:
    http://powersearching.wordpress.com/2013/07/23/exchange-public-folders-search-fail-error-while-crawling-lob-contents/
    Tried it, still same Error messages, problem not resolved.
    Any hints? Please advise.
    With best regards
    bostjanc

    Hi Bostjan,
    From the error message, the issue might be caused by throttling policy on Exchange side. And the article you posted provides the right solution, some modification to the solution and please try again.
    For throttling policy part
    1.Execute the command for Set-ThrottlingPolicy
    Set-ThrottlingPolicy SharePoint -RCAMaxConcurrency Unlimited -EWSMaxConcurrency Unlimited -EWSMaxSubscriptions Unlimited -CPAMaxConcurrency Unlimited -EwsCutoffBalance Unlimited -EwsMaxBurst Unlimited -EwsRechargeRate Unlimited
    2.Execute the command Get-ThrottlingPolicy SharePoint to double confirm the policy setting has been successfully executed
    For registry key part
    1. Start Registry Editor (regedit).
    2. Navigate to the following registry subkey:
    \\HKEY_LOCAL_MACHINE \SYSTEM\CurrentControlSet\Services\MSExchangeIS\ParametersSystem
    3. Right-click ParametersSystem, point to New, and then click Key.
    A new key is created in the console tree.
    4. Rename the key MaxObjsPerMapiSession, and then press Enter.
    5. Right-click MaxObjsPerMapiSession, point to New, and then click DWORD (32-bit) Value.
    The new value is created in the result pane.
    6. Rename the key to <Object_type>, where <Object_type> is the name of the registry object type that you're modifying. For example, to modify the number of messages that can be opened, use objtMessage. Press Enter.
    7. Right-click the newly created key, and then click Modify.
    8. In the Value data box, type the number of objects that you want to limit this entry to, and then click OK. For example, type 350 to increase the value for the object.
    9. Restart the Microsoft Exchange Information Store service.
    If it still doesn’t help, please check ULS log for related error message.
    Regards,
    Please remember to mark the replies as answers if they help and unmark them if they provide no help. If you have feedback for TechNet Subscriber Support, contact
    [email protected] .
    Rebecca Tu
    TechNet Community Support

  • Error while crawling LOB contents (Error caused by exception: Microsoft.BusinessData.Infrastructure.BdcException)

    Everytime when I perform a Full Crawl on the Central Administraion page, this error will appear and the crawl process will run into an infinite loop and nothing crawl success. I have tried delete and re-create the crawling content sources, start/stop
    the search services but this error still appear.
    Please help. Thanks a lot.
    Error Message:
    topicpages://
    Error while crawling LOB contents.
    ( Error caused by exception: Microsoft.BusinessData.Infrastructure.BdcException
    The shim execution failed unexpectedly - Exception has been thrown by the target of an invocation..:
    System.Net.WebException The request failed with HTTP status 404: Not Found.;
    SearchID = EFE3FAA4-12C3-40AD-B410-51D6189FC720 )

    Tried scheduled an incremental every 10 minutes
    Still getting error on crawling LOB contents. The incremental crawl ran into an infinite loop and nothing crawl success
    Thanks for help.
    Error Message:
    topicpages://(servername)
    Error while crawling LOB contents.
     ( Error caused by exception: Microsoft.BusinessData.Infrastructure.BdcException
    The shim execution failed unexpectedly - Exception has been thrown by the target of an invocation..:
    System.Net.WebException The request failed with HTTP status 404: Not Found.;
    SearchID = B801BC11-8138-453F-AB4D-AA4BE4F5B57A )

  • Error while crawling LOB contents. ( Access denied by Business Data Connectivity. )

    Hi,
    When I execute Incremental or Full Crawl I get 17000 error and only 1 success.
    The error is:
    Error while crawling LOB contents. ( Access denied by Business Data Connectivity. )
    I went to Central Admin > Security > Specify Web application Policy > Search for the service account for crawl and it's "svc_SPSearch"
    I went to Central Admin > Application Management > Manage Service Application > BDC Service > I select all the affected external sources > then Set Object Permission in ribbon > Added "svc_SPSearch" with execute rights > then
     kick incremental crawl...
    After this I still get 17000 error, 1 success.. Still an error.
    I am not sure how the crawler's trust to FAST infrastructure works.
    Your response is appreciated. Thanks!
    -Fran

    I have the same issue. 
    I have Configured the BDC Service application using SQL external content. The connection was successful and I am able to see the external content in the List "BDC Demo" . But when I search in the BDC Demo site it gives nothing.
    So I checked in the crawl logs and identified that it shows " 1 " under error. to further drill down the issue , I went to click on "1" and see the error message : Error while crawling LOB contents SharePoint 2013 .
    I have created an external DB named BCSDemo_DB for which I have granted my search Service account read& write permission.
    I have added the same account under administrators for both secure store and BCS service applications. 
    I have done index reset , done a full crawl but the error still occurs.
    Can someone please advise if I am missing something.
    Regards

  • Error while crawling LOB contents

    Hello
    I am trying to get the crawl to work on SharePoint but the full crawls never stops and when I stop it the crawl log has below.
    topicpages://
    Error while crawling LOB contents. ( Error caused by exception: Microsoft.BusinessData.Infrastructure.BdcException The shim execution failed unexpectedly - Exception has been thrown by the target of an invocation..: System.Net.WebException The request
    failed with HTTP status 404: Not Found. )
    I have been searching the net continuously and made sure the folders have the correct permissions etc
    The longest I let the search go is 100 hours
    Using:
    Sharepoint 2013 Foundation
    Server 2008 R2
    Has anyone seen this before or have any ideas what to try and get this going correctly?
    Cheers
    James

    Also below is what comes up in Event log when I start a full crawl
    'Log Name:      Application
    Source:        Microsoft-SharePoint Products-SharePoint Server Search
    Date:          24/06/2014 10:24:53 a.m.
    Event ID:      1400
    Task Category: Crawler:Gatherer Service
    Level:         Warning
    Keywords:     
    User:          <domain>\SP-Search
    Computer:      <server>
    Description:
    Gatherer\CSS connector failed to connect to CSS net.tcp://<server>/5DA29F/ContentProcessingComponent1/ContentSubmissionServices/content
    Event Xml:
    <Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
      <System>
        <Provider Name="Microsoft-SharePoint Products-SharePoint Server Search" Guid="{C8263AFE-83A5-448C-878C-1E5F5D1C4252}" />
        <EventID>1400</EventID>
        <Version>15</Version>
        <Level>3</Level>
        <Task>3</Task>
        <Opcode>0</Opcode>
        <Keywords>0x4000000000000000</Keywords>
        <TimeCreated SystemTime="2014-06-23T22:24:53.874185700Z" />
        <EventRecordID>67690</EventRecordID>
        <Correlation />
        <Execution ProcessID="1636" ThreadID="3284" />
        <Channel>Application</Channel>
        <Computer>**<server>**</Computer>
        <Security UserID="S-1-5-21-3477241775-3255730575-1957110980-1562" />
      </System>
      <EventData>
        <Data Name="string0">net.tcp://<server>/5DA29F/ContentProcessingComponent1/ContentSubmissionServices/content</Data>
      </EventData>
    </Event>'

  • Error while crawling LOB contents in SP 2013:'Could not find default endpoint element that references contract in the ServiceModel client configuration section

    Hi,
    I created custom BDC Model using Visual Studio. In ReadList method i am getting data using Web Service call. Using this External Content Type (BDC Model) i created one external list and it is populating with data.
    I created one new content source in search service application using this BDC Model and when crawled this content source i am getting the below error.
    Error while crawling LOB contents. ( Error caused by exception: Microsoft.BusinessData.Runtime.RuntimeException MethodInstance with Name 'ReadList' on Entity (External Content Type) with Name 'Entity1' in Namespace 'bcsex.BdcModel1' failed
    unexpectedly. The failure occurred in method 'ReadList' defined in class 'bcsex.BdcModel1.EntityService1' with the message 'Could not find default endpoint element that references contract 'ServiceReference1.Service1Soap' in the ServiceModel client configuration
    section.
    I included the bindings and client end point configuration settings in web.config file located at
    \\Inetpub\wwwroot\wss\VirtualDirectories\Port_Number, but still getting the same error.
    Please help. Thanks a lot.

    The Issue resolved by including the bindings and client end point configuration settings in
    machine.config file located at C:\Windows\Microsoft.NET\Framework64\v4.0.30319\Config\ and also restarted the server after config changes.

  • TREX fatal error while crawling

    Hallo,
    I have problems with the indexing of documents.
    I created an index and started the indexing.
    But every time the indexing stops at a certain point and sets the status to "failed".
    In the error log the following entry can be found:
    ERROR     12 juil. 05 11:44:54     fatal error while crawling     java.lang.ArrayIndexOutOfBoundsException
    Can anybody tell me what might be the reason. Or is there any other logfile with a more detailed descripiton?
    Thanks for helping.
    Birgit.

    Hi Birgit,
    can you please post this as support message to SAP?
    Thanks,
    Karsten

  • Error : The input field contains prohibited characters (already deleted)

    Hi,
    While trying to generate depreciation (Tcode AFAB), after select the Time of Print (either spool or immediate printing) this error will be displayed - "The input field contains prohibited characters (already deleted)", no error number displayed and the character ### appear in Time of Print.  I m not using any special characters and i was able to generate it before. Also the test run is successful.
    Please guide why this error is appear and what is the resolution for it.
    Thank you,
    Amalina

    Hi Amalina,
    Please see note 382285 solution part.
    Regards Bernhard

  • Error while crawling web repository

    Hi Experts,
    System in use - EP 2004s
    We have a web server which has number of documents on that server. I have created a web repository for this web server . Repository is working fine , but while crawling it has indexed about 20000  documents and given some errors for 600 documents.
    Errors are like :
    1. Crawler error
    2. TREX preparation error
    and when I search for the indexed pages it gives search result but when I click on html version it gives an error message 'No index service found'.
    Any suggessions !
    (Points are assured ..)
    Thanks & Regards,
    Amit Kade

    Hi Tamil,
    Thanks a lot for the help ! But I have already set everything correctly as per the sap help .
    TREX is indexing the documents but as I mentioned not indexing all the documents and I cannot view the searched documents in HTML format.
    One observation : It is not indexing large documents (Greater than 10 - 12 KB).
    Any suggessions !
    Thanks & Regards,
    Amit Kade

  • Error: Topic ID "Ầ‾»¿#define" contains invalid characters.

    Hi,
    I have just upgraded to RoboHelp for Word 9. When I compile I get the message:
    Warning - HC3033: File RhubarbRhubarb.hh: The Topic ID "Ầ‾»¿#define" contains invalid characters. The #, =, >, @, !, or \Â˚haracters cannot be used in a WinHelp version 4.0 source file.
    I have 7 files in the project and there is one of these error messages for each file. I checked the .hh files and there is nothing corresponding to that as a Topic ID. I don't use any special characters in my Topic IDs anyway.
    It doesn't seem to be having any effect on my generated project but I'm uneasy about unexplainable error messages. Does anyone have an idea why they're there and how to remove them please?
    regards
    Christina

    To the best of my knowledge (haven't used Rh for Word for a long time) the only solution is sleuthing the elusive illegal character.
    See www.grainge.org for RoboHelp and Authoring tips
    @petergrainge

  • Error while passing URL Dynamically in SOAP AXIS adapter..!!

    Hi ,
    Idoc> XI>SOAP-AXIS
    I am doing a scenario where I need to pass the URL dynamically in SOAP-AXIS adapter by taking the RCVPRN of Idoc.
    If
    RCVPRN = 100 , message has to go to http://10.190.25.16:8210/file/receiver
    RCVPRN = 200 , message has to go to    http://10.190.25.16:8210/file/receiver2
    RCVPRN = 300 , message has to go to    http://10.190.25.16:8210/file/receiver3
    I used the below UDF and it is working correctly and generating the URL dynamically .
    DynamicConfiguration conf = (DynamicConfiguration) container.getTransformationParameters().get(StreamTransformationConstants.DYNAMIC_CONFIGURATION);
    DynamicConfigurationKey keyHeader1 = DynamicConfigurationKey.create( "http://sap.com/xi/XI/System/SOAP", "TServerLocation");
    conf.put(keyHeader1, a);
    return "";
    The value is coming in SOAP document as expected like below.
    <sap:DynamicConfiguration xmlns:sap="http://sap.com/xi/XI/Message/30" SOAP:mustUnderstand="1">
      <sap:Record namespace="http://sap.com/xi/XI/System/SOAP" name="TServerLocation">http://10.190.25.16:8210/file/receiver3</sap:Record>
      </sap:DynamicConfiguration>
    I used the below Configuration modules in receiver SOAP-AXIS adapter as suggested in Note 1028961.
    AF_Adapters/axis/AFAdapterBean ---> afreq
    AF_Adapters/axis/HandlerBean ---> xireq
    AF_Adapters/axis/HandlerBean ---> dc
    AF_Adapters/axis/HandlerBean ---> remover
    AF_Adapters/axis/HandlerBean ---> trp
    AF_Adapters/axis/HandlerBean ---> xires
    AF_Adapters/axis/AFAdapterBean ---> afres
    xireq -> handler.type-> java:com.sap.aii.axis.xi.XI30OutboundHandler
    dc -> handler.type-> javasap.aii.axis.xi.XI30DynamicConfigurationHandler
    dc ---> key.1 ---> write http://sap.com/xi/XI/System/SOAP TServerLocation
    dc ---> location.1 ---> context
    dc ---> value.1 ---> transport.url
    remover ---> handler.type ---> java:com.sap.aii.axis.soap.HeaderRemovalHandler
    remover ---> namespace ---> http://sap.com/xi/XI/Message/30
    trp ---> handler.type ---> java:com.sap.aii.adapter.axis.ra.transport.http.HTTPSender
    trp ---> module.pivot ---> true
    xires ---> handler.type ---> java:com.sap.aii.axis.xi.XI30OutboundHandler
    and I am getting the below error in SOAP-AXIS channel at the point java:com.sap.aii.adapter.axis.ra.transport.http.HTTPSender.
    2009-03-02 15:23:44 Success Axis: getting handler trp of java:com.sap.aii.adapter.axis.ra.transport.http.HTTPSender
    2009-03-02 15:23:45 Error Axis: error in invocation: (500)internal server error
    2009-03-02 15:23:45 Error MP: Exception caught with cause (500)internal server error
    2009-03-02 15:23:45 Error Exception caught by adapter framework: (500)internal server error
    2009-03-02 15:23:46 Error Delivery of the message to the application using connection SOAP_http://sap.com/xi/XI/System failed, due to: com.sap.aii.af.ra.ms.api.RecoverableException: (500)internal server error: (500)internal server error.
    2009-03-02 15:23:46 Error The message status set to NDLV.
    Kindly let me know if anyone has any idea what might be wrong?
    Note : The given URL is correct one because I cross checked by passing them normally by giving directly in adapter and they are all working and the message is going to receiver.
    Thanks
    Deepthi

    Hi Stefan,
    >> dc -> handler.type-> javasap.aii.axis.xi.XI30DynamicConfigurationHandler
    >> dc ---> key.1 ---> write http://sap.com/xi/XI/System/SOAP TServerLocation
    >> dc ---> location.1 ---> context
    >> dc ---> value.1 ---> transport.url
    >> Try read instead of write
    When I tried with this, I am getting the below error "Connection refused ".
    Success Axis: entering HandlerBean
    Success Axis: getting handler trp of java:com.sap.aii.adapter.axis.ra.transport.http.HTTPSender
    Error Axis: error in invocation: java.net.ConnectException: Connection refused
    Error MP: Exception caught with cause java.net.ConnectException: Connection refused
    Error Exception caught by adapter framework: ; nested exception is: java.net.ConnectException: Connection refused
    Error Delivery of the message to the application using connection SOAP_http://sap.com/xi/XI/System failed, due to: com.sap.aii.af.ra.ms.api.RecoverableException: ; nested exception is: java.net.ConnectException: Connection refused: java.net.ConnectException: Connection refused.
    Success The message status set to WAIT.
    It is going successfully when I tried sending directly. Looks like Dynamic Configuration is not working for SOAP-AXIS.
    Any Suggestion?
    Thanks
    Deepthi.

Maybe you are looking for