I want example application in c# based on the duplicate detection alogrithm

duplicate detection alogrithm
Step 1: Consider the Stemmed keywords of the web page.
Step 2: Based on the starting character i.e. A-Z we here by assumed the hash values should start
with1-26.
Step 3: Scan every word from the sample and compare with DB (data base) (initially DB
Contains NO key values. Once the New keyword is found then generate respective hash
value. Store that key value in temporary DB.
Step 4: Repeat the step 3 until all the keywords get completes.
Step 5: Store all Hash values for a given sample in local DB (i.e. here we used array list)
Step 6: Repeat step 1 to step 6 for N no. of samples.
Step 7: Once the selected samples were over then calculate similarity measure on the samples
hash values which we stored in local DB with respective to webpages in repository.
Step 8: From similarity measure, we can generate a report on the samples in the score of %
forms. Pages that are 80% similar are considered to be near duplicates.
this is my duplicate detection alorithm. i want some example windows application in c# based on alogrithm.

Yu may want to use a dictionary which has a built in hash table for the key.  See code below for starting point
List<string> input = new List<string>() { "abc", "abd", "def", "ghi" };
Dictionary<string, List<string>> dict = input.AsEnumerable()
.GroupBy(x => x.Substring(0,1), y => y)
.ToDictionary(x => x.Key, y => y.ToList());
jdweng

Similar Messages

  • Want to generate target structure based on the comparing of records source

    Hi Experts
    i hava an issue here
    I want to generate target sturecture
    My requirement is
    Source                                                   Target
    LPLRecordset                                        PurchaseRecordset
    a                                                             a
    b                                                             b
    c                                                             c
    d                                                             d
    Tax(Subrecordset                                 1
      1                                                            2
      2
    If LPL data is coming it need to populate Purchase Record fields... if tax is coming then it need to populate with LPL with Tax ,
    If tax Recordset is coming twice it need to populate Purchase Recordset ..
    Plz hep me in this
    how to populate the target , If possible with eg

    Source                                                 
    LPLRecordset                                       
    a                                                            
    b                                                           
    c                                                            
    d                                                            
    Tax(Subrecordset                                
      1                                                           
      2
    Target
    PurchaseRecordset
    a
    b
    c
    d
    1
    2
    If LPL data is coming just it should populate a,b,c,d
    If Tax data is coming it need to populate whole purchase record but only 1,2
    if tax data is repeating 2 times it should populate whole purchase record twice with only tax data populate in the target
    How to map from source to target
    Reply me back if you have any queies

  • BSP Extensions Book - example application

    Hi all!
    I have bought the Book "BSP Extensions: How to master Web reporting with HTMLB" and i've tried to run the example application. But on half the way of the programming (the next chapter is checkbox & radioButton) i'm getting a JScript error that says "childNodes.0.tagName is null or not an object" and it occurs every time i want to create the text of a new treenode by focusing the inputfield.
    I've coded everything exactly like it is in the book so i don't know what's wrong. A little while ago i've searched the SDN forums for that case and found out that there is a SAP Note (949105) about it. After following the instructions and refreshing the cache the error still appeares. Does anyone read the book and faced the same problem?
    (I'm Using the Trial Version of Netweaver 2004s ABAP.)
    Regards
    Mark-André

    hi,
    can you tell me ..what kind of example you are trying..
    there are demo programs which are given by sap..first try it and then sort the error..
    program names :
    SBSPEXT_BSP
    SBSPEXT_HTMLB
    or
    choose bsp application from se80 and search as SB*
    you will find more programs:
    thanks
    jaideep
    if helpful reward points...

  • How can I use an application based upon the Siebel High Interactive Framework?

    I ma trying to use an application that is based upon the Siebel High Interactive Framework and can't open the software on my MacBook Pro. IS there any solution to this problem?

    My vote is pick up VirtualBox and download Windows 8 Preview, both are free, and see how that goes. Even though a deveoper preview and has IE10 this is my primary OS and browser. Only issue is getting it into a VM but that should be okay, and at least you can decide from there.
    Windows 7 64-bit SP1 Home Premium System Builder ($95) otherwise and then Parallels 7 or run natively.
    http://news.softpedia.com/news/Download-and-Install-Windows-8-on-a-Mac-with-Para llels-7-0-15050-231759.shtml
    http://msdn.microsoft.com/en-us/windows/apps/br229516
    VirtualBox 4.1.4 is includes a Windows 8 Template for guests.
    http://www.virtualbox.org/wiki/Downloads
    http://skear.hubpages.com/hub/Installing-Windows-8-Developers-Preview-Edition-on -VirtualBox

  • Plot a chart dynamically based on the values selected

    Hi All,
    I have some requirement like this i want to plot a chart . The chart should change dynamically based on target_name and date. What type of UI should I use ? I tried using multiselect or shuttle for choosing the target_name and date picker tool for date once i choose all this the chart should appear. Can I have a custom button called submit so once all the values are entered it plots chart ? Please help me out with our ideas.
    Thanks in Advance

    Hi,
    Using the dependent value sets you can govern the values which can be selected based on a specific value selected in a particular segment. Example if Country name is selected in segment 1 then specific states names as per the country selected can be displayed in segment 2.
    As per the requirement described you want to enable different fields based on the value selected in segment1, currently there is no standard mechanism available to enable/disable fields based on a value and you will have to do an extension/customization to meet this requirement.
    Thanks,
    Sanjay

  • How to fill a Tree in Java based on the BC4J?

    Hi,
    I want to build a tree based on the BC4J and then I want to use it with hGrid, could anyone please give me a example code.
    Thanks
    Mohammad Tahir

    Hi,
    I want to build a tree based on the BC4J and then I want to use it with hGrid, could anyone please give me a example code.
    Thanks
    Mohammad Tahir

  • Display of Time Stamps based on the TimeZones

    Hi experts,
    I am new to Internationalization topic and wants to
    display Time Stamps based on the TimeZones.
    For example, If I choose Asia/Calcutta as my TimeZone
    it is displaying "Mon Aug 28 11:02:47 GMT+5:30 2001" as the TimeStamp.(I am in this TimeZone,which is default to me.)
    Similarly If I select any other TimeZone, it should display the TimeStamp in the same format but the values in the Time Stamp should reflect the selected TimeZone.
    I will be thankful to you if u could help me to get the solution.
    Srinivas.

    Hi Srinivas,
    It comes down to some tinkering in the java.util.* package.
    First off, you'll need to familiarise yourself with the Calendar class. This basically replaces the old Date class from old versions of the JDK, and allows you to hold certain dates and times. You can get one by calling Calendar.getInstance();
    You can then set a TimeZone on this Calendar by calling myCalendar.setTimeZone().
    To create a basic TimeZone, have a look at the SimpleTimeZone class - it'll allow you to set the hours ahead or behind GMT, as well as a start date for this time zone, and even gives the ability to add daylight savings time.
    From there, the time zone you add to the Calendar should produce the correct dates for you. Changing the time zone within a Calendar that already holds a date and time should alter that date and time accordingly.
    If you want to get really clever, have a look at the Locale class - it allows you to specify a default language and country. You can then apply this to your Calendar by calling Calendar.getInstance(Locale myLocale) instead of the version above. If you have the international JDK (i.e. the one with i18n.jar onboard), Calendar should be clever enough to format the dates nicely for you if you use a fairly well-known Locale (such as American English and so on).
    A Locale will also let you format dates in a location-specific way if you want to output them to the screen later on down the track. A quick look at SimpleDateFormat and some of the other DateFormat classes should tell you what you need to know.
    If you can't quite get your head around any of the above, feel free to reply again!
    Martin Hughes

  • Sticky connectios based on the JSessionID

    We are using a CSS11501.
    We want to implement sticky connections based on the JSessionID from the ORACLE application servers?
    How could this be done?
    Thank you in advance.

    Gilles,
    thank you very much for your prompt reply.
    It is very helpful and gives the right answer to my question.
    Just a clarification through my scenario; the CSS load balances across 3-servers at 3 different ports. Each server may handle a request to any of these different ports.
    On top of each server sits an APACHE that also publishes three uri that the client hits. These uri are common for all servers.
    Thus, in order to make sticky connections to my understanding, we should not check the cookie from the session that the client initiates (I mean any jessionid originated by hitting e.g. www.company1.com) but the server IP and port. There are also www.company2.com and www.company3.com.
    Each of them resides on www.server1.com, www.server2.com and www.server3.com
    That means a mixed configuration, if the jsessionid cannot contain a constant. As far I have seen this is dynamic and quite difficult to make it constant. Have you ever come across with this ? Is this possible (I mean a constant in the jsessionid)? It is sufficient to me, if this can be implemented. I will ask the developer to do so, but we donnot know whether this is achievable.
    Any suggestions that you may have are more than welcome.
    It is quite complex and confusing ... :(

  • Need to launch specific document and wave file based on the inbound Route Point

    I am having issues using CDA in terms of creating the appropriate WF to launch both a wave file and specific Word document based on the inbound DNIS.  I have UCCX 8.0(2) SU3 enhanced so I can't use HTTP.  I really need a solution today.  Thanks.

    I did script to play the file to the agent as a whisper then hang up and ring the line again to connect the caller to the agent but the customer didn't like that at all so i just created the WF to launch from the same filepath on all machines to point the the appropriate wave files based on the DNIS.  Because the customer has enhanced I'm not launching the Word docs as PDFs in a integrated browser like I'd prefer but as an external application.  Again, based on the DNIS. 

  • I want to create a Ajax based Chating application using java .

    hi . I want to create an chating Application , and ajax based chatting application using java as my server side language ..i have some idea about it .. but one thing that i m not understanding is that how i will communicate with other users because each user has its own session so what i will do to send one messge to all the users ???? . i m new to JavaEE and i have chosen this as my semester project .. i need some help regarding this .

    First get it working without AJAX.

  • Please help me regarding ServiceDemo example application

    Hi Friends..
    Currently i use Java Card 2.2.1, and i use JCDK 2.2.1..
    There's a ServiceDemo example application that shipped with JCDK 2.2.1..
    There are some files, as follows :
    1. Main class that extends javacard.framework.Applet
    2. TestService class that extends BasicService
    3. PreProcess class
    4. PostProcess class
    The PreProcess class and PostProcess class contains nothing, there's only it's default constructor in it..
        private Dispatcher disp;
        private Service serv;
        public Main()
            disp = new Dispatcher( (short) 1);
            serv = new TestService();
            disp.addService(serv, Dispatcher.PROCESS_COMMAND);
            register();
        }In that code, it seems that the all processes handled by TestService class...
    What is the advantages while we use "Service" based java card?.. Is it more good than RMI based Java Card?..
    Do i've to send the APDU command while i want to do the specific process provided?..
    Is this often used in the kind of Java Card Application?..
    Thanks in advance..

    Thanks Shane for your reply.. :)
    If i rely on SecureRMIPurse directly without know the basic of Pre-process and Post-process, i think its difficult enough for me.. :(
    Could you guide me the very simple Pre-process and Post-process?..
    Assume that i describe this function in the PreProcess and PostProcess class :
    public class PreProcess {
        public PreProcess() {
        public byte addOne(byte add){
             return ++add;
    public class PostProcess {
        public PostProcess() {
        public byte subtractOne(byte subtract){
             return --subtract;
    }And then how to call either PreProcess's addOne method or PostProcess's subtractOne method from TestService class?..
    Is it only through new operator?..
    PreProcess preproc=new PreProcess();
    preproc.addOne(1);
    PostProcess postproc=new PostProcess();
    postproc.subtractOne(5);Is this way a good implementation?..
    Please correct me if i'm wrong..
    Thanks in advance..

  • License Information for Adventure Builder Example Application

    Hello All,
    We wanted to use the example application Adventure Builder in one of our research projects as a case study. (Link to adventure builder: http://java.sun.com/developer/releases/adventure/). However, I am not very sure about the license they use. (http://java.sun.com/developer/releases/adventure/adventure-1_0_1-license.txt). We want to use both the architectural documents available at AdventureBuilder wiki and the source code of the application. For the architectural documents, however, we are not sure, whether, we can use them in our project and publish our results based on them or not?
    Any help will be appreciated.
    Best Regards
    Quratulann

    Ask Oracle's lawyers, or your own. This is not a legal aid service, and any advice you might get here cannot be relied on. Locking this off topic thread.

  • MRP with PD, it doesn't want to show me UNPLREQ based on forecast.

    Hi guru's!
    I work on a new project. I use PD as planification type.
    The target is to take in account in MRP only the PLDORDER proposals (not converted in Production Order), the companie update the stock manually with MIGO to erase PldOrder at each new MRP run. (They don't want to create and manage master data as componants, BOM or routings)
    Material master have forecast view, and historic of consumption are copied from past sales results.
    They want to use a dynamic safety stock, which is calculated thanks to MP38. This lunch calculation of forecast collectively to create UNPLREQ in MRP every month based on the historic.
    I've done the customizing, and my MRP work correctly : UNPLREQ was here and the update of stock with MIGO was good (after a MRP run PLDORDER were erased).
    But after some change in customizing, I don't retreive my first situation, in fact UNPLREQ (neither none requirement) are shown now in my MRP.
    I don't retrieve my mistake in customizing... Please, is somebody know where something can change the fact that you can't see the requirements in MRP??? the problem not comes from my material master data, i'm sure of that!! it's in the customizing.
    I hope you have all need to answer.
    Thanks in advance.
    JC

    Dear R.Brahmankar,
    Your note, even if it's for an old version of SAP, is helpfull for me. I will try to create my own planification type with the same configuration than PD but changing forcast to mandatory. I will test it and we will see after...
    In fact, before my example "crash", I have tested plan. type VV and VM. So, as forcast is mandatory in this case but it's not work too, there is something which is "bizarre"!! Moreover, I've try the two way of enter forcast historic (non planned consumption and total consumption) for my article in forcast view.
    JH,
    My requirement was not replace in independant requirement (MD61).
    Thanks, I will re-post something if I find a my mistake.!!

  • Adobe Story application for iOS-based devices

    This question was posted in response to the following article: http://help.adobe.com/en_US/story/cs/using/WSeffff8bffc802084-10c4f33d12eecb6f873-8000.htm l

    Adobe has a great, different view on screenwriting (and soon, novel writing) applications. I really like the desktop browser-based versions; however, the desktop and iOS versions royally bite. I cannot even use the browser version on my iPad because of the Flash limitation - though, that wouldn't be a problem if Adobe had a really nice iPad Adobe Story app.
    The iPhone version is only good enough to look something up right quick, like when an idea pops in my head and I want to verify where it might go... but not ever execute those changes on the iPhone because it's just not usable... and I probably wouldn't even want to do much on that smaller device.
    Thus, I'm eager to see a great Adobe Story iPad app that does it all. Since I need that type of app NOW, I bought Final Draft Writer which does everything I wanted... and then, I saw the benefits of upgrading my Final Draft Mac desktop software to version 8 so they can interact with each other. These are purchases I would rather not have made, but needed to due to Adobe Story not being quite up to speed yet.
    But, I know Adobe Story will get there. Looking forward to it... and to not endlessly upgrading FD anymore.

  • Anyone have an example of a delimiter-based eText template?

    Would REALLLLY appreciate it. We are trying to get eText out of Bi Publisher and want a delimited format. I have only done fixed position.

    can you please post an example for both - delimiter based and fixed position based. I am not able to process the output. You can mail at [email protected]
    Thanks,
    sbedi

Maybe you are looking for

  • Macbook Pro to Asus monitor best connection?

    I have a late 2013 mbp and just bought an Asus VS248H-P monitor to hook up to it. I've heard of different ways to connect them, for example, HDMI to HDMI, HDMI to thunderbolt, and DVI to thunderbolt. I was wondering which would look the best on the m

  • How can i move photos from computer to iphone

    how can i move photos from computer to my iphone 4 please answer... thanks

  • Patch Policies not Caching on Rebuild

    I'm gone through my second "patch Tuesday" using the new Patch Policy feature in ZCM 11.3 and I'm trying to figure out why my patch criteria does not appear to be caching what I am expecting. From the Rules tab, I've set my criteria for one of my rul

  • JTable Cell Renderer - Using DefaultTableCellRender

    Hi. I am posting this Question again in a simplified manner. How to color complete row by checking the "value" when you are using ur custom made RowRenderer which extends DefaultTableCellRenderer. In this case only one cell gets colored. What I am do

  • Speeding Up The Loading Time of Plug-Ins.....

    Hi. I have Scarbee Vintage Keyboard Collection and it takes about 1/2 mins to load the engine, then 2/3 mins to load a patch. It doesnt do this with other plugins and my computer is certainly not slow. Has anyone got any ideas of what this could be?