NetProwler
07-09-2011, 02:33 AM
A brief introduction to the problem: One of our clients has an in-house search facility in her large portal site - which is also triggered when there is a 404 - to 'improve' the user experience. Some pages use French, Spanish and Latin words/expressions as part of the content. Eg: Caveat Emptor, Carpe Diem,raison d'etre etc
The search script uses a look-up table to discern the meaning of the searched word in case it can't find appropriate content. For other languages, we want to use a language dictionary to populate the look-up table. I know it sounds very convoluted and complicated. But we have to do what we need to do. We downloaded one French-English dictionary for this project. But the dictionary is in the form of an executable file which upon installation reduces to an executable binary file and what looks like a compressed form of word definitions in binary format. I tried disassembler, Hex Editors to extract the word definitions to no avail. I can't find any ASCII string containing words.
I had done this exercise about 7 years ago for another project but I can't remember what Perl script I wrote to extract from a similar file. Any pointers ?
I would really appreciate any help.
The search script uses a look-up table to discern the meaning of the searched word in case it can't find appropriate content. For other languages, we want to use a language dictionary to populate the look-up table. I know it sounds very convoluted and complicated. But we have to do what we need to do. We downloaded one French-English dictionary for this project. But the dictionary is in the form of an executable file which upon installation reduces to an executable binary file and what looks like a compressed form of word definitions in binary format. I tried disassembler, Hex Editors to extract the word definitions to no avail. I can't find any ASCII string containing words.
I had done this exercise about 7 years ago for another project but I can't remember what Perl script I wrote to extract from a similar file. Any pointers ?
I would really appreciate any help.