|
|
|
|
|
|
|
Re: Contains Clause is not working on certain scenario (split from unrelated hijacked thread by bb) [message #495179 is a reply to message #495144] |
Thu, 17 February 2011 15:48 |
|
Barbara Boehmer
Messages: 9100 Registered: November 2002 Location: California, USA
|
Senior Member |
|
|
Since it is loading the file into the blob, and working on a text file, but not on a pdf file or even a doc file, then it sounds like it is not doing any filtering. Ctxhx.exe is the lowest level at which to check the filtering. You mentioned switching from ascii to utf and that might be part of the problem. Please try running something like this from the operating system, substituting your home, paths, and files:
your_oracle_home/bin/ctxhx.exe source_path/source_file.pdf target_path/target_file.html ASCII8 utf8 H NOMETA 120 HEURISTIC FORMAT NOPDFROTATE
After running that, edit your target_path/target_file.html and see if the data in the pdf file has been converted to html or if you have an empty file or no file or what.
If that filters your document, then try adding a column in your table to specify the character set and adding that column as a charset column to your index parameters.
If none of this helps, then I am out of things to check and suggest that you re-post your problem on the OTN Text forum:
http://forums.oracle.com/forums/forum.jspa?forumID=71
The Oracle Text product manager regularly responds there and if he can't figure it out, he will refer you to support. Have you checked metalink to see if there is already an identified bug?
[Updated on: Thu, 17 February 2011 16:04] Report message to a moderator
|
|
|
|