Home » Server Options » Text & interMedia » Multi-lingual index for blob column (Oracle 10g R2 10.2.0.1.0)
Multi-lingual index for blob column [message #513612] Tue, 28 June 2011 06:56 Go to next message
dkdms2124
Messages: 369
Registered: April 2010
Location: INDIA
Senior Member

Hi All,

I have a test table with three columns (id,name,doc) where doc column is of blob type.

I am mentioning the step I have followed to create index and then search the document containing the keyword using the CONTAIN keyword.

I am able to find the documents in English Language but not able to find documents in other langauges.

Please help me out.

SQL> conn sample/sample
Connected.
SQL> create table test(
  2  id number primary key,
  3  name varchar2(2000),
  4  doc blob);

Table created.

SQL> create sequence sample_seq;

SQL> conn sys/oracle as sysdba
Connected.
SQL> create or replace directory documents as 'C:\sample_work';

Directory created.

SQL> grant read,write on directory documents to sample;

SQL> create or replace procedure load_data ( p_file_name IN test.name%type) AS
  2     v_bfile bfile;
  3     v_blob blob;
  4     begin
  5     insert into test (id,name,doc)
  6     values (sample_seq.nextval,p_file_name,empty_blob())
  7     return doc into v_blob;
  8     v_bfile := bfilename('DOCUMENTS',p_file_name);
  9     dbms_lob.fileopen(v_bfile,dbms_lob.file_readonly);
 10     dbms_lob.loadfromfile(v_blob,v_bfile,dbms_lob.getlength(v_bfile));
 11     dbms_lob.fileclose(v_bfile);
 12     commit;
 13     end;
 14  /

Procedure created.

SQL> EXEC load_data ('Clustering.doc');

PL/SQL procedure successfully completed.

SQL> exec load_data('connectivity.doc');

PL/SQL procedure successfully completed.

SQL> select id from test;

        ID
----------
        22
        23
        24

SQL> begin
  2   ctx_ddl.create_preference('est_lexer', 'WORLD_LEXER');
  3  end;
  4  /

PL/SQL procedure successfully completed.

SQL> create index sample_doc_idx on test (doc) indextype IS ctxsys.context parameters(' LEXER EST_LEXER ');

Index created.

SQL> exec load_data('backtrack_oracle_tutorial.pdf');

PL/SQL procedure successfully completed.

SQL> exec load_data('Reading Logs Spanish.pdf');

PL/SQL procedure successfully completed.

SQL> exec load_data('Pan-2.4-fr_FR.pdf');

PL/SQL procedure successfully completed.

SQL> exec load_data('Kitchen-2.4-fr_FR.pdf');

PL/SQL procedure successfully completed.

SQL> exec load_data('dutch.txt');

PL/SQL procedure successfully completed.

SQL> select count(*) from test;

  COUNT(*)
----------
         8


SQL> set autotrace on
SQL> ed
Wrote file afiedt.buf

  1  SELECT SCORE(1) score, id, name
  2  FROM   test
  3  WHERE  CONTAINS(doc, 'Tomcat', 1) > 0
  4* ORDER BY SCORE(1) DESC
SQL>
SQL> /

     SCORE         ID NAME
---------- ---------- --------------------------------------------------------------------------------
       100         23 Clustering.doc


Execution Plan
----------------------------------------------------------
Plan hash value: 2693406471

-----------------------------------------------------------------------------------------------
| Id  | Operation                    | Name           | Rows  | Bytes | Cost (%CPU)| Time     |
-----------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT             |                |     3 |  1632 |     1 (100)| 00:00:01 |
|   1 |  SORT ORDER BY               |                |     3 |  1632 |     1 (100)| 00:00:01 |
|   2 |   TABLE ACCESS BY INDEX ROWID| TEST           |     3 |  1632 |     0   (0)| 00:00:01 |
|*  3 |    DOMAIN INDEX              | SAMPLE_DOC_IDX |       |       |     0   (0)| 00:00:01 |
-----------------------------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------

   3 - access("CTXSYS"."CONTAINS"("DOC",'Tomcat',1)>0)


Statistics
----------------------------------------------------------
         11  recursive calls
          0  db block gets
         19  consistent gets
          0  physical reads
          0  redo size
        532  bytes sent via SQL*Net to client
        396  bytes received via SQL*Net from client
          2  SQL*Net roundtrips to/from client
          1  sorts (memory)
          0  sorts (disk)
          1  rows processed



but When i try to look for the documents in other languages "no rows selected"

SQL> ed
Wrote file afiedt.buf

  1  SELECT SCORE(1) score, id, name
  2  FROM   test
  3  WHERE  CONTAINS(doc, 'Oracle Application Express', 1) > 0
  4* ORDER BY SCORE(1) DESC
SQL> //

no rows selected


Execution Plan
----------------------------------------------------------
Plan hash value: 2693406471

-----------------------------------------------------------------------------------------------
| Id  | Operation                    | Name           | Rows  | Bytes | Cost (%CPU)| Time     |
-----------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT             |                |     1 |   544 |     1 (100)| 00:00:01 |
|   1 |  SORT ORDER BY               |                |     1 |   544 |     1 (100)| 00:00:01 |
|   2 |   TABLE ACCESS BY INDEX ROWID| TEST           |     1 |   544 |     0   (0)| 00:00:01 |
|*  3 |    DOMAIN INDEX              | SAMPLE_DOC_IDX |       |       |     0   (0)| 00:00:01 |
-----------------------------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------

   3 - access("CTXSYS"."CONTAINS"("DOC",'Oracle Application Express',1)>0)


Statistics
----------------------------------------------------------
        148  recursive calls
          0  db block gets
        647  consistent gets
          0  physical reads
          0  redo size
        380  bytes sent via SQL*Net to client
        385  bytes received via SQL*Net from client
          1  SQL*Net roundtrips to/from client
          1  sorts (memory)
          0  sorts (disk)
          0  rows processed


Please help me how I can use world_lexer for documents in various languages

Thanks
Deepak
Re: Multi-lingual index for blob column [message #513627 is a reply to message #513612] Tue, 28 June 2011 07:24 Go to previous messageGo to next message
cookiemonster
Messages: 13958
Registered: September 2008
Location: Rainy Manchester
Senior Member
Are you expecting oracle text to translate the documents into english?
Re: Multi-lingual index for blob column [message #513637 is a reply to message #513627] Tue, 28 June 2011 07:55 Go to previous messageGo to next message
dkdms2124
Messages: 369
Registered: April 2010
Location: INDIA
Senior Member

No I just want to display the record in which the search sting exists.

Look at this:

SELECT SCORE(1) score, id, name
FROM   test
WHERE  CONTAINS(doc, '箭头显示在两个方向 ', 1) > 0
ORDER BY SCORE(1) DESC;


no rows selected.

Thanks
Deepak
Re: Multi-lingual index for blob column [message #513640 is a reply to message #513637] Tue, 28 June 2011 08:04 Go to previous messageGo to next message
dkdms2124
Messages: 369
Registered: April 2010
Location: INDIA
Senior Member

Is some related to character set of my windows and oracle database?

Regards
Deepak
Re: Multi-lingual index for blob column [message #513650 is a reply to message #513640] Tue, 28 June 2011 09:06 Go to previous messageGo to next message
Barbara Boehmer
Messages: 9100
Registered: November 2002
Location: California, USA
Senior Member
According to what you posted, you have not synchronized the index after loading the additional files. So, even though you have 8 rows in the table, only the first 3 that you loaded before you created the index have been indexed, so you will not be able to search anything in the other 5. After loading, you need to synchronize.
Re: Multi-lingual index for blob column [message #513655 is a reply to message #513650] Tue, 28 June 2011 09:45 Go to previous messageGo to next message
dkdms2124
Messages: 369
Registered: April 2010
Location: INDIA
Senior Member

Hello Mam,

I used the below mentioned command to synchronize the Index:

begin
ctx_ddl.sync_index('SAMPLE_DOC_IDX');
END;


SELECT SCORE(1) score, id, name
FROM   test
WHERE  CONTAINS(doc, '箭头显示在两个方向', 1) > 0
ORDER BY SCORE(1) DESC;


Still no rows selected.

doesnot seems helping out.

Regards
Deepak
Re: Multi-lingual index for blob column [message #513659 is a reply to message #513655] Tue, 28 June 2011 10:15 Go to previous messageGo to next message
dkdms2124
Messages: 369
Registered: April 2010
Location: INDIA
Senior Member

I have found some procedure to synchronyze the context index and after making the changes accordingly still "no rows returned"

Below is the procedure created:

CREATE OR REPLACE  
Procedure sync_ctx_indexes 
IS  
CURSOR sql1 is select distinct(pnd_index_owner||'.'||pnd_index_name) as index_name from CTXSYS.ctx_pending; 
BEGIN  
FOR SAMPLE IN sql1 LOOP
ctx_ddl.sync_index(SAMPLE.SAMPLE_DOC_IDX); 
END LOOP; 
END; 



Again when I executed the same sql statement for the search , got the same result:

SELECT SCORE(1) score, id, name
FROM   test
WHERE  CONTAINS(doc, 'Datenbankebene ', 1) > 0
ORDER BY SCORE(1) DESC;


Help me out plsss.

Thanks
Deepak
Re: Multi-lingual index for blob column [message #513663 is a reply to message #513659] Tue, 28 June 2011 10:32 Go to previous messageGo to next message
dkdms2124
Messages: 369
Registered: April 2010
Location: INDIA
Senior Member

I have also checked the ctx_pending view but there are no indexes pending for the sample user.

SQL> select * from ctxsys.CTX_PENDING;

PND_INDEX_OWNER                PND_INDEX_NAME                 PND_PARTITION_NAME             PND_ROWID          PND_TIMES
------------------------------ ------------------------------ ------------------------------ ------------------ ---------
SAMPLE_TEST                    SAMPLE_GLOBAL_IDX                                             AAASfuAANAAAADvAAA 28-JUN-11
SAMPLE_TEST                    SAMPLE_GLOBAL_IDX                                             AAASfuAANAAAADvAAB 28-JUN-11
SAMPLE_TEST                    SAMPLE_GLOBAL_IDX                                             AAASfuAANAAAADvAAC 28-JUN-11
SAMPLE_TEST                    SAMPLE_GLOBAL_IDX                                             AAASfuAANAAAADvAAD 28-JUN-11
SAMPLE_TEST                    SAMPLE_GLOBAL_IDX                                             AAASfuAANAAAADvAAE 28-JUN-11


What next to do?

regards
Deepak
Re: Multi-lingual index for blob column [message #513666 is a reply to message #513663] Tue, 28 June 2011 12:32 Go to previous messageGo to next message
Barbara Boehmer
Messages: 9100
Registered: November 2002
Location: California, USA
Senior Member
Try testing just a simple example first, inserting test data directly through SQL, without loading a file, inserting before you index, checking that the data is loaded, checking that the data is indexed, then testing the queries, as shown below. I have provided the test script and a run of it separately. It does not display properly on my system due to character set issues, but it still indexes and finds the data. Please post the results from your system in the same manner. If that does not work, then we need to figure out why. If it does work, then you can add one more thing at a time to it until it does not work, which should show where the problem is. You should re-run the full script and check to make sure that the data is loaded and check to make sure that the data is indexed with each test after adding something. Frequently problems are that the data was not properly loaded or not done indexing or synchronizing.

create table test
  (id    number primary key,
   name  varchar2(2000),
   doc   blob)
/
-- insert data before indexing:
insert into test (id, name, doc)
values (1, 'test1', utl_raw.cast_to_raw ('Tomcat'))
/
insert into test (id, name, doc)
values (2, 'test2', utl_raw.cast_to_raw ('Datenbankebene'))
/
insert into test (id, name, doc)
values (3, 'test3', utl_raw.cast_to_raw ('箭头显示在两个方向'))
/
commit
/
-- make sure data is in the table:
column name format a5
column doc  format a30
select id, name, length (doc), utl_raw.cast_to_varchar2 (doc) doc
from   test
/
begin
  ctx_ddl.create_preference ('est_lexer', 'world_lexer');
end;
/
create index sample_doc_idx on test (doc) 
indextype is ctxsys.context
parameters ('lexer est_lexer')
/
-- make sure data is indexed:
select token_text from dr$sample_doc_idx$i
/
-- test queries:
select score (1) score, id, name, 
       utl_raw.cast_to_varchar2 (doc) doc
from   test
where  contains (doc, 'Tomcat', 1) > 0
order  by score (1) desc
/
select score (1) score, id, name, 
       utl_raw.cast_to_varchar2 (doc) doc
from   test
where  contains (doc, 'Datenbankebene', 1) > 0
order  by score (1) desc
/
select score (1) score, id, name, 
       utl_raw.cast_to_varchar2 (doc) doc
from   test
where  contains (doc, '箭头显示在两个方向', 1) > 0
order  by score (1) desc
/


SCOTT@orcl_11gR2> create table test
  2    (id    number primary key,
  3  	name  varchar2(2000),
  4  	doc   blob)
  5  /

Table created.

SCOTT@orcl_11gR2> -- insert data before indexing:
SCOTT@orcl_11gR2> insert into test (id, name, doc)
  2  values (1, 'test1', utl_raw.cast_to_raw ('Tomcat'))
  3  /

1 row created.

SCOTT@orcl_11gR2> insert into test (id, name, doc)
  2  values (2, 'test2', utl_raw.cast_to_raw ('Datenbankebene'))
  3  /

1 row created.

SCOTT@orcl_11gR2> insert into test (id, name, doc)
  2  values (3, 'test3', utl_raw.cast_to_raw ('箭头显示在两个æ¹å''))
  3  /

1 row created.

SCOTT@orcl_11gR2> commit
  2  /

Commit complete.

SCOTT@orcl_11gR2> -- make sure data is in the table:
SCOTT@orcl_11gR2> column name format a5
SCOTT@orcl_11gR2> column doc  format a30
SCOTT@orcl_11gR2> select id, name, length (doc), utl_raw.cast_to_varchar2 (doc) doc
  2  from   test
  3  /

        ID NAME  LENGTH(DOC) DOC
---------- ----- ----------- ------------------------------
         1 test1           6 Tomcat
         2 test2          14 Datenbankebene
         3 test3          56 箭头显示在两个æ¹å'

3 rows selected.

SCOTT@orcl_11gR2> begin
  2    ctx_ddl.create_preference ('est_lexer', 'world_lexer');
  3  end;
  4  /

PL/SQL procedure successfully completed.

SCOTT@orcl_11gR2> create index sample_doc_idx on test (doc)
  2  indextype is ctxsys.context
  3  parameters ('lexer est_lexer')
  4  /

Index created.

SCOTT@orcl_11gR2> -- make sure data is indexed:
SCOTT@orcl_11gR2> select token_text from dr$sample_doc_idx$i
  2  /

TOKEN_TEXT
----------------------------------------------------------------
DATENBANKEBENE
TOMCAT
¨
´
¹Å
ĸ¤Ä¸ªÆ
Ť
Ƙ¾Ç¤ºÅŒ
Ç®

9 rows selected.

SCOTT@orcl_11gR2> -- test queries:
SCOTT@orcl_11gR2> select score (1) score, id, name,
  2  	    utl_raw.cast_to_varchar2 (doc) doc
  3  from   test
  4  where  contains (doc, 'Tomcat', 1) > 0
  5  order  by score (1) desc
  6  /

     SCORE         ID NAME  DOC
---------- ---------- ----- ------------------------------
         4          1 test1 Tomcat

1 row selected.

SCOTT@orcl_11gR2> select score (1) score, id, name,
  2  	    utl_raw.cast_to_varchar2 (doc) doc
  3  from   test
  4  where  contains (doc, 'Datenbankebene', 1) > 0
  5  order  by score (1) desc
  6  /

     SCORE         ID NAME  DOC
---------- ---------- ----- ------------------------------
         4          2 test2 Datenbankebene

1 row selected.

SCOTT@orcl_11gR2> select score (1) score, id, name,
  2  	    utl_raw.cast_to_varchar2 (doc) doc
  3  from   test
  4  where  contains (doc, '箭头显示在两个æ¹å'', 1) > 0
  5  order  by score (1) desc
  6  /

     SCORE         ID NAME  DOC
---------- ---------- ----- ------------------------------
         4          3 test3 箭头显示在两个æ¹å'

1 row selected.

SCOTT@orcl_11gR2>

Re: Multi-lingual index for blob column [message #513683 is a reply to message #513666] Tue, 28 June 2011 22:46 Go to previous messageGo to next message
dkdms2124
Messages: 369
Registered: April 2010
Location: INDIA
Senior Member

Hello Mam,

I tried your step and gor the below mentioned error in between.

SQL*Plus: Release 11.2.0.1.0 Production on Wed Jun 29 09:21:43 2011

Copyright (c) 1982, 2010, Oracle.  All rights reserved.

Enter user-name: sys/oracle as sysdba

Connected to:
Oracle Database 10g Enterprise Edition Release 10.2.0.1.0 - Production
With the Partitioning, OLAP and Data Mining options

SQL> select name from v$database;

NAME
---------
SOURCE

SQL> conn scott/tiger
Connected.
SQL> conn sys/oracle as sysdba
Connected.
SQL> grant execute on ctx_ddl to scott;

Grant succeeded.

SQL> grant dba to scott;

Grant succeeded.

SQL> conn scott/tiger
Connected.
SQL>
SQL> select * from tab;

TNAME                          TABTYPE  CLUSTERID
------------------------------ ------- ----------
DEPT                           TABLE
EMP                            TABLE
BONUS                          TABLE
SALGRADE                       TABLE
TIME_TEST                      TABLE

SQL> ed
Wrote file afiedt.buf

  1  create table test
  2    (id    number primary key,
  3     name  varchar2(2000),
  4*    doc   blob)
SQL> /

Table created.

SQL> insert into test (id, name, doc)
  2  values (1, 'test1', utl_raw.cast_to_raw ('Tomcat'));

1 row created.

SQL> insert into test (id, name, doc)
  2  values (2, 'test2', utl_raw.cast_to_raw ('Datenbankebene'));

1 row created.

SQL> insert into test (id, name, doc)
  2  values (3, 'test3', utl_raw.cast_to_raw ('?????????'));

1 row created.

SQL> commit;

Commit complete.

SQL> column name format a5
SQL> column doc  format a30
SQL> select id, name, length (doc), utl_raw.cast_to_varchar2 (doc) doc
  2  from   test;

        ID NAME  LENGTH(DOC) DOC
---------- ----- ----------- ------------------------------
         1 test1           6 Tomcat
         2 test2          14 Datenbankebene
         3 test3           9 ?????????

SQL> begin
  2    ctx_ddl.create_preference ('est_lexer', 'world_lexer');
  3  end;
  4  /

PL/SQL procedure successfully completed.

SQL> create index sample_doc_idx on test (doc)
  2  indextype is ctxsys.context
  3  parameters ('lexer est_lexer');

Index created.

SQL> select token_text from dr$sample_doc_idx$i;

TOKEN_TEXT
----------------------------------------------------------------
DATENBANKEBENE
TOMCAT

SQL> select score (1) score, id, name,
  2         utl_raw.cast_to_varchar2 (doc) doc
  3  from   test
  4  where  contains (doc, 'Tomcat', 1) > 0
  5  order  by score (1) desc;

     SCORE         ID NAME  DOC
---------- ---------- ----- ------------------------------
         4          1 test1 Tomcat

SQL> select score (1) score, id, name,
  2         utl_raw.cast_to_varchar2 (doc) doc
  3  from   test
  4  where  contains (doc, 'Datenbankebene', 1) > 0
  5  order  by score (1) desc
  6  /

     SCORE         ID NAME  DOC
---------- ---------- ----- ------------------------------
         4          2 test2 Datenbankebene

SQL> select score (1) score, id, name,
  2         utl_raw.cast_to_varchar2 (doc) doc
  3  from   test
  4  where  contains (doc, '?????????', 1) > 0
  5  order  by score (1) desc
  6  /
select score (1) score, id, name,
*
ERROR at line 1:
ORA-29902: error in executing ODCIIndexStart() routine
ORA-20000: Oracle Text error:
DRG-50901: text query parser syntax error on line 1, column 10


SQL> select score (1) score, id, name,
  2         utl_raw.cast_to_varchar2 (doc) doc
  3  from   test
  4  where  contains (doc, '?????????', 1) > 0
  5  order  by score (1) desc
  6  ;
select score (1) score, id, name,
*
ERROR at line 1:
ORA-29902: error in executing ODCIIndexStart() routine
ORA-20000: Oracle Text error:
DRG-50901: text query parser syntax error on line 1, column 10


SQL>



Should I proceed ignoring it or there is some reason why I got this error.

Thanks n Regards
Deepak
Re: Multi-lingual index for blob column [message #513688 is a reply to message #513683] Tue, 28 June 2011 23:22 Go to previous messageGo to next message
Barbara Boehmer
Messages: 9100
Registered: November 2002
Location: California, USA
Senior Member
Did you insert some data that was just displayed as a string of question marks or did you actually insert a string of questions marks? If you inserted a string of question marks, then they would correctly not be indexed, and a query with just question marks in the search string would correctly cause an error. It looks like that is what happened. When I ran my test case, I put the test case in a sql file, then ran that file, spooling the results. Because of the special characters, I saved the sql file that I ran as utf-8 and included a blank line at the top of the file.

[Updated on: Tue, 28 June 2011 23:23]

Report message to a moderator

Re: Multi-lingual index for blob column [message #513689 is a reply to message #513688] Tue, 28 June 2011 23:37 Go to previous messageGo to next message
dkdms2124
Messages: 369
Registered: April 2010
Location: INDIA
Senior Member

Hello Mam,

I have followed the same that you mentioned in your post and everything seems working now. Check out the result below.

SQL*Plus: Release 11.2.0.1.0 Production on Wed Jun 29 09:32:04 2011

Copyright (c) 1982, 2010, Oracle.  All rights reserved.


Connected to:
Oracle Database 10g Enterprise Edition Release 10.2.0.1.0 - Production
With the Partitioning, OLAP and Data Mining options

SQL> conn hr/hr
ERROR:
ORA-01017: invalid username/password; logon denied


Warning: You are no longer connected to ORACLE.
SQL> conn sys/oracle as sysdba
Connected.
SQL> create user hrtest idenitified by hr;
create user hrtest idenitified by hr
                   *
ERROR at line 1:
ORA-00922: missing or invalid option


SQL> create user hrtest identified by hr;

User created.

SQL> grant dba to hrtest;

Grant succeeded.

SQL> grant execute on ctx_ddl to hrtest;

Grant succeeded.

SQL> conn hrtest/hr
Connected.
SQL> select * from tab;

no rows selected

SQL> create table test
  2    2    (id    number primary key,
  3    3        name  varchar2(2000),
  4    4        doc   blob);
  2    (id    number primary key,
  *
ERROR at line 2:
ORA-00922: missing or invalid option


SQL> ed
Wrote file afiedt.buf

  1  create table test
  2      (id    number primary key,
  3     name  varchar2(2000),
  4*    doc   blob)
SQL> /

Table created.

SQL> ed
Wrote file afiedt.buf

  1  insert into test (id, name, doc)
  2* values (1, 'test1', utl_raw.cast_to_raw ('Tomcat'))
SQL> /

1 row created.

SQL> ed
Wrote file afiedt.buf

  1  insert into test (id, name, doc)
  2*  values (2, 'test2', utl_raw.cast_to_raw ('Datenbankebene'))
SQL> /

1 row created.

SQL> ed
Wrote file afiedt.buf

  1   insert into test (id, name, doc)
  2*  values (3, 'test3', utl_raw.cast_to_raw ('τ«¡σñ┤µÿ╛τñ║σ£¿Σ╕ñΣ╕¬µ╣σÉ'))
SQL> /

1 row created.

SQL> commit;

Commit complete.

SQL> column name format a5
SQL> column doc  format a30
SQL> select id, name, length (doc), utl_raw.cast_to_varchar2 (doc) doc from test;

        ID NAME  LENGTH(DOC) DOC
---------- ----- ----------- ------------------------------
         1 test1           6 Tomcat
         2 test2          14 Datenbankebene
         3 test3          25 τ«¡σñ┤µÿ╛τñ║σ£¿Σ╕ñΣ╕¬µ╣σÉ

SQL> ed
Wrote file afiedt.buf

  1  begin
  2   ctx_ddl.create_preference ('est_lexer', 'world_lexer');
  3*  end;
SQL> /

PL/SQL procedure successfully completed.

SQL> ed
Wrote file afiedt.buf

  1  create index sample_doc_idx on test (doc)
  2   indextype is ctxsys.context
  3*  parameters ('lexer est_lexer')
SQL> ;
  1  create index sample_doc_idx on test (doc)
  2   indextype is ctxsys.context
  3*  parameters ('lexer est_lexer')
SQL> /

Index created.

SQL> select token_text from dr$sample_doc_idx$i;

TOKEN_TEXT
----------------------------------------------------------------
DATENBANKEBENE
TOMCAT
¿
┤
─╕ñ─╕¬╞╣┼É
Ť
╞ÿ╛╟ñ║┼î
Ǯ

8 rows selected.

SQL> ed
Wrote file afiedt.buf

  1  select score (1) score, id, name,
  2    utl_raw.cast_to_varchar2 (doc) doc
  3    from   test
  4    where  contains (doc, 'Tomcat', 1) > 0
  5    order  by score (1) desc
  6*   /
SQL> /
  /
  *
ERROR at line 6:
ORA-00933: SQL command not properly ended


SQL> ed
Wrote file afiedt.buf

  1  select score (1) score, id, name,
  2    utl_raw.cast_to_varchar2 (doc) doc
  3    from   test
  4    where  contains (doc, 'Tomcat', 1) > 0
  5    order  by score (1) desc
  6*   /
SQL> /
  /
  *
ERROR at line 6:
ORA-00933: SQL command not properly ended


SQL> ed
Wrote file afiedt.buf

  1  select score (1) score, id, name,
  2    utl_raw.cast_to_varchar2 (doc) doc
  3    from   test
  4    where  contains (doc, 'Tomcat', 1) > 0
  5*   order  by score (1) desc
SQL> /

     SCORE         ID NAME  DOC
---------- ---------- ----- ------------------------------
         4          1 test1 Tomcat

SQL> ed
Wrote file afiedt.buf

  1  select score (1) score, id, name,
  2    utl_raw.cast_to_varchar2 (doc) doc
  3    from   test
  4    where  contains (doc, 'Datenbankebene', 1) > 0
  5*   order  by score (1) desc
SQL> /

     SCORE         ID NAME  DOC
---------- ---------- ----- ------------------------------
         4          2 test2 Datenbankebene

SQL> ed
Wrote file afiedt.buf

  1  select score (1) score, id, name,
  2    utl_raw.cast_to_varchar2 (doc) doc
  3    from   test
  4    where  contains (doc, 'τ«¡σñ┤µÿ╛τñ║σ£¿Σ╕ñΣ╕¬µ╣σÉ', 1) > 0
  5*   order  by score (1) desc
SQL> /

     SCORE         ID NAME  DOC
---------- ---------- ----- ------------------------------
         4          3 test3 τ«¡σñ┤µÿ╛τñ║σ£¿Σ╕ñΣ╕¬µ╣σÉ

SQL>


Now why I am not able to get the same result when I try to search the srting from the files stored in the table.

Regards
Deepak
Re: Multi-lingual index for blob column [message #513690 is a reply to message #513689] Tue, 28 June 2011 23:44 Go to previous messageGo to next message
Barbara Boehmer
Messages: 9100
Registered: November 2002
Location: California, USA
Senior Member
Start by checking that the documents are loaded properly:

select id, name, length (doc) from test;

Then check that they are indexed properly:

select token_text from dr$sample_doc_idx$i;


Re: Multi-lingual index for blob column [message #513691 is a reply to message #513690] Tue, 28 June 2011 23:54 Go to previous messageGo to next message
dkdms2124
Messages: 369
Registered: April 2010
Location: INDIA
Senior Member

Below ate the results of both the queries..

[code]
SQL> select id, name, length (doc) from test;

ID NAME
---------- ---------------------------------------------------------------
22 ABC.txt
23 Clustering.doc
24 connectivity.doc
41 backtrack_oracle_tutorial.pdf
42 Reading Logs Spanish.pdf
43 Pan-2.4-fr_FR.pdf
44 Kitchen-2.4-fr_FR.pdf
45 dutch.txt
61 german.odt
62 chinese.odt
63 japanese.odt

ID NAME
---------- ---------------------------------------------------------------
64 dutch.odt
81 backtrack_oracle_tutorial.pdf
82 dutch.odt

14 rows selected.

SQL> select token_text from dr$sample_doc_idx$i;

TOKEN_TEXT
----------------------------------------------------------------
0
02
0WNED
1
1
1.1
1.1
1.10A
1.3
1.4
10

TOKEN_TEXT
----------------------------------------------------------------
10
10.2.0.2
10.2.0.2.0
10G
11
11G
12
13
14
15
15,45

TOKEN_TEXT
----------------------------------------------------------------
1521
16
17
18
19
192.168.1.1
192.168.1.2
192.168.2
192.168.2.23
192.168.2.238
192.168.2.30

TOKEN_TEXT
----------------------------------------------------------------
1991
1999
19E
19K
2
2
2.0
2.0.55
2.1
2.2.2
2.4.0

TOKEN_TEXT
----------------------------------------------------------------
20
200
2000
20000
2001
2006
2007
2100
23
2┴V╙F╟
3

TOKEN_TEXT
----------------------------------------------------------------
3
30
31
3Bÿ▄
4
4
45
4âU┘╤┘
4╬
4╬BI6W
4╬C

TOKEN_TEXT
----------------------------------------------------------------
4╬┘
5
5
51
59
5PW
5╧2╩êF6╤╤╩FW▄╤╩F┌
6
6
66ÿ
6A

TOKEN_TEXT
----------------------------------------------------------------
6R
6R¼
6W
6W2

6êF
6ê┴F
6ê┴FV┴B
6ê╩╩V7B
6╤B┴G┴
6╤FF╤

TOKEN_TEXT
----------------------------------------------------------------
6╤FWVW72═
6╤W7B
6ш
6╤╩6R
6┌
6▄V6
6▄ÿV
7
7
77Vÿ
7B

TOKEN_TEXT
----------------------------------------------------------------
7C
7CH01
7F
7F╤
7G
7NMN
7W
7╙7
7╙7FV
8
8

TOKEN_TEXT
----------------------------------------------------------------
8.1.7.4
80
8005
8009
8080
8080
8081
8082
8090
8105
8109

TOKEN_TEXT
----------------------------------------------------------------
8205
8209
8443
8I
8K
9
9
9.2.0.1
9.2.0.6
9.2.0.8
99

TOKEN_TEXT
----------------------------------------------------------------
99N
99P
99PP
99W
99WP╞PW
99WQBA
99Y
9B
9C
9I
9MM

TOKEN_TEXT
----------------------------------------------------------------
9MN
W
W2
W66
W6R
W6VGV¼
W6W
W7V
W9
W9NW
W9W

TOKEN_TEXT
----------------------------------------------------------------
WA
WAAR
WAARDOOR
WAARNAAR
WACC
WACW
WAN
WANT
WANT
WARC
WARRANTIES

TOKEN_TEXT
----------------------------------------------------------------
WARRANTY
WAY
WAYS
WAYS
WA╞MPMP
WA╞MW
WB
WCN
WEAK
WEB
WEB

TOKEN_TEXT
----------------------------------------------------------------
WEBAPPS
WEBBROWSER
WEBSERVER
WEBSITE
WEDNESDAY
WEEKDAY
WEER
WELCOME
WELL
WFV
WFV¼

TOKEN_TEXT
----------------------------------------------------------------
WF╤
WF▄V┴F╤6
WGFW
WGW
WH
WHETH
WHITEPAPER
WHOLE
WI
WID
WIDE

TOKEN_TEXT
----------------------------------------------------------------
WIDELY
WIDEST
WIKÿ
WIN32
WINDO
WINDOWS
WINDOWS
WIS
WISH
WIT
WITHOU

TOKEN_TEXT
----------------------------------------------------------------
WITHOUT
WITHOUT
WM
WMMP
WMN
WMNAN
WN
WN9NW
WN9PNN9PM
WN9PNN9W
WN9WQ

TOKEN_TEXT
----------------------------------------------------------------
WN9╞M
WNOES
WNW
WOR
WORD
WORDEN
WORDT
WORK
WORKER
WORKERS
WORKING

TOKEN_TEXT
----------------------------------------------------------------
WORKS
WP
WPM
WPMWW9
WPN
WPP
WP╞
WP╞PW
WP╞W
WP╞WNW
WP╞WP

TOKEN_TEXT
----------------------------------------------------------------
WP╞WWN
WP╞WW╞
WQBA
WQM
WQN
WQ╞
WRI
WRITE
WRITING
WRITK
WRITTEN

TOKEN_TEXT
----------------------------------------------------------------
WS
WSFPM
WW
WW9
WWAN
WWW
WY
WYD

W┴
W┴FW

TOKEN_TEXT
----------------------------------------------------------------
W╞
W╞BANQNAN9
W╞MW
W╞N
W╞PM
W╞╞
W▄V7WF
W▄V7WFR
W▄V7WFVB
X
X86

TOKEN_TEXT
----------------------------------------------------------------
XML
XML
Y
YAW
YCM
YCTPP
YEAR
YEARS
YES
YI
YOC

TOKEN_TEXT
----------------------------------------------------------------
YOYODYNE
YPP╞
YPW

Yÿ
YúC
YúÿKêD
Y
Z
ZC
ZF

TOKEN_TEXT
----------------------------------------------------------------
ZFWN
ZFWP╞WP
ZIE
ZIEN
ZIJN
ZIP
ZOEKEN
âU┘╤┘
â┘4
â┘╧
ê

TOKEN_TEXT
----------------------------------------------------------------
ê5
ê55
ê6
êB
êD
êNN
êR
êVF╤
êW┴
êê
êÿ

TOKEN_TEXT
----------------------------------------------------------------
êó
ê╩
ê╩VW
ê╙B
ê╙G2êÿ
ÿ
ÿ6╤B╩┴F
ÿ76╤
ÿ7B
ÿ7FVB
ÿB

TOKEN_TEXT
----------------------------------------------------------------
ÿF
ÿFV7FVB
ÿFV7FVB═
ÿV╩B
ÿV╤╩R
ÿW
ÿWB
ÿÿ
ÿ╞ó
ÿ╟
ó

TOKEN_TEXT
----------------------------------------------------------------
ó9
ó9Yê
óA
óD
óI
óK
óê
óó
óú
ú
ú5

TOKEN_TEXT
----------------------------------------------------------------
úBP
úD
úIÿ
úK
ú╞úD
ª

¬
¬C
¬E
¬F

TOKEN_TEXT
----------------------------------------------------------------
¬FP
¬FPW
¬K
¬PMK

¼

┤┤


┴2

TOKEN_TEXT
----------------------------------------------------------------
┴6ê
┴7
┴F
┴V┴
┴V╙F╟

╞AN
╞AN9
╞B
╞C
╞CA╞

TOKEN_TEXT
----------------------------------------------------------------
╞CC
╞CC8
╞CK
╞CN
╞CNC
L
LA
LAGEN
LANCER
LANCEURS
LANCEZ

TOKEN_TEXT
----------------------------------------------------------------
LANGUC╞
LAQUELLE
LARGER
LATEN
LATER
LATEST
LAUNCHES
LAW
LAX
LAYOUTS
LB

TOKEN_TEXT
----------------------------------------------------------------
LBFACTOR
LCBA
LCN
LE
LEAST
LEAVE
LEFT
LENG
LEQUEL
LES
LESS

TOKEN_TEXT
----------------------------------------------------------------
LESSER
LEVE
LEVEL
LEVELCRPMKN
LGPL
LIBC
LIBDó
LIBRAR
LIBRARIES
LIBRARY
LIBRARYK

TOKEN_TEXT
----------------------------------------------------------------
LIBRC╞
LIC
LICECC
LICECW
LICEN
LICENCE
LICENS
LICENSE
LICENSED
LICENSED9YD
LICENSEE

TOKEN_TEXT
----------------------------------------------------------------
LICENSES
LICENSOR
LIDA
LIEU
LIGN
LIGNE
LIGNES
LIKE
LIMCNAMK
LIMCNAMP
LIMITATION

TOKEN_TEXT
----------------------------------------------------------------
LIMITED
LINE
LINES
LINK
LINK
LINKED
LINKING
LINUX
LIS
LIST
LIST

TOKEN_TEXT
----------------------------------------------------------------
LISTDIR
LISTE
LISTENER
LISTENERS
LISTENING
LISTER
LISTJOBS
LISTREP
LISTTRANS
LITTLE
LOAD

TOKEN_TEXT
----------------------------------------------------------------
LOADBALANCER
LOADMODULE
LOCALHOST
LOCATIE
LOG
LOG
LOGIN
LOGOUT
LOGS
LONG
LOSSES

TOKEN_TEXT
----------------------------------------------------------------
LUISTERAAR
LY
M
MA
MAAR
MACHINE
MACHINE
MACHINES
MACOSX
MAD
MADE

TOKEN_TEXT
----------------------------------------------------------------
MAILING
MAIS
MAKE
MAKE
MAKING
MANAGEN
MANAGER
MANDATORY
MANUEL
MANY
MARVEL

TOKEN_TEXT
----------------------------------------------------------------
MAT
MATERIALS
MATION
MATI╚RES
MATT
MAX
MAY
MAY
MC
MC¬
MC¬FPP

TOKEN_TEXT
----------------------------------------------------------------
MEANINGFUL
MEANS
MEANS
MECHANISM
MEDIUM
MEEGELEVERD
MEER
MEET
MEILLEURE
MENTS
MENU

TOKEN_TEXT
----------------------------------------------------------------
MERCHANTABILITK
MERCHANTABILITY
MERE
MERK
MESSAGE
9N
9NP
9NPA
9NPM
9NPMW
9NPW

TOKEN_TEXT
----------------------------------------------------------------
9NP╞WM
9NW
9NWQBA
9P
9PA
9PANP9W
9PM
9PMAN9NPMWW
9PMC
9PMCFP
9PMP

TOKEN_TEXT
----------------------------------------------------------------
9PMPN
9PMW
9PMW╞N
9PN
9PP
9PPN9
9PPP
9PW
9Pó
9P╞
9P╞P

TOKEN_TEXT
----------------------------------------------------------------
9P╞WM
9P╞WNP
9P╞WNW
9Q
9QM
9QN
9QRPMPN
9S
9SFWN
9W
9W9

TOKEN_TEXT
----------------------------------------------------------------
9WB
9WM
9WMMA
9WMN
9WMN╞
9WN
9WN9A╞NP
9WN9╞M
9WQ
9WQBA
9WQN

TOKEN_TEXT
----------------------------------------------------------------
9WóA
9W╞
9W╞╞
9Y
9YA╞WNW
9YC
9YI
9╞
A1
AB
ABORD

TOKEN_TEXT
----------------------------------------------------------------
ABOUT
ABOV
ABOVE
AC
ACCEC
ACCEPT
ACCEPTAN
ACCESSIBLE
ACCESSORS
ACCNCM
ACCOMPANY

TOKEN_TEXT
----------------------------------------------------------------
ACCOMPANYING
ACE
ACHIEVE
ACK
ACTIVITIES
ACWNWM
ADD
ADD
ADDED
ADDITION
ADDRESS

TOKEN_TEXT
----------------------------------------------------------------
ADDRESSED
ADDRESSES
ADMIN
ADRESSE
ADVANTAGE
AF
AFBEELDING
AFFE
AFFICHAGE
AFFICHE
AFFICHER

TOKEN_TEXT
----------------------------------------------------------------
AFIN
AFTER
AGAIN
AGGREGATION
AGREEMENT
AINSI
AJOUT
AJP
AJP13
AK
AL

TOKEN_TEXT
----------------------------------------------------------------
ALEXORA1
ALIAS
ALLEEN
ALLEZ
ALLOWED
ALONG
ALORS
ALREA
ALREADY
ALREC
ALS

TOKEN_TEXT
----------------------------------------------------------------
ALTER
ALUES
AM
AMAP
AMC
AMK
AMKNAN
AMK╞
AMMELY
AMOCK
AMOCP

TOKEN_TEXT
----------------------------------------------------------------
AMP
AMPNAN
AMP╞
AMW9
AMW9N
AMW9NP
AMW9NPMWN
AMW9NWQBA
AMW9W
EXE
EXECUTABLE

TOKEN_TEXT
----------------------------------------------------------------
EXECUTABLES
EXECUTE
EXECUTED
EXECUTION
EXEMPLE
EXEMPLES
EXER
EXERCISE
EXPLAIN
EXPLAINS
EXPLICIT

TOKEN_TEXT
----------------------------------------------------------------
EXPLOIT
EXPLORATE
EXPOR
EXPORT
EXPORTER
EXPREP
EXPRESS
EXPRESSED
EXPRESSLY
EXT
EXTCóD

TOKEN_TEXT
----------------------------------------------------------------
EXTCóK
EXTENSION
EXTPROC
EX╔CUTABLES
EX╔CUTE
EX╔CUTER
EX╔CUTION
EX╔CUT╔
EX╔CUT╔E
EX╔CUT╔ES
EZ

TOKEN_TEXT
----------------------------------------------------------------
EêW┴FV┴6╤ê
Eÿ
F
F6┴6WB
F6▄
FA
FACILE
FACILITIE
FACILITIES
FACILITY
FACTO

TOKEN_TEXT
----------------------------------------------------------------
FAI
FAILURE
FAIR
FAIRE
FAIT
FALL
FALLS
FAUT
FA╟
FA╟O
FC

TOKEN_TEXT
----------------------------------------------------------------
FCM
FEBRUARY
FEE
FFÿ
FI
FIC
FICATEUR
FICHE
FICHI
FICHIE
FICHIER

TOKEN_TEXT
----------------------------------------------------------------
FIFTH
FIGURATION
FIL
FILE
FILE
FILES
FILES
FIN
FINALLY
FIND
FINE

TOKEN_TEXT
----------------------------------------------------------------
FINIR
FINS
FIRST
FITNESS
FITS
FIXING
FLOOR
FO
FOC
FOI
FOIS

TOKEN_TEXT
----------------------------------------------------------------
FOLDER
FOLDERS
FOLLOW
FOLLOWING
FOLLOWING
FOLLOWS
FON
FONC
FORM
FORMATION
FORMATIONS

TOKEN_TEXT
----------------------------------------------------------------
FORME
FORMER
FORMES
FORMING
FORWORD
FOU
FOUCCM
FOUCWM
FOUNDATION
FOUNDATION
FOURNI

TOKEN_TEXT
----------------------------------------------------------------
FP
FPA
FPM
FPMW
FPMWP
FPMWPW
FPN
FPP
FPP╞
FP╞WNP
FP╞WNPN

TOKEN_TEXT
----------------------------------------------------------------
FR
FRAN
FRANKLIN
FRAN╟AISE
FRE
FREE
FREEDOM
FRI
FROB
FTP
FULL

TOKEN_TEXT
----------------------------------------------------------------
FUNC
FUNCTION
FUNCTIONS
FV7
FVB
FV╩W
FW
FWN
FWP╞PW
FWQ
FW╞

TOKEN_TEXT
----------------------------------------------------------------
FY
FYB

FêW2
FÿV╩
FÿV╩B
F┴66
F╟
F╔RENTIEL
F╔RENTIELS
F╔RENTS

TOKEN_TEXT
----------------------------------------------------------------
F╔R╔
F╩76W
F╤
F╤6V╩
F╤6W2
F╤B
F╤FFW
Fш
F╤ê┴2
F╤╩R
F╙

TOKEN_TEXT
----------------------------------------------------------------
F┌
F▄
F▄R
F▄W6R
F▄╙2
G
G2¼
G2┴G┴B
GAIN
GAME
GATEWAY

TOKEN_TEXT
----------------------------------------------------------------
GEBRUIK
GEDISTRIBUEERD
GEEN
GENERAL
GENERALLY
GENERC
GENEROUS
GEOGRAPHICAL
GESCHIKT
GET
GETTING

TOKEN_TEXT
----------------------------------------------------------------
GEVAL
GEWOON
GE╧NSTALLEERD
GE╧NSTALLEERDE
GFW
GIVE
GIVECCC
GIVECWW
GIVEN
GLANCE
GLOGI

TOKEN_TEXT
----------------------------------------------------------------
GLOGIN
GMFLAGS
GNU
GO
GRAFIEK
GRAFISCH
GRALIT╔
GRANDE
GRANT
GRAP
GRATUITEMENT

TOKEN_TEXT
----------------------------------------------------------------
GRC╞C
GRC╞W
GREATER
GUARANTEE
GUESS
GUESSED
GUIDE
GUIDED
GV
GV╩7F╤ê
GW2

TOKEN_TEXT
----------------------------------------------------------------
GWFÿ
GWGW
G╔ND
H
HA
HABITUELLEMENT
HACKER
HACKING
HAPPEN
HARDENING
HELLOWORLD

TOKEN_TEXT
----------------------------------------------------------------
HEREBY
HERZIENING
HET
HEURE
HEURES
HIER
HIGHER
HIQUE
HO
HOLDER
HOME

TOKEN_TEXT
----------------------------------------------------------------
HOME
HOME1
HOME2
HOP
HORIZONTAL
HOST
HOST
HOUR
HTML
HTTP
HTTP

TOKEN_TEXT
----------------------------------------------------------------
HTTPD
I2
IC
ICCC
ICCCóA
ICWW
ICWWóA
ID
ID
IDENTIFIABLE
IDENTIFIED

TOKEN_TEXT
----------------------------------------------------------------
IDENTIFY
IES
IL
ILLUSTRATIE
IMMEDIATE
IMPLEMENT
IMPLEMENTED
IMPLEMENTED
IMPLIED
IMPO
IMPORTANT

TOKEN_TEXT
----------------------------------------------------------------
IMPORTANTES
IMPORTE
IMPOSE
IMPOSED
IMPOSSIBLE
INACCURATE
INATTENDUE
INC
INCLUDE
INCLUDED
INCLUDING

TOKEN_TEXT
----------------------------------------------------------------
INCORPORATE
IND
INDEPENDENT
INDEX
INDICATE
INDOWS
INFORMATIE
INFORMATION
INFORMATIONS
INFRASTRUCTUUR
INFRINGEMENT

TOKEN_TEXT
----------------------------------------------------------------
ING
INITIALISER
INJECT
INSCRIRE
INSTALL
INSTALL
INSTALLATION
INSTALLATION
INSTALLED
INSTALLED
INSTALLEREN

TOKEN_TEXT
----------------------------------------------------------------
INSTALLING
INSTALLS
INSTALL╔
INSTANCE
INSTANCE
INSTANCES
INSTANCES
INSTEAD
INSTRUCTIO
INT
INTEGER

TOKEN_TEXT
----------------------------------------------------------------
INTEGRATION
INTEGRITY
INTEND
INTENDED
INTENT
INTERF
INTERFACE
INTERVALLE
INTERVALLES
INTRODUCED
INT╔

TOKEN_TEXT
----------------------------------------------------------------
INT╔G
INT╔GRAL
INVOKED
ION
IP
ISOLATION
ISTENER
ITALIQUE
ITSELF
Iÿ
Iúÿ╞

TOKEN_TEXT
----------------------------------------------------------------
I╔
JAR
JAVA
JB
JE
JK
JKLOGFILE
JKLOGLEVEL
JKMOUNT
JKWORKERSFILE
JO

TOKEN_TEXT
----------------------------------------------------------------
JOB
JOBS
JOU
JOUR
JOURN
JOURNA
JOURNAL
JOURNALI
JOURNALISATION
JOURNALISE
JOURS

TOKEN_TEXT
----------------------------------------------------------------
JRE
JVMROUTE
K
KA
KANP
KC
KCZ
KENMERKEN
KERNEL
KET
KETTLE

TOKEN_TEXT
----------------------------------------------------------------
KEYS
KIND
KITCHEN
KJ
KJB
KK
KKM
KM
KMANP╞C
KMC
KMCA

TOKEN_TEXT
----------------------------------------------------------------
KMCC
KMCK
KMCKC
KMCK╞
KMCK╞CC
KMCM
KMCMM
KMC╞N
KMK
KMKA
KMT

TOKEN_TEXT
----------------------------------------------------------------
KN
KNB
KNN
KNOW
KNOWLEDGE
KNOWN
KP
KPN
KP╞
KT
KTR

TOKEN_TEXT
----------------------------------------------------------------
KUNNEN
KUNT
Kÿ

KúPú
K╞
K╞C
K╞CC
K╞CM
K╞CN
K╞CNC

TOKEN_TEXT
----------------------------------------------------------------
K╞CNCMN
K╞CNK
K╞CNKN
K╞CNKP╞
K╞CNP7T
K╞P
K╔E
K╔ES
MET
METTRE
MFYCTP

TOKEN_TEXT
----------------------------------------------------------------
MICROSYSTEMS
MID
MILIEU
MINIMAL
MINIMALISEERT
MINIMUM
MINUTE
MINUTES
MISE
MK
MK╞

TOKEN_TEXT
----------------------------------------------------------------
MM
MMAN
MMAN9
MMAN9NP
MMANP
MMC
MMCFPN
MMCFW
MMK
MMK╞
MMP

TOKEN_TEXT
----------------------------------------------------------------
MMP╞
MN
MOD
MOD
MODCMAN
MODCMAN9NP
MODE
MODICAN
MODICAN9N
MODICAN9NW
MODIFCNP

TOKEN_TEXT
----------------------------------------------------------------
MODIFI
MODIFICATION
MODIFICATIONS
MODIFIED
MODIFY
MODIFYING
MODULE
MODULES
MODULES
MOGELIJKHEID
MOIS

TOKEN_TEXT
----------------------------------------------------------------
MONDAY
MONTH
MORE
MOST
MOST
MOT
MOUNT
MP
MPLE
MPORTE
MPS

TOKEN_TEXT
----------------------------------------------------------------
MP╞
MSI
MUST
MW

M¬FYI
M╩ME
N
N9
N9P
N9PA

TOKEN_TEXT
----------------------------------------------------------------
N9PN
N9P╞
N9W
N9╞
NA
NAAR
NAL
NAME
NAME
NBúNÿ
NC

TOKEN_TEXT
----------------------------------------------------------------
NCHE
NCQBA
NCTION
NDOWS
NE
NEARLY
NECESSARILY
NECESSARY
NEED
NEED
NETCAT

TOKEN_TEXT
----------------------------------------------------------------
NEVER
NEW
NEW
NEWBLOCK
NEWER
NEXT
NF
NFPP╞
NFP╞
NFP╞W
NFW

TOKEN_TEXT
----------------------------------------------------------------
NIE
NIET
NIFIE
NIR
NIVEAU
NK
NKA
NKA╞N
NKC
NKM
NKMC

TOKEN_TEXT
----------------------------------------------------------------
NKMCC
NKMCN
NKP╞
NK╞CM
NMAP
NN
NN9
NN9NW
NODIG
NODIGE
NOM

TOKEN_TEXT
----------------------------------------------------------------
NOMBRE
NOREP
NORMAL
NORMALLY
NOTES
NOTHING
NOTICE
NOTICES
NOV
NOW
NOW

TOKEN_TEXT
----------------------------------------------------------------
NOWADAYS
NOYAU
NP
NP9
NPA╞N
NPP
NPPKC
NPPK╞
NPPPW
NPPP╞
NPP╞

TOKEN_TEXT
----------------------------------------------------------------
AN9
ANA
ANCIENNEMENT
ANGLAIS
ANN╔E
ANO
ANON
ANOTHER
ANOTHER
ANP
ANQNC

TOKEN_TEXT
----------------------------------------------------------------
ANQNCFPP
ANT
AP
APACHE
APACHE
APACHE2
APEX
API
APPARAISSE
APPLICATIE
APPLICATIEBEVEILIGING

TOKEN_TEXT
----------------------------------------------------------------
APPLICATION
APPLICATION
APPLIES
APPLY
APPROPRIATE
APPROPRIATE
APPROPRIATELY
APRI
APRIL
APR╚S
ARCHITECTUUR

TOKEN_TEXT
----------------------------------------------------------------
ARCHIVE
ARGU
ARISING
ARROWS
ASE
ASK
ATTACH
ATTACK
ATTECCAN
ATTECWAN
ATTEMPT

TOKEN_TEXT
----------------------------------------------------------------
ATTENTION
AU
AUQUEL
AUSSI
AUTH
AUTHID
AUTHOR
AUTHORIZED
AUTOMATICALL
AUTOMATICALLY
AUTONOMOUS

TOKEN_TEXT
----------------------------------------------------------------
AVAILABLE
AVAILABLE
AVANT
AVE
AVEC
AVEZ
AWARE
A╞C
A╞M
A╞MCCN
A╞MCCQBA

TOKEN_TEXT
----------------------------------------------------------------
A╞MCK
A╞MKMK
A╞MPMP
A╞MWP
A╞MWWN9╞M
A╞MWWQBA
A╞NP
A╞W
B
B7
BA

TOKEN_TEXT
----------------------------------------------------------------
BACKTRACK
BACKTRACK20
BALANCE
BALANCED
BALANCING
BAS
BASED
BASIC
BASICALLY
BAT
BAT

TOKEN_TEXT
----------------------------------------------------------------
BECOME
BEELDEN
BEGIN
BEGINNING
BEHEER
BEHEERT
BEHULP
BEIDE
BELANGRIJKSTE
BELOW
BESOIN

TOKEN_TEXT
----------------------------------------------------------------
BESTAANDE
BESTAAT
BESTAND
BESTANDEN
BESTANDSSYSTEEM
BETWEEN
BEVAT
BIEDT
BIJ
BIJVOORBEELD
BIN

TOKEN_TEXT
----------------------------------------------------------------
BINARY
BLOCK
BO
BODY
BREAK
BROWSER
BROWSER
BRUTEFORCE
BS
BT20
BUGTRAQ

TOKEN_TEXT
----------------------------------------------------------------
BUILDER
B═
C
C
CA
CALL
CALLED
CARTE
CAS
CASE
CASES

TOKEN_TEXT
----------------------------------------------------------------
CATALINA
CATALINE
CAUSE
CA¬
CA╞MC
CA╞MKMK
CB
CC
CCAN
CCZ
CD

TOKEN_TEXT
----------------------------------------------------------------
CE
CECI
CELA
CELLE
CERTAIN
CES
CET
CETTE
CF
CFP
CFPP

TOKEN_TEXT
----------------------------------------------------------------
CFW
CFW9P9
CFWQ
CFWó
CGI
CHA
CHACUNE
CHANC
CHANG
CHANGCMK
CHANGCMP

TOKEN_TEXT
----------------------------------------------------------------
CHANGE
CHANGE
CHANGED
CHANGES
CHANGES
CHANGING
CHAQUE
CHARG
CHARGE
CHARGEMENT
CHARIOT

TOKEN_TEXT
----------------------------------------------------------------
CHECK
CHECK
CHECKPWD
CHECKS
CHEF
CHEMIN
CHMOD
CHOICE
CHOISIR
CHOIX
CHOO

TOKEN_TEXT
----------------------------------------------------------------
CI
CIRCUMSTANCE
CIRCUMSTANCES
CK
CKK
CKMCC
CKN
CKP
CK╞
CK╞C
CK╞CNC

TOKEN_TEXT
----------------------------------------------------------------
CLAI
CLAIMS
CLASH
CLASSNAME
CLI
CLOSE
CLOSE
CLUSTER
CLUSTERING
CLUSTERS
CM

TOKEN_TEXT
----------------------------------------------------------------
CMMA
CMMK
CMN
CMNAN
CMN╞
CN
CNB
CNC
CNOES
CO
COCC

TOKEN_TEXT
----------------------------------------------------------------
COCW
COD
CODE
CODE
CODES
COLLABORATION
COLLECTING
COLLECTION
COLLECTIVE
COM
COMBINAISON

TOKEN_TEXT
----------------------------------------------------------------
COMBINATION
COMBINED
COME
COMING
COMMAN
COMMAND
COMMANDE
COMMANDS
COMME
COMMEN
COMMENCEZ

TOKEN_TEXT
----------------------------------------------------------------
COMMENT
COMMENTS
COMMIT
COMMUNICATES
COMPATIBLE
COMPELLED
COMPETING
COMPILER
COMPL
COMPLET
COMPLETE

TOKEN_TEXT
----------------------------------------------------------------
COMPONENTS
COMPUTE
CON
CONCERNANT
CONDI
CONDITIOCC
CONDITIOCW
CONDITION
CONDITIONS
CONF
CONF

TOKEN_TEXT
----------------------------------------------------------------
CONFIGURATIE
CONNECT
CONNECTE
CONNECTER
CONNECTING
CONNECTIONTIMEOUT
CONNECTOR
CONSEQUENCE
CONSEQUENTIAL
CONSIDERED
CONSISTENT

TOKEN_TEXT
----------------------------------------------------------------
CONSPICUOUSLY
CONSULTEZ
CONTA
CONTAINING
CONTAINS
CONTRADICT
CONTRIBUTIONS
CONTROL
CONVENIENTLY
CONVEY
COON

TOKEN_TEXT
----------------------------------------------------------------
COP
COPCN
COPCN9
COPIES
COPY
COPY
COPYING
COPYRIGHT
COPYRIGHTED
CORPORATCN
CORRECTION

TOKEN_TEXT
----------------------------------------------------------------
CORRESPONDING
COST
COUNTRIES
COUR
COURANTE
COURTE
COVERED
CP
CP8
CPM
CPP

TOKEN_TEXT
----------------------------------------------------------------
CPR
CP╞
CP╞C
CP╞CCN
CP╞CC╞
CP╞CK
CP╞KC
CQ
CQBA
CQM
CQN

TOKEN_TEXT
----------------------------------------------------------------
CQ╞
CREATE
CREATE
CREATED
CREATES
CRE╦REN
CRIT
CRITERIA
CRON
CRONTAB
CR╔ER

TOKEN_TEXT
----------------------------------------------------------------
CS
CTION
CU
CURRENT
CUS
CUSTOMARILY
CUSTOMER
CW

CóA


TOKEN_TEXT
----------------------------------------------------------------

C╞
C╞BANQNAN
C╞CK╞P
C╞MC
C╞N
C╞PM
C╞WP╞P
C╞╞
C═
D9

TOKEN_TEXT
----------------------------------------------------------------
D9YK
DADS
DAMAGES
DANS
DASDF
DAT
DATA
DATABA
DATABAS
DATABASE
DATABASES

TOKEN_TEXT
----------------------------------------------------------------
DATE
DAY
DB
DBA
DBAS
DBMS
DBSNMP
DCN
DE
DEBUG
DEBUGGING

TOKEN_TEXT
----------------------------------------------------------------
DECISION
DECLARE
DEE
DEF
DEFAULT
DEFAULTHOST
DEFINITION
DEFINITIONS
DENY
DEPEND
DEPUIS

TOKEN_TEXT
----------------------------------------------------------------
DERIC
DERIVA
DERIVATCM
DERIVATIVE
DERIVATIVES
DERIVED
DES
DESC
DESIGNATED
DESIGNED
DESSOUS

TOKEN_TEXT
----------------------------------------------------------------
DETAIL
DETAILED
DETAILS
DEVELOP
DEVELOPER
DEVREZ
DEZE
DI
DICTIONARY
DIE
DIF

TOKEN_TEXT
----------------------------------------------------------------
DIFFER
DIFFERENT
DIFFERENT
DIFF╔RENTS
DIMA
DIMECC╞N
DIMENSCN
DIMENSI
DIMENSIOCC¬
DIMENSIONS
DIR

TOKEN_TEXT
----------------------------------------------------------------
DIREC
DIRECTING
DIRECTL
DIRECTORY
DIRECTORY
DIS
DISCLAIMS
DISPLAYS
DISPOSANT
DISPOSE
DISTINCTES

TOKEN_TEXT
----------------------------------------------------------------
DISTINGUISHICK
DISTINGUISHICP
DISTRCBA
DISTRIBUT
DISTRIBUTCN
DISTRIBUTE
DISTRIBUTED
DISTRIBUTING
DISTRIBUTIOCCCQN
DISTRIBUTIOCWWQN
DISTRIBUTION

TOKEN_TEXT
----------------------------------------------------------------
DISTRIBUTOR
DISTRIBUTORS
DISTRIDA
DOCUMENT
DOCUMENTATI
DOE
DOET
DOING
DOIT
DOMAIN
DON

TOKEN_TEXT
----------------------------------------------------------------
DONN╔ES
DONN╔S
DONOR
DONT
DOOR
DOS
DOUBLES
DOUTE
DOWN
DOWNLOAD
DRAAIEN

TOKEN_TEXT
----------------------------------------------------------------
DRAAIT
DRIE
DU
DURANT
DURING
DU╓I44╤T
DU╓┘
DYNAMISCHE
Dÿ

D┬2

TOKEN_TEXT
----------------------------------------------------------------
D╔
D╔COMPRE
D╔COMPRESS
D╔CRIVENT
D╔FAUT
D╔FI
D╔FINIE
D╔FINIR
D╔FINIS
D╔FINIT
D╔ROUL╔E

TOKEN_TEXT
----------------------------------------------------------------
D╔SIRER
D╔SIREZ
D╔TAILL╔
D╔TAILL╔E
E
E
EACH
EASIEST
EC
ECTION
EC╞

TOKEN_TEXT
----------------------------------------------------------------
ED
EDIT
EDITOR
EE
EEN
EERDERE
EFFECTIVELY
EFORCE
ELLE
ELSE
EM

TOKEN_TEXT
----------------------------------------------------------------
EMBEDDED
EMENT
EMPLOYER
EN
ENABLES
ENCOURAGE
END
ENFORCINC
ENGINE
ENGINEERING
ENSUITE

TOKEN_TEXT
----------------------------------------------------------------
ENSURE
ENT
ENTIEL
ENTIRE
ENTIRELY
ENTRANT
ENTREP╘T
ENTREREZ
ENTREZ
ENV
ENVIRONMENT

TOKEN_TEXT
----------------------------------------------------------------
ENVIRONMENT
ENVIRONMENTS
ENVIRONNEMENT
ER
ERFACE
ERREUR
ERREURS
ERROR
ERROR
ERRORS
ERSION

TOKEN_TEXT
----------------------------------------------------------------
ERVER
ES
ESBê
ESCALATE
ESCALATING
ESCALATION
ESPACES
ESPECIALLY
EST
ET
EU

TOKEN_TEXT
----------------------------------------------------------------
EVEN
EVENT
EVERY
EVERYONE
EXAMPLE
EXCEPT
EXCEPTION
EXCLUDED
EXCLUSION
EXCUSE
PER

TOKEN_TEXT
----------------------------------------------------------------
PER
PERFECT
PERFORMING
PERIENCE
PERMET
PERMI
PERMISSCN
PERMISSION
PERMISSIONS
PERMIT
PERMITS

TOKEN_TEXT
----------------------------------------------------------------
PERMITTED
PERTINENT
PEUT
PEUVENT
PHYSI
PILE
PK
PKA
PKC
PKP╞
PL

TOKEN_TEXT
----------------------------------------------------------------
PLAC
PLACE
PLACES
PLACEZ
PLANI
PLANIFI
PLANIFICATION
PLANIFIER
PLANIFI╔ES
PLANNIFIER
PLATE

TOKEN_TEXT
----------------------------------------------------------------
PLATEF
PLATES
PLS
PLSQL
PLUGGIN
PLUS
PM
PMAN
PMANP╞W
PMC
PMK

TOKEN_TEXT
----------------------------------------------------------------
PMPA
PMPN
PMT
PMW
PMWA
PMWM
PMWMM
PMWP╞
PMWP╞WW9
PMWW
PN

TOKEN_TEXT
----------------------------------------------------------------
PN9
PNC
PNK
PNKMK
PNN
PNP
PNP9NW
PNP9W
PNPMP
PNQ
PNQMK

TOKEN_TEXT
----------------------------------------------------------------
PNQMP
PNQNAN
PNQNAN9
PNQNC
PNW
POINTER
POR
PORT
PORT
PORTION
PORTIONS

TOKEN_TEXT
----------------------------------------------------------------
PORTS
PORTS
POSE
POSER
POSITIONN╔
POSSIBILITIES
POSSIBLE
POSSIBLES
POUR
POURREZ
POUVE

TOKEN_TEXT
----------------------------------------------------------------
POUVEZ
PP
PP9NY
PPA
PPC
PPK
PPKC
PPKMCQMC
PPKN
PPM
PPP

TOKEN_TEXT
----------------------------------------------------------------
PPPMWQMW
PPPN
PPPW
PPP╞
PPW
PPW99NW
PPó
PP╞
PP╞WN
PRACTI
PRAGMA

TOKEN_TEXT
----------------------------------------------------------------
PRD
PREAMBLE
PRECISE
PREMIER
PREMI╚RE
PRENDRE
PREPARED
PRES
PRESIDENT
PREVIOUS
PRI

TOKEN_TEXT
----------------------------------------------------------------
PRICE
PRIVI
PRIVIL
PRIVILEGE
PRIVILEGES
PROBLEM
PROBLEMS
PROBL╚ME
PROCEDURE
PROD
PRODUCT

TOKEN_TEXT
----------------------------------------------------------------
PRODUCTIE
PRODUCTION
PROG
PROGRAM
PROGRAM
PROGRAMS
PROHIBITED
PROMINENT
PROMPT
PROPERLY
PROPERLY

TOKEN_TEXT
----------------------------------------------------------------
PROPERTIES
PROPERTY
PROPRIETARY
PROTECT
PROTECTED
PROTECTION
PROTOCOL
PROVCNN
PROVCNN9NP
PROVIDE
PROVIDED

TOKEN_TEXT
----------------------------------------------------------------
PROVIDES
PR╔F╔REZ
PR╔PARER
PR╔SENTES
PS
PSFP
PSFW
PSFWQBA
PSFW╞
PUBLI
PUBLIC

TOKEN_TEXT
----------------------------------------------------------------
PUBLISHED
PUBLI╔
PURPOSE
PW
PW9
PW99N9NP
PWQBA
PZ


Póó

TOKEN_TEXT
----------------------------------------------------------------

P╞
P╞KN
P╞P
P╞PN9
P╞W
P╞WM
P╞WNP
P╞WNPP╞
P╞WNW
P╞WNW9

TOKEN_TEXT
----------------------------------------------------------------
P╞WNWMN
P╞WNYC
P╞WW9
Q
QBA
QM
QM╞C
QM╞W
QN
QN9
QN9N

TOKEN_TEXT
----------------------------------------------------------------
QNMNC
QNMNCFW
QNNC
QNNK
QNNP
QNNW
QNP
QN╞
QRPMKN
QU
QUALITY

TOKEN_TEXT
----------------------------------------------------------------
QUAND
QUANTIT╔
QUE
QUELLE
QUESTIONS
QUI
QUIT
QUITE
QUOTES
Q╞
Q╞K

TOKEN_TEXT
----------------------------------------------------------------
Q╞MKMK
Q╞MPMP
Q╞N
Q╞P
R
RAC
RACINE
RAL
RAMME
RARE
RATHER

TOKEN_TEXT
----------------------------------------------------------------
RAWCM
RC
RC╞K
RC╞P
RE
READABLE
REAL
REASON
REASONABLY
REASONS
RECEIVE

TOKEN_TEXT
----------------------------------------------------------------
RECEIVED
RECICPN
RECIPIECC
RECIPIECW
RECIPIENT
RECIPIENTS
RECOM
RED
REDIRECTION
REDIRECTPORT
REDIRIG

TOKEN_TEXT
----------------------------------------------------------------
REDIRIGER
REDISTRIBUTE
REDISTRIBUTION
REFER
REFERENCES
REFERS
REFRAIN
REGARDLESS
REGISTERED
REL
RELEASE

TOKEN_TEXT
----------------------------------------------------------------
RELEASED
RELIANCE
RELOAD
REMAINS
RENDERED
RENDRE
RENSEIGNEMENTS
RENTIEL
REP
REPLACE
REPOSITORY

TOKEN_TEXT
----------------------------------------------------------------
REPRENAN
REPRODUCING
REPUTATION
REQUEST
REQUIRED
REQUIREMEN
REQUIREMENTS
REQUIREMENTS
╞C╞
╞K
╞KN

TOKEN_TEXT
----------------------------------------------------------------
╞M
╞MCKMCKA
╞MK
╞MP
╞MWPMWPA
╞N
╞NL
╞NN
╞NN9PN
╞P
╞PN

TOKEN_TEXT
----------------------------------------------------------------
╞W
╞WA╞
╞WC5
╞WCFPP
╞WCFW
╞WN
╞WP
╞WW
╞Wó
╞W╞
Ƣ

TOKEN_TEXT
----------------------------------------------------------------

╟AIS
╟ON

╔CRAN
╔CRAS
╔CRASAN
╔E
╔ES
╔GALEMENT
╔GALIT╔

TOKEN_TEXT
----------------------------------------------------------------
╔RENTIEL
╔TAPE
╔TOILE
╔T╔

╩B
╩C
╩G2
╩TRE
╩VB
╩VVB

TOKEN_TEXT
----------------------------------------------------------------
╩W
╩W▄
╩W▄R
╩ÿB
╩ÿR
╩┌


ͬ



TOKEN_TEXT
----------------------------------------------------------------
╤4┘
╤FV┴F╤G═
╤R
╤V╤╩
╤┴2
╤┴7B
╤╩
╤╩Fÿ
╤╩R

╙2

TOKEN_TEXT
----------------------------------------------------------------
╙7FV╩W
╙B
╙FVB═
╙F╟
╙F╤
╙G
╙G═

┘4═
┘╬


TOKEN_TEXT
----------------------------------------------------------------

▄G
▄R
▄R╩R
▄W
܈
▄ÿR
▄╤B
▄╤╩F┌
▄▄W
▀V

TOKEN_TEXT
----------------------------------------------------------------

≈≈
RESPONSIBLE
REST
RESTART
RESTAURER
RESTRICT
RESTRICTIONS
RESTRICTIVE
RESULT
RETOUR

TOKEN_TEXT
----------------------------------------------------------------
RETOURNE
RETURN
RETURNS
REUSE
RHOSTS
RICHTINGEN
RIEN
RIGHT
RIGHTS
RISK
RNI

TOKEN_TEXT
----------------------------------------------------------------
ROLES
ROOT
ROWLEVEL
ROYALTY
RS
RTE
RUN
RUNALL
RUNNING
RUNNING
RUNS

TOKEN_TEXT
----------------------------------------------------------------
RUNTIME
R┴
R╔
R╔ALIS╔ES
R╔F
R╔F╔
R╔F╔R
R╔F╔RENTIEL
R╔F╔RENTIELS
R╔F╔RER
R╔G

TOKEN_TEXT
----------------------------------------------------------------
R╔GULIERS
R╔PERTOIR
R╔PERTOIRE
R╔PERTOIRES
R═
SA
SAFEST
SAM
SAME
SAME
SAMPLE

TOKEN_TEXT
----------------------------------------------------------------
SANS
SATI
SATISFY
SAVE
SAY
SAY
SAYING
SC
SCC
SCHE
SCHEIDING

TOKEN_TEXT
----------------------------------------------------------------
SCHEMA
SCHOOL
SCM
SCOPE
SCOT
SCOTT
SCRIBE
SCRIPT
SCRIPTS
SE
SECTIOCCC

TOKEN_TEXT
----------------------------------------------------------------
SECTIOCCPP
SECTIOCWW
SECTIOCWY
SECTION
SECTIONS
SECURITY
SEE
SEIN
SELECT
SELECTIONNER
SELON

TOKEN_TEXT
----------------------------------------------------------------
SEMAINE
SEPARATE
SERVER
SERVER
SERVICE
SERVICENAME
SERVICING
SESSION
SESSION
SET
SET

TOKEN_TEXT
----------------------------------------------------------------
SEUL
SEULE
SF
SFPMWW
SFPNB
SFW
SFW╞
SH
SHARE
SHARED
SHARING

TOKEN_TEXT
----------------------------------------------------------------
SHELL
SHORT
SHOW
SHUTDOWN
SI
SID
SIDE
SIDGUESS
SIG
SIGN
SIGNATURE

TOKEN_TEXT
----------------------------------------------------------------
SIGNE
SIGNED
SIMILAR
SIMILARLY
SIMPLEMENT
SIMPLES
SIMPLETCPCLUSTER
SIMULTAN
SINGLE
SINON
SLASH

TOKEN_TEXT
----------------------------------------------------------------
SOFTD
SOFTWARE
SOFTWARE
SOFTWC╞
SOLARIS
SOMEONE
SOMEPASSWORD123
SOMETIMES
SON
SONT
SOON

TOKEN_TEXT
----------------------------------------------------------------
SORTIE
SOUR
SOURCE
SOURCE
SOUS
SOUVENT
SPEAK
SPECIAL
SPECIALLY
SPECIFIED
SPECIFIES

TOKEN_TEXT
----------------------------------------------------------------
SPECIFY
SPOON
SP╔CIF
SP╔CIFI
SP╔CIFIE
SP╔CIFIER
SP╔CIFI╔
SP╔CIFI╔S
SQL
SQLPL
SQLPLU

TOKEN_TEXT
----------------------------------------------------------------
SQLPLUS
SSEZ
SSH
SSL
SS╞ZS
SS╞ZSM01
ST
STA
STANDARD
START
START

TOKEN_TEXT
----------------------------------------------------------------
STARTED
STARTUP
STATED
STATES
STATING
STATUS
STD
STEP
STEPIN
STEPS
STICKY

TOKEN_TEXT
----------------------------------------------------------------
STOC
STOCK
STORAGE
STRAIGHTFORWARDLY
STRATEGY
STRING
STRUCTURE
SUBJECT
SUBLI
SUBMITTED
SUBSECTCN

TOKEN_TEXT
----------------------------------------------------------------
SUBSECTION
SUBSEQUENT
SUC
SUCCESSOR
SUGGEST
SUITABLE
SUIVANTE
SUIVANTS
SUN
SUPPLIED
SUPPLY

TOKEN_TEXT
----------------------------------------------------------------
SUP╔RIEUR
SUP╔RIEURE
SUR
SURE
SUSTAIDCó
SUSTAIDCó9
SUVIANTS
SYS
SYSTEEM
SYSTEM
SYSTEM

TOKEN_TEXT
----------------------------------------------------------------
SZ

S╔LECTIONNER
S╔PARANT
S╔PARER
T5E┘D
TABLE
TAG
TAPEZ
TCP
TE

TOKEN_TEXT
----------------------------------------------------------------
TEAM
TEC
TEKST
TELLS
TEMPS
TEN
TERM
TERMINATED
TERMS
TESTEN
TESTER

TOKEN_TEXT
----------------------------------------------------------------
TEXT
TEXTE
TFTP
TH
THANK
THC
THCN
THEMSELVES
THEREFOC
THI
THINGS

TOKEN_TEXT
----------------------------------------------------------------
THIRD
THOC
THREE
THRESHOLD
THROW
TIER
TIGER
TIME
TIME
TION
TLE

TOKEN_TEXT
----------------------------------------------------------------
TMP
TNS
TNSCMD
TNSCMD10G
TOE
TOEPASSINGEN
TOGETH
TOMCAT
TOMCATA
TOMCATB
TOMCATC

TOKEN_TEXT
----------------------------------------------------------------
TOMCATS
TOOLS
TOONT
TOUS
TOUT
TOUTES
TRADUCTION
TRAIT
TRANS
TRANSACTION
TRANSF

TOKEN_TEXT
----------------------------------------------------------------
TRANSFOR
TRANSFORMATION
TRANSFORMATIONS
TRANSLAT
TROUVEREZ
TRUE
TUSSEN
TUTORIAL
TWEA
TWEAD╞MP
TWO

TOKEN_TEXT
----------------------------------------------------------------
TWO
TXT
TY
TYPE
TYPE
TYPICAL
T┬CHE
T┬CHES
T╔L╔CHARG
U
U4U

TOKEN_TEXT
----------------------------------------------------------------
UCC
UCW
UD
UIT
ULIERS
UN
UNCHANGED
UNCOMBINED
UNCOMMENT
UNDEC
UNDER

TOKEN_TEXT
----------------------------------------------------------------
UNE
UNI
UNION
UNIX
UNLESS
UNPROTECTED
UNRESTRICTED
UNZIP
UP
UPDATE
UPDATED

TOKEN_TEXT
----------------------------------------------------------------
UPDATEWARE
UPDATEWAREHOUS
UPDATEWAREHOUSE
UPGRADEN
UPLOAD
your
US
USA
USE
USE
USED

TOKEN_TEXT
----------------------------------------------------------------
USED
USEFUL
USER
USERNAME
USERS
USES
USING
UT
UTILISATEUR
UTILISATION
UTILISER

TOKEN_TEXT
----------------------------------------------------------------
UTILISEZ
UTILIS╔
U╓
V
V7
V7B
V7Fÿ
V7Fш
VALEUR
VALEURS
VALIDES

TOKEN_TEXT
----------------------------------------------------------------
VALUE
VALUE
VAN
VANAF
VANUIT
VARCHAR2
VARCM╞CB
VARCM╞WB
VARIABLE
VARIABLE
VARIABLES

TOKEN_TEXT
----------------------------------------------------------------
VARIABLES
VARIOUS
VB
VEC
VERBATIM
VERBETERT
VERDA
VERHARD
VERIFY
VERSIE
VERSIOCCC

TOKEN_TEXT
----------------------------------------------------------------
VERSIOCWW
VERSION
VERSIONS
VERTICAL
VERWEZEN
VERWIJDEREN
VEZ
VIA
VICE
VINDEN
VIRGULE

TOKEN_TEXT
----------------------------------------------------------------
VNC
VNCSER
VNCSERVER
VNSERVER
VOEGEN
VOETAFDRUK
VOICI
VOID
VOIR
VOLGENDE
VOOR

TOKEN_TEXT
----------------------------------------------------------------
VOORDELEN
VOTRE
VOUS
VR
VR═
VVR
VVW2
VW7G2
VWB
VWGF╤╩R
VêêB

TOKEN_TEXT
----------------------------------------------------------------
Vÿ

V┴B
V┴FW7B
V┴F╤6
V╔RIFIEZ
V╩FVB
V╩╙╟
V╙F╟
V▄W
NPQMK

TOKEN_TEXT
----------------------------------------------------------------
NPQMP
NST
NT
NUMBER
NUM╔RO
NW
NY
NZ
Nÿ
O
OBJ

TOKEN_TEXT
----------------------------------------------------------------
OBJECT
OBLIC
OBTAINED
OC
OCCASIONS
OD
ODCIENV
ODCIINDEXGETMETADATA
OFF
OFFER
OFFERICK

TOKEN_TEXT
----------------------------------------------------------------
OFFERICP
OINDE
OINS
OLDER
OM
OMDAT
OMES
OMGEVING
OMGEVINGEN
OMVAT
ONCE

TOKEN_TEXT
----------------------------------------------------------------
ONDERDELEN
ONDERSTAANDE
ONDERSTEUNT
ONGELUK
ONTWIKKELAAR
ONTWIKKELAARS
OOK
OP
OPEN
OPERAT
OPERATE

TOKEN_TEXT
----------------------------------------------------------------
OPERATIN
OPERATING
OPGESLAGEN
OPLAGEN
OPT
OPTION
OPTIONAL
OPTIONS
ORA92
ORA9201
ORAC

TOKEN_TEXT
----------------------------------------------------------------
ORACLE
ORD
ORDEC
ORDINARY
ORG
ORG
ORIGICK
ORIGICP
ORIGIN
ORIGINAL
ORMATION

TOKEN_TEXT
----------------------------------------------------------------
ORMATIONS
ORME
OS
OT
OTHER
OTHER
OTHERS
OTHERWISE
OTHERWISE
OU
OURER

TOKEN_TEXT
----------------------------------------------------------------
OUT
OUTLN
OUTPUT
OUTSIDE
OVER
OWING
O┘
P
P4
P7D
P9

TOKEN_TEXT
----------------------------------------------------------------
P9P
P9PP
P9W
P9WQM
P9YI
PA
PACKAG
PACKAGE
PAG
PAGE
PAGE

TOKEN_TEXT
----------------------------------------------------------------
PAKKETTEN
PAN
PANNEAU
PANP
PANP9
PANP9W
PAR
PARAMETER
PART
PARTICULAR
PARTIES

TOKEN_TEXT
----------------------------------------------------------------
PARTY
PAS
PASS
PASSE
PASSED
PASSW
PASSWOR
PASSWORD
PASSWORDS
PASS╔E
PASTE

TOKEN_TEXT
----------------------------------------------------------------
PATCH
PATCHED
PATCHSETS
PATENT
PATENTS
PATH
PAY
PC
PCQBA
PCó
PCóó

TOKEN_TEXT
----------------------------------------------------------------
PD
PENDANT
PENTAHO
PEOPLE

2743 rows selected.

SQL>
SQL>

/code]


Everything is there then what could be the Issue?

Thanks
Deepak
Re: Multi-lingual index for blob column [message #513692 is a reply to message #513691] Wed, 29 June 2011 00:05 Go to previous messageGo to next message
Barbara Boehmer
Messages: 9100
Registered: November 2002
Location: California, USA
Senior Member
Your query below does not match the output. Where is the length (doc)? Do you see the value that you are searching for in the token_text of the other query or something similar due to display issues? Try loading just one simple document, perhaps with only one word in the foreign language that is a problem, so that it is easier to test.

SQL> select id, name, length (doc) from test;

ID NAME
---------- ---------------------------------------------------------------
22 ABC.txt
23 Clustering.doc
24 connectivity.doc
41 backtrack_oracle_tutorial.pdf
42 Reading Logs Spanish.pdf
43 Pan-2.4-fr_FR.pdf
44 Kitchen-2.4-fr_FR.pdf
45 dutch.txt
61 german.odt
62 chinese.odt
63 japanese.odt

ID NAME
---------- ---------------------------------------------------------------
64 dutch.odt
81 backtrack_oracle_tutorial.pdf
82 dutch.odt

14 rows selected.
Re: Multi-lingual index for blob column [message #513693 is a reply to message #513691] Wed, 29 June 2011 00:05 Go to previous messageGo to next message
dkdms2124
Messages: 369
Registered: April 2010
Location: INDIA
Senior Member

Check this out Mam..

TOKEN_TEXT
---------------------------------------------------------------
Vÿ
V¼
V-B
V-FW7B
V-F-6
V+RIFIEZ
V-FVB
V-+¦
V+F¦
V_W
NPQMK


All this data is the result of the query to check tokens.

but when I rty to run the serach query for one of the string "V+RIFIEZ" still there is "no row returned".

SQL> ed
Wrote file afiedt.buf

  1  select score (1) score, id, name
  2    from   test
  3    where  contains (doc, 'V+RIFIEZ', 1) > 0
  4*   order  by score (1) desc
SQL> /

no rows selected


More over one of the character of the string changed automatically. When I tried to run the query on the SQL developer the strinf was exact same as it was in the token but still there was no row returned.

Below is the query which I ran in SQL Developer..
select score (1) score, id, name
  from   test
  where  contains (doc, 'V╔RIFIEZ', 1) > 0
  order  by score (1) desc


Now what could be the problem Mam?

Thanks
Deepak

[Updated on: Wed, 29 June 2011 00:06]

Report message to a moderator

Re: Multi-lingual index for blob column [message #513694 is a reply to message #513693] Wed, 29 June 2011 00:10 Go to previous messageGo to next message
Barbara Boehmer
Messages: 9100
Registered: November 2002
Location: California, USA
Senior Member
With the lexer that you posted, it would not index the + symbol, so I think you are doing something different than what you have posted. You need to keep the test case as simple as possible and be consistent.
Re: Multi-lingual index for blob column [message #513695 is a reply to message #513694] Wed, 29 June 2011 00:12 Go to previous messageGo to next message
Barbara Boehmer
Messages: 9100
Registered: November 2002
Location: California, USA
Senior Member
Also, when you run your test queries, make sure that you are running them using the same method that worked previously, storing the query to a sql file, with a blank line at the top, saved as utf-8 and spooling the result when you run it.
Re: Multi-lingual index for blob column [message #513697 is a reply to message #513695] Wed, 29 June 2011 00:34 Go to previous messageGo to next message
Barbara Boehmer
Messages: 9100
Registered: November 2002
Location: California, USA
Senior Member
Also, make sure that you search for the same string that was entered, as it was entered in the document, not how it is displayed as token_text due to distortion when the operating system character set does not match.
Re: Multi-lingual index for blob column [message #513699 is a reply to message #513697] Wed, 29 June 2011 00:43 Go to previous message
dkdms2124
Messages: 369
Registered: April 2010
Location: INDIA
Senior Member

OK Mam, I am doing from the very beginning and will post the result I get.

Thanks
Deepak
Previous Topic: Error trying to index a PDF file
Next Topic: Oracle Text CATSEARCH with empty field and not returning data
Goto Forum:
  


Current Time: Sat Nov 23 06:58:07 CST 2024