Create and Use Custom Language Data
Create and use your own language-specific conditions (such as common abbreviations) when chunking data.
Here, you use the chunker helper function
CREATE_LANG_DATA from the DBMS_VECTOR_CHAIN package to load the data file for Simplified Chinese. This data file contains abbreviation tokens for your chosen language.
After loading the language data, you can now use language-specific chunking by specifying the
LANGUAGE chunking parameter with VECTOR_CHUNKS or UTL_TO_CHUNKS.
Related Topics
Parent topic: Configure Chunking Parameters