Custom Services
Dictionaries
A semantic dictionary is a collection of the words and phrases recognized by our Semantic Signature technology along with weights relating them to the semantic dimensions defining the main concepts covered by a dictionary. TextWise semantic dictionaries can be readily adapted to a specific target when there is adequate sample text representing the semantic categories of interest for training. Dictionaries can be quickly updated when new training data becomes available.
The SemanticHacker API is releasing a NEW general web dictionary - the 1.7k Dimension Dictionary. This dictionary, updated with recent ODP data, has a coarser grain than the previous 20k Dimension Dictionary. The new 1.7k dictionary is well-balanced and emphasizes recall in matching. It is currently the default for the SemanticHacker API and utlizied by our demo on the homepage.
Please note: The 20k Dimension Dictionary should only be used to support legacy applications. It will be deprecated in the near future.
TextWise can create domain-specific dictionaries (legal, pharmaceutical, etc.) for vertical applications. Please contact us if you have a requirement for such a custom dictionary.
Languages
Our underlying technology is based on a language independent semantic representation model, which can be easily transferrable to other languages. All we need is adequate training data and a basic language processing module to identify words and phrases.