Applying Cognition to Your Technology
Companies using Cognition's Semantic NLP win because:
- Their technology products are made smarter, more interactive, and provide a richer user experience.
- They can differentiate their user experience due to simultaneously increased precision and recall of text and documents.
- They will increase their revenue when a richer user experience retains users, thus increasing the lifetime value of the end user.
- They will be able to lower attorney review costs by nearly 50% and minimize the legal risk of missing relevant data.
- They will significantly reduce customer care call center response, maximize self-service support and increase customer satisfaction.
- They will increase the quality of systems that center on ensuring an organization is compliant with regulatory agencies.
- They will nearly double the quality of machine translation.
- They will increase their revenue when Cognition's Semantic NLP is applied to e-commerce applications and monetization strategies.
- They will increase their revenue when Semantic NLP is applied to ad placement, increasing click-through rates by matching ads more often, and matching ads to more relevant queries (or retrieved documents).
Cognition's Semantic NLPTM Component List
CognitionLEXICONTM -- The lexicon is the heart of Cognition's Semantic NLP technology. It includes:
- A Lexical Dictionary which defines the meanings of each word using morphological, syntactic, taxonomic and semantic features, enabling the software to select word meanings, recognize various forms of a given word and to parse phrases. Alternate spellings and common misspellings are also evaluated. This dictionary includes over 506,000 stems, covering all of the lower-case words of English, as well as tens of thousands of proper nouns, and 536,000 concepts. In combination with the morphology algorithm, the software recognizes over 1 million word forms.
- An Ontology or vast “Tree of English” broadens the search due to knowledge of the linkages between general word senses and specific word senses. This technology enables the computer to “reason downward”, allowing a search for the general term like ‘money’ to also find information about specific terms like ‘dollars’ ‘euros’ or ‘yen’. While very complete, covering over 536,000 concepts, the Ontology (or taxonomy) is extensible to accommodate a specific customer’s terminology.
- The Meaning Thesaurus groups word senses that are loosely synonymous. For example, ‘column’ (in one meaning), ‘file’ (in one meaning), ‘line’ (in one meaning) and ‘queue’ (in one meaning) are concepts in the same concept group. The thesaurus has over 76,000 such groupings.
CognitionINDEXERTM -- The Cognition Indexer creates searchable ‘indices’ of concepts extracted from user source documents. Supported document types include: HTML, XML, plain text, Word, WordPerfect, RTF, PDF, Power Point, and many other common document formats.
The CognitionINDEXER™ is one of the most advanced indexers in the world. It reads each sentence, phrase, and word in the dataset. It assigns meaning to each word based upon context, and interprets capital letters, acronyms and ambiguous words. As many as 15 attributes or values can be assigned to each word.
CognitionSEARCHERTM -- The Cognition’s Semantic Search engine gives users access to the indexed data using complex natural language queries, standard Boolean, advanced “Linguistic Boolean”, fuzzy, pattern, and Soundex name search queries. It also enables relevance ranking, search term highlighting and hit-to-hit navigation.
CognitionBROKERTM -- The CognitionBROKER™ allows users to optimize the network performance of Cognition’s Semantic Search engine software in a large, multi-server environment. It automatically load balances, provides fault tolerance by automatically compensating in case of server failure, and automatically discovers network layout and any new server. CognitionBROKER™ also allows you to start or stop servers at any time without modifying any configuration files.
CognitionAPITM -- CognitionAPITM gives technology products a simple, yet advanced integration for Cognition’s Semantic NLP software. This is also used by enterprise customers who have an internally developed data repository or content management system. Cognition offers Application Programming Interfaces (API) for the central components of the software including the Search, Index, and Review functionality within the system. Sample scripts and API libraries for C++, Python, Perl, Ruby, Java, VB/ASP/ActiveX, and PHP4 are available.
CognitionAPI comes in four flavors:
- ClientIndexAPI.h and its corresponding dynamic library for C++ programmers (client server)
- DirectIndexAPI.h and its corresponding dynamic library for C++ programmers (direct, non-client server)
- IndexAgent.ocx for Windows users who desire a COM object interface (client server)
- CognitionSolo.ocx for Windows users who desire a COM object interface (direct, non-client server)
The client server APIs function similarly to the direct indexing APIs, but they interact with one or more instance of Cognition applications on servers in your network to generate indices.
CognitionSPIDERTM -- Enterprise customers and technology companies may wish to include external, typically Web-based, data in their internal data repository. CognitionSpider is an application which enables the user to crawl external data and absorb it into their internal repository.
CognitionPARSERTM -- The Cognition Parser assigns grammatical structure to sentences, improving the interpretation of word meaning through grammatical relationships in the sentence. This is also used to rate the semantic and syntactic plausibility of the output of hypothesis-generating software such as machine translation or speech recognition software (See Cognition Ranker).
CognitionRANKERTM -- Translation software from foreign languages to English use statistical algorithms that can only guess what the translation might be from statistical similarity to known translations. This software produces hypothesis translations in a rank order. The Cognition Ranker tries to parse the hypotheses, and ranks them for semantic and syntactic plausibility, improving the final choice of translation sentence. In like manner, hypotheses produced by statistically-based speech recognition software for English are ranked for semantic and syntactic plausibility, improving the final choice of interpretation for speech.
CognitionCUSTOMIZATIONTM -- This is the Lexical Customization that includes a) text analysis tools; b) semi-automated lexical acquisition (a professional service); and c) client-controlled lexical customization. If the client has under one hundred words that they would like to add or modify, there are client customization files that enable a client to add words and phrases in a taxonomy, choose a desired sense to be chosen for a word in their domain, and prefer a name or non-name interpretation of a given word. If the client has a larger amount of vocabulary to add or modify, they can order service from Cognition Technologies to analyze their text for words unknown to the Cognition lexicon, and to add those words to the lexicon in a semi-automated way.
CognitionADMATCHERTM – The Cognition Ad Matcher interprets the meaning of queries, documents, ad phrases and ad copy. Using meaning as the medium, this ad placement technology maximizes the number of ad phrases matched, and minimizes the number of poor matches to ad phrases.
CognitionINTERFACETM – This is a sample interface that is a web page to be used as an interface to CognitionSearch. This can be used as is, or modified by the client as desired. It includes the method for sending queries to CognitionSearch, displaying results with highlighting of words and phrases in the retrieved documents. It also has the methods for displaying the senses and definitions of query terms chosen by CognitionSearch, for accepting changes in word senses from the user, the method for displaying spell-checking information and the method for accepting spelling choices from the user.