Machine Learning Extractor
Skill Cartridge®
Applies taxonomies and controlled vocabularies to documents.
By TEMIS

Scope

The MLX Skill Cartridge® is a framework distributed under the Creative Commons Attribution license (CC By), which allows its users to distribute their own cartridges built with TEMIS Machine Learning technology. This versatile extractor can be trained on annotated corpora where entities of interest have been highlighted by domain experts to extract, subcategorize and/or add scores to entities. The corpus annotation and Skill Cartridge® training processes require no natural language processing expertise and can be performed easily with the Annotation Workbench.

Internal

Depending on the learning (training) process to be carried out another Skill Cartridge® may be required in addition to the MLX Skill Cartridge®. Entities are extracted in relation to the lexicon of the Skill Cartridge® used. A model is created at the end of the learning/extraction process for each supported language found in the corpus.

Scoring is available in English, Spanish, French, Italian, German and Dutch.

Customization

The MLX Skill Cartridge® can be customized to allow users to build and distribute their own Skill Cartridge® with their own taxonomy, according to their area of interest.

Typical Applications

MLX is typically used to encapsulate and leverage corporate expert knowledge to automatically index documents in any domain.

Skill cartridge
Document management Generic
Language(s): Generic
Compatibility: Luxid® 6.2
Posting date: November 2013
Version: 1.0
Business model: Turnkey
Related links
MLX Documentation