Home > Data Mining Algorithms > Anomaly Detection > Anomaly Detection Viewers a... > Association > AR Model Viewers and Algori... > Decision Tree > Expectation Maximization > EM Model Viewer and Algorit... > Generalized Linear Models > GLM Model Viewers and Algor... > k-Means > Naive Bayes > Nonnegative Matrix Factoriz... > Orthogonal Partitioning Clu... > Singular Value Decompositio... > Support Vector Machine > SVM Model Viewers and Algor... > Explicit Semantic Analysis
(ESA) is a vectorial representation of text (individual words or entire documents) that uses a document corpus as a knowledge base. In ESA, a word and a document are represented as follows:
Word: Represented as a column vector in the tf-idf matrix of the text corpus. Typically, text corpus is Wikipedia.
Document (string of words): Represented as the centroid of the vectors representing its words.
Oracle Data Mining provides a prebuilt ESA model based on Wikipedia, and user can import the model to Oracle Data Miner for data mining purposes.