NASA Logo, National Aeronautics and Space Administration



sequenceMiner was developed to address the problem of detecting and describing anomalies in large sets of high-dimensional symbol sequences. sequenceMiner works by performing unsupervised clustering (grouping) of sequences using the normalized longest common subsequence (LCS) as a similarity measure, followed by a detailed analysis of outliers to detect anomalies.

sequenceMiner utilizes a new hybrid algorithm for computing the LCS that has been shown to outperform existing algorithms by a factor of five. sequenceMiner also includes new algorithms for outlier analysis that provide comprehensible indicators as to why a particular sequence was deemed to be an outlier. This provides analysts with a coherent description of the anomalies identified in the sequence, and why they differ from more “normal” sequences.


This software is released under the terms and conditions of the NASA Open Source Agreement (NOSA) Version 1.1 or later.

sequenceMiner NOSA Software Agreement

You can register your download using this form, all fields are required:
Full Name:
Street Address:
eMail Address:
First Gov logo
NASA Logo -