Explore alternate text comparison methods. Benchmark different methods. Possible optimizations include: - https://en.wikipedia.org/wiki/Overlap_coefficient - Concurrent licence confidence calculation - Strip "bad" characters from source text (License probably doesn't contain runes like `[{;:#+` etc)