Publication: A software system for gene sequence database construction based on fast approximate string matching.