Dear developers,
I am trying to use DOMINO to identify mapping regions which can be used as unique identifiers for several different species of trees. Someone within my group has already generated Multiple sequence alignments for each of the genes of interest, so I have been trying to use the custom run option to bypass the initial stages and starting at "Mapping/Alignment" but I have found when I get to the "marker discovery/selection" stage it displays no selectable taxa and prevents any advancements from that point. I was wondering if you would know of a solution to this problem?
I am using the MSA files in the phylip format and I have attached some example data below just in case the error lies there. Thanks in advance.
3 8807
Taxon1 ---------- -----AGAGA AATGGAGCCA AGAAAAGCAA TAGATGAATT
Taxon2 ---------- ---------- -ATGGA---C GAAAAATCGG TGGACGATAA
Taxon3 AAACAGATAG CCTGA----- -ATGGAGCCA GGGCTATCAA TCGAAGATCA
GTTCTTAAAA TTTCATCCTT GTTTTGATAC AAACACAAAG ATAGGAATAA
ATTCTCGAAG ATGCACCCTT GCTTTCCTCC ACGAACAAGG ATTGCAATAA
ATTCTCAAAG CTGCATCCAT GCTTTCCTCC GGACTCGAGG ATTGCAATCG
TAGGAGGTGG ACCTAGTGGA ACATCAGCTG CTTATGCATT GGTAAAACTT
TCGGAGCGGG CCCCAGTGGA CTCTCGGCTG CTTATGCACT ATGCAAATTA
TAGGAGCTGG CCCGAGTGGA TTATCTGCTG CTTATGCTCT TTGTAAACTC
GGTTATACAA ATATCACTAT TCTTGAGAAG TATCATTCTG TTGGTGGCAT
GGCTACTCCG ATGTAACTGT GTTCGAAAAA CATCATTCAC CTGGTGGAAT
GGTTATTCCG ATGTAACCGT GTTTGAGAAA GAACGATCGC CTGGGGGCAT
GTGTGAATCA GATGATATTG AAGGTATCTA T--------- ----------
GTGTGAATCA GTTCAAATCG AAGGTAATAA TAATCTGCCG GAAAGAAAAA
GTGTGAATCA GTTGATATTG AAGGTACAAA T--------- --------AG
---------- --GCTATTTA C--------- ---TTTTAGT CTGTTCGTAG
AAATATCCAC TGAATATTAG ATTTTTTACA TAGATTTAAT T---------
AAATAT---- --AGTATGAA A------AGA AAGTCTCAAT GTATACTAAG
AAAGATG--- ---------- ---TCTTGCT TTCTTATTTA GAAACAATTT
AAAAAAGTTC GA-------- -ATTAATGTG TTCTAACAGG AAGGATTTAC
AAAAAGGCTC AAGCACGTCC CGTTTATGTC TTCTTACAGG AAGGATATAT
AACTTTAAAA TTCTTATTTT TAATGACATA TTTGTGTATA GCTTGTTTTA
GATTTGGGAG GTCAAGTTCT TGCAGCAAAC AGTGCTCCGT CTATATTTCA
GATTTGGGAG GTCAAGTCCT TGCTGCAAAT AGTGCTCCTA GTGTATTTCA
GACTACAAGT TTGAAT---- ---------- ---------- -------ATT
CTTGGCTAAA GAAGTCGGTG CAGAAACTGA GGAAATGGAC ACACATAAAT
CCTGGCAAGA GAGATTGGAG CCGAAACAGA AGAAATGGAC ACTCATAAAT
TTTTATTCTT GAACTCCGTG CCTGG----- ---------- -----TTATA
TTGCACTTAT TAACAGTTCA AATGGAGCTC TCACCGAAAT GAAGTTAGTC
TTGCACTCAT AGACAACTCG ACTGGAGATC TCAGAGAAAT GAAACTTGTA
-CACGCTCAC ATAT------ ---------- GTTGAATTAT CCCGAGTCGG
-GAAGATTAT GTATCCATGA TCTCTCTTAC ACTCAA--AC TCCAGGTCAG
-GAAGATTAT GTATCTGTGA TCTCTCTCAC ATTGAA--AA TCCAGGTCAG
GG-GCGTGAG GGAACTTGTC TCC------- -CCTGTCCAA TCATCAGCTC
-ACATCTTCA GACACACACA GACAAAAAAA AAAACTGTGA TTTTCAGCT-
TACGCCTTAC AAAGTATATA T--------- -AGTTTTTCA CCGTAAGCTC
---------- ---AGTTAGT GACATTGAGA CGGACCTAAG AT----ATCC
----ATTTGT AAA------- --CTGCAGGA TAAGGCTAAA GAATCGATTG
GACCATTTCT GATAGCTCCT --CTACAGGA TAAGGCAAAG GTGTCTATTG
ATCTTCTTAT CATG------ ---GTATCAG AGTCGTGACT GTCCAATTCT
GAGTACACGC AGTGAGTGAA ATTGCCTCCG ATT--TAACT CCCGATTATC
GGGTACACGC CGTGAGTGAA ATTGCCCCAG ATT--TAACT CCCGAATACC
T--------- ---ATTTCAC TCGAT----- -----GTCGT TTATGGGC--
TAAAAAATCA AGGATTTCCT TCGGTGCCTA AATCAGTAAT ATACGGATAC
TCAAGGGTCA AGGATTTCAA TCTGTTCCTA AATCAGTAAT TTACGGATAT
---GTGTGAG GGTGTTGAAT TGTCCCAAAT AGG------- ------TGCG
ACAGCATCTG GATATGGATA CGTGCAAGAC ATGCCTTACG CATACATACA
ACTGCATCAG GCTATGGATA TGTTCAAGAT ATGCCTTACG CTTACATTCA
TGGAGGCAGT TTGTCCTC-- ---------- ---------- ----------
CGAGTTCACA AGAACATCCA TGGCTGGAAA AATCCAACGG TTCGAAGGCG
TGAGTTCACC AGGACTTCAA TGGCTGGAAA GATTCGCCGG TTTAAAGGAG
-TTATATGAT TCTTT----- ---------- ---------- AATTTTT---
GTTACACGAG CCTATGGAAT AAGCTGAGTC AAAGGCTGCC AAATTCTGTT
GTTACATGAG TGTTTGGAAG AAACTGAGTG AATACATGCC AAGTTTTTTC
---------- --TATTATCA ACATAA---- ----ATTGAG GATGGGTGGC
CCAACACAGA AGTACTATCA GTTAAACGCA ATTCATCCGA AATCAAAGTT
CCAACACCGA AGTACAATCA ATAAAACGCA GTTCATCTAG CATCAGAGTC
GG-------- --GGGTGGGG GGGT----AA TTTATA---- ----------
GAATTCAAAA CTGAGAATGG CGGTGTAAAA TCGAGAGAAT TCGATAAAAT
GATGCTAAAA ATGAAAGGGG TGATGAACAA GTTATGGAGT TCGATAAAAT
---TATGTAG TATGTTGTC- ------ACAC TAAAAGATTT TACGGA----
TATTATCTCC GGTGCTTTTC CGTTTATCAG TGGAAATACT TACAGATCAC
CATTATATCG GGTGCTTTCC CTTTCAACAA CGGAAAATTT TATAGATCAC
---------- ----ATTC-- TGTACACTAA CAGTATAAAG AAT----TTT
CTTCACCGAA CACAT----- ---ACATAGA T-ATATCATA TATGTTAATT
CTTCTTCGAA CATAATTCAT TTCACACTAA T-GTACAGAA AAT----ATT
CCACTAT--C AATTT----- ATTTGAACG- TGTTGTA--- ----------
TTATTATAAC ACAAAAAGTT AGCTAATTGC TTTTCTCTGT GCAGATACTG
CTAATATTTC CGTCTGGATG ATTTGATTG- TTTTCTGTCT GCAGAATTCT
---------- -CATACGTAC ATAA------ ---------- ----------
CGAATAATAA CCGTATTGAC TTAAGCGAGC TGGAAAAAGA GTTGTTCAGT
CGAGCAAT-- -CGTATGGAC ATGAGTGAGC TCGAAATGGA ATTGTTTAGT
CAAGTACAAG TAGTCG---- ------TTCA TGAAG----- -------CTA
AAAGTCAAAA CCATTGACTA TTACACGTCT TGAATAGGGA TTTGAGCATA
AAAGTACAGA CTATTGACTA CTATACGTCT TGAAGAGGGA TTGAAACATA
GTCCCA---- TTTTTCTTAT GT-----AGG TTAC-----A TGTAGCTACT
TTCCAAAGGG TTTTTATTAC TTCGACGAAT TTATGAACGA TCCATCTACT
TTCCAAAGGG TTTCTATTAC TTTGACAAGT TTATGGATGA TCCGGCAGCT
T--------- ---------- ---------- ---------- -TATCGTCGT
AAATCCTGTT GCAATGCAGA GATTTTACGG AGACACGGAT ATTTTCCTGT
AAACCCTGTT GCAATGCAGA AATTCTATGA TGACACTGAC ATATTTTTGT
AACATTCTTC TA-------- -TCAGTGTAT ATAAGTTAA- ----------
TCTGGTCTTC TCGGCTAATA TTCAGGAACA CGAGGTTGCT GAGCTGGCAA
TTTGGTCTTC TCGGCTGATA TTCAGGGATC AAAAGTCGAG GAGATGGCGA
-------ACT CTTAAACATA TGTGTTACGA GTCCAGGAAA TA--TTTACG
TTTCTGCGGT CAAAAGAATG GGAGGT-GAA GTCGAAAGAG TGATTTTACA
TGAATGCAGT CATAAGTATG GGAGGC-CAA ATCGAAGAAG TGGTTTTACA
ACTTGACAAG TTCTTGCTGC ----AATAGT GC-------- ----------
GC--TTCAAG TACTTCCCAC TGTGAAAAGT GAAGGTAATA TCTTGTAATA
AC--TTCAAG TACTTTCCTC TGTCAACAGT GAAGGTATG- ----------
-------CTA GCA-AAAAT- ---------- ---------- ---------A
AGAAA--CGT TCGGA----- ---TGTACTA ATTTCGGGAA AAA------A
--CGTA-CTT GCAGAATATT CCATGTAATT ATTACTAATC CAAACTCCAA
GAGCTGAAGT TGAGGAAATG GACAACCATA AGTTTGCCTT CGATCT----
AAACAGATAT GAAGGAGGGG TACTATGATA AGCTGGAATT TCTACTACAA
AAGCAGAAAT GAAGGACGGG TTTTACAATA AGCTCGAGTT TCTACTACAG
--TGATTGAA AATCATA--T CGATGGCTAC TGTTACTGGT CTTTCTGTT-
GGTCAAAGAA ACACCTACTT TGTTGG---- ---TAATGGC ATTTGAGCTG
GGTCAACAGA ATACATACTT CGTCAG---- ---TCATGGC ATTTGAGCTG
---------- -CCCGAATTA TAATATATGA ---------- -----TGTTC
ACTGAGAGGA ATGCTAGTTA CGCATTTGAC CTCGTTCGAA AGCATTTCTC
ACAGAGAGGA ACTCCAGCTA TGCCATGGAA CTCGTCCGAA AGCATTTTGC
TATTAATGAT TTTATTCATA GTACTTCATG GAATTTGGCG TAAAGCC---
TAACGATAAT CAAGAACCGA GCTATCCCTA TG--TCAAGG T------ATA
AAATAATAAT CCTGTACCAA GTTATCCATA TG--TCAAGG TGAGGTCATA
---------- ---------- ---------- ---------- ----------
A--------- ---------- ---------- ---------- ----------
GATAAG-AAA AAGA------ ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ----CATATT TTATTACAGC
---------- ---------- ---------- ----CTAATC TTATTTTAGA
---------- -----TTCCT AAGATA---- ---TGTTTCG GACTCTGCAA
ATGGATAT-- -----GTTGG CATTGATTTT AACAACGCAT GCTCATGCAG
ACCAATGG-- -----CTT-- ---------- -----CTTCT GGTATTGCAG
AA-------- ---TGCCGTC AGGT------ -GCATATTAG ATCCT---CC
AGATTACTGA CATTGAAACC GAACAACGGA GGCTCAGTCG TTCCCAGACA
AGACTTCTGA TGTTGAAGCC AAATC----- -GTACATTAG TACCTAAGCA
AATAG----- ---------- ---------- ---------- -----TAGCA
ATTGGACGAA TCGCCAGGTG TCCAGTTTCC AGAACTTTCC ACCCTCGACG
ATTAGATGAA TCATCTGGTG TCCAGTTTCC CGATCTTTCT TCCCTTGATG
CATTTTTAGA GAATCGGACA CG----GGTG CAGC------ ----------
CATACTTGAA ACATTGGGGA ACCCATAGTG TAACTCAAAG CAAGACACTG
CTTACTTGCA ATACTGGGCA AGTCACAGTG CAACACAAAG CAAGACACTG
---------- ---------- --AACATTTT GCAGAGTCCA AGCAACATAT
TACACATGGA TCAACGAGAA GGGGCAAATC GTAGCA--CA AAGAACGTAC
TATACTTGGA TCAACGAAAA GGGACGCATT GCAGCG--CA AAGAACATAC
---------- ---------C TCCTTAGTAG TCCAAATAAT TTCCGAGTCA
AAAGAACTTA ACTCAAACGC TTCAAAGATC TCCGAAAAAC TAACGAGCTG
AAAGAACTTC ACAATAATGC TTCTGAGATT TCTTACAAAC TCTTGACAAG
CGAGTAGTAA ACCTTCAAAT AATATTTAAA AGTACT---- ----------
TAACAACCCG ACAATCAAAA CAGGCGATAG GGTTCTCTTG GTTTATATAC
CCAAAAACCA ACCATCAAGC CTGGTGACAG AGTGCTCTTG GTCTACATCC
---AACTTTA TTACTGTGAA TTTTCCT--- ---------- -----AAATA
CTGGCCTAGA TTTCATCGAT GCCTTCTTCG GTTGTTTGAG AGCCCGTGTA
CGGGTCTAGA TTTTGTTGAC GCCTTCTTCG GGTGTCTGAG GGCTGGAGTC
TTACCTGTTA C--TGATCCC TTTCTGCAGT AAGCAACAGC AGAAGAATCG
ATACCCGTGC CCGCAATTCC ACCC------ GATCCGCTGC AGAAAACCGG
ATACCTGTTC CAACAATTCC TCCC------ GAACCATCAC AGAAGGGTGG
---ACATGAA TAGCCTCGAG GAGGAGTTGT TCGGTAAAGT ACA-------
GCAACAAGCA CTTCTCCACA TTTCCAACAT T-TCCAAAGC ATGCAATGCA
---ACAAGCA TTACTTCATG TGGTTAATAT T-GCTAAATC ATGCAATGCA
---------- ----AACTAT TGACTATTAC ACCATAGT-- ----------
GTTGCCATAT TATCAACTGT TAGTTACCAC ATCGCGGTCA AGGCTTCTTC
GTGGCAATTC TGTCAACACT TAGTTATCAC ATAACTGTTA TGGCAGTTTC
---------- ---------C TTGAAGATAA C--------- ---------T
GGCAAGAAAC ATGCTGTCCC TTAAGGGAAA AGACAAAAAT ACCCCTCGCT
TGCAAAAAAC ATGGTATTTC TTAAAGGAAA CAACAAATGC TCCCTACGCT
GGATT----- ----TGATCA TATACCGATG GGT------- ----------
GGCCCGATCT GCCGTGGCTC CACACCGATT CTTGGGTGAA GAAATCAAGA
GGCCTGATCT GCCATGGTTG TATACAGACT CTTATGTCAA GAAAGCCAAA
TTTTACTACT TT---GG--T CACCAAAAGC CATCGGGAAT CCAGTTGCAA
ATCACCATGC AAAACGGCAG T---ACGAGT CTTTGCCTCG TGATTTGTGC
TTTTCCTCGT CA---GGTGG TACTATGAAC CTTTATCAGA TGACTTGTGC
TGCAGAGATT TTACA----- --------AT GACACGAACG TCTTCTTGTT
TTCCTGCAGT TCACATCGGG GTCCACAGGA GAGCCGAAGG GAGTTATGAT
TTTCTCCAAT TTACATCAGG CTCTACAGGT GACGCCAAAG GAGTCATTAT
TTGGTCTTAT GGTAACTCGG TCGACATAGT AGGATTAA-- ---AGGAGAC
---AACTCAC GGTGGAATAA TCCACAATGT GAAGACGATG AGAAGCCAAT
---AACTCAT GGTGGGCTCA TTCATAATGT AAAGTTAATG CGAAGAAGAT
C---GAGCTC CTCATAAAAG GTGTT-GAAA GT-------- ----------
ACAAGAGCAC CTCGAATACG GTACTTGTGA GCTGGCTGCC ACAGTACCAC
ACAGGAGCAC ATCAAACACA GTTTTAGTAA GTTGGCTGCC TCAATATCAT
---------- --ATGGGAGG CTTTGTTGAA AA-------- --AGTCGTTT
GATATGGGAC TCATCGGTGG CCTTTTCACT AGCATGGTCA GTGGCGGTTC
GACATGGGAC TCATTGGAGG ACTCTTTACA AGTATGGTGT CTGGTGGATC
TGCAA----- -------CGA AAGC-----T CAAATATTTC C-CTCATGTT
CGCAATTTTG TTCTCTCCGA CGGTGTTTAT CCGGAACCCT TTATTATGGC
TGCAATTTTG TTTTCTCCAC TGACATTTAT CAGGAATCCC CTCTTATGGC
AACA------ --------AC CAAGGTACTC ---------- ----------
TGCAGACTAT AACCACGTAT CGCGCCACAC ACAGTGCGGG CCCCAATTTC
TGCAGACTAT CAGCACATAC CATGCGACTC ATAGTGCTGG CCCCAATTTT
----CTGAAA ---------- ---------- ---------A GACTGCGGAA
GCGTTCGAGC TGCTTATTCG TAGGCTAGAA GCCGATGAAA AAGGGCAGAA
GCCTTTGAAC TCCTCGTTCG CAGATTAGAG TCAAAC---A AATCACAAAG
TTTTCATTT- CAGTTTATGT TTTATGCACT CACAATCA-- --TGAATCA-
GTTCGATTTG TCTTCGATGG TTTTT---CT CATGGTTGCT GCAGAACCAA
TTTTGATTTG TCCTCCCTAA TTTTT---CT CATGATTGCT GCTGAACCAA
---------- ----CTTATA AGT---CT-- -----ATGGC TAAACTACTT
TCAGAGCAGA AACTATGAGA AGG---TTTC TGCTGCTGAC TCAGCCTTTC
TAAGAGCGAC AACCCTTAGA AAA---TTCC TTAAGCTGAC TCAATCTTTT
TGAATCTTTT ---------- ---------- ---------- ----ACATTG
GGGCTCGATC GGGGGGCCAT GGCCCCCGGC TATGGATTAG CAGAGAACTG
GGACTCTCTC AAGAGGTATT ATCTCCTGGT TATGGATTAG CAGAAAACTG
TCGGTGTGTA TAACTAATAT ATTTTCATTT TAAGATTTGA ACTATTATCG
C--GTTTACG TGTGCAGTGC GT----ATGG AGAAGGTGAA GAGATGTTGG
T--GTGTATG TAAGTAGTGC AT----ATGG AGAGGGTAAA GAGATTATGG
TTCTATT--- --GATTGTGT GCTTCGAAAT GACAGATATG AAG-----GA
TT-GATTGGA ATGAACGAGT TTGCTGTGGC TACATTGATA ACAATGATGA
TT-GATTGGC AAGATAGGGT GTGTTGTGGT TACATAAATT CAGATGATGA
AGGCTTTTAT G-AGAAAGTG GAAAATC--- ---------- --AACT----
TGAAGTTCAA A-TCAAGATT GTTGATCAAG AAACGGGCGA AGAGTGTGAA
AGATGTTGAT A-TAAAAATA GTTGATCCCG AGACTGGAAT AGAGCT----
-------ACA AGGTCAACAA AACACGTATT AT-------- GTTGGCGGGT
AAATCGGAAA AAGAAGGGGA GATTTGGATT AGTAGTCCAA GTAGTGGAGT
--ATCTGAAA AAGAAGGGGA GGTATGGATT AGAAGTCCAA GTTCAGGAGT
TG------AT GGCATT--CG AGTTGA---- ---------- ----------
TGGATACTGG GACAATGAGG AGTTGACCAA AACGACCTTC GAAAACAAAC
TGGGTACTGG GGCATGGAGG AGTTAAGCGA AAAGACTTAC AGAAATGAGA
---------- ---CAGAAAG GAATGCAT-- ---------- CTTACGCCAT
TGAATAGCGC CCATGGCAGG AAGTACATTC GAACGGGGGA TTTGGGAAGA
TTTATGACCA TCCAGGGAAG AAATACATGC GAACTGGAGA CTTGGGACGA
GGCTCTAATG CACAAGCATT --TCGCTAGT AA-------- ---TAATCCT
ATCGTGGACG GAAAGTTGTT CGTCACGGGC CGTATAAAGG ATCTCATAAT
ATAATTGATG GAAAATTATT CATCACTGGG AGAATAAAAG ATCTAATCAT
---------- ----ATACCA GCCTTCCCAT ATGTTAAGGT AGGCTACTTC
TGTTTCTGGA AGGAATA-TA TATTCGTCAG ATATCGAAAA A---------
TGTTGCTGGA AGGAATA-TA TACTCCTCGG ATATTGAAAA A---------
AGAGTTCAGA TATCTTCTTT TTTTTGATGT TATGGAT--C GTGTTACAGT
ACCGTCGAGA ATTCGTGCCA GCTAGTACGC CCTGGATGCT GTGCTGCAAT
ACAGTTGAGA TCTCATCTGA ACTTATACGA CCTGGCTGCT GTGCTGCTGT
TTCTCGATTT CATTATTTAA TACCCTATCA AAA-GGGATA T--CATTTCC
CGGT--GTTC CGAAGGAGAT ACTTCTATCG AAAGGGAATA TTCCATTTCC
TGGT--GTTC CAAAAGAGAT TCTTATGTCT AAA-GGAATA T--CAGTTCA
TAAAGGTTCT GATCAACTTG GATTGGTTGT AGTCGCAGAA GCTTAT----
GGAAACTTCC GATCATGTTG GATTGGTGGT GATTGCCGAG GTTCGAGAAG
AGAAATTACT GATGAAGTTG GATTGGTAGT AATTGCGGAG GTTCGAGAAG
-----AAATC ACTTCCTTGT GAAGTTATTA AACGTATCTG TACTCGTGTT
CAGTCAAGTC CGGTTTTAAG GAAGCTGTTA AACGAATTCA GGCATGCGTT
TA---AAGCC GGCTCTGAAA GAAATCATGA GACATATTCA GACATGTGTT
GTAGAAGAAC ATGGTGTATC TATTGCCTCC ATTATGACTA TCAAACCGAG
GCGGAAGAAC ATGGGATCAT TGTCTCCTCC GTTATTCTGA TCAAACCGAG
GCAGAAGAAC ATGGCATTGT TGCTGCTTCG ATTGTGTTGA TAAAACCAAG
GTCCATTAGT AAGACCACGT CAGGGAAAAT CAGGAGATTT GAAGTAGCTA
GACCATTAGC AAGACGACAT CCGGCAAAAT TAAACGGTAC GAATGTCTTC
ATCCATTAGT AAGACAACAT CAGGAAAAAT CAAGAGATTT GAATGTCTCA
AAAGATTTGT TGATGGAACA TTGAGTGTGG TGGAAGACGC AACCGATGGA
AAAAGTTTAA AAACGGAACT TTCGATGTCG TTCATCAG-- ----------
AAAAGTTTGC TGATGGAGCT TTGAATATTG TCGACCAGCT TGTTGCAGGA
GAAAAATCAT CACGGGAGTC TA---AAGAT GGTTCCGTAC CATAAATCAC
---------- ----TCCA-- -ACGGAAC-T GTATCGGAAA ATCGAGCCGA
GAAAAATTGC CAGATCCATC CACGGAACAT ATATTGCAAT CTCAAATCAC
AAGAAATTTA GCTCTGGG-- -AAATGATAA AATTAGCAAG GGACAGATCA
GAAAAATAAT CATCCAATTC CGAATTCTGA TATAACAAAG AGTGACATAG
GCCAGATCAC CCTTCAGT-- -AAATTTCAG TATTAACAAG AGGGATATAA
TAGAATTTCT GAAGGAGCTG CTCTATGAGA TGACGGAGGT TCTCAAGTCT
TGAACTTTCT GATCGAGCTT CTCTCTCAAA TGACCGGAAT TTCTACAGCA
TCAATTTTTT GATGGAGCTG CTCTCTGAAA TGACTGGAAC TTCTACTGTT
AAAATCTCCA TCACTGAGAG TCTGGTGTCT TATGGTGTCG ACTCAATTGG
AAAATCTCCA CCAATGAAAG CCTTGTATCT TATGGGGTGG ATTCCATAGG
GAAATGTCGG CAACTGAAAG CCTTGTATCT TATGGGATTG ATTCTATTGG
CGTTGTTCGA GCAGCTCAGA AAATTTCAAA CTTTCTCGGA GTTCTAGTTG
TGTCGTTCGA GCCGCTCAAA AGCTCTCCGA TTTTCTTGGA GTCCCAGTTG
TGTGGTTCGA GCAGCTCAAA AACTTTCAGA TTTCCTCGGA GTTTCAGTTG
GAGCGATAGA CATCTTCTCA GCAACCTGCA TTGAAGACTT AGCAGACTTT
GCGCAATCGA CATATTCACA GCGACATGCA TCGATGACCT GGCGAAATTT
GTGCAATTGA TATTTTTACA GCAACCTGCA TCGATGACTT GGCAAATTTT
GCTGAAGATC TTTTGAGAAA ATCCCACCCC GAGTTGGTCA CAACTACCTC
TCAGACAACC TATTAAAGAA GTCTCGTCCT AAATCAG--- ----------
GCAGAGAATC TTCTAAAGAA GTCCCGTCCT CAATCAGTAG CCAGTTCAAC
CGGTTCTT-- ---------- -G----TGAA AGCAAGATGA GC----TCAA
--------CA AAAAC----- -GAAGGCGAA AGTACCATCA ACAAAAACAA
CCATTCAACA GAGAC----- -GAAATCAAA ATCAAAATCA AC---TTCAG
CCA------- ---------- ---------- --------TG GTTTTTCCAA
CCTTTTCCGA AGCTTCTTCG TCAAGAAAGC TGCAAATCTG GTTCATGCAA
CTATTCTCAA GGCTTCTCCA ACTCATAAGC TGGGTATTTG GCTGTTTCAG
CGTTTAGCTC TTGCTTATGT CTCTTTCATG CTGATATTTC CTGCGTGCAT
ATCTTAGCAC TTGCTTACGT TTGCTTCTTG CTGATTTTTC CCGCTTACCT
CTCATAGCAC TTGCTTATGT TTCCTTCTTG CTGATATTTC CTGCATATCT
ATCGGTTTCC ACTTTCAAAT TTTTCATTTC TGCATGCCAA ATGCTGATAC
CTCGATTTCC GCCTTTACGT ATT-----CT TACAAGCCAC AGAAAAATAC
ATCAGTTTCT ACATTTACAC ATTGGATATA TGCTGGCCAC AATCCAATGC
AATCAGCCCC TTGGTTTGGT TACATTGTTT CCTTAGTATG TGCTCCACTT
ATGCA----- -----CTGCT TACTTGATTA GCCTATTTTG TGCTCCGCTT
ACACAAGTCT CTACTTCGGT TATTTGATTT CTTTAGCGTG TGCTCCTCTT
TCTTGGATAT TATGTG---- ---------- -----CATAG GTTTTCTTGG
TCTTGGATGC TGTGCCTTTT CTCGACATGC ATTAGCATAT CCTTTTTGGG
GCTTGGATAC TATGCATTTT CTTCACTAGC ATTGCAATAG GATTCTTTGG
AAAGTCATTT CTACAACCAA ACTATATTTT GAACCCTGAG GTGTCCATCT
GACACCATTT CTTCAGACAA ACTACGCCTT ATTTCCCGAG GTTTCGATTT
GAGTCCATAT CTGCAACCAA ACTATGCATT GTTCCCTGAA ATGTCGATCT
GGTCTGCTGA TTTTGTCAAG TGGTGGACAC TTTATAAGGC TCAAGAAATA
GGTCTACCGA ATTCGTGAAG TGGTGGGCCC TGCACAAGGC TGAGGAGGTT
GGTCAGTTGA ATTTGCCAAG TGGTGGGCAC TTCACAAGGT TCAAGAAGTT
TCTTCAAAAG TCTTAGCAGT GCATTTGAGA GGAACGGTAT TCATCAATTA
TCTTCGAAAG TTCTTGCAGT TCACTTAAGA GGCACTGTGT TCCTCAACTA
TCTTCGAAAG TTCTTGCAGT TCACCTGAGA GGCACTGTGT TCATCAACTA
TTGGTTCGAG ATGCTCGGGG AAAATATTAC ATCATCTCCT ATTCTTGATA
CTGGTTCAGA ATGCTGGGAG CTAAAGTCGC CTCGTCAGCA TTAATCGACA
CTGGTTTAGG ATGCTAGGAG CTAAGATTCC ATCTTCTGCA GTGCTCGATA
CTGTCGACAT CACAGATCCA TCACTAGTTT CTATTGGAGA GCAAGCTGTG
CCATCGACAT CACGGATCCA TTTCTAGTTT CGATCGGAGA AGATGCTGTG
CAATTGACAT CTCTGATCCA TTTTTAGTTT CGATTGGAGA AGAAGCTGTA
ATCAATGAAG GAGTATTGCT TCAAAGTCAT GAAGTGAAAA ATGGTGTCTT
CTGGCAGAGG GAGCGTTAAT CCAAAGCCAT GAAGTGAAAA ATGGAGTTTT
CTTGCAGAAG GAGTACTGAT TCAAAGCCAT GAAGTGAAAA ATGGAATCTT
GAGTTTCAAC CCGATAAGAA TCGGCCAAAA ATCGTCGATT GGTCCTTATG
GAGTTTCTAT CCAATGAGAA TCGGATCAAG ATCTTCGGTG GGGCCCTATG
GAATCTATAT CCTACCAGAA TTGGTTCAAG ATCTTCAGTG GGACCGTATG
CTGTTGTTCA AAGAGGAAGC ATTGTGGAAG ATGGTACTCA TGTGCTTGCT
CTTTGCTTCA GAAAGGCGTC GAGGTACATG ATGGAGACGA AGTACTTGCC
CTCTAATTCA GAAAGGCGCT GTGGTTGAAG ATGGAGCTGA AATTCTCGCC
CTCAACACCA GC-------- -AAGACTGCA GTTAA---TG TTAAGGCTAC
CTCA------ ---------- CAA--CTGAG GTTAGTACTT CTGATGTGGA
CTGAATTCTG GTGAAGGAGG CAACTCTGAG GCCAAGGTTT TTAATGCTGA
AAGCACCAAA AAGGTCAGT- --AGAATTAG AA-------- ----AAAGAA
TCATTTCCAA A--------- ---------- ---------- ----------
GAGTTTTCAA AAGGTCAGTC --ATAACGAA CATCTATATG CTCCACAAAA
ACTTTCATCA ACT------- -TAGTTGCTG GACTCTTGAA AAAAG-CTTT
---------- ---------- ---------- ---------- ----------
G-TTTTGATT ACT--TCTCT CTAGAATATC TACTAATAAA ATGCA-TTTT
AGACTGA--- ---------- --CACATTTC TGT------- ----------
---------- ---------- ---------- ---------- ----------
TTATTGAATT CACACCCATT G-CACTTTGA TATGTAAGAC AGAATTGCCT
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
AAAACGGAGC ACAATTTACG AATAATGAGT AGGTTAAGGA TCAAATGGCA
---------- ---------- ---TTTCTAT CATGATTACA GGACAAATCG
---------- ---------- ---------- ---------A GGGTAAAATG
TTGACTAACC ATTGATTT-- -CATTTCTAT CACAACCACA GGGAAAAACG
ACACGAAAGA GCATGAAGGG GAGTCATGGA AAT--CATGA ACAT-ATTAA
GGAAAACAAA TCATCAGCAA AAACTGTGGG AATTATGCCA TGGTAATT--
GTACAGCATA GCCTCCACAA AAATAGTGGA AACTCTGTAA CCATTATTCA
ATATTCTC-- -CATATACAT GGTCGGCTTT CTCAGCAGCT TATCGGCAGC
-ACTTTCTCG GTATATATAC AATCGGTTTT CTTGGCAGTT TATCGGCGGC
GAGTTTTTTG GCATCTACAT GATTGGTTTT CTCAGCAGCT TATCAGCAGC
TGTTTCTTAC TTCATTTACC TTTCGCTATG GCAAAACCCG CCTTCACTAA
GGTTTCTTAT TTCGCCTACC TCTATATAAT GCAAAAGCCT CCTATTATGC
GATGTCTTAT TTAATCTACC TCCAGATAAT GCAAAAACCT CCCTCACTTC
AACATTTCAG GTTCTTTTAC TTAGCTGGAG CCTTTCATTG GTTTCCATAC
ATCACTTTGC ATTCATATCT GTTTCTGGAG CATTTCATTG GCTACCTTAC
ATCATTTTTC TTTCTTCTGT TTATCTGGAG CCTTCCATTG GTTACCGTAC
ACGATCGCCA CATATGCTAC ATTGTTTGAC AATTCCTATT CAGGTTCATT
ACCATAGTTG CATATATAGT AATACTCGAT TCTGCCCCTT CAAATCCGAT
ACGATTGTTG CATGTGCAGT TATGTTTACG AGCATCCCGT CAAATCCGAT
CACTTTTGCT ACCACTATTG CCATTTTTTA CATATGTC-- --ATGGTCTT
CGCCTTTGCA ATCTCCATAG CCA----CAG CTTACACAAT TTACGGCCTT
CGATTTTGCT GTAGCCATAG CAA----TGG CATATGCCAC TTATGGCCTT
ATCCTTAGCT TATTCACTTG TTTGGTAAAC CATGTTGTCC ATCAAAAAAG
GTCCTCAGCT TTTTCACTTG CTTCCTCAA- -ATCATACTT AGAAAAAACA
CTCCTCTGTT TCTTCACTTG CCTCTTGAAC TACTGTATTT CGGGAAAACA
AAATGGACAT GATG------ --AAGACTTG TCTTGTCCAT CGTGTCAACG
AGACTTGTCC GAAAAACCCT TCCGAAAATG GCTAATACAC AGAATTTTGA
AGATCTGTAC AAAGCACCGT TCAGAACTTT TCTTCTTCGT CAAATTATGA
TTGCATGTCA CATAAGGTTC ACAAAATTCA TGTCTGGAAC AGAGTTGTTC
CATCATGCCA CGTCAGATTC GCGAAATTCA TCTCTGGAAC AGAAGCTTTC
TCGCCTGCCA TGTAAGATTT ACCAAGTTTC TTACTGGAAC AGAAGCTTTC
TGCATTTATT TACGTCGACT AGGTGCAAAA ATTGGCCAAC ATTGTTCCAT
TGTTTTTATC TACGCCAAAT GGGAGCCAAA ATTGGTAATC ATTGTTCGAT
TGCGTGTACT TGCGCTGTCT TGGAACAAAA GTTGGGAGGC ATTGTTCAAT
CAGAGCCATT AACCCCATTT CAGAACCAAA CCTCATCTCA ATAGGCAATG
CAGAGCTATT AATCCCATTT TAGACCCGGA GCTCGTTTCA CTAGCCGACG
TCGAGCCATC AATCCGGTGA CTAACCCCAA ATTGATATCA GTTGCAGATG
GTGTGCATTT AGGGGACTTC AGTAGAATCG TTCCCAGATT ATATACCTC-
GTGTTGATTT AGGGGACTTC AGCCGTTTAA TCCCAGGTTA TTACACCTCA
GAGTACATTT GGGGGACTTC AGCAGAATTA TTCCTGGCTA TTACACATC-
--TAGTGACT ATGTGTCTAG TAAAATTGAG ATTCAAGATA ACTCGGTTAT
AAAAACGGCT ACTTTTCAGG CGGAATCGAG GTACAAGACA ATGCTGTAAT
--AAATGGCT ATGTTCAAGG GAAGATTGAG ATACAAGATA ATTCAGTTGC
CGGTAGTCAG GGTCTTGTTC TCCCAGGTTC TGTCATTGAG AAGGATGTTA
AGGGAGCCAA GCACTAATCC TCCCTGGTTC GGTTCTTGAA AAATACGTAA
AGGGTGTCAA AGTTTAGTCC TCCCTGGTTC TGTCCTTGAG AAAGACGTCA
TTCTTGGTGC AATCTCAGTT GCTCCCATGA ACTCAGTACT TCAGCATGGT
TTCTCGGTGC ACTTTCGGTT GCTCCGGCGA ACAGAATCCT CCACAGTGGC
TTCTTGGGGC ACTCTCGGTG GCTCCCATGA GTTCAGTTCT TCAGAGCGGT
GGTGTCTTTG TTGGCTCCAA AAATCCAGTT TTGGTCAAAA GCAAGTCTTA
GGTGTTTATG TTGGTTCTCC ATTTCCTTCG ATGGTGAAGA ACACTACCCA
GGTGTTTTTG TTGGGTCTCC AACTCCAGTT ATGGTTAAGA ATGCCATTCA
CTCGTTGGAT GATCGAATCG AGGAGATGGA CTTGAAATAC AAAAAGGTGC
AAGTTTCGAC GACAGAATCG AAGAAATGGA CACGAAATAT CGAAAGGTGT
TTCTTTTGAT GAGAGGTTAG AGGAGATGGA CATGAAGTAC AAGAAGGTTC
TTGGAAATCT AGCTGCAAAT TTTGCTGCTT CAACTCTTAA AGTGAAATCG
TAGGAAATCT CGCAGCAAAT TTAGCAGCCA CAACCTTAAA AGTAAAATCG
TTGGGAATCT TGCTGCAAAT TTGGCGGGCA CAACTCTCAA AGTAAACTCT
AGATTCTTCC ATCGAATCGG TGCTGCAGGA AAGGGATGCT TAAGCCTCTA
AGGTACTTCC ATAGAATCGG TGCAGCTGGC AAGGGATTTT TAAAGCTTTA
AGATACTTTC ACCGGATTGG CGCTGCTGGT AAGGGATTTC TAAGGCTCTA
CAATGACATC CC---AGGTT TTGCAGATCA CAAGATTTTC AGTCCTGGCA
CGATCACCTT CCGAAGAATC TCCCACATCA TAGAATATTC TCAACGGGGA
TAACAACATA CC---AACCT TGCCAGACCA TGAAATCTTC TCTCCTGGAA
TGACATACCG AGTTATTATG AGGCACAGTA ATTGTTTGAG TTCTGATGAT
AGGAATATTC CATCATCCTT CGGCACAGCA ACTGCTTGAG CTCTGACGAC
AGCAATACTC CATTGTGATC CGGCACAGTA ATTGCCTAAG TTCTGATGAT
GATGCACGAC TTGATCCGCG TGGTGCAGCA ATAAGGATCT TATCAAATGG
GACGCACGCC TCGATCCACG TGGTGCAGCC ATACGAAT-- ----AACCA-
GATGCAAGAC TCGATCCACG TGGCGCAGCA ATAAGGATCC TATCAAGCAA
AACTGATGAG AACAGTTCAA TTCTTGATCT GACACTAAAG ACTGGTAAGG
--CTGAGGAT AATTCCCCAT TGCTCGATTT AACCTTAAAA ACTGGCAATG
TGCTGAACAA ACATCTCCAC TGCTGGATCT GACCCTAAAG ACAGGCAATG
CATTTCATAC TCGGACGATA GGTGACTTTG CAACATGGCT TGTTTGTGGA
CATTTCATGC ACGGACAATA GGGGACTTCG CCACGTGGCT AGTCTGTGGG
CATTCCATGC GCGAACTATA GGTGACTTTG CAACATGGCT CGTCTGCGGA
GCAGCAGCAC GGGAAGAACA CGTGAAGCAT GCTCCACATG TCCGGGATGC
GCAGCTGCGC GAGAGGAGCA TGTGAAGCAC TATCCGCACA TTCGTGACGC
GCTGCTGCAC GAGAAGAGCA TGTGAAGCAT GCTCCACATA TTCGTGATGC
AATGTGGGGT TCTCTTCGAC AGGCATACTC CTACACCGAG CTGCATTACT
AATGTGGGGA TCTTTGAGGA GAGCCGATTC TTACACCGAG CTTCACTATT
AATGTGGAGC TCCCTGAGAA CAGCTGACTC TTACACTGAG CTCCATTATT
ATTCAAACAT TTGCAGGCTT TTCAGATTCA AAAATGATCA AGAGATGTAT
ACTCGAACAT CACACGGCTT TTCCGGTTCG AAGATGGCCA AGAAATGTAC
ATTCAAACAT TTGCAGGCTG TTTAGGTTCA CGGATGGACA AGAAATGTAT
GTCAAATTCA AGTTAAGGCC TTTTGACAAC AAC------- ----------
GTCAGATTCA AATTGAGGCC GTTCGATAAG AAT------- ----------
GTCAAATTTA AGCTGAGGCC TTTTGATAGG AAG------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
---------- ---------- -----ATCGG TGAGGACTCT GGTGAGGTGA
---------- ---------- -----ATATC AGAGGAAAAT GGAAAAGTGG
---------- ---------- -----TTCGA TGAGGACACT GGCAAGGTAG
AGCCTAGAGG CGTACTTCCA CCAGAGACCG GTGCAATTCC AAGAGATGAG
AGCCCACAGG TGTACTTCCG CCAGAAACTG GGGCTATTCC GAGAGATGAA
AACCAAGGGG TGTACTTCCA CCAGAAACAG GTGCAATCCC AAGAGACGAG
AATGACAACC GTCCATTACG CTTCCTTGAT GAAGATTTCC AACATCGTCT
AACGACAAAC GCCCTCTGCT TTTCCTGGCC GATGATTTTA AGAAACG---
AATGATAAGC GCCCTTTACT TTTCCTCGAG GATGATTTTC AGCGTCG---
GCACACACCC GAGAAGGTGA AACAATGAAT TTTACTAAAT ACATTCGAGT
---------- ---------- ---GGTGGAT TCTTC----- ----------
---------- ---------- ---AGTAAAT TCTCC----- ----------
CTTTTGGATT TTCAAAGTGG TTCGTAAATC TTGAAAATGA ACAGGTCCGT
---------- ---------- ---------- -------GGA TAAAGTTCGT
---------- ---------- ---------- -------AGA GAAGGTACGT
TATGTTCTAC AACTACAAAT GCGATCAATA CCGGAGGATG AAGTGAATCG
TATGTTCTTC AGCTGCAAAT TCGACCAATC ACAAACGACG AAAGGGAAAG
TATGTTCTTC AATTGCACAC CCGATCAGTC CCAGAGGATG AAAACACACG
GGAGGCAGCA CTTGATTGCA CCAAGCCATG GGATGAGATT GAATTTCCTT
GGAGATTGCA CTCGACTGTA CAAGGCCATG GAACGAAACG GAGTTCCCAC
AGAAAATGCA CTTGACATCA CCAAGCCATG GGATGAAACT GAGTTTCGAG
ATATTGATAT AGGAGACATA ACTATTGATG CAACACTAAC CAAAGAAGAA
ATTTCGATGT GGGAGAGATT ACAATCGAAC GAATCCTGAC CAAAGAGGAA
TCATTGATGT GGGAGAGGTA ACCATTGATG AAGTGCTCAC CAAGGAAGAA
TCAGAAGATC TAGAGTTCAA TCCATTTCTC AAATGTCATG AAGTGGATGT
TCCGAAGAAC TCGAGTTCAA CCCTTTCCTT AGATGCCACG AAATCGATGT
TCTGAAAAGC TAGAGTTTAA TCCGTTTCTC AGATGTCGTG AAGTGGATGT
CATTCGAGCA ACGTCATGTA ATCAAAGCGC GTCCATGGAC CACGGACGTT
CATAAGGGCT TCGTCTTGTA GTGAAAGTGC ATCAATGGAT CACGGCCGTT
CATCAGGGCA ACATCGTGCA ATCAAAGTGC ATCGATGGAT CATGGCCGTT
CAGTCGTTTA CGACATATGC CAGCACCTGA GAAACAGAAA ACCTCTCCCA
CATTAGTGTA CTCGATCTGC CAACATCTTC GAAACAAGAA ACCACTTCCT
CAATAGTATA TGCAATATGC CAACATCTGC GCAACAAAAA GCCTCTTCCT
GAAGCTTGGA GAATCTTCTT AGATCAATCT GATGTTAAAA TGGACTTCAC
CAAGCATGGA GGGCTTTCTT GAATCAATCC GACGTGAAAA TCGACCTCTC
GAGGCTTGGA AGATGTTCTT GGATAAATCT GATGTCAAAG TCGACCTCTC
CGGCTGTCCC CTTGCATCG- --ACATTGGA GAAAAAGTAC TACCAAAGTC
GGGCTGTCCA ATGGCAGCTC AAAAACTCGA GAAAAGTCGA TACAGAAGTG
TGGCTGTCCA ATGGCAGCT- --GCATTCGA GAACAAG--- TGCAAGAGAA
GTCATGCTAG CAAGATCTTG GTACGTGACA ATGTGGCTAA TATCCGTTCA
GTGACATTAG CGAGGCCATG GTACGTGACA TTGTGGATGA TGTCAGCTCA
GTAACACTGG CAAGACCTTG GTATGTAACC TTGTGGCTGA TGTCGGGTCA
GCCGTTCTTG CAAACATTTC TTCCATATTT TCTCATGGGA TTCATAATCT
GCCGTTTCTG CAGATTTTTC TACCATACTT CCTCATGGGG CTAGTGATTT
ACCATTTCTA CAAACTTTCC TACCTTATTT TCTGATGGGA TTAGTAATAT
TTGCTCCATT GAAGTTTCTT TTCTACACAT CGGATACCAT AAAAGTCCAA
TTCCACCGAT GAAATTCATC TTCCACCAGA ACAAGATCAG TAAAACAGAG
TTGCTCCTCT GAACTGCATC TTCTACGTAA ATAAGATCAC AATGACTAAG
---AAGCATT GGCTGCTACC TTTGTTTTGG GTTGTGACAG GACTTTTAGG
CACATGCATT TCTTGCTGCC ACTTTTCTGG ATTTGCTCGG GCGTTTTATC
---ATGTATT GGTTGCTGCC AATCTTTTGG CTTTCTTCTG GAATTTTAGG
TGGAGTTCTA TGTGCTTTTG CAAAATGGAT TCTTGTAGGA ATCAAGAAAG
AGGGCTCGTT TGCGCTATAT GTAAATGGAT TCTCGTGGGT AAGAAGAAAG
AGGTCTTGTT TGTGCTGTGA GTAAATGGAT TCTCGTTGGA AGAAAGAAAG
ACGGCGAATG TGAGCTAATT TGGAGTAAAG GGATCTTCTG CGATACGATT
AAGGCGAAAA CGAACCGATA TGGAGTGTGG GGATATTCAT GGACACTATT
ACGGTAAAGT AGAGCCAATT TGGAGTGCAG GGATATTTAT GGACACTACT
TGGCAAGCAA TAAGGACACT AGTAGGAGAC TATTTCATGG AAATGAAAAG
TGGCAGGCTG TAAGAACACT GGCTGGCGAC TATTTCATGG AAATGACAGG
TGGCAGGCTA TAAGGACACT AGTTGAAGAA TATTTCATGG AAATGACAGG
TGGATCGTTC TTATTCGGAA TTTGGATGAA GCTTATGGGA TCAGAAGTTG
CGGCACGTTT ATATTCAACG TGTGGATGAA TCTAATGGGA TCGGAGGTTG
CGGATCATTT CTGTTTGGAA TTTGGATGAA ACTCATGGGG GCAGAAATCG
CTTGGGACGA AGGCACATAC ATCAACAGCA TGGGAGCAGT TTTGAATCCT
------ACAG AGGCGCTTAC GTGGACACCA TGGGAGCTGT GTTAAATCCT
CCTGGAAAGA GGGAGTTTAT GTGGACAGCA TGGGTGCAAT TTTGAATCCT
GAAATGGTGA GAATCGACAA GTATGGATCG ATCGAGAGAG AAGCTTTGCT
GAACTGGTGA AGATAGAAGA GAATGGCTCT GTGGAGAGAG AAGCTTTGCT
GAACTGATTA GGATTGAAGA ATATGGCTGT CTGGAGCGAG AGGCTTTGCT
CTTTGGACAC ATATATGAAG GTGAAGGAGG CAAAGTGAAG TATGGCAAAA
GTTTGGACAT ATATATGAAG GCGAAGGGGG TAAAGTTAAG TACGGAAAAA
CTTCGGGCAC ATATATGAAG GTGAGGAAGG GAAGGTGAAG TACGGGAAGA
TCAAGATTGG AGAAAGTGGC TATGTGGGAA GCAGAGCTGT GGCAATGCCA
TCGTCGTTAA GAAAGGCGGA TTTGTCGGGA GCAGAGCAGT GGCAATGCCG
TTGTGGTTAG AAAAGGCGCG TTTGTGGGAA GCCGAGCAGT GGCCATGCCC
AGGGTAACTA TTGACACTAA ----AGGTAC TCTTGGAGAT CTCTCTCTTG
GGAGTTACTG TGGGTGGTGG TGGTGGTTGC -TTAGGTGCT CTGTCTTTAG
GGAGTGACTC TGGACAACGG ----AGCTAC ACTCGGGGCC CTCTCGTTAG
CCATGAAGGG AGAGCTCGTA AAATAG---- ---------- ----------
CTATGAAGGA AGAGTTTGTT AACTAA---- ---------- ----------
CGATGAAGGG AGAGTTTGTC AAGTGATGCA TGGTAAAATA AAGCATAAAG
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
GATTTTCAAG TCATTTTCTG ATGATAAAAT GCTCCAAGCA AATTCATTGC
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
ATACTTATCA GAAAACATAT TTGTGATTTA GAAACTCACT CGTCTTCTGA
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
AATTATTATA CGTTAGCACG TTTGACTTGG AACTTCTACC ACAATAGTTA
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
TACCAAAGAT GATCCTATTC TTCAGGTACA CCGATTACAA TTTTCAGTGC
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
CCAATTATCT CCTTTTAATC TATCTTTAGT TTTCTTTCAT TATACTCTCC
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
TTATAATCTA AAGGCAAAAG TCAAACTTAC TTTTCGTCGG ATCCAGGACT
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
ACTTTTCACC CATCTACCAC TTATTTTTGT CACATTGTAA TTTAAACAAC
---------- ---------- ---------- ---------- ----------
---------- ---------- ---------- ---------- ----------
TTACTATGAA AATCAAACAA CTTGTCCAAT TTCCTTCAAT AAATATACGA
-------
-------
ATTTCAG
Dear developers,
I am trying to use DOMINO to identify mapping regions which can be used as unique identifiers for several different species of trees. Someone within my group has already generated Multiple sequence alignments for each of the genes of interest, so I have been trying to use the custom run option to bypass the initial stages and starting at "Mapping/Alignment" but I have found when I get to the "marker discovery/selection" stage it displays no selectable taxa and prevents any advancements from that point. I was wondering if you would know of a solution to this problem?
I am using the MSA files in the phylip format and I have attached some example data below just in case the error lies there. Thanks in advance.
3 8807
Taxon1 ---------- -----AGAGA AATGGAGCCA AGAAAAGCAA TAGATGAATT
Taxon2 ---------- ---------- -ATGGA---C GAAAAATCGG TGGACGATAA
Taxon3 AAACAGATAG CCTGA----- -ATGGAGCCA GGGCTATCAA TCGAAGATCA