; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC05G094580 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC05G094580
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionMuDRA-like transposase
Genome locationCmU531Chr05:17921118..17925174
RNA-Seq ExpressionCmUC05G094580
SyntenyCmUC05G094580
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052461.1 MuDRA-like transposase [Cucumis melo var. makuwa]6.2e-1629.17Show/hide
Query:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT
        +++ ++HGG WD+ +++Y   VLKGI+VP  +T+++L+ ++Y +A V+P+KF+I IR  Y++  +   P   ++ND D++F ++  N    P  L  + T
Subjt:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT

Query:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF
           S      + + N D N        Q+ N       PI  DR   +E   GE VE    H+ +G +  + ES  +   + DT   +   ++   + +F
Subjt:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF

Query:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR
        D                      +S    T G+G S     SS    ++VGQIFF K D+ M+LS+L M +NF+  V+KS K++  +R
Subjt:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR

KAA0054865.1 MuDRA-like transposase [Cucumis melo var. makuwa]6.2e-1629.17Show/hide
Query:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT
        +++ ++HGG WD+ +++Y   VLKGI+VP  +T+++L+ ++Y +A V+P+KF+I IR  Y++  +   P   ++ND D++F ++  N    P  L  + T
Subjt:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT

Query:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF
           S      + + N D N        Q+ N       PI  DR   +E   GE VE    H+ +G +  + ES  +   + DT   +   ++   + +F
Subjt:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF

Query:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR
        D                      +S    T G+G S     SS    ++VGQIFF K D+ M+LS+L M +NF+  V+KS K++  +R
Subjt:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR

TYK27211.1 MuDRA-like transposase [Cucumis melo var. makuwa]4.8e-1629.17Show/hide
Query:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT
        +++ ++HGG WD+ +++Y   VLKGI+VP  +T+++L+ ++Y +A V+P+KF+I IR  Y++  +   P   ++ND D++F ++  N    P  L  + T
Subjt:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT

Query:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF
           S      + + N D N        Q+ N       PI  DR   +E   GE VE    H+ +G +  + ES  +   + DT   +   ++   + +F
Subjt:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF

Query:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR
        D                      +S    T G+G S     SS    ++VGQIFF K D+ M+LS+L M +NF+  V+KS K++  +R
Subjt:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR

XP_038891938.1 uncharacterized protein LOC120081277 isoform X1 [Benincasa hispida]2.6e-2230.33Show/hide
Query:  FITRSEERELIQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKLDSYGPAVCITNDNDIRFLLVKTNQNRPQ
        F    ++ + +  +  +GG WD+ +    R  L GI+VP++L Y+E+K  +YG+  V+ S+F+++++V+YKL+   P   I ND+ IRFLL + + +R Q
Subjt:  FITRSEERELIQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKLDSYGPAVCITNDNDIRFLLVKTNQNRPQ

Query:  ILVKQHTIGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRG-DAPIV-----------DDR----NFNSE--------------GEWVENSHDDVGCD
        + V   +  + +I     ++  DDT    ER   + +  R+   D P+            +DR      NSE               ++++  H +  C 
Subjt:  ILVKQHTIGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRG-DAPIV-----------DDR----NFNSE--------------GEWVENSHDDVGCD

Query:  PHVNESMPAIVPDTQLTQPGHLFGNVHRMFDTSV---SCGTSGAGPSIMQPHSSFGMV-VEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR
        P V+ +       T L +     G+V      SV   S  +S     IM   SSF +  VEVGQ+FFSK+D+KM+LS+L + +NFE +VRKS K L+ ++
Subjt:  PHVNESMPAIVPDTQLTQPGHLFGNVHRMFDTSV---SCGTSGAGPSIMQPHSSFGMV-VEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR

XP_038891939.1 uncharacterized protein LOC120081277 isoform X2 [Benincasa hispida]9.9e-2230.69Show/hide
Query:  EQEFITRSEERELIQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKLDSYGPAVCITNDNDIRFLLVKTNQN
        E+    R +++++    I +GG WD+ +    R  L GI+VP++L Y+E+K  +YG+  V+ S+F+++++V+YKL+   P   I ND+ IRFLL + + +
Subjt:  EQEFITRSEERELIQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKLDSYGPAVCITNDNDIRFLLVKTNQN

Query:  RPQILVKQHTIGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRG-DAPIV-----------DDR----NFNSE--------------GEWVENSHDDV
        R Q+ V   +  + +I     ++  DDT    ER   + +  R+   D P+            +DR      NSE               ++++  H + 
Subjt:  RPQILVKQHTIGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRG-DAPIV-----------DDR----NFNSE--------------GEWVENSHDDV

Query:  GCDPHVNESMPAIVPDTQLTQPGHLFGNVHRMFDTSV---SCGTSGAGPSIMQPHSSFGMV-VEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLY
         C P V+ +       T L +     G+V      SV   S  +S     IM   SSF +  VEVGQ+FFSK+D+KM+LS+L + +NFE +VRKS K L+
Subjt:  GCDPHVNESMPAIVPDTQLTQPGHLFGNVHRMFDTSV---SCGTSGAGPSIMQPHSSFGMV-VEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLY

Query:  SMR
         ++
Subjt:  SMR

TrEMBL top hitse value%identityAlignment
A0A5A7UG00 MuDRA-like transposase3.0e-1629.17Show/hide
Query:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT
        +++ ++HGG WD+ +++Y   VLKGI+VP  +T+++L+ ++Y +A V+P+KF+I IR  Y++  +   P   ++ND D++F ++  N    P  L  + T
Subjt:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT

Query:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF
           S      + + N D N        Q+ N       PI  DR   +E   GE VE    H+ +G +  + ES  +   + DT   +   ++   + +F
Subjt:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF

Query:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR
        D                      +S    T G+G S     SS    ++VGQIFF K D+ M+LS+L M +NF+  V+KS K++  +R
Subjt:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR

A0A5D3BSX9 MuDRA-like transposase3.0e-1629.17Show/hide
Query:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT
        +++ ++HGG WD+ +++Y   VLKGI+VP  +T+++L+ ++Y +A V+P+KF+I IR  Y++  +   P   ++ND D++F ++  N    P  L  + T
Subjt:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT

Query:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF
           S      + + N D N        Q+ N       PI  DR   +E   GE VE    H+ +G +  + ES  +   + DT   +   ++   + +F
Subjt:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF

Query:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR
        D                      +S    T G+G S     SS    ++VGQIFF K D+ M+LS+L M +NF+  V+KS K++  +R
Subjt:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR

A0A5D3CLG9 MuDRA-like transposase3.0e-1629.17Show/hide
Query:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT
        +++ ++HGG WD+ +++Y   VLKGI+VP  +T+++L+ ++Y +A V+P+KF+I IR  Y++  +   P   ++ND D++F ++  N    P  L  + T
Subjt:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT

Query:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF
           S      + + N D N        Q+ N       PI  DR   +E   GE VE    H+ +G +  + ES  +   + DT   +   ++   + +F
Subjt:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF

Query:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR
        D                      +S    T G+G S     SS    ++VGQIFF K D+ M+LS+L M +NF+  V+KS K++  +R
Subjt:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR

A0A5D3DU45 MuDRA-like transposase2.3e-1629.17Show/hide
Query:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT
        +++ ++HGG WD+ +++Y   VLKGI+VP  +T+++L+ ++Y +A V+P+KF+I IR  Y++  +   P   ++ND D++F ++  N    P  L  + T
Subjt:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT

Query:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF
           S      + + N D N        Q+ N       PI  DR   +E   GE VE    H+ +G +  + ES  +   + DT   +   ++   + +F
Subjt:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF

Query:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR
        D                      +S    T G+G S     SS    ++VGQIFF K D+ M+LS+L M +NF+  V+KS K++  +R
Subjt:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR

Q5GIT1 MuDRA transposase-like3.0e-1629.17Show/hide
Query:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT
        +++ ++HGG WD+ +++Y   VLKGI+VP  +T+++L+ ++Y +A V+P+KF+I IR  Y++  +   P   ++ND D++F ++  N    P  L  + T
Subjt:  IQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFNIVIRVQYKL--DSYGPAVCITNDNDIRFLLVKTNQ-NRPQILVKQHT

Query:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF
           S      + + N D N        Q+ N       PI  DR   +E   GE VE    H+ +G +  + ES  +   + DT   +   ++   + +F
Subjt:  IGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSE---GEWVENS--HDDVGCDPHVNESMPAI--VPDTQLTQPGHLFGNVHRMF

Query:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR
        D                      +S    T G+G S     SS    ++VGQIFF K D+ M+LS+L M +NF+  V+KS K++  +R
Subjt:  D----------------------TSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTAACGACTTAGAGCGAGATACGAGATTACACGAAGTTAGGGTCGAGAGTAGGACACATGGTGGTAGGAATGAGGTAAGTAGTAAAGAAGAGGGCCGAGAG
CGAGAAGACCGAGAGCAAGAGTTCATAACAAGGAGCGAGGAGCGAGAGTTAATACAATTGTTTATTCAACATGGTGGGACATGGGATGACATCAAACAGAGGTAC
AACAGAGTTGTTTTAAAAGGCATCATTGTGCCAATCACATTAACATACCAAGAACTGAAGGACCAGGTATATGGAATAGCAAGGGTCAATCCATCAAAATTCAAC
ATTGTAATAAGGGTTCAATATAAGTTGGACTCGTATGGACCTGCGGTATGCATAACCAACGACAACGACATTAGATTTTTGTTGGTAAAGACAAATCAAAATAGA
CCTCAGATTCTTGTCAAACAACACACCATTGGTGAGTCTGACATTCTCCCTTCTGTGGTATCCATGGAAAATGATGACACTAATCCATGGGATGAAAGAGGTGTT
TCACAACATCATAATGGCCGGAAGAGGGGAGATGCACCCATAGTTGATGATCGAAATTTCAATTCTGAAGGTGAGTGGGTTGAGAATAGTCATGATGATGTTGGG
TGTGATCCTCATGTAAATGAGTCCATGCCTGCTATAGTGCCTGATACGCAGCTAACACAACCTGGTCATCTTTTTGGAAATGTGCATAGGATGTTTGATACATCT
GTTAGTTGTGGTACTTCAGGAGCTGGACCATCTATCATGCAACCACATTCTTCATTTGGTATGGTCGTGGAGGTAGGACAAATCTTCTTCTCGAAGGATGATGTG
AAGATGAAGTTGTCCATGCTAGTTATGATGGAAAATTTTGAGATGCAAGTCCGGAAGTCAAATAAGAAATTGTATAGTATGAGGAAGAATATTAAAAAGGATTTC
AAGGACGTAGCAGTGACCAAGTTATTTGATGATGCTGCCAGAGCATTTAGGGAGTTCGAGTTCAAAGCTTTGTGGGATCAAATCCTTCCCCCATCAGGGTTGTTT
GTGTTGGTCATTAATGGCTACAAAGAATTCCATCGGTTGGAGAACAATGTCGAGTCAAGAAATGTGGTCGATGTGGTGGAACGGAACACAATAGAGCAAAATGTA
ATGAGCCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTAACGACTTAGAGCGAGATACGAGATTACACGAAGTTAGGGTCGAGAGTAGGACACATGGTGGTAGGAATGAGGTAAGTAGTAAAGAAGAGGGCCGAGAG
CGAGAAGACCGAGAGCAAGAGTTCATAACAAGGAGCGAGGAGCGAGAGTTAATACAATTGTTTATTCAACATGGTGGGACATGGGATGACATCAAACAGAGGTAC
AACAGAGTTGTTTTAAAAGGCATCATTGTGCCAATCACATTAACATACCAAGAACTGAAGGACCAGGTATATGGAATAGCAAGGGTCAATCCATCAAAATTCAAC
ATTGTAATAAGGGTTCAATATAAGTTGGACTCGTATGGACCTGCGGTATGCATAACCAACGACAACGACATTAGATTTTTGTTGGTAAAGACAAATCAAAATAGA
CCTCAGATTCTTGTCAAACAACACACCATTGGTGAGTCTGACATTCTCCCTTCTGTGGTATCCATGGAAAATGATGACACTAATCCATGGGATGAAAGAGGTGTT
TCACAACATCATAATGGCCGGAAGAGGGGAGATGCACCCATAGTTGATGATCGAAATTTCAATTCTGAAGGTGAGTGGGTTGAGAATAGTCATGATGATGTTGGG
TGTGATCCTCATGTAAATGAGTCCATGCCTGCTATAGTGCCTGATACGCAGCTAACACAACCTGGTCATCTTTTTGGAAATGTGCATAGGATGTTTGATACATCT
GTTAGTTGTGGTACTTCAGGAGCTGGACCATCTATCATGCAACCACATTCTTCATTTGGTATGGTCGTGGAGGTAGGACAAATCTTCTTCTCGAAGGATGATGTG
AAGATGAAGTTGTCCATGCTAGTTATGATGGAAAATTTTGAGATGCAAGTCCGGAAGTCAAATAAGAAATTGTATAGTATGAGGAAGAATATTAAAAAGGATTTC
AAGGACGTAGCAGTGACCAAGTTATTTGATGATGCTGCCAGAGCATTTAGGGAGTTCGAGTTCAAAGCTTTGTGGGATCAAATCCTTCCCCCATCAGGGTTGTTT
GTGTTGGTCATTAATGGCTACAAAGAATTCCATCGGTTGGAGAACAATGTCGAGTCAAGAAATGTGGTCGATGTGGTGGAACGGAACACAATAGAGCAAAATGTA
ATGAGCCACTGA
Protein sequenceShow/hide protein sequence
MFNDLERDTRLHEVRVESRTHGGRNEVSSKEEGREREDREQEFITRSEERELIQLFIQHGGTWDDIKQRYNRVVLKGIIVPITLTYQELKDQVYGIARVNPSKFN
IVIRVQYKLDSYGPAVCITNDNDIRFLLVKTNQNRPQILVKQHTIGESDILPSVVSMENDDTNPWDERGVSQHHNGRKRGDAPIVDDRNFNSEGEWVENSHDDVG
CDPHVNESMPAIVPDTQLTQPGHLFGNVHRMFDTSVSCGTSGAGPSIMQPHSSFGMVVEVGQIFFSKDDVKMKLSMLVMMENFEMQVRKSNKKLYSMRKNIKKDF
KDVAVTKLFDDAARAFREFEFKALWDQILPPSGLFVLVINGYKEFHRLENNVESRNVVDVVERNTIEQNVMSH