; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G22340 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G22340
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotransposon protein
Genome locationClcChr02:34441141..34442164
RNA-Seq ExpressionClc02G22340
SyntenyClc02G22340
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]3.4e-4637.17Show/hide
Query:  EASGSRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVER
        E   S +R  KH WT EE+  LVECLV+ V +G WR+DN TFRPG+L+ + RMM  KIPG +I  S  ++SR++ +KR + A+AEM GP CSGFGWN E+
Subjt:  EASGSRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVER

Query:  KCIDGEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSG
        KCI  E E+FD W             V YD+L+ VF KDR TG  A + A++GS   P  +    D +     DF   Y P    +     E  +   S 
Subjt:  KCIDGEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSG

Query:  RGSGSSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYC
        R + SS  S+R R     +  ++VR   +   + +  IA+WP++    A + R+E+   L++IP L++     + R L+ +   +  F++ P   KY YC
Subjt:  RGSGSSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYC

Query:  MQVL
          +L
Subjt:  MQVL

KAA0035621.1 retrotransposon protein [Cucumis melo var. makuwa]4.4e-4637.33Show/hide
Query:  SRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKCID
        S +R  KH WT EE+  LVECLV+ V +G WR+DN TFRPG+L+ + RMM  KIPG +I  S  ++SR++ +KR + A+AEM GP CSGFGWN E+KCI 
Subjt:  SRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKCID

Query:  GEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSGRGSG
         E E+FD W             V YD+L+ VF KDR TG  A + A++GS   P  +    D +     DF   Y P    +     E  +   S R + 
Subjt:  GEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSGRGSG

Query:  SSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVL
        SS  S+R R     +  ++VR   +   + +  IA+WP++    A + R+E+   L++IP L++     + R L+ +   +  F++ P   KY YC  +L
Subjt:  SSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVL

KAA0038122.1 retrotransposon protein [Cucumis melo var. makuwa]4.4e-4637.33Show/hide
Query:  SRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKCID
        S +R  KH WT EE+  LVECLV+ V +G WR+DN TFRPG+L+ + RMM  KIPG +I  S  ++SR++ +KR + A+AEM GP CSGFGWN E+KCI 
Subjt:  SRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKCID

Query:  GEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSGRGSG
         E E+FD W             V YD+L+ VF KDR TG  A + A++GS   P  +    D +     DF   Y P    +     E  +   S R + 
Subjt:  GEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSGRGSG

Query:  SSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVL
        SS  S+R R     +  ++VR   +   + +  IA+WP++    A + R+E+   L++IP L++     + R L+ +   +  F++ P   KY YC  +L
Subjt:  SSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVL

KAA0050106.1 retrotransposon protein [Cucumis melo var. makuwa]8.4e-5341.58Show/hide
Query:  SGSRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKC
        + + ++A KH WT   D +LVECL+Q V+ G WRADN TF+ G+L  + ++M++KI G +IQV+P+L+SRV+ LK+QY AIAEM+GP CSGFGWN ERKC
Subjt:  SGSRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKC

Query:  IDGEAEIFDAWVKYDDLAIVFDKDRTTGSHATTTAEVGSEPVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPS--GRGSGSSLRSRRSRSS
        I+ E  +FD WVK                   T  ++      EE++ DI      + E+F IP+P     P  ED  +TP+     +GSS  S++ RS 
Subjt:  IDGEAEIFDAWVKYDDLAIVFDKDRTTGSHATTTAEVGSEPVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPS--GRGSGSSLRSRRSRSS

Query:  SIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVLGR
        S G+  +  R   +  +K I  IA W     ++     + LY +LQ+IPG+ V   L VA SLL DP +L  F+D+P +WKY  CM++LGR
Subjt:  SIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVLGR

KAA0057083.1 retrotransposon protein [Cucumis melo var. makuwa]6.9e-4737.67Show/hide
Query:  SRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKCID
        S +R  KH WT EE+  LVECLV+ V +G WR+DN TFRPG+L+ + RMM  KIPG +I  S  ++SR++ +KR + A+AEM GP CSGFGWN E+KCI 
Subjt:  SRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKCID

Query:  GEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSGRGSG
         E E+FD W             V YD+L+ VF KDR TG  A + A++GS   P  + E  D +     DF   Y P    +     E  +   S R + 
Subjt:  GEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSGRGSG

Query:  SSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVL
        SS  S+R R     +  ++VR   +   + +  IA+WP++    A + R+E+   L++IP L++     + R L+ +   +  F++ P   KY YC  +L
Subjt:  SSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVL

TrEMBL top hitse value%identityAlignment
A0A5A7U7F7 Retrotransposon protein4.1e-5341.58Show/hide
Query:  SGSRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKC
        + + ++A KH WT   D +LVECL+Q V+ G WRADN TF+ G+L  + ++M++KI G +IQV+P+L+SRV+ LK+QY AIAEM+GP CSGFGWN ERKC
Subjt:  SGSRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKC

Query:  IDGEAEIFDAWVKYDDLAIVFDKDRTTGSHATTTAEVGSEPVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPS--GRGSGSSLRSRRSRSS
        I+ E  +FD WVK                   T  ++      EE++ DI      + E+F IP+P     P  ED  +TP+     +GSS  S++ RS 
Subjt:  IDGEAEIFDAWVKYDDLAIVFDKDRTTGSHATTTAEVGSEPVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPS--GRGSGSSLRSRRSRSS

Query:  SIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVLGR
        S G+  +  R   +  +K I  IA W     ++     + LY +LQ+IPG+ V   L VA SLL DP +L  F+D+P +WKY  CM++LGR
Subjt:  SIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVLGR

A0A5A7UME4 Retrotransposon protein3.3e-4737.67Show/hide
Query:  SRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKCID
        S +R  KH WT EE+  LVECLV+ V +G WR+DN TFRPG+L+ + RMM  KIPG +I  S  ++SR++ +KR + A+AEM GP CSGFGWN E+KCI 
Subjt:  SRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKCID

Query:  GEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSGRGSG
         E E+FD W             V YD+L+ VF KDR TG  A + A++GS   P  + E  D +     DF   Y P    +     E  +   S R + 
Subjt:  GEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSGRGSG

Query:  SSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVL
        SS  S+R R     +  ++VR   +   + +  IA+WP++    A + R+E+   L++IP L++     + R L+ +   +  F++ P   KY YC  +L
Subjt:  SSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVL

A0A5D3CBF7 Retrotransposon protein2.2e-4637.33Show/hide
Query:  SRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKCID
        S +R  KH WT EE+  LVECLV+ V +G WR+DN TFRPG+L+ + RMM  KIPG +I  S  ++SR++ +KR + A+AEM GP CSGFGWN E+KCI 
Subjt:  SRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKCID

Query:  GEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSGRGSG
         E E+FD W             V YD+L+ VF KDR TG  A + A++GS   P  +    D +     DF   Y P    +     E  +   S R + 
Subjt:  GEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSGRGSG

Query:  SSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVL
        SS  S+R R     +  ++VR   +   + +  IA+WP++    A + R+E+   L++IP L++     + R L+ +   +  F++ P   KY YC  +L
Subjt:  SSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVL

A0A5D3DPR5 Retrotransposon protein2.2e-4637.33Show/hide
Query:  SRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKCID
        S +R  KH WT EE+  LVECLV+ V +G WR+DN TFRPG+L+ + RMM  KIPG +I  S  ++SR++ +KR + A+AEM GP CSGFGWN E+KCI 
Subjt:  SRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKCID

Query:  GEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSGRGSG
         E E+FD W             V YD+L+ VF KDR TG  A + A++GS   P  +    D +     DF   Y P    +     E  +   S R + 
Subjt:  GEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSGRGSG

Query:  SSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVL
        SS  S+R R     +  ++VR   +   + +  IA+WP++    A + R+E+   L++IP L++     + R L+ +   +  F++ P   KY YC  +L
Subjt:  SSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVL

E5GCB5 Retrotransposon protein1.6e-4637.17Show/hide
Query:  EASGSRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVER
        E   S +R  KH WT EE+  LVECLV+ V +G WR+DN TFRPG+L+ + RMM  KIPG +I  S  ++SR++ +KR + A+AEM GP CSGFGWN E+
Subjt:  EASGSRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVER

Query:  KCIDGEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSG
        KCI  E E+FD W             V YD+L+ VF KDR TG  A + A++GS   P  +    D +     DF   Y P    +     E  +   S 
Subjt:  KCIDGEAEIFDAW-------------VKYDDLAIVFDKDRTTGSHATTTAEVGSE--PVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSG

Query:  RGSGSSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYC
        R + SS  S+R R     +  ++VR   +   + +  IA+WP++    A + R+E+   L++IP L++     + R L+ +   +  F++ P   KY YC
Subjt:  RGSGSSLRSRRSRSSSIGEYNEVVREGFQLLTKSIDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYC

Query:  MQVL
          +L
Subjt:  MQVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCATCCGGGTCACGAGCAAGAGCCGCTAAACATGTATGGACGGATGAGGAGGATAGAATCCTCGTGGAGTGTTTGGTCCAGTGTGTGCAGTCTGGACACTGGCG
AGCTGATAACGAGACTTTTCGACCTGGATTCCTATCAAACATACTACGGATGATGCAGCAGAAGATACCAGGGTGTTCCATACAGGTCAGCCCACATCTGGAGTCAAGGG
TCAGGACATTGAAGAGACAGTATAGCGCGATCGCTGAAATGTTGGGCCCAGGCTGTAGTGGGTTTGGGTGGAATGTGGAGCGCAAATGTATTGACGGTGAGGCGGAGATA
TTTGACGCATGGGTCAAGTATGATGACTTGGCCATCGTATTCGACAAAGATAGAACCACAGGGAGTCATGCAACCACCACTGCAGAGGTCGGATCTGAACCTGTTATGGA
AGAGGAGAACGAGGACATCCTGAACAACCAGTCCCCGGACTTTGAGAACTTCTATATTCCTGATCCACCTTTTGCTAGCTCGCCCCCGTCAGAGGACTATTCGACTACCC
CCAGCGGTAGAGGGTCTGGGAGTAGCTTAAGGAGTAGGAGGTCCAGAAGTTCATCGATTGGAGAGTACAACGAGGTGGTTCGTGAGGGATTCCAACTTCTGACGAAGTCT
ATTGACGGCATTGCACAGTGGCCTGTCATGAACGAGGACCTGGCAAGGCGTCGTCGTCGAGAACTATACGCCGAGCTGCAATCCATTCCTGGTTTGTCAGTACAGTATGG
ATTGACTGTTGCACGATCATTACTTGCAGATCCAATGCTGTTAAGCCACTTTGTGGACTTCCCACCACAGTGGAAGTACGACTATTGTATGCAAGTCCTCGGGCGACCAC
GGGATCCAGCACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCATCCGGGTCACGAGCAAGAGCCGCTAAACATGTATGGACGGATGAGGAGGATAGAATCCTCGTGGAGTGTTTGGTCCAGTGTGTGCAGTCTGGACACTGGCG
AGCTGATAACGAGACTTTTCGACCTGGATTCCTATCAAACATACTACGGATGATGCAGCAGAAGATACCAGGGTGTTCCATACAGGTCAGCCCACATCTGGAGTCAAGGG
TCAGGACATTGAAGAGACAGTATAGCGCGATCGCTGAAATGTTGGGCCCAGGCTGTAGTGGGTTTGGGTGGAATGTGGAGCGCAAATGTATTGACGGTGAGGCGGAGATA
TTTGACGCATGGGTCAAGTATGATGACTTGGCCATCGTATTCGACAAAGATAGAACCACAGGGAGTCATGCAACCACCACTGCAGAGGTCGGATCTGAACCTGTTATGGA
AGAGGAGAACGAGGACATCCTGAACAACCAGTCCCCGGACTTTGAGAACTTCTATATTCCTGATCCACCTTTTGCTAGCTCGCCCCCGTCAGAGGACTATTCGACTACCC
CCAGCGGTAGAGGGTCTGGGAGTAGCTTAAGGAGTAGGAGGTCCAGAAGTTCATCGATTGGAGAGTACAACGAGGTGGTTCGTGAGGGATTCCAACTTCTGACGAAGTCT
ATTGACGGCATTGCACAGTGGCCTGTCATGAACGAGGACCTGGCAAGGCGTCGTCGTCGAGAACTATACGCCGAGCTGCAATCCATTCCTGGTTTGTCAGTACAGTATGG
ATTGACTGTTGCACGATCATTACTTGCAGATCCAATGCTGTTAAGCCACTTTGTGGACTTCCCACCACAGTGGAAGTACGACTATTGTATGCAAGTCCTCGGGCGACCAC
GGGATCCAGCACCATGA
Protein sequenceShow/hide protein sequence
MEASGSRARAAKHVWTDEEDRILVECLVQCVQSGHWRADNETFRPGFLSNILRMMQQKIPGCSIQVSPHLESRVRTLKRQYSAIAEMLGPGCSGFGWNVERKCIDGEAEI
FDAWVKYDDLAIVFDKDRTTGSHATTTAEVGSEPVMEEENEDILNNQSPDFENFYIPDPPFASSPPSEDYSTTPSGRGSGSSLRSRRSRSSSIGEYNEVVREGFQLLTKS
IDGIAQWPVMNEDLARRRRRELYAELQSIPGLSVQYGLTVARSLLADPMLLSHFVDFPPQWKYDYCMQVLGRPRDPAP