; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003895 (gene) of Snake gourd v1 genome

Gene IDTan0003895
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG09:1165139..1166166
RNA-Seq ExpressionTan0003895
SyntenyTan0003895
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050106.1 retrotransposon protein [Cucumis melo var. makuwa]1.1e-5744.74Show/hide
Query:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQV---------------------------GFGWNDERK
        M +   K +KH R    DEVLV+ L+QLV++GGW+ DNGTF+ GYLVQVQKLMKEK+ GS+IQV                           GFGWN+ERK
Subjt:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQV---------------------------GFGWNDERK

Query:  CIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGP
        CIE EKS+FDDWVK                                          T  DI +DD      D  IP+PH  E   GED+ +T  S     
Subjt:  CIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGP

Query:  VGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALWPKENHEMESARRKQLCGELQSIPGMDMDDCLAIAETLLADTTKFHSFLDYPPEWKYKCCMRIL
          S  SKKRR   G+L + FR+ MR  S  I KIA W +E  E+ES+  K+L  +LQ+IPGMD+DDCL +AE+LL D T  H+FLDYP EWKY+ CMRIL
Subjt:  VGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALWPKENHEMESARRKQLCGELQSIPGMDMDDCLAIAETLLADTTKFHSFLDYPPEWKYKCCMRIL

Query:  GRQP
        GRQP
Subjt:  GRQP

KAA0063789.1 retrotransposon protein [Cucumis melo var. makuwa]6.3e-5041.37Show/hide
Query:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQV-QKLMKEKLPGSDIQVGFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKP
        M +   K +KH    +EDE LV+ L+QLV++GGW+ DN TF+PGYL  V  K+M+           FGWN+ERKCIE EKS+FDDWVKGH +ARGL NK 
Subjt:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQV-QKLMKEKLPGSDIQVGFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKP

Query:  FPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGPVGSIVSKKRRMVHGELTEVFRSEMRC
        FP+F DL ++FG+D+AT  R +TP+ M S    D  +DD      D  IP+PH  +   GED+++T  S       S  SKKRR   G+L   F +    
Subjt:  FPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGPVGSIVSKKRRMVHGELTEVFRSEMRC

Query:  ASTHIEKIALWPKENHEMESARRKQLCGELQSIPGMDMDDCLAIAETLLADTTKFHSFLDYPPEWKYKCCMRILGRQP
                                                     E+LL D T  H+FLDYP EWKY+ CMRILGRQP
Subjt:  ASTHIEKIALWPKENHEMESARRKQLCGELQSIPGMDMDDCLAIAETLLADTTKFHSFLDYPPEWKYKCCMRILGRQP

TYJ96933.1 retrotransposon protein [Cucumis melo var. makuwa]5.9e-4846.41Show/hide
Query:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQV---------------------------GFGWNDERK
        M +   K +KH    ++D+ LV+ L+QLV++GGW+ +N TF+P YLVQVQKLMKEK+P S+IQV                            FGWN+ERK
Subjt:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQV---------------------------GFGWNDERK

Query:  CIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGP
        CIE EKS+FDDWVKGH +ARGL NKPF +F DL I+FG+DKAT  R +  ++MAS T  D  +DD      D  IP+PH  E   GED+ +T  S     
Subjt:  CIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGP

Query:  VGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALW
          S  SKKRR   G+L + FR+ M+  S  I KIA W
Subjt:  VGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALW

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]8.2e-6649.81Show/hide
Query:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQVGFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKPF
        M +   K +KH    +EDEVLV+ L+QLV++GGW+ DNGTF+ GYL Q   + +   P      GFGWN+ +KCIE EK +FDDWVKGH +A+GL NKPF
Subjt:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQVGFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKPF

Query:  PFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGPVGSIVSKKRRMVHGELTEVFRSEMRCA
        P+F DL ++FG+D+AT  R +TP++M+S T  D  +DD      D  IP+PH  E   GED+ +T  S       S  SKKRR   G+L + FR+ MR  
Subjt:  PFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGPVGSIVSKKRRMVHGELTEVFRSEMRCA

Query:  STHIEKIALWPKENHEMESARRKQLCGELQSIPGMDMDDCLAIAETLLADTTKFHSFLDYP
        S  I KIA W +E  E+ES+  K+L  ELQ+IPGMD+DDCL +AE+LL D T  H+FLDYP
Subjt:  STHIEKIALWPKENHEMESARRKQLCGELQSIPGMDMDDCLAIAETLLADTTKFHSFLDYP

XP_008455678.1 PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo]5.9e-4846.41Show/hide
Query:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQV---------------------------GFGWNDERK
        M +   K +KH    ++D+ LV+ L+QLV++GGW+ +N TF+P YLVQVQKLMKEK+P S+IQV                            FGWN+ERK
Subjt:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQV---------------------------GFGWNDERK

Query:  CIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGP
        CIE EKS+FDDWVKGH +ARGL NKPF +F DL I+FG+DKAT  R +  ++MAS T  D  +DD      D  IP+PH  E   GED+ +T  S     
Subjt:  CIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGP

Query:  VGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALW
          S  SKKRR   G+L + FR+ M+  S  I KIA W
Subjt:  VGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALW

TrEMBL top hitse value%identityAlignment
A0A1S3C252 uncharacterized protein At2g29880-like2.9e-4846.41Show/hide
Query:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQV---------------------------GFGWNDERK
        M +   K +KH    ++D+ LV+ L+QLV++GGW+ +N TF+P YLVQVQKLMKEK+P S+IQV                            FGWN+ERK
Subjt:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQV---------------------------GFGWNDERK

Query:  CIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGP
        CIE EKS+FDDWVKGH +ARGL NKPF +F DL I+FG+DKAT  R +  ++MAS T  D  +DD      D  IP+PH  E   GED+ +T  S     
Subjt:  CIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGP

Query:  VGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALW
          S  SKKRR   G+L + FR+ M+  S  I KIA W
Subjt:  VGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALW

A0A5A7U7F7 Retrotransposon protein5.2e-5844.74Show/hide
Query:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQV---------------------------GFGWNDERK
        M +   K +KH R    DEVLV+ L+QLV++GGW+ DNGTF+ GYLVQVQKLMKEK+ GS+IQV                           GFGWN+ERK
Subjt:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQV---------------------------GFGWNDERK

Query:  CIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGP
        CIE EKS+FDDWVK                                          T  DI +DD      D  IP+PH  E   GED+ +T  S     
Subjt:  CIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGP

Query:  VGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALWPKENHEMESARRKQLCGELQSIPGMDMDDCLAIAETLLADTTKFHSFLDYPPEWKYKCCMRIL
          S  SKKRR   G+L + FR+ MR  S  I KIA W +E  E+ES+  K+L  +LQ+IPGMD+DDCL +AE+LL D T  H+FLDYP EWKY+ CMRIL
Subjt:  VGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALWPKENHEMESARRKQLCGELQSIPGMDMDDCLAIAETLLADTTKFHSFLDYPPEWKYKCCMRIL

Query:  GRQP
        GRQP
Subjt:  GRQP

A0A5A7VE44 Retrotransposon protein3.1e-5041.37Show/hide
Query:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQV-QKLMKEKLPGSDIQVGFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKP
        M +   K +KH    +EDE LV+ L+QLV++GGW+ DN TF+PGYL  V  K+M+           FGWN+ERKCIE EKS+FDDWVKGH +ARGL NK 
Subjt:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQV-QKLMKEKLPGSDIQVGFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKP

Query:  FPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGPVGSIVSKKRRMVHGELTEVFRSEMRC
        FP+F DL ++FG+D+AT  R +TP+ M S    D  +DD      D  IP+PH  +   GED+++T  S       S  SKKRR   G+L   F +    
Subjt:  FPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGPVGSIVSKKRRMVHGELTEVFRSEMRC

Query:  ASTHIEKIALWPKENHEMESARRKQLCGELQSIPGMDMDDCLAIAETLLADTTKFHSFLDYPPEWKYKCCMRILGRQP
                                                     E+LL D T  H+FLDYP EWKY+ CMRILGRQP
Subjt:  ASTHIEKIALWPKENHEMESARRKQLCGELQSIPGMDMDDCLAIAETLLADTTKFHSFLDYPPEWKYKCCMRILGRQP

A0A5D3BC95 Retrotransposon protein2.9e-4846.41Show/hide
Query:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQV---------------------------GFGWNDERK
        M +   K +KH    ++D+ LV+ L+QLV++GGW+ +N TF+P YLVQVQKLMKEK+P S+IQV                            FGWN+ERK
Subjt:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQV---------------------------GFGWNDERK

Query:  CIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGP
        CIE EKS+FDDWVKGH +ARGL NKPF +F DL I+FG+DKAT  R +  ++MAS T  D  +DD      D  IP+PH  E   GED+ +T  S     
Subjt:  CIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGP

Query:  VGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALW
          S  SKKRR   G+L + FR+ M+  S  I KIA W
Subjt:  VGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALW

A0A5D3C7T4 Uncharacterized protein4.0e-6649.81Show/hide
Query:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQVGFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKPF
        M +   K +KH    +EDEVLV+ L+QLV++GGW+ DNGTF+ GYL Q   + +   P      GFGWN+ +KCIE EK +FDDWVKGH +A+GL NKPF
Subjt:  MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQVGFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKPF

Query:  PFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGPVGSIVSKKRRMVHGELTEVFRSEMRCA
        P+F DL ++FG+D+AT  R +TP++M+S T  D  +DD      D  IP+PH  E   GED+ +T  S       S  SKKRR   G+L + FR+ MR  
Subjt:  PFFDDLAIIFGKDKATDTRAETPIDMASVTGTDI-DDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGPVGSIVSKKRRMVHGELTEVFRSEMRCA

Query:  STHIEKIALWPKENHEMESARRKQLCGELQSIPGMDMDDCLAIAETLLADTTKFHSFLDYP
        S  I KIA W +E  E+ES+  K+L  ELQ+IPGMD+DDCL +AE+LL D T  H+FLDYP
Subjt:  STHIEKIALWPKENHEMESARRKQLCGELQSIPGMDMDDCLAIAETLLADTTKFHSFLDYP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02210.1 unknown protein6.0e-0631.91Show/hide
Query:  GFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFG
        GF W++ER+ +  + +++ D++K H  AR    +P P++ DL ++ G
Subjt:  GFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFG

AT4G02210.2 unknown protein6.0e-0631.91Show/hide
Query:  GFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFG
        GF W++ER+ +  + +++ D++K H  AR    +P P++ DL ++ G
Subjt:  GFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFG

AT5G27260.1 unknown protein8.7e-0524.18Show/hide
Query:  GFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDIDDDTSPEFHDLPIPDPHVSETVFGEDIST-
        GFGW+   K       ++ D++K H + + LR   F FFD+L IIFG+  AT   A    D          ++   E+ D    + +  +T    + S  
Subjt:  GFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIFGKDKATDTRAETPIDMASVTGTDIDDDTSPEFHDLPIPDPHVSETVFGEDIST-

Query:  ----TSNSRNDGPVGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALWPKENHEMESARRK-QLCGELQSIPGMDMDDCL
             S+  ++ P   +  +KR       ++   S M   S+ I  I    +E  + E A++K  +   ++ I   D+D+C+
Subjt:  ----TSNSRNDGPVGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALWPKENHEMESARRK-QLCGELQSIPGMDMDDCL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAACTCCTAAAGAAAAAGGTTCTAAGCATGTACGAAATATGATGGAGGACGAGGTGTTAGTCCAAAGTTTGATGCAACTTGTTCAACAAGGTGGGTGGAAAGGTGA
TAATGGGACTTTTAGGCCTGGATATCTAGTTCAAGTACAAAAATTAATGAAGGAAAAGCTACCAGGAAGCGACATACAGGTTGGTTTTGGATGGAATGACGAGCGAAAGT
GTATTGAAGAAGAGAAATCAATTTTTGATGACTGGGTTAAGGGTCACCTCCACGCCCGAGGTCTCAGGAACAAACCATTTCCATTCTTCGATGACCTAGCAATCATCTTC
GGGAAAGACAAGGCAACCGACACGAGGGCGGAGACTCCGATTGACATGGCCTCAGTGACAGGCACAGACATAGACGATGACACTAGTCCAGAGTTCCATGATCTCCCTAT
ACCTGATCCACATGTATCTGAAACTGTGTTTGGGGAGGACATATCTACCACATCTAACTCGAGGAATGATGGGCCAGTAGGATCCATAGTGAGTAAGAAAAGGCGCATGG
TGCATGGGGAACTCACTGAGGTTTTTCGCTCAGAGATGCGATGTGCCTCGACCCATATCGAGAAGATCGCCCTTTGGCCCAAAGAGAATCATGAAATGGAGTCTGCTCGA
CGTAAACAGTTGTGTGGAGAACTTCAATCTATCCCTGGAATGGATATGGATGATTGTTTAGCCATAGCCGAGACTTTACTAGCTGACACCACTAAATTTCACTCCTTCCT
TGACTACCCACCGGAGTGGAAGTATAAGTGTTGCATGCGGATCTTAGGAAGACAACCAGGGCCCTCCTCTTCGGGATCATCTCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAACTCCTAAAGAAAAAGGTTCTAAGCATGTACGAAATATGATGGAGGACGAGGTGTTAGTCCAAAGTTTGATGCAACTTGTTCAACAAGGTGGGTGGAAAGGTGA
TAATGGGACTTTTAGGCCTGGATATCTAGTTCAAGTACAAAAATTAATGAAGGAAAAGCTACCAGGAAGCGACATACAGGTTGGTTTTGGATGGAATGACGAGCGAAAGT
GTATTGAAGAAGAGAAATCAATTTTTGATGACTGGGTTAAGGGTCACCTCCACGCCCGAGGTCTCAGGAACAAACCATTTCCATTCTTCGATGACCTAGCAATCATCTTC
GGGAAAGACAAGGCAACCGACACGAGGGCGGAGACTCCGATTGACATGGCCTCAGTGACAGGCACAGACATAGACGATGACACTAGTCCAGAGTTCCATGATCTCCCTAT
ACCTGATCCACATGTATCTGAAACTGTGTTTGGGGAGGACATATCTACCACATCTAACTCGAGGAATGATGGGCCAGTAGGATCCATAGTGAGTAAGAAAAGGCGCATGG
TGCATGGGGAACTCACTGAGGTTTTTCGCTCAGAGATGCGATGTGCCTCGACCCATATCGAGAAGATCGCCCTTTGGCCCAAAGAGAATCATGAAATGGAGTCTGCTCGA
CGTAAACAGTTGTGTGGAGAACTTCAATCTATCCCTGGAATGGATATGGATGATTGTTTAGCCATAGCCGAGACTTTACTAGCTGACACCACTAAATTTCACTCCTTCCT
TGACTACCCACCGGAGTGGAAGTATAAGTGTTGCATGCGGATCTTAGGAAGACAACCAGGGCCCTCCTCTTCGGGATCATCTCACTAG
Protein sequenceShow/hide protein sequence
MTTPKEKGSKHVRNMMEDEVLVQSLMQLVQQGGWKGDNGTFRPGYLVQVQKLMKEKLPGSDIQVGFGWNDERKCIEEEKSIFDDWVKGHLHARGLRNKPFPFFDDLAIIF
GKDKATDTRAETPIDMASVTGTDIDDDTSPEFHDLPIPDPHVSETVFGEDISTTSNSRNDGPVGSIVSKKRRMVHGELTEVFRSEMRCASTHIEKIALWPKENHEMESAR
RKQLCGELQSIPGMDMDDCLAIAETLLADTTKFHSFLDYPPEWKYKCCMRILGRQPGPSSSGSSH