; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014870 (gene) of Snake gourd v1 genome

Gene IDTan0014870
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG02:81499425..81500468
RNA-Seq ExpressionTan0014870
SyntenyTan0014870
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050106.1 retrotransposon protein [Cucumis melo var. makuwa]7.1e-5641.96Show/hide
Query:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKLF-------------------------------------NGFGWNDERK
        MA+T++K +KH WT++ DEVLV+CLL +V++GGWRADN TF+ GYLVQVQKL                                      +GFGWN+ERK
Subjt:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKLF-------------------------------------NGFGWNDERK

Query:  CIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKDRGNQCDGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDII
        CIEAEK++FDDWVK    AR +                       + D+++  +D  I +PH  +P S               GSS  SK+R+   GD++
Subjt:  CIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKDRGNQCDGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDII

Query:  GVFRTEMHWASTQLERIVLWPKEKDELESTRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE
          FR  M   S ++ +I  W +EK E+ES+  KRLY +LQ IPG+D+DDCL +AE+LL D +  H+FLDYPAEWKY+ CMRILGR+
Subjt:  GVFRTEMHWASTQLERIVLWPKEKDELESTRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE

KAA0063789.1 retrotransposon protein [Cucumis melo var. makuwa]1.0e-4939.48Show/hide
Query:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQV-----QKLFNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDE
        MA+T++K +KH WT++EDE LV+CLL +V++GGWRADN+TF+PGYL  V     +  ++ FGWN+ERKCIEAEK++FDDWVK H +ARGL NK FPYF +
Subjt:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQV-----QKLFNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDE

Query:  LSIIFGKDR------------GNQC-----DGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQLE
        L ++FG+DR            G+Q      + D+++  +D  I +PH   P S    T          GSS  SK+R+   GD++  F            
Subjt:  LSIIFGKDR------------GNQC-----DGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQLE

Query:  RIVLWPKEKDELESTRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE
                                               E+LL D +  H+FLDYPAEWKY+ CMRILGR+
Subjt:  RIVLWPKEKDELESTRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE

KAA0065306.1 retrotransposon protein [Cucumis melo var. makuwa]2.5e-4540.08Show/hide
Query:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKL-------FNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYF
        MA+ ++K +KH WT++EDEVLV+CLL +V++GGWRADN TF+  YL Q   +        + FGWN+ERK                  A G R K     
Subjt:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKL-------FNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYF

Query:  DELSIIFGKDRGNQC-DGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQLERIVLWPKEKDELES
            +  G        + D+++  +D  I +PH  +P S               G    SK+R+   GD++  FR  M   S ++ +I  W +EK E+ES
Subjt:  DELSIIFGKDRGNQC-DGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQLERIVLWPKEKDELES

Query:  TRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE
        +  KRLY ELQ IPG+D+DDCL +AE+LL D +  H+FLDYPAEWKY+ CMRILGR+
Subjt:  TRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE

TYK06362.1 retrotransposon protein [Cucumis melo var. makuwa]7.6e-5042.19Show/hide
Query:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKL-------FNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYF
        MA+T++K +KH WT++EDEVLV CLL +V++GGWRADN TF+PGYL Q   +        +GFGWN+ERKCIEAEK++FDDWVK HP ARGL NKPFPYF
Subjt:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKL-------FNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYF

Query:  DELSIIFGKDRGNQCDGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQLERIVLWPKEKDELEST
         +L ++FG+DR           + D+P  +PH  +P S               GSS  SK+R+   GD++  FR                          
Subjt:  DELSIIFGKDRGNQCDGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQLERIVLWPKEKDELEST

Query:  RPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE
                                E+LL D +  H+FLDYPAEWKY+ CMRILGR+
Subjt:  RPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]1.2e-6346.9Show/hide
Query:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKL-------FNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYF
        MA+T++K +KH WT++EDEVLV+CLL +V++GGWRADN TF+ GYL Q   +        +GFGWN+ +KCIE EK +FDDWVK HP+A+GL NKPFPYF
Subjt:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKL-------FNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYF

Query:  DELSIIFGKDR--GNQC---------------DGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQ
         +L ++FG+DR  G +C               + D+++  +D  I +PH  +P S           T   GSS  SK+R+   GD++  FR  M   S +
Subjt:  DELSIIFGKDR--GNQC---------------DGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQ

Query:  LERIVLWPKEKDELESTRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYP
        + +I  W +EK E+ES+  KRLYAELQ IPG+D+DDCL +AE+LL D +  H+FLDYP
Subjt:  LERIVLWPKEKDELESTRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYP

TrEMBL top hitse value%identityAlignment
A0A5A7U7F7 Retrotransposon protein3.5e-5641.96Show/hide
Query:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKLF-------------------------------------NGFGWNDERK
        MA+T++K +KH WT++ DEVLV+CLL +V++GGWRADN TF+ GYLVQVQKL                                      +GFGWN+ERK
Subjt:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKLF-------------------------------------NGFGWNDERK

Query:  CIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKDRGNQCDGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDII
        CIEAEK++FDDWVK    AR +                       + D+++  +D  I +PH  +P S               GSS  SK+R+   GD++
Subjt:  CIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKDRGNQCDGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDII

Query:  GVFRTEMHWASTQLERIVLWPKEKDELESTRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE
          FR  M   S ++ +I  W +EK E+ES+  KRLY +LQ IPG+D+DDCL +AE+LL D +  H+FLDYPAEWKY+ CMRILGR+
Subjt:  GVFRTEMHWASTQLERIVLWPKEKDELESTRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE

A0A5A7VE44 Retrotransposon protein4.8e-5039.48Show/hide
Query:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQV-----QKLFNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDE
        MA+T++K +KH WT++EDE LV+CLL +V++GGWRADN+TF+PGYL  V     +  ++ FGWN+ERKCIEAEK++FDDWVK H +ARGL NK FPYF +
Subjt:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQV-----QKLFNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDE

Query:  LSIIFGKDR------------GNQC-----DGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQLE
        L ++FG+DR            G+Q      + D+++  +D  I +PH   P S    T          GSS  SK+R+   GD++  F            
Subjt:  LSIIFGKDR------------GNQC-----DGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQLE

Query:  RIVLWPKEKDELESTRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE
                                               E+LL D +  H+FLDYPAEWKY+ CMRILGR+
Subjt:  RIVLWPKEKDELESTRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE

A0A5A7VG45 Retrotransposon protein1.2e-4540.08Show/hide
Query:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKL-------FNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYF
        MA+ ++K +KH WT++EDEVLV+CLL +V++GGWRADN TF+  YL Q   +        + FGWN+ERK                  A G R K     
Subjt:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKL-------FNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYF

Query:  DELSIIFGKDRGNQC-DGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQLERIVLWPKEKDELES
            +  G        + D+++  +D  I +PH  +P S               G    SK+R+   GD++  FR  M   S ++ +I  W +EK E+ES
Subjt:  DELSIIFGKDRGNQC-DGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQLERIVLWPKEKDELES

Query:  TRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE
        +  KRLY ELQ IPG+D+DDCL +AE+LL D +  H+FLDYPAEWKY+ CMRILGR+
Subjt:  TRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE

A0A5D3C542 Retrotransposon protein3.7e-5042.19Show/hide
Query:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKL-------FNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYF
        MA+T++K +KH WT++EDEVLV CLL +V++GGWRADN TF+PGYL Q   +        +GFGWN+ERKCIEAEK++FDDWVK HP ARGL NKPFPYF
Subjt:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKL-------FNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYF

Query:  DELSIIFGKDRGNQCDGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQLERIVLWPKEKDELEST
         +L ++FG+DR           + D+P  +PH  +P S               GSS  SK+R+   GD++  FR                          
Subjt:  DELSIIFGKDRGNQCDGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQLERIVLWPKEKDELEST

Query:  RPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE
                                E+LL D +  H+FLDYPAEWKY+ CMRILGR+
Subjt:  RPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYPAEWKYKCCMRILGRE

A0A5D3C7T4 Uncharacterized protein5.9e-6446.9Show/hide
Query:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKL-------FNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYF
        MA+T++K +KH WT++EDEVLV+CLL +V++GGWRADN TF+ GYL Q   +        +GFGWN+ +KCIE EK +FDDWVK HP+A+GL NKPFPYF
Subjt:  MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKL-------FNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYF

Query:  DELSIIFGKDR--GNQC---------------DGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQ
         +L ++FG+DR  G +C               + D+++  +D  I +PH  +P S           T   GSS  SK+R+   GD++  FR  M   S +
Subjt:  DELSIIFGKDR--GNQC---------------DGDINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQ

Query:  LERIVLWPKEKDELESTRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYP
        + +I  W +EK E+ES+  KRLYAELQ IPG+D+DDCL +AE+LL D +  H+FLDYP
Subjt:  LERIVLWPKEKDELESTRPKRLYAELQVIPGIDMDDCLQIAETLLADISKFHSFLDYP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24960.1 unknown protein7.2e-0640.43Show/hide
Query:  NGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIF
        +GF W++ R  I A+ A++D ++K HP AR  R K  P +++L  IF
Subjt:  NGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIF

AT2G24960.2 unknown protein3.2e-0640Show/hide
Query:  NGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKD
        NGF W+  R  + A+  I++ +++AHP AR  R K  P +  L  IFGK+
Subjt:  NGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKD

AT4G02210.1 unknown protein3.8e-0736.21Show/hide
Query:  NGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKDRG---NQC
        +GF W++ER+ + A+  ++ D++KAH  AR    +P PY+ +L ++ G D G   N+C
Subjt:  NGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKDRG---NQC

AT4G02210.2 unknown protein3.8e-0736.21Show/hide
Query:  NGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKDRG---NQC
        +GF W++ER+ + A+  ++ D++KAH  AR    +P PY+ +L ++ G D G   N+C
Subjt:  NGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKDRG---NQC

AT5G27260.1 unknown protein1.1e-0925.4Show/hide
Query:  VQVQKLFNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIF------GKDRGNQCDGDINMTFQ--DLP-------IHDPHAYDPTS
        + +Q+  +GFGW+   K   A   ++ D++KAHP+ + LR   F +FDEL IIF      GK+    CD    +T++  + P         + + YD T+
Subjt:  VQVQKLFNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIF------GKDRGNQCDGDINMTFQ--DLP-------IHDPHAYDPTS

Query:  ARICTPHLYPRTMGRGSSSGSK----RRKVKQGDIIGVFRTEMHWASTQLERIVLWPKEKDELESTRPK-RLYAELQVIPGIDMDDCLQ
            + H Y   M  G+S   K    +R   +        + M   S+++  I+   +E+ + E  + K  ++  ++ I   D+D+C++
Subjt:  ARICTPHLYPRTMGRGSSSGSK----RRKVKQGDIIGVFRTEMHWASTQLERIVLWPKEKDELESTRPK-RLYAELQVIPGIDMDDCLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATACAAGTACAAAGAATTCTAAACACATGTGGACTTCAGTGGAGGATGAGGTATTGGTTCAGTGCCTACTACATGTCGTGCAACAGGGGGGGTGGAGAGCTGA
TAATGACACATTTCGACCTGGGTACTTAGTACAAGTACAAAAATTGTTTAATGGGTTTGGGTGGAATGACGAACGTAAGTGCATTGAGGCAGAGAAAGCAATTTTCGATG
ACTGGGTTAAGGCACACCCTCATGCTCGGGGTCTTAGGAACAAGCCATTTCCATACTTCGACGAGTTATCAATTATATTCGGTAAAGACAGGGGCAATCAGTGCGATGGC
GACATCAACATGACTTTTCAAGATCTCCCAATCCACGACCCACACGCATACGACCCAACATCGGCGAGGATATGTACGCCACACCTATATCCGAGAACGATGGGGCGGGG
ATCATCAAGTGGGTCGAAGAGACGCAAAGTGAAACAAGGGGACATTATTGGCGTATTTCGTACAGAGATGCATTGGGCGTCAACACAACTAGAGAGAATTGTCTTGTGGC
CTAAAGAGAAGGATGAACTAGAGTCGACCCGACCCAAACGACTATATGCAGAACTTCAAGTTATCCCTGGTATAGATATGGATGATTGTTTACAGATTGCTGAGACTCTG
TTGGCCGATATATCCAAATTCCACTCATTCCTCGACTACCCAGCTGAATGGAAATACAAATGTTGCATGCGTATCTTGGGAAGGGAGGCATGTTCCTCCATTAATCTATC
TTTGGTTGGACCCTTATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAATACAAGTACAAAGAATTCTAAACACATGTGGACTTCAGTGGAGGATGAGGTATTGGTTCAGTGCCTACTACATGTCGTGCAACAGGGGGGGTGGAGAGCTGA
TAATGACACATTTCGACCTGGGTACTTAGTACAAGTACAAAAATTGTTTAATGGGTTTGGGTGGAATGACGAACGTAAGTGCATTGAGGCAGAGAAAGCAATTTTCGATG
ACTGGGTTAAGGCACACCCTCATGCTCGGGGTCTTAGGAACAAGCCATTTCCATACTTCGACGAGTTATCAATTATATTCGGTAAAGACAGGGGCAATCAGTGCGATGGC
GACATCAACATGACTTTTCAAGATCTCCCAATCCACGACCCACACGCATACGACCCAACATCGGCGAGGATATGTACGCCACACCTATATCCGAGAACGATGGGGCGGGG
ATCATCAAGTGGGTCGAAGAGACGCAAAGTGAAACAAGGGGACATTATTGGCGTATTTCGTACAGAGATGCATTGGGCGTCAACACAACTAGAGAGAATTGTCTTGTGGC
CTAAAGAGAAGGATGAACTAGAGTCGACCCGACCCAAACGACTATATGCAGAACTTCAAGTTATCCCTGGTATAGATATGGATGATTGTTTACAGATTGCTGAGACTCTG
TTGGCCGATATATCCAAATTCCACTCATTCCTCGACTACCCAGCTGAATGGAAATACAAATGTTGCATGCGTATCTTGGGAAGGGAGGCATGTTCCTCCATTAATCTATC
TTTGGTTGGACCCTTATGTTGA
Protein sequenceShow/hide protein sequence
MANTSTKNSKHMWTSVEDEVLVQCLLHVVQQGGWRADNDTFRPGYLVQVQKLFNGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSIIFGKDRGNQCDG
DINMTFQDLPIHDPHAYDPTSARICTPHLYPRTMGRGSSSGSKRRKVKQGDIIGVFRTEMHWASTQLERIVLWPKEKDELESTRPKRLYAELQVIPGIDMDDCLQIAETL
LADISKFHSFLDYPAEWKYKCCMRILGREACSSINLSLVGPLC