; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034577 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034577
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold4:11640274..11647804
RNA-Seq ExpressionSpg034577
SyntenySpg034577
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]8.1e-2733.95Show/hide
Query:  PRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNLQDFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLS
        P F+   I   GW QFC         +VREFYAN+ D  +  V ++ V V ++   IN++F L++     + +     +++QL   + EV IEGA W++S
Subjt:  PRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNLQDFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLS

Query:  KTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDC-WRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLI
             T     LK  A  W  F+  R +P+TH  TV++DRVLL ++IL  +S+++ +I   EI  C   +K G L+FP+ IT L  +A VP  +D+  + 
Subjt:  KTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDC-WRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLI

Query:  DKGIIDTPNLARLQR
        + G I T +++R+ +
Subjt:  DKGIIDTPNLARLQR

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.4e-3338.98Show/hide
Query:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNL
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA  E     +VREFYAN+ D  E  V +RGV V WS   INA+F L
Subjt:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNL

Query:  QDFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
         D P    +E +   +   L   +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  QDFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.2e-3339.36Show/hide
Query:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNL
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA  E     +VREFYAN+ D EE  V +RGV V WS   INA+F L
Subjt:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNL

Query:  QDFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
         D P    +E +   +   L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  QDFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]4.3e-2834.96Show/hide
Query:  RFINNLARAKYVEMLR-------RDFLFERGFGDDLPRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNLQ
        +F +  A  +Y E ++       ++F+++     + P F+   I+   W  FCA  E     +VREFY N+ + ++  V IRGV V  S   IN +F+L 
Subjt:  RFINNLARAKYVEMLR-------RDFLFERGFGDDLPRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNLQ

Query:  DFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL
        D P    +E V   +  +L   +  V I GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++ V L +++L   SI+VG++I  EI 
Subjt:  DFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL

Query:  DCWRKKVGKLFFPNTITMLCRRAGVP
         C  +K G LFFP+ IT +CR    P
Subjt:  DCWRKKVGKLFFPNTITMLCRRAGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]9.5e-2843.01Show/hide
Query:  IVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNLQD--FPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIK
        +VREFYAN+ D EE  + +RGV V WS   INA+F L D    H+ F E +  P   +L   +  V   GA+W +S     T   + L   A  W  F+K
Subjt:  IVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNLQD--FPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIK

Query:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE
         RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+  TQE
Subjt:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.1e-3338.98Show/hide
Query:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNL
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA  E     +VREFYAN+ D  E  V +RGV V WS   INA+F L
Subjt:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNL

Query:  QDFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
         D P    +E +   +   L   +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  QDFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)5.6e-3439.36Show/hide
Query:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNL
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA  E     +VREFYAN+ D EE  V +RGV V WS   INA+F L
Subjt:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNL

Query:  QDFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
         D P    +E +   +   L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  QDFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL

A0A2P5DAQ2 Uncharacterized protein2.1e-2834.96Show/hide
Query:  RFINNLARAKYVEMLR-------RDFLFERGFGDDLPRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNLQ
        +F +  A  +Y E ++       ++F+++     + P F+   I+   W  FCA  E     +VREFY N+ + ++  V IRGV V  S   IN +F+L 
Subjt:  RFINNLARAKYVEMLR-------RDFLFERGFGDDLPRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNLQ

Query:  DFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL
        D P    +E V   +  +L   +  V I GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++ V L +++L   SI+VG++I  EI 
Subjt:  DFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL

Query:  DCWRKKVGKLFFPNTITMLCRRAGVP
         C  +K G LFFP+ IT +CR    P
Subjt:  DCWRKKVGKLFFPNTITMLCRRAGVP

A0A2P5DXM3 Uncharacterized protein4.6e-2843.01Show/hide
Query:  IVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNLQD--FPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIK
        +VREFYAN+ D EE  + +RGV V WS   INA+F L D    H+ F E +  P   +L   +  V   GA+W +S     T   + L   A  W  F+K
Subjt:  IVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNLQD--FPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIK

Query:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE
         RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+  TQE
Subjt:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE

W9QTD9 Uncharacterized protein3.9e-2733.95Show/hide
Query:  PRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNLQDFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLS
        P F+   I   GW QFC         +VREFYAN+ D  +  V ++ V V ++   IN++F L++     + +     +++QL   + EV IEGA W++S
Subjt:  PRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNLQDFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLS

Query:  KTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDC-WRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLI
             T     LK  A  W  F+  R +P+TH  TV++DRVLL ++IL  +S+++ +I   EI  C   +K G L+FP+ IT L  +A VP  +D+  + 
Subjt:  KTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDC-WRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLI

Query:  DKGIIDTPNLARLQR
        + G I T +++R+ +
Subjt:  DKGIIDTPNLARLQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACGAGAGCGAGAAAAGAAAGGGAGAATGAGGAGGAAGAGGTACCTGTTACCCCTGAAGTGCAGAAAAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGC
GACTGTTACTGCCACAGTAGAAGAAGAAAGCCCGAAACAACCAGAGGAAAATACCGAGGAAAATACCGAGCAGAGGGTCGCGGATACAGAAGTTCAAGAGGAGCGAACCG
AGGAAGTTCGAGAAGAAATTACAGAGGAAGTTCAAGAAAAGCAGGCCGAGGATGTACAGATGCAACAGGCAAAAGATGTTCAGGTAACGGATACTGAGCCAGTGCAGGAG
GCTCAAGTGGAGGTGATCATGCCAGAGGTACCAAAACCTCGCCGCGTTAAGAGGAAAGCAGGCCGCGCTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCGACCACTTA
TTCTGAAAGAGAAAATGCAAGAAGAGAGGAACGGGAAAAGAAGGAAGCTGAGGACAAGGCAAGAGAAGAAGAAGCAAAGAAAGCGGAAGAAGAGATTTTGCTCAAGCGAA
GGGCGGAAAAGGGCAAAAGTGTGGCTGAAGCATCGGAAGAACCTGACGAGATTGAGGAATCGAGATTTCCGTACAATCGCTTCATCAATAACCTTGCTCGGGCAAAGTAT
GTTGAGATGCTGAGAAGGGACTTCCTGTTTGAACGAGGATTTGGCGATGATCTGCCACGGTTCTTGAGGACTGGAATAGTGAACCTCGGCTGGAGTCAATTTTGTGCAAA
GTCGGAACCTGTTAATTCAAACATTGTTCGAGAATTTTACGCCAATATTGACGATCACGAAGAATTTCAGGTTATCATTCGAGGAGTGCCCGTTGACTGGAGCCCAGGAG
ACATTAATGCTTTGTTCAACCTCCAGGACTTTCCACACGCAGGCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAACTAAATGCGGCTGTCCGAGAGGTTGGCATT
GAGGGGGCCCAGTGGAGACTGTCGAAGACGGAAAAGCGCACATTTCAGGCTGCTTATTTGAAGAGCGAGGCCAATACATGGATGGGTTTCATCAAGCTGCGCTTACTGCC
GACAACTCACGACTCAACGGTGTCTCGAGACCGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGGGTAAGATAATTTCTTCTGAGATTCTTGATT
GCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAGGATGAGGATGATGTGCCACTAATAGACAAGGGA
ATAATTGACACACCAAATCTAGCTAGGCTTCAGAGGACGCAGGAAGCACGCCAAGGAGGTTTGGTGTGCGGCATCCACCAAATGCAGGAGCAATTACAGCTGCATTCCAA
AACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCTGCGGCAAAGAAAATTCGGGAGTTTGAGGCATGGGTATACTGCACCATGAAGTGGGTCATCCCGTGCTTGA
GAGCTTATGACTGTAGGGCTGCTTTAAGTCTGAAAAATGAAAATATAAACCCCTTGAAGATGAGTTTTGAATGTCTGATGATAGAGCTAGGCTGTGCAGAGCTTGGTTTT
GCAGAGTGCTCAGGGAAGGTTGAATATGTTGCTGGGCGACTTGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGCAGAACTGCCACCATGAGCTGGCAGAGGCAGA
TTGGTGGCTGGACCTGTTTTCAACGTGGAAGGCTTCATTTAATGAGGTGTTATTCCCAGCTGGTGTAAAAAGCTATTCTGATCTTTTAGTGGCTGGAGAGGAAGCAAGTT
GCGGTTGGGGAGATGAAATTCAGAGCAAAAACAAAGGAAAAGAGGAAGGAGCTGCTGTGGAGTCCTTATTCAGCTTGGGAAAGCTGGATTTGAGCTTTGGAGCTTCTTCC
TTAACCGTTTCTTCTTCTCTTTATCTTCCCATTGGTATTGGTTGTGTAAAAAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACGAGAGCGAGAAAAGAAAGGGAGAATGAGGAGGAAGAGGTACCTGTTACCCCTGAAGTGCAGAAAAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGC
GACTGTTACTGCCACAGTAGAAGAAGAAAGCCCGAAACAACCAGAGGAAAATACCGAGGAAAATACCGAGCAGAGGGTCGCGGATACAGAAGTTCAAGAGGAGCGAACCG
AGGAAGTTCGAGAAGAAATTACAGAGGAAGTTCAAGAAAAGCAGGCCGAGGATGTACAGATGCAACAGGCAAAAGATGTTCAGGTAACGGATACTGAGCCAGTGCAGGAG
GCTCAAGTGGAGGTGATCATGCCAGAGGTACCAAAACCTCGCCGCGTTAAGAGGAAAGCAGGCCGCGCTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCGACCACTTA
TTCTGAAAGAGAAAATGCAAGAAGAGAGGAACGGGAAAAGAAGGAAGCTGAGGACAAGGCAAGAGAAGAAGAAGCAAAGAAAGCGGAAGAAGAGATTTTGCTCAAGCGAA
GGGCGGAAAAGGGCAAAAGTGTGGCTGAAGCATCGGAAGAACCTGACGAGATTGAGGAATCGAGATTTCCGTACAATCGCTTCATCAATAACCTTGCTCGGGCAAAGTAT
GTTGAGATGCTGAGAAGGGACTTCCTGTTTGAACGAGGATTTGGCGATGATCTGCCACGGTTCTTGAGGACTGGAATAGTGAACCTCGGCTGGAGTCAATTTTGTGCAAA
GTCGGAACCTGTTAATTCAAACATTGTTCGAGAATTTTACGCCAATATTGACGATCACGAAGAATTTCAGGTTATCATTCGAGGAGTGCCCGTTGACTGGAGCCCAGGAG
ACATTAATGCTTTGTTCAACCTCCAGGACTTTCCACACGCAGGCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAACTAAATGCGGCTGTCCGAGAGGTTGGCATT
GAGGGGGCCCAGTGGAGACTGTCGAAGACGGAAAAGCGCACATTTCAGGCTGCTTATTTGAAGAGCGAGGCCAATACATGGATGGGTTTCATCAAGCTGCGCTTACTGCC
GACAACTCACGACTCAACGGTGTCTCGAGACCGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGGGTAAGATAATTTCTTCTGAGATTCTTGATT
GCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAGGATGAGGATGATGTGCCACTAATAGACAAGGGA
ATAATTGACACACCAAATCTAGCTAGGCTTCAGAGGACGCAGGAAGCACGCCAAGGAGGTTTGGTGTGCGGCATCCACCAAATGCAGGAGCAATTACAGCTGCATTCCAA
AACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCTGCGGCAAAGAAAATTCGGGAGTTTGAGGCATGGGTATACTGCACCATGAAGTGGGTCATCCCGTGCTTGA
GAGCTTATGACTGTAGGGCTGCTTTAAGTCTGAAAAATGAAAATATAAACCCCTTGAAGATGAGTTTTGAATGTCTGATGATAGAGCTAGGCTGTGCAGAGCTTGGTTTT
GCAGAGTGCTCAGGGAAGGTTGAATATGTTGCTGGGCGACTTGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGCAGAACTGCCACCATGAGCTGGCAGAGGCAGA
TTGGTGGCTGGACCTGTTTTCAACGTGGAAGGCTTCATTTAATGAGGTGTTATTCCCAGCTGGTGTAAAAAGCTATTCTGATCTTTTAGTGGCTGGAGAGGAAGCAAGTT
GCGGTTGGGGAGATGAAATTCAGAGCAAAAACAAAGGAAAAGAGGAAGGAGCTGCTGTGGAGTCCTTATTCAGCTTGGGAAAGCTGGATTTGAGCTTTGGAGCTTCTTCC
TTAACCGTTTCTTCTTCTCTTTATCTTCCCATTGGTATTGGTTGTGTAAAAAGCTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERENEEEEVPVTPEVQKRAEEQEKATEVATVTATVEEESPKQPEENTEENTEQRVADTEVQEERTEEVREEITEEVQEKQAEDVQMQQAKDVQVTDTEPVQE
AQVEVIMPEVPKPRRVKRKAGRARVVRTDTPSPPTTYSERENARREEREKKEAEDKAREEEAKKAEEEILLKRRAEKGKSVAEASEEPDEIEESRFPYNRFINNLARAKY
VEMLRRDFLFERGFGDDLPRFLRTGIVNLGWSQFCAKSEPVNSNIVREFYANIDDHEEFQVIIRGVPVDWSPGDINALFNLQDFPHAGFNEMVVAPSNDQLNAAVREVGI
EGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKG
IIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQLHSKTFCLSIFSGLVVAAAKKIREFEAWVYCTMKWVIPCLRAYDCRAALSLKNENINPLKMSFECLMIELGCAELGF
AECSGKVEYVAGRLEGANSVLQQNWEQNCHHELAEADWWLDLFSTWKASFNEVLFPAGVKSYSDLLVAGEEASCGWGDEIQSKNKGKEEGAAVESLFSLGKLDLSFGASS
LTVSSSLYLPIGIGCVKS