; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004995 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004995
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold4:10072712..10077126
RNA-Seq ExpressionSpg004995
SyntenySpg004995
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]5.1e-2635.55Show/hide
Query:  QFCAKPEPVNSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKS
        QFC  P      +VREFYAN+ D  +  V V+ V V ++  AIN++F L++     + +     +++QL   + EV IEGA W++S     T     LK 
Subjt:  QFCAKPEPVNSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKS

Query:  EANTWMGFIKLRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIISSEILDC-WRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQ
         A  W  F+  R +P+TH  TV++DRVLL ++IL  +S+++  I   EI  C   +K G L+FP+ IT L  +A VP  +D+  + + G I T +++R+ 
Subjt:  EANTWMGFIKLRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIISSEILDC-WRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQ

Query:  --RTQEARQGE
          R   A +GE
Subjt:  --RTQEARQGE

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.2e-3040.47Show/hide
Query:  EEEAKKAEEEILLKRR---AEKGKSVAEASEEPDEIEESRFPYNRFVNNLARANQFCAKPEPVNSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINAL
        E EA +   E  ++ R   AEKG  V + SE   ++     P+   V       QFCA PE     +VREFYAN+ D  E  V VRGV V WS  AINA+
Subjt:  EEEAKKAEEEILLKRR---AEKGKSVAEASEEPDEIEESRFPYNRFVNNLARANQFCAKPEPVNSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINAL

Query:  FNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIIS
        F L D P    +E +   +   L   ++ V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS+DR+LL  ++L   SI+VGR+I 
Subjt:  FNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIIS

Query:  SEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE
        SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  SEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.8e-2938.89Show/hide
Query:  ERENARREEQEKKEAEDKAR-----EEEAKKAEEEILLKRR---AEKGKSVAEASEEPDEIEESRFPYNRFVNNLARANQFCAKPEPVNSNIVREFYANI
        ER +  R     K    KA      E EA     E  ++ R   AEKG  V + SE   ++     P+   V       QFCA PE     +VREFYAN+
Subjt:  ERENARREEQEKKEAEDKAR-----EEEAKKAEEEILLKRR---AEKGKSVAEASEEPDEIEESRFPYNRFVNNLARANQFCAKPEPVNSNIVREFYANI

Query:  DDHEEFQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDST
         D EE  V VRGV V WS  AINA+F L D P    +E +   +   L   ++ V   GA+W +S     T   + L   A  W  F+K RLLPTTH  T
Subjt:  DDHEEFQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDST

Query:  VSRDRVLLGFAILRSMSIDVGRIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQEARQGET
        VS+DR+LL  ++L   SI+VGR+I SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+     A++G T
Subjt:  VSRDRVLLGFAILRSMSIDVGRIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQEARQGET

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]5.1e-2641.14Show/hide
Query:  FCAKPEPVNSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSE
        FCA PE     +VREFY N+ + ++  V +RGV V  S  AIN +F+L D P    +E V   +  +L   ++ V I GA+W +S     T   + L   
Subjt:  FCAKPEPVNSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSE

Query:  ANTWMGFIKLRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIISSEILDCWRKKVGKLFFPNTITMLCRRAGVP
        A  W  F+K RLLPTTH  TVS++ V L +++L   SI+VGR+I  EI  C  +K G LFFP+ IT +CR    P
Subjt:  ANTWMGFIKLRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIISSEILDCWRKKVGKLFFPNTITMLCRRAGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]3.8e-2944.56Show/hide
Query:  IVREFYANIDDHEEFQVIVRGVPVDWSPGAINALFNLQD--FPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIK
        +VREFYAN+ D EE  + VRGV V WS  AINA+F L D    H+ F E +  P   +L   ++ V   GA+W +S     T   + L   A  W  F+K
Subjt:  IVREFYANIDDHEEFQVIVRGVPVDWSPGAINALFNLQD--FPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIK

Query:  LRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE
         RLLPTTH   VS+DR+LL  ++L   SI+VGR+I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+  TQE
Subjt:  LRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)5.7e-3140.47Show/hide
Query:  EEEAKKAEEEILLKRR---AEKGKSVAEASEEPDEIEESRFPYNRFVNNLARANQFCAKPEPVNSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINAL
        E EA +   E  ++ R   AEKG  V + SE   ++     P+   V       QFCA PE     +VREFYAN+ D  E  V VRGV V WS  AINA+
Subjt:  EEEAKKAEEEILLKRR---AEKGKSVAEASEEPDEIEESRFPYNRFVNNLARANQFCAKPEPVNSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINAL

Query:  FNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIIS
        F L D P    +E +   +   L   ++ V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS+DR+LL  ++L   SI+VGR+I 
Subjt:  FNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIIS

Query:  SEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE
        SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  SEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.8e-2938.89Show/hide
Query:  ERENARREEQEKKEAEDKAR-----EEEAKKAEEEILLKRR---AEKGKSVAEASEEPDEIEESRFPYNRFVNNLARANQFCAKPEPVNSNIVREFYANI
        ER +  R     K    KA      E EA     E  ++ R   AEKG  V + SE   ++     P+   V       QFCA PE     +VREFYAN+
Subjt:  ERENARREEQEKKEAEDKAR-----EEEAKKAEEEILLKRR---AEKGKSVAEASEEPDEIEESRFPYNRFVNNLARANQFCAKPEPVNSNIVREFYANI

Query:  DDHEEFQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDST
         D EE  V VRGV V WS  AINA+F L D P    +E +   +   L   ++ V   GA+W +S     T   + L   A  W  F+K RLLPTTH  T
Subjt:  DDHEEFQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDST

Query:  VSRDRVLLGFAILRSMSIDVGRIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQEARQGET
        VS+DR+LL  ++L   SI+VGR+I SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+     A++G T
Subjt:  VSRDRVLLGFAILRSMSIDVGRIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQEARQGET

A0A2P5DAQ2 Uncharacterized protein2.5e-2641.14Show/hide
Query:  FCAKPEPVNSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSE
        FCA PE     +VREFY N+ + ++  V +RGV V  S  AIN +F+L D P    +E V   +  +L   ++ V I GA+W +S     T   + L   
Subjt:  FCAKPEPVNSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSE

Query:  ANTWMGFIKLRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIISSEILDCWRKKVGKLFFPNTITMLCRRAGVP
        A  W  F+K RLLPTTH  TVS++ V L +++L   SI+VGR+I  EI  C  +K G LFFP+ IT +CR    P
Subjt:  ANTWMGFIKLRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIISSEILDCWRKKVGKLFFPNTITMLCRRAGVP

A0A2P5DXM3 Uncharacterized protein1.8e-2944.56Show/hide
Query:  IVREFYANIDDHEEFQVIVRGVPVDWSPGAINALFNLQD--FPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIK
        +VREFYAN+ D EE  + VRGV V WS  AINA+F L D    H+ F E +  P   +L   ++ V   GA+W +S     T   + L   A  W  F+K
Subjt:  IVREFYANIDDHEEFQVIVRGVPVDWSPGAINALFNLQD--FPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIK

Query:  LRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE
         RLLPTTH   VS+DR+LL  ++L   SI+VGR+I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+  TQE
Subjt:  LRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE

W9QTD9 Uncharacterized protein2.5e-2635.55Show/hide
Query:  QFCAKPEPVNSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKS
        QFC  P      +VREFYAN+ D  +  V V+ V V ++  AIN++F L++     + +     +++QL   + EV IEGA W++S     T     LK 
Subjt:  QFCAKPEPVNSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKRTFQAAYLKS

Query:  EANTWMGFIKLRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIISSEILDC-WRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQ
         A  W  F+  R +P+TH  TV++DRVLL ++IL  +S+++  I   EI  C   +K G L+FP+ IT L  +A VP  +D+  + + G I T +++R+ 
Subjt:  EANTWMGFIKLRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIISSEILDC-WRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQ

Query:  --RTQEARQGE
          R   A +GE
Subjt:  --RTQEARQGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACAAGAGCGAGGAAAGAACGGGAGAATGAGGAGGAAGAGGTACCTGTTACCCCTGAAGTGCAGAAAGTTAAAGCGAAGAAGAAAAGGACCCCGGAGGAGAA
AGAAGCCAAAAGACGTAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGTGACTGTTACTGCCACAGTAGAAGAAGAAAGCCTGAAACAACCAGAGG
AAAATACCGAGGAAAATACCGAGCAGAGGGTCGCGGATACAGAAGAAGAGGAGCGAACAGAAGAGGTTCGAGAAGAAATTACAGAGGAAGTTCAAGAAAAGCAGGCCGAG
GATGTTCAAATGCAACAGGCAGAAGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGAGGCTCAAGTGGAGGTGATCATGCCGGAGGTACCAAAGCGTCGCCGCGTTAA
GAGGAAAGCAGGCCGAGCTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCGACCACTGATTCTGAAAGAGAAAATGCAAGAAGAGAGGAACAGGAAAAGAAGGAAGCTG
AGGACAAGGCAAGAGAAGAAGAAGCAAAGAAAGCGGAAGAGGAGATTTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAGCGTGGCTGAAGCATCGGAAGAACCTGACGAG
ATTGAGGAATCGAGATTTCCGTACAATCGCTTCGTCAATAACCTTGCTCGGGCAAATCAATTTTGTGCAAAGCCGGAACCTGTTAATTCCAACATTGTTCGAGAATTTTA
CGCCAATATTGACGATCACGAAGAGTTTCAGGTTATCGTTCGAGGAGTGCCCGTTGACTGGAGCCCAGGAGCCATTAATGCTTTGTTCAACCTCCAGGACTTTCCACACG
CAGGCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAACTAAATGCGGCTGTCCAAGAGGTTGGCATTGAGGGGGCCCAGTGGAGACTGTCGAAGACGGAAAAGCGC
ACATTTCAGGCTGCTTATTTGAAGAGCGAGGCCAATACATGGATGGGTTTCATCAAGCTGCGCTTACTGCCGACAACTCACGACTCAACGGTGTCTCGAGACCGGGTTTT
GCTTGGCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGGGTAGGATAATTTCTTCTGAGATTCTTGATTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACA
CTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAGGATGAGGATGATGTGCCATTAATAGACAAGGGGATAATTGACACACCAAATCTGGCTAGGCTTCAGAGGACG
CAGGAAGCACGCCAAGGAGAAACCTTTTGCTTGAGCATTTTCTCTGACCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGTGGGTCATCCCGTGCTTAAAAGCTTATGA
CTGTAGGGCTGCTTTAAGTCTGAAAAACAAGAATTTGAACCCCTTGAAAATGTATTTTGATATGTCTGATAATAGAGTTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAA
AAGTGGTGATTATTTGTCCATGCCGGAATAATTATTTTGCTACAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGTTGGGCGACTTGAGGGAGCAAATTCTGTG
CTGCAGCAAAACTGGGAGCAGAACTGCCACGTCACAGCTCGAACCCAACAAGAGCCAAAGGGAAAATTTGAAAAATATACCTCACTGTCCGCTCCTCTTGAACAGGTACT
GGCAGTAGTACAAAATACAGATTTGTTGAAACGAGCAGGAAGGTTGAAGTCAGATCCAGACAAGAGATATAGGAACAAGTACTGCATGTACCATGGGGACCACGACCACA
TAACCCGGGAATGCATACAATTACGAGATGAAATAGAAACCCTAATCAGAGAGGGTTATTTGAAGGAGCTTGCTGGGAATGACAGAGGCAAAAGACCATCACGGCTAGAG
TAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACAAGAGCGAGGAAAGAACGGGAGAATGAGGAGGAAGAGGTACCTGTTACCCCTGAAGTGCAGAAAGTTAAAGCGAAGAAGAAAAGGACCCCGGAGGAGAA
AGAAGCCAAAAGACGTAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGTGACTGTTACTGCCACAGTAGAAGAAGAAAGCCTGAAACAACCAGAGG
AAAATACCGAGGAAAATACCGAGCAGAGGGTCGCGGATACAGAAGAAGAGGAGCGAACAGAAGAGGTTCGAGAAGAAATTACAGAGGAAGTTCAAGAAAAGCAGGCCGAG
GATGTTCAAATGCAACAGGCAGAAGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGAGGCTCAAGTGGAGGTGATCATGCCGGAGGTACCAAAGCGTCGCCGCGTTAA
GAGGAAAGCAGGCCGAGCTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCGACCACTGATTCTGAAAGAGAAAATGCAAGAAGAGAGGAACAGGAAAAGAAGGAAGCTG
AGGACAAGGCAAGAGAAGAAGAAGCAAAGAAAGCGGAAGAGGAGATTTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAGCGTGGCTGAAGCATCGGAAGAACCTGACGAG
ATTGAGGAATCGAGATTTCCGTACAATCGCTTCGTCAATAACCTTGCTCGGGCAAATCAATTTTGTGCAAAGCCGGAACCTGTTAATTCCAACATTGTTCGAGAATTTTA
CGCCAATATTGACGATCACGAAGAGTTTCAGGTTATCGTTCGAGGAGTGCCCGTTGACTGGAGCCCAGGAGCCATTAATGCTTTGTTCAACCTCCAGGACTTTCCACACG
CAGGCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAACTAAATGCGGCTGTCCAAGAGGTTGGCATTGAGGGGGCCCAGTGGAGACTGTCGAAGACGGAAAAGCGC
ACATTTCAGGCTGCTTATTTGAAGAGCGAGGCCAATACATGGATGGGTTTCATCAAGCTGCGCTTACTGCCGACAACTCACGACTCAACGGTGTCTCGAGACCGGGTTTT
GCTTGGCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGGGTAGGATAATTTCTTCTGAGATTCTTGATTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACA
CTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAGGATGAGGATGATGTGCCATTAATAGACAAGGGGATAATTGACACACCAAATCTGGCTAGGCTTCAGAGGACG
CAGGAAGCACGCCAAGGAGAAACCTTTTGCTTGAGCATTTTCTCTGACCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGTGGGTCATCCCGTGCTTAAAAGCTTATGA
CTGTAGGGCTGCTTTAAGTCTGAAAAACAAGAATTTGAACCCCTTGAAAATGTATTTTGATATGTCTGATAATAGAGTTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAA
AAGTGGTGATTATTTGTCCATGCCGGAATAATTATTTTGCTACAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGTTGGGCGACTTGAGGGAGCAAATTCTGTG
CTGCAGCAAAACTGGGAGCAGAACTGCCACGTCACAGCTCGAACCCAACAAGAGCCAAAGGGAAAATTTGAAAAATATACCTCACTGTCCGCTCCTCTTGAACAGGTACT
GGCAGTAGTACAAAATACAGATTTGTTGAAACGAGCAGGAAGGTTGAAGTCAGATCCAGACAAGAGATATAGGAACAAGTACTGCATGTACCATGGGGACCACGACCACA
TAACCCGGGAATGCATACAATTACGAGATGAAATAGAAACCCTAATCAGAGAGGGTTATTTGAAGGAGCTTGCTGGGAATGACAGAGGCAAAAGACCATCACGGCTAGAG
TAA
Protein sequenceShow/hide protein sequence
MAKTRARKERENEEEEVPVTPEVQKVKAKKKRTPEEKEAKRRRRQQRAEEQEKATEVVTVTATVEEESLKQPEENTEENTEQRVADTEEEERTEEVREEITEEVQEKQAE
DVQMQQAEDVQVTDNEPVQEAQVEVIMPEVPKRRRVKRKAGRARVVRTDTPSPPTTDSERENARREEQEKKEAEDKAREEEAKKAEEEILLKRRAEKGKSVAEASEEPDE
IEESRFPYNRFVNNLARANQFCAKPEPVNSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVQEVGIEGAQWRLSKTEKR
TFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLGFAILRSMSIDVGRIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRT
QEARQGETFCLSIFSDLVVAAAKKILEWVIPCLKAYDCRAALSLKNKNLNPLKMYFDMSDNRVKLWQVLRIELKVVIICPCRNNYFATAELGFAECSESVVGRLEGANSV
LQQNWEQNCHVTARTQQEPKGKFEKYTSLSAPLEQVLAVVQNTDLLKRAGRLKSDPDKRYRNKYCMYHGDHDHITRECIQLRDEIETLIREGYLKELAGNDRGKRPSRLE