; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001036 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001036
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold8:35193292..35196470
RNA-Seq ExpressionSpg001036
SyntenySpg001036
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]3.5e-2935.16Show/hide
Query:  PHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLS
        P F+   I  HGW +FC  P +    LVREFYAN+         V+ ++V ++  AIN+++ L+      Y + A   ++EQL  V+ EV IEGA WQ+S
Subjt:  PHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLS

Query:  KTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEIFGC-WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILF
             T     LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  + +++ +I   EI  C   +K G L+FP+ IT L   A VP  + + I+ 
Subjt:  KTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEIFGC-WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILF

Query:  DKGIIDMPNLARLQRMQEV
        + G I   +++R+ + + V
Subjt:  DKGIIDMPNLARLQRMQEV

KAE8661093.1 hypothetical protein F3Y22_tig00116939pilonHSYRG00213 [Hibiscus syriacus]4.6e-2932.44Show/hide
Query:  RFVNNFARAKYAELLKRDFLFERGF------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNLQN
        +F N+ A+A++     R   FE GF       G     +   +    W++F   P SVNA LV+EFYANI +   +   VRG ++ ++ +AIN  ++LQ+
Subjt:  RFVNNFARAKYAELLKRDFLFERGF------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNLQN

Query:  F--PHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEI
            HAT+ E A    + +   V+ ++  E  +W   +T + +     L+  A  W  F++ +++PT+H++TVS  R+LL  +I+ S  IDVG II  ++
Subjt:  F--PHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEI

Query:  FGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDMPNLARLQRMQEVRQGGLVH
          C  KK   L FPN IT LCR   V E+  D IL     I+   L  L  ++  +    VH
Subjt:  FGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDMPNLARLQRMQEVRQGGLVH

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]5.7e-3234.72Show/hide
Query:  ASEEHDEIE-EQQLLDDRFVNNFARAKYAELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRG
        A + H  ++ E +  + R+ NN        +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+         VRG
Subjt:  ASEEHDEIE-EQQLLDDRFVNNFARAKYAELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRG

Query:  IEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAI
        ++V WS  AINA++ L + P   ++E     +   L  V+  V + GA+W +S     T   + L   A  W  F++  +LPTTH  TVS++R+LL  ++
Subjt:  IEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAI

Query:  LRSLGIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDMPNLARL
        L    I+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  LRSLGIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDMPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.3e-3336.55Show/hide
Query:  RFVNNFARAKYA-ELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNL
        +F    A  +Y   +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+   E     VRG++V WS  AINA++ L
Subjt:  RFVNNFARAKYA-ELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNL

Query:  QNFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEI
         + P   ++E     + + L  V+  V   GA+W +S     T   + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L    I+VG++I  EI
Subjt:  QNFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEI

Query:  FGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDMPNLARL
          C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  FGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDMPNLARL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]2.0e-2933.63Show/hide
Query:  RFVNNFARAKYAE-------LLKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNLQ
        +F +  A  +Y E        ++++F+++     E P F+   I  H W+ FC+ PE     LVREFY N+   +     +RG++V  S  AIN +++L 
Subjt:  RFVNNFARAKYAE-------LLKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNLQ

Query:  NFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEIF
        + P   ++E     +  +L  V+  V I GA+W +S     T   + L   A  W  F++ R+LPTTH  TVS+E V L +++L    I+VG++I  EI 
Subjt:  NFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEIF

Query:  GCWKKKVGKLFFPNTITMLCRGAGVP
         C  +K G LFFP+ IT +CR    P
Subjt:  GCWKKKVGKLFFPNTITMLCRGAGVP

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.8e-3234.72Show/hide
Query:  ASEEHDEIE-EQQLLDDRFVNNFARAKYAELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRG
        A + H  ++ E +  + R+ NN        +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+         VRG
Subjt:  ASEEHDEIE-EQQLLDDRFVNNFARAKYAELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRG

Query:  IEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAI
        ++V WS  AINA++ L + P   ++E     +   L  V+  V + GA+W +S     T   + L   A  W  F++  +LPTTH  TVS++R+LL  ++
Subjt:  IEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAI

Query:  LRSLGIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDMPNLARL
        L    I+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  LRSLGIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDMPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)1.1e-3336.55Show/hide
Query:  RFVNNFARAKYA-ELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNL
        +F    A  +Y   +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+   E     VRG++V WS  AINA++ L
Subjt:  RFVNNFARAKYA-ELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNL

Query:  QNFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEI
         + P   ++E     + + L  V+  V   GA+W +S     T   + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L    I+VG++I  EI
Subjt:  QNFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEI

Query:  FGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDMPNLARL
          C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  FGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDMPNLARL

A0A2P5DAQ2 Uncharacterized protein9.9e-3033.63Show/hide
Query:  RFVNNFARAKYAE-------LLKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNLQ
        +F +  A  +Y E        ++++F+++     E P F+   I  H W+ FC+ PE     LVREFY N+   +     +RG++V  S  AIN +++L 
Subjt:  RFVNNFARAKYAE-------LLKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNLQ

Query:  NFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEIF
        + P   ++E     +  +L  V+  V I GA+W +S     T   + L   A  W  F++ R+LPTTH  TVS+E V L +++L    I+VG++I  EI 
Subjt:  NFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEIF

Query:  GCWKKKVGKLFFPNTITMLCRGAGVP
         C  +K G LFFP+ IT +CR    P
Subjt:  GCWKKKVGKLFFPNTITMLCRGAGVP

A0A6A2WM54 Uncharacterized protein2.2e-2932.44Show/hide
Query:  RFVNNFARAKYAELLKRDFLFERGF------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNLQN
        +F N+ A+A++     R   FE GF       G     +   +    W++F   P SVNA LV+EFYANI +   +   VRG ++ ++ +AIN  ++LQ+
Subjt:  RFVNNFARAKYAELLKRDFLFERGF------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNLQN

Query:  F--PHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEI
            HAT+ E A    + +   V+ ++  E  +W   +T + +     L+  A  W  F++ +++PT+H++TVS  R+LL  +I+ S  IDVG II  ++
Subjt:  F--PHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEI

Query:  FGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDMPNLARLQRMQEVRQGGLVH
          C  KK   L FPN IT LCR   V E+  D IL     I+   L  L  ++  +    VH
Subjt:  FGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGIIDMPNLARLQRMQEVRQGGLVH

W9QTD9 Uncharacterized protein1.7e-2935.16Show/hide
Query:  PHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLS
        P F+   I  HGW +FC  P +    LVREFYAN+         V+ ++V ++  AIN+++ L+      Y + A   ++EQL  V+ EV IEGA WQ+S
Subjt:  PHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEQLSDVVREVGIEGAQWQLS

Query:  KTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEIFGC-WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILF
             T     LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  + +++ +I   EI  C   +K G L+FP+ IT L   A VP  + + I+ 
Subjt:  KTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEIFGC-WKKKVGKLFFPNTITMLCRGAGVPEDEGDVILF

Query:  DKGIIDMPNLARLQRMQEV
        + G I   +++R+ + + V
Subjt:  DKGIIDMPNLARLQRMQEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACAAGAGCGCGAAAAGAAAGGGAGAATGAGGAAGAAGAGGTACCTGTTACCCCTGAAGTGCAGAAAGTTAAAGCAAAGAAGAAAAGAACCCCGGAGGAGAA
ACAAGCCAAAAGACGAAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGTTGCTGCCACAGTTGAAGAAGGAGACCTGCAAGAACCTGATGTACAGA
ACCCAGAGGAGGCTGAGCAGAGAGTCGCGGATACGAAAGAAGGGGGCGAACAGAAGAAGTTCAAGAGGAGCGAACCGAGGAAGTTCAAGAAGAAATTACAGAGGAAGTTC
AAGAACAGCAGGCCAAGGATGTTCAAATGCAACAGGCAGAAGAGGTTCAGGAAAGCAGGCCGCGCTAGGGTTGTCCGAACTGATACTCCTTCGCCTCTGACCACGGATTC
TGAAAGAGAGAATGAAGAGAGAGTAGAGCGTGAGAAGAAGGAAGCCGAGGGAAGAGCAAGAGAAGAGGCAGAGAAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGG
CGGAAAAGGGCAAAAATGTTGTTGGAGCATCAGAAGAGCACGATGAAATAGAAGAGCAACAGTTACTGGATGATCGCTTCGTCAACAATTTTGCCAGAGCAAAATACGCT
GAGCTTTTGAAAAGGGACTTCCTGTTTGAGAGGGGATTTAGTGGTGAGCTTCCGCATTTTCTGAGGACTGGTATTGCGAATCATGGATGGGAACGATTCTGTTCAAAACC
CGAATCTGTAAACGCACAGTTAGTGCGCGAATTCTATGCAAATATCGACCGAGAAGAGGGTTTCCTAGCAGTTGTTCGAGGTATTGAGGTCGACTGGAGTCCTAGTGCTA
TCAACGCACTGTATAACCTTCAGAACTTCCCCCATGCGACATATAATGAGATGGCTGTTGCGCCATCTAATGAGCAGTTAAGTGATGTTGTGCGGGAGGTAGGTATTGAA
GGGGCACAATGGCAGCTATCCAAGACTCAGAAGAGGACATTCCAATCGGCTTATTTGAAAAGGGAAGCAAATACGTGGATGGGATTTATCAGACAAAGGATGCTTCCAAC
GACTCATGACTCGACAGTTTCTAGGGAACGAGTGCTTTTGGCTTTCGCTATTTTGAGGTCTCTCGGTATTGATGTGGGAAAAATTATTGCTGATGAAATATTTGGTTGTT
GGAAGAAGAAAGTGGGGAAACTGTTCTTTCCGAACACAATCACAATGCTTTGTAGAGGAGCAGGGGTTCCGGAAGATGAAGGGGATGTGATTCTGTTTGACAAGGGAATC
ATTGACATGCCTAACTTGGCACGGCTCCAGCGTATGCAGGAGGTACGTCAGGGTGGGCTGGTCCACGGCATCAACACGATTTTAGAACAACTCGCACTTTCGGCCAGCAG
GCAGGAGTTTGCTGAACGCTTGGACTGGTTAAGCTTAATTAGATCACGAGTATTTAGCCTAATTGGTGATGAGTTTGAGGCATGGGTATACTGCACCATAAAGTGGGACA
TCCCGTGCTTAAGAGCTTATGACTGTAGGGCTGCTTTAAGTCTGAAAAACAAAAATTTAAACCCCTTGAAAATGTGTTTTGATATGTCTGATAATAGAGCTAGGCTGTGG
CAAGTTCTTCTAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCG
ACTTAAGGGAGCGGATTTTATGCTGGAGCAAACCCGGAGGCAAAACTGCCACGTCACATCTCGTTTGCCAATTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACAAGAGCGCGAAAAGAAAGGGAGAATGAGGAAGAAGAGGTACCTGTTACCCCTGAAGTGCAGAAAGTTAAAGCAAAGAAGAAAAGAACCCCGGAGGAGAA
ACAAGCCAAAAGACGAAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGTTGCTGCCACAGTTGAAGAAGGAGACCTGCAAGAACCTGATGTACAGA
ACCCAGAGGAGGCTGAGCAGAGAGTCGCGGATACGAAAGAAGGGGGCGAACAGAAGAAGTTCAAGAGGAGCGAACCGAGGAAGTTCAAGAAGAAATTACAGAGGAAGTTC
AAGAACAGCAGGCCAAGGATGTTCAAATGCAACAGGCAGAAGAGGTTCAGGAAAGCAGGCCGCGCTAGGGTTGTCCGAACTGATACTCCTTCGCCTCTGACCACGGATTC
TGAAAGAGAGAATGAAGAGAGAGTAGAGCGTGAGAAGAAGGAAGCCGAGGGAAGAGCAAGAGAAGAGGCAGAGAAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGG
CGGAAAAGGGCAAAAATGTTGTTGGAGCATCAGAAGAGCACGATGAAATAGAAGAGCAACAGTTACTGGATGATCGCTTCGTCAACAATTTTGCCAGAGCAAAATACGCT
GAGCTTTTGAAAAGGGACTTCCTGTTTGAGAGGGGATTTAGTGGTGAGCTTCCGCATTTTCTGAGGACTGGTATTGCGAATCATGGATGGGAACGATTCTGTTCAAAACC
CGAATCTGTAAACGCACAGTTAGTGCGCGAATTCTATGCAAATATCGACCGAGAAGAGGGTTTCCTAGCAGTTGTTCGAGGTATTGAGGTCGACTGGAGTCCTAGTGCTA
TCAACGCACTGTATAACCTTCAGAACTTCCCCCATGCGACATATAATGAGATGGCTGTTGCGCCATCTAATGAGCAGTTAAGTGATGTTGTGCGGGAGGTAGGTATTGAA
GGGGCACAATGGCAGCTATCCAAGACTCAGAAGAGGACATTCCAATCGGCTTATTTGAAAAGGGAAGCAAATACGTGGATGGGATTTATCAGACAAAGGATGCTTCCAAC
GACTCATGACTCGACAGTTTCTAGGGAACGAGTGCTTTTGGCTTTCGCTATTTTGAGGTCTCTCGGTATTGATGTGGGAAAAATTATTGCTGATGAAATATTTGGTTGTT
GGAAGAAGAAAGTGGGGAAACTGTTCTTTCCGAACACAATCACAATGCTTTGTAGAGGAGCAGGGGTTCCGGAAGATGAAGGGGATGTGATTCTGTTTGACAAGGGAATC
ATTGACATGCCTAACTTGGCACGGCTCCAGCGTATGCAGGAGGTACGTCAGGGTGGGCTGGTCCACGGCATCAACACGATTTTAGAACAACTCGCACTTTCGGCCAGCAG
GCAGGAGTTTGCTGAACGCTTGGACTGGTTAAGCTTAATTAGATCACGAGTATTTAGCCTAATTGGTGATGAGTTTGAGGCATGGGTATACTGCACCATAAAGTGGGACA
TCCCGTGCTTAAGAGCTTATGACTGTAGGGCTGCTTTAAGTCTGAAAAACAAAAATTTAAACCCCTTGAAAATGTGTTTTGATATGTCTGATAATAGAGCTAGGCTGTGG
CAAGTTCTTCTAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCG
ACTTAAGGGAGCGGATTTTATGCTGGAGCAAACCCGGAGGCAAAACTGCCACGTCACATCTCGTTTGCCAATTTCATGA
Protein sequenceShow/hide protein sequence
MAKTRARKERENEEEEVPVTPEVQKVKAKKKRTPEEKQAKRRRRQQRAEEQEKATEVVAATVEEGDLQEPDVQNPEEAEQRVADTKEGGEQKKFKRSEPRKFKKKLQRKF
KNSRPRMFKCNRQKRFRKAGRARVVRTDTPSPLTTDSERENEERVEREKKEAEGRAREEAEKKAEEERLLKRRAEKGKNVVGASEEHDEIEEQQLLDDRFVNNFARAKYA
ELLKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDREEGFLAVVRGIEVDWSPSAINALYNLQNFPHATYNEMAVAPSNEQLSDVVREVGIE
GAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLGIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGAGVPEDEGDVILFDKGI
IDMPNLARLQRMQEVRQGGLVHGINTILEQLALSASRQEFAERLDWLSLIRSRVFSLIGDEFEAWVYCTIKWDIPCLRAYDCRAALSLKNKNLNPLKMCFDMSDNRARLW
QVLLIELKVVIICPCRKNYFAAAELGFAECSESVAGRLKGADFMLEQTRRQNCHVTSRLPIS