; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031214 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031214
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold8:31009567..31017667
RNA-Seq ExpressionSpg031214
SyntenySpg031214
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]4.0e-2828.44Show/hide
Query:  FVNNFPRAKYVELLKRDFLFERGF------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNF
        FV+   +  Y  +  R   FE GF      +  L   +   +  H W++F   P  VNA +V+EFY+NI +      +VRGI + ++P+AIN  + LQ  
Subjt:  FVNNFPRAKYVELLKRDFLFERGF------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNF

Query:  PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGC
                A    +E     + ++ + G +W   + +++T     L      W  F++ +++PT+H++TVS +R+LL  +IL    ID+GKII +    C
Subjt:  PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGC

Query:  WKKKVGKLFFPNTITMLCRGVGVPEDEGDEVRQG--GL-------------------------------VHGINTILEQLALSTSRQEFAE--RQALTFW
         K++   L FPN IT LCR   V E+  DE+  G  GL                               V   +T LEQ A+  + Q   +   + + ++
Subjt:  WKKKVGKLFFPNTITMLCRGVGVPEDEGDEVRQG--GL-------------------------------VHGINTILEQLALSTSRQEFAE--RQALTFW

Query:  SYVKNRDANLKKALQENFSK
        +Y K RDA L  AL E+  +
Subjt:  SYVKNRDANLKKALQENFSK

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.7e-2934.57Show/hide
Query:  ASEEHDEIE-EQQLLDDRFVNNFPRAKYVELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRG
        A + H  ++ E +  + R+ NN        +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+         VRG
Subjt:  ASEEHDEIE-EQQLLDDRFVNNFPRAKYVELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRG

Query:  IEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAI
        ++V WS  AINA++ L + P   ++E     +   L   +  V + GA+W +S     T   + L   A  W  F++  +LPTTH  TVS++R+LL  ++
Subjt:  IEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAI

Query:  LRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGVGVP
        L   +I+VG++I  EI  C  +K G LFFP+ IT LCR    P
Subjt:  LRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGVGVP

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]8.5e-3932.69Show/hide
Query:  GELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQW
        G+LP F+   I  H W++FC+ PE     LVREFYAN+   E     VRG++V WS  AINA++ L + P   ++E     + + L   +  V   GA+W
Subjt:  GELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQW

Query:  QLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGVGVP-------
         +S     T   + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L   +I+VG++I  EI  C  +K G LFFP+ IT LCR    P       
Subjt:  QLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGVGVP-------

Query:  -----------------EDEGDEVRQ---------------GGLVHGINTILEQLALSTSRQ-------EFAERQALTFWSYVKNRDANLKKALQENFSK
                         E   +  +Q               G ++  +  + ++L+    +Q       +   +Q   FW+Y K RD  LKKALQ NF++
Subjt:  -----------------EDEGDEVRQ---------------GGLVHGINTILEQLALSTSRQ-------EFAERQALTFWSYVKNRDANLKKALQENFSK

Query:  PYPALPAFPEDL
        P P  PAFP+++
Subjt:  PYPALPAFPEDL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.8e-2834.63Show/hide
Query:  LKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSD
        ++++F+++     E P F+   I  H W+ FC+ PE     LVREFY N+   +     +RG++V  S  AIN +++L + P   ++E     +  +L  
Subjt:  LKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSD

Query:  AVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCR
         +  V I GA+W +S     T   + L   A  W  F++ R+LPTTH  TVS+E V L +++L   +I+VG++I  EI  C  +K G LFFP+ IT +CR
Subjt:  AVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCR

Query:  GVGVP
            P
Subjt:  GVGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]7.0e-3334.32Show/hide
Query:  LVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQR
        LVREFYAN+   E     VRG++V WS  AINA++ L + P   ++E     +  +L   +  V   GA+W +S     T   + L   A  W  F++ R
Subjt:  LVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQR

Query:  MLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGVGVPEDEGDEVRQGGLVHGI--------------------
        +LPTTH   VS++R+LL  ++L   +I+VG++I  EI  C  +K G LFFP+ IT LCR      +E +++   G +  I                    
Subjt:  MLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGVGVPEDEGDEVRQGGLVHGI--------------------

Query:  -----------NTILEQLAL---STSRQEFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL
                     +L+QL       S+QE   +Q   FW+Y K RD  LKKALQ NF++P P  PAFP+++
Subjt:  -----------NTILEQLAL---STSRQEFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.3e-2934.57Show/hide
Query:  ASEEHDEIE-EQQLLDDRFVNNFPRAKYVELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRG
        A + H  ++ E +  + R+ NN        +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+         VRG
Subjt:  ASEEHDEIE-EQQLLDDRFVNNFPRAKYVELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRG

Query:  IEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAI
        ++V WS  AINA++ L + P   ++E     +   L   +  V + GA+W +S     T   + L   A  W  F++  +LPTTH  TVS++R+LL  ++
Subjt:  IEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAI

Query:  LRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGVGVP
        L   +I+VG++I  EI  C  +K G LFFP+ IT LCR    P
Subjt:  LRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGVGVP

A0A2P5BCG4 Uncharacterized protein (Fragment)4.1e-3932.69Show/hide
Query:  GELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQW
        G+LP F+   I  H W++FC+ PE     LVREFYAN+   E     VRG++V WS  AINA++ L + P   ++E     + + L   +  V   GA+W
Subjt:  GELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQW

Query:  QLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGVGVP-------
         +S     T   + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L   +I+VG++I  EI  C  +K G LFFP+ IT LCR    P       
Subjt:  QLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGVGVP-------

Query:  -----------------EDEGDEVRQ---------------GGLVHGINTILEQLALSTSRQ-------EFAERQALTFWSYVKNRDANLKKALQENFSK
                         E   +  +Q               G ++  +  + ++L+    +Q       +   +Q   FW+Y K RD  LKKALQ NF++
Subjt:  -----------------EDEGDEVRQ---------------GGLVHGINTILEQLALSTSRQ-------EFAERQALTFWSYVKNRDANLKKALQENFSK

Query:  PYPALPAFPEDL
        P P  PAFP+++
Subjt:  PYPALPAFPEDL

A0A2P5DAQ2 Uncharacterized protein8.6e-2934.63Show/hide
Query:  LKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSD
        ++++F+++     E P F+   I  H W+ FC+ PE     LVREFY N+   +     +RG++V  S  AIN +++L + P   ++E     +  +L  
Subjt:  LKRDFLFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSD

Query:  AVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCR
         +  V I GA+W +S     T   + L   A  W  F++ R+LPTTH  TVS+E V L +++L   +I+VG++I  EI  C  +K G LFFP+ IT +CR
Subjt:  AVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCR

Query:  GVGVP
            P
Subjt:  GVGVP

A0A2P5DXM3 Uncharacterized protein3.4e-3334.32Show/hide
Query:  LVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQR
        LVREFYAN+   E     VRG++V WS  AINA++ L + P   ++E     +  +L   +  V   GA+W +S     T   + L   A  W  F++ R
Subjt:  LVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQR

Query:  MLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGVGVPEDEGDEVRQGGLVHGI--------------------
        +LPTTH   VS++R+LL  ++L   +I+VG++I  EI  C  +K G LFFP+ IT LCR      +E +++   G +  I                    
Subjt:  MLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGVGVPEDEGDEVRQGGLVHGI--------------------

Query:  -----------NTILEQLAL---STSRQEFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL
                     +L+QL       S+QE   +Q   FW+Y K RD  LKKALQ NF++P P  PAFP+++
Subjt:  -----------NTILEQLAL---STSRQEFAERQALTFWSYVKNRDANLKKALQENFSKPYPALPAFPEDL

A0A6A2ZUE4 Uncharacterized protein1.9e-2828.44Show/hide
Query:  FVNNFPRAKYVELLKRDFLFERGF------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNF
        FV+   +  Y  +  R   FE GF      +  L   +   +  H W++F   P  VNA +V+EFY+NI +      +VRGI + ++P+AIN  + LQ  
Subjt:  FVNNFPRAKYVELLKRDFLFERGF------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNF

Query:  PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGC
                A    +E     + ++ + G +W   + +++T     L      W  F++ +++PT+H++TVS +R+LL  +IL    ID+GKII +    C
Subjt:  PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGC

Query:  WKKKVGKLFFPNTITMLCRGVGVPEDEGDEVRQG--GL-------------------------------VHGINTILEQLALSTSRQEFAE--RQALTFW
         K++   L FPN IT LCR   V E+  DE+  G  GL                               V   +T LEQ A+  + Q   +   + + ++
Subjt:  WKKKVGKLFFPNTITMLCRGVGVPEDEGDEVRQG--GL-------------------------------VHGINTILEQLALSTSRQEFAE--RQALTFW

Query:  SYVKNRDANLKKALQENFSK
        +Y K RDA L  AL E+  +
Subjt:  SYVKNRDANLKKALQENFSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGGAGAGGTTCGAGCTTGATCCAGAAATCGAGAGGACATTCAGGATAAGAAGAAGAGAGCAGCGTAGACAGCAGAATCAAATGGCTGACATACC
GCATCTACCGCAGGGTCCTGAAGGTCTAGCGAACCCCCAGCAGAATCGTGTGCTGCAGCAAAACCCGCCGCTGGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATC
CTATCTTGAATGTGACAGTGATTAGTCATCAGCAGCCGCCAGCTGTGGAGCCTGCTGCAGTGGTATTGCCCCAGCAAAATAAGCAGGCTTTGCCCGAGCAAAATTCGGGG
AATTCTCTTGAGGCAATGATGAAAGAATTTATGGCTCGTACAGACGCTGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAA
TGAGCTGAAAGCAAGGCCTCAAGGGAAACTTCCTTCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTCTTAGGAGTGGTAAACCCCTAG
AAGAAAGAAAAGAGCCTAGTAAAACCCAGGATATAGATAATAATTGTGATAGGAATGTTGTTGTTGAGAAAGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAATAAA
GATGCTGGAGCATCTGGATTTTATTTTGCTCAAGCCTTATTTATTGGCAACCCACGATACGTTTCTCTTACGTCATCCTTCTTGCTTCAATCTTTTGCTTTCTTTTTCGT
TTTCATTCTCTGTAAATCCCTTGAGACTTCCATGGCGAAAACAAGAGCGCGAAAAGAAAGAGAGAATGAGGAAGAAGAGGTACCTGTTACCCCTGAAGTGCAGAAAGTTA
AAGCAAAGAAGAAAAAAACCCCGGAGGAGAAAGAAGCCAAAAGACGAAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAGCAGAGGTTGTTGCTGCCACAGTTGAA
GAAGGAGACCCACAAGAACCTGATGTACAGAACCAAGAGGAGGCTGAGCAGAGAGTCGCGGATACGGAAGAAAGGGGGCAAACAGAAGAAGTTCAAGAGGAGCGAACCGA
GGAAGTTCAAGAAGAAGTTATAGAGGAAGTTCAAGAACAGCAGGCCGAGGATGTTCAAATGCAACAGGCAGAAGAGGTTCAGGTACCGGATAATGAGCCAGTGCAAGACG
CTCAAGTAGAGGTGATCATGCCGGAGGTACCAAAGCGTCGCCGCGTTAAGAGGAAACCAGGCCGCGCTAGGGTTGTCCGAACTGATACTCCTTCGCCTCTGACCACGGAT
TCTGAAAGAGAGAATGCAGAGAGAGTAGAGCGTGAGAAGAAGGAAGCCGAGGAAAGAGCAAGAGAAGAGCGTGAGAAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAG
GGCGGAAAAGGGCAAAAATGTTGCTGAAGCATCAGAAGAGCACGATGAAATAGAAGAGCAACAGTTACTGGATGATCGCTTCGTCAACAATTTTCCCAGAGCAAAATACG
TTGAGCTTCTGAAAAGGGACTTCCTGTTTGAGAGGGGATTTAGTGGTGAGCTCCCGCATTTTCTGAGGACTGGTATTGCGAATCATGGATGGGAACGATTCTGTTCAAAA
CCCGAATCTGTAAACGCGCAGTTAGTGCGCGAATTCTATGCAAATATCGACCAAGAAGAAGGTTTCCTAGCAGTTGTTCGAGGTATTGAGGTCGACTGGAGTCCTAGTGC
TATCAACGCATTGTATAACCTTCAGAACTTCCCCCATGCGGCATATAATGAGATGGCTGTCGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGGTAGGTATTG
AAGGGGCACAATGGCAGCTATCCAAGACTCAGAAGAGGACATTCCAATCGGCTTATTTGAAAAGGGAAGCAAATACGTGGATGGGATTTATCAGACAAAGGATGCTTCCA
ACGACTCATGACTCGACAGTTTCTAGGGAACGAGTGCTTTTGGCTTTCGCTATTTTGAGGTCTCTCAATATTGATGTGGGAAAAATTATTGCTGATGAAATATTTGGTTG
TTGGAAGAAGAAAGTGGGGAAACTATTCTTTCCGAACACAATCACAATGCTTTGTAGAGGAGTAGGGGTTCCGGAAGATGAAGGGGATGAGGTACGTCAGGGTGGGCTGG
TCCACGGCATCAACACGATTTTAGAACAACTAGCACTTTCGACCAGCAGGCAGGAGTTTGCTGAACGGCAAGCTTTGACTTTCTGGAGCTATGTTAAAAATCGTGATGCC
AATCTGAAGAAGGCGCTGCAAGAGAATTTTTCGAAACCATATCCAGCCCTTCCAGCATTCCCTGAAGATTTATTCAACCCCTGGATTCCGCCCCCACCAATGGAAGAAGG
AGAAGAGGAAGATGAAAATGAACCGGGCCAAGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGGAGAGGTTCGAGCTTGATCCAGAAATCGAGAGGACATTCAGGATAAGAAGAAGAGAGCAGCGTAGACAGCAGAATCAAATGGCTGACATACC
GCATCTACCGCAGGGTCCTGAAGGTCTAGCGAACCCCCAGCAGAATCGTGTGCTGCAGCAAAACCCGCCGCTGGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATC
CTATCTTGAATGTGACAGTGATTAGTCATCAGCAGCCGCCAGCTGTGGAGCCTGCTGCAGTGGTATTGCCCCAGCAAAATAAGCAGGCTTTGCCCGAGCAAAATTCGGGG
AATTCTCTTGAGGCAATGATGAAAGAATTTATGGCTCGTACAGACGCTGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAA
TGAGCTGAAAGCAAGGCCTCAAGGGAAACTTCCTTCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTCTTAGGAGTGGTAAACCCCTAG
AAGAAAGAAAAGAGCCTAGTAAAACCCAGGATATAGATAATAATTGTGATAGGAATGTTGTTGTTGAGAAAGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAATAAA
GATGCTGGAGCATCTGGATTTTATTTTGCTCAAGCCTTATTTATTGGCAACCCACGATACGTTTCTCTTACGTCATCCTTCTTGCTTCAATCTTTTGCTTTCTTTTTCGT
TTTCATTCTCTGTAAATCCCTTGAGACTTCCATGGCGAAAACAAGAGCGCGAAAAGAAAGAGAGAATGAGGAAGAAGAGGTACCTGTTACCCCTGAAGTGCAGAAAGTTA
AAGCAAAGAAGAAAAAAACCCCGGAGGAGAAAGAAGCCAAAAGACGAAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAGCAGAGGTTGTTGCTGCCACAGTTGAA
GAAGGAGACCCACAAGAACCTGATGTACAGAACCAAGAGGAGGCTGAGCAGAGAGTCGCGGATACGGAAGAAAGGGGGCAAACAGAAGAAGTTCAAGAGGAGCGAACCGA
GGAAGTTCAAGAAGAAGTTATAGAGGAAGTTCAAGAACAGCAGGCCGAGGATGTTCAAATGCAACAGGCAGAAGAGGTTCAGGTACCGGATAATGAGCCAGTGCAAGACG
CTCAAGTAGAGGTGATCATGCCGGAGGTACCAAAGCGTCGCCGCGTTAAGAGGAAACCAGGCCGCGCTAGGGTTGTCCGAACTGATACTCCTTCGCCTCTGACCACGGAT
TCTGAAAGAGAGAATGCAGAGAGAGTAGAGCGTGAGAAGAAGGAAGCCGAGGAAAGAGCAAGAGAAGAGCGTGAGAAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAG
GGCGGAAAAGGGCAAAAATGTTGCTGAAGCATCAGAAGAGCACGATGAAATAGAAGAGCAACAGTTACTGGATGATCGCTTCGTCAACAATTTTCCCAGAGCAAAATACG
TTGAGCTTCTGAAAAGGGACTTCCTGTTTGAGAGGGGATTTAGTGGTGAGCTCCCGCATTTTCTGAGGACTGGTATTGCGAATCATGGATGGGAACGATTCTGTTCAAAA
CCCGAATCTGTAAACGCGCAGTTAGTGCGCGAATTCTATGCAAATATCGACCAAGAAGAAGGTTTCCTAGCAGTTGTTCGAGGTATTGAGGTCGACTGGAGTCCTAGTGC
TATCAACGCATTGTATAACCTTCAGAACTTCCCCCATGCGGCATATAATGAGATGGCTGTCGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGGTAGGTATTG
AAGGGGCACAATGGCAGCTATCCAAGACTCAGAAGAGGACATTCCAATCGGCTTATTTGAAAAGGGAAGCAAATACGTGGATGGGATTTATCAGACAAAGGATGCTTCCA
ACGACTCATGACTCGACAGTTTCTAGGGAACGAGTGCTTTTGGCTTTCGCTATTTTGAGGTCTCTCAATATTGATGTGGGAAAAATTATTGCTGATGAAATATTTGGTTG
TTGGAAGAAGAAAGTGGGGAAACTATTCTTTCCGAACACAATCACAATGCTTTGTAGAGGAGTAGGGGTTCCGGAAGATGAAGGGGATGAGGTACGTCAGGGTGGGCTGG
TCCACGGCATCAACACGATTTTAGAACAACTAGCACTTTCGACCAGCAGGCAGGAGTTTGCTGAACGGCAAGCTTTGACTTTCTGGAGCTATGTTAAAAATCGTGATGCC
AATCTGAAGAAGGCGCTGCAAGAGAATTTTTCGAAACCATATCCAGCCCTTCCAGCATTCCCTGAAGATTTATTCAACCCCTGGATTCCGCCCCCACCAATGGAAGAAGG
AGAAGAGGAAGATGAAAATGAACCGGGCCAAGAGGACTGA
Protein sequenceShow/hide protein sequence
MSDPPGERFELDPEIERTFRIRRREQRRQQNQMADIPHLPQGPEGLANPQQNRVLQQNPPLEQNEQQNNQAENPILNVTVISHQQPPAVEPAAVVLPQQNKQALPEQNSG
NSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERKEPSKTQDIDNNCDRNVVVEKELESGQGAGGSNK
DAGASGFYFAQALFIGNPRYVSLTSSFLLQSFAFFFVFILCKSLETSMAKTRARKERENEEEEVPVTPEVQKVKAKKKKTPEEKEAKRRRRQQRAEEQEKAAEVVAATVE
EGDPQEPDVQNQEEAEQRVADTEERGQTEEVQEERTEEVQEEVIEEVQEQQAEDVQMQQAEEVQVPDNEPVQDAQVEVIMPEVPKRRRVKRKPGRARVVRTDTPSPLTTD
SERENAERVEREKKEAEERAREEREKKAEEERLLKRRAEKGKNVAEASEEHDEIEEQQLLDDRFVNNFPRAKYVELLKRDFLFERGFSGELPHFLRTGIANHGWERFCSK
PESVNAQLVREFYANIDQEEGFLAVVRGIEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQRMLP
TTHDSTVSRERVLLAFAILRSLNIDVGKIIADEIFGCWKKKVGKLFFPNTITMLCRGVGVPEDEGDEVRQGGLVHGINTILEQLALSTSRQEFAERQALTFWSYVKNRDA
NLKKALQENFSKPYPALPAFPEDLFNPWIPPPPMEEGEEEDENEPGQED