; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg026510 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg026510
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionpeptidyl-prolyl cis-trans isomerase G isoform X2
Genome locationscaffold8:2183659..2194021
RNA-Seq ExpressionSpg026510
SyntenySpg026510
Gene Ontology termsGO:0016575 - histone deacetylation (biological process)
InterPro domainsIPR013951 - Histone deacetylation protein Rxt3
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFZ18702.1 zinc finger CCCH domain protein [Actinidia rufa]3.0e-4548.22Show/hide
Query:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRV---------TSLSNSWEFLLDSVDFRIPTS
        ++GK E ++VVY+VGECMQEL+KLWKE+ESSQ DK  E+SQ  PTLEIRIPAEHVTATNRQ +  L   +         T   +S  + + S   R+P  
Subjt:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRV---------TSLSNSWEFLLDSVDFRIPTS

Query:  VTNLSLPSRLSSAVCKMTE---SPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASC-----SSFSPGFHV
          +  LPS  S  + K++     P++     +   ++    T +G           VCQT  +                +L S+SC      S +    V
Subjt:  VTNLSLPSRLSSAVCKMTE---SPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASC-----SSFSPGFHV

Query:  RGGQLWGTDVYTYDSDLVAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
        RGGQLWGTDVYTYDSDLVAVLMHTGYCRPTASPPPPA+QELRATIRVLPPQDC
Subjt:  RGGQLWGTDVYTYDSDLVAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC

KAG5528690.1 hypothetical protein RHGRI_029384 [Rhododendron griersonianum]2.6e-4448.73Show/hide
Query:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR
        +QGK E S+VVY+VGECM+EL+KLWKE+E+SQ D+ SE+SQN PTLEIRIPAEHVTATNRQ   S+R    SL+        S+ F +            
Subjt:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR

Query:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL
                T  P R  V    +F +          S   +        V AF G              L+  +         VRGGQLWGTDVYTYDSDL
Subjt:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL

Query:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
        VA+LMHTG+CRPTASPPPPAIQELRAT+RVLPPQDC
Subjt:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC

KAG6604709.1 hypothetical protein SDJN03_02026, partial [Cucurbita argyrosperma subsp. sororia]1.2e-4147.03Show/hide
Query:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR
        LQGKPEVSSVVYKVGECMQELIKLWKEHE SQIDKN ES QN+PTLEIRIPAEHVTATNRQ                                       
Subjt:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR

Query:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL
                                                                                          VRGGQLWGTDVYTYDSDL
Subjt:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL

Query:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
        VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
Subjt:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC

XP_022947594.1 uncharacterized protein LOC111451415 [Cucurbita moschata]1.2e-4147.03Show/hide
Query:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR
        LQGKPEVSSVVYKVGECMQELIKLWKEHE SQIDKN ES QN+PTLEIRIPAEHVTATNRQ                                       
Subjt:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR

Query:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL
                                                                                          VRGGQLWGTDVYTYDSDL
Subjt:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL

Query:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
        VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
Subjt:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC

XP_023532462.1 uncharacterized protein LOC111794622 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-4147.03Show/hide
Query:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR
        LQGKPEVSSVVYKVGECMQELIKLWKEHE SQIDKN ES QN+PTLEIRIPAEHVTATNRQ                                       
Subjt:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR

Query:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL
                                                                                          VRGGQLWGTDVYTYDSDL
Subjt:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL

Query:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
        VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
Subjt:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC

TrEMBL top hitse value%identityAlignment
A0A1S3CDT5 uncharacterized protein LOC103499596 isoform X11.4e-4047.03Show/hide
Query:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR
        LQGKPEVSSVVYKVGECMQELIKLWKEHE SQIDKN ESSQNIPTLEIRIPAEHV ATNRQ                                       
Subjt:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR

Query:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL
                                                                                          VRGGQLWGTDVYTYDSDL
Subjt:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL

Query:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
        VAVLMHTGYCR TASPPPPAIQELRATIRVLPPQDC
Subjt:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC

A0A6J1DZF3 zinc finger CCCH domain-containing protein 136.4e-4146.61Show/hide
Query:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR
        LQGKPEVSSVVYKVGECMQELIKLWKE+ES+QIDK  ESSQNIPTLEIRIPAEHV+ATNRQ                                       
Subjt:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR

Query:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL
                                                                                          VRGGQLWGTDVYTYDSDL
Subjt:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL

Query:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
        VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
Subjt:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC

A0A6J1G6W1 uncharacterized protein LOC1114514155.8e-4247.03Show/hide
Query:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR
        LQGKPEVSSVVYKVGECMQELIKLWKEHE SQIDKN ES QN+PTLEIRIPAEHVTATNRQ                                       
Subjt:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR

Query:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL
                                                                                          VRGGQLWGTDVYTYDSDL
Subjt:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL

Query:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
        VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
Subjt:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC

A0A6J1I0D2 uncharacterized protein LOC1114697182.2e-4146.61Show/hide
Query:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR
        LQGKPEVSSVVYKVGECMQELIKLWKEHE SQIDKN ES QN+P LEIRIPAEHVTATNRQ                                       
Subjt:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR

Query:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL
                                                                                          VRGGQLWGTDVYTYDSDL
Subjt:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL

Query:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
        VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
Subjt:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC

A0A7J0H6I6 Zinc finger CCCH domain protein1.5e-4548.22Show/hide
Query:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRV---------TSLSNSWEFLLDSVDFRIPTS
        ++GK E ++VVY+VGECMQEL+KLWKE+ESSQ DK  E+SQ  PTLEIRIPAEHVTATNRQ +  L   +         T   +S  + + S   R+P  
Subjt:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRV---------TSLSNSWEFLLDSVDFRIPTS

Query:  VTNLSLPSRLSSAVCKMTE---SPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASC-----SSFSPGFHV
          +  LPS  S  + K++     P++     +   ++    T +G           VCQT  +                +L S+SC      S +    V
Subjt:  VTNLSLPSRLSSAVCKMTE---SPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASC-----SSFSPGFHV

Query:  RGGQLWGTDVYTYDSDLVAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC
        RGGQLWGTDVYTYDSDLVAVLMHTGYCRPTASPPPPA+QELRATIRVLPPQDC
Subjt:  RGGQLWGTDVYTYDSDLVAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDC

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.9e-0533.33Show/hide
Query:  RDVRFWSLDPSAGFSCRSFFHFLV---NPSPARESVFSCLWKVKVPKKVLFFVWQVILGCVNTFDRLSRVKAPVVGPFCCILCQKAEEDLDHLLWD
        RD   W       FS RS +  L     P P   S F+CLWKV+VP++V  F+W V    V T +   R +  +     C +C+   E + H+L D
Subjt:  RDVRFWSLDPSAGFSCRSFFHFLV---NPSPARESVFSCLWKVKVPKKVLFFVWQVILGCVNTFDRLSRVKAPVVGPFCCILCQKAEEDLDHLLWD

Arabidopsis top hitse value%identityAlignment
AT5G08450.1 CONTAINS InterPro DOMAIN/s: Histone deacetylation protein Rxt3 (InterPro:IPR013951); Has 34444 Blast hits to 20801 proteins in 1175 species: Archae - 64; Bacteria - 2390; Metazoa - 15568; Fungi - 3729; Plants - 1886; Viruses - 208; Other Eukaryotes - 10599 (source: NCBI BLink).1.3e-3339.15Show/hide
Query:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR
        +QGK EVS VVYKVGECMQELIKLWKE++ S  DK+ + + N PTLE+RIPAEHVTATNRQ                                       
Subjt:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR

Query:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL
                                                                                          VRGGQLWGTD+YT DSDL
Subjt:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL

Query:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQD
        VAVLMHTGYCRPTASPPPP +QELR TIRVLP QD
Subjt:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQD

AT5G08450.2 CONTAINS InterPro DOMAIN/s: Histone deacetylation protein Rxt3 (InterPro:IPR013951); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).1.3e-3339.15Show/hide
Query:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR
        +QGK EVS VVYKVGECMQELIKLWKE++ S  DK+ + + N PTLE+RIPAEHVTATNRQ                                       
Subjt:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR

Query:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL
                                                                                          VRGGQLWGTD+YT DSDL
Subjt:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL

Query:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQD
        VAVLMHTGYCRPTASPPPP +QELR TIRVLP QD
Subjt:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQD

AT5G08450.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Histone deacetylation protein Rxt3 (InterPro:IPR013951); Has 34444 Blast hits to 20801 proteins in 1175 species: Archae - 64; Bacteria - 2390; Metazoa - 15568; Fungi - 3729; Plants - 1886; Viruses - 208; Other Eukaryotes - 10599 (source: NCBI BLink).1.3e-3339.15Show/hide
Query:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR
        +QGK EVS VVYKVGECMQELIKLWKE++ S  DK+ + + N PTLE+RIPAEHVTATNRQ                                       
Subjt:  LQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQAVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSR

Query:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL
                                                                                          VRGGQLWGTD+YT DSDL
Subjt:  LSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVSASCSSFSPGFHVRGGQLWGTDVYTYDSDL

Query:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQD
        VAVLMHTGYCRPTASPPPP +QELR TIRVLP QD
Subjt:  VAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCTCTTATCTTTGCTTGAGGGGGTTAGCCTTAGACCGAACTGGAGGGATGTGCGTTTCTGGAGTCTCGACCCCTCAGCGGGCTTTTCTTGCAGATCGTTTTTTCA
TTTCTTGGTCAATCCCTCCCCGGCTAGAGAGTCTGTTTTTTCGTGTCTTTGGAAGGTAAAAGTTCCGAAGAAAGTCTTGTTCTTTGTTTGGCAGGTCATCTTGGGCTGTG
TTAACACTTTCGATAGGCTTTCGAGAGTGAAGGCTCCTGTGGTTGGCCCTTTCTGTTGCATTCTATGTCAGAAGGCAGAGGAAGATCTTGATCACTTGTTATGGGATTGG
AGACGATCGAGGAGTTCCTCTTCCATCCGCCGTTTCAGGAGAAAGGAAAGTTCTTGTGGCATGCTGACATTTGTGCTATTTTGTGTGGTTTATGGGGGGAAAGGAACAAT
AGAATTTTTAGAGGGATCGAGAGACATTCTTGTGAGATTACAAGGAAAGCCTGAAGTCTCATCTGTGGTTTATAAAGTTGGTGAATGCATGCAAGAACTAATAAAGTTGT
GGAAGGAACATGAATCGTCACAGATAGATAAAAATAGTGAAAGCTCCCAGAATATCCCCACTCTAGAAATTCGAATACCAGCTGAACATGTTACTGCTACAAATAGGCAG
GCTGTGGAAAGTCTGAGGAGGAGAGTAACCAGCCTCTCGAATAGCTGGGAGTTCCTTTTGGACTCTGTTGACTTCCGCATTCCTACTTCGGTTACTAACTTGTCGCTCCC
TTCTCGGCTATCTTCAGCTGTTTGCAAGATGACTGAGTCACCTCAGAGGATGACTGTCCCTTACTCACGTTTTTTCCAGCTACCAAGGCCAATAACATTCGTTGGTTATC
CTTCATGTCTAGCGATATCTGTTCAAGCTGTTTGCCAAACAGTAGCAGCGTTTCCTGGACCTCACTCATTTCTTTATCGTGTCCATCCAGTCGTTCCTCTACTTGTTTCT
GCGTCATGCTCCTCATTTTCTCCAGGATTTCACGTCAGGGGTGGACAGCTGTGGGGAACAGATGTGTACACATATGATTCAGATCTCGTTGCTGTTCTCATGCACACAGG
CTACTGTCGTCCAACAGCTTCTCCTCCTCCACCTGCAATCCAGGAGTTGCGGGCAACGATCAGAGTATTACCTCCACAAGACTGTAAGTCTGTTTTTCGTTATCTTGATT
CTTGTTTAGTAGATGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCTCTTATCTTTGCTTGAGGGGGTTAGCCTTAGACCGAACTGGAGGGATGTGCGTTTCTGGAGTCTCGACCCCTCAGCGGGCTTTTCTTGCAGATCGTTTTTTCA
TTTCTTGGTCAATCCCTCCCCGGCTAGAGAGTCTGTTTTTTCGTGTCTTTGGAAGGTAAAAGTTCCGAAGAAAGTCTTGTTCTTTGTTTGGCAGGTCATCTTGGGCTGTG
TTAACACTTTCGATAGGCTTTCGAGAGTGAAGGCTCCTGTGGTTGGCCCTTTCTGTTGCATTCTATGTCAGAAGGCAGAGGAAGATCTTGATCACTTGTTATGGGATTGG
AGACGATCGAGGAGTTCCTCTTCCATCCGCCGTTTCAGGAGAAAGGAAAGTTCTTGTGGCATGCTGACATTTGTGCTATTTTGTGTGGTTTATGGGGGGAAAGGAACAAT
AGAATTTTTAGAGGGATCGAGAGACATTCTTGTGAGATTACAAGGAAAGCCTGAAGTCTCATCTGTGGTTTATAAAGTTGGTGAATGCATGCAAGAACTAATAAAGTTGT
GGAAGGAACATGAATCGTCACAGATAGATAAAAATAGTGAAAGCTCCCAGAATATCCCCACTCTAGAAATTCGAATACCAGCTGAACATGTTACTGCTACAAATAGGCAG
GCTGTGGAAAGTCTGAGGAGGAGAGTAACCAGCCTCTCGAATAGCTGGGAGTTCCTTTTGGACTCTGTTGACTTCCGCATTCCTACTTCGGTTACTAACTTGTCGCTCCC
TTCTCGGCTATCTTCAGCTGTTTGCAAGATGACTGAGTCACCTCAGAGGATGACTGTCCCTTACTCACGTTTTTTCCAGCTACCAAGGCCAATAACATTCGTTGGTTATC
CTTCATGTCTAGCGATATCTGTTCAAGCTGTTTGCCAAACAGTAGCAGCGTTTCCTGGACCTCACTCATTTCTTTATCGTGTCCATCCAGTCGTTCCTCTACTTGTTTCT
GCGTCATGCTCCTCATTTTCTCCAGGATTTCACGTCAGGGGTGGACAGCTGTGGGGAACAGATGTGTACACATATGATTCAGATCTCGTTGCTGTTCTCATGCACACAGG
CTACTGTCGTCCAACAGCTTCTCCTCCTCCACCTGCAATCCAGGAGTTGCGGGCAACGATCAGAGTATTACCTCCACAAGACTGTAAGTCTGTTTTTCGTTATCTTGATT
CTTGTTTAGTAGATGGCTAA
Protein sequenceShow/hide protein sequence
MALLSLLEGVSLRPNWRDVRFWSLDPSAGFSCRSFFHFLVNPSPARESVFSCLWKVKVPKKVLFFVWQVILGCVNTFDRLSRVKAPVVGPFCCILCQKAEEDLDHLLWDW
RRSRSSSSIRRFRRKESSCGMLTFVLFCVVYGGKGTIEFLEGSRDILVRLQGKPEVSSVVYKVGECMQELIKLWKEHESSQIDKNSESSQNIPTLEIRIPAEHVTATNRQ
AVESLRRRVTSLSNSWEFLLDSVDFRIPTSVTNLSLPSRLSSAVCKMTESPQRMTVPYSRFFQLPRPITFVGYPSCLAISVQAVCQTVAAFPGPHSFLYRVHPVVPLLVS
ASCSSFSPGFHVRGGQLWGTDVYTYDSDLVAVLMHTGYCRPTASPPPPAIQELRATIRVLPPQDCKSVFRYLDSCLVDG