; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025402 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025402
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold13:41984145..41989069
RNA-Seq ExpressionSpg025402
SyntenySpg025402
Gene Ontology termsGO:2000767 - positive regulation of cytoplasmic translation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0003727 - single-stranded RNA binding (molecular function)
GO:0003729 - mRNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0045182 - translation regulator activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585819.1 DNA-binding protein HEXBP, partial [Cucurbita argyrosperma subsp. sororia]2.8e-12491.63Show/hide
Query:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL
        MSS LCSSIRALAPQ W + NLRF+ILLQSKRC+ FAPRF+ACLSSNDDSVAIPKP PLAFDP EE+YGLGVDLKPRNS SS+PEPRSWFGPNGQYI+EL
Subjt:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVC GKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE
        SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIK+ +
Subjt:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE

XP_022951579.1 uncharacterized protein LOC111454354 isoform X1 [Cucurbita moschata]2.8e-12491.63Show/hide
Query:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL
        MSS LCSSIRALAPQ W + NLRF+ILLQSKRC+ FAPRF+ACLSSNDDSVAIPKP PLAFDP EE+YGLGVDLKPRNS SS+PEPRSWFGPNGQYI+EL
Subjt:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVC GKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE
        SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIK+ +
Subjt:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE

XP_022951581.1 uncharacterized protein LOC111454354 isoform X2 [Cucurbita moschata]2.8e-12491.63Show/hide
Query:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL
        MSS LCSSIRALAPQ W + NLRF+ILLQSKRC+ FAPRF+ACLSSNDDSVAIPKP PLAFDP EE+YGLGVDLKPRNS SS+PEPRSWFGPNGQYI+EL
Subjt:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVC GKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE
        SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIK+ +
Subjt:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE

XP_023002093.1 uncharacterized protein LOC111496065 isoform X1 [Cucurbita maxima]2.8e-12491.63Show/hide
Query:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL
        MSS LCSSIRALAPQ W + NLRF+ILLQSKRC+ FAPRF+ACLSSNDDSVAIPKP PLAFDP EE+YGLGVDLKPRNS SS+PEPRSWFGPNGQYI+EL
Subjt:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVC GKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE
        SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIK+ +
Subjt:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE

XP_023002096.1 uncharacterized protein LOC111496065 isoform X2 [Cucurbita maxima]2.8e-12491.63Show/hide
Query:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL
        MSS LCSSIRALAPQ W + NLRF+ILLQSKRC+ FAPRF+ACLSSNDDSVAIPKP PLAFDP EE+YGLGVDLKPRNS SS+PEPRSWFGPNGQYI+EL
Subjt:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVC GKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE
        SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIK+ +
Subjt:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE

TrEMBL top hitse value%identityAlignment
A0A1S3CPF7 uncharacterized protein LOC103503314 isoform X21.4e-12189.96Show/hide
Query:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL
        MSSSLC SI AL P YW  KN  F++LLQS RCICFAPRF+ACL  NDDSVAIPKPAPLAFDPAEELYGL VDLKPRNSASS+PEPRSWFGPNGQYIKEL
Subjt:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVC GKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE
        SRSLKSLNAKTG+FSKRM+IIHRDP LHAQRVAAIK+ +
Subjt:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE

A0A6J1GHY9 uncharacterized protein LOC111454354 isoform X21.4e-12491.63Show/hide
Query:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL
        MSS LCSSIRALAPQ W + NLRF+ILLQSKRC+ FAPRF+ACLSSNDDSVAIPKP PLAFDP EE+YGLGVDLKPRNS SS+PEPRSWFGPNGQYI+EL
Subjt:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVC GKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE
        SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIK+ +
Subjt:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE

A0A6J1GJ81 uncharacterized protein LOC111454354 isoform X11.4e-12491.63Show/hide
Query:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL
        MSS LCSSIRALAPQ W + NLRF+ILLQSKRC+ FAPRF+ACLSSNDDSVAIPKP PLAFDP EE+YGLGVDLKPRNS SS+PEPRSWFGPNGQYI+EL
Subjt:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVC GKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE
        SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIK+ +
Subjt:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE

A0A6J1KKD0 uncharacterized protein LOC111496065 isoform X21.4e-12491.63Show/hide
Query:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL
        MSS LCSSIRALAPQ W + NLRF+ILLQSKRC+ FAPRF+ACLSSNDDSVAIPKP PLAFDP EE+YGLGVDLKPRNS SS+PEPRSWFGPNGQYI+EL
Subjt:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVC GKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE
        SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIK+ +
Subjt:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE

A0A6J1KMZ5 uncharacterized protein LOC111496065 isoform X11.4e-12491.63Show/hide
Query:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL
        MSS LCSSIRALAPQ W + NLRF+ILLQSKRC+ FAPRF+ACLSSNDDSVAIPKP PLAFDP EE+YGLGVDLKPRNS SS+PEPRSWFGPNGQYI+EL
Subjt:  MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVC GKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKI

Query:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE
        SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIK+ +
Subjt:  SRSLKSLNAKTGLFSKRMKIIHRDPTLHAQRVAAIKRVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G44020.1 thylakoid lumenal P17.1 protein3.1e-0438.3Show/hide
Query:  LPCPSCRGRGYAPCTEC-----GIERSRADCSVCKGKGIVTCHQCLG
        +PC  C G G   C  C      +E    DC VCKG G++ C +C G
Subjt:  LPCPSCRGRGYAPCTEC-----GIERSRADCSVCKGKGIVTCHQCLG

AT5G20220.1 zinc knuckle (CCHC-type) family protein8.2e-8266.38Show/hide
Query:  QKNLRFYILLQSKRCICFAPRFLACLSS------NDDSVAIPKP--APLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKELPCPSCRGRGY
        +K  RF  LL       F PR ++  SS      ND SV+  +     + +DP+EEL+  GVD KPR  +  S EPRSWFGPNGQYI+ELPCP+CRGRGY
Subjt:  QKNLRFYILLQSKRCICFAPRFLACLSS------NDDSVAIPKP--APLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKELPCPSCRGRGY

Query:  APCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNAK
          C+ CGIERSR DC  CKGKGI+TC +CLGD VIWEESIDE+PWEKARS+SP R+KEDDEVDNLEIK  +++KSKR+YQSP PEVG KISRSLKSLNAK
Subjt:  APCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNAK

Query:  TGLFSKRMKIIHRDPTLHAQRVAAIKRVE
        TGLFSKRMKIIHRDP LHAQRVAAIK+ +
Subjt:  TGLFSKRMKIIHRDPTLHAQRVAAIKRVE

AT5G20220.2 zinc knuckle (CCHC-type) family protein8.2e-8266.38Show/hide
Query:  QKNLRFYILLQSKRCICFAPRFLACLSS------NDDSVAIPKP--APLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKELPCPSCRGRGY
        +K  RF  LL       F PR ++  SS      ND SV+  +     + +DP+EEL+  GVD KPR  +  S EPRSWFGPNGQYI+ELPCP+CRGRGY
Subjt:  QKNLRFYILLQSKRCICFAPRFLACLSS------NDDSVAIPKP--APLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKELPCPSCRGRGY

Query:  APCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNAK
          C+ CGIERSR DC  CKGKGI+TC +CLGD VIWEESIDE+PWEKARS+SP R+KEDDEVDNLEIK  +++KSKR+YQSP PEVG KISRSLKSLNAK
Subjt:  APCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNAK

Query:  TGLFSKRMKIIHRDPTLHAQRVAAIKRVE
        TGLFSKRMKIIHRDP LHAQRVAAIK+ +
Subjt:  TGLFSKRMKIIHRDPTLHAQRVAAIKRVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTCGTCACTCTGTTCATCAATTCGGGCATTGGCGCCGCAGTACTGGCCACAGAAGAACTTGAGATTCTACATATTGCTTCAATCCAAGAGATGTATATGCTTCGC
ACCTCGCTTCCTTGCCTGTTTGAGCAGCAACGACGACTCTGTTGCAATCCCCAAACCCGCTCCGCTGGCTTTCGATCCTGCGGAGGAGTTGTACGGACTTGGCGTCGATT
TAAAGCCAAGGAATTCAGCTTCTAGCTCACCTGAACCCAGGTCCTGGTTTGGCCCAAATGGTCAGTATATTAAAGAGCTCCCATGTCCAAGTTGCCGTGGCAGGGGCTAT
GCGCCGTGTACGGAATGTGGAATTGAAAGATCCCGTGCAGACTGTTCCGTGTGTAAAGGAAAGGGTATAGTGACCTGCCACCAGTGCTTGGGAGATCGTGTCATATGGGA
AGAGTCCATTGATGAACAACCGTGGGAGAAAGCCCGCTCCACTTCTCCATTAAGAATGAAGGAAGACGATGAAGTTGATAACTTGGAAATAAAGCTGGAAGAAAAGAAGA
AATCAAAGCGTGTTTACCAATCACCTCCTCCTGAAGTTGGATTAAAGATCAGTCGATCATTAAAAAGTCTCAATGCCAAAACAGGTCTATTTAGTAAGAGAATGAAGATT
ATCCATCGTGACCCCACTCTTCATGCTCAGAGAGTGGCTGCAATTAAGCGAGTGGAAGACTTTGTTAAAGTAGTTGCCGGGCATGGGTTTTCTCCACCCATGGCCCTTAG
GTTGTTCGCCCCTTTTGTGATTAATATATCTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCTCGTCACTCTGTTCATCAATTCGGGCATTGGCGCCGCAGTACTGGCCACAGAAGAACTTGAGATTCTACATATTGCTTCAATCCAAGAGATGTATATGCTTCGC
ACCTCGCTTCCTTGCCTGTTTGAGCAGCAACGACGACTCTGTTGCAATCCCCAAACCCGCTCCGCTGGCTTTCGATCCTGCGGAGGAGTTGTACGGACTTGGCGTCGATT
TAAAGCCAAGGAATTCAGCTTCTAGCTCACCTGAACCCAGGTCCTGGTTTGGCCCAAATGGTCAGTATATTAAAGAGCTCCCATGTCCAAGTTGCCGTGGCAGGGGCTAT
GCGCCGTGTACGGAATGTGGAATTGAAAGATCCCGTGCAGACTGTTCCGTGTGTAAAGGAAAGGGTATAGTGACCTGCCACCAGTGCTTGGGAGATCGTGTCATATGGGA
AGAGTCCATTGATGAACAACCGTGGGAGAAAGCCCGCTCCACTTCTCCATTAAGAATGAAGGAAGACGATGAAGTTGATAACTTGGAAATAAAGCTGGAAGAAAAGAAGA
AATCAAAGCGTGTTTACCAATCACCTCCTCCTGAAGTTGGATTAAAGATCAGTCGATCATTAAAAAGTCTCAATGCCAAAACAGGTCTATTTAGTAAGAGAATGAAGATT
ATCCATCGTGACCCCACTCTTCATGCTCAGAGAGTGGCTGCAATTAAGCGAGTGGAAGACTTTGTTAAAGTAGTTGCCGGGCATGGGTTTTCTCCACCCATGGCCCTTAG
GTTGTTCGCCCCTTTTGTGATTAATATATCTCTTTGA
Protein sequenceShow/hide protein sequence
MSSSLCSSIRALAPQYWPQKNLRFYILLQSKRCICFAPRFLACLSSNDDSVAIPKPAPLAFDPAEELYGLGVDLKPRNSASSSPEPRSWFGPNGQYIKELPCPSCRGRGY
APCTECGIERSRADCSVCKGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPPPEVGLKISRSLKSLNAKTGLFSKRMKI
IHRDPTLHAQRVAAIKRVEDFVKVVAGHGFSPPMALRLFAPFVINISL