; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr013236 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr013236
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNudix hydrolase domain-containing protein
Genome locationtig00153764:76346..77233
RNA-Seq ExpressionSgr013236
SyntenySgr013236
Gene Ontology termsNA
InterPro domainsIPR015797 - NUDIX hydrolase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138298.1 uncharacterized protein LOC111009508 [Momordica charantia]1.8e-14087.46Show/hide
Query:  MIVDMSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNP-RGFLKIPSMSLSRPNPKTRHDNHNFASPQSLSEWLK
        MI+DM SPPPPLPPPH ISNLTHLNKS PLPD +LAALSLFVFFSSSS KSFKFPL  FQ NP R FLKIPSMSLS P+PKTR DNH+FASPQSLS+WL 
Subjt:  MIVDMSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNP-RGFLKIPSMSLSRPNPKTRHDNHNFASPQSLSEWLK

Query:  PRLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEEL
        PRLPSDSFASWGVKPGTKNVHNLWLE+SEGETSL+DSNPP+RTVQVVSLRI+DKHNRVLVESHQ+LSDGT+RNRNRPLSEKMKPNETPESAVYRAVKEEL
Subjt:  PRLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEEL

Query:  GSIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADK-SVSVKKHYWKWVSADSVDS
        GSIIGD DCCEIV IVP+SY+MKIEERNSVSYPGLPACYVLHSMDV VEGLP+ EFCTVEEEEY +SEETEIA K +VSVKKH+WKWVSADSVDS
Subjt:  GSIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADK-SVSVKKHYWKWVSADSVDS

XP_022922297.1 uncharacterized protein LOC111430319 [Cucurbita moschata]2.3e-12779.79Show/hide
Query:  MSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPN----PKTRHDNHNFASPQSLSEWLKP
        M S PPP+PPP  IS+L HL +S PLPD FLAALSLFVF SSSS++SFKFPL   Q NPR FLK PSMS S PN    P   H  H F SPQSLS+WLKP
Subjt:  MSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPN----PKTRHDNHNFASPQSLSEWLKP

Query:  RLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPSDSFASWGVKPGTKNVHNLWLELSEGETSL+DSNPP+RTVQV+SLRIID H R+L+ESHQQLSDGT+RNRNRPLSEKMKPNETPESAVYRAVKEELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD
        SI+GD DC EIV IVPDSY+MKIEERNS SYPGLPACYVLHSMDVLVEGLP+ +FCTVEEEEY  SEET IAD++VSVKKH+WKWVS DS+D
Subjt:  SIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD

XP_022974352.1 uncharacterized protein LOC111472979 [Cucurbita maxima]2.5e-12679.11Show/hide
Query:  MSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPN----PKTRHDNHNFASPQSLSEWLKP
        M S PPP+PPP  IS+L HL +S PLPD FLAALSLFVF SSSS++SFK PL   Q NPR FLK PSMS S PN    P   H  H FASPQSLS+WLKP
Subjt:  MSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPN----PKTRHDNHNFASPQSLSEWLKP

Query:  RLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPSDSFASWGVKPGTKNVHNLWLELSEGETSL+DS PP+RTVQV+SLRIID H R+L+ESHQQLSDGT+RNRNRPLSEKMKPNETPESAVYRAVKEELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD
        SI+GD DC EIV IVPDSY+MKIEERNS SYPGLPACYVLHSMDVLVEGLP+ +FCTVEEEEY  SEE+ IAD++VSVKKH+WKWVS DS+D
Subjt:  SIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD

XP_023550799.1 uncharacterized protein LOC111808831 [Cucurbita pepo subsp. pepo]5.1e-12779.45Show/hide
Query:  MSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPN----PKTRHDNHNFASPQSLSEWLKP
        M S PPP+PPP  IS+L HL +S PLPD FLAALSLFVF SSSS++SFKFPL   Q NPR FLK PSMS S PN    P   H  H F SPQSLS+WLKP
Subjt:  MSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPN----PKTRHDNHNFASPQSLSEWLKP

Query:  RLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPSDSFASWGVKPGTKNVHNLWLELSEGETSL+DSNPP+RTVQV+SLRIID H R+L+ESHQQLSDGT+RNRNRPLSEKMKPNETPESAVYRAVKEELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD
        SI+GD DC EIV IVPDSY+MKIEERNS SYPGLPACYVLHSMDV+VEGLP+ +FCTVEEEEY  SEET IAD++VSVKKH+WKWVS DS+D
Subjt:  SIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD

XP_038874473.1 uncharacterized protein LOC120067121 isoform X2 [Benincasa hispida]1.5e-12380.07Show/hide
Query:  MSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFF-SSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPN-PKTRHDNHNFASPQSLSEWLKPRL
        M S P P+PPP   SNL HLNKS  LPD FLAALSLFVFF SSSS+KSFKFP FS QLNPR FLKIPS S+     P ++  +  F SPQSLSEWLKPRL
Subjt:  MSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFF-SSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPN-PKTRHDNHNFASPQSLSEWLKPRL

Query:  PSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELGSI
        PSDSFASWGV PGTKNVHNLWLE+S+GETSL+DSNPP+RT+ V+SLRI+D H+RVLVESHQQLSDGT+RNRNRPLSEKMKPNETPESAVYRAVKEELGSI
Subjt:  PSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELGSI

Query:  IGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSA
        IGD DC +IV IVPDSY+MKIEERNSVSYPGLPACYVLHSMDV VEGLPEGEFCTVEEEEYG SEET IAD++VSVKKH+WKWV +
Subjt:  IGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSA

TrEMBL top hitse value%identityAlignment
A0A0A0KJQ6 Uncharacterized protein2.8e-11575.17Show/hide
Query:  PPPPLPP--PHSISNLTHLNKS-APLPDLFLAALSLFVFF--SSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPNPKTRHDNHNFASPQSLSEWLKPRL
        PPPP+PP  P  ISNLTHLNKS A LPD FLAALSLF F   SSSS+KSFKFP FS QLNPR F KIPS+S+  PN  ++  +  F SPQSLSEWL+PRL
Subjt:  PPPPLPP--PHSISNLTHLNKS-APLPDLFLAALSLFVFF--SSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPNPKTRHDNHNFASPQSLSEWLKPRL

Query:  PSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELGSI
        PS SFASWGV PGTKN+HNLWLE+S+GETSL+DSNPP+RT+ V+SLRIID H+R+L+ESHQQLSDGT+RNRNRPLSEKMKPNETPESAVYRAV+EELGSI
Subjt:  PSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELGSI

Query:  IGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD
        +GD D  ++V IVPDSY++KIEER+SVSYPGL A YVLHSMDV VEGLP+G+FCTVEEEEY  SE+T IAD +VSVKKH+WKWVS +SVD
Subjt:  IGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD

A0A1S3AUP2 uncharacterized protein LOC1034830016.1e-11876.12Show/hide
Query:  PPPPLPP--PHSISNLTHLNKS-APLPDLFLAALSLFVFFSSSS-TKSFKFPLFSFQLNPRGFLKIPSMSLSRPNPKTRHDNHNFASPQSLSEWLKPRLP
        PPPP+PP  P  ISNLTHLNKS A LPD FLAALSLF FFSSSS +KSFKFP FS QLNPR FLKIPS S+  PN  ++  +  F SPQSLSEWL+PRLP
Subjt:  PPPPLPP--PHSISNLTHLNKS-APLPDLFLAALSLFVFFSSSS-TKSFKFPLFSFQLNPRGFLKIPSMSLSRPNPKTRHDNHNFASPQSLSEWLKPRLP

Query:  SDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELGSII
        S SFASWGV PGTKN+HNLWLE+S+GETSL+DSNPP+R + V+SLRIID H+R+L+ESHQQLSDGT+RNRNRPLSEKMKPNETPESAVYRAV+EELGSI+
Subjt:  SDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELGSII

Query:  GDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD
         D DC  +V IVPDSY++KIEER+SVSYPGLPACYVLHSMD+ VEGLP+G+FCTVE+EEY  SEET IAD++VSVKKH+WKWVS +SVD
Subjt:  GDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD

A0A6J1CCN3 uncharacterized protein LOC1110095088.7e-14187.46Show/hide
Query:  MIVDMSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNP-RGFLKIPSMSLSRPNPKTRHDNHNFASPQSLSEWLK
        MI+DM SPPPPLPPPH ISNLTHLNKS PLPD +LAALSLFVFFSSSS KSFKFPL  FQ NP R FLKIPSMSLS P+PKTR DNH+FASPQSLS+WL 
Subjt:  MIVDMSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNP-RGFLKIPSMSLSRPNPKTRHDNHNFASPQSLSEWLK

Query:  PRLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEEL
        PRLPSDSFASWGVKPGTKNVHNLWLE+SEGETSL+DSNPP+RTVQVVSLRI+DKHNRVLVESHQ+LSDGT+RNRNRPLSEKMKPNETPESAVYRAVKEEL
Subjt:  PRLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEEL

Query:  GSIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADK-SVSVKKHYWKWVSADSVDS
        GSIIGD DCCEIV IVP+SY+MKIEERNSVSYPGLPACYVLHSMDV VEGLP+ EFCTVEEEEY +SEETEIA K +VSVKKH+WKWVSADSVDS
Subjt:  GSIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADK-SVSVKKHYWKWVSADSVDS

A0A6J1E2U8 uncharacterized protein LOC1114303191.1e-12779.79Show/hide
Query:  MSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPN----PKTRHDNHNFASPQSLSEWLKP
        M S PPP+PPP  IS+L HL +S PLPD FLAALSLFVF SSSS++SFKFPL   Q NPR FLK PSMS S PN    P   H  H F SPQSLS+WLKP
Subjt:  MSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPN----PKTRHDNHNFASPQSLSEWLKP

Query:  RLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPSDSFASWGVKPGTKNVHNLWLELSEGETSL+DSNPP+RTVQV+SLRIID H R+L+ESHQQLSDGT+RNRNRPLSEKMKPNETPESAVYRAVKEELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD
        SI+GD DC EIV IVPDSY+MKIEERNS SYPGLPACYVLHSMDVLVEGLP+ +FCTVEEEEY  SEET IAD++VSVKKH+WKWVS DS+D
Subjt:  SIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD

A0A6J1IA21 uncharacterized protein LOC1114729791.2e-12679.11Show/hide
Query:  MSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPN----PKTRHDNHNFASPQSLSEWLKP
        M S PPP+PPP  IS+L HL +S PLPD FLAALSLFVF SSSS++SFK PL   Q NPR FLK PSMS S PN    P   H  H FASPQSLS+WLKP
Subjt:  MSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPN----PKTRHDNHNFASPQSLSEWLKP

Query:  RLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPSDSFASWGVKPGTKNVHNLWLELSEGETSL+DS PP+RTVQV+SLRIID H R+L+ESHQQLSDGT+RNRNRPLSEKMKPNETPESAVYRAVKEELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD
        SI+GD DC EIV IVPDSY+MKIEERNS SYPGLPACYVLHSMDVLVEGLP+ +FCTVEEEEY  SEE+ IAD++VSVKKH+WKWVS DS+D
Subjt:  SIIGDFDCCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G24460.1 unknown protein1.0e-8055.45Show/hide
Query:  MIVDMSSPPPPLPPPHSISNLTHLNK--SAPLPDLFLAALS-LFVFFSSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPNPKTRHDNHNFASPQSLSEW
        M V  S P  PL    +I+N    N   ++ LPD+FLAA+S LF++ S     S     FSF LNPR   +    ++SR +P        FA+PQSLS+W
Subjt:  MIVDMSSPPPPLPPPHSISNLTHLNK--SAPLPDLFLAALS-LFVFFSSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPNPKTRHDNHNFASPQSLSEW

Query:  LKPRLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKE
        L+ RLPSDSFA+WGVKPGTKNVHNLWLELS+GETSL+DS PP+RTV VV++R+I K+ R+LVE+HQ+LSDG++R R RPLSEKMKP E+P+ AV+RA+KE
Subjt:  LKPRLPSDSFASWGVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKE

Query:  ELGSII-GDFD-CCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYG-----RSEETEIADKSVSVKKHYWKWVSADS
        ELGSI  GD D   + + I+P +Y  ++EERNS+SYPGLPA Y LHS++  VEGLPE +FCT E+E  G      S ET  A  +V+VK+HYWKWVS DS
Subjt:  ELGSII-GDFD-CCEIVTIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYG-----RSEETEIADKSVSVKKHYWKWVSADS

Query:  VDS
        + S
Subjt:  VDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGTTGATATGTCATCGCCCCCACCTCCACTTCCACCGCCCCACTCCATCTCCAATCTTACTCACCTCAACAAATCCGCGCCTCTTCCTGATCTTTTCCTCGCTGC
TCTCTCTCTTTTCGTTTTCTTCTCTTCTTCCTCCACCAAATCCTTCAAATTCCCTCTTTTCTCTTTTCAATTAAACCCTCGCGGTTTTCTCAAGATACCCTCCATGTCCC
TCTCACGTCCCAACCCCAAAACACGCCATGACAATCACAACTTCGCATCTCCTCAATCCCTCTCCGAATGGCTCAAACCTCGCTTGCCCTCCGACTCTTTTGCTTCTTGG
GGTGTAAAGCCTGGCACCAAGAACGTCCACAACCTCTGGCTCGAGCTCTCAGAAGGAGAAACTTCCCTTTCCGACTCAAACCCTCCCCTTCGCACCGTTCAGGTCGTTTC
TCTTCGAATTATTGATAAACATAACCGAGTTCTCGTCGAATCGCACCAGCAACTATCCGATGGCACCGTACGGAATCGAAATCGACCGTTGTCGGAGAAAATGAAGCCGA
ATGAGACCCCTGAATCCGCGGTTTACCGGGCAGTGAAAGAAGAGCTCGGTTCGATTATTGGCGATTTCGATTGTTGTGAAATTGTGACGATTGTGCCAGATTCGTATCAA
ATGAAGATTGAGGAGCGGAACTCGGTTTCATACCCAGGTTTGCCGGCTTGTTACGTTTTGCATTCGATGGATGTTTTGGTTGAAGGTTTACCCGAGGGGGAGTTCTGCAC
AGTGGAGGAGGAGGAGTACGGAAGATCTGAGGAGACAGAGATTGCGGACAAGTCTGTGTCCGTGAAGAAGCATTATTGGAAATGGGTTAGTGCTGATTCTGTGGATTCTT
TAGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCGTTGATATGTCATCGCCCCCACCTCCACTTCCACCGCCCCACTCCATCTCCAATCTTACTCACCTCAACAAATCCGCGCCTCTTCCTGATCTTTTCCTCGCTGC
TCTCTCTCTTTTCGTTTTCTTCTCTTCTTCCTCCACCAAATCCTTCAAATTCCCTCTTTTCTCTTTTCAATTAAACCCTCGCGGTTTTCTCAAGATACCCTCCATGTCCC
TCTCACGTCCCAACCCCAAAACACGCCATGACAATCACAACTTCGCATCTCCTCAATCCCTCTCCGAATGGCTCAAACCTCGCTTGCCCTCCGACTCTTTTGCTTCTTGG
GGTGTAAAGCCTGGCACCAAGAACGTCCACAACCTCTGGCTCGAGCTCTCAGAAGGAGAAACTTCCCTTTCCGACTCAAACCCTCCCCTTCGCACCGTTCAGGTCGTTTC
TCTTCGAATTATTGATAAACATAACCGAGTTCTCGTCGAATCGCACCAGCAACTATCCGATGGCACCGTACGGAATCGAAATCGACCGTTGTCGGAGAAAATGAAGCCGA
ATGAGACCCCTGAATCCGCGGTTTACCGGGCAGTGAAAGAAGAGCTCGGTTCGATTATTGGCGATTTCGATTGTTGTGAAATTGTGACGATTGTGCCAGATTCGTATCAA
ATGAAGATTGAGGAGCGGAACTCGGTTTCATACCCAGGTTTGCCGGCTTGTTACGTTTTGCATTCGATGGATGTTTTGGTTGAAGGTTTACCCGAGGGGGAGTTCTGCAC
AGTGGAGGAGGAGGAGTACGGAAGATCTGAGGAGACAGAGATTGCGGACAAGTCTGTGTCCGTGAAGAAGCATTATTGGAAATGGGTTAGTGCTGATTCTGTGGATTCTT
TAGTTTAA
Protein sequenceShow/hide protein sequence
MIVDMSSPPPPLPPPHSISNLTHLNKSAPLPDLFLAALSLFVFFSSSSTKSFKFPLFSFQLNPRGFLKIPSMSLSRPNPKTRHDNHNFASPQSLSEWLKPRLPSDSFASW
GVKPGTKNVHNLWLELSEGETSLSDSNPPLRTVQVVSLRIIDKHNRVLVESHQQLSDGTVRNRNRPLSEKMKPNETPESAVYRAVKEELGSIIGDFDCCEIVTIVPDSYQ
MKIEERNSVSYPGLPACYVLHSMDVLVEGLPEGEFCTVEEEEYGRSEETEIADKSVSVKKHYWKWVSADSVDSLV