; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012835 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012835
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionmRNA decay activator protein ZFP36L2-like isoform X2
Genome locationtig00153572:133074..135656
RNA-Seq ExpressionSgr012835
SyntenySgr012835
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601766.1 hypothetical protein SDJN03_06999, partial [Cucurbita argyrosperma subsp. sororia]6.8e-9664.53Show/hide
Query:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR
        ME RQGKRSSK SIIKGLN TV PI IQMDNHLVIGGTGHY  ETN K PT+LDRYAEK DRSTPIV YFRSRSPVVGKLPSV DTFSTPVADVINANYR
Subjt:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR

Query:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS
        S+IINH+ASSTSGS SFGS  GGSSLSPLSAIENLET P+RSPQIYGTP+KV EEVIVMD ILISSV  GAKT++SA+D+ SGGGSS  KN YRTDICRS
Subjt:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS

Query:  WEILEAAATATNASLRMERRIFAQLVYLL--------KANLRLACSVL----QFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGH
        WE         + S R   +   Q+ YL+        K  +R     L    + GE Y  HG K RN+  +LTGIA TT QSN +DT  K TE+SRREGH
Subjt:  WEILEAAATATNASLRMERRIFAQLVYLL--------KANLRLACSVL----QFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGH

Query:  TTTSSSTSIRHMNTKPTPISTISIIDWSPEDDGIKIVLPNSKTT
            S  SI ++NT           DWSPEDD I+ ++P +K+T
Subjt:  TTTSSSTSIRHMNTKPTPISTISIIDWSPEDDGIKIVLPNSKTT

KAG7032491.1 mRNA decay activator protein ZFP36L3, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-9665.36Show/hide
Query:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR
        ME RQGKRSSK SIIKGLN TV PI IQMDNHLVIGGTGHY  ETN K PT+LDRYAEK DRSTPIV YFRSRSPVVGKLPSV DTFSTPVADVINANYR
Subjt:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR

Query:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS
        S+IINH+ASSTSGS SFGS  GGSSLSPLSAIENLET P+RSPQIYGTP+KV EEVIVMD ILISSV  GAKT++SA+D+ SGGGSS  KN YRTDICRS
Subjt:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS

Query:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRHM
        WE   +            +     +   LK   +L       GE Y  HG K RN+  +LTGIA TT QSN +DT  K TE+SRREGH    S  SI ++
Subjt:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRHM

Query:  NTKPTPISTISIIDWSPEDDGIKIVLPNSKTT
        NT           DWSPEDD I+ ++P +K+T
Subjt:  NTKPTPISTISIIDWSPEDDGIKIVLPNSKTT

XP_022154583.1 uncharacterized protein LOC111021811 [Momordica charantia]3.4e-11171.73Show/hide
Query:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR
        ME RQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY  ETNCKSPTMLDRY+EKLDRST IVRYFRSRSPV+GKLPSV DTFSTPV DVINANYR
Subjt:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR

Query:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSVP-GAKTMRSAIDTASGGGSSSGKNLYRTDICRS
        SAI+NHYASSTS S SFGS GGGSSLSPLSAIENLETP  RSPQIYGTPVKVDEEVIVMDGILISSVP GAKTMRSA ++ +GGGS SGKNLYRTDICRS
Subjt:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSVP-GAKTMRSAIDTASGGGSSSGKNLYRTDICRS

Query:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACS-VLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRH
        WE   +            +         +K   +++ S    +G S      KF +S      +AVTT  S VIDTIN S EVSR EG  T     SIRH
Subjt:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACS-VLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRH

Query:  MNTKPTPISTISIIDWSPEDDGIKIVLPNSKTTERK
        MNTKP+ ISTISI+DWSPEDDGIKIVLPNSK+T+R+
Subjt:  MNTKPTPISTISIIDWSPEDDGIKIVLPNSKTTERK

XP_022930093.1 uncharacterized protein LOC111436579 isoform X1 [Cucurbita moschata]5.4e-9364.16Show/hide
Query:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR
        ME RQGKRSSK S IKGLN TV PI IQMDNHLVIGGTGHY  ETN K PT+LDRYAEK DRSTPIV YFRSRSPVVGKLPSV DTFSTPVADVINANYR
Subjt:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR

Query:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS
        S+IINH+ASSTSGS SFGS  GGSSLSPLSAIENLET P+RSPQIYGTP+KV EEVIVMD ILISSV  GAKT++SA+++  GGGSS  KN YRTDICRS
Subjt:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS

Query:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRHM
        WE   +            +     +   LK   +       FGE Y  HG K RN+  +LTGIA TT QSN  DT  K TE+SRR GH    S  SI ++
Subjt:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRHM

Query:  NTKPTPISTISIIDWSPEDDGIKIVLPNSKTT
        NT           DWSPEDD I+ ++P +K+T
Subjt:  NTKPTPISTISIIDWSPEDDGIKIVLPNSKTT

XP_023536031.1 uncharacterized protein LOC111797287 isoform X1 [Cucurbita pepo subsp. pepo]8.4e-9464.46Show/hide
Query:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR
        ME RQGKRSSK +IIKGLN TV PI IQMDNHLVIGGTGHY  ETN K PT+LDRYAEK +RSTPIV YFRSRSPVVGKLPSV DTFSTPVADVINANYR
Subjt:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR

Query:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS
        S+IINH+ASSTSGS SFGS  GGSSLSPLSAIENLET P+RSPQIYGTP+KV EEVIVMD ILISSV  GAKT++SA+D+ SGGGSS  KN YRTDICRS
Subjt:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS

Query:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRHM
        WE   +            +     +   LK   +       FGE Y  HG K RN+  +LTGIA TT QSN +D I   TE+SRR GH    S  SI ++
Subjt:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRHM

Query:  NTKPTPISTISIIDWSPEDDGIKIVLPNSKTT
        NT           DWSPEDD I++V+P +K+T
Subjt:  NTKPTPISTISIIDWSPEDDGIKIVLPNSKTT

TrEMBL top hitse value%identityAlignment
A0A6J1DKP7 uncharacterized protein LOC1110218111.6e-11171.73Show/hide
Query:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR
        ME RQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY  ETNCKSPTMLDRY+EKLDRST IVRYFRSRSPV+GKLPSV DTFSTPV DVINANYR
Subjt:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR

Query:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSVP-GAKTMRSAIDTASGGGSSSGKNLYRTDICRS
        SAI+NHYASSTS S SFGS GGGSSLSPLSAIENLETP  RSPQIYGTPVKVDEEVIVMDGILISSVP GAKTMRSA ++ +GGGS SGKNLYRTDICRS
Subjt:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSVP-GAKTMRSAIDTASGGGSSSGKNLYRTDICRS

Query:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACS-VLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRH
        WE   +            +         +K   +++ S    +G S      KF +S      +AVTT  S VIDTIN S EVSR EG  T     SIRH
Subjt:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACS-VLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRH

Query:  MNTKPTPISTISIIDWSPEDDGIKIVLPNSKTTERK
        MNTKP+ ISTISI+DWSPEDDGIKIVLPNSK+T+R+
Subjt:  MNTKPTPISTISIIDWSPEDDGIKIVLPNSKTTERK

A0A6J1EPR1 uncharacterized protein LOC111436459 isoform X11.0e-9263.86Show/hide
Query:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR
        ME RQGKRSSK S IKGLN TV PI IQMDNHLVIGGTGHY  ETN K PT+LDRYAEK DRSTPIV YFRSRSPVVGKLPSV DTFSTPVADVINANYR
Subjt:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR

Query:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS
        S+IINH+ASSTSGS SFGS  GGSSLSPLSAIENLET P+RSPQIYGTP+KV EEVIVMD ILISSV  GAKT++SA+++  GGGSS  KN YR DICRS
Subjt:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS

Query:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRHM
        WE   +            +     +   LK   +       FGE Y  HG K RN+  +LTGIA TT QSN  DT  K TE+SRR GH    S  SI ++
Subjt:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRHM

Query:  NTKPTPISTISIIDWSPEDDGIKIVLPNSKTT
        NT           DWSPEDD I+ ++P +K+T
Subjt:  NTKPTPISTISIIDWSPEDDGIKIVLPNSKTT

A0A6J1EQM3 uncharacterized protein LOC111436579 isoform X12.6e-9364.16Show/hide
Query:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR
        ME RQGKRSSK S IKGLN TV PI IQMDNHLVIGGTGHY  ETN K PT+LDRYAEK DRSTPIV YFRSRSPVVGKLPSV DTFSTPVADVINANYR
Subjt:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR

Query:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS
        S+IINH+ASSTSGS SFGS  GGSSLSPLSAIENLET P+RSPQIYGTP+KV EEVIVMD ILISSV  GAKT++SA+++  GGGSS  KN YRTDICRS
Subjt:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS

Query:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRHM
        WE   +            +     +   LK   +       FGE Y  HG K RN+  +LTGIA TT QSN  DT  K TE+SRR GH    S  SI ++
Subjt:  WEILEAAATATNASLRMERRIFAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRHM

Query:  NTKPTPISTISIIDWSPEDDGIKIVLPNSKTT
        NT           DWSPEDD I+ ++P +K+T
Subjt:  NTKPTPISTISIIDWSPEDDGIKIVLPNSKTT

A0A6J1ICU5 uncharacterized protein LOC111472626 isoform X12.5e-9163.06Show/hide
Query:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR
        ME RQGKRSSK SIIKGLN TV PI IQMDNHLVIGGTGHY  ETN K PT+LDRYAEK DRSTPIV YFRSRSPVVGKLPS  DT STPVADVINA+YR
Subjt:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR

Query:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS
        S+IINH+ASSTSGS SFGS  GGSSLSPLSA+ENLET P+RSPQIYGTP+KV EEVIVMD ILISSV  GAKT++SA+D+ SGGGSS  KN YRTDICRS
Subjt:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS

Query:  WEILEAAATATNASLRMERRI-FAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRH
        WE         + S R   +  FA     L+       +  +FGE Y  HG K RN+  +LTGIA TT  +++  T    TE++RR GH    S  SI +
Subjt:  WEILEAAATATNASLRMERRI-FAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRH

Query:  MNTKPTPISTISIIDWSPEDDGIKIVLPNSKTT
        +NT           DWSP+DDGI++V+P +K+T
Subjt:  MNTKPTPISTISIIDWSPEDDGIKIVLPNSKTT

A0A6J1IEX1 uncharacterized protein LOC111472616 isoform X12.2e-9263.36Show/hide
Query:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR
        ME RQGKRSSK SIIKGLN TV PI IQMDNHLVIGGTGHY  ETN K PT+LDRYAEK DRSTPIV YFRSRSPVVGKLPS  DTFSTPVADVINA+YR
Subjt:  MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHY--ETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYR

Query:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS
        S+IINH+ASSTSGS SFGS  GGSSLSPLSA+ENLET P+RSPQIYGTP+KV EEVIVMD ILISSV  GAKT++SA+D+ SGGGSS  KN YRTDICRS
Subjt:  SAIINHYASSTSGSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSV-PGAKTMRSAIDTASGGGSSSGKNLYRTDICRS

Query:  WEILEAAATATNASLRMERRI-FAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRH
        WE         + S R   +  FA     L+       +  +FGE Y  HG K RN+  +LTGIA TT  +++  T    TE++RR GH    S  SI +
Subjt:  WEILEAAATATNASLRMERRI-FAQLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRH

Query:  MNTKPTPISTISIIDWSPEDDGIKIVLPNSKTT
        +NT           DWSP+DDGI++V+P +K+T
Subjt:  MNTKPTPISTISIIDWSPEDDGIKIVLPNSKTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATATCGTCAAGGCAAGAGGAGCTCCAAAACGTCCATTATCAAAGGCCTAAACGTTACGGTTCCGCCGATCAACATTCAGATGGACAATCACTTGGTTATCGGCGG
GACCGGACACTATGAAACAAACTGTAAAAGTCCGACGATGCTCGATCGTTACGCTGAGAAGCTTGACCGTTCCACACCGATCGTAAGGTACTTCCGCTCTCGATCTCCGG
TAGTCGGAAAATTACCATCCGTCGTTGATACGTTTTCCACTCCGGTAGCGGACGTCATCAATGCCAACTACCGCTCCGCTATTATCAATCACTACGCTAGCAGCACTTCA
GGAAGCGGTAGCTTCGGTTCTTGGGGGGGAGGATCGTCATTGTCACCACTTTCGGCTATAGAGAATTTGGAAACGCCACCTCTAAGGTCTCCGCAAATTTATGGAACTCC
GGTGAAGGTGGATGAAGAAGTGATAGTAATGGATGGAATTTTGATCAGTTCGGTCCCTGGAGCGAAAACGATGAGGTCTGCTATAGATACTGCAAGCGGCGGAGGTTCGT
CGTCGGGCAAAAATCTGTACAGAACGGATATTTGCCGCTCCTGGGAGATTCTGGAAGCTGCCGCTACGGCCACAAATGCCAGTTTGCGCATGGAAAGGAGGATCTTCGCC
CAGCTCGTTTACCTGTTAAAAGCAAACCTAAGATTAGCATGTTCTGTACTACAGTTCGGCGAATCATATTCTATACATGGCCCCAAGTTCCGCAACAGTACTCAGGCATT
AACGGGAATAGCTGTAACAACATTGCAGTCAAACGTAATAGATACAATCAACAAAAGTACTGAAGTTAGCAGAAGGGAAGGTCATACAACAACGTCGTCCTCAACTTCAA
TACGACACATGAACACTAAGCCAACCCCCATCTCTACCATTTCCATCATCGACTGGTCACCAGAAGACGATGGCATCAAAATTGTTCTCCCCAATTCAAAGACCACTGAA
AGGAAGATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAATATCGTCAAGGCAAGAGGAGCTCCAAAACGTCCATTATCAAAGGCCTAAACGTTACGGTTCCGCCGATCAACATTCAGATGGACAATCACTTGGTTATCGGCGG
GACCGGACACTATGAAACAAACTGTAAAAGTCCGACGATGCTCGATCGTTACGCTGAGAAGCTTGACCGTTCCACACCGATCGTAAGGTACTTCCGCTCTCGATCTCCGG
TAGTCGGAAAATTACCATCCGTCGTTGATACGTTTTCCACTCCGGTAGCGGACGTCATCAATGCCAACTACCGCTCCGCTATTATCAATCACTACGCTAGCAGCACTTCA
GGAAGCGGTAGCTTCGGTTCTTGGGGGGGAGGATCGTCATTGTCACCACTTTCGGCTATAGAGAATTTGGAAACGCCACCTCTAAGGTCTCCGCAAATTTATGGAACTCC
GGTGAAGGTGGATGAAGAAGTGATAGTAATGGATGGAATTTTGATCAGTTCGGTCCCTGGAGCGAAAACGATGAGGTCTGCTATAGATACTGCAAGCGGCGGAGGTTCGT
CGTCGGGCAAAAATCTGTACAGAACGGATATTTGCCGCTCCTGGGAGATTCTGGAAGCTGCCGCTACGGCCACAAATGCCAGTTTGCGCATGGAAAGGAGGATCTTCGCC
CAGCTCGTTTACCTGTTAAAAGCAAACCTAAGATTAGCATGTTCTGTACTACAGTTCGGCGAATCATATTCTATACATGGCCCCAAGTTCCGCAACAGTACTCAGGCATT
AACGGGAATAGCTGTAACAACATTGCAGTCAAACGTAATAGATACAATCAACAAAAGTACTGAAGTTAGCAGAAGGGAAGGTCATACAACAACGTCGTCCTCAACTTCAA
TACGACACATGAACACTAAGCCAACCCCCATCTCTACCATTTCCATCATCGACTGGTCACCAGAAGACGATGGCATCAAAATTGTTCTCCCCAATTCAAAGACCACTGAA
AGGAAGATGTAA
Protein sequenceShow/hide protein sequence
MEYRQGKRSSKTSIIKGLNVTVPPINIQMDNHLVIGGTGHYETNCKSPTMLDRYAEKLDRSTPIVRYFRSRSPVVGKLPSVVDTFSTPVADVINANYRSAIINHYASSTS
GSGSFGSWGGGSSLSPLSAIENLETPPLRSPQIYGTPVKVDEEVIVMDGILISSVPGAKTMRSAIDTASGGGSSSGKNLYRTDICRSWEILEAAATATNASLRMERRIFA
QLVYLLKANLRLACSVLQFGESYSIHGPKFRNSTQALTGIAVTTLQSNVIDTINKSTEVSRREGHTTTSSSTSIRHMNTKPTPISTISIIDWSPEDDGIKIVLPNSKTTE
RKM