; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025960 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025960
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationtig00153017:1840819..1848381
RNA-Seq ExpressionSgr025960
SyntenySgr025960
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579221.1 hypothetical protein SDJN03_23669, partial [Cucurbita argyrosperma subsp. sororia]6.4e-8988.32Show/hide
Query:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW
        MATSL+ +AI+ASLHLIAFV AIGAEMRRS ATV PDEYDETTHCVYDSDAST YGLVAFGLLLISHTVL  VTRCLC GKGLKSGGSTVCAIILF+VSW
Subjt:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW

Query:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTG
        HLFLGAESLLLAGSVRNAYHTKFRG LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY  HSRADTGGWQKHQNEGVGMV  +F+ HEQH R TG
Subjt:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTG

KAG7016734.1 hypothetical protein SDJN02_21844 [Cucurbita argyrosperma subsp. argyrosperma]1.7e-8988.83Show/hide
Query:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW
        MATSL+ +AI+ASLHLIAFV AIGAEMRRS ATV PDEYDETTHCVYDSDAST YGLVAFGLLLISHTVL  VTRCLC GKGLKSGGSTVCAIILF+VSW
Subjt:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW

Query:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTG
        HLFLGAESLLLAGSVRNAYHTKFRG LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY AHSRADTGGWQKHQNEGVGMV  +F+ HEQH R TG
Subjt:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTG

XP_022939105.1 uncharacterized protein LOC111445108 [Cucurbita moschata]5.8e-9088.89Show/hide
Query:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW
        MATSL+ +AI+ASLHLIAFV AIGAEMRRS ATV PDEYDETTHCVYDSDAST YGLVAFGLLLISHTVL  VTRCLC GKGLKSGGSTVCAIILF+VSW
Subjt:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW

Query:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTGR
        HLFLGAESLLLAGSVRNAYHTKFRG LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY AHSRADTGGWQKHQNEGVGMV  +F+ HEQH R TGR
Subjt:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTGR

XP_022992987.1 uncharacterized protein LOC111489148 [Cucurbita maxima]4.9e-8988.32Show/hide
Query:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW
        MATSL+ +AI+ASLHLIAFV AIGAEMRRS ATV PDEYDETTHCVYDSDAST YGLVAFGLLLISHTVL  VTRCLC GKGLKSGGSTVCAIILF+VSW
Subjt:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW

Query:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTG
        HLFLGAESLLLAGSVRNAYHTKFRG LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY AHSRADTGGW+KHQNEGVGMV  +F+ HEQH R TG
Subjt:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTG

XP_023550275.1 uncharacterized protein LOC111808498 [Cucurbita pepo subsp. pepo]1.1e-8888.32Show/hide
Query:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW
        MATSL+ +AI+ASLHLIAFV AIGAEMRRS ATV PDEYDETTHCVYDSDAST YGLVAFGLLLISHTVL  VTRCLC GKGLKSGGSTVCAIILF+VSW
Subjt:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW

Query:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTG
        HL LGAESLLLAGSVRNAYHTKFRG LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY AHSRADTGGWQKHQNEGVGMV  +F+ HEQH R TG
Subjt:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTG

TrEMBL top hitse value%identityAlignment
A0A0A0KV04 Uncharacterized protein5.7e-8383.51Show/hide
Query:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW
        MATSL +I I+ASLHLIAFV AIGAEMRRS A VVPD+YDETT+CVYDSDAST YGLVAFGLLLIS TVL  VTRCLC GKGL+SGGSTVCA+I F++SW
Subjt:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW

Query:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTR
        HLFLGAESLLLAGSVRNAYHTK+R VLP+ NLSCAMLRRGVFAAAAALTFLSLVFSILYY  HSRADTGGWQKHQNEG+GM  SN  Q EQH R
Subjt:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTR

A0A1S3CSX8 uncharacterized protein LOC1035045962.4e-8182.47Show/hide
Query:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW
        MAT+L +I I+ASLHLIAFV AIGAEMRRS A VVPD+YDETT+CVYDSDAST YGLVAFGLLLIS TVL  VTRCLC GKGLKSGGSTVCA+I F++SW
Subjt:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW

Query:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTR
        HLFLGAESLLLAGSVRNAYHTK+R  LP+ NLSCAMLRRGVFAAAAALTFLSLVFSILYY  HSRADTGGWQKHQNEG+GM  SN  Q E H R
Subjt:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTR

A0A6J1DHA0 uncharacterized protein LOC1110200442.2e-8788.83Show/hide
Query:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW
        MATSLS+IAI+ASLHLIAFVLAIGAEMRRS ATVVPDEYDETTHCVYDSDASTAYGL AFGLLLISHT+L AVTRCLC GKGLKSGGSTVCAI+LFVVSW
Subjt:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW

Query:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNE-GVGMVDSNFAQHEQHTRHT
         LFLGAESLLLAGSVRNAYHTKFRG LPIKNLSC  LR GVFAAAAA+TFLSLVFSILYY  HSRADTGGWQKHQNE GVGMV S+ AQHEQH R T
Subjt:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNE-GVGMVDSNFAQHEQHTRHT

A0A6J1FLQ5 uncharacterized protein LOC1114451082.8e-9088.89Show/hide
Query:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW
        MATSL+ +AI+ASLHLIAFV AIGAEMRRS ATV PDEYDETTHCVYDSDAST YGLVAFGLLLISHTVL  VTRCLC GKGLKSGGSTVCAIILF+VSW
Subjt:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW

Query:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTGR
        HLFLGAESLLLAGSVRNAYHTKFRG LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY AHSRADTGGWQKHQNEGVGMV  +F+ HEQH R TGR
Subjt:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTGR

A0A6J1JRH3 uncharacterized protein LOC1114891482.4e-8988.32Show/hide
Query:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW
        MATSL+ +AI+ASLHLIAFV AIGAEMRRS ATV PDEYDETTHCVYDSDAST YGLVAFGLLLISHTVL  VTRCLC GKGLKSGGSTVCAIILF+VSW
Subjt:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW

Query:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTG
        HLFLGAESLLLAGSVRNAYHTKFRG LPIKNLSC+MLRRGVFAAAAALTFLSLVFSILYY AHSRADTGGW+KHQNEGVGMV  +F+ HEQH R TG
Subjt:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)4.0e-2842.68Show/hide
Query:  SLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSWHLF
        S  +  +V +L L+AF  +I AE RRS    + D    TT CVYDSD +T YG+ AF  LL S ++L +VT+C+C G+ L  G     +II F+ SW  F
Subjt:  SLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSWHLF

Query:  LGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYY
        L AE+ ++AG+ +NAYHTK+   L  +  SCA LR+G+F A A     ++V ++ YY
Subjt:  LGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYY

AT1G52910.1 Protein of unknown function (DUF1218)9.7e-3549.71Show/hide
Query:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW
        MA+ L +I IV  L LIA  LAI AE RRS   VVPD   E  HC Y SD +T+YG  AF LL IS  ++   +RC C GK LK GGS  C I+LF++ W
Subjt:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW

Query:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKN-LSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKH
          FL AE  LLAGS+RNAYHT +R +  I+N  SC ++R+GVFAA A+    + + S  YY ++SRA  G    H
Subjt:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKN-LSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKH

AT1G68220.1 Protein of unknown function (DUF1218)4.8e-5860.71Show/hide
Query:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGS-TVCAIILFVVS
        MA S+SI+ +V +LHL+AFV A GAE RRS A  VPD+YDE T C Y ++AST YG+ AFGLLL+S  V+N VT+CLC GKGL +G S TV AI+ FVVS
Subjt:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGS-TVCAIILFVVS

Query:  WHLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEG--VGMVDSNFAQHEQHT
        W  FLGAE+ LL GS RNAYHTK  G+   K LSCA+L  GVFAA AA T +SL+ +ILYY AHS+ADTGGW+KHQN+G  +GM   + A  +Q+T
Subjt:  WHLFLGAESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEG--VGMVDSNFAQHEQHT

AT3G15480.1 Protein of unknown function (DUF1218)3.5e-3247.9Show/hide
Query:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW
        MA+ L ++ IV  L LIA  LAI AE RRS   V  D   +  +CVY +D +T+YG  AF LL +S  ++ A +RC C GK L  GGS  CAIILF++ W
Subjt:  MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSW

Query:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKN-LSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRA
          FL AE  LLA S+RNAYHT++R +  +++  SC ++R+GVFAA AA T  + + S  YY  +SRA
Subjt:  HLFLGAESLLLAGSVRNAYHTKFRGVLPIKN-LSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRA

AT4G27435.1 Protein of unknown function (DUF1218)2.0e-3249.38Show/hide
Query:  IIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSWHLFLGA
        + AIV   +LIAF LA+ AE RRS A VV D   +  +CVYDSD +T YG+ AF   + S  ++  V+RC C GK LK GGS   A+ILF+VSW  FL A
Subjt:  IIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSWHLFLGA

Query:  ESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRA
        E  LLAGSV NAYHTK+R +       C  LR+GVFAA A+  F + + S  YY+ +  A
Subjt:  ESLLLAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACCTCTTTGTCCATCATAGCCATCGTCGCCTCTCTGCACCTCATCGCCTTCGTCCTCGCCATCGGAGCTGAAATGCGCCGGAGCCATGCAACGGTAGTGCCGGA
TGAGTACGACGAGACTACTCACTGCGTCTACGATTCCGATGCGTCGACGGCGTACGGTCTTGTGGCGTTTGGTTTGCTATTGATAAGCCACACTGTGCTTAACGCTGTCA
CGAGGTGTCTCTGCGGCGGCAAAGGCTTGAAAAGTGGAGGATCCACTGTCTGTGCTATCATCCTCTTCGTCGTTTCTTGGCATCTGTTCTTGGGAGCCGAATCTTTGTTG
CTGGCTGGGTCTGTTAGGAATGCCTACCACACCAAGTTCAGAGGAGTACTGCCTATCAAAAACTTGTCCTGTGCCATGCTTCGACGGGGGGTGTTCGCCGCCGCAGCAGC
CTTGACATTTCTGTCATTGGTGTTCTCGATCCTTTACTACTGGGCGCACTCGAGAGCCGATACTGGAGGTTGGCAGAAGCATCAGAATGAAGGTGTTGGCATGGTGGATT
CTAATTTTGCACAGCACGAGCAGCACACTCGACACACTGGTAGATCAAAATATCAACTTGTTCTTGTTCTTGCTAAGCTACGAATGTGTAAAGTGATTGCATACTTCAAT
CTAGATGTGGAAAATGTAGTTGTCGCATATGACGGAGTGGCGAGGAGGACGGCTTCCCCGCCGGCTGATGAGAATCGGTTGGTCGTCAACGAGGTCGAATTCTTTTCCTA
CAAGAAACCCACTAATATTATCACCACTATCAATAACGAAGATTGGCTTGCACCTGCTGACTCTAACACTGGAAGCGATCGAGCGATGGTGGATGATGGGATTTCATGGA
TGGTGAAGACAAGCCAGCCAAAAACTGAGAAGATTATGAATATGCTAAAATCTTTGACACCAATTAAGCAAATGCATATATATATTGAACATGTTGAGATAGTTAATATC
GAAGAAGGGGAGGTAGAAAAAGGGGTGGATGTTGTTGTACTAAGGGGTGAGAGTTTATCTTCGAAGGAAGACACCAGTGATAAAAGTGTGGACATGAATGGCATTGTAGA
TAGTGATTATGAAATACTTGATGACAATATTCTTTTTGATGAAAATGTGGATGAGCATGAGGTTGGGTTAGAGGGAAGTATGGAAGGGGTGGGACATGAGAACTTAGGAA
GAAGTGGTATGATGCAAAGAAAAAGGTTGGGAGTGTATGATGAGGTTGATTTTGAACCTATGGATGATGGGATGCAGTCCGATTATGCATCATCTAAGGAATTACATTTT
CCTTGCCACTCTAATGATGAAAGGGATGATGAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACCTCTTTGTCCATCATAGCCATCGTCGCCTCTCTGCACCTCATCGCCTTCGTCCTCGCCATCGGAGCTGAAATGCGCCGGAGCCATGCAACGGTAGTGCCGGA
TGAGTACGACGAGACTACTCACTGCGTCTACGATTCCGATGCGTCGACGGCGTACGGTCTTGTGGCGTTTGGTTTGCTATTGATAAGCCACACTGTGCTTAACGCTGTCA
CGAGGTGTCTCTGCGGCGGCAAAGGCTTGAAAAGTGGAGGATCCACTGTCTGTGCTATCATCCTCTTCGTCGTTTCTTGGCATCTGTTCTTGGGAGCCGAATCTTTGTTG
CTGGCTGGGTCTGTTAGGAATGCCTACCACACCAAGTTCAGAGGAGTACTGCCTATCAAAAACTTGTCCTGTGCCATGCTTCGACGGGGGGTGTTCGCCGCCGCAGCAGC
CTTGACATTTCTGTCATTGGTGTTCTCGATCCTTTACTACTGGGCGCACTCGAGAGCCGATACTGGAGGTTGGCAGAAGCATCAGAATGAAGGTGTTGGCATGGTGGATT
CTAATTTTGCACAGCACGAGCAGCACACTCGACACACTGGTAGATCAAAATATCAACTTGTTCTTGTTCTTGCTAAGCTACGAATGTGTAAAGTGATTGCATACTTCAAT
CTAGATGTGGAAAATGTAGTTGTCGCATATGACGGAGTGGCGAGGAGGACGGCTTCCCCGCCGGCTGATGAGAATCGGTTGGTCGTCAACGAGGTCGAATTCTTTTCCTA
CAAGAAACCCACTAATATTATCACCACTATCAATAACGAAGATTGGCTTGCACCTGCTGACTCTAACACTGGAAGCGATCGAGCGATGGTGGATGATGGGATTTCATGGA
TGGTGAAGACAAGCCAGCCAAAAACTGAGAAGATTATGAATATGCTAAAATCTTTGACACCAATTAAGCAAATGCATATATATATTGAACATGTTGAGATAGTTAATATC
GAAGAAGGGGAGGTAGAAAAAGGGGTGGATGTTGTTGTACTAAGGGGTGAGAGTTTATCTTCGAAGGAAGACACCAGTGATAAAAGTGTGGACATGAATGGCATTGTAGA
TAGTGATTATGAAATACTTGATGACAATATTCTTTTTGATGAAAATGTGGATGAGCATGAGGTTGGGTTAGAGGGAAGTATGGAAGGGGTGGGACATGAGAACTTAGGAA
GAAGTGGTATGATGCAAAGAAAAAGGTTGGGAGTGTATGATGAGGTTGATTTTGAACCTATGGATGATGGGATGCAGTCCGATTATGCATCATCTAAGGAATTACATTTT
CCTTGCCACTCTAATGATGAAAGGGATGATGAAAGATGA
Protein sequenceShow/hide protein sequence
MATSLSIIAIVASLHLIAFVLAIGAEMRRSHATVVPDEYDETTHCVYDSDASTAYGLVAFGLLLISHTVLNAVTRCLCGGKGLKSGGSTVCAIILFVVSWHLFLGAESLL
LAGSVRNAYHTKFRGVLPIKNLSCAMLRRGVFAAAAALTFLSLVFSILYYWAHSRADTGGWQKHQNEGVGMVDSNFAQHEQHTRHTGRSKYQLVLVLAKLRMCKVIAYFN
LDVENVVVAYDGVARRTASPPADENRLVVNEVEFFSYKKPTNIITTINNEDWLAPADSNTGSDRAMVDDGISWMVKTSQPKTEKIMNMLKSLTPIKQMHIYIEHVEIVNI
EEGEVEKGVDVVVLRGESLSSKEDTSDKSVDMNGIVDSDYEILDDNILFDENVDEHEVGLEGSMEGVGHENLGRSGMMQRKRLGVYDEVDFEPMDDGMQSDYASSKELHF
PCHSNDERDDER