; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028927 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028927
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationtig00153210:1685615..1686527
RNA-Seq ExpressionSgr028927
SyntenySgr028927
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601276.1 hypothetical protein SDJN03_06509, partial [Cucurbita argyrosperma subsp. sororia]2.6e-8785.28Show/hide
Query:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW
        MA+SAIVL TITSLHL+AFVLAVGAERRRSTA VVPDEYDE TYCVYGTDASTVYGLSAFGLLL+SQ VVNGVTRCFCCGKGL+SGK TTVAIFFFVFSW
Subjt:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW

Query:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNE-GLGMAAAPHDLKQTTHFEK
        ISFVGAE+ LLAGSARNAYHTKYRAAFG +DLSCATLRKGVFAG  AMT+LS+VGSIL+YWAHSKADTGGWQK QNE G+G+AA  HDLKQ    +K
Subjt:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNE-GLGMAAAPHDLKQTTHFEK

XP_008446636.1 PREDICTED: uncharacterized protein LOC103489308 [Cucumis melo]2.6e-8787.05Show/hide
Query:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW
        MAVSAIVLATI SLHL+AFVLAVGAERRRSTANVVPDEYDE+TYCVYGTDASTVYGLSAFGLLL+SQ VVNGVTRCFCCGKGL+SGKTTTVAIFFFVFSW
Subjt:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW

Query:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNEGLGMAAAPH---DLKQ
        ISF+GAE+ LLAGSARNAYHTKYRA FGV+ LSCATLRKGVFAG AAMT+LS+VGSILFYW HSKADTGGW+KHQNEG+GM AA     D+KQ
Subjt:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNEGLGMAAAPH---DLKQ

XP_022986328.1 uncharacterized protein LOC111484079 [Cucurbita maxima]5.3e-8885.35Show/hide
Query:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW
        MA+SAIVL TITSLHL+AFVLAVGAERRRSTA VVPDEYDE TYCVYGTDASTVYGLSAFGLLL+SQ VVNGVTRCFCCGKGL+SGK TTVAIFFFVFSW
Subjt:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW

Query:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNE-GLGMAAAPHDLKQTTHFEKP
        ISFVGAE+ LLAGSARNAYHTKYRAAFG +DLSCATLRKGVFAG  AMT+LS+VGSIL+YWAHSKADTGGWQK QNE G+G+AA  HDLKQ    +KP
Subjt:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNE-GLGMAAAPHDLKQTTHFEKP

XP_023514518.1 uncharacterized protein LOC111778772 [Cucurbita pepo subsp. pepo]2.0e-8785.28Show/hide
Query:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW
        MA+SAIVL TITSLHL+AFVLAVGAERRRSTA VVPDEYDE TYCVYGTDASTVYGLSAFGLLL+SQ VVNGVTRCFCCGKGL+SGK TTVAIFFFVFSW
Subjt:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW

Query:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNE-GLGMAAAPHDLKQTTHFEK
        ISFVGAE+ LLAGSARNAYHTKYRAA G +DLSCATLRKGVFAG  AMT+LS+VGSIL+YWAHSKADTGGWQK QNE G+G+AA  HDLKQ  H +K
Subjt:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNE-GLGMAAAPHDLKQTTHFEK

XP_038892755.1 uncharacterized protein LOC120081728 [Benincasa hispida]1.0e-9187.24Show/hide
Query:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW
        MA+SAIVLATITSLHL+AFVLAVGAERRRSTA VVPDEYDE+TYC+YGTDASTVYGLSAFGLLL+SQ VVNGVTRCFCCGKGL+SGKTTTVAIFFFVFSW
Subjt:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW

Query:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNEGLGMAAAPHDLKQTTHFEK
        ISFVGAE+ LLAGSARNAYHTKYRAAFGV++LSC TLRKGVFAG AAMT+LS+VGSIL+YW HSKADTGGW+KHQNEG+GMAA  HDLKQ   FEK
Subjt:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNEGLGMAAAPHDLKQTTHFEK

TrEMBL top hitse value%identityAlignment
A0A0A0KWH0 Uncharacterized protein9.1e-8688.11Show/hide
Query:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW
        MAVSAIVLATITSLHL+AFVLAVGAERRRSTAN+VPDEYDE+TYCVYGTDASTVYGLSAFGLLL+SQ VVNGVTRCFCCGKGL+SGKTTTVAIFFFVFSW
Subjt:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW

Query:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQ-NEGLGMAAA
        ISF+GAE+ LLAGSARNAYHTKYRA FGV+ LSCATLRKGVFAG AAMT+LS+VGSIL+YW HSKADTGGW+KHQ NEG+GM  A
Subjt:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQ-NEGLGMAAA

A0A1S3BF21 uncharacterized protein LOC1034893081.3e-8787.05Show/hide
Query:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW
        MAVSAIVLATI SLHL+AFVLAVGAERRRSTANVVPDEYDE+TYCVYGTDASTVYGLSAFGLLL+SQ VVNGVTRCFCCGKGL+SGKTTTVAIFFFVFSW
Subjt:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW

Query:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNEGLGMAAAPH---DLKQ
        ISF+GAE+ LLAGSARNAYHTKYRA FGV+ LSCATLRKGVFAG AAMT+LS+VGSILFYW HSKADTGGW+KHQNEG+GM AA     D+KQ
Subjt:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNEGLGMAAAPH---DLKQ

A0A5D3CD50 Uncharacterized protein1.3e-8787.05Show/hide
Query:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW
        MAVSAIVLATI SLHL+AFVLAVGAERRRSTANVVPDEYDE+TYCVYGTDASTVYGLSAFGLLL+SQ VVNGVTRCFCCGKGL+SGKTTTVAIFFFVFSW
Subjt:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW

Query:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNEGLGMAAAPH---DLKQ
        ISF+GAE+ LLAGSARNAYHTKYRA FGV+ LSCATLRKGVFAG AAMT+LS+VGSILFYW HSKADTGGW+KHQNEG+GM AA     D+KQ
Subjt:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNEGLGMAAAPH---DLKQ

A0A6J1GYZ6 uncharacterized protein LOC111458800 isoform X16.3e-8784.77Show/hide
Query:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW
        MA+SAIVL TITSLHL+AFVLAVGAERRRSTA VVPDEYDE TYCVYGTDASTVYGLSAFGLLL+SQ VVNGVTRCFCCGKGL+SGK TTVAIFFFVFSW
Subjt:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW

Query:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNE-GLGMAAAPHDLKQTTHFEK
        ISFVGAE+ LLAGSARNAYHTKYRAAFG +DLSCATLRKGVFAG  AMT+LS+VGSI +YWAHSKADTGGWQK QNE G+G+AA  HDLKQ    +K
Subjt:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNE-GLGMAAAPHDLKQTTHFEK

A0A6J1JG73 uncharacterized protein LOC1114840792.6e-8885.35Show/hide
Query:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW
        MA+SAIVL TITSLHL+AFVLAVGAERRRSTA VVPDEYDE TYCVYGTDASTVYGLSAFGLLL+SQ VVNGVTRCFCCGKGL+SGK TTVAIFFFVFSW
Subjt:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSW

Query:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNE-GLGMAAAPHDLKQTTHFEKP
        ISFVGAE+ LLAGSARNAYHTKYRAAFG +DLSCATLRKGVFAG  AMT+LS+VGSIL+YWAHSKADTGGWQK QNE G+G+AA  HDLKQ    +KP
Subjt:  ISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNE-GLGMAAAPHDLKQTTHFEKP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52910.1 Protein of unknown function (DUF1218)1.6e-3444.77Show/hide
Query:  SAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSWISF
        S +V+  +  L L+A  LA+ AE+RRS   VVPD   E  +C YG+D +T YG  AF LL +SQ ++   +RCFCCGK L  G +    I  F+  W+ F
Subjt:  SAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSWISF

Query:  VGAEVCLLAGSARNAYHTKYRAAFGVKD-LSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKH
        + AEVCLLAGS RNAYHT YR  + +++  SC  +RKGVFA GA+  L + + S  +Y ++S+A  G    H
Subjt:  VGAEVCLLAGSARNAYHTKYRAAFGVKD-LSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKH

AT1G61065.1 Protein of unknown function (DUF1218)1.3e-3142.78Show/hide
Query:  SAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSWISF
        S ++L  +    L+AF LAV AE+RR+T  +  +  D  +YCVY  D +T  G+ +F +LL SQ ++   +RC CCG+ L    + + AIF F+ +W+ F
Subjt:  SAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSWISF

Query:  VGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNEGLGMAA
          A+VCLLAGS RNAYHTKYR  FG    SC +LRKGVF  GAA  +L+ + S L+Y   S+A    +Q  ++ G+ M++
Subjt:  VGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNEGLGMAA

AT1G68220.1 Protein of unknown function (DUF1218)9.4e-6765.5Show/hide
Query:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTV-AIFFFVFS
        MAVS  +L  +T+LHL+AFV A GAERRRSTA  VPD+YDE+T C YGT+ASTVYG+SAFGLLLVSQ VVNGVT+C C GKGLV+G + TV AI FFV S
Subjt:  MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTV-AIFFFVFS

Query:  WISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNEGLGMA-AAPHDL--KQTTHFEK
        W+SF+GAE CLL GSARNAYHTK    +  K+LSCA L  GVFA GAA TL+SL+ +IL+Y AHSKADTGGW+KHQN+G+ +    P D   +Q T F K
Subjt:  WISFVGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNEGLGMA-AAPHDL--KQTTHFEK

AT3G15480.1 Protein of unknown function (DUF1218)3.3e-3548.17Show/hide
Query:  SAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSWISF
        S +V+  +  L L+A  LA+ AE+RRS   V  D   +  YCVYGTD +T YG  AF LL VSQ ++   +RCFCCGK L  G +   AI  F+  W+ F
Subjt:  SAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSWISF

Query:  VGAEVCLLAGSARNAYHTKYRAAFGVKD-LSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKA
        + AE+CLLA S RNAYHT+YR  + V+D  SC  +RKGVFA GAA TL + + S  +Y  +S+A
Subjt:  VGAEVCLLAGSARNAYHTKYRAAFGVKD-LSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKA

AT4G27435.1 Protein of unknown function (DUF1218)7.2e-3547.85Show/hide
Query:  SAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSWISF
        S IV A +   +L+AF LAV AE+RRSTA VV D   +  YCVY +D +T YG+ AF   + SQ ++  V+RCFCCGK L  G +  +A+  F+ SW+ F
Subjt:  SAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSWISF

Query:  VGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKA
        + AE+CLLAGS  NAYHTKYR  F      C TLRKGVFA GA+    + + S  +Y+ +  A
Subjt:  VGAEVCLLAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTGTCGGCGATTGTCTTGGCTACCATCACTTCTCTACATCTCATGGCCTTTGTTCTCGCCGTGGGCGCCGAGCGCCGCCGCAGCACGGCGAATGTGGTCCCGGA
TGAGTACGACGAGCAGACGTACTGCGTGTACGGCACCGACGCGTCGACGGTGTACGGACTGTCGGCGTTCGGTTTGCTTCTTGTAAGCCAGACGGTGGTTAACGGTGTCA
CTAGGTGTTTCTGCTGTGGGAAGGGTTTGGTTAGTGGAAAAACCACCACCGTCGCCATCTTCTTCTTTGTCTTCTCCTGGATCAGCTTTGTGGGAGCTGAGGTTTGCCTG
TTGGCGGGATCGGCGAGGAACGCGTACCACACCAAGTACAGGGCGGCGTTCGGCGTCAAAGACTTATCGTGTGCGACCCTTCGGAAGGGAGTGTTCGCCGGCGGCGCCGC
CATGACACTGCTGTCGCTGGTGGGATCGATTCTGTTCTACTGGGCGCACTCTAAAGCCGACACCGGAGGTTGGCAGAAGCACCAGAACGAGGGCCTCGGCATGGCGGCGG
CTCCTCACGATCTCAAACAAACCACTCATTTTGAGAAACCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTGTCGGCGATTGTCTTGGCTACCATCACTTCTCTACATCTCATGGCCTTTGTTCTCGCCGTGGGCGCCGAGCGCCGCCGCAGCACGGCGAATGTGGTCCCGGA
TGAGTACGACGAGCAGACGTACTGCGTGTACGGCACCGACGCGTCGACGGTGTACGGACTGTCGGCGTTCGGTTTGCTTCTTGTAAGCCAGACGGTGGTTAACGGTGTCA
CTAGGTGTTTCTGCTGTGGGAAGGGTTTGGTTAGTGGAAAAACCACCACCGTCGCCATCTTCTTCTTTGTCTTCTCCTGGATCAGCTTTGTGGGAGCTGAGGTTTGCCTG
TTGGCGGGATCGGCGAGGAACGCGTACCACACCAAGTACAGGGCGGCGTTCGGCGTCAAAGACTTATCGTGTGCGACCCTTCGGAAGGGAGTGTTCGCCGGCGGCGCCGC
CATGACACTGCTGTCGCTGGTGGGATCGATTCTGTTCTACTGGGCGCACTCTAAAGCCGACACCGGAGGTTGGCAGAAGCACCAGAACGAGGGCCTCGGCATGGCGGCGG
CTCCTCACGATCTCAAACAAACCACTCATTTTGAGAAACCGTAA
Protein sequenceShow/hide protein sequence
MAVSAIVLATITSLHLMAFVLAVGAERRRSTANVVPDEYDEQTYCVYGTDASTVYGLSAFGLLLVSQTVVNGVTRCFCCGKGLVSGKTTTVAIFFFVFSWISFVGAEVCL
LAGSARNAYHTKYRAAFGVKDLSCATLRKGVFAGGAAMTLLSLVGSILFYWAHSKADTGGWQKHQNEGLGMAAAPHDLKQTTHFEKP