; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011868 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011868
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111488680
Genome locationtig00153107:104537..105484
RNA-Seq ExpressionSgr011868
SyntenySgr011868
Gene Ontology termsGO:0071669 - plant-type cell wall organization or biogenesis (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575310.1 hypothetical protein SDJN03_25949, partial [Cucurbita argyrosperma subsp. sororia]2.9e-12875.42Show/hide
Query:  PLQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADV-LSRGQICLDQSKRFLLPSGPVSLPVFLFVLAK
        PL +  LTFFKC RWQLEET+DK +CPYHYYCD+IYPGDYPP VDLLVL FT ATY++TLL M+ADV  SRG+ C DQ KRFLLPSGPVSLP+FLFVL K
Subjt:  PLQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADV-LSRGQICLDQSKRFLLPSGPVSLPVFLFVLAK

Query:  GHRINTVFPLFLVGPAILHLVYISALTFD-NGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYR
        GHRINT+FPLFL+GPAILHLVYISALTFD NG DKDIKYVFFEASTMSGILHASLNLD++ILPYYTGLDAL+ S FSGECPSCVCR +PLVVGGR +SYR
Subjt:  GHRINTVFPLFLVGPAILHLVYISALTFD-NGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYR

Query:  GWSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEELE
        GWS TT VVVCALCTRI+CR+SGEK  R+A+V R +LEGL W+ IT DCVYLSRN   ER  +Q  A+G VF LVF+HV+K+ RRWQLM   G + E
Subjt:  GWSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEELE

KAG7013842.1 hypothetical protein SDJN02_24011, partial [Cucurbita argyrosperma subsp. argyrosperma]6.4e-12875.42Show/hide
Query:  PLQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADV-LSRGQICLDQSKRFLLPSGPVSLPVFLFVLAK
        PL +  LTFFKC RWQLEET+DK +CPYHYYCD+IYPGDYPP VDLLVL FT ATY++TLL M+ADV  SRG+ C DQ KRFLLPSGPVSLP+FLFVL K
Subjt:  PLQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADV-LSRGQICLDQSKRFLLPSGPVSLPVFLFVLAK

Query:  GHRINTVFPLFLVGPAILHLVYISALTFD-NGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYR
        GHRINT+FPLFL+GPAILHLVYISALTFD NG DKDIKYVFFEASTMSGILHASLNLD++ILPYYTGLDAL+ S FSGECPSCVCR  PLVVGGR +SYR
Subjt:  GHRINTVFPLFLVGPAILHLVYISALTFD-NGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYR

Query:  GWSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEELE
        GWS TT VVVCALCTRI+CR+SGEK  R+A+V R +LEGL W+ IT DCVYLSRN   ER  +Q  A+G VF LVF+HV+K+ RRWQLM   G + E
Subjt:  GWSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEELE

XP_022929796.1 uncharacterized protein LOC111436298 [Cucurbita moschata]1.9e-12774.75Show/hide
Query:  PLQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADV-LSRGQICLDQSKRFLLPSGPVSLPVFLFVLAK
        PL + HLTFFKC RWQLEET+DK +CPYHYYCD++YPG+YPP VDLLVL FT ATY++TLL M+ADV  SRG+ C DQ KRFLLPSGPVSLP+FLFVL K
Subjt:  PLQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADV-LSRGQICLDQSKRFLLPSGPVSLPVFLFVLAK

Query:  GHRINTVFPLFLVGPAILHLVYISALTFD-NGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYR
        GHRINT+FPLFL+GPAILHLVYISALTFD NG DKDIKYVFFEASTMSGILHASLNLD++ILPYYTGLDAL+ S FSGECPSCVCR +PLVVGGR +SYR
Subjt:  GHRINTVFPLFLVGPAILHLVYISALTFD-NGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYR

Query:  GWSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEELE
        GWS TT VVVCALCTRI+CR+SGEK  R+A V R +LEGL W+ IT DCVYLS N   ER  +Q  A+G VF LVF+HV+K+ RRWQLM   G + E
Subjt:  GWSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEELE

XP_023549027.1 uncharacterized protein LOC111807512 [Cucurbita pepo subsp. pepo]6.4e-12875.68Show/hide
Query:  LQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADV-LSRGQICLDQSKRFLLPSGPVSLPVFLFVLAKG
        L + HLTFFKC RWQLEET+DK +CPYHYYCD++YPGDYPP VDLLVL FT ATY++TLL M+ADV  SRG+ C DQ KRFLLPSGPVSLP+FLFVL KG
Subjt:  LQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADV-LSRGQICLDQSKRFLLPSGPVSLPVFLFVLAKG

Query:  HRINTVFPLFLVGPAILHLVYISALTFD-NGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYRG
        HRINT+FPLFL+GPAILHLVYISALTFD NG DKDIKYVFFEASTMSGILHASLNLD++ILPYYTGLDAL+ S FSGECPSCVCRN+PLVVGGRF+SYRG
Subjt:  HRINTVFPLFLVGPAILHLVYISALTFD-NGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYRG

Query:  WSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEELE
        WS TT VVVCALCTRI+CR+SGEK  R+A+V R +LEGL W+ ITLDCVYLS N   ER  +Q  A+G VF LVF+HV+K+ RR QLM   G + E
Subjt:  WSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEELE

XP_038875593.1 uncharacterized protein LOC120068005 [Benincasa hispida]9.3e-12774.32Show/hide
Query:  LQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADVLSRGQICLDQSKRFLLPSGPVSLPVFLFVLAKGH
        L + H TFFKC RWQLEET+DK SCP+HYYCDSIYPGDYP  +DLLVL FTAATY++TLL M+AD+  RG+ CLDQ K+FLLPSGP SLP+FLFVLAKG+
Subjt:  LQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADVLSRGQICLDQSKRFLLPSGPVSLPVFLFVLAKGH

Query:  RINTVFPLFLVGPAILHLVYISALTFDNGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYRGWS
        RINT+FPLFL+GP ILH+VYISALTFDNGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALV S FSGECPSCVCRN PL VGGRF+SYRGWS
Subjt:  RINTVFPLFLVGPAILHLVYISALTFDNGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYRGWS

Query:  ATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEE
         TT VVVC LCTRI+CR++G+K  RK +V + LLEGL W+LIT DCVYLS N   ER  LQ   +G VF LVF+H++K+ RRW LM   G +
Subjt:  ATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEE

TrEMBL top hitse value%identityAlignment
A0A0A0K9S3 Uncharacterized protein8.7e-12372.24Show/hide
Query:  LQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADVLS-RGQICLDQSKRFLLPSGPVSLPVFLFVLAKG
        L +  L FFKC RWQLEET+DK SCP+HYYCD+IYPGDYPP +DLLVL FTA TYL+TLL M+ D+ S RG+ C DQ K+FLLPSGP SLPVFLFVLAKG
Subjt:  LQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADVLS-RGQICLDQSKRFLLPSGPVSLPVFLFVLAKG

Query:  HRINTVFPLFLVGPAILHLVYISALTFDNGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYRGW
        HRINT+FPLFL+GP IL L+YISALTFDNGADKDIKYVFFEASTMSGILHASLNLD VILPYYTGLDAL+ S FSGEC SCVCRN PLVVGGRF+SYRGW
Subjt:  HRINTVFPLFLVGPAILHLVYISALTFDNGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYRGW

Query:  SATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAG--EELEKV
        S+TT V+VC LC RI+ R++G +  RK +  + LLEGL W+LIT DCVYLS N  AER  LQ   +G VF LVFIHV+K+ RRWQLM C    ++L+KV
Subjt:  SATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAG--EELEKV

A0A1S3CC89 uncharacterized protein LOC1034988288.7e-12371.57Show/hide
Query:  LQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADVLS-RGQICLDQSKRFLLPSGPVSLPVFLFVLAKG
        L +  L FFKC RWQLEET+DK SCP+HYYCD+IYPGDYP  +DLLVL FTA TY++TLL M+ D+ S RG+ C DQ K+FLLPSGP SLPVFLFVLAKG
Subjt:  LQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADVLS-RGQICLDQSKRFLLPSGPVSLPVFLFVLAKG

Query:  HRINTVFPLFLVGPAILHLVYISALTFDNGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYRGW
        HRINT+FPLFL+GP IL L+YISALTFDNGADKDIKYVFFEASTMSGILHASLNLD VILPYYTGLDAL+ S FSGEC SCVCRN PLVVGGRF++YRGW
Subjt:  HRINTVFPLFLVGPAILHLVYISALTFDNGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYRGW

Query:  SATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAG--EELEKV
        S+TT V+VC LCTRI+CR++G +  RK +  + LLEGL W+LIT DCVYLS N  AER  LQ   +G VF LVF+HV+K+ RRWQLM C     +L+KV
Subjt:  SATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAG--EELEKV

A0A2I4FR44 uncharacterized protein LOC1090013296.1e-10867.47Show/hide
Query:  AAFRPAWPL-QDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADVLSRGQICLDQSKRFLLPSGPVSLPV
        +AF+P W + +D+ + FFKC+RWQLEETMD I CPYHY+CDS YPG+YPP VD+LV  FT A+YL TL+IMV D+  RGQ  L QSKR+LLPSGPVSLPV
Subjt:  AAFRPAWPL-QDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADVLSRGQICLDQSKRFLLPSGPVSLPV

Query:  FLFVLAKGHRINTVFPLFLVGPAILHLVYISALTFDNGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGG
         L  LAKGHRI+ VFPL  VGPAIL LV+ISALTFD+GA+KD+KY FFEAST+SGILHASL LDS++LPYYTG DALV STFSGEC SCVCRND L+VGG
Subjt:  FLFVLAKGHRINTVFPLFLVGPAILHLVYISALTFDNGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGG

Query:  RFMSYRGWSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMA
          +SYRGWS TT  VV  LC R++CRL GEKT    ++ R  LE +AWILI +D V+L  NSP ER TL+ AAFGG+F L+ +HVLK A
Subjt:  RFMSYRGWSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMA

A0A6J1EP56 uncharacterized protein LOC1114362989.0e-12874.75Show/hide
Query:  PLQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADV-LSRGQICLDQSKRFLLPSGPVSLPVFLFVLAK
        PL + HLTFFKC RWQLEET+DK +CPYHYYCD++YPG+YPP VDLLVL FT ATY++TLL M+ADV  SRG+ C DQ KRFLLPSGPVSLP+FLFVL K
Subjt:  PLQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADV-LSRGQICLDQSKRFLLPSGPVSLPVFLFVLAK

Query:  GHRINTVFPLFLVGPAILHLVYISALTFD-NGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYR
        GHRINT+FPLFL+GPAILHLVYISALTFD NG DKDIKYVFFEASTMSGILHASLNLD++ILPYYTGLDAL+ S FSGECPSCVCR +PLVVGGR +SYR
Subjt:  GHRINTVFPLFLVGPAILHLVYISALTFD-NGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYR

Query:  GWSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEELE
        GWS TT VVVCALCTRI+CR+SGEK  R+A V R +LEGL W+ IT DCVYLS N   ER  +Q  A+G VF LVF+HV+K+ RRWQLM   G + E
Subjt:  GWSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEELE

A0A6J1JX99 LOW QUALITY PROTEIN: uncharacterized protein LOC1114886801.3e-12673.74Show/hide
Query:  PLQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMV-ADVLSRGQICLDQSKRFLLPSGPVSLPVFLFVLAK
        PL +  LTFFKC RWQLEET+DK +CPYHYYCD++YPGDYPP +DLLVL FT ATY++TLL M+   + SRG+ C DQ KRFLLPSGPVSLP+FLFVL K
Subjt:  PLQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMV-ADVLSRGQICLDQSKRFLLPSGPVSLPVFLFVLAK

Query:  GHRINTVFPLFLVGPAILHLVYISALTFD-NGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYR
        GHRINT+FPLFL+GPAIL LVYISALTFD NG+DKDIKYVFFEASTMSGILHASLNLD++I+PYYTGLDAL+ S FSGECPSCVCRN+PLVVGG+F+SYR
Subjt:  GHRINTVFPLFLVGPAILHLVYISALTFD-NGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYR

Query:  GWSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEELE
        GWS TT VVVCALCTRI+CR+SGEK  R+A+V R +LEGL W  ITLDCVYLS N   ER  +Q  A+G VF LVF+HV+K+ RRWQLMS  G + E
Subjt:  GWSATTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEELE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41610.1 unknown protein7.9e-8452.69Show/hide
Query:  TFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADVLSRGQI------CLDQSKRFLLPSGPVSLPVFLFVLAKGH
        TFFKC +WQ E+T+D I+CP+HY+CDSIY GDYP + D+LV  F   TYL TL+++V  V+SR +         D+++R+LLPSGP+SLP+ + +LAKG 
Subjt:  TFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADVLSRGQI------CLDQSKRFLLPSGPVSLPVFLFVLAKGH

Query:  RINTVFPLFLVGPAILHLVYISALTFDNGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYRGWS
        RINT+FP+ + GPAIL LV +S L F+N  +K+  +VFFEAST+SGILHASL LD+VILPYYTG DALV STFSG C SC+CR +PL+VGG+ +SYRGWS
Subjt:  RINTVFPLFLVGPAILHLVYISALTFDNGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYRGWS

Query:  ATTLVVVCALCTRIICRLSGEK-TTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTL-QAAAFGGVFALVFIHVL
        +TT +VV  L  RIIC+L  E+   ++ +V +++++GL  +++  DCVYL+  SP E   L +   FG +  L+ ++V+
Subjt:  ATTLVVVCALCTRIICRLSGEK-TTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTL-QAAAFGGVFALVFIHVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTGGCCACTCCTGCACTCCGATCAATCGGCAGCCTTCCGCCCTGCATGGCCTCTTCAAGATATCCACTTGACCTTCTTCAAATGCATCCGATGGCAACTGGAAGA
GACGATGGACAAAATCAGCTGCCCTTATCACTACTACTGCGACAGCATTTACCCCGGCGACTATCCTCCGGTAGTTGATCTTCTGGTTCTCGCCTTCACGGCGGCGACGT
ACTTGGCGACCCTTTTGATTATGGTGGCGGATGTGTTGTCTCGTGGGCAGATTTGTTTAGATCAGTCCAAGAGATTTCTGTTACCATCTGGGCCAGTTTCTCTCCCGGTG
TTCCTCTTTGTCTTGGCCAAGGGCCACCGTATCAACACCGTCTTCCCTCTCTTCCTCGTCGGTCCGGCGATTCTCCACCTGGTTTACATCTCCGCGCTCACGTTCGACAA
TGGAGCCGACAAGGACATCAAGTACGTCTTCTTCGAAGCCTCGACAATGTCTGGGATTCTTCACGCTAGCTTAAACTTGGACTCTGTTATCCTCCCTTATTACACGGGGT
TGGATGCTTTGGTTGCGTCGACGTTTTCCGGCGAATGCCCATCATGTGTTTGCAGAAATGACCCATTAGTGGTGGGAGGAAGATTCATGTCTTACAGGGGCTGGTCGGCT
ACAACGTTAGTTGTCGTGTGTGCTCTGTGTACGAGAATCATCTGTCGGCTGTCTGGGGAGAAGACGACAAGAAAAGCCATGGTTTTCAGGTCGTTGCTGGAAGGGTTGGC
TTGGATCTTGATAACGTTGGACTGTGTTTATCTGAGCAGAAACTCACCGGCGGAGAGGACGACGTTGCAGGCTGCTGCGTTTGGCGGCGTATTTGCTTTGGTGTTCATTC
ATGTGCTCAAAATGGCGAGACGATGGCAGTTGATGTCTTGCGCCGGCGAGGAGTTGGAGAAAGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACTGGCCACTCCTGCACTCCGATCAATCGGCAGCCTTCCGCCCTGCATGGCCTCTTCAAGATATCCACTTGACCTTCTTCAAATGCATCCGATGGCAACTGGAAGA
GACGATGGACAAAATCAGCTGCCCTTATCACTACTACTGCGACAGCATTTACCCCGGCGACTATCCTCCGGTAGTTGATCTTCTGGTTCTCGCCTTCACGGCGGCGACGT
ACTTGGCGACCCTTTTGATTATGGTGGCGGATGTGTTGTCTCGTGGGCAGATTTGTTTAGATCAGTCCAAGAGATTTCTGTTACCATCTGGGCCAGTTTCTCTCCCGGTG
TTCCTCTTTGTCTTGGCCAAGGGCCACCGTATCAACACCGTCTTCCCTCTCTTCCTCGTCGGTCCGGCGATTCTCCACCTGGTTTACATCTCCGCGCTCACGTTCGACAA
TGGAGCCGACAAGGACATCAAGTACGTCTTCTTCGAAGCCTCGACAATGTCTGGGATTCTTCACGCTAGCTTAAACTTGGACTCTGTTATCCTCCCTTATTACACGGGGT
TGGATGCTTTGGTTGCGTCGACGTTTTCCGGCGAATGCCCATCATGTGTTTGCAGAAATGACCCATTAGTGGTGGGAGGAAGATTCATGTCTTACAGGGGCTGGTCGGCT
ACAACGTTAGTTGTCGTGTGTGCTCTGTGTACGAGAATCATCTGTCGGCTGTCTGGGGAGAAGACGACAAGAAAAGCCATGGTTTTCAGGTCGTTGCTGGAAGGGTTGGC
TTGGATCTTGATAACGTTGGACTGTGTTTATCTGAGCAGAAACTCACCGGCGGAGAGGACGACGTTGCAGGCTGCTGCGTTTGGCGGCGTATTTGCTTTGGTGTTCATTC
ATGTGCTCAAAATGGCGAGACGATGGCAGTTGATGTCTTGCGCCGGCGAGGAGTTGGAGAAAGTGTAA
Protein sequenceShow/hide protein sequence
MNWPLLHSDQSAAFRPAWPLQDIHLTFFKCIRWQLEETMDKISCPYHYYCDSIYPGDYPPVVDLLVLAFTAATYLATLLIMVADVLSRGQICLDQSKRFLLPSGPVSLPV
FLFVLAKGHRINTVFPLFLVGPAILHLVYISALTFDNGADKDIKYVFFEASTMSGILHASLNLDSVILPYYTGLDALVASTFSGECPSCVCRNDPLVVGGRFMSYRGWSA
TTLVVVCALCTRIICRLSGEKTTRKAMVFRSLLEGLAWILITLDCVYLSRNSPAERTTLQAAAFGGVFALVFIHVLKMARRWQLMSCAGEELEKV