; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr006124 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr006124
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionlate embryogenesis abundant protein At1g64065
Genome locationtig00004396:54520..55113
RNA-Seq ExpressionSgr006124
SyntenySgr006124
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145906.1 late embryogenesis abundant protein At1g64065 [Momordica charantia]4.7e-6872.49Show/hide
Query:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI
        SRRRASE+CINACC+YLF  A   C+A+LILGL VVRVK PT KL+SVAV+NL+YGFSP+PF+ ATLIAE+T+EN NFG+FKYEE T+ + IYYGVA GI
Subjt:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI

Query:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ
        G V+RVSVNAK  K+ +F+V+VK NAS+ DVDY S+DLA LK MNMSCIAEFEGR+RLLKLFKE KVS+LKCTMTLN +SH + NLACQ
Subjt:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ

XP_022929131.1 late embryogenesis abundant protein At1g64065 [Cucurbita moschata]4.7e-7680.95Show/hide
Query:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI
        SRR+ASE+CIN  C+YLF IA V CIA LILGLVVVRVK PT+KLTSV V+NL+YGFSP PFM ATLIAE+TMEN NFG+FKYEE TN TLIYYGVAVGI
Subjt:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI

Query:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ
        G V+ VSVNAK  KK NF VKVKPN+S VDVDYFSHDLA LKTMNMSCIAEF+GR+RLLKLFKEKKVS+LKCTM+LNL SHGVQNLACQ
Subjt:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ

XP_022970100.1 late embryogenesis abundant protein At1g64065 [Cucurbita maxima]2.7e-7680.95Show/hide
Query:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI
        SRR+ASE+CIN  C+YLF IA V CIA LILGLVVVRVK PT+KLTSV V+NL+YGFSP PFM ATLIAE+TMEN NFG+FKYEE TN TLIYYGVAVGI
Subjt:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI

Query:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ
        G V+ VSVNAK  K  NFTVKVKPN+S VDVDYFSHDLA LKTMNMSCIAEF+GR+RLLKLFKEKKVS+LKCTM+LNL+SHGVQNLACQ
Subjt:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ

XP_023550526.1 late embryogenesis abundant protein At1g64065 [Cucurbita pepo subsp. pepo]7.2e-7781.48Show/hide
Query:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI
        SRR+ASE+CIN  C+YLF IA V CIA LILGLVVVRVK PT+KLTSV V+NL+YGFSP PFM ATLIAE+TMEN NFG+FKYEE TN TLIYYGVAVGI
Subjt:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI

Query:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ
        G V+ VSVNAK  KK NFTVKVKPN+S VDVDYFSHDLA LKTMNMSCIAEF+GR+RLLKLFKEKKVS+LKCTM+LNL+SHGVQNLACQ
Subjt:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ

XP_038874293.1 late embryogenesis abundant protein At1g64065-like [Benincasa hispida]2.6e-6371.51Show/hide
Query:  RRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGIGR
        R A  +C++  C++LF  A+  CI  L L LVV+RVK+PT+KLT VAV++L YGFSP PFM ATLI E+TMEN NFG+FKYEE  NVTLIY GV VGIG 
Subjt:  RRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGIGR

Query:  VRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLAC
        V+RVSVNAK  +K NFTVKV+PN+S VDVDYFS+DLARLKTMNMSCIA+FEGR  LLKLFK KK+SVLKC+M+LNLTSHGVQNLAC
Subjt:  VRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLAC

TrEMBL top hitse value%identityAlignment
A0A1S3AUE7 late embryogenesis abundant protein At1g640654.1e-6268.75Show/hide
Query:  SAGGSRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEET-NVTLIYYGV
        +A  SRR++S++C+NA C+ LF  A   CIA L  GLVV+RVK PT+KLTSVAV+NL+YGFSP PFM ATL  E+TMEN N+G F+YE   NVTLIYYGV
Subjt:  SAGGSRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEET-NVTLIYYGV

Query:  AVGIGRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLAC
         VGIG V+R+SVNAK  +K  F VKVKPN   V+VDYFS DLARLKTMNMS  AEFEG+I LLKLFKEKK+SV+KC+ +LNLTSHGVQNLAC
Subjt:  AVGIGRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLAC

A0A5A7TMT1 Late embryogenesis abundant protein4.1e-6268.75Show/hide
Query:  SAGGSRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEET-NVTLIYYGV
        +A  SRR++S++C+NA C+ LF  A   CIA L  GLVV+RVK PT+KLTSVAV+NL+YGFSP PFM ATL  E+TMEN N+G F+YE   NVTLIYYGV
Subjt:  SAGGSRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEET-NVTLIYYGV

Query:  AVGIGRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLAC
         VGIG V+R+SVNAK  +K  F VKVKPN   V+VDYFS DLARLKTMNMS  AEFEG+I LLKLFKEKK+SV+KC+ +LNLTSHGVQNLAC
Subjt:  AVGIGRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLAC

A0A6J1CXS7 late embryogenesis abundant protein At1g640652.3e-6872.49Show/hide
Query:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI
        SRRRASE+CINACC+YLF  A   C+A+LILGL VVRVK PT KL+SVAV+NL+YGFSP+PF+ ATLIAE+T+EN NFG+FKYEE T+ + IYYGVA GI
Subjt:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI

Query:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ
        G V+RVSVNAK  K+ +F+V+VK NAS+ DVDY S+DLA LK MNMSCIAEFEGR+RLLKLFKE KVS+LKCTMTLN +SH + NLACQ
Subjt:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ

A0A6J1EM85 late embryogenesis abundant protein At1g640652.3e-7680.95Show/hide
Query:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI
        SRR+ASE+CIN  C+YLF IA V CIA LILGLVVVRVK PT+KLTSV V+NL+YGFSP PFM ATLIAE+TMEN NFG+FKYEE TN TLIYYGVAVGI
Subjt:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI

Query:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ
        G V+ VSVNAK  KK NF VKVKPN+S VDVDYFSHDLA LKTMNMSCIAEF+GR+RLLKLFKEKKVS+LKCTM+LNL SHGVQNLACQ
Subjt:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ

A0A6J1I1W2 late embryogenesis abundant protein At1g640651.3e-7680.95Show/hide
Query:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI
        SRR+ASE+CIN  C+YLF IA V CIA LILGLVVVRVK PT+KLTSV V+NL+YGFSP PFM ATLIAE+TMEN NFG+FKYEE TN TLIYYGVAVGI
Subjt:  SRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEE-TNVTLIYYGVAVGI

Query:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ
        G V+ VSVNAK  K  NFTVKVKPN+S VDVDYFSHDLA LKTMNMSCIAEF+GR+RLLKLFKEKKVS+LKCTM+LNL+SHGVQNLACQ
Subjt:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ

SwissProt top hitse value%identityAlignment
Q6DST1 Late embryogenesis abundant protein At1g640651.5e-1632.8Show/hide
Query:  RRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENL-NYGFSPDPFMGATLIAEMTMENLNFGQFKYEETNVTLIY--YGVAVGI
        RR +E+    C VY   I  ++    LIL  + +R+  P ++  S++  +L + G S +P+  ATL++++++ N NFG F++E++ + ++Y  +GV VG 
Subjt:  RRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENL-NYGFSPDPFMGATLIAEMTMENLNFGQFKYEETNVTLIY--YGVAVGI

Query:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ
         ++    V A    ++   V    +  L+D      DL RL  + +  +AE  GRI++L   K  KVSV+ CTM LNLT   +QNL C+
Subjt:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ

Arabidopsis top hitse value%identityAlignment
AT1G64065.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.1e-1732.8Show/hide
Query:  RRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENL-NYGFSPDPFMGATLIAEMTMENLNFGQFKYEETNVTLIY--YGVAVGI
        RR +E+    C VY   I  ++    LIL  + +R+  P ++  S++  +L + G S +P+  ATL++++++ N NFG F++E++ + ++Y  +GV VG 
Subjt:  RRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENL-NYGFSPDPFMGATLIAEMTMENLNFGQFKYEETNVTLIY--YGVAVGI

Query:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ
         ++    V A    ++   V    +  L+D      DL RL  + +  +AE  GRI++L   K  KVSV+ CTM LNLT   +QNL C+
Subjt:  GRVRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCAGAAACTTCCGCCGGGGGGTCACGGCGCAGAGCATCAGAACAATGCATCAACGCCTGCTGCGTTTACCTTTTCGTCATCGCCACCGTCGTCTGCATCGCCCT
CCTGATTCTCGGCCTCGTCGTCGTTCGTGTCAAAATCCCCACCCTCAAACTGACTTCCGTCGCCGTCGAGAATCTGAACTACGGCTTCTCCCCGGACCCTTTCATGGGCG
CCACCTTGATCGCCGAGATGACGATGGAGAATCTGAATTTCGGGCAGTTCAAGTACGAGGAAACCAACGTCACTCTGATTTACTACGGCGTGGCCGTCGGGATCGGCCGA
GTGAGAAGGGTCTCTGTAAATGCAAAGGACAGAAAAAAGATGAACTTTACTGTGAAAGTGAAGCCGAACGCGAGCCTGGTTGATGTCGATTACTTCAGCCATGATCTGGC
GAGGTTGAAGACGATGAATATGAGCTGCATCGCCGAGTTTGAGGGTCGGATTCGTCTGTTGAAGTTGTTCAAAGAGAAGAAAGTTTCAGTGTTGAAATGCACCATGACTT
TGAACTTGACCTCCCATGGCGTCCAGAATCTTGCTTGCCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGCCAGAAACTTCCGCCGGGGGGTCACGGCGCAGAGCATCAGAACAATGCATCAACGCCTGCTGCGTTTACCTTTTCGTCATCGCCACCGTCGTCTGCATCGCCCT
CCTGATTCTCGGCCTCGTCGTCGTTCGTGTCAAAATCCCCACCCTCAAACTGACTTCCGTCGCCGTCGAGAATCTGAACTACGGCTTCTCCCCGGACCCTTTCATGGGCG
CCACCTTGATCGCCGAGATGACGATGGAGAATCTGAATTTCGGGCAGTTCAAGTACGAGGAAACCAACGTCACTCTGATTTACTACGGCGTGGCCGTCGGGATCGGCCGA
GTGAGAAGGGTCTCTGTAAATGCAAAGGACAGAAAAAAGATGAACTTTACTGTGAAAGTGAAGCCGAACGCGAGCCTGGTTGATGTCGATTACTTCAGCCATGATCTGGC
GAGGTTGAAGACGATGAATATGAGCTGCATCGCCGAGTTTGAGGGTCGGATTCGTCTGTTGAAGTTGTTCAAAGAGAAGAAAGTTTCAGTGTTGAAATGCACCATGACTT
TGAACTTGACCTCCCATGGCGTCCAGAATCTTGCTTGCCAATAG
Protein sequenceShow/hide protein sequence
MAPETSAGGSRRRASEQCINACCVYLFVIATVVCIALLILGLVVVRVKIPTLKLTSVAVENLNYGFSPDPFMGATLIAEMTMENLNFGQFKYEETNVTLIYYGVAVGIGR
VRRVSVNAKDRKKMNFTVKVKPNASLVDVDYFSHDLARLKTMNMSCIAEFEGRIRLLKLFKEKKVSVLKCTMTLNLTSHGVQNLACQ