; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027178 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027178
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLEA_2 domain-containing protein
Genome locationtig00153048:1804521..1810926
RNA-Seq ExpressionSgr027178
SyntenySgr027178
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148717.1 uncharacterized protein LOC101219269 [Cucumis sativus]2.6e-2746.7Show/hide
Query:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLR-
        FP+DPSLQLVRLKLNRVKV             V   + +  + SL        +G  G+ LG+VSSEGGRVSARGSSYVNATLDLNG EV+H+    L  
Subjt:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLR-

Query:  IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE
        +   +         EG MGLFF KIPIK                               +VSCEV VNTN+QTIEHQDCYPE
Subjt:  IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE

XP_008463384.1 PREDICTED: uncharacterized protein LOC103501551 [Cucumis melo]1.0e-2646.15Show/hide
Query:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLR-
        FP+DPSLQLVRLKLNRVKV             V   + +  + SL        +G  G+ LG+VSS GGRVSARGSSYVNATLDLNG EV+H+    L  
Subjt:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLR-

Query:  IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE
        +   +       + EG MGLFF KIPIK                               +VSCEV VNTN+QTIEHQDCYPE
Subjt:  IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE

XP_022144909.1 uncharacterized protein LOC111014473 [Momordica charantia]5.9e-2746.15Show/hide
Query:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHE-FFTCLR
        FPADPSLQLVRLKLNR+KV                 + ++ + SL        +G  G+ LGFVSSEGGRVSARG SYVNATLDLNG EVIH+  +    
Subjt:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHE-FFTCLR

Query:  IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE
        +   +       + EGYMGLFF K PIK                               +VSCEV VNTND+TIEHQDCYPE
Subjt:  IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE

XP_023530779.1 uncharacterized protein LOC111793228 isoform X1 [Cucurbita pepo subsp. pepo]1.9e-2543.55Show/hide
Query:  SLSTFPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFT
        S   FP+DPSLQLVRLKLN  KV                 + +  + SL        +G  GK LGFVSS+GGRVSARGSSYVNAT+DLNG EVIH+ F 
Subjt:  SLSTFPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFT

Query:  CLR-IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE
         L+ +   +     + + EG+MG FF K PIK                               +VSC+V VNT  QTIEHQDCYPE
Subjt:  CLR-IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE

XP_038878687.1 uncharacterized protein LOC120070868 [Benincasa hispida]1.0e-2646.99Show/hide
Query:  FPADPSLQLVRLKLNRVKVVCCLLSSSTYLSLF--------------------PLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLR
        FPADPS QLVRLKLN VKV       S  LS F                     +G  GK LGFVSSEGGRVSARGSSYVNATLDLNG EV+H+    L 
Subjt:  FPADPSLQLVRLKLNRVKVVCCLLSSSTYLSLF--------------------PLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLR

Query:  -IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE
         +   +       + EG MGLFF K PIK                               KVSCEV VN N+QTIEHQDCYPE
Subjt:  -IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE

TrEMBL top hitse value%identityAlignment
A0A0A0LTV4 LEA_2 domain-containing protein1.3e-2746.7Show/hide
Query:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLR-
        FP+DPSLQLVRLKLNRVKV             V   + +  + SL        +G  G+ LG+VSSEGGRVSARGSSYVNATLDLNG EV+H+    L  
Subjt:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLR-

Query:  IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE
        +   +         EG MGLFF KIPIK                               +VSCEV VNTN+QTIEHQDCYPE
Subjt:  IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE

A0A1S3CJK6 uncharacterized protein LOC1035015514.9e-2746.15Show/hide
Query:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLR-
        FP+DPSLQLVRLKLNRVKV             V   + +  + SL        +G  G+ LG+VSS GGRVSARGSSYVNATLDLNG EV+H+    L  
Subjt:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLR-

Query:  IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE
        +   +       + EG MGLFF KIPIK                               +VSCEV VNTN+QTIEHQDCYPE
Subjt:  IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE

A0A6J1CTN0 uncharacterized protein LOC1110144732.9e-2746.15Show/hide
Query:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHE-FFTCLR
        FPADPSLQLVRLKLNR+KV                 + ++ + SL        +G  G+ LGFVSSEGGRVSARG SYVNATLDLNG EVIH+  +    
Subjt:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHE-FFTCLR

Query:  IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE
        +   +       + EGYMGLFF K PIK                               +VSCEV VNTND+TIEHQDCYPE
Subjt:  IWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE

A0A6J1HAC8 uncharacterized protein LOC1114615744.6e-2543.96Show/hide
Query:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLRI
        FP+DPSLQLVRLKLN V V                 + ++ + SL        +G  G+ LGFVSS+GGRVSARGSSYVNATLDLNG ++IH+ F  L  
Subjt:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLRI

Query:  WHR-VSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE
          + +       + EG MGLFF K PIK +                              VSCEV V+TN QTIEHQDCYPE
Subjt:  WHR-VSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE

A0A6J1JI07 uncharacterized protein LOC1114852801.6e-2544.51Show/hide
Query:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLRI
        FP+DPSLQLVRLKLN VKV                 + +  + SL        +G  G+ LGFVSS+GGRVSARGSSYVNATLDLNG ++IH+ F  L  
Subjt:  FPADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLRI

Query:  WHR-VSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE
          + +       + EG MGLFF K PIK +                              VSCEV V+TN QTIEHQDCYPE
Subjt:  WHR-VSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.4e-1027.49Show/hide
Query:  HFSPPLRRS--LSTF-------------PADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSA
        H SPP RR   +S F             P+DP ++++R+K++ V V             V   +S++   S         +   GKTLG VSS+GG V+A
Subjt:  HFSPPLRRS--LSTF-------------PADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSA

Query:  RGSSYVNATLDLNGCEVIHEFFTCLRIWHRVSSHSIR----RQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTN
         GSSY++A  +L+G  V   F   + + H ++  S+      ++ G +G+ FF+ P+K                               KV+C + V+T 
Subjt:  RGSSYVNATLDLNGCEVIHEFFTCLRIWHRVSSHSIR----RQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTN

Query:  DQTIEHQDCYP
        +QTI  Q C P
Subjt:  DQTIEHQDCYP

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.7e-0829.75Show/hide
Query:  HFSPPLRRS--LSTF-------------PADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSA
        H SPP RR   +S F             P+DP ++++R+K++ V V             V   +S++   S         +   GKTLG VSS+GG V+A
Subjt:  HFSPPLRRS--LSTF-------------PADPSLQLVRLKLNRVKV-------------VCCLLSSSTYLSL------FPLGVPGKTLGFVSSEGGRVSA

Query:  RGSSYVNATLDLNGCEVIHEFFTCLRIWHRVSSHSIR----RQSEGYMGLFFFKIPIK
         GSSY++A  +L+G  V   F   + + H ++  S+      ++ G +G+ FF+ P+K
Subjt:  RGSSYVNATLDLNGCEVIHEFFTCLRIWHRVSSHSIR----RQSEGYMGLFFFKIPIK

AT2G25735.1 unknown protein4.4e-0464.71Show/hide
Query:  YDPCSYSQNFDQG---NAADELDNLSRSFSARFA
        Y+PC YS NFDQG   +  DE +NLSRSFS RFA
Subjt:  YDPCSYSQNFDQG---NAADELDNLSRSFSARFA

AT4G13270.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.8e-1629.63Show/hide
Query:  LYPLRRHFSPPLRRSLS------------------TFPADPSLQLVRLKLNRVKVV-------------CCLLSSSTYLSL------FPLGVPGKTLGFV
        L P RRH  P L R+L                    +P+DP + + R+ LN + VV                + +  + SL        +G  G+ LG V
Subjt:  LYPLRRHFSPPLRRSLS------------------TFPADPSLQLVRLKLNRVKVV-------------CCLLSSSTYLSL------FPLGVPGKTLGFV

Query:  SSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCL-RIWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCE
         S+GG + AR SSY++ATL+L+G EV+H+    +  +   V       Q +G +G+  F IP                              I+GKVSCE
Subjt:  SSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCL-RIWHRVSSHSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCE

Query:  VSVNTNDQTIEHQDCY
        V VN N+Q I HQDC+
Subjt:  VSVNTNDQTIEHQDCY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTGCGCCGCTGTACCCTCTACGCCGCCACTTTTCTCCTCCTCTCCGCCGTAGTCTGTCTACTTTTCCCGCTGATCCGTCGCTCCAACTCGTCCGATTGAAACT
CAATCGCGTCAAAGTCGTTTGTTGCCTGTTGTCGTCCTCGACCTATCTTTCTCTGTTTCCGTTAGGGGTACCGGGGAAGACATTAGGATTTGTGAGCTCGGAGGGCGGCC
GAGTGTCTGCTCGTGGGTCATCTTACGTAAATGCCACTCTCGATTTGAATGGGTGCGAGGTCATTCACGAGTTTTTTACTTGCTTGAGGATTTGGCATCGGGTATCATCC
CATTCGATACGGAGACAAAGTGAAGGATACATGGGGCTTTTCTTTTTCAAAATCCCGATTAAGAAATCATTTCTTGGTCGTGCGAGTTGCTTTGCTTGCGGCACTCTTAA
ATCATCGCCAAATGCTCAATTTTCTACTATGCCGTTGATTGAGGGCAAGGTGTCGTGTGAGGTATCTGTGAATACAAATGACCAAACGATTGAACATCAAGATTGCTACC
CTGAGTTTTTCCTTGCCGATCCATTCTTTACCATCCGGCCCACCCAAACGTACTCGCTCTGGTGTAGTCTGCAACGAAACGACCCAGGGTGCCCCGTCGGCGAGCACAAG
CACCGCCCTTTGGCCGACCCACCGCCACTCACAGCCGCCGCTTCATGGATATTTGCTGCTGCCGCAGCCGCCCCCTGCAAGAGGAACCAATTGCATTGTGAGGCCAACTC
CCTCCGGCTTACCAGAGGCAGCCCAAATAGAAAAACGGACTTGCTCCGTCGTTTCTCCGCCGCTCAGTCATTATCTCCTTCTTCAACTCCGCCTGCTCCTCTGGAAGTTG
GGGCAGCAAAACGATCAGACTGGGGCAATGCCCATGCGAAGGGATGGAGTACTGCGGAGATCGACGTTGAAGAGTGGCGGAAACAAGCGGAGCCTGTGGAGGGCTTCTTC
CTCAGGAAGTTTCCTTACGATCCTTGCTCTTATTCTCAGAATTTCGATCAAGGCAACGCCGCCGACGAACTCGATAACCTCTCCCGCTCCTTCTCCGCCCGCTTCGCCCT
CGCCGATGCTGCTTCCAGGATCTTCCTCGACAAAACAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTGCGCCGCTGTACCCTCTACGCCGCCACTTTTCTCCTCCTCTCCGCCGTAGTCTGTCTACTTTTCCCGCTGATCCGTCGCTCCAACTCGTCCGATTGAAACT
CAATCGCGTCAAAGTCGTTTGTTGCCTGTTGTCGTCCTCGACCTATCTTTCTCTGTTTCCGTTAGGGGTACCGGGGAAGACATTAGGATTTGTGAGCTCGGAGGGCGGCC
GAGTGTCTGCTCGTGGGTCATCTTACGTAAATGCCACTCTCGATTTGAATGGGTGCGAGGTCATTCACGAGTTTTTTACTTGCTTGAGGATTTGGCATCGGGTATCATCC
CATTCGATACGGAGACAAAGTGAAGGATACATGGGGCTTTTCTTTTTCAAAATCCCGATTAAGAAATCATTTCTTGGTCGTGCGAGTTGCTTTGCTTGCGGCACTCTTAA
ATCATCGCCAAATGCTCAATTTTCTACTATGCCGTTGATTGAGGGCAAGGTGTCGTGTGAGGTATCTGTGAATACAAATGACCAAACGATTGAACATCAAGATTGCTACC
CTGAGTTTTTCCTTGCCGATCCATTCTTTACCATCCGGCCCACCCAAACGTACTCGCTCTGGTGTAGTCTGCAACGAAACGACCCAGGGTGCCCCGTCGGCGAGCACAAG
CACCGCCCTTTGGCCGACCCACCGCCACTCACAGCCGCCGCTTCATGGATATTTGCTGCTGCCGCAGCCGCCCCCTGCAAGAGGAACCAATTGCATTGTGAGGCCAACTC
CCTCCGGCTTACCAGAGGCAGCCCAAATAGAAAAACGGACTTGCTCCGTCGTTTCTCCGCCGCTCAGTCATTATCTCCTTCTTCAACTCCGCCTGCTCCTCTGGAAGTTG
GGGCAGCAAAACGATCAGACTGGGGCAATGCCCATGCGAAGGGATGGAGTACTGCGGAGATCGACGTTGAAGAGTGGCGGAAACAAGCGGAGCCTGTGGAGGGCTTCTTC
CTCAGGAAGTTTCCTTACGATCCTTGCTCTTATTCTCAGAATTTCGATCAAGGCAACGCCGCCGACGAACTCGATAACCTCTCCCGCTCCTTCTCCGCCCGCTTCGCCCT
CGCCGATGCTGCTTCCAGGATCTTCCTCGACAAAACAACTTGA
Protein sequenceShow/hide protein sequence
MASAPLYPLRRHFSPPLRRSLSTFPADPSLQLVRLKLNRVKVVCCLLSSSTYLSLFPLGVPGKTLGFVSSEGGRVSARGSSYVNATLDLNGCEVIHEFFTCLRIWHRVSS
HSIRRQSEGYMGLFFFKIPIKKSFLGRASCFACGTLKSSPNAQFSTMPLIEGKVSCEVSVNTNDQTIEHQDCYPEFFLADPFFTIRPTQTYSLWCSLQRNDPGCPVGEHK
HRPLADPPPLTAAASWIFAAAAAAPCKRNQLHCEANSLRLTRGSPNRKTDLLRRFSAAQSLSPSSTPPAPLEVGAAKRSDWGNAHAKGWSTAEIDVEEWRKQAEPVEGFF
LRKFPYDPCSYSQNFDQGNAADELDNLSRSFSARFALADAASRIFLDKTT