; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015227 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015227
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLEA_2 domain-containing protein
Genome locationtig00003063:900252..911826
RNA-Seq ExpressionSgr015227
SyntenySgr015227
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589033.1 hypothetical protein SDJN03_17598, partial [Cucurbita argyrosperma subsp. sororia]4.6e-2080Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK
        MEEFY++R+SSRKV T+V+GHQVPLYGGIS IGNWR+QR +GVEV LNLTV VRSRAYILG+LVK
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK

XP_022135688.1 uncharacterized protein LOC111007587 isoform X1 [Momordica charantia]9.9e-2389.23Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK
        M EFYE+R+SSR+VATAVAGHQVPLYGGI+VIGNWREQR EGVEVPLNLTV VRSRAYILGKLVK
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK

XP_022928427.1 uncharacterized protein LOC111435243 [Cucurbita moschata]4.6e-2080Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK
        MEEFY++R+SSRKV T+V+GHQVPLYGGIS IGNWR+QR +GVEV LNLTV VRSRAYILG+LVK
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK

XP_022989441.1 uncharacterized protein LOC111486495 [Cucurbita maxima]4.6e-2080Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK
        MEEFY++R+SSRKV T+V+GHQVPLYGGIS IGNWR+QR +GVEV LNLTV VRSRAYILG+LVK
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK

XP_038888376.1 uncharacterized protein LOC120078225 [Benincasa hispida]2.3e-1976.12Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLE--GVEVPLNLTVGVRSRAYILGKLVK
        MEEFY++R+SSR+V T+VAGHQ+PLYGGIS IGNWR+QR +  GVE+PLNLTV VRSRAYILG+LVK
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLE--GVEVPLNLTVGVRSRAYILGKLVK

TrEMBL top hitse value%identityAlignment
A0A0A0K4T2 LEA_2 domain-containing protein4.2e-1977.61Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLE--GVEVPLNLTVGVRSRAYILGKLVK
        MEEFY++R+SSR+V T+VAGHQVPLYGGIS IGNWR+QR +  GVEV LNLTV VRSRAYILG+LVK
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLE--GVEVPLNLTVGVRSRAYILGKLVK

A0A1S3BJ42 uncharacterized protein LOC1034902459.3e-1976.12Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLE--GVEVPLNLTVGVRSRAYILGKLVK
        MEEFY++R+SSR++ T+VAGHQVPLYGGIS IGNWR+QR +  GVEV LNLTV VRSRAYILG+LVK
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLE--GVEVPLNLTVGVRSRAYILGKLVK

A0A6J1C5K4 uncharacterized protein LOC111007587 isoform X14.8e-2389.23Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK
        M EFYE+R+SSR+VATAVAGHQVPLYGGI+VIGNWREQR EGVEVPLNLTV VRSRAYILGKLVK
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK

A0A6J1EJW4 uncharacterized protein LOC1114352432.2e-2080Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK
        MEEFY++R+SSRKV T+V+GHQVPLYGGIS IGNWR+QR +GVEV LNLTV VRSRAYILG+LVK
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK

A0A6J1JK28 uncharacterized protein LOC1114864952.2e-2080Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK
        MEEFY++R+SSRKV T+V+GHQVPLYGGIS IGNWR+QR +GVEV LNLTV VRSRAYILG+LVK
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45688.1 unknown protein5.8e-0532.04Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVI-------GNWREQRLEG------------VEVPLNLTVGVRSRAYILGKLVKLSDLQPHEAWISKRHR
        +++FY+ RKS R V   V G ++PLYG  S +          + ++ +G              VP+ L+  VRSRAY+LGKLV+    +  E  I+  H+
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVI-------GNWREQRLEG------------VEVPLNLTVGVRSRAYILGKLVKLSDLQPHEAWISKRHR

Query:  QRN
          N
Subjt:  QRN

AT2G41990.1 CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterPro:IPR004864)3.3e-0845.31Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLV
        M +F   R     V T V GHQ+PLYGG+S         L+ + +PLNLT+ + S+AYILG+LV
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLV

AT4G35170.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.8e-1049.23Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK
        M EF + RKS R + T V G Q+PLYGG+  +   R +  + V +PLNLT  +R+RAY+LG+LVK
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVK

AT5G42860.1 unknown protein9.9e-0534.94Show/hide
Query:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVI-------GNWREQRLEG-----------VEVPLNLTVGVRSRAYILGKLVK
        +++FY+ RKS R V   V G ++PLYG  S +          + ++ +G             VP+ L   VRSRAY+LGKLV+
Subjt:  MEEFYEERKSSRKVATAVAGHQVPLYGGISVI-------GNWREQRLEG-----------VEVPLNLTVGVRSRAYILGKLVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATAATGAAATGGGGGTGGGGGGGGGGGGATTGTATAAATTGAAGTATTTTTGGGGGGTGAGAGATGAAATTTGTGGAGAAATTCGGGGTGAGACTTTTCAGACATTGGGA
ATGGAAGAGTTCTATGAAGAAAGGAAGAGCTCGCGGAAGGTGGCAACGGCGGTGGCGGGGCATCAAGTTCCTCTGTACGGCGGGATCTCGGTGATCGGAAACTGGAGAGA
GCAGCGGCTGGAGGGGGTGGAGGTGCCGCTGAACCTGACGGTGGGGGTGAGGTCTAGGGCTTACATTCTGGGGAAACTGGTGAAGCTTTCTGATCTTCAGCCACATGAAG
CATGGATAAGCAAACGTCACCGGCAGCGCAATCCCGCCGACGAGCCCCGCCATGCTCCCCAGAAATGGGATCGCCACCGCCACGAAGAAACACAGATCTGGAACGAGCTC
ATGGCATTGATTATCACTAATAAGCTCGTTAAGCCGATGAGAAACTGTGAGGTGTCGCGGGCATGGAAGGCGTACAGAGCTGATTTGATCGTCAGTTACCTGTATCTCAA
GGACGAGATTGTGCCCTCTGAAAGCAAAAGCGACGATTCCCAGAGCGTTCAATGCGTCAAAGGCCCTAGCCATGTTGGTGTTGGCTTTCACCGGATCGTAGGAAACGCCG
GGCATTCTGCCCTTCACGACGGAGATCATCCATATCAACGTGCAGTAGCCGATGGCACTAACAGGACGGCGGCGCAGGTGAAAACCAAGTACCATTCGGTCGGCGTTAAG
GAATCCGCTCTGCAGTTGGTTCCGCAAACGATTTTGAAGAATGTGTTCGACGTTGAGCCTCCGATGATTATCAGAGCCACGCAGGTGCCGGCGGAGAGTATAAAGTTGCC
ATATGAACGCCACCGTCAAGCTTATGATCCCTCCGGCCCTGCAACCAGAAGCTTTCAAAAACCCATTGAATTCAAACTTATTGTAAACGATCAGTGGTTTTATTTACCAT
CCGAGAATGGTGAAGGCGACGGGGAGCACCAGAGCCTGGATTCCGATGCCGGAGCAGAGGGTGTGGAAGGCGGCGTAGAAGGCGTTGCCGTTTCTCGACTCGGTGATCGG
AAGCCAGGCGTCGTGGGGGTCCAGCCTCGTCAAGTTGAGGGCCCGTCGGATCGGACTTCCGATCGGAGTGATATTGAACCGCGGTCTAGGGCTCGACAGTCTCGGAGTTT
TGCTCGCAATTTCTATCTGGTCGTCGCCGGAGAGCAGCGGCGAGCGGAACCCACAAATCATGACTTGCTGGCCAGCCTCCGCCACCGTCGACGAGAACAAAGAAGAAACA
CAGCATCTGGCCGCCGGAGCAATGGACAAAACCAGAGCTTGCTCCACTTCCCGGTCGACTTGTCGTCGGATCAAAGACACAGCATAGTCCATGAACGCCGCATCACACGG
CAGCTTGATCGGCCCGTCCCTCGGCAGCCCGAACTCCTCCTCCGACATCTTCAACAGCTCCCTGAACACGTAGCTCCCCAGATATCTTATCGGAAAAACGAACCGCTTCT
GATCGACTGTGTAGACGACGAAGTGGCCCTTGTCAGCCACCGTGGAGGAGCTTCTGGTTCGCGGCAACGATATCCTCTTCCGCCGCTGAGAGGCGGCCGCCGTCTTCTGC
CATTTCCTGGCTATTCTAATTAG
mRNA sequenceShow/hide mRNA sequence
ATAATGAAATGGGGGTGGGGGGGGGGGGATTGTATAAATTGAAGTATTTTTGGGGGGTGAGAGATGAAATTTGTGGAGAAATTCGGGGTGAGACTTTTCAGACATTGGGA
ATGGAAGAGTTCTATGAAGAAAGGAAGAGCTCGCGGAAGGTGGCAACGGCGGTGGCGGGGCATCAAGTTCCTCTGTACGGCGGGATCTCGGTGATCGGAAACTGGAGAGA
GCAGCGGCTGGAGGGGGTGGAGGTGCCGCTGAACCTGACGGTGGGGGTGAGGTCTAGGGCTTACATTCTGGGGAAACTGGTGAAGCTTTCTGATCTTCAGCCACATGAAG
CATGGATAAGCAAACGTCACCGGCAGCGCAATCCCGCCGACGAGCCCCGCCATGCTCCCCAGAAATGGGATCGCCACCGCCACGAAGAAACACAGATCTGGAACGAGCTC
ATGGCATTGATTATCACTAATAAGCTCGTTAAGCCGATGAGAAACTGTGAGGTGTCGCGGGCATGGAAGGCGTACAGAGCTGATTTGATCGTCAGTTACCTGTATCTCAA
GGACGAGATTGTGCCCTCTGAAAGCAAAAGCGACGATTCCCAGAGCGTTCAATGCGTCAAAGGCCCTAGCCATGTTGGTGTTGGCTTTCACCGGATCGTAGGAAACGCCG
GGCATTCTGCCCTTCACGACGGAGATCATCCATATCAACGTGCAGTAGCCGATGGCACTAACAGGACGGCGGCGCAGGTGAAAACCAAGTACCATTCGGTCGGCGTTAAG
GAATCCGCTCTGCAGTTGGTTCCGCAAACGATTTTGAAGAATGTGTTCGACGTTGAGCCTCCGATGATTATCAGAGCCACGCAGGTGCCGGCGGAGAGTATAAAGTTGCC
ATATGAACGCCACCGTCAAGCTTATGATCCCTCCGGCCCTGCAACCAGAAGCTTTCAAAAACCCATTGAATTCAAACTTATTGTAAACGATCAGTGGTTTTATTTACCAT
CCGAGAATGGTGAAGGCGACGGGGAGCACCAGAGCCTGGATTCCGATGCCGGAGCAGAGGGTGTGGAAGGCGGCGTAGAAGGCGTTGCCGTTTCTCGACTCGGTGATCGG
AAGCCAGGCGTCGTGGGGGTCCAGCCTCGTCAAGTTGAGGGCCCGTCGGATCGGACTTCCGATCGGAGTGATATTGAACCGCGGTCTAGGGCTCGACAGTCTCGGAGTTT
TGCTCGCAATTTCTATCTGGTCGTCGCCGGAGAGCAGCGGCGAGCGGAACCCACAAATCATGACTTGCTGGCCAGCCTCCGCCACCGTCGACGAGAACAAAGAAGAAACA
CAGCATCTGGCCGCCGGAGCAATGGACAAAACCAGAGCTTGCTCCACTTCCCGGTCGACTTGTCGTCGGATCAAAGACACAGCATAGTCCATGAACGCCGCATCACACGG
CAGCTTGATCGGCCCGTCCCTCGGCAGCCCGAACTCCTCCTCCGACATCTTCAACAGCTCCCTGAACACGTAGCTCCCCAGATATCTTATCGGAAAAACGAACCGCTTCT
GATCGACTGTGTAGACGACGAAGTGGCCCTTGTCAGCCACCGTGGAGGAGCTTCTGGTTCGCGGCAACGATATCCTCTTCCGCCGCTGAGAGGCGGCCGCCGTCTTCTGC
CATTTCCTGGCTATTCTAATTAG
Protein sequenceShow/hide protein sequence
NEMGVGGGGLYKLKYFWGVRDEICGEIRGETFQTLGMEEFYEERKSSRKVATAVAGHQVPLYGGISVIGNWREQRLEGVEVPLNLTVGVRSRAYILGKLVKLSDLQPHEA
WISKRHRQRNPADEPRHAPQKWDRHRHEETQIWNELMALIITNKLVKPMRNCEVSRAWKAYRADLIVSYLYLKDEIVPSESKSDDSQSVQCVKGPSHVGVGFHRIVGNAG
HSALHDGDHPYQRAVADGTNRTAAQVKTKYHSVGVKESALQLVPQTILKNVFDVEPPMIIRATQVPAESIKLPYERHRQAYDPSGPATRSFQKPIEFKLIVNDQWFYLPS
ENGEGDGEHQSLDSDAGAEGVEGGVEGVAVSRLGDRKPGVVGVQPRQVEGPSDRTSDRSDIEPRSRARQSRSFARNFYLVVAGEQRRAEPTNHDLLASLRHRRREQRRNT
ASGRRSNGQNQSLLHFPVDLSSDQRHSIVHERRITRQLDRPVPRQPELLLRHLQQLPEHVAPQISYRKNEPLLIDCVDDEVALVSHRGGASGSRQRYPLPPLRGGRRLLP
FPGYSN