; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr000789 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr000789
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLate embryogenesis abundant protein (LEA) family protein
Genome locationtig00000536:122632..124432
RNA-Seq ExpressionSgr000789
SyntenySgr000789
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8543454.1 hypothetical protein F0562_021051 [Nyssa sinensis]3.9e-2753.46Show/hide
Query:  SSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRP--------RHASSQVAA--AAADGAAQDVKRA------TEKAAEGVKDKAATAAEEVTQKTKEVAG
        +S+A T +L K H   +++VRSPWSSRVFV V P        RH ++  +A   A+D A + V  A       +KA + +K+KAA+ AE+V QKT++VAG
Subjt:  SSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRP--------RHASSQVAA--AAADGAAQDVKRA------TEKAAEGVKDKAATAAEEVTQKTKEVAG

Query:  KVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKS
        KV+ETAQDLA KAKQT QDAWGSVKDTTQ IKE V GKAEESKE IK+ AE+ K+++ +
Subjt:  KVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKS

XP_008457975.1 PREDICTED: uncharacterized protein At4g13230 [Cucumis melo]3.5e-5282.31Show/hide
Query:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQVAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAK
        MAS LA +ATLPKL RAGS++V +PWSS+VFVS+RPRH SSQV +AAADGAAQ VKR TEKAAE VKDKA +AAEEVTQKTK+VAGKVSETAQD+AGKAK
Subjt:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQVAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC
        QTV+DAWGSVKDTTQNIKEKVVGKAEESKEAIKD+AEN+K N+KS C
Subjt:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC

XP_011659168.1 uncharacterized protein At4g13230 [Cucumis sativus]8.6e-5178.91Show/hide
Query:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQVAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAK
        MAS LA +AT+PKL RAGS++V +PWSS VFVS+RPRH SSQV AAAAD AA  VKR TE+AAE VKDKA +AA+EVTQ+TKEVAGKVSETAQD+AGKAK
Subjt:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQVAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC
        QTV+DAWGSVKDTTQNIK+KVVGKAEESKEAIKD+AEN+K+N+KS C
Subjt:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC

XP_022152517.1 uncharacterized protein At4g13230 [Momordica charantia]6.4e-5485.91Show/hide
Query:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQ--VAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGK
        MASSLA+ AT+ KL+RAGSVLVRSPWSSRVFVSVRPRHASS    AAAAADGAAQ VKRATEKAAE VKDKA++AAEEVTQKTKEVAGKVSETAQDLAGK
Subjt:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQ--VAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGK

Query:  AKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC
        AKQTVQ+AWGSVKDTTQNIKEKVVGKAEESKEAIKD+A+N+K  +KS C
Subjt:  AKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC

XP_038897424.1 uncharacterized protein At4g13230 [Benincasa hispida]2.9e-5485.03Show/hide
Query:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQVAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAK
        MAS LA +ATLPKLHRA S+ VR+PWSS+VFVSVRP+H SS+VAAAAADGAAQ VKRATEKAAE VKDKA +AA+EVTQKTKEVAGKVSETAQDLAGKAK
Subjt:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQVAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC
        QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKD+A+NLK+N+K+ C
Subjt:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC

TrEMBL top hitse value%identityAlignment
A0A0A0K9U4 Uncharacterized protein4.2e-5178.91Show/hide
Query:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQVAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAK
        MAS LA +AT+PKL RAGS++V +PWSS VFVS+RPRH SSQV AAAAD AA  VKR TE+AAE VKDKA +AA+EVTQ+TKEVAGKVSETAQD+AGKAK
Subjt:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQVAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC
        QTV+DAWGSVKDTTQNIK+KVVGKAEESKEAIKD+AEN+K+N+KS C
Subjt:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC

A0A1S3C7D4 uncharacterized protein At4g132301.7e-5282.31Show/hide
Query:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQVAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAK
        MAS LA +ATLPKL RAGS++V +PWSS+VFVS+RPRH SSQV +AAADGAAQ VKR TEKAAE VKDKA +AAEEVTQKTK+VAGKVSETAQD+AGKAK
Subjt:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQVAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC
        QTV+DAWGSVKDTTQNIKEKVVGKAEESKEAIKD+AEN+K N+KS C
Subjt:  QTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC

A0A5B7AK94 Uncharacterized protein (Fragment)2.6e-2954.04Show/hide
Query:  SSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRP-------RHASSQVAAAAADGAAQDVKRATEKAAEG----------VKDKAATAAEEVTQKTKEVA
        +S+A T TL K      + VRSPWSSRVFV + P       RH S+  A +A + AA    +  + A +G          +K+KAA+ AE+VT K+K+VA
Subjt:  SSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRP-------RHASSQVAAAAADGAAQDVKRATEKAAEG----------VKDKAATAAEEVTQKTKEVA

Query:  GKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSK
        GKVSETAQDLA KAKQT QDAWGSVKDTTQ IKE V GKAEESK++IKDSAEN+K+++ +K
Subjt:  GKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSK

A0A5J5BL23 Uncharacterized protein1.9e-2753.46Show/hide
Query:  SSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRP--------RHASSQVAA--AAADGAAQDVKRA------TEKAAEGVKDKAATAAEEVTQKTKEVAG
        +S+A T +L K H   +++VRSPWSSRVFV V P        RH ++  +A   A+D A + V  A       +KA + +K+KAA+ AE+V QKT++VAG
Subjt:  SSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRP--------RHASSQVAA--AAADGAAQDVKRA------TEKAAEGVKDKAATAAEEVTQKTKEVAG

Query:  KVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKS
        KV+ETAQDLA KAKQT QDAWGSVKDTTQ IKE V GKAEESKE IK+ AE+ K+++ +
Subjt:  KVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKS

A0A6J1DGG7 uncharacterized protein At4g132303.1e-5485.91Show/hide
Query:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQ--VAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGK
        MASSLA+ AT+ KL+RAGSVLVRSPWSSRVFVSVRPRHASS    AAAAADGAAQ VKRATEKAAE VKDKA++AAEEVTQKTKEVAGKVSETAQDLAGK
Subjt:  MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQ--VAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGK

Query:  AKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC
        AKQTVQ+AWGSVKDTTQNIKEKVVGKAEESKEAIKD+A+N+K  +KS C
Subjt:  AKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC

SwissProt top hitse value%identityAlignment
Q8LFD5 Uncharacterized protein At4g132301.1e-0841.3Show/hide
Query:  VKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSK
        V   T    + V DKA  A ++V +K  E A  +S+ A +L  KAK T ++AW  VKDTT+ IK+ V GK EE+KE+IK +A+ +++++ +K
Subjt:  VKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSK

Arabidopsis top hitse value%identityAlignment
AT1G04670.1 unknown protein1.3e-0437.14Show/hide
Query:  RVFVSVRPRHASSQVAAAAADGAAQDVKRATEKAAEGVK----------DKAATAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIK
        R FVS  PR    +  A       Q VK A E   EG K          D A+T A  VT+ TK+V  KV+ET   +  KAK +V    G+ K+ T  IK
Subjt:  RVFVSVRPRHASSQVAAAAADGAAQDVKRATEKAAEGVK----------DKAATAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIK

Query:  EKVVG
         K++G
Subjt:  EKVVG

AT4G13230.1 Late embryogenesis abundant protein (LEA) family protein7.9e-1041.3Show/hide
Query:  VKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSK
        V   T    + V DKA  A ++V +K  E A  +S+ A +L  KAK T ++AW  VKDTT+ IK+ V GK EE+KE+IK +A+ +++++ +K
Subjt:  VKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSVKDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGCAGCTTGGCCGCAACAGCAACTCTTCCAAAGCTTCATCGTGCTGGATCTGTTCTTGTACGCAGTCCATGGAGCTCTAGGGTTTTCGTCTCTGTCAGACCAAG
ACATGCAAGTTCACAGGTTGCTGCGGCAGCAGCTGACGGAGCTGCTCAGGACGTCAAGCGAGCCACTGAGAAGGCTGCCGAAGGAGTCAAAGATAAGGCTGCCACTGCCG
CTGAGGAGGTGACTCAGAAAACAAAGGAAGTTGCTGGGAAGGTGAGTGAAACAGCACAAGATTTGGCTGGGAAAGCAAAGCAAACAGTTCAAGATGCTTGGGGATCTGTG
AAGGACACAACTCAGAACATCAAAGAGAAAGTGGTTGGCAAAGCTGAGGAATCCAAAGAAGCCATTAAAGACAGTGCAGAGAACCTCAAACAGAATGTCAAGTCAAAGTG
TTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGCAGCTTGGCCGCAACAGCAACTCTTCCAAAGCTTCATCGTGCTGGATCTGTTCTTGTACGCAGTCCATGGAGCTCTAGGGTTTTCGTCTCTGTCAGACCAAG
ACATGCAAGTTCACAGGTTGCTGCGGCAGCAGCTGACGGAGCTGCTCAGGACGTCAAGCGAGCCACTGAGAAGGCTGCCGAAGGAGTCAAAGATAAGGCTGCCACTGCCG
CTGAGGAGGTGACTCAGAAAACAAAGGAAGTTGCTGGGAAGGTGAGTGAAACAGCACAAGATTTGGCTGGGAAAGCAAAGCAAACAGTTCAAGATGCTTGGGGATCTGTG
AAGGACACAACTCAGAACATCAAAGAGAAAGTGGTTGGCAAAGCTGAGGAATCCAAAGAAGCCATTAAAGACAGTGCAGAGAACCTCAAACAGAATGTCAAGTCAAAGTG
TTGA
Protein sequenceShow/hide protein sequence
MASSLAATATLPKLHRAGSVLVRSPWSSRVFVSVRPRHASSQVAAAAADGAAQDVKRATEKAAEGVKDKAATAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQDAWGSV
KDTTQNIKEKVVGKAEESKEAIKDSAENLKQNVKSKC