; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh02G014200 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh02G014200
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionLEA_2 domain-containing protein
Genome locationCma_Chr02:8186379..8187657
RNA-Seq ExpressionCmaCh02G014200
SyntenyCmaCh02G014200
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647845.1 hypothetical protein Csa_000600 [Cucumis sativus]1.2e-6553.72Show/hide
Query:  MGEHTHSFPLPHSQAHHKTP-----------KNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSIS-NTNSSS-SSLNLTLIAE
        MGE + SFPL H QAHHK             + E SNKCFIY+FS FVFL VALLIF+LIV+RVNSP+I LSSIS  + S+S NTNSSS +SLNL+  AE
Subjt:  MGEHTHSFPLPHSQAHHKTP-----------KNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSIS-NTNSSS-SSLNLTLIAE

Query:  FSVDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDL
        F+VDNSNF PF FD  TV  +YGG+I GERSTGGGRA AKG+ RMNV+VE S +NVS     +GILN SSF K  GR+RLIH+ R+R+ SEISCS+NLDL
Subjt:  FSVDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDL

Query:  NTHQIQPRW---PYKSPHPPPPPTGINGGRRNIWRPAGDTPTKAKSIRKIHQYLLHLPLCYRFHRLHRRPDPRSSVAVKNLHYGFSSTPSMDATLIAELT
        NTHQIQ  W   P  +   P   T  +  RR   +   +T             +  L       R+       +SVAVK+L YGFS TP M+ATL  E+T
Subjt:  NTHQIQPRW---PYKSPHPPPPPTGINGGRRNIWRPAGDTPTKAKSIRKIHQYLLHLPLCYRFHRLHRRPDPRSSVAVKNLHYGFSSTPSMDATLIAELT

Query:  RENPNFGQF
         ENPN+G F
Subjt:  RENPNFGQF

KAG6606065.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. sororia]9.4e-9896.43Show/hide
Query:  MGEHTHSFPLPHSQAHHKTPKNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFSVDNSNFSPFIF
        MGEH+HSFPLPHSQAHHKTPKNEPSNKCFIY+FS+FVFLCVALLIFSLIV+RVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFS+DNSNF PFIF
Subjt:  MGEHTHSFPLPHSQAHHKTPKNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFSVDNSNFSPFIF

Query:  DYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNTHQIQPRW
        DYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRI LIHVLRKRIWSEISCSINLDLNTHQIQPRW
Subjt:  DYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNTHQIQPRW

XP_022958536.1 uncharacterized protein LOC111459739 [Cucurbita moschata]4.2e-9893.63Show/hide
Query:  MGEHTHSFPLPHSQAHHKTPKNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFSVDNSNFSPFIF
        MGEH+HSFPLPHSQAHHKTPKNEPSNKCFIY+FS+FVFLCVALLIFSLIV+RVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFS+DNSNF PFIF
Subjt:  MGEHTHSFPLPHSQAHHKTPKNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFSVDNSNFSPFIF

Query:  DYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNTHQIQPRWPYKS
        DYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRI LIHVLRKRIWSEISCSINLDLNTHQIQPRW   +
Subjt:  DYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNTHQIQPRWPYKS

Query:  PHPP
         H P
Subjt:  PHPP

XP_023532981.1 uncharacterized protein LOC111794996 [Cucurbita pepo subsp. pepo]3.9e-9695Show/hide
Query:  MGEHTHSFPLPHSQAHHKTPKNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFSVDNSNFSPFIF
        M EHTHSFPLPHSQAHHKT KNEPSNKCFIYLFSAFVFLCVALLIFSLIV+RVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFSVDNSNF PFIF
Subjt:  MGEHTHSFPLPHSQAHHKTPKNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFSVDNSNFSPFIF

Query:  DYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNTHQIQPRWPYKS
        DYVTVVFMYGGVIVGERS+GGGRAEAKGTTRMNVSVE SVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNTHQI PRW  +S
Subjt:  DYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNTHQIQPRWPYKS

XP_038875090.1 late embryogenesis abundant protein At1g64065 [Benincasa hispida]1.7e-6265.22Show/hide
Query:  MGEHTHSFPLPHSQAHH-----------KTPKNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFS
        M E + SFPL H QAHH           KT + E SNKCFIY+FS FVFL VA+LIF+LIV+RVNSP+I LSS+S+ KFSI+N NSSS SLNLT+IAEF+
Subjt:  MGEHTHSFPLPHSQAHH-----------KTPKNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFS

Query:  VDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNT
        VDNSNF PF FD  TV  MYGG IVGE+STG GRAEAKG+ RMNV++EAS +N+SSD N  GILN++SF K  GR+RLIH+ R+R  SEI+CS+NLD+NT
Subjt:  VDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNT

Query:  HQIQPRW
        HQIQ  W
Subjt:  HQIQPRW

TrEMBL top hitse value%identityAlignment
A0A0A0KQT7 LEA_2 domain-containing protein2.5e-5664.11Show/hide
Query:  MGEHTHSFPLPHSQAHHKTP-----------KNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSIS-NTNSSS-SSLNLTLIAE
        MGE + SFPL H QAHHK             + E SNKCFIY+FS FVFL VALLIF+LIV+RVNSP+I LSSIS  + S+S NTNSSS +SLNL+  AE
Subjt:  MGEHTHSFPLPHSQAHHKTP-----------KNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSIS-NTNSSS-SSLNLTLIAE

Query:  FSVDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDL
        F+VDNSNF PF FD  TV  +YGG+I GERSTGGGRA AKG+ RMNV+VE S +NVS     +GILN SSF K  GR+RLIH+ R+R+ SEISCS+NLDL
Subjt:  FSVDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDL

Query:  NTHQIQPRW
        NTHQIQ  W
Subjt:  NTHQIQPRW

A0A1S3ATY3 late embryogenesis abundant protein At1g640651.6e-5863.64Show/hide
Query:  MGEHTHSFPLPHSQAHHKTPK-----------NEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSS--SLNLTLIAE
        MGE + SFPL H QAHHKT +            E SNKCFIY+FS FVFL VALLIF+LIV+RVNSP+I+LS++S+ KFS+SN N+SSS  SL+L+  A 
Subjt:  MGEHTHSFPLPHSQAHHKTPK-----------NEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSS--SLNLTLIAE

Query:  FSVDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDL
        F+VDNSNF PF FD  TV  +YGG+I GERSTGGGRAEAKG+ RMNV+VE S +NVS     +GIL++SSF K  GR+RLIHV R+R+ SEISCS+NLDL
Subjt:  FSVDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDL

Query:  NTHQIQPRW
        NTHQIQ  W
Subjt:  NTHQIQPRW

A0A5A7TL68 Late embryogenesis abundant protein3.5e-5863.64Show/hide
Query:  MGEHTHSFPLPHSQAHHKTPK-----------NEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSS--SLNLTLIAE
        MGE + SFPL H QAHHKT +            E SNKCFIY+FS FVFL VALLIF+LIV+RVNSP+I+LS++S+ KFS+SN N+SSS  SL+L+  A 
Subjt:  MGEHTHSFPLPHSQAHHKTPK-----------NEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSS--SLNLTLIAE

Query:  FSVDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDL
        F VDNSNF PF FD  TV  +YGG+I GERSTGGGRAEAKG+ RMNV+VE S +NVS     +GIL++SSF K  GR+RLIHV R+R+ SEISCS+NLDL
Subjt:  FSVDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDL

Query:  NTHQIQPRW
        NTHQIQ  W
Subjt:  NTHQIQPRW

A0A6J1H2C0 uncharacterized protein LOC1114597392.0e-9893.63Show/hide
Query:  MGEHTHSFPLPHSQAHHKTPKNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFSVDNSNFSPFIF
        MGEH+HSFPLPHSQAHHKTPKNEPSNKCFIY+FS+FVFLCVALLIFSLIV+RVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFS+DNSNF PFIF
Subjt:  MGEHTHSFPLPHSQAHHKTPKNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFSVDNSNFSPFIF

Query:  DYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNTHQIQPRWPYKS
        DYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRI LIHVLRKRIWSEISCSINLDLNTHQIQPRW   +
Subjt:  DYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNTHQIQPRWPYKS

Query:  PHPP
         H P
Subjt:  PHPP

A0A6J1I3M2 uncharacterized protein LOC1114688756.0e-5859.81Show/hide
Query:  MGEHTHSFPLPHSQAHHKTP-------------KNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAE
        M + + SFP+ H +AHHK+              + E SNKCFIY+FSAFVFL VA+LIF+LIV+RVNSP +  SS+SV KFS+SNTNSSS SLNLT+ A+
Subjt:  MGEHTHSFPLPHSQAHHKTP-------------KNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAE

Query:  FSVDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDL
         +VDNSNF PF FDY +V F+Y G IVG+ +TG GRA+AKGT  MNV+V+AS  N+S+D N S +LN+SSFA   GR+RLIH+ R+R  SEISCS+ LDL
Subjt:  FSVDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDL

Query:  NTHQIQPRW
        NTHQIQ  W
Subjt:  NTHQIQPRW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.9e-1226.86Show/hide
Query:  KCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKF-SISNTNS-SSSSLNLTLIAEFSVDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRA
        KC I + +  + L   +L     V RV  P I ++ + V    S++ TN       N+++I + SV N N + F +   T    Y G +VGE     G+A
Subjt:  KCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKF-SISNTNS-SSSSLNLTLIAEFSVDNSNFSPFIFDYVTVVFMYGGVIVGERSTGGGRA

Query:  EAKGTTRMNVSVEASVENVSSD------LNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNTHQIQ
            T+RMNV+V+  ++ + SD      ++ SG++N+ S+ + GG+++++ +++K +  +++C++ +++    IQ
Subjt:  EAKGTTRMNVSVEASVENVSSD------LNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNTHQIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGAACACACCCACAGCTTTCCATTGCCACACTCCCAAGCTCACCACAAAACTCCCAAAAACGAACCATCCAACAAATGCTTCATCTACCTCTTCTCCGCCTTCGT
CTTCCTCTGCGTCGCCCTTCTGATCTTCTCTCTCATCGTTGTGCGCGTTAATTCCCCGACCATCGACCTCTCTTCCATCTCCGTCCGTAAGTTTTCCATCTCTAATACTA
ATTCCTCTTCCTCCTCGCTTAATCTGACCTTGATTGCTGAATTCTCCGTCGACAATTCGAACTTCAGTCCCTTCATTTTCGATTACGTCACCGTCGTTTTCATGTACGGC
GGCGTCATCGTCGGCGAAAGGAGTACTGGCGGGGGTAGGGCTGAGGCGAAGGGGACGACGAGGATGAATGTTTCTGTTGAAGCTTCTGTGGAGAATGTTAGCAGCGATTT
GAATGGTTCGGGGATTTTGAATATGAGTAGCTTTGCGAAATTTGGAGGGAGAATTCGTTTGATTCATGTTTTAAGGAAGAGGATTTGGTCGGAGATTAGTTGTTCCATTA
ATCTGGATTTGAACACTCATCAAATTCAGCCTCGTTGGCCCTACAAATCCCCTCATCCTCCTCCACCACCAACCGGAATTAATGGCGGCCGACGAAACATCTGGCGACCT
GCCGGCGACACTCCGACCAAGGCGAAAAGCATCAGAAAAATACATCAATACCTCCTGCATTTACCTCTTTGCTATCGCTTCCATCGCCTGCATCGTCGCCCTGACCCTCG
GTCTTCCGTCGCAGTAAAAAATCTGCACTACGGCTTCTCGTCGACCCCTTCCATGGACGCCACATTAATCGCCGAATTAACACGGGAAAATCCCAATTTCGGGCAGTTCA
ATGAAAAAGTGAAACCAAATCCGAGCTTCGTTAATGGTGTGTTACTTCAGCCACAATTTGGGAAGGTAGAAGACGATGAATATGAGCTGCATCACGGAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGAACACACCCACAGCTTTCCATTGCCACACTCCCAAGCTCACCACAAAACTCCCAAAAACGAACCATCCAACAAATGCTTCATCTACCTCTTCTCCGCCTTCGT
CTTCCTCTGCGTCGCCCTTCTGATCTTCTCTCTCATCGTTGTGCGCGTTAATTCCCCGACCATCGACCTCTCTTCCATCTCCGTCCGTAAGTTTTCCATCTCTAATACTA
ATTCCTCTTCCTCCTCGCTTAATCTGACCTTGATTGCTGAATTCTCCGTCGACAATTCGAACTTCAGTCCCTTCATTTTCGATTACGTCACCGTCGTTTTCATGTACGGC
GGCGTCATCGTCGGCGAAAGGAGTACTGGCGGGGGTAGGGCTGAGGCGAAGGGGACGACGAGGATGAATGTTTCTGTTGAAGCTTCTGTGGAGAATGTTAGCAGCGATTT
GAATGGTTCGGGGATTTTGAATATGAGTAGCTTTGCGAAATTTGGAGGGAGAATTCGTTTGATTCATGTTTTAAGGAAGAGGATTTGGTCGGAGATTAGTTGTTCCATTA
ATCTGGATTTGAACACTCATCAAATTCAGCCTCGTTGGCCCTACAAATCCCCTCATCCTCCTCCACCACCAACCGGAATTAATGGCGGCCGACGAAACATCTGGCGACCT
GCCGGCGACACTCCGACCAAGGCGAAAAGCATCAGAAAAATACATCAATACCTCCTGCATTTACCTCTTTGCTATCGCTTCCATCGCCTGCATCGTCGCCCTGACCCTCG
GTCTTCCGTCGCAGTAAAAAATCTGCACTACGGCTTCTCGTCGACCCCTTCCATGGACGCCACATTAATCGCCGAATTAACACGGGAAAATCCCAATTTCGGGCAGTTCA
ATGAAAAAGTGAAACCAAATCCGAGCTTCGTTAATGGTGTGTTACTTCAGCCACAATTTGGGAAGGTAGAAGACGATGAATATGAGCTGCATCACGGAATTTGA
Protein sequenceShow/hide protein sequence
MGEHTHSFPLPHSQAHHKTPKNEPSNKCFIYLFSAFVFLCVALLIFSLIVVRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFSVDNSNFSPFIFDYVTVVFMYG
GVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNGSGILNMSSFAKFGGRIRLIHVLRKRIWSEISCSINLDLNTHQIQPRWPYKSPHPPPPPTGINGGRRNIWRP
AGDTPTKAKSIRKIHQYLLHLPLCYRFHRLHRRPDPRSSVAVKNLHYGFSSTPSMDATLIAELTRENPNFGQFNEKVKPNPSFVNGVLLQPQFGKVEDDEYELHHGI