; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007955 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007955
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
Genome locationChr10:17834907..17835554
RNA-Seq ExpressionHG10007955
SyntenyHG10007955
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046596.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family isoform 1 [Cucumis melo var. makuwa]2.1e-8587.18Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MT+SS DDSVPVPYTLL  NAAQQNVVVLSLYRP PCRHRRLLRL A YSAAFLLLF VAFLLFPSDPSLQLVRLKLNRVKVHL+P V+LDLSF  SLRV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLI
        RNKNFFSL+Y+++GVSVGYRGRRLGYVSS GGRVSARGSSYVNATLDLNG EVVHDVLYLLADLGKGIIPFDTETEVEGSMG+ FIK PIK  +I
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLI

XP_004148717.1 uncharacterized protein LOC101219269 [Cucumis sativus]8.9e-8985.44Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MTSSS DDSVPVPYTL+P NAAQQNVVVLSLYRPPPCRHRRLLRLCA YSAAFLLLF VAFLLFPSDPSLQLVRLKLNRVKVHL+PVV+LDLSF  SLRV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLIL-ILF
        RNKNFFSL+Y+++GVSVGYRGRRLGYVSS+GGRVSARGSSYVNATLDLNG EVVHDVLYLLADLGKGIIPFDTET+VEGSMG+ FIK PIK  +   +L 
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLIL-ILF

Query:  FTINQS
         T NQ+
Subjt:  FTINQS

XP_008463384.1 PREDICTED: uncharacterized protein LOC103501551 [Cucumis melo]7.8e-8583.98Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MT+SS DDSVPVPYTLL  NAAQQNVVVLSLYRP PCRHRRLLRL A YSAAFLLLF VAFLLFPSDPSLQLVRLKLNRVKVHL+P V+LDLSF  SLRV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLIL-ILF
        RNKNFFSL+Y+++GVSVGYRGRRLGYVSS GGRVSARGSSYVNATLDLNG EVVHDVLYLLADLGKGIIPFDTETEVEGSMG+ FIK PIK  +   +L 
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLIL-ILF

Query:  FTINQS
         T NQ+
Subjt:  FTINQS

XP_022144909.1 uncharacterized protein LOC111014473 [Momordica charantia]7.6e-8078.82Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MTSSSRDDSVPVPY+LLP NAA QNVVVLSLYRPP  R RRLLRLCA YSAAFLLL  VAFLLFP+DPSLQLVRLKLNR+KV LLPV+ LDLSF AS+RV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLILILFF
        RN NFFSLDY+Y+GVSVGYRGRRLG+VSS+GGRVSARG SYVNATLDLNGFEV+HD +YL+ DL  GI+PFDTETEVEG MG+ FIKFPIK  +   +F 
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLILILFF

Query:  TIN
          N
Subjt:  TIN

XP_038878687.1 uncharacterized protein LOC120070868 [Benincasa hispida]1.1e-8690.16Show/hide
Query:  MTSSSR-DDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPV-VALDLSFFASL
        MTSSSR DDSVPVPYTLLPQNAAQQNVVVLSLYR PPC+H RLLRLCALYSAAFLLLF VAFLLFP+DPS QLVRLKLN VKVHL+P  V+LDLSFFASL
Subjt:  MTSSSR-DDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPV-VALDLSFFASL

Query:  RVRNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIK
        RVRNKNFFSL YDYIGVSVGYRG+RLG+VSS+GGRVSARGSSYVNATLDLNG EVVHDVLYLLADLGKGIIPFDTETEVEGSMG+ FIKFPIK
Subjt:  RVRNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIK

TrEMBL top hitse value%identityAlignment
A0A0A0LTV4 LEA_2 domain-containing protein4.3e-8985.44Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MTSSS DDSVPVPYTL+P NAAQQNVVVLSLYRPPPCRHRRLLRLCA YSAAFLLLF VAFLLFPSDPSLQLVRLKLNRVKVHL+PVV+LDLSF  SLRV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLIL-ILF
        RNKNFFSL+Y+++GVSVGYRGRRLGYVSS+GGRVSARGSSYVNATLDLNG EVVHDVLYLLADLGKGIIPFDTET+VEGSMG+ FIK PIK  +   +L 
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLIL-ILF

Query:  FTINQS
         T NQ+
Subjt:  FTINQS

A0A1S3CJK6 uncharacterized protein LOC1035015513.8e-8583.98Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MT+SS DDSVPVPYTLL  NAAQQNVVVLSLYRP PCRHRRLLRL A YSAAFLLLF VAFLLFPSDPSLQLVRLKLNRVKVHL+P V+LDLSF  SLRV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLIL-ILF
        RNKNFFSL+Y+++GVSVGYRGRRLGYVSS GGRVSARGSSYVNATLDLNG EVVHDVLYLLADLGKGIIPFDTETEVEGSMG+ FIK PIK  +   +L 
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLIL-ILF

Query:  FTINQS
         T NQ+
Subjt:  FTINQS

A0A5A7TX90 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family isoform 11.0e-8587.18Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MT+SS DDSVPVPYTLL  NAAQQNVVVLSLYRP PCRHRRLLRL A YSAAFLLLF VAFLLFPSDPSLQLVRLKLNRVKVHL+P V+LDLSF  SLRV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLI
        RNKNFFSL+Y+++GVSVGYRGRRLGYVSS GGRVSARGSSYVNATLDLNG EVVHDVLYLLADLGKGIIPFDTETEVEGSMG+ FIK PIK  +I
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLI

A0A6J1CTN0 uncharacterized protein LOC1110144733.7e-8078.82Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MTSSSRDDSVPVPY+LLP NAA QNVVVLSLYRPP  R RRLLRLCA YSAAFLLL  VAFLLFP+DPSLQLVRLKLNR+KV LLPV+ LDLSF AS+RV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLILILFF
        RN NFFSLDY+Y+GVSVGYRGRRLG+VSS+GGRVSARG SYVNATLDLNGFEV+HD +YL+ DL  GI+PFDTETEVEG MG+ FIKFPIK  +   +F 
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLILILFF

Query:  TIN
          N
Subjt:  TIN

A0A6J1JI07 uncharacterized protein LOC1114852802.7e-7576.73Show/hide
Query:  SSSRDDSVPVPYTLLPQN-AAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVR
        S S+D S+PVPY+ +P N AA QNVVVLSLYRPP  R RRLLRLCALYSAAFLLL  V FLLFPSDPSLQLVRLKLN VKV LLP V LDLSF AS+RVR
Subjt:  SSSRDDSVPVPYTLLPQN-AAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVR

Query:  NKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLILILFFT
        NKNFFSLDY+Y+GVSVG+RGRRLG+VSS GGRVSARGSSYVNATLDLNG +++HDV +LL DL KGIIPFDTETEVEGSMG+ FIKFPIK  +   +F  
Subjt:  NKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLILILFFT

Query:  IN
         N
Subjt:  IN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.7e-3437.37Show/hide
Query:  YTLLPQNAAQQ--NVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVRNKNFFSLDYD
        Y  LP +++ +  + V++S +  PP R R ++ +  +  A+ L+     ++ +PSDP ++++R+K++ V VH  PV ++D++   +L+V N + +S D+ 
Subjt:  YTLLPQNAAQQ--NVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVRNKNFFSLDYD

Query:  YIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLIL-ILFFTINQSFTR
         + V++ YRG+ LG+VSS GG V+A GSSY++A  +L+G  V  DV++L+ DL KG + FDT TE  G +G+LF +FP+K  +   IL  T+NQ+ +R
Subjt:  YIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLIL-ILFFTINQSFTR

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.7e-3338.12Show/hide
Query:  YTLLPQNAAQQ--NVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVRNKNFFSLDYD
        Y  LP +++ +  + V++S +  PP R R ++ +  +  A+ L+     ++ +PSDP ++++R+K++ V VH  PV ++D++   +L+V N + +S D+ 
Subjt:  YTLLPQNAAQQ--NVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVRNKNFFSLDYD

Query:  YIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKV
         + V++ YRG+ LG+VSS GG V+A GSSY++A  +L+G  V  DV++L+ DL KG + FDT TE  G +G+LF +FP+KV
Subjt:  YIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKV

AT4G13270.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.0e-4546.6Show/hide
Query:  SSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHR-----RLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFAS
        +SS+ +   +PYT LP +   Q+V++L+ YR    RHR     R LR   L++A  LLL    +LL+PSDP + + R+ LN + V     +ALDLSF  +
Subjt:  SSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHR-----RLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFAS

Query:  LRVRNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLILI
        ++VRN++FFSLDYD + VS+GYRGR LG V S+GG + AR SSY++ATL+L+G EVVHDV+YL+ DL KG+IPFDT  +V+G +G+L    PI+  +   
Subjt:  LRVRNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLILI

Query:  LFFTIN
        ++  +N
Subjt:  LFFTIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTCCAGTTCCAGGGACGATTCTGTCCCTGTGCCCTACACTCTTCTTCCCCAAAATGCTGCACAGCAAAACGTCGTCGTTTTATCCCTCTACCGTCCCCCTCCATG
CCGACATCGGCGGCTTCTCCGCCTCTGTGCTCTCTACTCCGCCGCCTTCCTCCTCCTCTTCACCGTTGCTTTTCTACTTTTCCCCTCCGATCCTTCGCTCCAACTCGTCC
GATTGAAACTCAATCGTGTCAAAGTCCATTTGTTGCCTGTTGTCGCCCTTGACCTTTCTTTCTTTGCTTCTCTTAGGGTTCGCAATAAGAACTTCTTCTCTCTCGATTAC
GATTACATTGGGGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATATGTGAGCTCTCAGGGCGGTCGAGTTTCTGCTCGAGGCTCTTCTTATGTGAATGCCACTCTCGA
CTTGAATGGGTTTGAAGTCGTTCACGACGTCTTGTACTTGCTTGCGGATCTGGGGAAGGGTATCATTCCCTTCGATACGGAGACGGAAGTGGAAGGATCCATGGGGATTC
TCTTTATCAAATTCCCGATTAAGGTAATGTTGATTTTGATTCTGTTTTTCACAATAAATCAAAGCTTTACTCGGAATTGTGAATTGGTTCGATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCTCCAGTTCCAGGGACGATTCTGTCCCTGTGCCCTACACTCTTCTTCCCCAAAATGCTGCACAGCAAAACGTCGTCGTTTTATCCCTCTACCGTCCCCCTCCATG
CCGACATCGGCGGCTTCTCCGCCTCTGTGCTCTCTACTCCGCCGCCTTCCTCCTCCTCTTCACCGTTGCTTTTCTACTTTTCCCCTCCGATCCTTCGCTCCAACTCGTCC
GATTGAAACTCAATCGTGTCAAAGTCCATTTGTTGCCTGTTGTCGCCCTTGACCTTTCTTTCTTTGCTTCTCTTAGGGTTCGCAATAAGAACTTCTTCTCTCTCGATTAC
GATTACATTGGGGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATATGTGAGCTCTCAGGGCGGTCGAGTTTCTGCTCGAGGCTCTTCTTATGTGAATGCCACTCTCGA
CTTGAATGGGTTTGAAGTCGTTCACGACGTCTTGTACTTGCTTGCGGATCTGGGGAAGGGTATCATTCCCTTCGATACGGAGACGGAAGTGGAAGGATCCATGGGGATTC
TCTTTATCAAATTCCCGATTAAGGTAATGTTGATTTTGATTCTGTTTTTCACAATAAATCAAAGCTTTACTCGGAATTGTGAATTGGTTCGATGTTGA
Protein sequenceShow/hide protein sequence
MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVRNKNFFSLDY
DYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLADLGKGIIPFDTETEVEGSMGILFIKFPIKVMLILILFFTINQSFTRNCELVRC