; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G008850 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G008850
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
Genome locationchr06:18414262..18415167
RNA-Seq ExpressionLsi06G008850
SyntenyLsi06G008850
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046596.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family isoform 1 [Cucumis melo var. makuwa]1.3e-9885.84Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MT+SS DDSVPVPYTLL  NAAQQNVVVLSLYRP PCRHRRLLRL A YSAAFLLLF VAFLLFPSDPSLQLVRLKLNRVKVHL+P V+LDLSF  SLRV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIKFAIIEWMNS
        RNKNFFSL+Y+++GVSVGYRGRRLGYVSS GGRVSARGSSYVNATLDLNG EVVHDVLYLL DLGKGIIPFDTETEVEGSMG+ FIK PIKFAII+W+NS
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIKFAIIEWMNS

Query:  SKTPFLCKQHDLNNAYSVL
        S++ FLCKQHD NN YS+L
Subjt:  SKTPFLCKQHDLNNAYSVL

XP_004148717.1 uncharacterized protein LOC101219269 [Cucumis sativus]9.4e-8989.53Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MTSSS DDSVPVPYTL+P NAAQQNVVVLSLYRPPPCRHRRLLRLCA YSAAFLLLF VAFLLFPSDPSLQLVRLKLNRVKVHL+PVV+LDLSF  SLRV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK
        RNKNFFSL+Y+++GVSVGYRGRRLGYVSS+GGRVSARGSSYVNATLDLNG EVVHDVLYLL DLGKGIIPFDTET+VEGSMG+ FIK PIK
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK

XP_008463384.1 PREDICTED: uncharacterized protein LOC103501551 [Cucumis melo]8.2e-8587.96Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MT+SS DDSVPVPYTLL  NAAQQNVVVLSLYRP PCRHRRLLRL A YSAAFLLLF VAFLLFPSDPSLQLVRLKLNRVKVHL+P V+LDLSF  SLRV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK
        RNKNFFSL+Y+++GVSVGYRGRRLGYVSS GGRVSARGSSYVNATLDLNG EVVHDVLYLL DLGKGIIPFDTETEVEGSMG+ FIK PIK
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK

XP_022144909.1 uncharacterized protein LOC111014473 [Momordica charantia]6.1e-8082.72Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MTSSSRDDSVPVPY+LLP NAA QNVVVLSLYRPP  R RRLLRLCA YSAAFLLL  VAFLLFP+DPSLQLVRLKLNR+KV LLPV+ LDLSF AS+RV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK
        RN NFFSLDY+Y+GVSVGYRGRRLG+VSS+GGRVSARG SYVNATLDLNGFEV+HD +YL+ DL  GI+PFDTETEVEG MG+ FIKFPIK
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK

XP_038878687.1 uncharacterized protein LOC120070868 [Benincasa hispida]3.3e-8689.64Show/hide
Query:  MTSSSR-DDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPV-VALDLSFFASL
        MTSSSR DDSVPVPYTLLPQNAAQQNVVVLSLYR PPC+H RLLRLCALYSAAFLLLF VAFLLFP+DPS QLVRLKLN VKVHL+P  V+LDLSFFASL
Subjt:  MTSSSR-DDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPV-VALDLSFFASL

Query:  RVRNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK
        RVRNKNFFSL YDYIGVSVGYRG+RLG+VSS+GGRVSARGSSYVNATLDLNG EVVHDVLYLL DLGKGIIPFDTETEVEGSMG+ FIKFPIK
Subjt:  RVRNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK

TrEMBL top hitse value%identityAlignment
A0A0A0LTV4 LEA_2 domain-containing protein4.6e-8989.53Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MTSSS DDSVPVPYTL+P NAAQQNVVVLSLYRPPPCRHRRLLRLCA YSAAFLLLF VAFLLFPSDPSLQLVRLKLNRVKVHL+PVV+LDLSF  SLRV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK
        RNKNFFSL+Y+++GVSVGYRGRRLGYVSS+GGRVSARGSSYVNATLDLNG EVVHDVLYLL DLGKGIIPFDTET+VEGSMG+ FIK PIK
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK

A0A1S3CJK6 uncharacterized protein LOC1035015514.0e-8587.96Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MT+SS DDSVPVPYTLL  NAAQQNVVVLSLYRP PCRHRRLLRL A YSAAFLLLF VAFLLFPSDPSLQLVRLKLNRVKVHL+P V+LDLSF  SLRV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK
        RNKNFFSL+Y+++GVSVGYRGRRLGYVSS GGRVSARGSSYVNATLDLNG EVVHDVLYLL DLGKGIIPFDTETEVEGSMG+ FIK PIK
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK

A0A5A7TX90 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family isoform 16.3e-9985.84Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MT+SS DDSVPVPYTLL  NAAQQNVVVLSLYRP PCRHRRLLRL A YSAAFLLLF VAFLLFPSDPSLQLVRLKLNRVKVHL+P V+LDLSF  SLRV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIKFAIIEWMNS
        RNKNFFSL+Y+++GVSVGYRGRRLGYVSS GGRVSARGSSYVNATLDLNG EVVHDVLYLL DLGKGIIPFDTETEVEGSMG+ FIK PIKFAII+W+NS
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIKFAIIEWMNS

Query:  SKTPFLCKQHDLNNAYSVL
        S++ FLCKQHD NN YS+L
Subjt:  SKTPFLCKQHDLNNAYSVL

A0A6J1CTN0 uncharacterized protein LOC1110144733.0e-8082.72Show/hide
Query:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV
        MTSSSRDDSVPVPY+LLP NAA QNVVVLSLYRPP  R RRLLRLCA YSAAFLLL  VAFLLFP+DPSLQLVRLKLNR+KV LLPV+ LDLSF AS+RV
Subjt:  MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRV

Query:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK
        RN NFFSLDY+Y+GVSVGYRGRRLG+VSS+GGRVSARG SYVNATLDLNGFEV+HD +YL+ DL  GI+PFDTETEVEG MG+ FIKFPIK
Subjt:  RNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK

A0A6J1JI07 uncharacterized protein LOC1114852809.9e-7679.27Show/hide
Query:  SSSRDDSVPVPYTLLPQN-AAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVR
        S S+D S+PVPY+ +P N AA QNVVVLSLYRPP  R RRLLRLCALYSAAFLLL  V FLLFPSDPSLQLVRLKLN VKV LLP V LDLSF AS+RVR
Subjt:  SSSRDDSVPVPYTLLPQN-AAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVR

Query:  NKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIKFAI
        NKNFFSLDY+Y+GVSVG+RGRRLG+VSS GGRVSARGSSYVNATLDLNG +++HDV +LL DL KGIIPFDTETEVEGSMG+ FIKFPIK  +
Subjt:  NKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIKFAI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.7e-3237.78Show/hide
Query:  YTLLPQNAAQQ--NVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVRNKNFFSLDYD
        Y  LP +++ +  + V++S +  PP R R ++ +  +  A+ L+     ++ +PSDP ++++R+K++ V VH  PV ++D++   +L+V N + +S D+ 
Subjt:  YTLLPQNAAQQ--NVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVRNKNFFSLDYD

Query:  YIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK
         + V++ YRG+ LG+VSS GG V+A GSSY++A  +L+G  V  DV++L+ DL KG + FDT TE  G +G+LF +FP+K
Subjt:  YIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.7e-3237.78Show/hide
Query:  YTLLPQNAAQQ--NVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVRNKNFFSLDYD
        Y  LP +++ +  + V++S +  PP R R ++ +  +  A+ L+     ++ +PSDP ++++R+K++ V VH  PV ++D++   +L+V N + +S D+ 
Subjt:  YTLLPQNAAQQ--NVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVRNKNFFSLDYD

Query:  YIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK
         + V++ YRG+ LG+VSS GG V+A GSSY++A  +L+G  V  DV++L+ DL KG + FDT TE  G +G+LF +FP+K
Subjt:  YIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK

AT4G13270.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.2e-4648.97Show/hide
Query:  SSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHR-----RLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFAS
        +SS+ +   +PYT LP +   Q+V++L+ YR    RHR     R LR   L++A  LLL    +LL+PSDP + + R+ LN + V     +ALDLSF  +
Subjt:  SSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHR-----RLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFAS

Query:  LRVRNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK
        ++VRN++FFSLDYD + VS+GYRGR LG V S+GG + AR SSY++ATL+L+G EVVHDV+YL+ DL KG+IPFDT  +V+G +G+L    PI+
Subjt:  LRVRNKNFFSLDYDYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTCCAGTTCCAGGGACGATTCTGTCCCTGTGCCCTACACTCTTCTTCCCCAAAATGCTGCACAGCAAAACGTCGTCGTTTTATCCCTCTACCGTCCCCCTCCATG
CCGACATCGGCGGCTTCTCCGCCTCTGTGCTCTCTACTCCGCCGCCTTCCTCCTCCTCTTCACCGTTGCTTTTCTACTTTTCCCCTCCGATCCTTCGCTCCAACTCGTCC
GATTGAAACTCAATCGTGTCAAAGTCCATTTGTTGCCTGTTGTCGCCCTTGACCTTTCTTTCTTTGCTTCTCTTAGGGTTCGCAATAAGAACTTCTTCTCTCTCGATTAC
GATTACATTGGGGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATATGTGAGCTCTCAGGGCGGTCGAGTTTCTGCTCGAGGCTCTTCTTATGTGAATGCCACTCTCGA
CTTGAATGGGTTTGAAGTCGTTCACGACGTCTTGTACTTGCTTGTGGATCTGGGGAAGGGTATCATTCCCTTCGATACGGAGACGGAAGTGGAAGGATCCATGGGGATTC
TCTTTATCAAATTCCCGATTAAGTTTGCAATCATTGAATGGATGAACAGTAGTAAGACTCCATTCCTTTGCAAACAGCACGATTTAAATAATGCTTATTCTGTTCTGGCG
ACCATTACACGTTTCTTAGTATAG
mRNA sequenceShow/hide mRNA sequence
GTGTAAATCATTGATAGGCTGAAGCTACTACCTTAACCAAGGGCAAACGGAGTGTTTGGCTACAGCCACAATAAAACTCAGTTCCCATTTTAATTCTAATTCTCTCTAAG
CTTTAGCTAACAAACATGACCTCCAGTTCCAGGGACGATTCTGTCCCTGTGCCCTACACTCTTCTTCCCCAAAATGCTGCACAGCAAAACGTCGTCGTTTTATCCCTCTA
CCGTCCCCCTCCATGCCGACATCGGCGGCTTCTCCGCCTCTGTGCTCTCTACTCCGCCGCCTTCCTCCTCCTCTTCACCGTTGCTTTTCTACTTTTCCCCTCCGATCCTT
CGCTCCAACTCGTCCGATTGAAACTCAATCGTGTCAAAGTCCATTTGTTGCCTGTTGTCGCCCTTGACCTTTCTTTCTTTGCTTCTCTTAGGGTTCGCAATAAGAACTTC
TTCTCTCTCGATTACGATTACATTGGGGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATATGTGAGCTCTCAGGGCGGTCGAGTTTCTGCTCGAGGCTCTTCTTATGT
GAATGCCACTCTCGACTTGAATGGGTTTGAAGTCGTTCACGACGTCTTGTACTTGCTTGTGGATCTGGGGAAGGGTATCATTCCCTTCGATACGGAGACGGAAGTGGAAG
GATCCATGGGGATTCTCTTTATCAAATTCCCGATTAAGTTTGCAATCATTGAATGGATGAACAGTAGTAAGACTCCATTCCTTTGCAAACAGCACGATTTAAATAATGCT
TATTCTGTTCTGGCGACCATTACACGTTTCTTAGTATAG
Protein sequenceShow/hide protein sequence
MTSSSRDDSVPVPYTLLPQNAAQQNVVVLSLYRPPPCRHRRLLRLCALYSAAFLLLFTVAFLLFPSDPSLQLVRLKLNRVKVHLLPVVALDLSFFASLRVRNKNFFSLDY
DYIGVSVGYRGRRLGYVSSQGGRVSARGSSYVNATLDLNGFEVVHDVLYLLVDLGKGIIPFDTETEVEGSMGILFIKFPIKFAIIEWMNSSKTPFLCKQHDLNNAYSVLA
TITRFLV