; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G002650 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G002650
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionHeavy metal-associated domain containing protein
Genome locationchr04:2733810..2735092
RNA-Seq ExpressionLsi04G002650
SyntenyLsi04G002650
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR036163 - Heavy metal-associated domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013033.1 hypothetical protein SDJN02_25789 [Cucurbita argyrosperma subsp. argyrosperma]8.3e-3487.76Show/hide
Query:  RSMAAAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIP-SRVSTIKRLFGSSCR
        ++ AAAGIGVRK KIFPHASSLASIESLSLPLVQEIV  ADI CAECQKKLANIL+KMNDTESVVVNLLDKKVILTR+LQIP SRVSTIKRL GSS R
Subjt:  RSMAAAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIP-SRVSTIKRLFGSSCR

KGN52069.1 hypothetical protein Csa_009028 [Cucumis sativus]7.5e-3591.21Show/hide
Query:  GIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIPSRVSTIKRLFGSSCR
        GIGVRK++IFPHASSLASIESL+LPLVQEIVLTADI CAECQKKLANILSKMNDTESVVVNLLDKKVILTRR QIPSRVSTI+RLF S CR
Subjt:  GIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIPSRVSTIKRLFGSSCR

XP_022968296.1 uncharacterized protein LOC111467567 isoform X3 [Cucurbita maxima]1.7e-3491.58Show/hide
Query:  AAAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIP-SRVSTIKRLFGSSCR
        AAAGIGVRK KIFPHASSLASIESLSLPLVQEIV TADI CAECQKKLANILSKMNDTESVVVNLLDKKVILTR+LQIP SRVST+KRL GSS R
Subjt:  AAAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIP-SRVSTIKRLFGSSCR

XP_023542320.1 uncharacterized protein LOC111802252 [Cucurbita pepo subsp. pepo]1.1e-3390.53Show/hide
Query:  AAAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIP-SRVSTIKRLFGSSCR
        AAAGIGVRK KIFPHASSLASIESLSLPLVQEIV  ADI CAECQKKLANIL+KMNDTESVVVNLLDKKVILTR+LQIP SRVSTIKRL GSS R
Subjt:  AAAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIP-SRVSTIKRLFGSSCR

XP_038892707.1 uncharacterized protein LOC120081691 [Benincasa hispida]9.5e-3894.68Show/hide
Query:  AAAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIPSRVSTIKRLFGSSCR
        AAAGIGVRK+KIFPHASSLASIESLSLPLVQEIVLTADI CAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIPSR+STIKRL GS CR
Subjt:  AAAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIPSRVSTIKRLFGSSCR

TrEMBL top hitse value%identityAlignment
A0A0A0KT62 Uncharacterized protein3.6e-3591.21Show/hide
Query:  GIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIPSRVSTIKRLFGSSCR
        GIGVRK++IFPHASSLASIESL+LPLVQEIVLTADI CAECQKKLANILSKMNDTESVVVNLLDKKVILTRR QIPSRVSTI+RLF S CR
Subjt:  GIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIPSRVSTIKRLFGSSCR

A0A6J1DB71 uncharacterized protein LOC1110187073.2e-3181Show/hide
Query:  AAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIP-------SRVSTIKRLFGSSCR
        AA IGVRK +IFPHASSLASIESLSLPLVQEIVLTADIRC ECQ KLANILSK+NDTESVVVNLL+KKVILTRRLQ+P       S+V+TIKRL GSS R
Subjt:  AAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIP-------SRVSTIKRLFGSSCR

A0A6J1G176 uncharacterized protein LOC111449743 isoform X13.3e-2869.17Show/hide
Query:  AAAGIGVRKQKIFPHASSLASIESLSLPL---------------------------VQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILT
        AAAGIGVRK KIFPHASSLAS+ESLSLPL                           VQEIV  ADI CAECQKKLANIL+KMNDTESVVVNLLDKKVILT
Subjt:  AAAGIGVRKQKIFPHASSLASIESLSLPL---------------------------VQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILT

Query:  RRLQI-PSRVSTIKRLFGSS
        R+LQI  SRVSTIKRL GSS
Subjt:  RRLQI-PSRVSTIKRLFGSS

A0A6J1G188 uncharacterized protein LOC111449743 isoform X31.3e-3289.25Show/hide
Query:  AAAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQI-PSRVSTIKRLFGSS
        AAAGIGVRK KIFPHASSLAS+ESLSLPLVQEIV  ADI CAECQKKLANIL+KMNDTESVVVNLLDKKVILTR+LQI  SRVSTIKRL GSS
Subjt:  AAAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQI-PSRVSTIKRLFGSS

A0A6J1HUG9 uncharacterized protein LOC111467567 isoform X38.1e-3591.58Show/hide
Query:  AAAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIP-SRVSTIKRLFGSSCR
        AAAGIGVRK KIFPHASSLASIESLSLPLVQEIV TADI CAECQKKLANILSKMNDTESVVVNLLDKKVILTR+LQIP SRVST+KRL GSS R
Subjt:  AAAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIP-SRVSTIKRLFGSSCR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G68585.1 unknown protein4.6e-1450Show/hide
Query:  SSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILT------RRLQ------IPSRVSTIKRL
        +SLAS+ SLS+PL+QEIVL+ADIRC++CQ+K+A+I+++M +T S++V++L+KKV LT      RR+       +  ++STIKRL
Subjt:  SSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILT------RRLQ------IPSRVSTIKRL

AT2G35730.1 Heavy metal transport/detoxification superfamily protein1.5e-0440.3Show/hide
Query:  SLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIPSRVST
        +L   E LSLP  Q I + AD+ C  CQ +++ I+SKM   E  VV+ L KK+++ R    P  VS+
Subjt:  SLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIPSRVST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGAGCATGGCGGCGGCCGGAATCGGAGTTCGGAAGCAGAAGATTTTCCCTCATGCTTCTAGCCTGGCTTCCATTGAGTCCTTATCTCTGCCTCTTGTTCAGGAAAT
TGTATTAACAGCTGATATTCGGTGTGCTGAATGTCAGAAGAAACTGGCCAATATACTCTCCAAAATGAATGATACAGAGTCTGTGGTGGTGAATTTGTTGGACAAGAAAG
TGATATTGACTCGAAGATTGCAGATTCCATCCAGAGTTTCAACGATCAAGAGATTGTTTGGTTCTTCTTGCAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGCGGAGCATGGCGGCGGCCGGAATCGGAGTTCGGAAGCAGAAGATTTTCCCTCATGCTTCTAGCCTGGCTTCCATTGAGTCCTTATCTCTGCCTCTTGTTCAGGAAAT
TGTATTAACAGCTGATATTCGGTGTGCTGAATGTCAGAAGAAACTGGCCAATATACTCTCCAAAATGAATGATACAGAGTCTGTGGTGGTGAATTTGTTGGACAAGAAAG
TGATATTGACTCGAAGATTGCAGATTCCATCCAGAGTTTCAACGATCAAGAGATTGTTTGGTTCTTCTTGCAGATAA
Protein sequenceShow/hide protein sequence
MRSMAAAGIGVRKQKIFPHASSLASIESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQIPSRVSTIKRLFGSSCR