; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G020930 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G020930
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUnknown protein
Genome locationchr04:28117924..28118785
RNA-Seq ExpressionLsi04G020930
SyntenyLsi04G020930
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145517.1 uncharacterized protein LOC101216814 isoform X2 [Cucumis sativus]1.5e-4882.81Show/hide
Query:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG
        ML +DQWL+AAMAD+TLVA+LL RLKQSQAVLPSKS LPM VPFTWGI+QPRSRMSTATA A     TV VRC DVVL+RN+KDVDSTRCSPTTPLSWSG
Subjt:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG

Query:  GASPSATLDGYEESSRPVTLSHAASRFK
        GASPSATLDG+EESSRP TLS AASRFK
Subjt:  GASPSATLDGYEESSRPVTLSHAASRFK

XP_016901256.1 PREDICTED: uncharacterized protein LOC103493773 isoform X1 [Cucumis melo]4.0e-4983.59Show/hide
Query:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG
        ML  DQWL+AAMADDTLVA+LL RLKQSQAVLPSKS LPM VPFTWGI+QPRSRMSTATA A     TV+VRC DVVL+RN+KDVDSTRCSPTTPLSWSG
Subjt:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG

Query:  GASPSATLDGYEESSRPVTLSHAASRFK
        GASPSATLDG+EESSRP TLS AASRFK
Subjt:  GASPSATLDGYEESSRPVTLSHAASRFK

XP_022976190.1 uncharacterized protein LOC111476653 [Cucurbita maxima]8.8e-4171.88Show/hide
Query:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG
        M+  D+WLTAAMA+D++VA+LL RLKQSQA  PSKS  PM +PF WG+RQPRSR        A AM  V VRC DVVLQRN+KDVDSTRCSPTTPLSWSG
Subjt:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG

Query:  GASPSATLDGYEESSRPVTLSHAASRFK
        GASPSATLDGYE SSR  TL H ASRFK
Subjt:  GASPSATLDGYEESSRPVTLSHAASRFK

XP_031739969.1 uncharacterized protein LOC101216814 isoform X1 [Cucumis sativus]1.2e-4880Show/hide
Query:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG
        ML +DQWL+AAMAD+TLVA+LL RLKQSQAVLPSKS LPM VPFTWGI+QPRSRMSTATA A     TV VRC DVVL+RN+KDVDSTRCSPTTPLSWSG
Subjt:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG

Query:  GASPSATLDGYEESSRPVTLSHAASRFKFSNLNAG
        GASPSATLDG+EESSRP TLS AASRFK     AG
Subjt:  GASPSATLDGYEESSRPVTLSHAASRFKFSNLNAG

XP_038897737.1 uncharacterized protein LOC120085676 [Benincasa hispida]3.8e-5282.27Show/hide
Query:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTAT-----AAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTP
        ML +DQWLTAAMADDTLVAELLFRLKQSQAVLPSKS LP+ VPFTWGIRQPRSRMSTAT     AAAA AM TVAVRC DVVL RN+KDVDSTRCSPTTP
Subjt:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTAT-----AAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTP

Query:  LSWSGGASPSATLDGYEESSRPVTLSHAASRFKFSNLNAGV
        LSWSGGASPSATLDG+E+SSRP TLS AASRFK +  N  V
Subjt:  LSWSGGASPSATLDGYEESSRPVTLSHAASRFKFSNLNAGV

TrEMBL top hitse value%identityAlignment
A0A0A0L0G9 Uncharacterized protein1.2e-4383.76Show/hide
Query:  MADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSGGASPSATLDGY
        MAD+TLVA+LL RLKQSQAVLPSKS LPM VPFTWGI+QPRSRMSTATA A     TV VRC DVVL+RN+KDVDSTRCSPTTPLSWSGGASPSATLDG+
Subjt:  MADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSGGASPSATLDGY

Query:  EESSRPVTLSHAASRFK
        EESSRP TLS AASRFK
Subjt:  EESSRPVTLSHAASRFK

A0A1S4DZ38 uncharacterized protein LOC103493773 isoform X11.9e-4983.59Show/hide
Query:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG
        ML  DQWL+AAMADDTLVA+LL RLKQSQAVLPSKS LPM VPFTWGI+QPRSRMSTATA A     TV+VRC DVVL+RN+KDVDSTRCSPTTPLSWSG
Subjt:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG

Query:  GASPSATLDGYEESSRPVTLSHAASRFK
        GASPSATLDG+EESSRP TLS AASRFK
Subjt:  GASPSATLDGYEESSRPVTLSHAASRFK

A0A5A7VE69 Uncharacterized protein1.9e-4983.59Show/hide
Query:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG
        ML  DQWL+AAMADDTLVA+LL RLKQSQAVLPSKS LPM VPFTWGI+QPRSRMSTATA A     TV+VRC DVVL+RN+KDVDSTRCSPTTPLSWSG
Subjt:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG

Query:  GASPSATLDGYEESSRPVTLSHAASRFK
        GASPSATLDG+EESSRP TLS AASRFK
Subjt:  GASPSATLDGYEESSRPVTLSHAASRFK

A0A6J1FJI3 uncharacterized protein LOC1114460212.1e-4071.43Show/hide
Query:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG
        M+ ND WLTAAMADD +V ELL RLKQSQAV PSKS LP+MVPFTWGIRQ RSR +T              RC D VL RN+KDVDSTR SPTTPLSWSG
Subjt:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG

Query:  GASPSATLDGYEESSRPVTLSHAASRFKFSNLN
        G SPSATLDGYEESSRP TLSHAASRFK +  N
Subjt:  GASPSATLDGYEESSRPVTLSHAASRFKFSNLN

A0A6J1IIT2 uncharacterized protein LOC1114766534.3e-4171.88Show/hide
Query:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG
        M+  D+WLTAAMA+D++VA+LL RLKQSQA  PSKS  PM +PF WG+RQPRSR        A AM  V VRC DVVLQRN+KDVDSTRCSPTTPLSWSG
Subjt:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG

Query:  GASPSATLDGYEESSRPVTLSHAASRFK
        GASPSATLDGYE SSR  TL H ASRFK
Subjt:  GASPSATLDGYEESSRPVTLSHAASRFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15800.1 unknown protein1.0e-1038.4Show/hide
Query:  WLTAAMADDTLVAELLFRLKQSQAVLP-SKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWS------
        W+  AM DD+LVAE L  L  ++  LP +KS     +   W +RQPR++ +T                      R   D D TR SPTTPLSWS      
Subjt:  WLTAAMADDTLVAELLFRLKQSQAVLP-SKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWS------

Query:  -GGASPSATLDGYEESSRPVTLSHA
         GG   +A +DG+EESS  V LS A
Subjt:  -GGASPSATLDGYEESSRPVTLSHA

AT1G80610.1 unknown protein1.1e-0733.1Show/hide
Query:  LKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSGG
        +  + W+  AM+DD++VAE L RL+ S                     +P++R+        +A P   ++    V QR SK  D TR SPTTPLSWSG 
Subjt:  LKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSGG

Query:  ASPS------------ATLDGYEESSRPVTLSHAASRFKFSNLNA
         S S             T++G EESS  V  S    R K S  +A
Subjt:  ASPS------------ATLDGYEESSRPVTLSHAASRFKFSNLNA

AT4G32030.1 unknown protein1.2e-1443.55Show/hide
Query:  DQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSGG---
        D W+  A+ DD LV ELL RLK +  V+ S +   ++ P  WGIRQ RSR S                    VL    KDVDS R SP TPLSWSGG   
Subjt:  DQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSGG---

Query:  ----ASPSATLDGYEESSRPVTLS
            ASPSA  DG+E++SR  + S
Subjt:  ----ASPSATLDGYEESSRPVTLS

AT4G32030.2 unknown protein1.2e-1443.55Show/hide
Query:  DQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSGG---
        D W+  A+ DD LV ELL RLK +  V+ S +   ++ P  WGIRQ RSR S                    VL    KDVDS R SP TPLSWSGG   
Subjt:  DQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSGG---

Query:  ----ASPSATLDGYEESSRPVTLS
            ASPSA  DG+E++SR  + S
Subjt:  ----ASPSATLDGYEESSRPVTLS

AT5G25210.1 unknown protein1.9e-1233.59Show/hide
Query:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG
        M+ +D W   AM D  +VAELL +LK+++ +       P++    WGI+QPRSR                            +    +RCSP+TPLSWSG
Subjt:  MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSG

Query:  G-----ASPSATLDGYEESSRPVTLSHAASR
        G     +SPS  +DGYE +SR ++   + S+
Subjt:  G-----ASPSATLDGYEESSRPVTLSHAASR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCAAGAACGACCAATGGCTTACGGCTGCCATGGCCGACGACACTCTCGTCGCCGAGTTATTGTTCAGGTTGAAGCAGTCCCAGGCTGTATTGCCTTCCAAATCTTC
TCTTCCTATGATGGTGCCGTTTACTTGGGGGATTAGGCAACCCAGGTCTAGGATGTCTACGGCCACGGCGGCAGCGGCGGAAGCTATGCCTACGGTGGCTGTCAGATGTG
CCGATGTTGTCTTGCAGAGGAATAGTAAGGATGTTGACTCCACCAGATGCAGTCCTACGACGCCGCTTTCATGGAGCGGTGGAGCTTCTCCTTCCGCTACGCTTGACGGT
TATGAGGAGTCAAGCCGTCCCGTTACTCTCTCTCATGCTGCTTCCAGATTTAAGTTCTCAAATCTTAACGCTGGCGTTCTTAATTACGTCTTTTACTCGGCGGTGGTGTC
GGATGTGGATTTTAGGCGGCCGGCGGCGGCGTGGATTCTCAGA
mRNA sequenceShow/hide mRNA sequence
AAAAAGAGTGAGAGTGGGAGGCCCACGCCGAAGCTGACAAGGAAGTGGAACAAAGAAGGGAAACCTGAACAACCACCACATAGCAAACCCCACTCTCTCTCTTTCCTTCT
CTTTTTATCTATCTTCCTTCGTTCTCTCCCCCAATATTTATACGCTCCTTTACTCTCTCCGCCACCCTCTTCTCTTTTTTCTCTCTCTTTTCTTTCCCCCATTTCCCTTT
TTTCTCTCCGCCGATCATGCTCAAGAACGACCAATGGCTTACGGCTGCCATGGCCGACGACACTCTCGTCGCCGAGTTATTGTTCAGGTTGAAGCAGTCCCAGGCTGTAT
TGCCTTCCAAATCTTCTCTTCCTATGATGGTGCCGTTTACTTGGGGGATTAGGCAACCCAGGTCTAGGATGTCTACGGCCACGGCGGCAGCGGCGGAAGCTATGCCTACG
GTGGCTGTCAGATGTGCCGATGTTGTCTTGCAGAGGAATAGTAAGGATGTTGACTCCACCAGATGCAGTCCTACGACGCCGCTTTCATGGAGCGGTGGAGCTTCTCCTTC
CGCTACGCTTGACGGTTATGAGGAGTCAAGCCGTCCCGTTACTCTCTCTCATGCTGCTTCCAGATTTAAGTTCTCAAATCTTAACGCTGGCGTTCTTAATTACGTCTTTT
ACTCGGCGGTGGTGTCGGATGTGGATTTTAGGCGGCCGGCGGCGGCGTGGATTCTCAGA
Protein sequenceShow/hide protein sequence
MLKNDQWLTAAMADDTLVAELLFRLKQSQAVLPSKSSLPMMVPFTWGIRQPRSRMSTATAAAAEAMPTVAVRCADVVLQRNSKDVDSTRCSPTTPLSWSGGASPSATLDG
YEESSRPVTLSHAASRFKFSNLNAGVLNYVFYSAVVSDVDFRRPAAAWILR