; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G020020 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G020020
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDUF4050 domain-containing protein
Genome locationchr03:31388968..31392352
RNA-Seq ExpressionLsi03G020020
SyntenyLsi03G020020
Gene Ontology termsNA
InterPro domainsIPR025124 - Domain of unknown function DUF4050


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137841.1 uncharacterized protein LOC101221441 isoform X2 [Cucumis sativus]4.5e-5293.64Show/hide
Query:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLV
        MEKMEIN RNPNLNGNGNHSS DSKVALNGKSNE PTF+NHAEIAWHERRREWVGDR+ENVQR PMEPILSWTTTYEDLL TAEPFQQPIPLAEMVDFLV
Subjt:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLV

Query:  DIWHEDGLYD
        DIWHEDGLYD
Subjt:  DIWHEDGLYD

XP_008442713.1 PREDICTED: uncharacterized protein LOC103486506 isoform X1 [Cucumis melo]7.1e-5090.91Show/hide
Query:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLV
        MEK EIN  NPNLNGNGNHSSGDSKVALNGKSNE PTF+NHAEIAWHERR+EWVGDRSENVQR P EPILSWT TYEDLL TAEPFQQPIPLAEMVDFLV
Subjt:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLV

Query:  DIWHEDGLYD
        DIWHEDGLYD
Subjt:  DIWHEDGLYD

XP_022144600.1 uncharacterized protein LOC111014243 isoform X2 [Momordica charantia]5.5e-5091.59Show/hide
Query:  MEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIW
        MEIN RNPNLNGNGNHSSGD+KVA+NGKSNET TFVNHAEI WHERRREWVGDRSEN QRAPMEPILSWTTTYEDLL +AEPF+QPIPLAEMVDFLVDIW
Subjt:  MEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIW

Query:  HEDGLYD
        HEDGLYD
Subjt:  HEDGLYD

XP_031739473.1 uncharacterized protein LOC101221441 isoform X1 [Cucumis sativus]9.3e-5086.55Show/hide
Query:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPIL---------SWTTTYEDLLSTAEPFQQPIP
        MEKMEIN RNPNLNGNGNHSS DSKVALNGKSNE PTF+NHAEIAWHERRREWVGDR+ENVQR PMEPIL         SWTTTYEDLL TAEPFQQPIP
Subjt:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPIL---------SWTTTYEDLLSTAEPFQQPIP

Query:  LAEMVDFLVDIWHEDGLYD
        LAEMVDFLVDIWHEDGLYD
Subjt:  LAEMVDFLVDIWHEDGLYD

XP_038903277.1 uncharacterized protein LOC120089909 [Benincasa hispida]5.3e-5393.64Show/hide
Query:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLV
        MEKMEIN RNPNLNG+GNHSSGDSKVALNGKSNETP F+NHAEIAWHERRREWVGD SENVQRAPMEPILSWTTTY DLL TAEPFQQPIPLAEMVDFLV
Subjt:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLV

Query:  DIWHEDGLYD
        DIWHEDGLYD
Subjt:  DIWHEDGLYD

TrEMBL top hitse value%identityAlignment
A0A0A0LAJ4 DUF4050 domain-containing protein2.2e-5293.64Show/hide
Query:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLV
        MEKMEIN RNPNLNGNGNHSS DSKVALNGKSNE PTF+NHAEIAWHERRREWVGDR+ENVQR PMEPILSWTTTYEDLL TAEPFQQPIPLAEMVDFLV
Subjt:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLV

Query:  DIWHEDGLYD
        DIWHEDGLYD
Subjt:  DIWHEDGLYD

A0A1S3B705 uncharacterized protein LOC103486506 isoform X13.4e-5090.91Show/hide
Query:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLV
        MEK EIN  NPNLNGNGNHSSGDSKVALNGKSNE PTF+NHAEIAWHERR+EWVGDRSENVQR P EPILSWT TYEDLL TAEPFQQPIPLAEMVDFLV
Subjt:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLV

Query:  DIWHEDGLYD
        DIWHEDGLYD
Subjt:  DIWHEDGLYD

A0A5A7TM85 Uncharacterized protein3.4e-5090.91Show/hide
Query:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLV
        MEK EIN  NPNLNGNGNHSSGDSKVALNGKSNE PTF+NHAEIAWHERR+EWVGDRSENVQR P EPILSWT TYEDLL TAEPFQQPIPLAEMVDFLV
Subjt:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLV

Query:  DIWHEDGLYD
        DIWHEDGLYD
Subjt:  DIWHEDGLYD

A0A6J1CS38 uncharacterized protein LOC111014243 isoform X22.6e-5091.59Show/hide
Query:  MEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIW
        MEIN RNPNLNGNGNHSSGD+KVA+NGKSNET TFVNHAEI WHERRREWVGDRSEN QRAPMEPILSWTTTYEDLL +AEPF+QPIPLAEMVDFLVDIW
Subjt:  MEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIW

Query:  HEDGLYD
        HEDGLYD
Subjt:  HEDGLYD

A0A6J1FBC2 uncharacterized protein LOC111442437 isoform X11.1e-4889.19Show/hide
Query:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRA-PMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFL
        MEKMEI+  NPNLNGNGNHSS DSKVA+N KSNET  FVNHAEIAWHE+RREWVGDR ENVQRA PMEPILSWTTTYEDLL TAEPFQQPIPLAEMVDFL
Subjt:  MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRA-PMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFL

Query:  VDIWHEDGLYD
        VDIWHEDGLYD
Subjt:  VDIWHEDGLYD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G15770.1 unknown protein7.7e-1851.9Show/hide
Query:  SNETPTFVNHAEIAWHERRREWVGD-RSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIWHEDGLY
        SNE   FVNH  + W++ R++WVGD RSE+ +    EPIL+   TYE LL + + F +PIPL EMV FLV++W E+GLY
Subjt:  SNETPTFVNHAEIAWHERRREWVGD-RSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIWHEDGLY

AT3G15770.2 unknown protein7.7e-1851.9Show/hide
Query:  SNETPTFVNHAEIAWHERRREWVGD-RSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIWHEDGLY
        SNE   FVNH  + W++ R++WVGD RSE+ +    EPIL+   TYE LL + + F +PIPL EMV FLV++W E+GLY
Subjt:  SNETPTFVNHAEIAWHERRREWVGD-RSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIWHEDGLY

AT3G54880.1 unknown protein9.3e-2454.55Show/hide
Query:  DSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIWHEDGLYD
        D K ++   S  T T VNH    W E R +WVGD+S   +    + I+SW+TTYEDLLST EPF + IPL EMVDFLVDIW+++GLYD
Subjt:  DSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIWHEDGLYD

AT5G03440.1 unknown protein2.1e-2352.34Show/hide
Query:  MEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIW
        ME N    N++ N   SS D +     KS+E   FVNHAEIAW E R++WVGD S      P EP++ +  TYEDLL++  PF +PIPLAEMVDFL DIW
Subjt:  MEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIW

Query:  HEDGLYD
        H DGL++
Subjt:  HEDGLYD

AT5G03440.2 unknown protein2.1e-2352.34Show/hide
Query:  MEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIW
        ME N    N++ N   SS D +     KS+E   FVNHAEIAW E R++WVGD S      P EP++ +  TYEDLL++  PF +PIPLAEMVDFL DIW
Subjt:  MEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIW

Query:  HEDGLYD
        H DGL++
Subjt:  HEDGLYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAAATGGAAATCAATTGTAGAAATCCTAATTTGAATGGAAATGGGAACCATTCTTCGGGTGATTCAAAGGTTGCCTTGAATGGCAAGTCTAACGAGACGCCTAC
ATTCGTCAACCACGCGGAAATAGCTTGGCATGAAAGAAGAAGAGAATGGGTTGGTGACCGTTCTGAAAATGTGCAACGAGCACCAATGGAACCGATCTTGAGTTGGACAA
CGACTTATGAAGATCTTCTTTCAACTGCAGAGCCTTTTCAGCAACCTATTCCTTTAGCTGAAATGGTGGACTTCTTGGTTGATATCTGGCATGAAGATGGCCTCTATGAT
TAG
mRNA sequenceShow/hide mRNA sequence
CTAAAACAGAGAGGAAAAAAAAGAAGGGGGAATTTGATGATGACCTCCGGGAAGATCCGGAAGGCTCCATTATTGGTTCAATAAAAACTTAAATGAAGATCGTACGATCT
GTACCAGTCCCCAGCATTACCAAACTTCATAAAGTTGGATCATAATTGAATGGCAATGGGGTAACATTTTGAACGGTCTACAGAAGATGATAATCCGTATCCCCCAGAAA
AATACAAAAGTGAAATGCCATTAAAACAAGAGACCCAAAATGTAGGATTCCCAAATCCACCGCACCCCAGACTCTGTTCTTCTTCTTCATATGTTGGGACCCCAGATGGG
GTTTTCACAGTAACCTTCAACCACACATCTCTTTGCACTTGGCAGCATTTCACCCGTTCGATCAAACGAAACTGTCAGTATTCTTCGCCGTTTTGTGCAGAAGAATCCAT
TTTCTTCCCCCCTTTTTTCTGTAGCTAATGCTGAAAAGGGGGGACTGTTTCTTCTTCGGTAATTTCCTTAAAGATCAACCCCAATCTCAACGTTTCACTACAGGAGGATT
CACTGTTTACTCCTTTTTTCAGGGTCAAAAGGTGTTGTTTCGTGTAATGAAGAGAATGGATATTAAGCTAGAGCAGCAGAGAGTGCACTATTTTCAGTGATAGAGTGGAA
TATCCCTGGGGCAAGGGTCACTTCCTTGACATTTTGAATTTTATAGGATTAGTTTGAAGAGTAAGTGATTAGCTTTCAGTTTTGAACTAGACTCGGCACGATTAATGGAT
TCGGAATTTCGAGTAGTCTGAGGGAACCCTGTGGGGTTTTTGTTGGAGGAACAAACTTGATGGGACTAAGGAGTTGGCCGGATGAGTCAAGCAAGAAAATACTATTTTAT
GTTTCTGTAATTACCGTATCTCTCAGGAATAATTAGAGTTGTAGAAGCTGATATGCCATGCCATGAAACTAGAGAAGAACAAAAGAGCATTTTCTTTTCTTCTCTGCTGC
CTTTTAGAGTCTAAAAGATGATGACTGCACACAAGTAAAAGGGTCAAAATTGTTTTGGAGAAATGGGTTAAATTCCTGAGCAGGCAAGTCGGAAGGTCGAAGCACACATT
TCGCACTTTGGTTCATTGTCGAGCAAGGCAATCCCACTGAAATATGTTTCTTTTCTGCTCTGGTGTAATTTGTATTTGGTATAGGCTTTCAATGTCTTCATAGTTTTTTT
AAGTAGTAAAACACTTTCTAAATTCTTCCTTGACATCTGGCTAGATGCATATCAAAAAGTGTTTTAAGAAGCACTGAAGTGTTTATGGAGTAAACATTTGATTAGTGTTT
GAAGTTTTTACTTAATTTACTCATAAACACTACAAAATCTATGCCGAACACAATCTTAGCTGCTCCCACAGTTTCTTAGACACTTGAGTGTCTTTTCTTTTCATTTTCTA
TTTCTTACGCACAACACCAAATGCTGGTGGTTGTTTCTTGTTCTAAGTGAATGCAACGAAAAGACAGGAAATAATGAATTGGTTTCTTAGCTCTTTCTTCTTTCGGAGCA
TTTATCCCTTATTTTTTCAGCAAGTGAGAACGTACTTATTATTATAGAAGCTTTGCAACTAACAAATTTTATGGTTTTAGAATATACTATAAGTGTTATATTATAGCAGT
GTCTTCCCAAAGTATATTGTTCAAAACTTTATGCATTCATACAATTTTGTTGCAAGGGTTTTAAAACATGTCTTCTTATTAGGATTCAAACTCGGTTACGTATATGTAAT
ATTTTATTACTAATTATTGGAACAGCAAAATAGTTGATTGGAATTAGTTAGCACGAGACAAAGGTTTGTGTGATCTTGTGTAGTATGAGATGTGCAAGGGTGTAAAAATT
AACAAGATTTCTCTTCAATAGGGTGGAAATTTATGAACTTTACATTATATGATAGGGGCAAATATTTCAAATTAATCTATGTTTGCTCTTTATTAAAATATGAAAGCAGA
TACGTAAAAAGTAAATGTGTATTGTGCATATAGTATAACATTGACAATTTTAAAGGATAACAACAAATTGTCTCAAACATCAATCAAAACCATACTCTTGTCTCAATTTG
TTTTTCTCTATTCTCCATGAATTCTCCTGAAGGGACTCATTATCTTTATCTCATGTACAGGTCACAGTTTTTCAAGGCAATGGTTGTTCATTATGGAGAAAATGGAAATC
AATTGTAGAAATCCTAATTTGAATGGAAATGGGAACCATTCTTCGGGTGATTCAAAGGTTGCCTTGAATGGCAAGTCTAACGAGACGCCTACATTCGTCAACCACGCGGA
AATAGCTTGGCATGAAAGAAGAAGAGAATGGGTTGGTGACCGTTCTGAAAATGTGCAACGAGCACCAATGGAACCGATCTTGAGTTGGACAACGACTTATGAAGATCTTC
TTTCAACTGCAGAGCCTTTTCAGCAACCTATTCCTTTAGCTGAAATGGTGGACTTCTTGGTTGATATCTGGCATGAAGATGGCCTCTATGATTAGGCATTCTCATATAAT
CAATCTATCAGATTTGCATTTTTCTTTTTTCTTTGCTCCTTTTTCAAGGCTTCTATATTAATTTTTGCTATTATCATTGAACAAATTGATGATCAGAATAGATGGTTCTC
TTGTACTCTATATACTCTGATTTGTAACTAACAAAGTGAAGATTATGATTTTTTGTT
Protein sequenceShow/hide protein sequence
MEKMEINCRNPNLNGNGNHSSGDSKVALNGKSNETPTFVNHAEIAWHERRREWVGDRSENVQRAPMEPILSWTTTYEDLLSTAEPFQQPIPLAEMVDFLVDIWHEDGLYD