; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G000030 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G000030
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF3531)
Genome locationchr04:46945..47439
RNA-Seq ExpressionLsi04G000030
SyntenyLsi04G000030
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034098.1 hypothetical protein E6C27_scaffold65G00860 [Cucumis melo var. makuwa]2.7e-4379.07Show/hide
Query:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTD-AKSSGTSARSRRLLKLREEKRKREHDRL
        MSV  GVGLGL FTN N +C F SNTR FPQS+ S PEFHPISLRSR LL ENG+DSKFD  STSTPTTTD  KSSGTSARSRRLLKLREEKRKREHDRL
Subjt:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTD-AKSSGTSARSRRLLKLREEKRKREHDRL

Query:  HNYPAWAK--------VLEDACKNDAELR
        HNYPAWAK        VLEDACKNDAELR
Subjt:  HNYPAWAK--------VLEDACKNDAELR

XP_004147093.1 uncharacterized protein LOC101211689 [Cucumis sativus]7.5e-4684.3Show/hide
Query:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDA-KSSGTSARSRRLLKLREEKRKREHDRL
        MSVF G+GLGL FTN N +C F SNTR FPQS+TS PEF PISLRSR LL ENG+DSKFD  STSTPT TDA KSSGTSARSRRLLKLREEKRKREHDRL
Subjt:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDA-KSSGTSARSRRLLKLREEKRKREHDRL

Query:  HNYPAWAKVLEDACKNDAELR
        HNYPAWAKVLEDACKNDAELR
Subjt:  HNYPAWAKVLEDACKNDAELR

XP_008445913.1 PREDICTED: uncharacterized protein LOC103488796 [Cucumis melo]1.7e-4584.3Show/hide
Query:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTD-AKSSGTSARSRRLLKLREEKRKREHDRL
        MSV  GVGLGL FTN N +C F SNTR FPQS+ S PEFHPISLRSR LL ENG+DSKFD  STSTPTTTD  KSSGTSARSRRLLKLREEKRKREHDRL
Subjt:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTD-AKSSGTSARSRRLLKLREEKRKREHDRL

Query:  HNYPAWAKVLEDACKNDAELR
        HNYPAWAKVLEDACKNDAELR
Subjt:  HNYPAWAKVLEDACKNDAELR

XP_022139315.1 uncharacterized protein LOC111010260 isoform X1 [Momordica charantia]3.4e-4684.17Show/hide
Query:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLH
        MSVFKGVGLGL F NAN SC FPSN RIF +SVTSN EF PISLR R LL ENGNDSKFDT+S+STPT  DAK SGT+AR RRLLKLREEKRKREHDRLH
Subjt:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLH

Query:  NYPAWAKVLEDACKNDAELR
        NYPAWAKVLEDACKNDAELR
Subjt:  NYPAWAKVLEDACKNDAELR

XP_022139316.1 uncharacterized protein LOC111010260 isoform X2 [Momordica charantia]3.4e-4684.17Show/hide
Query:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLH
        MSVFKGVGLGL F NAN SC FPSN RIF +SVTSN EF PISLR R LL ENGNDSKFDT+S+STPT  DAK SGT+AR RRLLKLREEKRKREHDRLH
Subjt:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLH

Query:  NYPAWAKVLEDACKNDAELR
        NYPAWAKVLEDACKNDAELR
Subjt:  NYPAWAKVLEDACKNDAELR

TrEMBL top hitse value%identityAlignment
A0A1S3BDB3 uncharacterized protein LOC1034887968.1e-4684.3Show/hide
Query:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTD-AKSSGTSARSRRLLKLREEKRKREHDRL
        MSV  GVGLGL FTN N +C F SNTR FPQS+ S PEFHPISLRSR LL ENG+DSKFD  STSTPTTTD  KSSGTSARSRRLLKLREEKRKREHDRL
Subjt:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTD-AKSSGTSARSRRLLKLREEKRKREHDRL

Query:  HNYPAWAKVLEDACKNDAELR
        HNYPAWAKVLEDACKNDAELR
Subjt:  HNYPAWAKVLEDACKNDAELR

A0A5A7STY8 Uncharacterized protein1.3e-4379.07Show/hide
Query:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTD-AKSSGTSARSRRLLKLREEKRKREHDRL
        MSV  GVGLGL FTN N +C F SNTR FPQS+ S PEFHPISLRSR LL ENG+DSKFD  STSTPTTTD  KSSGTSARSRRLLKLREEKRKREHDRL
Subjt:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTD-AKSSGTSARSRRLLKLREEKRKREHDRL

Query:  HNYPAWAK--------VLEDACKNDAELR
        HNYPAWAK        VLEDACKNDAELR
Subjt:  HNYPAWAK--------VLEDACKNDAELR

A0A6J1CBZ8 uncharacterized protein LOC111010260 isoform X11.6e-4684.17Show/hide
Query:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLH
        MSVFKGVGLGL F NAN SC FPSN RIF +SVTSN EF PISLR R LL ENGNDSKFDT+S+STPT  DAK SGT+AR RRLLKLREEKRKREHDRLH
Subjt:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLH

Query:  NYPAWAKVLEDACKNDAELR
        NYPAWAKVLEDACKNDAELR
Subjt:  NYPAWAKVLEDACKNDAELR

A0A6J1CCA9 uncharacterized protein LOC111010260 isoform X21.6e-4684.17Show/hide
Query:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLH
        MSVFKGVGLGL F NAN SC FPSN RIF +SVTSN EF PISLR R LL ENGNDSKFDT+S+STPT  DAK SGT+AR RRLLKLREEKRKREHDRLH
Subjt:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLH

Query:  NYPAWAKVLEDACKNDAELR
        NYPAWAKVLEDACKNDAELR
Subjt:  NYPAWAKVLEDACKNDAELR

A0A6J1HT13 uncharacterized protein LOC1114665171.5e-3975.83Show/hide
Query:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLH
        MSVF+ +GLGL F NA  +C F SN  IF +SVT N EF  ISLRSR +L ENG++S FD KS STPT TDAK SGT+ARSRRLLKLREEKRKREHDRLH
Subjt:  MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLH

Query:  NYPAWAKVLEDACKNDAELR
        NYPAWAKVLEDACKNDAELR
Subjt:  NYPAWAKVLEDACKNDAELR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G08400.1 Protein of unknown function (DUF3531)2.1e-0934.07Show/hide
Query:  SCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAK-----------
        SC    N    P  V+    F        TL+  + N      +     +  + K SGT+AR RRLLK+REEKRKR++DRLH+YP+WAK           
Subjt:  SCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAK-----------

Query:  ----------------------VLEDACKNDAELR
                              VLE ACK+D ELR
Subjt:  ----------------------VLEDACKNDAELR

AT5G08400.2 Protein of unknown function (DUF3531)1.6e-1445.1Show/hide
Query:  SCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAKVLEDACKNDAE
        SC    N    P  V+    F        TL+  + N      +     +  + K SGT+AR RRLLK+REEKRKR++DRLH+YP+WAKVLE ACK+D E
Subjt:  SCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAKVLEDACKNDAE

Query:  LR
        LR
Subjt:  LR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGTGTTCAAGGGTGTCGGATTAGGTTTAACTTTTACGAATGCCAATTTCAGTTGCTATTTTCCTTCTAATACAAGAATCTTCCCCCAATCTGTCACTTCAAACCC
TGAATTTCATCCAATTTCTCTTCGTTCTCGTACATTGCTTTTTGAAAATGGCAACGATTCTAAGTTTGACACCAAGAGCACTTCTACACCGACTACGACTGATGCTAAGA
GCTCTGGAACCTCTGCTAGAAGTCGTCGATTGCTAAAGCTTCGTGAAGAGAAGCGCAAACGAGAACATGATCGTCTCCACAATTACCCTGCCTGGGCGAAAGTGTTAGAA
GATGCCTGCAAAAACGATGCGGAATTACGACTGTTCTTTGTGATAGCATTGGCAATCCAGAGGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCGTGTTCAAGGGTGTCGGATTAGGTTTAACTTTTACGAATGCCAATTTCAGTTGCTATTTTCCTTCTAATACAAGAATCTTCCCCCAATCTGTCACTTCAAACCC
TGAATTTCATCCAATTTCTCTTCGTTCTCGTACATTGCTTTTTGAAAATGGCAACGATTCTAAGTTTGACACCAAGAGCACTTCTACACCGACTACGACTGATGCTAAGA
GCTCTGGAACCTCTGCTAGAAGTCGTCGATTGCTAAAGCTTCGTGAAGAGAAGCGCAAACGAGAACATGATCGTCTCCACAATTACCCTGCCTGGGCGAAAGTGTTAGAA
GATGCCTGCAAAAACGATGCGGAATTACGACTGTTCTTTGTGATAGCATTGGCAATCCAGAGGAAATGA
Protein sequenceShow/hide protein sequence
MSVFKGVGLGLTFTNANFSCYFPSNTRIFPQSVTSNPEFHPISLRSRTLLFENGNDSKFDTKSTSTPTTTDAKSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAKVLE
DACKNDAELRLFFVIALAIQRK