; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi10G001050 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi10G001050
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationchr10:1619209..1622005
RNA-Seq ExpressionLsi10G001050
SyntenyLsi10G001050
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652279.1 hypothetical protein Csa_022101 [Cucumis sativus]1.7e-8889.42Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS
        MAVK VGSCSPGLTKVGL+ MALCIA YILGPPLYWHFME L AFSSSS STCPPCFCDCSS TDFAFT+ELENTTFRDCVKHDSGMNEETEK+FAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS
        EELKLREAE LE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTA+WETRARQRGWRD+IV SRG +Q S
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS

XP_004151898.1 uncharacterized protein LOC101219040 [Cucumis sativus]1.7e-8889.42Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS
        MAVK VGSCSPGLTKVGL+ MALCIA YILGPPLYWHFME L AFSSSS STCPPCFCDCSS TDFAFT+ELENTTFRDCVKHDSGMNEETEK+FAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS
        EELKLREAE LE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTA+WETRARQRGWRD+IV SRG +Q S
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS

XP_008455917.1 PREDICTED: uncharacterized protein LOC103495987 isoform X1 [Cucumis melo]5.6e-8788.89Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS
        MA K VGS SPGLTKVGL FMA+CIA YILGPPLYWHF E LAAFSSSS STCPPCFCDCSS TDFAFT+EL+NTTFRDCVKHDSGMNEETEK+FAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS
        EELKLREAE LE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LT +WETRARQRGWRDDIV SRG VQTS
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS

XP_022947173.1 uncharacterized protein LOC111451120 isoform X1 [Cucurbita moschata]1.7e-8385.94Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS
        MAVK  GSCSPGLTKVGL F+ALCIA YILGPPLYWHF+E LA  SSSS STCPPCFCDCSS TDFAFT E ENTTFRDCVKHDSGMNEETE++FAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS
        EELKLREAE +E HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTA+WE RARQRGWRDDIV S   R  VQTS
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS

XP_023534557.1 uncharacterized protein LOC111796098 isoform X1 [Cucurbita pepo subsp. pepo]3.4e-8486.98Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS
        MAVK  GSCSPGLTKVGL F+ALCIA YILGPPLYWHFME LA  SSSS STCPPCFCDCSS TDFAFT E ENTTFRDCVKHDSGMNEETE+SFAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS
        EELKLREAE +E HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTA+WE RARQRGWRDDIV S   R  VQTS
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS

TrEMBL top hitse value%identityAlignment
A0A0A0LQC2 Uncharacterized protein8.5e-8989.42Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS
        MAVK VGSCSPGLTKVGL+ MALCIA YILGPPLYWHFME L AFSSSS STCPPCFCDCSS TDFAFT+ELENTTFRDCVKHDSGMNEETEK+FAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS
        EELKLREAE LE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTA+WETRARQRGWRD+IV SRG +Q S
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS

A0A1S3C1Z9 uncharacterized protein LOC103495987 isoform X12.7e-8788.89Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS
        MA K VGS SPGLTKVGL FMA+CIA YILGPPLYWHF E LAAFSSSS STCPPCFCDCSS TDFAFT+EL+NTTFRDCVKHDSGMNEETEK+FAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS
        EELKLREAE LE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LT +WETRARQRGWRDDIV SRG VQTS
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS

A0A1S3C2Q4 uncharacterized protein LOC103495987 isoform X28.5e-8185.19Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS
        MA K VGS SPGLTKVGL FMA+CIA YILGPPLYWHF E LAAFSSSS STCPPCFCDCSS TDFAFT+        DCVKHDSGMNEETEK+FAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS
        EELKLREAE LE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LT +WETRARQRGWRDDIV SRG VQTS
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS

A0A6J1G5W1 uncharacterized protein LOC111451120 isoform X18.2e-8485.94Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS
        MAVK  GSCSPGLTKVGL F+ALCIA YILGPPLYWHF+E LA  SSSS STCPPCFCDCSS TDFAFT E ENTTFRDCVKHDSGMNEETE++FAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS
        EELKLREAE +E HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTA+WE RARQRGWRDDIV S   R  VQTS
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS

A0A6J1I131 uncharacterized protein LOC111469880 isoform X14.1e-8385.42Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS
        MAVK  GSCSPGLTKVGL F+ALCIA YILGPPLYWHFME LA  SSS  STCPPCFCDCSS TDFAFT E ENTTFRDCVKHDSGMNEETE+SFAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS
        E+LKLREA+ +E HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTA+WE RARQRGWRDDIV S   R  VQTS
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)3.2e-5662.57Show/hide
Query:  KVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREAETLESH
        K+GL  + L +A YILGPPLYWH  EALAA S+SS   CP C C+CS+++     KEL N +F DC KHD  +NE+TEK++AELL+EELKLREAE+LE H
Subjt:  KVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREAETLESH

Query:  RGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDI----VRSRGNVQTS
        + AD+ LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  LA QKKLT+ WE RARQ+GWR+      V+S+ NVQ +
Subjt:  RGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDI----VRSRGNVQTS

AT2G32580.1 Protein of unknown function (DUF1068)2.1e-4758.33Show/hide
Query:  KVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREAETLESH
        KVGL  +AL +  YILGPPLYWH  EALA     S ++C  C CDCSS         L N +F DC K D  +NE+TEK++AELL+EELK REA ++E H
Subjt:  KVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREAETLESH

Query:  RGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS
        +  D  LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  L  QKKLT++WE RARQ+G++D   +S
Subjt:  RGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS

AT2G32580.2 Protein of unknown function (DUF1068)2.1e-3163.46Show/hide
Query:  DCVKHDSGMNEETEKSFAELLSEELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDD
        +C K D  +NE+TEK++AELL+EELK REA ++E H+  D  LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  L  QKKLT++WE RARQ+G++D 
Subjt:  DCVKHDSGMNEETEKSFAELLSEELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDD

Query:  IVRS
          +S
Subjt:  IVRS

AT4G04360.1 Protein of unknown function (DUF1068)3.4e-4254.17Show/hide
Query:  KVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREAETLESH
        KV    M LCI  YI GP LYWH  E +A    S  S+CPPC CDCSS    +    L N +F DC++H+ G +EE+E SF E+++EELKLREA+  E  
Subjt:  KVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREAETLESH

Query:  RGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS
          AD  LL+AKK  SQYQKEADKC+ GMETCE ARE+AEA L  Q++L+ +WE RARQ GW++  V S
Subjt:  RGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS

AT4G30996.1 Protein of unknown function (DUF1068)2.7e-3145.06Show/hide
Query:  LYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTD-FAFTKELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREAETLESHRG
        L   A+  A  + GP LYW F +     S+ + S CPPC CDC            L N +  DC   D  + +E EK F +LL+EELKL+EA   E  R 
Subjt:  LYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTD-FAFTKELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREAETLESHRG

Query:  ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDD
         +++L EAK++ SQYQKEA+KCN+  E CE+ARERAEA L  ++K+T++WE RARQ GW  +
Subjt:  ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTGAAGCAAGTGGGTTCGTGCTCTCCAGGCTTGACGAAGGTGGGATTGTATTTTATGGCTCTTTGTATAGCGACTTACATTCTGGGTCCGCCTCTCTACTGGCA
TTTCATGGAGGCCTTGGCCGCTTTCTCTTCTTCCTCTTTCTCAACTTGCCCACCTTGCTTTTGTGACTGTTCTTCTCACACTGACTTTGCCTTCACTAAAGAGCTCGAAA
ACACTACTTTTAGAGATTGTGTGAAACATGACTCTGGCATGAATGAGGAAACAGAAAAGAGTTTTGCAGAGTTGTTGTCTGAGGAACTGAAACTGAGGGAAGCTGAAACT
TTGGAGAGTCATCGGGGCGCCGACATATCTCTGCTAGAAGCAAAGAAGATGACATCTCAATATCAGAAAGAAGCAGACAAGTGCAATTCAGGCATGGAAACATGTGAAGC
AGCAAGGGAAAGGGCTGAAGCTACATTAGCTTCACAAAAGAAGTTAACAGCAATATGGGAGACTAGGGCTCGCCAAAGAGGATGGAGAGACGACATTGTTAGATCCCGTG
GTAACGTCCAAACCTCATAA
mRNA sequenceShow/hide mRNA sequence
CAAATTTACTTGAGCCAAAGTTGTATTTCCATTCCACTTTCCCTTCTATAATCCACCTTCCCCAACGGGTTCGTTCCCATTTTCCATTTCTGGACTACACGAAGGTCTTG
AAAATGGCAGTGAAGCAAGTGGGTTCGTGCTCTCCAGGCTTGACGAAGGTGGGATTGTATTTTATGGCTCTTTGTATAGCGACTTACATTCTGGGTCCGCCTCTCTACTG
GCATTTCATGGAGGCCTTGGCCGCTTTCTCTTCTTCCTCTTTCTCAACTTGCCCACCTTGCTTTTGTGACTGTTCTTCTCACACTGACTTTGCCTTCACTAAAGAGCTCG
AAAACACTACTTTTAGAGATTGTGTGAAACATGACTCTGGCATGAATGAGGAAACAGAAAAGAGTTTTGCAGAGTTGTTGTCTGAGGAACTGAAACTGAGGGAAGCTGAA
ACTTTGGAGAGTCATCGGGGCGCCGACATATCTCTGCTAGAAGCAAAGAAGATGACATCTCAATATCAGAAAGAAGCAGACAAGTGCAATTCAGGCATGGAAACATGTGA
AGCAGCAAGGGAAAGGGCTGAAGCTACATTAGCTTCACAAAAGAAGTTAACAGCAATATGGGAGACTAGGGCTCGCCAAAGAGGATGGAGAGACGACATTGTTAGATCCC
GTGGTAACGTCCAAACCTCATAAAAGATCCGATGCTTATTGAACGAGCTGCTTTGACAGCAGCTTGGAATTCATGGAGCCAGAAGCCTACTGAGCAACTTTGATTAGTCT
ATGCTAGGAGCCCAGGCTTTTCCTTTACATAGTCCACTCACATTTCTTTAAACTTCCTCGTGTATATGGAAAAGATACAAACCAAACGTCACTCTCCCTTCCCTCCAAGA
GATCAAGACCCAAAACCTTTGGCAAGGTTATACAGCAAGCAAATTTTCCTGTGATGCTGATTGGACTGTTTAAAGTTTTGGTATAGCAATTAGTTAGACTACTCGTGTAA
CTTGTCGTCACTAGAAAGTTTAAAATATTGTCCTAGTCCATCAATATTGTATAATGGATCAAAGGAATTGCATGACATAAATTCTGGCGTGTATGTCAATAGCCATTTGT
TGAGCCTCA
Protein sequenceShow/hide protein sequence
MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREAET
LESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS