; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022187 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022187
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCopine domain-containing protein
Genome locationChr05:21766922..21767365
RNA-Seq ExpressionHG10022187
SyntenyHG10022187
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011655383.1 uncharacterized protein LOC105435522 [Cucumis sativus]1.1e-5381.05Show/hide
Query:  MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGGE-IRKRFGRRSRRWFRIRRS------SIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRF
        MADLETASAGKFRYRRLRYSEEFEEAFGRSR GGGE IRKR+ RR R WFR+R        SIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSI+KV+KRF
Subjt:  MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGGE-IRKRFGRRSRRWFRIRRS------SIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRF

Query:  RDGEAYLGDLFAGNYLFLQVNPSSMKCLKKKNHQR----FALQNFPQTYSLPS
        RDGEAYLGDLFAGNYLFLQVNPSSMKCLK  +H +    F ++NFPQTYSLPS
Subjt:  RDGEAYLGDLFAGNYLFLQVNPSSMKCLKKKNHQR----FALQNFPQTYSLPS

XP_016902323.1 PREDICTED: uncharacterized protein LOC107991631 [Cucumis melo]5.4e-5380.54Show/hide
Query:  MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRS------SIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFR
        MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGG IRKR+ R  R WFR+R        SIGRRLKKLRIPSLRKL+RRKWRLVNAMRGSI+KVLKRFR
Subjt:  MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRS------SIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFR

Query:  DGEAYLGDLFAGNYLFLQVNPSSMKCLKKKNHQR-FALQNFPQTYSLPS
        DGEAYLGDLFAGNYLFLQVNPSSMKC+K  + Q  F ++NF QTYSLP+
Subjt:  DGEAYLGDLFAGNYLFLQVNPSSMKCLKKKNHQR-FALQNFPQTYSLPS

XP_022922294.1 uncharacterized protein LOC111430312 [Cucurbita moschata]9.8e-4778.72Show/hide
Query:  MADLE-TASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRSSIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAY
        M DL+ TA  GKFRYRRLRYSEEFEEAFGRSR G    R+RF  RS+RWFRIRR+SIGRRLKKLR+PSLRKLLRRK RLV AMRGSI+KV+KRFR+GEAY
Subjt:  MADLE-TASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRSSIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAY

Query:  LGDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQNFPQTYSL
        LGDLFAGNYLFLQVNPSS+KC   KNH   ALQNF  TYSL
Subjt:  LGDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQNFPQTYSL

XP_023551709.1 uncharacterized protein LOC111809563 [Cucurbita pepo subsp. pepo]3.4e-4780.14Show/hide
Query:  MADLETASA-GKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRSSIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAY
        M DL+T +A GKFRYRRLRYSEEFEEAFGRSR GG E R+RF  RS+RWFRIRR+SIGRRLKKLR+PSLRKLLRRK RLV AMRGSI+KV+KRFR+GEAY
Subjt:  MADLETASA-GKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRSSIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAY

Query:  LGDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQNFPQTYSL
        LGDLFAGNYLFLQVNPSS+KC   KNH   ALQNF  TYSL
Subjt:  LGDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQNFPQTYSL

XP_038889218.1 uncharacterized protein LOC120079105 [Benincasa hispida]2.0e-5283.92Show/hide
Query:  MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRR-SSIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAY
        MADLETA A KFRYRRL YSEEFEEAFGR RGG     +R  +RS+RWFRIR+ SSIGRRLKKLRIPSLRKLLRRK +LVNAMRGSISK+LKRFRDGEAY
Subjt:  MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRR-SSIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAY

Query:  LGDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQNFPQTYSLPS
        LGDLFAGNYLFLQVNPSSMKCL  KNH RF LQNFPQTYSLPS
Subjt:  LGDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQNFPQTYSLPS

TrEMBL top hitse value%identityAlignment
A0A0A0KSD1 Uncharacterized protein5.3e-5481.05Show/hide
Query:  MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGGE-IRKRFGRRSRRWFRIRRS------SIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRF
        MADLETASAGKFRYRRLRYSEEFEEAFGRSR GGGE IRKR+ RR R WFR+R        SIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSI+KV+KRF
Subjt:  MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGGE-IRKRFGRRSRRWFRIRRS------SIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRF

Query:  RDGEAYLGDLFAGNYLFLQVNPSSMKCLKKKNHQR----FALQNFPQTYSLPS
        RDGEAYLGDLFAGNYLFLQVNPSSMKCLK  +H +    F ++NFPQTYSLPS
Subjt:  RDGEAYLGDLFAGNYLFLQVNPSSMKCLKKKNHQR----FALQNFPQTYSLPS

A0A1S4E272 uncharacterized protein LOC1079916312.6e-5380.54Show/hide
Query:  MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRS------SIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFR
        MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGG IRKR+ R  R WFR+R        SIGRRLKKLRIPSLRKL+RRKWRLVNAMRGSI+KVLKRFR
Subjt:  MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRS------SIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFR

Query:  DGEAYLGDLFAGNYLFLQVNPSSMKCLKKKNHQR-FALQNFPQTYSLPS
        DGEAYLGDLFAGNYLFLQVNPSSMKC+K  + Q  F ++NF QTYSLP+
Subjt:  DGEAYLGDLFAGNYLFLQVNPSSMKCLKKKNHQR-FALQNFPQTYSLPS

A0A6J1E2Z5 uncharacterized protein LOC1114303124.8e-4778.72Show/hide
Query:  MADLE-TASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRSSIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAY
        M DL+ TA  GKFRYRRLRYSEEFEEAFGRSR G    R+RF  RS+RWFRIRR+SIGRRLKKLR+PSLRKLLRRK RLV AMRGSI+KV+KRFR+GEAY
Subjt:  MADLE-TASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRSSIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAY

Query:  LGDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQNFPQTYSL
        LGDLFAGNYLFLQVNPSS+KC   KNH   ALQNF  TYSL
Subjt:  LGDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQNFPQTYSL

A0A6J1FMJ7 uncharacterized protein LOC1114453177.1e-4373.47Show/hide
Query:  MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRSSIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAYL
        MADLE ASA KFRYRRLR+S+E E AF RSR G    ++RFG      FR+RR+SIGRRLKKLRIPSLRKL  RK RLVNAM GSISKVLKRFRDGEAYL
Subjt:  MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRSSIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAYL

Query:  GDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQNFPQTYSLPSLALPN
        GDLFAGNYLFLQ+NPSS+KCL   NH ++ LQNFP T+SLP  ALPN
Subjt:  GDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQNFPQTYSLPSLALPN

A0A6J1JM01 uncharacterized protein LOC1114870963.4e-4578.01Show/hide
Query:  MADLE-TASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRSSIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAY
        M DL+ TA  GKFRYRRLRYSEEFEEAFGRSR  G E R+RF  RS+RWF+IRR+SIGRRLKKLR+PSLRKLLRRK RLV AM+GSISKV+KRFR+GEAY
Subjt:  MADLE-TASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRSSIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAY

Query:  LGDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQNFPQTYSL
        LGDLFAGNYLFLQVNPSS+KC   KNH   AL NF  TYSL
Subjt:  LGDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQNFPQTYSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G14410.1 unknown protein3.0e-0938.26Show/hide
Query:  GGEIRKRFGRRSRRWFRIRRSSIGRRLK-KLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAYLGDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQ
        G   R+R    S R+ R+      RR++ +++I  LR  +R+K    N ++  I K+LKR ++ +++ GDLFAGNYLF+QVNPSS+   K    + F  Q
Subjt:  GGEIRKRFGRRSRRWFRIRRSSIGRRLK-KLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAYLGDLFAGNYLFLQVNPSSMKCLKKKNHQRFALQ

Query:  --NFPQTYSLPSLAL
          NFP   SLP + +
Subjt:  --NFPQTYSLPSLAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGACCTCGAAACAGCATCGGCAGGCAAATTCCGGTACCGGAGATTGAGGTACAGCGAGGAATTTGAGGAGGCGTTTGGGAGATCCAGAGGAGGAGGAGGAGAGAT
TAGGAAGAGATTTGGAAGGAGATCGAGGCGGTGGTTCAGAATCAGAAGGAGTTCGATTGGGAGGAGGTTGAAGAAGCTGAGAATCCCCAGCTTGAGAAAGCTGTTGAGGA
GGAAATGGCGGCTGGTGAATGCTATGAGAGGCTCGATTTCGAAGGTTTTGAAGAGATTCAGAGATGGAGAAGCGTATTTAGGAGATCTTTTTGCGGGAAATTACTTGTTT
CTTCAGGTTAATCCAAGCTCCATGAAATGCTTGAAGAAGAAGAATCATCAGCGATTTGCTCTTCAAAATTTCCCCCAAACTTATTCCCTCCCAAGCTTAGCTTTACCTAA
TTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGACCTCGAAACAGCATCGGCAGGCAAATTCCGGTACCGGAGATTGAGGTACAGCGAGGAATTTGAGGAGGCGTTTGGGAGATCCAGAGGAGGAGGAGGAGAGAT
TAGGAAGAGATTTGGAAGGAGATCGAGGCGGTGGTTCAGAATCAGAAGGAGTTCGATTGGGAGGAGGTTGAAGAAGCTGAGAATCCCCAGCTTGAGAAAGCTGTTGAGGA
GGAAATGGCGGCTGGTGAATGCTATGAGAGGCTCGATTTCGAAGGTTTTGAAGAGATTCAGAGATGGAGAAGCGTATTTAGGAGATCTTTTTGCGGGAAATTACTTGTTT
CTTCAGGTTAATCCAAGCTCCATGAAATGCTTGAAGAAGAAGAATCATCAGCGATTTGCTCTTCAAAATTTCCCCCAAACTTATTCCCTCCCAAGCTTAGCTTTACCTAA
TTAG
Protein sequenceShow/hide protein sequence
MADLETASAGKFRYRRLRYSEEFEEAFGRSRGGGGEIRKRFGRRSRRWFRIRRSSIGRRLKKLRIPSLRKLLRRKWRLVNAMRGSISKVLKRFRDGEAYLGDLFAGNYLF
LQVNPSSMKCLKKKNHQRFALQNFPQTYSLPSLALPN