; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10016565 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10016565
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr03:5998600..5999720
RNA-Seq ExpressionHG10016565
SyntenyHG10016565
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143553.1 uncharacterized protein LOC101209623 isoform X1 [Cucumis sativus]1.5e-5577.71Show/hide
Query:  VSLLPPSRSLHNNAHAKVVNVDEKGLK-QAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIKMDGG
        +SL PPSRS HNNAHA +V ++EKGLK + ISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLY  RKNS HI P  K KK+LLEVNISIKMDGG
Subjt:  VSLLPPSRSLHNNAHAKVVNVDEKGLK-QAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIKMDGG

Query:  TMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGK
        T MEI+ETKKEAP P E LR RTSRS+T +R+VPEMKRLDW KSLRS  AP+P +GK
Subjt:  TMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGK

XP_008440564.1 PREDICTED: uncharacterized protein LOC103484942 [Cucumis melo]3.7e-5981.53Show/hide
Query:  VSLLPPSRSLHNNAHAKVVNVDEKGLK-QAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIKMDGG
        +SLL PSRS HNNAHA +V ++EKGLK + ISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLY HRKNS HIKP  K  K+LLEVNISIKMDGG
Subjt:  VSLLPPSRSLHNNAHAKVVNVDEKGLK-QAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIKMDGG

Query:  TMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGK
        T MEIRETKKEAPPP E LR RTSRS+TL+R+VPEMKRLDW KSLRS  AP+PFVGK
Subjt:  TMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGK

XP_022962743.1 uncharacterized protein LOC111463142 isoform X1 [Cucurbita moschata]2.1e-5472.35Show/hide
Query:  LGLGDVSLLPPSRSLHNNAHAKVVNVDEKGLKQAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIK
        LGL  +SLLPP     +NAHAK++ V EKGL  AISEEEMKIRREIE EIERDLEEEIKGGIYQQALRL RLYQHR  SG I PPVKTKK+LLEVNISIK
Subjt:  LGLGDVSLLPPSRSLHNNAHAKVVNVDEKGLKQAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIK

Query:  MDGGTMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGKNVRFDRNKM
        M+GGT MEI++     PPP    R+RT+RSDT +R VPE+KRLDWVKSLRS  AP P VGKNV F RNKM
Subjt:  MDGGTMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGKNVRFDRNKM

XP_022962744.1 uncharacterized protein LOC111463142 isoform X2 [Cucurbita moschata]2.1e-5472.35Show/hide
Query:  LGLGDVSLLPPSRSLHNNAHAKVVNVDEKGLKQAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIK
        LGL  +SLLPP     +NAHAK++ V EKGL  AISEEEMKIRREIE EIERDLEEEIKGGIYQQALRL RLYQHR  SG I PPVKTKK+LLEVNISIK
Subjt:  LGLGDVSLLPPSRSLHNNAHAKVVNVDEKGLKQAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIK

Query:  MDGGTMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGKNVRFDRNKM
        M+GGT MEI++     PPP    R+RT+RSDT +R VPE+KRLDWVKSLRS  AP P VGKNV F RNKM
Subjt:  MDGGTMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGKNVRFDRNKM

XP_038883628.1 uncharacterized protein LOC120074545 [Benincasa hispida]7.7e-6583.82Show/hide
Query:  LGLGDVSLLPPSRSLHNNAHAKVVNVDEKGLKQAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIK
        LGL D+SLLPPS S H N HAKVV V++KGL+ AISEEEMKIRREIENEIERDLEEEIKGGIYQ ALRL+RLYQHRK SGHIK P+KTKKDLLEVNISIK
Subjt:  LGLGDVSLLPPSRSLHNNAHAKVVNVDEKGLKQAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIK

Query:  MDGGTMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGKNVRFDRNKMAAP
        MDGGTMMEIRETKK APPP E LR RTSRSDTLA AVPEMKRLDW KSLRS  AP PFV +NVRFDRNKMA P
Subjt:  MDGGTMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGKNVRFDRNKMAAP

TrEMBL top hitse value%identityAlignment
A0A0A0KJZ6 Uncharacterized protein7.0e-5677.71Show/hide
Query:  VSLLPPSRSLHNNAHAKVVNVDEKGLK-QAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIKMDGG
        +SL PPSRS HNNAHA +V ++EKGLK + ISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLY  RKNS HI P  K KK+LLEVNISIKMDGG
Subjt:  VSLLPPSRSLHNNAHAKVVNVDEKGLK-QAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIKMDGG

Query:  TMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGK
        T MEI+ETKKEAP P E LR RTSRS+T +R+VPEMKRLDW KSLRS  AP+P +GK
Subjt:  TMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGK

A0A1S3B1F7 uncharacterized protein LOC1034849421.8e-5981.53Show/hide
Query:  VSLLPPSRSLHNNAHAKVVNVDEKGLK-QAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIKMDGG
        +SLL PSRS HNNAHA +V ++EKGLK + ISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLY HRKNS HIKP  K  K+LLEVNISIKMDGG
Subjt:  VSLLPPSRSLHNNAHAKVVNVDEKGLK-QAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIKMDGG

Query:  TMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGK
        T MEIRETKKEAPPP E LR RTSRS+TL+R+VPEMKRLDW KSLRS  AP+PFVGK
Subjt:  TMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGK

A0A6J1HFP1 uncharacterized protein LOC111463142 isoform X11.0e-5472.35Show/hide
Query:  LGLGDVSLLPPSRSLHNNAHAKVVNVDEKGLKQAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIK
        LGL  +SLLPP     +NAHAK++ V EKGL  AISEEEMKIRREIE EIERDLEEEIKGGIYQQALRL RLYQHR  SG I PPVKTKK+LLEVNISIK
Subjt:  LGLGDVSLLPPSRSLHNNAHAKVVNVDEKGLKQAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIK

Query:  MDGGTMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGKNVRFDRNKM
        M+GGT MEI++     PPP    R+RT+RSDT +R VPE+KRLDWVKSLRS  AP P VGKNV F RNKM
Subjt:  MDGGTMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGKNVRFDRNKM

A0A6J1HHY9 uncharacterized protein LOC111463142 isoform X21.0e-5472.35Show/hide
Query:  LGLGDVSLLPPSRSLHNNAHAKVVNVDEKGLKQAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIK
        LGL  +SLLPP     +NAHAK++ V EKGL  AISEEEMKIRREIE EIERDLEEEIKGGIYQQALRL RLYQHR  SG I PPVKTKK+LLEVNISIK
Subjt:  LGLGDVSLLPPSRSLHNNAHAKVVNVDEKGLKQAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIK

Query:  MDGGTMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGKNVRFDRNKM
        M+GGT MEI++     PPP    R+RT+RSDT +R VPE+KRLDWVKSLRS  AP P VGKNV F RNKM
Subjt:  MDGGTMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGKNVRFDRNKM

A0A6J1KUF8 uncharacterized protein LOC1114972926.6e-5472.94Show/hide
Query:  LGLGDVSLLPPSRSLHNNAHAKVVNVDEKGLKQAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIK
        LGL  +SLLPPS S H N HAK++ V EKGL  AISEEEMKIRREIE EIERDLEEEIKGGIYQQALRL RLYQHR  SG I PP+KTKK+LLEVNISIK
Subjt:  LGLGDVSLLPPSRSLHNNAHAKVVNVDEKGLKQAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIK

Query:  MDGGTMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGKNVRFDRNKM
        M+GGT MEI+E     PPP     +RT+RSDT +R VPE+KRLDWVKSLRS  AP P VGKNV F RNKM
Subjt:  MDGGTMMEIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGKNVRFDRNKM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G26290.1 unknown protein1.6e-1250Show/hide
Query:  ISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHI-KPPVKTKKDLLEVNISIKMDGGTMMEIRETKKE
        +S++E +IR E+E EIER+LE E K GIY  AL+L RLY+ R+    +    ++  K +LEVNI+IKM+G T +EI E KKE
Subjt:  ISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHI-KPPVKTKKDLLEVNISIKMDGGTMMEIRETKKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCAGCTAGGGTTGGGTGATGTGTCACTTTTGCCACCGAGTCGGTCTCTTCATAATAATGCTCATGCAAAAGTGGTAAATGTGGATGAGAAGGGTTTAAAGCAAGC
AATTTCTGAAGAAGAGATGAAGATCCGACGAGAAATTGAGAATGAAATAGAGAGAGATTTGGAGGAAGAGATTAAGGGAGGGATATACCAACAAGCTCTCCGCCTGCACC
GCCTTTACCAGCACCGGAAAAATTCCGGTCATATTAAGCCGCCGGTGAAAACCAAGAAAGACCTTTTGGAGGTAAATATCAGCATCAAAATGGATGGAGGGACTATGATG
GAAATAAGAGAGACCAAAAAGGAGGCGCCGCCGCCGACGGAACGTCTCCGGCAGAGGACTTCCCGGTCCGACACTCTTGCTCGAGCAGTCCCGGAGATGAAAAGGTTGGA
TTGGGTGAAGTCTCTTAGGTCGAGGCCGGCGCCGCTGCCATTTGTCGGGAAAAACGTCAGGTTTGATCGGAATAAAATGGCGGCGCCGATGGCCGGTGCCCGTGCCGTGC
AATAA
mRNA sequenceShow/hide mRNA sequence
ATGATCCAGCTAGGGTTGGGTGATGTGTCACTTTTGCCACCGAGTCGGTCTCTTCATAATAATGCTCATGCAAAAGTGGTAAATGTGGATGAGAAGGGTTTAAAGCAAGC
AATTTCTGAAGAAGAGATGAAGATCCGACGAGAAATTGAGAATGAAATAGAGAGAGATTTGGAGGAAGAGATTAAGGGAGGGATATACCAACAAGCTCTCCGCCTGCACC
GCCTTTACCAGCACCGGAAAAATTCCGGTCATATTAAGCCGCCGGTGAAAACCAAGAAAGACCTTTTGGAGGTAAATATCAGCATCAAAATGGATGGAGGGACTATGATG
GAAATAAGAGAGACCAAAAAGGAGGCGCCGCCGCCGACGGAACGTCTCCGGCAGAGGACTTCCCGGTCCGACACTCTTGCTCGAGCAGTCCCGGAGATGAAAAGGTTGGA
TTGGGTGAAGTCTCTTAGGTCGAGGCCGGCGCCGCTGCCATTTGTCGGGAAAAACGTCAGGTTTGATCGGAATAAAATGGCGGCGCCGATGGCCGGTGCCCGTGCCGTGC
AATAA
Protein sequenceShow/hide protein sequence
MIQLGLGDVSLLPPSRSLHNNAHAKVVNVDEKGLKQAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQHRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMM
EIRETKKEAPPPTERLRQRTSRSDTLARAVPEMKRLDWVKSLRSRPAPLPFVGKNVRFDRNKMAAPMAGARAVQ