; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018034 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018034
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA-directed DNA polymerase
Genome locationChr03:29352799..29353584
RNA-Seq ExpressionHG10018034
SyntenyHG10018034
Gene Ontology termsGO:0006261 - DNA-dependent DNA replication (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038899891.1 DNA polymerase I isoform X1 [Benincasa hispida]3.7e-5786.81Show/hide
Query:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF
        MA HHLHTATASASHICRN LGFIFTSK PNP RF  SSSSSSLRIQSLGHHS PSFS+LLSPKGYCSSSGSI+  NTVDS AT+H SSASSRSQQMLQF
Subjt:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF

Query:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML
        QDSLS+SL+YKEETGI+NPSDARVMLIDGTSVIYRAYYKLL  L
Subjt:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML

XP_038899892.1 DNA polymerase I isoform X2 [Benincasa hispida]3.7e-5786.81Show/hide
Query:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF
        MA HHLHTATASASHICRN LGFIFTSK PNP RF  SSSSSSLRIQSLGHHS PSFS+LLSPKGYCSSSGSI+  NTVDS AT+H SSASSRSQQMLQF
Subjt:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF

Query:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML
        QDSLS+SL+YKEETGI+NPSDARVMLIDGTSVIYRAYYKLL  L
Subjt:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML

XP_038899895.1 DNA polymerase I isoform X4 [Benincasa hispida]3.7e-5786.81Show/hide
Query:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF
        MA HHLHTATASASHICRN LGFIFTSK PNP RF  SSSSSSLRIQSLGHHS PSFS+LLSPKGYCSSSGSI+  NTVDS AT+H SSASSRSQQMLQF
Subjt:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF

Query:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML
        QDSLS+SL+YKEETGI+NPSDARVMLIDGTSVIYRAYYKLL  L
Subjt:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML

XP_038899896.1 DNA polymerase I isoform X5 [Benincasa hispida]3.7e-5786.81Show/hide
Query:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF
        MA HHLHTATASASHICRN LGFIFTSK PNP RF  SSSSSSLRIQSLGHHS PSFS+LLSPKGYCSSSGSI+  NTVDS AT+H SSASSRSQQMLQF
Subjt:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF

Query:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML
        QDSLS+SL+YKEETGI+NPSDARVMLIDGTSVIYRAYYKLL  L
Subjt:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML

XP_038899897.1 DNA polymerase I isoform X6 [Benincasa hispida]3.7e-5786.81Show/hide
Query:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF
        MA HHLHTATASASHICRN LGFIFTSK PNP RF  SSSSSSLRIQSLGHHS PSFS+LLSPKGYCSSSGSI+  NTVDS AT+H SSASSRSQQMLQF
Subjt:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF

Query:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML
        QDSLS+SL+YKEETGI+NPSDARVMLIDGTSVIYRAYYKLL  L
Subjt:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML

TrEMBL top hitse value%identityAlignment
A0A5A7V6D7 DNA polymerase I1.2e-5076.67Show/hide
Query:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF
        MASHHLHTATASASHICRN LGF+FTSK P P RFS+SSS    RI     HSFPS SLLLSPKGYCSSSGSIN+ N +D+ ATYH SSAS+R Q M+QF
Subjt:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF

Query:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGMLDLSVRM
        QDSLSNSL++KE+TGIDNP+DARVMLIDGTS+I+RAYYKLLGML +SVRM
Subjt:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGMLDLSVRM

A0A6J1D6K2 DNA-directed DNA polymerase1.8e-4978.91Show/hide
Query:  MASHHLH--TATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYH-SSSASSRSQQM
        MA HHLH  TATA+ASHICRN LG+IFTS+  +PLR    SSSSSL+IQSL HHSFPSFSLL SPKGYCSSSG +NA N VDSNATYH SSSASS+SQQM
Subjt:  MASHHLH--TATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYH-SSSASSRSQQM

Query:  LQFQDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML
        LQFQDSL NSL +KEE GID+PSDARVMLIDGTS+IYRAYYKLL  L
Subjt:  LQFQDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML

A0A6J1D7Q6 DNA-directed DNA polymerase1.8e-4978.91Show/hide
Query:  MASHHLH--TATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYH-SSSASSRSQQM
        MA HHLH  TATA+ASHICRN LG+IFTS+  +PLR    SSSSSL+IQSL HHSFPSFSLL SPKGYCSSSG +NA N VDSNATYH SSSASS+SQQM
Subjt:  MASHHLH--TATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYH-SSSASSRSQQM

Query:  LQFQDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML
        LQFQDSL NSL +KEE GID+PSDARVMLIDGTS+IYRAYYKLL  L
Subjt:  LQFQDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML

A0A6J1D847 uncharacterized protein LOC111017796 isoform X21.8e-4978.91Show/hide
Query:  MASHHLH--TATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYH-SSSASSRSQQM
        MA HHLH  TATA+ASHICRN LG+IFTS+  +PLR    SSSSSL+IQSL HHSFPSFSLL SPKGYCSSSG +NA N VDSNATYH SSSASS+SQQM
Subjt:  MASHHLH--TATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYH-SSSASSRSQQM

Query:  LQFQDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML
        LQFQDSL NSL +KEE GID+PSDARVMLIDGTS+IYRAYYKLL  L
Subjt:  LQFQDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML

A0A6J1HVF9 DNA-directed DNA polymerase3.6e-5079.86Show/hide
Query:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF
        MA HHLHTATASASHICRN LG+IFTSK  +PLRF  SSSSSSLRIQS GHH FPSFS+LLSPKGYCSSS S+NA   VDSNATYH S ASS SQQMLQ 
Subjt:  MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQF

Query:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML
        QDSLSNS + KEET ID+PSDARVMLIDGTS+IYRAY+KLL  L
Subjt:  QDSLSNSLSYKEETGIDNPSDARVMLIDGTSVIYRAYYKLLGML

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52050.1 5'-3' exonuclease family protein3.0e-0440.62Show/hide
Query:  SFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQFQDSLSNSLSYKEETGIDN---PSDARVMLIDGTSVIYRAYYKLLGMLD
        S  S SL  S K YCSS       N   S +T  S S     Q         S    +K E  + +    S+ RVMLIDGTS+IYRAYYKLL  L+
Subjt:  SFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQFQDSLSNSLSYKEETGIDN---PSDARVMLIDGTSVIYRAYYKLLGMLD

AT3G52050.2 5'-3' exonuclease family protein3.0e-0440.62Show/hide
Query:  SFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQFQDSLSNSLSYKEETGIDN---PSDARVMLIDGTSVIYRAYYKLLGMLD
        S  S SL  S K YCSS       N   S +T  S S     Q         S    +K E  + +    S+ RVMLIDGTS+IYRAYYKLL  L+
Subjt:  SFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQFQDSLSNSLSYKEETGIDN---PSDARVMLIDGTSVIYRAYYKLLGMLD

AT3G52050.3 5'-3' exonuclease family protein3.0e-0440.62Show/hide
Query:  SFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQFQDSLSNSLSYKEETGIDN---PSDARVMLIDGTSVIYRAYYKLLGMLD
        S  S SL  S K YCSS       N   S +T  S S     Q         S    +K E  + +    S+ RVMLIDGTS+IYRAYYKLL  L+
Subjt:  SFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQFQDSLSNSLSYKEETGIDN---PSDARVMLIDGTSVIYRAYYKLLGMLD

AT3G52050.4 5'-3' exonuclease family protein3.0e-0440.62Show/hide
Query:  SFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQFQDSLSNSLSYKEETGIDN---PSDARVMLIDGTSVIYRAYYKLLGMLD
        S  S SL  S K YCSS       N   S +T  S S     Q         S    +K E  + +    S+ RVMLIDGTS+IYRAYYKLL  L+
Subjt:  SFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQFQDSLSNSLSYKEETGIDN---PSDARVMLIDGTSVIYRAYYKLLGMLD

AT3G52050.5 5'-3' exonuclease family protein3.0e-0440.62Show/hide
Query:  SFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQFQDSLSNSLSYKEETGIDN---PSDARVMLIDGTSVIYRAYYKLLGMLD
        S  S SL  S K YCSS       N   S +T  S S     Q         S    +K E  + +    S+ RVMLIDGTS+IYRAYYKLL  L+
Subjt:  SFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQFQDSLSNSLSYKEETGIDN---PSDARVMLIDGTSVIYRAYYKLLGMLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCCCACCATCTTCACACTGCGACTGCTAGTGCCTCGCATATTTGTAGAAATTTGTTGGGTTTCATTTTCACCTCCAAATTGCCTAATCCTCTTCGTTTCTCCTC
TTCTTCTTCTTCGTCTTCTTTGAGGATACAGTCTCTTGGTCATCATTCTTTTCCGTCTTTCTCTCTTCTGCTATCCCCGAAGGGTTACTGTAGTTCATCCGGAAGTATAA
ATGCTCCCAATACCGTAGATAGCAATGCAACTTATCATAGTAGTTCTGCATCTTCTAGAAGCCAGCAAATGCTGCAATTCCAAGATTCATTATCGAATTCACTTTCGTAC
AAAGAAGAAACTGGAATTGATAATCCTTCAGATGCTAGAGTTATGCTCATTGATGGCACATCAGTCATTTATAGAGCATACTACAAGCTTTTGGGTATGTTAGACTTGTC
AGTCAGAATGTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCCCACCATCTTCACACTGCGACTGCTAGTGCCTCGCATATTTGTAGAAATTTGTTGGGTTTCATTTTCACCTCCAAATTGCCTAATCCTCTTCGTTTCTCCTC
TTCTTCTTCTTCGTCTTCTTTGAGGATACAGTCTCTTGGTCATCATTCTTTTCCGTCTTTCTCTCTTCTGCTATCCCCGAAGGGTTACTGTAGTTCATCCGGAAGTATAA
ATGCTCCCAATACCGTAGATAGCAATGCAACTTATCATAGTAGTTCTGCATCTTCTAGAAGCCAGCAAATGCTGCAATTCCAAGATTCATTATCGAATTCACTTTCGTAC
AAAGAAGAAACTGGAATTGATAATCCTTCAGATGCTAGAGTTATGCTCATTGATGGCACATCAGTCATTTATAGAGCATACTACAAGCTTTTGGGTATGTTAGACTTGTC
AGTCAGAATGTTTTAG
Protein sequenceShow/hide protein sequence
MASHHLHTATASASHICRNLLGFIFTSKLPNPLRFSSSSSSSSLRIQSLGHHSFPSFSLLLSPKGYCSSSGSINAPNTVDSNATYHSSSASSRSQQMLQFQDSLSNSLSY
KEETGIDNPSDARVMLIDGTSVIYRAYYKLLGMLDLSVRMF