; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004427 (gene) of Snake gourd v1 genome

Gene IDTan0004427
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPol protein
Genome locationLG08:11785654..11787202
RNA-Seq ExpressionTan0004427
SyntenyTan0004427
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033160.1 pol protein [Cucumis melo var. makuwa]5.9e-4869.93Show/hide
Query:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP
        MQTAQSRQKSY D RR+DLEFDV + VFLKV  M GV+ F+RKGKPSP F+  FEILEQIGPV YRLAL P LS+VH++FHVSMLRKYVPDP  VVD+EP
Subjt:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP

Query:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEE
        L ++E+LSY EQP  +LA++ K+LRNR I LVK LW+N Q EE
Subjt:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEE

KAA0054231.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.7e-4859.09Show/hide
Query:  QTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEPL
        +TAQSRQKSY D RR+DLEFDV + VFLKV  M GV+ F+R+GK SP F+ PFEILE+IGPV YRLAL P LS+VH++FHVSMLRKYVPDP  VVD+EPL
Subjt:  QTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEPL

Query:  RLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEELIGNKRRKCEPTKVAYRSRSFVGSCRSRIFVDSP
         ++E+LSY EQP  +LAR+ K+LRNR I LVKVLW+N + EE    +      T  + R   F  S R     D P
Subjt:  RLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEELIGNKRRKCEPTKVAYRSRSFVGSCRSRIFVDSP

KAA0058449.1 pol protein [Cucumis melo var. makuwa]4.5e-4862.05Show/hide
Query:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP
        M TAQSRQKS+ D RR+DLEFDV + VFL V  M GV+ F++KGK SPHF+ PFEILE+IGPV YRLAL P  S+VH++FHVSMLRKYV DP  VVD+EP
Subjt:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP

Query:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEELIGNKRRKCEPTKVAYRSRSFVG
        L++NE+LSYEEQP  ILAR+ KVLRNR I+LVKVLW+N + EE I  +            +R  VG
Subjt:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEELIGNKRRKCEPTKVAYRSRSFVG

KAA0062520.1 pol protein [Cucumis melo var. makuwa]1.7e-4768.53Show/hide
Query:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP
        M TAQSRQKSY D RR+DLEF+V + VFLKV  M GV+ F+R+GK SP F+RPFEILE+IGPV YRLAL P LS+VH++FHVSMLRKYVPDP  VVD+EP
Subjt:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP

Query:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEE
        L ++E+LSY EQP  +LAR+ KVLRNR I LVK+LW+N + EE
Subjt:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEE

TYK22921.1 pol protein [Cucumis melo var. makuwa]4.5e-4869.93Show/hide
Query:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP
        MQTAQSRQKSY D RR+DLEFDV + VFLKV  M GV+ F+RKGKPSP F+  FEILEQIGPV YRLAL P LS+VH++FHVSMLRKYVPDP  VVD+EP
Subjt:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP

Query:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEE
        L ++E+LSY EQP  +LA++ K+LRNR I LVK LW+N Q EE
Subjt:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEE

TrEMBL top hitse value%identityAlignment
A0A5A7SQH4 Pol protein2.9e-4869.93Show/hide
Query:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP
        MQTAQSRQKSY D RR+DLEFDV + VFLKV  M GV+ F+RKGKPSP F+  FEILEQIGPV YRLAL P LS+VH++FHVSMLRKYVPDP  VVD+EP
Subjt:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP

Query:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEE
        L ++E+LSY EQP  +LA++ K+LRNR I LVK LW+N Q EE
Subjt:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEE

A0A5A7UL17 Reverse transcriptase3.8e-4859.09Show/hide
Query:  QTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEPL
        +TAQSRQKSY D RR+DLEFDV + VFLKV  M GV+ F+R+GK SP F+ PFEILE+IGPV YRLAL P LS+VH++FHVSMLRKYVPDP  VVD+EPL
Subjt:  QTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEPL

Query:  RLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEELIGNKRRKCEPTKVAYRSRSFVGSCRSRIFVDSP
         ++E+LSY EQP  +LAR+ K+LRNR I LVKVLW+N + EE    +      T  + R   F  S R     D P
Subjt:  RLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEELIGNKRRKCEPTKVAYRSRSFVGSCRSRIFVDSP

A0A5A7V5L6 Reverse transcriptase8.4e-4868.53Show/hide
Query:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP
        M TAQSRQKSY D RR+DLEF+V + VFLKV  M GV+ F+R+GK SP F+RPFEILE+IGPV YRLAL P LS+VH++FHVSMLRKYVPDP  VVD+EP
Subjt:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP

Query:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEE
        L ++E+LSY EQP  +LAR+ KVLRNR I LVK+LW+N + EE
Subjt:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEE

A0A5D3BDH8 Pol protein2.2e-4862.05Show/hide
Query:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP
        M TAQSRQKS+ D RR+DLEFDV + VFL V  M GV+ F++KGK SPHF+ PFEILE+IGPV YRLAL P  S+VH++FHVSMLRKYV DP  VVD+EP
Subjt:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP

Query:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEELIGNKRRKCEPTKVAYRSRSFVG
        L++NE+LSYEEQP  ILAR+ KVLRNR I+LVKVLW+N + EE I  +            +R  VG
Subjt:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEELIGNKRRKCEPTKVAYRSRSFVG

A0A5D3DGY9 Pol protein2.2e-4869.93Show/hide
Query:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP
        MQTAQSRQKSY D RR+DLEFDV + VFLKV  M GV+ F+RKGKPSP F+  FEILEQIGPV YRLAL P LS+VH++FHVSMLRKYVPDP  VVD+EP
Subjt:  MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEP

Query:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEE
        L ++E+LSY EQP  +LA++ K+LRNR I LVK LW+N Q EE
Subjt:  LRLNEDLSYEEQPTRILARDQKVLRNRAINLVKVLWKNQQEEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAACTGCACAGAGTAGGCAGAAGAGCTATGATGACTCAAGGCGGAGGGACTTGGAGTTCGATGTGGACAACCATGTGTTCCTTAAGGTGCCCTCGATGATGGGAGT
TGTAAGTTTTGATCGGAAAGGGAAGCCGAGTCCACACTTTATAAGGCCATTCGAGATTTTAGAGCAGATTGGTCCAGTGACTTATCGATTGGCATTGCTGCCATTTCTCT
CTTCAGTTCATAATATCTTTCATGTCTCCATGTTGAGGAAATACGTGCCTGACCCAATGCGTGTGGTTGATTTTGAACCCTTGCGGTTGAACGAGGACCTGAGCTACGAG
GAGCAACCAACACGAATTCTCGCCAGAGACCAAAAGGTTCTCCGTAACCGAGCTATCAATCTGGTCAAGGTCTTATGGAAAAACCAACAAGAAGAGGAGCTAATTGGGAA
CAAGAGGAGGAAGTGCGAGCCAACTAAAGTCGCCTACCGAAGCCGTTCGTTCGTGGGTTCATGTCGGAGCCGCATTTTCGTGGATTCGCCGCCTCACCTTCTTGTGGCCG
ATTCACCATCTTTTGGTCCAGTTCTCAAGATGAGCCAACTAGTTCGGTGTAGATCGCCGTGCGGGCTTCCTAAAACTTGTTTCCTTTGGTCGATCCGCCGCCACTGA
mRNA sequenceShow/hide mRNA sequence
GATCCGAGAATGTCCCGTCAAAGCCAACTCATGGGAAAGACAAGTGCAAGTACACCTGACGCGGCGACGGCGTACGTACGACCAAGTTATATTCCGCCGCTTATTTTACT
TGATGCTTGGTTGAGGACTGAAGAACCTCGCCTAAAGACCCGACGAGGCCACAAAGCGTTGCGATCATAACGATGGGGCAGGAGTTCCCCTCAAGGCTGGCCATGTTCGA
GGGATGAATCATGCCGCCGCACGGTCCGAAAGGTGCGACCGTGACACTAGGCACCTAGTTGAACTTCAGTAACTCGTTTCACCTCCAGACAGACGGTCAGATGGAACGGT
TGAACCAGGTGTTGGAGGACATGTTACGAGCATACGCTCTCGACTGTCCAGGCAGTTGGGATACCCATCTGCATCTGATGGAATTTGCTTACAATAATAGCTACCAAGCA
ACGATAGGTATGGCGCCGTTTGAAGCATCGTATGGAAGGAGGTGCAGAACTCCCGTTTACTGGGACGAGGTTGGTGAATGCTAGTTATTGGGTCCCGAGTTAATCCAAGT
TACGAATGATGTGATACAAAAGATCCGAGCGAGAATGCAAACTGCACAGAGTAGGCAGAAGAGCTATGATGACTCAAGGCGGAGGGACTTGGAGTTCGATGTGGACAACC
ATGTGTTCCTTAAGGTGCCCTCGATGATGGGAGTTGTAAGTTTTGATCGGAAAGGGAAGCCGAGTCCACACTTTATAAGGCCATTCGAGATTTTAGAGCAGATTGGTCCA
GTGACTTATCGATTGGCATTGCTGCCATTTCTCTCTTCAGTTCATAATATCTTTCATGTCTCCATGTTGAGGAAATACGTGCCTGACCCAATGCGTGTGGTTGATTTTGA
ACCCTTGCGGTTGAACGAGGACCTGAGCTACGAGGAGCAACCAACACGAATTCTCGCCAGAGACCAAAAGGTTCTCCGTAACCGAGCTATCAATCTGGTCAAGGTCTTAT
GGAAAAACCAACAAGAAGAGGAGCTAATTGGGAACAAGAGGAGGAAGTGCGAGCCAACTAAAGTCGCCTACCGAAGCCGTTCGTTCGTGGGTTCATGTCGGAGCCGCATT
TTCGTGGATTCGCCGCCTCACCTTCTTGTGGCCGATTCACCATCTTTTGGTCCAGTTCTCAAGATGAGCCAACTAGTTCGGTGTAGATCGCCGTGCGGGCTTCCTAAAAC
TTGTTTCCTTTGGTCGATCCGCCGCCACTGATTTCAGATCTGCAAGTAGGTTCGCCTGAGGTGTTTTCAGCCTTCATATTACCATTGGTCGGTAAATTTTAATTCTGATT
CGTGGTTTTGATTTAAAGGTCGTTGTTAAGTATGAATTCTGATTTGGTTAAGGTTGTCCTAGGACTTTATCAATGTTCGAAGGAAGGGCGACGAAATCTAAGCAAGGTCA
GCGTTGTCTTGATTCAAAATTTGGACGGATGTATGTGGAAACAGAGTCATTAGACCCTTAGGCTTCGTGTTTTAATGTTGTTGTGGATTGGTGGGTTAATTTTGGAGTTC
TGATTAAAC
Protein sequenceShow/hide protein sequence
MQTAQSRQKSYDDSRRRDLEFDVDNHVFLKVPSMMGVVSFDRKGKPSPHFIRPFEILEQIGPVTYRLALLPFLSSVHNIFHVSMLRKYVPDPMRVVDFEPLRLNEDLSYE
EQPTRILARDQKVLRNRAINLVKVLWKNQQEEELIGNKRRKCEPTKVAYRSRSFVGSCRSRIFVDSPPHLLVADSPSFGPVLKMSQLVRCRSPCGLPKTCFLWSIRRH