; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002663 (gene) of Snake gourd v1 genome

Gene IDTan0002663
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA/RNA polymerases superfamily protein
Genome locationLG01:74384939..74385325
RNA-Seq ExpressionTan0002663
SyntenyTan0002663
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026280.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.5e-3264.86Show/hide
Query:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA
        G   S+ ESS P  + N+EEQ+  R+ QRL   +  AQ+D EKKYGIERLKALGATTF GTT+P DAE WL LIEKCF+V RC EDRKV+LAAFLLQ GA
Subjt:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA

Query:  DDWWKITESRK
        +DWW + ESR+
Subjt:  DDWWKITESRK

KAA0035225.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.2e-3466.09Show/hide
Query:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKG-
        G   S+ ESS P  + NMEEQ+  R+ QRL   + SAQ+DPEKKYGIERLKALGATTF GTT+PADAE WL LIEKCFRV RCPEDRKV+LA+FLLQ G 
Subjt:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKG-

Query:  ---ADDWWKITESRK
            +DWW++ ESR+
Subjt:  ---ADDWWKITESRK

KAA0036813.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.7e-3263.96Show/hide
Query:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA
        G   S+ ESS P+ + N+EEQ+  R+ QRL   + SAQ+DPEKKYG ERLKALGATTF GTT+P D E WL LIEKCFRV R  EDRKV+LAAFLLQ  A
Subjt:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA

Query:  DDWWKITESRK
        +DWW++ ESR+
Subjt:  DDWWKITESRK

TYJ95881.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]2.7e-3263.96Show/hide
Query:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA
        G   S+ ESS P+ + N+EEQ+  R+ QRL   + SAQ+DPEKKYG ERLKALGATTF GTT+P D E WL LIEKCFRV R  EDRKV+LAAFLLQ  A
Subjt:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA

Query:  DDWWKITESRK
        +DWW++ ESR+
Subjt:  DDWWKITESRK

XP_038896416.1 uncharacterized protein LOC120084680 [Benincasa hispida]4.9e-3475Show/hide
Query:  MEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGADDWWKITES
        ME+++F RITQRL  SVGS Q DPEKK+GIERLKALGATTF+GTTDP DAE+WL LIEKCF+VMRCPED KV+LA FLLQKG + WWK+ E+
Subjt:  MEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGADDWWKITES

TrEMBL top hitse value%identityAlignment
A0A5A7SJ99 DNA/RNA polymerases superfamily protein1.7e-3264.86Show/hide
Query:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA
        G   S+ ESS P  + N+EEQ+  R+ QRL   +  AQ+D EKKYGIERLKALGATTF GTT+P DAE WL LIEKCF+V RC EDRKV+LAAFLLQ GA
Subjt:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA

Query:  DDWWKITESRK
        +DWW + ESR+
Subjt:  DDWWKITESRK

A0A5A7T1M0 Reverse transcriptase1.3e-3263.96Show/hide
Query:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA
        G   S+ ESS P+ + N+EEQ+  R+ QRL   + SAQ+DPEKKYG ERLKALGATTF GTT+P D E WL LIEKCFRV R  EDRKV+LAAFLLQ  A
Subjt:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA

Query:  DDWWKITESRK
        +DWW++ ESR+
Subjt:  DDWWKITESRK

A0A5A7U067 Retrotrans_gag domain-containing protein6.0e-3060.36Show/hide
Query:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA
        G E S+ ESS    +VN+EEQ+  R+ Q L   + SAQ++ EK + IERLKALGATTF GTT+ ADAE WL LIEKCF+VMRC EDRKV+L  FLL+ G 
Subjt:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA

Query:  DDWWKITESRK
        +DWW++TESR+
Subjt:  DDWWKITESRK

A0A5D3BB91 Reverse transcriptase1.3e-3263.96Show/hide
Query:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA
        G   S+ ESS P+ + N+EEQ+  R+ QRL   + SAQ+DPEKKYG ERLKALGATTF GTT+P D E WL LIEKCFRV R  EDRKV+LAAFLLQ  A
Subjt:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGA

Query:  DDWWKITESRK
        +DWW++ ESR+
Subjt:  DDWWKITESRK

A0A5D3DES5 DNA/RNA polymerases superfamily protein1.1e-3466.09Show/hide
Query:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKG-
        G   S+ ESS P  + NMEEQ+  R+ QRL   + SAQ+DPEKKYGIERLKALGATTF GTT+PADAE WL LIEKCFRV RCPEDRKV+LA+FLLQ G 
Subjt:  GREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKG-

Query:  ---ADDWWKITESRK
            +DWW++ ESR+
Subjt:  ---ADDWWKITESRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTCGAAACTCAAGAAGCTACCGACGATAGAGGAAGAGAGGTATCAGAGGGAGAGTCAAGTAATCCCCAACAACAGGTGAACATGGAGGAACAGATCTTCACGAG
GATAACTCAAAGATTAGGTGAAAGTGTTGGATCAGCACAAGCAGATCCAGAAAAGAAGTATGGCATTGAAAGACTGAAGGCATTAGGTGCAACAACATTTGAAGGCACGA
CAGATCCCGCTGATGCTGAGGTTTGGTTAAATTTGATTGAGAAGTGTTTTAGGGTCATGCGATGCCCTGAAGACAGGAAGGTCGATTTAGCAGCATTCTTGCTTCAGAAA
GGAGCAGATGATTGGTGGAAGATAACAGAGAGTAGAAAAGGGGAAGCTTGGAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTCGAAACTCAAGAAGCTACCGACGATAGAGGAAGAGAGGTATCAGAGGGAGAGTCAAGTAATCCCCAACAACAGGTGAACATGGAGGAACAGATCTTCACGAG
GATAACTCAAAGATTAGGTGAAAGTGTTGGATCAGCACAAGCAGATCCAGAAAAGAAGTATGGCATTGAAAGACTGAAGGCATTAGGTGCAACAACATTTGAAGGCACGA
CAGATCCCGCTGATGCTGAGGTTTGGTTAAATTTGATTGAGAAGTGTTTTAGGGTCATGCGATGCCCTGAAGACAGGAAGGTCGATTTAGCAGCATTCTTGCTTCAGAAA
GGAGCAGATGATTGGTGGAAGATAACAGAGAGTAGAAAAGGGGAAGCTTGGAGCTAG
Protein sequenceShow/hide protein sequence
MEVETQEATDDRGREVSEGESSNPQQQVNMEEQIFTRITQRLGESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQK
GADDWWKITESRKGEAWS