; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004918 (gene) of Snake gourd v1 genome

Gene IDTan0004918
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG01:99622099..99622527
RNA-Seq ExpressionTan0004918
SyntenyTan0004918
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026280.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]5.6e-3159.84Show/hide
Query:  EAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKCFRVMRCPEDRKV
        EA         G   S+ ESS P  E N+EEQ+  R+ QRL   +  AQ+  EKKYGIERLKALGATTF GTT+P D E WL LIEKCF+V RC EDRKV
Subjt:  EAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKCFRVMRCPEDRKV

Query:  DLAAFLLQKGADDWWKITESRK
        +LAAFLLQ GA+DWW + ESR+
Subjt:  DLAAFLLQKGADDWWKITESRK

KAA0035225.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]7.8e-3358.52Show/hide
Query:  GKRGRQVEAGTQEATGD--RGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKCFRV
        G+  +  +A T  A  +   G   S+ ESS P  E NMEEQ+  R+ QRL   + SAQ+ PEKKYGIERLKALGATTF GTT+PAD E WL LIEKCFRV
Subjt:  GKRGRQVEAGTQEATGD--RGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKCFRV

Query:  MRCPEDRKVDLAAFLLQKG----ADDWWKITESRK
         RCPEDRKV+LA+FLLQ G     +DWW++ ESR+
Subjt:  MRCPEDRKVDLAAFLLQKG----ADDWWKITESRK

KAA0036813.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]8.6e-3258.21Show/hide
Query:  MVRGGKRGR-QVEAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKC
        M RG  R     EA         G   S+ ESS P+ E N+EEQ+  R+ QRL   + SAQ+ PEKKYG ERLKALGATTF GTT+P DVE WL LIEKC
Subjt:  MVRGGKRGR-QVEAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKC

Query:  FRVMRCPEDRKVDLAAFLLQKGADDWWKITESRK
        FRV R  EDRKV+LAAFLLQ  A+DWW++ ESR+
Subjt:  FRVMRCPEDRKVDLAAFLLQKGADDWWKITESRK

TYJ95881.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]3.9e-3256.85Show/hide
Query:  MVRGGKRGR-QVEAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKC
        M RG  R     EA         G   S+ ESS P+ E N+EEQ+  R+ QRL   + SAQ+ PEKKYG ERLKALGATTF GTT+P DVE WL LIEKC
Subjt:  MVRGGKRGR-QVEAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKC

Query:  FRVMRCPEDRKVDLAAFLLQKGADDWWKITESRK---GDAWSWKKF
        FRV R  EDRKV+LAAFLLQ  A+DWW++ ESR+   GD  SW +F
Subjt:  FRVMRCPEDRKVDLAAFLLQKGADDWWKITESRK---GDAWSWKKF

XP_038896416.1 uncharacterized protein LOC120084680 [Benincasa hispida]7.8e-3373.91Show/hide
Query:  MEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGADDWWKITES
        ME+++F RITQRLA SVGS Q  PEKK+GIERLKALGATTF+GTTDP D E+WL LIEKCF+VMRCPED KV+LA FLLQKG + WWK+ E+
Subjt:  MEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKCFRVMRCPEDRKVDLAAFLLQKGADDWWKITES

TrEMBL top hitse value%identityAlignment
A0A5A7SJ99 DNA/RNA polymerases superfamily protein2.7e-3159.84Show/hide
Query:  EAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKCFRVMRCPEDRKV
        EA         G   S+ ESS P  E N+EEQ+  R+ QRL   +  AQ+  EKKYGIERLKALGATTF GTT+P D E WL LIEKCF+V RC EDRKV
Subjt:  EAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKCFRVMRCPEDRKV

Query:  DLAAFLLQKGADDWWKITESRK
        +LAAFLLQ GA+DWW + ESR+
Subjt:  DLAAFLLQKGADDWWKITESRK

A0A5A7T1M0 Reverse transcriptase4.2e-3258.21Show/hide
Query:  MVRGGKRGR-QVEAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKC
        M RG  R     EA         G   S+ ESS P+ E N+EEQ+  R+ QRL   + SAQ+ PEKKYG ERLKALGATTF GTT+P DVE WL LIEKC
Subjt:  MVRGGKRGR-QVEAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKC

Query:  FRVMRCPEDRKVDLAAFLLQKGADDWWKITESRK
        FRV R  EDRKV+LAAFLLQ  A+DWW++ ESR+
Subjt:  FRVMRCPEDRKVDLAAFLLQKGADDWWKITESRK

A0A5A7U067 Retrotrans_gag domain-containing protein1.5e-2953.1Show/hide
Query:  MVRGGKRGRQV-EAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKC
        M RG  R   V EA         G E S+ ESS    EVN+EEQ+  R+ Q L   + SAQ+  EK + IERLKALGATTF GTT+ AD E WL LIEKC
Subjt:  MVRGGKRGRQV-EAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKC

Query:  FRVMRCPEDRKVDLAAFLLQKGADDWWKITESRKGDA--WSWKKF
        F+VMRC EDRKV+L  FLL+ G +DWW++TESR+      SW +F
Subjt:  FRVMRCPEDRKVDLAAFLLQKGADDWWKITESRKGDA--WSWKKF

A0A5D3BB91 Reverse transcriptase1.9e-3256.85Show/hide
Query:  MVRGGKRGR-QVEAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKC
        M RG  R     EA         G   S+ ESS P+ E N+EEQ+  R+ QRL   + SAQ+ PEKKYG ERLKALGATTF GTT+P DVE WL LIEKC
Subjt:  MVRGGKRGR-QVEAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKC

Query:  FRVMRCPEDRKVDLAAFLLQKGADDWWKITESRK---GDAWSWKKF
        FRV R  EDRKV+LAAFLLQ  A+DWW++ ESR+   GD  SW +F
Subjt:  FRVMRCPEDRKVDLAAFLLQKGADDWWKITESRK---GDAWSWKKF

A0A5D3DES5 DNA/RNA polymerases superfamily protein3.8e-3358.52Show/hide
Query:  GKRGRQVEAGTQEATGD--RGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKCFRV
        G+  +  +A T  A  +   G   S+ ESS P  E NMEEQ+  R+ QRL   + SAQ+ PEKKYGIERLKALGATTF GTT+PAD E WL LIEKCFRV
Subjt:  GKRGRQVEAGTQEATGD--RGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKCFRV

Query:  MRCPEDRKVDLAAFLLQKG----ADDWWKITESRK
         RCPEDRKV+LA+FLLQ G     +DWW++ ESR+
Subjt:  MRCPEDRKVDLAAFLLQKG----ADDWWKITESRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAGGTGGCAAACGAGGCAGGCAAGTGGAGGCCGGAACTCAAGAAGCTACTGGTGATAGAGGAAGAGAGGTATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGA
GGTGAACATGGAGGAACAGATCTTCACGAGGATAACTCAAAGATTAGCTGAAAGTGTTGGATCAGCACAAGCAAAACCAGAAAAAAAGTATGGCATTGAAAGACTGAAGG
CCTTAGGTGCAACAACATTTGAAGGCACGACAGATCCCGCTGATGTTGAGGTTTGGTTAAATCTGATTGAGAAGTGTTTTAGGGTCATGCGATGCCCTGAAGATAGGAAG
GTCGATTTAGCAGCATTTTTACTCCAGAAAGGAGCAGATGATTGGTGGAAGATAACAGAGAGCAGAAAAGGGGATGCTTGGAGCTGGAAAAAGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGAGGTGGCAAACGAGGCAGGCAAGTGGAGGCCGGAACTCAAGAAGCTACTGGTGATAGAGGAAGAGAGGTATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGA
GGTGAACATGGAGGAACAGATCTTCACGAGGATAACTCAAAGATTAGCTGAAAGTGTTGGATCAGCACAAGCAAAACCAGAAAAAAAGTATGGCATTGAAAGACTGAAGG
CCTTAGGTGCAACAACATTTGAAGGCACGACAGATCCCGCTGATGTTGAGGTTTGGTTAAATCTGATTGAGAAGTGTTTTAGGGTCATGCGATGCCCTGAAGATAGGAAG
GTCGATTTAGCAGCATTTTTACTCCAGAAAGGAGCAGATGATTGGTGGAAGATAACAGAGAGCAGAAAAGGGGATGCTTGGAGCTGGAAAAAGTTTTGA
Protein sequenceShow/hide protein sequence
MVRGGKRGRQVEAGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQAKPEKKYGIERLKALGATTFEGTTDPADVEVWLNLIEKCFRVMRCPEDRK
VDLAAFLLQKGADDWWKITESRKGDAWSWKKF