; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020814 (gene) of Snake gourd v1 genome

Gene IDTan0020814
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNA-directed DNA polymerase (reverse transcriptase)-related family protein
Genome locationLG05:10996250..10999926
RNA-Seq ExpressionTan0020814
SyntenyTan0020814
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]8.9e-3136.11Show/hide
Query:  GLGSTHKGLFPWNWSYRVITHVP-----HLSELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQ
        G GST K  F   W  R  T  P        +  VA F+   G WDV  +      +D   IL +PIS  N+QDSW+WH+D+  +YS+RSGYK  M  K 
Subjt:  GLGSTHKGLFPWNWSYRVITHVP-----HLSELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQ

Query:  DKASCSENNLRGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEELLPEMKGVTIWVG-SVWDI
        +  S S N     W  +W + +PTKIK+F+WR+ H  +PT+Q+L  R + + P C +C    E+  HA F CK A + W  L P +  ++     S  ++
Subjt:  DKASCSENNLRGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEELLPEMKGVTIWVG-SVWDI

Query:  WLDATKLLTSEKLEIA
        W   T+ L  + L +A
Subjt:  WLDATKLLTSEKLEIA

XP_030483769.1 uncharacterized protein LOC115700339 [Cannabis sativa]1.2e-2736.3Show/hide
Query:  VAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQDKASCSENNLRGWWKVVWVVRIPTKIKLFVWRTF
        V++++    +W++  L+   SP DV  IL +P+S   VQD W+WHHD +  YS+ +GY      ++ + S   N    WWK  W   +P K+K+F WR  
Subjt:  VAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQDKASCSENNLRGWWKVVWVVRIPTKIKLFVWRTF

Query:  HNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWE
         + +P ++SLF +K++ S  C +C    ET GHALF+C  A + W+
Subjt:  HNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWE

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]1.5e-3030.77Show/hide
Query:  GLGSTHKGLFPWNWSYRVITHVPHLSELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQDKASC
        G+   H    P N  ++ +      S L VA+++ D+ +WD+  L    SP D+  IL +P+S  + +D W WH+D +  Y+++SGY      +    S 
Subjt:  GLGSTHKGLFPWNWSYRVITHVPHLSELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQDKASC

Query:  SENNLRGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEELLPEMKGVTIWVGSVWDIWLDATK
        S  +   WW++ W + +P+K+++F WR  ++ LP +Q+LF RK+I S  C +C +  E+ GHALF+C  A   W+    ++           D  L  + 
Subjt:  SENNLRGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEELLPEMKGVTIWVGSVWDIWLDATK

Query:  LLTSEKLE
        +LT  +LE
Subjt:  LLTSEKLE

XP_030504959.1 uncharacterized protein LOC115719927 [Cannabis sativa]5.4e-2838.67Show/hide
Query:  LKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYK-AVMLSKQDKASCSENNLRGWWKVVWVVRIPTKIKLFVW
        + V+  + +   W++  L  F  P DV  IL +P+S     D  +WHH  + SY+++SG+  A  L  +D  S S+ NL  WWK  W + +P KI++F W
Subjt:  LKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYK-AVMLSKQDKASCSENNLRGWWKVVWVVRIPTKIKLFVW

Query:  RTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEE
        + +HN++PT+ +L  RK+IDS  C +C    E+ GHALF CK A + W+E
Subjt:  RTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEE

XP_030509050.1 uncharacterized protein LOC115723712 [Cannabis sativa]3.7e-2935.52Show/hide
Query:  ELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQDKASCSENNLRGWWKVVWVVRIPTKIKLFVW
        ++KVA+F+  S QWDV KL+QF +P DV  IL +P+S    +D  +WH+     Y+++SGYK +     D    S +    WW+  W +++P+KI++F W
Subjt:  ELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQDKASCSENNLRGWWKVVWVVRIPTKIKLFVW

Query:  RTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEELLPEMKGVTIWVGSVWDIWLDATKLLTSEKLEI
        R +H  LPT+  L  R +  SPQCP+C    ET  HA F C  A + W+     +        S  D  +  +  L +EK+E+
Subjt:  RTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEELLPEMKGVTIWVGSVWDIWLDATKLLTSEKLEI

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248744.3e-3136.11Show/hide
Query:  GLGSTHKGLFPWNWSYRVITHVP-----HLSELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQ
        G GST K  F   W  R  T  P        +  VA F+   G WDV  +      +D   IL +PIS  N+QDSW+WH+D+  +YS+RSGYK  M  K 
Subjt:  GLGSTHKGLFPWNWSYRVITHVP-----HLSELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQ

Query:  DKASCSENNLRGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEELLPEMKGVTIWVG-SVWDI
        +  S S N     W  +W + +PTKIK+F+WR+ H  +PT+Q+L  R + + P C +C    E+  HA F CK A + W  L P +  ++     S  ++
Subjt:  DKASCSENNLRGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEELLPEMKGVTIWVG-SVWDI

Query:  WLDATKLLTSEKLEIA
        W   T+ L  + L +A
Subjt:  WLDATKLLTSEKLEIA

A0A803NN93 Uncharacterized protein6.2e-3041.78Show/hide
Query:  VAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYK-AVMLSKQDKASCSENNLRGWWKVVWVVRIPTKIKLFVWRT
        VA++++   QWD+ KL    S  DV +ILK+P+S     DSWVWH+    SY++ SGY  A  ++ Q+ +SCS  +   WWK  W + +P+K+K+F WR 
Subjt:  VAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYK-AVMLSKQDKASCSENNLRGWWKVVWVVRIPTKIKLFVWRT

Query:  FHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFW
         HN LP++ +L  R++I+S  C +C    E+ GHALF+CK A   W
Subjt:  FHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFW

A0A803NTN0 Uncharacterized protein2.8e-3040.79Show/hide
Query:  SELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYK-AVMLSKQDKASCSENNLRGWWKVVWVVRIPTKIKLF
        S L V+ F+ D  +W++  L  +  P DV  IL +P+S    QD  +WHH  +  Y+++SG+  A  L +Q+ +S S+ N RGWWK  W + +P KI++F
Subjt:  SELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYK-AVMLSKQDKASCSENNLRGWWKVVWVVRIPTKIKLF

Query:  VWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEE
         W+  +NVLP + +LF +K+IDS  C +C    E+ GHALF CK A   W+E
Subjt:  VWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEE

A0A803PIB6 Uncharacterized protein7.4e-3130.77Show/hide
Query:  GLGSTHKGLFPWNWSYRVITHVPHLSELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQDKASC
        G+   H    P N  ++ +      S L VA+++ D+ +WD+  L    SP D+  IL +P+S  + +D W WH+D +  Y+++SGY      +    S 
Subjt:  GLGSTHKGLFPWNWSYRVITHVPHLSELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQDKASC

Query:  SENNLRGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEELLPEMKGVTIWVGSVWDIWLDATK
        S  +   WW++ W + +P+K+++F WR  ++ LP +Q+LF RK+I S  C +C +  E+ GHALF+C  A   W+    ++           D  L  + 
Subjt:  SENNLRGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEELLPEMKGVTIWVGSVWDIWLDATK

Query:  LLTSEKLE
        +LT  +LE
Subjt:  LLTSEKLE

A0A803QAN3 Uncharacterized protein4.8e-3035.09Show/hide
Query:  GLFPWNWSYRVITHVPHLS--ELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQDKASCSENNL
        G  PW  SY     V +     L V+ F+ D  +W++  L +F  P DV  IL +P+S    QD  +WHH  +  Y+++SG+      ++   S + +  
Subjt:  GLFPWNWSYRVITHVPHLS--ELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQDKASCSENNL

Query:  RGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEE
        R WW+  W + +P K+++F W+ FHN+LP + +LF +K+IDS  C +C    E+ GHALF CK A   W +
Subjt:  RGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEE

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657509.0e-1022.78Show/hide
Query:  WDVGKLEQFLSPQDVGDILKVPIS-QTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQDKASCSENNLRGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQS
        WD  K++ + +     ++  V +   T  +D   W   Q+  +S+RS Y+ + + +  +      N+  ++  +W VR+P ++K F+W   +  + T + 
Subjt:  WDVGKLEQFLSPQDVGDILKVPIS-QTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQDKASCSENNLRGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQS

Query:  LFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEELLPEMKGVTIWVGSVWDIWL
           R L  S  C VC    E+  H L  C      W  ++P+ +    +  S+++ WL
Subjt:  LFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEELLPEMKGVTIWVGSVWDIWL

Arabidopsis top hitse value%identityAlignment
AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-0435.82Show/hide
Query:  WWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFW
        W K VW      K    +W +  + LPT Q L     I S  C +C    E+  H LF+C+FAA+ W
Subjt:  WWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFW

AT3G09510.1 Ribonuclease H-like superfamily protein1.0e-1629.03Show/hide
Query:  QNLAIGKNGLGSTHKGLFPWNW--SYRVITHVPHLSELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAV
        QN+ IG + +  +H    P N   +Y+ +T + +L E K + +      WD  K+ QF+   D G I ++ ++++   D  +W+++    Y++RSGY  +
Subjt:  QNLAIGKNGLGSTHKGLFPWNW--SYRVITHVPHLSELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAV

Query:  M--LSKQDKASCSENNLRGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFW
            S    A    +        +W + I  K+K F+WR     L T++ L  R +   P CP C +  E+  HALFTC FA   W
Subjt:  M--LSKQDKASCSENNLRGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFW

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.3e-0736.92Show/hide
Query:  WWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAK
        W   +W ++I  KIKL +W+  +N LP    L  R +   P C  C +  ET  H LF C FA +
Subjt:  WWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAAAATCCAAAATCTGGCCATTGGAAAAAACGGGCTCGGATCAACGCACAAAGGACTGTTTCCGTGGAATTGGTCTTACCGTGTTATTACGCATGTTCCACATTT
GAGTGAGCTGAAGGTAGCAGAGTTCGTTAATGATTCTGGGCAATGGGATGTTGGCAAATTAGAACAGTTCCTTTCTCCCCAAGATGTAGGAGATATTTTGAAAGTACCCA
TTTCACAAACAAATGTCCAGGATAGTTGGGTTTGGCATCATGATCAGAATTGTAGTTACTCTATCAGGAGTGGGTATAAGGCAGTTATGCTTTCTAAGCAAGACAAGGCA
TCCTGTAGTGAGAATAATTTAAGAGGGTGGTGGAAGGTAGTTTGGGTTGTAAGGATTCCAACAAAGATTAAACTGTTTGTATGGAGGACATTTCACAATGTTCTTCCAAC
TTCTCAGTCTCTATTTGACCGAAAATTGATTGACTCTCCCCAATGCCCTGTGTGCATGAAGGTTGCTGAGACAACGGGTCATGCATTATTCACGTGCAAGTTTGCTGCTA
AGTTCTGGGAGGAGTTGTTGCCCGAGATGAAGGGAGTTACTATTTGGGTTGGCTCAGTTTGGGACATTTGGCTTGATGCTACAAAGCTCCTGACATCTGAGAAGCTGGAA
ATTGCATATATGGGGGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAAAAATCCAAAATCTGGCCATTGGAAAAAACGGGCTCGGATCAACGCACAAAGGACTGTTTCCGTGGAATTGGTCTTACCGTGTTATTACGCATGTTCCACATTT
GAGTGAGCTGAAGGTAGCAGAGTTCGTTAATGATTCTGGGCAATGGGATGTTGGCAAATTAGAACAGTTCCTTTCTCCCCAAGATGTAGGAGATATTTTGAAAGTACCCA
TTTCACAAACAAATGTCCAGGATAGTTGGGTTTGGCATCATGATCAGAATTGTAGTTACTCTATCAGGAGTGGGTATAAGGCAGTTATGCTTTCTAAGCAAGACAAGGCA
TCCTGTAGTGAGAATAATTTAAGAGGGTGGTGGAAGGTAGTTTGGGTTGTAAGGATTCCAACAAAGATTAAACTGTTTGTATGGAGGACATTTCACAATGTTCTTCCAAC
TTCTCAGTCTCTATTTGACCGAAAATTGATTGACTCTCCCCAATGCCCTGTGTGCATGAAGGTTGCTGAGACAACGGGTCATGCATTATTCACGTGCAAGTTTGCTGCTA
AGTTCTGGGAGGAGTTGTTGCCCGAGATGAAGGGAGTTACTATTTGGGTTGGCTCAGTTTGGGACATTTGGCTTGATGCTACAAAGCTCCTGACATCTGAGAAGCTGGAA
ATTGCATATATGGGGGCTTAG
Protein sequenceShow/hide protein sequence
MTKIQNLAIGKNGLGSTHKGLFPWNWSYRVITHVPHLSELKVAEFVNDSGQWDVGKLEQFLSPQDVGDILKVPISQTNVQDSWVWHHDQNCSYSIRSGYKAVMLSKQDKA
SCSENNLRGWWKVVWVVRIPTKIKLFVWRTFHNVLPTSQSLFDRKLIDSPQCPVCMKVAETTGHALFTCKFAAKFWEELLPEMKGVTIWVGSVWDIWLDATKLLTSEKLE
IAYMGA