; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001433 (gene) of Snake gourd v1 genome

Gene IDTan0001433
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationLG08:71927482..71927934
RNA-Seq ExpressionTan0001433
SyntenyTan0001433
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCH81586.1 RNA-directed DNA polymerase (Reverse transcriptase) [Trifolium medium]3.1e-2444.7Show/hide
Query:  NGKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHG---SVHSLLQE-DGNWDTEKIQSNFNMEDAQHI
        NG FL A+LG NPS+ WRS+     L   GYRWK+G G  + I + PW  +   + PL+  +   H     V SL+    G WD E IQ+NFN  DA+ I
Subjt:  NGKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHG---SVHSLLQE-DGNWDTEKIQSNFNMEDAQHI

Query:  LQIPRLGSPIDDEIVWKYNKRGIFTVKSAYRL
        L+IP L     DEI+W+Y+K+G+++VKSAYR+
Subjt:  LQIPRLGSPIDDEIVWKYNKRGIFTVKSAYRL

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]1.9e-2945.93Show/hide
Query:  GKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHILQIPR
        G FL+A LG+ PS+ WRSILWG DLF KGYRWK+G+G  +N+  DPW   +G   P+     + + SV  L++  G WD  K++ +F + +A  ILQ P 
Subjt:  GKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHILQIPR

Query:  LGSPIDDEIVWKYNKRGIFTVKSAYRLGLRLRYAL
             DDEI+W  +K GIF+V+SAY LG++L   L
Subjt:  LGSPIDDEIVWKYNKRGIFTVKSAYRLGLRLRYAL

XP_024033484.1 uncharacterized protein LOC112095607 [Citrus clementina]1.2e-2340.71Show/hide
Query:  FLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHILQIPRLG
        FL A LGSNP F WRSI+WG  + L G RW+IG G +V I    W       +P    +   +  V  L+ +D  WD  KI  +F+  DA  I+ +P   
Subjt:  FLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHILQIPRLG

Query:  SPIDDEIVWKYNKRGIFTVKSAYRLGLRLRYALEAPSSNK
         P DD+I+W Y+K+G ++VKS Y++ LRL++ L  PSS++
Subjt:  SPIDDEIVWKYNKRGIFTVKSAYRLGLRLRYALEAPSSNK

XP_030502765.1 uncharacterized protein LOC115717936 [Cannabis sativa]4.1e-2439.31Show/hide
Query:  SNGKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHILQI
        SNG F+ + LGSNPS TWRS+ WG +L LKG RW++GSG  +N   D W       KP        +  V  L+ E   WD   +Q+NF+  D   IL I
Subjt:  SNGKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHILQI

Query:  PRLGSPIDDEIVWKYNKRGIFTVKSAYRLGLRLRYALEAPSSNKI
        P    P DD ++W ++  GI+ VKS Y+L + L    +  SS+ +
Subjt:  PRLGSPIDDEIVWKYNKRGIFTVKSAYRLGLRLRYALEAPSSNKI

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]4.1e-2438.62Show/hide
Query:  SNGKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHILQI
        SNG +L A LGSNPS TWRS++WG +L LKG RW++GSG ++N   D W       KP        +  V  L+ E   WD   +++NFN  D   +L I
Subjt:  SNGKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHILQI

Query:  PRLGSPIDDEIVWKYNKRGIFTVKSAYRLGLRLRYALEAPSSNKI
        P    P DD ++W  +  G++ VKS Y   + L    ++  SN I
Subjt:  PRLGSPIDDEIVWKYNKRGIFTVKSAYRLGLRLRYALEAPSSNKI

TrEMBL top hitse value%identityAlignment
A0A1S8AC01 Ribonuclease H-like superfamily protein4.4e-2441.43Show/hide
Query:  FLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHILQIPRLG
        FL A LGSNPSF WRSI+WG  + L G RW+IG G +V I    W       +P    +   +  V  L+ +D  WD  KI  +F+  DA  I+ +P   
Subjt:  FLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHILQIPRLG

Query:  SPIDDEIVWKYNKRGIFTVKSAYRLGLRLRY-ALEAPSSN
         P DD+I+W Y+K+G ++VKS Y++ LRL++ AL + S N
Subjt:  SPIDDEIVWKYNKRGIFTVKSAYRLGLRLRY-ALEAPSSN

A0A392M2U4 RNA-directed DNA polymerase (Reverse transcriptase) (Fragment)1.5e-2444.7Show/hide
Query:  NGKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHG---SVHSLLQE-DGNWDTEKIQSNFNMEDAQHI
        NG FL A+LG NPS+ WRS+     L   GYRWK+G G  + I + PW  +   + PL+  +   H     V SL+    G WD E IQ+NFN  DA+ I
Subjt:  NGKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHG---SVHSLLQE-DGNWDTEKIQSNFNMEDAQHI

Query:  LQIPRLGSPIDDEIVWKYNKRGIFTVKSAYRL
        L+IP L     DEI+W+Y+K+G+++VKSAYR+
Subjt:  LQIPRLGSPIDDEIVWKYNKRGIFTVKSAYRL

A0A6J1DRA0 uncharacterized protein LOC1110224239.2e-3045.93Show/hide
Query:  GKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHILQIPR
        G FL+A LG+ PS+ WRSILWG DLF KGYRWK+G+G  +N+  DPW   +G   P+     + + SV  L++  G WD  K++ +F + +A  ILQ P 
Subjt:  GKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHILQIPR

Query:  LGSPIDDEIVWKYNKRGIFTVKSAYRLGLRLRYAL
             DDEI+W  +K GIF+V+SAY LG++L   L
Subjt:  LGSPIDDEIVWKYNKRGIFTVKSAYRLGLRLRYAL

A0A803NHG3 Uncharacterized protein2.6e-2439.42Show/hide
Query:  LKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWF----TIEGREKP----LMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHI
        L A  G++ SF WRS++WG ++ LKGYRW++G+G QV +++DPW     + +  +KP     +   DLTH S        G WD   I++NFN EDA+ I
Subjt:  LKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWF----TIEGREKP----LMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHI

Query:  LQIPRLGSPIDDEIVWKYNKRGIFTVKSAYRLGLRLR
        L++P L   ++D+++W Y++ G +TV+S YR+   +R
Subjt:  LQIPRLGSPIDDEIVWKYNKRGIFTVKSAYRLGLRLR

A0A803QJV0 Uncharacterized protein4.0e-2537.82Show/hide
Query:  SILSNGKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWF----TIEGREKP----LMVASDLTHGSVHSLLQEDGNWDTEKIQSNF
        S   N   L A  G++ SF WRS++WG ++ LKGYRW++G+G QV +++DPW     + +  +KP     +   DLTH S        G WD   I++NF
Subjt:  SILSNGKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWF----TIEGREKP----LMVASDLTHGSVHSLLQEDGNWDTEKIQSNF

Query:  NMEDAQHILQIPRLGSPIDDEIVWKYNKRGIFTVKSAYRLGLRLRYALEAPSSNKI
        N EDA+ IL++P L   ++D+++W Y++ G +TV+S YR+   +R + EA S  ++
Subjt:  NMEDAQHILQIPRLGSPIDDEIVWKYNKRGIFTVKSAYRLGLRLRYALEAPSSNKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein4.7e-1030.71Show/hide
Query:  LKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGN---WDTEKIQSNFNMEDAQHILQIPR
        L A +    S+ W S+L G+ L  KG R  IG G  + I  D         +PL         ++++L +  G+   WD  KI    +  D   I +I  
Subjt:  LKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGN---WDTEKIQSNFNMEDAQHILQIPR

Query:  LGSPIDDEIVWKYNKRGIFTVKSAYRL
          S   D+I+W YN  G +TV+S Y L
Subjt:  LGSPIDDEIVWKYNKRGIFTVKSAYRL

AT4G29090.1 Ribonuclease H-like superfamily protein2.3e-0930.53Show/hide
Query:  LKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIE--------GREKPLMVASDLTHGSVHSLLQEDG-NWDTEKIQSNFNMEDAQH
        L A LGS PSF W+SI    ++  +G R  +G+G  + I    W   +         R  P   AS  +   V  L+ E G  W  + I+  F   + + 
Subjt:  LKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIE--------GREKPLMVASDLTHGSVHSLLQEDG-NWDTEKIQSNFNMEDAQH

Query:  ILQIPRLGSPIDDEIVWKYNKRGIFTVKSAY
        I ++   G  I D   W Y   G +TVKS Y
Subjt:  ILQIPRLGSPIDDEIVWKYNKRGIFTVKSAY

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.9e-0431.03Show/hide
Query:  NGKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPL
        +   ++ ++G+ PS+ WRSI+ G +L  +G    IG G    +  D W   E    PL
Subjt:  NGKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGTCGATACTTTCAAACGGGAAGTTCCTAAAAGCAACTCTTGGCTCTAACCCATCTTTTACATGGCGGAGTATCTTATGGGGTCTGGATCTTTTCTTGAAAGGGTA
TAGATGGAAAATTGGGAGTGGACACCAAGTCAATATCATGGATGATCCATGGTTTACCATTGAAGGCCGGGAGAAACCGTTGATGGTTGCTTCCGATCTTACTCATGGCT
CAGTCCACTCCCTTCTACAGGAGGATGGTAACTGGGATACAGAAAAGATCCAGAGTAATTTTAACATGGAAGATGCCCAACATATATTACAAATTCCCCGTTTGGGCTCT
CCTATAGATGATGAAATTGTATGGAAATATAATAAACGTGGAATCTTCACAGTAAAGAGTGCGTACCGATTGGGACTTAGACTCCGCTATGCCTTGGAGGCGCCAAGCTC
TAACAAAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGGTCGATACTTTCAAACGGGAAGTTCCTAAAAGCAACTCTTGGCTCTAACCCATCTTTTACATGGCGGAGTATCTTATGGGGTCTGGATCTTTTCTTGAAAGGGTA
TAGATGGAAAATTGGGAGTGGACACCAAGTCAATATCATGGATGATCCATGGTTTACCATTGAAGGCCGGGAGAAACCGTTGATGGTTGCTTCCGATCTTACTCATGGCT
CAGTCCACTCCCTTCTACAGGAGGATGGTAACTGGGATACAGAAAAGATCCAGAGTAATTTTAACATGGAAGATGCCCAACATATATTACAAATTCCCCGTTTGGGCTCT
CCTATAGATGATGAAATTGTATGGAAATATAATAAACGTGGAATCTTCACAGTAAAGAGTGCGTACCGATTGGGACTTAGACTCCGCTATGCCTTGGAGGCGCCAAGCTC
TAACAAAATTTGA
Protein sequenceShow/hide protein sequence
MRSILSNGKFLKATLGSNPSFTWRSILWGLDLFLKGYRWKIGSGHQVNIMDDPWFTIEGREKPLMVASDLTHGSVHSLLQEDGNWDTEKIQSNFNMEDAQHILQIPRLGS
PIDDEIVWKYNKRGIFTVKSAYRLGLRLRYALEAPSSNKI