; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007566 (gene) of Snake gourd v1 genome

Gene IDTan0007566
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG01:12053135..12053804
RNA-Seq ExpressionTan0007566
SyntenyTan0007566
Gene Ontology termsGO:0006139 - nucleobase-containing compound metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0016772 - transferase activity, transferring phosphorus-containing groups (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO52016.1 reverse transcriptase [Corchorus capsularis]1.3e-1025.67Show/hide
Query:  MDVYSYCPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLE--AMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQ------------
        +DV   C VC K+PE+V H  F CKF+   W D +   + +T   ++    W +    K  +I +L+   +  W IWNN+N  ++++             
Subjt:  MDVYSYCPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLE--AMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQ------------

Query:  -------ALSRQDDNCVLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLVLMLNGEKEVLTQVG
               + S +    V++RD+ G +++     +++   SL+A+V+AL  G  LA + GI      +DSL  +  +N +  +  + G
Subjt:  -------ALSRQDDNCVLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLVLMLNGEKEVLTQVG

XP_030926547.1 uncharacterized protein LOC115953156 [Quercus lobata]2.5e-0925.12Show/hide
Query:  CPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQALSR-----------QDDNC
        CPVC    E+V HAL  C F++  W   L   L       S  D+ L  +   T++ LE  F   W+IW+N+NN++HK   LS            ++  C
Subjt:  CPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQALSR-----------QDDNC

Query:  ------------------------------------------VLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLVL
                                                  V++RD+NG ++ A+   +  Y S+  ++V AL  G+  A ++ +P + V +D+LT++ 
Subjt:  ------------------------------------------VLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLVL

Query:  MLN
         +N
Subjt:  MLN

XP_030936552.1 uncharacterized protein LOC115961769 [Quercus lobata]2.7e-0824.63Show/hide
Query:  CPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQALSR-----------QDDNC
        CPVC     +V HAL TC F++  W   L   L       S  D+ L  +   T++ LE  F   W+IW+N+N++VHK   LS            ++  C
Subjt:  CPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQALSR-----------QDDNC

Query:  ------------------------------------------VLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLVL
                                                  V++RD NG ++ A+   +  Y S+  ++V AL  G+  A ++ +P + V  D+L ++ 
Subjt:  ------------------------------------------VLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLVL

Query:  MLN
         +N
Subjt:  MLN

XP_030943489.1 uncharacterized protein LOC115968280 [Quercus lobata]1.3e-1024.62Show/hide
Query:  CPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHK----------------------
        CP C K PE++ H+L  C+FA + W       + ++S  +   D  L+ ++  T   LE  FV  WSIW N+N +VH+                      
Subjt:  CPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHK----------------------

Query:  RQALSRQDDNC-------------------------------VLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLV
          + S QD  C                               V++RD+ G ++ A  NY+S   S+   +  A+  GI LA ++G+  + + +D+L+++
Subjt:  RQALSRQDDNC-------------------------------VLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLV

XP_042950031.1 uncharacterized protein LOC122282138 [Carya illinoinensis]1.2e-0829.48Show/hide
Query:  CPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHK----------RQALSRQDDNC-
        C +C    E   HALF C    + W D  P+ LG      S  D    A    + + L    V  W +WN +N  +++            ALS Q D   
Subjt:  CPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHK----------RQALSRQDDNC-

Query:  -----VLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLVLMLNGEKEVLT
             V++RD NG ++V +        S+ + +V AL  G++L AQ G+P + + ++SL LV  LN   + LT
Subjt:  -----VLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLVLMLNGEKEVLT

TrEMBL top hitse value%identityAlignment
A0A1R3G1P9 Reverse transcriptase6.3e-1125.67Show/hide
Query:  MDVYSYCPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLE--AMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQ------------
        +DV   C VC K+PE+V H  F CKF+   W D +   + +T   ++    W +    K  +I +L+   +  W IWNN+N  ++++             
Subjt:  MDVYSYCPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLE--AMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQ------------

Query:  -------ALSRQDDNCVLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLVLMLNGEKEVLTQVG
               + S +    V++RD+ G +++     +++   SL+A+V+AL  G  LA + GI      +DSL  +  +N +  +  + G
Subjt:  -------ALSRQDDNCVLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLVLMLNGEKEVLTQVG

A0A1R3GIN3 Reverse transcriptase1.6e-0624.66Show/hide
Query:  MDVYSYCPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAM-KTLTIEKLEATFVGTWSIWNNQN-------------------N
        + + S C VC     +V H  F C F+   W    P + G  S   S  D WL  + K   +  LE      W IWNN+N                   N
Subjt:  MDVYSYCPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAM-KTLTIEKLEATFVGTWSIWNNQN-------------------N

Query:  LVHKRQALSRQDDNC-----------------------------------VLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVF
        LV   +A +R+ +                                     V++RD+ G +L      V++ L SL+A+V A+  G+ +A   G+ +V   
Subjt:  LVHKRQALSRQDDNC-----------------------------------VLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVF

Query:  TDSLTLVLMLNGEKEVLTQVGTI
        +DSL  +L LN     L + G++
Subjt:  TDSLTLVLMLNGEKEVLTQVGTI

A0A6J1CTE3 uncharacterized protein LOC1110145781.5e-0738.14Show/hide
Query:  CPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQ---ALSRQDDNCVLVRD
        C VC K+ ET  HALF CK A E W  +LP+         S+QD  L  +++L+    +   VG W+IWN++N +  +RQ   A  R D     VRD
Subjt:  CPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQ---ALSRQDDNCVLVRD

A0A7N2L6Z9 Reverse transcriptase domain-containing protein1.2e-0621.95Show/hide
Query:  YCPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQALS----------------
        +CPVC ++ E + H L TC FA   W       LG+   +  I+   L  +       L   F  +W+IW+N+N  VH    LS                
Subjt:  YCPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQALS----------------

Query:  ---------------------------RQDDNC-----------VLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTL
                                     D  C           V++RD +G ++ A+   +  Y  + W ++ A+  G+ LA ++ +P + + +D+L+ 
Subjt:  ---------------------------RQDDNC-----------VLVRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTL

Query:  VLMLN
        +L +N
Subjt:  VLMLN

A0A803NZB5 Uncharacterized protein1.7e-0827.1Show/hide
Query:  SYCPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQALSRQDDNC-------VL
        S C +C++  E+VGHALF C++A   W++         + T    D        L+  ++E  F   W+IWN +N +VH ++A     D+         +
Subjt:  SYCPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQALSRQDDNC-------VL

Query:  VRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLV
        +RD+NG ++ A+   +     S   +  AL   +  A Q  +P   V +D+L +V
Subjt:  VRDANGALLVAMENYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein9.6e-0427.78Show/hide
Query:  CPVCRKKPETVGHALFTCKFAAEFWK----DMLPEML-------GVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHKR
        CP C ++ E++ HALFTC FA   W+     ++   L        +++    +QDT +         KL   ++  W IW  +NN+V  +
Subjt:  CPVCRKKPETVGHALFTCKFAAEFWK----DMLPEML-------GVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTTTATTCCTATTGCCCTGTATGCAGGAAGAAGCCTGAGACAGTAGGTCATGCTTTATTTACGTGTAAGTTTGCAGCAGAGTTCTGGAAGGATATGTTGCCAGA
AATGTTGGGTGTCACCTCCCGGACTTTCTCTATTCAGGACACTTGGCTTGAAGCTATGAAGACGCTGACAATCGAGAAGCTTGAAGCGACATTTGTTGGGACCTGGTCTA
TCTGGAACAATCAGAATAATTTAGTGCACAAAAGGCAAGCTTTAAGTAGGCAGGATGACAACTGTGTCCTTGTACGCGATGCCAATGGGGCTCTATTGGTTGCTATGGAG
AATTATGTTTCATATTACTTATCTTCTTTATGGGCTAAGGTGAATGCATTGCGTGATGGTATACGACTGGCTGCCCAAGTGGGTATTCCAAATGTTCATGTCTTTACAGA
TTCATTGACTTTGGTTTTAATGTTGAATGGTGAGAAGGAGGTATTGACCCAAGTTGGGACGATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTTTATTCCTATTGCCCTGTATGCAGGAAGAAGCCTGAGACAGTAGGTCATGCTTTATTTACGTGTAAGTTTGCAGCAGAGTTCTGGAAGGATATGTTGCCAGA
AATGTTGGGTGTCACCTCCCGGACTTTCTCTATTCAGGACACTTGGCTTGAAGCTATGAAGACGCTGACAATCGAGAAGCTTGAAGCGACATTTGTTGGGACCTGGTCTA
TCTGGAACAATCAGAATAATTTAGTGCACAAAAGGCAAGCTTTAAGTAGGCAGGATGACAACTGTGTCCTTGTACGCGATGCCAATGGGGCTCTATTGGTTGCTATGGAG
AATTATGTTTCATATTACTTATCTTCTTTATGGGCTAAGGTGAATGCATTGCGTGATGGTATACGACTGGCTGCCCAAGTGGGTATTCCAAATGTTCATGTCTTTACAGA
TTCATTGACTTTGGTTTTAATGTTGAATGGTGAGAAGGAGGTATTGACCCAAGTTGGGACGATATAA
Protein sequenceShow/hide protein sequence
MDVYSYCPVCRKKPETVGHALFTCKFAAEFWKDMLPEMLGVTSRTFSIQDTWLEAMKTLTIEKLEATFVGTWSIWNNQNNLVHKRQALSRQDDNCVLVRDANGALLVAME
NYVSYYLSSLWAKVNALRDGIRLAAQVGIPNVHVFTDSLTLVLMLNGEKEVLTQVGTI