; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021005 (gene) of Snake gourd v1 genome

Gene IDTan0021005
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG08:9035014..9037831
RNA-Seq ExpressionTan0021005
SyntenyTan0021005
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4394198.1 hypothetical protein F8388_005832 [Cannabis sativa]6.2e-1529.46Show/hide
Query:  VGSGHQISIREDPWLLAEGWDKPLWV------------------DPNLIDL-----NDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVR
        VG G  ISI  D W+   G+ K  ++                  + NL  L     +D+   IL+IP T   T D   W FT  G ++V SGYH   +  
Subjt:  VGSGHQISIREDPWLLAEGWDKPLWV------------------DPNLIDL-----NDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVR

Query:  AHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPK-----------------------IWSCRNVLLHKQDSLDWERV
         HN+ +SSN++ ++  WK +W+ P+P K+K   W+  HDILPT  N   + I+ +                         IW+CRN  LH   S    +V
Subjt:  AHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPK-----------------------IWSCRNVLLHKQDSLDWERV

Query:  FLNTQLQFQEFTQSEATRIQVPNH
         L+  + F +  QS   +    NH
Subjt:  FLNTQLQFQEFTQSEATRIQVPNH

KAF4404563.1 hypothetical protein G4B88_005949 [Cannabis sativa]6.2e-1529.46Show/hide
Query:  VGSGHQISIREDPWLLAEGWDKPLWV------------------DPNLIDL-----NDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVR
        VG G  ISI  D W+   G+ K  ++                  + NL  L     +D+   IL+IP T   T D   W FT  G ++V SGYH   +  
Subjt:  VGSGHQISIREDPWLLAEGWDKPLWV------------------DPNLIDL-----NDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVR

Query:  AHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPK-----------------------IWSCRNVLLHKQDSLDWERV
         HN+ +SSN++ ++  WK +W+ P+P K+K   W+  HDILPT  N   + I+ +                         IW+CRN  LH   S    +V
Subjt:  AHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPK-----------------------IWSCRNVLLHKQDSLDWERV

Query:  FLNTQLQFQEFTQSEATRIQVPNH
         L+  + F +  QS   +    NH
Subjt:  FLNTQLQFQEFTQSEATRIQVPNH

PNY15111.1 ribonuclease H [Trifolium pratense]2.3e-1431.61Show/hide
Query:  VGSGHQISIREDPWL--------------LAEGWDKPLWVDPNLIDLN----------DDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSV
        +G+G  + IRED WL              L  G      +DP+    N          D+A  ILSIP +  L  D+I+W +   G +SV+S +HL   +
Subjt:  VGSGHQISIREDPWL--------------LAEGWDKPLWVDPNLIDLN----------DDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSV

Query:  RAHN--EASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGI
        + HN  + ++S+      +W+ IW  P+PN+++   W++  +ILPTRAN + KG+
Subjt:  RAHN--EASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGI

VVA38592.1 PREDICTED: reverse mRNAase, partial [Prunus dulcis]5.2e-1438.6Show/hide
Query:  NLIDLNDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVRAHNE-ASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWK
        N + L  D   I+ IP +     D IVW +   G+F+VKS Y + + V + +E  SSS+NS T  +W+ IW+  +P K+KI  W++ HDILPT+AN + K
Subjt:  NLIDLNDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVRAHNE-ASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWK

Query:  GIIMNPKIWSCRNV
        G+ M      C ++
Subjt:  GIIMNPKIWSCRNV

XP_023899813.1 uncharacterized protein LOC112011695 [Quercus suber]1.4e-1442.31Show/hide
Query:  ANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVRAH-NEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPKI
        A+ I SIP +  L TD+++W  T  G F+V+S YHL M+  +  +  SSSNNS  +  W  IWS P+P+KI+   W++ HD LPT++N L + +I     
Subjt:  ANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVRAH-NEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPKI

Query:  WSCR
         SCR
Subjt:  WSCR

TrEMBL top hitse value%identityAlignment
A0A2N9ED81 Uncharacterized protein2.3e-1531.36Show/hide
Query:  VGSGHQISIREDPWLLAEGWDKPLWVDP-----------NLID---------------LNDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGM
        VGSG +I I ED WL     + P  V P           NLI+               L ++A  IL IP +     D  VW  T  G++SV+SGYH+ +
Subjt:  VGSGHQISIREDPWLLAEGWDKPLWVDP-----------NLID---------------LNDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGM

Query:  SVRAHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPKIWSCRNV---LLH-----KQ-----DSLDWERVFLNTQ-L
        S  + +   SS+ S   Q+W +IWS  IP K++   W+  H+ LPTR N  ++ +I +P+  +C  V   +LH     KQ      ++ W    L  Q L
Subjt:  SVRAHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPKIWSCRNV---LLH-----KQ-----DSLDWERVFLNTQ-L

Query:  QFQE-FTQSEAT--RIQVPNHMTTTMEAWESPDEGW
         F E F    AT    ++    TTT   W   +  W
Subjt:  QFQE-FTQSEAT--RIQVPNHMTTTMEAWESPDEGW

A0A2N9IJM4 RNase H domain-containing protein2.3e-1531.36Show/hide
Query:  VGSGHQISIREDPWLLAEGWDKPLWVDP-----------NLID---------------LNDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGM
        VGSG +I I ED WL     + P  V P           NLI+               L ++A  IL IP +     D  VW  T  G++SV+SGYH+ +
Subjt:  VGSGHQISIREDPWLLAEGWDKPLWVDP-----------NLID---------------LNDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGM

Query:  SVRAHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPKIWSCRNV---LLH-----KQ-----DSLDWERVFLNTQ-L
        S  + +   SS+ S   Q+W +IWS  IP K++   W+  H+ LPTR N  ++ +I +P+  +C  V   +LH     KQ      ++ W    L  Q L
Subjt:  SVRAHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPKIWSCRNV---LLH-----KQ-----DSLDWERVFLNTQ-L

Query:  QFQE-FTQSEAT--RIQVPNHMTTTMEAWESPDEGW
         F E F    AT    ++    TTT   W   +  W
Subjt:  QFQE-FTQSEAT--RIQVPNHMTTTMEAWESPDEGW

A0A2N9J7E4 Uncharacterized protein4.6e-1632.12Show/hide
Query:  VGSGHQISIREDPWLLAEGWDKPLWVDPNLIDLNDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVRAHNEASSSNNSLTRQMWKSIWST
        VG+G++I I+   WLL EG  + L     LIDL  DA  IL IP +  +  D+I W     G +SV+SGY L +      +A SS       +WK IW  
Subjt:  VGSGHQISIREDPWLLAEGWDKPLWVDPNLIDLNDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVRAHNEASSSNNSLTRQMWKSIWST

Query:  PIPNKIKICYWKIIHDILPTRANFLWKGIIMNP--------------KIWSCRNVLLHKQDSLDWERVFLNTQLQFQEF---TQSEATRIQVP
         +P KIK   W+  HD LPT +    + ++ NP               +W+ RN   H   S  + +++   Q+  QE+   T  E    Q P
Subjt:  PIPNKIKICYWKIIHDILPTRANFLWKGIIMNP--------------KIWSCRNVLLHKQDSLDWERVFLNTQLQFQEF---TQSEATRIQVP

A0A7J6HHM0 zf-RVT domain-containing protein3.0e-1529.46Show/hide
Query:  VGSGHQISIREDPWLLAEGWDKPLWV------------------DPNLIDL-----NDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVR
        VG G  ISI  D W+   G+ K  ++                  + NL  L     +D+   IL+IP T   T D   W FT  G ++V SGYH   +  
Subjt:  VGSGHQISIREDPWLLAEGWDKPLWV------------------DPNLIDL-----NDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVR

Query:  AHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPK-----------------------IWSCRNVLLHKQDSLDWERV
         HN+ +SSN++ ++  WK +W+ P+P K+K   W+  HDILPT  N   + I+ +                         IW+CRN  LH   S    +V
Subjt:  AHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPK-----------------------IWSCRNVLLHKQDSLDWERV

Query:  FLNTQLQFQEFTQSEATRIQVPNH
         L+  + F +  QS   +    NH
Subjt:  FLNTQLQFQEFTQSEATRIQVPNH

A0A7J6IAE7 zf-RVT domain-containing protein3.0e-1529.46Show/hide
Query:  VGSGHQISIREDPWLLAEGWDKPLWV------------------DPNLIDL-----NDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVR
        VG G  ISI  D W+   G+ K  ++                  + NL  L     +D+   IL+IP T   T D   W FT  G ++V SGYH   +  
Subjt:  VGSGHQISIREDPWLLAEGWDKPLWV------------------DPNLIDL-----NDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVR

Query:  AHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPK-----------------------IWSCRNVLLHKQDSLDWERV
         HN+ +SSN++ ++  WK +W+ P+P K+K   W+  HDILPT  N   + I+ +                         IW+CRN  LH   S    +V
Subjt:  AHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDILPTRANFLWKGIIMNPK-----------------------IWSCRNVLLHKQDSLDWERV

Query:  FLNTQLQFQEFTQSEATRIQVPNH
         L+  + F +  QS   +    NH
Subjt:  FLNTQLQFQEFTQSEATRIQVPNH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein7.3e-0627.27Show/hide
Query:  WVDPNLIDLNDDANH--ILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHL-----GMSVRAHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDI
        W D  +    D ++H  I  I        D+I+W + T G ++V+SGY L       ++ A N    S +  TR     IW+ PI  K+K   W+ +   
Subjt:  WVDPNLIDLNDDANH--ILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHL-----GMSVRAHNEASSSNNSLTRQMWKSIWSTPIPNKIKICYWKIIHDI

Query:  LPTRANFLWKGIIMNPKIWSC
        L T      +G+ ++P    C
Subjt:  LPTRANFLWKGIIMNPKIWSC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATCGTTGGTAGTGGACATCAAATCAGCATCAGGGAGGATCCATGGTTGTTGGCGGAAGGATGGGATAAACCCCTCTGGGTGGATCCAAACTTGATAGACCTGAA
TGACGATGCAAATCACATTCTCTCAATCCCGCGAACTGGAGATCTGACCACTGACGAAATTGTTTGGAAGTTCACCACGAAAGGGGTCTTTTCGGTCAAGAGTGGCTACC
ATTTAGGTATGAGCGTTAGAGCTCATAATGAAGCTTCAAGTTCGAACAACTCCTTAACAAGACAAATGTGGAAGTCAATATGGAGTACGCCTATTCCAAACAAGATCAAG
ATTTGTTACTGGAAGATCATTCACGACATTCTCCCGACTCGAGCTAATTTTTTATGGAAGGGCATCATCATGAACCCAAAGATTTGGTCATGTCGGAATGTTTTGTTACA
TAAACAAGATAGTCTTGACTGGGAAAGGGTGTTCCTTAACACACAACTCCAATTTCAGGAGTTCACTCAGTCTGAGGCAACTAGGATCCAAGTTCCCAACCATATGACAA
CAACGATGGAAGCTTGGGAGTCGCCGGATGAGGGTTGGAGTACAATATACCTCTCTTGGTGGAATCTGACTCTTGAGAGGCTATACGACTTATCAACGGTGTTGACAATA
ATCGAACAGAGACAAGAGACTTTGCGAGGAAGATCAGGCAACGAGCAACTTCTTGGGCCATCATTTCTTTTCGCCACAATAGTCGGGAGACAAATATGGGCGCTCACAAA
CTCGCACAACGAGAAAAACACCTTCAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTATCGTTGGTAGTGGACATCAAATCAGCATCAGGGAGGATCCATGGTTGTTGGCGGAAGGATGGGATAAACCCCTCTGGGTGGATCCAAACTTGATAGACCTGAA
TGACGATGCAAATCACATTCTCTCAATCCCGCGAACTGGAGATCTGACCACTGACGAAATTGTTTGGAAGTTCACCACGAAAGGGGTCTTTTCGGTCAAGAGTGGCTACC
ATTTAGGTATGAGCGTTAGAGCTCATAATGAAGCTTCAAGTTCGAACAACTCCTTAACAAGACAAATGTGGAAGTCAATATGGAGTACGCCTATTCCAAACAAGATCAAG
ATTTGTTACTGGAAGATCATTCACGACATTCTCCCGACTCGAGCTAATTTTTTATGGAAGGGCATCATCATGAACCCAAAGATTTGGTCATGTCGGAATGTTTTGTTACA
TAAACAAGATAGTCTTGACTGGGAAAGGGTGTTCCTTAACACACAACTCCAATTTCAGGAGTTCACTCAGTCTGAGGCAACTAGGATCCAAGTTCCCAACCATATGACAA
CAACGATGGAAGCTTGGGAGTCGCCGGATGAGGGTTGGAGTACAATATACCTCTCTTGGTGGAATCTGACTCTTGAGAGGCTATACGACTTATCAACGGTGTTGACAATA
ATCGAACAGAGACAAGAGACTTTGCGAGGAAGATCAGGCAACGAGCAACTTCTTGGGCCATCATTTCTTTTCGCCACAATAGTCGGGAGACAAATATGGGCGCTCACAAA
CTCGCACAACGAGAAAAACACCTTCAGGTAG
Protein sequenceShow/hide protein sequence
MAIVGSGHQISIREDPWLLAEGWDKPLWVDPNLIDLNDDANHILSIPRTGDLTTDEIVWKFTTKGVFSVKSGYHLGMSVRAHNEASSSNNSLTRQMWKSIWSTPIPNKIK
ICYWKIIHDILPTRANFLWKGIIMNPKIWSCRNVLLHKQDSLDWERVFLNTQLQFQEFTQSEATRIQVPNHMTTTMEAWESPDEGWSTIYLSWWNLTLERLYDLSTVLTI
IEQRQETLRGRSGNEQLLGPSFLFATIVGRQIWALTNSHNEKNTFR