; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000657 (gene) of Snake gourd v1 genome

Gene IDTan0000657
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG10:19989116..19990988
RNA-Seq ExpressionTan0000657
SyntenyTan0000657
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3459595.1 reverse transcriptase [Gossypium australe]6.0e-0828.87Show/hide
Query:  TWRRIMWGRELFYKGYKWRLGNDITIPIENKPWIARQGC------NKPLMVSDP-FKGCSSKDDS--SSLWKVLWNLDLRPKIKICAWKVLKDIIPSKAN
        TWR I   REL      WR+GN  ++ I N PW+   G        K L+   P F   +  +D   +  +K+LW L +  KIKI  W++ ++ +P   N
Subjt:  TWRRIMWGRELFYKGYKWRLGNDITIPIENKPWIARQGC------NKPLMVSDP-FKGCSSKDDS--SSLWKVLWNLDLRPKIKICAWKVLKDIIPSKAN

Query:  IIAKGIDTNALSILCRKKEKTTRHLFWEYSSEAIKAITSKQD
        +  + +  +    LCR   + + HL W  S + +++I +  D
Subjt:  IIAKGIDTNALSILCRKKEKTTRHLFWEYSSEAIKAITSKQD

KAF4362436.1 hypothetical protein F8388_012228 [Cannabis sativa]2.7e-0825.16Show/hide
Query:  WRRIMWGRELFYKGYKWRLGNDITIPI--------ENKPW------------------------IARQGCNKPLMVSDPFKGCSSKDDSSSLWKVLWNLD
        W+ I+WGRE+  +  +WR+ N  TI I        E+ PW                        I + G      ++     CS+ D + + WK+ WNL+
Subjt:  WRRIMWGRELFYKGYKWRLGNDITIPI--------ENKPW------------------------IARQGCNKPLMVSDPFKGCSSKDDSSSLWKVLWNLD

Query:  LRPKIKICAWKVLKDIIPSKANIIAKGIDTNALSILCRKKEKTTRHLFWEYSSEAIKAI
        L P++K+  WK+ +  +P+K+N+  +G+  +     C + E++  H  W  + E +K +
Subjt:  LRPKIKICAWKVLKDIIPSKANIIAKGIDTNALSILCRKKEKTTRHLFWEYSSEAIKAI

KAF7828452.1 reverse transcriptase [Senna tora]2.4e-0929.63Show/hide
Query:  WRRIMWGRELFYKGYKWRLGNDITIPIENKPWIA-------RQGCNKPL------MVSDPFKG----CSSKDDSSSLWKVLWNLDLRPKIKICAWKVLKD
        WR ++ GR+   K   W++GN  +I +    W+A       R   N  L        SD F G     SS  + + LW+V+WN   +PK+K+  WKV K+
Subjt:  WRRIMWGRELFYKGYKWRLGNDITIPIENKPWIA-------RQGCNKPL------MVSDPFKG----CSSKDDSSSLWKVLWNLDLRPKIKICAWKVLKD

Query:  IIPSKANIIAKGIDTNALSILCRKKEKTTRHLFWE
         IP++AN+  + ++ +   + C    ++T H   E
Subjt:  IIPSKANIIAKGIDTNALSILCRKKEKTTRHLFWE

MCH88735.1 RNA-directed DNA polymerase (Reverse transcriptase) [Trifolium medium]2.1e-0825.86Show/hide
Query:  WRRIMWGRELFYKGYKWRLGNDITIPIENKPWIARQGCNKPLMVSDPFKGCSSKDDSSSLWKVLWNLDLRPKIKICAWKVLKDIIPSKANIIAKGIDTNA
        WR I   R +  +GY+WR+G+   IPI + PW+  +         +P     + +     W  LW L + PK+K   W++ +D +P++  +++KG++  +
Subjt:  WRRIMWGRELFYKGYKWRLGNDITIPIENKPWIARQGCNKPLMVSDPFKGCSSKDDSSSLWKVLWNLDLRPKIKICAWKVLKDIIPSKANIIAKGIDTNA

Query:  LSILCRKKEKTTRHLF
          ++C++  +   H F
Subjt:  LSILCRKKEKTTRHLF

XP_021737769.1 uncharacterized protein LOC110704286 [Chenopodium quinoa]1.9e-0925Show/hide
Query:  MTWRRIMWGRELFYKGYKWRLG--NDITIPIENKPWIARQGCNKPLMVSDPFKGCSSKDDSSSL--WKVLWNLDLRPKIKICAWKVLKDIIPSKANIIAK
        +T+ R +W         +W L   N I    E+  +  R   N  L+  +  +  +++ +S     W+V+W+ ++  K K+ AW+ +K+++ ++ N+  +
Subjt:  MTWRRIMWGRELFYKGYKWRLG--NDITIPIENKPWIARQGCNKPLMVSDPFKGCSSKDDSSSL--WKVLWNLDLRPKIKICAWKVLKDIIPSKANIIAK

Query:  GIDTNALSILCRKKEKTTRHLFWEYSSEAIKAITSKQDDLSEIF-LVAEDIANLLPRCSSMTFEKCPRSSNNVAHTLARI
        GI+ + L  +C +  ++T H+      E  K I  +   L  I+  +  DI  L  +C S +F    R  N VAH LA++
Subjt:  GIDTNALSILCRKKEKTTRHLFWEYSSEAIKAITSKQDDLSEIF-LVAEDIANLLPRCSSMTFEKCPRSSNNVAHTLARI

TrEMBL top hitse value%identityAlignment
A0A392MNQ5 RNA-directed DNA polymerase (Reverse transcriptase) (Fragment)1.0e-0825.86Show/hide
Query:  WRRIMWGRELFYKGYKWRLGNDITIPIENKPWIARQGCNKPLMVSDPFKGCSSKDDSSSLWKVLWNLDLRPKIKICAWKVLKDIIPSKANIIAKGIDTNA
        WR I   R +  +GY+WR+G+   IPI + PW+  +         +P     + +     W  LW L + PK+K   W++ +D +P++  +++KG++  +
Subjt:  WRRIMWGRELFYKGYKWRLGNDITIPIENKPWIARQGCNKPLMVSDPFKGCSSKDDSSSLWKVLWNLDLRPKIKICAWKVLKDIIPSKANIIAKGIDTNA

Query:  LSILCRKKEKTTRHLF
          ++C++  +   H F
Subjt:  LSILCRKKEKTTRHLF

A0A7J6EVU5 Uncharacterized protein1.3e-0825.16Show/hide
Query:  WRRIMWGRELFYKGYKWRLGNDITIPI--------ENKPW------------------------IARQGCNKPLMVSDPFKGCSSKDDSSSLWKVLWNLD
        W+ I+WGRE+  +  +WR+ N  TI I        E+ PW                        I + G      ++     CS+ D + + WK+ WNL+
Subjt:  WRRIMWGRELFYKGYKWRLGNDITIPI--------ENKPW------------------------IARQGCNKPLMVSDPFKGCSSKDDSSSLWKVLWNLD

Query:  LRPKIKICAWKVLKDIIPSKANIIAKGIDTNALSILCRKKEKTTRHLFWEYSSEAIKAI
        L P++K+  WK+ +  +P+K+N+  +G+  +     C + E++  H  W  + E +K +
Subjt:  LRPKIKICAWKVLKDIIPSKANIIAKGIDTNALSILCRKKEKTTRHLFWEYSSEAIKAI

A0A803NKW3 Uncharacterized protein3.4e-0925.61Show/hide
Query:  MTWRRIMWGRELFYKGYKWRLGNDITIPIENKPWIARQGCNKPLM---------VSDPFKGCSSKDDSSS-----LWKVLWNLDLRPKIKICAWKVLKDI
        +TWR I+WG+EL  KG +W++G+   I     PWI      KPL+         V+D       + ++S+      WK  W+L +  K+ I  W+ ++D 
Subjt:  MTWRRIMWGRELFYKGYKWRLGNDITIPIENKPWIARQGCNKPLM---------VSDPFKGCSSKDDSSS-----LWKVLWNLDLRPKIKICAWKVLKDI

Query:  IPSKANIIAKGIDTNALSILCRKKEKTTRHL---------FWEYSSEAIKAITSKQDDLSEIFL
        +P    +  + I       LC ++++T  H          FW  S+ A++ +      L  I +
Subjt:  IPSKANIIAKGIDTNALSILCRKKEKTTRHL---------FWEYSSEAIKAITSKQDDLSEIFL

A0A803PEK8 Uncharacterized protein1.5e-0925.22Show/hide
Query:  WRRIMWGRELFYKGYKWRLGNDITIPIENKPWIAR----QGCNKP---------------------------------LMVSDPFKGCSSKDD-------
        WR +MWG+++   GY+WR+GN  T+ +   PW++R    +  +KP                                 L++S P  G   +D        
Subjt:  WRRIMWGRELFYKGYKWRLGNDITIPIENKPWIAR----QGCNKP---------------------------------LMVSDPFKGCSSKDD-------

Query:  ------------SSSL---------------WKVLWNLDLRPKIKICAWKVLKDIIPSKANIIAKGIDTNALSILCRKKE----KTTRHLFWEYS-SEAI
                    +SSL               WK LW+L + PKIK   WK+  + IP+ AN+  +G+   AL  +C +      +TT H  WE   S+ +
Subjt:  ------------SSSL---------------WKVLWNLDLRPKIKICAWKVLKDIIPSKANIIAKGIDTNALSILCRKKE----KTTRHLFWEYS-SEAI

Query:  KAITSKQDDLSEIFLVAEDIANLLPRCSSM
         A++  +DD+ +I    ED+ + L R + +
Subjt:  KAITSKQDDLSEIFLVAEDIANLLPRCSSM

A0A803PHH5 Uncharacterized protein1.7e-0822.22Show/hide
Query:  WRRIMWGRELFYKGYKWRLGNDITIPIENKPWIARQG---CNKPL----------------------------------------------MVSDPF---
        W+ I+WGRE+ Y+G +WR+GN  TI +    W+ R      N+P+                                              M+  PF   
Subjt:  WRRIMWGRELFYKGYKWRLGNDITIPIENKPWIARQG---CNKPL----------------------------------------------MVSDPF---

Query:  -------------------KGCSSKDDSSSLWKVLWNLDLRPKIKICAWKVLKDIIPSKANIIAKGIDTNALSILCRKKEKTTRHLFWEYSSEAIKAI
                              S+ D + + WK+ W+L+L P++K+  WK+ ++ +P K N++ +G+  N +   C + E+T  H  W  + E +K +
Subjt:  -------------------KGCSSKDDSSSLWKVLWNLDLRPKIKICAWKVLKDIIPSKANIIAKGIDTNALSILCRKKEKTTRHLFWEYSSEAIKAI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTGGCGGAGAATCATGTGGGGCAGAGAATTATTCTACAAAGGCTACAAATGGAGGCTGGGAAATGACATAACTATCCCAATTGAGAACAAACCTTGGATAGCAAG
GCAGGGCTGCAACAAACCTCTGATGGTAAGTGATCCGTTTAAAGGGTGCTCGAGTAAAGATGACTCCTCCTCCCTCTGGAAAGTTCTATGGAATCTTGATTTGCGCCCTA
AGATCAAGATCTGTGCCTGGAAAGTCCTTAAGGATATCATCCCTTCGAAAGCTAATATAATTGCCAAAGGAATTGATACTAATGCTCTAAGTATTTTGTGCAGGAAAAAA
GAAAAGACGACTAGACATCTATTCTGGGAGTACTCTTCAGAGGCCATCAAAGCCATTACATCCAAACAAGATGATCTCTCGGAGATTTTTCTAGTAGCAGAGGACATCGC
CAACCTCTTGCCTAGGTGTTCTTCGATGACCTTTGAGAAATGCCCAAGATCTAGCAACAATGTTGCTCACACTCTCGCCAGGATAGCCATGGACCCTCAGTTTCGCCATC
CAAGCAGGGAGGAACAGAGGGATTTCGAGGTTGGGGAGGAAGTTTTAATGACCTCTGGTTTTCCTGTGTGGTTTGTTAACCTCATTATTGAGGATGCTGATGTTTAA
mRNA sequenceShow/hide mRNA sequence
GAAAAAAATTGAAATTGAAATTCTTATACTAGTATCATTAGTTCATTACATTATTAGAAAACAATTGAAATACAAAACAGTTCAATACTTAGCAGGAATGAAATTATGTC
CTAAATTCAAAAGAAACCAACCAAAAGATATCACAAACTTACAGAATGATATGAATAAGGTGATTTTTCTCGACGCCCAAAAAGTTTTACCCTTATACAGAATTCAATTG
GACCCCATCCCTCTAATTAAAAACAATTCCCAAGAGGCGGATTCTAGAAAAGCCGAGGGTGGATTGGGGTTCAAGGAGCTCGAGCTGTTTAACCAAGCAATGCTAGCTAA
ACAAAGCTAGCAGATTCTCAATCAACCTGAGAGCCTCCTCTCCAAAGTCCTACGTGGTTGTTATTTCAGGGCAGGTGATTTCTTATCGGCTCCCATAGGCAGGAAACCCT
CTATGACTTGGCGGAGAATCATGTGGGGCAGAGAATTATTCTACAAAGGCTACAAATGGAGGCTGGGAAATGACATAACTATCCCAATTGAGAACAAACCTTGGATAGCA
AGGCAGGGCTGCAACAAACCTCTGATGGTAAGTGATCCGTTTAAAGGGTGCTCGAGTAAAGATGACTCCTCCTCCCTCTGGAAAGTTCTATGGAATCTTGATTTGCGCCC
TAAGATCAAGATCTGTGCCTGGAAAGTCCTTAAGGATATCATCCCTTCGAAAGCTAATATAATTGCCAAAGGAATTGATACTAATGCTCTAAGTATTTTGTGCAGGAAAA
AAGAAAAGACGACTAGACATCTATTCTGGGAGTACTCTTCAGAGGCCATCAAAGCCATTACATCCAAACAAGATGATCTCTCGGAGATTTTTCTAGTAGCAGAGGACATC
GCCAACCTCTTGCCTAGGTGTTCTTCGATGACCTTTGAGAAATGCCCAAGATCTAGCAACAATGTTGCTCACACTCTCGCCAGGATAGCCATGGACCCTCAGTTTCGCCA
TCCAAGCAGGGAGGAACAGAGGGATTTCGAGGTTGGGGAGGAAGTTTTAATGACCTCTGGTTTTCCTGTGTGGTTTGTTAACCTCATTATTGAGGATGCTGATGTTTAA
Protein sequenceShow/hide protein sequence
MTWRRIMWGRELFYKGYKWRLGNDITIPIENKPWIARQGCNKPLMVSDPFKGCSSKDDSSSLWKVLWNLDLRPKIKICAWKVLKDIIPSKANIIAKGIDTNALSILCRKK
EKTTRHLFWEYSSEAIKAITSKQDDLSEIFLVAEDIANLLPRCSSMTFEKCPRSSNNVAHTLARIAMDPQFRHPSREEQRDFEVGEEVLMTSGFPVWFVNLIIEDADV