; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS026760 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS026760
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold468:127730..131076
RNA-Seq ExpressionMS026760
SyntenyMS026760
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7845416.1 uncharacterized protein G2W53_002321 [Senna tora]1.5e-0924.63Show/hide
Query:  LPTCEALVWRRLAIPNFCYFCGDVSEDVLHLFWFCDVVQSFWLNSQF---AHLRHS---------------------WPVSSMSFLLSNLQGRVY-----
        +PT E L  R ++I   C  C D  EDV H    C  +Q  W+ ++F   + L H+                     WP   +  ++  +Q   +     
Subjt:  LPTCEALVWRRLAIPNFCYFCGDVSEDVLHLFWFCDVVQSFWLNSQF---AHLRHS---------------------WPVSSMSFLLSNLQGRVY-----

Query:  --------WRGSFGPTGKLGETLSATFPISDSGSFSCWV-----FVYEAEFVQMVAPHRWLGEEALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPV
                W     P  KL   + AT      GS    V         A    ++ P+  +  EA+AIK GL LA ++G   +M E+++     +L  P 
Subjt:  --------WRGSFGPTGKLGETLSATFPISDSGSFSCWV-----FVYEAEFVQMVAPHRWLGEEALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPV

Query:  DDLSEVGALLSNIHQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPLIISLIIVDEQ
        D  S + A+  +I + C+S + + F++  R  N+ A  ++R A     N+VW +  PL +S +   +Q
Subjt:  DDLSEVGALLSNIHQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPLIISLIIVDEQ

PRQ31856.1 putative ribonuclease H-like domain-containing protein [Rosa chinensis]1.3e-0830.38Show/hide
Query:  PTGKLGETLSATFPISDS-GSFSCWVFVYEAEFVQMVA-----PHRWLGEEALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNI
        P+G+L   ++  F ++D  G     V  +E + +  +A      H  L  EA A + GL L I+ G   I  E++S      L    ++LSEV  +L + 
Subjt:  PTGKLGETLSATFPISDS-GSFSCWVFVYEAEFVQMVA-----PHRWLGEEALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNI

Query:  HQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPLIISLIIVDEQTFC
         +   S RS+R   T+RE N  A +LA LA    ++ VW++E P II  ++ ++   C
Subjt:  HQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPLIISLIIVDEQTFC

XP_022135942.1 uncharacterized protein LOC111007775 [Momordica charantia]1.0e-0842.06Show/hide
Query:  EALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQRCISSRSI--RFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPLIIS
        E LA +EG+ LAI  G+ P   ET+S   F LL    +D SE+G L S+I +  +SS  I   FSF +REGN+ AH LAR+ +      VWVEE    +S
Subjt:  EALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQRCISSRSI--RFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPLIIS

Query:  LIIVDEQ
         +I  ++
Subjt:  LIIVDEQ

XP_022140628.1 uncharacterized protein LOC111011237 [Momordica charantia]3.1e-1042.11Show/hide
Query:  EALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPL
        EA+   EGL LA  +GV P++ ET+S   F L  QP +DLSE G ++         S    F+F  REGN AAH LAR A+      +W+E+ PL
Subjt:  EALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPL

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]3.3e-1226.24Show/hide
Query:  NRLPTCEALVWRRLAIPNFCYFCGDVSEDVLHLFWFCDVVQSFWLNSQFAH------LRHSW-PVSSMSF---------LLSNLQGRVY-----------
        +RLPT   L  R + I N CYFCG   ED +HLFW C   ++ W+NS+F        LR S   +S   F         L +    R +           
Subjt:  NRLPTCEALVWRRLAIPNFCYFCGDVSEDVLHLFWFCDVVQSFWLNSQFAH------LRHSW-PVSSMSF---------LLSNLQGRVY-----------

Query:  -----WRGSFGP----------TGKLGETLSATFPISDSGSF---------------SCWVFVYEAEFVQMVAPHRWLGE-------EALAIKEGLALAI
             W   +            TG++  T    +   D G +                  + ++      M A  ++L         EA+A  EGL LA 
Subjt:  -----WRGSFGP----------TGKLGETLSATFPISDSGSF---------------SCWVFVYEAEFVQMVAPHRWLGE-------EALAIKEGLALAI

Query:  NMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPL
         +G+ P +                +DLSE G ++         S    F+F  REGN AAH LAR A+  H   +W+E+ PL
Subjt:  NMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPL

TrEMBL top hitse value%identityAlignment
A0A2N9HEC7 Reverse transcriptase domain-containing protein2.2e-0924.7Show/hide
Query:  RNRLPTCEALVWRRLAIPNFCYFCGDVSEDVLHLFWFCDVVQSFWLNSQFAHLRHSWPVSSMSFLLSNLQGR----VYWRGSFGPTGKLGETLSATF---
        R  LPT   L  R++ +   C  CG+  ED +H  W C  +QS W N +   LR +  V S++ +   LQ +     Y R +     K    L+  +   
Subjt:  RNRLPTCEALVWRRLAIPNFCYFCGDVSEDVLHLFWFCDVVQSFWLNSQFAHLRHSWPVSSMSFLLSNLQGR----VYWRGSFGPTGKLGETLSATF---

Query:  ------------PISDSGSFSCWVFVYEAEFVQMVA-------PHRWLGEEALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNI
                       ++ +    V V ++  + M +       PH     EA A+K  +   + +G+     E +S      L+ P   L+  G L+++ 
Subjt:  ------------PISDSGSFSCWVFVYEAEFVQMVA-------PHRWLGEEALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNI

Query:  HQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPLIISLII
               +S  FS   R+GN  AH LAR A++ +   VW+E  P  ++  I
Subjt:  HQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPLIISLII

A0A6J1C467 uncharacterized protein LOC1110077754.9e-0942.06Show/hide
Query:  EALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQRCISSRSI--RFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPLIIS
        E LA +EG+ LAI  G+ P   ET+S   F LL    +D SE+G L S+I +  +SS  I   FSF +REGN+ AH LAR+ +      VWVEE    +S
Subjt:  EALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQRCISSRSI--RFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPLIIS

Query:  LIIVDEQ
         +I  ++
Subjt:  LIIVDEQ

A0A6J1CIF1 uncharacterized protein LOC1110112371.5e-1042.11Show/hide
Query:  EALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPL
        EA+   EGL LA  +GV P++ ET+S   F L  QP +DLSE G ++         S    F+F  REGN AAH LAR A+      +W+E+ PL
Subjt:  EALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPL

A0A6J1DAR4 uncharacterized protein LOC1110189541.6e-1226.24Show/hide
Query:  NRLPTCEALVWRRLAIPNFCYFCGDVSEDVLHLFWFCDVVQSFWLNSQFAH------LRHSW-PVSSMSF---------LLSNLQGRVY-----------
        +RLPT   L  R + I N CYFCG   ED +HLFW C   ++ W+NS+F        LR S   +S   F         L +    R +           
Subjt:  NRLPTCEALVWRRLAIPNFCYFCGDVSEDVLHLFWFCDVVQSFWLNSQFAH------LRHSW-PVSSMSF---------LLSNLQGRVY-----------

Query:  -----WRGSFGP----------TGKLGETLSATFPISDSGSF---------------SCWVFVYEAEFVQMVAPHRWLGE-------EALAIKEGLALAI
             W   +            TG++  T    +   D G +                  + ++      M A  ++L         EA+A  EGL LA 
Subjt:  -----WRGSFGP----------TGKLGETLSATFPISDSGSF---------------SCWVFVYEAEFVQMVAPHRWLGE-------EALAIKEGLALAI

Query:  NMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPL
         +G+ P +                +DLSE G ++         S    F+F  REGN AAH LAR A+  H   +W+E+ PL
Subjt:  NMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPL

M5VU98 Reverse transcriptase domain-containing protein1.5e-1026.12Show/hide
Query:  LPTCEALVWRRLAIPNFCYFCGDVSEDVLHLFWFCDVVQSFWLNSQFAHLRHSWPVSSMSFLLSNLQGRVY-------------------WRGSFGPTGK
        LPT   L+ + + + + C FCGD++E  LH+   C    + W  S      H     S   ++   Q  V+                    R +  P+G+
Subjt:  LPTCEALVWRRLAIPNFCYFCGDVSEDVLHLFWFCDVVQSFWLNSQFAHLRHSWPVSSMSFLLSNLQGRVY-------------------WRGSFGPTGK

Query:  LGETLSATF-PISDSGSFSCWVFVYEAEFVQMVAPHRWLGE-------EALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQ
        L       F P S  G+        +  FV  VA  + +GE       E LA +EG+ALA+++G    + E +S      + +   D S +G ++ ++  
Subjt:  LGETLSATF-PISDSGSFSCWVFVYEAEFVQMVAPHRWLGE-------EALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQ

Query:  RCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPLII
              S  F FT RE N  AH+LAR  +    N +W E  P +I
Subjt:  RCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGPLII

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACCGTAATCGGCTCCCCACGTGTGAGGCCCTTGTCTGGAGGCGGTTGGCTATTCCGAACTTTTGTTACTTTTGTGGAGATGTCAGCGAGGACGTCCTTCATCTCTT
TTGGTTTTGTGATGTTGTGCAGTCGTTCTGGTTGAACTCGCAGTTTGCTCACCTTCGACACTCTTGGCCTGTCTCGTCTATGTCCTTCTTACTTTCTAATTTGCAGGGAA
GAGTCTATTGGCGTGGGTCCTTTGGGCCTACTGGGAAGTTGGGCGAGACACTATCTGCAACATTTCCGATCAGTGACAGCGGTTCATTCTCCTGCTGGGTCTTTGTGTAC
GAGGCGGAGTTTGTGCAGATGGTAGCCCCCCACAGATGGTTGGGTGAAGAGGCTTTGGCGATCAAGGAGGGGTTGGCGTTGGCAATTAATATGGGTGTTCGCCCGATTAT
GCCGGAAACTAATTCCTTACACTATTTTCGGCTATTGGATCAACCTGTGGATGATTTATCAGAGGTTGGTGCTCTCCTCTCAAATATTCATCAACGGTGTATTTCCTCGC
GCTCCATTAGATTCAGCTTCACCCATCGGGAAGGAAATAACGCTGCGCACCAGTTGGCTCGTCTTGCTATAACTAGACATTTGAATATAGTTTGGGTTGAGGAAGGACCT
TTAATCATTTCACTTATTATTGTTGATGAACAAACTTTCTGTGATACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTACCGTAATCGGCTCCCCACGTGTGAGGCCCTTGTCTGGAGGCGGTTGGCTATTCCGAACTTTTGTTACTTTTGTGGAGATGTCAGCGAGGACGTCCTTCATCTCTT
TTGGTTTTGTGATGTTGTGCAGTCGTTCTGGTTGAACTCGCAGTTTGCTCACCTTCGACACTCTTGGCCTGTCTCGTCTATGTCCTTCTTACTTTCTAATTTGCAGGGAA
GAGTCTATTGGCGTGGGTCCTTTGGGCCTACTGGGAAGTTGGGCGAGACACTATCTGCAACATTTCCGATCAGTGACAGCGGTTCATTCTCCTGCTGGGTCTTTGTGTAC
GAGGCGGAGTTTGTGCAGATGGTAGCCCCCCACAGATGGTTGGGTGAAGAGGCTTTGGCGATCAAGGAGGGGTTGGCGTTGGCAATTAATATGGGTGTTCGCCCGATTAT
GCCGGAAACTAATTCCTTACACTATTTTCGGCTATTGGATCAACCTGTGGATGATTTATCAGAGGTTGGTGCTCTCCTCTCAAATATTCATCAACGGTGTATTTCCTCGC
GCTCCATTAGATTCAGCTTCACCCATCGGGAAGGAAATAACGCTGCGCACCAGTTGGCTCGTCTTGCTATAACTAGACATTTGAATATAGTTTGGGTTGAGGAAGGACCT
TTAATCATTTCACTTATTATTGTTGATGAACAAACTTTCTGTGATACTTAA
Protein sequenceShow/hide protein sequence
MYRNRLPTCEALVWRRLAIPNFCYFCGDVSEDVLHLFWFCDVVQSFWLNSQFAHLRHSWPVSSMSFLLSNLQGRVYWRGSFGPTGKLGETLSATFPISDSGSFSCWVFVY
EAEFVQMVAPHRWLGEEALAIKEGLALAINMGVRPIMPETNSLHYFRLLDQPVDDLSEVGALLSNIHQRCISSRSIRFSFTHREGNNAAHQLARLAITRHLNIVWVEEGP
LIISLIIVDEQTFCDT