; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS027567 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS027567
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold238:83039..86425
RNA-Seq ExpressionMS027567
SyntenyMS027567
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010474595.1 PREDICTED: uncharacterized protein LOC104754152 [Camelina sativa]6.6e-1738.13Show/hide
Query:  EEAWTSDSICREI-----IYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRI
        EE  T  +I R+      +  ADDSL FCKA + E   V   ++DY +A+ Q++N +KS + F   V  + +  ++S++G+S    +G+YLGIP SL   
Subjt:  EEAWTSDSICREI-----IYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRI

Query:  KNRDFARIKDKLWKVVQGWKANLFSVGGKEMSPQLVEIA
        KN+ F+ +KD+L   V GW + L S GGKE+  + V +A
Subjt:  KNRDFARIKDKLWKVVQGWKANLFSVGGKEMSPQLVEIA

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]6.6e-1741.59Show/hide
Query:  IYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFARIKDKLWKVVQG
        ++ ADDSL+FC+A+      +KR L  Y +AS Q++N DKSV+ FSPN  +  +   Q I+GM +      YLG+P+   R K++ F+ IK+K+WK++  
Subjt:  IYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFARIKDKLWKVVQG

Query:  WKANLFSVGGKEM
        W   +FS+GGKE+
Subjt:  WKANLFSVGGKEM

XP_030483769.1 uncharacterized protein LOC115700339 [Cannabis sativa]5.1e-1743.64Show/hide
Query:  ADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFARIKDKLWKVVQGWKA
        ADDSL+FC+A+      +KR L  Y +AS Q++N DKSV+ FSPN  M  +   Q I+GM +      YLG+P+   R K++ F+ IK+K+WK++  W  
Subjt:  ADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFARIKDKLWKVVQGWKA

Query:  NLFSVGGKEM
         +FS+GGKE+
Subjt:  NLFSVGGKEM

XP_030942886.1 uncharacterized protein LOC115967872 [Quercus lobata]2.3e-1746.77Show/hide
Query:  SICR-----EIIYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFAR
        SICR       ++ ADDSL+FCKA+  E   +K  L  Y+  S QKIN DKS ++FSPN   + +  V SI+G    SS   YLG+PS + R K   FA 
Subjt:  SICR-----EIIYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFAR

Query:  IKDKLWKVVQGWKANLFSVGGKEM
        IKDK+ K + GWK+ L S+GGKE+
Subjt:  IKDKLWKVVQGWKANLFSVGGKEM

XP_030945984.1 uncharacterized protein LOC115970494 [Quercus lobata]2.3e-1743.64Show/hide
Query:  ADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFARIKDKLWKVVQGWKA
        +DDSLIFCKAS  E+ V+   L  Y  +S Q INF+KS ++FS N  +  R+ ++  MG+  V    +YLG+P+ + R K + F+ +KD++WK +QGWK 
Subjt:  ADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFARIKDKLWKVVQGWKA

Query:  NLFSVGGKEM
         L S  GKE+
Subjt:  NLFSVGGKEM

TrEMBL top hitse value%identityAlignment
A0A2N9EFD5 Reverse transcriptase domain-containing protein1.9e-1741.94Show/hide
Query:  SICR-----EIIYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFAR
        SICR       ++ ADDS+IFCKASI +   + + L  Y++AS QKIN  K+ L+FS N     R  + ++ G +  +    YLG+P  + R K R F  
Subjt:  SICR-----EIIYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFAR

Query:  IKDKLWKVVQGWKANLFSVGGKEM
        IKD++WK +QGWK  L S  GKE+
Subjt:  IKDKLWKVVQGWKANLFSVGGKEM

A0A2N9G5C3 Uncharacterized protein6.4e-1842.75Show/hide
Query:  EEAWTSDSICR-----EIIYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRI
        E++    +ICR       ++ ADDS+IFC+AS  E  V++  L  Y++AS QKIN +K+  +FS N  +  R  + S+ G S  S    YLG+PS L R 
Subjt:  EEAWTSDSICR-----EIIYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRI

Query:  KNRDFARIKDKLWKVVQGWKANLFSVGGKEM
        K R F  IKD++WK +QGWK NL S  G+E+
Subjt:  KNRDFARIKDKLWKVVQGWKANLFSVGGKEM

A0A2N9ISH2 Reverse transcriptase domain-containing protein2.4e-1739.52Show/hide
Query:  SICR-----EIIYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFAR
        SICR       ++ ADDS+IFCKAS      ++  L  Y  AS Q +N DK+ ++FS N  M+ R+ + + MG S+ +    YLG+P+ L R K R F+ 
Subjt:  SICR-----EIIYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFAR

Query:  IKDKLWKVVQGWKANLFSVGGKEM
        +K+++W+ +QGWK  L S  G+E+
Subjt:  IKDKLWKVVQGWKANLFSVGGKEM

A0A5B7AER2 Reverse transcriptase domain-containing protein6.4e-1844.07Show/hide
Query:  IYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFARIKDKLWKVVQG
        ++ ADDSL+F KA   ++  + R +  Y  AS Q+INFDKS L FSPNV    R+ ++ I+G+SL S    YLG+PS++ R K + F  IKD++W+ +QG
Subjt:  IYEADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFARIKDKLWKVVQG

Query:  WKANLFSVGGKEMSPQLV
        WK  L S  G+E+  ++V
Subjt:  WKANLFSVGGKEMSPQLV

A0A803PAC6 Uncharacterized protein2.4e-1741.82Show/hide
Query:  ADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFARIKDKLWKVVQGWKA
        ADDS +FC ASI    ++K  L  Y++AS QK+NF KS L+FSPNV+++ +  +   + + + SS   YLG+P  + R K + F  + DK+W  +  WK 
Subjt:  ADDSLIFCKASIPEIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFARIKDKLWKVVQGWKA

Query:  NLFSVGGKEM
         +FS GGKE+
Subjt:  NLFSVGGKEM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGTTGCCTCCAAGATTTGGATTTCCAAGGGAACACTTTCACGTGGTGTAATCGAAGAAAACATGATCAAATCAGTGAGAGATTTGATCTTTTTTTTAGGCAATGA
GGCGTTTAGGGATCTTTGGGCAGAGTTAAGAGTAGTTCATCAGGATTGGGCCAGGTATGATCACAGGCCCATAATGATATCTTTGTTTGAAATGAGCTCTAGACCTATTC
GTCGAACGAGGGATTTCAAATTTGAAGAAGCCTGGACCAGTGATAGTATATGCAGGGAGATTATTTATGAGGCAGATGATAGCCTTATTTTCTGCAAGGCATCAATTCCT
GAAATATATGTTGTAAAGAGAACTCTGCTCGACTATAAGAAAGCCTCTGTTCAGAAGATCAATTTTGATAAATCGGTTCTCTGGTTCTCTCCTAATGTGAAGATGAAGTT
CAGGAAGTACGTGCAATCTATCATGGGGATGTCCTTAGTCTCTTCGTTGGGGAACTATCTTGGAATTCCGTCGAGCCTTTCAAGAATCAAGAATAGAGATTTTGCAAGGA
TTAAAGACAAGCTGTGGAAGGTGGTCCAAGGTTGGAAAGCAAACTTGTTTTCGGTGGGTGGAAAAGAGATGAGCCCTCAACTAGTCGAAATTGCTGCGATTAGGGAAGGT
TTGAAGATTGCAGTGCGATTAGGGCTCTCGAGGGTGATTATCGAGACGGACTCCCTGAGCTCCATTGCTCTTATTCGCGAGGGATCCCTTGTGCAAAATGAAATTTCGAA
CTGGCTAGCAGATATTCGTATGCTATCATAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGTTGCCTCCAAGATTTGGATTTCCAAGGGAACACTTTCACGTGGTGTAATCGAAGAAAACATGATCAAATCAGTGAGAGATTTGATCTTTTTTTTAGGCAATGA
GGCGTTTAGGGATCTTTGGGCAGAGTTAAGAGTAGTTCATCAGGATTGGGCCAGGTATGATCACAGGCCCATAATGATATCTTTGTTTGAAATGAGCTCTAGACCTATTC
GTCGAACGAGGGATTTCAAATTTGAAGAAGCCTGGACCAGTGATAGTATATGCAGGGAGATTATTTATGAGGCAGATGATAGCCTTATTTTCTGCAAGGCATCAATTCCT
GAAATATATGTTGTAAAGAGAACTCTGCTCGACTATAAGAAAGCCTCTGTTCAGAAGATCAATTTTGATAAATCGGTTCTCTGGTTCTCTCCTAATGTGAAGATGAAGTT
CAGGAAGTACGTGCAATCTATCATGGGGATGTCCTTAGTCTCTTCGTTGGGGAACTATCTTGGAATTCCGTCGAGCCTTTCAAGAATCAAGAATAGAGATTTTGCAAGGA
TTAAAGACAAGCTGTGGAAGGTGGTCCAAGGTTGGAAAGCAAACTTGTTTTCGGTGGGTGGAAAAGAGATGAGCCCTCAACTAGTCGAAATTGCTGCGATTAGGGAAGGT
TTGAAGATTGCAGTGCGATTAGGGCTCTCGAGGGTGATTATCGAGACGGACTCCCTGAGCTCCATTGCTCTTATTCGCGAGGGATCCCTTGTGCAAAATGAAATTTCGAA
CTGGCTAGCAGATATTCGTATGCTATCATAA
Protein sequenceShow/hide protein sequence
MIVASKIWISKGTLSRGVIEENMIKSVRDLIFFLGNEAFRDLWAELRVVHQDWARYDHRPIMISLFEMSSRPIRRTRDFKFEEAWTSDSICREIIYEADDSLIFCKASIP
EIYVVKRTLLDYKKASVQKINFDKSVLWFSPNVKMKFRKYVQSIMGMSLVSSLGNYLGIPSSLSRIKNRDFARIKDKLWKVVQGWKANLFSVGGKEMSPQLVEIAAIREG
LKIAVRLGLSRVIIETDSLSSIALIREGSLVQNEISNWLADIRMLS