; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g29250 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g29250
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:20769307..20770246
RNA-Seq ExpressionMoc01g29250
SyntenyMoc01g29250
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022135942.1 uncharacterized protein LOC111007775 [Momordica charantia]9.3e-2433.18Show/hide
Query:  EKGRIFIMCFGSARNGKAFNPSGADLHGDLAEWASSYIRAYRAANSNP--PQI---RQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQI
        E+  +F+    +ARN   F      L  ++ +W + Y++ Y++A   P  P +   R      W PP    +K+N DA+F   N +AGL I+IR+    +
Subjt:  EKGRIFIMCFGSARNGKAFNPSGADLHGDLAEWASSYIRAYRAANSNP--PQI---RQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQI

Query:  MATATKYLQNVMSVDEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPF-LQASFKFTKREGNEAAHLLARRALL
        + +A  ++ +V  V  AE LAA EG+ +A+E+G+ P ++E DS ++F L   + ED SEIG + S  R  V+   +   F F  REGN  AH LAR  ++
Subjt:  MATATKYLQNVMSVDEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPF-LQASFKFTKREGNEAAHLLARRALL

Query:  HHEQLILLEDW
             + +E+W
Subjt:  HHEQLILLEDW

XP_022139684.1 uncharacterized protein LOC111010533 [Momordica charantia]6.4e-4148.58Show/hide
Query:  IFIMCFGSARNGKAFNPSGADLHGDLAEWASSYIRAYRAANSN--------PPQIRQTSEIR-------WHPPSYGVYKINTDASFSSSNSNAGLG-III
        +F+    + RN   FN    +   DLA W S+YI  ++A N+N            +Q+S+I        W P   GV+K+ TDASFSS + NAGLG III
Subjt:  IFIMCFGSARNGKAFNPSGADLHGDLAEWASSYIRAYRAANSN--------PPQIRQTSEIR-------WHPPSYGVYKINTDASFSSSNSNAGLG-III

Query:  RNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLL
        R++RGQ++A+ATKYL++V SVD+AEALAAVEGLRVAME+GI P+ LE DS+RI+ LF ++ E +S+ G II   +  +A  LQ S+ FTKR GN  AHLL
Subjt:  RNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLL

Query:  ARRALLHHEQLI
        ARRAL   E  +
Subjt:  ARRALLHHEQLI

XP_022140628.1 uncharacterized protein LOC111011237 [Momordica charantia]5.4e-4858.1Show/hide
Query:  LAEWASSYIRAYRAANSN--PPQIRQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMES
        L EWA+ Y+  +R ANSN  P ++  T+E+ W PP   +YKINTDASF +S+ +AGLGIIIRN RGQ+MA+ATKYL+N+ SVD AEA+ AVEGL++A + 
Subjt:  LAEWASSYIRAYRAANSN--PPQIRQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMES

Query:  GIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLLARRALLHHEQLILLEDWP
        G+ PV LE DS RIF LF++  ED+SE GEI+  A++     L ASF F KREGN+AAH+LARRALL  E  I +EDWP
Subjt:  GIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLLARRALLHHEQLILLEDWP

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]3.0e-5444.11Show/hide
Query:  MTLPCKLKVFFWRLCLDRLPTGASLCKRRVDVSKLCFFAEEKG----RIFIMC-----------FG-------------------------------SAR
        M +P K+KVF WRLCLDRLPTG +L KR V+++  C+F    G     +F +C           FG                               + R
Subjt:  MTLPCKLKVFFWRLCLDRLPTGASLCKRRVDVSKLCFFAEEKG----RIFIMC-----------FG-------------------------------SAR

Query:  NGKAFNPSGADLHG---DLAEWASSYIRAYRAANSNP--PQIRQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSV
        N +AFN S   +     +L EWA+ Y   +R A SNP   ++  T+EI W PP  G+YKINTDASF +S+ +AGLGIII N RGQ+MA ATKYL+N+ SV
Subjt:  NGKAFNPSGADLHG---DLAEWASSYIRAYRAANSNP--PQIRQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSV

Query:  DEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLLARRALLHHEQLILLEDWP
        D AEA+AAVEGL++A E G++P                 +ED+SE GEI+  A++     L ASF F KREGN+AAH+LARRALL HE  I +EDWP
Subjt:  DEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLLARRALLHHEQLILLEDWP

XP_022150944.1 uncharacterized protein LOC111018973 [Momordica charantia]6.6e-2233.17Show/hide
Query:  IFIMCFGSARNGKAFNPSGADLHGDLAEWASSYIRAYRAANSNPPQ----IRQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATAT
        + +    +ARN    + S      DL  W+ +Y++ Y+AA  +        R      W PP+  + K+N DA+F   +  AG+G+IIR+  G +  TA 
Subjt:  IFIMCFGSARNGKAFNPSGADLHGDLAEWASSYIRAYRAANSNPPQ----IRQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATAT

Query:  KYLQNVMSVDEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQ-ASFKFTKREGNEAAHLLARRALLHHEQL
        + L     VD  E  A  EG+ +A+E+G    ++E DS+RIF L   +  D SE+G + S  +  ++   +  SF FT R GN  AHLLA+ AL      
Subjt:  KYLQNVMSVDEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQ-ASFKFTKREGNEAAHLLARRALLHHEQL

Query:  ILLEDWPN
        I +E+WP+
Subjt:  ILLEDWPN

TrEMBL top hitse value%identityAlignment
A0A2N9J7E4 Uncharacterized protein5.3e-2530.89Show/hide
Query:  LPCKLKVFFWRLCLDRLPTGASLCKRRVDVSKLCFFAEEKGRIFIMCFGSARNGKAFNPSGADLHGDLAEWASSYIRAYRAANSNPPQIRQT-SEIRWHP
        +P K+K F WR C D LPT + L +R+V  + LC     + ++      + RN    +P  +D +  +   A   ++ Y A  +     +QT  + RW  
Subjt:  LPCKLKVFFWRLCLDRLPTGASLCKRRVDVSKLCFFAEEKGRIFIMCFGSARNGKAFNPSGADLHGDLAEWASSYIRAYRAANSNPPQIRQT-SEIRWHP

Query:  PSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISD
        P    YK+N D +    +++ G+G++IR++ G  +AT ++ +  + +V+  EALAA   +  A E GI  VE+E D+  I K  N      +  G +I D
Subjt:  PSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISD

Query:  ARDSVAPFLQASFKFTKREGNEAAHLLARRALLHHEQLILLEDWPN
        A+  +  F + S   T+R GN  AH LARRA   +   + LE+ P+
Subjt:  ARDSVAPFLQASFKFTKREGNEAAHLLARRALLHHEQLILLEDWPN

A0A6J1CDQ4 uncharacterized protein LOC1110105333.1e-4148.58Show/hide
Query:  IFIMCFGSARNGKAFNPSGADLHGDLAEWASSYIRAYRAANSN--------PPQIRQTSEIR-------WHPPSYGVYKINTDASFSSSNSNAGLG-III
        +F+    + RN   FN    +   DLA W S+YI  ++A N+N            +Q+S+I        W P   GV+K+ TDASFSS + NAGLG III
Subjt:  IFIMCFGSARNGKAFNPSGADLHGDLAEWASSYIRAYRAANSN--------PPQIRQTSEIR-------WHPPSYGVYKINTDASFSSSNSNAGLG-III

Query:  RNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLL
        R++RGQ++A+ATKYL++V SVD+AEALAAVEGLRVAME+GI P+ LE DS+RI+ LF ++ E +S+ G II   +  +A  LQ S+ FTKR GN  AHLL
Subjt:  RNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLL

Query:  ARRALLHHEQLI
        ARRAL   E  +
Subjt:  ARRALLHHEQLI

A0A6J1CIF1 uncharacterized protein LOC1110112372.6e-4858.1Show/hide
Query:  LAEWASSYIRAYRAANSN--PPQIRQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMES
        L EWA+ Y+  +R ANSN  P ++  T+E+ W PP   +YKINTDASF +S+ +AGLGIIIRN RGQ+MA+ATKYL+N+ SVD AEA+ AVEGL++A + 
Subjt:  LAEWASSYIRAYRAANSN--PPQIRQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMES

Query:  GIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLLARRALLHHEQLILLEDWP
        G+ PV LE DS RIF LF++  ED+SE GEI+  A++     L ASF F KREGN+AAH+LARRALL  E  I +EDWP
Subjt:  GIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLLARRALLHHEQLILLEDWP

A0A6J1DAR4 uncharacterized protein LOC1110189541.4e-5444.11Show/hide
Query:  MTLPCKLKVFFWRLCLDRLPTGASLCKRRVDVSKLCFFAEEKG----RIFIMC-----------FG-------------------------------SAR
        M +P K+KVF WRLCLDRLPTG +L KR V+++  C+F    G     +F +C           FG                               + R
Subjt:  MTLPCKLKVFFWRLCLDRLPTGASLCKRRVDVSKLCFFAEEKG----RIFIMC-----------FG-------------------------------SAR

Query:  NGKAFNPSGADLHG---DLAEWASSYIRAYRAANSNP--PQIRQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSV
        N +AFN S   +     +L EWA+ Y   +R A SNP   ++  T+EI W PP  G+YKINTDASF +S+ +AGLGIII N RGQ+MA ATKYL+N+ SV
Subjt:  NGKAFNPSGADLHG---DLAEWASSYIRAYRAANSNP--PQIRQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSV

Query:  DEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLLARRALLHHEQLILLEDWP
        D AEA+AAVEGL++A E G++P                 +ED+SE GEI+  A++     L ASF F KREGN+AAH+LARRALL HE  I +EDWP
Subjt:  DEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLLARRALLHHEQLILLEDWP

M5XK32 Reverse transcriptase domain-containing protein (Fragment)9.0e-2531.68Show/hide
Query:  TLPCKLKVFFWRLCLDRLPTGASLCKRRVDVSKLCFF----AEEKGRIFIMC-FGSARNGKAFNPSGADLHG---------DLAEWASSYIRAYRAANSN
        T+P KLK+F WR+  D LPT A+L K+ VD+  +C F     E    +  MC F  A     +N S    H          D+  +A  Y+  +  AN  
Subjt:  TLPCKLKVFFWRLCLDRLPTGASLCKRRVDVSKLCFF----AEEKGRIFIMC-FGSARNGKAFNPSGADLHG---------DLAEWASSYIRAYRAANSN

Query:  PPQI--RQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLF
        P ++  R    +RW  PS G  K N D +F  ++    +G++ R+  G  +A   K +  V+S + AE LAA EG+ +A+  G      E DS  +    
Subjt:  PPQI--RQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLF

Query:  NKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLLARRALLHHEQLILLEDWPN
         +  +D S IG I+ D +     F  + F+FT RE N   H LAR  L + +  I  E  P+
Subjt:  NKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLLARRALLHHEQLILLEDWPN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein2.7e-1329.41Show/hide
Query:  DLAEWASSYIRAYRAANSNPPQIRQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMESG
        DL EW    IR    +    PQ+ ++S  RW PP +   K NTDA+++  N   G+G ++RN +G++     + L  + SV EAE L A+    +++   
Subjt:  DLAEWASSYIRAYRAANSNPPQIRQTSEIRWHPPSYGVYKINTDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMESG

Query:  IYP-VELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLLARRAL--LHHEQLI--LLEDWPNSTL
         Y  V  E DS  + ++ N + E    +   I D +  ++ F +  F F  REGN  A  +AR +L  L+++  +  ++  W  S++
Subjt:  IYP-VELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKREGNEAAHLLARRAL--LHHEQLI--LLEDWPNSTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCTCCCTTGCAAACTTAAAGTGTTCTTTTGGCGGCTTTGCTTAGATAGGCTCCCCACTGGCGCCAGTCTATGCAAAAGGCGAGTAGACGTTTCCAAATTGTGTTT
TTTTGCGGAAGAAAAGGGGAGGATATTTATCATGTGTTTTGGGAGTGCAAGAAATGGGAAGGCCTTCAATCCAAGTGGGGCAGATCTTCATGGTGATTTAGCTGAATGGG
CTAGTTCCTACATCAGGGCATATAGAGCAGCAAATTCCAACCCTCCTCAGATTCGGCAAACAAGTGAAATCAGGTGGCACCCACCAAGTTATGGGGTTTATAAAATCAAT
ACAGATGCATCTTTTTCTTCATCGAATTCAAATGCTGGTCTGGGTATCATCATAAGAAATTATCGAGGACAAATAATGGCGACAGCAACAAAATACCTGCAGAATGTCAT
GTCGGTGGATGAAGCAGAAGCTTTAGCAGCGGTGGAAGGGTTGAGAGTAGCGATGGAGTCCGGCATATACCCTGTGGAATTGGAGATGGATTCCATTCGAATTTTCAAGC
TATTCAACAAGGAAATGGAAGACATTTCTGAGATTGGAGAAATTATTTCAGATGCCCGTGATAGTGTGGCTCCTTTTTTGCAAGCCTCTTTCAAATTTACGAAAAGGGAG
GGGAATGAAGCAGCTCACCTGCTTGCTAGGCGTGCTCTTCTGCACCACGAACAGTTAATTTTGCTCGAAGATTGGCCGAACTCTACTCTGTGCTTTCGTTGGAGTGTGTG
GATTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTCTCCCTTGCAAACTTAAAGTGTTCTTTTGGCGGCTTTGCTTAGATAGGCTCCCCACTGGCGCCAGTCTATGCAAAAGGCGAGTAGACGTTTCCAAATTGTGTTT
TTTTGCGGAAGAAAAGGGGAGGATATTTATCATGTGTTTTGGGAGTGCAAGAAATGGGAAGGCCTTCAATCCAAGTGGGGCAGATCTTCATGGTGATTTAGCTGAATGGG
CTAGTTCCTACATCAGGGCATATAGAGCAGCAAATTCCAACCCTCCTCAGATTCGGCAAACAAGTGAAATCAGGTGGCACCCACCAAGTTATGGGGTTTATAAAATCAAT
ACAGATGCATCTTTTTCTTCATCGAATTCAAATGCTGGTCTGGGTATCATCATAAGAAATTATCGAGGACAAATAATGGCGACAGCAACAAAATACCTGCAGAATGTCAT
GTCGGTGGATGAAGCAGAAGCTTTAGCAGCGGTGGAAGGGTTGAGAGTAGCGATGGAGTCCGGCATATACCCTGTGGAATTGGAGATGGATTCCATTCGAATTTTCAAGC
TATTCAACAAGGAAATGGAAGACATTTCTGAGATTGGAGAAATTATTTCAGATGCCCGTGATAGTGTGGCTCCTTTTTTGCAAGCCTCTTTCAAATTTACGAAAAGGGAG
GGGAATGAAGCAGCTCACCTGCTTGCTAGGCGTGCTCTTCTGCACCACGAACAGTTAATTTTGCTCGAAGATTGGCCGAACTCTACTCTGTGCTTTCGTTGGAGTGTGTG
GATTGCTTAG
Protein sequenceShow/hide protein sequence
MTLPCKLKVFFWRLCLDRLPTGASLCKRRVDVSKLCFFAEEKGRIFIMCFGSARNGKAFNPSGADLHGDLAEWASSYIRAYRAANSNPPQIRQTSEIRWHPPSYGVYKIN
TDASFSSSNSNAGLGIIIRNYRGQIMATATKYLQNVMSVDEAEALAAVEGLRVAMESGIYPVELEMDSIRIFKLFNKEMEDISEIGEIISDARDSVAPFLQASFKFTKRE
GNEAAHLLARRALLHHEQLILLEDWPNSTLCFRWSVWIA