; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g06080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g06080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:4150307..4151647
RNA-Seq ExpressionMoc04g06080
SyntenyMoc04g06080
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEC68887.1 hypothetical protein OsI_37529 [Oryza sativa Indica Group]5.5e-3227.55Show/hide
Query:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV
        ++ +Y   G  +    G N   AWRS  +G DL ++G  W++GN  S+    D WLPR+++ +PI  +   R + V  LI   G W+   I   F   + 
Subjt:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV

Query:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKKWKKFWRTSVSSKLKICCWRIYNDIIPESSTHFF-----WNCR--
        EVILNI +S R++ D + W PDK G FSV+SAYRL   L + +E  +S +    K W+  W+  V  K+KI  WR+ ++ +   +         W  R  
Subjt:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKKWKKFWRTSVSSKLKICCWRIYNDIIPESSTHFF-----WNCR--

Query:  --HGWCS----------KDFCDWMWSQRDRRSA--VEGN----------------------SWTTPPHHRWKLNTNTTWMDSLNRGGIGWILRDS-----
          HG  +          + + D ++  R    A  V+G                        W  P     KLN + ++  S  +GG+G ILR+S     
Subjt:  --HGWCS----------KDFCDWMWSQRDRRSA--VEGN----------------------SWTTPPHHRWKLNTNTTWMDSLNRGGIGWILRDS-----

Query:  ----------------------NGLKSINKEERLETIVESDCLELVNLLKGVDSDLTEISFSIDEALDLQ------EVMKMCSQFGKASLHI
                               GLK       L   VE+DC  +V LL+G+  D + ++  I EA  L        + K+C      S H+
Subjt:  ----------------------NGLKSINKEERLETIVESDCLELVNLLKGVDSDLTEISFSIDEALDLQ------EVMKMCSQFGKASLHI

XP_022149515.1 uncharacterized protein LOC111017927 [Momordica charantia]8.1e-5258.08Show/hide
Query:  ALIRSAGGWNESLIRNSFLEEEVEVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKKWKKFWRTSVSSKLKICCWRIY
        AL+     WNESLIRNSFLEEE ++ILNIPLS  NQHDEVIWGPDKK KFSVKS YRLGVHLASADEVQTSNSEEEAKKWKK WRT V +K+KICCWRIY
Subjt:  ALIRSAGGWNESLIRNSFLEEEVEVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKKWKKFWRTSVSSKLKICCWRIY

Query:  NDII------------------------PESSTHFFWNCRHG-WCSKD--------------------FCDWMWSQRDRRSAVEGNSWTTPPHHRWKL
        NDII                         ESSTH FWNCR+  + SK+                      +    +R RRS +EG+SWTTPPHHRW L
Subjt:  NDII------------------------PESSTHFFWNCRHG-WCSKD--------------------FCDWMWSQRDRRSAVEGNSWTTPPHHRWKL

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]5.7e-3751.08Show/hide
Query:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV
        +RG+Y K G FL+AKLG  P YAWRS  WGRDLF++GYRWK+GN  S+  + DPWLPR+ N+ P+     VR   V  L+   G W+E  +R SF+  E 
Subjt:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV

Query:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHL
        ++IL  PL  +++ DE+IWG DK G FSV+SAY LG+ L
Subjt:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHL

XP_030486845.1 uncharacterized protein LOC115703751 [Cannabis sativa]1.0e-3331.77Show/hide
Query:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV
        ++ RY+    FL A  G  P   W+   WG+DL  +G RWK+G+  S+    DPWLP   N++P+++ G      V +LI +   WN +L+ + F+  +V
Subjt:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV

Query:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKK-WKKFWRTSVSSKLKICCWRIYNDIIP-----------------
        E IL+IPL++    D +IW  +  G ++VKS Y +  +L   +E Q S S  ++++ WKKFWR S+ SK++I  WR  ND +P                 
Subjt:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKK-WKKFWRTSVSSKLKICCWRIYNDIIP-----------------

Query:  ------ESSTHFFWNC---RHGWCSKDF-CDWMWSQRDRRSAVEGNS-----------------WTTPPHHRWKLNTNTTWMDSLNRGGIGWILRDSNG
              ES  H  + C   R  W    F  D +   ++ +SA +  +                 W  PP  + KLNT+     S  R G G +LRDSNG
Subjt:  ------ESSTHFFWNC---RHGWCSKDF-CDWMWSQRDRRSAVEGNS-----------------WTTPPHHRWKLNTNTTWMDSLNRGGIGWILRDSNG

XP_030500352.1 uncharacterized protein LOC115715819 [Cannabis sativa]9.3e-3229.85Show/hide
Query:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV
        ++ RY+    FL+A +GH+P + W+   WGR LF  G RWKIG    +  A+DPW+PR + ++P    G   +  V  LI     W+ +L+   F   +V
Subjt:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV

Query:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKKWKKFWRTSVSSKLKICCWRIYNDIIPESST--------------
        + IL IPLS     D + W P   G +SV++ Y L   LA  D+  +S S   A  W   W  S+  K+KI  WR +ND +P ++               
Subjt:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKKWKKFWRTSVSSKLKICCWRIYNDIIPESST--------------

Query:  -HFFWNCRHGWCSKDFCD-------WMWSQRDRRSAVEGNSWTTPPHHRWKLNTNTTWMDSLNRGGIGWILRDSNG--LKSINK-------EERLETIVE
         H   +  + WCS    D          S      A     WT PP    K+N +     +    GIG ++R SNG  + +I+K          +E I  
Subjt:  -HFFWNCRHGWCSKDFCD-------WMWSQRDRRSAVEGNSWTTPPHHRWKLNTNTTWMDSLNRGGIGWILRDSNG--LKSINK-------EERLETIVE

Query:  SDCL----ELVNLLKGVDSDLTEIS
          CL    +L  L+  V++D   +S
Subjt:  SDCL----ELVNLLKGVDSDLTEIS

TrEMBL top hitse value%identityAlignment
A0A6J1D5Y4 uncharacterized protein LOC1110179273.9e-5258.08Show/hide
Query:  ALIRSAGGWNESLIRNSFLEEEVEVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKKWKKFWRTSVSSKLKICCWRIY
        AL+     WNESLIRNSFLEEE ++ILNIPLS  NQHDEVIWGPDKK KFSVKS YRLGVHLASADEVQTSNSEEEAKKWKK WRT V +K+KICCWRIY
Subjt:  ALIRSAGGWNESLIRNSFLEEEVEVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKKWKKFWRTSVSSKLKICCWRIY

Query:  NDII------------------------PESSTHFFWNCRHG-WCSKD--------------------FCDWMWSQRDRRSAVEGNSWTTPPHHRWKL
        NDII                         ESSTH FWNCR+  + SK+                      +    +R RRS +EG+SWTTPPHHRW L
Subjt:  NDII------------------------PESSTHFFWNCRHG-WCSKD--------------------FCDWMWSQRDRRSAVEGNSWTTPPHHRWKL

A0A6J1DRA0 uncharacterized protein LOC1110224232.7e-3751.08Show/hide
Query:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV
        +RG+Y K G FL+AKLG  P YAWRS  WGRDLF++GYRWK+GN  S+  + DPWLPR+ N+ P+     VR   V  L+   G W+E  +R SF+  E 
Subjt:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV

Query:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHL
        ++IL  PL  +++ DE+IWG DK G FSV+SAY LG+ L
Subjt:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHL

A0A803P9P5 Uncharacterized protein6.3e-3437.91Show/hide
Query:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV
        ++ RY K   FL+A++G  P   WRS  WG++L  +G RWK+GN   +  A DPWLP   ++KP++ +       V  LI +   WN SL++  FLE +V
Subjt:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV

Query:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKKWKKFWRTSVSSKLKICCWRIYNDIIP
          I +IPL++ +Q D++IW  +  G +SVKS Y L   L   +++  ++S    + WKKFW  ++ SK+KI  WR  +D +P
Subjt:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKKWKKFWRTSVSSKLKICCWRIYNDIIP

A0A803PAK3 Uncharacterized protein3.1e-3331.71Show/hide
Query:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV
        ++ RY +   FL A LG +P   WR   WG++L  +G RWK+GN   +  A  PWLP   ++KP+  RG      V  LI +   WN  L+   FL  +V
Subjt:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV

Query:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKK-WKKFWRTSVSSKLKICCWRIYNDIIP-----------------
         +I +IPL+    HD +IW  +  G +SVKS Y L   L   +E Q  +S  +A++ WKKFW   + SK++I  WR  +D +P                 
Subjt:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKK-WKKFWRTSVSSKLKICCWRIYNDIIP-----------------

Query:  ------ESSTHFFWNCRHGWCSKDFCDWMWSQRDRRSA---------VEGNSWTTPPHHRWKLNTNTTWMDSLNRGGIGWILRDSNG
              E+  H  + C+     K + D    Q    SA          + + W  PP    KLNT+       N+ G G +LRD +G
Subjt:  ------ESSTHFFWNCRHGWCSKDFCDWMWSQRDRRSA---------VEGNSWTTPPHHRWKLNTNTTWMDSLNRGGIGWILRDSNG

A0A803PC16 Uncharacterized protein2.0e-3532.01Show/hide
Query:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV
        ++ RY +   FL A LG  P   WR   WG++L  +G RWK+GN   +  A  PWLP   ++KP+  RG      V  LI +   WN  L+   FL  +V
Subjt:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEV

Query:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKK-WKKFWRTSVSSKLKICCWRIYNDIIP-----------------
         +I +IPL+    HD +IW  +  G +SVKS Y L   L   +E Q  +S  +A++ WKKFW   + SK++I  WR  +D +P                 
Subjt:  EVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKK-WKKFWRTSVSSKLKICCWRIYNDIIP-----------------

Query:  ------ESSTHFFWNCRHGWCSKDFCDWMWSQRDRRSAVEGNSWTTPPHHRWKLNTNTTWMDSLNRGGIGWILRDSNG
              E+  H  + C+    +  +     S     +A + + W  PP  R KLNT+       N+ G G +LRD +G
Subjt:  ------ESSTHFFWNCRHGWCSKDFCDWMWSQRDRRSAVEGNSWTTPPHHRWKLNTNTTWMDSLNRGGIGWILRDSNG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein5.3e-0924.16Show/hide
Query:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGG---WNESLIRNSFLE
        M+ RY K    L AK+     Y W S   G  L ++G R  IG+  ++    D  +   +  +P+      +E  ++ L    G    W++S I     +
Subjt:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGG---WNESLIRNSFLE

Query:  EEVEVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKKWKKFWRTSVSSKLKICCWR
         +   I  I L+   + D++IW  +  G+++V+S Y L  H  S +    +          + W   +  KLK   WR
Subjt:  EEVEVILNIPLSVRNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKKWKKFWRTSVSSKLKICCWR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.0e-0432.31Show/hide
Query:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPI
        +R RY      ++  +G  P YAWRS   GR+L  RG    IG+ +      D W+  E    P+
Subjt:  MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGCAGGTACCACAAGGGAGGGGAATTCTTAAAAGCAAAATTGGGACACAATCCTCCCTATGCTTGGAGAAGTACATGGTGGGGGCGGGATCTATTCAGGAGGGG
ATATCGATGGAAAATTGGAAATGATTTAAGCGTGGCGGCTGCTGAAGATCCTTGGCTGCCTAGGGAGAATAACTACAAGCCTATTCTTGTTCGTGGAGCTGTCAGGGAGG
AGCATGTAGATGCTCTCATTCGAAGCGCTGGAGGGTGGAACGAAAGCTTGATTAGGAATTCTTTTCTAGAGGAGGAGGTTGAGGTCATTTTAAATATTCCTCTTTCGGTG
CGCAATCAACATGATGAAGTTATATGGGGGCCAGATAAGAAAGGGAAGTTCAGCGTTAAAAGTGCTTATAGATTGGGTGTTCACTTGGCTTCCGCTGATGAGGTCCAAAC
CTCAAATTCAGAGGAAGAAGCTAAGAAATGGAAGAAATTTTGGAGAACATCAGTGTCTTCAAAACTCAAGATTTGTTGCTGGCGAATTTACAACGACATCATTCCTGAGT
CATCCACACATTTTTTCTGGAACTGCAGGCACGGATGGTGCTCCAAGGATTTTTGCGACTGGATGTGGTCACAGCGGGATAGGCGAAGCGCAGTTGAAGGGAACAGTTGG
ACCACACCACCCCACCATAGATGGAAATTGAACACGAACACTACCTGGATGGACTCCCTGAATCGTGGTGGCATTGGCTGGATTCTTCGTGACTCTAATGGTCTGAAATC
TATCAACAAGGAAGAAAGGTTAGAAACCATTGTTGAGTCGGACTGCCTTGAACTAGTTAATCTGCTGAAAGGGGTGGATTCCGACCTAACTGAGATTAGCTTTTCCATCG
ACGAAGCTTTGGATCTACAAGAGGTCATGAAGATGTGCTCACAGTTTGGCAAAGCAAGCCTGCACATCGACTTTTTCCGGAAGTTAGGGGGGTGTTCTGGACTGTCGTGG
CTTTCTTCCTTGATGGAAGATGATGTTCGTGCGTGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGCAGGTACCACAAGGGAGGGGAATTCTTAAAAGCAAAATTGGGACACAATCCTCCCTATGCTTGGAGAAGTACATGGTGGGGGCGGGATCTATTCAGGAGGGG
ATATCGATGGAAAATTGGAAATGATTTAAGCGTGGCGGCTGCTGAAGATCCTTGGCTGCCTAGGGAGAATAACTACAAGCCTATTCTTGTTCGTGGAGCTGTCAGGGAGG
AGCATGTAGATGCTCTCATTCGAAGCGCTGGAGGGTGGAACGAAAGCTTGATTAGGAATTCTTTTCTAGAGGAGGAGGTTGAGGTCATTTTAAATATTCCTCTTTCGGTG
CGCAATCAACATGATGAAGTTATATGGGGGCCAGATAAGAAAGGGAAGTTCAGCGTTAAAAGTGCTTATAGATTGGGTGTTCACTTGGCTTCCGCTGATGAGGTCCAAAC
CTCAAATTCAGAGGAAGAAGCTAAGAAATGGAAGAAATTTTGGAGAACATCAGTGTCTTCAAAACTCAAGATTTGTTGCTGGCGAATTTACAACGACATCATTCCTGAGT
CATCCACACATTTTTTCTGGAACTGCAGGCACGGATGGTGCTCCAAGGATTTTTGCGACTGGATGTGGTCACAGCGGGATAGGCGAAGCGCAGTTGAAGGGAACAGTTGG
ACCACACCACCCCACCATAGATGGAAATTGAACACGAACACTACCTGGATGGACTCCCTGAATCGTGGTGGCATTGGCTGGATTCTTCGTGACTCTAATGGTCTGAAATC
TATCAACAAGGAAGAAAGGTTAGAAACCATTGTTGAGTCGGACTGCCTTGAACTAGTTAATCTGCTGAAAGGGGTGGATTCCGACCTAACTGAGATTAGCTTTTCCATCG
ACGAAGCTTTGGATCTACAAGAGGTCATGAAGATGTGCTCACAGTTTGGCAAAGCAAGCCTGCACATCGACTTTTTCCGGAAGTTAGGGGGGTGTTCTGGACTGTCGTGG
CTTTCTTCCTTGATGGAAGATGATGTTCGTGCGTGCTAA
Protein sequenceShow/hide protein sequence
MRGRYHKGGEFLKAKLGHNPPYAWRSTWWGRDLFRRGYRWKIGNDLSVAAAEDPWLPRENNYKPILVRGAVREEHVDALIRSAGGWNESLIRNSFLEEEVEVILNIPLSV
RNQHDEVIWGPDKKGKFSVKSAYRLGVHLASADEVQTSNSEEEAKKWKKFWRTSVSSKLKICCWRIYNDIIPESSTHFFWNCRHGWCSKDFCDWMWSQRDRRSAVEGNSW
TTPPHHRWKLNTNTTWMDSLNRGGIGWILRDSNGLKSINKEERLETIVESDCLELVNLLKGVDSDLTEISFSIDEALDLQEVMKMCSQFGKASLHIDFFRKLGGCSGLSW
LSSLMEDDVRAC