; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g17040 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g17040
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:11309490..11310832
RNA-Seq ExpressionMoc03g17040
SyntenyMoc03g17040
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KYP52189.1 hypothetical protein KK1_025927 [Cajanus cajan]1.2e-1228.09Show/hide
Query:  NGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPR-ESYFKLLTTSEENVELMVSELIN-GSGEWDVPKLNSMLMKEDVDLIRKIPLSRYNRRD-
        N SY W+S+   + + K+G+R+ V +G  I  +K     R  + +    T   +  LMV +LI+  +G WD+P L+++  ++D+  I+ +PL   N+ D 
Subjt:  NGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPR-ESYFKLLTTSEENVELMVSELIN-GSGEWDVPKLNSMLMKEDVDLIRKIPLSRYNRRD-

Query:  -GWL--RSGVPVEDEVVVNCDASCLVNTNRTRIRVSIRTGSYFWKYAMTNFYEIGFPPLH-AEVIAIRDGIRLAHRLELEKFVIESDCAEAINLLNGGET
          W   R G+        N DA+   +TN       +R     +K A T +Y  G  P H AEVIA  + +        E  +IE DC   ++ L+G   
Subjt:  -GWL--RSGVPVEDEVVVNCDASCLVNTNRTRIRVSIRTGSYFWKYAMTNFYEIGFPPLH-AEVIAIRDGIRLAHRLELEKFVIESDCAEAINLLNGGET

Query:  NKSDMGSWLLEIDEVRKRLKFVSFKHVLRRRNVVA
          S+ G  + +   +    K +S + + R+ N VA
Subjt:  NKSDMGSWLLEIDEVRKRLKFVSFKHVLRRRNVVA

OMO67823.1 reverse transcriptase [Corchorus capsularis]4.8e-1424.91Show/hide
Query:  KNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENV-ELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLSR------
        +N S+ W+SL+ GR + KEG R+ +  G  +  +      +   F+ L+  E  + ++ VSELI+  G W+V  L+ +   EDV+ I  +PLS+      
Subjt:  KNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENV-ELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLSR------

Query:  -------------------YNRRDGWLRSGVPVEDEVVVNCDASCLVNTNRTRIRVSIRTGSYFWKYAMTNFYEIGFPPLHAEVIAIRDGIRLAHRLELE
                               D W     P      +N D +  V+++  ++ V +R  +   +++           + +E++AI  G+ +A    +E
Subjt:  -------------------YNRRDGWLRSGVPVEDEVVVNCDASCLVNTNRTRIRVSIRTGSYFWKYAMTNFYEIGFPPLHAEVIAIRDGIRLAHRLELE

Query:  KFVIESDCAEAINLLNGGETNKSDMGSWLLEIDEVRKRLKFVSFKHVLRRRNVVARMAVSTRTSI
        K V++SDC +AI  +  G  +  D    + EI          SFKHV R  NV+A        S+
Subjt:  KFVIESDCAEAINLLNGGETNKSDMGSWLLEIDEVRKRLKFVSFKHVLRRRNVVARMAVSTRTSI

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]5.0e-1134.69Show/hide
Query:  MSAEGDKNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLS
        + AE   N S+ W+SL WG++L  +GLR+ V +G +I+ +  K  P  S+FK+++  +  +  +V +L   SG+W+VP L  +   ++VD   +IPL+
Subjt:  MSAEGDKNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLS

XP_022131662.1 uncharacterized protein LOC111004787 [Momordica charantia]4.8e-1441.58Show/hide
Query:  KNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLSRYNRRDGW
        +N SYFWK  +WGRDL  +GLR  V +G TI  F     PR   F+ +T      ++ V++LIN +G+WDV  ++ +  +ED DLI  +P+S YN  D W
Subjt:  KNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLSRYNRRDGW

Query:  L
        +
Subjt:  L

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.5e-1540.19Show/hide
Query:  MSAEGDKNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLSRY
        + A  +   SYFWK  +WGRDL  +GLR  V +G TI+AF     PR + FK L  +   ++  V+  I   G WDV  ++     ED DLI  +P+S Y
Subjt:  MSAEGDKNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLSRY

Query:  NRRDGWL
        N +D WL
Subjt:  NRRDGWL

TrEMBL top hitse value%identityAlignment
A0A5E4FZN9 PREDICTED: retrotransposon2.4e-1134.69Show/hide
Query:  MSAEGDKNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLS
        + AE   N S+ W+SL WG++L  +GLR+ V +G +I+ +  K  P  S+FK+++  +  +  +V +L   SG+W+VP L  +   ++VD   +IPL+
Subjt:  MSAEGDKNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLS

A0A6J1BRN0 uncharacterized protein LOC1110047872.3e-1441.58Show/hide
Query:  KNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLSRYNRRDGW
        +N SYFWK  +WGRDL  +GLR  V +G TI  F     PR   F+ +T      ++ V++LIN +G+WDV  ++ +  +ED DLI  +P+S YN  D W
Subjt:  KNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLSRYNRRDGW

Query:  L
        +
Subjt:  L

A0A6J1DX30 uncharacterized protein LOC1110248747.3e-1640.19Show/hide
Query:  MSAEGDKNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLSRY
        + A  +   SYFWK  +WGRDL  +GLR  V +G TI+AF     PR + FK L  +   ++  V+  I   G WDV  ++     ED DLI  +P+S Y
Subjt:  MSAEGDKNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLSRY

Query:  NRRDGWL
        N +D WL
Subjt:  NRRDGWL

A0A7J6GI20 zf-RVT domain-containing protein4.1e-1145.65Show/hide
Query:  MWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLSRYNRRD--GW
        MWG+DL K+GLRF V DG  IRAF+    PR + F +  + E N  L V  LI   G+WD+PKL  + + +DV+LI  I LS +   D  GW
Subjt:  MWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLSRYNRRD--GW

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)4.1e-1135.71Show/hide
Query:  MSAEGDKNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLS
        + AE   N S+ W+SL WG++L  +GLR+ V  G +I+ +  K  P  S FK+++  +  +   V +L   SG+W+VP L  +   ++VD I +IPL+
Subjt:  MSAEGDKNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGCTGAGGGTGATAAGAATGGATCTTACTTCTGGAAAAGTTTGATGTGGGGGCGCGATCTGCCAAAGGAGGGGCTGAGATTCTGGGTGAGAGATGGATGC
ACGATAAGGGCCTTTAAGGGTAAGTGTGCCCCTAGAGAATCATATTTCAAGCTCCTTACTACGTCAGAAGAGAATGTGGAATTGATGGTAAGTGAGCTGATCAAT
GGGAGTGGAGAATGGGATGTGCCGAAGCTAAATTCTATGCTGATGAAGGAAGATGTTGATCTGATTAGAAAAATTCCTCTTAGCAGATATAACAGGAGGGATGGC
TGGTTGAGAAGCGGGGTACCTGTTGAGGACGAGGTTGTTGTGAACTGTGACGCGTCCTGCCTGGTGAACACGAATCGAACTAGAATTCGAGTTTCCATCAGGACG
GGATCTTACTTTTGGAAGTATGCAATGACAAATTTCTATGAGATTGGTTTCCCTCCCCTGCATGCAGAGGTGATTGCGATTAGAGATGGGATCCGGCTGGCCCAT
CGTCTGGAGCTGGAAAAATTTGTGATAGAGTCAGATTGTGCTGAAGCAATTAACCTCCTAAATGGTGGGGAGACTAATAAATCTGATATGGGTTCCTGGCTTTTG
GAAATTGATGAGGTAAGGAAGAGGTTGAAATTCGTTTCCTTCAAGCATGTGCTCAGGCGGAGGAATGTGGTCGCTAGAATGGCGGTTTCCACTAGAACTTCTATT
CTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGCTGAGGGTGATAAGAATGGATCTTACTTCTGGAAAAGTTTGATGTGGGGGCGCGATCTGCCAAAGGAGGGGCTGAGATTCTGGGTGAGAGATGGATGC
ACGATAAGGGCCTTTAAGGGTAAGTGTGCCCCTAGAGAATCATATTTCAAGCTCCTTACTACGTCAGAAGAGAATGTGGAATTGATGGTAAGTGAGCTGATCAAT
GGGAGTGGAGAATGGGATGTGCCGAAGCTAAATTCTATGCTGATGAAGGAAGATGTTGATCTGATTAGAAAAATTCCTCTTAGCAGATATAACAGGAGGGATGGC
TGGTTGAGAAGCGGGGTACCTGTTGAGGACGAGGTTGTTGTGAACTGTGACGCGTCCTGCCTGGTGAACACGAATCGAACTAGAATTCGAGTTTCCATCAGGACG
GGATCTTACTTTTGGAAGTATGCAATGACAAATTTCTATGAGATTGGTTTCCCTCCCCTGCATGCAGAGGTGATTGCGATTAGAGATGGGATCCGGCTGGCCCAT
CGTCTGGAGCTGGAAAAATTTGTGATAGAGTCAGATTGTGCTGAAGCAATTAACCTCCTAAATGGTGGGGAGACTAATAAATCTGATATGGGTTCCTGGCTTTTG
GAAATTGATGAGGTAAGGAAGAGGTTGAAATTCGTTTCCTTCAAGCATGTGCTCAGGCGGAGGAATGTGGTCGCTAGAATGGCGGTTTCCACTAGAACTTCTATT
CTCTAA
Protein sequenceShow/hide protein sequence
MSAEGDKNGSYFWKSLMWGRDLPKEGLRFWVRDGCTIRAFKGKCAPRESYFKLLTTSEENVELMVSELINGSGEWDVPKLNSMLMKEDVDLIRKIPLSRYNRRDG
WLRSGVPVEDEVVVNCDASCLVNTNRTRIRVSIRTGSYFWKYAMTNFYEIGFPPLHAEVIAIRDGIRLAHRLELEKFVIESDCAEAINLLNGGETNKSDMGSWLL
EIDEVRKRLKFVSFKHVLRRRNVVARMAVSTRTSIL