; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g09430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g09430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon 17.6
Genome locationchr9:7868563..7868973
RNA-Seq ExpressionMoc09g09430
SyntenyMoc09g09430
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031377418.1 uncharacterized protein LOC116192868 [Punica granatum]3.4e-4161.83Show/hide
Query:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD
        MVREGIVL H++S K IEV+RAK+++I KL  PT  KGVRSFLGHA FY RFIK+F++IS+PLC LL  D    FN+NCL+AF +L E L+  P+I+ P+
Subjt:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD

Query:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV
        W LPFELMCDA+++AVGA+LGQ++GK+FH +
Subjt:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV

XP_031379021.1 uncharacterized protein LOC116194359 [Punica granatum]3.4e-4161.83Show/hide
Query:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD
        MVREGIVL H++S K IEV+RAK+++I KL  PT  KGVRSFLGHA FY RFIK+F++IS+PLC LL  D    FN+NCL+AF +L E L+  P+I+ P+
Subjt:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD

Query:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV
        W LPFELMCDA+++AVGA+LGQ++GK+FH +
Subjt:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV

XP_031382022.1 uncharacterized protein LOC116196442 [Punica granatum]3.4e-4161.83Show/hide
Query:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD
        MVREGIVL H++S K IEV+RAK+++I KL  PT  KGVRSFLGHA FY RFIK+F++IS+PLC LL  D    FN+NCL+AF +L E L+  P+I+ P+
Subjt:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD

Query:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV
        W LPFELMCDA+++AVGA+LGQ++GK+FH +
Subjt:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV

XP_031384834.1 uncharacterized protein LOC116198747 [Punica granatum]3.4e-4161.83Show/hide
Query:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD
        MVREGIVL H++S K IEV+RAK+++I KL  PT  KGVRSFLGHA FY RFIK+F++IS+PLC LL  D    FN+NCL+AF +L E L+  P+I+ P+
Subjt:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD

Query:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV
        W LPFELMCDA+++AVGA+LGQ++GK+FH +
Subjt:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV

XP_031390124.1 uncharacterized protein LOC116202669 [Punica granatum]3.4e-4161.83Show/hide
Query:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD
        MVREGIVL H++S K IEV+RAK+++I KL  PT  KGVRSFLGHA FY RFIK+F++IS+PLC LL  D    FN+NCL+AF +L E L+  P+I+ P+
Subjt:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD

Query:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV
        W LPFELMCDA+++AVGA+LGQ++GK+FH +
Subjt:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV

TrEMBL top hitse value%identityAlignment
A0A6P8CBX2 Reverse transcriptase1.6e-4161.83Show/hide
Query:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD
        MVREGIVL H++S K IEV+RAK+++I KL  PT  KGVRSFLGHA FY RFIK+F++IS+PLC LL  D    FN+NCL+AF +L E L+  P+I+ P+
Subjt:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD

Query:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV
        W LPFELMCDA+++AVGA+LGQ++GK+FH +
Subjt:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV

A0A6P8CDK6 Reverse transcriptase1.6e-4161.83Show/hide
Query:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD
        MVREGIVL H++S K IEV+RAK+++I KL  PT  KGVRSFLGHA FY RFIK+F++IS+PLC LL  D    FN+NCL+AF +L E L+  P+I+ P+
Subjt:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD

Query:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV
        W LPFELMCDA+++AVGA+LGQ++GK+FH +
Subjt:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV

A0A6P8CP09 uncharacterized protein LOC1161928681.6e-4161.83Show/hide
Query:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD
        MVREGIVL H++S K IEV+RAK+++I KL  PT  KGVRSFLGHA FY RFIK+F++IS+PLC LL  D    FN+NCL+AF +L E L+  P+I+ P+
Subjt:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD

Query:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV
        W LPFELMCDA+++AVGA+LGQ++GK+FH +
Subjt:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV

A0A6P8CRA8 uncharacterized protein LOC1161987471.6e-4161.83Show/hide
Query:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD
        MVREGIVL H++S K IEV+RAK+++I KL  PT  KGVRSFLGHA FY RFIK+F++IS+PLC LL  D    FN+NCL+AF +L E L+  P+I+ P+
Subjt:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD

Query:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV
        W LPFELMCDA+++AVGA+LGQ++GK+FH +
Subjt:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV

A0A6P8D6K9 uncharacterized protein LOC1162026691.6e-4161.83Show/hide
Query:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD
        MVREGIVL H++S K IEV+RAK+++I KL  PT  KGVRSFLGHA FY RFIK+F++IS+PLC LL  D    FN+NCL+AF +L E L+  P+I+ P+
Subjt:  MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPD

Query:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV
        W LPFELMCDA+++AVGA+LGQ++GK+FH +
Subjt:  WNLPFELMCDANNFAVGAMLGQQKGKIFHHV

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.9e-1335.54Show/hide
Query:  REGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDC-LSNFNENCLKAFEILNEALSLVPIIIEPDW
        +E   L H ++   I+ N  KI+ I K  +PT  K +++FLG   +Y +FI NFA I+KP+ + L  +  +   N     AF+ L   +S  PI+  PD+
Subjt:  REGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDC-LSNFNENCLKAFEILNEALSLVPIIIEPDW

Query:  NLPFELMCDANNFAVGAMLGQ
           F L  DA++ A+GA+L Q
Subjt:  NLPFELMCDANNFAVGAMLGQ

P10401 Retrovirus-related Pol polyprotein from transposon gypsy9.3e-1036.28Show/hide
Query:  KIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLS-----------NFNENCLKAFEILNEALSLVPIIIE-PDWNLPFELMCD
        K+  I +   P  V  VRSFLG AS+Y  FIK+FA I++P+  +L  +  S            FNE    AF+ L   L+   +I++ PD+  PF+L  D
Subjt:  KIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLS-----------NFNENCLKAFEILNEALSLVPIIIE-PDWNLPFELMCD

Query:  ANNFAVGAMLGQQ
        A+   +GA+L Q+
Subjt:  ANNFAVGAMLGQQ

P20825 Retrovirus-related Pol polyprotein from transposon 2973.4e-1234.71Show/hide
Query:  REGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDC-LSNFNENCLKAFEILNEALSLVPIIIEPDW
        +E   L H ++   I+ N  K+  IV   +PT  K +R+FLG   +Y +FI N+A I+KP+   L     +       ++AFE L   +   PI+  PD+
Subjt:  REGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDC-LSNFNENCLKAFEILNEALSLVPIIIEPDW

Query:  NLPFELMCDANNFAVGAMLGQ
           F L  DA+N A+GA+L Q
Subjt:  NLPFELMCDANNFAVGAMLGQ

P92523 Uncharacterized mitochondrial protein AtMg008604.2e-1035.42Show/hide
Query:  HQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPDWNLPF
        H ISG+ +  + AK++ +V    P     +R FLG   +Y RF+KN+ +I +PL +LL  + L  + E    AF+ L  A++ +P++  PD  LPF
Subjt:  HQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPDWNLPF

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.9e-1030.95Show/hide
Query:  LDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQL-----LNVDCLSN------FNENCLKAFEILNEALSLVPII
        L + ++   I+ +  K+  I ++  PT VK ++ FLG  S+Y +FI+++A+++KPL  L      N+    +       +E  L++F  L   L    I+
Subjt:  LDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQL-----LNVDCLSN------FNENCLKAFEILNEALSLVPII

Query:  IEPDWNLPFELMCDANNFAVGAMLGQ
          P +  PF L  DA+N+A+GA+L Q
Subjt:  IEPDWNLPFELMCDANNFAVGAMLGQ

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.0e-1135.42Show/hide
Query:  HQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPDWNLPF
        H ISG+ +  + AK++ +V    P     +R FLG   +Y RF+KN+ +I +PL +LL  + L  + E    AF+ L  A++ +P++  PD  LPF
Subjt:  HQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPDWNLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTACGCGAAGGAATAGTTTTGGATCATCAAATTTCGGGAAAAAGAATTGAAGTCAATAGGGCAAAAATAGATTTGATTGTCAAACTATCGCTACCCACTTAT
GTAAAAGGTGTTAGAAGTTTTCTAGGACACGCGAGTTTTTACCCTCGTTTTATTAAAAATTTTGCACAAATTTCTAAACCCTTATGTCAATTATTAAATGTGGAT
TGTTTATCTAATTTTAATGAAAATTGTTTAAAAGCTTTTGAGATACTGAATGAAGCACTTAGTTTAGTGCCCATTATAATTGAACCTGACTGGAACTTGCCATTT
GAACTTATGTGTGATGCCAATAATTTTGCAGTGGGTGCAATGTTAGGGCAACAAAAAGGTAAAATTTTTCATCATGTTTCAACGGTAGTTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTACGCGAAGGAATAGTTTTGGATCATCAAATTTCGGGAAAAAGAATTGAAGTCAATAGGGCAAAAATAGATTTGATTGTCAAACTATCGCTACCCACTTAT
GTAAAAGGTGTTAGAAGTTTTCTAGGACACGCGAGTTTTTACCCTCGTTTTATTAAAAATTTTGCACAAATTTCTAAACCCTTATGTCAATTATTAAATGTGGAT
TGTTTATCTAATTTTAATGAAAATTGTTTAAAAGCTTTTGAGATACTGAATGAAGCACTTAGTTTAGTGCCCATTATAATTGAACCTGACTGGAACTTGCCATTT
GAACTTATGTGTGATGCCAATAATTTTGCAGTGGGTGCAATGTTAGGGCAACAAAAAGGTAAAATTTTTCATCATGTTTCAACGGTAGTTAGTTGA
Protein sequenceShow/hide protein sequence
MVREGIVLDHQISGKRIEVNRAKIDLIVKLSLPTYVKGVRSFLGHASFYPRFIKNFAQISKPLCQLLNVDCLSNFNENCLKAFEILNEALSLVPIIIEPDWNLPF
ELMCDANNFAVGAMLGQQKGKIFHHVSTVVS