; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g07690 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g07690
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:5350754..5360557
RNA-Seq ExpressionMoc03g07690
SyntenyMoc03g07690
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:1901576 - organic substance biosynthetic process (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0016020 - membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]1.5e-0861.82Show/hide
Query:  KSKSDVKS----FLGYGGTKIDYIFWDDQNKKISRSKNMIFNEGVLYKDKTKVDS
        +SK D KS    F+GYG T+  Y FWDDQN+KI RSKN++FNEG LYKDK K  S
Subjt:  KSKSDVKS----FLGYGGTKIDYIFWDDQNKKISRSKNMIFNEGVLYKDKTKVDS

XP_022154000.1 uncharacterized protein LOC111021367 isoform X1 [Momordica charantia]6.7e-0935.97Show/hide
Query:  MKKLIDDVTKVFPLARGLGNKYKDYKTSMLAKPPCPTYNQFVLRLKVHDQHTTMCMQHRKLSLLSEEDDIE----ERKVLYVTRKGLYDGK-KNIDQL--
        MKK +D+ TK+F LARGLGNKYKD++ +ML++ P PTYNQFV  L+ H+           L L+SE D+ +    ++ + + T++G   G+ +N +Q   
Subjt:  MKKLIDDVTKVFPLARGLGNKYKDYKTSMLAKPPCPTYNQFVLRLKVHDQHTTMCMQHRKLSLLSEEDDIE----ERKVLYVTRKGLYDGK-KNIDQL--

Query:  -TFLGMSSSNLGPGNQRSYTSNVDRIESTQLEEANQTTR
         +F G  S N+ P    + +      ESTQ + +N + +
Subjt:  -TFLGMSSSNLGPGNQRSYTSNVDRIESTQLEEANQTTR

XP_022154001.1 uncharacterized protein LOC111021367 isoform X2 [Momordica charantia]6.7e-0935.97Show/hide
Query:  MKKLIDDVTKVFPLARGLGNKYKDYKTSMLAKPPCPTYNQFVLRLKVHDQHTTMCMQHRKLSLLSEEDDIE----ERKVLYVTRKGLYDGK-KNIDQL--
        MKK +D+ TK+F LARGLGNKYKD++ +ML++ P PTYNQFV  L+ H+           L L+SE D+ +    ++ + + T++G   G+ +N +Q   
Subjt:  MKKLIDDVTKVFPLARGLGNKYKDYKTSMLAKPPCPTYNQFVLRLKVHDQHTTMCMQHRKLSLLSEEDDIE----ERKVLYVTRKGLYDGK-KNIDQL--

Query:  -TFLGMSSSNLGPGNQRSYTSNVDRIESTQLEEANQTTR
         +F G  S N+ P    + +      ESTQ + +N + +
Subjt:  -TFLGMSSSNLGPGNQRSYTSNVDRIESTQLEEANQTTR

XP_022154021.1 uncharacterized protein LOC111021379 [Momordica charantia]4.2e-1169.81Show/hide
Query:  MKKLIDDVTKVFPLARGLGNKYKDYKTSMLAKPPCPTYNQFVLRLKVHDQHTT
        MKK +DD+TKVF LARGLG KYKD++T+ML+K P P+YNQFVL LK HDQ  T
Subjt:  MKKLIDDVTKVFPLARGLGNKYKDYKTSMLAKPPCPTYNQFVLRLKVHDQHTT

XP_022156301.1 uncharacterized protein LOC111023227 [Momordica charantia]7.6e-2980.22Show/hide
Query:  LTFLGMSSSNLGPGNQRSYTSNVDRIESTQLEEANQTTRSSGTTKRKGRCNTRGLNADKHVQNHGLIEINIEEEDGKLVCSHSSKLVSQIG
        +T   MSSSNLGPGN++SYTSNVDRIE+TQ  EA+QTT+SSGTTKRKGR NT GLN DKHVQ+HGLIEINIEEED K VCSHS KLVSQIG
Subjt:  LTFLGMSSSNLGPGNQRSYTSNVDRIESTQLEEANQTTRSSGTTKRKGRCNTRGLNADKHVQNHGLIEINIEEEDGKLVCSHSSKLVSQIG

TrEMBL top hitse value%identityAlignment
A0A438IVY1 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-0858.62Show/hide
Query:  KSKSDVKS----FLGYGGTKIDYIFWDDQNKKISRSKNMIFNEGVLYKDKTKVDSRST
        +SK DVKS    F+GYG  K  Y FWD+QNKKI RS+N+IFNE V+YKD++ V S  T
Subjt:  KSKSDVKS----FLGYGGTKIDYIFWDDQNKKISRSKNMIFNEGVLYKDKTKVDSRST

A0A6J1DIE6 uncharacterized protein LOC111021367 isoform X13.2e-0935.97Show/hide
Query:  MKKLIDDVTKVFPLARGLGNKYKDYKTSMLAKPPCPTYNQFVLRLKVHDQHTTMCMQHRKLSLLSEEDDIE----ERKVLYVTRKGLYDGK-KNIDQL--
        MKK +D+ TK+F LARGLGNKYKD++ +ML++ P PTYNQFV  L+ H+           L L+SE D+ +    ++ + + T++G   G+ +N +Q   
Subjt:  MKKLIDDVTKVFPLARGLGNKYKDYKTSMLAKPPCPTYNQFVLRLKVHDQHTTMCMQHRKLSLLSEEDDIE----ERKVLYVTRKGLYDGK-KNIDQL--

Query:  -TFLGMSSSNLGPGNQRSYTSNVDRIESTQLEEANQTTR
         +F G  S N+ P    + +      ESTQ + +N + +
Subjt:  -TFLGMSSSNLGPGNQRSYTSNVDRIESTQLEEANQTTR

A0A6J1DME2 uncharacterized protein LOC111021367 isoform X23.2e-0935.97Show/hide
Query:  MKKLIDDVTKVFPLARGLGNKYKDYKTSMLAKPPCPTYNQFVLRLKVHDQHTTMCMQHRKLSLLSEEDDIE----ERKVLYVTRKGLYDGK-KNIDQL--
        MKK +D+ TK+F LARGLGNKYKD++ +ML++ P PTYNQFV  L+ H+           L L+SE D+ +    ++ + + T++G   G+ +N +Q   
Subjt:  MKKLIDDVTKVFPLARGLGNKYKDYKTSMLAKPPCPTYNQFVLRLKVHDQHTTMCMQHRKLSLLSEEDDIE----ERKVLYVTRKGLYDGK-KNIDQL--

Query:  -TFLGMSSSNLGPGNQRSYTSNVDRIESTQLEEANQTTR
         +F G  S N+ P    + +      ESTQ + +N + +
Subjt:  -TFLGMSSSNLGPGNQRSYTSNVDRIESTQLEEANQTTR

A0A6J1DMG5 uncharacterized protein LOC1110213792.0e-1169.81Show/hide
Query:  MKKLIDDVTKVFPLARGLGNKYKDYKTSMLAKPPCPTYNQFVLRLKVHDQHTT
        MKK +DD+TKVF LARGLG KYKD++T+ML+K P P+YNQFVL LK HDQ  T
Subjt:  MKKLIDDVTKVFPLARGLGNKYKDYKTSMLAKPPCPTYNQFVLRLKVHDQHTT

A0A6J1DUJ0 uncharacterized protein LOC1110232273.7e-2980.22Show/hide
Query:  LTFLGMSSSNLGPGNQRSYTSNVDRIESTQLEEANQTTRSSGTTKRKGRCNTRGLNADKHVQNHGLIEINIEEEDGKLVCSHSSKLVSQIG
        +T   MSSSNLGPGN++SYTSNVDRIE+TQ  EA+QTT+SSGTTKRKGR NT GLN DKHVQ+HGLIEINIEEED K VCSHS KLVSQIG
Subjt:  LTFLGMSSSNLGPGNQRSYTSNVDRIESTQLEEANQTTRSSGTTKRKGRCNTRGLNADKHVQNHGLIEINIEEEDGKLVCSHSSKLVSQIG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAGCTTATTGATGATGTTACTAAAGTGTTTCCTTTAGCTAGAGGTTTAGGCAACAAATACAAAGATTATAAGACTTCTATGCTTGCAAAACCACCTTGC
CCCACATATAATCAGTTTGTTCTTCGGTTGAAAGTACATGATCAGCACACTACAATGTGTATGCAGCACCGTAAGCTTTCCTTGCTCAGTGAGGAAGACGATATC
GAGGAGAGGAAAGTTCTTTACGTTACGAGGAAGGGGCTTTATGATGGGAAGAAAAATATAGATCAACTGACCTTCCTTGGGATGTCATCTTCAAATCTTGGACCA
GGAAACCAACGGAGTTACACAAGCAATGTCGATAGAATAGAAAGCACTCAATTGGAAGAGGCTAATCAAACAACACGGAGTTCTGGCACGACTAAAAGAAAAGGT
AGATGCAACACTAGAGGGCTGAATGCTGATAAACATGTCCAAAATCATGGTCTGATAGAAATTAATATCGAAGAAGAGGATGGCAAGCTAGTTTGTAGCCATAGT
TCGAAATTGGTCTCTCAAATTGGGGAACAAATGGTTGAACTGAAAAATGCACCAGTAGAAGAAGGTGCAAAACCTCCTTTAGCTCGACAAATCAGTGTTCTAGTA
CTAGGCAGGCGAGACAAGAGTAAATCTGATGTCAAATCCTTTCTTGGGTATGGTGGAACTAAGATAGACTACATATTTTGGGATGACCAGAACAAGAAAATTAGC
AGAAGCAAGAACATGATCTTCAATGAAGGAGTTCTTTACAAAGATAAAACTAAAGTAGATTCAAGAAGTACATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAGCTTATTGATGATGTTACTAAAGTGTTTCCTTTAGCTAGAGGTTTAGGCAACAAATACAAAGATTATAAGACTTCTATGCTTGCAAAACCACCTTGC
CCCACATATAATCAGTTTGTTCTTCGGTTGAAAGTACATGATCAGCACACTACAATGTGTATGCAGCACCGTAAGCTTTCCTTGCTCAGTGAGGAAGACGATATC
GAGGAGAGGAAAGTTCTTTACGTTACGAGGAAGGGGCTTTATGATGGGAAGAAAAATATAGATCAACTGACCTTCCTTGGGATGTCATCTTCAAATCTTGGACCA
GGAAACCAACGGAGTTACACAAGCAATGTCGATAGAATAGAAAGCACTCAATTGGAAGAGGCTAATCAAACAACACGGAGTTCTGGCACGACTAAAAGAAAAGGT
AGATGCAACACTAGAGGGCTGAATGCTGATAAACATGTCCAAAATCATGGTCTGATAGAAATTAATATCGAAGAAGAGGATGGCAAGCTAGTTTGTAGCCATAGT
TCGAAATTGGTCTCTCAAATTGGGGAACAAATGGTTGAACTGAAAAATGCACCAGTAGAAGAAGGTGCAAAACCTCCTTTAGCTCGACAAATCAGTGTTCTAGTA
CTAGGCAGGCGAGACAAGAGTAAATCTGATGTCAAATCCTTTCTTGGGTATGGTGGAACTAAGATAGACTACATATTTTGGGATGACCAGAACAAGAAAATTAGC
AGAAGCAAGAACATGATCTTCAATGAAGGAGTTCTTTACAAAGATAAAACTAAAGTAGATTCAAGAAGTACATAG
Protein sequenceShow/hide protein sequence
MKKLIDDVTKVFPLARGLGNKYKDYKTSMLAKPPCPTYNQFVLRLKVHDQHTTMCMQHRKLSLLSEEDDIEERKVLYVTRKGLYDGKKNIDQLTFLGMSSSNLGP
GNQRSYTSNVDRIESTQLEEANQTTRSSGTTKRKGRCNTRGLNADKHVQNHGLIEINIEEEDGKLVCSHSSKLVSQIGEQMVELKNAPVEEGAKPPLARQISVLV
LGRRDKSKSDVKSFLGYGGTKIDYIFWDDQNKKISRSKNMIFNEGVLYKDKTKVDSRST