; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g19030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g19030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon 17.6
Genome locationchr4:13931292..13933801
RNA-Seq ExpressionMoc04g19030
SyntenyMoc04g19030
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3458760.1 Integrase, catalytic core [Gossypium australe]4.0e-0733.57Show/hide
Query:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIEDDLLSEELQIEELLNQLTEELCIVFEAED-----------NEAKLSQSVESRVQPEGILER
        RLGEFE+ ALT+ C+ +L  KL  K+ DLGSFTIL S      +EE  +   +    E     F  ++           NEA +++ +   ++ + + +R
Subjt:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIEDDLLSEELQIEELLNQLTEELCIVFEAED-----------NEAKLSQSVESRVQPEGILER

Query:  ---AFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG
           +F+ L+L     +P +PS+E+   LELK+LP HLK  + G
Subjt:  ---AFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG

XP_012453456.1 PREDICTED: uncharacterized protein LOC105775492 [Gossypium raimondii]1.5e-0631.22Show/hide
Query:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIEDDLLSEEL-QIEELLNQL--------------------------------TEELC------
        RLGEFETVALT+EC A+L  KL  K+ + GSFTIL SIE+  +S+ L  + E +N +                                T+E C      
Subjt:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIEDDLLSEEL-QIEELLNQL--------------------------------TEELC------

Query:  ----------IVFEAEDNEAKLSQSVESRVQPE--GILE---------RAFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG
                    +   DNE    +  E  V  E  G++E         R+F+SL   +   +P +PS++    LELK LP+HLK ++ G
Subjt:  ----------IVFEAEDNEAKLSQSVESRVQPE--GILE---------RAFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG

XP_022147186.1 uncharacterized protein LOC111016198 [Momordica charantia]2.2e-1350.53Show/hide
Query:  LVSIEDDLLSEELQIEELLNQLTEELCIVFEAEDNEAKLSQSVESRVQPEGILERAFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG
        L+ + DDL SEE+Q EELL+QL EE+  +FE ++ EAKL Q   +    + + ++AFES EL+D +Q  L+ SVEKA KLELK LP HLK  + G
Subjt:  LVSIEDDLLSEELQIEELLNQLTEELCIVFEAEDNEAKLSQSVESRVQPEGILERAFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG

XP_022152112.1 uncharacterized protein LOC111019902 [Momordica charantia]1.9e-0940.98Show/hide
Query:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIED--------------------------------------------------DLLSEELQIE
        RLGEFETVA TKECSAIL GKL QKMGD GSFTI +SI                                                     LL++E+Q+E
Subjt:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIED--------------------------------------------------DLLSEELQIE

Query:  ELLNQLTEELCIVFEAEDNEAK
        ELL+QL +EL I+FE E+ EA+
Subjt:  ELLNQLTEELCIVFEAEDNEAK

XP_022157452.1 uncharacterized protein LOC111024147 [Momordica charantia]3.6e-0836.46Show/hide
Query:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIEDDLLSE---ELQIEELLNQLT-----------------EELCIV----FEAEDN-------
        RLGEFETV LTKEC  ILT K+ QKM D GSFTI VSI    + +   +L     L  LT                 EE+ I+    F A+ N       
Subjt:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIEDDLLSE---ELQIEELLNQLT-----------------EELCIV----FEAEDN-------

Query:  ------------EAKLSQ---------SVESRVQPEGILERAFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG
                    EA L +         S++   +      R  E L+LE+ +Q+ LKPSVE+  KLELK LP HLK  + G
Subjt:  ------------EAKLSQ---------SVESRVQPEGILERAFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG

TrEMBL top hitse value%identityAlignment
A0A5B6UKZ2 Integrase, catalytic core1.9e-0733.57Show/hide
Query:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIEDDLLSEELQIEELLNQLTEELCIVFEAED-----------NEAKLSQSVESRVQPEGILER
        RLGEFE+ ALT+ C+ +L  KL  K+ DLGSFTIL S      +EE  +   +    E     F  ++           NEA +++ +   ++ + + +R
Subjt:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIEDDLLSEELQIEELLNQLTEELCIVFEAED-----------NEAKLSQSVESRVQPEGILER

Query:  ---AFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG
           +F+ L+L     +P +PS+E+   LELK+LP HLK  + G
Subjt:  ---AFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG

A0A6J1D1L0 uncharacterized protein LOC1110161981.0e-1350.53Show/hide
Query:  LVSIEDDLLSEELQIEELLNQLTEELCIVFEAEDNEAKLSQSVESRVQPEGILERAFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG
        L+ + DDL SEE+Q EELL+QL EE+  +FE ++ EAKL Q   +    + + ++AFES EL+D +Q  L+ SVEKA KLELK LP HLK  + G
Subjt:  LVSIEDDLLSEELQIEELLNQLTEELCIVFEAEDNEAKLSQSVESRVQPEGILERAFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG

A0A6J1DFB0 uncharacterized protein LOC1110199029.2e-1040.98Show/hide
Query:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIED--------------------------------------------------DLLSEELQIE
        RLGEFETVA TKECSAIL GKL QKMGD GSFTI +SI                                                     LL++E+Q+E
Subjt:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIED--------------------------------------------------DLLSEELQIE

Query:  ELLNQLTEELCIVFEAEDNEAK
        ELL+QL +EL I+FE E+ EA+
Subjt:  ELLNQLTEELCIVFEAEDNEAK

A0A6J1DUI2 uncharacterized protein LOC1110241471.7e-0836.46Show/hide
Query:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIEDDLLSE---ELQIEELLNQLT-----------------EELCIV----FEAEDN-------
        RLGEFETV LTKEC  ILT K+ QKM D GSFTI VSI    + +   +L     L  LT                 EE+ I+    F A+ N       
Subjt:  RLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIEDDLLSE---ELQIEELLNQLT-----------------EELCIV----FEAEDN-------

Query:  ------------EAKLSQ---------SVESRVQPEGILERAFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG
                    EA L +         S++   +      R  E L+LE+ +Q+ LKPSVE+  KLELK LP HLK  + G
Subjt:  ------------EAKLSQ---------SVESRVQPEGILERAFESLELEDSDQEPLKPSVEKAHKLELKTLPIHLKRIFFG

A0A6J1DUI3 uncharacterized protein LOC1110232212.8e-0686.49Show/hide
Query:  LGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSI
        LGEF+TVALTKECSAILT KL QKMGD GSFTI VSI
Subjt:  LGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGATTAGGGGAATTTGAAACAGTGGCCCTTACCAAAGAGTGTAGTGCCATTTTGACAGGAAAACTTGCTCAAAAGATGGGAGACCTAGGAAGCTTCACCATTCTTGT
ATCCATAGAAGATGATTTGTTGAGTGAAGAATTACAAATAGAGGAGCTCCTAAACCAGTTAACAGAAGAGCTATGCATAGTTTTTGAAGCTGAAGATAACGAAGCCAAGT
TGAGTCAATCTGTTGAGAGTAGAGTACAACCAGAGGGAATATTAGAAAGGGCATTTGAATCATTGGAGTTAGAGGACAGTGATCAAGAGCCGTTGAAGCCATCTGTGGAA
AAAGCTCATAAGTTGGAACTGAAAACTTTGCCAATCCACTTGAAGAGAATCTTCTTTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGATTAGGGGAATTTGAAACAGTGGCCCTTACCAAAGAGTGTAGTGCCATTTTGACAGGAAAACTTGCTCAAAAGATGGGAGACCTAGGAAGCTTCACCATTCTTGT
ATCCATAGAAGATGATTTGTTGAGTGAAGAATTACAAATAGAGGAGCTCCTAAACCAGTTAACAGAAGAGCTATGCATAGTTTTTGAAGCTGAAGATAACGAAGCCAAGT
TGAGTCAATCTGTTGAGAGTAGAGTACAACCAGAGGGAATATTAGAAAGGGCATTTGAATCATTGGAGTTAGAGGACAGTGATCAAGAGCCGTTGAAGCCATCTGTGGAA
AAAGCTCATAAGTTGGAACTGAAAACTTTGCCAATCCACTTGAAGAGAATCTTCTTTGGATGA
Protein sequenceShow/hide protein sequence
MRLGEFETVALTKECSAILTGKLAQKMGDLGSFTILVSIEDDLLSEELQIEELLNQLTEELCIVFEAEDNEAKLSQSVESRVQPEGILERAFESLELEDSDQEPLKPSVE
KAHKLELKTLPIHLKRIFFG