; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g30690 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g30690
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr8:21979462..21980151
RNA-Seq ExpressionMoc08g30690
SyntenyMoc08g30690
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY98609.1 haloacid dehalogenase-like hydrolase (HAD) superfamily protein [Actinidia rufa]3.1e-7161.75Show/hide
Query:  IAPIAIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTG---DLLHSWIICNNVVISWILNSLSKEISASVLFSD
        I  +A +  ++PYFLHHSD   LVLVS  LT +NY SW+R+M+IAL++KNK+GF+DGSI +P G   +LL+SWI  NNVVISWILNS+SKEISAS++FS 
Subjt:  IAPIAIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTG---DLLHSWIICNNVVISWILNSLSKEISASVLFSD

Query:  SAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNESFSQLRAQLL
        SA EIW+DLK+RFQ+ N PRIFQL R+L N VQDQ  VS +FT+LKT+W ELN+YRP+C+CG C+CGGVK++ +  Q +++M FLM L+ SF+Q+R QLL
Subjt:  SAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNESFSQLRAQLL

Query:  LMEPEPTINRAFSLVSQ
        LM+P P IN+ FSL+SQ
Subjt:  LMEPEPTINRAFSLVSQ

KAA8543184.1 hypothetical protein F0562_021321 [Nyssa sinensis]1.9e-7364.79Show/hide
Query:  AIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTG---DLLHSWIICNNVVISWILNSLSKEISASVLFSDSARE
        AIE+ +NPY+LHHSD+   +LVS  LT ENYT+WSR+MLIAL++KNK+GFVDGSI+ P G   +LL+SWI  NN+VISWILNS+SKEISAS++F+ SARE
Subjt:  AIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTG---DLLHSWIICNNVVISWILNSLSKEISASVLFSDSARE

Query:  IWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNESFSQLRAQLLLMEP
        IWLDL++RFQ++NRPRIFQL R+L NL Q+Q SVS +FT+LKT+W EL++YR +C+CG+CSCGGVK +    Q +++M FLMGL++SFSQ+R QLLLM+P
Subjt:  IWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNESFSQLRAQLLLMEP

Query:  EPTINRAFSLVSQ
         P INR FSL+ Q
Subjt:  EPTINRAFSLVSQ

XP_022152756.1 uncharacterized protein LOC111020399 [Momordica charantia]2.7e-11593.01Show/hide
Query:  MADDFINPTASTPTVIAPIAIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSL
        MADDFIN TAST T+IA IAIEQYTNPYFLHHSDNTSLVLVSDPLT ENYTSWSRSMLIALT+KNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSL
Subjt:  MADDFINPTASTPTVIAPIAIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSL

Query:  SKEISASVLFSDSAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGL
        SKEISAS+LFSDSAREIWLDLKERF+KQNRPRIFQL RDLSNLVQDQLSVSA+FT LKTLWTELNSY PSCT GRCSCGGVKEIIAFQQQ+HVMCFLMGL
Subjt:  SKEISASVLFSDSAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGL

Query:  NESFSQLRAQLLLMEPEPTINRAFSLVSQ
        NESFSQLR QLLLMEPEPTINR FSLVSQ
Subjt:  NESFSQLRAQLLLMEPEPTINRAFSLVSQ

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]3.4e-9477.97Show/hide
Query:  DDFINPTASTPTVIAPIAIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSLSK
        DD +NPTA     + PI +EQ+ NPYFLHHSDNTSLVLVSD LT ENYTSWSRS++IALT+KNK+GFVDGSI RPT   LHSWIICNNVVISWI NSLSK
Subjt:  DDFINPTASTPTVIAPIAIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSLSK

Query:  EISASVLFSDSAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNE
        +ISASVLFSDSA EIWLDLKERFQ+QNRPRIFQL R+LSNL QDQLSV+A+FTRLKTLW+EL  YRP+C+CGRCS GGVK I A  QQ++VM FLMGLN 
Subjt:  EISASVLFSDSAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNE

Query:  SFSQLRAQLLLMEPEPTINRAFSLVSQ
        SFSQ+RAQLLLMEP PTINRAF+LV+Q
Subjt:  SFSQLRAQLLLMEPEPTINRAFSLVSQ

XP_038874906.1 uncharacterized protein LOC120067409 [Benincasa hispida]1.8e-7162.21Show/hide
Query:  NPTASTPTVIAPIA---IEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSLSKE
        +P+++  T  AP+    ++QY+  YFLHHSD+TSLVLVSD LT  NY+SWS+SM ++ T+KNK+GF+DG++ +P GDL +SWIICN+VV +WI N+LSK+
Subjt:  NPTASTPTVIAPIA---IEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSLSKE

Query:  ISASVLFSDSAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNES
        I+ASV FSDS REIWLDL++R+Q +  P IFQ  R+LSNLVQDQLSV A+FT+LKT W EL SY+P C+CGRC+CGGVKE++ + Q +HVM FLMGLN S
Subjt:  ISASVLFSDSAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNES

Query:  FSQLRAQLLLMEPEPTI
        FSQ+R  LLL  PEPTI
Subjt:  FSQLRAQLLLMEPEPTI

TrEMBL top hitse value%identityAlignment
A0A5J5BKC2 Uncharacterized protein9.3e-7464.79Show/hide
Query:  AIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTG---DLLHSWIICNNVVISWILNSLSKEISASVLFSDSARE
        AIE+ +NPY+LHHSD+   +LVS  LT ENYT+WSR+MLIAL++KNK+GFVDGSI+ P G   +LL+SWI  NN+VISWILNS+SKEISAS++F+ SARE
Subjt:  AIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTG---DLLHSWIICNNVVISWILNSLSKEISASVLFSDSARE

Query:  IWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNESFSQLRAQLLLMEP
        IWLDL++RFQ++NRPRIFQL R+L NL Q+Q SVS +FT+LKT+W EL++YR +C+CG+CSCGGVK +    Q +++M FLMGL++SFSQ+R QLLLM+P
Subjt:  IWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNESFSQLRAQLLLMEP

Query:  EPTINRAFSLVSQ
         P INR FSL+ Q
Subjt:  EPTINRAFSLVSQ

A0A6J1D5E3 uncharacterized protein LOC1110171968.2e-7076.3Show/hide
Query:  MLIALTMKNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSLSKEISASVLFSDSAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTR
        M IALT+KNK G VDGSI RP  + L+SWIICNNVVI+WILNSLSKEISASVLF+DSAREIWLDL+ERFQ+QNRPRIFQL RDLS LVQDQLSVSA+FT+
Subjt:  MLIALTMKNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSLSKEISASVLFSDSAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTR

Query:  LKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNESFSQLRAQLLLMEPEPTINRAFSLVSQ
        LKTLWTEL +YRP+C+CGRC+CGGVK ++ + Q ++VMCFLMGLN+SFSQ+RA LLLM P PTIN AF L++Q
Subjt:  LKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNESFSQLRAQLLLMEPEPTINRAFSLVSQ

A0A6J1DIP8 uncharacterized protein LOC1110203991.3e-11593.01Show/hide
Query:  MADDFINPTASTPTVIAPIAIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSL
        MADDFIN TAST T+IA IAIEQYTNPYFLHHSDNTSLVLVSDPLT ENYTSWSRSMLIALT+KNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSL
Subjt:  MADDFINPTASTPTVIAPIAIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSL

Query:  SKEISASVLFSDSAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGL
        SKEISAS+LFSDSAREIWLDLKERF+KQNRPRIFQL RDLSNLVQDQLSVSA+FT LKTLWTELNSY PSCT GRCSCGGVKEIIAFQQQ+HVMCFLMGL
Subjt:  SKEISASVLFSDSAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGL

Query:  NESFSQLRAQLLLMEPEPTINRAFSLVSQ
        NESFSQLR QLLLMEPEPTINR FSLVSQ
Subjt:  NESFSQLRAQLLLMEPEPTINRAFSLVSQ

A0A6J1DNP7 uncharacterized protein LOC1110220651.6e-9477.97Show/hide
Query:  DDFINPTASTPTVIAPIAIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSLSK
        DD +NPTA     + PI +EQ+ NPYFLHHSDNTSLVLVSD LT ENYTSWSRS++IALT+KNK+GFVDGSI RPT   LHSWIICNNVVISWI NSLSK
Subjt:  DDFINPTASTPTVIAPIAIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSLSK

Query:  EISASVLFSDSAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNE
        +ISASVLFSDSA EIWLDLKERFQ+QNRPRIFQL R+LSNL QDQLSV+A+FTRLKTLW+EL  YRP+C+CGRCS GGVK I A  QQ++VM FLMGLN 
Subjt:  EISASVLFSDSAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNE

Query:  SFSQLRAQLLLMEPEPTINRAFSLVSQ
        SFSQ+RAQLLLMEP PTINRAF+LV+Q
Subjt:  SFSQLRAQLLLMEPEPTINRAFSLVSQ

A0A7J0FKC9 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein1.5e-7161.75Show/hide
Query:  IAPIAIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTG---DLLHSWIICNNVVISWILNSLSKEISASVLFSD
        I  +A +  ++PYFLHHSD   LVLVS  LT +NY SW+R+M+IAL++KNK+GF+DGSI +P G   +LL+SWI  NNVVISWILNS+SKEISAS++FS 
Subjt:  IAPIAIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTG---DLLHSWIICNNVVISWILNSLSKEISASVLFSD

Query:  SAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNESFSQLRAQLL
        SA EIW+DLK+RFQ+ N PRIFQL R+L N VQDQ  VS +FT+LKT+W ELN+YRP+C+CG C+CGGVK++ +  Q +++M FLM L+ SF+Q+R QLL
Subjt:  SAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNESFSQLRAQLL

Query:  LMEPEPTINRAFSLVSQ
        LM+P P IN+ FSL+SQ
Subjt:  LMEPEPTINRAFSLVSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.1e-2631.13Show/hide
Query:  NPYFL----HHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPT--GDLLHSWIICNNVVISWILNSLSKEISASVLFSDSAREIWL
        +PY+L    HH  + S+  +S     +NY +W       L +  K GF+DG++ +P     L   W  CN +V+ W++NS++ ++  SV+++++A ++W 
Subjt:  NPYFL----HHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPT--GDLLHSWIICNNVVISWILNSLSKEISASVLFSDSAREIWL

Query:  DLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYR--PSCTCGRCSCGGVKEIIAFQQQKHVMCFLMG--LNESFSQLRAQLLLME
        DL+  F      +I+QL R L+ L Q   SV  +F +L  +W EL+ Y   P C CG C+C   K     ++++    FLMG  LN+ F  +  +++  +
Subjt:  DLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYR--PSCTCGRCSCGGVKEIIAFQQQKHVMCFLMG--LNESFSQLRAQLLLME

Query:  PEPTINRAFSLV
        P P+++ AF++V
Subjt:  PEPTINRAFSLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGACGATTTCATCAATCCGACTGCATCCACGCCTACTGTTATCGCTCCGATTGCAATCGAGCAATATACCAACCCTTATTTCTTGCATCACTCTGATAATACAAG
CCTTGTTCTTGTATCTGATCCTTTGACATATGAGAATTATACATCATGGAGTAGATCGATGTTGATTGCTCTCACGATGAAGAACAAAGTTGGTTTTGTTGACGGATCTA
TCGTTCGGCCTACTGGAGATCTTCTTCACTCTTGGATCATCTGCAACAACGTTGTAATTTCTTGGATCTTGAACTCTCTGTCAAAAGAAATCTCTGCAAGCGTTTTATTT
TCTGATTCGGCCCGTGAAATTTGGCTCGATTTGAAAGAGCGTTTTCAGAAACAGAACCGCCCTCGCATCTTTCAATTATGCCGAGATCTATCCAATTTGGTGCAAGATCA
ACTTTCTGTAAGTGCTCATTTCACTCGTTTAAAAACACTATGGACAGAACTCAACTCTTATCGCCCCTCTTGCACTTGTGGTCGATGTTCGTGTGGTGGTGTGAAGGAGA
TTATTGCTTTTCAGCAACAAAAACACGTTATGTGTTTTCTTATGGGGCTGAATGAATCTTTCAGTCAATTGCGGGCACAATTACTCCTTATGGAACCTGAACCAACTATC
AATCGAGCCTTCTCCTTGGTTTCTCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGACGATTTCATCAATCCGACTGCATCCACGCCTACTGTTATCGCTCCGATTGCAATCGAGCAATATACCAACCCTTATTTCTTGCATCACTCTGATAATACAAG
CCTTGTTCTTGTATCTGATCCTTTGACATATGAGAATTATACATCATGGAGTAGATCGATGTTGATTGCTCTCACGATGAAGAACAAAGTTGGTTTTGTTGACGGATCTA
TCGTTCGGCCTACTGGAGATCTTCTTCACTCTTGGATCATCTGCAACAACGTTGTAATTTCTTGGATCTTGAACTCTCTGTCAAAAGAAATCTCTGCAAGCGTTTTATTT
TCTGATTCGGCCCGTGAAATTTGGCTCGATTTGAAAGAGCGTTTTCAGAAACAGAACCGCCCTCGCATCTTTCAATTATGCCGAGATCTATCCAATTTGGTGCAAGATCA
ACTTTCTGTAAGTGCTCATTTCACTCGTTTAAAAACACTATGGACAGAACTCAACTCTTATCGCCCCTCTTGCACTTGTGGTCGATGTTCGTGTGGTGGTGTGAAGGAGA
TTATTGCTTTTCAGCAACAAAAACACGTTATGTGTTTTCTTATGGGGCTGAATGAATCTTTCAGTCAATTGCGGGCACAATTACTCCTTATGGAACCTGAACCAACTATC
AATCGAGCCTTCTCCTTGGTTTCTCAATAG
Protein sequenceShow/hide protein sequence
MADDFINPTASTPTVIAPIAIEQYTNPYFLHHSDNTSLVLVSDPLTYENYTSWSRSMLIALTMKNKVGFVDGSIVRPTGDLLHSWIICNNVVISWILNSLSKEISASVLF
SDSAREIWLDLKERFQKQNRPRIFQLCRDLSNLVQDQLSVSAHFTRLKTLWTELNSYRPSCTCGRCSCGGVKEIIAFQQQKHVMCFLMGLNESFSQLRAQLLLMEPEPTI
NRAFSLVSQ