; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g13580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g13580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr4:10489631..10491030
RNA-Seq ExpressionMoc04g13580
SyntenyMoc04g13580
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058980.1 uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa]6.3e-1750.96Show/hide
Query:  QMKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLAD
        +MKLL+WN RGLGS  K ALIK +I  Y+P+ VIL E  L   +  IIKS W S+ INW   NA+GSSGGIL LW+    ++    E  FSLS NF L +
Subjt:  QMKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLAD

Query:  GFSF
          S+
Subjt:  GFSF

RVX11275.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]3.5e-1542.27Show/hide
Query:  MKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCL
        MK++SWN RGLGS KK  ++K  +S   P++V++QE K    D  ++ S+WS    +W+AL A+G+SGGIL +W+      +E++ G FS+SI F +
Subjt:  MKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCL

TYK11012.1 uncharacterized protein E5676_scaffold874G00540 [Cucumis melo var. makuwa]6.3e-1750.96Show/hide
Query:  QMKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLAD
        +MKLL+WN RGLGS  K ALIK +I  Y+P+ VIL E  L   +  IIKS W S+ INW   NA+GSSGGIL LW+    ++    E  FSLS NF L +
Subjt:  QMKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLAD

Query:  GFSF
          S+
Subjt:  GFSF

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]1.2e-1849.51Show/hide
Query:  MKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLADG
        M +L+WNVRGLGS  K A IK +I+   P++VIL E K +S++   IKSLWSS  I W++L+A+G+SGGI+ LW++   +  E+I G FS+S++F LAD 
Subjt:  MKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLADG

Query:  FSF
        F++
Subjt:  FSF

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.6e-3168.93Show/hide
Query:  MKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLADG
        MK L+WNVRGL SWKK ALIKQ IS  NPN+VILQE KL+ +D LI+KSLWS+HGINWSAL+A+G + GIL LWN  D    E+IEG FSL+INFCL+DG
Subjt:  MKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLADG

Query:  FSF
        F F
Subjt:  FSF

TrEMBL top hitse value%identityAlignment
A0A438JQQ0 LINE-1 retrotransposable element ORF2 protein1.7e-1542.27Show/hide
Query:  MKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCL
        MK++SWN RGLGS KK  ++K  +S   P++V++QE K    D  ++ S+WS    +W+AL A+G+SGGIL +W+      +E++ G FS+SI F +
Subjt:  MKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCL

A0A5A7UV84 Reverse transcriptase domain-containing protein3.1e-1750.96Show/hide
Query:  QMKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLAD
        +MKLL+WN RGLGS  K ALIK +I  Y+P+ VIL E  L   +  IIKS W S+ INW   NA+GSSGGIL LW+    ++    E  FSLS NF L +
Subjt:  QMKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLAD

Query:  GFSF
          S+
Subjt:  GFSF

A0A5D3CI86 Reverse transcriptase domain-containing protein3.1e-1750.96Show/hide
Query:  QMKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLAD
        +MKLL+WN RGLGS  K ALIK +I  Y+P+ VIL E  L   +  IIKS W S+ INW   NA+GSSGGIL LW+    ++    E  FSLS NF L +
Subjt:  QMKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLAD

Query:  GFSF
          S+
Subjt:  GFSF

A0A6J1CVN2 uncharacterized protein LOC1110146575.6e-1949.51Show/hide
Query:  MKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLADG
        M +L+WNVRGLGS  K A IK +I+   P++VIL E K +S++   IKSLWSS  I W++L+A+G+SGGI+ LW++   +  E+I G FS+S++F LAD 
Subjt:  MKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLADG

Query:  FSF
        F++
Subjt:  FSF

A0A6J1E2G6 uncharacterized protein LOC1110254057.5e-3268.93Show/hide
Query:  MKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLADG
        MK L+WNVRGL SWKK ALIKQ IS  NPN+VILQE KL+ +D LI+KSLWS+HGINWSAL+A+G + GIL LWN  D    E+IEG FSL+INFCL+DG
Subjt:  MKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSGGILFLWNKSDFAMDEIIEGDFSLSINFCLADG

Query:  FSF
        F F
Subjt:  FSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGCGGCTTCATGCAGTCTCTAGTTCCTCTTCCAGCCTCCGTGAACTCATGGAACTCAAGATTAAGGTCAAGAGCAACAACATGGGCTTTATCCCGGCAACCGTCGA
GCTCCCTCCATCCCTCACTCATGAAGACATTATTACCGTCCATATTGACCCTTTTTTCATTGTCGAAAACCTGGTGGGTCGCAGACACTATGCTCGTGGTGGACATAGAA
ACCCCCCGAAATCAACAGCCGGAAAAGCACCCGCCGGAAAACCTCCCTCTCGGCGCCAGACCTTCGCCGTCGTGGTTCCCAACCTTGCAGTTACTGCTGAGACTGACACA
TGGGCCCACAAATCTGGAGACCCGACACTGTCGTCTGCGTCTCGACAGACATCCCTGACAAAGTCCAAAGGGAAGGAAAAAGTCATTGACTTTTCTGCCCCTCTTCCCTT
GAGCTCCCCACCTCAGATGAAGCTTCTCTCATGGAATGTTAGAGGGTTGGGCTCATGGAAGAAAATAGCCCTTATAAAACAGTCAATCTCCCACTATAATCCAAATCTTG
TTATTCTACAAGAAAATAAGCTCGCATCTGTTGACCCCCTCATCATCAAGTCCCTTTGGAGCTCACATGGGATCAATTGGTCCGCCCTCAATGCTACGGGTTCTAGCGGA
GGGATCCTTTTTCTTTGGAACAAATCAGACTTCGCTATGGATGAGATCATTGAAGGTGATTTCTCCCTCTCCATTAATTTTTGTCTTGCTGATGGCTTCTCTTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGCGGCTTCATGCAGTCTCTAGTTCCTCTTCCAGCCTCCGTGAACTCATGGAACTCAAGATTAAGGTCAAGAGCAACAACATGGGCTTTATCCCGGCAACCGTCGA
GCTCCCTCCATCCCTCACTCATGAAGACATTATTACCGTCCATATTGACCCTTTTTTCATTGTCGAAAACCTGGTGGGTCGCAGACACTATGCTCGTGGTGGACATAGAA
ACCCCCCGAAATCAACAGCCGGAAAAGCACCCGCCGGAAAACCTCCCTCTCGGCGCCAGACCTTCGCCGTCGTGGTTCCCAACCTTGCAGTTACTGCTGAGACTGACACA
TGGGCCCACAAATCTGGAGACCCGACACTGTCGTCTGCGTCTCGACAGACATCCCTGACAAAGTCCAAAGGGAAGGAAAAAGTCATTGACTTTTCTGCCCCTCTTCCCTT
GAGCTCCCCACCTCAGATGAAGCTTCTCTCATGGAATGTTAGAGGGTTGGGCTCATGGAAGAAAATAGCCCTTATAAAACAGTCAATCTCCCACTATAATCCAAATCTTG
TTATTCTACAAGAAAATAAGCTCGCATCTGTTGACCCCCTCATCATCAAGTCCCTTTGGAGCTCACATGGGATCAATTGGTCCGCCCTCAATGCTACGGGTTCTAGCGGA
GGGATCCTTTTTCTTTGGAACAAATCAGACTTCGCTATGGATGAGATCATTGAAGGTGATTTCTCCCTCTCCATTAATTTTTGTCTTGCTGATGGCTTCTCTTTCTAG
Protein sequenceShow/hide protein sequence
MWRLHAVSSSSSSLRELMELKIKVKSNNMGFIPATVELPPSLTHEDIITVHIDPFFIVENLVGRRHYARGGHRNPPKSTAGKAPAGKPPSRRQTFAVVVPNLAVTAETDT
WAHKSGDPTLSSASRQTSLTKSKGKEKVIDFSAPLPLSSPPQMKLLSWNVRGLGSWKKIALIKQSISHYNPNLVILQENKLASVDPLIIKSLWSSHGINWSALNATGSSG
GILFLWNKSDFAMDEIIEGDFSLSINFCLADGFSF