; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g08050 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g08050
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr9:6414067..6414957
RNA-Seq ExpressionMoc09g08050
SyntenyMoc09g08050
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0016070 - RNA metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
GO:0005488 - binding (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN74312.1 hypothetical protein VITISV_037520 [Vitis vinifera]3.9e-1831.79Show/hide
Query:  KLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDPDFTVAEIIEG--------------
        K+LSWN RGLGS KKR  +++ +S  NP++V+ QETK    D  ++ S+W    ++W  L A G+SGGI+ LW+   F  +E + G              
Subjt:  KLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDPDFTVAEIIEG--------------

Query:  ------IYGPSTIETRHCLWKELQDLSFLCEEQWILAG---------TSMSPDGLGKNLM----------LVEPQPLSWVAWTWFDDEAQIPQVC
              +YGP+    R   W ELQDL  L   +W + G           M    L  N+           L++P PL   A+TW     Q+  +C
Subjt:  ------IYGPSTIETRHCLWKELQDLSFLCEEQWILAG---------TSMSPDGLGKNLM----------LVEPQPLSWVAWTWFDDEAQIPQVC

VVA20479.1 Hypothetical predicted protein, partial [Prunus dulcis]5.0e-1839.57Show/hide
Query:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDP----------DFTVA-EIIE---
        MK++SWN+RGLGS +KR L+K+ + R+ P++VIL ETK   VD  ++  +W S    W    ++G SGGI  LWN            DF+V+  I+E   
Subjt:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDP----------DFTVA-EIIE---

Query:  ------GIYGPSTIETRHCLWKELQDLSFLCEEQWILAG
              GIYGP     R   W+EL DL   C ++W L G
Subjt:  ------GIYGPSTIETRHCLWKELQDLSFLCEEQWILAG

XP_021820446.1 uncharacterized protein LOC110762145 [Prunus avium]2.3e-1837.41Show/hide
Query:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDP-----DFTVAEI-----------
        MK++SWN+RGLGS +KR ++K+ ++R+ P++VILQETK   +D  ++ S+W S   +W  + + G SGGI+ +WN       D  +AE            
Subjt:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDP-----DFTVAEI-----------

Query:  ----IEGIYGPSTIETRHCLWKELQDLSFLCEEQWILAG
            + GIYGP     R   W EL  L  LC E W + G
Subjt:  ----IEGIYGPSTIETRHCLWKELQDLSFLCEEQWILAG

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]3.2e-2041.61Show/hide
Query:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDPDFTVAEII---------------
        M +L+WNVRGLGS  KRA IK +I+ + P++VIL ETK +S++   IKSLWSS  I W++L+A G+SGGI+ LW+    +  E+I               
Subjt:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDPDFTVAEII---------------

Query:  -----EGIYGPSTIETRHCLWKELQDLSFLCEEQWIL
              G+Y P   + R   W+EL DL+ LC   W+L
Subjt:  -----EGIYGPSTIETRHCLWKELQDLSFLCEEQWIL

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.3e-3759.71Show/hide
Query:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDPDFTVAEIIE--------------
        MK L+WNVRGL SWKK ALIKQ ISR+NPN+VILQETKL+ +D  I+KSLWS+HGINWS L+A G + GIL LWNDPD   AE+IE              
Subjt:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDPDFTVAEIIE--------------

Query:  ------GIYGPSTIETRHCLWKELQDLSFLCEEQWILAG
              GIYGPST E  +  W+EL DLS LCE  WILAG
Subjt:  ------GIYGPSTIETRHCLWKELQDLSFLCEEQWILAG

TrEMBL top hitse value%identityAlignment
A0A6J1CVN2 uncharacterized protein LOC1110146571.5e-2041.61Show/hide
Query:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDPDFTVAEII---------------
        M +L+WNVRGLGS  KRA IK +I+ + P++VIL ETK +S++   IKSLWSS  I W++L+A G+SGGI+ LW+    +  E+I               
Subjt:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDPDFTVAEII---------------

Query:  -----EGIYGPSTIETRHCLWKELQDLSFLCEEQWIL
              G+Y P   + R   W+EL DL+ LC   W+L
Subjt:  -----EGIYGPSTIETRHCLWKELQDLSFLCEEQWIL

A0A6J1E2G6 uncharacterized protein LOC1110254056.2e-3859.71Show/hide
Query:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDPDFTVAEIIE--------------
        MK L+WNVRGL SWKK ALIKQ ISR+NPN+VILQETKL+ +D  I+KSLWS+HGINWS L+A G + GIL LWNDPD   AE+IE              
Subjt:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDPDFTVAEIIE--------------

Query:  ------GIYGPSTIETRHCLWKELQDLSFLCEEQWILAG
              GIYGPST E  +  W+EL DLS LCE  WILAG
Subjt:  ------GIYGPSTIETRHCLWKELQDLSFLCEEQWILAG

A0A6P5T1U8 uncharacterized protein LOC1107621451.1e-1837.41Show/hide
Query:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDP-----DFTVAEI-----------
        MK++SWN+RGLGS +KR ++K+ ++R+ P++VILQETK   +D  ++ S+W S   +W  + + G SGGI+ +WN       D  +AE            
Subjt:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDP-----DFTVAEI-----------

Query:  ----IEGIYGPSTIETRHCLWKELQDLSFLCEEQWILAG
            + GIYGP     R   W EL  L  LC E W + G
Subjt:  ----IEGIYGPSTIETRHCLWKELQDLSFLCEEQWILAG

A0A803P8A0 Uncharacterized protein1.2e-2035.68Show/hide
Query:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWND----------PDFTVAEII-----
        MK+L+WN+RG G   KRA IK +I + NP++VILQE K A+VD   I S+W S    W  L A+G SGG L +W+            +F+++ +I     
Subjt:  MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWND----------PDFTVAEII-----

Query:  -----EGIYGPSTIETRHCLWKELQDLSFLCEEQWILAG-------------TSMSP------DGLGKNLMLVEPQPLSWVAWTW
              G+YGP + + RH  W EL  LS +C E W + G             +S S       DGL + L L++P+ L   ++TW
Subjt:  -----EGIYGPSTIETRHCLWKELQDLSFLCEEQWILAG-------------TSMSP------DGLGKNLMLVEPQPLSWVAWTW

A5B978 Reverse transcriptase domain-containing protein1.9e-1831.79Show/hide
Query:  KLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDPDFTVAEIIEG--------------
        K+LSWN RGLGS KKR  +++ +S  NP++V+ QETK    D  ++ S+W    ++W  L A G+SGGI+ LW+   F  +E + G              
Subjt:  KLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDPDFTVAEIIEG--------------

Query:  ------IYGPSTIETRHCLWKELQDLSFLCEEQWILAG---------TSMSPDGLGKNLM----------LVEPQPLSWVAWTWFDDEAQIPQVC
              +YGP+    R   W ELQDL  L   +W + G           M    L  N+           L++P PL   A+TW     Q+  +C
Subjt:  ------IYGPSTIETRHCLWKELQDLSFLCEEQWILAG---------TSMSPDGLGKNLM----------LVEPQPLSWVAWTWFDDEAQIPQVC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTTCTCTCTTGGAATGTCAGAGGTTTGGGCTCATGGAAGAAAAGAGCCCTCATAAAGCAGTCCATCTCCCGTATTAATCCAAATCTTGTCATTTTACAAGAGAC
AAAACTCGCTTCTGTCGACCCCTTCATTATCAAGTCCCTTTGGAGCTCTCATGGGATTAACTGGTCCACCCTCAATGCTGTGGGATCTAGTGGAGGGATTCTTTTTCTTT
GGAACGACCCTGACTTCACCGTAGCTGAGATCATTGAAGGCATCTATGGTCCCTCTACCATAGAGACCCGCCATTGTTTATGGAAAGAGCTTCAAGATCTCTCCTTCCTC
TGTGAGGAACAATGGATTTTAGCAGGGACTTCAATGTCTCCAGATGGTCTTGGGAAAAATCTCATGCTAGTGGAGCCACAACCCCTCAGTTGGGTGGCCTGGACATGGTT
TGATGATGAAGCTCAAATCCCTCAAGTTTGCGTTAAAAGTTTGGAATGCAGAACATTTCGGCCGGCTGCAATCACATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTTCTCTCTTGGAATGTCAGAGGTTTGGGCTCATGGAAGAAAAGAGCCCTCATAAAGCAGTCCATCTCCCGTATTAATCCAAATCTTGTCATTTTACAAGAGAC
AAAACTCGCTTCTGTCGACCCCTTCATTATCAAGTCCCTTTGGAGCTCTCATGGGATTAACTGGTCCACCCTCAATGCTGTGGGATCTAGTGGAGGGATTCTTTTTCTTT
GGAACGACCCTGACTTCACCGTAGCTGAGATCATTGAAGGCATCTATGGTCCCTCTACCATAGAGACCCGCCATTGTTTATGGAAAGAGCTTCAAGATCTCTCCTTCCTC
TGTGAGGAACAATGGATTTTAGCAGGGACTTCAATGTCTCCAGATGGTCTTGGGAAAAATCTCATGCTAGTGGAGCCACAACCCCTCAGTTGGGTGGCCTGGACATGGTT
TGATGATGAAGCTCAAATCCCTCAAGTTTGCGTTAAAAGTTTGGAATGCAGAACATTTCGGCCGGCTGCAATCACATAA
Protein sequenceShow/hide protein sequence
MKLLSWNVRGLGSWKKRALIKQSISRINPNLVILQETKLASVDPFIIKSLWSSHGINWSTLNAVGSSGGILFLWNDPDFTVAEIIEGIYGPSTIETRHCLWKELQDLSFL
CEEQWILAGTSMSPDGLGKNLMLVEPQPLSWVAWTWFDDEAQIPQVCVKSLECRTFRPAAIT