; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS007047 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS007047
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationscaffold430:106968..107348
RNA-Seq ExpressionMS007047
SyntenyMS007047
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]1.4e-2043.65Show/hide
Query:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS
        SSKD+GW  + S G+ GGIL++WD S+I V E L+G  S+SI    S   +  ITNVYGP DY + + +W  L  +SG+    WC+GG  N++RW  +  
Subjt:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS

Query:  SGGRITRSMRKFNCIIGELDLTEVPL
           + TR MR+FN  I  L++ E+PL
Subjt:  SGGRITRSMRKFNCIIGELDLTEVPL

XP_019074274.1 PREDICTED: uncharacterized protein LOC109122247 [Vitis vinifera]2.0e-1939.68Show/hide
Query:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS
        ++++  WA L + G SGGIL +WD  +++  EV+ G+ S+SI  +L+   ++ ++ VYGP +    K LW EL DI+G +   WC+GGDFNV R  S++ 
Subjt:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS

Query:  SGGRITRSMRKFNCIIGELDLTEVPL
         G R+T SM+ F+  I + +L ++PL
Subjt:  SGGRITRSMRKFNCIIGELDLTEVPL

XP_022154822.1 uncharacterized protein LOC111021983 [Momordica charantia]8.5e-3159.06Show/hide
Query:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS
        SSKDVGWACLNS+                                                DYR+ KRLWSELRDISGFSEKFWCLGGDFNVSRWPSD+S
Subjt:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS

Query:  SGGRITRSMRKFNCIIGELDLTEVPLS
        SGGRITRSMRKFNCIIGELDLTEVPLS
Subjt:  SGGRITRSMRKFNCIIGELDLTEVPLS

XP_031739979.1 uncharacterized protein LOC116403332 [Cucumis sativus]1.0e-2045.6Show/hide
Query:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS
        SS  VGWA L + G SGGIL LW E  I V + ++G  SISI    +   +  IT VYGP+ YR   + W EL  + G   + WC+GGDFNV RW +++S
Subjt:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS

Query:  SGGRITRSMRKFNCIIGELDLTEVP
        S    TRSM +F  +I ELD+ ++P
Subjt:  SGGRITRSMRKFNCIIGELDLTEVP

XP_038876676.1 uncharacterized protein LOC120069076 [Benincasa hispida]1.2e-2144.88Show/hide
Query:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS
        SSK+VG A + ++GKSGG+L++WD+S+I VS + +   S+SI     N     ITNVYGP DY++ +RLW+EL  ++   +  WC+GGDFN  R   +R 
Subjt:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS

Query:  SGGRITRSMRKFNCIIGELDLTEVPLS
          G+ TR M  FN  I   +L E+PLS
Subjt:  SGGRITRSMRKFNCIIGELDLTEVPLS

TrEMBL top hitse value%identityAlignment
A0A438GDE7 LINE-1 retrotransposable element ORF2 protein9.6e-2039.68Show/hide
Query:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS
        ++++  WA L + G SGGIL +WD  +++  EV+ G+ S+SI  +L+   ++ ++ VYGP +    K LW EL DI+G +   WC+GGDFNV R  S++ 
Subjt:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS

Query:  SGGRITRSMRKFNCIIGELDLTEVPL
         G R+T SM+ F+  I + +L ++PL
Subjt:  SGGRITRSMRKFNCIIGELDLTEVPL

A0A438GQR2 Uncharacterized protein9.6e-2039.68Show/hide
Query:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS
        ++++  WA L + G SGGIL +WD  +++  EV+ G+ S+SI  +L+   ++ ++ VYGP +    K LW EL DI+G +   WC+GGDFNV R  S++ 
Subjt:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS

Query:  SGGRITRSMRKFNCIIGELDLTEVPL
         G R+T SM+ F+  I + +L ++PL
Subjt:  SGGRITRSMRKFNCIIGELDLTEVPL

A0A5D3BHE3 Uncharacterized protein6.6e-2143.65Show/hide
Query:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS
        SSKD+GW  + S G+ GGIL++WD S+I V E L+G  S+SI    S   +  ITNVYGP DY + + +W  L  +SG+    WC+GG  N++RW  +  
Subjt:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS

Query:  SGGRITRSMRKFNCIIGELDLTEVPL
           + TR MR+FN  I  L++ E+PL
Subjt:  SGGRITRSMRKFNCIIGELDLTEVPL

A0A6J1DMQ9 uncharacterized protein LOC1110219834.1e-3159.06Show/hide
Query:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS
        SSKDVGWACLNS+                                                DYR+ KRLWSELRDISGFSEKFWCLGGDFNVSRWPSD+S
Subjt:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS

Query:  SGGRITRSMRKFNCIIGELDLTEVPLS
        SGGRITRSMRKFNCIIGELDLTEVPLS
Subjt:  SGGRITRSMRKFNCIIGELDLTEVPLS

A5CAA2 Reverse transcriptase domain-containing protein9.6e-2039.68Show/hide
Query:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS
        ++++  WA L + G SGGIL +WD  +++  EV+ G+ S+SI  +L+   ++ ++ VYGP +    K LW EL DI+G +   WC+GGDFNV R  S++ 
Subjt:  SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRS

Query:  SGGRITRSMRKFNCIIGELDLTEVPL
         G R+T SM+ F+  I + +L ++PL
Subjt:  SGGRITRSMRKFNCIIGELDLTEVPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGCTCCAAGGATGTAGGTTGGGCATGTTTGAATTCGGAAGGTAAATCAGGAGGTATTTTATCTTTATGGGATGAAAGCAGAATCGCAGTATCTGAAGTTCTAGAAGGCAC
TTGCTCAATTAGTATAGTAGTCTCTCTTTCCAATTTGAATAATGTGGTGATAACAAATGTTTATGGCCCAACAGATTACAGAGATAGTAAGCGATTATGGTCAGAACTCA
GGGATATTTCAGGCTTCAGTGAAAAATTCTGGTGCTTGGGTGGTGATTTTAACGTATCAAGATGGCCTTCAGACAGATCTTCAGGGGGACGTATTACTAGGAGCATGAGA
AAATTTAATTGTATCATTGGAGAGCTTGACCTTACGGAGGTTCCCTTATCT
mRNA sequenceShow/hide mRNA sequence
AGCTCCAAGGATGTAGGTTGGGCATGTTTGAATTCGGAAGGTAAATCAGGAGGTATTTTATCTTTATGGGATGAAAGCAGAATCGCAGTATCTGAAGTTCTAGAAGGCAC
TTGCTCAATTAGTATAGTAGTCTCTCTTTCCAATTTGAATAATGTGGTGATAACAAATGTTTATGGCCCAACAGATTACAGAGATAGTAAGCGATTATGGTCAGAACTCA
GGGATATTTCAGGCTTCAGTGAAAAATTCTGGTGCTTGGGTGGTGATTTTAACGTATCAAGATGGCCTTCAGACAGATCTTCAGGGGGACGTATTACTAGGAGCATGAGA
AAATTTAATTGTATCATTGGAGAGCTTGACCTTACGGAGGTTCCCTTATCT
Protein sequenceShow/hide protein sequence
SSKDVGWACLNSEGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYGPTDYRDSKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDRSSGGRITRSMR
KFNCIIGELDLTEVPLS