; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002613 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002613
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationscaffold111:148028..148615
RNA-Seq ExpressionMS002613
SyntenyMS002613
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045000.1 uncharacterized protein E6C27_scaffold74G003100 [Cucumis melo var. makuwa]1.4e-2735.5Show/hide
Query:  VTLETFYDEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---
        V L   YDE C   + +RI     T E         F    P FG +  PL A+ W+L LE +F  + CSD++ VSFA++ L++ A  W + L  ++   
Subjt:  VTLETFYDEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---

Query:  APPITWKVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER
           +TW+  K+ F +RY P W++ E F  LC L QGD TV EYD++F +L  LA E I ++A ++ +F +GLR + + +I   +++ YA++R+ A+  E+
Subjt:  APPITWKVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER

KAG6577099.1 hypothetical protein SDJN03_24673, partial [Cucurbita argyrosperma subsp. sororia]1.8e-2736.6Show/hide
Query:  DEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGED-GPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---APPITW
        DE C         + A     E   KF +F    P + GED GPL A+ WVL LE +F  + CSDE+ VSFA + L++ A+ W I  ++ +      +TW
Subjt:  DEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGED-GPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---APPITW

Query:  KVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER
        + FK  F +RY PSW++ E    L KL QG++TV+EYD+EF  L  L  E   +D++++ +F+ GLR +   ++   D++ Y+D+R++A+ +E+
Subjt:  KVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER

TYK16471.1 uncharacterized protein E5676_scaffold21G002740 [Cucumis melo var. makuwa]2.8e-2836Show/hide
Query:  VTLETFYDEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---
        V L   YDE C   + +RI     T E         F    P FG +  PL A+ W+L LE +F  + CSD++ VSFA++ L++ A  W + L  ++   
Subjt:  VTLETFYDEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---

Query:  APPITWKVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER
           +TW+ FK+ F +RY P W++ E F  LC L QGD TV EYD++F +L  LA E I ++A ++ +F +GLR + + +I   +++ YA++R+ A+  E+
Subjt:  APPITWKVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER

XP_004147729.1 uncharacterized protein LOC101209793 [Cucumis sativus]8.0e-2836.5Show/hide
Query:  VTLETFYDEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---
        V L   YDE C   + MRI     T E         F    P FG +  PL A+ W+L LE +F  + CSDE  VSFA + L++ A  W + +  ++   
Subjt:  VTLETFYDEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---

Query:  APPITWKVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER
           +TW+ FK+ F +RY PSW++ E F  LC L QGD TV EYD++F +L  LA E I ++A ++ +F +GLR + + ++   ++   A+IR+ A+ +E+
Subjt:  APPITWKVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER

XP_022136646.1 uncharacterized protein LOC111008300 [Momordica charantia]7.1e-9397.66Show/hide
Query:  AEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNVAPPITWKVFKKEFLERYHPSWMQKENFT
        +EHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNVAPPITWK FKKEFLERYHPSWMQKENFT
Subjt:  AEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNVAPPITWKVFKKEFLERYHPSWMQKENFT

Query:  MLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKIDSMQYADIRDAAIKLERDF
        MLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLR EFKLKIDSMQYADIRD AIKLERDF
Subjt:  MLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKIDSMQYADIRDAAIKLERDF

TrEMBL top hitse value%identityAlignment
A0A0A0L042 Retrotrans_gag domain-containing protein3.9e-2836.5Show/hide
Query:  VTLETFYDEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---
        V L   YDE C   + MRI     T E         F    P FG +  PL A+ W+L LE +F  + CSDE  VSFA + L++ A  W + +  ++   
Subjt:  VTLETFYDEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---

Query:  APPITWKVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER
           +TW+ FK+ F +RY PSW++ E F  LC L QGD TV EYD++F +L  LA E I ++A ++ +F +GLR + + ++   ++   A+IR+ A+ +E+
Subjt:  APPITWKVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER

A0A5A7TP79 Retrotrans_gag domain-containing protein6.6e-2835.5Show/hide
Query:  VTLETFYDEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---
        V L   YDE C   + +RI     T E         F    P FG +  PL A+ W+L LE +F  + CSD++ VSFA++ L++ A  W + L  ++   
Subjt:  VTLETFYDEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---

Query:  APPITWKVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER
           +TW+  K+ F +RY P W++ E F  LC L QGD TV EYD++F +L  LA E I ++A ++ +F +GLR + + +I   +++ YA++R+ A+  E+
Subjt:  APPITWKVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER

A0A5D3CX20 Retrotrans_gag domain-containing protein1.3e-2836Show/hide
Query:  VTLETFYDEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---
        V L   YDE C   + +RI     T E         F    P FG +  PL A+ W+L LE +F  + CSD++ VSFA++ L++ A  W + L  ++   
Subjt:  VTLETFYDEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---

Query:  APPITWKVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER
           +TW+ FK+ F +RY P W++ E F  LC L QGD TV EYD++F +L  LA E I ++A ++ +F +GLR + + +I   +++ YA++R+ A+  E+
Subjt:  APPITWKVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER

A0A6J1C434 uncharacterized protein LOC1110083003.4e-9397.66Show/hide
Query:  AEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNVAPPITWKVFKKEFLERYHPSWMQKENFT
        +EHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNVAPPITWK FKKEFLERYHPSWMQKENFT
Subjt:  AEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNVAPPITWKVFKKEFLERYHPSWMQKENFT

Query:  MLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKIDSMQYADIRDAAIKLERDF
        MLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLR EFKLKIDSMQYADIRD AIKLERDF
Subjt:  MLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKIDSMQYADIRDAAIKLERDF

A0A6J1EYB6 uncharacterized protein LOC111437567 isoform X15.6e-2736.08Show/hide
Query:  DEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGED-GPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---APPITW
        DE C         + A     E   KF +F    P + GED  PL A+ WVL LE +F  + CSDE+ VSFA + L++ A+ W I  ++ +      +TW
Subjt:  DEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGED-GPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNV---APPITW

Query:  KVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER
        + FK  F +RY PSW++ E    L KL QG++TV+EYD+EF  L  L  E   +D++++ +F+ GLR +   ++   D++ Y+D+R++A+ +E+
Subjt:  KVFKKEFLERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKI---DSMQYADIRDAAIKLER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTCACTTTAGAGACTTTTTACGACGAGTTTTGCGAATTTGAGAACAGAATGAGGATTTATATGCATGCTACTACTGCTGAGCATGAACGTAGAGACAAGTTTCGGCGATT
TAACGCTCGTAGTCCTAAATTTGGAGGCGAGGATGGCCCCCTAAAGGCCAGACTCTGGGTTCTTAAGTTGGAGAGATTGTTTAAGTTTTTAGATTGCTCCGACGAAGAAA
ATGTTTCGTTTGCCACTATCATGCTCGAAAACGATGCTATCGATTGGGCGATTTGGCTGGAAAAAAATGTAGCTCCTCCAATCACATGGAAGGTCTTCAAGAAGGAATTC
CTTGAAAGGTACCATCCATCTTGGATGCAGAAAGAAAACTTTACTATGCTCTGCAAGCTGGTACAAGGAGATAAAACTGTGGTTGAATATGATAAAGAATTTGATAGATT
GTTTCATCTTGCCGGTGAGTGCATTCAAAATGATGCAATGAAATCTTCCATGTTTTTCAATGGGTTGAGGTTTGAATTCAAGCTAAAAATTGACAGCATGCAGTACGCCG
ATATTAGAGATGCGGCAATAAAATTGGAGCGAGACTTT
mRNA sequenceShow/hide mRNA sequence
GTCACTTTAGAGACTTTTTACGACGAGTTTTGCGAATTTGAGAACAGAATGAGGATTTATATGCATGCTACTACTGCTGAGCATGAACGTAGAGACAAGTTTCGGCGATT
TAACGCTCGTAGTCCTAAATTTGGAGGCGAGGATGGCCCCCTAAAGGCCAGACTCTGGGTTCTTAAGTTGGAGAGATTGTTTAAGTTTTTAGATTGCTCCGACGAAGAAA
ATGTTTCGTTTGCCACTATCATGCTCGAAAACGATGCTATCGATTGGGCGATTTGGCTGGAAAAAAATGTAGCTCCTCCAATCACATGGAAGGTCTTCAAGAAGGAATTC
CTTGAAAGGTACCATCCATCTTGGATGCAGAAAGAAAACTTTACTATGCTCTGCAAGCTGGTACAAGGAGATAAAACTGTGGTTGAATATGATAAAGAATTTGATAGATT
GTTTCATCTTGCCGGTGAGTGCATTCAAAATGATGCAATGAAATCTTCCATGTTTTTCAATGGGTTGAGGTTTGAATTCAAGCTAAAAATTGACAGCATGCAGTACGCCG
ATATTAGAGATGCGGCAATAAAATTGGAGCGAGACTTT
Protein sequenceShow/hide protein sequence
VTLETFYDEFCEFENRMRIYMHATTAEHERRDKFRRFNARSPKFGGEDGPLKARLWVLKLERLFKFLDCSDEENVSFATIMLENDAIDWAIWLEKNVAPPITWKVFKKEF
LERYHPSWMQKENFTMLCKLVQGDKTVVEYDKEFDRLFHLAGECIQNDAMKSSMFFNGLRFEFKLKIDSMQYADIRDAAIKLERDF