; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g14640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g14640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag protease polyprotein
Genome locationchr9:12589538..12590447
RNA-Seq ExpressionMoc09g14640
SyntenyMoc09g14640
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]1.6e-6075.16Show/hide
Query:  DFINTYPALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEF
        D ++  P LA+AWLS METIF YMRC +EQKVQ  +FMLKDDA LWWES ER IDV GGP+TWLQFK+AFF QYYPAIT +RKQ EFLNLKQ N+SVEE+
Subjt:  DFINTYPALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEF

Query:  EREFTKLSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALIDN
        +REFTKLSRFAP+LVDTE+ K ERFI+ LKDE +GFVA LSPPDYA ALR AALIDN
Subjt:  EREFTKLSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALIDN

XP_038882311.1 uncharacterized protein LOC120073551 [Benincasa hispida]8.7e-3853.01Show/hide
Query:  RAYYSDFDDGDFINTYPALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNL
        R YY    DG   N  P  A+ WLSS+E IF++MRC +E K+Q A+FML  +A +WW S E+ ID GG   TW QFK+ F+ +Y+ A T + KQAEFLN 
Subjt:  RAYYSDFDDGDFINTYPALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNL

Query:  KQGNKSVEEFEREFTKLSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALID
        KQG  SVEE+E++F KLS FAPKLV TE+ +T  FI GLK  ++G V AL    YA AL AA  ID
Subjt:  KQGNKSVEEFEREFTKLSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALID

XP_038885815.1 uncharacterized protein LOC120076109 [Benincasa hispida]8.7e-3855.33Show/hide
Query:  PALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEFEREFTK
        P  A+ WLSS+ETIF++MRCP+E K+Q AIFML  +A +WW S E+ ID GG    W QFK+ F+ +Y+ A T++ KQAEFLNLKQG  SVE++E+EF K
Subjt:  PALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEFEREFTK

Query:  LSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALID
        LSRF P+LV T++ +TERFI  L+  ++G V AL    Y  ALRAA  ID
Subjt:  LSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALID

XP_038891712.1 uncharacterized protein LOC120081110 [Benincasa hispida]1.4e-4054Show/hide
Query:  PALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEFEREFTK
        P  A+ W+S +ETIF YM+CP++QKVQ A+FML D A +WW+ AER + VGG P+TW QFK+ F+ +Y+ A  ++ KQ EFL L+QG++SVEE+++EF  
Subjt:  PALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEFEREFTK

Query:  LSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALID
        LSRFAP+LV TE+ + ERFI GLK+ I+G V A  P  +  ALR AA +D
Subjt:  LSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALID

XP_038895970.1 uncharacterized protein LOC120084143 [Benincasa hispida]1.1e-3754.67Show/hide
Query:  PALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEFEREFTK
        P     WLSS+ETIF++MRCP+E  +Q A+FML  +  +WW SAE+ ID+GG   TW +FK+ F+ +Y+ A T++ KQAEFLNL QG  SVEE+E+EF K
Subjt:  PALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEFEREFTK

Query:  LSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALID
        LS F PKLV TE+ + ERFI GL+  +QG V AL    YA  LRAA  ID
Subjt:  LSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALID

TrEMBL top hitse value%identityAlignment
A0A5A7T7E7 Ty3-gypsy retrotransposon protein2.8e-3449.31Show/hide
Query:  PALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEFEREFTK
        P  A  WLSS+ETIF YM+CP++QKVQ AIFML D    WWE+ ER +      ITW QFK++F+ +++PA  +  K+ EFLNL+QG+ +VE+++ EF  
Subjt:  PALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEFEREFTK

Query:  LSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALR
        LSRFAP+++ TE+ + ++F+ GL+ +IQG V A  P  +A ALR
Subjt:  LSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALR

A0A5A7VJF1 Reverse transcriptase3.7e-3448.15Show/hide
Query:  RAYYSDFDDGDFINTYPALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNL
        R Y S   DG   N  P  A  WL+S+ETIF YM+CP++QKVQ A+F L+D    WWE+AER +      ITW QFK+ F+ +++ A  +  K  EFLNL
Subjt:  RAYYSDFDDGDFINTYPALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNL

Query:  KQGNKSVEEFEREFTKLSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAA
        +QG+ +VE+++ EF  LSRFAP +V  ES +TE+F+ GL+ ++QG V AL P  +A ALR A
Subjt:  KQGNKSVEEFEREFTKLSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAA

A0A5D3E4V0 Reverse transcriptase3.7e-3442.13Show/hide
Query:  IPPVVQLTGQTGNPPMGQTPGQTGNPPIGSNFWTSRAYYSDFDDGDFINTYPALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSID
        + P VQ   Q  NP     P Q        +    R Y     DG   +  P  A  WLSS+ETIF YM+CP++QKVQ A+FML D    WWE+ ER + 
Subjt:  IPPVVQLTGQTGNPPMGQTPGQTGNPPIGSNFWTSRAYYSDFDDGDFINTYPALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSID

Query:  VGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEFEREFTKLSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAA
           G ITW QFK++FF +++ A  +  K+ EFLNL+Q + +VE+++ EF  LSRFAP+++ TE+ + ++F+ GL+ +IQG V A  P  +A ALR A
Subjt:  VGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEFEREFTKLSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAA

A0A6J1DSJ6 uncharacterized protein LOC1110235127.9e-6175.16Show/hide
Query:  DFINTYPALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEF
        D ++  P LA+AWLS METIF YMRC +EQKVQ  +FMLKDDA LWWES ER IDV GGP+TWLQFK+AFF QYYPAIT +RKQ EFLNLKQ N+SVEE+
Subjt:  DFINTYPALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEF

Query:  EREFTKLSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALIDN
        +REFTKLSRFAP+LVDTE+ K ERFI+ LKDE +GFVA LSPPDYA ALR AALIDN
Subjt:  EREFTKLSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALIDN

A0A6J1EKD9 uncharacterized protein LOC1114354605.7e-3548.37Show/hide
Query:  PALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSI---DVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEFERE
        P L ++W+ S+ETIF +M CP++QKV+ A FMLK +A  WW++A++++   D    PI W + K AF  +YYPA+  +  +  F++LKQGN +VEE+E E
Subjt:  PALADAWLSSMETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSI---DVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEFERE

Query:  FTKLSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALID
        FT+LSRFA + +DTE K+T +FI+GL+ EIQG VAA++   Y  AL AA+++D
Subjt:  FTKLSRFAPKLVDTESKKTERFIMGLKDEIQGFVAALSPPDYAIALRAAALID

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCACGTAAAAGAATGTCTGTGAGACGGGGTGGGCTAGATAGGGATGTTGACCCTGAGACAATAGAGTGGACAGTAAATAACCAAACTTCAGGTCAGATAGAGAA
TCCACCAATGGTTCAAACTACTGATCAGACGGGAATTCCACCAGTGGTTCAACTTACTGGTCAGACGGGGAATCCACCAATGGGTCAAACTCCTGGACAAACGGGGAATC
CACCAATTGGGTCAAACTTTTGGACAAGCAGAGCCTACTATAGCGACTTTGATGATGGAGACTTTATAAACACTTATCCAGCGTTGGCAGACGCTTGGTTGTCGTCAATG
GAGACCATTTTTTATTATATGAGGTGTCCGGATGAACAAAAAGTGCAGTATGCTATCTTTATGCTAAAAGATGATGCCCTTTTATGGTGGGAGTCTGCAGAAAGGTCTAT
TGATGTGGGTGGAGGCCCAATCACATGGTTGCAGTTTAAGGATGCTTTCTTCCTACAGTATTACCCAGCGATCACCCAGTTCAGGAAACAAGCGGAGTTTTTAAACCTAA
AGCAAGGTAACAAATCAGTGGAAGAATTTGAGAGGGAATTCACAAAATTGTCTCGTTTTGCCCCTAAGCTAGTAGACACAGAGTCCAAGAAGACCGAACGATTCATAATG
GGCCTAAAGGATGAGATTCAAGGCTTCGTGGCAGCTCTCTCTCCACCAGATTATGCTATAGCACTTCGAGCAGCTGCATTGATTGATAATTTATTCCTCCTCTTACTAAT
GGAGAAAATATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCACGTAAAAGAATGTCTGTGAGACGGGGTGGGCTAGATAGGGATGTTGACCCTGAGACAATAGAGTGGACAGTAAATAACCAAACTTCAGGTCAGATAGAGAA
TCCACCAATGGTTCAAACTACTGATCAGACGGGAATTCCACCAGTGGTTCAACTTACTGGTCAGACGGGGAATCCACCAATGGGTCAAACTCCTGGACAAACGGGGAATC
CACCAATTGGGTCAAACTTTTGGACAAGCAGAGCCTACTATAGCGACTTTGATGATGGAGACTTTATAAACACTTATCCAGCGTTGGCAGACGCTTGGTTGTCGTCAATG
GAGACCATTTTTTATTATATGAGGTGTCCGGATGAACAAAAAGTGCAGTATGCTATCTTTATGCTAAAAGATGATGCCCTTTTATGGTGGGAGTCTGCAGAAAGGTCTAT
TGATGTGGGTGGAGGCCCAATCACATGGTTGCAGTTTAAGGATGCTTTCTTCCTACAGTATTACCCAGCGATCACCCAGTTCAGGAAACAAGCGGAGTTTTTAAACCTAA
AGCAAGGTAACAAATCAGTGGAAGAATTTGAGAGGGAATTCACAAAATTGTCTCGTTTTGCCCCTAAGCTAGTAGACACAGAGTCCAAGAAGACCGAACGATTCATAATG
GGCCTAAAGGATGAGATTCAAGGCTTCGTGGCAGCTCTCTCTCCACCAGATTATGCTATAGCACTTCGAGCAGCTGCATTGATTGATAATTTATTCCTCCTCTTACTAAT
GGAGAAAATATAG
Protein sequenceShow/hide protein sequence
MPPRKRMSVRRGGLDRDVDPETIEWTVNNQTSGQIENPPMVQTTDQTGIPPVVQLTGQTGNPPMGQTPGQTGNPPIGSNFWTSRAYYSDFDDGDFINTYPALADAWLSSM
ETIFYYMRCPDEQKVQYAIFMLKDDALLWWESAERSIDVGGGPITWLQFKDAFFLQYYPAITQFRKQAEFLNLKQGNKSVEEFEREFTKLSRFAPKLVDTESKKTERFIM
GLKDEIQGFVAALSPPDYAIALRAAALIDNLFLLLLMEKI