; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g15060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g15060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag protease polyprotein
Genome locationchr10:11440259..11440840
RNA-Seq ExpressionMoc10g15060
SyntenyMoc10g15060
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0009987 - cellular process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]3.2e-4369.92Show/hide
Query:  MVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVT
        +VQ     +   + Q  G +S +AKYLRDFKKYD  SFDGLSVD  LAEAWLS +ETI  YMRC EEQKVQC VFMLKDDAFLWWE  +R IDVSGGPVT
Subjt:  MVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVT

Query:  WLQFKEAFFQQYYPTITRFRKQAD-SNLKQGNR
        WLQFKEAFFQQYYP IT +RKQ +  NLKQ NR
Subjt:  WLQFKEAFFQQYYPTITRFRKQAD-SNLKQGNR

XP_038883046.1 uncharacterized protein LOC120074107 [Benincasa hispida]2.2e-2847.3Show/hide
Query:  ANLETVVQSPSQTEIPPMVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFL
        A +E   Q+ +Q E  P+ QL    +  P  +    +S +AK+LRDF+KYD  SFDG   D T A+ WLSSIETI  +MRCPEE K+QC VFML  +  +
Subjt:  ANLETVVQSPSQTEIPPMVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFL

Query:  WWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITRFRKQAD-SNLKQG
        WW   ++ ID  G   TW QFKE F+++Y+   TR+ KQA+  NLKQG
Subjt:  WWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITRFRKQAD-SNLKQG

XP_038887018.1 uncharacterized protein LOC120077183 [Benincasa hispida]3.2e-2755.86Show/hide
Query:  MSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITRF
        +S +AK+LRDF+K+D+ SFDG   D T A+ WLSSIETI H+MRC EE K+QC VFML  +A +WW  A++ ID SGG  TW QFKE F++ Y+   TR+
Subjt:  MSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITRF

Query:  RKQAD-SNLKQ
         KQ +  NLKQ
Subjt:  RKQAD-SNLKQ

XP_038891712.1 uncharacterized protein LOC120081110 [Benincasa hispida]1.0e-2850Show/hide
Query:  PPMVQLPG-QMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGG
        PP  Q+P  Q +N    Q+    S +AK+LRDFKKY+  +F+G   D T AE W+S IETI  YM+CPE+QKVQC VFML D A +WW+ A+R + V G 
Subjt:  PPMVQLPG-QMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGG

Query:  PVTWLQFKEAFFQQYYPTITRFRKQAD-SNLKQGNR
        PVTW QFKE F+ +Y+    R+ KQ +   L+QG+R
Subjt:  PVTWLQFKEAFFQQYYPTITRFRKQAD-SNLKQGNR

XP_038896687.1 uncharacterized protein LOC120084949 [Benincasa hispida]3.8e-2851.26Show/hide
Query:  QTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYP
        Q+   +S +AK+LRDF+KY+  +F+G   DLT AE W+SSIETI  YM+CPE+QKVQC +FML D A +WW+ A+R + V G PVTW QFKE F+ +Y+ 
Subjt:  QTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYP

Query:  TITRFRKQAD-SNLKQGNR
           ++ KQ +   L+QG+R
Subjt:  TITRFRKQAD-SNLKQGNR

TrEMBL top hitse value%identityAlignment
A0A5A7SW90 Reverse transcriptase5.5e-2542.58Show/hide
Query:  RLGGLGGGANLETVVQSPSQTEIPPMVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVF
        R GG GG        +      + P VQ   Q  NP        +S +AK+LRDF+KY+  +FDG   D T A+ WLSS+ETI  YM+CPE+QKVQC VF
Subjt:  RLGGLGGGANLETVVQSPSQTEIPPMVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVF

Query:  MLKDDAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITR-FRKQADSNLKQ
        ML D    WWE  +R +    G +TW QFKE+FF +++    R  ++Q   NL+Q
Subjt:  MLKDDAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITR-FRKQADSNLKQ

A0A5A7T014 Gag protease polyprotein9.4e-2541.78Show/hide
Query:  ETVVQSPSQTEIPPMVQLPGQMENPPMGQ-TSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWW
        + ++Q   Q +  P    P     P + Q     +S +AK+LRDF+KY++ +F+G   D T A+ WLSS+ETI  YM+CPE+QKVQCVVFML D   +WW
Subjt:  ETVVQSPSQTEIPPMVQLPGQMENPPMGQ-TSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWW

Query:  ECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITR-FRKQADSNLKQG
        E  +R +    G +TW QFKE+F+ +++    R  ++Q   NL+QG
Subjt:  ECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITR-FRKQADSNLKQG

A0A5D3BCA2 Gag protease polyprotein9.4e-2541.78Show/hide
Query:  ETVVQSPSQTEIPPMVQLPGQMENPPMGQ-TSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWW
        + ++Q   Q +  P    P     P + Q     +S +AK+LRDF+KY++ +F+G   D T A+ WLSS+ETI  YM+CPE+QKVQCVVFML D   +WW
Subjt:  ETVVQSPSQTEIPPMVQLPGQMENPPMGQ-TSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWW

Query:  ECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITR-FRKQADSNLKQG
        E  +R +    G +TW QFKE+F+ +++    R  ++Q   NL+QG
Subjt:  ECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITR-FRKQADSNLKQG

A0A5D3E4V0 Reverse transcriptase5.5e-2542.58Show/hide
Query:  RLGGLGGGANLETVVQSPSQTEIPPMVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVF
        R GG GG        +      + P VQ   Q  NP        +S +AK+LRDF+KY+  +FDG   D T A+ WLSS+ETI  YM+CPE+QKVQC VF
Subjt:  RLGGLGGGANLETVVQSPSQTEIPPMVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVF

Query:  MLKDDAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITR-FRKQADSNLKQ
        ML D    WWE  +R +    G +TW QFKE+FF +++    R  ++Q   NL+Q
Subjt:  MLKDDAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITR-FRKQADSNLKQ

A0A6J1DSJ6 uncharacterized protein LOC1110235121.5e-4369.92Show/hide
Query:  MVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVT
        +VQ     +   + Q  G +S +AKYLRDFKKYD  SFDGLSVD  LAEAWLS +ETI  YMRC EEQKVQC VFMLKDDAFLWWE  +R IDVSGGPVT
Subjt:  MVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVT

Query:  WLQFKEAFFQQYYPTITRFRKQAD-SNLKQGNR
        WLQFKEAFFQQYYP IT +RKQ +  NLKQ NR
Subjt:  WLQFKEAFFQQYYPTITRFRKQAD-SNLKQGNR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACTGGGTGGGCTTGGTGGAGGTGCAAACCTTGAGACAGTGGTTCAATCTCCTAGTCAGACGGAAATTCCACCAATGGTTCAACTTCCTGGTCAGATGGAG
AATCCACCAATGGGTCAAACTTCTGGACACATGTCAACAAAGGCTAAATATCTACGAGATTTTAAGAAGTACGACACTCACTCTTTTGATGGACTATCTGTAGAT
TTGACGTTGGCAGAGGCTTGGTTGTCATCGATAGAGACTATCTCTCATTACATGAGGTGTCCGGAGGAACAAAAAGTGCAGTGTGTAGTCTTTATGCTGAAAGAT
GATGCCTTTTTGTGGTGGGAGTGTGCCAAGAGGTCTATTGATGTGAGTGGAGGCCCGGTCACATGGTTGCAGTTCAAGGAGGCTTTCTTTCAACAATATTACCCA
ACGATCACCCGGTTTAGGAAACAAGCGGATTCGAACTTGAAGCAAGGCAATAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGACTGGGTGGGCTTGGTGGAGGTGCAAACCTTGAGACAGTGGTTCAATCTCCTAGTCAGACGGAAATTCCACCAATGGTTCAACTTCCTGGTCAGATGGAG
AATCCACCAATGGGTCAAACTTCTGGACACATGTCAACAAAGGCTAAATATCTACGAGATTTTAAGAAGTACGACACTCACTCTTTTGATGGACTATCTGTAGAT
TTGACGTTGGCAGAGGCTTGGTTGTCATCGATAGAGACTATCTCTCATTACATGAGGTGTCCGGAGGAACAAAAAGTGCAGTGTGTAGTCTTTATGCTGAAAGAT
GATGCCTTTTTGTGGTGGGAGTGTGCCAAGAGGTCTATTGATGTGAGTGGAGGCCCGGTCACATGGTTGCAGTTCAAGGAGGCTTTCTTTCAACAATATTACCCA
ACGATCACCCGGTTTAGGAAACAAGCGGATTCGAACTTGAAGCAAGGCAATAGATGA
Protein sequenceShow/hide protein sequence
MRLGGLGGGANLETVVQSPSQTEIPPMVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKD
DAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITRFRKQADSNLKQGNR