; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g18450 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g18450
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr3:12178143..12179045
RNA-Seq ExpressionMoc03g18450
SyntenyMoc03g18450
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144016.1 uncharacterized protein LOC111013805 [Momordica charantia]3.1e-5049.18Show/hide
Query:  PAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKKRRLG-----------------------------------------
        P +       KK Q  +FK+FL+VL+Q+H+N+PLVEALEQMP YV+FLK+ILTKKR LG                                         
Subjt:  PAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKKRRLG-----------------------------------------

Query:  -----MGA-----------------IKPTTVTFQLADWSIANPKGKTEDVLVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVR
             MGA                  +PTTVT QLAD SI +P+GK EDV VQV KF FPADFI+LD+DA +EVPIILGR FLATG+ALVDV+KGE+T+ 
Subjt:  -----MGA-----------------IKPTTVTFQLADWSIANPKGKTEDVLVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVR

Query:  VQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSSEFMLE
        VQDQE+KFSV+ SMK+ A+++EC+VL++LDEA M  L  E MLE
Subjt:  VQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSSEFMLE

XP_022149795.1 uncharacterized protein LOC111018140 [Momordica charantia]7.3e-5245.2Show/hide
Query:  PEQPKSSKSGASASNRTEELKKHEQSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKK---------
        P + +++K  AS    T + K+ E    PV   +Y P  PYPK LQKK Q  QF++FL+VL+Q+HI +PLVEALEQM TYV+FLKDILTKK         
Subjt:  PEQPKSSKSGASASNRTEELKKHEQSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKK---------

Query:  ------------------------------------------------------RRLGMGAIKPTTVTFQLADWSIANPKGKTEDVLVQVRKFIFPADFI
                                                               +LG+G  +PTT+T QLAD SI   + K EDVLVQV KFIFP DFI
Subjt:  ------------------------------------------------------RRLGMGAIKPTTVTFQLADWSIANPKGKTEDVLVQVRKFIFPADFI

Query:  VLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVRVQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSSEFMLEH
        +LD+D  +EVPII+GR FL+TG+ALVDV KGE+ + VQD E+KFSVY+SMK PA+++EC+VL++L+EA +   ++E MLEH
Subjt:  VLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVRVQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSSEFMLEH

XP_022155208.1 uncharacterized protein LOC111022348 [Momordica charantia]6.8e-5046.15Show/hide
Query:  MNTDEAESKEDHQEPEQPKSSKSGASASNRTEELKKHEQSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKD
        M+     SKE    P + +++K+ AS  + T + K+ E    P    +Y    PYPKRLQKK ++ QF++FL+VL+Q+++N+PLVEAL+QMPTYV+FLKD
Subjt:  MNTDEAESKEDHQEPEQPKSSKSGASASNRTEELKKHEQSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKD

Query:  ILTKK---------------------------------------------------------------RRLGMGAIKPTTVTFQLADWSIANPKGKTEDV
        I TKK                                                               R+LGMG  +PTTVT QLAD SI  P+GK EDV
Subjt:  ILTKK---------------------------------------------------------------RRLGMGAIKPTTVTFQLADWSIANPKGKTEDV

Query:  LVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVRVQDQEIKFSVYESMKYPADADEC
        LVQV KFIFPADFI+LD+DA +EVPIILGR FL TG+ALVDV KGE+T+RVQDQE KFS+Y+SMK+P   +EC
Subjt:  LVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVRVQDQEIKFSVYESMKYPADADEC

XP_022156989.1 uncharacterized protein LOC111023818 [Momordica charantia]1.8e-5045.49Show/hide
Query:  QSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKKRR-------------------------------
        + K+ + +   VP  PYP+RLQKKNQ+ QF +FLEVL+Q+HIN+PL+EALEQMP YVKFLKDIL KKRR                               
Subjt:  QSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKKRR-------------------------------

Query:  --------------------------------LGMGAIKPTTVTFQLADWSIANPKGKTEDVLVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKA
                                        LG+G  +P TVT QLAD SI   +GK EDVLVQV KFIFPADFI+LD++A +E+PIILGR FL+TG+A
Subjt:  --------------------------------LGMGAIKPTTVTFQLADWSIANPKGKTEDVLVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKA

Query:  LVDVHKGEVTVRVQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSSEFML
        L+DVH GE+T+RV DQ++  S++ S+KYP D +EC+ LR+ D+     + +E +L
Subjt:  LVDVHKGEVTVRVQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSSEFML

XP_030497888.1 uncharacterized protein LOC115713544 [Cannabis sativa]3.0e-5358.7Show/hide
Query:  PAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKKRR-----------------LGMGAIKPTTVTFQLADWSIANPKGK
        P  P+P R +K+  +GQF RFL+VL+Q+HIN+PLVEALEQMPTYVKFLKDILTKKRR                 LG+G  +PTTVT QL D S+A+P+GK
Subjt:  PAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKKRR-----------------LGMGAIKPTTVTFQLADWSIANPKGK

Query:  TEDVLVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVRVQDQEIKFSVYESMKYPADADECAVLRVLD
         EDV VQV KFIFPADFI+LD++A +EVPIILGR FLATG+ L+DV  GE+T+RV DQ++ F+V+ +M++P + +EC+ L V+D
Subjt:  TEDVLVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVRVQDQEIKFSVYESMKYPADADECAVLRVLD

TrEMBL top hitse value%identityAlignment
A0A6J1CS22 uncharacterized protein LOC1110138051.5e-5049.18Show/hide
Query:  PAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKKRRLG-----------------------------------------
        P +       KK Q  +FK+FL+VL+Q+H+N+PLVEALEQMP YV+FLK+ILTKKR LG                                         
Subjt:  PAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKKRRLG-----------------------------------------

Query:  -----MGA-----------------IKPTTVTFQLADWSIANPKGKTEDVLVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVR
             MGA                  +PTTVT QLAD SI +P+GK EDV VQV KF FPADFI+LD+DA +EVPIILGR FLATG+ALVDV+KGE+T+ 
Subjt:  -----MGA-----------------IKPTTVTFQLADWSIANPKGKTEDVLVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVR

Query:  VQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSSEFMLE
        VQDQE+KFSV+ SMK+ A+++EC+VL++LDEA M  L  E MLE
Subjt:  VQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSSEFMLE

A0A6J1D3P6 uncharacterized protein LOC1110170141.6e-4943.82Show/hide
Query:  ESKEDHQEPEQPKSSKSGASASNRTEELKKHEQSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKKR
        E ++   E   P    +  + S +T  LKK +Q++     A Y P  PYPKRLQKK Q  QFK+ L+VL+Q+H+N+P VEALEQ+P YV+FLK+IL KKR
Subjt:  ESKEDHQEPEQPKSSKSGASASNRTEELKKHEQSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKKR

Query:  ---------------------------------------------------------------RLGMGAIKPTTVTFQLADWSIANPKGKTEDVLVQVRK
                                                                       +LG+G  +PTTVT QLAD S+ +P+GK EDVLVQV K
Subjt:  ---------------------------------------------------------------RLGMGAIKPTTVTFQLADWSIANPKGKTEDVLVQVRK

Query:  FIFPADFIVLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVRVQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSS
        FIFP DFI+LD+DA +EV II+ R FLAT +ALV+VHKG++T+RV DQE+KFSVY SM +P  A+EC V+++LDEA M  L +
Subjt:  FIFPADFIVLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVRVQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSS

A0A6J1D6R2 uncharacterized protein LOC1110181403.5e-5245.2Show/hide
Query:  PEQPKSSKSGASASNRTEELKKHEQSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKK---------
        P + +++K  AS    T + K+ E    PV   +Y P  PYPK LQKK Q  QF++FL+VL+Q+HI +PLVEALEQM TYV+FLKDILTKK         
Subjt:  PEQPKSSKSGASASNRTEELKKHEQSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKK---------

Query:  ------------------------------------------------------RRLGMGAIKPTTVTFQLADWSIANPKGKTEDVLVQVRKFIFPADFI
                                                               +LG+G  +PTT+T QLAD SI   + K EDVLVQV KFIFP DFI
Subjt:  ------------------------------------------------------RRLGMGAIKPTTVTFQLADWSIANPKGKTEDVLVQVRKFIFPADFI

Query:  VLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVRVQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSSEFMLEH
        +LD+D  +EVPII+GR FL+TG+ALVDV KGE+ + VQD E+KFSVY+SMK PA+++EC+VL++L+EA +   ++E MLEH
Subjt:  VLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVRVQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSSEFMLEH

A0A6J1DMD0 uncharacterized protein LOC1110223483.3e-5046.15Show/hide
Query:  MNTDEAESKEDHQEPEQPKSSKSGASASNRTEELKKHEQSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKD
        M+     SKE    P + +++K+ AS  + T + K+ E    P    +Y    PYPKRLQKK ++ QF++FL+VL+Q+++N+PLVEAL+QMPTYV+FLKD
Subjt:  MNTDEAESKEDHQEPEQPKSSKSGASASNRTEELKKHEQSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKD

Query:  ILTKK---------------------------------------------------------------RRLGMGAIKPTTVTFQLADWSIANPKGKTEDV
        I TKK                                                               R+LGMG  +PTTVT QLAD SI  P+GK EDV
Subjt:  ILTKK---------------------------------------------------------------RRLGMGAIKPTTVTFQLADWSIANPKGKTEDV

Query:  LVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVRVQDQEIKFSVYESMKYPADADEC
        LVQV KFIFPADFI+LD+DA +EVPIILGR FL TG+ALVDV KGE+T+RVQDQE KFS+Y+SMK+P   +EC
Subjt:  LVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVRVQDQEIKFSVYESMKYPADADEC

A0A6J1DV77 uncharacterized protein LOC1110238188.7e-5145.49Show/hide
Query:  QSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKKRR-------------------------------
        + K+ + +   VP  PYP+RLQKKNQ+ QF +FLEVL+Q+HIN+PL+EALEQMP YVKFLKDIL KKRR                               
Subjt:  QSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKKRR-------------------------------

Query:  --------------------------------LGMGAIKPTTVTFQLADWSIANPKGKTEDVLVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKA
                                        LG+G  +P TVT QLAD SI   +GK EDVLVQV KFIFPADFI+LD++A +E+PIILGR FL+TG+A
Subjt:  --------------------------------LGMGAIKPTTVTFQLADWSIANPKGKTEDVLVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKA

Query:  LVDVHKGEVTVRVQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSSEFML
        L+DVH GE+T+RV DQ++  S++ S+KYP D +EC+ LR+ D+     + +E +L
Subjt:  LVDVHKGEVTVRVQDQEIKFSVYESMKYPADADECAVLRVLDEAEMMALSSEFML

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACACAGATGAAGCTGAATCTAAAGAGGATCATCAGGAGCCCGAACAACCTAAGTCAAGTAAATCTGGAGCATCCGCGTCCAACAGAACCGAGGAATTGAAAAAGCA
TGAGCAATCAAAAATTCCTGTGAGAATAGCAGATTATGTTCCAGCAGTGCCTTACCCGAAGCGATTGCAGAAAAAGAATCAGGAAGGGCAGTTCAAAAGGTTCCTTGAAG
TGCTTGAACAGATACATATAAATATGCCTCTGGTGGAAGCTTTGGAACAAATGCCTACATATGTAAAGTTCTTGAAGGACATCCTCACTAAAAAGAGACGGCTAGGCATG
GGTGCAATTAAGCCAACAACAGTCACCTTTCAGCTAGCTGATTGGTCCATTGCTAATCCAAAAGGAAAAACTGAGGATGTGCTGGTTCAAGTACGCAAATTTATATTCCC
TGCTGATTTTATTGTTCTTGATTTTGATGCACATGAGGAAGTGCCCATCATCTTGGGAAGATCATTTTTGGCAACAGGCAAAGCACTAGTGGATGTACACAAGGGAGAGG
TCACTGTAAGGGTACAAGACCAAGAGATCAAATTCTCGGTATATGAATCTATGAAATATCCTGCGGATGCTGATGAGTGTGCTGTTCTAAGAGTACTAGATGAAGCTGAA
ATGATGGCTCTAAGTTCGGAATTTATGCTTGAGCACCAGAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACACAGATGAAGCTGAATCTAAAGAGGATCATCAGGAGCCCGAACAACCTAAGTCAAGTAAATCTGGAGCATCCGCGTCCAACAGAACCGAGGAATTGAAAAAGCA
TGAGCAATCAAAAATTCCTGTGAGAATAGCAGATTATGTTCCAGCAGTGCCTTACCCGAAGCGATTGCAGAAAAAGAATCAGGAAGGGCAGTTCAAAAGGTTCCTTGAAG
TGCTTGAACAGATACATATAAATATGCCTCTGGTGGAAGCTTTGGAACAAATGCCTACATATGTAAAGTTCTTGAAGGACATCCTCACTAAAAAGAGACGGCTAGGCATG
GGTGCAATTAAGCCAACAACAGTCACCTTTCAGCTAGCTGATTGGTCCATTGCTAATCCAAAAGGAAAAACTGAGGATGTGCTGGTTCAAGTACGCAAATTTATATTCCC
TGCTGATTTTATTGTTCTTGATTTTGATGCACATGAGGAAGTGCCCATCATCTTGGGAAGATCATTTTTGGCAACAGGCAAAGCACTAGTGGATGTACACAAGGGAGAGG
TCACTGTAAGGGTACAAGACCAAGAGATCAAATTCTCGGTATATGAATCTATGAAATATCCTGCGGATGCTGATGAGTGTGCTGTTCTAAGAGTACTAGATGAAGCTGAA
ATGATGGCTCTAAGTTCGGAATTTATGCTTGAGCACCAGAGTTAG
Protein sequenceShow/hide protein sequence
MNTDEAESKEDHQEPEQPKSSKSGASASNRTEELKKHEQSKIPVRIADYVPAVPYPKRLQKKNQEGQFKRFLEVLEQIHINMPLVEALEQMPTYVKFLKDILTKKRRLGM
GAIKPTTVTFQLADWSIANPKGKTEDVLVQVRKFIFPADFIVLDFDAHEEVPIILGRSFLATGKALVDVHKGEVTVRVQDQEIKFSVYESMKYPADADECAVLRVLDEAE
MMALSSEFMLEHQS