; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g15320 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g15320
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr4:11565133..11568993
RNA-Seq ExpressionMoc04g15320
SyntenyMoc04g15320
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154299.1 uncharacterized protein LOC111021593 [Momordica charantia]3.9e-7375.25Show/hide
Query:  RLGQRAPPTVSTQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGEL
        RLGQ+AP  V TQ GNQ+ARVFALTR+E T+AE  VTGTVLV N PAYVLFDSGSS TFISTAFVRQ  L+L PLGFLL VSTPSGSV+I+SQMV+ G L
Subjt:  RLGQRAPPTVSTQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGEL

Query:  SFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLDEL
        SFD Q L ARLIQLD+RDFDVILGMDWLATNQA+INCS++EVSFQLP G SF FKGVTGGV R VSAL+AR LLQ GAWG+LASVVD     P +D +
Subjt:  SFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLDEL

XP_022154844.1 uncharacterized protein LOC111022005 [Momordica charantia]1.0e-7385.63Show/hide
Query:  RLGQRAPPTVSTQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGEL
        RLGQRAP T+ TQGGN+RARVFALTRKEA DAE  VTG VLVHNVP Y LFDS SSHTFISTAFVRQATLK+E LG LLSVSTPSG+V+IASQMVRA +L
Subjt:  RLGQRAPPTVSTQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGEL

Query:  SFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLL
        SFDNQTL+ARLIQLD+RDFDVILGMDWLATNQANINC RREVSFQLPSGRSFTFKGVTGGV +AVS LKARR L
Subjt:  SFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLL

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]3.2e-6759.2Show/hide
Query:  VGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP-------------RLG-------QRAPPTVSTQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVP
        +GSSSGVKRK +   + Q  R  Q   Q+Q  PP              LG       Q+ P   + QGG QRARVFALTR +   AE  VTGT+LV ++P
Subjt:  VGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP-------------RLG-------QRAPPTVSTQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVP

Query:  AYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQL
        AY LFDSGSSH+FI++ FVR A L+LE LGFLLSVSTPSGSVL+ SQ+V+ G+LSFD QT E +LIQLDM+DFDVILGMDWLA N+ANINCS++EVSF+L
Subjt:  AYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQL

Query:  PSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLD
        PSG++FTFK V  GV R VSALKA  LLQ GAW YLASVVD     P ++
Subjt:  PSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLD

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]3.6e-6654.61Show/hide
Query:  VGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP-----------------------------------------RLGQRAPPTVSTQGGNQRARVFALT
        +GSSSGVKRK +   + QP R  Q   Q+Q  PP                                          LGQR P T + QGG  RARVFALT
Subjt:  VGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP-----------------------------------------RLGQRAPPTVSTQGGNQRARVFALT

Query:  RKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGM
        R +   AE  VT TVLV ++PAY LFDSGSSH+FI++ FV  A L+LE LGFLLSVSTPSGSVL+ SQ+V+ G+LSFD QTLE +LIQLDM+DFDVILGM
Subjt:  RKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGM

Query:  DWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLD
        DWLA N+ANI+CS+++VSF+LPSG++FTFKGV  GV R V ALKA  LLQ GAW YLASVVD     P ++
Subjt:  DWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLD

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]6.2e-9571.23Show/hide
Query:  MDKDISNRVQPLVEVGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP-----------------------------------------RLGQRAPPTVS
        MD D+SN VQPLVEVGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP                                         RLGQRA PTVS
Subjt:  MDKDISNRVQPLVEVGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP-----------------------------------------RLGQRAPPTVS

Query:  TQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARL
        TQG                       GT LVHNVPAYVLFD GSSHTFISTAFVRQATL+LEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARL
Subjt:  TQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARL

Query:  IQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLD
        IQLDMRDFDVILGMDWLATNQANINCS+REVSFQLPSGRSFTFKGV+GGV RAVSALKARRLL NGAW YLASVVDI    P +D
Subjt:  IQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLD

TrEMBL top hitse value%identityAlignment
A0A6J1DLN2 uncharacterized protein LOC1110215931.9e-7375.25Show/hide
Query:  RLGQRAPPTVSTQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGEL
        RLGQ+AP  V TQ GNQ+ARVFALTR+E T+AE  VTGTVLV N PAYVLFDSGSS TFISTAFVRQ  L+L PLGFLL VSTPSGSV+I+SQMV+ G L
Subjt:  RLGQRAPPTVSTQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGEL

Query:  SFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLDEL
        SFD Q L ARLIQLD+RDFDVILGMDWLATNQA+INCS++EVSFQLP G SF FKGVTGGV R VSAL+AR LLQ GAWG+LASVVD     P +D +
Subjt:  SFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLDEL

A0A6J1DNG3 uncharacterized protein LOC1110220055.0e-7485.63Show/hide
Query:  RLGQRAPPTVSTQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGEL
        RLGQRAP T+ TQGGN+RARVFALTRKEA DAE  VTG VLVHNVP Y LFDS SSHTFISTAFVRQATLK+E LG LLSVSTPSG+V+IASQMVRA +L
Subjt:  RLGQRAPPTVSTQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGEL

Query:  SFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLL
        SFDNQTL+ARLIQLD+RDFDVILGMDWLATNQANINC RREVSFQLPSGRSFTFKGVTGGV +AVS LKARR L
Subjt:  SFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLL

A0A6J1DTA8 uncharacterized protein LOC1110241141.6e-6759.2Show/hide
Query:  VGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP-------------RLG-------QRAPPTVSTQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVP
        +GSSSGVKRK +   + Q  R  Q   Q+Q  PP              LG       Q+ P   + QGG QRARVFALTR +   AE  VTGT+LV ++P
Subjt:  VGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP-------------RLG-------QRAPPTVSTQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVP

Query:  AYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQL
        AY LFDSGSSH+FI++ FVR A L+LE LGFLLSVSTPSGSVL+ SQ+V+ G+LSFD QT E +LIQLDM+DFDVILGMDWLA N+ANINCS++EVSF+L
Subjt:  AYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQL

Query:  PSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLD
        PSG++FTFK V  GV R VSALKA  LLQ GAW YLASVVD     P ++
Subjt:  PSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLD

A0A6J1DWP4 uncharacterized protein LOC1110252151.7e-6654.61Show/hide
Query:  VGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP-----------------------------------------RLGQRAPPTVSTQGGNQRARVFALT
        +GSSSGVKRK +   + QP R  Q   Q+Q  PP                                          LGQR P T + QGG  RARVFALT
Subjt:  VGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP-----------------------------------------RLGQRAPPTVSTQGGNQRARVFALT

Query:  RKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGM
        R +   AE  VT TVLV ++PAY LFDSGSSH+FI++ FV  A L+LE LGFLLSVSTPSGSVL+ SQ+V+ G+LSFD QTLE +LIQLDM+DFDVILGM
Subjt:  RKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGM

Query:  DWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLD
        DWLA N+ANI+CS+++VSF+LPSG++FTFKGV  GV R V ALKA  LLQ GAW YLASVVD     P ++
Subjt:  DWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLD

A0A6J1DYU5 uncharacterized protein LOC1110255173.0e-9571.23Show/hide
Query:  MDKDISNRVQPLVEVGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP-----------------------------------------RLGQRAPPTVS
        MD D+SN VQPLVEVGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP                                         RLGQRA PTVS
Subjt:  MDKDISNRVQPLVEVGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPP-----------------------------------------RLGQRAPPTVS

Query:  TQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARL
        TQG                       GT LVHNVPAYVLFD GSSHTFISTAFVRQATL+LEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARL
Subjt:  TQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARL

Query:  IQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLD
        IQLDMRDFDVILGMDWLATNQANINCS+REVSFQLPSGRSFTFKGV+GGV RAVSALKARRLL NGAW YLASVVDI    P +D
Subjt:  IQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARRLLQNGAWGYLASVVDIMPCRPVLD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAAGGATATCTCCAATAGGGTCCAACCTCTGGTGGAAGTCGGATCATCTTCAGGTGTGAAAAGGAAGGTCTCTCCGGCTTACGCCGACCAGCCATTTAGAGCACC
CCAGCGCCCGGCTCAGCAGCAGGGCCTGCCACCAAGGCTAGGCCAGAGGGCTCCCCCAACAGTTTCGACGCAGGGAGGTAATCAGAGGGCTCGTGTCTTCGCACTTACTC
GCAAGGAAGCGACGGATGCCGAAATCTTTGTCACAGGTACGGTCTTAGTCCATAATGTGCCTGCTTATGTATTGTTTGACTCGGGATCGAGTCACACCTTCATCTCTACT
GCGTTTGTTCGTCAGGCAACCCTCAAACTAGAGCCGTTAGGGTTTCTGCTGTCAGTTTCTACACCTTCAGGGTCGGTTTTGATTGCTAGTCAAATGGTGAGAGCAGGTGA
GTTATCTTTTGACAATCAGACCCTAGAGGCAAGATTGATCCAACTGGACATGCGGGATTTTGACGTCATTTTGGGCATGGATTGGCTAGCTACCAACCAAGCCAACATTA
ATTGCTCGAGGAGGGAAGTCTCTTTCCAACTACCTTCGGGTCGGAGCTTTACGTTTAAAGGAGTTACGGGTGGAGTTCTAAGGGCAGTCTCAGCGTTGAAGGCAAGACGC
CTTTTACAAAATGGTGCCTGGGGATATTTGGCCAGTGTCGTCGACATTATGCCTTGTAGACCAGTACTAGACGAGTTGGATCGTTCTGAGGTGGAGTTAGCGGTGGAAGA
TATCTCGGCAGTGTTAGCTCGACTCTCGGTTGAACCCACTTTAAGACAGCGGGTCATCGCTGCACAGAAGGGAGATCCCAGCCTGAGCAAGGGTTTCGGTATGGTAGATG
AAACCTTCTGTTATAAGGAGGTACCCATTGAGATCGTAGCAAGAGAGACCAAGGTGCTGCGGAATCGGGCAATTGACTTGGTGAAGGTCTTGTGGAGGAATCACCAAGTG
GAGAAAGTAACCTGGGAAAGGGAAGACGAGATTAGAGCCCGATACCCTGAATTATTCGAACGACGAAGTTTCGAGGACGAAAGTTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATAAGGATATCTCCAATAGGGTCCAACCTCTGGTGGAAGTCGGATCATCTTCAGGTGTGAAAAGGAAGGTCTCTCCGGCTTACGCCGACCAGCCATTTAGAGCACC
CCAGCGCCCGGCTCAGCAGCAGGGCCTGCCACCAAGGCTAGGCCAGAGGGCTCCCCCAACAGTTTCGACGCAGGGAGGTAATCAGAGGGCTCGTGTCTTCGCACTTACTC
GCAAGGAAGCGACGGATGCCGAAATCTTTGTCACAGGTACGGTCTTAGTCCATAATGTGCCTGCTTATGTATTGTTTGACTCGGGATCGAGTCACACCTTCATCTCTACT
GCGTTTGTTCGTCAGGCAACCCTCAAACTAGAGCCGTTAGGGTTTCTGCTGTCAGTTTCTACACCTTCAGGGTCGGTTTTGATTGCTAGTCAAATGGTGAGAGCAGGTGA
GTTATCTTTTGACAATCAGACCCTAGAGGCAAGATTGATCCAACTGGACATGCGGGATTTTGACGTCATTTTGGGCATGGATTGGCTAGCTACCAACCAAGCCAACATTA
ATTGCTCGAGGAGGGAAGTCTCTTTCCAACTACCTTCGGGTCGGAGCTTTACGTTTAAAGGAGTTACGGGTGGAGTTCTAAGGGCAGTCTCAGCGTTGAAGGCAAGACGC
CTTTTACAAAATGGTGCCTGGGGATATTTGGCCAGTGTCGTCGACATTATGCCTTGTAGACCAGTACTAGACGAGTTGGATCGTTCTGAGGTGGAGTTAGCGGTGGAAGA
TATCTCGGCAGTGTTAGCTCGACTCTCGGTTGAACCCACTTTAAGACAGCGGGTCATCGCTGCACAGAAGGGAGATCCCAGCCTGAGCAAGGGTTTCGGTATGGTAGATG
AAACCTTCTGTTATAAGGAGGTACCCATTGAGATCGTAGCAAGAGAGACCAAGGTGCTGCGGAATCGGGCAATTGACTTGGTGAAGGTCTTGTGGAGGAATCACCAAGTG
GAGAAAGTAACCTGGGAAAGGGAAGACGAGATTAGAGCCCGATACCCTGAATTATTCGAACGACGAAGTTTCGAGGACGAAAGTTTCTAA
Protein sequenceShow/hide protein sequence
MDKDISNRVQPLVEVGSSSGVKRKVSPAYADQPFRAPQRPAQQQGLPPRLGQRAPPTVSTQGGNQRARVFALTRKEATDAEIFVTGTVLVHNVPAYVLFDSGSSHTFIST
AFVRQATLKLEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGVTGGVLRAVSALKARR
LLQNGAWGYLASVVDIMPCRPVLDELDRSEVELAVEDISAVLARLSVEPTLRQRVIAAQKGDPSLSKGFGMVDETFCYKEVPIEIVARETKVLRNRAIDLVKVLWRNHQV
EKVTWEREDEIRARYPELFERRSFEDESF