; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g12800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g12800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr6:9892863..9893866
RNA-Seq ExpressionMoc06g12800
SyntenyMoc06g12800
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050721.1 methionine aminopeptidase 1B [Cucumis melo var. makuwa]2.7e-1529.68Show/hide
Query:  IKKAMSTLCKKAEEQRKMSEKIYIMLEGFVLE-IISGKSLSISQLGEYSMQKDKPTEETS----------------RIIQIHDQGLDWNKFKKVEMSIF-
        I+  +S + K  E+Q++M   + +M+E    +  + G+  S S   E ++ K K  E TS                +  +I +   D +KFKKVEM +F 
Subjt:  IKKAMSTLCKKAEEQRKMSEKIYIMLEGFVLE-IISGKSLSISQLGEYSMQKDKPTEETS----------------RIIQIHDQGLDWNKFKKVEMSIF-

Query:  -------------------TEGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADENAIDRTSR
                            E +V +++  F+ L  PL+ + +EV++  F+NGL P +  EV    P RL ++M  AQL+E+  +   EA+ N +  TS 
Subjt:  -------------------TEGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADENAIDRTSR

Query:  VSAVGSKATMKTLESTLTSTVTLASKAGTTTPIDT-------APTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCK
                T+   ++T T+TVT  +K  T  PI T           ++E   +RL + E++ +KEKGLCF+C +KY+ DH+CK
Subjt:  VSAVGSKATMKTLESTLTSTVTLASKAGTTTPIDT-------APTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCK

RVW39682.1 Retrovirus-related Pol polyprotein from transposon 297 [Vitis vinifera]1.5e-1335.4Show/hide
Query:  EGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADE-NAID--RTSRVSAVGSKATMKTLESTL
        +G+VA ++  FE L TPL  +S+EV++  F+NGL P +  E+   +P  LG +M  AQ +ED  +A++ A E N  +  R++++ +  ++   K  E+  
Subjt:  EGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADE-NAID--RTSRVSAVGSKATMKTLESTL

Query:  TSTVTLASKAGTTTPIDTAPTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCKNE
        T  V +  K           + ++E   KRL E E + ++EKGLCFKCE+K++  HRCK E
Subjt:  TSTVTLASKAGTTTPIDTAPTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCKNE

RVX08292.1 Heavy metal-associated isoprenylated plant protein 46 [Vitis vinifera]1.9e-1332.6Show/hide
Query:  IQIHDQGLDWNKFKKVEMSIFT-EGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADENAIDR
        I  H     W +   V +  F  +G+VA  +  FE + TPL  +SDEV++  F+NGL P +  E+   +P  LG +M  AQ +ED  + +  A E    +
Subjt:  IQIHDQGLDWNKFKKVEMSIFT-EGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADENAIDR

Query:  TSRVSAVGSKATMKTLESTLTSTVTLASKAGTTTPIDTAPTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCKNE
         +++ +  ++   K  E+  T  V ++ K             ++E   KRL+E E + +KEKGLCFKC++K++  HRCK E
Subjt:  TSRVSAVGSKATMKTLESTLTSTVTLASKAGTTTPIDTAPTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCKNE

XP_022131713.1 uncharacterized protein LOC111004816 [Momordica charantia]3.2e-3254.88Show/hide
Query:  MSIFTEGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADENAIDRTSRVSAVGSKATMKTLES
        M+I  EGSV ++QE FEAL      L +EVL+  +LNGL+PI+  EVLA++PT L QIM RAQLIED A   QE  E   +R S+    G K  +KT E+
Subjt:  MSIFTEGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADENAIDRTSRVSAVGSKATMKTLES

Query:  TLTSTVTLASKAGT-TTPIDTAPTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCKNE
        T T +VTLASK GT  T     PT KKE  YKRL E+EYRK+ E GLCF+CEKKY+V HRC+N+
Subjt:  TLTSTVTLASKAGT-TTPIDTAPTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCKNE

XP_022148929.1 uncharacterized protein LOC111017476 [Momordica charantia]8.1e-4449.38Show/hide
Query:  MIVKELKTRCNATE-EINDIKKAMSTLCKKAEEQRKMSEKIYIMLEGFVLEIISGKSLSISQLGEYSMQKDKPTEETSRIIQIHDQGLDWNKFKKVEMSI
        M+VKEL++RC A E E+ DIK A++ +  K EEQR MSE+I  MLE F+ EII GK+ SISQLGE  +QK+K TEE +R++QI+DQ  D NKFKKVEM I
Subjt:  MIVKELKTRCNATE-EINDIKKAMSTLCKKAEEQRKMSEKIYIMLEGFVLEIISGKSLSISQLGEYSMQKDKPTEETSRIIQIHDQGLDWNKFKKVEMSI

Query:  F----------------------------------------------------TEGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATE
        F                                                     EGSVA++QE FEAL  PL +LSDEVL+C FLNGLD +V  +VLATE
Subjt:  F----------------------------------------------------TEGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATE

Query:  PTRLGQIMHRAQLIEDIAIAVQEADENAIDRTSRVSAVGSK
        P  L QIMHRAQL+EDIA AVQEA E A+DR ++V+ +G K
Subjt:  PTRLGQIMHRAQLIEDIAIAVQEADENAIDRTSRVSAVGSK

TrEMBL top hitse value%identityAlignment
A0A438DW91 Retrovirus-related Pol polyprotein from transposon 2977.2e-1435.4Show/hide
Query:  EGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADE-NAID--RTSRVSAVGSKATMKTLESTL
        +G+VA ++  FE L TPL  +S+EV++  F+NGL P +  E+   +P  LG +M  AQ +ED  +A++ A E N  +  R++++ +  ++   K  E+  
Subjt:  EGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADE-NAID--RTSRVSAVGSKATMKTLESTL

Query:  TSTVTLASKAGTTTPIDTAPTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCKNE
        T  V +  K           + ++E   KRL E E + ++EKGLCFKCE+K++  HRCK E
Subjt:  TSTVTLASKAGTTTPIDTAPTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCKNE

A0A438JH68 Heavy metal-associated isoprenylated plant protein 469.4e-1432.6Show/hide
Query:  IQIHDQGLDWNKFKKVEMSIFT-EGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADENAIDR
        I  H     W +   V +  F  +G+VA  +  FE + TPL  +SDEV++  F+NGL P +  E+   +P  LG +M  AQ +ED  + +  A E    +
Subjt:  IQIHDQGLDWNKFKKVEMSIFT-EGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADENAIDR

Query:  TSRVSAVGSKATMKTLESTLTSTVTLASKAGTTTPIDTAPTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCKNE
         +++ +  ++   K  E+  T  V ++ K             ++E   KRL+E E + +KEKGLCFKC++K++  HRCK E
Subjt:  TSRVSAVGSKATMKTLESTLTSTVTLASKAGTTTPIDTAPTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCKNE

A0A5A7UAX7 Methionine aminopeptidase1.3e-1529.68Show/hide
Query:  IKKAMSTLCKKAEEQRKMSEKIYIMLEGFVLE-IISGKSLSISQLGEYSMQKDKPTEETS----------------RIIQIHDQGLDWNKFKKVEMSIF-
        I+  +S + K  E+Q++M   + +M+E    +  + G+  S S   E ++ K K  E TS                +  +I +   D +KFKKVEM +F 
Subjt:  IKKAMSTLCKKAEEQRKMSEKIYIMLEGFVLE-IISGKSLSISQLGEYSMQKDKPTEETS----------------RIIQIHDQGLDWNKFKKVEMSIF-

Query:  -------------------TEGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADENAIDRTSR
                            E +V +++  F+ L  PL+ + +EV++  F+NGL P +  EV    P RL ++M  AQL+E+  +   EA+ N +  TS 
Subjt:  -------------------TEGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADENAIDRTSR

Query:  VSAVGSKATMKTLESTLTSTVTLASKAGTTTPIDT-------APTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCK
                T+   ++T T+TVT  +K  T  PI T           ++E   +RL + E++ +KEKGLCF+C +KY+ DH+CK
Subjt:  VSAVGSKATMKTLESTLTSTVTLASKAGTTTPIDT-------APTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCK

A0A6J1BU87 uncharacterized protein LOC1110048161.5e-3254.88Show/hide
Query:  MSIFTEGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADENAIDRTSRVSAVGSKATMKTLES
        M+I  EGSV ++QE FEAL      L +EVL+  +LNGL+PI+  EVLA++PT L QIM RAQLIED A   QE  E   +R S+    G K  +KT E+
Subjt:  MSIFTEGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADENAIDRTSRVSAVGSKATMKTLES

Query:  TLTSTVTLASKAGT-TTPIDTAPTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCKNE
        T T +VTLASK GT  T     PT KKE  YKRL E+EYRK+ E GLCF+CEKKY+V HRC+N+
Subjt:  TLTSTVTLASKAGT-TTPIDTAPTAKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCKNE

A0A6J1D5H9 uncharacterized protein LOC1110174763.9e-4449.38Show/hide
Query:  MIVKELKTRCNATE-EINDIKKAMSTLCKKAEEQRKMSEKIYIMLEGFVLEIISGKSLSISQLGEYSMQKDKPTEETSRIIQIHDQGLDWNKFKKVEMSI
        M+VKEL++RC A E E+ DIK A++ +  K EEQR MSE+I  MLE F+ EII GK+ SISQLGE  +QK+K TEE +R++QI+DQ  D NKFKKVEM I
Subjt:  MIVKELKTRCNATE-EINDIKKAMSTLCKKAEEQRKMSEKIYIMLEGFVLEIISGKSLSISQLGEYSMQKDKPTEETSRIIQIHDQGLDWNKFKKVEMSI

Query:  F----------------------------------------------------TEGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATE
        F                                                     EGSVA++QE FEAL  PL +LSDEVL+C FLNGLD +V  +VLATE
Subjt:  F----------------------------------------------------TEGSVAKFQEPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATE

Query:  PTRLGQIMHRAQLIEDIAIAVQEADENAIDRTSRVSAVGSK
        P  L QIMHRAQL+EDIA AVQEA E A+DR ++V+ +G K
Subjt:  PTRLGQIMHRAQLIEDIAIAVQEADENAIDRTSRVSAVGSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGATAGTGAAAGAGTTGAAGACAAGGTGCAATGCGACGGAGGAGATTAATGACATCAAGAAAGCAATGTCGACCTTGTGTAAGAAGGCGGAGGAACAACGAAAGAT
GAGTGAAAAAATCTACATAATGCTGGAAGGATTTGTATTGGAGATCATCAGTGGTAAATCCCTGTCAATCTCGCAACTGGGCGAGTACTCGATGCAGAAGGATAAGCCGA
CGGAAGAGACGTCGAGGATTATCCAAATCCACGACCAAGGATTGGATTGGAACAAATTCAAGAAGGTTGAGATGTCTATATTCACAGAAGGAAGTGTTGCCAAGTTTCAA
GAGCCATTCGAAGCATTGTTTACTCCCCTAGCTAAACTTTCTGATGAGGTTCTCCAATGTGCCTTCCTGAATGGGCTTGATCCTATTGTACTGGTAGAGGTCTTGGCTAC
AGAGCCCACGCGCCTAGGCCAAATTATGCACAGGGCCCAACTGATTGAGGATATTGCAATAGCAGTTCAAGAAGCAGATGAGAATGCGATTGATCGAACCAGCAGGGTTA
GTGCCGTGGGAAGTAAGGCAACTATGAAGACGTTGGAGTCCACCCTCACTAGTACGGTCACATTGGCTAGCAAAGCAGGAACAACGACGCCTATAGACACAGCACCGACA
GCCAAGAAAGAAGCCACTTATAAGCGCTTGATAGAGGATGAGTATCGGAAGCAGAAGGAGAAGGGCCTTTGTTTTAAATGTGAGAAAAAATACACTGTTGACCATCGGTG
TAAGAATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGATGATAGTGAAAGAGTTGAAGACAAGGTGCAATGCGACGGAGGAGATTAATGACATCAAGAAAGCAATGTCGACCTTGTGTAAGAAGGCGGAGGAACAACGAAAGAT
GAGTGAAAAAATCTACATAATGCTGGAAGGATTTGTATTGGAGATCATCAGTGGTAAATCCCTGTCAATCTCGCAACTGGGCGAGTACTCGATGCAGAAGGATAAGCCGA
CGGAAGAGACGTCGAGGATTATCCAAATCCACGACCAAGGATTGGATTGGAACAAATTCAAGAAGGTTGAGATGTCTATATTCACAGAAGGAAGTGTTGCCAAGTTTCAA
GAGCCATTCGAAGCATTGTTTACTCCCCTAGCTAAACTTTCTGATGAGGTTCTCCAATGTGCCTTCCTGAATGGGCTTGATCCTATTGTACTGGTAGAGGTCTTGGCTAC
AGAGCCCACGCGCCTAGGCCAAATTATGCACAGGGCCCAACTGATTGAGGATATTGCAATAGCAGTTCAAGAAGCAGATGAGAATGCGATTGATCGAACCAGCAGGGTTA
GTGCCGTGGGAAGTAAGGCAACTATGAAGACGTTGGAGTCCACCCTCACTAGTACGGTCACATTGGCTAGCAAAGCAGGAACAACGACGCCTATAGACACAGCACCGACA
GCCAAGAAAGAAGCCACTTATAAGCGCTTGATAGAGGATGAGTATCGGAAGCAGAAGGAGAAGGGCCTTTGTTTTAAATGTGAGAAAAAATACACTGTTGACCATCGGTG
TAAGAATGAATAG
Protein sequenceShow/hide protein sequence
MMIVKELKTRCNATEEINDIKKAMSTLCKKAEEQRKMSEKIYIMLEGFVLEIISGKSLSISQLGEYSMQKDKPTEETSRIIQIHDQGLDWNKFKKVEMSIFTEGSVAKFQ
EPFEALFTPLAKLSDEVLQCAFLNGLDPIVLVEVLATEPTRLGQIMHRAQLIEDIAIAVQEADENAIDRTSRVSAVGSKATMKTLESTLTSTVTLASKAGTTTPIDTAPT
AKKEATYKRLIEDEYRKQKEKGLCFKCEKKYTVDHRCKNE