; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g29430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g29430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr8:21134305..21136313
RNA-Seq ExpressionMoc08g29430
SyntenyMoc08g29430
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_019054932.1 PREDICTED: uncharacterized protein LOC109115387 [Nelumbo nucifera]1.3e-1333.72Show/hide
Query:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH
        +K+ G + SR+VV+L+DSGASHNFI+E+LV+ L LP +P+  +G+ +G GD V+ S VC+  +  +N P   ++ D   LKL                  
Subjt:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH

Query:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG
                  G   V        +  +    W +  M   L   R+ LQGD SL  SQVSL +MM  ++ EG
Subjt:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG

XP_022897442.1 uncharacterized protein LOC111411108 [Olea europaea var. sylvestris]5.2e-1535.47Show/hide
Query:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH
        MKL G +  R+VVVL+DSGASHNFI+  +V+ L LP+S +  YG+I+GTG +V+   +C+     V+   P L+ ++  L L           +G++   
Subjt:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH

Query:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG
          M++++ K+GR                Q  W+S  M F +   +I LQGD SL ++ VSLK+M KA + +G
Subjt:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG

XP_031737572.1 uncharacterized protein LOC116402461 [Cucumis sativus]4.3e-1737.43Show/hide
Query:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH
        +K+ G +R R+VVVLID GA+HNFIAE +V+ L + +    +YG++LGTG  V+A+ VCK                 + LK+              + TH
Subjt:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH

Query:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGE
            F+   +G   V   +       +V + ++ SEM+F   EW + LQGDRSLV+SQVSLKSMMK    E
Subjt:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGE

XP_038904464.1 uncharacterized protein LOC120090832 [Benincasa hispida]2.9e-2138.95Show/hide
Query:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH
        +K+ G++  + VVVLIDSGASHNFI + LV  L LP  P+ SYGI+LG G SV+ + VCKG   ++N    T++ D   L L                  
Subjt:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH

Query:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG
                  G   V   +   +    V+  W +SEM+FQ+ +W++HL+G+R+L+K+Q+SLKSMMK +R EG
Subjt:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG

XP_038907170.1 uncharacterized protein LOC120092972 [Benincasa hispida]1.3e-2140.7Show/hide
Query:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH
        +KL+G+V    VVVLIDSGA+HNFI + LV  L+LP SP+ S GI+LGTG SV+ + +CKG   ++N P  T++ D                        
Subjt:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH

Query:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG
            F    +G   V   +   +   +V+  W +S+MDFQ+ E R+HL+ DRSLVKSQ+SLKSMMK +   G
Subjt:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG

TrEMBL top hitse value%identityAlignment
A0A1U8Q8J7 uncharacterized protein LOC1091153876.2e-1433.72Show/hide
Query:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH
        +K+ G + SR+VV+L+DSGASHNFI+E+LV+ L LP +P+  +G+ +G GD V+ S VC+  +  +N P   ++ D   LKL                  
Subjt:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH

Query:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG
                  G   V        +  +    W +  M   L   R+ LQGD SL  SQVSL +MM  ++ EG
Subjt:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG

A0A5C7HUZ0 Chromo domain-containing protein1.2e-1233.71Show/hide
Query:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPK------TMVNHPFPTLLTDQIVLKLIVEHFFYTYASV
        MK+ G V  ++VV+LID GA+HNFI+ +LV+ L LPI+   +YG+ +GTGDSV+   +CKG         +V    P  L    V+  I        A++
Subjt:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPK------TMVNHPFPTLLTDQIVLKLIVEHFFYTYASV

Query:  GTTKTHWKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG
        G T T+WK++ +K                               FQL    + L+ D SL K+ VSLK+MM+  + EG
Subjt:  GTTKTHWKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG

A0A5C7IJS7 Uncharacterized protein3.1e-1334.27Show/hide
Query:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPK------TMVNHPFPTLLTDQIVLKLIVEHFFYTYASV
        MK+ G V  ++VV LID GA+HNFI+  LV+ L LPI+ + +YG+ +GTGDSV+   +CKG         +V    P  L    V+  I        A++
Subjt:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPK------TMVNHPFPTLLTDQIVLKLIVEHFFYTYASV

Query:  GTTKTHWKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG
        G T T+WK++ +K                               FQL    + L+GD SL K+ VSLK+MM+  + EG
Subjt:  GTTKTHWKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG

A0A5D3BSP2 Ty3/gypsy retrotransposon protein2.6e-1229.76Show/hide
Query:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH
        MK+ G ++ R+V++LID GA+HNFI+E LV+ L LP+  +  YG+ILG+G +VQ   +C+                                +V    ++
Subjt:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH

Query:  WKMR--FIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMK
        WK++  F+  ++G   V   +    +       W++  + F     +I ++GD SL KS++SLKSM+K
Subjt:  WKMR--FIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMK

A0A803PSM5 Uncharacterized protein9.0e-1333.14Show/hide
Query:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH
        MKL G +  + V VLIDSGA+HNFI+  +V    +PI+ +  YGI+LGTGD V+A  VC                 Q+ L+L            G  +  
Subjt:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTH

Query:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG
            F+  ++G   V   +        +Q  WR+  M F+     + LQGD SL KSQ+SLK+M+  +   G
Subjt:  WKMRFIKGKVGRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein2.3e-0531.43Show/hide
Query:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPT---LLTDQIVLKLIVEHFFYTYASVGTT
        M+  G +   KVVV IDSGA+ NFI   L   L LP S +    ++LG    +Q+   C G +  V     T   LL D     + V   +   + +G T
Subjt:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPT---LLTDQIVLKLIVEHFFYTYASVGTT

Query:  KTHWK
          +W+
Subjt:  KTHWK

AT3G30770.1 Eukaryotic aspartyl protease family protein4.7e-0637.88Show/hide
Query:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMV
        M+  G +   KVVV+IDSGA++NFI++ L   L LP S +    ++LG    +Q    C G   +V
Subjt:  MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTCGTCGGGAGTGTCCGCAGTCGGAAGGTCGTAGTGTTGATCGATAGTGGAGCTTCCCACAACTTCATCGCTGAAAGTTTAGTACGAGGGTTGGATCTGCCTAT
CTCACCCTCATTCAGTTATGGCATAATCTTGGGAACGGGCGATTCCGTGCAAGCCTCAAAAGTATGTAAAGGTCCTAAAACGATGGTGAATCACCCATTTCCAACTCTTT
TGACTGACCAAATCGTGCTCAAACTGATAGTGGAACACTTCTTCTACACGTATGCATCGGTTGGAACCACCAAAACGCATTGGAAGATGCGTTTCATTAAGGGCAAAGTT
GGAAGAAAAACTGTTTTCTGCAGGCTTCCCAGGCGCCTGGCGCCTCCTGAGGTCCAATACGGTTGGCGATCATCTGAGATGGATTTTCAGCTTCGAGAATGGAGAATTCA
TCTTCAAGGAGACCGTAGTCTCGTGAAATCTCAGGTTTCATTGAAATCCATGATGAAGGCCATTAGGGGGGAAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTCGTCGGGAGTGTCCGCAGTCGGAAGGTCGTAGTGTTGATCGATAGTGGAGCTTCCCACAACTTCATCGCTGAAAGTTTAGTACGAGGGTTGGATCTGCCTAT
CTCACCCTCATTCAGTTATGGCATAATCTTGGGAACGGGCGATTCCGTGCAAGCCTCAAAAGTATGTAAAGGTCCTAAAACGATGGTGAATCACCCATTTCCAACTCTTT
TGACTGACCAAATCGTGCTCAAACTGATAGTGGAACACTTCTTCTACACGTATGCATCGGTTGGAACCACCAAAACGCATTGGAAGATGCGTTTCATTAAGGGCAAAGTT
GGAAGAAAAACTGTTTTCTGCAGGCTTCCCAGGCGCCTGGCGCCTCCTGAGGTCCAATACGGTTGGCGATCATCTGAGATGGATTTTCAGCTTCGAGAATGGAGAATTCA
TCTTCAAGGAGACCGTAGTCTCGTGAAATCTCAGGTTTCATTGAAATCCATGATGAAGGCCATTAGGGGGGAAGGATAG
Protein sequenceShow/hide protein sequence
MKLVGSVRSRKVVVLIDSGASHNFIAESLVRGLDLPISPSFSYGIILGTGDSVQASKVCKGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTHWKMRFIKGKV
GRKTVFCRLPRRLAPPEVQYGWRSSEMDFQLREWRIHLQGDRSLVKSQVSLKSMMKAIRGEG