; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g17080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g17080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111019515
Genome locationchr4:12602717..12610168
RNA-Seq ExpressionMoc04g17080
SyntenyMoc04g17080
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]1.1e-3764.71Show/hide
Query:  MNEPKMRAAKAKAAEAKKKVVAPEPVDTIELDLSEGEEVETQWNAANLATRTSLMK-------------------------------------------S
        MNEPK RAAKAKAAEAKKKVVAP PVD IELDLSEGE+VET WNAANLATRTSLMK                                            
Subjt:  MNEPKMRAAKAKAAEAKKKVVAPEPVDTIELDLSEGEEVETQWNAANLATRTSLMK-------------------------------------------S

Query:  LREFYATVHPQSHIAIVCGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTLE
        +REFYA  HPQSHIAIV GKEIRFDATQINYTFNIKNI+DAVGNKMLVTPTLE
Subjt:  LREFYATVHPQSHIAIVCGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTLE

XP_022156881.1 uncharacterized protein LOC111023717 [Momordica charantia]5.2e-4078.76Show/hide
Query:  PSIDSVH------VVKEFLDVFPKDLPGLPLSLDETLCYIEVHIEILAKETKVLRNRAIDLVKILWRNHQVEEATWEREDEIRARYLELFDQRTFKDESF
        PS+D+VH      ++++++      L   PL LDETLCYIEVHIEILAKETKVLRNRAIDLVKILWRNHQVEEATWEREDEIRARYLELFDQRTFKDESF
Subjt:  PSIDSVH------VVKEFLDVFPKDLPGLPLSLDETLCYIEVHIEILAKETKVLRNRAIDLVKILWRNHQVEEATWEREDEIRARYLELFDQRTFKDESF

Query:  LRGEVCNAPRYSG
        LRGEVCNAPRYSG
Subjt:  LRGEVCNAPRYSG

XP_022156985.1 uncharacterized protein LOC111023814 [Momordica charantia]1.0e-2746.15Show/hide
Query:  HAASTVLVHKVPAYVLFDSGSSHTIISNAFVRQANLELEPLGFLLSVSTPSGSVMIASQMVRAGELSYDNQT----------------------------
        + +S VLVH VPAY LFDSGSSHT IS AFV QANL LEPLGFLLSVSTPSGS M  SQMVR G+LS  + T                            
Subjt:  HAASTVLVHKVPAYVLFDSGSSHTIISNAFVRQANLELEPLGFLLSVSTPSGSVMIASQMVRAGELSYDNQT----------------------------

Query:  ---------------------------------------LERGAWGYLASVIDISKTTPSIDSVHVVKEFLDVFPKDLPGLP
                                               L+ GAWGYL SV+D SK TPS+DSV V  EF DVFP+DLPGLP
Subjt:  ---------------------------------------LERGAWGYLASVIDISKTTPSIDSVHVVKEFLDVFPKDLPGLP

XP_022157851.1 uncharacterized protein LOC111024467 [Momordica charantia]4.8e-3042.92Show/hide
Query:  HNASHSAVAHAAS-------TVLVHKVPAYVLFDSGSSHTIISNAFVRQANLELEPLGFLLSVSTPSGSVMIASQMVRAGELSYDNQTLERGAWGYLASV
        H    S VA A S         +VH  P     D       + +  V  A L +EP      ++   G   +  +    G          RG+   + S 
Subjt:  HNASHSAVAHAAS-------TVLVHKVPAYVLFDSGSSHTIISNAFVRQANLELEPLGFLLSVSTPSGSVMIASQMVRAGELSYDNQTLERGAWGYLASV

Query:  IDISKTTPSIDSVH------VVKEFLDVFPKDLPGLPLSLDETLCYIEVHIEILAKETKVLRNRAIDLVKILWRNHQVEEATWEREDEIRARYLELFDQR
           +   PS+++VH      ++++++      L   PL LDETL Y +V IEILAKETKVLRN +IDLVK+ WRNHQVEE TWEREDEI+ARY ELFDQR
Subjt:  IDISKTTPSIDSVH------VVKEFLDVFPKDLPGLPLSLDETLCYIEVHIEILAKETKVLRNRAIDLVKILWRNHQVEEATWEREDEIRARYLELFDQR

Query:  TFKDESFLRGEVCNAPRYS
        TF+DESFLRGEVCNAP YS
Subjt:  TFKDESFLRGEVCNAPRYS

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]1.4e-3251.09Show/hide
Query:  VAHAASTVLVHKVPAYVLFDSGSSHTIISNAFVRQANLELEPLGFLLSVSTPSGSVMIASQMVRAGELSYDNQTLE------------------------
        V+    T LVH VPAYVLFD GSSHT IS AFVRQA LELEPLGFLLSVSTPSGSV+IASQMVRAGELS+DNQTLE                        
Subjt:  VAHAASTVLVHKVPAYVLFDSGSSHTIISNAFVRQANLELEPLGFLLSVSTPSGSVMIASQMVRAGELSYDNQTLE------------------------

Query:  -------------------------------------------RGAWGYLASVIDISKTTPSIDSVHVVKEFLDVFPKDLPGLP
                                                    GAW YLASV+DIS T PSIDS HVVK F DVFP+DL GLP
Subjt:  -------------------------------------------RGAWGYLASVIDISKTTPSIDSVHVVKEFLDVFPKDLPGLP

TrEMBL top hitse value%identityAlignment
A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195155.2e-3864.71Show/hide
Query:  MNEPKMRAAKAKAAEAKKKVVAPEPVDTIELDLSEGEEVETQWNAANLATRTSLMK-------------------------------------------S
        MNEPK RAAKAKAAEAKKKVVAP PVD IELDLSEGE+VET WNAANLATRTSLMK                                            
Subjt:  MNEPKMRAAKAKAAEAKKKVVAPEPVDTIELDLSEGEEVETQWNAANLATRTSLMK-------------------------------------------S

Query:  LREFYATVHPQSHIAIVCGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTLE
        +REFYA  HPQSHIAIV GKEIRFDATQINYTFNIKNI+DAVGNKMLVTPTLE
Subjt:  LREFYATVHPQSHIAIVCGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTLE

A0A6J1DRW8 uncharacterized protein LOC1110238144.9e-2846.15Show/hide
Query:  HAASTVLVHKVPAYVLFDSGSSHTIISNAFVRQANLELEPLGFLLSVSTPSGSVMIASQMVRAGELSYDNQT----------------------------
        + +S VLVH VPAY LFDSGSSHT IS AFV QANL LEPLGFLLSVSTPSGS M  SQMVR G+LS  + T                            
Subjt:  HAASTVLVHKVPAYVLFDSGSSHTIISNAFVRQANLELEPLGFLLSVSTPSGSVMIASQMVRAGELSYDNQT----------------------------

Query:  ---------------------------------------LERGAWGYLASVIDISKTTPSIDSVHVVKEFLDVFPKDLPGLP
                                               L+ GAWGYL SV+D SK TPS+DSV V  EF DVFP+DLPGLP
Subjt:  ---------------------------------------LERGAWGYLASVIDISKTTPSIDSVHVVKEFLDVFPKDLPGLP

A0A6J1DWA8 uncharacterized protein LOC1110237172.5e-4078.76Show/hide
Query:  PSIDSVH------VVKEFLDVFPKDLPGLPLSLDETLCYIEVHIEILAKETKVLRNRAIDLVKILWRNHQVEEATWEREDEIRARYLELFDQRTFKDESF
        PS+D+VH      ++++++      L   PL LDETLCYIEVHIEILAKETKVLRNRAIDLVKILWRNHQVEEATWEREDEIRARYLELFDQRTFKDESF
Subjt:  PSIDSVH------VVKEFLDVFPKDLPGLPLSLDETLCYIEVHIEILAKETKVLRNRAIDLVKILWRNHQVEEATWEREDEIRARYLELFDQRTFKDESF

Query:  LRGEVCNAPRYSG
        LRGEVCNAPRYSG
Subjt:  LRGEVCNAPRYSG

A0A6J1DYU5 uncharacterized protein LOC1110255176.6e-3351.09Show/hide
Query:  VAHAASTVLVHKVPAYVLFDSGSSHTIISNAFVRQANLELEPLGFLLSVSTPSGSVMIASQMVRAGELSYDNQTLE------------------------
        V+    T LVH VPAYVLFD GSSHT IS AFVRQA LELEPLGFLLSVSTPSGSV+IASQMVRAGELS+DNQTLE                        
Subjt:  VAHAASTVLVHKVPAYVLFDSGSSHTIISNAFVRQANLELEPLGFLLSVSTPSGSVMIASQMVRAGELSYDNQTLE------------------------

Query:  -------------------------------------------RGAWGYLASVIDISKTTPSIDSVHVVKEFLDVFPKDLPGLP
                                                    GAW YLASV+DIS T PSIDS HVVK F DVFP+DL GLP
Subjt:  -------------------------------------------RGAWGYLASVIDISKTTPSIDSVHVVKEFLDVFPKDLPGLP

A0A6J1DZD5 uncharacterized protein LOC1110244672.3e-3042.92Show/hide
Query:  HNASHSAVAHAAS-------TVLVHKVPAYVLFDSGSSHTIISNAFVRQANLELEPLGFLLSVSTPSGSVMIASQMVRAGELSYDNQTLERGAWGYLASV
        H    S VA A S         +VH  P     D       + +  V  A L +EP      ++   G   +  +    G          RG+   + S 
Subjt:  HNASHSAVAHAAS-------TVLVHKVPAYVLFDSGSSHTIISNAFVRQANLELEPLGFLLSVSTPSGSVMIASQMVRAGELSYDNQTLERGAWGYLASV

Query:  IDISKTTPSIDSVH------VVKEFLDVFPKDLPGLPLSLDETLCYIEVHIEILAKETKVLRNRAIDLVKILWRNHQVEEATWEREDEIRARYLELFDQR
           +   PS+++VH      ++++++      L   PL LDETL Y +V IEILAKETKVLRN +IDLVK+ WRNHQVEE TWEREDEI+ARY ELFDQR
Subjt:  IDISKTTPSIDSVH------VVKEFLDVFPKDLPGLPLSLDETLCYIEVHIEILAKETKVLRNRAIDLVKILWRNHQVEEATWEREDEIRARYLELFDQR

Query:  TFKDESFLRGEVCNAPRYS
        TF+DESFLRGEVCNAP YS
Subjt:  TFKDESFLRGEVCNAPRYS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGAACCTAAAATGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGAGCCAGTTGATACAATCGAACTAGACTTGTCCGAGGGCGA
GGAGGTCGAGACGCAATGGAACGCGGCGAATTTAGCCACTCGCACTTCATTAATGAAATCCCTCAGAGAGTTCTACGCTACTGTTCATCCCCAGTCACATATAGCCATAG
TGTGTGGGAAGGAAATAAGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTAAGAATATCAGAGATGCTGTGGGCAATAAGATGTTAGTGACTCCGACTCTAGAA
CAGCTCGGTGAGGCTCTAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAGACCCGAGGATGTTTCACTAGCTGCTGCAGG
ATGGTTGTATATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGGCACTCAGGATAGGGCCCTGCTGGTTTATGCCATGCTAAAGGGCATAGATGTGAATT
ATGGAGAATTGATCAATACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATGCTTGCGACAAGGTGTGCAGCTC
CCTGCGGATCAAATAAAGAGAGATGCCCCAGTTGTGGAAGAGAAGAATATTCGGAGAATTATCGCCCATGCGCTACAAAGAAAGGAAGGTACTGGGACGTCTCATACATC
GGAGATCCGTCGTCTCCGAGACGAGAACCAACAGCTGCGAGATCAGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGTCGGAAACATCCTCATGGGAGGCG
CCAGGCGCCTGGGAAGGTTGCAGAAAACTGTTTTTCTTCCAACTTTGCCCTTAATGAAACGTGTCTTCCAATGCATTTTGGTGGTTCTTACCGATGCATACGTGTGCGCT
TGCATCTATATAATTGTTACCACGCTTAGTGCGTGTGTTCAGCATAACGCCAGTCACTCTGCAGTAGCGCATGCTGCCAGTACGGTATTAGTCCATAAAGTGCCTGCTTA
CGTATTGTTTGATTCGGGGTCAAGTCACACTATTATTTCCAATGCATTTGTTCGTCAAGCAAACCTTGAACTAGAGCCGTTAGGTTTTTTGTTGTCAGTATCTACGCCAT
CAGGGTCAGTTATGATTGCTAGTCAAATGGTGAGAGCAGGCGAGTTATCCTACGACAATCAGACCCTGGAGCGTGGTGCTTGGGGTTATTTGGCGAGTGTCATCGACATT
AGTAAGACTACACCCAGTATCGACTCCGTCCACGTGGTTAAGGAATTCCTGGACGTGTTCCCTAAAGACCTCCCGGGGCTACCCCTGTCCTTAGATGAAACCTTGTGCTA
TATAGAGGTACATATTGAGATCTTAGCAAAAGAAACCAAGGTGCTGAGGAATCGGGCGATTGACTTGGTGAAGATCCTGTGGAGGAACCACCAAGTGGAGGAAGCTACCT
GGGAAAGGGAAGACGAGATCAGAGCCCGCTATCTTGAGTTGTTCGATCAACGAACTTTCAAGGACGAAAGTTTTTTAAGAGGGGAAGTCTGTAACGCCCCGCGTTACTCA
GGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGAACCTAAAATGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGAGCCAGTTGATACAATCGAACTAGACTTGTCCGAGGGCGA
GGAGGTCGAGACGCAATGGAACGCGGCGAATTTAGCCACTCGCACTTCATTAATGAAATCCCTCAGAGAGTTCTACGCTACTGTTCATCCCCAGTCACATATAGCCATAG
TGTGTGGGAAGGAAATAAGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTAAGAATATCAGAGATGCTGTGGGCAATAAGATGTTAGTGACTCCGACTCTAGAA
CAGCTCGGTGAGGCTCTAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAGACCCGAGGATGTTTCACTAGCTGCTGCAGG
ATGGTTGTATATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGGCACTCAGGATAGGGCCCTGCTGGTTTATGCCATGCTAAAGGGCATAGATGTGAATT
ATGGAGAATTGATCAATACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATGCTTGCGACAAGGTGTGCAGCTC
CCTGCGGATCAAATAAAGAGAGATGCCCCAGTTGTGGAAGAGAAGAATATTCGGAGAATTATCGCCCATGCGCTACAAAGAAAGGAAGGTACTGGGACGTCTCATACATC
GGAGATCCGTCGTCTCCGAGACGAGAACCAACAGCTGCGAGATCAGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGTCGGAAACATCCTCATGGGAGGCG
CCAGGCGCCTGGGAAGGTTGCAGAAAACTGTTTTTCTTCCAACTTTGCCCTTAATGAAACGTGTCTTCCAATGCATTTTGGTGGTTCTTACCGATGCATACGTGTGCGCT
TGCATCTATATAATTGTTACCACGCTTAGTGCGTGTGTTCAGCATAACGCCAGTCACTCTGCAGTAGCGCATGCTGCCAGTACGGTATTAGTCCATAAAGTGCCTGCTTA
CGTATTGTTTGATTCGGGGTCAAGTCACACTATTATTTCCAATGCATTTGTTCGTCAAGCAAACCTTGAACTAGAGCCGTTAGGTTTTTTGTTGTCAGTATCTACGCCAT
CAGGGTCAGTTATGATTGCTAGTCAAATGGTGAGAGCAGGCGAGTTATCCTACGACAATCAGACCCTGGAGCGTGGTGCTTGGGGTTATTTGGCGAGTGTCATCGACATT
AGTAAGACTACACCCAGTATCGACTCCGTCCACGTGGTTAAGGAATTCCTGGACGTGTTCCCTAAAGACCTCCCGGGGCTACCCCTGTCCTTAGATGAAACCTTGTGCTA
TATAGAGGTACATATTGAGATCTTAGCAAAAGAAACCAAGGTGCTGAGGAATCGGGCGATTGACTTGGTGAAGATCCTGTGGAGGAACCACCAAGTGGAGGAAGCTACCT
GGGAAAGGGAAGACGAGATCAGAGCCCGCTATCTTGAGTTGTTCGATCAACGAACTTTCAAGGACGAAAGTTTTTTAAGAGGGGAAGTCTGTAACGCCCCGCGTTACTCA
GGTTAG
Protein sequenceShow/hide protein sequence
MNEPKMRAAKAKAAEAKKKVVAPEPVDTIELDLSEGEEVETQWNAANLATRTSLMKSLREFYATVHPQSHIAIVCGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTLE
QLGEALECVGKPSATWDLTTHGKVRLRPEDVSLAAAGWLYIVKNRILPTEHDEHGTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQL
PADQIKRDAPVVEEKNIRRIIAHALQRKEGTGTSHTSEIRRLRDENQQLRDQDHLGVQINQKKQKVGNILMGGARRLGRLQKTVFLPTLPLMKRVFQCILVVLTDAYVCA
CIYIIVTTLSACVQHNASHSAVAHAASTVLVHKVPAYVLFDSGSSHTIISNAFVRQANLELEPLGFLLSVSTPSGSVMIASQMVRAGELSYDNQTLERGAWGYLASVIDI
SKTTPSIDSVHVVKEFLDVFPKDLPGLPLSLDETLCYIEVHIEILAKETKVLRNRAIDLVKILWRNHQVEEATWEREDEIRARYLELFDQRTFKDESFLRGEVCNAPRYS
G