; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS017632 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS017632
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAspartic proteinase nepenthesin-2-like
Genome locationscaffold373:1043232..1043638
RNA-Seq ExpressionMS017632
SyntenyMS017632
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034471.1 Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. argyrosperma]5.3e-2349.37Show/hide
Query:  PFLFS-YSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GSS
        PFL S   + S SSSSS+TT+TLPLT F SL F                   H   PRT++N +     L   SYGAYS+ L+F             GSS
Subjt:  PFLFS-YSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GSS

Query:  LVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRC
        LVWFPCTA + C  CSFP V  ATI KFIPKLSS+AKIIG    KC+WIFGPN+K  C
Subjt:  LVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRC

XP_011657732.1 probable aspartyl protease At4g16563 [Cucumis sativus]1.2e-2246.54Show/hide
Query:  PFLFSYSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GSSL
        PFLFS  +   +SSSS+TT+ LPLT F S+ F                   H   P++++N +    +L   SYGAYSV L+F             GSSL
Subjt:  PFLFSYSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GSSL

Query:  VWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRCWN
        VWFPCTA + C RCSFP V  ATI+KF+PKLSS+ K++G    KCAWIFGPN+K RC N
Subjt:  VWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRCWN

XP_022925946.1 probable aspartyl protease At4g16563 [Cucurbita moschata]3.1e-2349.37Show/hide
Query:  PFLFS-YSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GSS
        PFL S   + S SSSSS+TT+TLPLT F SL F                   H   PRT++N +     L   SYGAYS+ L+F             GSS
Subjt:  PFLFS-YSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GSS

Query:  LVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRC
        LVWFPCTA + C  CSFP V  ATI KFIPKLSS+AKIIG    KC+WIFGPN+K  C
Subjt:  LVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRC

XP_023543736.1 probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo]1.5e-2248.77Show/hide
Query:  FPFRPFLFS-YSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF------------
        FP  PFL S   + S SSSSS+TT+TLPLT F SL F                   H   PR ++N +     L   SYGAYS+ L+F            
Subjt:  FPFRPFLFS-YSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF------------

Query:  -GSSLVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRC
         GSSLVWFPCTA + C  CSFP V  ATI KFIPKLSS+AKIIG    KC+WIFGPN+K  C
Subjt:  -GSSLVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRC

XP_038881211.1 probable aspartyl protease At4g16563 [Benincasa hispida]9.0e-2349.7Show/hide
Query:  FPFRPFLFS-YSIASTSSSSSNTTITLPLTAF------------------SSLEFQIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-----------
        FP  PFLFS + +  TSSSSS +TITLPLTAF                  +SL    H   P+T++ Q  S   L + SYGAYS+ L+F           
Subjt:  FPFRPFLFS-YSIASTSSSSSNTTITLPLTAF------------------SSLEFQIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-----------

Query:  --GSSLVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRCWN
          GSSLVWFPCTA + C  CSFP V  ATI KF+PKLSS+AKIIG    KCAWIFGPN+  RC N
Subjt:  --GSSLVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRCWN

TrEMBL top hitse value%identityAlignment
A0A0A0KHK2 Peptidase A1 domain-containing protein5.7e-2346.54Show/hide
Query:  PFLFSYSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GSSL
        PFLFS  +   +SSSS+TT+ LPLT F S+ F                   H   P++++N +    +L   SYGAYSV L+F             GSSL
Subjt:  PFLFSYSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GSSL

Query:  VWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRCWN
        VWFPCTA + C RCSFP V  ATI+KF+PKLSS+ K++G    KCAWIFGPN+K RC N
Subjt:  VWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRCWN

A0A5A7SGF9 Aspartic proteinase nepenthesin-2-like3.1e-2143.48Show/hide
Query:  FRPFLFSYSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GS
        F P  F +SI     +SS+++ITLPL  F S+ F                   H   P++++N +    +L   SYGAY+V L+F             GS
Subjt:  FRPFLFSYSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GS

Query:  SLVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRCWN
        SLVWFPCTA + C  CSFP V  ATI+KF+PKLSS+ KI+G    KCAWIFGPN+K RC N
Subjt:  SLVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRCWN

A0A5D3CAS4 Aspartic proteinase nepenthesin-2-like8.3e-2244.1Show/hide
Query:  FRPFLFSYSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GS
        F P  F +SI     +SS+++ITLPLT F S+ F                   H   P++++N +    +L   SYGAY+V L+F             GS
Subjt:  FRPFLFSYSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GS

Query:  SLVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRCWN
        SLVWFPCTA + C  CSFP V  ATI+KF+PKLSS+ KI+G    KCAWIFGPN+K RC N
Subjt:  SLVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRCWN

A0A6J1EDJ0 probable aspartyl protease At4g165631.5e-2349.37Show/hide
Query:  PFLFS-YSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GSS
        PFL S   + S SSSSS+TT+TLPLT F SL F                   H   PRT++N +     L   SYGAYS+ L+F             GSS
Subjt:  PFLFS-YSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------GSS

Query:  LVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRC
        LVWFPCTA + C  CSFP V  ATI KFIPKLSS+AKIIG    KC+WIFGPN+K  C
Subjt:  LVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRC

A0A6J1IMR7 probable aspartyl protease At4g165634.8e-2245.96Show/hide
Query:  FPFRPFLFSYSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------
        FP +  L    + S SSSSS+ T+TLPLTAF SL                     H   P+T++N +     L   SYGAYS+ L+F             
Subjt:  FPFRPFLFSYSIASTSSSSSNTTITLPLTAFSSLEF-----------------QIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSF-------------

Query:  GSSLVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRC
        GSSLVWFPCTA + C  CSFP V  ATI KFIPKLSS+A+IIG    KC+WIFGPN+K  C
Subjt:  GSSLVWFPCTAHFLCFRCSFP-VAIATITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFRC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52500.1 Eukaryotic aspartyl protease family protein9.0e-1348.81Show/hide
Query:  SYGAYSVFLSF-------------GSSLVWFPCTAHFLCFRCSFPVAIAT-ITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFR
        SYG YSV LSF             GSSLVW PCT+ +LC  C F     T I +FIPK SS++KIIG    KC +++GPNV+ R
Subjt:  SYGAYSVFLSF-------------GSSLVWFPCTAHFLCFRCSFPVAIAT-ITKFIPKLSSAAKIIGYGKWKCAWIFGPNVKFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AATTCCTTCCCATTCCGTCCCTTTCTATTTTCATATTCCATTGCTTCTACTTCCTCTTCTTCCTCTAACACCACCATCACACTCCCCCTCACCGCCTTCTCTTCTCTCGA
GTTCCAGATCCATGGAAATGGTCCAAGAACCGAAACAAATCAAGCGCATTCGCCGAGAAATTTGCTCACGTGTAGCTATGGCGCTTACTCAGTTTTTCTCAGCTTCGGAA
GTAGTCTCGTCTGGTTCCCCTGCACCGCCCATTTTCTCTGCTTCAGGTGTTCGTTTCCGGTGGCTATTGCGACGATTACGAAATTTATCCCCAAATTATCTTCTGCTGCG
AAGATTATCGGTTACGGAAAGTGGAAATGTGCTTGGATTTTTGGCCCTAATGTGAAATTTAGATGTTGGAAT
mRNA sequenceShow/hide mRNA sequence
AATTCCTTCCCATTCCGTCCCTTTCTATTTTCATATTCCATTGCTTCTACTTCCTCTTCTTCCTCTAACACCACCATCACACTCCCCCTCACCGCCTTCTCTTCTCTCGA
GTTCCAGATCCATGGAAATGGTCCAAGAACCGAAACAAATCAAGCGCATTCGCCGAGAAATTTGCTCACGTGTAGCTATGGCGCTTACTCAGTTTTTCTCAGCTTCGGAA
GTAGTCTCGTCTGGTTCCCCTGCACCGCCCATTTTCTCTGCTTCAGGTGTTCGTTTCCGGTGGCTATTGCGACGATTACGAAATTTATCCCCAAATTATCTTCTGCTGCG
AAGATTATCGGTTACGGAAAGTGGAAATGTGCTTGGATTTTTGGCCCTAATGTGAAATTTAGATGTTGGAAT
Protein sequenceShow/hide protein sequence
NSFPFRPFLFSYSIASTSSSSSNTTITLPLTAFSSLEFQIHGNGPRTETNQAHSPRNLLTCSYGAYSVFLSFGSSLVWFPCTAHFLCFRCSFPVAIATITKFIPKLSSAA
KIIGYGKWKCAWIFGPNVKFRCWN