; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002227 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002227
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationscaffold30:3369449..3376554
RNA-Seq ExpressionMS002227
SyntenyMS002227
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058227.1 aspartic proteinase CDR1-like [Cucumis melo var. makuwa]1.5e-28260.33Show/hide
Query:  TTQAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKS
        TTQAWCP ++QILPGPRHR HMKKPLWII+LVSFIIVFLICAYMYPPQ++ ACYIFSS GCK ITDWLPPAPARELTDEE++SHVVI+EILNSPI PSK+
Subjt:  TTQAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKS

Query:  TKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF---------------
         KIAFMF+TPGSLPFEKLWDKFFNGHE KFTVYVHASKEKPTHVS HFLNRDIHS QVVWGKITMVDAE+RLLANALQ+PDNHHF               
Subjt:  TKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF---------------

Query:  -----------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIA
                         DPGPHGNGRYSEHMLPEVE+KHFRKGAQWFT+KRQHA+IV+ADNLYYSKFRDYC+PGLEGHNCIADEHYLPTFFNMIDPTGIA
Subjt:  -----------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIA

Query:  NWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKYK----------------------------------------------------
        NWSVTHVDWSERKWHPKSYRAEDITYELLQ ITSIDVSVHVTSD+K                                                      
Subjt:  NWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKYK----------------------------------------------------

Query:  ----------------------SGHACGMVCRG---HVTY------------LQEKSIPR----------------------------------------
                              S  A  ++ R    H  Y             Q  SI R                                        
Subjt:  ----------------------SGHACGMVCRG---HVTY------------LQEKSIPR----------------------------------------

Query:  --------------LWISCC-------------------------------------------------------------------------KIRKANL
                      LW+ C                                                                          KI+K NL
Subjt:  --------------LWISCC-------------------------------------------------------------------------KIRKANL

Query:  TFGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQ
        TFGCGHMN KTN DDTYNGVFGLG YP+ITMATQLGNKFSYCIGDI +PLY HNHL LG G++IEGDSTPLQI FGHYYV+L+ ISVGSK LKIDPNAF+
Subjt:  TFGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQ

Query:  MTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP
        ++SDG GGVLIDSGMTYTKL NGGFELL+DEI DLMKG+LERIPT+RKFEGLC+KGVV RDLVG P VTFHFA GADLVLESGSLFRQHGGDRFCLA+LP
Subjt:  MTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP

Query:  SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDCQLLGD
        SNSE++NLSVIG+LAQQNYNVGFDLEQMKVFFRRIDCQLL +
Subjt:  SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDCQLLGD

KAG6589695.1 Glycosyltransferase BC10, partial [Cucurbita argyrosperma subsp. sororia]3.3e-27459.36Show/hide
Query:  QAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKSTK
        Q +C  ++QI PGPR+R HMK+PLWIIILVSFIIVFLICAYMYPPQ++ ACYIFSS+GCK ITDWLPPAPARELTDEE+ASHV I+EILNSPI PSK+ K
Subjt:  QAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKSTK

Query:  IAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF-----------------
        +AFMF+TPGSLPFEKLWDKFFNGHEGKFTVYVHASKE+PTHVSSHFL+RDIHSDQVVWGKITMVDAE+RLLANAL DPDNHHF                 
Subjt:  IAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF-----------------

Query:  ---------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIANW
                       DPGPHGNGRYSEHMLPEVE+KHFRKGAQWFT+KRQHAIIV+ADNLYYSKFR+YCQPGLEG NCIADEHYLPTFFNMIDPTGIANW
Subjt:  ---------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIANW

Query:  SVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDE---------------------------------------------------------
        SVTHVDWSERKWHPKSYRAE IT ELLQ ITSIDVSVHVTSD+                                                         
Subjt:  SVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDE---------------------------------------------------------

Query:  --------------------------------------------------------------KYKSGHACGMVCRGHVT-------YLQEKSI-------
                                                                      K K   + G V R +++       +L   SI       
Subjt:  --------------------------------------------------------------KYKSGHACGMVCRGHVT-------YLQEKSI-------

Query:  --------PRLWISCC--------------------------------------------------------------------------KIRKANLTFG
                  LW+ C                                                                           KIRK NLTFG
Subjt:  --------PRLWISCC--------------------------------------------------------------------------KIRKANLTFG

Query:  CGHMNTKTNRDDTYNGVFGLGGYPHITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQMTS
        CGHMN KTN DD+YNGVFGLG YP ITMATQLGNKFSYCIGDI DPLY HN L LG GA++EGDSTPL+I FGHYYV+LEGISVG+KRL IDP AFQMT 
Subjt:  CGHMNTKTNRDDTYNGVFGLGGYPHITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQMTS

Query:  DGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLPSNS
        DGRGGVLIDSGMTYTKL NGGFELL+DEI DL  G+LERIPTRR+FEGLC+KGVV RDL+GLPPVTFHFA GADLVLESGSLFRQHGGDRFCLA+LPSNS
Subjt:  DGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLPSNS

Query:  EMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDCQLLGD
        EM+NLSVIG+LAQQNYNV FDLEQMKVFF RIDCQLL D
Subjt:  EMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDCQLLGD

OMO53064.1 Peptidase A1 [Corchorus olitorius]8.5e-19847.03Show/hide
Query:  QAW---CPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSK
        QAW     GD+QILPG RHRP +K+P+WII LVSF+ +FL+CAY+YPP  N+ACY+FSS+GCKA+ DWLPP+P RELTDEEIAS VVI++ILN P   SK
Subjt:  QAW---CPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSK

Query:  STKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF--------------
        ++KIAFMF++P SLPFEKLWD FF GHEGKF+VYVHASKEKP HVS +FLNRDI SD+VVWGKI+MVDAE+RLLA+AL+DPDN HF              
Subjt:  STKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF--------------

Query:  ------------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGI
                          DPGPHGNGRYSEHMLPEVEKK FRKGAQWF+++RQHA+IVMAD+LYYS+FRDYC+PGL+G NCIADEHYLPTFFNM+DP GI
Subjt:  ------------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGI

Query:  ANWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEK-------------------------------------------------YKSG
        ANWSVTHVDWSERKWHPKSYR +D+T +LL+ ITSID+SVHVTSDEK                                                 Y S 
Subjt:  ANWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEK-------------------------------------------------YKSG

Query:  -----HACGMV--CRGHVTYLQEKS-------------------------------IPRL----------WISC--------------------------
             HA  +       + YLQ K+                                P+L          W+ C                          
Subjt:  -----HACGMV--CRGHVTYLQEKS-------------------------------IPRL----------WISC--------------------------

Query:  -------------------CK-----------------------------IRKANLTFGCGHMNTKTNRDDT-----YNGVFGLGGYPHITMATQLGNKF
                           CK                             +   N+ FGC    +    DD+     +NG+FGLG  P +++  ++GNKF
Subjt:  -------------------CK-----------------------------IRKANLTFGCGHMNTKTNRDDT-----YNGVFGLGGYPHITMATQLGNKF

Query:  SYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGV
        SYCIG++ D  Y +N L +G GA IEG STPL+   G YYVSLEGIS+G   L IDP  F+ T +G GGV+IDSG   + L    +  L   + +L+ G+
Subjt:  SYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGV

Query:  LERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP-----SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRR
        L+ +  + +   +CY G V RDLVG P VTFH A GADLVL++GSLF       FCLA+ P      N E  +LSVIGV+AQQNYNV +DL   K++ +R
Subjt:  LERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP-----SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRR

Query:  IDCQLLGD
        IDCQLL D
Subjt:  IDCQLLGD

OMO72814.1 Peptidase A1 [Corchorus capsularis]3.6e-19646.61Show/hide
Query:  QAW---CPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSK
        QAW     GD+QILPG RHRP +K+P+WII LVSF+ +FL+CAY+YPP  N+ACY+FSS+GCKA+ DWLPP+P RELTDEEIAS VVI++ILN P   SK
Subjt:  QAW---CPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSK

Query:  STKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF--------------
        ++KIAFMF++P SLPFEKLWD FF GHEGKF+VYVHASKEKP HVS +FLNRDI SD+VVWGKI+MVDAE+RLLA+AL+DPDN HF              
Subjt:  STKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF--------------

Query:  ------------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGI
                          DPGPHGNGRYSEHMLPEVEKK FRKGAQWF+++RQHA+IVMAD+LYYS+FRDYC+PGL+G NCIADEHYLPTFFNM+DP GI
Subjt:  ------------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGI

Query:  ANWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKYK---------------------------------------------------
        ANWSVTHVDWSERKWHPKSYRA+D+T +LL+ ITSID+SVHVTSDEK +                                                   
Subjt:  ANWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKYK---------------------------------------------------

Query:  -------SGHACGMV--CRGHVTYLQEKS-------------------------------IPRL----------WISC----------------------
               + HA  +       + YLQ K+                                P+L          W+ C                      
Subjt:  -------SGHACGMV--CRGHVTYLQEKS-------------------------------IPRL----------WISC----------------------

Query:  ----------------------CKIRKA-----------------------------NLTFGCGHMNTKTNRDDT-----YNGVFGLGGYPHITMATQLG
                              CK                                 N++FGC    +    DD+     +NG+FGLG  P +++  ++G
Subjt:  ----------------------CKIRKA-----------------------------NLTFGCGHMNTKTNRDDT-----YNGVFGLGGYPHITMATQLG

Query:  NKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLM
        NKFSYCIG+I D  Y +N L +G  A IEG STPL+   G YYVSLEGIS+G   L IDP  F+ T +G GGV+IDSG   + L    +  L   +  L+
Subjt:  NKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLM

Query:  KGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP-----SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVF
         G+L+++  + +   +CY G V RDLVG P VTFH A GADLVL++GSLF       FCLA+ P      N E  +LSVIGV+AQQNYNV +DL    ++
Subjt:  KGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP-----SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVF

Query:  FRRIDCQLLGD
         +RIDCQLL D
Subjt:  FRRIDCQLLGD

TYK28585.1 Peptidase A1 [Cucumis melo var. makuwa]1.0e-28360.45Show/hide
Query:  TTQAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKS
        TTQAWCP ++QILPGPRHR HMKKPLWII+LVSFIIVFLICAYMYPPQ++ ACYIFSS+GCK ITDWLPPAPARELTDEE++SHVVI+EILNSPI PSK+
Subjt:  TTQAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKS

Query:  TKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF---------------
         KIAFMF+TPGSLPFEKLWDKFFNGHE KFTVYVHASKEKPTHVS HFLNRDIHS QVVWGKITMVDAE+RLLANALQ+PDNHHF               
Subjt:  TKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF---------------

Query:  -----------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIA
                         DPGPHGNGRYSEHMLPEVE+KHFRKGAQWFT+KRQHA+IV+ADNLYYSKFRDYC+PGLEGHNCIADEHYLPTFFNMIDPTGIA
Subjt:  -----------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIA

Query:  NWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKYK----------------------------------------------------
        NWSVTHVDWSERKWHPKSYRAEDITYELLQ ITSIDVSVHVTSD+K                                                      
Subjt:  NWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKYK----------------------------------------------------

Query:  ----------------------SGHACGMVCRG---HVTY------------LQEKSIPR----------------------------------------
                              S  A  ++ R    H  Y             Q  SI R                                        
Subjt:  ----------------------SGHACGMVCRG---HVTY------------LQEKSIPR----------------------------------------

Query:  --------------LWISCC-------------------------------------------------------------------------KIRKANL
                      LW+ C                                                                          KI+K NL
Subjt:  --------------LWISCC-------------------------------------------------------------------------KIRKANL

Query:  TFGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQ
        TFGCGHMN KTN DDTYNGVFGLG YPHITMATQLGNKFSYCIGDI +PLY HNHL LG G++IEGDSTPLQI FGHYYV+L+ ISVGSK LKIDPNAF+
Subjt:  TFGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQ

Query:  MTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP
        ++SDG GGVLIDSGMTYTKL NGGFELL+DEI DLMKG+LERIPT+RKFEGLC+KGVV RDLVG P VTFHFA GADLVLESGSLFRQHGGDRFCLA+LP
Subjt:  MTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP

Query:  SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDCQLLGD
        SNSE++NLSVIG+LAQQNYNVGFDLEQMKVFFRRIDCQLL +
Subjt:  SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDCQLLGD

TrEMBL top hitse value%identityAlignment
A0A1R3G4R0 Peptidase A14.1e-19847.03Show/hide
Query:  QAW---CPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSK
        QAW     GD+QILPG RHRP +K+P+WII LVSF+ +FL+CAY+YPP  N+ACY+FSS+GCKA+ DWLPP+P RELTDEEIAS VVI++ILN P   SK
Subjt:  QAW---CPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSK

Query:  STKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF--------------
        ++KIAFMF++P SLPFEKLWD FF GHEGKF+VYVHASKEKP HVS +FLNRDI SD+VVWGKI+MVDAE+RLLA+AL+DPDN HF              
Subjt:  STKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF--------------

Query:  ------------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGI
                          DPGPHGNGRYSEHMLPEVEKK FRKGAQWF+++RQHA+IVMAD+LYYS+FRDYC+PGL+G NCIADEHYLPTFFNM+DP GI
Subjt:  ------------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGI

Query:  ANWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEK-------------------------------------------------YKSG
        ANWSVTHVDWSERKWHPKSYR +D+T +LL+ ITSID+SVHVTSDEK                                                 Y S 
Subjt:  ANWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEK-------------------------------------------------YKSG

Query:  -----HACGMV--CRGHVTYLQEKS-------------------------------IPRL----------WISC--------------------------
             HA  +       + YLQ K+                                P+L          W+ C                          
Subjt:  -----HACGMV--CRGHVTYLQEKS-------------------------------IPRL----------WISC--------------------------

Query:  -------------------CK-----------------------------IRKANLTFGCGHMNTKTNRDDT-----YNGVFGLGGYPHITMATQLGNKF
                           CK                             +   N+ FGC    +    DD+     +NG+FGLG  P +++  ++GNKF
Subjt:  -------------------CK-----------------------------IRKANLTFGCGHMNTKTNRDDT-----YNGVFGLGGYPHITMATQLGNKF

Query:  SYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGV
        SYCIG++ D  Y +N L +G GA IEG STPL+   G YYVSLEGIS+G   L IDP  F+ T +G GGV+IDSG   + L    +  L   + +L+ G+
Subjt:  SYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGV

Query:  LERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP-----SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRR
        L+ +  + +   +CY G V RDLVG P VTFH A GADLVL++GSLF       FCLA+ P      N E  +LSVIGV+AQQNYNV +DL   K++ +R
Subjt:  LERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP-----SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRR

Query:  IDCQLLGD
        IDCQLL D
Subjt:  IDCQLLGD

A0A1R3HR95 Peptidase A11.7e-19646.61Show/hide
Query:  QAW---CPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSK
        QAW     GD+QILPG RHRP +K+P+WII LVSF+ +FL+CAY+YPP  N+ACY+FSS+GCKA+ DWLPP+P RELTDEEIAS VVI++ILN P   SK
Subjt:  QAW---CPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSK

Query:  STKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF--------------
        ++KIAFMF++P SLPFEKLWD FF GHEGKF+VYVHASKEKP HVS +FLNRDI SD+VVWGKI+MVDAE+RLLA+AL+DPDN HF              
Subjt:  STKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF--------------

Query:  ------------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGI
                          DPGPHGNGRYSEHMLPEVEKK FRKGAQWF+++RQHA+IVMAD+LYYS+FRDYC+PGL+G NCIADEHYLPTFFNM+DP GI
Subjt:  ------------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGI

Query:  ANWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKYK---------------------------------------------------
        ANWSVTHVDWSERKWHPKSYRA+D+T +LL+ ITSID+SVHVTSDEK +                                                   
Subjt:  ANWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKYK---------------------------------------------------

Query:  -------SGHACGMV--CRGHVTYLQEKS-------------------------------IPRL----------WISC----------------------
               + HA  +       + YLQ K+                                P+L          W+ C                      
Subjt:  -------SGHACGMV--CRGHVTYLQEKS-------------------------------IPRL----------WISC----------------------

Query:  ----------------------CKIRKA-----------------------------NLTFGCGHMNTKTNRDDT-----YNGVFGLGGYPHITMATQLG
                              CK                                 N++FGC    +    DD+     +NG+FGLG  P +++  ++G
Subjt:  ----------------------CKIRKA-----------------------------NLTFGCGHMNTKTNRDDT-----YNGVFGLGGYPHITMATQLG

Query:  NKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLM
        NKFSYCIG+I D  Y +N L +G  A IEG STPL+   G YYVSLEGIS+G   L IDP  F+ T +G GGV+IDSG   + L    +  L   +  L+
Subjt:  NKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLM

Query:  KGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP-----SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVF
         G+L+++  + +   +CY G V RDLVG P VTFH A GADLVL++GSLF       FCLA+ P      N E  +LSVIGV+AQQNYNV +DL    ++
Subjt:  KGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP-----SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVF

Query:  FRRIDCQLLGD
         +RIDCQLL D
Subjt:  FRRIDCQLLGD

A0A5A7UU11 Aspartic proteinase CDR1-like7.1e-28360.33Show/hide
Query:  TTQAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKS
        TTQAWCP ++QILPGPRHR HMKKPLWII+LVSFIIVFLICAYMYPPQ++ ACYIFSS GCK ITDWLPPAPARELTDEE++SHVVI+EILNSPI PSK+
Subjt:  TTQAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKS

Query:  TKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF---------------
         KIAFMF+TPGSLPFEKLWDKFFNGHE KFTVYVHASKEKPTHVS HFLNRDIHS QVVWGKITMVDAE+RLLANALQ+PDNHHF               
Subjt:  TKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF---------------

Query:  -----------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIA
                         DPGPHGNGRYSEHMLPEVE+KHFRKGAQWFT+KRQHA+IV+ADNLYYSKFRDYC+PGLEGHNCIADEHYLPTFFNMIDPTGIA
Subjt:  -----------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIA

Query:  NWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKYK----------------------------------------------------
        NWSVTHVDWSERKWHPKSYRAEDITYELLQ ITSIDVSVHVTSD+K                                                      
Subjt:  NWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKYK----------------------------------------------------

Query:  ----------------------SGHACGMVCRG---HVTY------------LQEKSIPR----------------------------------------
                              S  A  ++ R    H  Y             Q  SI R                                        
Subjt:  ----------------------SGHACGMVCRG---HVTY------------LQEKSIPR----------------------------------------

Query:  --------------LWISCC-------------------------------------------------------------------------KIRKANL
                      LW+ C                                                                          KI+K NL
Subjt:  --------------LWISCC-------------------------------------------------------------------------KIRKANL

Query:  TFGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQ
        TFGCGHMN KTN DDTYNGVFGLG YP+ITMATQLGNKFSYCIGDI +PLY HNHL LG G++IEGDSTPLQI FGHYYV+L+ ISVGSK LKIDPNAF+
Subjt:  TFGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQ

Query:  MTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP
        ++SDG GGVLIDSGMTYTKL NGGFELL+DEI DLMKG+LERIPT+RKFEGLC+KGVV RDLVG P VTFHFA GADLVLESGSLFRQHGGDRFCLA+LP
Subjt:  MTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP

Query:  SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDCQLLGD
        SNSE++NLSVIG+LAQQNYNVGFDLEQMKVFFRRIDCQLL +
Subjt:  SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDCQLLGD

A0A5D3DZ20 Peptidase A14.9e-28460.45Show/hide
Query:  TTQAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKS
        TTQAWCP ++QILPGPRHR HMKKPLWII+LVSFIIVFLICAYMYPPQ++ ACYIFSS+GCK ITDWLPPAPARELTDEE++SHVVI+EILNSPI PSK+
Subjt:  TTQAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKS

Query:  TKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF---------------
         KIAFMF+TPGSLPFEKLWDKFFNGHE KFTVYVHASKEKPTHVS HFLNRDIHS QVVWGKITMVDAE+RLLANALQ+PDNHHF               
Subjt:  TKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF---------------

Query:  -----------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIA
                         DPGPHGNGRYSEHMLPEVE+KHFRKGAQWFT+KRQHA+IV+ADNLYYSKFRDYC+PGLEGHNCIADEHYLPTFFNMIDPTGIA
Subjt:  -----------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIA

Query:  NWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKYK----------------------------------------------------
        NWSVTHVDWSERKWHPKSYRAEDITYELLQ ITSIDVSVHVTSD+K                                                      
Subjt:  NWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKYK----------------------------------------------------

Query:  ----------------------SGHACGMVCRG---HVTY------------LQEKSIPR----------------------------------------
                              S  A  ++ R    H  Y             Q  SI R                                        
Subjt:  ----------------------SGHACGMVCRG---HVTY------------LQEKSIPR----------------------------------------

Query:  --------------LWISCC-------------------------------------------------------------------------KIRKANL
                      LW+ C                                                                          KI+K NL
Subjt:  --------------LWISCC-------------------------------------------------------------------------KIRKANL

Query:  TFGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQ
        TFGCGHMN KTN DDTYNGVFGLG YPHITMATQLGNKFSYCIGDI +PLY HNHL LG G++IEGDSTPLQI FGHYYV+L+ ISVGSK LKIDPNAF+
Subjt:  TFGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNAFQ

Query:  MTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP
        ++SDG GGVLIDSGMTYTKL NGGFELL+DEI DLMKG+LERIPT+RKFEGLC+KGVV RDLVG P VTFHFA GADLVLESGSLFRQHGGDRFCLA+LP
Subjt:  MTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAVLP

Query:  SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDCQLLGD
        SNSE++NLSVIG+LAQQNYNVGFDLEQMKVFFRRIDCQLL +
Subjt:  SNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDCQLLGD

A0A6J1BZQ0 uncharacterized protein LOC111006958 isoform X11.4e-18590.75Show/hide
Query:  TTQAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKS
        TTQAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKS
Subjt:  TTQAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKS

Query:  TKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF---------------
        TKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF               
Subjt:  TKIAFMFMTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF---------------

Query:  -----------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIA
                         DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIA
Subjt:  -----------------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIA

Query:  NWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEK
        NWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEK
Subjt:  NWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEK

SwissProt top hitse value%identityAlignment
Q3EBM5 Probable aspartic protease At2g356151.8e-2532.68Show/hide
Query:  FGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLGN----KFSYCIGDITDPLYAHNHLFLGHGAF---IEGD----STPL--QIIFGHYYVSLEGISVG
        FGCG+ N  T  D+T +G+ GLGG  H+++ +QLG+    KFSYC+   +      + + LG  +    +  D    STPL  +    +YY++LE ISVG
Subjt:  FGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLGN----KFSYCIGDITDPLYAHNHLFLGHGAF---IEGD----STPL--QIIFGHYYVSLEGISVG

Query:  SKRL-----KIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESG
         K++       +PN   + S+  G ++IDSG T T L  G F+     + + + G  +R+   +     C+K   G   +GLP +T HF +GAD+ L   
Subjt:  SKRL-----KIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESG

Query:  SLFRQHGGDRFCLAVLPSNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDC
        + F +   D  CL+++P+      +++ G  AQ ++ VG+DLE   V F+ +DC
Subjt:  SLFRQHGGDRFCLAVLPSNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDC

Q65XS5 Glycosyltransferase BC104.6e-2930.54Show/hide
Query:  DEEIASHVVIQEILNSPIAPSKSTKIAFMFMTPGSLPFEKLWDKFFNG-HEGKFTVYVHASK----EKPTHVSSHFLNRDI-HSDQVVWGKITMVDAEKR
        +EE+A  V   E+  +P+ P  + ++AF+F+    LP + +WD FF G  EG+F+++VH+       + T  S  F NR + +S QV WG+ +M++AE+ 
Subjt:  DEEIASHVVIQEILNSPIAPSKSTKIAFMFMTPGSLPFEKLWDKFFNG-HEGKFTVYVHASK----EKPTHVSSHFLNRDI-HSDQVVWGKITMVDAEKR

Query:  LLANALQDPDNHHF-------DPGPHGN----------------------GRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQ--
        LLA+AL+DP N  F        P  + N                      GRY+  M P +  +++RKG+QW  L R+HA +V+ D     +F+ +C+  
Subjt:  LLANALQDPDNHHF-------DPGPHGN----------------------GRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQ--

Query:  ---------------PGLEGHNCIADEHYLPTFFNMID-PTGIANWSVTHVDW--------SERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEK
                          + HNCI DEHY+ T          +   SVTH  W          R WHP +Y+  D T  L++ I  ID   + T + K
Subjt:  ---------------PGLEGHNCIADEHYLPTFFNMID-PTGIANWSVTHVDW--------SERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEK

Q766C2 Aspartic proteinase nepenthesin-21.4e-1727.64Show/hide
Query:  NLTFGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLG-NKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIF------GHYYVSLEGISVGSKR
        N+ FGCG  N    + +   G+ G+G  P +++ +QLG  +FSYC+        + + L LG  A    + +P   +        +YY++L+GI+VG   
Subjt:  NLTFGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLG-NKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIF------GHYYVSLEGISVGSKR

Query:  LKIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGG
        L I  + FQ+  DG GG++IDSG T T L    +  +     D +   L  +         C++       V +P ++  F  G   + E   L     G
Subjt:  LKIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGG

Query:  DRFCLAVLPSNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDC
           CLA+   +S  + +S+ G + QQ   V +DL+ + V F    C
Subjt:  DRFCLAVLPSNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDC

Q766C3 Aspartic proteinase nepenthesin-13.9e-2031.17Show/hide
Query:  NLTFGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLG-NKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIF------GHYYVSLEGISVGSKR
        N+TFGCG  N    + +   G+ G+G  P +++ +QL   KFSYC+  I       ++L LG  A      +P   +         YY++L G+SVGS R
Subjt:  NLTFGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLG-NKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIF------GHYYVSLEGISVGSKR

Query:  LKIDPNAFQMTS-DGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHG
        L IDP+AF + S +G GG++IDSG T T   N  ++ +  E    +   L  +        LC++       + +P    HF  G DL L S + F    
Subjt:  LKIDPNAFQMTS-DGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHG

Query:  GDRFCLAVLPSNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDC
            CLA+    S    +S+ G + QQN  V +D     V F    C
Subjt:  GDRFCLAVLPSNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDC

Q9SV77 Aspartyl protease UND1.4e-2231.82Show/hide
Query:  FGCGHMNTKTNRDDTY---NGVFGLGGYPHITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNA
        FGC      T  D +Y    G+ GL G    ++  + G+KFS+C+G+I++P  +HN L LG GA ++G  T + I  GH    LE I VG          
Subjt:  FGCGHMNTKTNRDDTY---NGVFGLGGYPHITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVGSKRLKIDPNA

Query:  FQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLM-KGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLA
         ++T D    V +D+G T + L+   +    D   DL+    L   PT      LCYK      L  +  V F F  GA+L +   ++F Q G       
Subjt:  FQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLM-KGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLA

Query:  VLPSNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDCQL
         + +N E  +  +IGV+A Q YNVG+DL     +  + DC +
Subjt:  VLPSNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDCQL

Arabidopsis top hitse value%identityAlignment
AT2G19160.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein5.1e-12460.53Show/hide
Query:  LQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKSTKIAFMFMT
        +Q LPG RHR   +KP+WII ++S I +F+I AYM+P  S +ACY+FSSKGCK +TDWLPP+  RE +D+EIA+ VVI EIL+SP    KS+KIAFMF+T
Subjt:  LQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKSTKIAFMFMT

Query:  PGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF------------------------
        PG+LPFEKLWD FF GHEGKF+VY+HASK+ P H S +FLNR+I SD+VVWG+I+M+DAE+RLL NAL+DP+N  F                        
Subjt:  PGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF------------------------

Query:  --------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGH-NCIADEHYLPTFFNMIDPTGIANWSVTHVD
                DPGPHG GR+ +HMLPE+ ++ FRKGAQWF++KRQHA++ +ADNLYYSKFRDYC PG+EG+ NCIADEHYLPTFF M+DPTGIANW+VT+VD
Subjt:  --------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGH-NCIADEHYLPTFFNMIDPTGIANWSVTHVD

Query:  WSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDE
        WSERKWHP+ Y  EDIT EL++ I+SID    VTS++
Subjt:  WSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDE

AT4G25870.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.7e-11656.89Show/hide
Query:  QILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQ-SNSACY-IFSSKGCK-AITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKSTKIAFMF
        +IL GPRH   +KKPLW+++ VS   + LIC +MYP    +S+C+ ++S++GC+ A++ WL P   R+ TDEEIA+  V+++IL +P   + ++KIAF+F
Subjt:  QILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQ-SNSACY-IFSSKGCK-AITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKSTKIAFMF

Query:  MTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF----------------------
        +TPG+LPFEKLWD+FF GHEGKF++Y+H SKE+P H+S HF +R+IHSD+V WG+I+MVDAEKRLL +AL+DPDN HF                      
Subjt:  MTPGSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF----------------------

Query:  ----------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEG-HNCIADEHYLPTFFNMIDPTGIANWSVTH
                  DPGPHG GR+ EHMLPE+ K+ FRKGAQWFT+KRQHAIIVMAD LYYSKFR+YC PG+E   NCIADEHYLPTFFNMIDP GI+NWSVT 
Subjt:  ----------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEG-HNCIADEHYLPTFFNMIDPTGIANWSVTH

Query:  VDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKY
        VDWSER+WHPK+Y   +I+ E ++ +TS D+SVHVTS  K+
Subjt:  VDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKY

AT4G30060.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein4.9e-11960.78Show/hide
Query:  LPGPRHR--PHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKSTKIAFMFMTP
        LPG R+R  P  ++ +WII+++S I +F I AYMYP  S  ACY+ SS+GCKA+ DWLPP+  RE +D+EIA+ VVI+EIL+SP    K++KIAFMF+TP
Subjt:  LPGPRHR--PHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKSTKIAFMFMTP

Query:  GSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF-------------------------
        G+LPFE+LWD+FF GHEGKF+VY+HASKE+P H S +FLNR+I SD+VVWG+I+MVDAE+RLLANAL+D  N  F                         
Subjt:  GSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF-------------------------

Query:  -------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGH-NCIADEHYLPTFFNMIDPTGIANWSVTHVDW
               DPG HG GR+  HMLPE+ KK FRKGAQWFT+KRQHA+  MAD+LYYSKFRDYC PG+E + NCIADEHYLPTFF+M+DP GIANW+VT VDW
Subjt:  -------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGH-NCIADEHYLPTFFNMIDPTGIANWSVTHVDW

Query:  SERKWHPKSYRAEDITYELLQKITSIDVSVHVTS
        SERKWHPK+Y  EDIT+ELL  +TS D  VHVTS
Subjt:  SERKWHPKSYRAEDITYELLQKITSIDVSVHVTS

AT5G57270.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.4e-11357.96Show/hide
Query:  PRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSN---SACYIFSSKGCK-AITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKSTKIAFMFMTPG
        PRHR  +KKPL I++LV    V L+  YMYP  +N   SAC   SS+GC+ A++ WL P   R+ TDEE+A+ VVI++IL  P A +  +KIAFMF+TPG
Subjt:  PRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSN---SACYIFSSKGCK-AITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKSTKIAFMFMTPG

Query:  SLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF--------------------------
        +LPFEKLWDKFF G EG+F++Y+H S+ +P H+S HF +R+IHSD V WG+I+MVDAE+RLLANAL+DPDN HF                          
Subjt:  SLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF--------------------------

Query:  ------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGH-NCIADEHYLPTFFNMIDPTGIANWSVTHVDWS
              D GPHG GR+ +HMLPE+ ++ FRKGAQWFT+KRQHA+IVMAD LYYSKFR+YC+PG+E + NCIADEHYLPTFF+M+DP GI+NWSVT+VDWS
Subjt:  ------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGH-NCIADEHYLPTFFNMIDPTGIANWSVTHVDWS

Query:  ERKWHPKSYRAEDITYELLQKITSIDVSVHVTS
        ER+WHPK+YRA D++ +LL+ ITS D+SVHVTS
Subjt:  ERKWHPKSYRAEDITYELLQKITSIDVSVHVTS

AT5G57270.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.4e-11357.96Show/hide
Query:  PRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSN---SACYIFSSKGCK-AITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKSTKIAFMFMTPG
        PRHR  +KKPL I++LV    V L+  YMYP  +N   SAC   SS+GC+ A++ WL P   R+ TDEE+A+ VVI++IL  P A +  +KIAFMF+TPG
Subjt:  PRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSN---SACYIFSSKGCK-AITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKSTKIAFMFMTPG

Query:  SLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF--------------------------
        +LPFEKLWDKFF G EG+F++Y+H S+ +P H+S HF +R+IHSD V WG+I+MVDAE+RLLANAL+DPDN HF                          
Subjt:  SLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHF--------------------------

Query:  ------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGH-NCIADEHYLPTFFNMIDPTGIANWSVTHVDWS
              D GPHG GR+ +HMLPE+ ++ FRKGAQWFT+KRQHA+IVMAD LYYSKFR+YC+PG+E + NCIADEHYLPTFF+M+DP GI+NWSVT+VDWS
Subjt:  ------DPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQHAIIVMADNLYYSKFRDYCQPGLEGH-NCIADEHYLPTFFNMIDPTGIANWSVTHVDWS

Query:  ERKWHPKSYRAEDITYELLQKITSIDVSVHVTS
        ER+WHPK+YRA D++ +LL+ ITS D+SVHVTS
Subjt:  ERKWHPKSYRAEDITYELLQKITSIDVSVHVTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACAACTCAGGCATGGTGTCCTGGCGATTTGCAGATTTTGCCTGGGCCTCGACATCGCCCTCATATGAAGAAGCCTCTGTGGATCATCATCTTGGTCTCCTTCATCATTGT
CTTCCTTATCTGTGCATACATGTATCCACCTCAAAGTAATAGTGCCTGTTACATTTTTTCTTCTAAAGGTTGTAAGGCCATTACCGACTGGCTTCCGCCTGCTCCTGCAA
GAGAACTTACTGATGAAGAGATTGCTTCTCATGTTGTTATTCAAGAAATTTTGAATTCACCTATTGCACCATCCAAATCTACAAAGATAGCATTTATGTTTATGACTCCA
GGGTCTTTGCCGTTTGAGAAGCTGTGGGATAAGTTTTTCAATGGTCATGAGGGCAAATTCACTGTTTATGTCCATGCATCTAAGGAGAAACCAACTCATGTCAGTAGCCA
CTTCTTGAATCGGGATATTCATAGTGATCAGGTGGTGTGGGGTAAAATTACCATGGTTGATGCCGAAAAAAGATTGTTGGCAAATGCTCTTCAAGATCCAGATAATCACC
ATTTTGATCCTGGTCCACATGGAAATGGCAGGTATTCAGAGCACATGTTACCAGAAGTGGAGAAGAAACACTTTAGAAAAGGTGCTCAGTGGTTCACGTTGAAACGGCAG
CATGCTATAATAGTAATGGCTGACAATCTTTATTACTCAAAATTTCGTGATTACTGCCAGCCAGGTCTAGAAGGGCACAATTGCATAGCTGATGAGCACTACTTGCCAAC
CTTTTTCAATATGATTGATCCTACTGGAATAGCAAATTGGTCAGTAACACATGTTGATTGGTCTGAAAGAAAGTGGCATCCAAAATCGTATAGAGCAGAGGACATTACCT
ATGAGCTTTTACAGAAGATTACATCAATTGATGTGAGCGTGCATGTAACAAGTGACGAGAAGTACAAAAGTGGCCATGCTTGTGGAATGGTTTGCAGAGGCCATGTTACC
TATTTGCAAGAAAAGTCTATCCCCAGGCTCTGGATAAGTTGCTGTAAAATCAGAAAGGCGAATTTAACATTCGGGTGTGGCCATATGAACACCAAAACCAACAGAGACGA
CACCTACAATGGCGTCTTTGGGCTCGGCGGCTATCCCCACATAACAATGGCCACTCAACTCGGCAACAAATTCTCGTATTGCATCGGCGACATCACCGATCCTCTCTACG
CCCACAACCACCTTTTTCTGGGCCACGGCGCTTTCATCGAAGGCGACTCCACCCCTCTTCAAATCATCTTCGGCCATTACTACGTCTCTCTGGAGGGCATCAGCGTTGGA
TCCAAGAGGCTCAAAATCGACCCAAATGCCTTCCAAATGACTTCAGACGGCAGAGGCGGCGTTCTCATCGACTCTGGAATGACGTACACCAAGCTCACCAATGGCGGATT
TGAGCTTCTTTTCGACGAGATTGCGGATCTCATGAAGGGGGTTTTGGAGCGCATCCCGACGAGGAGGAAATTCGAGGGGCTGTGTTACAAGGGAGTTGTGGGTCGAGATC
TTGTTGGGCTGCCGCCGGTGACGTTCCATTTCGCCAGCGGCGCCGATTTGGTGCTGGAGTCTGGGAGTTTGTTCCGGCAGCACGGTGGGGATCGGTTCTGTTTGGCGGTT
TTGCCGAGCAATTCCGAGATGATGAATCTGTCTGTGATTGGGGTTTTGGCTCAACAGAACTATAATGTGGGTTTTGATCTTGAACAGATGAAGGTTTTCTTCCGTAGGAT
TGATTGTCAGCTTCTTGGCGAC
mRNA sequenceShow/hide mRNA sequence
ACAACTCAGGCATGGTGTCCTGGCGATTTGCAGATTTTGCCTGGGCCTCGACATCGCCCTCATATGAAGAAGCCTCTGTGGATCATCATCTTGGTCTCCTTCATCATTGT
CTTCCTTATCTGTGCATACATGTATCCACCTCAAAGTAATAGTGCCTGTTACATTTTTTCTTCTAAAGGTTGTAAGGCCATTACCGACTGGCTTCCGCCTGCTCCTGCAA
GAGAACTTACTGATGAAGAGATTGCTTCTCATGTTGTTATTCAAGAAATTTTGAATTCACCTATTGCACCATCCAAATCTACAAAGATAGCATTTATGTTTATGACTCCA
GGGTCTTTGCCGTTTGAGAAGCTGTGGGATAAGTTTTTCAATGGTCATGAGGGCAAATTCACTGTTTATGTCCATGCATCTAAGGAGAAACCAACTCATGTCAGTAGCCA
CTTCTTGAATCGGGATATTCATAGTGATCAGGTGGTGTGGGGTAAAATTACCATGGTTGATGCCGAAAAAAGATTGTTGGCAAATGCTCTTCAAGATCCAGATAATCACC
ATTTTGATCCTGGTCCACATGGAAATGGCAGGTATTCAGAGCACATGTTACCAGAAGTGGAGAAGAAACACTTTAGAAAAGGTGCTCAGTGGTTCACGTTGAAACGGCAG
CATGCTATAATAGTAATGGCTGACAATCTTTATTACTCAAAATTTCGTGATTACTGCCAGCCAGGTCTAGAAGGGCACAATTGCATAGCTGATGAGCACTACTTGCCAAC
CTTTTTCAATATGATTGATCCTACTGGAATAGCAAATTGGTCAGTAACACATGTTGATTGGTCTGAAAGAAAGTGGCATCCAAAATCGTATAGAGCAGAGGACATTACCT
ATGAGCTTTTACAGAAGATTACATCAATTGATGTGAGCGTGCATGTAACAAGTGACGAGAAGTACAAAAGTGGCCATGCTTGTGGAATGGTTTGCAGAGGCCATGTTACC
TATTTGCAAGAAAAGTCTATCCCCAGGCTCTGGATAAGTTGCTGTAAAATCAGAAAGGCGAATTTAACATTCGGGTGTGGCCATATGAACACCAAAACCAACAGAGACGA
CACCTACAATGGCGTCTTTGGGCTCGGCGGCTATCCCCACATAACAATGGCCACTCAACTCGGCAACAAATTCTCGTATTGCATCGGCGACATCACCGATCCTCTCTACG
CCCACAACCACCTTTTTCTGGGCCACGGCGCTTTCATCGAAGGCGACTCCACCCCTCTTCAAATCATCTTCGGCCATTACTACGTCTCTCTGGAGGGCATCAGCGTTGGA
TCCAAGAGGCTCAAAATCGACCCAAATGCCTTCCAAATGACTTCAGACGGCAGAGGCGGCGTTCTCATCGACTCTGGAATGACGTACACCAAGCTCACCAATGGCGGATT
TGAGCTTCTTTTCGACGAGATTGCGGATCTCATGAAGGGGGTTTTGGAGCGCATCCCGACGAGGAGGAAATTCGAGGGGCTGTGTTACAAGGGAGTTGTGGGTCGAGATC
TTGTTGGGCTGCCGCCGGTGACGTTCCATTTCGCCAGCGGCGCCGATTTGGTGCTGGAGTCTGGGAGTTTGTTCCGGCAGCACGGTGGGGATCGGTTCTGTTTGGCGGTT
TTGCCGAGCAATTCCGAGATGATGAATCTGTCTGTGATTGGGGTTTTGGCTCAACAGAACTATAATGTGGGTTTTGATCTTGAACAGATGAAGGTTTTCTTCCGTAGGAT
TGATTGTCAGCTTCTTGGCGAC
Protein sequenceShow/hide protein sequence
TTQAWCPGDLQILPGPRHRPHMKKPLWIIILVSFIIVFLICAYMYPPQSNSACYIFSSKGCKAITDWLPPAPARELTDEEIASHVVIQEILNSPIAPSKSTKIAFMFMTP
GSLPFEKLWDKFFNGHEGKFTVYVHASKEKPTHVSSHFLNRDIHSDQVVWGKITMVDAEKRLLANALQDPDNHHFDPGPHGNGRYSEHMLPEVEKKHFRKGAQWFTLKRQ
HAIIVMADNLYYSKFRDYCQPGLEGHNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEDITYELLQKITSIDVSVHVTSDEKYKSGHACGMVCRGHVT
YLQEKSIPRLWISCCKIRKANLTFGCGHMNTKTNRDDTYNGVFGLGGYPHITMATQLGNKFSYCIGDITDPLYAHNHLFLGHGAFIEGDSTPLQIIFGHYYVSLEGISVG
SKRLKIDPNAFQMTSDGRGGVLIDSGMTYTKLTNGGFELLFDEIADLMKGVLERIPTRRKFEGLCYKGVVGRDLVGLPPVTFHFASGADLVLESGSLFRQHGGDRFCLAV
LPSNSEMMNLSVIGVLAQQNYNVGFDLEQMKVFFRRIDCQLLGD