; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0022693 (gene) of Chayote v1 genome

Gene IDSed0022693
OrganismSechium edule (Chayote v1)
Descriptionproline-rich protein 4-like
Genome locationLG05:35067513..35069568
RNA-Seq ExpressionSed0022693
SyntenySed0022693
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588996.1 Proline-rich protein 2, partial [Cucurbita argyrosperma subsp. sororia]6.3e-13073.11Show/hide
Query:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF
        MQ+ S CFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIK ++AF+GLRVS+DCK KDG  ERKG AEL++EGKFKVLLP EALKDGKLKGKCF
Subjt:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF

Query:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK
        AQLHSASSTPC S D LESSM+VLKS G GK TFGLSG LKF+S TCVSAFFWH YHHPPLPPI FPPHPP       PPYH+KFF     PPPTPEE  
Subjt:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK

Query:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKP-PPVYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPP
         PPVPE PPPVY+PPPV E PPPV +KPPPV +KPPPV +KPPPVY+  P   KKP PPVYE  PPV +  PP   KPP  + KPP      PPVY   P
Subjt:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKP-PPVYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPP

Query:  PVYKKPPPVYEKPPPVYEKPPPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH
        PVY   PPVY   PPVY   PPVY  KPPTPVYKHP+YK+LPPISK PPC PVPKVIP +PPKY SHPKFGKKFPPL P  PH
Subjt:  PVYKKPPPVYEKPPPVYEKPPPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH

KAG7015262.1 Translocon-associated protein subunit alpha [Cucurbita argyrosperma subsp. argyrosperma]2.2e-13072.96Show/hide
Query:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF
        MQ+ S CFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIK ++AF+GLRVS+DCK KDG  ERKG AEL++EGKFKVLLP EALKDGKLKGKCF
Subjt:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF

Query:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK
        AQLHSASSTPC S D LESSM+VLKS G GK TFGLSG LKF+S TCVSAFFWH YHHPPLPPI FPPHPP       PPYH+KFF     PPPTPEE  
Subjt:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK

Query:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKP-PPVYEKPPPV-EKKP------PPVYEKPPPVKKKPPPAYEKPP
         PPVPE PPPVY+PPPV E PPPV +KPPPV +KPPPV +KPPPVY+  P   KKP PPVYE  PPV E KP      PPVYE  PPV +  PP   KPP
Subjt:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKP-PPVYEKPPPV-EKKP------PPVYEKPPPVKKKPPPAYEKPP

Query:  PVYEKP--PPVYKKPPPVYEKPPPVYEKPPPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH
            KP  PPVY   PPVY   PPVY   PPVY  KPPTPVYKHP+YK+LPPISK PPC PVPKVIP +PPKY SHPKFGKKFPPL P  PH
Subjt:  PVYEKP--PPVYKKPPPVYEKPPPVYEKPPPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH

XP_022989049.1 repetitive proline-rich cell wall protein 2-like isoform X1 [Cucurbita maxima]3.7e-13068.25Show/hide
Query:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF
        MQ  S CFFFWFFFLFA TFCHGSDLTTVEVVGVGECADCHKNNIK ++AF+GLRVSIDCK KDG  ERKG AEL++EGKFKVLLPTEALKDGKLKGKCF
Subjt:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF

Query:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK
        A LHSASSTPC S D LESSM+VLKS G GK TFGL G LKF+S TCVSAFFWH YHHPPLPPI FPPHPP       PPYH+KFF     PPPTPEE  
Subjt:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK

Query:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYE---------------KPPPVEKKP---------PPVYEK
         PPVPE PPPVY+PPPV E PPPV +KPPPV +KPPPV +KPPPVYEKPPPV +KPPPVY+               KPP  E KP         PPVYE 
Subjt:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYE---------------KPPPVEKKP---------PPVYEK

Query:  PPPVKKKPPPAYEKP----------------PPVYEKPPPVYKKPPPVYEKPPPVYEKPPPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVP
         PPV +  PP   KP                PPVY   PPVY   PPVY   PPVY   PPVY  KPP PVYKHP+YK+LPPISK PPC PVPKVIP +P
Subjt:  PPPVKKKPPPAYEKP----------------PPVYEKPPPVYKKPPPVYEKPPPVYEKPPPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVP

Query:  PKYFSHPKFGKKFPPLPPSAPH
        PKY SHPKFGKKFPPL P  PH
Subjt:  PKYFSHPKFGKKFPPLPPSAPH

XP_022989050.1 proline-rich protein 4-like isoform X2 [Cucurbita maxima]1.8e-13271.68Show/hide
Query:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF
        MQ  S CFFFWFFFLFA TFCHGSDLTTVEVVGVGECADCHKNNIK ++AF+GLRVSIDCK KDG  ERKG AEL++EGKFKVLLPTEALKDGKLKGKCF
Subjt:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF

Query:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK
        A LHSASSTPC S D LESSM+VLKS G GK TFGL G LKF+S TCVSAFFWH YHHPPLPPI FPPHPP       PPYH+KFF     PPPTPEE  
Subjt:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK

Query:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKP-PPVYEKPPPVKKKPPPAYEKP--------
         PPVPE PPPVY+PPPV E PPPV +KPPPV +KPPPV +KPPPVYEKPPPV +KPPPVY+  P   KKP PPVYE  PPV +  PP   KP        
Subjt:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKP-PPVYEKPPPVKKKPPPAYEKP--------

Query:  --------PPVYEKPPPVYKKPPPVYEKPPPVYEKPPPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH
                PPVY   PPVY   PPVY   PPVY   PPVY  KPP PVYKHP+YK+LPPISK PPC PVPKVIP +PPKY SHPKFGKKFPPL P  PH
Subjt:  --------PPVYEKPPPVYKKPPPVYEKPPPVYEKPPPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH

XP_022989051.1 proline-rich protein 4-like isoform X3 [Cucurbita maxima]1.4e-12972.38Show/hide
Query:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF
        MQ  S CFFFWFFFLFA TFCHGSDLTTVEVVGVGECADCHKNNIK ++AF+GLRVSIDCK KDG  ERKG AEL++EGKFKVLLPTEALKDGKLKGKCF
Subjt:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF

Query:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK
        A LHSASSTPC S D LESSM+VLKS G GK TFGL G LKF+S TCVSAFFWH YHHPPLPPI FPPHPP       PPYH+KFF     PPPTPEE  
Subjt:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK

Query:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKP-------PPVYEKPPPVKKKPPPAYEKPPP
         PPVPE PPPVY+PPPV E PPPV +KPPPV +KPPPV +KPPPVYEKPPPV +KPPPVY+  P   KKP       PPVYE  PPV +  PP     PP
Subjt:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKP-------PPVYEKPPPVKKKPPPAYEKPPP

Query:  VYEKPPPVYKKPPPVYEKPPPVYEKP--PPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH
        VYE  PPV +  PP   KPP    KP  PPVY  KPP PVYKHP+YK+LPPISK PPC PVPKVIP +PPKY SHPKFGKKFPPL P  PH
Subjt:  VYEKPPPVYKKPPPVYEKPPPVYEKP--PPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH

TrEMBL top hitse value%identityAlignment
A0A6J1EKD0 proline-rich protein 4-like isoform X28.9e-12270.6Show/hide
Query:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF
        MQ+ S CF FWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIK ++AF+GLRVS+DCK KDG  ERKG AEL++EGKFKVLLP EALKDGKLKGKCF
Subjt:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF

Query:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK
        AQLHSASSTPC S D LESSM+VLKS G GK TFGLSG LKF+S TCVSAFFWH YHHPPLPPI FPPHPP       PPYH+KFF     PPPTPEE  
Subjt:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK

Query:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKP-PPVYEKPPPVKKKPPPAYEKPPPVYEKPP
         PPVPE PPPVY+PPPV E               PPPV +KPPPVYEKPPPV +KPPPVY+  P   KKP PPVYE  PPV +  PP   KPP    KPP
Subjt:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKP-PPVYEKPPPVKKKPPPAYEKPPPVYEKPP

Query:  PVYKKPPPVYEKPPPVYEKPPPVYKPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH
            KPP      PPVY     V KPPTPVYKHP+YK+LPPISK PPC+PVPKVIP +PPKY SHPKFGKKFPPL P  PH
Subjt:  PVYKKPPPVYEKPPPVYEKPPPVYKPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH

A0A6J1ERR1 proline-rich protein 4-like isoform X13.6e-12370.87Show/hide
Query:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF
        MQ+ S CF FWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIK ++AF+GLRVS+DCK KDG  ERKG AEL++EGKFKVLLP EALKDGKLKGKCF
Subjt:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF

Query:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK
        AQLHSASSTPC S D LESSM+VLKS G GK TFGLSG LKF+S TCVSAFFWH YHHPPLPPI FPPHPP       PPYH+KFF     PPPTPEE  
Subjt:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK

Query:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKP-PPVYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPP
         PPVPE PPPVY+PPPV E PPPV +KPPPV +KPPPV +KPPPVY+  P   KKP PPVYE  PPV +  PP   KPP  + KPP      PPV E  P
Subjt:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKP-PPVYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPP

Query:  PVYKKPPPVYEKPPPVYEKPPPVYKPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH
        P            PPVY     V KPPTPVYKHP+YK+LPPISK PPC+PVPKVIP +PPKY SHPKFGKKFPPL P  PH
Subjt:  PVYKKPPPVYEKPPPVYEKPPPVYKPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH

A0A6J1JIY3 proline-rich protein 4-like isoform X36.8e-13072.38Show/hide
Query:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF
        MQ  S CFFFWFFFLFA TFCHGSDLTTVEVVGVGECADCHKNNIK ++AF+GLRVSIDCK KDG  ERKG AEL++EGKFKVLLPTEALKDGKLKGKCF
Subjt:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF

Query:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK
        A LHSASSTPC S D LESSM+VLKS G GK TFGL G LKF+S TCVSAFFWH YHHPPLPPI FPPHPP       PPYH+KFF     PPPTPEE  
Subjt:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK

Query:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKP-------PPVYEKPPPVKKKPPPAYEKPPP
         PPVPE PPPVY+PPPV E PPPV +KPPPV +KPPPV +KPPPVYEKPPPV +KPPPVY+  P   KKP       PPVYE  PPV +  PP     PP
Subjt:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKP-------PPVYEKPPPVKKKPPPAYEKPPP

Query:  VYEKPPPVYKKPPPVYEKPPPVYEKP--PPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH
        VYE  PPV +  PP   KPP    KP  PPVY  KPP PVYKHP+YK+LPPISK PPC PVPKVIP +PPKY SHPKFGKKFPPL P  PH
Subjt:  VYEKPPPVYKKPPPVYEKPPPVYEKP--PPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH

A0A6J1JLA8 proline-rich protein 4-like isoform X28.5e-13371.68Show/hide
Query:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF
        MQ  S CFFFWFFFLFA TFCHGSDLTTVEVVGVGECADCHKNNIK ++AF+GLRVSIDCK KDG  ERKG AEL++EGKFKVLLPTEALKDGKLKGKCF
Subjt:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF

Query:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK
        A LHSASSTPC S D LESSM+VLKS G GK TFGL G LKF+S TCVSAFFWH YHHPPLPPI FPPHPP       PPYH+KFF     PPPTPEE  
Subjt:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK

Query:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKP-PPVYEKPPPVKKKPPPAYEKP--------
         PPVPE PPPVY+PPPV E PPPV +KPPPV +KPPPV +KPPPVYEKPPPV +KPPPVY+  P   KKP PPVYE  PPV +  PP   KP        
Subjt:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKP-PPVYEKPPPVKKKPPPAYEKP--------

Query:  --------PPVYEKPPPVYKKPPPVYEKPPPVYEKPPPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH
                PPVY   PPVY   PPVY   PPVY   PPVY  KPP PVYKHP+YK+LPPISK PPC PVPKVIP +PPKY SHPKFGKKFPPL P  PH
Subjt:  --------PPVYEKPPPVYKKPPPVYEKPPPVYEKPPPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPH

A0A6J1JN98 repetitive proline-rich cell wall protein 2-like isoform X11.8e-13068.25Show/hide
Query:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF
        MQ  S CFFFWFFFLFA TFCHGSDLTTVEVVGVGECADCHKNNIK ++AF+GLRVSIDCK KDG  ERKG AEL++EGKFKVLLPTEALKDGKLKGKCF
Subjt:  MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCF

Query:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK
        A LHSASSTPC S D LESSM+VLKS G GK TFGL G LKF+S TCVSAFFWH YHHPPLPPI FPPHPP       PPYH+KFF     PPPTPEE  
Subjt:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPY----VLPPYHYKFFPPKVNPPPTPEEPK

Query:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYE---------------KPPPVEKKP---------PPVYEK
         PPVPE PPPVY+PPPV E PPPV +KPPPV +KPPPV +KPPPVYEKPPPV +KPPPVY+               KPP  E KP         PPVYE 
Subjt:  PPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYE---------------KPPPVEKKP---------PPVYEK

Query:  PPPVKKKPPPAYEKP----------------PPVYEKPPPVYKKPPPVYEKPPPVYEKPPPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVP
         PPV +  PP   KP                PPVY   PPVY   PPVY   PPVY   PPVY  KPP PVYKHP+YK+LPPISK PPC PVPKVIP +P
Subjt:  PPPVKKKPPPAYEKP----------------PPVYEKPPPVYKKPPPVYEKPPPVYEKPPPVY--KPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVP

Query:  PKYFSHPKFGKKFPPLPPSAPH
        PKY SHPKFGKKFPPL P  PH
Subjt:  PKYFSHPKFGKKFPPLPPSAPH

SwissProt top hitse value%identityAlignment
P13983 Extensin7.2e-1243.22Show/hide
Query:  YYHHPPLPPISFPPHPPYVLPPYHYKFFPPK--VNPPPTPEEPKPPPVP----EIPPPVYKPPPVYEKPPPVEKKPP--PVEKKPPPV-EKKPPPVYEKP
        Y   PP    S  P P Y  PP  Y   PP    +PPP    P PPP P      PPP Y PPP Y  PPP     P  P+   PPPV    PPP Y  P
Subjt:  YYHHPPLPPISFPPHPPYVLPPYHYKFFPPK--VNPPPTPEEPKPPPVP----EIPPPVYKPPPVYEKPPPVEKKPP--PVEKKPPPV-EKKPPPVYEKP

Query:  PPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKP---PPVYEKPPPVYKKPPPVYEKPPPV---YEKPPPVYKPPTPVYKHPWYKVLPPISK
        PP    PPP    PPP    PPP YE+ PP    PPPAY  P   PP Y  PPP Y  PPP Y +PPP+   Y  PPP Y PP P    P Y   PP   
Subjt:  PPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKP---PPVYEKPPPVYKKPPPVYEKPPPV---YEKPPPVYKPPTPVYKHPWYKVLPPISK

Query:  PPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAP
        PPP  P     P  PP Y   P      PP P  +P
Subjt:  PPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAP

Q9LJ64 Pollen-specific leucine-rich repeat extensin-like protein 11.0e-1847.93Show/hide
Query:  PPLPPISFPPHPPYVLPPYHYKFFPPKVNPPPTPEEPKPPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEK
        P  PP+  PP PP V  P      PP V  PP P    PPPV   PPPV+ PP     PPPV   PPPV   PPPV   PPPV+  PPPV   PPPV+  
Subjt:  PPLPPISFPPHPPYVLPPYHYKFFPPKVNPPPTPEEPKPPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEK

Query:  PPPVEK-KPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYK-KPPPVYEKPPPVYEKPPPVYKPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPK
        PPPV+   PPPV+  PPP     PP    PPPV+  PPPV+   PPPV+  PPPV+  PPPV+ PP PV+  P     PP+  PPP    P  I + PP 
Subjt:  PPPVEK-KPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYK-KPPPVYEKPPPVYEKPPPVYKPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPK

Query:  YFSHPKFGKKFPPLPPS
         FS P   K   PLPP+
Subjt:  YFSHPKFGKKFPPLPPS

Q9SKP9 Proline-rich protein 24.5e-3038.17Show/hide
Query:  CFFFWFFFLFAA--TFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKD--GKLERKGVAELDKEGKFKVLLPTEAL-KDGKLKGKCFA
        C  F F     A    C    +  VEV+G  E      + IKI NAFSGLRV+I+CK  D  G    +G  E+D+ GKF + +P + +  DG LK  C+A
Subjt:  CFFFWFFFLFAA--TFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKD--GKLERKGVAELDKEGKFKVLLPTEAL-KDGKLKGKCFA

Query:  QLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPYVLPPYHYKFFPPKVNPPPTPEEPKPPPVP
         L SA   PCP+HD LE+S +V  SK       GL   LKF    C+S FFWH      +P   FP  PP  LPP  +    PK+  P            
Subjt:  QLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPYVLPPYHYKFFPPKVNPPPTPEEPKPPPVP

Query:  EIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYKKP
           PP+YKPP V             + KKP P +    P+Y+ P P+ K P P+Y+ P  + KKP      PP + K          P+Y+ P P+YK P
Subjt:  EIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYKKP

Query:  PPVYEKPPPVYEKPPPVYKPPTPVYKHPWYKVLPPISKPP--PCTPVPKVIPTVPPKYFSHPKFGKKFPPLP
          + +K  P   K  P+YK P P+YK P +K  PP+   P  PC P+PK  P  PPKY  HPKFG K+PP P
Subjt:  PPVYEKPPPVYEKPPPVYKPPTPVYKHPWYKVLPPISKPP--PCTPVPKVIPTVPPKYFSHPKFGKKFPPLP

Q9T0I5 Proline-rich protein 44.0e-4743.39Show/hide
Query:  QNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKD-GKLKGKCF
        + S  C       L +AT    S    VEVVG  E      + IK  +AFSGLRV+IDCK   G    KG   +D +GKF + +P + + D G LK +C+
Subjt:  QNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKD-GKLKGKCF

Query:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPIS-----FPPHPPYVLPPYHYKFFPPKVNPP------
        AQLHSA+ TPCP+HD LES+ +V  SK   K   GL   LKF    CVS FFW     P LPP       FP  PP  LPP+  K  PPK +PP      
Subjt:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPIS-----FPPHPPYVLPPYHYKFFPPKVNPP------

Query:  -----PTPEEPKPPPVP--------EIPP--PVYKPPPVYEKPPPVEKKP----PPVEKKPPPVE-KKPPPVYEKPPPVE-KKPPPVYEKPPPV------
             P P++  PPPVP        E+PP  PVYKPPP  E PPP+ KKP    PP  + PPPV   KPPP  EKPPPV   KPPP  E PPPV      
Subjt:  -----PTPEEPKPPPVP--------EIPP--PVYKPPPVYEKPPPVEKKP----PPVEKKPPPVE-KKPPPVYEKPPPVE-KKPPPVYEKPPPV------

Query:  -------EKKPPPVYEKPPPVKKKPPPAYEKPPPV-YEKPPPVYKKPPPVYEKPP--PVYEKP-----PPVY---------KPPTPVYKHPWYKVLP---
               +  PPPV    PP KK  PP    PPPV   KPPP    PPP  E PP  PVY+ P     PP+Y          PP P+YK P   V+P   
Subjt:  -------EKKPPPVYEKPPPVKKKPPPAYEKPPPV-YEKPPPVYKKPPPVYEKPP--PVYEKP-----PPVY---------KPPTPVYKHPWYKVLP---

Query:  -----PISKPP-------PCTPVPKV-----IPTVPPKYFSHPKFGKKFPPLPP
             P+ KPP       PC P+P++      P +PPKY  HPKFG K+PPLPP
Subjt:  -----PISKPP-------PCTPVPKV-----IPTVPPKYFSHPKFGKKFPPLPP

Q9XIL9 Pollen-specific leucine-rich repeat extensin-like protein 35.3e-2349.78Show/hide
Query:  HHPPLPPISFPPHPPYVLPPYHYKFFPPKVNPPPTPEEPKPPPVPEIPPPVYK-PPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYE-KPPPVEKKPPP
        H PP PP+  PP PP V  P      PP   PPP    P PPPV   PPPV+  PPPV+  PPPV   PPPV   PPPV   PPPVY   PPPV   PPP
Subjt:  HHPPLPPISFPPHPPYVLPPYHYKFFPPKVNPPPTPEEPKPPPVPEIPPPVYK-PPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYE-KPPPVEKKPPP

Query:  VYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYKKPPPVYEKPPPVYEKPPPVYK--PPTPVYKHPWYKVLPP-ISKPPPCTPVPKVIP
        V+  PPPV   PPPVY  PP     PPP +  PPPV+  PPPV+  PPPVY  PPPVY  PPP  K  PP PVY  P   +LPP +S PP  TPV    P
Subjt:  VYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYKKPPPVYEKPPPVYEKPPPVYK--PPTPVYKHPWYKVLPP-ISKPPPCTPVPKVIP

Query:  TVPPKYFSHPKFGKKFPPLPPSAPH
          P +    P   ++F  +PP   H
Subjt:  TVPPKYFSHPKFGKKFPPLPPSAPH

Arabidopsis top hitse value%identityAlignment
AT2G15880.1 Leucine-rich repeat (LRR) family protein3.8e-2449.78Show/hide
Query:  HHPPLPPISFPPHPPYVLPPYHYKFFPPKVNPPPTPEEPKPPPVPEIPPPVYK-PPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYE-KPPPVEKKPPP
        H PP PP+  PP PP V  P      PP   PPP    P PPPV   PPPV+  PPPV+  PPPV   PPPV   PPPV   PPPVY   PPPV   PPP
Subjt:  HHPPLPPISFPPHPPYVLPPYHYKFFPPKVNPPPTPEEPKPPPVPEIPPPVYK-PPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYE-KPPPVEKKPPP

Query:  VYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYKKPPPVYEKPPPVYEKPPPVYK--PPTPVYKHPWYKVLPP-ISKPPPCTPVPKVIP
        V+  PPPV   PPPVY  PP     PPP +  PPPV+  PPPV+  PPPVY  PPPVY  PPP  K  PP PVY  P   +LPP +S PP  TPV    P
Subjt:  VYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYKKPPPVYEKPPPVYEKPPPVYK--PPTPVYKHPWYKVLPP-ISKPPPCTPVPKVIP

Query:  TVPPKYFSHPKFGKKFPPLPPSAPH
          P +    P   ++F  +PP   H
Subjt:  TVPPKYFSHPKFGKKFPPLPPSAPH

AT2G21140.1 proline-rich protein 23.2e-3138.17Show/hide
Query:  CFFFWFFFLFAA--TFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKD--GKLERKGVAELDKEGKFKVLLPTEAL-KDGKLKGKCFA
        C  F F     A    C    +  VEV+G  E      + IKI NAFSGLRV+I+CK  D  G    +G  E+D+ GKF + +P + +  DG LK  C+A
Subjt:  CFFFWFFFLFAA--TFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKD--GKLERKGVAELDKEGKFKVLLPTEAL-KDGKLKGKCFA

Query:  QLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPYVLPPYHYKFFPPKVNPPPTPEEPKPPPVP
         L SA   PCP+HD LE+S +V  SK       GL   LKF    C+S FFWH      +P   FP  PP  LPP  +    PK+  P            
Subjt:  QLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPYVLPPYHYKFFPPKVNPPPTPEEPKPPPVP

Query:  EIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYKKP
           PP+YKPP V             + KKP P +    P+Y+ P P+ K P P+Y+ P  + KKP      PP + K          P+Y+ P P+YK P
Subjt:  EIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYKKP

Query:  PPVYEKPPPVYEKPPPVYKPPTPVYKHPWYKVLPPISKPP--PCTPVPKVIPTVPPKYFSHPKFGKKFPPLP
          + +K  P   K  P+YK P P+YK P +K  PP+   P  PC P+PK  P  PPKY  HPKFG K+PP P
Subjt:  PPVYEKPPPVYEKPPPVYKPPTPVYKHPWYKVLPPISKPP--PCTPVPKVIPTVPPKYFSHPKFGKKFPPLP

AT3G19020.1 Leucine-rich repeat (LRR) family protein7.4e-2047.93Show/hide
Query:  PPLPPISFPPHPPYVLPPYHYKFFPPKVNPPPTPEEPKPPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEK
        P  PP+  PP PP V  P      PP V  PP P    PPPV   PPPV+ PP     PPPV   PPPV   PPPV   PPPV+  PPPV   PPPV+  
Subjt:  PPLPPISFPPHPPYVLPPYHYKFFPPKVNPPPTPEEPKPPPVPEIPPPVYKPPPVYEKPPPVEKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEK

Query:  PPPVEK-KPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYK-KPPPVYEKPPPVYEKPPPVYKPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPK
        PPPV+   PPPV+  PPP     PP    PPPV+  PPPV+   PPPV+  PPPV+  PPPV+ PP PV+  P     PP+  PPP    P  I + PP 
Subjt:  PPPVEK-KPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYK-KPPPVYEKPPPVYEKPPPVYKPPTPVYKHPWYKVLPPISKPPPCTPVPKVIPTVPPK

Query:  YFSHPKFGKKFPPLPPS
         FS P   K   PLPP+
Subjt:  YFSHPKFGKKFPPLPPS

AT4G38770.1 proline-rich protein 42.9e-4843.39Show/hide
Query:  QNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKD-GKLKGKCF
        + S  C       L +AT    S    VEVVG  E      + IK  +AFSGLRV+IDCK   G    KG   +D +GKF + +P + + D G LK +C+
Subjt:  QNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKD-GKLKGKCF

Query:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPIS-----FPPHPPYVLPPYHYKFFPPKVNPP------
        AQLHSA+ TPCP+HD LES+ +V  SK   K   GL   LKF    CVS FFW     P LPP       FP  PP  LPP+  K  PPK +PP      
Subjt:  AQLHSASSTPCPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPIS-----FPPHPPYVLPPYHYKFFPPKVNPP------

Query:  -----PTPEEPKPPPVP--------EIPP--PVYKPPPVYEKPPPVEKKP----PPVEKKPPPVE-KKPPPVYEKPPPVE-KKPPPVYEKPPPV------
             P P++  PPPVP        E+PP  PVYKPPP  E PPP+ KKP    PP  + PPPV   KPPP  EKPPPV   KPPP  E PPPV      
Subjt:  -----PTPEEPKPPPVP--------EIPP--PVYKPPPVYEKPPPVEKKP----PPVEKKPPPVE-KKPPPVYEKPPPVE-KKPPPVYEKPPPV------

Query:  -------EKKPPPVYEKPPPVKKKPPPAYEKPPPV-YEKPPPVYKKPPPVYEKPP--PVYEKP-----PPVY---------KPPTPVYKHPWYKVLP---
               +  PPPV    PP KK  PP    PPPV   KPPP    PPP  E PP  PVY+ P     PP+Y          PP P+YK P   V+P   
Subjt:  -------EKKPPPVYEKPPPVKKKPPPAYEKPPPV-YEKPPPVYKKPPPVYEKPP--PVYEKP-----PPVY---------KPPTPVYKHPWYKVLP---

Query:  -----PISKPP-------PCTPVPKV-----IPTVPPKYFSHPKFGKKFPPLPP
             P+ KPP       PC P+P++      P +PPKY  HPKFG K+PPLPP
Subjt:  -----PISKPP-------PCTPVPKV-----IPTVPPKYFSHPKFGKKFPPLPP

AT5G59170.1 Proline-rich extensin-like family protein1.1e-1044.68Show/hide
Query:  PPL---PPISFPPHPPYVLPPYHYKFFPP-KVNPPP----TPEEPKPPPVPEIPPPVYKPPPVYEKPPPVE-----KKPPPVEKKPPPVEKKPPPVYEKP
        PP+   PPI   P PPY  PP  Y   PP K  P P     P E  PPP+ + PPP   PPP+ + PPP +     KK PP E+ PPP++K PPP +  P
Subjt:  PPL---PPISFPPHPPYVLPPYHYKFFPP-KVNPPP----TPEEPKPPPVPEIPPPVYKPPPVYEKPPPVE-----KKPPPVEKKPPPVEKKPPPVYEKP

Query:  PPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYKKPPPVYEKPPPVYEKP-PPVYKPPTP--VYKHPWYKVLPPISKPPP
        PP+ KK PP  + PPP++K PPP  + PPP+KK PPP  + PPP+ + PPP+ K PPP  E PPP+   P PPV  PP P   Y HP  K  PP   P  
Subjt:  PPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYKKPPPVYEKPPPVYEKP-PPVYKPPTP--VYKHPWYKVLPPISKPPP

Query:  CTPVPKVIPTVPPKYFSHP-----KFGKKFPPLPP
        C P P+  P  P K +  P        KK+PP  P
Subjt:  CTPVPKVIPTVPPKYFSHP-----KFGKKFPPLPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAACTCTTCTGCCTGTTTCTTTTTCTGGTTTTTCTTCCTTTTTGCTGCAACTTTCTGCCATGGCAGTGACTTGACAACAGTTGAGGTTGTTGGTGTTGGAGAATG
TGCAGACTGCCACAAGAATAACATTAAAATCAGCAATGCCTTCTCAGGGCTTCGTGTAAGCATTGACTGCAAACAGAAAGATGGAAAGCTTGAAAGAAAAGGTGTTGCAG
AGCTTGATAAAGAAGGAAAGTTCAAAGTGTTGCTTCCAACTGAGGCTTTGAAAGATGGAAAGTTGAAGGGGAAATGCTTTGCACAACTTCATAGTGCTTCTTCTACTCCT
TGTCCATCTCATGATGCCTTGGAATCTTCCATGGTTGTGTTAAAATCCAAAGGTGAGGGAAAACAAACCTTTGGTTTGAGTGGTGGGCTCAAGTTCCAGAGTGCAACTTG
TGTATCTGCTTTCTTTTGGCATTACTACCATCATCCTCCTTTGCCTCCTATCTCTTTCCCTCCCCATCCTCCATATGTGCTCCCTCCTTACCATTACAAGTTCTTCCCTC
CTAAGGTCAACCCTCCACCCACCCCAGAAGAACCGAAACCACCCCCAGTCCCCGAAATCCCTCCGCCGGTTTACAAGCCTCCCCCGGTCTACGAGAAGCCTCCACCGGTC
GAGAAAAAGCCTCCACCGGTCGAGAAAAAGCCTCCGCCAGTCGAGAAAAAGCCTCCTCCTGTCTACGAGAAGCCTCCACCGGTTGAGAAAAAGCCTCCTCCTGTCTACGA
GAAGCCTCCACCGGTTGAGAAAAAGCCTCCTCCTGTCTACGAGAAGCCTCCACCGGTCAAGAAAAAGCCTCCTCCTGCCTACGAGAAGCCTCCCCCAGTCTACGAGAAGC
CTCCACCGGTCTACAAAAAGCCTCCCCCGGTCTATGAGAAGCCTCCACCGGTCTACGAAAAGCCTCCGCCGGTTTACAAGCCCCCAACACCAGTGTATAAGCATCCATGG
TACAAGGTTCTTCCTCCCATTTCAAAGCCTCCTCCATGTACACCAGTTCCTAAAGTCATCCCTACTGTTCCTCCAAAGTACTTCTCTCACCCCAAATTTGGCAAGAAGTT
CCCTCCTCTGCCTCCTTCAGCTCCACACCATTAA
mRNA sequenceShow/hide mRNA sequence
CTAAAAACCCTATAAAGTTATCCATTTGGATTCACTGTTTTGCACATTCATTTCCCTATTCATATCTCCATGCAGAACTCTTCTGCCTGTTTCTTTTTCTGGTTTTTCTT
CCTTTTTGCTGCAACTTTCTGCCATGGCAGTGACTTGACAACAGTTGAGGTTGTTGGTGTTGGAGAATGTGCAGACTGCCACAAGAATAACATTAAAATCAGCAATGCCT
TCTCAGGGCTTCGTGTAAGCATTGACTGCAAACAGAAAGATGGAAAGCTTGAAAGAAAAGGTGTTGCAGAGCTTGATAAAGAAGGAAAGTTCAAAGTGTTGCTTCCAACT
GAGGCTTTGAAAGATGGAAAGTTGAAGGGGAAATGCTTTGCACAACTTCATAGTGCTTCTTCTACTCCTTGTCCATCTCATGATGCCTTGGAATCTTCCATGGTTGTGTT
AAAATCCAAAGGTGAGGGAAAACAAACCTTTGGTTTGAGTGGTGGGCTCAAGTTCCAGAGTGCAACTTGTGTATCTGCTTTCTTTTGGCATTACTACCATCATCCTCCTT
TGCCTCCTATCTCTTTCCCTCCCCATCCTCCATATGTGCTCCCTCCTTACCATTACAAGTTCTTCCCTCCTAAGGTCAACCCTCCACCCACCCCAGAAGAACCGAAACCA
CCCCCAGTCCCCGAAATCCCTCCGCCGGTTTACAAGCCTCCCCCGGTCTACGAGAAGCCTCCACCGGTCGAGAAAAAGCCTCCACCGGTCGAGAAAAAGCCTCCGCCAGT
CGAGAAAAAGCCTCCTCCTGTCTACGAGAAGCCTCCACCGGTTGAGAAAAAGCCTCCTCCTGTCTACGAGAAGCCTCCACCGGTTGAGAAAAAGCCTCCTCCTGTCTACG
AGAAGCCTCCACCGGTCAAGAAAAAGCCTCCTCCTGCCTACGAGAAGCCTCCCCCAGTCTACGAGAAGCCTCCACCGGTCTACAAAAAGCCTCCCCCGGTCTATGAGAAG
CCTCCACCGGTCTACGAAAAGCCTCCGCCGGTTTACAAGCCCCCAACACCAGTGTATAAGCATCCATGGTACAAGGTTCTTCCTCCCATTTCAAAGCCTCCTCCATGTAC
ACCAGTTCCTAAAGTCATCCCTACTGTTCCTCCAAAGTACTTCTCTCACCCCAAATTTGGCAAGAAGTTCCCTCCTCTGCCTCCTTCAGCTCCACACCATTAAGAAAAAT
TTGTCTCCAAGAAATTAGTTAATGTGAATTGATCAAGATGAATGCTATCCTATATTGCTATACCTATATATTTTTACATATATATTTGAACTATGTCATTTGTAAGTAAG
AAAATTGCTTGGTCTCATATTTTATATATTATCTTCAAACTTTGTTGTTGATGTAATAAAGTCTTCAAGTGGAGATCACAAGAAGAAACAAATTTAGAGGGAAAGAGAGG
GGATCAGAGATTATATTATGGTGTGTCTAGAATTTGTTTTTGTATTGTAAGAGTTACTTGGAATTTGATGGTTAAGTTTGTTGTTGTAATGTTTCATTGGGATTTTTTAG
TTTAATTGCTCAAGTTGATCCATTGATTGAATATATATCTAATGGTTTTTAGAGCTATAATTAATGTATTTAATATGTTTTTGGTTGTGAA
Protein sequenceShow/hide protein sequence
MQNSSACFFFWFFFLFAATFCHGSDLTTVEVVGVGECADCHKNNIKISNAFSGLRVSIDCKQKDGKLERKGVAELDKEGKFKVLLPTEALKDGKLKGKCFAQLHSASSTP
CPSHDALESSMVVLKSKGEGKQTFGLSGGLKFQSATCVSAFFWHYYHHPPLPPISFPPHPPYVLPPYHYKFFPPKVNPPPTPEEPKPPPVPEIPPPVYKPPPVYEKPPPV
EKKPPPVEKKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVEKKPPPVYEKPPPVKKKPPPAYEKPPPVYEKPPPVYKKPPPVYEKPPPVYEKPPPVYKPPTPVYKHPW
YKVLPPISKPPPCTPVPKVIPTVPPKYFSHPKFGKKFPPLPPSAPHH