; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001464 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001464
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionproline-rich protein 4-like
Genome locationchr4:31640777..31641935
RNA-Seq ExpressionLag0001464
SyntenyLag0001464
Gene Ontology termsNA
InterPro domainsIPR003882 - Pistil-specific extensin-like protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588996.1 Proline-rich protein 2, partial [Cucurbita argyrosperma subsp. sororia]2.6e-13876.39Show/hide
Query:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF
        MQ+P VC FFWFFFLFAAT CHGSDLTTVEVVGVGECADCHKNNIKT+HAF+GLRVS++CK KDG FERKG AEL+E+GKFKVLLP E L+DGKLKGKCF
Subjt:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF

Query:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP
        AQLHSASS PC S DGLESSMIV KS G GK TFGLSG LKF+S TCVSAFFWHY HHPPLPPI   VFPPHPPLFGHPY  PPYHHKFFPP     PP 
Subjt:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP

Query:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV--PIYKKPL--PPPVYKPKPPVYK----------------PKPPVYKPKPPVVYKPKPP
          K P    PPPVPE PPPV EKPPPVYEKPPP Y KPPPV  P  KKP    PPVY+PKPPV +                PKPPVY PKPP VY PKPP
Subjt:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV--PIYKKPL--PPPVYKPKPPVYK----------------PKPPVYKPKPPVVYKPKPP

Query:  VVYKPKPPVYKPKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP
         VY PKPPVY PKPPVY  KPPTPVYKHP+YKILPPISKLPPCPPVPKVIPV+PPKY SHPKFGKKFPPL PP+PHP
Subjt:  VVYKPKPPVYKPKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP

KAG7015262.1 Translocon-associated protein subunit alpha [Cucurbita argyrosperma subsp. argyrosperma]2.0e-13873.85Show/hide
Query:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF
        MQ+P VC FFWFFFLFAAT CHGSDLTTVEVVGVGECADCHKNNIKT+HAF+GLRVS++CK KDG FERKG AEL+E+GKFKVLLP E L+DGKLKGKCF
Subjt:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF

Query:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP
        AQLHSASS PC S DGLESSMIV KS G GK TFGLSG LKF+S TCVSAFFWHY HHPPLPPI   VFPPHPPLFGHPY  PPYHHKFFPP     PP 
Subjt:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP

Query:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV--PIYKKPL--PPPVYKPKPPVYKPKPPVYKPKPPVVYKPKPPV---------------
          K P    PPPVPE PPPV EKPPPVYEKPPP Y KPPPV  P  KKP    PPVY+PKPPV +PKPPV +PKPP VY+PKPPV               
Subjt:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV--PIYKKPL--PPPVYKPKPPVYKPKPPVYKPKPPVVYKPKPPV---------------

Query:  --------------VYKPKPPVYKPKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP
                      VY PKPPVY PKPPVY  KPPTPVYKHP+YKILPPISKLPPCPPVPKVIPV+PPKY SHPKFGKKFPPL PP+PHP
Subjt:  --------------VYKPKPPVYKPKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP

XP_022989049.1 repetitive proline-rich cell wall protein 2-like isoform X1 [Cucurbita maxima]2.1e-13568.97Show/hide
Query:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF
        MQ P VC FFWFFFLFA T CHGSDLTTVEVVGVGECADCHKNNIKT+HAF+GLRVSI+CK KDG FERKG AEL+E+GKFKVLLPTE L+DGKLKGKCF
Subjt:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF

Query:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP
        A LHSASS PC S DGLESSMIV KS G GK TFGL G LKF+S TCVSAFFWHY HHPPLPPI   VFPPHPPLFGHPY  PPYHHKFFPP     PP 
Subjt:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP

Query:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV-----PIYKKPLP-------------------PPVYKPKPPVYK---PKPPVYKPKPPV
          K P    PPPVPE PPPV EKPPPVYEKPPP Y KPPPV     P+Y+KP P                   PPVY+PKPPV +   PKPPVY+PKPPV
Subjt:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV-----PIYKKPLP-------------------PPVYKPKPPVYK---PKPPVYKPKPPV

Query:  -----------------------------VYKPKPPV------VYKPKPPVYKPKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYF
                                     VY PKPPV      VY PKPPVY PKPPVY  KPP PVYKHP+YKILPPISKLPPCPPVPKVIPV+PPKY 
Subjt:  -----------------------------VYKPKPPV------VYKPKPPVYKPKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYF

Query:  SHPKFGKKFPPLPPPIPHP
        SHPKFGKKFPPL PP+PHP
Subjt:  SHPKFGKKFPPLPPPIPHP

XP_022989050.1 proline-rich protein 4-like isoform X2 [Cucurbita maxima]6.4e-13772.36Show/hide
Query:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF
        MQ P VC FFWFFFLFA T CHGSDLTTVEVVGVGECADCHKNNIKT+HAF+GLRVSI+CK KDG FERKG AEL+E+GKFKVLLPTE L+DGKLKGKCF
Subjt:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF

Query:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP
        A LHSASS PC S DGLESSMIV KS G GK TFGL G LKF+S TCVSAFFWHY HHPPLPPI   VFPPHPPLFGHPY  PPYHHKFFPP     PP 
Subjt:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP

Query:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV-----PIYKKPLP-------------PPVYK-----------------------PKPPV
          K P    PPPVPE PPPV EKPPPVYEKPPP Y KPPPV     P+Y+KP P             PPVY+                       PKPPV
Subjt:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV-----PIYKKPLP-------------PPVYK-----------------------PKPPV

Query:  YKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP
        Y PKPPVY PKPP VY PKPP VY PKPPVY PKPPVY  KPP PVYKHP+YKILPPISKLPPCPPVPKVIPV+PPKY SHPKFGKKFPPL PP+PHP
Subjt:  YKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP

XP_038888565.1 proline-rich protein 4-like [Benincasa hispida]2.1e-13577.31Show/hide
Query:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF
        M NP    F  FF L A+  CHG+DLTTVEVVGVGECADCHKNNIKT+HAFSGL VSI+CKQKDG+ ERKGVA+LDE+GKFKVLLPTEVL+DGKLKGKCF
Subjt:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF

Query:  AQLHSASSAPCPSHDGLE--SSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFW-HYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKVY-P
        AQLHSASS PCPSHDGLE  +SMIVFKSKGEGKQTFGL  GLKFKS TCVSAFFW HY+HHPPLPPIS PVFPPHPPL+ HPYL PPYHHK FPPKVY P
Subjt:  AQLHSASSAPCPSHDGLE--SSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFW-HYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKVY-P

Query:  PPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPP--PVYKPKPPVYKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVYKP
        PPT + P+PPPVP+KPPP YEKPPPVYEKPPP Y KPPPV  Y KP+PP  PVYK KP    PKPPVYKPKPPV    KPP   KP+            P
Subjt:  PPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPP--PVYKPKPPVYKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVYKP

Query:  PTPV-YKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP
        P+PV YKHPWYKILPPISKLPPCPPVPKVIPVVPPKY SHPKFGKKFPPL P +PHP
Subjt:  PTPV-YKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP

TrEMBL top hitse value%identityAlignment
A0A6J1EKD0 proline-rich protein 4-like isoform X24.2e-13477.03Show/hide
Query:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF
        MQ+P VC  FWFFFLFAAT CHGSDLTTVEVVGVGECADCHKNNIKT+HAF+GLRVS++CK KDG FERKG AEL+E+GKFKVLLP E L+DGKLKGKCF
Subjt:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF

Query:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP
        AQLHSASS PC S DGLESSMIV KS G GK TFGLSG LKF+S TCVSAFFWHY HHPPLPPI   VFPPHPPLFGHPY  PPYHHKFFPP     PP 
Subjt:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP

Query:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVY--K
          K P    PPPVPE PPPV EKPPPVYEKPPP Y KPPPV    KP P    KPKPPVY+PKPPV +PKPP   KP  P V +PKP    PKPPVY  K
Subjt:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVY--K

Query:  PPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP
        PPTPVYKHP+YKILPPISKLPPC PVPKVIPV+PPKY SHPKFGKKFPPL PP+PHP
Subjt:  PPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP

A0A6J1ERR1 proline-rich protein 4-like isoform X14.2e-13476.52Show/hide
Query:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF
        MQ+P VC  FWFFFLFAAT CHGSDLTTVEVVGVGECADCHKNNIKT+HAF+GLRVS++CK KDG FERKG AEL+E+GKFKVLLP E L+DGKLKGKCF
Subjt:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF

Query:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP
        AQLHSASS PC S DGLESSMIV KS G GK TFGLSG LKF+S TCVSAFFWHY HHPPLPPI   VFPPHPPLFGHPY  PPYHHKFFPP     PP 
Subjt:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP

Query:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYKPKPPVYKPKPPVVYKPKPPV--VYKPKPPVYKPKPP---
          K P    PPPVPE PPPV EKPPPVYEKPPP Y KPPPV    KP P    KPKPPVY+PKPPV +PKPP    PKPPV     PKPPV +PKPP   
Subjt:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYKPKPPVYKPKPPVVYKPKPPV--VYKPKPPVYKPKPP---

Query:  --VYKPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP
          V KPPTPVYKHP+YKILPPISKLPPC PVPKVIPV+PPKY SHPKFGKKFPPL PP+PHP
Subjt:  --VYKPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP

A0A6J1JIY3 proline-rich protein 4-like isoform X31.3e-13573.83Show/hide
Query:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF
        MQ P VC FFWFFFLFA T CHGSDLTTVEVVGVGECADCHKNNIKT+HAF+GLRVSI+CK KDG FERKG AEL+E+GKFKVLLPTE L+DGKLKGKCF
Subjt:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF

Query:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP
        A LHSASS PC S DGLESSMIV KS G GK TFGL G LKF+S TCVSAFFWHY HHPPLPPI   VFPPHPPLFGHPY  PPYHHKFFPP     PP 
Subjt:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP

Query:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV-----PIYKKPLPPPVY--------------KPKPPVYKPKPPVYKPKP--PVVYKPKP
          K P    PPPVPE PPPV EKPPPVYEKPPP Y KPPPV     P+Y+K  PPPVY              KPKPPVY+PKPPV +PKP  P VY+PKP
Subjt:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV-----PIYKKPLPPPVY--------------KPKPPVYKPKPPVYKPKP--PVVYKPKP

Query:  PV-----VYKPKPPVYK---PKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP
        PV        PKPPV +   PKPPVY  KPP PVYKHP+YKILPPISKLPPCPPVPKVIPV+PPKY SHPKFGKKFPPL PP+PHP
Subjt:  PV-----VYKPKPPVYK---PKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP

A0A6J1JLA8 proline-rich protein 4-like isoform X23.1e-13772.36Show/hide
Query:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF
        MQ P VC FFWFFFLFA T CHGSDLTTVEVVGVGECADCHKNNIKT+HAF+GLRVSI+CK KDG FERKG AEL+E+GKFKVLLPTE L+DGKLKGKCF
Subjt:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF

Query:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP
        A LHSASS PC S DGLESSMIV KS G GK TFGL G LKF+S TCVSAFFWHY HHPPLPPI   VFPPHPPLFGHPY  PPYHHKFFPP     PP 
Subjt:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP

Query:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV-----PIYKKPLP-------------PPVYK-----------------------PKPPV
          K P    PPPVPE PPPV EKPPPVYEKPPP Y KPPPV     P+Y+KP P             PPVY+                       PKPPV
Subjt:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV-----PIYKKPLP-------------PPVYK-----------------------PKPPV

Query:  YKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP
        Y PKPPVY PKPP VY PKPP VY PKPPVY PKPPVY  KPP PVYKHP+YKILPPISKLPPCPPVPKVIPV+PPKY SHPKFGKKFPPL PP+PHP
Subjt:  YKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP

A0A6J1JN98 repetitive proline-rich cell wall protein 2-like isoform X11.0e-13568.97Show/hide
Query:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF
        MQ P VC FFWFFFLFA T CHGSDLTTVEVVGVGECADCHKNNIKT+HAF+GLRVSI+CK KDG FERKG AEL+E+GKFKVLLPTE L+DGKLKGKCF
Subjt:  MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCF

Query:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP
        A LHSASS PC S DGLESSMIV KS G GK TFGL G LKF+S TCVSAFFWHY HHPPLPPI   VFPPHPPLFGHPY  PPYHHKFFPP     PP 
Subjt:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKV--YPPP

Query:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV-----PIYKKPLP-------------------PPVYKPKPPVYK---PKPPVYKPKPPV
          K P    PPPVPE PPPV EKPPPVYEKPPP Y KPPPV     P+Y+KP P                   PPVY+PKPPV +   PKPPVY+PKPPV
Subjt:  TSKEP---VPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPV-----PIYKKPLP-------------------PPVYKPKPPVYK---PKPPVYKPKPPV

Query:  -----------------------------VYKPKPPV------VYKPKPPVYKPKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYF
                                     VY PKPPV      VY PKPPVY PKPPVY  KPP PVYKHP+YKILPPISKLPPCPPVPKVIPV+PPKY 
Subjt:  -----------------------------VYKPKPPV------VYKPKPPVYKPKPPVY--KPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYF

Query:  SHPKFGKKFPPLPPPIPHP
        SHPKFGKKFPPL PP+PHP
Subjt:  SHPKFGKKFPPLPPPIPHP

SwissProt top hitse value%identityAlignment
P13983 Extensin2.8e-1038.74Show/hide
Query:  YHHHPP--LPPISFPVFPPHPPLFGHP-----------YLLPPYHHKFFPPKVYPPPTSKEPVPPPVPE-----KPPPVYEKPPPVYEKPPPDYVKPPPV
        Y   PP  LP  S P++ P PP++  P           YL PP      PP   PPP + E  PPP P        PP Y  PPP Y  PPP Y +PPP+
Subjt:  YHHHPP--LPPISFPVFPPHPPLFGHP-----------YLLPPYHHKFFPPKVYPPPTSKEPVPPPVPE-----KPPPVYEKPPPVYEKPPPDYVKPPPV

Query:  PIYKKPLPPPVYKPKPPVYKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVYK--PPTPVYKHPWYKILPPISKLPP--CPPVPKVIPVVPPKYFS
        P    P PP    P PP Y P PP Y P PP   +P PP      PP Y P PP Y   PP+P+Y  P     P +  LPP   PP P+ I + PP +  
Subjt:  PIYKKPLPPPVYKPKPPVYKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVYK--PPTPVYKHPWYKILPPISKLPP--CPPVPKVIPVVPPKYFS

Query:  ----HPKFGKKFPPLPPPIPHP
             P +G+  PP PP    P
Subjt:  ----HPKFGKKFPPLPPPIPHP

Q9SKP9 Proline-rich protein 22.5e-4343.18Show/hide
Query:  VCLFFWFFFLFAA--TLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKD--GDFERKGVAELDEDGKFKVLLPTEVL-EDGKLKGKCF
        +CL F F     A    C    +  VEV+G  E      + IK  +AFSGLRV+IECK  D  G F  +G  E+DE GKF + +P +++ +DG LK  C+
Subjt:  VCLFFWFFFLFAA--TLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKD--GDFERKGVAELDEDGKFKVLLPTEVL-EDGKLKGKCF

Query:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHP-----PLPPISFP-VFPPHPPLFGHPYLLPPYHHKFFPPKV
        A L SA   PCP+HDGLE+S IVF SK       GL   LKF    C+S FFWH    P      LPP++FP +  P PP++  P ++P           
Subjt:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHP-----PLPPISFP-VFPPHPPLFGHPYLLPPYHHKFFPPKV

Query:  YPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVYKP
              K+P PP +  K  P+Y+ P P+Y+ P P Y   PPV I KKP PP ++K   P+YKP  P+Y  KPPVV  PK     K  PP++K   P+YK 
Subjt:  YPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVYKP

Query:  PTPVYKHPWYKILPPISKLP--PCPPVPKVIPVVPPKYFSHPKFGKKFPPLP
        P P+YK P +K  PP+  +P  PCPP+PK  P  PPKY  HPKFG K+PP P
Subjt:  PTPVYKHPWYKILPPISKLP--PCPPVPKVIPVVPPKYFSHPKFGKKFPPLP

Q9T0I5 Proline-rich protein 43.2e-4640.49Show/hide
Query:  CLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLED-GKLKGKCFAQLHS
        CL      L +ATL   S    VEVVG  E      + IKT HAFSGLRV+I+CK   G F  KG   +D+ GKF + +P +++ D G LK +C+AQLHS
Subjt:  CLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLED-GKLKGKCFAQLHS

Query:  ASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPL----------PPISFPVF--PPHPPLFGHPYLLPPYHHKFFPP-
        A+  PCP+HDGLES+ IVF SK   K   GL   LKF    CVS FFW     PP           PP+  P F   P PP +  P  +PP    + PP 
Subjt:  ASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPL----------PPISFPVF--PPHPPLFGHPYLLPPYHHKFFPP-

Query:  --------KVYPPPTSKEPVPPPVP-EKPPPVYEKPPPVYEKP----PPDYVKPPPVPIYK------KPLPPPVYKPKPPVYKP----------------
                 VY PP  KE VPPPVP  KPPP  E PPP+ +KP    PP    PPPVP+YK      KP P PVYKP P +  P                
Subjt:  --------KVYPPPTSKEPVPPPVP-EKPPPVYEKPPPVYEKP----PPDYVKPPPVPIYK------KPLPPPVYKPKPPVYKP----------------

Query:  ----------KPPVYKPKPPVVYKPKPPVVYKPKP---------------PVYKP-----KPPVY---------KPPTPVYKH-------------PWYK
                  KPP  KP PP    P P  V+KP P               PVYKP      PP+Y          PP P+YK              P YK
Subjt:  ----------KPPVYKPKPPVVYKPKPPVVYKPKP---------------PVYKP-----KPPVY---------KPPTPVYKH-------------PWYK

Query:  ILPPISKLP--PCPPVPKV-----IPVVPPKYFSHPKFGKKFPPLPPPIPHP
          PP+  +P  PCPP+P++      P +PPKY  HPKFG K+PPLP   PHP
Subjt:  ILPPISKLP--PCPPVPKV-----IPVVPPKYFSHPKFGKKFPPLPPPIPHP

Q9T0K5 Leucine-rich repeat extensin-like protein 33.7e-1045.69Show/hide
Query:  PLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKVYPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYKPKP
        PLPP S P  PP  P+F  P  L        PP   PPP    P PPP P  PPPVY  PPP    PPP    PPP P+Y  P PPP   P PPVY P P
Subjt:  PLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKVYPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYKPKP

Query:  PVYKPKPPVVYKPKPPVVYKPKPPVYK-PKPPVYK----PPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYF--SHPKFGKKFPPLPPPIPH
        P   P PP VY P PP    P PPVY  P PPVY     PP+P    P Y   PP    P  PP P+  P  P  Y+  S P      PP  PP PH
Subjt:  PVYKPKPPVVYKPKPPVVYKPKPPVYK-PKPPVYK----PPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYF--SHPKFGKKFPPLPPPIPH

Q9XIL9 Pollen-specific leucine-rich repeat extensin-like protein 32.4e-1746.43Show/hide
Query:  HHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKVYPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYK
        H PP PP+  P  PP PP++  P   PP +    PP V+ PP      PPPV   PPPV+  PPPV+  PPP  V  PP P+Y  P PPPV+ P PPV+ 
Subjt:  HHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKVYPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYK

Query:  PKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVYKPPTPVYK--HPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP
        P PPV+ P PPV   P PP V+ P PPV+ P PPV+ PP PVY    P Y   PP  K PP PPV    P++PPK  S        PP   P+  P
Subjt:  PKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVYKPPTPVYK--HPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP

Arabidopsis top hitse value%identityAlignment
AT2G15880.1 Leucine-rich repeat (LRR) family protein1.7e-1846.43Show/hide
Query:  HHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKVYPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYK
        H PP PP+  P  PP PP++  P   PP +    PP V+ PP      PPPV   PPPV+  PPPV+  PPP  V  PP P+Y  P PPPV+ P PPV+ 
Subjt:  HHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKVYPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYK

Query:  PKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVYKPPTPVYK--HPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP
        P PPV+ P PPV   P PP V+ P PPV+ P PPV+ PP PVY    P Y   PP  K PP PPV    P++PPK  S        PP   P+  P
Subjt:  PKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVYKPPTPVYK--HPWYKILPPISKLPPCPPVPKVIPVVPPKYFSHPKFGKKFPPLPPPIPHP

AT2G21140.1 proline-rich protein 21.8e-4443.18Show/hide
Query:  VCLFFWFFFLFAA--TLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKD--GDFERKGVAELDEDGKFKVLLPTEVL-EDGKLKGKCF
        +CL F F     A    C    +  VEV+G  E      + IK  +AFSGLRV+IECK  D  G F  +G  E+DE GKF + +P +++ +DG LK  C+
Subjt:  VCLFFWFFFLFAA--TLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKD--GDFERKGVAELDEDGKFKVLLPTEVL-EDGKLKGKCF

Query:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHP-----PLPPISFP-VFPPHPPLFGHPYLLPPYHHKFFPPKV
        A L SA   PCP+HDGLE+S IVF SK       GL   LKF    C+S FFWH    P      LPP++FP +  P PP++  P ++P           
Subjt:  AQLHSASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHP-----PLPPISFP-VFPPHPPLFGHPYLLPPYHHKFFPPKV

Query:  YPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVYKP
              K+P PP +  K  P+Y+ P P+Y+ P P Y   PPV I KKP PP ++K   P+YKP  P+Y  KPPVV  PK     K  PP++K   P+YK 
Subjt:  YPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVYKP

Query:  PTPVYKHPWYKILPPISKLP--PCPPVPKVIPVVPPKYFSHPKFGKKFPPLP
        P P+YK P +K  PP+  +P  PCPP+PK  P  PPKY  HPKFG K+PP P
Subjt:  PTPVYKHPWYKILPPISKLP--PCPPVPKVIPVVPPKYFSHPKFGKKFPPLP

AT3G19020.1 Leucine-rich repeat (LRR) family protein2.4e-1243.28Show/hide
Query:  PPLPPISFPVFPPHPPLFGHP---YLLPPYHHKFFPPKVYPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVY
        P  PP+  P  PP PP+   P   +  PP  H   PP   PPP    P PPPV   PPPV+  PPPV+  PPP +  PPPV       PPPV+ P PPV 
Subjt:  PPLPPISFPVFPPHPPLFGHP---YLLPPYHHKFFPPKVYPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVY

Query:  K-PKPPVYKPKPPV-VYKPKPPVVYKPKPPVYK-PKPPVYKPPTPVYKHPWYKILPPISKLPP---CPPVPKVIPVVPPKYFSHPKFGKKFPPL--PPPI
          P PPV+ P PP  +Y P PP V+ P PPV+  P PPV+ PP PV+  P     PP+   PP    PP P   P  P   +S P      PP+  PPP 
Subjt:  K-PKPPVYKPKPPV-VYKPKPPVVYKPKPPVYK-PKPPVYKPPTPVYKHPWYKILPPISKLPP---CPPVPKVIPVVPPKYFSHPKFGKKFPPL--PPPI

Query:  P
        P
Subjt:  P

AT4G13340.1 Leucine-rich repeat (LRR) family protein2.6e-1145.69Show/hide
Query:  PLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKVYPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYKPKP
        PLPP S P  PP  P+F  P  L        PP   PPP    P PPP P  PPPVY  PPP    PPP    PPP P+Y  P PPP   P PPVY P P
Subjt:  PLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKVYPPPTSKEPVPPPVPEKPPPVYEKPPPVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYKPKP

Query:  PVYKPKPPVVYKPKPPVVYKPKPPVYK-PKPPVYK----PPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYF--SHPKFGKKFPPLPPPIPH
        P   P PP VY P PP    P PPVY  P PPVY     PP+P    P Y   PP    P  PP P+  P  P  Y+  S P      PP  PP PH
Subjt:  PVYKPKPPVVYKPKPPVVYKPKPPVYK-PKPPVYK----PPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKYF--SHPKFGKKFPPLPPPIPH

AT4G38770.1 proline-rich protein 42.3e-4740.49Show/hide
Query:  CLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLED-GKLKGKCFAQLHS
        CL      L +ATL   S    VEVVG  E      + IKT HAFSGLRV+I+CK   G F  KG   +D+ GKF + +P +++ D G LK +C+AQLHS
Subjt:  CLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLED-GKLKGKCFAQLHS

Query:  ASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPL----------PPISFPVF--PPHPPLFGHPYLLPPYHHKFFPP-
        A+  PCP+HDGLES+ IVF SK   K   GL   LKF    CVS FFW     PP           PP+  P F   P PP +  P  +PP    + PP 
Subjt:  ASSAPCPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPL----------PPISFPVF--PPHPPLFGHPYLLPPYHHKFFPP-

Query:  --------KVYPPPTSKEPVPPPVP-EKPPPVYEKPPPVYEKP----PPDYVKPPPVPIYK------KPLPPPVYKPKPPVYKP----------------
                 VY PP  KE VPPPVP  KPPP  E PPP+ +KP    PP    PPPVP+YK      KP P PVYKP P +  P                
Subjt:  --------KVYPPPTSKEPVPPPVP-EKPPPVYEKPPPVYEKP----PPDYVKPPPVPIYK------KPLPPPVYKPKPPVYKP----------------

Query:  ----------KPPVYKPKPPVVYKPKPPVVYKPKP---------------PVYKP-----KPPVY---------KPPTPVYKH-------------PWYK
                  KPP  KP PP    P P  V+KP P               PVYKP      PP+Y          PP P+YK              P YK
Subjt:  ----------KPPVYKPKPPVVYKPKPPVVYKPKP---------------PVYKP-----KPPVY---------KPPTPVYKH-------------PWYK

Query:  ILPPISKLP--PCPPVPKV-----IPVVPPKYFSHPKFGKKFPPLPPPIPHP
          PP+  +P  PCPP+P++      P +PPKY  HPKFG K+PPLP   PHP
Subjt:  ILPPISKLP--PCPPVPKV-----IPVVPPKYFSHPKFGKKFPPLPPPIPHP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAACCCTTGTGTCTGTCTCTTCTTCTGGTTCTTCTTCCTCTTTGCTGCAACTCTCTGCCATGGCAGTGACTTGACAACAGTTGAGGTTGTTGGGGTTGGAGAATG
TGCAGACTGCCACAAGAATAACATTAAAACTAGCCATGCCTTCTCAGGGCTTCGTGTAAGCATTGAATGCAAACAGAAAGATGGAGACTTTGAAAGAAAAGGGGTTGCAG
AGCTTGATGAAGATGGAAAGTTCAAAGTGTTGCTTCCAACTGAGGTCTTGGAAGATGGAAAGTTGAAGGGGAAGTGCTTTGCACAGCTTCACAGTGCTTCTTCTGCTCCT
TGCCCATCTCATGATGGCTTGGAATCCTCAATGATTGTGTTCAAATCCAAAGGTGAGGGAAAACAAACATTTGGTTTGTCTGGTGGGCTCAAGTTCAAGAGTGCAACTTG
TGTTTCTGCCTTCTTTTGGCATTATCATCATCACCCTCCTTTGCCTCCTATCTCCTTCCCTGTCTTCCCTCCTCATCCTCCATTGTTTGGCCATCCATATTTGCTCCCTC
CTTACCATCACAAGTTCTTCCCTCCTAAGGTCTACCCTCCACCCACCTCGAAAGAGCCGGTACCTCCGCCGGTCCCCGAAAAGCCTCCACCAGTTTACGAGAAGCCTCCG
CCAGTCTACGAAAAGCCTCCACCTGATTACGTAAAGCCTCCACCAGTACCAATCTACAAGAAGCCTCTTCCACCACCAGTTTACAAGCCCAAACCACCGGTTTACAAGCC
AAAACCACCGGTTTACAAGCCGAAACCACCGGTCGTTTACAAGCCGAAACCACCAGTCGTTTACAAGCCGAAACCGCCAGTGTACAAGCCGAAACCACCAGTTTACAAGC
CTCCAACACCAGTCTATAAGCATCCATGGTACAAGATTCTCCCTCCAATTTCAAAGCTTCCTCCATGTCCACCAGTTCCAAAGGTCATCCCTGTTGTCCCTCCAAAGTAC
TTCTCTCACCCCAAGTTTGGGAAGAAGTTTCCTCCTCTGCCTCCTCCTATTCCACATCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGAACCCTTGTGTCTGTCTCTTCTTCTGGTTCTTCTTCCTCTTTGCTGCAACTCTCTGCCATGGCAGTGACTTGACAACAGTTGAGGTTGTTGGGGTTGGAGAATG
TGCAGACTGCCACAAGAATAACATTAAAACTAGCCATGCCTTCTCAGGGCTTCGTGTAAGCATTGAATGCAAACAGAAAGATGGAGACTTTGAAAGAAAAGGGGTTGCAG
AGCTTGATGAAGATGGAAAGTTCAAAGTGTTGCTTCCAACTGAGGTCTTGGAAGATGGAAAGTTGAAGGGGAAGTGCTTTGCACAGCTTCACAGTGCTTCTTCTGCTCCT
TGCCCATCTCATGATGGCTTGGAATCCTCAATGATTGTGTTCAAATCCAAAGGTGAGGGAAAACAAACATTTGGTTTGTCTGGTGGGCTCAAGTTCAAGAGTGCAACTTG
TGTTTCTGCCTTCTTTTGGCATTATCATCATCACCCTCCTTTGCCTCCTATCTCCTTCCCTGTCTTCCCTCCTCATCCTCCATTGTTTGGCCATCCATATTTGCTCCCTC
CTTACCATCACAAGTTCTTCCCTCCTAAGGTCTACCCTCCACCCACCTCGAAAGAGCCGGTACCTCCGCCGGTCCCCGAAAAGCCTCCACCAGTTTACGAGAAGCCTCCG
CCAGTCTACGAAAAGCCTCCACCTGATTACGTAAAGCCTCCACCAGTACCAATCTACAAGAAGCCTCTTCCACCACCAGTTTACAAGCCCAAACCACCGGTTTACAAGCC
AAAACCACCGGTTTACAAGCCGAAACCACCGGTCGTTTACAAGCCGAAACCACCAGTCGTTTACAAGCCGAAACCGCCAGTGTACAAGCCGAAACCACCAGTTTACAAGC
CTCCAACACCAGTCTATAAGCATCCATGGTACAAGATTCTCCCTCCAATTTCAAAGCTTCCTCCATGTCCACCAGTTCCAAAGGTCATCCCTGTTGTCCCTCCAAAGTAC
TTCTCTCACCCCAAGTTTGGGAAGAAGTTTCCTCCTCTGCCTCCTCCTATTCCACATCCTTAG
Protein sequenceShow/hide protein sequence
MQNPCVCLFFWFFFLFAATLCHGSDLTTVEVVGVGECADCHKNNIKTSHAFSGLRVSIECKQKDGDFERKGVAELDEDGKFKVLLPTEVLEDGKLKGKCFAQLHSASSAP
CPSHDGLESSMIVFKSKGEGKQTFGLSGGLKFKSATCVSAFFWHYHHHPPLPPISFPVFPPHPPLFGHPYLLPPYHHKFFPPKVYPPPTSKEPVPPPVPEKPPPVYEKPP
PVYEKPPPDYVKPPPVPIYKKPLPPPVYKPKPPVYKPKPPVYKPKPPVVYKPKPPVVYKPKPPVYKPKPPVYKPPTPVYKHPWYKILPPISKLPPCPPVPKVIPVVPPKY
FSHPKFGKKFPPLPPPIPHP