; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0004545 (gene) of Chayote v1 genome

Gene IDSed0004545
OrganismSechium edule (Chayote v1)
DescriptionUnknown protein
Genome locationLG04:37577687..37580934
RNA-Seq ExpressionSed0004545
SyntenySed0004545
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038899317.1 uncharacterized protein LOC120086655 isoform X1 [Benincasa hispida]1.7e-21372.08Show/hide
Query:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKP----------TAADPILSNKRPRDPKTPKNKKNKP--NPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA
        M+PYSEE+L EEV HLH+LWRRGPPRNPKP           AA+   SNKRP DPK   NKK KP   P  DSGPEWPCPEP+QNQPSTSSGW PIEP A
Subjt:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKP----------TAADPILSNKRPRDPKTPKNKKNKP--NPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA

Query:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS
        TP AHPVSS ER  LAAL LQY+  +ACRGFFARNADSGSDE+ +E++   +GEM+ESEEYKFF KLFVEN+ELR +YEKN E G FCCLVCGGM K+K 
Subjt:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS

Query:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENH------DGVQTENVCILNDVNDKKNEVV
        GK+FKNC GLVQHSISIS TKKK AHRAFG VVCRVFGWDIDRLPTIVLKGEPL RSLA S +LKVQ EENH       GVQ ENV I  D  +KKNEVV
Subjt:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENH------DGVQTENVCILNDVNDKKNEVV

Query:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELEN-----LHVPESILKACKEFCEAFSTSMSDDDVSENNLI
          D  + KLEEE+TAEDP SN+KDLIS +N+DACK NDV +  EN DNS+ GMEESNAE++N     L VPESILKACKEFC AF TSMSD+DVSENNLI
Subjt:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELEN-----LHVPESILKACKEFCEAFSTSMSDDDVSENNLI

Query:  DGDGVEEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVIC
        DG+GVEEREEFKFF KLFTENESLRRYYEN YDDGEFFCLAC G GKKMLKSFKTCGRLLQHTTSL KNK    PVQKP IAK+LK+K + H A S VIC
Subjt:  DGDGVEEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVIC

Query:  RVLGWDIEKLPAVVLKGEPLGRSLTKP--AVLKDEPVGNAVDNTNESVVPVEDDSAKINYLKNENL
        +VLGWDIEKLPAVVLKGEPLGRSLTK   A L+DE VGN+VDNT       EDDS KIN ++ E++
Subjt:  RVLGWDIEKLPAVVLKGEPLGRSLTKP--AVLKDEPVGNAVDNTNESVVPVEDDSAKINYLKNENL

XP_038899319.1 uncharacterized protein LOC120086655 isoform X2 [Benincasa hispida]4.5e-21472.16Show/hide
Query:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKP----------TAADPILSNKRPRDPKTPKNKKNKP--NPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA
        M+PYSEE+L EEV HLH+LWRRGPPRNPKP           AA+   SNKRP DPK   NKK KP   P  DSGPEWPCPEP+QNQPSTSSGW PIEP A
Subjt:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKP----------TAADPILSNKRPRDPKTPKNKKNKP--NPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA

Query:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS
        TP AHPVSS ER  LAAL LQY+  +ACRGFFARNADSGSDE+ +E++   +GEM+ESEEYKFF KLFVEN+ELR +YEKN E G FCCLVCGGM K+K 
Subjt:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS

Query:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENH------DGVQTENVCILNDVNDKKNEVV
        GK+FKNC GLVQHSISIS TKKK AHRAFG VVCRVFGWDIDRLPTIVLKGEPL RSLA S +LKVQ EENH       GVQ ENV I  D  +KKNEVV
Subjt:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENH------DGVQTENVCILNDVNDKKNEVV

Query:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELEN-----LHVPESILKACKEFCEAFSTSMSDDDVSENNLI
          D  + KLEEE+TAEDP SN+KDLIS +N+DACK NDV +  EN DNS+ GMEESNAE++N     L VPESILKACKEFC AF TSMSD+DVSENNLI
Subjt:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELEN-----LHVPESILKACKEFCEAFSTSMSDDDVSENNLI

Query:  DGDGVEEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVIC
        DG+GVEEREEFKFF KLFTENESLRRYYEN YDDGEFFCLAC G GKKMLKSFKTCGRLLQHTTSL KNK    PVQKP IAK+LK+K + H A S VIC
Subjt:  DGDGVEEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVIC

Query:  RVLGWDIEKLPAVVLKGEPLGRSLTKPAVLKDEPVGNAVDNTNESVVPVEDDSAKINYLKNENL
        +VLGWDIEKLPAVVLKGEPLGRSLTK    KDE VGN+VDNT       EDDS KIN ++ E++
Subjt:  RVLGWDIEKLPAVVLKGEPLGRSLTKPAVLKDEPVGNAVDNTNESVVPVEDDSAKINYLKNENL

XP_038899320.1 uncharacterized protein LOC120086655 isoform X3 [Benincasa hispida]6.1e-21171.73Show/hide
Query:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKP----------TAADPILSNKRPRDPKTPKNKKNKP--NPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA
        M+PYSEE+L EEV HLH+LWRRGPPRNPKP           AA+   SNKRP DPK   NKK KP   P  DSGPEWPCPEP+QNQPSTSSGW PIEP A
Subjt:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKP----------TAADPILSNKRPRDPKTPKNKKNKP--NPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA

Query:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS
        TP AHPVSS ER  LAAL LQY+  +ACRGFFARNADSGSDE+ +E++   +GEM+ESEEYKFF KLFVEN+ELR +YEKN E G FCCLVCGGM K+K 
Subjt:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS

Query:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENH------DGVQTENVCILNDVNDKKNEVV
        GK+FKNC GLVQHSISIS TKKK AHRAFG VVCRVFGWDIDRLPTIVLKGEPL RSLA S +LK   EENH       GVQ ENV I  D  +KKNEVV
Subjt:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENH------DGVQTENVCILNDVNDKKNEVV

Query:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELEN-----LHVPESILKACKEFCEAFSTSMSDDDVSENNLI
          D  + KLEEE+TAEDP SN+KDLIS +N+DACK NDV +  EN DNS+ GMEESNAE++N     L VPESILKACKEFC AF TSMSD+DVSENNLI
Subjt:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELEN-----LHVPESILKACKEFCEAFSTSMSDDDVSENNLI

Query:  DGDGVEEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVIC
        DG+GVEEREEFKFF KLFTENESLRRYYEN YDDGEFFCLAC G GKKMLKSFKTCGRLLQHTTSL KNK    PVQKP IAK+LK+K + H A S VIC
Subjt:  DGDGVEEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVIC

Query:  RVLGWDIEKLPAVVLKGEPLGRSLTKP--AVLKDEPVGNAVDNTNESVVPVEDDSAKINYLKNENL
        +VLGWDIEKLPAVVLKGEPLGRSLTK   A L+DE VGN+VDNT       EDDS KIN ++ E++
Subjt:  RVLGWDIEKLPAVVLKGEPLGRSLTKP--AVLKDEPVGNAVDNTNESVVPVEDDSAKINYLKNENL

XP_038899321.1 uncharacterized protein LOC120086655 isoform X4 [Benincasa hispida]4.1e-21572.73Show/hide
Query:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKP----------TAADPILSNKRPRDPKTPKNKKNKP--NPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA
        M+PYSEE+L EEV HLH+LWRRGPPRNPKP           AA+   SNKRP DPK   NKK KP   P  DSGPEWPCPEP+QNQPSTSSGW PIEP A
Subjt:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKP----------TAADPILSNKRPRDPKTPKNKKNKP--NPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA

Query:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS
        TP AHPVSS ER  LAAL LQY+  +ACRGFFARNADSGSDE+ +E++   +GEM+ESEEYKFF KLFVEN+ELR +YEKN E G FCCLVCGGM K+K 
Subjt:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS

Query:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENH------DGVQTENVCILNDVNDKKNEVV
        GK+FKNC GLVQHSISIS TKKK AHRAFG VVCRVFGWDIDRLPTIVLKGEPL RSLA S +LKVQ EENH       GVQ ENV I  D  +KKNEVV
Subjt:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENH------DGVQTENVCILNDVNDKKNEVV

Query:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGV
          D  + KLEEE+TAEDP SN+KDLIS +N+DACK NDV +  EN DNS+ GMEESNAE++NL VPESILKACKEFC AF TSMSD+DVSENNLIDG+GV
Subjt:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGV

Query:  EEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGW
        EEREEFKFF KLFTENESLRRYYEN YDDGEFFCLAC G GKKMLKSFKTCGRLLQHTTSL KNK    PVQKP IAK+LK+K + H A S VIC+VLGW
Subjt:  EEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGW

Query:  DIEKLPAVVLKGEPLGRSLTKP--AVLKDEPVGNAVDNTNESVVPVEDDSAKINYLKNENL
        DIEKLPAVVLKGEPLGRSLTK   A L+DE VGN+VDNT       EDDS KIN ++ E++
Subjt:  DIEKLPAVVLKGEPLGRSLTKP--AVLKDEPVGNAVDNTNESVVPVEDDSAKINYLKNENL

XP_038899322.1 uncharacterized protein LOC120086655 isoform X5 [Benincasa hispida]1.1e-19668.63Show/hide
Query:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKP----------TAADPILSNKRPRDPKTPKNKKNKP--NPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA
        M+PYSEE+L EEV HLH+LWRRGPPRNPKP           AA+   SNKRP DPK   NKK KP   P  DSGPEWPCPEP+QNQPSTSSGW PIEP A
Subjt:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKP----------TAADPILSNKRPRDPKTPKNKKNKP--NPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA

Query:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS
        TP AHPVSS ER  LAAL LQY+  +ACRGFFARNADSGSDE+ +E++   +GEM+ESEEYKFF KLFVEN+ELR +YEKN E G FCCLVCGGM K+K 
Subjt:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS

Query:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENH------DGVQTENVCILNDVNDKKNEVV
        GK+FKNC GLVQHSISIS TKKK AHRAFG VVCRVFGWDIDRLPTIVLKGEPL RSLA S +LKVQ EENH       GVQ ENV I  D  +KKNEVV
Subjt:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENH------DGVQTENVCILNDVNDKKNEVV

Query:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGV
          D  + KLEEE+TAEDP SN+KDLIS +                                   VPESILKACKEFC AF TSMSD+DVSENNLIDG+GV
Subjt:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGV

Query:  EEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGW
        EEREEFKFF KLFTENESLRRYYEN YDDGEFFCLAC G GKKMLKSFKTCGRLLQHTTSL KNK    PVQKP IAK+LK+K + H A S VIC+VLGW
Subjt:  EEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGW

Query:  DIEKLPAVVLKGEPLGRSLTKP--AVLKDEPVGNAVDNTNESVVPVEDDSAKINYLKNENL
        DIEKLPAVVLKGEPLGRSLTK   A L+DE VGN+VDNT       EDDS KIN ++ E++
Subjt:  DIEKLPAVVLKGEPLGRSLTKP--AVLKDEPVGNAVDNTNESVVPVEDDSAKINYLKNENL

TrEMBL top hitse value%identityAlignment
A0A1S3CJZ0 uncharacterized protein LOC103501816 isoform X15.4e-17363.54Show/hide
Query:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKPT-------AADPILSNKRPRDP-----KTPKNKKNKPNPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA
        M+PYS+E+L +EV +LHSLW RGPPRNPKPT        ADP  SNKRP DP     K  K KK + +PP DSGPEWPCPEP+QNQPSTSSGW PI+P A
Subjt:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKPT-------AADPILSNKRPRDP-----KTPKNKKNKPNPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA

Query:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS
        TP A  VSS ER+ LAAL LQY+  +ACR FFARNADSGSDE+E+E++E  DGEM+ES+EY FF K+FVENEELR +YEKN E G FCCLVC GMGKKK 
Subjt:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS

Query:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENHDGVQTENVCILNDVNDKKNEV--VSTDE
        GK+FKNC  LVQHSISIS TKKK AHRAFG VV RVFGWDIDRLPTIVLKGEPL RSLA S DLKVQ EE H               D KNEV  VS +E
Subjt:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENHDGVQTENVCILNDVNDKKNEV--VSTDE

Query:  NEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGVEERE
        +E KLEE KTAEDP SN+KDLIS EN+DA K  DV +  ENADNS+SGM ESN E++NLHV  +IL+ACKEF  AF  SM+DDDVSE      DG EERE
Subjt:  NEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGVEERE

Query:  EFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKN---KSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGWD
        EFKFF KLFTENE+LRRYYEN Y DGEF CLAC+  G+K +K FKTC RLLQH+T L KN   K G    QKP+  K+LK+  L H AY+ V+C+VLG D
Subjt:  EFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKN---KSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGWD

Query:  IEKLPAVVLKGEPLGRSLTKPAVLKDEPVGNAVDNTNESVVPVEDDSAKINYLK
        I+ LPA+VL GE LG SLTK  V K +   +    ++ +   VEDDS ++N L+
Subjt:  IEKLPAVVLKGEPLGRSLTKPAVLKDEPVGNAVDNTNESVVPVEDDSAKINYLK

A0A1S3CJZ2 uncharacterized protein LOC103501816 isoform X22.4e-17363.72Show/hide
Query:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKPT-------AADPILSNKRPRDP-----KTPKNKKNKPNPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA
        M+PYS+E+L +EV +LHSLW RGPPRNPKPT        ADP  SNKRP DP     K  K KK + +PP DSGPEWPCPEP+QNQPSTSSGW PI+P A
Subjt:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKPT-------AADPILSNKRPRDP-----KTPKNKKNKPNPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA

Query:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS
        TP A  VSS ER+ LAAL LQY+  +ACR FFARNADSGSDE+E+E++E  DGEM+ES+EY FF K+FVENEELR +YEKN E G FCCLVC GMGKKK 
Subjt:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS

Query:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENHDGVQTENVCILNDVNDKKNEV--VSTDE
        GK+FKNC  LVQHSISIS TKKK AHRAFG VV RVFGWDIDRLPTIVLKGEPL RSLA S DLKVQ EE H               D KNEV  VS +E
Subjt:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENHDGVQTENVCILNDVNDKKNEV--VSTDE

Query:  NEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGVEERE
        +E KLEE KTAEDP SN+KDLIS EN+DA K  DV +  ENADNS+SGM ESN E++NLHV  +IL+ACKEF  AF  SM+DDDVSE      DG EERE
Subjt:  NEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGVEERE

Query:  EFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKN---KSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGWD
        EFKFF KLFTENE+LRRYYEN Y DGEF CLAC+  G+K +K FKTC RLLQH+T L KN   K G    QKP+  K+LK+  L H AY+ V+C+VLG D
Subjt:  EFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKN---KSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGWD

Query:  IEKLPAVVLKGEPLGRSLTKPAVLKDEPVGNAVDNTNESVVPVEDDSAKINYLK
        I+ LPA+VL GE LG SLTK  V KD+       +  + +  VEDDS ++N L+
Subjt:  IEKLPAVVLKGEPLGRSLTKPAVLKDEPVGNAVDNTNESVVPVEDDSAKINYLK

A0A5D3DXE1 Uncharacterized protein6.7e-17165.71Show/hide
Query:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKPT-------AADPILSNKRPRDP-----KTPKNKKNKPNPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA
        M+PYS+E+L +EV +LHSLW RGPPRNPKPT        ADP  SNKRP DP     K  K KK + +PP DSGPEWPCPEP+QNQPSTSSGW PI+P A
Subjt:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPKPT-------AADPILSNKRPRDP-----KTPKNKKNKPNPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCA

Query:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS
        TP A  VSS ER+ LAAL LQY+  +ACR FFARNADSGSDE+E+E++E  DGEM+ES+EY FF K+FVENEELR +YEKN E G FCCLVC GMGKKK 
Subjt:  TPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKS

Query:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENHDGVQTENVCILNDVNDKKNEV--VSTDE
        GK+FKNC  LVQHSISIS TKKK AHRAFG VV RVFGWDIDRLPTIVLKGEPL RSLA S DLKVQ EE H               D KNEV  VS +E
Subjt:  GKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENHDGVQTENVCILNDVNDKKNEV--VSTDE

Query:  NEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGVEERE
        +E KLEE KTAEDP SN+KDLIS EN+DA K  DV +  ENADNS+SGM ESN E++NLHV  +IL+ACKEF  AF  SM+DDDVSE      DG EERE
Subjt:  NEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGVEERE

Query:  EFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKN---KSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGWD
        EFKFF KLFTENE+LRRYYEN Y DGEF CLAC+  G+K +K FKTC RLLQH+T L KN   K G    QKP+  K+LK+  L H AY+ V+C+VLG D
Subjt:  EFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKN---KSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGWD

Query:  IEKLPAVVLKGEPLGRSLTKPAVLK
        I+ LPA+VL GE LG SLTK  V K
Subjt:  IEKLPAVVLKGEPLGRSLTKPAVLK

A0A6J1CJP3 uncharacterized protein LOC111012232 isoform X22.1e-17260.86Show/hide
Query:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPK----------PTAADPILSNKRPRDPKTPKNKKNK------PNPPHDSGPEWPCPEPLQNQPSTSSGWRPI
        M+PY E +L EEV HLHSLWRRGPP+N K             A+ I SNKRP  P+ PK KK K      P+ P +SGPEWPCPEP+QNQPSTSSGW  I
Subjt:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPK----------PTAADPILSNKRPRDPKTPKNKKNK------PNPPHDSGPEWPCPEPLQNQPSTSSGWRPI

Query:  EPCATPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGS--DEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGG
        +PCATP A PVSS ER  L+AL LQY+  +ACRGFFARNADSGS  +E+E+E++E  DG + + EEYKFF K+FVEN EL ++YEKN E GSFCCLVCGG
Subjt:  EPCATPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGS--DEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGG

Query:  MGKKKSGKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENHDGVQTENVCILNDVNDKKNEVV
        MGKKKSGKRFK+C GLVQHSISIS TKKK AHRAFGLV+CRV GWD+DRLP IVLKGEPL RSLA S + +VQ E+NH  V  E  C +   ND +    
Subjt:  MGKKKSGKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENHDGVQTENVCILNDVNDKKNEVV

Query:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGV
           +NE KLEE+K AEDP SNAK+  S EN + CK+NDVNM  EN DNS+ GM     E++NL V + I KACKEF   FS S SD+      L DGDG+
Subjt:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGV

Query:  EEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGW
        EEREEFKFF KLFTEN+ LR YYE+ Y+DGEF CLAC+G GKK  K FKTCGRLLQH+TSL KN+ G         AK+LK+K L H AYS  +C+VLGW
Subjt:  EEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGW

Query:  DIEKLPAVVLKGEPLGRSLTKPAVLKDEPVGNAVDNTNESVVPVEDDSAKINYLKNE
        D+E+LP+VVLKGEPLGRSLTKP V KDE +GN   N + S  P+E+ S + + L+++
Subjt:  DIEKLPAVVLKGEPLGRSLTKPAVLKDEPVGNAVDNTNESVVPVEDDSAKINYLKNE

A0A6J1CM54 uncharacterized protein LOC111012232 isoform X13.0e-17160.18Show/hide
Query:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPK----------PTAADPILSNKRPRDPKTPKNKKNK------PNPPHDSGPEWPCPEPLQNQPSTSSGWRPI
        M+PY E +L EEV HLHSLWRRGPP+N K             A+ I SNKRP  P+ PK KK K      P+ P +SGPEWPCPEP+QNQPSTSSGW  I
Subjt:  MNPYSEEKLAEEVRHLHSLWRRGPPRNPK----------PTAADPILSNKRPRDPKTPKNKKNK------PNPPHDSGPEWPCPEPLQNQPSTSSGWRPI

Query:  EPCATPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGS--DEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGG
        +PCATP A PVSS ER  L+AL LQY+  +ACRGFFARNADSGS  +E+E+E++E  DG + + EEYKFF K+FVEN EL ++YEKN E GSFCCLVCGG
Subjt:  EPCATPTAHPVSSVEREILAALHLQYRAVEACRGFFARNADSGS--DEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGG

Query:  MGKKKSGKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENHDGVQTENVCILNDVNDKKNEVV
        MGKKKSGKRFK+C GLVQHSISIS TKKK AHRAFGLV+CRV GWD+DRLP IVLKGEPL RSLA S + +VQ E+NH  V  E  C +   ND +    
Subjt:  MGKKKSGKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENHDGVQTENVCILNDVNDKKNEVV

Query:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGV
           +NE KLEE+K AEDP SNAK+  S EN + CK+NDVNM  EN DNS+ GM     E++NL V + I KACKEF   FS S SD+      L DGDG+
Subjt:  STDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGV

Query:  EEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGW
        EEREEFKFF KLFTEN+ LR YYE+ Y+DGEF CLAC+G GKK  K FKTCGRLLQH+TSL KN+ G         AK+LK+K L H AYS  +C+VLGW
Subjt:  EEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGW

Query:  DIEKLPAVVLKGEPLGRSLTKPAVLKDEPV-GNAVDNTNESVV--PVEDDSAKINYLKNE
        D+E+LP+VVLKGEPLGRSLTKP V K  P   + + N N S+   P+E+ S + + L+++
Subjt:  DIEKLPAVVLKGEPLGRSLTKPAVLKDEPV-GNAVDNTNESVV--PVEDDSAKINYLKNE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78810.1 unknown protein2.0e-5031.23Show/hide
Query:  MNPYSEEKLAEEVRHLHSLWRRGPP-----------------RNPKPT--------------AADPILSNKRPRDPKTPKNKKNKPNPPHDSGPEWPCPE
        MN Y +E L +EV +LHSLW +GPP                 + P+P               A  P + ++ P +P+   N   +P P  DSG EWP  +
Subjt:  MNPYSEEKLAEEVRHLHSLWRRGPP-----------------RNPKPT--------------AADPILSNKRPRDPKTPKNKKNKPNPPHDSGPEWPCPE

Query:  PLQNQPSTSSGWRPIEPCATPTAHPVSSVEREILAALHLQYRAVEACRGFFARNA----DSGSDEDEDEDDEGKDGEMIE------SEEYKFFFKLFVEN
         +   PST SGW    PC      P+S+ E+E LAA  LQ      CR FF R +     S +  DE E DEG + + +E      S+E++F  ++F EN
Subjt:  PLQNQPSTSSGWRPIEPCATPTAHPVSSVEREILAALHLQYRAVEACRGFFARNA----DSGSDEDEDEDDEGKDGEMIE------SEEYKFFFKLFVEN

Query:  EELRSHYEKNSEDGSFCCLVCGGMGKKKSGKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEEN
         +L+ +YEKN+ +G F CLVCGG+G +KS ++FK+C  L+QHS++I  T  K  HRA   VVC V GWD++                             
Subjt:  EELRSHYEKNSEDGSFCCLVCGGMGKKKSGKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEEN

Query:  HDGVQTENVCILNDVNDKKNEVVSTDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFC
                           N VVS+ ++   + E   A +P S++K  I  E      K  V    E+A  ++  M+++ +E            A K+  
Subjt:  HDGVQTENVCILNDVNDKKNEVVSTDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFC

Query:  EAFSTSMSD--DDVSENNLIDGDGVEEREEFKFFFKLFTENESLRRYYENKYDDGEFFCL-ACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKP
            T  +D  ++  + NL         EE +   K+F+EN  L+ YYE  Y+ G F CL  C  T KKMLK FK C  ++QH T               
Subjt:  EAFSTSMSD--DDVSENNLIDGDGVEEREEFKFFFKLFTENESLRRYYENKYDDGEFFCL-ACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKP

Query:  RIAKLLKIKALTHCAYSLVICRVLGWDIEKLPAVVLKG
           K+ K+K   H  ++  +C +LGWD E LP  V+KG
Subjt:  RIAKLLKIKALTHCAYSLVICRVLGWDIEKLPAVVLKG

AT1G78810.2 unknown protein2.0e-5031.23Show/hide
Query:  MNPYSEEKLAEEVRHLHSLWRRGPP-----------------RNPKPT--------------AADPILSNKRPRDPKTPKNKKNKPNPPHDSGPEWPCPE
        MN Y +E L +EV +LHSLW +GPP                 + P+P               A  P + ++ P +P+   N   +P P  DSG EWP  +
Subjt:  MNPYSEEKLAEEVRHLHSLWRRGPP-----------------RNPKPT--------------AADPILSNKRPRDPKTPKNKKNKPNPPHDSGPEWPCPE

Query:  PLQNQPSTSSGWRPIEPCATPTAHPVSSVEREILAALHLQYRAVEACRGFFARNA----DSGSDEDEDEDDEGKDGEMIE------SEEYKFFFKLFVEN
         +   PST SGW    PC      P+S+ E+E LAA  LQ      CR FF R +     S +  DE E DEG + + +E      S+E++F  ++F EN
Subjt:  PLQNQPSTSSGWRPIEPCATPTAHPVSSVEREILAALHLQYRAVEACRGFFARNA----DSGSDEDEDEDDEGKDGEMIE------SEEYKFFFKLFVEN

Query:  EELRSHYEKNSEDGSFCCLVCGGMGKKKSGKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEEN
         +L+ +YEKN+ +G F CLVCGG+G +KS ++FK+C  L+QHS++I  T  K  HRA   VVC V GWD++                             
Subjt:  EELRSHYEKNSEDGSFCCLVCGGMGKKKSGKRFKNCFGLVQHSISISNTKKKHAHRAFGLVVCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEEN

Query:  HDGVQTENVCILNDVNDKKNEVVSTDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFC
                           N VVS+ ++   + E   A +P S++K  I  E      K  V    E+A  ++  M+++ +E            A K+  
Subjt:  HDGVQTENVCILNDVNDKKNEVVSTDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADNSLSGMEESNAELENLHVPESILKACKEFC

Query:  EAFSTSMSD--DDVSENNLIDGDGVEEREEFKFFFKLFTENESLRRYYENKYDDGEFFCL-ACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKP
            T  +D  ++  + NL         EE +   K+F+EN  L+ YYE  Y+ G F CL  C  T KKMLK FK C  ++QH T               
Subjt:  EAFSTSMSD--DDVSENNLIDGDGVEEREEFKFFFKLFTENESLRRYYENKYDDGEFFCL-ACQGTGKKMLKSFKTCGRLLQHTTSLVKNKSGIVPVQKP

Query:  RIAKLLKIKALTHCAYSLVICRVLGWDIEKLPAVVLKG
           K+ K+K   H  ++  +C +LGWD E LP  V+KG
Subjt:  RIAKLLKIKALTHCAYSLVICRVLGWDIEKLPAVVLKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCTTACTCCGAGGAAAAACTCGCCGAAGAGGTCCGCCATCTCCACTCTCTATGGCGTCGAGGCCCACCCAGAAACCCTAAACCCACTGCCGCGGATCCAATTCT
CTCCAACAAGAGACCTAGAGACCCCAAGACTCCAAAGAACAAGAAGAACAAACCAAACCCACCACATGACTCCGGCCCCGAGTGGCCCTGCCCCGAGCCGCTTCAAAATC
AGCCCTCCACCTCATCTGGGTGGCGGCCGATCGAGCCCTGCGCCACTCCCACGGCTCACCCCGTGTCGTCTGTAGAGCGGGAGATTCTCGCGGCGCTGCATTTGCAGTAC
AGGGCAGTCGAGGCCTGCCGGGGATTCTTCGCTAGAAATGCCGATTCAGGGAGTGACGAGGACGAGGACGAGGACGACGAGGGTAAGGATGGGGAAATGATTGAAAGTGA
AGAGTATAAGTTCTTTTTTAAGCTTTTTGTGGAGAATGAGGAACTTCGGAGTCATTACGAGAAGAATTCTGAAGATGGGTCGTTTTGTTGTTTGGTTTGTGGTGGAATGG
GGAAAAAGAAATCTGGGAAAAGGTTCAAGAATTGTTTTGGGCTTGTTCAGCATTCGATTTCGATATCGAACACCAAGAAGAAGCACGCTCACAGGGCTTTTGGATTGGTC
GTGTGCAGGGTTTTTGGTTGGGATATTGATCGACTTCCCACGATTGTGTTGAAGGGCGAGCCTCTTGGTCGATCATTAGCCGGTTCTAGAGACTTGAAGGTTCAGCATGA
GGAAAATCATGACGGGGTTCAAACTGAAAATGTATGCATTTTGAATGATGTAAATGATAAGAAGAATGAAGTGGTTTCTACGGATGAGAATGAACATAAATTAGAGGAAG
AAAAGACAGCTGAAGATCCCATTTCTAATGCTAAAGATTTGATTTCTAGCGAGAATGAAGATGCTTGCAAGAAGAATGATGTCAATATGCCAACAGAAAATGCTGATAAT
TCACTTTCAGGCATGGAAGAAAGCAATGCAGAACTGGAAAACTTGCATGTACCTGAGTCAATTTTGAAAGCGTGCAAAGAATTTTGTGAAGCATTCTCCACATCTATGAG
TGACGATGATGTTAGTGAGAATAATTTAATCGATGGCGATGGAGTCGAGGAACGTGAAGAGTTCAAGTTCTTTTTTAAGTTGTTCACCGAGAACGAAAGCTTGAGAAGAT
ATTACGAGAACAAGTATGATGATGGAGAATTTTTCTGTTTAGCTTGTCAAGGAACAGGAAAGAAAATGTTGAAGAGTTTCAAGACATGTGGCCGCCTTCTCCAGCATACA
ACTTCTCTAGTAAAGAACAAATCAGGGATAGTACCAGTCCAGAAGCCTCGTATTGCTAAATTGTTGAAAATAAAGGCGTTGACTCATTGTGCATACAGTTTAGTCATATG
CAGGGTTCTTGGTTGGGACATCGAAAAACTTCCCGCAGTCGTGTTAAAAGGCGAACCTCTTGGTCGTTCCTTAACAAAGCCAGCTGTGTTGAAGGATGAACCTGTTGGTA
ATGCAGTGGATAATACGAATGAATCGGTTGTTCCGGTAGAAGATGACTCGGCAAAGATCAACTACTTGAAGAACGAGAATCTATGA
mRNA sequenceShow/hide mRNA sequence
AAAGAAAAAAAAACCGTGAAGAAAAAGGGGGAAAATTGCCTCCCGGTGATGTGAAGCAAAAGCGATGGTAGCTGAATAGGCAAAACCAGAATCCGACCACCATTGCTCTG
TTTCTCGATTCCGCCATTTTCGCCGTCAATGAATCCTTACTCCGAGGAAAAACTCGCCGAAGAGGTCCGCCATCTCCACTCTCTATGGCGTCGAGGCCCACCCAGAAACC
CTAAACCCACTGCCGCGGATCCAATTCTCTCCAACAAGAGACCTAGAGACCCCAAGACTCCAAAGAACAAGAAGAACAAACCAAACCCACCACATGACTCCGGCCCCGAG
TGGCCCTGCCCCGAGCCGCTTCAAAATCAGCCCTCCACCTCATCTGGGTGGCGGCCGATCGAGCCCTGCGCCACTCCCACGGCTCACCCCGTGTCGTCTGTAGAGCGGGA
GATTCTCGCGGCGCTGCATTTGCAGTACAGGGCAGTCGAGGCCTGCCGGGGATTCTTCGCTAGAAATGCCGATTCAGGGAGTGACGAGGACGAGGACGAGGACGACGAGG
GTAAGGATGGGGAAATGATTGAAAGTGAAGAGTATAAGTTCTTTTTTAAGCTTTTTGTGGAGAATGAGGAACTTCGGAGTCATTACGAGAAGAATTCTGAAGATGGGTCG
TTTTGTTGTTTGGTTTGTGGTGGAATGGGGAAAAAGAAATCTGGGAAAAGGTTCAAGAATTGTTTTGGGCTTGTTCAGCATTCGATTTCGATATCGAACACCAAGAAGAA
GCACGCTCACAGGGCTTTTGGATTGGTCGTGTGCAGGGTTTTTGGTTGGGATATTGATCGACTTCCCACGATTGTGTTGAAGGGCGAGCCTCTTGGTCGATCATTAGCCG
GTTCTAGAGACTTGAAGGTTCAGCATGAGGAAAATCATGACGGGGTTCAAACTGAAAATGTATGCATTTTGAATGATGTAAATGATAAGAAGAATGAAGTGGTTTCTACG
GATGAGAATGAACATAAATTAGAGGAAGAAAAGACAGCTGAAGATCCCATTTCTAATGCTAAAGATTTGATTTCTAGCGAGAATGAAGATGCTTGCAAGAAGAATGATGT
CAATATGCCAACAGAAAATGCTGATAATTCACTTTCAGGCATGGAAGAAAGCAATGCAGAACTGGAAAACTTGCATGTACCTGAGTCAATTTTGAAAGCGTGCAAAGAAT
TTTGTGAAGCATTCTCCACATCTATGAGTGACGATGATGTTAGTGAGAATAATTTAATCGATGGCGATGGAGTCGAGGAACGTGAAGAGTTCAAGTTCTTTTTTAAGTTG
TTCACCGAGAACGAAAGCTTGAGAAGATATTACGAGAACAAGTATGATGATGGAGAATTTTTCTGTTTAGCTTGTCAAGGAACAGGAAAGAAAATGTTGAAGAGTTTCAA
GACATGTGGCCGCCTTCTCCAGCATACAACTTCTCTAGTAAAGAACAAATCAGGGATAGTACCAGTCCAGAAGCCTCGTATTGCTAAATTGTTGAAAATAAAGGCGTTGA
CTCATTGTGCATACAGTTTAGTCATATGCAGGGTTCTTGGTTGGGACATCGAAAAACTTCCCGCAGTCGTGTTAAAAGGCGAACCTCTTGGTCGTTCCTTAACAAAGCCA
GCTGTGTTGAAGGATGAACCTGTTGGTAATGCAGTGGATAATACGAATGAATCGGTTGTTCCGGTAGAAGATGACTCGGCAAAGATCAACTACTTGAAGAACGAGAATCT
ATGAAGCAGTGTTGAAGGATGATGTGAAATGAGGGTCTTCAACTTCATAGCAGTTGACCAAATATTAGATAGAACTTTAGTAGGCAGTAGCTTTAGTAGTAAAATCTGGC
TTCAACAATGAGGGAATCTATGTTTTTTTTCTTCTCTTAATCTTTGTGACATCTTGTTTCG
Protein sequenceShow/hide protein sequence
MNPYSEEKLAEEVRHLHSLWRRGPPRNPKPTAADPILSNKRPRDPKTPKNKKNKPNPPHDSGPEWPCPEPLQNQPSTSSGWRPIEPCATPTAHPVSSVEREILAALHLQY
RAVEACRGFFARNADSGSDEDEDEDDEGKDGEMIESEEYKFFFKLFVENEELRSHYEKNSEDGSFCCLVCGGMGKKKSGKRFKNCFGLVQHSISISNTKKKHAHRAFGLV
VCRVFGWDIDRLPTIVLKGEPLGRSLAGSRDLKVQHEENHDGVQTENVCILNDVNDKKNEVVSTDENEHKLEEEKTAEDPISNAKDLISSENEDACKKNDVNMPTENADN
SLSGMEESNAELENLHVPESILKACKEFCEAFSTSMSDDDVSENNLIDGDGVEEREEFKFFFKLFTENESLRRYYENKYDDGEFFCLACQGTGKKMLKSFKTCGRLLQHT
TSLVKNKSGIVPVQKPRIAKLLKIKALTHCAYSLVICRVLGWDIEKLPAVVLKGEPLGRSLTKPAVLKDEPVGNAVDNTNESVVPVEDDSAKINYLKNENL