; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037828 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037828
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr2:9585325..9587949
RNA-Seq ExpressionLag0037828
SyntenyLag0037828
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038899317.1 uncharacterized protein LOC120086655 isoform X1 [Benincasa hispida]2.4e-24576.69Show/hide
Query:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA
        M+PYSE+ LTEEVLHLH+LWRRGPPRNPKP HNHSST   A A+RNPSNKRP DPK+R NKKKKPR +P QDSGPEWPCPEP+QNQPSTSSGWP IEP A
Subjt:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA

Query:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T  A PVSSEER N AALQLQYKG +ACRGFF R ADSGSDEE EEEE     NG MMESEEYKFFLKLFVENDELRGYYEKN E G FCCLVCGGM K+
Subjt:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE
        K GK+FKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLA+SG+ KVQPEENHVAKEHDS VQ+ENVAIS D+ +++NE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE

Query:  VVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMEN-----LHVSESILKACKEFFAAFSTSTSDDDVSENN
        VV +D  +Q LEEE+T EDPTSN KDLISG+N+   K NDV +QAEN DNS+ GM ES +EM+N     L V ESILKACKEF AAF TS SD+DVSENN
Subjt:  VVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMEN-----LHVSESILKACKEFFAAFSTSTSDDDVSENN

Query:  LMDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIV
        L+DG+GVEEREEFKFF KLF ENESLRRYYENNYDDGEFFCL C GAG+KMLKSFKTCGRLLQHTTSL K K  KKPV KPHIAKM+KMK++AHRA S V
Subjt:  LMDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIV

Query:  ICKVLGWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKPDDPVEDDSAKINKVQDESIVDAVGSKDDVAEDESTKVNQLQSESVGNAVGNMNDLDG
        ICKVLGWD+EKLPAVVLKGEPLG SLTK      QD    N  D+  EDDS KINK+Q+ES+ +AV + DD+ ED+STKVNQLQ +S GNA+GNMNDLDG
Subjt:  ICKVLGWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKPDDPVEDDSAKINKVQDESIVDAVGSKDDVAEDESTKVNQLQSESVGNAVGNMNDLDG

Query:  VKENE
        VKE +
Subjt:  VKENE

XP_038899319.1 uncharacterized protein LOC120086655 isoform X2 [Benincasa hispida]2.2e-24676.78Show/hide
Query:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA
        M+PYSE+ LTEEVLHLH+LWRRGPPRNPKP HNHSST   A A+RNPSNKRP DPK+R NKKKKPR +P QDSGPEWPCPEP+QNQPSTSSGWP IEP A
Subjt:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA

Query:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T  A PVSSEER N AALQLQYKG +ACRGFF R ADSGSDEE EEEE     NG MMESEEYKFFLKLFVENDELRGYYEKN E G FCCLVCGGM K+
Subjt:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE
        K GK+FKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLA+SG+ KVQPEENHVAKEHDS VQ+ENVAIS D+ +++NE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE

Query:  VVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMEN-----LHVSESILKACKEFFAAFSTSTSDDDVSENN
        VV +D  +Q LEEE+T EDPTSN KDLISG+N+   K NDV +QAEN DNS+ GM ES +EM+N     L V ESILKACKEF AAF TS SD+DVSENN
Subjt:  VVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMEN-----LHVSESILKACKEFFAAFSTSTSDDDVSENN

Query:  LMDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIV
        L+DG+GVEEREEFKFF KLF ENESLRRYYENNYDDGEFFCL C GAG+KMLKSFKTCGRLLQHTTSL K K  KKPV KPHIAKM+KMK++AHRA S V
Subjt:  LMDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIV

Query:  ICKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDT---NKPDDPVEDDSAKINKVQDESIVDAVGSKDDVAEDESTKVNQLQSESVGNAVGNMNDLDGVK
        ICKVLGWD+EKLPAVVLKGEPLG SLTK    +D    N  D+  EDDS KINK+Q+ES+ +AV + DD+ ED+STKVNQLQ +S GNA+GNMNDLDGVK
Subjt:  ICKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDT---NKPDDPVEDDSAKINKVQDESIVDAVGSKDDVAEDESTKVNQLQSESVGNAVGNMNDLDGVK

Query:  ENE
        E +
Subjt:  ENE

XP_038899320.1 uncharacterized protein LOC120086655 isoform X3 [Benincasa hispida]8.6e-24376.36Show/hide
Query:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA
        M+PYSE+ LTEEVLHLH+LWRRGPPRNPKP HNHSST   A A+RNPSNKRP DPK+R NKKKKPR +P QDSGPEWPCPEP+QNQPSTSSGWP IEP A
Subjt:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA

Query:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T  A PVSSEER N AALQLQYKG +ACRGFF R ADSGSDEE EEEE     NG MMESEEYKFFLKLFVENDELRGYYEKN E G FCCLVCGGM K+
Subjt:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE
        K GK+FKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLA+SG+ K  PEENHVAKEHDS VQ+ENVAIS D+ +++NE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE

Query:  VVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMEN-----LHVSESILKACKEFFAAFSTSTSDDDVSENN
        VV +D  +Q LEEE+T EDPTSN KDLISG+N+   K NDV +QAEN DNS+ GM ES +EM+N     L V ESILKACKEF AAF TS SD+DVSENN
Subjt:  VVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMEN-----LHVSESILKACKEFFAAFSTSTSDDDVSENN

Query:  LMDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIV
        L+DG+GVEEREEFKFF KLF ENESLRRYYENNYDDGEFFCL C GAG+KMLKSFKTCGRLLQHTTSL K K  KKPV KPHIAKM+KMK++AHRA S V
Subjt:  LMDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIV

Query:  ICKVLGWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKPDDPVEDDSAKINKVQDESIVDAVGSKDDVAEDESTKVNQLQSESVGNAVGNMNDLDG
        ICKVLGWD+EKLPAVVLKGEPLG SLTK      QD    N  D+  EDDS KINK+Q+ES+ +AV + DD+ ED+STKVNQLQ +S GNA+GNMNDLDG
Subjt:  ICKVLGWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKPDDPVEDDSAKINKVQDESIVDAVGSKDDVAEDESTKVNQLQSESVGNAVGNMNDLDG

Query:  VKENE
        VKE +
Subjt:  VKENE

XP_038899321.1 uncharacterized protein LOC120086655 isoform X4 [Benincasa hispida]5.8e-24777.33Show/hide
Query:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA
        M+PYSE+ LTEEVLHLH+LWRRGPPRNPKP HNHSST   A A+RNPSNKRP DPK+R NKKKKPR +P QDSGPEWPCPEP+QNQPSTSSGWP IEP A
Subjt:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA

Query:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T  A PVSSEER N AALQLQYKG +ACRGFF R ADSGSDEE EEEE     NG MMESEEYKFFLKLFVENDELRGYYEKN E G FCCLVCGGM K+
Subjt:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE
        K GK+FKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLA+SG+ KVQPEENHVAKEHDS VQ+ENVAIS D+ +++NE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE

Query:  VVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLMDGD
        VV +D  +Q LEEE+T EDPTSN KDLISG+N+   K NDV +QAEN DNS+ GM ES +EM+NL V ESILKACKEF AAF TS SD+DVSENNL+DG+
Subjt:  VVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLMDGD

Query:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIVICKVL
        GVEEREEFKFF KLF ENESLRRYYENNYDDGEFFCL C GAG+KMLKSFKTCGRLLQHTTSL K K  KKPV KPHIAKM+KMK++AHRA S VICKVL
Subjt:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIVICKVL

Query:  GWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKPDDPVEDDSAKINKVQDESIVDAVGSKDDVAEDESTKVNQLQSESVGNAVGNMNDLDGVKENE
        GWD+EKLPAVVLKGEPLG SLTK      QD    N  D+  EDDS KINK+Q+ES+ +AV + DD+ ED+STKVNQLQ +S GNA+GNMNDLDGVKE +
Subjt:  GWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKPDDPVEDDSAKINKVQDESIVDAVGSKDDVAEDESTKVNQLQSESVGNAVGNMNDLDGVKENE

XP_038899322.1 uncharacterized protein LOC120086655 isoform X5 [Benincasa hispida]1.5e-23174Show/hide
Query:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA
        M+PYSE+ LTEEVLHLH+LWRRGPPRNPKP HNHSST   A A+RNPSNKRP DPK+R NKKKKPR +P QDSGPEWPCPEP+QNQPSTSSGWP IEP A
Subjt:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSST---AVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA

Query:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T  A PVSSEER N AALQLQYKG +ACRGFF R ADSGSDEE EEEE     NG MMESEEYKFFLKLFVENDELRGYYEKN E G FCCLVCGGM K+
Subjt:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE
        K GK+FKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPL RSLA+SG+ KVQPEENHVAKEHDS VQ+ENVAIS D+ +++NE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE

Query:  VVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLMDGD
        VV +D  +Q LEEE+T EDPTSN KDLISG+                                   V ESILKACKEF AAF TS SD+DVSENNL+DG+
Subjt:  VVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLMDGD

Query:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIVICKVL
        GVEEREEFKFF KLF ENESLRRYYENNYDDGEFFCL C GAG+KMLKSFKTCGRLLQHTTSL K K  KKPV KPHIAKM+KMK++AHRA S VICKVL
Subjt:  GVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPV-KPHIAKMMKMKILAHRAYSIVICKVL

Query:  GWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKPDDPVEDDSAKINKVQDESIVDAVGSKDDVAEDESTKVNQLQSESVGNAVGNMNDLDGVKENE
        GWD+EKLPAVVLKGEPLG SLTK      QD    N  D+  EDDS KINK+Q+ES+ +AV + DD+ ED+STKVNQLQ +S GNA+GNMNDLDGVKE +
Subjt:  GWDMEKLPAVVLKGEPLGHSLTKP--GVPQDT---NKPDDPVEDDSAKINKVQDESIVDAVGSKDDVAEDESTKVNQLQSESVGNAVGNMNDLDGVKENE

TrEMBL top hitse value%identityAlignment
A0A1S3CJZ0 uncharacterized protein LOC103501816 isoform X14.4e-19266.84Show/hide
Query:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKN---KKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA
        M+PYS++ LT+EVL+LHSLW RGPPRNPKPTH+HSSTAVA  NPSNKRP DP  RKN   KKKKPR DPPQDSGPEWPCPEP+QNQPSTSSGWP I+P A
Subjt:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKN---KKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA

Query:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T  AQ VSSEER+N AALQLQYKG +ACR FF R ADSGSDEEEEEEEE++G    MMES+EY FFLK+FVEN+ELR YYEKN E G FCCLVC GMGKK
Subjt:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE
        K GK+FKNC+ LVQHSISIS TKKKRAHRAFG VV RVFGWDIDRLPTIVLKGEPL RSLANSGD KVQPEE HV                    D +NE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE

Query:  V--VSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLMD
        V  VSV+E+EQ LEE KT EDPTSN KDLISGEN+   K+ DV +Q ENADNSISGMGES  EM+NLHV  +IL+ACKEF AAF  S +DDDVSE     
Subjt:  V--VSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLMD

Query:  GDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKV
         DG EEREEFKFF KLF ENE+LRRYYEN+Y DGEF CL CE AGRK +K FKTC RLLQH+T L K    K+  KP   K++KM +LAHRAY+ V+CKV
Subjt:  GDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKV

Query:  LGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKPDDPVEDDSAKINKVQDESIVDAVGSK-DDVAEDESTKVNQLQ
        LG D++ LPA+VL GE LG SLTK                  + ++K+QD+S V    S  DD+ ED+ST+VN+L+
Subjt:  LGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKPDDPVEDDSAKINKVQDESIVDAVGSK-DDVAEDESTKVNQLQ

A0A1S3CJZ2 uncharacterized protein LOC103501816 isoform X29.8e-19268.35Show/hide
Query:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKN---KKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA
        M+PYS++ LT+EVL+LHSLW RGPPRNPKPTH+HSSTAVA  NPSNKRP DP  RKN   KKKKPR DPPQDSGPEWPCPEP+QNQPSTSSGWP I+P A
Subjt:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKN---KKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA

Query:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T  AQ VSSEER+N AALQLQYKG +ACR FF R ADSGSDEEEEEEEE++G    MMES+EY FFLK+FVEN+ELR YYEKN E G FCCLVC GMGKK
Subjt:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE
        K GK+FKNC+ LVQHSISIS TKKKRAHRAFG VV RVFGWDIDRLPTIVLKGEPL RSLANSGD KVQPEE HV                    D +NE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE

Query:  V--VSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLMD
        V  VSV+E+EQ LEE KT EDPTSN KDLISGEN+   K+ DV +Q ENADNSISGMGES  EM+NLHV  +IL+ACKEF AAF  S +DDDVSE     
Subjt:  V--VSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLMD

Query:  GDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKV
         DG EEREEFKFF KLF ENE+LRRYYEN+Y DGEF CL CE AGRK +K FKTC RLLQH+T L K    K+  KP   K++KM +LAHRAY+ V+CKV
Subjt:  GDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKV

Query:  LGWDMEKLPAVVLKGEPLGHSLTKPGVPQD-------TNKPDDPVEDDSAKINKVQ
        LG D++ LPA+VL GE LG SLTK  V +D       ++  DD VEDDS ++N+++
Subjt:  LGWDMEKLPAVVLKGEPLGHSLTKPGVPQD-------TNKPDDPVEDDSAKINKVQ

A0A5D3DXE1 Uncharacterized protein1.4e-19065.65Show/hide
Query:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKN---KKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA
        M+PYS++ LT+EVL+LHSLW RGPPRNPKPTH+HSSTAVA  NPSNKRP DP  RKN   KKKKPR DPPQDSGPEWPCPEP+QNQPSTSSGWP I+P A
Subjt:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKN---KKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSA

Query:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK
        T  AQ VSSEER+N AALQLQYKG +ACR FF R ADSGSDEEEEEEEE++G    MMES+EY FFLK+FVEN+ELR YYEKN E G FCCLVC GMGKK
Subjt:  TLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKK

Query:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE
        K GK+FKNC+ LVQHSISIS TKKKRAHRAFG VV RVFGWDIDRLPTIVLKGEPL RSLANSGD KVQPEE HV                    D +NE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENE

Query:  V--VSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLMD
        V  VSV+E+EQ LEE KT EDPTSN KDLISGEN+   K+ DV +Q ENADNSISGMGES  EM+NLHV  +IL+ACKEF AAF  S +DDDVSE     
Subjt:  V--VSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLMD

Query:  GDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKV
         DG EEREEFKFF KLF ENE+LRRYYEN+Y DGEF CL CE AGRK +K FKTC RLLQH+T L K    K+  KP   K++KM +LAHRAY+ V+CKV
Subjt:  GDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKV

Query:  LGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKPDDPVE---------------DDSAKINKVQDESIVDAVGSK-DDVAEDESTKVNQLQ
        LG D++ LPA+VL GE LG SLTK     D +K    ++               +++A + K+QD+S V    S  DD+ ED+ST+VN+L+
Subjt:  LGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKPDDPVE---------------DDSAKINKVQDESIVDAVGSK-DDVAEDESTKVNQLQ

A0A6J1CJP3 uncharacterized protein LOC111012232 isoform X21.7e-19165.68Show/hide
Query:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSSTA---VASRNPSNKRPRDPKDRKNKKKK----PRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTI
        M+PY E+ LTEEVLHLHSLWRRGPP+N K   NHS+ A   VA+R PSNKRP  P+  K KKKK    P PD PQ+SGPEWPCPEP+QNQPSTSSGWP I
Subjt:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSSTA---VASRNPSNKRPRDPKDRKNKKKK----PRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTI

Query:  EPSATLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGG
        +P AT  AQPVSSEER   +ALQLQYK  +ACRGFF R ADSGS+ EEEEEEEEE N+GG+ + EEYKFFLK+FVEN EL  YYEKN E GSFCCLVCGG
Subjt:  EPSATLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGG

Query:  MGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDEND
        MGKKKSGKRFK+CVGLVQHSISISRTKKKRAHRAFG V+CRV GWD+DRLP IVLKGEPL RSLA+SG+ +VQPE+NHVAKE    V+S        END
Subjt:  MGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDEND

Query:  EENEVVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNL
        ++       +NE+ LEE+K  EDP SN K+  SGEN    KENDVNMQ EN DNSI GMG  K EM+NL V + I KACKEFFA FS STSD+      L
Subjt:  EENEVVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNL

Query:  MDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKK-PVKPHIAKMMKMKILAHRAYSIVI
         DGDG+EEREEFKFF KLF EN+ LR YYE+NY+DGEF CL CEGAG+K  K FKTCGRLLQH+TSLAK + G+  P     AKM+KMK LAHRAYS  +
Subjt:  MDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKK-PVKPHIAKMMKMKILAHRAYSIVI

Query:  CKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDT------NKPDDPVEDDSAKINKVQDESIV---DAVGSKDD
        CKVLGWD+E+LP+VVLKGEPLG SLTKPGV +D       +   DP+E+ S + +K++D+++    D VG+  D
Subjt:  CKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDT------NKPDDPVEDDSAKINKVQDESIV---DAVGSKDD

A0A6J1CM54 uncharacterized protein LOC111012232 isoform X11.4e-19065Show/hide
Query:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSSTA---VASRNPSNKRPRDPKDRKNKKKK----PRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTI
        M+PY E+ LTEEVLHLHSLWRRGPP+N K   NHS+ A   VA+R PSNKRP  P+  K KKKK    P PD PQ+SGPEWPCPEP+QNQPSTSSGWP I
Subjt:  MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSSTA---VASRNPSNKRPRDPKDRKNKKKK----PRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTI

Query:  EPSATLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGG
        +P AT  AQPVSSEER   +ALQLQYK  +ACRGFF R ADSGS+ EEEEEEEEE N+GG+ + EEYKFFLK+FVEN EL  YYEKN E GSFCCLVCGG
Subjt:  EPSATLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGG

Query:  MGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDEND
        MGKKKSGKRFK+CVGLVQHSISISRTKKKRAHRAFG V+CRV GWD+DRLP IVLKGEPL RSLA+SG+ +VQPE+NHVAKE    V+S        END
Subjt:  MGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDEND

Query:  EENEVVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNL
        ++       +NE+ LEE+K  EDP SN K+  SGEN    KENDVNMQ EN DNSI GMG  K EM+NL V + I KACKEFFA FS STSD+      L
Subjt:  EENEVVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNL

Query:  MDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKK-PVKPHIAKMMKMKILAHRAYSIVI
         DGDG+EEREEFKFF KLF EN+ LR YYE+NY+DGEF CL CEGAG+K  K FKTCGRLLQH+TSLAK + G+  P     AKM+KMK LAHRAYS  +
Subjt:  MDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAGRKMLKSFKTCGRLLQHTTSLAKGKTGKK-PVKPHIAKMMKMKILAHRAYSIVI

Query:  CKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNK------------PDDPVEDDSAKINKVQDESIV---DAVGSKDD
        CKVLGWD+E+LP+VVLKGEPLG SLTKPGV + + K              DP+E+ S + +K++D+++    D VG+  D
Subjt:  CKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNK------------PDDPVEDDSAKINKVQDESIV---DAVGSKDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78810.1 unknown protein4.2e-5431.82Show/hide
Query:  MNPYSEKLLTEEVLHLHSLWRRGPP-RNPKPTHNHS----------------------------STAVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSG
        MN Y ++ L +EV++LHSLW +GPP R P P+ N +                            +  + SRNP+N     P++  N  K+PRP    DSG
Subjt:  MNPYSEKLLTEEVLHLHSLWRRGPP-RNPKPTHNHS----------------------------STAVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSG

Query:  PEWPCPEPLQNQPSTSSGWPTIEPSATLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKAD------SGSDEEE--EEEEEEEGNNGGMMESEEYKFF
         EWP  + +   PST SGWP   P      +P+S+EE++  AA  LQ      CR FFGRK+       +G DE E  E +E++         S+E++F 
Subjt:  PEWPCPEPLQNQPSTSSGWPTIEPSATLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKAD------SGSDEEE--EEEEEEEGNNGGMMESEEYKFF

Query:  LKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDS
         ++F EN +L+ YYEKN+  G F CLVCGG+G +KS ++FK+C+ L+QHS++I +T  K  HRA  QVVC V GWD++                      
Subjt:  LKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDS

Query:  KVQPEENHVAKEHDSAVQSENVAISNDENDEENEVVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLH
                                        N VVS  ++ Q + E  +     S +            ++  V    E+A  ++  M ++ SE     
Subjt:  KVQPEENHVAKEHDSAVQSENVAISNDENDEENEVVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLH

Query:  VSESILKACKEFFAAFSTSTSD--DDVSENNLMDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGA-GRKMLKSFKTCGRLLQHTTSL
               A K+ F    T  +D  ++  + NL         EE +   K+F+EN  L+ YYE NY+ G F CLVC  A  +KMLK FK C  ++QH T  
Subjt:  VSESILKACKEFFAAFSTSTSD--DDVSENNLMDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGA-GRKMLKSFKTCGRLLQHTTSL

Query:  AKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLGWDMEKLPAVVLKG
                       K+ KMKI AH+ ++  +C++LGWD E LP  V+KG
Subjt:  AKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLGWDMEKLPAVVLKG

AT1G78810.2 unknown protein4.2e-5431.82Show/hide
Query:  MNPYSEKLLTEEVLHLHSLWRRGPP-RNPKPTHNHS----------------------------STAVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSG
        MN Y ++ L +EV++LHSLW +GPP R P P+ N +                            +  + SRNP+N     P++  N  K+PRP    DSG
Subjt:  MNPYSEKLLTEEVLHLHSLWRRGPP-RNPKPTHNHS----------------------------STAVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSG

Query:  PEWPCPEPLQNQPSTSSGWPTIEPSATLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKAD------SGSDEEE--EEEEEEEGNNGGMMESEEYKFF
         EWP  + +   PST SGWP   P      +P+S+EE++  AA  LQ      CR FFGRK+       +G DE E  E +E++         S+E++F 
Subjt:  PEWPCPEPLQNQPSTSSGWPTIEPSATLVAQPVSSEERQNHAALQLQYKGLEACRGFFGRKAD------SGSDEEE--EEEEEEEGNNGGMMESEEYKFF

Query:  LKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDS
         ++F EN +L+ YYEKN+  G F CLVCGG+G +KS ++FK+C+ L+QHS++I +T  K  HRA  QVVC V GWD++                      
Subjt:  LKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDS

Query:  KVQPEENHVAKEHDSAVQSENVAISNDENDEENEVVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLH
                                        N VVS  ++ Q + E  +     S +            ++  V    E+A  ++  M ++ SE     
Subjt:  KVQPEENHVAKEHDSAVQSENVAISNDENDEENEVVSVDENEQILEEEKTVEDPTSNVKDLISGENETIGKENDVNMQAENADNSISGMGESKSEMENLH

Query:  VSESILKACKEFFAAFSTSTSD--DDVSENNLMDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGA-GRKMLKSFKTCGRLLQHTTSL
               A K+ F    T  +D  ++  + NL         EE +   K+F+EN  L+ YYE NY+ G F CLVC  A  +KMLK FK C  ++QH T  
Subjt:  VSESILKACKEFFAAFSTSTSD--DDVSENNLMDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGA-GRKMLKSFKTCGRLLQHTTSL

Query:  AKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLGWDMEKLPAVVLKG
                       K+ KMKI AH+ ++  +C++LGWD E LP  V+KG
Subjt:  AKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLGWDMEKLPAVVLKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCCTACTCCGAGAAACTACTCACCGAAGAGGTCCTCCATCTCCACTCTCTATGGCGCCGAGGCCCGCCGAGGAACCCTAAACCCACTCACAACCATTCATCCAC
CGCCGTCGCGAGTCGGAACCCCTCGAACAAGAGACCCAGAGACCCAAAGGATCGAAAGAACAAGAAGAAGAAACCACGCCCAGATCCACCGCAAGACTCCGGCCCCGAGT
GGCCCTGCCCGGAGCCGCTTCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGACGATCGAGCCCTCTGCCACTCTGGTGGCTCAGCCGGTATCGTCTGAAGAGCGGCAA
AATCATGCGGCGTTGCAATTGCAGTACAAGGGACTCGAGGCCTGCCGGGGATTTTTCGGTAGAAAGGCCGATTCGGGAAGTGACGAAGAGGAAGAGGAGGAGGAGGAAGA
AGAGGGGAATAATGGTGGGATGATGGAAAGTGAAGAGTATAAGTTCTTTTTGAAGCTGTTTGTGGAGAACGATGAACTTAGGGGTTACTACGAGAAGAATTCTGAAGGTG
GGTCGTTTTGTTGCTTGGTTTGTGGTGGAATGGGGAAAAAGAAATCTGGGAAAAGGTTTAAGAATTGTGTTGGCCTCGTTCAACATTCGATTTCCATATCCAGGACAAAG
AAGAAGCGGGCTCATAGGGCTTTTGGACAGGTTGTATGCAGGGTTTTTGGATGGGATATTGATCGACTTCCGACGATTGTGTTGAAGGGCGAGCCTCTTGGTCGATCATT
AGCCAATTCTGGAGACTCCAAGGTTCAGCCTGAGGAAAATCATGTGGCTAAAGAACATGACTCTGCGGTTCAGAGTGAAAATGTAGCCATTTCCAATGATGAAAATGACG
AGGAGAATGAAGTGGTCTCGGTGGATGAGAATGAACAGATATTGGAGGAAGAAAAGACAGTTGAAGATCCCACTTCTAATGTTAAAGATTTGATTTCTGGTGAGAATGAA
ACTATTGGCAAGGAGAATGATGTCAATATGCAAGCAGAAAATGCTGATAATTCAATTTCAGGCATGGGGGAAAGCAAATCCGAAATGGAAAACTTGCATGTGTCGGAGTC
GATTTTGAAAGCCTGTAAAGAATTTTTTGCAGCCTTCTCCACATCTACGAGTGACGATGATGTTAGTGAAAATAACTTAATGGATGGAGATGGAGTTGAGGAACGCGAAG
AGTTCAAGTTCTTTTTTAAGTTGTTCGCCGAGAATGAAAGCTTGAGAAGGTATTACGAGAACAACTATGATGATGGGGAATTTTTCTGTTTAGTTTGTGAAGGAGCAGGA
AGGAAAATGTTGAAGAGTTTTAAGACATGTGGCCGCCTTCTCCAGCATACAACATCTCTAGCGAAGGGGAAAACAGGAAAAAAACCAGTCAAGCCTCACATTGCTAAGAT
GATGAAAATGAAGATACTGGCTCATAGGGCATACAGTATAGTTATATGCAAGGTTCTTGGTTGGGACATGGAAAAGCTTCCTGCAGTCGTGTTAAAAGGCGAACCTCTCG
GTCATTCCTTAACAAAGCCGGGCGTGCCACAGGATACAAACAAGCCGGATGATCCTGTAGAAGATGACTCTGCAAAGATTAACAAAGTGCAGGATGAATCGATTGTTGAT
GCAGTTGGTAGTAAAGATGATGTCGCAGAAGATGAATCGACGAAGGTAAACCAATTGCAGAGTGAATCTGTTGGCAATGCAGTTGGTAATATGAATGATTTAGATGGTGT
AAAAGAAAATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCCCTACTCCGAGAAACTACTCACCGAAGAGGTCCTCCATCTCCACTCTCTATGGCGCCGAGGCCCGCCGAGGAACCCTAAACCCACTCACAACCATTCATCCAC
CGCCGTCGCGAGTCGGAACCCCTCGAACAAGAGACCCAGAGACCCAAAGGATCGAAAGAACAAGAAGAAGAAACCACGCCCAGATCCACCGCAAGACTCCGGCCCCGAGT
GGCCCTGCCCGGAGCCGCTTCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGACGATCGAGCCCTCTGCCACTCTGGTGGCTCAGCCGGTATCGTCTGAAGAGCGGCAA
AATCATGCGGCGTTGCAATTGCAGTACAAGGGACTCGAGGCCTGCCGGGGATTTTTCGGTAGAAAGGCCGATTCGGGAAGTGACGAAGAGGAAGAGGAGGAGGAGGAAGA
AGAGGGGAATAATGGTGGGATGATGGAAAGTGAAGAGTATAAGTTCTTTTTGAAGCTGTTTGTGGAGAACGATGAACTTAGGGGTTACTACGAGAAGAATTCTGAAGGTG
GGTCGTTTTGTTGCTTGGTTTGTGGTGGAATGGGGAAAAAGAAATCTGGGAAAAGGTTTAAGAATTGTGTTGGCCTCGTTCAACATTCGATTTCCATATCCAGGACAAAG
AAGAAGCGGGCTCATAGGGCTTTTGGACAGGTTGTATGCAGGGTTTTTGGATGGGATATTGATCGACTTCCGACGATTGTGTTGAAGGGCGAGCCTCTTGGTCGATCATT
AGCCAATTCTGGAGACTCCAAGGTTCAGCCTGAGGAAAATCATGTGGCTAAAGAACATGACTCTGCGGTTCAGAGTGAAAATGTAGCCATTTCCAATGATGAAAATGACG
AGGAGAATGAAGTGGTCTCGGTGGATGAGAATGAACAGATATTGGAGGAAGAAAAGACAGTTGAAGATCCCACTTCTAATGTTAAAGATTTGATTTCTGGTGAGAATGAA
ACTATTGGCAAGGAGAATGATGTCAATATGCAAGCAGAAAATGCTGATAATTCAATTTCAGGCATGGGGGAAAGCAAATCCGAAATGGAAAACTTGCATGTGTCGGAGTC
GATTTTGAAAGCCTGTAAAGAATTTTTTGCAGCCTTCTCCACATCTACGAGTGACGATGATGTTAGTGAAAATAACTTAATGGATGGAGATGGAGTTGAGGAACGCGAAG
AGTTCAAGTTCTTTTTTAAGTTGTTCGCCGAGAATGAAAGCTTGAGAAGGTATTACGAGAACAACTATGATGATGGGGAATTTTTCTGTTTAGTTTGTGAAGGAGCAGGA
AGGAAAATGTTGAAGAGTTTTAAGACATGTGGCCGCCTTCTCCAGCATACAACATCTCTAGCGAAGGGGAAAACAGGAAAAAAACCAGTCAAGCCTCACATTGCTAAGAT
GATGAAAATGAAGATACTGGCTCATAGGGCATACAGTATAGTTATATGCAAGGTTCTTGGTTGGGACATGGAAAAGCTTCCTGCAGTCGTGTTAAAAGGCGAACCTCTCG
GTCATTCCTTAACAAAGCCGGGCGTGCCACAGGATACAAACAAGCCGGATGATCCTGTAGAAGATGACTCTGCAAAGATTAACAAAGTGCAGGATGAATCGATTGTTGAT
GCAGTTGGTAGTAAAGATGATGTCGCAGAAGATGAATCGACGAAGGTAAACCAATTGCAGAGTGAATCTGTTGGCAATGCAGTTGGTAATATGAATGATTTAGATGGTGT
AAAAGAAAATGAATGA
Protein sequenceShow/hide protein sequence
MNPYSEKLLTEEVLHLHSLWRRGPPRNPKPTHNHSSTAVASRNPSNKRPRDPKDRKNKKKKPRPDPPQDSGPEWPCPEPLQNQPSTSSGWPTIEPSATLVAQPVSSEERQ
NHAALQLQYKGLEACRGFFGRKADSGSDEEEEEEEEEEGNNGGMMESEEYKFFLKLFVENDELRGYYEKNSEGGSFCCLVCGGMGKKKSGKRFKNCVGLVQHSISISRTK
KKRAHRAFGQVVCRVFGWDIDRLPTIVLKGEPLGRSLANSGDSKVQPEENHVAKEHDSAVQSENVAISNDENDEENEVVSVDENEQILEEEKTVEDPTSNVKDLISGENE
TIGKENDVNMQAENADNSISGMGESKSEMENLHVSESILKACKEFFAAFSTSTSDDDVSENNLMDGDGVEEREEFKFFFKLFAENESLRRYYENNYDDGEFFCLVCEGAG
RKMLKSFKTCGRLLQHTTSLAKGKTGKKPVKPHIAKMMKMKILAHRAYSIVICKVLGWDMEKLPAVVLKGEPLGHSLTKPGVPQDTNKPDDPVEDDSAKINKVQDESIVD
AVGSKDDVAEDESTKVNQLQSESVGNAVGNMNDLDGVKENE