; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS006272 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS006272
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTranscription factor, putative
Genome locationscaffold96:1787902..1789905
RNA-Seq ExpressionMS006272
SyntenyMS006272
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143880.1 uncharacterized protein LOC101212238 [Cucumis sativus]0.0e+0090.22Show/hide
Query:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS--GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGT
        MGSSFQN NFAANYSL+V K LGKSFQDGKTGAEDSADTILRLDS   SVPC SIS GMKRKWSLVE S+G  GQSVGSSLSLGFVHSSSS DSKGSSGT
Subjt:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS--GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGT

Query:  ACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSC
        ACTRVSSAKETDEESSMALDLDFSL+LG+++VASPKEPA+K LKVQK+KPK DLELSLSTGPSESDVTSIYQ FPSLQLSMEKPLTF ET N DDGETSC
Subjt:  ACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSC

Query:  CWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ
        CWKPGT QP +PTSLNPQVGY+F PVTE +IPPANVPDLSSSVLTMPKSSVTCTSGIT  QQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ
Subjt:  CWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ

Query:  KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ
        K GCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTD+CIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ
Subjt:  KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ

Query:  APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK
         PGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK
Subjt:  APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK

Query:  RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPI-----LNKTDKLKEVVREDYMNED
        RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVS+GPI     L+KT+K+K +V EDYMNED
Subjt:  RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPI-----LNKTDKLKEVVREDYMNED

Query:  LIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV
        LIKVGGKV PNL+  A +EA+KP ++SVL PEGRVHGGSLLAMLACSSGLGS+SRNAVSGP+QP+E HIM RSWV
Subjt:  LIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV

XP_008437353.1 PREDICTED: uncharacterized protein LOC103482798 [Cucumis melo]0.0e+0091.11Show/hide
Query:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS--GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGT
        MGSSFQN NFAANYSL+V K LGKSFQDGKTGAEDSADTILRLDS   SVPC SIS GMKRKWSLVE S+G  GQSVGSSLSLGFVHSSSS DSKGSSGT
Subjt:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS--GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGT

Query:  ACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSC
        ACTRVSSAKETDEESSMALDLDFSL+LGN++VASPKEPA+K LKVQK+KPK DLELSLSTGPSESDVTSIYQ FPSLQLSMEKPL F ET N DDGETSC
Subjt:  ACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSC

Query:  CWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ
        CWKPGT QP MPTSLNPQVGY+F PVTE VIPPANVPDLSSSVLTMPKSSVTCTSGIT  QQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ
Subjt:  CWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ

Query:  KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ
        KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTD+CIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ
Subjt:  KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ

Query:  APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK
         PGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK
Subjt:  APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK

Query:  RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPI-----LNKTDKLKEVVREDYMNED
        RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVS+GPI     L+KT+K+K +V EDYMNED
Subjt:  RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPI-----LNKTDKLKEVVREDYMNED

Query:  LIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV
        LIKVGGKV PNL+Q A +EA+KP ++SVLVPEGRVHGGSLLAMLACSSGLGS+SRNAVSGP+QP+E HIMPRSWV
Subjt:  LIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV

XP_022145917.1 uncharacterized protein LOC111015265 [Momordica charantia]0.0e+0099.55Show/hide
Query:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDSGSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGTAC
        MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDSGSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGTAC
Subjt:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDSGSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGTAC

Query:  TRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSCCW
        TRVSSAKETDEESSMALDLDFSLHLGNE VASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTF ETLNIDDGETSCCW
Subjt:  TRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSCCW

Query:  KPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKP
        KPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKP
Subjt:  KPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKP

Query:  GCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQAP
        GCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQAP
Subjt:  GCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQAP

Query:  GCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGKRC
        GCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGKRC
Subjt:  GCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGKRC

Query:  KFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPILNKTDKLKEVVREDYMNEDLIKVGGK
        KFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPILNKTDKLKEVVREDYMNEDLIKVGGK
Subjt:  KFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPILNKTDKLKEVVREDYMNEDLIKVGGK

Query:  VCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV
        VCPNLKQSAVNEAEKPLAS VLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV
Subjt:  VCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV

XP_023550287.1 uncharacterized protein LOC111808504 [Cucurbita pepo subsp. pepo]0.0e+0088.18Show/hide
Query:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS----GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSS
        MGSS QN NFAANYSL+VLK LGKSFQDGKTGA DSADTILRLDS     SVPCRS+SKGMKRKWSLVE SVG  G+SVGSSLSLGFVHSSSS DSKGSS
Subjt:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS----GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSS

Query:  GTACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGET
         T CTRVSSAK+TDEESSM LDLDFSL+LGN+KVASPKEPANKSLKV K KPK DLELSLSTG  ESDVTSIYQ  PSLQLSMEKPLTF ETLN DDGET
Subjt:  GTACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGET

Query:  SCCWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRR
        SC WKPGTVQP + TSLNPQVGY+FHPVTEKV+PPANVPDLSSSVLT+PKSSVTCTSGIT+Q Q+RF+RSSNSKICQV+GCGKGARGASGRCISHGGGRR
Subjt:  SCCWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRR

Query:  CQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRR
        CQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTD+CIAHGGGRRCN EGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRR
Subjt:  CQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRR

Query:  CQAPGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGG
        CQ PGCTKGAQGSTMYCKAHGGGKRCT+PGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGG
Subjt:  CQAPGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGG

Query:  GKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLG-----PILNKTDKLKEVVREDYMN
        GKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGT+PCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLG     P L++ +K+K VVREDYMN
Subjt:  GKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLG-----PILNKTDKLKEVVREDYMN

Query:  EDLIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV
        EDLIKVGGKV PNL   A +EAE+P     ++PEGRVHGGSLLAMLACSSGLGSSSRN VSGPE  +EP  MPRSWV
Subjt:  EDLIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV

XP_038876062.1 uncharacterized protein LOC120068383 [Benincasa hispida]0.0e+0089.78Show/hide
Query:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS--GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGT
        MGSSFQN NFAANYSL+VLK LGKS+QDG  GAEDSADTILRLDS   SVPC SI KG+KRKWSLVE S+G  GQSVGSSLSLGFVHSSSS DSKGSS T
Subjt:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS--GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGT

Query:  ACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSC
        ACTRVSSAKETDEESSMALDLDFSL+LG++K ASPKEPA+KSLKVQK+KPK DLELSLSTGPSESDVTSIYQ FPSLQLSMEKPL F ETLNIDDGETSC
Subjt:  ACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSC

Query:  CWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ
        CWK G  QP MPTSLNPQ GY+F PVTE VIPPANVPDLSSSVLTMPKSSVTCTSGIT+QQQ R+NR +NSK+CQVEGCGKGARGASGRCISHGGGRRCQ
Subjt:  CWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ

Query:  KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ
        KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTD+CIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ
Subjt:  KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ

Query:  APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK
         PGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK
Subjt:  APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK

Query:  RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLG-----PILNKTDKLKEVVREDYMNED
        RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLG     P L+KT+K+K VV EDYMNED
Subjt:  RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLG-----PILNKTDKLKEVVREDYMNED

Query:  LIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV
        LIKVGGK  PNL+  A +EAEK + +SVL+PEGRVHGGSLLAMLACSSGLGSSSRNAVSGP+QP+EPHIMPRSWV
Subjt:  LIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV

TrEMBL top hitse value%identityAlignment
A0A0A0KKF7 Uncharacterized protein0.0e+0090.22Show/hide
Query:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS--GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGT
        MGSSFQN NFAANYSL+V K LGKSFQDGKTGAEDSADTILRLDS   SVPC SIS GMKRKWSLVE S+G  GQSVGSSLSLGFVHSSSS DSKGSSGT
Subjt:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS--GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGT

Query:  ACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSC
        ACTRVSSAKETDEESSMALDLDFSL+LG+++VASPKEPA+K LKVQK+KPK DLELSLSTGPSESDVTSIYQ FPSLQLSMEKPLTF ET N DDGETSC
Subjt:  ACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSC

Query:  CWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ
        CWKPGT QP +PTSLNPQVGY+F PVTE +IPPANVPDLSSSVLTMPKSSVTCTSGIT  QQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ
Subjt:  CWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ

Query:  KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ
        K GCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTD+CIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ
Subjt:  KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ

Query:  APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK
         PGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK
Subjt:  APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK

Query:  RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPI-----LNKTDKLKEVVREDYMNED
        RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVS+GPI     L+KT+K+K +V EDYMNED
Subjt:  RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPI-----LNKTDKLKEVVREDYMNED

Query:  LIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV
        LIKVGGKV PNL+  A +EA+KP ++SVL PEGRVHGGSLLAMLACSSGLGS+SRNAVSGP+QP+E HIM RSWV
Subjt:  LIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV

A0A1S3ATY8 uncharacterized protein LOC1034827980.0e+0091.11Show/hide
Query:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS--GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGT
        MGSSFQN NFAANYSL+V K LGKSFQDGKTGAEDSADTILRLDS   SVPC SIS GMKRKWSLVE S+G  GQSVGSSLSLGFVHSSSS DSKGSSGT
Subjt:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS--GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGT

Query:  ACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSC
        ACTRVSSAKETDEESSMALDLDFSL+LGN++VASPKEPA+K LKVQK+KPK DLELSLSTGPSESDVTSIYQ FPSLQLSMEKPL F ET N DDGETSC
Subjt:  ACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSC

Query:  CWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ
        CWKPGT QP MPTSLNPQVGY+F PVTE VIPPANVPDLSSSVLTMPKSSVTCTSGIT  QQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ
Subjt:  CWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ

Query:  KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ
        KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTD+CIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ
Subjt:  KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ

Query:  APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK
         PGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK
Subjt:  APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK

Query:  RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPI-----LNKTDKLKEVVREDYMNED
        RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVS+GPI     L+KT+K+K +V EDYMNED
Subjt:  RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPI-----LNKTDKLKEVVREDYMNED

Query:  LIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV
        LIKVGGKV PNL+Q A +EA+KP ++SVLVPEGRVHGGSLLAMLACSSGLGS+SRNAVSGP+QP+E HIMPRSWV
Subjt:  LIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV

A0A5D3C4G5 Putative WRKY transcription factor 190.0e+0091.11Show/hide
Query:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS--GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGT
        MGSSFQN NFAANYSL+V K LGKSFQDGKTGAEDSADTILRLDS   SVPC SIS GMKRKWSLVE S+G  GQSVGSSLSLGFVHSSSS DSKGSSGT
Subjt:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS--GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGT

Query:  ACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSC
        ACTRVSSAKETDEESSMALDLDFSL+LGN++VASPKEPA+K LKVQK+KPK DLELSLSTGPSESDVTSIYQ FPSLQLSMEKPL F ET N DDGETSC
Subjt:  ACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSC

Query:  CWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ
        CWKPGT QP MPTSLNPQVGY+F PVTE VIPPANVPDLSSSVLTMPKSSVTCTSGIT  QQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ
Subjt:  CWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQ

Query:  KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ
        KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTD+CIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ
Subjt:  KPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ

Query:  APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK
         PGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK
Subjt:  APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGK

Query:  RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPI-----LNKTDKLKEVVREDYMNED
        RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVS+GPI     L+KT+K+K +V EDYMNED
Subjt:  RCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPI-----LNKTDKLKEVVREDYMNED

Query:  LIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV
        LIKVGGKV PNL+Q A +EA+KP ++SVLVPEGRVHGGSLLAMLACSSGLGS+SRNAVSGP+QP+E HIMPRSWV
Subjt:  LIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV

A0A6J1CY22 uncharacterized protein LOC1110152650.0e+0099.55Show/hide
Query:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDSGSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGTAC
        MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDSGSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGTAC
Subjt:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDSGSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGTAC

Query:  TRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSCCW
        TRVSSAKETDEESSMALDLDFSLHLGNE VASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTF ETLNIDDGETSCCW
Subjt:  TRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSCCW

Query:  KPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKP
        KPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKP
Subjt:  KPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKP

Query:  GCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQAP
        GCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQAP
Subjt:  GCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQAP

Query:  GCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGKRC
        GCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGKRC
Subjt:  GCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGKRC

Query:  KFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPILNKTDKLKEVVREDYMNEDLIKVGGK
        KFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPILNKTDKLKEVVREDYMNEDLIKVGGK
Subjt:  KFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGPILNKTDKLKEVVREDYMNEDLIKVGGK

Query:  VCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV
        VCPNLKQSAVNEAEKPLAS VLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV
Subjt:  VCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV

A0A6J1ELM2 uncharacterized protein LOC1114356810.0e+0088.18Show/hide
Query:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS----GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSS
        MGSS QN NFAANYSL+VLK LGKSFQDGKTGA DSADTILRLDS     SVPCRSISKGMKRKWSLVE SVG  G+SVGSSLSLGFVHSSSS DSKGSS
Subjt:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDS----GSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSS

Query:  GTACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGET
         T CTRVSSAK+TDEESSM LDLDFSL+LGN+KVASPKEPANKSLKV K KPK DLELSLSTG  ESDVTSIYQ  PSLQLSMEKPLTF ETLN DDGET
Subjt:  GTACTRVSSAKETDEESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGET

Query:  SCCWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRR
        SC WKPGTVQP + TSLNPQVGY+FHPVTEKV+PPANVPDLSSSVLT+PKSSVTCTSGIT+Q Q+RF+RSSNSKICQV+GCGKGARGASGRCISHGGGRR
Subjt:  SCCWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRR

Query:  CQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRR
        CQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTD+CIAHGGGRRCN EGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRR
Subjt:  CQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRR

Query:  CQAPGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGG
        CQ PGCTKGAQGSTMYCKAHGGGKRCT+PGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGG
Subjt:  CQAPGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGG

Query:  GKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLG-----PILNKTDKLKEVVREDYMN
        GKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGT+PCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLG     P L++ +K+K VVREDYMN
Subjt:  GKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLG-----PILNKTDKLKEVVREDYMN

Query:  EDLIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV
        EDLIKVGGKV PNL   A +EAE+    S ++PEGRVHGGSLLAMLACSSGL SSSRN VSGPE  +EP  MPRSWV
Subjt:  EDLIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEPHIMPRSWV

SwissProt top hitse value%identityAlignment
Q9SZ67 Probable WRKY transcription factor 193.2e-7158.47Show/hide
Query:  SSSVLTMPKS-----SVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAE
        SSS+ T+P        ++  SG+++Q     N SS+SK+CQVEGC KGAR ASGRCISHGGGRRCQKP C KGAEG+TVYCKAHGGGRRC++LGCTK AE
Subjt:  SSSVLTMPKS-----SVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAE

Query:  GRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQAPGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAE
        G TDFCIAHGGGRRCN E CTR+A G++  C++HGGG RC+   C KSA G    C +HGGG++C    CT  A+G +  C  HGGGKRC    CTK AE
Subjt:  GRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQAPGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAE

Query:  GSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVA
        G +  C  HGGG+RC   G   CTK   G   FC A
Subjt:  GSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVA

Arabidopsis top hitse value%identityAlignment
AT1G64140.1 BEST Arabidopsis thaliana protein match is: loricrin-related (TAIR:AT5G64550.1)1.4e-21562.33Show/hide
Query:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDSGSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGTAC
        M S FQN  FA N S +  K LG+S Q       ++ADT LRLDS + P  S +KG+KRKW+L++ +           LSL   HSSSS DSKGSS TAC
Subjt:  MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDSGSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGTAC

Query:  TRVSSAKETDEESSMALDLDFSLHLGNEK-VASPKEPANKSLK-VQKMKPKFDLELSLSTGPS-ESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETS
        T +SSA+ET+E SSM ++LDFSLHLGNEK  AS K+PAN  +K +Q   PKFDLELSLS G S +S++T++ Q     Q       + A+ L  ++ E++
Subjt:  TRVSSAKETDEESSMALDLDFSLHLGNEK-VASPKEPANKSLK-VQKMKPKFDLELSLSTGPS-ESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETS

Query:  CC-WKPGTVQPTMPTSLNPQV-GYVFHPVTEKVIPPANVPDLSS-SVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGG
         C W+ G   PT+  S + +   ++ H     +IP A+V +LSS +  T P SS TCTSG+++Q + +   SS+SK+CQVEGC KGARGASGRCISHGGG
Subjt:  CC-WKPGTVQPTMPTSLNPQV-GYVFHPVTEKVIPPANVPDLSS-SVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGG

Query:  RRCQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGG
        RRCQK GCHKGAEGRTVYCKAHGGGRRC+ LGCTKSAEGRTDFCIAHGGGRRC+ E CTRAARG+SGLCIRHGGGKRCQ+ENCTKSAEGLSGLCISHGGG
Subjt:  RRCQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGG

Query:  RRCQAPGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRH
        RRCQ+ GCTKGAQGSTM+CKAHGGGKRCT  GCTKGAEGSTPFCKGHGGGKRC FQG   C+KSVHGGTNFCVAHGGGKRCAVPECTKSARGRTD+CVRH
Subjt:  RRCQAPGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRH

Query:  GGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEY-GTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGP-------ILNKTDKLKEVVR
        GGGKRC+ EGCGKSAQGSTDFCKAHGGGKRC+WG P +EY G     PC SFARGK GLCALH+ LVQD RVHGG+++           ++T+  +E   
Subjt:  GGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEY-GTQPCCPCNSFARGKMGLCALHSGLVQDKRVHGGVSLGP-------ILNKTDKLKEVVR

Query:  EDYMNEDLIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSL-LAMLACSSGLGSS
           MN D +K             +NE E  L    L PEGRVHGGSL +AML    G GS+
Subjt:  EDYMNEDLIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSL-LAMLACSSGLGSS

AT4G12020.1 protein kinase family protein2.3e-7258.47Show/hide
Query:  SSSVLTMPKS-----SVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAE
        SSS+ T+P        ++  SG+++Q     N SS+SK+CQVEGC KGAR ASGRCISHGGGRRCQKP C KGAEG+TVYCKAHGGGRRC++LGCTK AE
Subjt:  SSSVLTMPKS-----SVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAE

Query:  GRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQAPGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAE
        G TDFCIAHGGGRRCN E CTR+A G++  C++HGGG RC+   C KSA G    C +HGGG++C    CT  A+G +  C  HGGGKRC    CTK AE
Subjt:  GRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQAPGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAE

Query:  GSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVA
        G +  C  HGGG+RC   G   CTK   G   FC A
Subjt:  GSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVA

AT5G09670.1 loricrin-related6.3e-13160.25Show/hide
Query:  RSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCN-REGCTRAARGKSGLC
        R+SN + C+  GC KGARGASG CISHGGG+RCQKPGC+KGAE +T +CK HGGG+RC+HLGCTKSAEG+TDFCI+HGGGRRC   EGC +AARG+SGLC
Subjt:  RSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCN-REGCTRAARGKSGLC

Query:  IRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ-APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGG
        I+HGGGKRC  E+CT+SAEG +GLCISHGGG+RCQ   GC KGAQGST YCKAHGGGKRC   GC+KGAEGSTP CK HGGGKRC   GGGIC+KSVHGG
Subjt:  IRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ-APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGG

Query:  TNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQD
        TNFCVAHGGGKRC V  CTKSARGRTD CV+HGGGKRCK   C KSAQGSTDFCKAHGGGKRCSWG             C  FARGK GLCA H+ ++  
Subjt:  TNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQD

Query:  KRVHGGVS--LGP---------------------ILNKTDKLKEVVREDYMNEDLIKVGGKV-CPNLKQSAVNEAE-KPLASSVLVPEGRVHGGSLLAML
        +   G  S  +GP                     + + TD ++ +  E+      + +  +V  P+  +S  N  E +      +VPE RVHGG L+  L
Subjt:  KRVHGGVS--LGP---------------------ILNKTDKLKEVVREDYMNEDLIKVGGKV-CPNLKQSAVNEAE-KPLASSVLVPEGRVHGGSLLAML

AT5G09670.2 loricrin-related6.3e-13160.25Show/hide
Query:  RSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCN-REGCTRAARGKSGLC
        R+SN + C+  GC KGARGASG CISHGGG+RCQKPGC+KGAE +T +CK HGGG+RC+HLGCTKSAEG+TDFCI+HGGGRRC   EGC +AARG+SGLC
Subjt:  RSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCN-REGCTRAARGKSGLC

Query:  IRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ-APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGG
        I+HGGGKRC  E+CT+SAEG +GLCISHGGG+RCQ   GC KGAQGST YCKAHGGGKRC   GC+KGAEGSTP CK HGGGKRC   GGGIC+KSVHGG
Subjt:  IRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQ-APGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGG

Query:  TNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQD
        TNFCVAHGGGKRC V  CTKSARGRTD CV+HGGGKRCK   C KSAQGSTDFCKAHGGGKRCSWG             C  FARGK GLCA H+ ++  
Subjt:  TNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQD

Query:  KRVHGGVS--LGP---------------------ILNKTDKLKEVVREDYMNEDLIKVGGKV-CPNLKQSAVNEAE-KPLASSVLVPEGRVHGGSLLAML
        +   G  S  +GP                     + + TD ++ +  E+      + +  +V  P+  +S  N  E +      +VPE RVHGG L+  L
Subjt:  KRVHGGVS--LGP---------------------ILNKTDKLKEVVREDYMNEDLIKVGGKV-CPNLKQSAVNEAE-KPLASSVLVPEGRVHGGSLLAML

AT5G64550.1 loricrin-related3.2e-13562.78Show/hide
Query:  LTFAET----LNIDDGETSCCWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLT---------MPKSSVTCTSGITEQQQQRFNRSSNS
        LT+A+T      +D+G TS   + G   P++          +F P TE V  P+ + + S++  T           + SV+  S  +        R SN 
Subjt:  LTFAET----LNIDDGETSCCWKPGTVQPTMPTSLNPQVGYVFHPVTEKVIPPANVPDLSSSVLT---------MPKSSVTCTSGITEQQQQRFNRSSNS

Query:  KICQVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCN-REGCTRAARGKSGLCIRHGG
        K C+  GC KGARGASG CI HGGG+RCQK GC+KGAE +T +CKAHGGG+RCQHLGCTKSAEG+TD CI+HGGGRRC   EGC +AARGKSGLCI+HGG
Subjt:  KICQVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKSAEGRTDFCIAHGGGRRCN-REGCTRAARGKSGLCIRHGG

Query:  GKRCQKENCTKSAEGLSGLCISHGGGRRCQAPGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVA
        GKRC+ E+CT+SAEG +GLCISHGGGRRCQ+ GCTKGAQGST YCKAHGGGKRC   GCTKGAEGSTP CK HGGGKRC F GGGIC KSVHGGT+FCVA
Subjt:  GKRCQKENCTKSAEGLSGLCISHGGGRRCQAPGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVA

Query:  HGGGKRCAVPECTKSARGRTDYCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDK
        HGGGKRC V  CTKSARGRTD CV+HGGGKRCK +GC KSAQGSTDFCKAHGGGKRCSWG             C  FARGK GLCA H+ + QDK
Subjt:  HGGGKRCAVPECTKSARGRTDYCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKMGLCALHSGLVQDK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTCAAGCTTCCAGAACTTCAACTTTGCTGCCAATTATTCTTTGGATGTACTAAAGTTTTTGGGAAAGTCATTCCAAGATGGGAAAACTGGAGCTGAAGACAGTGC
AGATACCATTCTGCGACTTGATTCGGGATCAGTACCCTGTAGGTCCATATCAAAAGGAATGAAACGGAAGTGGAGTTTGGTTGAGAATTCCGTGGGTGGTCCTGGTCAGT
CAGTTGGGTCTTCTTTGTCTCTTGGGTTTGTACATTCCTCAAGCTCTATTGACAGTAAGGGGAGTTCAGGAACTGCATGCACCAGAGTGTCTTCTGCTAAAGAAACTGAT
GAGGAATCCTCAATGGCACTCGACCTGGATTTTTCTCTCCATCTTGGTAACGAGAAGGTAGCTAGCCCTAAAGAACCAGCCAATAAATCACTTAAAGTACAGAAAATGAA
ACCGAAGTTTGACTTGGAGTTAAGTCTCTCAACTGGGCCATCTGAATCTGATGTCACTTCAATTTATCAAAGTTTTCCCTCACTTCAGTTGTCAATGGAGAAACCATTGA
CATTTGCGGAGACCTTAAACATAGATGATGGAGAAACATCATGTTGTTGGAAGCCTGGAACTGTCCAGCCAACCATGCCCACCTCGTTGAATCCTCAGGTTGGCTACGTT
TTTCACCCAGTTACTGAGAAAGTAATTCCACCTGCCAATGTTCCAGACCTTTCCTCGAGTGTTTTAACCATGCCAAAAAGCTCAGTCACCTGTACTTCCGGGATAACAGA
ACAGCAGCAACAACGATTTAATCGTAGTAGTAATTCCAAGATATGTCAAGTTGAGGGATGTGGCAAGGGAGCCAGAGGTGCTTCTGGCCGTTGTATATCACATGGTGGTG
GTCGGAGGTGTCAAAAACCTGGATGCCACAAAGGAGCCGAGGGTCGCACTGTATATTGCAAGGCCCATGGAGGTGGACGGCGATGTCAACATTTGGGGTGCACCAAGAGT
GCAGAAGGGCGCACCGACTTCTGTATTGCCCATGGTGGGGGTAGAAGATGCAACCGTGAAGGATGCACCCGTGCAGCGAGAGGTAAATCAGGATTATGCATTCGTCATGG
GGGCGGAAAGAGATGTCAAAAGGAGAACTGCACCAAGAGTGCAGAAGGCCTCTCTGGTCTTTGTATCTCTCATGGTGGTGGTAGGCGATGCCAGGCGCCTGGGTGCACTA
AAGGGGCCCAAGGGAGCACCATGTACTGCAAGGCTCATGGTGGGGGAAAAAGGTGCACGGCTCCTGGTTGTACCAAAGGTGCTGAAGGGAGCACCCCATTTTGCAAGGGC
CACGGTGGTGGGAAAAGGTGTGGATTCCAAGGGGGTGGAATTTGTACAAAGAGTGTTCATGGGGGTACGAACTTCTGTGTGGCACATGGTGGGGGTAAGAGATGCGCTGT
TCCTGAGTGCACGAAAAGTGCTCGGGGAAGGACTGACTATTGCGTGCGTCACGGTGGGGGAAAGAGATGCAAGTTCGAAGGGTGTGGCAAAAGTGCACAAGGAAGCACAG
ATTTTTGCAAAGCGCATGGTGGAGGAAAGAGATGCTCTTGGGGTCATCCTGGGTCTGAGTATGGCACCCAACCCTGTTGCCCTTGCAATTCATTCGCCAGGGGGAAGATG
GGTCTCTGTGCACTTCATAGTGGTCTGGTGCAAGATAAGAGGGTTCATGGGGGTGTTTCACTCGGACCTATTCTCAATAAGACAGATAAGTTGAAGGAGGTGGTCAGGGA
AGATTATATGAACGAAGATCTCATCAAGGTAGGAGGCAAGGTTTGTCCAAATCTCAAACAATCTGCAGTCAATGAAGCAGAGAAACCATTGGCATCATCGGTTCTCGTTC
CCGAAGGTAGGGTGCACGGTGGAAGTCTATTAGCAATGCTTGCTTGCAGCTCGGGGCTGGGGTCAAGCAGTAGAAATGCAGTTTCTGGTCCTGAGCAGCCACTGGAACCT
CACATCATGCCTCGTAGTTGGGTG
mRNA sequenceShow/hide mRNA sequence
ATGGGCTCAAGCTTCCAGAACTTCAACTTTGCTGCCAATTATTCTTTGGATGTACTAAAGTTTTTGGGAAAGTCATTCCAAGATGGGAAAACTGGAGCTGAAGACAGTGC
AGATACCATTCTGCGACTTGATTCGGGATCAGTACCCTGTAGGTCCATATCAAAAGGAATGAAACGGAAGTGGAGTTTGGTTGAGAATTCCGTGGGTGGTCCTGGTCAGT
CAGTTGGGTCTTCTTTGTCTCTTGGGTTTGTACATTCCTCAAGCTCTATTGACAGTAAGGGGAGTTCAGGAACTGCATGCACCAGAGTGTCTTCTGCTAAAGAAACTGAT
GAGGAATCCTCAATGGCACTCGACCTGGATTTTTCTCTCCATCTTGGTAACGAGAAGGTAGCTAGCCCTAAAGAACCAGCCAATAAATCACTTAAAGTACAGAAAATGAA
ACCGAAGTTTGACTTGGAGTTAAGTCTCTCAACTGGGCCATCTGAATCTGATGTCACTTCAATTTATCAAAGTTTTCCCTCACTTCAGTTGTCAATGGAGAAACCATTGA
CATTTGCGGAGACCTTAAACATAGATGATGGAGAAACATCATGTTGTTGGAAGCCTGGAACTGTCCAGCCAACCATGCCCACCTCGTTGAATCCTCAGGTTGGCTACGTT
TTTCACCCAGTTACTGAGAAAGTAATTCCACCTGCCAATGTTCCAGACCTTTCCTCGAGTGTTTTAACCATGCCAAAAAGCTCAGTCACCTGTACTTCCGGGATAACAGA
ACAGCAGCAACAACGATTTAATCGTAGTAGTAATTCCAAGATATGTCAAGTTGAGGGATGTGGCAAGGGAGCCAGAGGTGCTTCTGGCCGTTGTATATCACATGGTGGTG
GTCGGAGGTGTCAAAAACCTGGATGCCACAAAGGAGCCGAGGGTCGCACTGTATATTGCAAGGCCCATGGAGGTGGACGGCGATGTCAACATTTGGGGTGCACCAAGAGT
GCAGAAGGGCGCACCGACTTCTGTATTGCCCATGGTGGGGGTAGAAGATGCAACCGTGAAGGATGCACCCGTGCAGCGAGAGGTAAATCAGGATTATGCATTCGTCATGG
GGGCGGAAAGAGATGTCAAAAGGAGAACTGCACCAAGAGTGCAGAAGGCCTCTCTGGTCTTTGTATCTCTCATGGTGGTGGTAGGCGATGCCAGGCGCCTGGGTGCACTA
AAGGGGCCCAAGGGAGCACCATGTACTGCAAGGCTCATGGTGGGGGAAAAAGGTGCACGGCTCCTGGTTGTACCAAAGGTGCTGAAGGGAGCACCCCATTTTGCAAGGGC
CACGGTGGTGGGAAAAGGTGTGGATTCCAAGGGGGTGGAATTTGTACAAAGAGTGTTCATGGGGGTACGAACTTCTGTGTGGCACATGGTGGGGGTAAGAGATGCGCTGT
TCCTGAGTGCACGAAAAGTGCTCGGGGAAGGACTGACTATTGCGTGCGTCACGGTGGGGGAAAGAGATGCAAGTTCGAAGGGTGTGGCAAAAGTGCACAAGGAAGCACAG
ATTTTTGCAAAGCGCATGGTGGAGGAAAGAGATGCTCTTGGGGTCATCCTGGGTCTGAGTATGGCACCCAACCCTGTTGCCCTTGCAATTCATTCGCCAGGGGGAAGATG
GGTCTCTGTGCACTTCATAGTGGTCTGGTGCAAGATAAGAGGGTTCATGGGGGTGTTTCACTCGGACCTATTCTCAATAAGACAGATAAGTTGAAGGAGGTGGTCAGGGA
AGATTATATGAACGAAGATCTCATCAAGGTAGGAGGCAAGGTTTGTCCAAATCTCAAACAATCTGCAGTCAATGAAGCAGAGAAACCATTGGCATCATCGGTTCTCGTTC
CCGAAGGTAGGGTGCACGGTGGAAGTCTATTAGCAATGCTTGCTTGCAGCTCGGGGCTGGGGTCAAGCAGTAGAAATGCAGTTTCTGGTCCTGAGCAGCCACTGGAACCT
CACATCATGCCTCGTAGTTGGGTG
Protein sequenceShow/hide protein sequence
MGSSFQNFNFAANYSLDVLKFLGKSFQDGKTGAEDSADTILRLDSGSVPCRSISKGMKRKWSLVENSVGGPGQSVGSSLSLGFVHSSSSIDSKGSSGTACTRVSSAKETD
EESSMALDLDFSLHLGNEKVASPKEPANKSLKVQKMKPKFDLELSLSTGPSESDVTSIYQSFPSLQLSMEKPLTFAETLNIDDGETSCCWKPGTVQPTMPTSLNPQVGYV
FHPVTEKVIPPANVPDLSSSVLTMPKSSVTCTSGITEQQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEGRTVYCKAHGGGRRCQHLGCTKS
AEGRTDFCIAHGGGRRCNREGCTRAARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQAPGCTKGAQGSTMYCKAHGGGKRCTAPGCTKGAEGSTPFCKG
HGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTDYCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFARGKM
GLCALHSGLVQDKRVHGGVSLGPILNKTDKLKEVVREDYMNEDLIKVGGKVCPNLKQSAVNEAEKPLASSVLVPEGRVHGGSLLAMLACSSGLGSSSRNAVSGPEQPLEP
HIMPRSWV