; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012466 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012466
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationChr01:21472544..21476485
RNA-Seq ExpressionHG10012466
SyntenyHG10012466
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34042.1 hydroxyproline-rich glycoprotein family protein [Cucumis melo subsp. melo]3.8e-23193.12Show/hide
Query:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE
        MSSRGGRVSLKAMESPKRVVSVS VES+PQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQ+SHRRS S  G C NVEDCNGVKSGLQE
Subjt:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE

Query:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV
        KL FAEDLIKDLQSQLVELKEEL+KSQSLNLELQSQNDLLVRDLAAAEAKFA+ASNND+R+SV+E SQR  EDNQKLENGKLET P SS RN RDL+CK 
Subjt:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV

Query:  PPPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHT
        PPPRA   PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLA+KADIETKGEFINGLIDKVLVAAHT
Subjt:  PPPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHT

Query:  DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQ
        DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALK LENEISFYKDDTNSPCE ALKKMASLLDKSER IQRLITLRSTVMHSYQ
Subjt:  DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQ

Query:  DLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQ
        DLKLPT+WMLDSGIMSKIKQASMNLAKMYMKRVKTELDS+RSSDKESNHESLLLQGIHFAYRTHQ
Subjt:  DLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQ

KAA0052149.1 protein CHUP1 [Cucumis melo var. makuwa]3.5e-25392.48Show/hide
Query:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE
        MSSRGGRVSLKAMESPKRVVSVS VES+PQSG+KKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQ+SHRRS S  G C NVEDCNGVKSGLQE
Subjt:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE

Query:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV
        KL FAEDLIKDLQSQLVELKEEL+KSQSLNLELQSQNDLLVRDLAAAEAKFA+ASNND+R+SV+E SQR  EDNQKLENGKLET P SS RN RDL+CK 
Subjt:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV

Query:  PPPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHT
        PPPRA   PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLA+KADIETKGEFINGLIDKVLVAAHT
Subjt:  PPPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHT

Query:  DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQ
        DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALK LENEISFYKDDTNSPCE ALKKMASLLDKSER IQRLITLRSTVMHSYQ
Subjt:  DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQ

Query:  DLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWIV
        DLKLPT+WMLDSGIMSKIKQASMNLAKMYMKRVKTELDS+RSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQ+LG+SH+QG IV
Subjt:  DLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWIV

Query:  GIPSS
        GI SS
Subjt:  GIPSS

XP_004147632.1 protein CHUP1, chloroplastic [Cucumis sativus]2.3e-25291.5Show/hide
Query:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE
        MSSRGGRVSLKAMESPKRVVSVSAVES+PQSGVKKQSS+VSRSLTPN PKKGRDGENVGVSARTVNRGGLKQ+ HRRS SG G C NVEDCNGVKSGLQE
Subjt:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE

Query:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV
        KLCFAEDLIKDLQSQLVELKEEL KSQSLN ELQSQNDLLVRDLAAAEAKFA+ SNND+R+SV+E SQR+ EDNQKLENGKLET P SS RN RDL+CK 
Subjt:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV

Query:  PPPRA----PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAH
        PPPRA    PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLA+KADIETKGEFINGLIDKVLVAAH
Subjt:  PPPRA----PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAH

Query:  TDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSY
        TDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALK LENEISFYKDDTNSPCE ALKKMASLLDKSER IQRLITLRSTVMHSY
Subjt:  TDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSY

Query:  QDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWI
        Q+LKLPT+WMLDSGIMSKIKQASMNLAKMYMKRVKTELDS+RSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPR+++G+SH+QG I
Subjt:  QDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWI

Query:  VGIPSS
        VGI SS
Subjt:  VGIPSS

XP_008439003.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo]1.6e-25392.67Show/hide
Query:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE
        MSSRGGRVSLKAMESPKRVVSVS VES+PQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQ+SHRRS S  G C NVEDCNGVKSGLQE
Subjt:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE

Query:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV
        KL FAEDLIKDLQSQLVELKEEL+KSQSLNLELQSQNDLLVRDLAAAEAKFA+ASNND+R+SV+E SQR  EDNQKLENGKLET P SS RN RDL+CK 
Subjt:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV

Query:  PPPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHT
        PPPRA   PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLA+KADIETKGEFINGLIDKVLVAAHT
Subjt:  PPPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHT

Query:  DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQ
        DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALK LENEISFYKDDTNSPCE ALKKMASLLDKSER IQRLITLRSTVMHSYQ
Subjt:  DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQ

Query:  DLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWIV
        DLKLPT+WMLDSGIMSKIKQASMNLAKMYMKRVKTELDS+RSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQ+LG+SH+QG IV
Subjt:  DLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWIV

Query:  GIPSS
        GI SS
Subjt:  GIPSS

XP_038902728.1 protein CHUP1, chloroplastic [Benincasa hispida]8.6e-25292.06Show/hide
Query:  SSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQEK
        SSRGGRVSLKAMESPKR+VSVSAVES+PQSGVKKQSS+V RSLTP APKKGRDGENVGV ARTVNRGGLKQ+SHRRS SGTGPCANVEDCNGVKSGLQEK
Subjt:  SSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQEK

Query:  LCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKVP
        LCFAEDLIKDLQSQLV LKEELQKSQSLN+ELQS NDLLVRDLAAAEAK A+ SNNDQRESVAE SQRN EDNQKL NGKLET P SS R+ RDLECK P
Subjt:  LCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKVP

Query:  PPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHTD
        PPRA   PPPPPLPVQSMPR AATQKSPDLVR+FHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLA+KADIETKGEFINGLIDKVLVAAHTD
Subjt:  PPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHTD

Query:  IEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQD
        IED+LKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALK LENEISFYKDDT+SPCE ALKKMASLLDKSER IQRLI LRST MHSYQD
Subjt:  IEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQD

Query:  LKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWIVG
        LKLPT+WMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESN ESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLG+SHSQG IVG
Subjt:  LKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWIVG

Query:  IPSS
        IPSS
Subjt:  IPSS

TrEMBL top hitse value%identityAlignment
A0A0A0L5G9 Uncharacterized protein1.1e-25291.5Show/hide
Query:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE
        MSSRGGRVSLKAMESPKRVVSVSAVES+PQSGVKKQSS+VSRSLTPN PKKGRDGENVGVSARTVNRGGLKQ+ HRRS SG G C NVEDCNGVKSGLQE
Subjt:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE

Query:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV
        KLCFAEDLIKDLQSQLVELKEEL KSQSLN ELQSQNDLLVRDLAAAEAKFA+ SNND+R+SV+E SQR+ EDNQKLENGKLET P SS RN RDL+CK 
Subjt:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV

Query:  PPPRA----PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAH
        PPPRA    PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLA+KADIETKGEFINGLIDKVLVAAH
Subjt:  PPPRA----PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAH

Query:  TDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSY
        TDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALK LENEISFYKDDTNSPCE ALKKMASLLDKSER IQRLITLRSTVMHSY
Subjt:  TDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSY

Query:  QDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWI
        Q+LKLPT+WMLDSGIMSKIKQASMNLAKMYMKRVKTELDS+RSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPR+++G+SH+QG I
Subjt:  QDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWI

Query:  VGIPSS
        VGI SS
Subjt:  VGIPSS

A0A1S4DT91 protein CHUP1, chloroplastic7.6e-25492.67Show/hide
Query:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE
        MSSRGGRVSLKAMESPKRVVSVS VES+PQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQ+SHRRS S  G C NVEDCNGVKSGLQE
Subjt:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE

Query:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV
        KL FAEDLIKDLQSQLVELKEEL+KSQSLNLELQSQNDLLVRDLAAAEAKFA+ASNND+R+SV+E SQR  EDNQKLENGKLET P SS RN RDL+CK 
Subjt:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV

Query:  PPPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHT
        PPPRA   PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLA+KADIETKGEFINGLIDKVLVAAHT
Subjt:  PPPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHT

Query:  DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQ
        DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALK LENEISFYKDDTNSPCE ALKKMASLLDKSER IQRLITLRSTVMHSYQ
Subjt:  DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQ

Query:  DLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWIV
        DLKLPT+WMLDSGIMSKIKQASMNLAKMYMKRVKTELDS+RSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQ+LG+SH+QG IV
Subjt:  DLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWIV

Query:  GIPSS
        GI SS
Subjt:  GIPSS

A0A5D3C1G2 Protein CHUP11.7e-25392.48Show/hide
Query:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE
        MSSRGGRVSLKAMESPKRVVSVS VES+PQSG+KKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQ+SHRRS S  G C NVEDCNGVKSGLQE
Subjt:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE

Query:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV
        KL FAEDLIKDLQSQLVELKEEL+KSQSLNLELQSQNDLLVRDLAAAEAKFA+ASNND+R+SV+E SQR  EDNQKLENGKLET P SS RN RDL+CK 
Subjt:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV

Query:  PPPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHT
        PPPRA   PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLA+KADIETKGEFINGLIDKVLVAAHT
Subjt:  PPPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHT

Query:  DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQ
        DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALK LENEISFYKDDTNSPCE ALKKMASLLDKSER IQRLITLRSTVMHSYQ
Subjt:  DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQ

Query:  DLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWIV
        DLKLPT+WMLDSGIMSKIKQASMNLAKMYMKRVKTELDS+RSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQ+LG+SH+QG IV
Subjt:  DLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWIV

Query:  GIPSS
        GI SS
Subjt:  GIPSS

A0A6J1G7Z6 protein CHUP1, chloroplastic8.2e-22484.21Show/hide
Query:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE
        MSSRGGRVS KAMESPKR+VSVSAV+S+PQS VKKQSSRVSRSLTPNAPKKGRDGENVGVSAR VNRGGLKQ S RR       C+NVEDCNGVKS LQ+
Subjt:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE

Query:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV
        KLCF EDLIKDLQSQLV LKEELQKSQSLNLELQS+NDLLVRDLAAAEAK ANASNNDQ  SV        E NQKLENGKL+  P +S RN +D E K 
Subjt:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV

Query:  -----PPPR-----APPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDK
             PPPR      PPPPPLPV+S+PR  A+QKSPDLVRLFHSL+KKEGKR PPLLGKPAAINAHNSIVGEIQNRSAHLLA+KADIETKGEFINGLIDK
Subjt:  -----PPPR-----APPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDK

Query:  VLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRS
        VLVAA+TDIED+LKFVDWLD QLSSLADERAVLKHFKWPEKKADAMREAAIEYRALK LENEIS YKDDTNSPCE ALKKMASLLDKSER IQRLI LR+
Subjt:  VLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRS

Query:  TVMHSYQDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKS
        TVMHSYQDLKLPT+WMLDSGI SKIKQASMNLAKMYMKRVKTEL+SIRSSDKESN ESLLLQG+HF YRTHQFAGGLDSETLCAFEEIKQWVPRQV+G S
Subjt:  TVMHSYQDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKS

Query:  HS-QGWIVGIPSS
        HS QGWIVGIPSS
Subjt:  HS-QGWIVGIPSS

E5GC44 Hydroxyproline-rich glycoprotein family protein1.8e-23193.12Show/hide
Query:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE
        MSSRGGRVSLKAMESPKRVVSVS VES+PQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQ+SHRRS S  G C NVEDCNGVKSGLQE
Subjt:  MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQE

Query:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV
        KL FAEDLIKDLQSQLVELKEEL+KSQSLNLELQSQNDLLVRDLAAAEAKFA+ASNND+R+SV+E SQR  EDNQKLENGKLET P SS RN RDL+CK 
Subjt:  KLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKV

Query:  PPPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHT
        PPPRA   PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLA+KADIETKGEFINGLIDKVLVAAHT
Subjt:  PPPRA---PPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHT

Query:  DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQ
        DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALK LENEISFYKDDTNSPCE ALKKMASLLDKSER IQRLITLRSTVMHSYQ
Subjt:  DIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQ

Query:  DLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQ
        DLKLPT+WMLDSGIMSKIKQASMNLAKMYMKRVKTELDS+RSSDKESNHESLLLQGIHFAYRTHQ
Subjt:  DLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQ

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic1.7e-7249.83Show/hide
Query:  PPPRAPPPPPLPVQSMPRAAA----TQKSPDLVRLFHSLRKKEGKRD--PPLL--GKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVL
        PP   PPPPP P  ++ R A       ++P+LV  + SL K+E K++  P L+  G   +  A N+++GEI+NRS  LLAVKAD+ET+G+F+  L  +V 
Subjt:  PPPRAPPPPPLPVQSMPRAAA----TQKSPDLVRLFHSLRKKEGKRD--PPLL--GKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVL

Query:  VAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTV
         ++ TDIED+L FV WLD +LS L DERAVLKHF WPE KADA+REAA EY+ L +LE +++ + DD N  CE ALKKM  LL+K E+++  L+  R   
Subjt:  VAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTV

Query:  MHSYQDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK
        +  Y++  +P  W+ D+G++ KIK +S+ LAK YMKRV  ELDS+  SDK+ N E LLLQG+ FA+R HQFAGG D+E++ AFEE++
Subjt:  MHSYQDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein1.2e-10549.25Show/hide
Query:  ARTVNRGGL--------KQISHRRSFSGTGPCANVEDCNGVKSGLQEKLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFAN
        AR+VNR  +        + IS +   +     A  ++       L+EKL   E LIKDLQ Q++ LK EL+++++ N+EL+  N  L +DL +AEAK ++
Subjt:  ARTVNRGGL--------KQISHRRSFSGTGPCANVEDCNGVKSGLQEKLCFAEDLIKDLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFAN

Query:  ASNND--------------QRESVAEGSQRNVEDNQKLENGKLETHPPSSYR------------------NARDLECK--VPPPRAPPPPPLPVQSMPRA
         S+ND              QR   ++  Q  V+    +E+ +L    PS  R                    RD       PP   PPPPP P + + +A
Subjt:  ASNND--------------QRESVAEGSQRNVEDNQKLENGKLETHPPSSYR------------------NARDLECK--VPPPRAPPPPPLPVQSMPRA

Query:  AATQKSPDLVRLFHSLRKKEGKRD--PPLLGKPAAIN-AHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSL
        A  QKSP + +LF  L K++  R+    + G  + +N AHNSIVGEIQNRSAHL+A+KADIETKGEFIN LI KVL    +D+ED++KFVDWLD +L++L
Subjt:  AATQKSPDLVRLFHSLRKKEGKRD--PPLLGKPAAIN-AHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSL

Query:  ADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQDLKLPTSWMLDSGIMSKIK
        ADERAVLKHFKWPEKKAD ++EAA+EYR LK+LE E+S Y DD N     ALKKMA+LLDKSE+ I+RL+ LR + M SYQD K+P  WMLDSG++ KIK
Subjt:  ADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQDLKLPTSWMLDSGIMSKIK

Query:  QASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVP
        +AS+ LAK YM RV  EL S R+ D+ES  E+LLLQG+ FAYRTHQFAGGLD ETLCA EEIKQ VP
Subjt:  QASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVP

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.2e-7349.83Show/hide
Query:  PPPRAPPPPPLPVQSMPRAAA----TQKSPDLVRLFHSLRKKEGKRD--PPLL--GKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVL
        PP   PPPPP P  ++ R A       ++P+LV  + SL K+E K++  P L+  G   +  A N+++GEI+NRS  LLAVKAD+ET+G+F+  L  +V 
Subjt:  PPPRAPPPPPLPVQSMPRAAA----TQKSPDLVRLFHSLRKKEGKRD--PPLL--GKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVL

Query:  VAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTV
         ++ TDIED+L FV WLD +LS L DERAVLKHF WPE KADA+REAA EY+ L +LE +++ + DD N  CE ALKKM  LL+K E+++  L+  R   
Subjt:  VAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTV

Query:  MHSYQDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK
        +  Y++  +P  W+ D+G++ KIK +S+ LAK YMKRV  ELDS+  SDK+ N E LLLQG+ FA+R HQFAGG D+E++ AFEE++
Subjt:  MHSYQDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.2e-7349.83Show/hide
Query:  PPPRAPPPPPLPVQSMPRAAA----TQKSPDLVRLFHSLRKKEGKRD--PPLL--GKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVL
        PP   PPPPP P  ++ R A       ++P+LV  + SL K+E K++  P L+  G   +  A N+++GEI+NRS  LLAVKAD+ET+G+F+  L  +V 
Subjt:  PPPRAPPPPPLPVQSMPRAAA----TQKSPDLVRLFHSLRKKEGKRD--PPLL--GKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVL

Query:  VAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTV
         ++ TDIED+L FV WLD +LS L DERAVLKHF WPE KADA+REAA EY+ L +LE +++ + DD N  CE ALKKM  LL+K E+++  L+  R   
Subjt:  VAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTV

Query:  MHSYQDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK
        +  Y++  +P  W+ D+G++ KIK +S+ LAK YMKRV  ELDS+  SDK+ N E LLLQG+ FA+R HQFAGG D+E++ AFEE++
Subjt:  MHSYQDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.2e-7349.83Show/hide
Query:  PPPRAPPPPPLPVQSMPRAAA----TQKSPDLVRLFHSLRKKEGKRD--PPLL--GKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVL
        PP   PPPPP P  ++ R A       ++P+LV  + SL K+E K++  P L+  G   +  A N+++GEI+NRS  LLAVKAD+ET+G+F+  L  +V 
Subjt:  PPPRAPPPPPLPVQSMPRAAA----TQKSPDLVRLFHSLRKKEGKRD--PPLL--GKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVL

Query:  VAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTV
         ++ TDIED+L FV WLD +LS L DERAVLKHF WPE KADA+REAA EY+ L +LE +++ + DD N  CE ALKKM  LL+K E+++  L+  R   
Subjt:  VAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTV

Query:  MHSYQDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK
        +  Y++  +P  W+ D+G++ KIK +S+ LAK YMKRV  ELDS+  SDK+ N E LLLQG+ FA+R HQFAGG D+E++ AFEE++
Subjt:  MHSYQDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-6346.71Show/hide
Query:  KVPPPRAPPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKE---GKRDPPLLGKPA--AINAHNS---IVGEIQNRSAHLLAVKADIETKGEFINGLIDK
        K PPP  PPPPP P      +A  ++ P++V  +HSL +++    +RD    G  A  AI A+++   ++GEI+NRS +LLA+K D+ET+G+FI  LI +
Subjt:  KVPPPRAPPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKE---GKRDPPLLGKPA--AINAHNS---IVGEIQNRSAHLLAVKADIETKGEFINGLIDK

Query:  VLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRS
        V  AA +DIED++ FV WLD +LS L DERAVLKHF+WPE+KADA+REAA  Y  LK+L +E S +++D      +ALKKM +L +K E  +  L  +R 
Subjt:  VLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRS

Query:  TVMHSYQDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK
        +    ++  ++P  WML++GI S+IK AS+ LA  YMKRV  EL++I     E   E L++QG+ FA+R HQFAGG D+ET+ AFEE++
Subjt:  TVMHSYQDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCTCGCGGCGGAAGGGTTTCTTTGAAGGCTATGGAGTCGCCGAAGCGGGTGGTTTCTGTATCGGCAGTTGAATCGTCGCCTCAGTCTGGTGTGAAGAAGCAAAG
TTCGAGAGTTAGCAGATCTCTGACGCCGAATGCTCCGAAGAAGGGGAGGGATGGCGAGAATGTTGGAGTTTCGGCTCGAACGGTCAACCGTGGTGGTCTTAAGCAAATTT
CCCACCGGCGTTCTTTTTCTGGTACTGGTCCGTGTGCGAATGTTGAGGATTGTAATGGAGTTAAGAGTGGATTGCAGGAGAAGCTTTGTTTTGCGGAGGATTTGATTAAA
GATTTGCAGTCTCAATTGGTGGAGTTGAAGGAGGAGTTGCAGAAGTCTCAGAGCTTGAACCTAGAACTTCAATCGCAGAACGATTTGCTCGTTCGTGACCTAGCCGCTGC
TGAAGCGAAGTTCGCTAATGCTAGCAATAACGACCAGAGGGAGTCAGTTGCAGAGGGCTCGCAACGAAACGTCGAGGACAATCAGAAACTCGAAAATGGAAAGTTGGAGA
CCCATCCACCAAGCTCGTATCGGAATGCTAGAGATTTGGAATGCAAGGTGCCACCACCACGGGCGCCGCCGCCGCCGCCTCTTCCTGTCCAGTCCATGCCCCGAGCAGCG
GCCACACAGAAATCTCCAGACCTTGTACGCCTCTTTCACTCGTTAAGAAAGAAAGAGGGGAAGAGAGATCCTCCATTGTTGGGGAAACCAGCTGCGATCAATGCGCATAA
TAGCATTGTTGGGGAAATTCAGAACCGTTCTGCGCATCTTTTAGCGGTAAAAGCAGACATTGAAACCAAAGGAGAGTTCATCAATGGCCTCATTGACAAGGTGCTTGTTG
CAGCTCATACGGACATAGAGGATATCCTCAAGTTTGTCGATTGGCTTGATTCCCAACTTTCATCATTGGCTGATGAGCGAGCTGTGTTAAAGCATTTCAAGTGGCCTGAG
AAAAAAGCTGATGCCATGCGAGAAGCTGCCATAGAATACCGTGCACTCAAACGGTTGGAAAATGAGATCTCTTTTTACAAGGATGATACTAATTCTCCATGTGAGACAGC
CTTGAAGAAGATGGCGAGCTTGTTAGACAAGTCGGAGCGAGCCATACAACGGTTAATCACACTTCGGAGTACTGTCATGCATTCTTATCAAGACCTGAAACTCCCTACAA
GTTGGATGCTAGACTCCGGAATCATGAGTAAAATAAAGCAAGCTTCTATGAATCTAGCCAAGATGTACATGAAAAGGGTGAAAACGGAGCTGGATTCGATTCGTAGTTCG
GATAAAGAATCCAATCATGAATCTCTTCTACTTCAGGGAATTCATTTCGCATACAGAACTCACCAGTTTGCTGGAGGACTTGATTCAGAAACATTATGTGCTTTTGAGGA
AATAAAACAATGGGTTCCAAGACAAGTACTGGGAAAATCTCATTCTCAAGGATGGATAGTTGGCATACCATCATCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCTCGCGGCGGAAGGGTTTCTTTGAAGGCTATGGAGTCGCCGAAGCGGGTGGTTTCTGTATCGGCAGTTGAATCGTCGCCTCAGTCTGGTGTGAAGAAGCAAAG
TTCGAGAGTTAGCAGATCTCTGACGCCGAATGCTCCGAAGAAGGGGAGGGATGGCGAGAATGTTGGAGTTTCGGCTCGAACGGTCAACCGTGGTGGTCTTAAGCAAATTT
CCCACCGGCGTTCTTTTTCTGGTACTGGTCCGTGTGCGAATGTTGAGGATTGTAATGGAGTTAAGAGTGGATTGCAGGAGAAGCTTTGTTTTGCGGAGGATTTGATTAAA
GATTTGCAGTCTCAATTGGTGGAGTTGAAGGAGGAGTTGCAGAAGTCTCAGAGCTTGAACCTAGAACTTCAATCGCAGAACGATTTGCTCGTTCGTGACCTAGCCGCTGC
TGAAGCGAAGTTCGCTAATGCTAGCAATAACGACCAGAGGGAGTCAGTTGCAGAGGGCTCGCAACGAAACGTCGAGGACAATCAGAAACTCGAAAATGGAAAGTTGGAGA
CCCATCCACCAAGCTCGTATCGGAATGCTAGAGATTTGGAATGCAAGGTGCCACCACCACGGGCGCCGCCGCCGCCGCCTCTTCCTGTCCAGTCCATGCCCCGAGCAGCG
GCCACACAGAAATCTCCAGACCTTGTACGCCTCTTTCACTCGTTAAGAAAGAAAGAGGGGAAGAGAGATCCTCCATTGTTGGGGAAACCAGCTGCGATCAATGCGCATAA
TAGCATTGTTGGGGAAATTCAGAACCGTTCTGCGCATCTTTTAGCGGTAAAAGCAGACATTGAAACCAAAGGAGAGTTCATCAATGGCCTCATTGACAAGGTGCTTGTTG
CAGCTCATACGGACATAGAGGATATCCTCAAGTTTGTCGATTGGCTTGATTCCCAACTTTCATCATTGGCTGATGAGCGAGCTGTGTTAAAGCATTTCAAGTGGCCTGAG
AAAAAAGCTGATGCCATGCGAGAAGCTGCCATAGAATACCGTGCACTCAAACGGTTGGAAAATGAGATCTCTTTTTACAAGGATGATACTAATTCTCCATGTGAGACAGC
CTTGAAGAAGATGGCGAGCTTGTTAGACAAGTCGGAGCGAGCCATACAACGGTTAATCACACTTCGGAGTACTGTCATGCATTCTTATCAAGACCTGAAACTCCCTACAA
GTTGGATGCTAGACTCCGGAATCATGAGTAAAATAAAGCAAGCTTCTATGAATCTAGCCAAGATGTACATGAAAAGGGTGAAAACGGAGCTGGATTCGATTCGTAGTTCG
GATAAAGAATCCAATCATGAATCTCTTCTACTTCAGGGAATTCATTTCGCATACAGAACTCACCAGTTTGCTGGAGGACTTGATTCAGAAACATTATGTGCTTTTGAGGA
AATAAAACAATGGGTTCCAAGACAAGTACTGGGAAAATCTCATTCTCAAGGATGGATAGTTGGCATACCATCATCATAA
Protein sequenceShow/hide protein sequence
MSSRGGRVSLKAMESPKRVVSVSAVESSPQSGVKKQSSRVSRSLTPNAPKKGRDGENVGVSARTVNRGGLKQISHRRSFSGTGPCANVEDCNGVKSGLQEKLCFAEDLIK
DLQSQLVELKEELQKSQSLNLELQSQNDLLVRDLAAAEAKFANASNNDQRESVAEGSQRNVEDNQKLENGKLETHPPSSYRNARDLECKVPPPRAPPPPPLPVQSMPRAA
ATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAVKADIETKGEFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPE
KKADAMREAAIEYRALKRLENEISFYKDDTNSPCETALKKMASLLDKSERAIQRLITLRSTVMHSYQDLKLPTSWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSIRSS
DKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRQVLGKSHSQGWIVGIPSS