; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0022541 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0022541
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptioncentromere protein C-like isoform X1
Genome locationchr01:27477740..27483684
RNA-Seq ExpressionPI0022541
SyntenyPI0022541
Gene Ontology termsGO:0051315 - attachment of mitotic spindle microtubules to kinetochore (biological process)
GO:0051382 - kinetochore assembly (biological process)
GO:0051455 - attachment of spindle microtubules to kinetochore involved in homologous chromosome segregation (biological process)
GO:0000776 - kinetochore (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0019237 - centromeric DNA binding (molecular function)
InterPro domainsIPR028386 - Centromere protein C/Mif2/cnp3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058804.1 uncharacterized protein E6C27_scaffold339G002780 [Cucumis melo var. makuwa]0.0e+0092.31Show/hide
Query:  TMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAE
        TMVNEE R SDVIDPLAAYSGINLFPTAFGTL D SKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNS SM SEAATFLVKN+KNE A++KAE
Subjt:  TMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAE

Query:  ENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI
        ENPQERRPALNRKRARFSLKPDA  PPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQ GAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI
Subjt:  ENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI

Query:  TTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP
        TTEDDQNVDPSQVTFDSG+FSPLKLGTETHPSP+IIDSEKKTDEDVAF EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP
Subjt:  TTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP

Query:  LTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPARDP
        LTLEKLCLPDLEAIPT+NLKS+RGNLSKRSLISVDNQLQKTETLKSK+DNENLVN VSTPSS+RSPLASLSALNRRISLSNSSGD FSA+GIDRSPARDP
Subjt:  LTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPARDP

Query:  YLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLE
        YLFELGNHLSDAVGITE SSVSKL P LT+DG T+ANGI+PSKIL GDDSMSKISSSNILNV QVGGNTALSGT+AST+AK+V  SST VEINEKLSCLE
Subjt:  YLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLE

Query:  AQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQLDQ
        AQAD V NMQI DH+GSASEQPKLSEVDLI+EYPVGIRSQLDQSAATCTENIVD SSRSSGTEHHDEMEDHEGSASEQPNSSKVD+IKEYPVGI  QLDQ
Subjt:  AQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQLDQ

Query:  STTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
        STTTTCAE IVDG SRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SG+RRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP 
Subjt:  STTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGKATMKVKSLVSNEYKDLVELAALH
        KGNGK TMKVKSLVSNEYKDLV+LAALH
Subjt:  KGNGKATMKVKSLVSNEYKDLVELAALH

XP_011659552.1 centromere protein C isoform X3 [Cucumis sativus]0.0e+0093.15Show/hide
Query:  MTTMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMK
        M TM NEEARHSDVIDPLAAYSGINLF TAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSM SEAATFLVKN+KNEEAT+K
Subjt:  MTTMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMK

Query:  AEENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS
        AEEN QERRPALNRKRARFSLKPDAR PPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS
Subjt:  AEENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS

Query:  SITTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQI
        SI TEDDQNVDPSQVTFDSGIFSPLKLGTETHPSP+IIDSEKKTDEDVAF EEEEEEELVASATKAENR+NDIL+EFLSGNCEDLEGDRAINILQERLQI
Subjt:  SITTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQI

Query:  KPLTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPAR
        KPLTLEKLCLPDLEAIPT+NLKSSR NLSKRSLISVDNQLQK E LKSKQDN NLVNPVSTPSS+RSPLASLSALNRRISLSNSS D FSA+GID+SP+R
Subjt:  KPLTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPAR

Query:  DPYLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSC
        DPYLFELGNHLSDAVG TEQSSVSKL P LT+DG TVANGIKPSKIL GDDSMS ISSSNILNVPQVGGNTALSGT+ASTEAK+V  SST VEINEKLSC
Subjt:  DPYLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSC

Query:  LEAQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQL
        LEAQADAV NMQI DHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVD SSRSSGTEH DEMEDHEGSASEQP SSKVDVIKEYPV I SQL
Subjt:  LEAQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQL

Query:  DQSTTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVS
        DQSTTTTCAENI DGASRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSG+RRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVS
Subjt:  DQSTTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVS

Query:  PAKGNGKATMKVKSLVSNEYKDLVELAALH
        PAKGNGK TMKVKSLVSNEYKDLVELAALH
Subjt:  PAKGNGKATMKVKSLVSNEYKDLVELAALH

XP_031745135.1 centromere protein C isoform X1 [Cucumis sativus]0.0e+0092.14Show/hide
Query:  MTTMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMK
        M TM NEEARHSDVIDPLAAYSGINLF TAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSM SEAATFLVKN+KNEEAT+K
Subjt:  MTTMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMK

Query:  AEENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG--------RS
        AEEN QERRPALNRKRARFSLKPDAR PPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG        RS
Subjt:  AEENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG--------RS

Query:  VRYKHQYSSITTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAIN
        VRYKHQYSSI TEDDQNVDPSQVTFDSGIFSPLKLGTETHPSP+IIDSEKKTDEDVAF EEEEEEELVASATKAENR+NDIL+EFLSGNCEDLEGDRAIN
Subjt:  VRYKHQYSSITTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAIN

Query:  ILQERLQIKPLTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAY
        ILQERLQIKPLTLEKLCLPDLEAIPT+NLKSSR NLSKRSLISVDNQLQK E LKSKQDN NLVNPVSTPSS+RSPLASLSALNRRISLSNSS D FSA+
Subjt:  ILQERLQIKPLTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAY

Query:  GIDRSPARDPYLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGV
        GID+SP+RDPYLFELGNHLSDAVG TEQSSVSKL P LT+DG TVANGIKPSKIL GDDSMS ISSSNILNVPQVGGNTALSGT+ASTEAK+V  SST V
Subjt:  GIDRSPARDPYLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGV

Query:  EINEKLSCLEAQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEY
        EINEKLSCLEAQADAV NMQI DHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVD SSRSSGTEH DEMEDHEGSASEQP SSKVDVIKEY
Subjt:  EINEKLSCLEAQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEY

Query:  PVGIHSQLDQSTTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLAT
        PV I SQLDQSTTTTCAENI DGASRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSG+RRSTRFKTRPLEYWKGERLLYGRVHESL T
Subjt:  PVGIHSQLDQSTTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLAT

Query:  VIGLKYVSPAKGNGKATMKVKSLVSNEYKDLVELAALH
        VIGLKYVSPAKGNGK TMKVKSLVSNEYKDLVELAALH
Subjt:  VIGLKYVSPAKGNGKATMKVKSLVSNEYKDLVELAALH

XP_031745136.1 centromere protein C isoform X2 [Cucumis sativus]0.0e+0091.87Show/hide
Query:  MTTMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMK
        M TM NEEARHSDVIDPLAAYSGINLF TAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSM SEAATFLVKN+KNEEAT+K
Subjt:  MTTMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMK

Query:  AEENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG--------RS
        AEEN QERRPALNRKRARFSLKPDAR PPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG        RS
Subjt:  AEENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG--------RS

Query:  VRYKHQYSSITTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAIN
        VRYKHQYSSI TEDDQNVDPSQVTFDSGIFSPLKLGTETHPSP+IIDSEKKTDEDVAF EEEEEEELVASATKAENR+NDIL+EFLSGNCEDLEGDRAIN
Subjt:  VRYKHQYSSITTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAIN

Query:  ILQERLQIKPLTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAY
        ILQERLQIKPLTLEKLCLPDLEAIPT+NLKSSR NLSKRSLISVDNQLQK E LKSKQDN NLVNPVSTPSS+RSPLASLSALNRRISLSNSS D FSA+
Subjt:  ILQERLQIKPLTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAY

Query:  GIDRSPARDPYLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGV
        GID+SP+RDPYLFELGNHLSDAVG TEQSSVSKL P LT+DG TVANGIKPSKIL GDDSMS ISSSNILNVPQVGGNTALSGT+ASTEAK+V  SST V
Subjt:  GIDRSPARDPYLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGV

Query:  EINEKLSCLEAQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEY
        EINEKLSCLEAQADAV NMQI DHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVD SSRSSGTEH DEMEDHEGSASEQP SSKVDVIKEY
Subjt:  EINEKLSCLEAQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEY

Query:  PVGIHSQLDQSTTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLAT
        PV I SQLDQSTTTTCAENI DGASRSSGTDHHD   VKPKSRANKQ KGKKIS RQSLAGAGTTWQSG+RRSTRFKTRPLEYWKGERLLYGRVHESL T
Subjt:  PVGIHSQLDQSTTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLAT

Query:  VIGLKYVSPAKGNGKATMKVKSLVSNEYKDLVELAALH
        VIGLKYVSPAKGNGK TMKVKSLVSNEYKDLVELAALH
Subjt:  VIGLKYVSPAKGNGKATMKVKSLVSNEYKDLVELAALH

XP_031745137.1 centromere protein C isoform X4 [Cucumis sativus]0.0e+0093.01Show/hide
Query:  MTTMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMK
        M TM NEEARHSDVIDPLAAYSGINLF TAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSM SEAATFLVKN+KNEEAT+K
Subjt:  MTTMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMK

Query:  AEENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS
        AEEN QERRPALNRKRARFSLKPDAR PPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG SVRYKHQYS
Subjt:  AEENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS

Query:  SITTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQI
        SI TEDDQNVDPSQVTFDSGIFSPLKLGTETHPSP+IIDSEKKTDEDVAF EEEEEEELVASATKAENR+NDIL+EFLSGNCEDLEGDRAINILQERLQI
Subjt:  SITTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQI

Query:  KPLTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPAR
        KPLTLEKLCLPDLEAIPT+NLKSSR NLSKRSLISVDNQLQK E LKSKQDN NLVNPVSTPSS+RSPLASLSALNRRISLSNSS D FSA+GID+SP+R
Subjt:  KPLTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPAR

Query:  DPYLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSC
        DPYLFELGNHLSDAVG TEQSSVSKL P LT+DG TVANGIKPSKIL GDDSMS ISSSNILNVPQVGGNTALSGT+ASTEAK+V  SST VEINEKLSC
Subjt:  DPYLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSC

Query:  LEAQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQL
        LEAQADAV NMQI DHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVD SSRSSGTEH DEMEDHEGSASEQP SSKVDVIKEYPV I SQL
Subjt:  LEAQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQL

Query:  DQSTTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVS
        DQSTTTTCAENI DGASRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSG+RRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVS
Subjt:  DQSTTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVS

Query:  PAKGNGKATMKVKSLVSNEYKDLVELAALH
        PAKGNGK TMKVKSLVSNEYKDLVELAALH
Subjt:  PAKGNGKATMKVKSLVSNEYKDLVELAALH

TrEMBL top hitse value%identityAlignment
A0A0A0K774 Uncharacterized protein0.0e+0093.15Show/hide
Query:  MTTMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMK
        M TM NEEARHSDVIDPLAAYSGINLF TAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSM SEAATFLVKN+KNEEAT+K
Subjt:  MTTMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMK

Query:  AEENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS
        AEEN QERRPALNRKRARFSLKPDAR PPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS
Subjt:  AEENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS

Query:  SITTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQI
        SI TEDDQNVDPSQVTFDSGIFSPLKLGTETHPSP+IIDSEKKTDEDVAF EEEEEEELVASATKAENR+NDIL+EFLSGNCEDLEGDRAINILQERLQI
Subjt:  SITTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQI

Query:  KPLTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPAR
        KPLTLEKLCLPDLEAIPT+NLKSSR NLSKRSLISVDNQLQK E LKSKQDN NLVNPVSTPSS+RSPLASLSALNRRISLSNSS D FSA+GID+SP+R
Subjt:  KPLTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPAR

Query:  DPYLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSC
        DPYLFELGNHLSDAVG TEQSSVSKL P LT+DG TVANGIKPSKIL GDDSMS ISSSNILNVPQVGGNTALSGT+ASTEAK+V  SST VEINEKLSC
Subjt:  DPYLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSC

Query:  LEAQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQL
        LEAQADAV NMQI DHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVD SSRSSGTEH DEMEDHEGSASEQP SSKVDVIKEYPV I SQL
Subjt:  LEAQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQL

Query:  DQSTTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVS
        DQSTTTTCAENI DGASRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSG+RRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVS
Subjt:  DQSTTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVS

Query:  PAKGNGKATMKVKSLVSNEYKDLVELAALH
        PAKGNGK TMKVKSLVSNEYKDLVELAALH
Subjt:  PAKGNGKATMKVKSLVSNEYKDLVELAALH

A0A1S3CDU5 uncharacterized protein LOC103499749 isoform X20.0e+0091.91Show/hide
Query:  TMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAE
        TMVNEE R SDVIDPLAAYSGINLFPTAFGTL DPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNS SM SEAATFLVKN+KNE A++KAE
Subjt:  TMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAE

Query:  ENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI
        ENPQERRPALNRKRARFSLKPDA  PPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQ GAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI
Subjt:  ENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI

Query:  TTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP
        TTEDDQNVDPSQVTFDSG+FSPLKLGTETHPSP+IIDSEKKTDEDVAF EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP
Subjt:  TTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP

Query:  LTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPARDP
        LTLEKLCLPDLEAIPT+NLKS+RGNLSKRSLISVDNQLQKTETLKSK+DNENLVN VSTPSS+RSPLASLSALNRRISLSNSSGD FSA+GIDRSPARDP
Subjt:  LTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPARDP

Query:  YLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLE
        YLFELGNHLSDAVGITE SSVSKL P LT+DG T+ANGI+PSKIL GDDSMSKISSSNILNV QVG NTALSGT+AST+AK+V  SST VEINEKLSCLE
Subjt:  YLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLE

Query:  AQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQLDQ
        AQAD V NMQI DH+GSASEQPKLSEVDLI+EYPVGIRSQLDQSAATCTENIVD SSRSSGTEHHDEMEDHEGSASEQPNSSKVD+IKEYPVGI  QLDQ
Subjt:  AQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQLDQ

Query:  S-TTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP
        S TTTTCAE IVDG SRSSGTDHHDE  VKPKSRANKQRKGKKISGRQSLAGAGTTW+SG+RRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP
Subjt:  S-TTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP

Query:  AKGNGKATMKVKSLVSNEYKDLVELAALH
         KGNGK TMKVKSLVSNEYKDLV+LAALH
Subjt:  AKGNGKATMKVKSLVSNEYKDLVELAALH

A0A1S3CDU7 uncharacterized protein LOC103499749 isoform X10.0e+0092.18Show/hide
Query:  TMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAE
        TMVNEE R SDVIDPLAAYSGINLFPTAFGTL DPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNS SM SEAATFLVKN+KNE A++KAE
Subjt:  TMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAE

Query:  ENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI
        ENPQERRPALNRKRARFSLKPDA  PPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQ GAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI
Subjt:  ENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI

Query:  TTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP
        TTEDDQNVDPSQVTFDSG+FSPLKLGTETHPSP+IIDSEKKTDEDVAF EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP
Subjt:  TTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP

Query:  LTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPARDP
        LTLEKLCLPDLEAIPT+NLKS+RGNLSKRSLISVDNQLQKTETLKSK+DNENLVN VSTPSS+RSPLASLSALNRRISLSNSSGD FSA+GIDRSPARDP
Subjt:  LTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPARDP

Query:  YLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLE
        YLFELGNHLSDAVGITE SSVSKL P LT+DG T+ANGI+PSKIL GDDSMSKISSSNILNV QVG NTALSGT+AST+AK+V  SST VEINEKLSCLE
Subjt:  YLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLE

Query:  AQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQLDQ
        AQAD V NMQI DH+GSASEQPKLSEVDLI+EYPVGIRSQLDQSAATCTENIVD SSRSSGTEHHDEMEDHEGSASEQPNSSKVD+IKEYPVGI  QLDQ
Subjt:  AQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQLDQ

Query:  S-TTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP
        S TTTTCAE IVDG SRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SG+RRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP
Subjt:  S-TTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP

Query:  AKGNGKATMKVKSLVSNEYKDLVELAALH
         KGNGK TMKVKSLVSNEYKDLV+LAALH
Subjt:  AKGNGKATMKVKSLVSNEYKDLVELAALH

A0A1S4E341 uncharacterized protein LOC103499749 isoform X30.0e+0088.48Show/hide
Query:  TMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAE
        TMVNEE R SDVIDPLAAYSGINLFPTAFGTL DPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNS SM SEAATFLVKN+KNE A++KAE
Subjt:  TMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAE

Query:  ENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI
        ENPQERRPALNRKRARFSLKPDA  PPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQ GAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI
Subjt:  ENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI

Query:  TTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP
        TTEDDQNVDPSQVTFDSG+FSPLKLGTETHPSP+IIDSEKKTDEDVAF EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP
Subjt:  TTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP

Query:  LTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPARDP
        LTLEKLCLPDLEAIPT+NLKS+RGNLSKRSLISVDNQLQKTETLKSK+DNENLVN VSTPSS+RSPLASLSALNRRISLSNSS                 
Subjt:  LTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPARDP

Query:  YLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLE
                    VGITE SSVSKL P LT+DG T+ANGI+PSKIL GDDSMSKISSSNILNV QVG NTALSGT+AST+AK+V  SST VEINEKLSCLE
Subjt:  YLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLE

Query:  AQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQLDQ
        AQAD V NMQI DH+GSASEQPKLSEVDLI+EYPVGIRSQLDQSAATCTENIVD SSRSSGTEHHDEMEDHEGSASEQPNSSKVD+IKEYPVGI  QLDQ
Subjt:  AQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQLDQ

Query:  S-TTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP
        S TTTTCAE IVDG SRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SG+RRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP
Subjt:  S-TTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP

Query:  AKGNGKATMKVKSLVSNEYKDLVELAALH
         KGNGK TMKVKSLVSNEYKDLV+LAALH
Subjt:  AKGNGKATMKVKSLVSNEYKDLVELAALH

A0A5A7UUE4 Uncharacterized protein0.0e+0092.31Show/hide
Query:  TMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAE
        TMVNEE R SDVIDPLAAYSGINLFPTAFGTL D SKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNS SM SEAATFLVKN+KNE A++KAE
Subjt:  TMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAE

Query:  ENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI
        ENPQERRPALNRKRARFSLKPDA  PPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQ GAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI
Subjt:  ENPQERRPALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSI

Query:  TTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP
        TTEDDQNVDPSQVTFDSG+FSPLKLGTETHPSP+IIDSEKKTDEDVAF EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP
Subjt:  TTEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAF-EEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKP

Query:  LTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPARDP
        LTLEKLCLPDLEAIPT+NLKS+RGNLSKRSLISVDNQLQKTETLKSK+DNENLVN VSTPSS+RSPLASLSALNRRISLSNSSGD FSA+GIDRSPARDP
Subjt:  LTLEKLCLPDLEAIPTINLKSSRGNLSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPARDP

Query:  YLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLE
        YLFELGNHLSDAVGITE SSVSKL P LT+DG T+ANGI+PSKIL GDDSMSKISSSNILNV QVGGNTALSGT+AST+AK+V  SST VEINEKLSCLE
Subjt:  YLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLE

Query:  AQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQLDQ
        AQAD V NMQI DH+GSASEQPKLSEVDLI+EYPVGIRSQLDQSAATCTENIVD SSRSSGTEHHDEMEDHEGSASEQPNSSKVD+IKEYPVGI  QLDQ
Subjt:  AQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQLDQ

Query:  STTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
        STTTTCAE IVDG SRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SG+RRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP 
Subjt:  STTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGLRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGKATMKVKSLVSNEYKDLVELAALH
        KGNGK TMKVKSLVSNEYKDLV+LAALH
Subjt:  KGNGKATMKVKSLVSNEYKDLVELAALH

SwissProt top hitse value%identityAlignment
Q66LG9 Centromere protein C1.1e-5629.23Show/hide
Query:  DPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAEENPQERRPALNRK
        DPL AYSG++LFP    +L +P  P     DL   H  L+SM     S+  EQA++IL+     +Q                 +    N +ERRP L+RK
Subjt:  DPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAEENPQERRPALNRK

Query:  RARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVR-YKHQYSSITTEDDQNVDPSQ
        R  FSL      PP  + P+FD  +    E+FF AY+K E A +E QKQTG+ + D+ +  PS   R RRPGI GR  R +K  ++     D  N++ S+
Subjt:  RARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVR-YKHQYSSITTEDDQNVDPSQ

Query:  VTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAFEEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKPLTLEKLCLPDLEA
                  L+  T  H    +   +++ D+              S    +  +N++L + L+ + E+LEGD AI +L+ERLQIK   +EK  +P+ + 
Subjt:  VTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAFEEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKPLTLEKLCLPDLEA

Query:  IPTINLKSSRGN-LSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGI------DRSPAR---DPYLF
        +  +NLK+S  N  +++SL  + N L+ T  +  ++++ +      +P ++           +  S  N   D FS   I      D+ P+     P   
Subjt:  IPTINLKSSRGN-LSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGI------DRSPAR---DPYLF

Query:  ELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLEAQ-
        ++ N     VG  + +  S  N  + K      +G   S I  G          N    P +    ++S   ++   K+V   + G E++  +S   A  
Subjt:  ELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLEAQ-

Query:  --ADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRS-QLDQSAATCTENIVDESSRSSG-----TEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGI
           D   + +I++   +     + +  ++ + + V   S    Q A++ + N   E   + G      EH+  + + E   +   +  +V+   E     
Subjt:  --ADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRS-QLDQSAATCTENIVDESSRSSG-----TEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGI

Query:  HSQLDQSTTTTCAENIVDGASRSSGTDHHDEEQVKP---KSRANKQRKGKK------------------ISGRQSLAGAGTTWQSGLRRSTRFKTRPLEY
        H Q ++      +++ V   S++   +   ++Q+K    +SRA KQ KGK                    S R+SLA AGT  + G+RRSTR K+RPLEY
Subjt:  HSQLDQSTTTTCAENIVDGASRSSGTDHHDEEQVKP---KSRANKQRKGKK------------------ISGRQSLAGAGTTWQSGLRRSTRFKTRPLEY

Query:  WKGERLLYGRVHESLATVIGLKYVSPAKG-NGKATMKVKSLVSNEYKDLVELAALH
        W+GER LYGR+HESL TVIG+KY SP +G       KVKS VS+EYK LV+ AALH
Subjt:  WKGERLLYGRVHESLATVIGLKYVSPAKG-NGKATMKVKSLVSNEYKDLVELAALH

Arabidopsis top hitse value%identityAlignment
AT1G15660.1 centromere protein C7.7e-5829.23Show/hide
Query:  DPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAEENPQERRPALNRK
        DPL AYSG++LFP    +L +P  P     DL   H  L+SM     S+  EQA++IL+     +Q                 +    N +ERRP L+RK
Subjt:  DPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAEENPQERRPALNRK

Query:  RARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVR-YKHQYSSITTEDDQNVDPSQ
        R  FSL      PP  + P+FD  +    E+FF AY+K E A +E QKQTG+ + D+ +  PS   R RRPGI GR  R +K  ++     D  N++ S+
Subjt:  RARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVR-YKHQYSSITTEDDQNVDPSQ

Query:  VTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAFEEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKPLTLEKLCLPDLEA
                  L+  T  H    +   +++ D+              S    +  +N++L + L+ + E+LEGD AI +L+ERLQIK   +EK  +P+ + 
Subjt:  VTFDSGIFSPLKLGTETHPSPYIIDSEKKTDEDVAFEEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKPLTLEKLCLPDLEA

Query:  IPTINLKSSRGN-LSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGI------DRSPAR---DPYLF
        +  +NLK+S  N  +++SL  + N L+ T  +  ++++ +      +P ++           +  S  N   D FS   I      D+ P+     P   
Subjt:  IPTINLKSSRGN-LSKRSLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGI------DRSPAR---DPYLF

Query:  ELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLEAQ-
        ++ N     VG  + +  S  N  + K      +G   S I  G          N    P +    ++S   ++   K+V   + G E++  +S   A  
Subjt:  ELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGIKPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLEAQ-

Query:  --ADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRS-QLDQSAATCTENIVDESSRSSG-----TEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGI
           D   + +I++   +     + +  ++ + + V   S    Q A++ + N   E   + G      EH+  + + E   +   +  +V+   E     
Subjt:  --ADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRS-QLDQSAATCTENIVDESSRSSG-----TEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGI

Query:  HSQLDQSTTTTCAENIVDGASRSSGTDHHDEEQVKP---KSRANKQRKGKK------------------ISGRQSLAGAGTTWQSGLRRSTRFKTRPLEY
        H Q ++      +++ V   S++   +   ++Q+K    +SRA KQ KGK                    S R+SLA AGT  + G+RRSTR K+RPLEY
Subjt:  HSQLDQSTTTTCAENIVDGASRSSGTDHHDEEQVKP---KSRANKQRKGKK------------------ISGRQSLAGAGTTWQSGLRRSTRFKTRPLEY

Query:  WKGERLLYGRVHESLATVIGLKYVSPAKG-NGKATMKVKSLVSNEYKDLVELAALH
        W+GER LYGR+HESL TVIG+KY SP +G       KVKS VS+EYK LV+ AALH
Subjt:  WKGERLLYGRVHESLATVIGLKYVSPAKG-NGKATMKVKSLVSNEYKDLVELAALH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAACAATGGTGAACGAAGAAGCTCGACACTCCGATGTGATCGATCCTCTTGCTGCGTATTCTGGTATCAATCTCTTTCCGACCGCATTTGGTACTTTGCCGGATCC
GTCAAAGCCACATGATCTTGGAACAGACCTTGACGGCATCCACAAGCGCCTCAAATCCATGGTGTTAAGGAGTCCCAGTAAACTATTAGAGCAGGCCAGATCAATTTTAG
ATGGCAACTCAAATTCGATGCAATCTGAAGCTGCTACATTTCTTGTGAAGAATAAGAAAAATGAGGAAGCTACAATGAAGGCAGAGGAAAATCCACAGGAAAGAAGGCCG
GCCTTAAACCGAAAGCGGGCTAGGTTTTCTTTAAAACCTGATGCTAGGCACCCTCCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGACCCCGAGGAGTT
CTTTTTGGCCTATGAAAAGCATGAAAATGCCAAAAAAGAAATCCAAAAACAGACGGGAGCAGTTTTAAAGGACTTGAACCAACAAAATCCATCGACGAATACACGCCAGC
GGAGACCAGGGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATCAGAATGTAGATCCTTCACAAGTGACATTTGATTCAGGT
ATTTTCAGTCCATTGAAATTGGGCACAGAAACACACCCAAGTCCATATATAATTGACTCAGAAAAGAAAACTGATGAAGATGTAGCCTTTGAGGAGGAGGAGGAGGAGGA
GCTCGTTGCTTCAGCTACGAAGGCAGAGAACAGAGTAAATGATATTTTGGATGAATTTCTCTCTGGCAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATATTAC
AGGAGCGGTTGCAGATTAAACCTCTTACTTTAGAGAAATTATGCCTTCCAGATTTGGAAGCCATTCCAACAATAAATTTGAAATCTTCAAGAGGCAATCTGTCGAAGCGT
AGTTTGATCAGTGTGGACAATCAGTTACAAAAGACAGAAACTTTGAAATCTAAGCAGGACAATGAAAATTTGGTCAATCCTGTTTCTACACCATCATCAGTGAGAAGTCC
ATTGGCATCGTTATCAGCCCTAAATAGACGAATTTCACTTTCAAATTCATCAGGTGATCCATTTTCAGCTTATGGCATCGACCGATCTCCCGCAAGAGATCCTTACCTTT
TTGAACTCGGTAATCACTTATCTGATGCAGTTGGTATTACAGAGCAGTCGAGCGTTTCTAAATTGAATCCACCTTTAACCAAAGATGGCTGGACTGTAGCAAATGGAATT
AAACCATCCAAAATTCTTTTTGGAGACGATTCCATGTCTAAAATATCTTCAAGTAATATTTTAAATGTACCCCAAGTTGGTGGCAATACTGCTTTAAGTGGAACTCATGC
CAGCACGGAAGCTAAAGATGTTTGTGACAGCAGCACAGGCGTGGAAATAAATGAGAAATTGAGCTGTCTTGAAGCCCAAGCAGATGCGGTGGTTAATATGCAGATTTCAG
ATCACGAAGGATCAGCTTCTGAGCAACCAAAATTATCTGAGGTGGATCTAATCAAAGAGTACCCGGTTGGCATTCGGAGTCAGTTGGATCAATCAGCTGCTACTTGTACT
GAAAATATTGTTGATGAGTCGTCTAGAAGCAGTGGAACAGAACACCACGATGAGATGGAAGATCACGAAGGATCAGCTTCTGAGCAACCAAACTCATCTAAGGTGGATGT
GATTAAAGAGTACCCTGTCGGCATTCACAGTCAGTTGGATCAATCAACTACTACTACTTGTGCTGAAAATATTGTCGATGGGGCATCTAGAAGCAGTGGAACGGATCACC
ATGATGAGGAACAGGTCAAGCCAAAATCTCGTGCAAACAAACAACGCAAAGGCAAAAAGATTTCCGGGAGGCAAAGTCTTGCAGGTGCTGGTACAACGTGGCAAAGTGGG
TTGAGAAGAAGTACCAGGTTCAAAACACGACCCTTGGAGTACTGGAAAGGTGAAAGGCTGTTGTACGGACGTGTACATGAGAGCCTAGCGACAGTAATCGGCTTGAAGTA
TGTGTCTCCAGCAAAAGGAAATGGCAAAGCAACCATGAAGGTGAAGTCTCTAGTCTCCAATGAGTACAAAGATCTCGTCGAGTTAGCAGCCCTTCACTGA
mRNA sequenceShow/hide mRNA sequence
AAGTTTTGAAGGTCCTCATCTGCAAAACAATAAAAATTTGAACGGGAAAAAACAATACGAGTTTTCGCGCTTTGGTTCACTCGTTCGAATTTGGGGTGGTTCTAGGGTTC
GGTTCAAACATTGGAGATGACAACAATGGTGAACGAAGAAGCTCGACACTCCGATGTGATCGATCCTCTTGCTGCGTATTCTGGTATCAATCTCTTTCCGACCGCATTTG
GTACTTTGCCGGATCCGTCAAAGCCACATGATCTTGGAACAGACCTTGACGGCATCCACAAGCGCCTCAAATCCATGGTGTTAAGGAGTCCCAGTAAACTATTAGAGCAG
GCCAGATCAATTTTAGATGGCAACTCAAATTCGATGCAATCTGAAGCTGCTACATTTCTTGTGAAGAATAAGAAAAATGAGGAAGCTACAATGAAGGCAGAGGAAAATCC
ACAGGAAAGAAGGCCGGCCTTAAACCGAAAGCGGGCTAGGTTTTCTTTAAAACCTGATGCTAGGCACCCTCCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGA
AAGACCCCGAGGAGTTCTTTTTGGCCTATGAAAAGCATGAAAATGCCAAAAAAGAAATCCAAAAACAGACGGGAGCAGTTTTAAAGGACTTGAACCAACAAAATCCATCG
ACGAATACACGCCAGCGGAGACCAGGGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATCAGAATGTAGATCCTTCACAAGT
GACATTTGATTCAGGTATTTTCAGTCCATTGAAATTGGGCACAGAAACACACCCAAGTCCATATATAATTGACTCAGAAAAGAAAACTGATGAAGATGTAGCCTTTGAGG
AGGAGGAGGAGGAGGAGCTCGTTGCTTCAGCTACGAAGGCAGAGAACAGAGTAAATGATATTTTGGATGAATTTCTCTCTGGCAATTGTGAAGATCTAGAAGGTGATCGA
GCCATCAACATATTACAGGAGCGGTTGCAGATTAAACCTCTTACTTTAGAGAAATTATGCCTTCCAGATTTGGAAGCCATTCCAACAATAAATTTGAAATCTTCAAGAGG
CAATCTGTCGAAGCGTAGTTTGATCAGTGTGGACAATCAGTTACAAAAGACAGAAACTTTGAAATCTAAGCAGGACAATGAAAATTTGGTCAATCCTGTTTCTACACCAT
CATCAGTGAGAAGTCCATTGGCATCGTTATCAGCCCTAAATAGACGAATTTCACTTTCAAATTCATCAGGTGATCCATTTTCAGCTTATGGCATCGACCGATCTCCCGCA
AGAGATCCTTACCTTTTTGAACTCGGTAATCACTTATCTGATGCAGTTGGTATTACAGAGCAGTCGAGCGTTTCTAAATTGAATCCACCTTTAACCAAAGATGGCTGGAC
TGTAGCAAATGGAATTAAACCATCCAAAATTCTTTTTGGAGACGATTCCATGTCTAAAATATCTTCAAGTAATATTTTAAATGTACCCCAAGTTGGTGGCAATACTGCTT
TAAGTGGAACTCATGCCAGCACGGAAGCTAAAGATGTTTGTGACAGCAGCACAGGCGTGGAAATAAATGAGAAATTGAGCTGTCTTGAAGCCCAAGCAGATGCGGTGGTT
AATATGCAGATTTCAGATCACGAAGGATCAGCTTCTGAGCAACCAAAATTATCTGAGGTGGATCTAATCAAAGAGTACCCGGTTGGCATTCGGAGTCAGTTGGATCAATC
AGCTGCTACTTGTACTGAAAATATTGTTGATGAGTCGTCTAGAAGCAGTGGAACAGAACACCACGATGAGATGGAAGATCACGAAGGATCAGCTTCTGAGCAACCAAACT
CATCTAAGGTGGATGTGATTAAAGAGTACCCTGTCGGCATTCACAGTCAGTTGGATCAATCAACTACTACTACTTGTGCTGAAAATATTGTCGATGGGGCATCTAGAAGC
AGTGGAACGGATCACCATGATGAGGAACAGGTCAAGCCAAAATCTCGTGCAAACAAACAACGCAAAGGCAAAAAGATTTCCGGGAGGCAAAGTCTTGCAGGTGCTGGTAC
AACGTGGCAAAGTGGGTTGAGAAGAAGTACCAGGTTCAAAACACGACCCTTGGAGTACTGGAAAGGTGAAAGGCTGTTGTACGGACGTGTACATGAGAGCCTAGCGACAG
TAATCGGCTTGAAGTATGTGTCTCCAGCAAAAGGAAATGGCAAAGCAACCATGAAGGTGAAGTCTCTAGTCTCCAATGAGTACAAAGATCTCGTCGAGTTAGCAGCCCTT
CACTGAGAGTCGTCTACTAAAAGGAACAAAAAGCCTTGAAGCTTCTTAGATTTTGCATGTATAACAACAAGCAATTCTCTTTGAATACAAATAGCATCTAGTCTCTATGT
AAAGACTGTAGAGGAGAATTAGGCTTATGCCATTGCATTGTATATTTCTAAGCCTTTCTCTATCATATATATATCTATCAAGCTGTTTCGCTTGTGTATTTGAGCTCATG
TACTTGTCATACGATTCCATATTTTACCCATTGACATCTAGCTTTCTGTACCAATTTTCTAGAATGAGTTGTAATCCTTGGGCCATTTTGTTTTTTA
Protein sequenceShow/hide protein sequence
MTTMVNEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMQSEAATFLVKNKKNEEATMKAEENPQERRP
ALNRKRARFSLKPDARHPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFDSG
IFSPLKLGTETHPSPYIIDSEKKTDEDVAFEEEEEEELVASATKAENRVNDILDEFLSGNCEDLEGDRAINILQERLQIKPLTLEKLCLPDLEAIPTINLKSSRGNLSKR
SLISVDNQLQKTETLKSKQDNENLVNPVSTPSSVRSPLASLSALNRRISLSNSSGDPFSAYGIDRSPARDPYLFELGNHLSDAVGITEQSSVSKLNPPLTKDGWTVANGI
KPSKILFGDDSMSKISSSNILNVPQVGGNTALSGTHASTEAKDVCDSSTGVEINEKLSCLEAQADAVVNMQISDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCT
ENIVDESSRSSGTEHHDEMEDHEGSASEQPNSSKVDVIKEYPVGIHSQLDQSTTTTCAENIVDGASRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSG
LRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGKATMKVKSLVSNEYKDLVELAALH