; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G21140 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G21140
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptioncentromere protein C-like isoform X1
Genome locationChr7:18115571..18121378
RNA-Seq ExpressionCSPI07G21140
SyntenyCSPI07G21140
Gene Ontology termsGO:0051315 - attachment of mitotic spindle microtubules to kinetochore (biological process)
GO:0051382 - kinetochore assembly (biological process)
GO:0051455 - attachment of spindle microtubules to kinetochore involved in homologous chromosome segregation (biological process)
GO:0000776 - kinetochore (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0019237 - centromeric DNA binding (molecular function)
InterPro domainsIPR028386 - Centromere protein C/Mif2/cnp3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058804.1 uncharacterized protein E6C27_scaffold339G002780 [Cucumis melo var. makuwa]0.0e+0092.46Show/hide
Query:  ITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKV
        +TM NEE R SDVIDPLAAYSGINLFPTAFGTL D SKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNS SMISEAATFLVKNEKNE A+VK 
Subjt:  ITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKV

Query:  EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS
        EEN QERRPALNRKRARFSLKPDA QPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQ GAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS
Subjt:  EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS

Query:  IATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIK
        I TEDDQNVDPSQVTFDSG+FSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENR+NDIL+EFLSGNCEDLEGDRAINILQERLQIK
Subjt:  IATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIK

Query:  PLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRD
        PLTLEKLCLPDLEAIPTMNLKS+R NLSKRSLISVDNQLQK E LKSK+DN NLVN VSTPSSMRSPLASLSALNRRISLSNSS DSFSAHGID+SP+RD
Subjt:  PLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRD

Query:  PYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSCL
        PYLFELGNHLSDAVG TE SSVSKLKPLLTRDGGT+ANGI+PSKILSGDDSMS ISSSNILNV QVGGNTALSGTYAST+AKNVS SSTDVEINEKLSCL
Subjt:  PYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSCL

Query:  EAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLD
        EAQAD VANMQI DH+GSASEQPKLSEVDLI+EYPVGIRSQLDQSAATCTENIVDGSSRSSGTEH DEMEDHEGSASEQP SSKVD+IKEYPV IQ QLD
Subjt:  EAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLD

Query:  QSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVSP
        QSTTTTCAE I DG SRSSGTDHHDEEQVKPKSRANKQ KGKKIS RQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESL TVIGLKYVSP
Subjt:  QSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVSP

Query:  AKGNGKPTMKVKSLVSNEYKDLVELAALH
         KGNGKPTMKVKSLVSNEYKDLV+LAALH
Subjt:  AKGNGKPTMKVKSLVSNEYKDLVELAALH

XP_011659552.1 centromere protein C isoform X3 [Cucumis sativus]0.0e+0099.59Show/hide
Query:  MITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK
        MITMANEEARHSDVIDPLAAYSGINLF TAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK
Subjt:  MITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK

Query:  VEENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS
         EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS
Subjt:  VEENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS

Query:  SIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQI
        SIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQI
Subjt:  SIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQI

Query:  KPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSR
        KPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSR
Subjt:  KPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSR

Query:  DPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSC
        DPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSC
Subjt:  DPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSC

Query:  LEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQL
        LEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQL
Subjt:  LEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQL

Query:  DQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS
        DQSTTTTCAENIADGASRSSGTDHHD EQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS
Subjt:  DQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS

Query:  PAKGNGKPTMKVKSLVSNEYKDLVELAALH
        PAKGNGKPTMKVKSLVSNEYKDLVELAALH
Subjt:  PAKGNGKPTMKVKSLVSNEYKDLVELAALH

XP_031745135.1 centromere protein C isoform X1 [Cucumis sativus]0.0e+0098.51Show/hide
Query:  MITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK
        MITMANEEARHSDVIDPLAAYSGINLF TAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK
Subjt:  MITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK

Query:  VEENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG--------RS
         EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG        RS
Subjt:  VEENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG--------RS

Query:  VRYKHQYSSIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAIN
        VRYKHQYSSIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAIN
Subjt:  VRYKHQYSSIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAIN

Query:  ILQERLQIKPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAH
        ILQERLQIKPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAH
Subjt:  ILQERLQIKPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAH

Query:  GIDQSPSRDPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDV
        GIDQSPSRDPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDV
Subjt:  GIDQSPSRDPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDV

Query:  EINEKLSCLEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEY
        EINEKLSCLEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEY
Subjt:  EINEKLSCLEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEY

Query:  PVAIQSQLDQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTT
        PVAIQSQLDQSTTTTCAENIADGASRSSGTDHHD EQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTT
Subjt:  PVAIQSQLDQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTT

Query:  VIGLKYVSPAKGNGKPTMKVKSLVSNEYKDLVELAALH
        VIGLKYVSPAKGNGKPTMKVKSLVSNEYKDLVELAALH
Subjt:  VIGLKYVSPAKGNGKPTMKVKSLVSNEYKDLVELAALH

XP_031745136.1 centromere protein C isoform X2 [Cucumis sativus]0.0e+0098.24Show/hide
Query:  MITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK
        MITMANEEARHSDVIDPLAAYSGINLF TAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK
Subjt:  MITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK

Query:  VEENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG--------RS
         EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG        RS
Subjt:  VEENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG--------RS

Query:  VRYKHQYSSIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAIN
        VRYKHQYSSIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAIN
Subjt:  VRYKHQYSSIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAIN

Query:  ILQERLQIKPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAH
        ILQERLQIKPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAH
Subjt:  ILQERLQIKPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAH

Query:  GIDQSPSRDPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDV
        GIDQSPSRDPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDV
Subjt:  GIDQSPSRDPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDV

Query:  EINEKLSCLEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEY
        EINEKLSCLEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEY
Subjt:  EINEKLSCLEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEY

Query:  PVAIQSQLDQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTT
        PVAIQSQLDQSTTTTCAENIADGASRSSGTDHHD   VKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTT
Subjt:  PVAIQSQLDQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTT

Query:  VIGLKYVSPAKGNGKPTMKVKSLVSNEYKDLVELAALH
        VIGLKYVSPAKGNGKPTMKVKSLVSNEYKDLVELAALH
Subjt:  VIGLKYVSPAKGNGKPTMKVKSLVSNEYKDLVELAALH

XP_031745137.1 centromere protein C isoform X4 [Cucumis sativus]0.0e+0099.45Show/hide
Query:  MITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK
        MITMANEEARHSDVIDPLAAYSGINLF TAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK
Subjt:  MITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK

Query:  VEENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS
         EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILG SVRYKHQYS
Subjt:  VEENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS

Query:  SIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQI
        SIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQI
Subjt:  SIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQI

Query:  KPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSR
        KPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSR
Subjt:  KPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSR

Query:  DPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSC
        DPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSC
Subjt:  DPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSC

Query:  LEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQL
        LEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQL
Subjt:  LEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQL

Query:  DQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS
        DQSTTTTCAENIADGASRSSGTDHHD EQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS
Subjt:  DQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS

Query:  PAKGNGKPTMKVKSLVSNEYKDLVELAALH
        PAKGNGKPTMKVKSLVSNEYKDLVELAALH
Subjt:  PAKGNGKPTMKVKSLVSNEYKDLVELAALH

TrEMBL top hitse value%identityAlignment
A0A0A0K774 Uncharacterized protein0.0e+0099.59Show/hide
Query:  MITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK
        MITMANEEARHSDVIDPLAAYSGINLF TAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK
Subjt:  MITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVK

Query:  VEENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS
         EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS
Subjt:  VEENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYS

Query:  SIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQI
        SIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQI
Subjt:  SIATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQI

Query:  KPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSR
        KPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSR
Subjt:  KPLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSR

Query:  DPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSC
        DPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSC
Subjt:  DPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSC

Query:  LEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQL
        LEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQL
Subjt:  LEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQL

Query:  DQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS
        DQSTTTTCAENIADGASRSSGTDHHD EQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS
Subjt:  DQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS

Query:  PAKGNGKPTMKVKSLVSNEYKDLVELAALH
        PAKGNGKPTMKVKSLVSNEYKDLVELAALH
Subjt:  PAKGNGKPTMKVKSLVSNEYKDLVELAALH

A0A1S3CDU5 uncharacterized protein LOC103499749 isoform X20.0e+0092.05Show/hide
Query:  ITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKV
        +TM NEE R SDVIDPLAAYSGINLFPTAFGTL DPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNS SMISEAATFLVKNEKNE A+VK 
Subjt:  ITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKV

Query:  EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS
        EEN QERRPALNRKRARFSLKPDA QPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQ GAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS
Subjt:  EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS

Query:  IATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIK
        I TEDDQNVDPSQVTFDSG+FSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENR+NDIL+EFLSGNCEDLEGDRAINILQERLQIK
Subjt:  IATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIK

Query:  PLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRD
        PLTLEKLCLPDLEAIPTMNLKS+R NLSKRSLISVDNQLQK E LKSK+DN NLVN VSTPSSMRSPLASLSALNRRISLSNSS DSFSAHGID+SP+RD
Subjt:  PLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRD

Query:  PYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSCL
        PYLFELGNHLSDAVG TE SSVSKLKPLLTRDGGT+ANGI+PSKILSGDDSMS ISSSNILNV QVG NTALSGTYAST+AKNVS SSTDVEINEKLSCL
Subjt:  PYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSCL

Query:  EAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLD
        EAQAD VANMQI DH+GSASEQPKLSEVDLI+EYPVGIRSQLDQSAATCTENIVDGSSRSSGTEH DEMEDHEGSASEQP SSKVD+IKEYPV IQ QLD
Subjt:  EAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLD

Query:  QS-TTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS
        QS TTTTCAE I DG SRSSGTDHHDE  VKPKSRANKQ KGKKIS RQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESL TVIGLKYVS
Subjt:  QS-TTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS

Query:  PAKGNGKPTMKVKSLVSNEYKDLVELAALH
        P KGNGKPTMKVKSLVSNEYKDLV+LAALH
Subjt:  PAKGNGKPTMKVKSLVSNEYKDLVELAALH

A0A1S3CDU7 uncharacterized protein LOC103499749 isoform X10.0e+0092.33Show/hide
Query:  ITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKV
        +TM NEE R SDVIDPLAAYSGINLFPTAFGTL DPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNS SMISEAATFLVKNEKNE A+VK 
Subjt:  ITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKV

Query:  EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS
        EEN QERRPALNRKRARFSLKPDA QPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQ GAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS
Subjt:  EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS

Query:  IATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIK
        I TEDDQNVDPSQVTFDSG+FSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENR+NDIL+EFLSGNCEDLEGDRAINILQERLQIK
Subjt:  IATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIK

Query:  PLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRD
        PLTLEKLCLPDLEAIPTMNLKS+R NLSKRSLISVDNQLQK E LKSK+DN NLVN VSTPSSMRSPLASLSALNRRISLSNSS DSFSAHGID+SP+RD
Subjt:  PLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRD

Query:  PYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSCL
        PYLFELGNHLSDAVG TE SSVSKLKPLLTRDGGT+ANGI+PSKILSGDDSMS ISSSNILNV QVG NTALSGTYAST+AKNVS SSTDVEINEKLSCL
Subjt:  PYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSCL

Query:  EAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLD
        EAQAD VANMQI DH+GSASEQPKLSEVDLI+EYPVGIRSQLDQSAATCTENIVDGSSRSSGTEH DEMEDHEGSASEQP SSKVD+IKEYPV IQ QLD
Subjt:  EAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLD

Query:  QS-TTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS
        QS TTTTCAE I DG SRSSGTDHHDEEQVKPKSRANKQ KGKKIS RQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESL TVIGLKYVS
Subjt:  QS-TTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS

Query:  PAKGNGKPTMKVKSLVSNEYKDLVELAALH
        P KGNGKPTMKVKSLVSNEYKDLV+LAALH
Subjt:  PAKGNGKPTMKVKSLVSNEYKDLVELAALH

A0A1S4E341 uncharacterized protein LOC103499749 isoform X30.0e+0088.77Show/hide
Query:  ITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKV
        +TM NEE R SDVIDPLAAYSGINLFPTAFGTL DPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNS SMISEAATFLVKNEKNE A+VK 
Subjt:  ITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKV

Query:  EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS
        EEN QERRPALNRKRARFSLKPDA QPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQ GAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS
Subjt:  EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS

Query:  IATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIK
        I TEDDQNVDPSQVTFDSG+FSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENR+NDIL+EFLSGNCEDLEGDRAINILQERLQIK
Subjt:  IATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIK

Query:  PLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRD
        PLTLEKLCLPDLEAIPTMNLKS+R NLSKRSLISVDNQLQK E LKSK+DN NLVN VSTPSSMRSPLASLSALNRRISLSNSS                
Subjt:  PLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRD

Query:  PYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSCL
                     VG TE SSVSKLKPLLTRDGGT+ANGI+PSKILSGDDSMS ISSSNILNV QVG NTALSGTYAST+AKNVS SSTDVEINEKLSCL
Subjt:  PYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSCL

Query:  EAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLD
        EAQAD VANMQI DH+GSASEQPKLSEVDLI+EYPVGIRSQLDQSAATCTENIVDGSSRSSGTEH DEMEDHEGSASEQP SSKVD+IKEYPV IQ QLD
Subjt:  EAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLD

Query:  QS-TTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS
        QS TTTTCAE I DG SRSSGTDHHDEEQVKPKSRANKQ KGKKIS RQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESL TVIGLKYVS
Subjt:  QS-TTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVS

Query:  PAKGNGKPTMKVKSLVSNEYKDLVELAALH
        P KGNGKPTMKVKSLVSNEYKDLV+LAALH
Subjt:  PAKGNGKPTMKVKSLVSNEYKDLVELAALH

A0A5A7UUE4 Uncharacterized protein0.0e+0092.46Show/hide
Query:  ITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKV
        +TM NEE R SDVIDPLAAYSGINLFPTAFGTL D SKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNS SMISEAATFLVKNEKNE A+VK 
Subjt:  ITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKV

Query:  EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS
        EEN QERRPALNRKRARFSLKPDA QPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQ GAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS
Subjt:  EENLQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSS

Query:  IATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIK
        I TEDDQNVDPSQVTFDSG+FSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENR+NDIL+EFLSGNCEDLEGDRAINILQERLQIK
Subjt:  IATEDDQNVDPSQVTFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIK

Query:  PLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRD
        PLTLEKLCLPDLEAIPTMNLKS+R NLSKRSLISVDNQLQK E LKSK+DN NLVN VSTPSSMRSPLASLSALNRRISLSNSS DSFSAHGID+SP+RD
Subjt:  PLTLEKLCLPDLEAIPTMNLKSSRSNLSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRD

Query:  PYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSCL
        PYLFELGNHLSDAVG TE SSVSKLKPLLTRDGGT+ANGI+PSKILSGDDSMS ISSSNILNV QVGGNTALSGTYAST+AKNVS SSTDVEINEKLSCL
Subjt:  PYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANGIKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSCL

Query:  EAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLD
        EAQAD VANMQI DH+GSASEQPKLSEVDLI+EYPVGIRSQLDQSAATCTENIVDGSSRSSGTEH DEMEDHEGSASEQP SSKVD+IKEYPV IQ QLD
Subjt:  EAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLD

Query:  QSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVSP
        QSTTTTCAE I DG SRSSGTDHHDEEQVKPKSRANKQ KGKKIS RQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESL TVIGLKYVSP
Subjt:  QSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVSP

Query:  AKGNGKPTMKVKSLVSNEYKDLVELAALH
         KGNGKPTMKVKSLVSNEYKDLV+LAALH
Subjt:  AKGNGKPTMKVKSLVSNEYKDLVELAALH

SwissProt top hitse value%identityAlignment
Q66LG9 Centromere protein C1.1e-5931.91Show/hide
Query:  DPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKVEENLQERRPALNRK
        DPL AYSG++LFP    +L +P  P     DL   H  L+SM     S+  EQA++IL+                 + + +  +    N +ERRP L+RK
Subjt:  DPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKVEENLQERRPALNRK

Query:  RARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSIATEDDQNVDPSQV
        R  FSL     QPP  + P+FD  +    E+FF AY+K E A +E QKQTG+ + D+ +  PS   R RRPGI GR  R                 P + 
Subjt:  RARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSIATEDDQNVDPSQV

Query:  TFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIKPLTLEKLCLPDLEA
        +F    F+ + +  E       I SE+  +   A      + E+  S    +  +N++L + L+ + E+LEGD AI +L+ERLQIK   +EK  +P+ + 
Subjt:  TFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIKPLTLEKLCLPDLEA

Query:  IPTMNLKSSRSN-LSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGI------DQSPSR---DPYLF
        +  MNLK+S SN  +++SL  + N      ILK         N V+   +  SP        +  S  N   D FS   I      DQ PS     P   
Subjt:  IPTMNLKSSRSN-LSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGI------DQSPSR---DPYLF

Query:  ELGNHLSDAVGNTEQSSV--SKLKPLLTRDGGTVANGIKPSKILSGD--------DSMSNISSSNI-LNVP-QVGGNTALSGTYASTEAKNVSVSSTDVE
        ++ N     VG  + +S     +      D   + +GI  S  LS D        DS+SN SS+ +  NV  +  G         S   +N      D E
Subjt:  ELGNHLSDAVGNTEQSSV--SKLKPLLTRDGGTVANGIKPSKILSGD--------DSMSNISSSNI-LNVP-QVGGNTALSGTYASTEAKNVSVSSTDVE

Query:  INEKLSCLEAQAD----------AVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEME--DHEGSASEQP
        INE+   LE  A+           V    I   +G++S+ P  +  +        +            EN+  GS+     E+  E+    H+ +   + 
Subjt:  INEKLSCLEAQAD----------AVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEME--DHEGSASEQP

Query:  KSSKVDVIKEYPVAIQSQL--DQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGE
        + S    +K+    +  +   D+   T   E+ A   ++      ++ E+ KPK      H+GK  S R+SLA AGT  + GVRRSTR K+RPLEYW+GE
Subjt:  KSSKVDVIKEYPVAIQSQL--DQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGE

Query:  RLLYGRVHESLTTVIGLKYVSPAKG-NGKPTMKVKSLVSNEYKDLVELAALH
        R LYGR+HESLTTVIG+KY SP +G       KVKS VS+EYK LV+ AALH
Subjt:  RLLYGRVHESLTTVIGLKYVSPAKG-NGKPTMKVKSLVSNEYKDLVELAALH

Arabidopsis top hitse value%identityAlignment
AT1G15660.1 centromere protein C7.5e-6131.91Show/hide
Query:  DPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKVEENLQERRPALNRK
        DPL AYSG++LFP    +L +P  P     DL   H  L+SM     S+  EQA++IL+                 + + +  +    N +ERRP L+RK
Subjt:  DPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKVEENLQERRPALNRK

Query:  RARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSIATEDDQNVDPSQV
        R  FSL     QPP  + P+FD  +    E+FF AY+K E A +E QKQTG+ + D+ +  PS   R RRPGI GR  R                 P + 
Subjt:  RARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSIATEDDQNVDPSQV

Query:  TFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIKPLTLEKLCLPDLEA
        +F    F+ + +  E       I SE+  +   A      + E+  S    +  +N++L + L+ + E+LEGD AI +L+ERLQIK   +EK  +P+ + 
Subjt:  TFDSGIFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIKPLTLEKLCLPDLEA

Query:  IPTMNLKSSRSN-LSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGI------DQSPSR---DPYLF
        +  MNLK+S SN  +++SL  + N      ILK         N V+   +  SP        +  S  N   D FS   I      DQ PS     P   
Subjt:  IPTMNLKSSRSN-LSKRSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGI------DQSPSR---DPYLF

Query:  ELGNHLSDAVGNTEQSSV--SKLKPLLTRDGGTVANGIKPSKILSGD--------DSMSNISSSNI-LNVP-QVGGNTALSGTYASTEAKNVSVSSTDVE
        ++ N     VG  + +S     +      D   + +GI  S  LS D        DS+SN SS+ +  NV  +  G         S   +N      D E
Subjt:  ELGNHLSDAVGNTEQSSV--SKLKPLLTRDGGTVANGIKPSKILSGD--------DSMSNISSSNI-LNVP-QVGGNTALSGTYASTEAKNVSVSSTDVE

Query:  INEKLSCLEAQAD----------AVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEME--DHEGSASEQP
        INE+   LE  A+           V    I   +G++S+ P  +  +        +            EN+  GS+     E+  E+    H+ +   + 
Subjt:  INEKLSCLEAQAD----------AVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATCTENIVDGSSRSSGTEHRDEME--DHEGSASEQP

Query:  KSSKVDVIKEYPVAIQSQL--DQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGE
        + S    +K+    +  +   D+   T   E+ A   ++      ++ E+ KPK      H+GK  S R+SLA AGT  + GVRRSTR K+RPLEYW+GE
Subjt:  KSSKVDVIKEYPVAIQSQL--DQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGE

Query:  RLLYGRVHESLTTVIGLKYVSPAKG-NGKPTMKVKSLVSNEYKDLVELAALH
        R LYGR+HESLTTVIG+KY SP +G       KVKS VS+EYK LV+ AALH
Subjt:  RLLYGRVHESLTTVIGLKYVSPAKG-NGKPTMKVKSLVSNEYKDLVELAALH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAACAATGGCGAACGAAGAAGCTCGACACTCCGATGTTATCGATCCTCTTGCTGCTTATTCTGGTATCAATCTTTTTCCGACCGCATTTGGTACTTTGCCGGATCC
GTCAAAGCCACATGATCTTGGAACAGACCTCGACGGCATCCACAAGCGCCTCAAATCCATGGTGTTAAGGAGTCCCAGTAAACTATTAGAACAGGCCAGATCAATTTTAG
ATGGCAACTCAAATTCGATGATATCTGAAGCTGCCACATTTCTTGTGAAGAATGAGAAAAATGAGGAAGCTACAGTGAAGGTAGAGGAAAATCTTCAAGAAAGAAGGCCG
GCCTTAAACCGAAAGCGGGCTAGGTTTTCTTTAAAACCCGATGCTAGGCAACCTCCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGACCCCGAGGAGTT
CTTTTTGGCCTATGAAAAGCATGAAAATGCCAAAAAAGAAATCCAGAAGCAGACGGGAGCAGTTTTAAAGGACTTGAACCAACAAAATCCGTCGACGAATACACGCCAGC
GTAGACCGGGGATTCTTGGAAGATCTGTTAGATACAAGCATCAATATTCATCAATAGCAACTGAAGATGATCAGAATGTAGATCCTTCTCAAGTGACATTTGATTCAGGC
ATTTTCAGTCCATTGAAATTGGGCACAGAAACACACCCAAGTCCACATATAATTGACTCAGAAAAGAAAACTGATGAAGATGTAGCCTTTGAGGAGGAGGAGGAGGAGGA
GGAGCTCGTTGCTTCAGCTACGAAGGCAGAGAACAGAATAAATGATATTTTGAATGAATTTCTCTCTGGTAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATAT
TACAGGAGCGCTTGCAGATTAAACCTCTTACTTTAGAGAAATTATGCCTTCCAGATTTAGAAGCCATTCCAACAATGAATTTGAAATCTTCAAGAAGCAATCTATCAAAG
CGTAGTTTGATCAGTGTGGACAATCAGTTACAAAAGATAGAAATTTTGAAATCTAAGCAGGACAATGTAAATTTGGTTAATCCTGTTTCTACACCATCATCAATGAGAAG
TCCATTGGCATCGTTATCAGCACTAAATAGACGGATTTCACTTTCAAATTCATCAAGTGATTCATTTTCAGCTCATGGCATTGACCAATCTCCATCAAGAGATCCTTACC
TTTTTGAACTCGGTAATCACTTATCTGATGCAGTTGGTAATACAGAGCAGTCAAGCGTTTCTAAGTTGAAGCCACTTTTAACCAGAGATGGTGGGACTGTAGCAAATGGA
ATTAAACCATCCAAAATTCTTTCTGGAGATGATTCCATGTCTAATATATCTTCAAGTAATATTTTAAATGTACCCCAAGTTGGGGGCAATACTGCTTTAAGTGGAACTTA
TGCCAGCACGGAGGCTAAAAATGTTAGTGTCAGCAGCACAGACGTGGAAATAAATGAGAAATTGAGTTGTCTTGAAGCCCAAGCAGATGCGGTGGCTAATATGCAGATTG
AAGATCACGAAGGATCAGCTTCTGAGCAACCAAAATTATCTGAGGTGGATCTAATCAAAGAGTACCCGGTTGGCATTCGGAGTCAGTTGGATCAATCAGCTGCTACTTGT
ACTGAAAATATTGTTGATGGGTCATCTAGAAGCAGTGGTACAGAACACCGCGATGAGATGGAAGATCATGAAGGATCAGCTTCTGAGCAACCAAAGTCATCTAAGGTGGA
TGTGATTAAAGAGTACCCAGTAGCCATTCAGAGTCAGTTGGATCAATCAACTACTACTACTTGTGCTGAAAATATTGCCGATGGGGCATCTAGAAGCAGTGGAACGGATC
ACCATGATGAGGAACAGGTCAAGCCAAAATCTCGTGCAAACAAACAACACAAAGGCAAAAAGATTTCTCGGAGGCAAAGCCTTGCAGGTGCTGGTACAACGTGGCAAAGT
GGGGTGAGAAGAAGTACCAGGTTCAAAACACGACCCTTGGAGTACTGGAAAGGTGAAAGGCTGTTGTACGGACGTGTACATGAGAGCCTGACGACAGTAATCGGGTTGAA
GTATGTGTCTCCAGCAAAAGGAAATGGCAAACCAACCATGAAGGTGAAGTCTCTAGTCTCCAATGAGTACAAAGATCTCGTCGAGTTAGCAGCCCTTCACTGA
mRNA sequenceShow/hide mRNA sequence
GTTTTCGCGCTTTGGTTCACTCTTTCGAATTTGGAGTTGTTCTAGGGTTCGGTTCAAACGTTGGAGATGATAACAATGGCGAACGAAGAAGCTCGACACTCCGATGTTAT
CGATCCTCTTGCTGCTTATTCTGGTATCAATCTTTTTCCGACCGCATTTGGTACTTTGCCGGATCCGTCAAAGCCACATGATCTTGGAACAGACCTCGACGGCATCCACA
AGCGCCTCAAATCCATGGTGTTAAGGAGTCCCAGTAAACTATTAGAACAGGCCAGATCAATTTTAGATGGCAACTCAAATTCGATGATATCTGAAGCTGCCACATTTCTT
GTGAAGAATGAGAAAAATGAGGAAGCTACAGTGAAGGTAGAGGAAAATCTTCAAGAAAGAAGGCCGGCCTTAAACCGAAAGCGGGCTAGGTTTTCTTTAAAACCCGATGC
TAGGCAACCTCCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGACCCCGAGGAGTTCTTTTTGGCCTATGAAAAGCATGAAAATGCCAAAAAAGAAATCC
AGAAGCAGACGGGAGCAGTTTTAAAGGACTTGAACCAACAAAATCCGTCGACGAATACACGCCAGCGTAGACCGGGGATTCTTGGAAGATCTGTTAGATACAAGCATCAA
TATTCATCAATAGCAACTGAAGATGATCAGAATGTAGATCCTTCTCAAGTGACATTTGATTCAGGCATTTTCAGTCCATTGAAATTGGGCACAGAAACACACCCAAGTCC
ACATATAATTGACTCAGAAAAGAAAACTGATGAAGATGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGGAGCTCGTTGCTTCAGCTACGAAGGCAGAGAACAGAATAAATG
ATATTTTGAATGAATTTCTCTCTGGTAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATATTACAGGAGCGCTTGCAGATTAAACCTCTTACTTTAGAGAAATTA
TGCCTTCCAGATTTAGAAGCCATTCCAACAATGAATTTGAAATCTTCAAGAAGCAATCTATCAAAGCGTAGTTTGATCAGTGTGGACAATCAGTTACAAAAGATAGAAAT
TTTGAAATCTAAGCAGGACAATGTAAATTTGGTTAATCCTGTTTCTACACCATCATCAATGAGAAGTCCATTGGCATCGTTATCAGCACTAAATAGACGGATTTCACTTT
CAAATTCATCAAGTGATTCATTTTCAGCTCATGGCATTGACCAATCTCCATCAAGAGATCCTTACCTTTTTGAACTCGGTAATCACTTATCTGATGCAGTTGGTAATACA
GAGCAGTCAAGCGTTTCTAAGTTGAAGCCACTTTTAACCAGAGATGGTGGGACTGTAGCAAATGGAATTAAACCATCCAAAATTCTTTCTGGAGATGATTCCATGTCTAA
TATATCTTCAAGTAATATTTTAAATGTACCCCAAGTTGGGGGCAATACTGCTTTAAGTGGAACTTATGCCAGCACGGAGGCTAAAAATGTTAGTGTCAGCAGCACAGACG
TGGAAATAAATGAGAAATTGAGTTGTCTTGAAGCCCAAGCAGATGCGGTGGCTAATATGCAGATTGAAGATCACGAAGGATCAGCTTCTGAGCAACCAAAATTATCTGAG
GTGGATCTAATCAAAGAGTACCCGGTTGGCATTCGGAGTCAGTTGGATCAATCAGCTGCTACTTGTACTGAAAATATTGTTGATGGGTCATCTAGAAGCAGTGGTACAGA
ACACCGCGATGAGATGGAAGATCATGAAGGATCAGCTTCTGAGCAACCAAAGTCATCTAAGGTGGATGTGATTAAAGAGTACCCAGTAGCCATTCAGAGTCAGTTGGATC
AATCAACTACTACTACTTGTGCTGAAAATATTGCCGATGGGGCATCTAGAAGCAGTGGAACGGATCACCATGATGAGGAACAGGTCAAGCCAAAATCTCGTGCAAACAAA
CAACACAAAGGCAAAAAGATTTCTCGGAGGCAAAGCCTTGCAGGTGCTGGTACAACGTGGCAAAGTGGGGTGAGAAGAAGTACCAGGTTCAAAACACGACCCTTGGAGTA
CTGGAAAGGTGAAAGGCTGTTGTACGGACGTGTACATGAGAGCCTGACGACAGTAATCGGGTTGAAGTATGTGTCTCCAGCAAAAGGAAATGGCAAACCAACCATGAAGG
TGAAGTCTCTAGTCTCCAATGAGTACAAAGATCTCGTCGAGTTAGCAGCCCTTCACTGAGAGTCATCTACTAAAAGGAACAAAAAGCCTTGAAGCTTCTTAGATTTTGCA
TGTATAACAACAAGCAATTCCCTTTGAATACAAACAACATCCAGTCTCTTTGTAAAAACTGTAGAGGAGAATTAGGCTTATGACATTGCATTGTATATTTCTAAGCCTTT
TCTCTATCATATATATATCCATCAAGCTGTTTCGCTTGTGTATTTGAGCTCATGTACTTTGTCATAAGATTTCATATTTTACCCATTGACAACTAGCTTTCTGTACCAAT
TTTCT
Protein sequenceShow/hide protein sequence
MITMANEEARHSDVIDPLAAYSGINLFPTAFGTLPDPSKPHDLGTDLDGIHKRLKSMVLRSPSKLLEQARSILDGNSNSMISEAATFLVKNEKNEEATVKVEENLQERRP
ALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYEKHENAKKEIQKQTGAVLKDLNQQNPSTNTRQRRPGILGRSVRYKHQYSSIATEDDQNVDPSQVTFDSG
IFSPLKLGTETHPSPHIIDSEKKTDEDVAFEEEEEEEELVASATKAENRINDILNEFLSGNCEDLEGDRAINILQERLQIKPLTLEKLCLPDLEAIPTMNLKSSRSNLSK
RSLISVDNQLQKIEILKSKQDNVNLVNPVSTPSSMRSPLASLSALNRRISLSNSSSDSFSAHGIDQSPSRDPYLFELGNHLSDAVGNTEQSSVSKLKPLLTRDGGTVANG
IKPSKILSGDDSMSNISSSNILNVPQVGGNTALSGTYASTEAKNVSVSSTDVEINEKLSCLEAQADAVANMQIEDHEGSASEQPKLSEVDLIKEYPVGIRSQLDQSAATC
TENIVDGSSRSSGTEHRDEMEDHEGSASEQPKSSKVDVIKEYPVAIQSQLDQSTTTTCAENIADGASRSSGTDHHDEEQVKPKSRANKQHKGKKISRRQSLAGAGTTWQS
GVRRSTRFKTRPLEYWKGERLLYGRVHESLTTVIGLKYVSPAKGNGKPTMKVKSLVSNEYKDLVELAALH