; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg09549 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg09549
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptioncentromere protein C isoform X1
Genome locationCarg_Chr17:6780295..6787091
RNA-Seq ExpressionCarg09549
SyntenyCarg09549
Gene Ontology termsGO:0051382 - kinetochore assembly (biological process)
GO:0000776 - kinetochore (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0019237 - centromeric DNA binding (molecular function)
InterPro domainsIPR028386 - Centromere protein C/Mif2/cnp3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575561.1 Centromere protein C, partial [Cucurbita argyrosperma subsp. sororia]9.1e-30679.84Show/hide
Query:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
        MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
Subjt:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE

Query:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR
        NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLE A +       E+   T A  K L                        
Subjt:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR

Query:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT
                    +  S     RR  I                                                RSVRYKHQYSSITSEDDQNVEPSQVT
Subjt:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT

Query:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPL-EEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQ
        FESGSISPSILGTEKDASPPIICSEMKTNEEVPL EEEEEAFV      ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQ
Subjt:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPL-EEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQ

Query:  TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGI
        TMNLRSSRGNLPERSLI+VDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSP                                
Subjt:  TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGI

Query:  AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDD
         EKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDD
Subjt:  AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDD

Query:  EMEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR
        EMEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR
Subjt:  EMEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR

Query:  PLEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH
        PLEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH
Subjt:  PLEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH

KAG7014102.1 hypothetical protein SDJN02_24275, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
        MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
Subjt:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE

Query:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR
        NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR
Subjt:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR

Query:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT
        FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT
Subjt:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT

Query:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFVENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQTMNLRSS
        FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFVENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQTMNLRSS
Subjt:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFVENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQTMNLRSS

Query:  RGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIAEKLGVS
        RGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIAEKLGVS
Subjt:  RGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIAEKLGVS

Query:  RLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDEMEDHEG
        RLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDEMEDHEG
Subjt:  RLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDEMEDHEG

Query:  STSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRPLEYWKG
        STSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRPLEYWKG
Subjt:  STSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRPLEYWKG

Query:  ERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH
        ERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH
Subjt:  ERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH

XP_022953572.1 centromere protein C isoform X1 [Cucurbita moschata]0.0e+0083.25Show/hide
Query:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
        MVNE+ARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
Subjt:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE

Query:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR
        NPQERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYERLE A +       E+   T A  K L                        
Subjt:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR

Query:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT
                    +  S     RR  I                                                RSVRYKHQYSSITSEDDQNVEPSQVT
Subjt:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT

Query:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQT
        FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFV      ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQT
Subjt:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQT

Query:  MNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIA
         NLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPIST FSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIA
Subjt:  MNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIA

Query:  EKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDE
        EKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSET ANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDE
Subjt:  EKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDE

Query:  MEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP
        MEDHEGSTSEQPNTSKVD I+EYPIGIQTQLDQSIATCTENIVD PSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP
Subjt:  MEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP

Query:  LEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH
        LEYWKGERLLYGRVHESLATVIGLKYVSP KGNGQPTLKVKSLVSSEYNELVELAALH
Subjt:  LEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH

XP_022992183.1 centromere protein C-like isoform X1 [Cucurbita maxima]0.0e+0082.48Show/hide
Query:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
        MVNE+ARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
Subjt:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE

Query:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR
        NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLE A +       E+   T A  K L                        
Subjt:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR

Query:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT
                    +  S     RR  I                                                RSVRYKHQYSSITSEDDQ VEPSQVT
Subjt:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT

Query:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPL-EEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQ
        FESGSISPS LGTEKDASPPIICSEMKTNEEVP  EEEEEAFV      ENKVNKILDELLSANCEDLEGD+AINKLQECLQIKPINLEKLCLPDLEAIQ
Subjt:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPL-EEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQ

Query:  TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGI
        TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPIST FSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGI
Subjt:  TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGI

Query:  AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDD
        AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDV+SISKISSSNVLNVPQAGA+AALSETHANMEAKDISGSS EVEVNEKLSFLEAQADAVAATNVLDD
Subjt:  AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDD

Query:  EMEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR
        EMEDHEGSTSEQPNTSKVD I+EYPIGIQT LDQS ATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR
Subjt:  EMEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR

Query:  PLEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH
        PLEYWKGERLLYGRVHESLATVIGLKYVSP KGNGQPTLKVKSLVSSEYNELVELAALH
Subjt:  PLEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH

XP_023548004.1 centromere protein C-like isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0082.87Show/hide
Query:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
        MVNE+ARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHD GTDLDGIHKHLKSMVSRNPSKLIEQARSILN NSNLMQSKAAT LVKNEKKEEAAANVEE
Subjt:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE

Query:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR
        NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLE A +       E+   T A  K L                        
Subjt:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR

Query:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT
                    +  S     RR  I                                                RSVRYKHQYSSITSEDDQNVEPSQVT
Subjt:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT

Query:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPL-EEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQ
        FESGSISPSILGTEKDASPPIICSEMKTNEEVPL EEEEEAFV      ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQ
Subjt:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPL-EEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQ

Query:  TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGI
        TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPIST FSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGI
Subjt:  TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGI

Query:  AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDD
        AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDD
Subjt:  AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDD

Query:  EMEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR
        EMEDHEGSTSEQPNTSKVD I+EYP+G+QTQLDQS ATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR
Subjt:  EMEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR

Query:  PLEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH
        PLEYWKGERLLYGRVHESLATVIGLKYVSP KGNGQPTLKVKSLVSSEYNELVELAALH
Subjt:  PLEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH

TrEMBL top hitse value%identityAlignment
A0A6J1GNL2 centromere protein C isoform X10.0e+0083.25Show/hide
Query:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
        MVNE+ARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
Subjt:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE

Query:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR
        NPQERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYERLE A +       E+   T A  K L                        
Subjt:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR

Query:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT
                    +  S     RR  I                                                RSVRYKHQYSSITSEDDQNVEPSQVT
Subjt:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT

Query:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQT
        FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFV      ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQT
Subjt:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQT

Query:  MNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIA
         NLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPIST FSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIA
Subjt:  MNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIA

Query:  EKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDE
        EKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSET ANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDE
Subjt:  EKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDE

Query:  MEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP
        MEDHEGSTSEQPNTSKVD I+EYPIGIQTQLDQSIATCTENIVD PSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP
Subjt:  MEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP

Query:  LEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH
        LEYWKGERLLYGRVHESLATVIGLKYVSP KGNGQPTLKVKSLVSSEYNELVELAALH
Subjt:  LEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH

A0A6J1GNP2 centromere protein C isoform X23.2e-30479.42Show/hide
Query:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
        MVNE+ARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
Subjt:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE

Query:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR
        NPQERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYERLE A +       E+   T A  K L                        
Subjt:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR

Query:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT
                    +  S     RR  I                                                RSVRYKHQYSSITSEDDQNVEPSQVT
Subjt:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT

Query:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQT
        FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFV      ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQT
Subjt:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQT

Query:  MNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIA
         NLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPIST FSMRSPLASLSALTRRISLSNSP                             VGIA
Subjt:  MNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIA

Query:  EKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDE
        EKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSET ANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDE
Subjt:  EKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDE

Query:  MEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP
        MEDHEGSTSEQPNTSKVD I+EYPIGIQTQLDQSIATCTENIVD PSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP
Subjt:  MEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP

Query:  LEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH
        LEYWKGERLLYGRVHESLATVIGLKYVSP KGNGQPTLKVKSLVSSEYNELVELAALH
Subjt:  LEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH

A0A6J1GQ29 centromere protein C isoform X31.1e-30178.89Show/hide
Query:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
        MVNE+ARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
Subjt:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE

Query:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR
        NPQERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYERLE A +       E+   T A  K L                        
Subjt:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR

Query:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT
                    +  S     RR  I                                                RSVRYKHQYSSITSEDDQNVEPSQVT
Subjt:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT

Query:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQT
        FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFV      ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQT
Subjt:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQT

Query:  MNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIA
         NLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPIST FSMRSPLASLSALTRRISLSNSP                                 
Subjt:  MNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIA

Query:  EKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDE
        EKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSET ANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDE
Subjt:  EKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDE

Query:  MEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP
        MEDHEGSTSEQPNTSKVD I+EYPIGIQTQLDQSIATCTENIVD PSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP
Subjt:  MEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRP

Query:  LEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH
        LEYWKGERLLYGRVHESLATVIGLKYVSP KGNGQPTLKVKSLVSSEYNELVELAALH
Subjt:  LEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH

A0A6J1JWV5 centromere protein C-like isoform X24.3e-30178.66Show/hide
Query:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
        MVNE+ARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
Subjt:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE

Query:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR
        NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLE A +       E+   T A  K L                        
Subjt:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR

Query:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT
                    +  S     RR  I                                                RSVRYKHQYSSITSEDDQ VEPSQVT
Subjt:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT

Query:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPL-EEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQ
        FESGSISPS LGTEKDASPPIICSEMKTNEEVP  EEEEEAFV      ENKVNKILDELLSANCEDLEGD+AINKLQECLQIKPINLEKLCLPDLEAIQ
Subjt:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPL-EEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQ

Query:  TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGI
        TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPIST FSMRSPLASLSALTRRISLSNSP                             VGI
Subjt:  TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGI

Query:  AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDD
        AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDV+SISKISSSNVLNVPQAGA+AALSETHANMEAKDISGSS EVEVNEKLSFLEAQADAVAATNVLDD
Subjt:  AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDD

Query:  EMEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR
        EMEDHEGSTSEQPNTSKVD I+EYPIGIQT LDQS ATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR
Subjt:  EMEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR

Query:  PLEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH
        PLEYWKGERLLYGRVHESLATVIGLKYVSP KGNGQPTLKVKSLVSSEYNELVELAALH
Subjt:  PLEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH

A0A6J1JYG6 centromere protein C-like isoform X10.0e+0082.48Show/hide
Query:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
        MVNE+ARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE
Subjt:  MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEE

Query:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR
        NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLE A +       E+   T A  K L                        
Subjt:  NPQERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVAR

Query:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT
                    +  S     RR  I                                                RSVRYKHQYSSITSEDDQ VEPSQVT
Subjt:  FTYGCRCLLFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVT

Query:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPL-EEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQ
        FESGSISPS LGTEKDASPPIICSEMKTNEEVP  EEEEEAFV      ENKVNKILDELLSANCEDLEGD+AINKLQECLQIKPINLEKLCLPDLEAIQ
Subjt:  FESGSISPSILGTEKDASPPIICSEMKTNEEVPL-EEEEEAFV------ENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQ

Query:  TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGI
        TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPIST FSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGI
Subjt:  TMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGI

Query:  AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDD
        AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDV+SISKISSSNVLNVPQAGA+AALSETHANMEAKDISGSS EVEVNEKLSFLEAQADAVAATNVLDD
Subjt:  AEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDD

Query:  EMEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR
        EMEDHEGSTSEQPNTSKVD I+EYPIGIQT LDQS ATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR
Subjt:  EMEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGNQREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTR

Query:  PLEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH
        PLEYWKGERLLYGRVHESLATVIGLKYVSP KGNGQPTLKVKSLVSSEYNELVELAALH
Subjt:  PLEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH

SwissProt top hitse value%identityAlignment
Q66LG9 Centromere protein C5.4e-4326.99Show/hide
Query:  DPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEENP----QERRPA
        DPL AYSG+SLFP    +L  P  P     DL   H  L+SM     S+  EQA++IL                     E+   +V+ NP    +ERRP 
Subjt:  DPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEENP----QERRPA

Query:  LNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVARFTYGCRCL
        L+RKR  FSL     QPP  + P+FD  +    E+FF AY++ E A R          +      +  P    P   G P                    
Subjt:  LNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVARFTYGCRCL

Query:  LFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDD-QNVEPSQVTFESGSIS
                    GR+   ++ SF             F++ +                    L AS++ +        I SE   ++   + VT     + 
Subjt:  LFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDD-QNVEPSQVTFESGSIS

Query:  PSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFVENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQTMNLRSSRGNLPER
         S + T+KD                             +N +L +LL+ + E+LEGD AI  L+E LQIK  N+EK  +P+ + ++ MNL++S  N P R
Subjt:  PSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFVENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQTMNLRSSRGNLPER

Query:  SLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFS--------AHDLDQSSARNPSLFELSNHLSDAVGIAEKLGV
          +S    + +  N  + + + +S +P  T     SP   +   +     +  PGD           A D+  +S  N    ++++  +D+V   ++ G 
Subjt:  SLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFS--------AHDLDQSSARNPSLFELSNHLSDAVGIAEKLGV

Query:  SRLMSLLTKDDGTVAKGIK--------SPKILLGDVDSISKISSS----NV-LNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVA
                +DD  +  GI         +P I +  +DSIS  SS+    NV +       +  +SE+ AN    D      + E+NE+   LE  A+  +
Subjt:  SRLMSLLTKDDGTVAKGIK--------SPKILLGDVDSISKISSS----NV-LNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVA

Query:  AT-----NVLDDEMEDHEGSTSEQPNTS----------------------KVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQK----
                V +D +   +G++S+ PN +                      + +V      G+Q +    +   +    +   +   +D++ K + K    
Subjt:  AT-----NVLDDEMEDHEGSTSEQPNTS----------------------KVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQK----

Query:  --------------SRAGNQ------------------REGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP
                      SRA  Q                   EGK  S RKSLA AGT  +GGVRRSTR K+RPLEYW+GER LYGR+HESL TVIG+KY SP
Subjt:  --------------SRAGNQ------------------REGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP

Query:  GKG-NGQPTLKVKSLVSSEYNELVELAALH
        G+G       KVKS VS EY +LV+ AALH
Subjt:  GKG-NGQPTLKVKSLVSSEYNELVELAALH

Arabidopsis top hitse value%identityAlignment
AT1G15660.1 centromere protein C3.8e-4426.99Show/hide
Query:  DPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEENP----QERRPA
        DPL AYSG+SLFP    +L  P  P     DL   H  L+SM     S+  EQA++IL                     E+   +V+ NP    +ERRP 
Subjt:  DPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEENP----QERRPA

Query:  LNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVARFTYGCRCL
        L+RKR  FSL     QPP  + P+FD  +    E+FF AY++ E A R          +      +  P    P   G P                    
Subjt:  LNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVARFTYGCRCL

Query:  LFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDD-QNVEPSQVTFESGSIS
                    GR+   ++ SF             F++ +                    L AS++ +        I SE   ++   + VT     + 
Subjt:  LFYARYASVRNYGRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDD-QNVEPSQVTFESGSIS

Query:  PSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFVENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQTMNLRSSRGNLPER
         S + T+KD                             +N +L +LL+ + E+LEGD AI  L+E LQIK  N+EK  +P+ + ++ MNL++S  N P R
Subjt:  PSILGTEKDASPPIICSEMKTNEEVPLEEEEEAFVENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQTMNLRSSRGNLPER

Query:  SLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFS--------AHDLDQSSARNPSLFELSNHLSDAVGIAEKLGV
          +S    + +  N  + + + +S +P  T     SP   +   +     +  PGD           A D+  +S  N    ++++  +D+V   ++ G 
Subjt:  SLISVDSQLQRIENLKSKQDDENSVNPISTAFSMRSPLASLSALTRRISLSNSPGDPFS--------AHDLDQSSARNPSLFELSNHLSDAVGIAEKLGV

Query:  SRLMSLLTKDDGTVAKGIK--------SPKILLGDVDSISKISSS----NV-LNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVA
                +DD  +  GI         +P I +  +DSIS  SS+    NV +       +  +SE+ AN    D      + E+NE+   LE  A+  +
Subjt:  SRLMSLLTKDDGTVAKGIK--------SPKILLGDVDSISKISSS----NV-LNVPQAGAEAALSETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVA

Query:  AT-----NVLDDEMEDHEGSTSEQPNTS----------------------KVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQK----
                V +D +   +G++S+ PN +                      + +V      G+Q +    +   +    +   +   +D++ K + K    
Subjt:  AT-----NVLDDEMEDHEGSTSEQPNTS----------------------KVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQK----

Query:  --------------SRAGNQ------------------REGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP
                      SRA  Q                   EGK  S RKSLA AGT  +GGVRRSTR K+RPLEYW+GER LYGR+HESL TVIG+KY SP
Subjt:  --------------SRAGNQ------------------REGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP

Query:  GKG-NGQPTLKVKSLVSSEYNELVELAALH
        G+G       KVKS VS EY +LV+ AALH
Subjt:  GKG-NGQPTLKVKSLVSSEYNELVELAALH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAACGAAGATGCTCGACACTCCGATGTGATTGATCCTCTTGCTGCTTATTCTGGAATTAGTCTCTTTCCGAGCGCATTTGGCACTTTGCCGGTTCCGTCGAAGCC
ACATGATATTGGAACCGACCTTGACGGCATCCACAAGCACCTCAAATCCATGGTCTCAAGAAATCCCAGTAAACTTATAGAGCAGGCTAGATCGATTTTGAACGGTAACT
CTAATTTGATGCAATCCAAAGCTGCCACATTTCTCGTAAAGAATGAGAAAAAGGAGGAAGCTGCAGCAAACGTGGAGGAAAATCCACAAGAAAGAAGGCCAGCCTTAAAC
CGAAAACGGGCCAGATTTTCTTTGAAGCCTGATGCTAGGCAACCTCCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGACCCTGAGGAGTTCTTTTTGGC
GTACGAAAGGCTTGAAACTGCAACACGCATTCCGGATCATGCTGTATTCGAGATGTGTCTTCATACATATGCTTGTGAGAAGACTCTACCTGTCCGCGCGGCCCCGTATA
GTTCCGGGCCCCCAATCTGGATAGGAATCACTAGCAATGCTGTGGCCAGATTCACTTATGGGTGTAGATGCTTGTTATTCTACGCCCGATACGCTTCTGTTAGGAATTAT
GGTAGACGTATGAGCATTTATGAACTTAGCTTTCGAGTTCGAACGCTGTCCGTTCTAATGGCTAGTCAGTGTTTTTCAGAAACCGTTGGTGCGCACGGCCATATTCTCTT
ATCGTCTATGAAAGTGTTAACGCCTAAGAACCTTACTCTCTGCGCCTCCCAGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACATCTGAAGATGATCAGAATG
TAGAACCCTCTCAAGTGACATTTGAATCAGGTAGTATCAGTCCATCGATATTGGGCACAGAAAAAGATGCAAGTCCACCTATAATTTGCTCAGAAATGAAAACTAATGAA
GAGGTACCCCTTGAGGAGGAGGAGGAGGCGTTTGTTGAGAACAAAGTGAATAAAATTTTGGATGAATTACTCTCTGCTAATTGTGAAGATCTAGAAGGTGATCGAGCCAT
CAACAAATTACAGGAGTGTTTGCAGATTAAACCAATTAATTTAGAGAAATTATGCCTTCCTGATTTAGAAGCCATTCAAACAATGAATCTGAGATCTTCAAGGGGTAATC
TACCTGAGCGTAGTTTGATCAGTGTGGACAGTCAGTTACAAAGGATAGAAAATTTGAAATCTAAGCAGGATGATGAAAATTCGGTTAATCCAATTTCTACAGCATTCTCA
ATGAGAAGTCCGTTGGCTTCATTATCAGCCCTAACTAGAAGAATTTCGCTTTCAAATTCACCAGGTGATCCATTTTCAGCTCATGACCTTGACCAATCATCAGCAAGAAA
TCCTTCCCTTTTTGAACTCAGTAATCACTTGTCTGATGCAGTTGGTATTGCAGAGAAGTTGGGTGTTTCTAGATTGATGTCACTTTTAACCAAGGATGACGGGACTGTAG
CTAAGGGAATTAAGTCACCCAAAATTCTTCTTGGGGATGTTGATTCCATATCTAAAATATCTTCAAGTAATGTTTTAAATGTACCCCAAGCTGGTGCCGAAGCTGCCTTA
AGTGAAACTCATGCCAACATGGAAGCTAAGGATATAAGTGGCAGCAGCACAGAAGTGGAAGTGAATGAGAAATTGAGTTTTCTTGAAGCCCAAGCAGATGCTGTGGCTGC
AACTAATGTTTTGGATGATGAGATGGAAGATCACGAAGGATCCACTTCTGAGCAACCAAACACATCCAAGGTGGATGTGATCGAAGAGTACCCGATTGGCATTCAGACTC
AGCTGGATCAATCAATTGCTACATGTACTGAGAATATTGTCGATGGGCCATCAAGAAGCAGTGGAACAGATAACCACGATAAGGTCAAGCAAAAATCTCGTGCAGGCAAT
CAACGCGAAGGCAAAAGGGTGTCTGGGAGGAAAAGCCTTGCAGGGGCTGGTACAACGTGGCAAGGCGGGGTGAGACGAAGTACCAGGTTCAAAACCCGACCGTTGGAGTA
CTGGAAAGGTGAACGTCTGTTGTACGGACGAGTACATGAGAGCCTGGCAACGGTAATTGGGTTGAAGTATGTGTCTCCTGGTAAAGGTAATGGCCAACCAACTCTGAAGG
TGAAGTCTTTGGTCTCCAGTGAGTACAACGAACTTGTTGAGTTAGCAGCTCTGCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGAACGAAGATGCTCGACACTCCGATGTGATTGATCCTCTTGCTGCTTATTCTGGAATTAGTCTCTTTCCGAGCGCATTTGGCACTTTGCCGGTTCCGTCGAAGCC
ACATGATATTGGAACCGACCTTGACGGCATCCACAAGCACCTCAAATCCATGGTCTCAAGAAATCCCAGTAAACTTATAGAGCAGGCTAGATCGATTTTGAACGGTAACT
CTAATTTGATGCAATCCAAAGCTGCCACATTTCTCGTAAAGAATGAGAAAAAGGAGGAAGCTGCAGCAAACGTGGAGGAAAATCCACAAGAAAGAAGGCCAGCCTTAAAC
CGAAAACGGGCCAGATTTTCTTTGAAGCCTGATGCTAGGCAACCTCCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGACCCTGAGGAGTTCTTTTTGGC
GTACGAAAGGCTTGAAACTGCAACACGCATTCCGGATCATGCTGTATTCGAGATGTGTCTTCATACATATGCTTGTGAGAAGACTCTACCTGTCCGCGCGGCCCCGTATA
GTTCCGGGCCCCCAATCTGGATAGGAATCACTAGCAATGCTGTGGCCAGATTCACTTATGGGTGTAGATGCTTGTTATTCTACGCCCGATACGCTTCTGTTAGGAATTAT
GGTAGACGTATGAGCATTTATGAACTTAGCTTTCGAGTTCGAACGCTGTCCGTTCTAATGGCTAGTCAGTGTTTTTCAGAAACCGTTGGTGCGCACGGCCATATTCTCTT
ATCGTCTATGAAAGTGTTAACGCCTAAGAACCTTACTCTCTGCGCCTCCCAGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACATCTGAAGATGATCAGAATG
TAGAACCCTCTCAAGTGACATTTGAATCAGGTAGTATCAGTCCATCGATATTGGGCACAGAAAAAGATGCAAGTCCACCTATAATTTGCTCAGAAATGAAAACTAATGAA
GAGGTACCCCTTGAGGAGGAGGAGGAGGCGTTTGTTGAGAACAAAGTGAATAAAATTTTGGATGAATTACTCTCTGCTAATTGTGAAGATCTAGAAGGTGATCGAGCCAT
CAACAAATTACAGGAGTGTTTGCAGATTAAACCAATTAATTTAGAGAAATTATGCCTTCCTGATTTAGAAGCCATTCAAACAATGAATCTGAGATCTTCAAGGGGTAATC
TACCTGAGCGTAGTTTGATCAGTGTGGACAGTCAGTTACAAAGGATAGAAAATTTGAAATCTAAGCAGGATGATGAAAATTCGGTTAATCCAATTTCTACAGCATTCTCA
ATGAGAAGTCCGTTGGCTTCATTATCAGCCCTAACTAGAAGAATTTCGCTTTCAAATTCACCAGGTGATCCATTTTCAGCTCATGACCTTGACCAATCATCAGCAAGAAA
TCCTTCCCTTTTTGAACTCAGTAATCACTTGTCTGATGCAGTTGGTATTGCAGAGAAGTTGGGTGTTTCTAGATTGATGTCACTTTTAACCAAGGATGACGGGACTGTAG
CTAAGGGAATTAAGTCACCCAAAATTCTTCTTGGGGATGTTGATTCCATATCTAAAATATCTTCAAGTAATGTTTTAAATGTACCCCAAGCTGGTGCCGAAGCTGCCTTA
AGTGAAACTCATGCCAACATGGAAGCTAAGGATATAAGTGGCAGCAGCACAGAAGTGGAAGTGAATGAGAAATTGAGTTTTCTTGAAGCCCAAGCAGATGCTGTGGCTGC
AACTAATGTTTTGGATGATGAGATGGAAGATCACGAAGGATCCACTTCTGAGCAACCAAACACATCCAAGGTGGATGTGATCGAAGAGTACCCGATTGGCATTCAGACTC
AGCTGGATCAATCAATTGCTACATGTACTGAGAATATTGTCGATGGGCCATCAAGAAGCAGTGGAACAGATAACCACGATAAGGTCAAGCAAAAATCTCGTGCAGGCAAT
CAACGCGAAGGCAAAAGGGTGTCTGGGAGGAAAAGCCTTGCAGGGGCTGGTACAACGTGGCAAGGCGGGGTGAGACGAAGTACCAGGTTCAAAACCCGACCGTTGGAGTA
CTGGAAAGGTGAACGTCTGTTGTACGGACGAGTACATGAGAGCCTGGCAACGGTAATTGGGTTGAAGTATGTGTCTCCTGGTAAAGGTAATGGCCAACCAACTCTGAAGG
TGAAGTCTTTGGTCTCCAGTGAGTACAACGAACTTGTTGAGTTAGCAGCTCTGCACTGAGGGTCGTGTACAAAAAGGAGCAAACAGCCTCGAAGCTTTTCGGATTCGGTT
TCTTGAATATAAATAGCATCTGTTACGCTCACGCCATTGCCTTGTAAACTTCTGCGCCCTTTCTTCTATATCATATATATATCTATCAAGCTGTCTCGCTTGTGTCGCTC
GTGTACACGTGTCATGTGATTTCATAATTTGAACCTTTCATACCGATGTGCAAATTTGAACCTTTCGTGCCAATGTATTTGGATTTGGATATTCAAGCGAATCAGAATGC
TAGCAATTTGACTAAGTCTTGTCAGGTCAAAGGCCATCTAAAACATTCTCTGAATTCATCTTCTAAGATTGAAAATGCTAAAGGATTACAAAATTAAGTTGTAGCTTAAA
AATGGAGTAATTTATGGGATTAGCTCTACATGTTTACATTCTTCAACATCAGTACAGTAAATCACAACATATTTACCAATCTTTGTTTTTCCTTTCTGACTCAAAACAAT
CAGACACTCCCACTTAATAAATTACAAAGAAAAAGGAGAGGAAGAAAAGAAAAGATAAGGAACATCCTTCCAAATTTTCAAATCATCTACCATTGAGATTAGAGAAATGA
GCAAACATTGCACTTCCCGAGTCATCATCGACGTCTTCGACCTCTACTGGTTGCCTACTCCGTTCTGCCTGCACGACAGGGCGCTCGACCGAGTTCGGTGCATTTGTAGA
AGCGTCGACAGCGGTCGCGGGGGCAACAGCTGCCACACTTGCGACCGTGCTTTCATCTTCA
Protein sequenceShow/hide protein sequence
MVNEDARHSDVIDPLAAYSGISLFPSAFGTLPVPSKPHDIGTDLDGIHKHLKSMVSRNPSKLIEQARSILNGNSNLMQSKAATFLVKNEKKEEAAANVEENPQERRPALN
RKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLETATRIPDHAVFEMCLHTYACEKTLPVRAAPYSSGPPIWIGITSNAVARFTYGCRCLLFYARYASVRNY
GRRMSIYELSFRVRTLSVLMASQCFSETVGAHGHILLSSMKVLTPKNLTLCASQRSVRYKHQYSSITSEDDQNVEPSQVTFESGSISPSILGTEKDASPPIICSEMKTNE
EVPLEEEEEAFVENKVNKILDELLSANCEDLEGDRAINKLQECLQIKPINLEKLCLPDLEAIQTMNLRSSRGNLPERSLISVDSQLQRIENLKSKQDDENSVNPISTAFS
MRSPLASLSALTRRISLSNSPGDPFSAHDLDQSSARNPSLFELSNHLSDAVGIAEKLGVSRLMSLLTKDDGTVAKGIKSPKILLGDVDSISKISSSNVLNVPQAGAEAAL
SETHANMEAKDISGSSTEVEVNEKLSFLEAQADAVAATNVLDDEMEDHEGSTSEQPNTSKVDVIEEYPIGIQTQLDQSIATCTENIVDGPSRSSGTDNHDKVKQKSRAGN
QREGKRVSGRKSLAGAGTTWQGGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPGKGNGQPTLKVKSLVSSEYNELVELAALH