; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G027650 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G027650
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptioncentromere protein C-like isoform X1
Genome locationchr02:33919881..33925893
RNA-Seq ExpressionLsi02G027650
SyntenyLsi02G027650
Gene Ontology termsGO:0051315 - attachment of mitotic spindle microtubules to kinetochore (biological process)
GO:0051382 - kinetochore assembly (biological process)
GO:0051455 - attachment of spindle microtubules to kinetochore involved in homologous chromosome segregation (biological process)
GO:0000776 - kinetochore (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0019237 - centromeric DNA binding (molecular function)
InterPro domainsIPR028386 - Centromere protein C/Mif2/cnp3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011659552.1 centromere protein C isoform X3 [Cucumis sativus]0.0e+0085.19Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        NL ERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSI 
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAF EEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINILQE LQIKP
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP

Query:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP
        + LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  LVNPVSTPSS+RSPLAS+SALNRRISLSNSS D FSAH I QSP+RDP
Subjt:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP

Query:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL
        YLFEL N+LSDAVG  EQSS+SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALSGT+AS E K+VS S T+VE+NEKLSCL
Subjt:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL

Query:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD
        E   D VANMQ+EDHEGSASEQP  S+VD+IKEYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYPV IQSQLD
Subjt:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD

Query:  QS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP
        QS T TC ENI +G SRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSP
Subjt:  QS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP

Query:  AKGNGQPTMKVKSLVSNEYKDLVELAALH
        AKGNG+PTMKVKSLVSNEYKDLVELAALH
Subjt:  AKGNGQPTMKVKSLVSNEYKDLVELAALH

XP_031745135.1 centromere protein C isoform X1 [Cucumis sativus]0.0e+0084.26Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILG--------RSVRY
        NL ERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQRRPGILG        RSVRY
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILG--------RSVRY

Query:  KHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINIL
        KHQYSSI TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAF EEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINIL
Subjt:  KHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINIL

Query:  QECLQIKPINLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDI
        QE LQIKP+ LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  LVNPVSTPSS+RSPLAS+SALNRRISLSNSS D FSAH I
Subjt:  QECLQIKPINLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDI

Query:  GQSPARDPYLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVE
         QSP+RDPYLFEL N+LSDAVG  EQSS+SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALSGT+AS E K+VS S T+VE
Subjt:  GQSPARDPYLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVE

Query:  VNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYP
        +NEKLSCLE   D VANMQ+EDHEGSASEQP  S+VD+IKEYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYP
Subjt:  VNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYP

Query:  VGIQSQLDQS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATV
        V IQSQLDQS T TC ENI +G SRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TV
Subjt:  VGIQSQLDQS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATV

Query:  IGLKYVSPAKGNGQPTMKVKSLVSNEYKDLVELAALH
        IGLKYVSPAKGNG+PTMKVKSLVSNEYKDLVELAALH
Subjt:  IGLKYVSPAKGNGQPTMKVKSLVSNEYKDLVELAALH

XP_031745136.1 centromere protein C isoform X2 [Cucumis sativus]0.0e+0083.99Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILG--------RSVRY
        NL ERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQRRPGILG        RSVRY
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILG--------RSVRY

Query:  KHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINIL
        KHQYSSI TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAF EEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINIL
Subjt:  KHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINIL

Query:  QECLQIKPINLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDI
        QE LQIKP+ LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  LVNPVSTPSS+RSPLAS+SALNRRISLSNSS D FSAH I
Subjt:  QECLQIKPINLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDI

Query:  GQSPARDPYLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVE
         QSP+RDPYLFEL N+LSDAVG  EQSS+SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALSGT+AS E K+VS S T+VE
Subjt:  GQSPARDPYLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVE

Query:  VNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYP
        +NEKLSCLE   D VANMQ+EDHEGSASEQP  S+VD+IKEYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYP
Subjt:  VNEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYP

Query:  VGIQSQLDQS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATV
        V IQSQLDQS T TC ENI +G SRSSGTDHHD   VKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TV
Subjt:  VGIQSQLDQS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATV

Query:  IGLKYVSPAKGNGQPTMKVKSLVSNEYKDLVELAALH
        IGLKYVSPAKGNG+PTMKVKSLVSNEYKDLVELAALH
Subjt:  IGLKYVSPAKGNGQPTMKVKSLVSNEYKDLVELAALH

XP_031745137.1 centromere protein C isoform X4 [Cucumis sativus]0.0e+0085.05Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        NL ERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQRRPGILG SVRYKHQYSSI 
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAF EEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINILQE LQIKP
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP

Query:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP
        + LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  LVNPVSTPSS+RSPLAS+SALNRRISLSNSS D FSAH I QSP+RDP
Subjt:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP

Query:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL
        YLFEL N+LSDAVG  EQSS+SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALSGT+AS E K+VS S T+VE+NEKLSCL
Subjt:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL

Query:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD
        E   D VANMQ+EDHEGSASEQP  S+VD+IKEYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYPV IQSQLD
Subjt:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD

Query:  QS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP
        QS T TC ENI +G SRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSP
Subjt:  QS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP

Query:  AKGNGQPTMKVKSLVSNEYKDLVELAALH
        AKGNG+PTMKVKSLVSNEYKDLVELAALH
Subjt:  AKGNGQPTMKVKSLVSNEYKDLVELAALH

XP_038896841.1 centromere protein C isoform X2 [Benincasa hispida]0.0e+0087.91Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        MVT+EARHSD IDPLAAYSGINLFS+AF TL DPSKPHDLG DLDGIHKHLKSMVSRSPSKL+EQARSILDGNSNLMQSEAATFLVKNEK+EEATVK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        N  ERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYER ENAKKEIQKQTGAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSIT
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP
        TEDD+NVDPSQVT ESG ISP ++GTETHPSPHIIDS  KTDEDVAF    EEEEEFV SVTKAENKVNKILDELLS NC DLEGDRAINILQECLQIKP
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP

Query:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP
         NLEKLCLPDLEAI TM LKSSS NLSKRS ISV NQLQRIETLKSKQDDE LVNP+S PSSIRSPLAS+SALNRRISLSNSSGDPFSAH I QSPARDP
Subjt:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP

Query:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL
        YLF L+N LSDA GIAEQSS+SKLKSLLTKD GTVANGIKPSKILF DVDSMSK+SSS VLNVP+VG +T LSGTH SME KDVSG   EVEVNEKLSCL
Subjt:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL

Query:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD
        E   D VANMQMEDHEGSASEQPNSSKVD+IKEYPVGIQSQLDQ+TA C ENI DGPSR SG DH  EMEDH+G A EQPNSS VDVIKEYPVG+Q QLD
Subjt:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD

Query:  QSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
        Q TATCTENI +GPSRSSGTDH +EEQ KPKSRANKQ +GKKISGRQSLAGAGTTWQ GVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
Subjt:  QSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGQPTMKVKSLVSNEYKDLVELAALH
        KGNGQP MKVKSLVSNEYKDLVELAALH
Subjt:  KGNGQPTMKVKSLVSNEYKDLVELAALH

TrEMBL top hitse value%identityAlignment
A0A0A0K774 Uncharacterized protein0.0e+0085.19Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        NL ERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSI 
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAF EEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINILQE LQIKP
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP

Query:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP
        + LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  LVNPVSTPSS+RSPLAS+SALNRRISLSNSS D FSAH I QSP+RDP
Subjt:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP

Query:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL
        YLFEL N+LSDAVG  EQSS+SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALSGT+AS E K+VS S T+VE+NEKLSCL
Subjt:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL

Query:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD
        E   D VANMQ+EDHEGSASEQP  S+VD+IKEYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYPV IQSQLD
Subjt:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD

Query:  QS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP
        QS T TC ENI +G SRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSP
Subjt:  QS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP

Query:  AKGNGQPTMKVKSLVSNEYKDLVELAALH
        AKGNG+PTMKVKSLVSNEYKDLVELAALH
Subjt:  AKGNGQPTMKVKSLVSNEYKDLVELAALH

A0A1S3CDU5 uncharacterized protein LOC103499749 isoform X20.0e+0083.29Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        MV EE R SDVIDPLAAYSGINLF  AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNS  M SEAATFLVKNEK+E A+VK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        N  ERRPALNRKRARFSLKPDA QP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ GAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSIT
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAF EEEEEEEE V S TKAEN+VN ILDE LS NCEDLEGDRAINILQE LQIKP
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP

Query:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP
        + LEKLCLPDLEAIPTMNLKS+  NLSKRS ISVDNQLQ+ ETLKSK+D+E LVN VSTPSS+RSPLAS+SALNRRISLSNSSGD FSAH I +SPARDP
Subjt:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP

Query:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL
        YLFEL N+LSDAVGI E SS+SKLK LLT+D GT+ANGI+PSKIL GD DSMSK+SSSN+LNV QVG +TALSGT+AS + K+VSGS T+VE+NEKLSCL
Subjt:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL

Query:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD
        E   DVVANMQ+ DH+GSASEQP  S+VD+I+EYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQPNSSKVD+IKEYPVGIQ QLD
Subjt:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD

Query:  QS--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVS
        QS  T TC E IV+G SRSSGTDHHDE  VKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVS
Subjt:  QS--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVS

Query:  PAKGNGQPTMKVKSLVSNEYKDLVELAALH
        P KGNG+PTMKVKSLVSNEYKDLV+LAALH
Subjt:  PAKGNGQPTMKVKSLVSNEYKDLVELAALH

A0A1S3CDU7 uncharacterized protein LOC103499749 isoform X10.0e+0083.56Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        MV EE R SDVIDPLAAYSGINLF  AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNS  M SEAATFLVKNEK+E A+VK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        N  ERRPALNRKRARFSLKPDA QP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ GAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSIT
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAF EEEEEEEE V S TKAEN+VN ILDE LS NCEDLEGDRAINILQE LQIKP
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP

Query:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP
        + LEKLCLPDLEAIPTMNLKS+  NLSKRS ISVDNQLQ+ ETLKSK+D+E LVN VSTPSS+RSPLAS+SALNRRISLSNSSGD FSAH I +SPARDP
Subjt:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP

Query:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL
        YLFEL N+LSDAVGI E SS+SKLK LLT+D GT+ANGI+PSKIL GD DSMSK+SSSN+LNV QVG +TALSGT+AS + K+VSGS T+VE+NEKLSCL
Subjt:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL

Query:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD
        E   DVVANMQ+ DH+GSASEQP  S+VD+I+EYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQPNSSKVD+IKEYPVGIQ QLD
Subjt:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD

Query:  QS--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVS
        QS  T TC E IV+G SRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVS
Subjt:  QS--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVS

Query:  PAKGNGQPTMKVKSLVSNEYKDLVELAALH
        P KGNG+PTMKVKSLVSNEYKDLV+LAALH
Subjt:  PAKGNGQPTMKVKSLVSNEYKDLVELAALH

A0A1S4E341 uncharacterized protein LOC103499749 isoform X32.4e-30980.41Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        MV EE R SDVIDPLAAYSGINLF  AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNS  M SEAATFLVKNEK+E A+VK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        N  ERRPALNRKRARFSLKPDA QP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ GAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSIT
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAF EEEEEEEE V S TKAEN+VN ILDE LS NCEDLEGDRAINILQE LQIKP
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP

Query:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP
        + LEKLCLPDLEAIPTMNLKS+  NLSKRS ISVDNQLQ+ ETLKSK+D+E LVN VSTPSS+RSPLAS+SALNRRISLSNSS                 
Subjt:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP

Query:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL
                    VGI E SS+SKLK LLT+D GT+ANGI+PSKIL GD DSMSK+SSSN+LNV QVG +TALSGT+AS + K+VSGS T+VE+NEKLSCL
Subjt:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL

Query:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD
        E   DVVANMQ+ DH+GSASEQP  S+VD+I+EYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQPNSSKVD+IKEYPVGIQ QLD
Subjt:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD

Query:  QS--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVS
        QS  T TC E IV+G SRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVS
Subjt:  QS--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVS

Query:  PAKGNGQPTMKVKSLVSNEYKDLVELAALH
        P KGNG+PTMKVKSLVSNEYKDLV+LAALH
Subjt:  PAKGNGQPTMKVKSLVSNEYKDLVELAALH

A0A5A7UUE4 Uncharacterized protein0.0e+0083.54Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        MV EE R SDVIDPLAAYSGINLF  AF TL D SKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNS  M SEAATFLVKNEK+E A+VK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        N  ERRPALNRKRARFSLKPDA QP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ GAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSIT
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAF EEEEEEEE V S TKAEN+VN ILDE LS NCEDLEGDRAINILQE LQIKP
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKP

Query:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP
        + LEKLCLPDLEAIPTMNLKS+  NLSKRS ISVDNQLQ+ ETLKSK+D+E LVN VSTPSS+RSPLAS+SALNRRISLSNSSGD FSAH I +SPARDP
Subjt:  INLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDP

Query:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL
        YLFEL N+LSDAVGI E SS+SKLK LLT+D GT+ANGI+PSKIL GD DSMSK+SSSN+LNV QVG +TALSGT+AS + K+VSGS T+VE+NEKLSCL
Subjt:  YLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCL

Query:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD
        E   DVVANMQ+ DH+GSASEQP  S+VD+I+EYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQPNSSKVD+IKEYPVGIQ QLD
Subjt:  E---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLD

Query:  QS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP
        QS T TC E IV+G SRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP
Subjt:  QS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP

Query:  AKGNGQPTMKVKSLVSNEYKDLVELAALH
         KGNG+PTMKVKSLVSNEYKDLV+LAALH
Subjt:  AKGNGQPTMKVKSLVSNEYKDLVELAALH

SwissProt top hitse value%identityAlignment
Q66LG9 Centromere protein C1.5e-5830.42Show/hide
Query:  DPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRK
        DPL AYSG++LF    ++L +P  P     DL   H  L+SM     S+  EQA++IL+                 + D +  +    N  ERRP L+RK
Subjt:  DPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRK

Query:  RARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVR-YKHQYSSITTEDDENVDPSQ
        R  FSL     QP   + P+FD  +    E+FF AY++ E A +E QKQTG+ + D+ +  PS  +R RRPGI GR  R +K  ++     D  N++ S+
Subjt:  RARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVR-YKHQYSSITTEDDENVDPSQ

Query:  VTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPINLEKLCLPDL
          +   S        E+  + H+   +++ D+                S    +  +N +L +LL+ + E+LEGD AI +L+E LQIK  N+EK  +P+ 
Subjt:  VTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPINLEKLCLPDL

Query:  EAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDI------GQSPAR---DPYL
        + +  MNLK+S  N   R  +S    +Q I  LK         N V+   +  SP        +  S  N   D FS  DI       Q P+     P  
Subjt:  EAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDI------GQSPAR---DPYL

Query:  FELSNYLSDAVGIAEQSS--ISKLKSLLTKDSGTVANGIKPSKILFGD------VDSMSKMSSS----NV-LNVPQVGVDTALSGTHASMETKDVSGSRT
         ++ N     VG  + +S     +     +D   + +GI  S +          +DS+S  SS+    NV +      VD  +S + A+  T D      
Subjt:  FELSNYLSDAVGIAEQSS--ISKLKSLLTKDSGTVANGIKPSKILFGD------VDSMSKMSSS----NV-LNVPQVGVDTALSGTHASMETKDVSGSRT

Query:  EVEVNEKLSCLE--------DVVANMQMED-----HEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATC----TENIVDGPSRCSGMDHADEME--DHE
        + E+NE+   LE        +V     +E+      +G++S+ PN +     ++Y   +   L+ A         EN+  G +    +++A E+    H+
Subjt:  EVEVNEKLSCLE--------DVVANMQMED-----HEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATC----TENIVDGPSRCSGMDHADEME--DHE

Query:  GLAIEQPNSSKVDVIKEYPVGIQSQLDQSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEY
             +   S    +K+    +  +        T    +   + +    ++ E+ KPK       +GK  S R+SLA AGT  + GVRRSTR K+RPLEY
Subjt:  GLAIEQPNSSKVDVIKEYPVGIQSQLDQSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEY

Query:  WKGERLLYGRVHESLATVIGLKYVSPAKG-NGQPTMKVKSLVSNEYKDLVELAALH
        W+GER LYGR+HESL TVIG+KY SP +G       KVKS VS+EYK LV+ AALH
Subjt:  WKGERLLYGRVHESLATVIGLKYVSPAKG-NGQPTMKVKSLVSNEYKDLVELAALH

Arabidopsis top hitse value%identityAlignment
AT1G15660.1 centromere protein C1.1e-5930.42Show/hide
Query:  DPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRK
        DPL AYSG++LF    ++L +P  P     DL   H  L+SM     S+  EQA++IL+                 + D +  +    N  ERRP L+RK
Subjt:  DPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRK

Query:  RARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVR-YKHQYSSITTEDDENVDPSQ
        R  FSL     QP   + P+FD  +    E+FF AY++ E A +E QKQTG+ + D+ +  PS  +R RRPGI GR  R +K  ++     D  N++ S+
Subjt:  RARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVR-YKHQYSSITTEDDENVDPSQ

Query:  VTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPINLEKLCLPDL
          +   S        E+  + H+   +++ D+                S    +  +N +L +LL+ + E+LEGD AI +L+E LQIK  N+EK  +P+ 
Subjt:  VTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPINLEKLCLPDL

Query:  EAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDI------GQSPAR---DPYL
        + +  MNLK+S  N   R  +S    +Q I  LK         N V+   +  SP        +  S  N   D FS  DI       Q P+     P  
Subjt:  EAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDI------GQSPAR---DPYL

Query:  FELSNYLSDAVGIAEQSS--ISKLKSLLTKDSGTVANGIKPSKILFGD------VDSMSKMSSS----NV-LNVPQVGVDTALSGTHASMETKDVSGSRT
         ++ N     VG  + +S     +     +D   + +GI  S +          +DS+S  SS+    NV +      VD  +S + A+  T D      
Subjt:  FELSNYLSDAVGIAEQSS--ISKLKSLLTKDSGTVANGIKPSKILFGD------VDSMSKMSSS----NV-LNVPQVGVDTALSGTHASMETKDVSGSRT

Query:  EVEVNEKLSCLE--------DVVANMQMED-----HEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATC----TENIVDGPSRCSGMDHADEME--DHE
        + E+NE+   LE        +V     +E+      +G++S+ PN +     ++Y   +   L+ A         EN+  G +    +++A E+    H+
Subjt:  EVEVNEKLSCLE--------DVVANMQMED-----HEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATC----TENIVDGPSRCSGMDHADEME--DHE

Query:  GLAIEQPNSSKVDVIKEYPVGIQSQLDQSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEY
             +   S    +K+    +  +        T    +   + +    ++ E+ KPK       +GK  S R+SLA AGT  + GVRRSTR K+RPLEY
Subjt:  GLAIEQPNSSKVDVIKEYPVGIQSQLDQSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEY

Query:  WKGERLLYGRVHESLATVIGLKYVSPAKG-NGQPTMKVKSLVSNEYKDLVELAALH
        W+GER LYGR+HESL TVIG+KY SP +G       KVKS VS+EYK LV+ AALH
Subjt:  WKGERLLYGRVHESLATVIGLKYVSPAKG-NGQPTMKVKSLVSNEYKDLVELAALH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGACCGAAGAGGCTCGACACTCCGATGTGATCGATCCACTTGCTGCTTATTCTGGTATCAATCTCTTTTCGAACGCATTTCGTACTTTGCGGGATCCGTCAAAGCC
ACATGATCTTGGAACCGACCTTGACGGCATCCACAAGCACCTCAAATCCATGGTGTCAAGAAGTCCCAGTAAACTTGTAGAGCAGGCCAGATCAATTTTAGACGGGAACT
CAAATTTGATGCAATCTGAAGCTGCCACATTTCTTGTAAAGAATGAGAAAGATGAGGAAGCTACAGTGAAGGTGGAGGAAAATCTTCATGAAAGAAGGCCGGCCTTAAAC
CGAAAGCGGGCTAGGTTCTCTTTAAAACCTGATGCTAGACAACCTTCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGATCCCGAGGAGTTCTTTTTGGC
CTACGAAAGGCTTGAAAATGCCAAAAAAGAAATTCAAAAACAGACAGGAGCAGTTTTGAAGGACTTGAACCAACAAAATCCATCCACGAATAAACGCCAGCGTAGACCAG
GGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATGAGAATGTAGATCCTTCTCAAGTGACGCTTGAATCAGGTAGCATCAGT
CCATCGATATTGGGCACAGAGACACACCCAAGTCCACATATAATTGACTCGGAAAAGAAAACTGATGAAGATGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGGAGGAGTT
CGTTGGTTCAGTTACCAAGGCAGAGAACAAAGTGAATAAAATTTTGGATGAATTACTCTCTGCCAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATATTACAGG
AGTGCTTGCAGATTAAACCCATTAATTTAGAGAAATTATGCCTTCCAGATTTAGAAGCCATTCCAACAATGAATTTGAAATCTTCAAGTTGCAATCTGTCAAAGCGTAGT
TTCATCAGTGTGGACAATCAGTTACAAAGGATAGAAACTTTGAAATCTAAGCAGGACGATGAAACTTTGGTTAATCCTGTTTCTACACCATCCTCAATCAGAAGTCCATT
GGCATCTGTATCAGCCCTAAATAGACGAATTTCGCTTTCAAATTCATCAGGTGATCCATTTTCTGCTCATGACATTGGCCAATCTCCAGCAAGAGATCCTTACCTTTTTG
AACTCAGTAATTACTTGTCTGATGCAGTTGGTATTGCAGAGCAGTCAAGTATTTCTAAATTGAAGTCACTTTTAACCAAAGATAGCGGGACTGTAGCAAATGGAATTAAG
CCATCCAAAATTCTTTTTGGAGATGTTGATTCCATGTCTAAAATGTCTTCAAGTAATGTTTTAAATGTCCCCCAAGTTGGTGTCGATACTGCCTTAAGTGGAACTCACGC
CAGCATGGAAACTAAAGATGTTAGTGGCAGCCGCACAGAAGTGGAAGTAAATGAGAAATTGAGTTGCCTTGAAGATGTTGTGGCTAATATGCAGATGGAAGATCACGAAG
GATCAGCTTCTGAGCAACCAAACTCATCCAAGGTGGATGTGATCAAAGAGTACCCAGTTGGCATTCAGAGTCAGTTGGATCAAGCAACTGCTACTTGTACTGAAAATATT
GTCGATGGGCCATCTAGATGCAGTGGAATGGATCACGCCGATGAGATGGAAGATCACGAAGGATTAGCTATTGAGCAACCAAACTCATCCAAGGTGGATGTGATCAAAGA
GTACCCGGTTGGCATTCAGAGTCAGTTGGATCAATCAACTGCTACTTGTACTGAAAATATTGTCAACGGGCCGTCTAGAAGCAGTGGAACGGATCACCACGATGAGGAAC
AGGTCAAGCCAAAATCTCGTGCAAACAAACAACGGAAAGGCAAAAAGATTTCTGGGAGGCAAAGCCTTGCAGGGGCTGGTACAACGTGGCAAAGTGGGGTGAGAAGAAGT
ACCAGGTTCAAAACACGACCGTTGGAGTACTGGAAAGGTGAAAGGCTGTTGTACGGACGCGTACATGAGAGCCTAGCAACAGTAATCGGGTTGAAGTATGTATCTCCTGC
AAAAGGAAATGGCCAACCAACTATGAAGGTGAAGTCTTTAGTCTCCAATGAGTACAAAGATCTCGTTGAGTTAGCAGCTCTTCACTAA
mRNA sequenceShow/hide mRNA sequence
GTTTCATTGAAATAAGTCGGCTAAAAGGAAAAAGAAGAAGGTTTTGAAGGACCTCATCTGAAAAACAAAAAAAATTTAAACGGAAAAAAAAAAAGAACACGAAAGTTTTC
GCGCTTTGGTTCACTCTTTCGAATTTGCAGTGGTAGGGTTCGGTTCAAACATTGGATAGGGCAACAATGGTGACCGAAGAGGCTCGACACTCCGATGTGATCGATCCACT
TGCTGCTTATTCTGGTATCAATCTCTTTTCGAACGCATTTCGTACTTTGCGGGATCCGTCAAAGCCACATGATCTTGGAACCGACCTTGACGGCATCCACAAGCACCTCA
AATCCATGGTGTCAAGAAGTCCCAGTAAACTTGTAGAGCAGGCCAGATCAATTTTAGACGGGAACTCAAATTTGATGCAATCTGAAGCTGCCACATTTCTTGTAAAGAAT
GAGAAAGATGAGGAAGCTACAGTGAAGGTGGAGGAAAATCTTCATGAAAGAAGGCCGGCCTTAAACCGAAAGCGGGCTAGGTTCTCTTTAAAACCTGATGCTAGACAACC
TTCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGATCCCGAGGAGTTCTTTTTGGCCTACGAAAGGCTTGAAAATGCCAAAAAAGAAATTCAAAAACAGA
CAGGAGCAGTTTTGAAGGACTTGAACCAACAAAATCCATCCACGAATAAACGCCAGCGTAGACCAGGGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCA
ATAACAACTGAAGATGATGAGAATGTAGATCCTTCTCAAGTGACGCTTGAATCAGGTAGCATCAGTCCATCGATATTGGGCACAGAGACACACCCAAGTCCACATATAAT
TGACTCGGAAAAGAAAACTGATGAAGATGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGGAGGAGTTCGTTGGTTCAGTTACCAAGGCAGAGAACAAAGTGAATAAAATTT
TGGATGAATTACTCTCTGCCAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATATTACAGGAGTGCTTGCAGATTAAACCCATTAATTTAGAGAAATTATGCCTT
CCAGATTTAGAAGCCATTCCAACAATGAATTTGAAATCTTCAAGTTGCAATCTGTCAAAGCGTAGTTTCATCAGTGTGGACAATCAGTTACAAAGGATAGAAACTTTGAA
ATCTAAGCAGGACGATGAAACTTTGGTTAATCCTGTTTCTACACCATCCTCAATCAGAAGTCCATTGGCATCTGTATCAGCCCTAAATAGACGAATTTCGCTTTCAAATT
CATCAGGTGATCCATTTTCTGCTCATGACATTGGCCAATCTCCAGCAAGAGATCCTTACCTTTTTGAACTCAGTAATTACTTGTCTGATGCAGTTGGTATTGCAGAGCAG
TCAAGTATTTCTAAATTGAAGTCACTTTTAACCAAAGATAGCGGGACTGTAGCAAATGGAATTAAGCCATCCAAAATTCTTTTTGGAGATGTTGATTCCATGTCTAAAAT
GTCTTCAAGTAATGTTTTAAATGTCCCCCAAGTTGGTGTCGATACTGCCTTAAGTGGAACTCACGCCAGCATGGAAACTAAAGATGTTAGTGGCAGCCGCACAGAAGTGG
AAGTAAATGAGAAATTGAGTTGCCTTGAAGATGTTGTGGCTAATATGCAGATGGAAGATCACGAAGGATCAGCTTCTGAGCAACCAAACTCATCCAAGGTGGATGTGATC
AAAGAGTACCCAGTTGGCATTCAGAGTCAGTTGGATCAAGCAACTGCTACTTGTACTGAAAATATTGTCGATGGGCCATCTAGATGCAGTGGAATGGATCACGCCGATGA
GATGGAAGATCACGAAGGATTAGCTATTGAGCAACCAAACTCATCCAAGGTGGATGTGATCAAAGAGTACCCGGTTGGCATTCAGAGTCAGTTGGATCAATCAACTGCTA
CTTGTACTGAAAATATTGTCAACGGGCCGTCTAGAAGCAGTGGAACGGATCACCACGATGAGGAACAGGTCAAGCCAAAATCTCGTGCAAACAAACAACGGAAAGGCAAA
AAGATTTCTGGGAGGCAAAGCCTTGCAGGGGCTGGTACAACGTGGCAAAGTGGGGTGAGAAGAAGTACCAGGTTCAAAACACGACCGTTGGAGTACTGGAAAGGTGAAAG
GCTGTTGTACGGACGCGTACATGAGAGCCTAGCAACAGTAATCGGGTTGAAGTATGTATCTCCTGCAAAAGGAAATGGCCAACCAACTATGAAGGTGAAGTCTTTAGTCT
CCAATGAGTACAAAGATCTCGTTGAGTTAGCAGCTCTTCACTAAGGTTCGTGTACCAAAAGGAACAAAAATCCTTGAAGCTTTCCGGATTTTGGATGTATAACTAGCAAT
TCTCTTTGAATATAAATAGCATCTAGTCTCTGTCTGTGGAAAGACTGTAGAGGAGAATTAGGCTTATGCCTTTGCATTGTATATTTCTTCGCCCTTCTCTATCATATATA
TATCTATCAAGCTGTTTCGCTTGTCTGTTTTTGCTCATGTACTTGTGTCGTATGATTTCTTATTTTAATTTTTACCCATTGACATCTAACTTTTTGTACCAATGTTCCAG
AATGAATTGTAATCCTTTGGCCAACTTGTTTTTTACTCACTTTTTCTTTTTGGCTGGGGGCTAAG
Protein sequenceShow/hide protein sequence
MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALN
RKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSITTEDDENVDPSQVTLESGSIS
PSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPINLEKLCLPDLEAIPTMNLKSSSCNLSKRS
FISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPYLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIK
PSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLEDVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENI
VDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRS
TRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTMKVKSLVSNEYKDLVELAALH