; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10023321 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10023321
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptioncentromere protein C-like isoform X1
Genome locationChr05:33042487..33047969
RNA-Seq ExpressionHG10023321
SyntenyHG10023321
Gene Ontology termsGO:0051315 - attachment of mitotic spindle microtubules to kinetochore (biological process)
GO:0051382 - kinetochore assembly (biological process)
GO:0051455 - attachment of spindle microtubules to kinetochore involved in homologous chromosome segregation (biological process)
GO:0000776 - kinetochore (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0019237 - centromeric DNA binding (molecular function)
InterPro domainsIPR028386 - Centromere protein C/Mif2/cnp3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011659552.1 centromere protein C isoform X3 [Cucumis sativus]0.0e+0085.3Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        NL ERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSI 
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAFEEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINILQE LQIKP+
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI

Query:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY
         LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  LVNPVSTPSS+RSPLAS+SALNRRISLSNSS D FSAH I QSP+RDPY
Subjt:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY

Query:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE
        LFEL N+LSDAVG  EQSS+SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALSGT+AS E K+VS S T+VE+NEKLSCLE
Subjt:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE

Query:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ
           D VANMQ+EDHEGSASEQP  S+VD+IKEYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYPV IQSQLDQ
Subjt:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ

Query:  S-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
        S T TC ENI +G SRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSPA
Subjt:  S-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGQPTMKVKSLVSNEYKDLVELAALH
        KGNG+PTMKVKSLVSNEYKDLVELAALH
Subjt:  KGNGQPTMKVKSLVSNEYKDLVELAALH

XP_031745135.1 centromere protein C isoform X1 [Cucumis sativus]0.0e+0084.38Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILG--------RSVRY
        NL ERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQRRPGILG        RSVRY
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILG--------RSVRY

Query:  KHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQ
        KHQYSSI TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAFEEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINILQ
Subjt:  KHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQ

Query:  ECLQIKPINLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIG
        E LQIKP+ LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  LVNPVSTPSS+RSPLAS+SALNRRISLSNSS D FSAH I 
Subjt:  ECLQIKPINLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIG

Query:  QSPARDPYLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEV
        QSP+RDPYLFEL N+LSDAVG  EQSS+SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALSGT+AS E K+VS S T+VE+
Subjt:  QSPARDPYLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEV

Query:  NEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPV
        NEKLSCLE   D VANMQ+EDHEGSASEQP  S+VD+IKEYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYPV
Subjt:  NEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPV

Query:  GIQSQLDQS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVI
         IQSQLDQS T TC ENI +G SRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVI
Subjt:  GIQSQLDQS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVI

Query:  GLKYVSPAKGNGQPTMKVKSLVSNEYKDLVELAALH
        GLKYVSPAKGNG+PTMKVKSLVSNEYKDLVELAALH
Subjt:  GLKYVSPAKGNGQPTMKVKSLVSNEYKDLVELAALH

XP_031745136.1 centromere protein C isoform X2 [Cucumis sativus]0.0e+0084.1Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILG--------RSVRY
        NL ERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQRRPGILG        RSVRY
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILG--------RSVRY

Query:  KHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQ
        KHQYSSI TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAFEEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINILQ
Subjt:  KHQYSSITTEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQ

Query:  ECLQIKPINLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIG
        E LQIKP+ LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  LVNPVSTPSS+RSPLAS+SALNRRISLSNSS D FSAH I 
Subjt:  ECLQIKPINLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIG

Query:  QSPARDPYLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEV
        QSP+RDPYLFEL N+LSDAVG  EQSS+SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALSGT+AS E K+VS S T+VE+
Subjt:  QSPARDPYLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEV

Query:  NEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPV
        NEKLSCLE   D VANMQ+EDHEGSASEQP  S+VD+IKEYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYPV
Subjt:  NEKLSCLE---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPV

Query:  GIQSQLDQS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVI
         IQSQLDQS T TC ENI +G SRSSGTDHHD   VKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVI
Subjt:  GIQSQLDQS-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVI

Query:  GLKYVSPAKGNGQPTMKVKSLVSNEYKDLVELAALH
        GLKYVSPAKGNG+PTMKVKSLVSNEYKDLVELAALH
Subjt:  GLKYVSPAKGNGQPTMKVKSLVSNEYKDLVELAALH

XP_031745137.1 centromere protein C isoform X4 [Cucumis sativus]0.0e+0085.16Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        NL ERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQRRPGILG SVRYKHQYSSI 
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAFEEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINILQE LQIKP+
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI

Query:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY
         LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  LVNPVSTPSS+RSPLAS+SALNRRISLSNSS D FSAH I QSP+RDPY
Subjt:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY

Query:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE
        LFEL N+LSDAVG  EQSS+SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALSGT+AS E K+VS S T+VE+NEKLSCLE
Subjt:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE

Query:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ
           D VANMQ+EDHEGSASEQP  S+VD+IKEYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYPV IQSQLDQ
Subjt:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ

Query:  S-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
        S T TC ENI +G SRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSPA
Subjt:  S-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGQPTMKVKSLVSNEYKDLVELAALH
        KGNG+PTMKVKSLVSNEYKDLVELAALH
Subjt:  KGNGQPTMKVKSLVSNEYKDLVELAALH

XP_038896841.1 centromere protein C isoform X2 [Benincasa hispida]0.0e+0088.03Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        MVT+EARHSD IDPLAAYSGINLFS+AF TL DPSKPHDLG DLDGIHKHLKSMVSRSPSKL+EQARSILDGNSNLMQSEAATFLVKNEK+EEATVK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        N  ERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYER ENAKKEIQKQTGAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSIT
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI
        TEDD+NVDPSQVT ESG ISP ++GTETHPSPHIIDS  KTDEDVAF   EEEEEFV SVTKAENKVNKILDELLS NC DLEGDRAINILQECLQIKP 
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI

Query:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY
        NLEKLCLPDLEAI TM LKSSS NLSKRS ISV NQLQRIETLKSKQDDE LVNP+S PSSIRSPLAS+SALNRRISLSNSSGDPFSAH I QSPARDPY
Subjt:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY

Query:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE
        LF L+N LSDA GIAEQSS+SKLKSLLTKD GTVANGIKPSKILF DVDSMSK+SSS VLNVP+VG +T LSGTH SME KDVSG   EVEVNEKLSCLE
Subjt:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE

Query:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ
           D VANMQMEDHEGSASEQPNSSKVD+IKEYPVGIQSQLDQ+TA C ENI DGPSR SG DH  EMEDH+G A EQPNSS VDVIKEYPVG+Q QLDQ
Subjt:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ

Query:  STATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAK
         TATCTENI +GPSRSSGTDH +EEQ KPKSRANKQ +GKKISGRQSLAGAGTTWQ GVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAK
Subjt:  STATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAK

Query:  GNGQPTMKVKSLVSNEYKDLVELAALH
        GNGQP MKVKSLVSNEYKDLVELAALH
Subjt:  GNGQPTMKVKSLVSNEYKDLVELAALH

TrEMBL top hitse value%identityAlignment
A0A0A0K774 Uncharacterized protein0.0e+0085.3Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        M  EEARHSDVIDPLAAYSGINLFS AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNSN M SEAATFLVKNEK+EEATVK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        NL ERRPALNRKRARFSLKPDARQP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTGAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSI 
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAFEEEEEEEE V S TKAEN++N IL+E LS NCEDLEGDRAINILQE LQIKP+
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI

Query:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY
         LEKLCLPDLEAIPTMNLKSS  NLSKRS ISVDNQLQ+IE LKSKQD+  LVNPVSTPSS+RSPLAS+SALNRRISLSNSS D FSAH I QSP+RDPY
Subjt:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY

Query:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE
        LFEL N+LSDAVG  EQSS+SKLK LLT+D GTVANGIKPSKIL GD DSMS +SSSN+LNVPQVG +TALSGT+AS E K+VS S T+VE+NEKLSCLE
Subjt:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE

Query:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ
           D VANMQ+EDHEGSASEQP  S+VD+IKEYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQP SSKVDVIKEYPV IQSQLDQ
Subjt:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ

Query:  S-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
        S T TC ENI +G SRSSGTDHHD EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSPA
Subjt:  S-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGQPTMKVKSLVSNEYKDLVELAALH
        KGNG+PTMKVKSLVSNEYKDLVELAALH
Subjt:  KGNGQPTMKVKSLVSNEYKDLVELAALH

A0A1S3CDU5 uncharacterized protein LOC103499749 isoform X20.0e+0083.4Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        MV EE R SDVIDPLAAYSGINLF  AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNS  M SEAATFLVKNEK+E A+VK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        N  ERRPALNRKRARFSLKPDA QP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ GAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSIT
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAFEEEEEEEE V S TKAEN+VN ILDE LS NCEDLEGDRAINILQE LQIKP+
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI

Query:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY
         LEKLCLPDLEAIPTMNLKS+  NLSKRS ISVDNQLQ+ ETLKSK+D+E LVN VSTPSS+RSPLAS+SALNRRISLSNSSGD FSAH I +SPARDPY
Subjt:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY

Query:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE
        LFEL N+LSDAVGI E SS+SKLK LLT+D GT+ANGI+PSKIL GD DSMSK+SSSN+LNV QVG +TALSGT+AS + K+VSGS T+VE+NEKLSCLE
Subjt:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE

Query:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ
           DVVANMQ+ DH+GSASEQP  S+VD+I+EYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQPNSSKVD+IKEYPVGIQ QLDQ
Subjt:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ

Query:  S--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP
        S  T TC E IV+G SRSSGTDHHDE  VKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP
Subjt:  S--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP

Query:  AKGNGQPTMKVKSLVSNEYKDLVELAALH
         KGNG+PTMKVKSLVSNEYKDLV+LAALH
Subjt:  AKGNGQPTMKVKSLVSNEYKDLVELAALH

A0A1S3CDU7 uncharacterized protein LOC103499749 isoform X10.0e+0083.68Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        MV EE R SDVIDPLAAYSGINLF  AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNS  M SEAATFLVKNEK+E A+VK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        N  ERRPALNRKRARFSLKPDA QP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ GAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSIT
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAFEEEEEEEE V S TKAEN+VN ILDE LS NCEDLEGDRAINILQE LQIKP+
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI

Query:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY
         LEKLCLPDLEAIPTMNLKS+  NLSKRS ISVDNQLQ+ ETLKSK+D+E LVN VSTPSS+RSPLAS+SALNRRISLSNSSGD FSAH I +SPARDPY
Subjt:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY

Query:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE
        LFEL N+LSDAVGI E SS+SKLK LLT+D GT+ANGI+PSKIL GD DSMSK+SSSN+LNV QVG +TALSGT+AS + K+VSGS T+VE+NEKLSCLE
Subjt:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE

Query:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ
           DVVANMQ+ DH+GSASEQP  S+VD+I+EYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQPNSSKVD+IKEYPVGIQ QLDQ
Subjt:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ

Query:  S--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP
        S  T TC E IV+G SRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP
Subjt:  S--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP

Query:  AKGNGQPTMKVKSLVSNEYKDLVELAALH
         KGNG+PTMKVKSLVSNEYKDLV+LAALH
Subjt:  AKGNGQPTMKVKSLVSNEYKDLVELAALH

A0A1S4E341 uncharacterized protein LOC103499749 isoform X30.0e+0080.52Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        MV EE R SDVIDPLAAYSGINLF  AF TL DPSKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNS  M SEAATFLVKNEK+E A+VK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        N  ERRPALNRKRARFSLKPDA QP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ GAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSIT
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAFEEEEEEEE V S TKAEN+VN ILDE LS NCEDLEGDRAINILQE LQIKP+
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI

Query:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY
         LEKLCLPDLEAIPTMNLKS+  NLSKRS ISVDNQLQ+ ETLKSK+D+E LVN VSTPSS+RSPLAS+SALNRRISLSNSS                  
Subjt:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY

Query:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE
                   VGI E SS+SKLK LLT+D GT+ANGI+PSKIL GD DSMSK+SSSN+LNV QVG +TALSGT+AS + K+VSGS T+VE+NEKLSCLE
Subjt:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE

Query:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ
           DVVANMQ+ DH+GSASEQP  S+VD+I+EYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQPNSSKVD+IKEYPVGIQ QLDQ
Subjt:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ

Query:  S--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP
        S  T TC E IV+G SRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP
Subjt:  S--TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSP

Query:  AKGNGQPTMKVKSLVSNEYKDLVELAALH
         KGNG+PTMKVKSLVSNEYKDLV+LAALH
Subjt:  AKGNGQPTMKVKSLVSNEYKDLVELAALH

A0A5A7UUE4 Uncharacterized protein0.0e+0083.65Show/hide
Query:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE
        MV EE R SDVIDPLAAYSGINLF  AF TL D SKPHDLGTDLDGIHK LKSMV RSPSKL+EQARSILDGNS  M SEAATFLVKNEK+E A+VK EE
Subjt:  MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEE

Query:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT
        N  ERRPALNRKRARFSLKPDA QP VNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ GAVLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSIT
Subjt:  NLHERRPALNRKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSIT

Query:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI
        TEDD+NVDPSQVT +SG  SP  LGTETHPSPHIIDSEKKTDEDVAFEEEEEEEE V S TKAEN+VN ILDE LS NCEDLEGDRAINILQE LQIKP+
Subjt:  TEDDENVDPSQVTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPI

Query:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY
         LEKLCLPDLEAIPTMNLKS+  NLSKRS ISVDNQLQ+ ETLKSK+D+E LVN VSTPSS+RSPLAS+SALNRRISLSNSSGD FSAH I +SPARDPY
Subjt:  NLEKLCLPDLEAIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPY

Query:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE
        LFEL N+LSDAVGI E SS+SKLK LLT+D GT+ANGI+PSKIL GD DSMSK+SSSN+LNV QVG +TALSGT+AS + K+VSGS T+VE+NEKLSCLE
Subjt:  LFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKPSKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLE

Query:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ
           DVVANMQ+ DH+GSASEQP  S+VD+I+EYPVGI+SQLDQ+ ATCTENIVDG SR SG +H DEMEDHEG A EQPNSSKVD+IKEYPVGIQ QLDQ
Subjt:  ---DVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIVDGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQ

Query:  S-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
        S T TC E IV+G SRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP 
Subjt:  S-TATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGQPTMKVKSLVSNEYKDLVELAALH
        KGNG+PTMKVKSLVSNEYKDLV+LAALH
Subjt:  KGNGQPTMKVKSLVSNEYKDLVELAALH

SwissProt top hitse value%identityAlignment
Q66LG9 Centromere protein C1.2e-5830.46Show/hide
Query:  DPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRK
        DPL AYSG++LF    ++L +P  P     DL   H  L+SM     S+  EQA++IL+                 + D +  +    N  ERRP L+RK
Subjt:  DPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRK

Query:  RARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVR-YKHQYSSITTEDDENVDPSQ
        R  FSL     QP   + P+FD  +    E+FF AY++ E A +E QKQTG+ + D+ +  PS  +R RRPGI GR  R +K  ++     D  N++ S+
Subjt:  RARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVR-YKHQYSSITTEDDENVDPSQ

Query:  VTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPINLEKLCLPDLE
          +   S        E+  + H+   +++ D+               S    +  +N +L +LL+ + E+LEGD AI +L+E LQIK  N+EK  +P+ +
Subjt:  VTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPINLEKLCLPDLE

Query:  AIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDI------GQSPAR---DPYLF
         +  MNLK+S  N   R  +S    +Q I  LK         N V+   +  SP        +  S  N   D FS  DI       Q P+     P   
Subjt:  AIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDI------GQSPAR---DPYLF

Query:  ELSNYLSDAVGIAEQSS--ISKLKSLLTKDSGTVANGIKPSKILFGD------VDSMSKMSSS----NV-LNVPQVGVDTALSGTHASMETKDVSGSRTE
        ++ N     VG  + +S     +     +D   + +GI  S +          +DS+S  SS+    NV +      VD  +S + A+  T D      +
Subjt:  ELSNYLSDAVGIAEQSS--ISKLKSLLTKDSGTVANGIKPSKILFGD------VDSMSKMSSS----NV-LNVPQVGVDTALSGTHASMETKDVSGSRTE

Query:  VEVNEKLSCLE--------DVVANMQMED-----HEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATC----TENIVDGPSRCSGMDHADEME--DHEG
         E+NE+   LE        +V     +E+      +G++S+ PN +     ++Y   +   L+ A         EN+  G +    +++A E+    H+ 
Subjt:  VEVNEKLSCLE--------DVVANMQMED-----HEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATC----TENIVDGPSRCSGMDHADEME--DHEG

Query:  LAIEQPNSSKVDVIKEYPVGIQSQLDQSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYW
            +   S    +K+    +  +        T    +   + +    ++ E+ KPK       +GK  S R+SLA AGT  + GVRRSTR K+RPLEYW
Subjt:  LAIEQPNSSKVDVIKEYPVGIQSQLDQSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYW

Query:  KGERLLYGRVHESLATVIGLKYVSPAKG-NGQPTMKVKSLVSNEYKDLVELAALH
        +GER LYGR+HESL TVIG+KY SP +G       KVKS VS+EYK LV+ AALH
Subjt:  KGERLLYGRVHESLATVIGLKYVSPAKG-NGQPTMKVKSLVSNEYKDLVELAALH

Arabidopsis top hitse value%identityAlignment
AT1G15660.1 centromere protein C8.2e-6030.46Show/hide
Query:  DPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRK
        DPL AYSG++LF    ++L +P  P     DL   H  L+SM     S+  EQA++IL+                 + D +  +    N  ERRP L+RK
Subjt:  DPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALNRK

Query:  RARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVR-YKHQYSSITTEDDENVDPSQ
        R  FSL     QP   + P+FD  +    E+FF AY++ E A +E QKQTG+ + D+ +  PS  +R RRPGI GR  R +K  ++     D  N++ S+
Subjt:  RARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVR-YKHQYSSITTEDDENVDPSQ

Query:  VTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPINLEKLCLPDLE
          +   S        E+  + H+   +++ D+               S    +  +N +L +LL+ + E+LEGD AI +L+E LQIK  N+EK  +P+ +
Subjt:  VTLESGSISPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPINLEKLCLPDLE

Query:  AIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDI------GQSPAR---DPYLF
         +  MNLK+S  N   R  +S    +Q I  LK         N V+   +  SP        +  S  N   D FS  DI       Q P+     P   
Subjt:  AIPTMNLKSSSCNLSKRSFISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDI------GQSPAR---DPYLF

Query:  ELSNYLSDAVGIAEQSS--ISKLKSLLTKDSGTVANGIKPSKILFGD------VDSMSKMSSS----NV-LNVPQVGVDTALSGTHASMETKDVSGSRTE
        ++ N     VG  + +S     +     +D   + +GI  S +          +DS+S  SS+    NV +      VD  +S + A+  T D      +
Subjt:  ELSNYLSDAVGIAEQSS--ISKLKSLLTKDSGTVANGIKPSKILFGD------VDSMSKMSSS----NV-LNVPQVGVDTALSGTHASMETKDVSGSRTE

Query:  VEVNEKLSCLE--------DVVANMQMED-----HEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATC----TENIVDGPSRCSGMDHADEME--DHEG
         E+NE+   LE        +V     +E+      +G++S+ PN +     ++Y   +   L+ A         EN+  G +    +++A E+    H+ 
Subjt:  VEVNEKLSCLE--------DVVANMQMED-----HEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATC----TENIVDGPSRCSGMDHADEME--DHEG

Query:  LAIEQPNSSKVDVIKEYPVGIQSQLDQSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYW
            +   S    +K+    +  +        T    +   + +    ++ E+ KPK       +GK  S R+SLA AGT  + GVRRSTR K+RPLEYW
Subjt:  LAIEQPNSSKVDVIKEYPVGIQSQLDQSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYW

Query:  KGERLLYGRVHESLATVIGLKYVSPAKG-NGQPTMKVKSLVSNEYKDLVELAALH
        +GER LYGR+HESL TVIG+KY SP +G       KVKS VS+EYK LV+ AALH
Subjt:  KGERLLYGRVHESLATVIGLKYVSPAKG-NGQPTMKVKSLVSNEYKDLVELAALH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGACCGAAGAGGCTCGACACTCCGATGTGATCGATCCACTTGCTGCTTATTCTGGTATCAATCTCTTTTCGAACGCATTTCGTACTTTGCGGGATCCGTCAAAGCC
ACATGATCTTGGAACCGACCTTGACGGCATCCACAAGCACCTCAAATCCATGGTGTCAAGAAGTCCCAGTAAACTTGTAGAGCAGGCCAGATCAATTTTAGACGGGAACT
CAAATTTGATGCAATCTGAAGCTGCCACATTTCTTGTAAAGAATGAGAAAGATGAGGAAGCTACAGTGAAGGTGGAGGAAAATCTTCATGAAAGAAGGCCGGCCTTAAAC
CGAAAGCGGGCTAGGTTCTCTTTAAAACCTGATGCTAGACAACCTTCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGATCCCGAGGAGTTCTTTTTGGC
CTACGAAAGGCTTGAAAATGCCAAAAAAGAAATTCAAAAACAGACAGGAGCAGTTTTGAAGGACTTGAACCAACAAAATCCATCCACGAATAAACGCCAGCGTAGACCAG
GGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATGAGAATGTAGATCCTTCTCAAGTGACGCTTGAATCAGGTAGCATCAGT
CCATCGATATTGGGCACAGAGACACACCCAAGTCCACATATAATTGACTCGGAAAAGAAAACTGATGAAGATGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGGAGTTCGT
TGGTTCAGTTACCAAGGCAGAGAACAAAGTGAATAAAATTTTGGATGAATTACTCTCTGCCAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATATTACAGGAGT
GCTTGCAGATTAAACCCATTAATTTAGAGAAATTATGCCTTCCAGATTTAGAAGCCATTCCAACAATGAATTTGAAATCTTCAAGTTGCAATCTGTCAAAGCGTAGTTTC
ATCAGTGTGGACAATCAGTTACAAAGGATAGAAACTTTGAAATCTAAGCAGGACGATGAAACTTTGGTTAATCCTGTTTCTACACCATCCTCAATCAGAAGTCCATTGGC
ATCTGTATCAGCCCTAAATAGACGAATTTCGCTTTCAAATTCATCAGGTGATCCATTTTCTGCTCATGACATTGGCCAATCTCCAGCAAGAGATCCTTACCTTTTTGAAC
TCAGTAATTACTTGTCTGATGCAGTTGGTATTGCAGAGCAGTCAAGTATTTCTAAATTGAAGTCACTTTTAACCAAAGATAGCGGGACTGTAGCAAATGGAATTAAGCCA
TCCAAAATTCTTTTTGGAGATGTTGATTCCATGTCTAAAATGTCTTCAAGTAATGTTTTAAATGTCCCCCAAGTTGGTGTCGATACTGCCTTAAGTGGAACTCACGCCAG
CATGGAAACTAAAGATGTTAGTGGCAGCCGCACAGAAGTGGAAGTAAATGAGAAATTGAGTTGCCTTGAAGATGTTGTGGCTAATATGCAGATGGAAGATCACGAAGGAT
CAGCTTCTGAGCAACCAAACTCATCCAAGGTGGATGTGATCAAAGAGTACCCAGTTGGCATTCAGAGTCAGTTGGATCAAGCAACTGCTACTTGTACTGAAAATATTGTC
GATGGGCCATCTAGATGCAGTGGAATGGATCACGCCGATGAGATGGAAGATCACGAAGGATTAGCTATTGAGCAACCAAACTCATCCAAGGTGGATGTGATCAAAGAGTA
CCCGGTTGGCATTCAGAGTCAGTTGGATCAATCAACTGCTACTTGTACTGAAAATATTGTCAACGGGCCGTCTAGAAGCAGTGGAACGGATCACCACGATGAGGAACAGG
TCAAGCCAAAATCTCGTGCAAACAAACAACGGAAAGGCAAAAAGATTTCTGGGAGGCAAAGCCTTGCAGGGGCTGGTACAACGTGGCAAAGTGGGGTGAGAAGAAGTACC
AGGTTCAAAACACGACCGTTGGAGTACTGGAAAGGTGAAAGGCTGTTGTACGGACGCGTACATGAGAGCCTAGCAACAGTAATCGGGTTGAAGTATGTATCTCCTGCAAA
AGGAAATGGCCAACCAACTATGAAGGTGAAGTCTTTAGTCTCCAATGAGTACAAAGATCTCGTTGAGTTAGCAGCTCTTCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGACCGAAGAGGCTCGACACTCCGATGTGATCGATCCACTTGCTGCTTATTCTGGTATCAATCTCTTTTCGAACGCATTTCGTACTTTGCGGGATCCGTCAAAGCC
ACATGATCTTGGAACCGACCTTGACGGCATCCACAAGCACCTCAAATCCATGGTGTCAAGAAGTCCCAGTAAACTTGTAGAGCAGGCCAGATCAATTTTAGACGGGAACT
CAAATTTGATGCAATCTGAAGCTGCCACATTTCTTGTAAAGAATGAGAAAGATGAGGAAGCTACAGTGAAGGTGGAGGAAAATCTTCATGAAAGAAGGCCGGCCTTAAAC
CGAAAGCGGGCTAGGTTCTCTTTAAAACCTGATGCTAGACAACCTTCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGATCCCGAGGAGTTCTTTTTGGC
CTACGAAAGGCTTGAAAATGCCAAAAAAGAAATTCAAAAACAGACAGGAGCAGTTTTGAAGGACTTGAACCAACAAAATCCATCCACGAATAAACGCCAGCGTAGACCAG
GGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATGAGAATGTAGATCCTTCTCAAGTGACGCTTGAATCAGGTAGCATCAGT
CCATCGATATTGGGCACAGAGACACACCCAAGTCCACATATAATTGACTCGGAAAAGAAAACTGATGAAGATGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGGAGTTCGT
TGGTTCAGTTACCAAGGCAGAGAACAAAGTGAATAAAATTTTGGATGAATTACTCTCTGCCAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATATTACAGGAGT
GCTTGCAGATTAAACCCATTAATTTAGAGAAATTATGCCTTCCAGATTTAGAAGCCATTCCAACAATGAATTTGAAATCTTCAAGTTGCAATCTGTCAAAGCGTAGTTTC
ATCAGTGTGGACAATCAGTTACAAAGGATAGAAACTTTGAAATCTAAGCAGGACGATGAAACTTTGGTTAATCCTGTTTCTACACCATCCTCAATCAGAAGTCCATTGGC
ATCTGTATCAGCCCTAAATAGACGAATTTCGCTTTCAAATTCATCAGGTGATCCATTTTCTGCTCATGACATTGGCCAATCTCCAGCAAGAGATCCTTACCTTTTTGAAC
TCAGTAATTACTTGTCTGATGCAGTTGGTATTGCAGAGCAGTCAAGTATTTCTAAATTGAAGTCACTTTTAACCAAAGATAGCGGGACTGTAGCAAATGGAATTAAGCCA
TCCAAAATTCTTTTTGGAGATGTTGATTCCATGTCTAAAATGTCTTCAAGTAATGTTTTAAATGTCCCCCAAGTTGGTGTCGATACTGCCTTAAGTGGAACTCACGCCAG
CATGGAAACTAAAGATGTTAGTGGCAGCCGCACAGAAGTGGAAGTAAATGAGAAATTGAGTTGCCTTGAAGATGTTGTGGCTAATATGCAGATGGAAGATCACGAAGGAT
CAGCTTCTGAGCAACCAAACTCATCCAAGGTGGATGTGATCAAAGAGTACCCAGTTGGCATTCAGAGTCAGTTGGATCAAGCAACTGCTACTTGTACTGAAAATATTGTC
GATGGGCCATCTAGATGCAGTGGAATGGATCACGCCGATGAGATGGAAGATCACGAAGGATTAGCTATTGAGCAACCAAACTCATCCAAGGTGGATGTGATCAAAGAGTA
CCCGGTTGGCATTCAGAGTCAGTTGGATCAATCAACTGCTACTTGTACTGAAAATATTGTCAACGGGCCGTCTAGAAGCAGTGGAACGGATCACCACGATGAGGAACAGG
TCAAGCCAAAATCTCGTGCAAACAAACAACGGAAAGGCAAAAAGATTTCTGGGAGGCAAAGCCTTGCAGGGGCTGGTACAACGTGGCAAAGTGGGGTGAGAAGAAGTACC
AGGTTCAAAACACGACCGTTGGAGTACTGGAAAGGTGAAAGGCTGTTGTACGGACGCGTACATGAGAGCCTAGCAACAGTAATCGGGTTGAAGTATGTATCTCCTGCAAA
AGGAAATGGCCAACCAACTATGAAGGTGAAGTCTTTAGTCTCCAATGAGTACAAAGATCTCGTTGAGTTAGCAGCTCTTCACTAA
Protein sequenceShow/hide protein sequence
MVTEEARHSDVIDPLAAYSGINLFSNAFRTLRDPSKPHDLGTDLDGIHKHLKSMVSRSPSKLVEQARSILDGNSNLMQSEAATFLVKNEKDEEATVKVEENLHERRPALN
RKRARFSLKPDARQPSVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGAVLKDLNQQNPSTNKRQRRPGILGRSVRYKHQYSSITTEDDENVDPSQVTLESGSIS
PSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEFVGSVTKAENKVNKILDELLSANCEDLEGDRAINILQECLQIKPINLEKLCLPDLEAIPTMNLKSSSCNLSKRSF
ISVDNQLQRIETLKSKQDDETLVNPVSTPSSIRSPLASVSALNRRISLSNSSGDPFSAHDIGQSPARDPYLFELSNYLSDAVGIAEQSSISKLKSLLTKDSGTVANGIKP
SKILFGDVDSMSKMSSSNVLNVPQVGVDTALSGTHASMETKDVSGSRTEVEVNEKLSCLEDVVANMQMEDHEGSASEQPNSSKVDVIKEYPVGIQSQLDQATATCTENIV
DGPSRCSGMDHADEMEDHEGLAIEQPNSSKVDVIKEYPVGIQSQLDQSTATCTENIVNGPSRSSGTDHHDEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRST
RFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTMKVKSLVSNEYKDLVELAALH