; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC09G164860 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC09G164860
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptioncentromere protein C-like isoform X1
Genome locationCicolChr09:2001420..2007322
RNA-Seq ExpressionCcUC09G164860
SyntenyCcUC09G164860
Gene Ontology termsGO:0051315 - attachment of mitotic spindle microtubules to kinetochore (biological process)
GO:0051382 - kinetochore assembly (biological process)
GO:0051455 - attachment of spindle microtubules to kinetochore involved in homologous chromosome segregation (biological process)
GO:0000776 - kinetochore (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0019237 - centromeric DNA binding (molecular function)
InterPro domainsIPR028386 - Centromere protein C/Mif2/cnp3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058804.1 uncharacterized protein E6C27_scaffold339G002780 [Cucumis melo var. makuwa]6.4e-31073.04Show/hide
Query:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF
        MV+EE R SDVIDPLAAYSGINLFP+AFGTL D SKP+DLGTDLDGIHK LKSMV                                             
Subjt:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                       R+PSKL+EQARSILDGNS  M SEAATFLVKNEKNE A+VK EENP ERRPALNRKRARFSLKPDA QP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S    P  LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR
        TETHPSPHIIDSEKKTDEDVAFEEEEEEEE       TKAEN+VN IL E LS +CEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR

Query:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL
        NLSKRSLISVDN LQ+TETLKSK+D+E LVN VSTPSS+RSPL SLSA NRR+SLSNSSGD FSAHGID+SPAR PYLFEL NHLSDAVGI E SSVSKL
Subjt:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL

Query:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN
        K LLT+DGGT+ANG + SKIL GD DSMSKISSSN+LNV QVGG+TALSGT+AS +AK+ SG ST+VE+N+KLSCLEAQADVVANM++ D +GSASEQP 
Subjt:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN

Query:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL
         S VD+I+EYPVG +SQLDQS ATCTENIVD  SRS GT HH EME+H+G ASEQPN SKVD+IKEYPVGIQ QL                         
Subjt:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL

Query:  TDQS-TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYV
         DQS T TC E IVDG+SRSSGT+HH+EEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYV
Subjt:  TDQS-TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYV

Query:  SPAKGNGQPTLKVKSLVSNEYKDLVELAALH
        SP KGNG+PT+KVKSLVSNEYKDLV+LAALH
Subjt:  SPAKGNGQPTLKVKSLVSNEYKDLVELAALH

XP_011659552.1 centromere protein C isoform X3 [Cucumis sativus]0.0e+0073.77Show/hide
Query:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF
        M +EEARHSDVIDPLAAYSGINLF +AFGTLPD SKP+DLGTDLDGIHK LKSMV                                             
Subjt:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                       R+PSKL+EQARSILDGNSN M SEAATFLVKNEKNEEATVK EEN  ERRPALNRKRARFSLKPDARQP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSI TEDDQNVDPSQVTF+S    P  LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR
        TETHPSPHIIDSEKKTDEDVAFEEEEEEEE       TKAEN++N IL E LS +CEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKSS  
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR

Query:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL
        NLSKRSLISVDN LQ+ E LKSKQD+  LVNPVSTPSS+RSPL SLSA NRR+SLSNSS D FSAHGIDQSP+R PYLFEL NHLSDAVG  EQSSVSKL
Subjt:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL

Query:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN
        K LLT+DGGTVANG K SKIL GD DSMS ISSSN+LNVPQVGG+TALSGT+AS EAK+ S  ST+VE+N+KLSCLEAQAD VANM++ED EGSASEQP 
Subjt:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN

Query:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL
         S VD+IKEYPVG +SQLDQS ATCTENIVD  SRS GT H  EME+H+G ASEQP  SKVDVIKEYPV IQSQL                         
Subjt:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL

Query:  TDQS-TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYV
         DQS T TC ENI DG+SRSSGT+HH+ EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYV
Subjt:  TDQS-TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYV

Query:  SPAKGNGQPTLKVKSLVSNEYKDLVELAALH
        SPAKGNG+PT+KVKSLVSNEYKDLVELAALH
Subjt:  SPAKGNGQPTLKVKSLVSNEYKDLVELAALH

XP_031745135.1 centromere protein C isoform X1 [Cucumis sativus]0.0e+0073.06Show/hide
Query:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF
        M +EEARHSDVIDPLAAYSGINLF +AFGTLPD SKP+DLGTDLDGIHK LKSMV                                             
Subjt:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                       R+PSKL+EQARSILDGNSN M SEAATFLVKNEKNEEATVK EEN  ERRPALNRKRARFSLKPDARQP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILG--------RSVRYKHQYSSITTEDDQNVDPSQVTFESD
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILG        RSVRYKHQYSSI TEDDQNVDPSQVTF+S 
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILG--------RSVRYKHQYSSITTEDDQNVDPSQVTFESD

Query:  SIGPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPT
           P  LGTETHPSPHIIDSEKKTDEDVAFEEEEEEEE       TKAEN++N IL E LS +CEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPT
Subjt:  SIGPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPT

Query:  MNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIA
        MNLKSS  NLSKRSLISVDN LQ+ E LKSKQD+  LVNPVSTPSS+RSPL SLSA NRR+SLSNSS D FSAHGIDQSP+R PYLFEL NHLSDAVG  
Subjt:  MNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIA

Query:  EQSSVSKLKSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNDKLSCLEAQADVVANMRMEDLE
        EQSSVSKLK LLT+DGGTVANG K SKIL GD DSMS ISSSN+LNVPQVGG+TALSGT+AS EAK+ S  ST+VE+N+KLSCLEAQAD VANM++ED E
Subjt:  EQSSVSKLKSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNDKLSCLEAQADVVANMRMEDLE

Query:  GSASEQPNSSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDF
        GSASEQP  S VD+IKEYPVG +SQLDQS ATCTENIVD  SRS GT H  EME+H+G ASEQP  SKVDVIKEYPV IQSQL                 
Subjt:  GSASEQPNSSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDF

Query:  GLCVVCFLTDQS-TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLA
                 DQS T TC ENI DG+SRSSGT+HH+ EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL 
Subjt:  GLCVVCFLTDQS-TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLA

Query:  TVIGLKYVSPAKGNGQPTLKVKSLVSNEYKDLVELAALH
        TVIGLKYVSPAKGNG+PT+KVKSLVSNEYKDLVELAALH
Subjt:  TVIGLKYVSPAKGNGQPTLKVKSLVSNEYKDLVELAALH

XP_031745137.1 centromere protein C isoform X4 [Cucumis sativus]0.0e+0073.65Show/hide
Query:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF
        M +EEARHSDVIDPLAAYSGINLF +AFGTLPD SKP+DLGTDLDGIHK LKSMV                                             
Subjt:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                       R+PSKL+EQARSILDGNSN M SEAATFLVKNEKNEEATVK EEN  ERRPALNRKRARFSLKPDARQP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILG SVRYKHQYSSI TEDDQNVDPSQVTF+S    P  LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR
        TETHPSPHIIDSEKKTDEDVAFEEEEEEEE       TKAEN++N IL E LS +CEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKSS  
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR

Query:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL
        NLSKRSLISVDN LQ+ E LKSKQD+  LVNPVSTPSS+RSPL SLSA NRR+SLSNSS D FSAHGIDQSP+R PYLFEL NHLSDAVG  EQSSVSKL
Subjt:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL

Query:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN
        K LLT+DGGTVANG K SKIL GD DSMS ISSSN+LNVPQVGG+TALSGT+AS EAK+ S  ST+VE+N+KLSCLEAQAD VANM++ED EGSASEQP 
Subjt:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN

Query:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL
         S VD+IKEYPVG +SQLDQS ATCTENIVD  SRS GT H  EME+H+G ASEQP  SKVDVIKEYPV IQSQL                         
Subjt:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL

Query:  TDQS-TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYV
         DQS T TC ENI DG+SRSSGT+HH+ EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYV
Subjt:  TDQS-TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYV

Query:  SPAKGNGQPTLKVKSLVSNEYKDLVELAALH
        SPAKGNG+PT+KVKSLVSNEYKDLVELAALH
Subjt:  SPAKGNGQPTLKVKSLVSNEYKDLVELAALH

XP_038896841.1 centromere protein C isoform X2 [Benincasa hispida]0.0e+0075.72Show/hide
Query:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF
        MV++EARHSD IDPLAAYSGINLF SAFGTLPD SKP+DLG DLDGIHKHLKSM                                              
Subjt:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                     VSR+PSKLIEQARSILDGNSN MQSEAATFLVKNEKNEEATVK EENP ERRPALNRKRARFSLKPDARQP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG
        PVNLEPTFDIKQLKDPEEFFLAYER ENAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFES  I P ++G
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFV--VTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSS
        TETHPSPHIIDS  KTDEDVAF       EEEEEFV  VTKAENKVNKIL ELLS +C DLEGDRAINILQE LQIKP NLEKLCLPDLEAI TM LKSS
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFV--VTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSS

Query:  SRNLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVS
        S NLSKRSLISV N LQR ETLKSKQDDE LVNP+S PSSIRSPL SLSA NRR+SLSNSSGDPFSAHGIDQSPAR PYLF L+N+LSDA GIAEQSSVS
Subjt:  SRNLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVS

Query:  KLKSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQ
        KLKSLLTKDGGTVANG K SKILF D DSMSKISSS VLNVP+VG +T LSGTH SMEAKD S GS EVEVN+KLSCLE Q D VANM+MED EGSASEQ
Subjt:  KLKSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQ

Query:  PNSSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVC
        PNSS VD+IKEYPVG QSQLDQSTA C ENI D  SRS GT+HH EME+HKG ASEQPN S VDVIKEYPVG+Q QL                       
Subjt:  PNSSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVC

Query:  FLTDQSTATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKY
           DQ TATCTENI DG SRSSGT+H NEEQ KPKSRANKQ +GKKISGRQSLAGAGTTWQ GVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKY
Subjt:  FLTDQSTATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKY

Query:  VSPAKGNGQPTLKVKSLVSNEYKDLVELAALH
        VSPAKGNGQP +KVKSLVSNEYKDLVELAALH
Subjt:  VSPAKGNGQPTLKVKSLVSNEYKDLVELAALH

TrEMBL top hitse value%identityAlignment
A0A0A0K774 Uncharacterized protein0.0e+0073.77Show/hide
Query:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF
        M +EEARHSDVIDPLAAYSGINLF +AFGTLPD SKP+DLGTDLDGIHK LKSMV                                             
Subjt:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                       R+PSKL+EQARSILDGNSN M SEAATFLVKNEKNEEATVK EEN  ERRPALNRKRARFSLKPDARQP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSI TEDDQNVDPSQVTF+S    P  LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR
        TETHPSPHIIDSEKKTDEDVAFEEEEEEEE       TKAEN++N IL E LS +CEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKSS  
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR

Query:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL
        NLSKRSLISVDN LQ+ E LKSKQD+  LVNPVSTPSS+RSPL SLSA NRR+SLSNSS D FSAHGIDQSP+R PYLFEL NHLSDAVG  EQSSVSKL
Subjt:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL

Query:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN
        K LLT+DGGTVANG K SKIL GD DSMS ISSSN+LNVPQVGG+TALSGT+AS EAK+ S  ST+VE+N+KLSCLEAQAD VANM++ED EGSASEQP 
Subjt:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN

Query:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL
         S VD+IKEYPVG +SQLDQS ATCTENIVD  SRS GT H  EME+H+G ASEQP  SKVDVIKEYPV IQSQL                         
Subjt:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL

Query:  TDQS-TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYV
         DQS T TC ENI DG+SRSSGT+HH+ EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYV
Subjt:  TDQS-TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYV

Query:  SPAKGNGQPTLKVKSLVSNEYKDLVELAALH
        SPAKGNG+PT+KVKSLVSNEYKDLVELAALH
Subjt:  SPAKGNGQPTLKVKSLVSNEYKDLVELAALH

A0A1S3CDU5 uncharacterized protein LOC103499749 isoform X21.3e-30672.6Show/hide
Query:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF
        MV+EE R SDVIDPLAAYSGINLFP+AFGTL D SKP+DLGTDLDGIHK LKSMV                                             
Subjt:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                       R+PSKL+EQARSILDGNS  M SEAATFLVKNEKNE A+VK EENP ERRPALNRKRARFSLKPDA QP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S    P  LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR
        TETHPSPHIIDSEKKTDEDVAFEEEEEEEE       TKAEN+VN IL E LS +CEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR

Query:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL
        NLSKRSLISVDN LQ+TETLKSK+D+E LVN VSTPSS+RSPL SLSA NRR+SLSNSSGD FSAHGID+SPAR PYLFEL NHLSDAVGI E SSVSKL
Subjt:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL

Query:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN
        K LLT+DGGT+ANG + SKIL GD DSMSKISSSN+LNV QVG +TALSGT+AS +AK+ SG ST+VE+N+KLSCLEAQADVVANM++ D +GSASEQP 
Subjt:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN

Query:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL
         S VD+I+EYPVG +SQLDQS ATCTENIVD  SRS GT HH EME+H+G ASEQPN SKVD+IKEYPVGIQ QL                         
Subjt:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL

Query:  TDQS--TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKY
         DQS  T TC E IVDG+SRSSGT+HH+E  VKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKY
Subjt:  TDQS--TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKY

Query:  VSPAKGNGQPTLKVKSLVSNEYKDLVELAALH
        VSP KGNG+PT+KVKSLVSNEYKDLV+LAALH
Subjt:  VSPAKGNGQPTLKVKSLVSNEYKDLVELAALH

A0A1S3CDU7 uncharacterized protein LOC103499749 isoform X13.7e-30972.84Show/hide
Query:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF
        MV+EE R SDVIDPLAAYSGINLFP+AFGTL D SKP+DLGTDLDGIHK LKSMV                                             
Subjt:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                       R+PSKL+EQARSILDGNS  M SEAATFLVKNEKNE A+VK EENP ERRPALNRKRARFSLKPDA QP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S    P  LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR
        TETHPSPHIIDSEKKTDEDVAFEEEEEEEE       TKAEN+VN IL E LS +CEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR

Query:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL
        NLSKRSLISVDN LQ+TETLKSK+D+E LVN VSTPSS+RSPL SLSA NRR+SLSNSSGD FSAHGID+SPAR PYLFEL NHLSDAVGI E SSVSKL
Subjt:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL

Query:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN
        K LLT+DGGT+ANG + SKIL GD DSMSKISSSN+LNV QVG +TALSGT+AS +AK+ SG ST+VE+N+KLSCLEAQADVVANM++ D +GSASEQP 
Subjt:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN

Query:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL
         S VD+I+EYPVG +SQLDQS ATCTENIVD  SRS GT HH EME+H+G ASEQPN SKVD+IKEYPVGIQ QL                         
Subjt:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL

Query:  TDQS--TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKY
         DQS  T TC E IVDG+SRSSGT+HH+EEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKY
Subjt:  TDQS--TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKY

Query:  VSPAKGNGQPTLKVKSLVSNEYKDLVELAALH
        VSP KGNG+PT+KVKSLVSNEYKDLV+LAALH
Subjt:  VSPAKGNGQPTLKVKSLVSNEYKDLVELAALH

A0A1S4E341 uncharacterized protein LOC103499749 isoform X34.1e-28969.83Show/hide
Query:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF
        MV+EE R SDVIDPLAAYSGINLFP+AFGTL D SKP+DLGTDLDGIHK LKSMV                                             
Subjt:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                       R+PSKL+EQARSILDGNS  M SEAATFLVKNEKNE A+VK EENP ERRPALNRKRARFSLKPDA QP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S    P  LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR
        TETHPSPHIIDSEKKTDEDVAFEEEEEEEE       TKAEN+VN IL E LS +CEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR

Query:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL
        NLSKRSLISVDN LQ+TETLKSK+D+E LVN VSTPSS+RSPL SLSA NRR+SLSNSS                             VGI E SSVSKL
Subjt:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL

Query:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN
        K LLT+DGGT+ANG + SKIL GD DSMSKISSSN+LNV QVG +TALSGT+AS +AK+ SG ST+VE+N+KLSCLEAQADVVANM++ D +GSASEQP 
Subjt:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN

Query:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL
         S VD+I+EYPVG +SQLDQS ATCTENIVD  SRS GT HH EME+H+G ASEQPN SKVD+IKEYPVGIQ QL                         
Subjt:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL

Query:  TDQS--TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKY
         DQS  T TC E IVDG+SRSSGT+HH+EEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKY
Subjt:  TDQS--TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKY

Query:  VSPAKGNGQPTLKVKSLVSNEYKDLVELAALH
        VSP KGNG+PT+KVKSLVSNEYKDLV+LAALH
Subjt:  VSPAKGNGQPTLKVKSLVSNEYKDLVELAALH

A0A5A7UUE4 Uncharacterized protein3.1e-31073.04Show/hide
Query:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF
        MV+EE R SDVIDPLAAYSGINLFP+AFGTL D SKP+DLGTDLDGIHK LKSMV                                             
Subjt:  MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                       R+PSKL+EQARSILDGNS  M SEAATFLVKNEKNE A+VK EENP ERRPALNRKRARFSLKPDA QP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S    P  LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR
        TETHPSPHIIDSEKKTDEDVAFEEEEEEEE       TKAEN+VN IL E LS +CEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSR

Query:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL
        NLSKRSLISVDN LQ+TETLKSK+D+E LVN VSTPSS+RSPL SLSA NRR+SLSNSSGD FSAHGID+SPAR PYLFEL NHLSDAVGI E SSVSKL
Subjt:  NLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKL

Query:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN
        K LLT+DGGT+ANG + SKIL GD DSMSKISSSN+LNV QVGG+TALSGT+AS +AK+ SG ST+VE+N+KLSCLEAQADVVANM++ D +GSASEQP 
Subjt:  KSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPN

Query:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL
         S VD+I+EYPVG +SQLDQS ATCTENIVD  SRS GT HH EME+H+G ASEQPN SKVD+IKEYPVGIQ QL                         
Subjt:  SSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFL

Query:  TDQS-TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYV
         DQS T TC E IVDG+SRSSGT+HH+EEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYV
Subjt:  TDQS-TATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYV

Query:  SPAKGNGQPTLKVKSLVSNEYKDLVELAALH
        SP KGNG+PT+KVKSLVSNEYKDLV+LAALH
Subjt:  SPAKGNGQPTLKVKSLVSNEYKDLVELAALH

SwissProt top hitse value%identityAlignment
Q66LG9 Centromere protein C1.4e-4428.79Show/hide
Query:  EAATFLVKNEKNEEATVKVEE-----------NPHERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKD
        ++  F +++E  E+A   +E+           N  ERRP L+RKR  FSL     QPP  + P+FD  +    E+FF AY++ E A +E QKQTGS + D
Subjt:  EAATFLVKNEKNEEATVKVEE-----------NPHERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKD

Query:  LNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENK
        + +  PS   R RRPGI GR  R                 P + +F +DS    ++  E       I SE+  +   A      + E ++  V T  +  
Subjt:  LNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENK

Query:  VNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRN-LSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSP
        +N +L +LL+ S E+LEGD AI +L+ERLQIK  N+EK  +P+ + +  MNLK+S  N  +++SL  + N L+ T  +  +++  +      +P +I   
Subjt:  VNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRN-LSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSP

Query:  LGSLSAFNRRVSLSNSSGDPFSAHGI------DQSPAR---GPYLFELSNHLSDAVGIAEQSSV--SKLKSLLTKDGGTVANGTKLSKILFGD------A
                +  S  N   D FS   I      DQ P+     P   ++ N     VG  + +S     +     +D   + +G   S +           
Subjt:  LGSLSAFNRRVSLSNSSGDPFSAHGI------DQSPAR---GPYLFELSNHLSDAVGIAEQSSV--SKLKSLLTKDGGTVANGTKLSKILFGD------A

Query:  DSMSKISSSNV---LNVPQVGGDTALSGTHASMEAKDDSGSTEVEVNDKLSCLE-----AQADVVANMRMED-----LEGSASEQPNSSMVDVIKEYPVG
        DS+S  SS+ +   +++   G +  +  + +           + E+N++   LE     A  +V     +E+      +G++S+ PN +     ++Y   
Subjt:  DSMSKISSSNV---LNVPQVGGDTALSGTHASMEAKDDSGSTEVEVNDKLSCLE-----AQADVVANMRMED-----LEGSASEQPNSSMVDVIKEYPVG

Query:  TQSQLDQSTATCTENIVDELSRSRGTNHHIEMEN--------HKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFLTDQST
          S      A   + + +E + + G+   +++EN        HK     +   S    +K+    +  + G        P +                  
Subjt:  TQSQLDQSTATCTENIVDELSRSRGTNHHIEMEN--------HKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFLTDQST

Query:  ATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKG-
                    ++ G ++  EE+ KPK       +GK  S R+SLA AGT  + GVRRSTR K+RPLEYW+GER LYGR+HESL TVIG+KY SP +G 
Subjt:  ATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKG-

Query:  NGQPTLKVKSLVSNEYKDLVELAALH
              KVKS VS+EYK LV+ AALH
Subjt:  NGQPTLKVKSLVSNEYKDLVELAALH

Arabidopsis top hitse value%identityAlignment
AT1G15660.1 centromere protein C1.0e-4528.79Show/hide
Query:  EAATFLVKNEKNEEATVKVEE-----------NPHERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKD
        ++  F +++E  E+A   +E+           N  ERRP L+RKR  FSL     QPP  + P+FD  +    E+FF AY++ E A +E QKQTGS + D
Subjt:  EAATFLVKNEKNEEATVKVEE-----------NPHERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKD

Query:  LNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENK
        + +  PS   R RRPGI GR  R                 P + +F +DS    ++  E       I SE+  +   A      + E ++  V T  +  
Subjt:  LNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEEEEEEFVVTKAENK

Query:  VNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRN-LSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSP
        +N +L +LL+ S E+LEGD AI +L+ERLQIK  N+EK  +P+ + +  MNLK+S  N  +++SL  + N L+ T  +  +++  +      +P +I   
Subjt:  VNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRN-LSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSP

Query:  LGSLSAFNRRVSLSNSSGDPFSAHGI------DQSPAR---GPYLFELSNHLSDAVGIAEQSSV--SKLKSLLTKDGGTVANGTKLSKILFGD------A
                +  S  N   D FS   I      DQ P+     P   ++ N     VG  + +S     +     +D   + +G   S +           
Subjt:  LGSLSAFNRRVSLSNSSGDPFSAHGI------DQSPAR---GPYLFELSNHLSDAVGIAEQSSV--SKLKSLLTKDGGTVANGTKLSKILFGD------A

Query:  DSMSKISSSNV---LNVPQVGGDTALSGTHASMEAKDDSGSTEVEVNDKLSCLE-----AQADVVANMRMED-----LEGSASEQPNSSMVDVIKEYPVG
        DS+S  SS+ +   +++   G +  +  + +           + E+N++   LE     A  +V     +E+      +G++S+ PN +     ++Y   
Subjt:  DSMSKISSSNV---LNVPQVGGDTALSGTHASMEAKDDSGSTEVEVNDKLSCLE-----AQADVVANMRMED-----LEGSASEQPNSSMVDVIKEYPVG

Query:  TQSQLDQSTATCTENIVDELSRSRGTNHHIEMEN--------HKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFLTDQST
          S      A   + + +E + + G+   +++EN        HK     +   S    +K+    +  + G        P +                  
Subjt:  TQSQLDQSTATCTENIVDELSRSRGTNHHIEMEN--------HKGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFLTDQST

Query:  ATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKG-
                    ++ G ++  EE+ KPK       +GK  S R+SLA AGT  + GVRRSTR K+RPLEYW+GER LYGR+HESL TVIG+KY SP +G 
Subjt:  ATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKG-

Query:  NGQPTLKVKSLVSNEYKDLVELAALH
              KVKS VS+EYK LV+ AALH
Subjt:  NGQPTLKVKSLVSNEYKDLVELAALH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGCGAAGAAGCTCGACACTCCGATGTCATCGATCCTCTTGCTGCTTATTCTGGTATCAATCTCTTTCCGAGCGCATTTGGTACTTTGCCGGATCAGTCAAAGCC
ATATGATCTTGGAACCGACCTTGACGGCATCCACAAGCACCTCAAATCCATGGTATTTGCATCTTTCGATACTTGTTTAACTGCGCCAGAGTGTGGCGTGGTTTTCCTTC
TTAGTCCTAAAAGATTTAGCTCAATGGTTGAATTGGAGACTCGCCTTGCTTCTCATTTCTCCCATCTAATTTCCATTTTCTCATCATCTCGGCACCTCAGGAAGCATTTT
GTATCTATTTCTAGTTTTGGTTTAATTGACGATCAACGACTCTCCCAAATTTGTCAGGTGTCAAGAACTCCCAGTAAACTTATAGAGCAGGCCAGATCAATTTTAGACGG
GAACTCAAATTGGATGCAATCTGAAGCTGCCACATTTCTTGTGAAGAATGAGAAAAATGAGGAAGCTACGGTGAAGGTGGAGGAAAATCCACATGAAAGAAGGCCGGCCT
TAAACCGAAAGCGGGCTAGGTTCTCTTTAAAACCTGATGCTAGACAACCTCCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGACCCCGAGGAGTTCTTT
TTGGCCTATGAAAGGCTTGAAAATGCCAAAAAAGAAATCCAAAAACAGACAGGATCTGTTTTGAAGGACTTGAACCAACAAAATCCATCCACAAATAACCGTCAGCGTAG
ACCAGGGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATCAGAATGTAGATCCTTCTCAAGTGACATTTGAATCAGATAGCA
TCGGTCCATCGATTTTGGGCACAGAAACACACCCAAGTCCACATATAATTGACTCAGAAAAGAAAACTGATGAAGACGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGGAG
GAGGAGGAGGAGTTCGTTGTTACCAAGGCAGAGAACAAAGTGAATAAAATTTTGGGTGAATTACTCTCTGCTAGTTGTGAAGATCTAGAAGGTGATCGAGCCATCAACAT
ATTACAGGAGCGCTTGCAGATTAAACCCATTAATTTAGAGAAATTATGTCTTCCAGATTTGGAAGCCATTCCGACAATGAATTTGAAATCTTCAAGTCGCAATCTTTCAA
AGCGTAGTTTGATCAGCGTGGACAATCATTTACAAAGGACGGAAACTTTGAAATCCAAGCAGGACGATGAAACTTTGGTTAATCCTGTTTCTACACCATCCTCAATCAGA
AGCCCATTGGGCTCATTATCAGCCTTCAATAGACGAGTTTCACTTTCAAATTCATCAGGTGATCCATTTTCTGCTCATGGAATTGACCAATCTCCAGCAAGAGGTCCTTA
CCTTTTTGAACTCAGTAATCACTTGTCTGATGCAGTTGGTATTGCAGAGCAGTCAAGTGTTTCTAAATTGAAGTCACTTTTAACCAAAGACGGCGGGACTGTAGCAAATG
GAACTAAGCTATCCAAAATTCTTTTTGGAGATGCTGATTCCATGTCTAAAATATCTTCAAGTAATGTTTTAAATGTACCCCAAGTTGGTGGCGATACTGCCTTAAGTGGA
ACTCACGCCAGCATGGAAGCTAAAGATGATAGTGGCAGCACAGAAGTGGAAGTAAATGACAAATTAAGTTGTCTTGAAGCCCAAGCAGATGTTGTGGCTAATATGCGGAT
GGAAGATCTCGAAGGATCAGCTTCTGAGCAACCAAACTCATCCATGGTGGACGTGATCAAAGAGTACCCAGTTGGCACTCAGAGTCAGTTGGATCAATCAACTGCTACCT
GTACTGAAAATATTGTCGATGAGCTGTCTAGAAGCAGAGGAACAAATCACCACATTGAGATGGAAAATCACAAAGGATTAGCTTCTGAGCAACCAAACTTATCCAAGGTG
GATGTGATCAAAGAGTACCCGGTTGGCATTCAGAGTCAGTTGGGTATGATCTTCAATGCCAGTATTTTCCCAGTAGATGGATTAGATGATTTTGGGCTGTGTGTTGTTTG
TTTTCTAACAGATCAATCAACTGCTACTTGTACTGAAAATATTGTCGATGGGTCGTCTAGAAGCAGTGGAACGAATCACCACAACGAGGAACAGGTCAAGCCAAAATCTC
GTGCAAACAAACAACGCAAAGGCAAAAAGATTTCTGGGAGGCAAAGCCTTGCAGGGGCTGGTACAACGTGGCAAAGTGGGGTGAGAAGAAGTACCAGGTTCAAAACACGA
CCGTTGGAGTACTGGAAAGGTGAAAGGCTGTTGTACGGACGCGTACATGAGAGCCTAGCAACAGTAATCGGGTTGAAGTATGTGTCTCCTGCAAAAGGAAATGGCCAACC
AACTCTGAAGGTGAAGTCTTTAGTCTCCAATGAGTACAAAGATCTCGTTGAGTTAGCAGCTCTGCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGAGCGAAGAAGCTCGACACTCCGATGTCATCGATCCTCTTGCTGCTTATTCTGGTATCAATCTCTTTCCGAGCGCATTTGGTACTTTGCCGGATCAGTCAAAGCC
ATATGATCTTGGAACCGACCTTGACGGCATCCACAAGCACCTCAAATCCATGGTATTTGCATCTTTCGATACTTGTTTAACTGCGCCAGAGTGTGGCGTGGTTTTCCTTC
TTAGTCCTAAAAGATTTAGCTCAATGGTTGAATTGGAGACTCGCCTTGCTTCTCATTTCTCCCATCTAATTTCCATTTTCTCATCATCTCGGCACCTCAGGAAGCATTTT
GTATCTATTTCTAGTTTTGGTTTAATTGACGATCAACGACTCTCCCAAATTTGTCAGGTGTCAAGAACTCCCAGTAAACTTATAGAGCAGGCCAGATCAATTTTAGACGG
GAACTCAAATTGGATGCAATCTGAAGCTGCCACATTTCTTGTGAAGAATGAGAAAAATGAGGAAGCTACGGTGAAGGTGGAGGAAAATCCACATGAAAGAAGGCCGGCCT
TAAACCGAAAGCGGGCTAGGTTCTCTTTAAAACCTGATGCTAGACAACCTCCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGACCCCGAGGAGTTCTTT
TTGGCCTATGAAAGGCTTGAAAATGCCAAAAAAGAAATCCAAAAACAGACAGGATCTGTTTTGAAGGACTTGAACCAACAAAATCCATCCACAAATAACCGTCAGCGTAG
ACCAGGGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATCAGAATGTAGATCCTTCTCAAGTGACATTTGAATCAGATAGCA
TCGGTCCATCGATTTTGGGCACAGAAACACACCCAAGTCCACATATAATTGACTCAGAAAAGAAAACTGATGAAGACGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGGAG
GAGGAGGAGGAGTTCGTTGTTACCAAGGCAGAGAACAAAGTGAATAAAATTTTGGGTGAATTACTCTCTGCTAGTTGTGAAGATCTAGAAGGTGATCGAGCCATCAACAT
ATTACAGGAGCGCTTGCAGATTAAACCCATTAATTTAGAGAAATTATGTCTTCCAGATTTGGAAGCCATTCCGACAATGAATTTGAAATCTTCAAGTCGCAATCTTTCAA
AGCGTAGTTTGATCAGCGTGGACAATCATTTACAAAGGACGGAAACTTTGAAATCCAAGCAGGACGATGAAACTTTGGTTAATCCTGTTTCTACACCATCCTCAATCAGA
AGCCCATTGGGCTCATTATCAGCCTTCAATAGACGAGTTTCACTTTCAAATTCATCAGGTGATCCATTTTCTGCTCATGGAATTGACCAATCTCCAGCAAGAGGTCCTTA
CCTTTTTGAACTCAGTAATCACTTGTCTGATGCAGTTGGTATTGCAGAGCAGTCAAGTGTTTCTAAATTGAAGTCACTTTTAACCAAAGACGGCGGGACTGTAGCAAATG
GAACTAAGCTATCCAAAATTCTTTTTGGAGATGCTGATTCCATGTCTAAAATATCTTCAAGTAATGTTTTAAATGTACCCCAAGTTGGTGGCGATACTGCCTTAAGTGGA
ACTCACGCCAGCATGGAAGCTAAAGATGATAGTGGCAGCACAGAAGTGGAAGTAAATGACAAATTAAGTTGTCTTGAAGCCCAAGCAGATGTTGTGGCTAATATGCGGAT
GGAAGATCTCGAAGGATCAGCTTCTGAGCAACCAAACTCATCCATGGTGGACGTGATCAAAGAGTACCCAGTTGGCACTCAGAGTCAGTTGGATCAATCAACTGCTACCT
GTACTGAAAATATTGTCGATGAGCTGTCTAGAAGCAGAGGAACAAATCACCACATTGAGATGGAAAATCACAAAGGATTAGCTTCTGAGCAACCAAACTTATCCAAGGTG
GATGTGATCAAAGAGTACCCGGTTGGCATTCAGAGTCAGTTGGGTATGATCTTCAATGCCAGTATTTTCCCAGTAGATGGATTAGATGATTTTGGGCTGTGTGTTGTTTG
TTTTCTAACAGATCAATCAACTGCTACTTGTACTGAAAATATTGTCGATGGGTCGTCTAGAAGCAGTGGAACGAATCACCACAACGAGGAACAGGTCAAGCCAAAATCTC
GTGCAAACAAACAACGCAAAGGCAAAAAGATTTCTGGGAGGCAAAGCCTTGCAGGGGCTGGTACAACGTGGCAAAGTGGGGTGAGAAGAAGTACCAGGTTCAAAACACGA
CCGTTGGAGTACTGGAAAGGTGAAAGGCTGTTGTACGGACGCGTACATGAGAGCCTAGCAACAGTAATCGGGTTGAAGTATGTGTCTCCTGCAAAAGGAAATGGCCAACC
AACTCTGAAGGTGAAGTCTTTAGTCTCCAATGAGTACAAAGATCTCGTTGAGTTAGCAGCTCTGCACTAAGGGTCGTGTACAAAAAGGAAGAAAAAGCCTTGAAACCTTT
TGGATTTTGCATGTATAACTAGCAATTCTCTTTGAATATAAATAGCATCTAGTCTCTGTGGAAAGACTATGGAGGAGAATTAGGCTAATGCCATTACATTGTATATTTCT
TCGCCCTTCTCTATCATATATATATATCTATCAAGCTGTTTCGCTTGTGTGTTTTTGCTCATATACTTGTGCATGTGATTTCATATTTTACCCACTGACATTTATCTGAT
TGTACCAATGTTCTAGAATGAGCTGTAAACATTTGGCCAACTTGTTTTTGACTCACTCACTCCTTTTTGGCTGGGGGCCAAGGCAGCTAGTTTTATAGCAAAATATTGCA
GTTTGGCTTGTGTGTGTGTCTATATTGAGTTTAACAATATTATGGGCGGGAGGATTTGA
Protein sequenceShow/hide protein sequence
MVSEEARHSDVIDPLAAYSGINLFPSAFGTLPDQSKPYDLGTDLDGIHKHLKSMVFASFDTCLTAPECGVVFLLSPKRFSSMVELETRLASHFSHLISIFSSSRHLRKHF
VSISSFGLIDDQRLSQICQVSRTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFF
LAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSIGPSILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEE
EEEEFVVTKAENKVNKILGELLSASCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIR
SPLGSLSAFNRRVSLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGTKLSKILFGDADSMSKISSSNVLNVPQVGGDTALSG
THASMEAKDDSGSTEVEVNDKLSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQSQLDQSTATCTENIVDELSRSRGTNHHIEMENHKGLASEQPNLSKV
DVIKEYPVGIQSQLGMIFNASIFPVDGLDDFGLCVVCFLTDQSTATCTENIVDGSSRSSGTNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTR
PLEYWKGERLLYGRVHESLATVIGLKYVSPAKGNGQPTLKVKSLVSNEYKDLVELAALH