; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G02490 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G02490
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptioncentromere protein C-like isoform X1
Genome locationClcChr09:1979837..1985679
RNA-Seq ExpressionClc09G02490
SyntenyClc09G02490
Gene Ontology termsGO:0051315 - attachment of mitotic spindle microtubules to kinetochore (biological process)
GO:0051382 - kinetochore assembly (biological process)
GO:0051455 - attachment of spindle microtubules to kinetochore involved in homologous chromosome segregation (biological process)
GO:0000776 - kinetochore (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0019237 - centromeric DNA binding (molecular function)
InterPro domainsIPR028386 - Centromere protein C/Mif2/cnp3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058804.1 uncharacterized protein E6C27_scaffold339G002780 [Cucumis melo var. makuwa]0.0e+0073.67Show/hide
Query:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF
        MV+EE R SDV+DPLAAYSGINLFP+AFGTL D SKP+DLGTDLDGIHK LKSMVL S                                          
Subjt:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                         PSKL+EQARSILDGNS  M SEAATFLVKNEKNE A+VK EENP ERRPALNRKRARFSLKPDA QP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S   SPL LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL
        TETHPSPHIIDSEKKTDEDVAFEEEEEEE +V   TKAEN+VN IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  NL
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL

Query:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS
        SKRSLISVDN LQ+TETLKSK+D+E LVN VSTPSS+RSPL SLSALNRRISLSNSSGD FSAHGID+SPAR PYLFEL NHLSDAVGI E SSVSKLK 
Subjt:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS

Query:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS
        LLT+DGGT+ANGI+ SKIL GD DSMSKISSS++LNV QVGG+TALSGT+AS +AK+ SG ST+VE+NEK SCLEAQADVVANM++ D +GSASEQP  S
Subjt:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS

Query:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD
         VD+I+EYPVG + QLDQS ATCTENIVDG SRSSGT HH EME+HEG ASEQPN SKVD+IKEYPVGIQ Q                         L  
Subjt:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD

Query:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
         +T TC E IVDG+SRSS T+HH+EEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP 
Subjt:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGQPTLKVKSLVSNEYKDLVELAALH
        KGNG+PT+KVKSLVSNEYKDLV+LAALH
Subjt:  KGNGQPTLKVKSLVSNEYKDLVELAALH

XP_011659552.1 centromere protein C isoform X3 [Cucumis sativus]0.0e+0074.4Show/hide
Query:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF
        M +EEARHSDV+DPLAAYSGINLF +AFGTLPD SKP+DLGTDLDGIHK LKSMVL S                                          
Subjt:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                         PSKL+EQARSILDGNSN M SEAATFLVKNEKNEEATVK EEN  ERRPALNRKRARFSLKPDARQP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSI TEDDQNVDPSQVTF+S   SPL LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL
        TETHPSPHIIDSEKKTDEDVAFEEEEEEE +V   TKAEN++N IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKSS  NL
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL

Query:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS
        SKRSLISVDN LQ+ E LKSKQD+  LVNPVSTPSS+RSPL SLSALNRRISLSNSS D FSAHGIDQSP+R PYLFEL NHLSDAVG  EQSSVSKLK 
Subjt:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS

Query:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS
        LLT+DGGTVANGIK SKIL GD DSMS ISSS++LNVPQVGG+TALSGT+AS EAK+ S  ST+VE+NEK SCLEAQAD VANM++ED EGSASEQP  S
Subjt:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS

Query:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD
         VD+IKEYPVG + QLDQS ATCTENIVDG SRSSGT H  EME+HEG ASEQP  SKVDVIKEYPV IQSQ                         L  
Subjt:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD

Query:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
         +T TC ENI DG+SRSS T+HH+ EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSPA
Subjt:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGQPTLKVKSLVSNEYKDLVELAALH
        KGNG+PT+KVKSLVSNEYKDLVELAALH
Subjt:  KGNGQPTLKVKSLVSNEYKDLVELAALH

XP_031745135.1 centromere protein C isoform X1 [Cucumis sativus]0.0e+0073.68Show/hide
Query:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF
        M +EEARHSDV+DPLAAYSGINLF +AFGTLPD SKP+DLGTDLDGIHK LKSMVL S                                          
Subjt:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                         PSKL+EQARSILDGNSN M SEAATFLVKNEKNEEATVK EEN  ERRPALNRKRARFSLKPDARQP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILG--------RSVRYKHQYSSITTEDDQNVDPSQVTFESD
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILG        RSVRYKHQYSSI TEDDQNVDPSQVTF+S 
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILG--------RSVRYKHQYSSITTEDDQNVDPSQVTFESD

Query:  SISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMN
          SPL LGTETHPSPHIIDSEKKTDEDVAFEEEEEEE +V   TKAEN++N IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMN
Subjt:  SISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMN

Query:  LKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQ
        LKSS  NLSKRSLISVDN LQ+ E LKSKQD+  LVNPVSTPSS+RSPL SLSALNRRISLSNSS D FSAHGIDQSP+R PYLFEL NHLSDAVG  EQ
Subjt:  LKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQ

Query:  SSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGS
        SSVSKLK LLT+DGGTVANGIK SKIL GD DSMS ISSS++LNVPQVGG+TALSGT+AS EAK+ S  ST+VE+NEK SCLEAQAD VANM++ED EGS
Subjt:  SSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGS

Query:  ASEQPNSSMVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRL
        ASEQP  S VD+IKEYPVG + QLDQS ATCTENIVDG SRSSGT H  EME+HEG ASEQP  SKVDVIKEYPV IQSQ                    
Subjt:  ASEQPNSSMVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRL

Query:  CVVCFLTDESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVI
             L   +T TC ENI DG+SRSS T+HH+ EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVI
Subjt:  CVVCFLTDESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVI

Query:  GLKYVSPAKGNGQPTLKVKSLVSNEYKDLVELAALH
        GLKYVSPAKGNG+PT+KVKSLVSNEYKDLVELAALH
Subjt:  GLKYVSPAKGNGQPTLKVKSLVSNEYKDLVELAALH

XP_031745137.1 centromere protein C isoform X4 [Cucumis sativus]0.0e+0074.28Show/hide
Query:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF
        M +EEARHSDV+DPLAAYSGINLF +AFGTLPD SKP+DLGTDLDGIHK LKSMVL S                                          
Subjt:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                         PSKL+EQARSILDGNSN M SEAATFLVKNEKNEEATVK EEN  ERRPALNRKRARFSLKPDARQP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILG SVRYKHQYSSI TEDDQNVDPSQVTF+S   SPL LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL
        TETHPSPHIIDSEKKTDEDVAFEEEEEEE +V   TKAEN++N IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKSS  NL
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL

Query:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS
        SKRSLISVDN LQ+ E LKSKQD+  LVNPVSTPSS+RSPL SLSALNRRISLSNSS D FSAHGIDQSP+R PYLFEL NHLSDAVG  EQSSVSKLK 
Subjt:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS

Query:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS
        LLT+DGGTVANGIK SKIL GD DSMS ISSS++LNVPQVGG+TALSGT+AS EAK+ S  ST+VE+NEK SCLEAQAD VANM++ED EGSASEQP  S
Subjt:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS

Query:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD
         VD+IKEYPVG + QLDQS ATCTENIVDG SRSSGT H  EME+HEG ASEQP  SKVDVIKEYPV IQSQ                         L  
Subjt:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD

Query:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
         +T TC ENI DG+SRSS T+HH+ EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSPA
Subjt:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGQPTLKVKSLVSNEYKDLVELAALH
        KGNG+PT+KVKSLVSNEYKDLVELAALH
Subjt:  KGNGQPTLKVKSLVSNEYKDLVELAALH

XP_038896841.1 centromere protein C isoform X2 [Benincasa hispida]0.0e+0076.24Show/hide
Query:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF
        MV++EARHSD +DPLAAYSGINLF SAFGTLPD SKP+DLG DLDGIHKHLKSM                                              
Subjt:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                     VS +PSKLIEQARSILDGNSN MQSEAATFLVKNEKNEEATVK EENP ERRPALNRKRARFSLKPDARQP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG
        PVNLEPTFDIKQLKDPEEFFLAYER ENAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFES  ISP ++G
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVVTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKR
        TETHPSPHIIDS  KTDEDVAFEEEEE    VTKAENKVNKIL ELLS NC DLEGDRAINILQE LQIKP NLEKLCLPDLEAI TM LKSSS NLSKR
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVVTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKR

Query:  SLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLT
        SLISV N LQR ETLKSKQDDE LVNP+S PSSIRSPL SLSALNRRISLSNSSGDPFSAHGIDQSPAR PYLF L+N+LSDA GIAEQSSVSKLKSLLT
Subjt:  SLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLT

Query:  KDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVD
        KDGGTVANGIK SKILF D DSMSKISSS VLNVP+VG +T LSGTH SMEAKD S GS EVEVNEK SCLE Q D VANM+MED EGSASEQPNSS VD
Subjt:  KDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVD

Query:  VIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTDEST
        +IKEYPVG Q QLDQSTA C ENI DG SRSSGT+HH EME+H+G ASEQPN S VDVIKEYPVG+Q QL                          D+ T
Subjt:  VIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTDEST

Query:  ATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGN
        ATCTENI DG SRSS T+H NEEQ KPKSRANKQ +GKKISGRQSLAGAGTTWQ GVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGN
Subjt:  ATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKGN

Query:  GQPTLKVKSLVSNEYKDLVELAALH
        GQP +KVKSLVSNEYKDLVELAALH
Subjt:  GQPTLKVKSLVSNEYKDLVELAALH

TrEMBL top hitse value%identityAlignment
A0A0A0K774 Uncharacterized protein0.0e+0074.4Show/hide
Query:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF
        M +EEARHSDV+DPLAAYSGINLF +AFGTLPD SKP+DLGTDLDGIHK LKSMVL S                                          
Subjt:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                         PSKL+EQARSILDGNSN M SEAATFLVKNEKNEEATVK EEN  ERRPALNRKRARFSLKPDARQP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSI TEDDQNVDPSQVTF+S   SPL LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL
        TETHPSPHIIDSEKKTDEDVAFEEEEEEE +V   TKAEN++N IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKSS  NL
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL

Query:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS
        SKRSLISVDN LQ+ E LKSKQD+  LVNPVSTPSS+RSPL SLSALNRRISLSNSS D FSAHGIDQSP+R PYLFEL NHLSDAVG  EQSSVSKLK 
Subjt:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS

Query:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS
        LLT+DGGTVANGIK SKIL GD DSMS ISSS++LNVPQVGG+TALSGT+AS EAK+ S  ST+VE+NEK SCLEAQAD VANM++ED EGSASEQP  S
Subjt:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS

Query:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD
         VD+IKEYPVG + QLDQS ATCTENIVDG SRSSGT H  EME+HEG ASEQP  SKVDVIKEYPV IQSQ                         L  
Subjt:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD

Query:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
         +T TC ENI DG+SRSS T+HH+ EQVKPKSRANKQ KGKKIS RQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESL TVIGLKYVSPA
Subjt:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGQPTLKVKSLVSNEYKDLVELAALH
        KGNG+PT+KVKSLVSNEYKDLVELAALH
Subjt:  KGNGQPTLKVKSLVSNEYKDLVELAALH

A0A1S3CDU5 uncharacterized protein LOC103499749 isoform X20.0e+0073.31Show/hide
Query:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF
        MV+EE R SDV+DPLAAYSGINLFP+AFGTL D SKP+DLGTDLDGIHK LKSMVL S                                          
Subjt:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                         PSKL+EQARSILDGNS  M SEAATFLVKNEKNE A+VK EENP ERRPALNRKRARFSLKPDA QP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S   SPL LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL
        TETHPSPHIIDSEKKTDEDVAFEEEEEEE +V   TKAEN+VN IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  NL
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL

Query:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS
        SKRSLISVDN LQ+TETLKSK+D+E LVN VSTPSS+RSPL SLSALNRRISLSNSSGD FSAHGID+SPAR PYLFEL NHLSDAVGI E SSVSKLK 
Subjt:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS

Query:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS
        LLT+DGGT+ANGI+ SKIL GD DSMSKISSS++LNV QVG +TALSGT+AS +AK+ SG ST+VE+NEK SCLEAQADVVANM++ D +GSASEQP  S
Subjt:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS

Query:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD
         VD+I+EYPVG + QLDQS ATCTENIVDG SRSSGT HH EME+HEG ASEQPN SKVD+IKEYPVGIQ QL                           
Subjt:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD

Query:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
         +T TC E IVDG+SRSS T+HH+E  VKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP 
Subjt:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGQPTLKVKSLVSNEYKDLVELAALH
        KGNG+PT+KVKSLVSNEYKDLV+LAALH
Subjt:  KGNGQPTLKVKSLVSNEYKDLVELAALH

A0A1S3CDU7 uncharacterized protein LOC103499749 isoform X10.0e+0073.55Show/hide
Query:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF
        MV+EE R SDV+DPLAAYSGINLFP+AFGTL D SKP+DLGTDLDGIHK LKSMVL S                                          
Subjt:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                         PSKL+EQARSILDGNS  M SEAATFLVKNEKNE A+VK EENP ERRPALNRKRARFSLKPDA QP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S   SPL LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL
        TETHPSPHIIDSEKKTDEDVAFEEEEEEE +V   TKAEN+VN IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  NL
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL

Query:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS
        SKRSLISVDN LQ+TETLKSK+D+E LVN VSTPSS+RSPL SLSALNRRISLSNSSGD FSAHGID+SPAR PYLFEL NHLSDAVGI E SSVSKLK 
Subjt:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS

Query:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS
        LLT+DGGT+ANGI+ SKIL GD DSMSKISSS++LNV QVG +TALSGT+AS +AK+ SG ST+VE+NEK SCLEAQADVVANM++ D +GSASEQP  S
Subjt:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS

Query:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD
         VD+I+EYPVG + QLDQS ATCTENIVDG SRSSGT HH EME+HEG ASEQPN SKVD+IKEYPVGIQ QL                           
Subjt:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD

Query:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
         +T TC E IVDG+SRSS T+HH+EEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP 
Subjt:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGQPTLKVKSLVSNEYKDLVELAALH
        KGNG+PT+KVKSLVSNEYKDLV+LAALH
Subjt:  KGNGQPTLKVKSLVSNEYKDLVELAALH

A0A1S4E341 uncharacterized protein LOC103499749 isoform X37.2e-29470.53Show/hide
Query:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF
        MV+EE R SDV+DPLAAYSGINLFP+AFGTL D SKP+DLGTDLDGIHK LKSMVL S                                          
Subjt:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                         PSKL+EQARSILDGNS  M SEAATFLVKNEKNE A+VK EENP ERRPALNRKRARFSLKPDA QP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S   SPL LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL
        TETHPSPHIIDSEKKTDEDVAFEEEEEEE +V   TKAEN+VN IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  NL
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL

Query:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS
        SKRSLISVDN LQ+TETLKSK+D+E LVN VSTPSS+RSPL SLSALNRRISLSNSS                             VGI E SSVSKLK 
Subjt:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS

Query:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS
        LLT+DGGT+ANGI+ SKIL GD DSMSKISSS++LNV QVG +TALSGT+AS +AK+ SG ST+VE+NEK SCLEAQADVVANM++ D +GSASEQP  S
Subjt:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS

Query:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD
         VD+I+EYPVG + QLDQS ATCTENIVDG SRSSGT HH EME+HEG ASEQPN SKVD+IKEYPVGIQ QL                           
Subjt:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD

Query:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
         +T TC E IVDG+SRSS T+HH+EEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP 
Subjt:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGQPTLKVKSLVSNEYKDLVELAALH
        KGNG+PT+KVKSLVSNEYKDLV+LAALH
Subjt:  KGNGQPTLKVKSLVSNEYKDLVELAALH

A0A5A7UUE4 Uncharacterized protein0.0e+0073.67Show/hide
Query:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF
        MV+EE R SDV+DPLAAYSGINLFP+AFGTL D SKP+DLGTDLDGIHK LKSMVL S                                          
Subjt:  MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIF

Query:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP
                                         PSKL+EQARSILDGNS  M SEAATFLVKNEKNE A+VK EENP ERRPALNRKRARFSLKPDA QP
Subjt:  SSSRHLRKHFVSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQP

Query:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG
        PVNLEPTFDIKQLKDPEEFFLAYE+ ENAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S   SPL LG
Subjt:  PVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILG

Query:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL
        TETHPSPHIIDSEKKTDEDVAFEEEEEEE +V   TKAEN+VN IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  NL
Subjt:  TETHPSPHIIDSEKKTDEDVAFEEEEEEEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNL

Query:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS
        SKRSLISVDN LQ+TETLKSK+D+E LVN VSTPSS+RSPL SLSALNRRISLSNSSGD FSAHGID+SPAR PYLFEL NHLSDAVGI E SSVSKLK 
Subjt:  SKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKS

Query:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS
        LLT+DGGT+ANGI+ SKIL GD DSMSKISSS++LNV QVGG+TALSGT+AS +AK+ SG ST+VE+NEK SCLEAQADVVANM++ D +GSASEQP  S
Subjt:  LLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSS

Query:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD
         VD+I+EYPVG + QLDQS ATCTENIVDG SRSSGT HH EME+HEG ASEQPN SKVD+IKEYPVGIQ Q                         L  
Subjt:  MVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTD

Query:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA
         +T TC E IVDG+SRSS T+HH+EEQVKPKSRANKQRKGKKISGRQSLAGAGTTW+SGVRRSTRFK RPLEYWKGER+LYGRVHESLATVIGLKYVSP 
Subjt:  ESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPA

Query:  KGNGQPTLKVKSLVSNEYKDLVELAALH
        KGNG+PT+KVKSLVSNEYKDLV+LAALH
Subjt:  KGNGQPTLKVKSLVSNEYKDLVELAALH

SwissProt top hitse value%identityAlignment
Q66LG9 Centromere protein C1.4e-4729.27Show/hide
Query:  EAATFLVKNEKNEEATVKVEE-----------NPHERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKD
        ++  F +++E  E+A   +E+           N  ERRP L+RKR  FSL     QPP  + P+FD  +    E+FF AY++ E A +E QKQTGS + D
Subjt:  EAATFLVKNEKNEEATVKVEE-----------NPHERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKD

Query:  LNQQNPSTNNRQRRPGILGRSVR-YKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEFVVTKAENKVNKI
        + +  PS   R RRPGI GR  R +K  ++     D  N++ S+      S   L    E+  + H+   +++ D+     +++            +N +
Subjt:  LNQQNPSTNNRQRRPGILGRSVR-YKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEFVVTKAENKVNKI

Query:  LGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRN-LSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSL
        L +LL+ + E+LEGD AI +L+ERLQIK  N+EK  +P+ + +  MNLK+S  N  +++SL  + N L+ T  +  +++  +      +P +I       
Subjt:  LGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRN-LSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSL

Query:  SALNRRISLSNSSGDPFSAHGI------DQSPAR---GPYLFELSNHLSDAVGIAEQSSV--SKLKSLLTKDGGTVANGIKSSKILFGD------ADSMS
            +  S  N   D FS   I      DQ P+     P   ++ N     VG  + +S     +     +D   + +GI  S +           DS+S
Subjt:  SALNRRISLSNSSGDPFSAHGI------DQSPAR---GPYLFELSNHLSDAVGIAEQSSV--SKLKSLLTKDGGTVANGIKSSKILFGD------ADSMS

Query:  KISSSSV---LNVPQVGGDTALSGTHASMEAKDDSGSTEVEVNEKFSCLE-----AQADVVANMRMED-----LEGSASEQPNSSMVDVIKEYPVGTQGQ
          SS+ +   +++   G +  +  + +           + E+NE+   LE     A  +V     +E+      +G++S+ PN +      E      G 
Subjt:  KISSSSV---LNVPQVGGDTALSGTHASMEAKDDSGSTEVEVNEKFSCLE-----AQADVVANMRMED-----LEGSASEQPNSSMVDVIKEYPVGTQGQ

Query:  LDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTDESTATCTENIVDGSS
        L+   A   + + +  + ++G+   +++EN    A E    S     K    G         + ++    G D      +  L  ES A           
Subjt:  LDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTDESTATCTENIVDGSS

Query:  RSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKG-NGQPTLKVKSLV
           +T   + E+ + K +     +GK  S R+SLA AGT  + GVRRSTR K+RPLEYW+GER LYGR+HESL TVIG+KY SP +G       KVKS V
Subjt:  RSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKG-NGQPTLKVKSLV

Query:  SNEYKDLVELAALH
        S+EYK LV+ AALH
Subjt:  SNEYKDLVELAALH

Arabidopsis top hitse value%identityAlignment
AT1G15660.1 centromere protein C9.7e-4929.27Show/hide
Query:  EAATFLVKNEKNEEATVKVEE-----------NPHERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKD
        ++  F +++E  E+A   +E+           N  ERRP L+RKR  FSL     QPP  + P+FD  +    E+FF AY++ E A +E QKQTGS + D
Subjt:  EAATFLVKNEKNEEATVKVEE-----------NPHERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFFLAYERLENAKKEIQKQTGSVLKD

Query:  LNQQNPSTNNRQRRPGILGRSVR-YKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEFVVTKAENKVNKI
        + +  PS   R RRPGI GR  R +K  ++     D  N++ S+      S   L    E+  + H+   +++ D+     +++            +N +
Subjt:  LNQQNPSTNNRQRRPGILGRSVR-YKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEFVVTKAENKVNKI

Query:  LGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRN-LSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSL
        L +LL+ + E+LEGD AI +L+ERLQIK  N+EK  +P+ + +  MNLK+S  N  +++SL  + N L+ T  +  +++  +      +P +I       
Subjt:  LGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRN-LSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSL

Query:  SALNRRISLSNSSGDPFSAHGI------DQSPAR---GPYLFELSNHLSDAVGIAEQSSV--SKLKSLLTKDGGTVANGIKSSKILFGD------ADSMS
            +  S  N   D FS   I      DQ P+     P   ++ N     VG  + +S     +     +D   + +GI  S +           DS+S
Subjt:  SALNRRISLSNSSGDPFSAHGI------DQSPAR---GPYLFELSNHLSDAVGIAEQSSV--SKLKSLLTKDGGTVANGIKSSKILFGD------ADSMS

Query:  KISSSSV---LNVPQVGGDTALSGTHASMEAKDDSGSTEVEVNEKFSCLE-----AQADVVANMRMED-----LEGSASEQPNSSMVDVIKEYPVGTQGQ
          SS+ +   +++   G +  +  + +           + E+NE+   LE     A  +V     +E+      +G++S+ PN +      E      G 
Subjt:  KISSSSV---LNVPQVGGDTALSGTHASMEAKDDSGSTEVEVNEKFSCLE-----AQADVVANMRMED-----LEGSASEQPNSSMVDVIKEYPVGTQGQ

Query:  LDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTDESTATCTENIVDGSS
        L+   A   + + +  + ++G+   +++EN    A E    S     K    G         + ++    G D      +  L  ES A           
Subjt:  LDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKEYPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTDESTATCTENIVDGSS

Query:  RSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKG-NGQPTLKVKSLV
           +T   + E+ + K +     +GK  S R+SLA AGT  + GVRRSTR K+RPLEYW+GER LYGR+HESL TVIG+KY SP +G       KVKS V
Subjt:  RSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYWKGERLLYGRVHESLATVIGLKYVSPAKG-NGQPTLKVKSLV

Query:  SNEYKDLVELAALH
        S+EYK LV+ AALH
Subjt:  SNEYKDLVELAALH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGCGAAGAAGCTCGACACTCCGATGTCGTCGATCCTCTTGCTGCTTATTCTGGTATCAATCTCTTTCCGAGCGCATTTGGTACTTTGCCGGATCGGTCAAAGCC
ATATGATCTTGGAACCGACCTTGACGGCATCCACAAGCACCTCAAATCCATGGTACTTGCATCTTTTGATACTTTTTTAACTGCGCCAGAGTTTGATGTGGTTTTCCTTC
TTAGTCCTAAAAGATTTAGCTCAATGGTTGAATTGGAGACTCGCCTTGCTTCTCATTTCTCCCATCTAATTTCCATTTTCTCATCATCTCGGCACCTCAGGAAGCATTTT
GTATCTATTTCTAGTTTTGGTTTAATTGACGATCAACGACTCCCCCAAATTTGTCAGGTTTCAGGAACTCCCAGTAAACTTATAGAGCAGGCCAGATCAATTTTAGACGG
GAACTCAAATTGGATGCAATCTGAAGCTGCCACATTTCTTGTGAAGAATGAGAAAAATGAGGAAGCTACAGTGAAGGTGGAGGAAAATCCACATGAAAGAAGGCCGGCCT
TAAACCGAAAGCGGGCCAGGTTCTCTTTAAAACCTGATGCTAGACAACCTCCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGACCCCGAGGAGTTCTTT
TTGGCCTATGAAAGGCTTGAAAATGCCAAAAAAGAAATCCAAAAACAGACAGGATCTGTTTTGAAGGACTTGAACCAACAAAATCCATCCACGAATAACCGTCAGCGTAG
ACCAGGGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATCAGAATGTAGATCCTTCTCAAGTGACATTTGAATCAGATAGCA
TCAGTCCATTGATTTTGGGCACAGAAACACACCCAAGTCCACATATAATTGACTCAGAAAAGAAAACTGATGAAGACGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGTTC
GTTGTTACCAAGGCAGAGAACAAAGTGAATAAAATTTTGGGTGAATTACTCTCTGCCAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATATTACAGGAGCGCTT
GCAGATTAAACCCATTAATTTAGAGAAATTATGTCTTCCAGATTTGGAAGCCATTCCGACAATGAATTTGAAATCTTCAAGTCGCAATCTTTCAAAGCGTAGTTTGATCA
GTGTGGACAATCATTTACAAAGGACAGAAACTTTGAAATCTAAGCAGGACGATGAAACTTTGGTTAATCCTGTTTCTACACCATCCTCAATCAGAAGCCCATTGGGCTCA
TTATCAGCCTTAAATAGACGAATTTCACTTTCAAATTCATCAGGTGATCCATTTTCTGCTCATGGAATTGACCAATCTCCAGCAAGAGGTCCTTACCTTTTTGAACTCAG
TAATCACTTGTCTGATGCAGTTGGTATTGCAGAGCAGTCAAGTGTTTCTAAATTGAAGTCACTTTTAACCAAAGACGGCGGGACTGTAGCAAATGGAATTAAGTCATCCA
AAATTCTTTTTGGAGATGCTGATTCCATGTCTAAAATATCTTCAAGTAGTGTTTTAAATGTACCCCAAGTTGGTGGCGATACTGCCTTAAGTGGAACTCACGCCAGCATG
GAAGCTAAAGATGATAGTGGCAGCACAGAAGTGGAAGTAAATGAGAAATTCAGTTGTCTTGAAGCCCAAGCAGATGTTGTGGCTAATATGCGGATGGAAGATCTCGAAGG
ATCAGCTTCCGAGCAACCAAACTCATCCATGGTGGACGTGATCAAAGAGTACCCAGTTGGCACTCAGGGTCAGTTGGATCAATCAACTGCTACCTGTACTGAAAATATTG
TCGATGGGCTGTCTAGAAGCAGTGGAACAAATCACCACGTTGAGATGGAAAATCACGAAGGATTAGCTTCTGAGCAACCAAACTTATCCAAGGTGGATGTGATCAAAGAG
TACCCGGTTGGCATTCAGAGTCAGTTGGGTATGATCTTCAATGCCAGTATTTTCCCAGTAGATGGATTAGATGATTTCAGGCTGTGTGTTGTTTGTTTTCTAACAGATGA
ATCAACTGCTACTTGTACCGAAAATATTGTCGATGGGTCGTCTAGAAGCAGTGAAACGAATCACCACAACGAGGAACAGGTCAAGCCAAAATCTCGTGCAAACAAACAAC
GCAAAGGCAAAAAGATTTCTGGGAGGCAAAGCCTTGCAGGGGCTGGTACAACGTGGCAAAGTGGGGTGAGAAGAAGTACCAGGTTCAAAACACGACCGTTAGAGTACTGG
AAAGGTGAAAGGCTGTTGTACGGACGCGTACATGAGAGCCTAGCAACAGTAATCGGGTTGAAGTATGTGTCTCCTGCAAAAGGAAATGGCCAACCAACTCTGAAGGTGAA
GTCTCTAGTCTCCAATGAGTACAAAGATCTCGTTGAGTTAGCAGCTCTGCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGAGCGAAGAAGCTCGACACTCCGATGTCGTCGATCCTCTTGCTGCTTATTCTGGTATCAATCTCTTTCCGAGCGCATTTGGTACTTTGCCGGATCGGTCAAAGCC
ATATGATCTTGGAACCGACCTTGACGGCATCCACAAGCACCTCAAATCCATGGTACTTGCATCTTTTGATACTTTTTTAACTGCGCCAGAGTTTGATGTGGTTTTCCTTC
TTAGTCCTAAAAGATTTAGCTCAATGGTTGAATTGGAGACTCGCCTTGCTTCTCATTTCTCCCATCTAATTTCCATTTTCTCATCATCTCGGCACCTCAGGAAGCATTTT
GTATCTATTTCTAGTTTTGGTTTAATTGACGATCAACGACTCCCCCAAATTTGTCAGGTTTCAGGAACTCCCAGTAAACTTATAGAGCAGGCCAGATCAATTTTAGACGG
GAACTCAAATTGGATGCAATCTGAAGCTGCCACATTTCTTGTGAAGAATGAGAAAAATGAGGAAGCTACAGTGAAGGTGGAGGAAAATCCACATGAAAGAAGGCCGGCCT
TAAACCGAAAGCGGGCCAGGTTCTCTTTAAAACCTGATGCTAGACAACCTCCTGTGAACTTGGAACCAACATTTGACATCAAACAATTGAAAGACCCCGAGGAGTTCTTT
TTGGCCTATGAAAGGCTTGAAAATGCCAAAAAAGAAATCCAAAAACAGACAGGATCTGTTTTGAAGGACTTGAACCAACAAAATCCATCCACGAATAACCGTCAGCGTAG
ACCAGGGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATCAGAATGTAGATCCTTCTCAAGTGACATTTGAATCAGATAGCA
TCAGTCCATTGATTTTGGGCACAGAAACACACCCAAGTCCACATATAATTGACTCAGAAAAGAAAACTGATGAAGACGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGTTC
GTTGTTACCAAGGCAGAGAACAAAGTGAATAAAATTTTGGGTGAATTACTCTCTGCCAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATATTACAGGAGCGCTT
GCAGATTAAACCCATTAATTTAGAGAAATTATGTCTTCCAGATTTGGAAGCCATTCCGACAATGAATTTGAAATCTTCAAGTCGCAATCTTTCAAAGCGTAGTTTGATCA
GTGTGGACAATCATTTACAAAGGACAGAAACTTTGAAATCTAAGCAGGACGATGAAACTTTGGTTAATCCTGTTTCTACACCATCCTCAATCAGAAGCCCATTGGGCTCA
TTATCAGCCTTAAATAGACGAATTTCACTTTCAAATTCATCAGGTGATCCATTTTCTGCTCATGGAATTGACCAATCTCCAGCAAGAGGTCCTTACCTTTTTGAACTCAG
TAATCACTTGTCTGATGCAGTTGGTATTGCAGAGCAGTCAAGTGTTTCTAAATTGAAGTCACTTTTAACCAAAGACGGCGGGACTGTAGCAAATGGAATTAAGTCATCCA
AAATTCTTTTTGGAGATGCTGATTCCATGTCTAAAATATCTTCAAGTAGTGTTTTAAATGTACCCCAAGTTGGTGGCGATACTGCCTTAAGTGGAACTCACGCCAGCATG
GAAGCTAAAGATGATAGTGGCAGCACAGAAGTGGAAGTAAATGAGAAATTCAGTTGTCTTGAAGCCCAAGCAGATGTTGTGGCTAATATGCGGATGGAAGATCTCGAAGG
ATCAGCTTCCGAGCAACCAAACTCATCCATGGTGGACGTGATCAAAGAGTACCCAGTTGGCACTCAGGGTCAGTTGGATCAATCAACTGCTACCTGTACTGAAAATATTG
TCGATGGGCTGTCTAGAAGCAGTGGAACAAATCACCACGTTGAGATGGAAAATCACGAAGGATTAGCTTCTGAGCAACCAAACTTATCCAAGGTGGATGTGATCAAAGAG
TACCCGGTTGGCATTCAGAGTCAGTTGGGTATGATCTTCAATGCCAGTATTTTCCCAGTAGATGGATTAGATGATTTCAGGCTGTGTGTTGTTTGTTTTCTAACAGATGA
ATCAACTGCTACTTGTACCGAAAATATTGTCGATGGGTCGTCTAGAAGCAGTGAAACGAATCACCACAACGAGGAACAGGTCAAGCCAAAATCTCGTGCAAACAAACAAC
GCAAAGGCAAAAAGATTTCTGGGAGGCAAAGCCTTGCAGGGGCTGGTACAACGTGGCAAAGTGGGGTGAGAAGAAGTACCAGGTTCAAAACACGACCGTTAGAGTACTGG
AAAGGTGAAAGGCTGTTGTACGGACGCGTACATGAGAGCCTAGCAACAGTAATCGGGTTGAAGTATGTGTCTCCTGCAAAAGGAAATGGCCAACCAACTCTGAAGGTGAA
GTCTCTAGTCTCCAATGAGTACAAAGATCTCGTTGAGTTAGCAGCTCTGCACTAAGGGTCGTGTACAAAAAGGGAGAAAAAGCCTTGAAACCTTTTGGATTTTGCATGTA
TAACTAGCAATTCTCTTTGAATATAAATAGCACCTAGTTTCTGTGGAAAGACTATGGAGGAGAATTAGGCTAATGCCATTGCATTGTATATTTCTTCGCCCTTCTCTATC
ATATATATATCTATCAAGCTGTTTCGCTTGTGTGTTTTTGCTCATGTACTTGTGTCATATGATTTCATATTTTACCCACTGACATTTATCTGATTGTACAAATGTTCCAG
AATGAGTTGTAAACATTTGGCCAACTTGTTTTTGACTCACTCACTCCTTTTTGGCTGGGGGCCAAGGCAGCTAGTTTATAGCAAAATATTGCAGTTTGGCTTTGTGTGTT
GAGTTTAACAATATTATGGGTTGGAGGATTTGA
Protein sequenceShow/hide protein sequence
MVSEEARHSDVVDPLAAYSGINLFPSAFGTLPDRSKPYDLGTDLDGIHKHLKSMVLASFDTFLTAPEFDVVFLLSPKRFSSMVELETRLASHFSHLISIFSSSRHLRKHF
VSISSFGLIDDQRLPQICQVSGTPSKLIEQARSILDGNSNWMQSEAATFLVKNEKNEEATVKVEENPHERRPALNRKRARFSLKPDARQPPVNLEPTFDIKQLKDPEEFF
LAYERLENAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEF
VVTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGS
LSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASM
EAKDDSGSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQLDQSTATCTENIVDGLSRSSGTNHHVEMENHEGLASEQPNLSKVDVIKE
YPVGIQSQLGMIFNASIFPVDGLDDFRLCVVCFLTDESTATCTENIVDGSSRSSETNHHNEEQVKPKSRANKQRKGKKISGRQSLAGAGTTWQSGVRRSTRFKTRPLEYW
KGERLLYGRVHESLATVIGLKYVSPAKGNGQPTLKVKSLVSNEYKDLVELAALH