; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G013830 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G013830
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptioncentromere protein C-like isoform X1
Genome locationCG_Chr05:21538564..21542194
RNA-Seq ExpressionClCG05G013830
SyntenyClCG05G013830
Gene Ontology termsGO:0051315 - attachment of mitotic spindle microtubules to kinetochore (biological process)
GO:0051382 - kinetochore assembly (biological process)
GO:0051455 - attachment of spindle microtubules to kinetochore involved in homologous chromosome segregation (biological process)
GO:0000776 - kinetochore (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0019237 - centromeric DNA binding (molecular function)
InterPro domainsIPR028386 - Centromere protein C/Mif2/cnp3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058804.1 uncharacterized protein E6C27_scaffold339G002780 [Cucumis melo var. makuwa]6.4e-16082.01Show/hide
Query:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE
        NAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S   SPL LGTETHPSPHIIDSEKKTDEDVAFEEEEE
Subjt:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE

Query:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL
        EE +V   TKAEN+VN IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  NLSKRSLISVDN LQ+TETLKSK+D+E L
Subjt:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL

Query:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS
        VN VSTPSS+RSPL SLSALNRRISLSNSSGD FSAHGID+SPAR PYLFEL NHLSDAVGI E SSVSKLK LLT+DGGT+ANGI+ SKIL GD DSMS
Subjt:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS

Query:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL
        KISSS++LNV QVGG+TALSGT+AS +AK+ SG ST+VE+NEK SCLEAQADVVANM++ D +GSASEQP  S VD+I+EYPVG + QL
Subjt:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL

XP_011659552.1 centromere protein C isoform X3 [Cucumis sativus]5.3e-16282.52Show/hide
Query:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE
        NAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSI TEDDQNVDPSQVTF+S   SPL LGTETHPSPHIIDSEKKTDEDVAFEEEEE
Subjt:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE

Query:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL
        EE +V   TKAEN++N IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKSS  NLSKRSLISVDN LQ+ E LKSKQD+  L
Subjt:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL

Query:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS
        VNPVSTPSS+RSPL SLSALNRRISLSNSS D FSAHGIDQSP+R PYLFEL NHLSDAVG  EQSSVSKLK LLT+DGGTVANGIK SKIL GD DSMS
Subjt:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS

Query:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL
         ISSS++LNVPQVGG+TALSGT+AS EAK+ S  ST+VE+NEK SCLEAQAD VANM++ED EGSASEQP  S VD+IKEYPVG + QL
Subjt:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL

XP_031745137.1 centromere protein C isoform X4 [Cucumis sativus]3.8e-16082.26Show/hide
Query:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE
        NAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILG SVRYKHQYSSI TEDDQNVDPSQVTF+S   SPL LGTETHPSPHIIDSEKKTDEDVAFEEEEE
Subjt:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE

Query:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL
        EE +V   TKAEN++N IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKSS  NLSKRSLISVDN LQ+ E LKSKQD+  L
Subjt:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL

Query:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS
        VNPVSTPSS+RSPL SLSALNRRISLSNSS D FSAHGIDQSP+R PYLFEL NHLSDAVG  EQSSVSKLK LLT+DGGTVANGIK SKIL GD DSMS
Subjt:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS

Query:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL
         ISSS++LNVPQVGG+TALSGT+AS EAK+ S  ST+VE+NEK SCLEAQAD VANM++ED EGSASEQP  S VD+IKEYPVG + QL
Subjt:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL

XP_038896840.1 centromere protein C isoform X1 [Benincasa hispida]7.1e-16785.75Show/hide
Query:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE
        NAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFES  ISP ++GTETHPSPHIIDS  KTDEDVAFEEEEE
Subjt:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE

Query:  EEFVVTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETLVNP
            VTKAENKVNKIL ELLS NC DLEGDRAINILQE LQIKP NLEKLCLPDLEAI TM LKSSS NLSKRSLISV N LQR ETLKSKQDDE LVNP
Subjt:  EEFVVTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETLVNP

Query:  VSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMSKIS
        +S PSSIRSPL SLSALNRRISLSNSSGDPFSAHGIDQSPAR PYLF L+N+LSDA GIAEQSSVSKLKSLLTKDGGTVANGIK SKILF D DSMSKIS
Subjt:  VSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMSKIS

Query:  SSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL
        SS VLNVP+VG +T LSGTH SMEAKD S GS EVEVNEK SCLE Q D VANM+MED EGSASEQPNSS VD+IKEYPVG Q QL
Subjt:  SSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL

XP_038896841.1 centromere protein C isoform X2 [Benincasa hispida]7.1e-16785.75Show/hide
Query:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE
        NAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFES  ISP ++GTETHPSPHIIDS  KTDEDVAFEEEEE
Subjt:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE

Query:  EEFVVTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETLVNP
            VTKAENKVNKIL ELLS NC DLEGDRAINILQE LQIKP NLEKLCLPDLEAI TM LKSSS NLSKRSLISV N LQR ETLKSKQDDE LVNP
Subjt:  EEFVVTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETLVNP

Query:  VSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMSKIS
        +S PSSIRSPL SLSALNRRISLSNSSGDPFSAHGIDQSPAR PYLF L+N+LSDA GIAEQSSVSKLKSLLTKDGGTVANGIK SKILF D DSMSKIS
Subjt:  VSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMSKIS

Query:  SSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL
        SS VLNVP+VG +T LSGTH SMEAKD S GS EVEVNEK SCLE Q D VANM+MED EGSASEQPNSS VD+IKEYPVG Q QL
Subjt:  SSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL

TrEMBL top hitse value%identityAlignment
A0A0A0K774 Uncharacterized protein2.5e-16282.52Show/hide
Query:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE
        NAKKEIQKQTG+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSI TEDDQNVDPSQVTF+S   SPL LGTETHPSPHIIDSEKKTDEDVAFEEEEE
Subjt:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE

Query:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL
        EE +V   TKAEN++N IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKSS  NLSKRSLISVDN LQ+ E LKSKQD+  L
Subjt:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL

Query:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS
        VNPVSTPSS+RSPL SLSALNRRISLSNSS D FSAHGIDQSP+R PYLFEL NHLSDAVG  EQSSVSKLK LLT+DGGTVANGIK SKIL GD DSMS
Subjt:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS

Query:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL
         ISSS++LNVPQVGG+TALSGT+AS EAK+ S  ST+VE+NEK SCLEAQAD VANM++ED EGSASEQP  S VD+IKEYPVG + QL
Subjt:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDS-GSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL

A0A1S3CDU5 uncharacterized protein LOC103499749 isoform X21.5e-15981.75Show/hide
Query:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE
        NAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S   SPL LGTETHPSPHIIDSEKKTDEDVAFEEEEE
Subjt:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE

Query:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL
        EE +V   TKAEN+VN IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  NLSKRSLISVDN LQ+TETLKSK+D+E L
Subjt:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL

Query:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS
        VN VSTPSS+RSPL SLSALNRRISLSNSSGD FSAHGID+SPAR PYLFEL NHLSDAVGI E SSVSKLK LLT+DGGT+ANGI+ SKIL GD DSMS
Subjt:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS

Query:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL
        KISSS++LNV QVG +TALSGT+AS +AK+ SG ST+VE+NEK SCLEAQADVVANM++ D +GSASEQP  S VD+I+EYPVG + QL
Subjt:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL

A0A1S3CDU7 uncharacterized protein LOC103499749 isoform X11.5e-15981.75Show/hide
Query:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE
        NAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S   SPL LGTETHPSPHIIDSEKKTDEDVAFEEEEE
Subjt:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE

Query:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL
        EE +V   TKAEN+VN IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  NLSKRSLISVDN LQ+TETLKSK+D+E L
Subjt:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL

Query:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS
        VN VSTPSS+RSPL SLSALNRRISLSNSSGD FSAHGID+SPAR PYLFEL NHLSDAVGI E SSVSKLK LLT+DGGT+ANGI+ SKIL GD DSMS
Subjt:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS

Query:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL
        KISSS++LNV QVG +TALSGT+AS +AK+ SG ST+VE+NEK SCLEAQADVVANM++ D +GSASEQP  S VD+I+EYPVG + QL
Subjt:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL

A0A5A7UUE4 Uncharacterized protein3.1e-16082.01Show/hide
Query:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE
        NAKKEIQKQ G+VLKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTF+S   SPL LGTETHPSPHIIDSEKKTDEDVAFEEEEE
Subjt:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE

Query:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL
        EE +V   TKAEN+VN IL E LS NCEDLEGDRAINILQERLQIKP+ LEKLCLPDLEAIPTMNLKS+  NLSKRSLISVDN LQ+TETLKSK+D+E L
Subjt:  EEFVV---TKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETL

Query:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS
        VN VSTPSS+RSPL SLSALNRRISLSNSSGD FSAHGID+SPAR PYLFEL NHLSDAVGI E SSVSKLK LLT+DGGT+ANGI+ SKIL GD DSMS
Subjt:  VNPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMS

Query:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL
        KISSS++LNV QVGG+TALSGT+AS +AK+ SG ST+VE+NEK SCLEAQADVVANM++ D +GSASEQP  S VD+I+EYPVG + QL
Subjt:  KISSSSVLNVPQVGGDTALSGTHASMEAKDDSG-STEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQL

A0A6J1JYG6 centromere protein C-like isoform X11.5e-14677.18Show/hide
Query:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE
        NAKKEIQKQTG++LKDLNQQNPSTN RQRRPGILGRSVRYKHQYSSIT+EDDQ V+PSQVTFES SISP  LGTE   SP II SE KT+E+V FEEEEE
Subjt:  NAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE

Query:  EEFV--VTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETLV
        E FV  +T AENKVNKIL ELLSANCEDLEGD+AIN LQE LQIKPINLEKLCLPDLEAI TMNL+SS  NL +RSLISVD+ LQR E LKSKQDDE  V
Subjt:  EEFV--VTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETLV

Query:  NPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMSK
        NP+STP S+RSPL SLSAL RRISLSNS GDPFSAH +DQS AR P LFELSNHLSDAVGIAE+  VS+L SLLTKD GTVA GIKS KIL GD +S+SK
Subjt:  NPVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMSK

Query:  ISSSSVLNVPQVGGDTALSGTHASMEAKDDSGST-EVEVNEKFSCLEAQADVVA-----NMRMEDLEGSASEQPNSSMVDVIKEYPVGTQ
        ISSS+VLNVPQ G D ALS THA+MEAKD SGS+ EVEVNEK S LEAQAD VA     +  MED EGS SEQPN+S VD IKEYP+G Q
Subjt:  ISSSSVLNVPQVGGDTALSGTHASMEAKDDSGST-EVEVNEKFSCLEAQADVVA-----NMRMEDLEGSASEQPNSSMVDVIKEYPVGTQ

SwissProt top hitse value%identityAlignment
Q66LG9 Centromere protein C2.3e-1125.13Show/hide
Query:  AKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVR-YKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE
        A +E QKQTGS + D+ +  PS   R RRPGI GR  R +K  ++     D  N++ S+      S   L    E+  + H+   +++ D+     +++ 
Subjt:  AKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVR-YKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE

Query:  EEFVVTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRN-LSKRSLISVDNHLQRTETLKSKQDDETLVN
                   +N +L +LL+ + E+LEGD AI +L+ERLQIK  N+EK  +P+ + +  MNLK+S  N  +++SL  + N L+ T  +  +++  +   
Subjt:  EEFVVTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRN-LSKRSLISVDNHLQRTETLKSKQDDETLVN

Query:  PVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGI------DQSPAR---GPYLFELSNHLSDAVGIAEQSSV--SKLKSLLTKDGGTVANGIKSSKI
           +P +I           +  S  N   D FS   I      DQ P+     P   ++ N     VG  + +S     +     +D   + +GI  S +
Subjt:  PVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGI------DQSPAR---GPYLFELSNHLSDAVGIAEQSSV--SKLKSLLTKDGGTVANGIKSSKI

Query:  LFGD------ADSMSKISSSSV---LNVPQVGGDTALSGTHASMEAKDDSGSTEVEVNEKFSCLE-----AQADVVANMRMED-----LEGSASEQPN
                   DS+S  SS+ +   +++   G +  +  + +           + E+NE+   LE     A  +V     +E+      +G++S+ PN
Subjt:  LFGD------ADSMSKISSSSV---LNVPQVGGDTALSGTHASMEAKDDSGSTEVEVNEKFSCLE-----AQADVVANMRMED-----LEGSASEQPN

Arabidopsis top hitse value%identityAlignment
AT1G15660.1 centromere protein C1.6e-1225.13Show/hide
Query:  AKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVR-YKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE
        A +E QKQTGS + D+ +  PS   R RRPGI GR  R +K  ++     D  N++ S+      S   L    E+  + H+   +++ D+     +++ 
Subjt:  AKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVR-YKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEE

Query:  EEFVVTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRN-LSKRSLISVDNHLQRTETLKSKQDDETLVN
                   +N +L +LL+ + E+LEGD AI +L+ERLQIK  N+EK  +P+ + +  MNLK+S  N  +++SL  + N L+ T  +  +++  +   
Subjt:  EEFVVTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRN-LSKRSLISVDNHLQRTETLKSKQDDETLVN

Query:  PVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGI------DQSPAR---GPYLFELSNHLSDAVGIAEQSSV--SKLKSLLTKDGGTVANGIKSSKI
           +P +I           +  S  N   D FS   I      DQ P+     P   ++ N     VG  + +S     +     +D   + +GI  S +
Subjt:  PVSTPSSIRSPLGSLSALNRRISLSNSSGDPFSAHGI------DQSPAR---GPYLFELSNHLSDAVGIAEQSSV--SKLKSLLTKDGGTVANGIKSSKI

Query:  LFGD------ADSMSKISSSSV---LNVPQVGGDTALSGTHASMEAKDDSGSTEVEVNEKFSCLE-----AQADVVANMRMED-----LEGSASEQPN
                   DS+S  SS+ +   +++   G +  +  + +           + E+NE+   LE     A  +V     +E+      +G++S+ PN
Subjt:  LFGD------ADSMSKISSSSV---LNVPQVGGDTALSGTHASMEAKDDSGSTEVEVNEKFSCLE-----AQADVVANMRMED-----LEGSASEQPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGCTTGAAAGTAAATGCCAAAAAAGAAATCCAAAAACAGACAGGATCTGTTTTGAAGGACTTGAACCAACAAAATCCATCCACGAATAACCGTCAGCGTAGACC
AGGGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATCAGAATGTAGATCCTTCTCAAGTGACATTTGAATCAGATAGCATCA
GTCCATTGATTTTGGGCACAGAAACACACCCAAGTCCACATATAATTGACTCAGAAAAGAAAACTGATGAAGACGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGTTCGTT
GTTACCAAGGCAGAGAACAAAGTGAATAAAATTTTGGGTGAATTACTCTCTGCCAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATATTACAGGAGCGCTTGCA
GATTAAACCCATTAATTTAGAGAAATTATGTCTTCCAGATTTGGAAGCCATTCCGACAATGAATTTGAAATCTTCAAGTCGCAATCTTTCAAAGCGTAGTTTGATCAGTG
TGGACAATCATTTACAAAGGACAGAAACTTTGAAATCTAAGCAGGACGATGAAACTTTGGTTAATCCTGTTTCTACACCATCCTCAATCAGAAGCCCATTGGGCTCATTA
TCAGCCTTAAATAGACGAATTTCACTTTCAAATTCATCAGGTGATCCATTTTCTGCTCATGGAATTGACCAATCTCCAGCAAGAGGTCCTTACCTTTTTGAACTCAGTAA
TCACTTGTCTGATGCAGTTGGTATTGCAGAGCAGTCAAGTGTTTCTAAATTGAAGTCACTTTTAACCAAAGACGGCGGGACTGTAGCAAATGGAATTAAGTCATCCAAAA
TTCTTTTTGGAGATGCTGATTCCATGTCTAAAATATCTTCAAGTAGTGTTTTAAATGTACCCCAAGTTGGTGGCGATACTGCCTTAAGTGGAACTCACGCCAGCATGGAA
GCTAAAGATGATAGTGGCAGCACAGAAGTGGAAGTAAATGAGAAATTCAGTTGTCTTGAAGCCCAAGCAGATGTTGTGGCTAATATGCGGATGGAAGATCTCGAAGGATC
AGCTTCCGAGCAACCAAACTCATCCATGGTGGACGTGATCAAAGAGTACCCAGTTGGCACTCAGGGTCAGTTGGGTATGATCTTCAACCCCAGTATCGGAAAACCGAGAT
TGGGGAAGCTTACTACCCGAATTGTTCTTGTCCTTGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGGCTTGAAAGTAAATGCCAAAAAAGAAATCCAAAAACAGACAGGATCTGTTTTGAAGGACTTGAACCAACAAAATCCATCCACGAATAACCGTCAGCGTAGACC
AGGGATTCTTGGGAGATCTGTTAGATACAAGCATCAATATTCATCAATAACAACTGAAGATGATCAGAATGTAGATCCTTCTCAAGTGACATTTGAATCAGATAGCATCA
GTCCATTGATTTTGGGCACAGAAACACACCCAAGTCCACATATAATTGACTCAGAAAAGAAAACTGATGAAGACGTAGCCTTTGAGGAGGAGGAGGAGGAGGAGTTCGTT
GTTACCAAGGCAGAGAACAAAGTGAATAAAATTTTGGGTGAATTACTCTCTGCCAATTGTGAAGATCTAGAAGGTGATCGAGCCATCAACATATTACAGGAGCGCTTGCA
GATTAAACCCATTAATTTAGAGAAATTATGTCTTCCAGATTTGGAAGCCATTCCGACAATGAATTTGAAATCTTCAAGTCGCAATCTTTCAAAGCGTAGTTTGATCAGTG
TGGACAATCATTTACAAAGGACAGAAACTTTGAAATCTAAGCAGGACGATGAAACTTTGGTTAATCCTGTTTCTACACCATCCTCAATCAGAAGCCCATTGGGCTCATTA
TCAGCCTTAAATAGACGAATTTCACTTTCAAATTCATCAGGTGATCCATTTTCTGCTCATGGAATTGACCAATCTCCAGCAAGAGGTCCTTACCTTTTTGAACTCAGTAA
TCACTTGTCTGATGCAGTTGGTATTGCAGAGCAGTCAAGTGTTTCTAAATTGAAGTCACTTTTAACCAAAGACGGCGGGACTGTAGCAAATGGAATTAAGTCATCCAAAA
TTCTTTTTGGAGATGCTGATTCCATGTCTAAAATATCTTCAAGTAGTGTTTTAAATGTACCCCAAGTTGGTGGCGATACTGCCTTAAGTGGAACTCACGCCAGCATGGAA
GCTAAAGATGATAGTGGCAGCACAGAAGTGGAAGTAAATGAGAAATTCAGTTGTCTTGAAGCCCAAGCAGATGTTGTGGCTAATATGCGGATGGAAGATCTCGAAGGATC
AGCTTCCGAGCAACCAAACTCATCCATGGTGGACGTGATCAAAGAGTACCCAGTTGGCACTCAGGGTCAGTTGGGTATGATCTTCAACCCCAGTATCGGAAAACCGAGAT
TGGGGAAGCTTACTACCCGAATTGTTCTTGTCCTTGACTAGCGAATTGACATCAAGAAAGCAAATCTTGGAGAATCAAGATCCAAATCTATATCGAGATAG
Protein sequenceShow/hide protein sequence
MKGLKVNAKKEIQKQTGSVLKDLNQQNPSTNNRQRRPGILGRSVRYKHQYSSITTEDDQNVDPSQVTFESDSISPLILGTETHPSPHIIDSEKKTDEDVAFEEEEEEEFV
VTKAENKVNKILGELLSANCEDLEGDRAINILQERLQIKPINLEKLCLPDLEAIPTMNLKSSSRNLSKRSLISVDNHLQRTETLKSKQDDETLVNPVSTPSSIRSPLGSL
SALNRRISLSNSSGDPFSAHGIDQSPARGPYLFELSNHLSDAVGIAEQSSVSKLKSLLTKDGGTVANGIKSSKILFGDADSMSKISSSSVLNVPQVGGDTALSGTHASME
AKDDSGSTEVEVNEKFSCLEAQADVVANMRMEDLEGSASEQPNSSMVDVIKEYPVGTQGQLGMIFNPSIGKPRLGKLTTRIVLVLD