; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC06G116880 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC06G116880
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionCCT domain-containing protein
Genome locationCicolChr06:8876216..8877950
RNA-Seq ExpressionCcUC06G116880
SyntenyCcUC06G116880
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR010402 - CCT domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031932.1 CCT domain-containing protein [Cucumis melo var. makuwa]1.8e-22993.67Show/hide
Query:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS
        MKN   CL+ETK+DFCAAV KGIS  DEISSPINAQIFDFCDPELF ETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDN    NGN NGNT+A AAS
Subjt:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS

Query:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV
        FIPANDASAATNITTNSTSNL+AI DSQEELDNDISASI+FSPSASFS+PQYLTIQSG FDVSQVQSQMPLVDPMIEGLVQCPMAPVG LIDEDLPSIYV
Subjt:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV

Query:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
        DDCLSSLTSYMP+NP+SPSCSFVGA+MATYLPTTSMNPATSTVESCGMFSLLG ELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
Subjt:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM

Query:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE
        NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE
Subjt:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE

Query:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
        S+SNWENRVVVKEEDSM+DSSDIFAHISGVNSFKCNYPIQSW
Subjt:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW

XP_008460686.1 PREDICTED: uncharacterized protein LOC103499454 [Cucumis melo]4.7e-21790.05Show/hide
Query:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS
        MKN   CL+ETK+DFCAAV KGISV DEISSPINAQIFDFCDPELF ETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDN    NGN NGNT+A AAS
Subjt:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS

Query:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV
        FIPANDASAATNITTNSTSNL+AI DSQEELDNDISASI+FSPSASFS+PQYLTIQSG FDVSQVQSQMPLVDPMIEGLVQCPMAPVG LIDEDLPSIYV
Subjt:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV

Query:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
        DDCLSSLTSYMP+NP+SPSCSFVGA+MATYLPTTSMNPATSTVESCGMFSLLG ELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
Subjt:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM

Query:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE
        NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE          
Subjt:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE

Query:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
                VVVKEEDSM+DSSDIFAHISGVNSFKCNYPIQSW
Subjt:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW

XP_011649131.1 uncharacterized protein LOC101214336 isoform X1 [Cucumis sativus]1.8e-21389.82Show/hide
Query:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS
        MKN   CL+ETKV FCA V K ISV+DEISSPINAQIFDFCDPELF ETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDN  NGNGN NGNT+A AAS
Subjt:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS

Query:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV
        FIP NDASAATNITTNS SNLTAI DSQEELDNDISASIDFSPSASFSIPQYLTIQSG FDVSQVQSQMPLVDPMIEGLVQCPMAPVG LIDEDLPSIYV
Subjt:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV

Query:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
        DDCLSSLTSYMPLNP+SPSCSFVG TMATYLPTTSMNPATSTVESCGMFSLLG +L  QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
Subjt:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM

Query:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE
        NCTSLASDLSSLKDSTFKVGKLS EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE          
Subjt:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE

Query:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
                VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
Subjt:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW

XP_011649132.1 uncharacterized protein LOC101214336 isoform X2 [Cucumis sativus]2.9e-21189.59Show/hide
Query:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS
        MKN   CL+ETKV FCA V K IS  DEISSPINAQIFDFCDPELF ETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDN  NGNGN NGNT+A AAS
Subjt:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS

Query:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV
        FIP NDASAATNITTNS SNLTAI DSQEELDNDISASIDFSPSASFSIPQYLTIQSG FDVSQVQSQMPLVDPMIEGLVQCPMAPVG LIDEDLPSIYV
Subjt:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV

Query:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
        DDCLSSLTSYMPLNP+SPSCSFVG TMATYLPTTSMNPATSTVESCGMFSLLG +L  QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
Subjt:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM

Query:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE
        NCTSLASDLSSLKDSTFKVGKLS EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE          
Subjt:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE

Query:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
                VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
Subjt:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW

XP_038875443.1 uncharacterized protein LOC120067896 isoform X1 [Benincasa hispida]8.8e-21690.29Show/hide
Query:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS
        MKNG SCL+ETKVDF  AV KGI   DEISSPINAQIFDFCDPELF ETLQ+SEFNSCSNCCYDKNSPY TNLSNSPDQTDN    NGN NGNT+AAAAS
Subjt:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS

Query:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV
        F+PANDASAATNITTNSTSNLTAI DSQEELDNDISASIDFSPSASFSIPQYLTIQSG FDVSQVQSQMPL+DPMIEGLVQCPMAPVGTLIDEDLPSIYV
Subjt:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV

Query:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
        DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSM PATSTVESCGMFSLLG ELQPQDLDYQGDNCGLY+QDCMQGTFNPADLQVLNNENLQL AGAM
Subjt:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM

Query:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE
        NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE          
Subjt:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE

Query:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
                VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
Subjt:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

TrEMBL top hitse value%identityAlignment
A0A0A0LKJ8 CCT domain-containing protein8.3e-20491.35Show/hide
Query:  DEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAASFIPANDASAATNITTNSTSNLTAILD
        DEISSPINAQIFDFCDPELF ETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDN  NGNGN NGNT+A AASFIP NDASAATNITTNS SNLTAI D
Subjt:  DEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAASFIPANDASAATNITTNSTSNLTAILD

Query:  SQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYVDDCLSSLTSYMPLNPSSPSCSFVGAT
        SQEELDNDISASIDFSPSASFSIPQYLTIQSG FDVSQVQSQMPLVDPMIEGLVQCPMAPVG LIDEDLPSIYVDDCLSSLTSYMPLNP+SPSCSFVG T
Subjt:  SQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYVDDCLSSLTSYMPLNPSSPSCSFVGAT

Query:  MATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEE
        MATYLPTTSMNPATSTVESCGMFSLLG +L  QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLS EE
Subjt:  MATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEE

Query:  RKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFESFSNWENRVVVKEEDSMVDSSDIFAH
        RKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE                  VVVKEEDSMVDSSDIFAH
Subjt:  RKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFESFSNWENRVVVKEEDSMVDSSDIFAH

Query:  ISGVNSFKCNYPIQSW
        ISGVNSFKCNYPIQSW
Subjt:  ISGVNSFKCNYPIQSW

A0A1S3CCK1 uncharacterized protein LOC1034994542.3e-21790.05Show/hide
Query:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS
        MKN   CL+ETK+DFCAAV KGISV DEISSPINAQIFDFCDPELF ETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDN    NGN NGNT+A AAS
Subjt:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS

Query:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV
        FIPANDASAATNITTNSTSNL+AI DSQEELDNDISASI+FSPSASFS+PQYLTIQSG FDVSQVQSQMPLVDPMIEGLVQCPMAPVG LIDEDLPSIYV
Subjt:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV

Query:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
        DDCLSSLTSYMP+NP+SPSCSFVGA+MATYLPTTSMNPATSTVESCGMFSLLG ELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
Subjt:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM

Query:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE
        NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE          
Subjt:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE

Query:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
                VVVKEEDSM+DSSDIFAHISGVNSFKCNYPIQSW
Subjt:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW

A0A5A7SLG3 CCT domain-containing protein8.8e-23093.67Show/hide
Query:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS
        MKN   CL+ETK+DFCAAV KGIS  DEISSPINAQIFDFCDPELF ETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDN    NGN NGNT+A AAS
Subjt:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS

Query:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV
        FIPANDASAATNITTNSTSNL+AI DSQEELDNDISASI+FSPSASFS+PQYLTIQSG FDVSQVQSQMPLVDPMIEGLVQCPMAPVG LIDEDLPSIYV
Subjt:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV

Query:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
        DDCLSSLTSYMP+NP+SPSCSFVGA+MATYLPTTSMNPATSTVESCGMFSLLG ELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
Subjt:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM

Query:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE
        NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE
Subjt:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE

Query:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
        S+SNWENRVVVKEEDSM+DSSDIFAHISGVNSFKCNYPIQSW
Subjt:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW

A0A6J1H4B0 uncharacterized protein LOC111460284 isoform X22.0e-18982.29Show/hide
Query:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNS-EFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAA
        MKNG SCL+E KVDFC  V KGIS  +EISSPINAQI+DFCD ELF+E LQNS EFNS SNC YD NS YATNL +SPDQ DN    NGN NGNT+ AA 
Subjt:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNS-EFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAA

Query:  SFIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIY
        SF+PANDASA TNITTN  SNLT I D QEELDNDISASIDFSPS SFSI QYLTIQSG FD+SQVQSQMPL+DPMI+GL+QCPMAP GTLIDEDLPSIY
Subjt:  SFIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIY

Query:  VDDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMN--PATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAA
        VDDCLSS TSYMPLNPSSPSCSFVGATM TYLPT  MN   ++S+VE+CGMF LL  ELQPQDLDYQGDNCGLYSQD MQGTFNPADLQVL++ENLQLAA
Subjt:  VDDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMN--PATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAA

Query:  GAMNCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEP
        GAMNCTSLASDLSSLKDSTFKVGKLS+EERK+KIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDEL ENHRAACS HE EEEEEVKLL +P
Subjt:  GAMNCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEP

Query:  TFESFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
        TF+  S     VVVKEEDSMVDSSDIFAHISGVNS K +YPIQSWI
Subjt:  TFESFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

B0F827 Zinc finger-like protein1.4e-21189.59Show/hide
Query:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS
        MKN   CL+ETKV FCA V K IS  DEISSPINAQIFDFCDPELF ETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDN  NGNGN NGNT+A AAS
Subjt:  MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAAS

Query:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV
        FIP NDASAATNITTNS SNLTAI DSQEELDNDISASIDFSPSASFSIPQYLTIQSG FDVSQVQSQMPLVDPMIEGLVQCPMAPVG LIDEDLPSIYV
Subjt:  FIPANDASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYV

Query:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
        DDCLSSLTSYMPLNP+SPSCSFVG TMATYLPTTSMNPATSTVESCGMFSLLG +L  QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
Subjt:  DDCLSSLTSYMPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM

Query:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE
        NCTSLASDLSSLKDSTFKVGKLS EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE          
Subjt:  NCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFE

Query:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
                VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
Subjt:  SFSNWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW

SwissProt top hitse value%identityAlignment
E5RQA1 Transcription factor GHD72.6e-0548.53Show/hide
Query:  SLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDE
        ++A D  SL  +T  VG  +M ER+ K+ RY +KR +R + K+I+YA RK  A+ RPRVRGRFAK  +
Subjt:  SLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDE

O50055 Zinc finger protein CONSTANS-LIKE 11.3e-0443.33Show/hide
Query:  LSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACS
        LS  +R+ ++ RY +K+  R F K I+YA RK  A+ RPR++GRFAK  ++ E    A S
Subjt:  LSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACS

O82117 Zinc finger protein CO34.5e-0550Show/hide
Query:  ERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDE
        +R+ ++HRY +KR  R F K I+YA RK  A++RPR++GRFAK  +
Subjt:  ERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDE

Q9FHH8 Zinc finger protein CONSTANS-LIKE 54.5e-0543.94Show/hide
Query:  SMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAK-----NDELAENHRAACSNH
        S  +R+ ++ RY +KR  R F K I+YA RK  A+SRPR++GRFAK     ND++  +H  A + H
Subjt:  SMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAK-----NDELAENHRAACSNH

Q9SK53 Zinc finger protein CONSTANS-LIKE 32.0e-0549.09Show/hide
Query:  KLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAEN
        +LS  ER+ ++ RY +KR  R F K I+YA RK  A+ RPR++GRFAK  +  EN
Subjt:  KLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAEN

Arabidopsis top hitse value%identityAlignment
AT1G04500.1 CCT motif family protein2.6e-7244.5Show/hide
Query:  DEISSPINAQIFDFCDPELFTETL-QNSEFNSCSN-CCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAASFIPANDASAATNITTNSTSNLTAI
        DEI+SP+ AQIFDFCD +LF ET  Q SE  S SN C Y +N+    N +N PD++++  N +   N                        N  ++L+ I
Subjt:  DEISSPINAQIFDFCDPELFTETL-QNSEFNSCSN-CCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAASFIPANDASAATNITTNSTSNLTAI

Query:  LDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLP---SIYVDDCLSSLTSYM--PLNPSSPS
         DSQ++ DNDI+ASIDFS S  F     L  Q   FD + +Q   P            P     +   + LP   S++ +DCLSS+ SY    +NPSSPS
Subjt:  LDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLP---SIYVDDCLSSLTSYM--PLNPSSPS

Query:  CSFVGAT-MATYLPTTS--MNPATSTVESCGMFSLLGTELQP---QDLDYQGDNCGLYSQDCMQGTFNPAD-----LQVLNNENLQLAAGAMNCTSLASD
        CSF+G T + TY+  T   MN    +    G    LG++ +P   Q ++ Q DN GL+  D ++  FNP D     L  + N+N  +A   +    L ++
Subjt:  CSFVGAT-MATYLPTTS--MNPATSTVESCGMFSLLGTELQP---QDLDYQGDNCGLYSQDCMQGTFNPAD-----LQVLNNENLQLAAGAMNCTSLASD

Query:  LSSLKDSTF-KVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFESFSNWEN
        ++ L D +F KVGKLS E+RKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDE  E +R ACS+H  +++++V                
Subjt:  LSSLKDSTF-KVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFESFSNWEN

Query:  RVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
           VKEE+ +VDSSDIF+HISGVNSFKCNYPIQSWI
Subjt:  RVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

AT1G63820.1 CCT motif family protein6.2e-1845.31Show/hide
Query:  YSQDCMQGTFNPADLQVLNNE-------NLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVR
        +  D M+  ++  DLQ L  +       +  LAA +   T  + D  SL     +VG+ S EERKEKI +Y  KR +RNF+K IKYACRKTLAD+RPRVR
Subjt:  YSQDCMQGTFNPADLQVLNNE-------NLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVR

Query:  GRFAKNDELAENHRAACSNHEGEEEEEV
        GRFA+NDE+ EN + A S    E ++++
Subjt:  GRFAKNDELAENHRAACSNHEGEEEEEV

AT2G33350.1 CCT motif family protein9.5e-6743.41Show/hide
Query:  DEISSPINAQIFDFCDPELFTETL-QNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAASFIPANDASAATNITTNSTSNLTAIL
        D+I+SP++AQIFDFCDP+LF ET  Q+SE  S SN   +K+  + +N +N+   T+N+ N N N N N                  +   N+ ++L+ I 
Subjt:  DEISSPINAQIFDFCDPELFTETL-QNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAASFIPANDASAATNITTNSTSNLTAIL

Query:  DSQEELDNDISASIDFSPSA-SFSIPQYL--TIQSGHFDVS---QVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPS-IYVDDCLSSLTSY-MPLNPSS
        DSQE+ +NDI+ASIDFS S+  + +  +L   I    FD S   QV  Q P +    + L    ++ + +L    L S ++ +DCLSS+ SY + LN   
Subjt:  DSQEELDNDISASIDFSPSA-SFSIPQYL--TIQSGHFDVS---QVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPS-IYVDDCLSSLTSY-MPLNPSS

Query:  PSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTEL-QPQD--LDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLAS-------
        PSCSF  ++      +T +  A S +        +G+E+ +P D  +D+Q DN G +  D ++  FNP DLQ        L  GA N + L +       
Subjt:  PSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTEL-QPQD--LDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLAS-------

Query:  ---DLSSLKDSTF-KVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFESFS
           D++ L+DST  KVGKLS E+RKEKI RYMKKRNERNF+KKIKYACRKTLADSRPRVRGRFAKNDE  E +R A S+H  +E+E+             
Subjt:  ---DLSSLKDSTF-KVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFESFS

Query:  NWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
             + VK+E+ +VDSSDIFAHISG NSFKCNYPIQSWI
Subjt:  NWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

AT2G33350.2 CCT motif family protein5.5e-6743.41Show/hide
Query:  DEISSPINAQIFDFCDPELFTETL-QNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAASFIPANDASAATNITTNSTSNLTAIL
        D+I+SP++AQIFDFCDP+LF ET  Q+SE  S SN   +K+  + +N +N+   T+N+ N N N N N                  +   N+ ++L+ I 
Subjt:  DEISSPINAQIFDFCDPELFTETL-QNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAASFIPANDASAATNITTNSTSNLTAIL

Query:  DSQEELDNDISASIDFSPSA-SFSIPQYL--TIQSGHFDVS---QVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPS-IYVDDCLSSLTSY-MPLNPSS
        DSQE+ +NDI+ASIDFS S+  + +  +L   I    FD S   QV  Q P +    + L    ++ + +L    L S ++ +DCLSS+ SY + LN   
Subjt:  DSQEELDNDISASIDFSPSA-SFSIPQYL--TIQSGHFDVS---QVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPS-IYVDDCLSSLTSY-MPLNPSS

Query:  PSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTEL-QPQD--LDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLAS-------
        PSCSF  ++      +T +  A S +        +G+E+ +P D  +D+Q DN G +  D ++  FNP DLQ        L  GA N + L +       
Subjt:  PSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTEL-QPQD--LDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLAS-------

Query:  ---DLSSLKDSTF-KVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFESFS
           D++ L+DST  KVGKLS E+RKEKI RYMKKRNERNF+KKIKYACRKTLADSRPRVRGRFAKNDE  E +R A S+H  +E+E+             
Subjt:  ---DLSSLKDSTF-KVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFESFS

Query:  NWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
             + VK+E+ +VDSSDIFAHISG NSFKCNYPIQSWI
Subjt:  NWENRVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

AT5G41380.1 CCT motif family protein2.8e-1848.31Show/hide
Query:  MQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAEN
        M+  ++  DLQ     N+++   + N T   S+     +  FKVG+ S EERKEKI +Y  KRN+RNF+K IKYACRKTLADSRPR+RGRFA+NDE+ E 
Subjt:  MQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAEN

Query:  HRAACSNHEGEEEEEVKL
              N E ++ E  KL
Subjt:  HRAACSNHEGEEEEEVKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAATGGTATTTCTTGCTTGTATGAAACAAAGGTTGATTTCTGTGCGGCCGTGCGTAAAGGCATATCGGTAGAAGATGAAATCTCGAGTCCGATCAATGCG
CAGATATTTGATTTCTGTGATCCTGAGCTGTTCACGGAGACGCTTCAGAACTCCGAGTTCAATTCTTGCTCGAATTGTTGTTATGACAAGAATTCGCCATATGCT
ACAAATCTGTCTAATTCCCCGGATCAAACAGATAACAATTGCAATGGCAATGGCAATAGCAATGGCAATACCATTGCCGCTGCTGCATCGTTTATACCTGCTAAT
GACGCATCGGCTGCAACTAACATAACGACCAACAGTACTAGTAATCTGACTGCTATCTTGGATTCCCAAGAAGAACTTGACAATGATATCTCTGCTTCCATAGAC
TTCTCTCCATCCGCTTCGTTTTCGATCCCTCAATATCTCACCATCCAGTCGGGGCATTTTGACGTTTCTCAAGTGCAGTCTCAAATGCCATTAGTAGATCCCATG
ATTGAGGGGCTTGTGCAGTGTCCTATGGCTCCAGTTGGGACTCTCATCGACGAAGATCTACCGTCGATTTACGTCGACGATTGCTTATCTTCCTTGACTTCCTAC
ATGCCACTGAATCCTTCTTCCCCTTCGTGCTCGTTTGTTGGAGCAACCATGGCAACTTACCTGCCTACTACGTCAATGAATCCTGCTACATCGACTGTCGAAAGT
TGTGGAATGTTTTCTCTCCTCGGCACAGAATTGCAACCGCAAGACCTCGACTATCAAGGAGACAACTGTGGACTCTACAGCCAAGACTGTATGCAGGGGACTTTC
AATCCAGCAGACCTTCAGGTGCTTAACAATGAGAATCTACAACTGGCTGCTGGGGCAATGAACTGCACTTCTTTAGCATCAGATCTCTCAAGCTTAAAGGACAGT
ACTTTCAAAGTAGGAAAACTCTCCATGGAAGAGAGAAAGGAGAAGATTCATAGGTACATGAAGAAAAGAAATGAGAGGAACTTCAGCAAGAAAATCAAGTATGCC
TGCCGAAAAACGCTAGCAGATAGCCGGCCAAGGGTTCGAGGACGTTTCGCGAAGAATGACGAATTGGCAGAGAATCACAGGGCTGCTTGTAGCAACCATGAAGGA
GAAGAAGAAGAAGAAGTAAAGCTCCTTTTTGAGCCAACTTTTGAAAGTTTCTCCAATTGGGAGAACAGAGTAGTTGTGAAGGAAGAAGATAGCATGGTTGATTCC
TCAGATATCTTTGCTCATATCAGTGGAGTGAACTCCTTCAAGTGCAACTATCCAATCCAGTCTTGGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAATGGTATTTCTTGCTTGTATGAAACAAAGGTTGATTTCTGTGCGGCCGTGCGTAAAGGCATATCGGTAGAAGATGAAATCTCGAGTCCGATCAATGCG
CAGATATTTGATTTCTGTGATCCTGAGCTGTTCACGGAGACGCTTCAGAACTCCGAGTTCAATTCTTGCTCGAATTGTTGTTATGACAAGAATTCGCCATATGCT
ACAAATCTGTCTAATTCCCCGGATCAAACAGATAACAATTGCAATGGCAATGGCAATAGCAATGGCAATACCATTGCCGCTGCTGCATCGTTTATACCTGCTAAT
GACGCATCGGCTGCAACTAACATAACGACCAACAGTACTAGTAATCTGACTGCTATCTTGGATTCCCAAGAAGAACTTGACAATGATATCTCTGCTTCCATAGAC
TTCTCTCCATCCGCTTCGTTTTCGATCCCTCAATATCTCACCATCCAGTCGGGGCATTTTGACGTTTCTCAAGTGCAGTCTCAAATGCCATTAGTAGATCCCATG
ATTGAGGGGCTTGTGCAGTGTCCTATGGCTCCAGTTGGGACTCTCATCGACGAAGATCTACCGTCGATTTACGTCGACGATTGCTTATCTTCCTTGACTTCCTAC
ATGCCACTGAATCCTTCTTCCCCTTCGTGCTCGTTTGTTGGAGCAACCATGGCAACTTACCTGCCTACTACGTCAATGAATCCTGCTACATCGACTGTCGAAAGT
TGTGGAATGTTTTCTCTCCTCGGCACAGAATTGCAACCGCAAGACCTCGACTATCAAGGAGACAACTGTGGACTCTACAGCCAAGACTGTATGCAGGGGACTTTC
AATCCAGCAGACCTTCAGGTGCTTAACAATGAGAATCTACAACTGGCTGCTGGGGCAATGAACTGCACTTCTTTAGCATCAGATCTCTCAAGCTTAAAGGACAGT
ACTTTCAAAGTAGGAAAACTCTCCATGGAAGAGAGAAAGGAGAAGATTCATAGGTACATGAAGAAAAGAAATGAGAGGAACTTCAGCAAGAAAATCAAGTATGCC
TGCCGAAAAACGCTAGCAGATAGCCGGCCAAGGGTTCGAGGACGTTTCGCGAAGAATGACGAATTGGCAGAGAATCACAGGGCTGCTTGTAGCAACCATGAAGGA
GAAGAAGAAGAAGAAGTAAAGCTCCTTTTTGAGCCAACTTTTGAAAGTTTCTCCAATTGGGAGAACAGAGTAGTTGTGAAGGAAGAAGATAGCATGGTTGATTCC
TCAGATATCTTTGCTCATATCAGTGGAGTGAACTCCTTCAAGTGCAACTATCCAATCCAGTCTTGGATTTGA
Protein sequenceShow/hide protein sequence
MKNGISCLYETKVDFCAAVRKGISVEDEISSPINAQIFDFCDPELFTETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNCNGNGNSNGNTIAAAASFIPAN
DASAATNITTNSTSNLTAILDSQEELDNDISASIDFSPSASFSIPQYLTIQSGHFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEDLPSIYVDDCLSSLTSY
MPLNPSSPSCSFVGATMATYLPTTSMNPATSTVESCGMFSLLGTELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDS
TFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVKLLFEPTFESFSNWENRVVVKEEDSMVDS
SDIFAHISGVNSFKCNYPIQSWI