; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy5G093380 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy5G093380
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionCCT domain-containing protein
Genome locationchrH05:2853735..2865564
RNA-Seq ExpressionChy5G093380
SyntenyChy5G093380
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR006571 - TLDc domain
IPR010402 - CCT domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147186.1 uncharacterized protein LOC101214336 isoform X3 [Cucumis sativus]1.22e-27498.05Show/hide
Query:  MLQDVMSSASDQMLSIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATN
        MLQDVMSSASDQML+IDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGN NTVAGAASFIP NDASAATN
Subjt:  MLQDVMSSASDQMLSIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATN

Query:  ITTNSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMP
        ITTNSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQ+QSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMP
Subjt:  ITTNSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMP

Query:  LNPASPSCSFVGTTMATYLPTTTMNPATSTVESCGMFSLLGHELQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKD
        LNPASPSCSFVGTTMATYLPTT+MNPATSTVESCGMFSLLG +LQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKD
Subjt:  LNPASPSCSFVGTTMATYLPTTTMNPATSTVESCGMFSLLGHELQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKD

Query:  STFKVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE-VVVKEEDSMVDSSDIFAHISGVNS
        STFKVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE VVVKEEDSMVDSSDIFAHISGVNS
Subjt:  STFKVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE-VVVKEEDSMVDSSDIFAHISGVNS

Query:  FKCNYPIQSWT
        FKCNYPIQSWT
Subjt:  FKCNYPIQSWT

XP_011649131.1 uncharacterized protein LOC101214336 isoform X1 [Cucumis sativus]1.82e-26498.23Show/hide
Query:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATNITTNSASNLTAIFDSQ
        DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGN NTVAGAASFIP NDASAATNITTNSASNLTAIFDSQ
Subjt:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATNITTNSASNLTAIFDSQ

Query:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA
        EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQ+QSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA
Subjt:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA

Query:  TYLPTTTMNPATSTVESCGMFSLLGHELQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERKEK
        TYLPTT+MNPATSTVESCGMFSLLG +LQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERKEK
Subjt:  TYLPTTTMNPATSTVESCGMFSLLGHELQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERKEK

Query:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT
        IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT
Subjt:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT

XP_011649132.1 uncharacterized protein LOC101214336 isoform X2 [Cucumis sativus]1.69e-26498.23Show/hide
Query:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATNITTNSASNLTAIFDSQ
        DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGN NTVAGAASFIP NDASAATNITTNSASNLTAIFDSQ
Subjt:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATNITTNSASNLTAIFDSQ

Query:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA
        EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQ+QSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA
Subjt:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA

Query:  TYLPTTTMNPATSTVESCGMFSLLGHELQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERKEK
        TYLPTT+MNPATSTVESCGMFSLLG +LQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERKEK
Subjt:  TYLPTTTMNPATSTVESCGMFSLLGHELQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERKEK

Query:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT
        IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT
Subjt:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT

XP_031736479.1 uncharacterized protein LOC101214336 isoform X4 [Cucumis sativus]1.07e-26498.23Show/hide
Query:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATNITTNSASNLTAIFDSQ
        DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGN NTVAGAASFIP NDASAATNITTNSASNLTAIFDSQ
Subjt:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATNITTNSASNLTAIFDSQ

Query:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA
        EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQ+QSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA
Subjt:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA

Query:  TYLPTTTMNPATSTVESCGMFSLLGHELQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERKEK
        TYLPTT+MNPATSTVESCGMFSLLG +LQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERKEK
Subjt:  TYLPTTTMNPATSTVESCGMFSLLGHELQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERKEK

Query:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT
        IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT
Subjt:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT

XP_038875445.1 uncharacterized protein LOC120067896 isoform X3 [Benincasa hispida]2.05e-25893.45Show/hide
Query:  MLQDVMSSASDQMLSIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATN
        MLQDV+ SA +QML IDEISSPINAQIFDFCDPELFAETLQ+SEFNSCSNCCYDKNSPY TNLSNSPDQTDNNGN NGN  TVA AASF+PANDASAATN
Subjt:  MLQDVMSSASDQMLSIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATN

Query:  ITTNSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMP
        ITTNS SNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQ+QSQMPL+DPMIEGLVQCPMAPVG LIDEDLPSIYVDDCLSSLTSYMP
Subjt:  ITTNSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMP

Query:  LNPASPSCSFVGTTMATYLPTTTMNPATSTVESCGMFSLLGHELQ--DLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSL
        LNP+SPSCSFVG TMATYLPTT+M PATSTVESCGMFSLLG ELQ  DLDYQGDNCGLY+QDCMQGTFNPADLQVLNNENLQL AGAMNCTSLASDLSSL
Subjt:  LNPASPSCSFVGTTMATYLPTTTMNPATSTVESCGMFSLLGHELQ--DLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSL

Query:  KDSTFKVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE-VVVKEEDSMVDSSDIFAHISGV
        KDSTFKVGKLS EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE VVVKEEDSMVDSSDIFAHISGV
Subjt:  KDSTFKVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE-VVVKEEDSMVDSSDIFAHISGV

Query:  NSFKCNYPIQSW
        NSFKCNYPIQSW
Subjt:  NSFKCNYPIQSW

TrEMBL top hitse value%identityAlignment
A0A0A0LIG4 TLDc domain-containing protein1.3e-18196.95Show/hide
Query:  MQAMKEKVSERFSSLFSNSTSSESSKPPPPDPRTQARPKSKGRKSLSSYLSLIIPSIHGSKPSASRQDTDAVQSPSVRYCDANNDFQEEGSDTSLGCSIP
        MQAMKEKVSERFS LFSNSTSSESSKPPPPDPRTQARPKSKGRKSLSSYLSLIIPSIHGSKPSASRQDTDAVQSPSVRYCDANNDFQEEGSDTSLGCSIP
Subjt:  MQAMKEKVSERFSSLFSNSTSSESSKPPPPDPRTQARPKSKGRKSLSSYLSLIIPSIHGSKPSASRQDTDAVQSPSVRYCDANNDFQEEGSDTSLGCSIP

Query:  FKTEEIPRNQGENKDCGSAYDEEKLNKLRDEYDSACRKSTCSSDGFEEAMERPTPRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHG
        FKTEEIPR+QGENKDCGSAYDE KLNKL  EYDSACRKSTCSSDGFEEAMERPTPRNPLSDLM ESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHG
Subjt:  FKTEEIPRNQGENKDCGSAYDEEKLNKLRDEYDSACRKSTCSSDGFEEAMERPTPRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHG

Query:  ISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVFTTKYGDPRLFRATGANHYYYICLNDLLALGGGGSFALCLDGDLLSG
        ISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLK TAKRKYQGTHQTFVFTTKYGDPRLFRATGANHYYYICL DLLALGGGGSFALCLDGDLLSG
Subjt:  ISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVFTTKYGDPRLFRATGANHYYYICLNDLLALGGGGSFALCLDGDLLSG

Query:  TSGPCDTFGSLCLAHDPEFELKNVETTG
        TSGPCDTFGSLCLAHDPEFELKNVE  G
Subjt:  TSGPCDTFGSLCLAHDPEFELKNVETTG

A0A0A0LKJ8 CCT domain-containing protein2.5e-22098.05Show/hide
Query:  MLQDVMSSASDQMLSIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATN
        MLQDVMSSASDQML+IDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGN NTVAGAASFIP NDASAATN
Subjt:  MLQDVMSSASDQMLSIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATN

Query:  ITTNSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMP
        ITTNSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQ+QSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMP
Subjt:  ITTNSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMP

Query:  LNPASPSCSFVGTTMATYLPTTTMNPATSTVESCGMFSLLGHELQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKD
        LNPASPSCSFVGTTMATYLPTT+MNPATSTVESCGMFSLLG +LQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKD
Subjt:  LNPASPSCSFVGTTMATYLPTTTMNPATSTVESCGMFSLLGHELQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKD

Query:  STFKVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEG-EEEEVVVKEEDSMVDSSDIFAHISGVNS
        STFKVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEG EEEEVVVKEEDSMVDSSDIFAHISGVNS
Subjt:  STFKVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEG-EEEEVVVKEEDSMVDSSDIFAHISGVNS

Query:  FKCNYPIQSWT
        FKCNYPIQSWT
Subjt:  FKCNYPIQSWT

A0A1S3CCK1 uncharacterized protein LOC1034994546.0e-20695.72Show/hide
Query:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATNITTNSASNLTAIFDSQ
        DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNG  NTVAGAASFIPANDASAATNITTNS SNL+AIFDSQ
Subjt:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATNITTNSASNLTAIFDSQ

Query:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA
        EELDNDISASI+FSPSASFS+PQYLTIQSGQFDVSQ+QSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMP+NPASPSCSFVG +MA
Subjt:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA

Query:  TYLPTTTMNPATSTVESCGMFSLLGHEL--QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERK
        TYLPTT+MNPATSTVESCGMFSLLG EL  QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLS EERK
Subjt:  TYLPTTTMNPATSTVESCGMFSLLGHEL--QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERK

Query:  EKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEG-EEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT
        EKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEG EEEEVVVKEEDSM+DSSDIFAHISGVNSFKCNYPIQSWT
Subjt:  EKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEG-EEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT

A0A5A7SLG3 CCT domain-containing protein7.4e-20491.57Show/hide
Query:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATNITTNSASNLTAIFDSQ
        DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNG  NTVAGAASFIPANDASAATNITTNS SNL+AIFDSQ
Subjt:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATNITTNSASNLTAIFDSQ

Query:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA
        EELDNDISASI+FSPSASFS+PQYLTIQSGQFDVSQ+QSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMP+NPASPSCSFVG +MA
Subjt:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA

Query:  TYLPTTTMNPATSTVESCGMFSLLGHEL--QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERK
        TYLPTT+MNPATSTVESCGMFSLLG EL  QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLS EERK
Subjt:  TYLPTTTMNPATSTVESCGMFSLLGHEL--QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERK

Query:  EKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE-------------------VVVKEEDSMVDSSDIFAHIS
        EKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE                   VVVKEEDSM+DSSDIFAHIS
Subjt:  EKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE-------------------VVVKEEDSMVDSSDIFAHIS

Query:  GVNSFKCNYPIQSWT
        GVNSFKCNYPIQSWT
Subjt:  GVNSFKCNYPIQSWT

B0F827 Zinc finger-like protein1.5e-21298.23Show/hide
Query:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATNITTNSASNLTAIFDSQ
        DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGN NTVAGAASFIP NDASAATNITTNSASNLTAIFDSQ
Subjt:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAATNITTNSASNLTAIFDSQ

Query:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA
        EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQ+QSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA
Subjt:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYMPLNPASPSCSFVGTTMA

Query:  TYLPTTTMNPATSTVESCGMFSLLGHELQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERKEK
        TYLPTT+MNPATSTVESCGMFSLLG +LQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERKEK
Subjt:  TYLPTTTMNPATSTVESCGMFSLLGHELQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSTEERKEK

Query:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEG-EEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT
        IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEG EEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT
Subjt:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEG-EEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT

SwissProt top hitse value%identityAlignment
A8KBE0 Oxidation resistance protein 11.0e-2139.53Show/hide
Query:  PRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVF
        P +   +L D S+ + +   E L   LP    G  W L+YST KHG+SL+TL R    L  P LL++ D+   IFG L   P K      + GT +TF+F
Subjt:  PRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVF

Query:  TTKYGDPRLFRATGANHYYYICLNDLLAL-GGGGSFALCLDGDLLSGTSGPCDTFGSLCLAHDPEFELKNVE
        T    D  +F+ TG N ++     D LA  GGGG FAL LDGDL  G S  C TFG+  L+   +F ++++E
Subjt:  TTKYGDPRLFRATGANHYYYICLNDLLAL-GGGGSFALCLDGDLLSGTSGPCDTFGSLCLAHDPEFELKNVE

Q4KMM3 Oxidation resistance protein 18.8e-2138.37Show/hide
Query:  PRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVF
        P +   +L D S  +     E L   LP    G  W L+Y T KHG SL+TL R    L  P L+++ D+ G +FG L   P K      + GT +TFVF
Subjt:  PRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVF

Query:  TTKYGDPRLFRATGANHYYYICLNDLLAL-GGGGSFALCLDGDLLSGTSGPCDTFGSLCLAHDPEFELKNVE
        T    +  +F+ TG N ++     D LA  GGGG FAL LDGDL  G S  C TFG+  L+   +F ++++E
Subjt:  TTKYGDPRLFRATGANHYYYICLNDLLAL-GGGGSFALCLDGDLLSGTSGPCDTFGSLCLAHDPEFELKNVE

Q4V8B0 Oxidation resistance protein 18.8e-2138.37Show/hide
Query:  PRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVF
        P +   +L D S  +     E L   LP    G  W L+Y T KHG SL+TL R    L  P L+++ D+ G +FG L   P K      + GT +TFVF
Subjt:  PRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVF

Query:  TTKYGDPRLFRATGANHYYYICLNDLLAL-GGGGSFALCLDGDLLSGTSGPCDTFGSLCLAHDPEFELKNVE
        T    +  +F+ TG N ++     D LA  GGGG FAL LDGDL  G S  C TFG+  L+   +F ++++E
Subjt:  TTKYGDPRLFRATGANHYYYICLNDLLAL-GGGGSFALCLDGDLLSGTSGPCDTFGSLCLAHDPEFELKNVE

Q6DFV7 Nuclear receptor coactivator 76.8e-2137.37Show/hide
Query:  RKSTCSSDGFEEAMERPTPRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECP
        RKSTCS    EE  E   P      L   SA + +   E L   LP  V+G  W L YST++HG SL+TL R S +L  P LL++ D    IFG     P
Subjt:  RKSTCSSDGFEEAMERPTPRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECP

Query:  LKPTAKRKYQGTHQTFVFTTKYGDPRLFRATGANHYYYI-CLNDLLALGGGGSFALCLDGDLLSGTSGPCDTFGSLCLAHDPEFELKNVE
         K      Y GT +TF++T    + ++F+ +G N Y+    ++ L   GGGG F L LD DL  G S  C TF +  L+   +F ++++E
Subjt:  LKPTAKRKYQGTHQTFVFTTKYGDPRLFRATGANHYYYI-CLNDLLALGGGGSFALCLDGDLLSGTSGPCDTFGSLCLAHDPEFELKNVE

Q8N573 Oxidation resistance protein 13.0e-2139.76Show/hide
Query:  DLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVFTTKYGD
        +L D S  +     E L   LP    G  W L+Y T KHG SL+TL R    L  P L+++ D+ G +FG L   PLK      + GT +TFVFT    +
Subjt:  DLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVFTTKYGD

Query:  PRLFRATGANHYYYICLNDLLAL-GGGGSFALCLDGDLLSGTSGPCDTFGSLCLAHDPEFELKNVE
          +F+ TG N ++     D LA  GGGG FAL LDGDL  G S  C TFG+  L+   +F ++++E
Subjt:  PRLFRATGANHYYYICLNDLLAL-GGGGSFALCLDGDLLSGTSGPCDTFGSLCLAHDPEFELKNVE

Arabidopsis top hitse value%identityAlignment
AT1G04500.1 CCT motif family protein7.5e-7646.99Show/hide
Query:  MLQDVMSSASDQMLSIDEISSPINAQIFDFCDPELFAETL-QNSEFNSCSN-CCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAA
        M QDV+S  S + LS+DEI+SP+ AQIFDFCD +LF ET  Q SE  S SN C Y +N+    N +N PD++++  N +   N                 
Subjt:  MLQDVMSSASDQMLSIDEISSPINAQIFDFCDPELFAETL-QNSEFNSCSN-CCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAA

Query:  TNITTNSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLP---SIYVDDCLSSL
             N  ++L+ IFDSQ++ DNDI+ASIDFS S  F     L     QFD + +Q   P            P     +   + LP   S++ +DCLSS+
Subjt:  TNITTNSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLP---SIYVDDCLSSL

Query:  TSYM--PLNPASPSCSFVGTT-MATYLPTTTMNPATSTVESCGMFS---LLGHEL-----QDLDYQGDNCGLYSQDCMQGTFNPAD-----LQVLNNENL
         SY    +NP+SPSCSF+G T + TY+ T T N   + + S G +S    LG +      Q ++ Q DN GL+  D ++  FNP D     L  + N+N 
Subjt:  TSYM--PLNPASPSCSFVGTT-MATYLPTTTMNPATSTVESCGMFS---LLGHEL-----QDLDYQGDNCGLYSQDCMQGTFNPAD-----LQVLNNENL

Query:  QLAAGAMNCTSLASDLSSLKDSTF-KVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAAC-SNHEGEEEEVV
         +A   +    L ++++ L D +F KVGKLS E+RKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDE  E +R AC S+HE ++++V 
Subjt:  QLAAGAMNCTSLASDLSSLKDSTF-KVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAAC-SNHEGEEEEVV

Query:  VKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
        VKEE+ +VDSSDIF+HISGVNSFKCNYPIQSW
Subjt:  VKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW

AT2G05590.1 TLD-domain containing nucleolar protein9.0e-6144Show/hide
Query:  MQAMKEKVSERFSSLFSNSTSSESSKPPPPDPRTQARPKSKGRKSLSSYLSLIIPSIHGSKPSASRQDTDAVQSPS---VRYCDANNDFQEEGSDTSLGC
        M A+K+KVS++ S+LF++S S  +S         +AR  S   KSLSSY S ++P   G++  +       +++ S   +  C + N   + G+  S+  
Subjt:  MQAMKEKVSERFSSLFSNSTSSESSKPPPPDPRTQARPKSKGRKSLSSYLSLIIPSIHGSKPSASRQDTDAVQSPS---VRYCDANNDFQEEGSDTSLGC

Query:  SIPFKTEEIPRNQGENKDCGSAYDEEKLNKLRDEYDSACRKSTCSSDGFEEAMERPTPRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTM
                     GE+KDC              E   + +     +D F+   +       + +L + S FIT++L+EFL   LPNIV+GCKW+LLYST+
Subjt:  SIPFKTEEIPRNQGENKDCGSAYDEEKLNKLRDEYDSACRKSTCSSDGFEEAMERPTPRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTM

Query:  KHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVFTTKYGDPRLFRATGANHYYYICLNDLLALGGGGSFALCLDGDL
        KHGISL+TL+R S  LPGPCLL+ GD +GA+FG LLECPL+PT KRKYQGT QTF+FTT YG+PR+FR TGAN YY +C+N+ LA GGGG+FALCLD DL
Subjt:  KHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVFTTKYGDPRLFRATGANHYYYICLNDLLALGGGGSFALCLDGDL

AT2G05590.2 TLD-domain containing nucleolar protein1.0e-7245.92Show/hide
Query:  MQAMKEKVSERFSSLFSNSTSSESSKPPPPDPRTQARPKSKGRKSLSSYLSLIIPSIHGSKPSASRQDTDAVQSPS---VRYCDANNDFQEEGSDTSLGC
        M A+K+KVS++ S+LF++S S  +S         +AR  S   KSLSSY S ++P   G++  +       +++ S   +  C + N   + G+  S+  
Subjt:  MQAMKEKVSERFSSLFSNSTSSESSKPPPPDPRTQARPKSKGRKSLSSYLSLIIPSIHGSKPSASRQDTDAVQSPS---VRYCDANNDFQEEGSDTSLGC

Query:  SIPFKTEEIPRNQGENKDCGSAYDEEKLNKLRDEYDSACRKSTCSSDGFEEAMERPTPRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTM
                     GE+KDC              E   + +     +D F+   +       + +L + S FIT++L+EFL   LPNIV+GCKW+LLYST+
Subjt:  SIPFKTEEIPRNQGENKDCGSAYDEEKLNKLRDEYDSACRKSTCSSDGFEEAMERPTPRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTM

Query:  KHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVFTTKYGDPRLFRATGANHYYYICLNDLLALGGGGSFALCLDGDL
        KHGISL+TL+R S  LPGPCLL+ GD +GA+FG LLECPL+PT KRKYQGT QTF+FTT YG+PR+FR TGAN YY +C+N+ LA GGGG+FALCLD DL
Subjt:  KHGISLQTLIRNSHNLPGPCLLIVGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVFTTKYGDPRLFRATGANHYYYICLNDLLALGGGGSFALCLDGDL

Query:  LSGTSGPCDTFGSLCLAHDPEFELKNVETTG
        L  TSGP +TFG+ CLA   EFELKNVE  G
Subjt:  LSGTSGPCDTFGSLCLAHDPEFELKNVETTG

AT2G33350.1 CCT motif family protein1.3e-7245.64Show/hide
Query:  MLQDVMSSASDQMLSIDEISSPINAQIFDFCDPELFAETL-QNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAAT
        MLQD++SS S   LSID+I+SP++AQIFDFCDP+LF ET  Q+SE  S SN   +K+  + +N +N+   T+N+ N N N NT             +   
Subjt:  MLQDVMSSASDQMLSIDEISSPINAQIFDFCDPELFAETL-QNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAAT

Query:  NITTNSASNLTAIFDSQEELDNDISASIDFSPSA-SFSIPQYL--TIQSGQFDVS---QLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPS-IYVDDCL
        +   N+ ++L+ IFDSQE+ +NDI+ASIDFS S+  + +  +L   I   QFD S   Q+  Q P +    + L    ++ + +L    L S ++ +DCL
Subjt:  NITTNSASNLTAIFDSQEELDNDISASIDFSPSA-SFSIPQYL--TIQSGQFDVS---QLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPS-IYVDDCL

Query:  SSLTSY-MPLNPASPSCSFVGTTMATYLPTTTMNPATSTVESCGMFSLLGHELQD-----LDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
        SS+ SY + LN   PSCSF  ++      +T +  A S +        +G E+       +D+Q DN G +  D ++  FNP DLQ        L  GA 
Subjt:  SSLTSY-MPLNPASPSCSFVGTTMATYLPTTTMNPATSTVESCGMFSLLGHELQD-----LDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM

Query:  NCTSLAS----------DLSSLKDSTF-KVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE
        N + L +          D++ L+DST  KVGKLS E+RKEKI RYMKKRNERNF+KKIKYACRKTLADSRPRVRGRFAKNDE  E +R A S+H  +E+E
Subjt:  NCTSLAS----------DLSSLKDSTF-KVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE

Query:  --VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
          + VK+E+ +VDSSDIFAHISG NSFKCNYPIQSW
Subjt:  --VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW

AT2G33350.2 CCT motif family protein7.8e-7345.64Show/hide
Query:  MLQDVMSSASDQMLSIDEISSPINAQIFDFCDPELFAETL-QNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAAT
        MLQD++SS S   LSID+I+SP++AQIFDFCDP+LF ET  Q+SE  S SN   +K+  + +N +N+   T+N+ N N N NT             +   
Subjt:  MLQDVMSSASDQMLSIDEISSPINAQIFDFCDPELFAETL-QNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASFIPANDASAAT

Query:  NITTNSASNLTAIFDSQEELDNDISASIDFSPSA-SFSIPQYL--TIQSGQFDVS---QLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPS-IYVDDCL
        +   N+ ++L+ IFDSQE+ +NDI+ASIDFS S+  + +  +L   I   QFD S   Q+  Q P +    + L    ++ + +L    L S ++ +DCL
Subjt:  NITTNSASNLTAIFDSQEELDNDISASIDFSPSA-SFSIPQYL--TIQSGQFDVS---QLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPS-IYVDDCL

Query:  SSLTSY-MPLNPASPSCSFVGTTMATYLPTTTMNPATSTVESCGMFSLLGHELQD-----LDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM
        SS+ SY + LN   PSCSF  ++      +T +  A S +        +G E+       +D+Q DN G +  D ++  FNP DLQ        L  GA 
Subjt:  SSLTSY-MPLNPASPSCSFVGTTMATYLPTTTMNPATSTVESCGMFSLLGHELQD-----LDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAM

Query:  NCTSLAS----------DLSSLKDSTF-KVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE
        N + L +          D++ L+DST  KVGKLS E+RKEKI RYMKKRNERNF+KKIKYACRKTLADSRPRVRGRFAKNDE  E +R A S+H  +E+E
Subjt:  NCTSLAS----------DLSSLKDSTF-KVGKLSTEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEE

Query:  --VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
          + VK+E+ +VDSSDIFAHISG NSFKCNYPIQSW
Subjt:  --VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGCTATGAAGGAGAAAGTCTCTGAGAGGTTCTCCAGCCTCTTCTCTAATTCCACCAGTTCCGAATCCTCCAAACCACCTCCCCCTGATCCTCGTACTCAGGCCAG
GCCAAAATCGAAAGGAAGAAAATCTCTATCTTCATATTTATCCTTAATAATCCCTTCCATACATGGGTCTAAACCTTCTGCTTCTCGTCAAGACACTGATGCAGTTCAAT
CTCCCTCAGTTCGATATTGTGATGCAAACAATGATTTCCAGGAGGAAGGCTCAGATACTTCTTTAGGATGTAGTATACCATTCAAGACGGAAGAAATACCTAGAAATCAG
GGTGAAAATAAGGATTGTGGTTCAGCATATGATGAGGAAAAACTGAATAAACTAAGAGATGAGTATGACTCGGCATGTAGAAAGAGCACTTGTAGTTCAGATGGGTTTGA
AGAAGCTATGGAGCGACCCACTCCAAGAAACCCTTTATCAGACCTCATGGATGAGTCAGCTTTTATCACTTCACACTTGTATGAATTCCTTGGGTGTTGTCTTCCCAACA
TTGTGAAAGGGTGCAAATGGGTCTTGCTGTACAGTACGATGAAGCATGGTATATCTCTTCAAACTCTTATTCGCAACAGCCACAATCTTCCTGGCCCATGTTTACTGATT
GTTGGAGATACTCGAGGTGCTATATTTGGTGGTCTTCTAGAATGCCCGTTGAAGCCTACAGCCAAAAGAAAATATCAAGGAACTCACCAGACATTTGTTTTTACAACGAA
ATATGGTGACCCAAGGCTTTTTCGAGCAACTGGAGCCAACCACTATTATTATATTTGTTTGAACGATTTACTGGCACTTGGAGGCGGTGGTAGCTTTGCCTTATGTTTGG
ATGGTGACTTATTAAGTGGAACTAGTGGACCGTGTGACACATTTGGTAGCTTATGCTTGGCGCATGACCCAGAGTTTGAGCTAAAAAATGTCGAGACGACCGGCATCCAT
TTCATTCAGATTCTCATTAACAATGCAGCCATTTATTACTTACACCACTTGCCAATAGCAGATGGACGGGTAAGAACTCTCTCCTCTCTTGATCATATTGAACTAATATC
TGCTGTCTTTTATTTATTGATTCTGTTGTACTGTATGCATGCTAATAGAAATCCTGAACACAAATTAAGAAGAATGGACCCCCGCCCCCCCAATCAGGAAAAAGAGAGAG
AAACAGGGGAGACAGTTGGATATGACAAGCTTGCAGTTATGGGAAAAGTAGCTGCAAGTTTGTCTGAATTAGAAACGATTGGTCGCTCCTTTAGAGGGAATTTTGTTGCC
AAATGTGGCAAATGCTTGAAGATCTACTCATTAATAACAATATTAGAGCTTGGTTTCGAAGTTATGTTGCAGGACGTCATGAGCTCTGCGTCAGACCAAATGCTTTCCAT
TGATGAAATCTCTAGTCCGATCAATGCGCAAATATTTGATTTCTGTGACCCCGAGCTGTTCGCGGAGACGCTTCAGAATTCTGAGTTCAATTCTTGCTCAAATTGTTGTT
ACGACAAGAATTCGCCATATGCTACAAATCTGTCTAATTCCCCAGATCAAACAGATAACAATGGCAATGGCAATGGCAATAGCAATACTGTTGCCGGTGCTGCATCGTTT
ATACCTGCTAACGATGCATCAGCTGCAACTAACATAACGACCAACAGTGCTAGTAATCTGACTGCTATCTTTGATTCCCAAGAAGAACTTGACAACGATATCTCTGCTTC
CATAGACTTCTCTCCATCGGCTTCGTTTTCGATCCCTCAATATCTCACTATCCAGTCGGGGCAGTTCGATGTTTCTCAATTGCAATCTCAAATGCCATTGGTAGATCCCA
TGATTGAAGGGCTTGTGCAGTGTCCTATGGCTCCAGTTGGAGCTCTCATTGACGAAGATCTACCGTCGATTTACGTCGATGATTGCTTATCTTCCTTGACTTCCTACATG
CCACTGAATCCTGCTTCCCCTTCATGCTCGTTTGTCGGAACTACCATGGCAACTTACCTGCCTACTACAACAATGAATCCTGCTACATCGACTGTTGAAAGTTGTGGAAT
GTTTTCTCTCCTCGGCCATGAATTGCAAGACCTTGACTATCAAGGAGACAACTGTGGACTCTACAGCCAAGACTGTATGCAGGGGACTTTCAATCCAGCAGATCTTCAGG
TGCTTAACAATGAAAATCTACAACTGGCTGCTGGGGCAATGAACTGCACTTCTTTAGCTTCCGATCTCTCAAGCTTAAAGGACAGTACTTTCAAAGTAGGAAAACTCTCC
ACGGAAGAGAGAAAGGAGAAGATTCATAGGTACATGAAGAAAAGAAATGAGAGAAACTTCAGCAAGAAAATTAAGTATGCCTGCCGAAAAACACTAGCAGATAGCCGACC
ACGGGTTCGAGGACGGTTCGCAAAGAACGACGAATTGGCAGAGAATCACAGGGCTGCTTGTAGCAACCATGAAGGAGAAGAAGAAGAAGTAGTTGTGAAGGAAGAAGATA
GCATGGTTGATTCCTCAGATATCTTTGCTCATATCAGTGGAGTGAACTCCTTCAAGTGCAACTATCCAATTCAGTCTTGGACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGGCTATGAAGGAGAAAGTCTCTGAGAGGTTCTCCAGCCTCTTCTCTAATTCCACCAGTTCCGAATCCTCCAAACCACCTCCCCCTGATCCTCGTACTCAGGCCAG
GCCAAAATCGAAAGGAAGAAAATCTCTATCTTCATATTTATCCTTAATAATCCCTTCCATACATGGGTCTAAACCTTCTGCTTCTCGTCAAGACACTGATGCAGTTCAAT
CTCCCTCAGTTCGATATTGTGATGCAAACAATGATTTCCAGGAGGAAGGCTCAGATACTTCTTTAGGATGTAGTATACCATTCAAGACGGAAGAAATACCTAGAAATCAG
GGTGAAAATAAGGATTGTGGTTCAGCATATGATGAGGAAAAACTGAATAAACTAAGAGATGAGTATGACTCGGCATGTAGAAAGAGCACTTGTAGTTCAGATGGGTTTGA
AGAAGCTATGGAGCGACCCACTCCAAGAAACCCTTTATCAGACCTCATGGATGAGTCAGCTTTTATCACTTCACACTTGTATGAATTCCTTGGGTGTTGTCTTCCCAACA
TTGTGAAAGGGTGCAAATGGGTCTTGCTGTACAGTACGATGAAGCATGGTATATCTCTTCAAACTCTTATTCGCAACAGCCACAATCTTCCTGGCCCATGTTTACTGATT
GTTGGAGATACTCGAGGTGCTATATTTGGTGGTCTTCTAGAATGCCCGTTGAAGCCTACAGCCAAAAGAAAATATCAAGGAACTCACCAGACATTTGTTTTTACAACGAA
ATATGGTGACCCAAGGCTTTTTCGAGCAACTGGAGCCAACCACTATTATTATATTTGTTTGAACGATTTACTGGCACTTGGAGGCGGTGGTAGCTTTGCCTTATGTTTGG
ATGGTGACTTATTAAGTGGAACTAGTGGACCGTGTGACACATTTGGTAGCTTATGCTTGGCGCATGACCCAGAGTTTGAGCTAAAAAATGTCGAGACGACCGGCATCCAT
TTCATTCAGATTCTCATTAACAATGCAGCCATTTATTACTTACACCACTTGCCAATAGCAGATGGACGGGTAAGAACTCTCTCCTCTCTTGATCATATTGAACTAATATC
TGCTGTCTTTTATTTATTGATTCTGTTGTACTGTATGCATGCTAATAGAAATCCTGAACACAAATTAAGAAGAATGGACCCCCGCCCCCCCAATCAGGAAAAAGAGAGAG
AAACAGGGGAGACAGTTGGATATGACAAGCTTGCAGTTATGGGAAAAGTAGCTGCAAGTTTGTCTGAATTAGAAACGATTGGTCGCTCCTTTAGAGGGAATTTTGTTGCC
AAATGTGGCAAATGCTTGAAGATCTACTCATTAATAACAATATTAGAGCTTGGTTTCGAAGTTATGTTGCAGGACGTCATGAGCTCTGCGTCAGACCAAATGCTTTCCAT
TGATGAAATCTCTAGTCCGATCAATGCGCAAATATTTGATTTCTGTGACCCCGAGCTGTTCGCGGAGACGCTTCAGAATTCTGAGTTCAATTCTTGCTCAAATTGTTGTT
ACGACAAGAATTCGCCATATGCTACAAATCTGTCTAATTCCCCAGATCAAACAGATAACAATGGCAATGGCAATGGCAATAGCAATACTGTTGCCGGTGCTGCATCGTTT
ATACCTGCTAACGATGCATCAGCTGCAACTAACATAACGACCAACAGTGCTAGTAATCTGACTGCTATCTTTGATTCCCAAGAAGAACTTGACAACGATATCTCTGCTTC
CATAGACTTCTCTCCATCGGCTTCGTTTTCGATCCCTCAATATCTCACTATCCAGTCGGGGCAGTTCGATGTTTCTCAATTGCAATCTCAAATGCCATTGGTAGATCCCA
TGATTGAAGGGCTTGTGCAGTGTCCTATGGCTCCAGTTGGAGCTCTCATTGACGAAGATCTACCGTCGATTTACGTCGATGATTGCTTATCTTCCTTGACTTCCTACATG
CCACTGAATCCTGCTTCCCCTTCATGCTCGTTTGTCGGAACTACCATGGCAACTTACCTGCCTACTACAACAATGAATCCTGCTACATCGACTGTTGAAAGTTGTGGAAT
GTTTTCTCTCCTCGGCCATGAATTGCAAGACCTTGACTATCAAGGAGACAACTGTGGACTCTACAGCCAAGACTGTATGCAGGGGACTTTCAATCCAGCAGATCTTCAGG
TGCTTAACAATGAAAATCTACAACTGGCTGCTGGGGCAATGAACTGCACTTCTTTAGCTTCCGATCTCTCAAGCTTAAAGGACAGTACTTTCAAAGTAGGAAAACTCTCC
ACGGAAGAGAGAAAGGAGAAGATTCATAGGTACATGAAGAAAAGAAATGAGAGAAACTTCAGCAAGAAAATTAAGTATGCCTGCCGAAAAACACTAGCAGATAGCCGACC
ACGGGTTCGAGGACGGTTCGCAAAGAACGACGAATTGGCAGAGAATCACAGGGCTGCTTGTAGCAACCATGAAGGAGAAGAAGAAGAAGTAGTTGTGAAGGAAGAAGATA
GCATGGTTGATTCCTCAGATATCTTTGCTCATATCAGTGGAGTGAACTCCTTCAAGTGCAACTATCCAATTCAGTCTTGGACTTGA
Protein sequenceShow/hide protein sequence
MQAMKEKVSERFSSLFSNSTSSESSKPPPPDPRTQARPKSKGRKSLSSYLSLIIPSIHGSKPSASRQDTDAVQSPSVRYCDANNDFQEEGSDTSLGCSIPFKTEEIPRNQ
GENKDCGSAYDEEKLNKLRDEYDSACRKSTCSSDGFEEAMERPTPRNPLSDLMDESAFITSHLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLPGPCLLI
VGDTRGAIFGGLLECPLKPTAKRKYQGTHQTFVFTTKYGDPRLFRATGANHYYYICLNDLLALGGGGSFALCLDGDLLSGTSGPCDTFGSLCLAHDPEFELKNVETTGIH
FIQILINNAAIYYLHHLPIADGRVRTLSSLDHIELISAVFYLLILLYCMHANRNPEHKLRRMDPRPPNQEKERETGETVGYDKLAVMGKVAASLSELETIGRSFRGNFVA
KCGKCLKIYSLITILELGFEVMLQDVMSSASDQMLSIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDKNSPYATNLSNSPDQTDNNGNGNGNSNTVAGAASF
IPANDASAATNITTNSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQLQSQMPLVDPMIEGLVQCPMAPVGALIDEDLPSIYVDDCLSSLTSYM
PLNPASPSCSFVGTTMATYLPTTTMNPATSTVESCGMFSLLGHELQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLAAGAMNCTSLASDLSSLKDSTFKVGKLS
TEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWT