; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G011970 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G011970
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionCCT domain-containing protein
Genome locationchr09:19217158..19220175
RNA-Seq ExpressionLsi09G011970
SyntenyLsi09G011970
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR010402 - CCT domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147186.1 uncharacterized protein LOC101214336 isoform X3 [Cucumis sativus]3.5e-21194.17Show/hide
Query:  MLQDVVHSASEQMLFIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDN--NGNGNGHTVAAAASFIPANDASAATN
        MLQDV+ SAS+QML IDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYD NSPYATNLSNSPDQTDN  NGNGNG+TVA AASFIP NDASAATN
Subjt:  MLQDVVHSASEQMLFIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDN--NGNGNGHTVAAAASFIPANDASAATN

Query:  LTTNSTSNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMP
        +TTNS SNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVG LIDE+LPSIYVDDCLSSLTSYMP
Subjt:  LTTNSTSNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMP

Query:  LNPSSPSCSFVGATMASYLPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSL
        LNP+SPSCSFVG TMA+YLPTTSMNPATSTVESCGMFSLLG +L  QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNN+NLQLAAGAMNCTSLASDLSSL
Subjt:  LNPSSPSCSFVGATMASYLPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSL

Query:  KDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGV
        KDSTFKVGKLS EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVK+EDSMVDSSDIFAHISGV
Subjt:  KDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGV

Query:  NSFKCNYPIQSW
        NSFKCNYPIQSW
Subjt:  NSFKCNYPIQSW

XP_038875443.1 uncharacterized protein LOC120067896 isoform X1 [Benincasa hispida]7.9e-21195.96Show/hide
Query:  IDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNLTTNSTSNLTAIFDSQE
        +DEISSPINAQIFDFCDPELFAETLQ+SEFNSCSNCCYD NSPY TNLSNSPDQTDNNGN NG+TVAAAASF+PANDASAATN+TTNSTSNLTAIFDSQE
Subjt:  IDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNLTTNSTSNLTAIFDSQE

Query:  ELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMAS
        ELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPL+DPMIEGLVQCPMAPVGTLIDE+LPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMA+
Subjt:  ELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMAS

Query:  YLPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKE
        YLPTTSM PATSTVESCGMFSLLGAELQPQDLDYQGDNCGLY+QDCMQGTFNPADLQVLNN+NLQL AGAMNCTSLASDLSSLKDSTFKVGKLSMEERKE
Subjt:  YLPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKE

Query:  KIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
        KIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVK+EDSMVDSSDIFAHISGVNSFKCNYPIQSWI
Subjt:  KIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

XP_038875444.1 uncharacterized protein LOC120067896 isoform X2 [Benincasa hispida]1.3e-21096.2Show/hide
Query:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNLTTNSTSNLTAIFDSQEE
        DEISSPINAQIFDFCDPELFAETLQ+SEFNSCSNCCYD NSPY TNLSNSPDQTDNNGN NG+TVAAAASF+PANDASAATN+TTNSTSNLTAIFDSQEE
Subjt:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNLTTNSTSNLTAIFDSQEE

Query:  LDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMASY
        LDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPL+DPMIEGLVQCPMAPVGTLIDE+LPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMA+Y
Subjt:  LDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMASY

Query:  LPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEK
        LPTTSM PATSTVESCGMFSLLGAELQPQDLDYQGDNCGLY+QDCMQGTFNPADLQVLNN+NLQL AGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEK
Subjt:  LPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEK

Query:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
        IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVK+EDSMVDSSDIFAHISGVNSFKCNYPIQSWI
Subjt:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

XP_038875445.1 uncharacterized protein LOC120067896 isoform X3 [Benincasa hispida]9.6e-21795.62Show/hide
Query:  MLQDVVHSASEQMLFIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNLT
        MLQDVV SA EQML IDEISSPINAQIFDFCDPELFAETLQ+SEFNSCSNCCYD NSPY TNLSNSPDQTDNNGN NG+TVAAAASF+PANDASAATN+T
Subjt:  MLQDVVHSASEQMLFIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNLT

Query:  TNSTSNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLN
        TNSTSNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPL+DPMIEGLVQCPMAPVGTLIDE+LPSIYVDDCLSSLTSYMPLN
Subjt:  TNSTSNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLN

Query:  PSSPSCSFVGATMASYLPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKD
        PSSPSCSFVGATMA+YLPTTSM PATSTVESCGMFSLLGAELQPQDLDYQGDNCGLY+QDCMQGTFNPADLQVLNN+NLQL AGAMNCTSLASDLSSLKD
Subjt:  PSSPSCSFVGATMASYLPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKD

Query:  STFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGVNS
        STFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVK+EDSMVDSSDIFAHISGVNS
Subjt:  STFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGVNS

Query:  FKCNYPIQSWI
        FKCNYPIQSWI
Subjt:  FKCNYPIQSWI

XP_038875446.1 uncharacterized protein LOC120067896 isoform X4 [Benincasa hispida]1.3e-21096.2Show/hide
Query:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNLTTNSTSNLTAIFDSQEE
        DEISSPINAQIFDFCDPELFAETLQ+SEFNSCSNCCYD NSPY TNLSNSPDQTDNNGN NG+TVAAAASF+PANDASAATN+TTNSTSNLTAIFDSQEE
Subjt:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNLTTNSTSNLTAIFDSQEE

Query:  LDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMASY
        LDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPL+DPMIEGLVQCPMAPVGTLIDE+LPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMA+Y
Subjt:  LDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMASY

Query:  LPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEK
        LPTTSM PATSTVESCGMFSLLGAELQPQDLDYQGDNCGLY+QDCMQGTFNPADLQVLNN+NLQL AGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEK
Subjt:  LPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEK

Query:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
        IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVK+EDSMVDSSDIFAHISGVNSFKCNYPIQSWI
Subjt:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

TrEMBL top hitse value%identityAlignment
A0A0A0LKJ8 CCT domain-containing protein1.7e-21194.17Show/hide
Query:  MLQDVVHSASEQMLFIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDN--NGNGNGHTVAAAASFIPANDASAATN
        MLQDV+ SAS+QML IDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYD NSPYATNLSNSPDQTDN  NGNGNG+TVA AASFIP NDASAATN
Subjt:  MLQDVVHSASEQMLFIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDN--NGNGNGHTVAAAASFIPANDASAATN

Query:  LTTNSTSNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMP
        +TTNS SNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVG LIDE+LPSIYVDDCLSSLTSYMP
Subjt:  LTTNSTSNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMP

Query:  LNPSSPSCSFVGATMASYLPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSL
        LNP+SPSCSFVG TMA+YLPTTSMNPATSTVESCGMFSLLG +L  QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNN+NLQLAAGAMNCTSLASDLSSL
Subjt:  LNPSSPSCSFVGATMASYLPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSL

Query:  KDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGV
        KDSTFKVGKLS EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVK+EDSMVDSSDIFAHISGV
Subjt:  KDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGV

Query:  NSFKCNYPIQSW
        NSFKCNYPIQSW
Subjt:  NSFKCNYPIQSW

A0A1S3CCK1 uncharacterized protein LOC1034994544.2e-21095.69Show/hide
Query:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNLTTNSTSNLTAIFDSQEE
        DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYD NSPYATNLSNSPDQTDNNGNGNG+TVA AASFIPANDASAATN+TTNSTSNL+AIFDSQEE
Subjt:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNLTTNSTSNLTAIFDSQEE

Query:  LDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMASY
        LDNDISASI+FSPSASFS+PQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVG LIDE+LPSIYVDDCLSSLTSYMP+NP+SPSCSFVGA+MA+Y
Subjt:  LDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMASY

Query:  LPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEK
        LPTTSMNPATSTVESCGMFSLLG ELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNN+NLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEK
Subjt:  LPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEK

Query:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSW
        IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVK+EDSM+DSSDIFAHISGVNSFKCNYPIQSW
Subjt:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSW

A0A5A7SLG3 CCT domain-containing protein9.7e-20791.5Show/hide
Query:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNLTTNSTSNLTAIFDSQEE
        DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYD NSPYATNLSNSPDQTDNNGNGNG+TVA AASFIPANDASAATN+TTNSTSNL+AIFDSQEE
Subjt:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNLTTNSTSNLTAIFDSQEE

Query:  LDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMASY
        LDNDISASI+FSPSASFS+PQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVG LIDE+LPSIYVDDCLSSLTSYMP+NP+SPSCSFVGA+MA+Y
Subjt:  LDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMASY

Query:  LPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEK
        LPTTSMNPATSTVESCGMFSLLG ELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNN+NLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEK
Subjt:  LPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEK

Query:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE------------------VVVKKEDSMVDSSDIFAHISGV
        IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE                  VVVK+EDSM+DSSDIFAHISGV
Subjt:  IHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE------------------VVVKKEDSMVDSSDIFAHISGV

Query:  NSFKCNYPIQSW
        NSFKCNYPIQSW
Subjt:  NSFKCNYPIQSW

A0A6J1KWS5 uncharacterized protein LOC111498929 isoform X55.0e-17983.29Show/hide
Query:  MLQDVV----HSASEQMLFIDEISSPINAQIFDFCDPELFAETLQNS-EFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASA
        ML +V+     S SEQML  +EISSPINAQI+DFCD ELF+E LQNS EFNS SNC YDNNS YATNL +SPDQ DNNGN NG+TV AA SF+PANDASA
Subjt:  MLQDVV----HSASEQMLFIDEISSPINAQIFDFCDPELFAETLQNS-EFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASA

Query:  ATNLTTNSTSNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTS
         TN+TTN  SNLT IFD QEELDNDISASIDFSPS SFSI QYLTIQSGQFDVSQVQSQMPL+DPMI+GL+QCPMAP GT IDE+LPSIYVDDCLSS TS
Subjt:  ATNLTTNSTSNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTS

Query:  YMPLNPSSPSCSFVGATMASYLPTTSMN--PATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQ-VLNNDNLQLAAGAMNCTSLA
        YMPLNPSSPSCSFVGATM +YLPT  MN   ++S+VE+CGMF LL AELQPQDLDYQGDNCGLYSQD MQGTFNPADLQ VL+++NLQLAAGAMNCTSLA
Subjt:  YMPLNPSSPSCSFVGATMASYLPTTSMN--PATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQ-VLNNDNLQLAAGAMNCTSLA

Query:  SDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIF
        SDLSSLKDSTFKVGKLS+EERK+KIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDEL ENHRAACS HEG EEEEVVVK+EDSMVDSSDIF
Subjt:  SDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIF

Query:  AHISGVNSFKCNYPIQSWI
        AHISGVNS K +YPIQSWI
Subjt:  AHISGVNSFKCNYPIQSWI

B0F827 Zinc finger-like protein2.4e-20594.95Show/hide
Query:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDN--NGNGNGHTVAAAASFIPANDASAATNLTTNSTSNLTAIFDSQ
        DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYD NSPYATNLSNSPDQTDN  NGNGNG+TVA AASFIP NDASAATN+TTNS SNLTAIFDSQ
Subjt:  DEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDN--NGNGNGHTVAAAASFIPANDASAATNLTTNSTSNLTAIFDSQ

Query:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMA
        EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVG LIDE+LPSIYVDDCLSSLTSYMPLNP+SPSCSFVG TMA
Subjt:  EELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMA

Query:  SYLPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERK
        +YLPTTSMNPATSTVESCGMFSLLG +L  QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNN+NLQLAAGAMNCTSLASDLSSLKDSTFKVGKLS EERK
Subjt:  SYLPTTSMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERK

Query:  EKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSW
        EKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVK+EDSMVDSSDIFAHISGVNSFKCNYPIQSW
Subjt:  EKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSW

SwissProt top hitse value%identityAlignment
E5RQA1 Transcription factor GHD72.5e-0548.53Show/hide
Query:  SLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDE
        ++A D  SL  +T  VG  +M ER+ K+ RY +KR +R + K+I+YA RK  A+ RPRVRGRFAK  +
Subjt:  SLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDE

O82117 Zinc finger protein CO34.2e-0550Show/hide
Query:  ERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDE
        +R+ ++HRY +KR  R F K I+YA RK  A++RPR++GRFAK  +
Subjt:  ERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDE

Q940T9 Zinc finger protein CONSTANS-LIKE 41.2e-0440.3Show/hide
Query:  SSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENH
        S     T +   L+  ER+ ++ RY +KR  R F K I+YA RK  A+ RPR++GRFAK  +  E++
Subjt:  SSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENH

Q9FHH8 Zinc finger protein CONSTANS-LIKE 55.5e-0543.94Show/hide
Query:  SMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAK-----NDELAENHRAACSNH
        S  +R+ ++ RY +KR  R F K I+YA RK  A+SRPR++GRFAK     ND++  +H  A + H
Subjt:  SMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAK-----NDELAENHRAACSNH

Q9SK53 Zinc finger protein CONSTANS-LIKE 32.5e-0549.09Show/hide
Query:  KLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAEN
        +LS  ER+ ++ RY +KR  R F K I+YA RK  A+ RPR++GRFAK  +  EN
Subjt:  KLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAEN

Arabidopsis top hitse value%identityAlignment
AT1G04500.1 CCT motif family protein1.9e-7746.51Show/hide
Query:  MLQDVVHSASEQMLFIDEISSPINAQIFDFCDPELFAETL-QNSEFNSCSN-CCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATN
        M QDV+  +S + L +DEI+SP+ AQIFDFCD +LF ET  Q SE  S SN C Y  N+    N +N PD++ N+G+   H                   
Subjt:  MLQDVVHSASEQMLFIDEISSPINAQIFDFCDPELFAETL-QNSEFNSCSN-CCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATN

Query:  LTTNSTSNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELP---SIYVDDCLSSLTS
           N  ++L+ IFDSQ++ DNDI+ASIDFS S  F     L     QFD + +Q   P            P     +   + LP   S++ +DCLSS+ S
Subjt:  LTTNSTSNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELP---SIYVDDCLSSLTS

Query:  YM--PLNPSSPSCSFVGATMASYLPTTSMNPATSTVESCGMFS---LLGAELQP---QDLDYQGDNCGLYSQDCMQGTFNPAD-----LQVLNNDNLQLA
        Y    +NPSSPSCSF+G T      T + N   + + S G +S    LG++ +P   Q ++ Q DN GL+  D ++  FNP D     L  + N N  +A
Subjt:  YM--PLNPSSPSCSFVGATMASYLPTTSMNPATSTVESCGMFS---LLGAELQP---QDLDYQGDNCGLYSQDCMQGTFNPAD-----LQVLNNDNLQLA

Query:  AGAMNCTSLASDLSSLKDSTF-KVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKK
           +    L ++++ L D +F KVGKLS E+RKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDE  E +R ACS+H  +++++V VK+
Subjt:  AGAMNCTSLASDLSSLKDSTF-KVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKK

Query:  EDSMVDSSDIFAHISGVNSFKCNYPIQSWI
        E+ +VDSSDIF+HISGVNSFKCNYPIQSWI
Subjt:  EDSMVDSSDIFAHISGVNSFKCNYPIQSWI

AT1G63820.1 CCT motif family protein3.4e-1846.09Show/hide
Query:  YSQDCMQGTFNPADLQVLNND-------NLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVR
        +  D M+  ++  DLQ L  D       +  LAA +   T  + D  SL     +VG+ S EERKEKI +Y  KR +RNF+K IKYACRKTLAD+RPRVR
Subjt:  YSQDCMQGTFNPADLQVLNND-------NLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVR

Query:  GRFAKNDELAENHRAACSNHEGEEEEEV
        GRFA+NDE+ EN + A S    E ++++
Subjt:  GRFAKNDELAENHRAACSNHEGEEEEEV

AT2G33350.1 CCT motif family protein4.3e-7445.29Show/hide
Query:  MLQDVVHSASEQMLFIDEISSPINAQIFDFCDPELFAETL-QNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNL
        MLQD++ S S   L ID+I+SP++AQIFDFCDP+LF ET  Q+SE  S SN    + S ++   + +  +  NN N N +T             +   + 
Subjt:  MLQDVVHSASEQMLFIDEISSPINAQIFDFCDPELFAETL-QNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNL

Query:  TTNSTSNLTAIFDSQEELDNDISASIDFSPSA-SFSIPQYL--TIQSGQFDVS---QVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPS-IYVDDCLSS
          N+ ++L+ IFDSQE+ +NDI+ASIDFS S+  + +  +L   I   QFD S   QV  Q P +    + L    ++ + +L    L S ++ +DCLSS
Subjt:  TTNSTSNLTAIFDSQEELDNDISASIDFSPSA-SFSIPQYL--TIQSGQFDVS---QVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPS-IYVDDCLSS

Query:  LTSY-MPLNPSSPSCSFVGATMASYLPTTSMNPATSTVESCGMFSLLGAEL-QPQD--LDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNC
        + SY + LN   PSCSF  ++      +T +  A S +        +G+E+ +P D  +D+Q DN G +  D ++  FNP DLQ        L  GA N 
Subjt:  LTSY-MPLNPSSPSCSFVGATMASYLPTTSMNPATSTVESCGMFSLLGAEL-QPQD--LDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNC

Query:  TSLAS----------DLSSLKDSTF-KVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAE-NHRAACSNHEGEEEEE
        + L +          D++ L+DST  KVGKLS E+RKEKI RYMKKRNERNF+KKIKYACRKTLADSRPRVRGRFAKNDE  E N +A  S+H+ E+E++
Subjt:  TSLAS----------DLSSLKDSTF-KVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAE-NHRAACSNHEGEEEEE

Query:  VVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
        + VK E+ +VDSSDIFAHISG NSFKCNYPIQSWI
Subjt:  VVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

AT2G33350.2 CCT motif family protein2.5e-7445.29Show/hide
Query:  MLQDVVHSASEQMLFIDEISSPINAQIFDFCDPELFAETL-QNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNL
        MLQD++ S S   L ID+I+SP++AQIFDFCDP+LF ET  Q+SE  S SN    + S ++   + +  +  NN N N +T             +   + 
Subjt:  MLQDVVHSASEQMLFIDEISSPINAQIFDFCDPELFAETL-QNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNL

Query:  TTNSTSNLTAIFDSQEELDNDISASIDFSPSA-SFSIPQYL--TIQSGQFDVS---QVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPS-IYVDDCLSS
          N+ ++L+ IFDSQE+ +NDI+ASIDFS S+  + +  +L   I   QFD S   QV  Q P +    + L    ++ + +L    L S ++ +DCLSS
Subjt:  TTNSTSNLTAIFDSQEELDNDISASIDFSPSA-SFSIPQYL--TIQSGQFDVS---QVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPS-IYVDDCLSS

Query:  LTSY-MPLNPSSPSCSFVGATMASYLPTTSMNPATSTVESCGMFSLLGAEL-QPQD--LDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNC
        + SY + LN   PSCSF  ++      +T +  A S +        +G+E+ +P D  +D+Q DN G +  D ++  FNP DLQ        L  GA N 
Subjt:  LTSY-MPLNPSSPSCSFVGATMASYLPTTSMNPATSTVESCGMFSLLGAEL-QPQD--LDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNC

Query:  TSLAS----------DLSSLKDSTF-KVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAE-NHRAACSNHEGEEEEE
        + L +          D++ L+DST  KVGKLS E+RKEKI RYMKKRNERNF+KKIKYACRKTLADSRPRVRGRFAKNDE  E N +A  S+H+ E+E++
Subjt:  TSLAS----------DLSSLKDSTF-KVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAE-NHRAACSNHEGEEEEE

Query:  VVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
        + VK E+ +VDSSDIFAHISG NSFKCNYPIQSWI
Subjt:  VVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

AT5G41380.1 CCT motif family protein4.4e-1845.04Show/hide
Query:  MQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAEN
        M+  ++  DLQ    +N+++   + N T   S+     +  FKVG+ S EERKEKI +Y  KRN+RNF+K IKYACRKTLADSRPR+RGRFA+NDE+ E 
Subjt:  MQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAEN

Query:  HRAACSNHEGEEEE----EVVVKKEDSMVDS
              N E ++ E    + + +KE++ V S
Subjt:  HRAACSNHEGEEEE----EVVVKKEDSMVDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCAAGACGTCGTCCACTCCGCGTCGGAGCAAATGCTTTTCATTGATGAAATCTCGAGTCCGATCAATGCGCAAATCTTTGATTTCTGTGATCCCGAGCTGTTCGC
GGAGACGCTTCAGAATTCCGAGTTCAATTCTTGCTCGAATTGTTGTTATGACAACAATTCGCCATATGCTACAAATCTGTCTAATTCCCCGGATCAAACAGATAACAATG
GCAATGGGAATGGCCATACCGTCGCTGCTGCTGCATCGTTTATACCTGCTAACGACGCATCCGCTGCAACTAACTTAACGACCAACAGTACTAGTAATCTGACTGCTATC
TTTGATTCCCAAGAAGAACTTGACAATGATATCTCTGCTTCCATAGACTTCTCTCCATCGGCTTCGTTTTCGATCCCTCAATATCTCACCATCCAGTCAGGGCAGTTCGA
CGTTTCTCAAGTGCAGTCTCAAATGCCATTAGTAGATCCCATGATTGAGGGGCTTGTCCAGTGTCCTATGGCTCCAGTTGGAACTCTCATCGACGAAGAACTACCATCAA
TTTACGTCGACGATTGCTTGTCTTCCTTGACTTCCTACATGCCACTCAACCCTTCTTCCCCCTCGTGCTCGTTTGTCGGAGCAACCATGGCAAGTTACCTGCCTACTACA
TCAATGAATCCTGCTACATCGACTGTCGAAAGTTGCGGAATGTTTTCTCTCCTTGGCGCAGAATTGCAACCGCAAGACCTTGACTATCAAGGAGACAACTGTGGACTCTA
CAGCCAAGACTGTATGCAGGGGACTTTCAATCCAGCAGACCTTCAGGTGCTTAACAATGATAATCTACAACTGGCTGCTGGGGCAATGAACTGCACTTCTTTAGCATCAG
ATCTCTCAAGCTTAAAGGACAGTACTTTCAAAGTAGGAAAACTCTCCATGGAAGAGAGAAAGGAGAAGATTCATAGGTACATGAAGAAAAGAAATGAGAGAAACTTCAGC
AAGAAAATCAAGTATGCTTGCCGAAAAACGCTAGCGGATAGCCGGCCACGTGTTCGGGGACGGTTCGCAAAGAACGACGAATTAGCAGAGAATCACAGAGCTGCTTGTAG
CAACCATGAAGGAGAAGAAGAAGAAGAAGTAGTTGTGAAGAAAGAAGATAGCATGGTTGATTCCTCAGATATCTTTGCGCATATCAGTGGAGTGAACTCTTTCAAGTGCA
ACTATCCAATCCAGTCTTGGATTTGA
mRNA sequenceShow/hide mRNA sequence
TGTCATTGCTTCTTCCATTTCACTCAAACTTCTTCTTTGCTTCACCAAAATCCATTCTTCTTCCATTCTCTTTCATTTTAAAACAACCCCTTTTGATTTTTCTTCTTAAT
CTTTAGAGTTATGTTGCAAGACGTCGTCCACTCCGCGTCGGAGCAAATGCTTTTCATTGATGAAATCTCGAGTCCGATCAATGCGCAAATCTTTGATTTCTGTGATCCCG
AGCTGTTCGCGGAGACGCTTCAGAATTCCGAGTTCAATTCTTGCTCGAATTGTTGTTATGACAACAATTCGCCATATGCTACAAATCTGTCTAATTCCCCGGATCAAACA
GATAACAATGGCAATGGGAATGGCCATACCGTCGCTGCTGCTGCATCGTTTATACCTGCTAACGACGCATCCGCTGCAACTAACTTAACGACCAACAGTACTAGTAATCT
GACTGCTATCTTTGATTCCCAAGAAGAACTTGACAATGATATCTCTGCTTCCATAGACTTCTCTCCATCGGCTTCGTTTTCGATCCCTCAATATCTCACCATCCAGTCAG
GGCAGTTCGACGTTTCTCAAGTGCAGTCTCAAATGCCATTAGTAGATCCCATGATTGAGGGGCTTGTCCAGTGTCCTATGGCTCCAGTTGGAACTCTCATCGACGAAGAA
CTACCATCAATTTACGTCGACGATTGCTTGTCTTCCTTGACTTCCTACATGCCACTCAACCCTTCTTCCCCCTCGTGCTCGTTTGTCGGAGCAACCATGGCAAGTTACCT
GCCTACTACATCAATGAATCCTGCTACATCGACTGTCGAAAGTTGCGGAATGTTTTCTCTCCTTGGCGCAGAATTGCAACCGCAAGACCTTGACTATCAAGGAGACAACT
GTGGACTCTACAGCCAAGACTGTATGCAGGGGACTTTCAATCCAGCAGACCTTCAGGTGCTTAACAATGATAATCTACAACTGGCTGCTGGGGCAATGAACTGCACTTCT
TTAGCATCAGATCTCTCAAGCTTAAAGGACAGTACTTTCAAAGTAGGAAAACTCTCCATGGAAGAGAGAAAGGAGAAGATTCATAGGTACATGAAGAAAAGAAATGAGAG
AAACTTCAGCAAGAAAATCAAGTATGCTTGCCGAAAAACGCTAGCGGATAGCCGGCCACGTGTTCGGGGACGGTTCGCAAAGAACGACGAATTAGCAGAGAATCACAGAG
CTGCTTGTAGCAACCATGAAGGAGAAGAAGAAGAAGAAGTAGTTGTGAAGAAAGAAGATAGCATGGTTGATTCCTCAGATATCTTTGCGCATATCAGTGGAGTGAACTCT
TTCAAGTGCAACTATCCAATCCAGTCTTGGATTTGAATTTTTTGATGTTGTTTATTAAAAAAATAAATCACAAAAAAGAAAAAGAAAAAAAAAAGAAAAGAAAAAGAGAG
AGGAAAAAACTACAATTTTGCAGGACCCAAATACAGATATGAACATAAGAATAATAATTATGATAATTAGTAGAAAGAGATAGTGATAAAGGGTCATAAAATAGAAATAA
GTTTTGAGTCAGGCCACCTAGCAATCAAACTTTAGTAGTTTTCAATTTGTAGATATCTCATCAACAAATTCAAAATCCTTAATCTGTACATGTTAGGGGTTTTTGCCCTC
AAAGTTTCAATAATTTAATGTATAATTGTTCAGTTTTAATTATAAGATTGTTACTACAATTCTCTCCTCTTC
Protein sequenceShow/hide protein sequence
MLQDVVHSASEQMLFIDEISSPINAQIFDFCDPELFAETLQNSEFNSCSNCCYDNNSPYATNLSNSPDQTDNNGNGNGHTVAAAASFIPANDASAATNLTTNSTSNLTAI
FDSQEELDNDISASIDFSPSASFSIPQYLTIQSGQFDVSQVQSQMPLVDPMIEGLVQCPMAPVGTLIDEELPSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMASYLPTT
SMNPATSTVESCGMFSLLGAELQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNDNLQLAAGAMNCTSLASDLSSLKDSTFKVGKLSMEERKEKIHRYMKKRNERNFS
KKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKKEDSMVDSSDIFAHISGVNSFKCNYPIQSWI