; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC07g0562 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC07g0562
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionCCT domain-containing protein
Genome locationMC07:13674195..13687575
RNA-Seq ExpressionMC07g0562
SyntenyMC07g0562
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR006571 - TLDc domain
IPR010402 - CCT domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138731.1 uncharacterized protein LOC111009824 [Momordica charantia]1.14e-261100Show/hide
Query:  EEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNSSDHSPETENHIAAGGAVFIPATDASAATNSNLTAIFDSQEELDNDISASIDFSPSASFS
        EEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNSSDHSPETENHIAAGGAVFIPATDASAATNSNLTAIFDSQEELDNDISASIDFSPSASFS
Subjt:  EEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNSSDHSPETENHIAAGGAVFIPATDASAATNSNLTAIFDSQEELDNDISASIDFSPSASFS

Query:  IPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPVEDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGATMATYLPASAASVESCGGIFPLMGS
        IPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPVEDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGATMATYLPASAASVESCGGIFPLMGS
Subjt:  IPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPVEDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGATMATYLPASAASVESCGGIFPLMGS

Query:  ELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKT
        ELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKT
Subjt:  ELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKT

Query:  LADSRPRVRGRFAKNDELTENHRTASSNHEGDEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
        LADSRPRVRGRFAKNDELTENHRTASSNHEGDEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
Subjt:  LADSRPRVRGRFAKNDELTENHRTASSNHEGDEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

XP_038875443.1 uncharacterized protein LOC120067896 isoform X1 [Benincasa hispida]2.78e-20378.61Show/hide
Query:  GFQEEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH------IAAGGAVFIPATDASAATN------SNLTAIFDS
        G  +E+SSPIN Q+FDFCDPELF+ETLQ+ EFNS SNCCY+KNS      S+   +T+N+        A  A F+PA DASAATN      SNLTAIFDS
Subjt:  GFQEEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH------IAAGGAVFIPATDASAATN------SNLTAIFDS

Query:  QEELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVG
        QEELDNDISASIDFSPSASFSIPQYLTIQS GQFDV Q+Q+QMPL+DP  ++EGLVQCPMAPV     EDLPSIYVDDCLSSLTS +PLNPSSPSCSFVG
Subjt:  QEELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVG

Query:  ATMATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLS
        ATMATYLP ++     ++VESCG +F L+G+ELQPQDLD+QGDNCGLY+QDC+QGTFNPADLQVLNNE LQL  GAMNCTSLASDLSSLKDSTFKVGKLS
Subjt:  ATMATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLS

Query:  VEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQS
        +EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDEL ENHR A SNHEG+EEE VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQS
Subjt:  VEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQS

Query:  WI
        WI
Subjt:  WI

XP_038875444.1 uncharacterized protein LOC120067896 isoform X2 [Benincasa hispida]4.66e-20378.75Show/hide
Query:  QEEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH------IAAGGAVFIPATDASAATN------SNLTAIFDSQE
        ++E+SSPIN Q+FDFCDPELF+ETLQ+ EFNS SNCCY+KNS      S+   +T+N+        A  A F+PA DASAATN      SNLTAIFDSQE
Subjt:  QEEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH------IAAGGAVFIPATDASAATN------SNLTAIFDSQE

Query:  ELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGAT
        ELDNDISASIDFSPSASFSIPQYLTIQS GQFDV Q+Q+QMPL+DP  ++EGLVQCPMAPV     EDLPSIYVDDCLSSLTS +PLNPSSPSCSFVGAT
Subjt:  ELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGAT

Query:  MATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVE
        MATYLP ++     ++VESCG +F L+G+ELQPQDLD+QGDNCGLY+QDC+QGTFNPADLQVLNNE LQL  GAMNCTSLASDLSSLKDSTFKVGKLS+E
Subjt:  MATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVE

Query:  ERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
        ERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDEL ENHR A SNHEG+EEE VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
Subjt:  ERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

XP_038875445.1 uncharacterized protein LOC120067896 isoform X3 [Benincasa hispida]8.40e-20378.95Show/hide
Query:  EEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH------IAAGGAVFIPATDASAATN------SNLTAIFDSQEE
        +E+SSPIN Q+FDFCDPELF+ETLQ+ EFNS SNCCY+KNS      S+   +T+N+        A  A F+PA DASAATN      SNLTAIFDSQEE
Subjt:  EEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH------IAAGGAVFIPATDASAATN------SNLTAIFDSQEE

Query:  LDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGATM
        LDNDISASIDFSPSASFSIPQYLTIQS GQFDV Q+Q+QMPL+DP  ++EGLVQCPMAPV     EDLPSIYVDDCLSSLTS +PLNPSSPSCSFVGATM
Subjt:  LDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGATM

Query:  ATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVEE
        ATYLP ++     ++VESCG +F L+G+ELQPQDLD+QGDNCGLY+QDC+QGTFNPADLQVLNNE LQL  GAMNCTSLASDLSSLKDSTFKVGKLS+EE
Subjt:  ATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVEE

Query:  RKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
        RKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDEL ENHR A SNHEG+EEE VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
Subjt:  RKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

XP_038875446.1 uncharacterized protein LOC120067896 isoform X4 [Benincasa hispida]3.62e-20378.75Show/hide
Query:  QEEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH------IAAGGAVFIPATDASAATN------SNLTAIFDSQE
        ++E+SSPIN Q+FDFCDPELF+ETLQ+ EFNS SNCCY+KNS      S+   +T+N+        A  A F+PA DASAATN      SNLTAIFDSQE
Subjt:  QEEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH------IAAGGAVFIPATDASAATN------SNLTAIFDSQE

Query:  ELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGAT
        ELDNDISASIDFSPSASFSIPQYLTIQS GQFDV Q+Q+QMPL+DP  ++EGLVQCPMAPV     EDLPSIYVDDCLSSLTS +PLNPSSPSCSFVGAT
Subjt:  ELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGAT

Query:  MATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVE
        MATYLP ++     ++VESCG +F L+G+ELQPQDLD+QGDNCGLY+QDC+QGTFNPADLQVLNNE LQL  GAMNCTSLASDLSSLKDSTFKVGKLS+E
Subjt:  MATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVE

Query:  ERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
        ERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDEL ENHR A SNHEG+EEE VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
Subjt:  ERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

TrEMBL top hitse value%identityAlignment
A0A0A0LKJ8 CCT domain-containing protein5.04e-19978.5Show/hide
Query:  EEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH--------IAAGGAVFIPATDASAATN------SNLTAIFDSQ
        +E+SSPIN Q+FDFCDPELF+ETLQN EFNS SNCCY+KNS      S+   +T+N+          AG A FIP  DASAATN      SNLTAIFDSQ
Subjt:  EEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH--------IAAGGAVFIPATDASAATN------SNLTAIFDSQ

Query:  EELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGA
        EELDNDISASIDFSPSASFSIPQYLTIQS GQFDV Q+Q+QMPLVDP  ++EGLVQCPMAPV     EDLPSIYVDDCLSSLTS +PLNP+SPSCSFVG 
Subjt:  EELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGA

Query:  TMATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSV
        TMATYLP ++     ++VESCG +F L+G +LQ  DLD+QGDNCGLYSQDC+QGTFNPADLQVLNNE LQLA GAMNCTSLASDLSSLKDSTFKVGKLS 
Subjt:  TMATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSV

Query:  EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
        EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDEL ENHR A SNHEG+EEE VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
Subjt:  EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW

A0A1S3CCK1 uncharacterized protein LOC1034994544.75e-20278.45Show/hide
Query:  QEEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH------IAAGGAVFIPATDASAATN------SNLTAIFDSQE
         +E+SSPIN Q+FDFCDPELF+ETLQN EFNS SNCCY+KNS      S+   +T+N+        AG A FIPA DASAATN      SNL+AIFDSQE
Subjt:  QEEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH------IAAGGAVFIPATDASAATN------SNLTAIFDSQE

Query:  ELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGAT
        ELDNDISASI+FSPSASFS+PQYLTIQS GQFDV Q+Q+QMPLVDP  ++EGLVQCPMAPV     EDLPSIYVDDCLSSLTS +P+NP+SPSCSFVGA+
Subjt:  ELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGAT

Query:  MATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVE
        MATYLP ++     ++VESCG +F L+G ELQPQDLD+QGDNCGLYSQDC+QGTFNPADLQVLNNE LQLA GAMNCTSLASDLSSLKDSTFKVGKLS+E
Subjt:  MATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVE

Query:  ERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW
        ERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDEL ENHR A SNHEG+EEE VVVKEEDSM+DSSDIFAHISGVNSFKCNYPIQSW
Subjt:  ERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSW

A0A5A7SLG3 CCT domain-containing protein7.90e-20074.94Show/hide
Query:  GFQEEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH------IAAGGAVFIPATDASAATN------SNLTAIFDS
        G  +E+SSPIN Q+FDFCDPELF+ETLQN EFNS SNCCY+KNS      S+   +T+N+        AG A FIPA DASAATN      SNL+AIFDS
Subjt:  GFQEEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH------IAAGGAVFIPATDASAATN------SNLTAIFDS

Query:  QEELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVG
        QEELDNDISASI+FSPSASFS+PQYLTIQS GQFDV Q+Q+QMPLVDP  ++EGLVQCPMAPV     EDLPSIYVDDCLSSLTS +P+NP+SPSCSFVG
Subjt:  QEELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVG

Query:  ATMATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLS
        A+MATYLP ++     ++VESCG +F L+G ELQPQDLD+QGDNCGLYSQDC+QGTFNPADLQVLNNE LQLA GAMNCTSLASDLSSLKDSTFKVGKLS
Subjt:  ATMATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLS

Query:  VEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-------------------VVVKEEDSMVDSSDI
        +EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDEL ENHR A SNHEG+EEE                   VVVKEEDSM+DSSDI
Subjt:  VEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-------------------VVVKEEDSMVDSSDI

Query:  FAHISGVNSFKCNYPIQSW
        FAHISGVNSFKCNYPIQSW
Subjt:  FAHISGVNSFKCNYPIQSW

A0A6J1CAJ8 uncharacterized protein LOC1110098245.50e-262100Show/hide
Query:  EEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNSSDHSPETENHIAAGGAVFIPATDASAATNSNLTAIFDSQEELDNDISASIDFSPSASFS
        EEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNSSDHSPETENHIAAGGAVFIPATDASAATNSNLTAIFDSQEELDNDISASIDFSPSASFS
Subjt:  EEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNSSDHSPETENHIAAGGAVFIPATDASAATNSNLTAIFDSQEELDNDISASIDFSPSASFS

Query:  IPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPVEDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGATMATYLPASAASVESCGGIFPLMGS
        IPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPVEDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGATMATYLPASAASVESCGGIFPLMGS
Subjt:  IPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPVEDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVGATMATYLPASAASVESCGGIFPLMGS

Query:  ELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKT
        ELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKT
Subjt:  ELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKT

Query:  LADSRPRVRGRFAKNDELTENHRTASSNHEGDEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
        LADSRPRVRGRFAKNDELTENHRTASSNHEGDEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
Subjt:  LADSRPRVRGRFAKNDELTENHRTASSNHEGDEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

B0F827 Zinc finger-like protein4.75e-19978.3Show/hide
Query:  QEEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH--------IAAGGAVFIPATDASAATN------SNLTAIFDS
         +E+SSPIN Q+FDFCDPELF+ETLQN EFNS SNCCY+KNS      S+   +T+N+          AG A FIP  DASAATN      SNLTAIFDS
Subjt:  QEEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNS------SDHSPETENH--------IAAGGAVFIPATDASAATN------SNLTAIFDS

Query:  QEELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVG
        QEELDNDISASIDFSPSASFSIPQYLTIQS GQFDV Q+Q+QMPLVDP  ++EGLVQCPMAPV     EDLPSIYVDDCLSSLTS +PLNP+SPSCSFVG
Subjt:  QEELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPV-----EDLPSIYVDDCLSSLTSNLPLNPSSPSCSFVG

Query:  ATMATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLS
         TMATYLP ++     ++VESCG +F L+G +LQ  DLD+QGDNCGLYSQDC+QGTFNPADLQVLNNE LQLA GAMNCTSLASDLSSLKDSTFKVGKLS
Subjt:  ATMATYLPASA-----ASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLS

Query:  VEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQS
         EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDEL ENHR A SNHEG+EEE VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQS
Subjt:  VEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE-VVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQS

Query:  W
        W
Subjt:  W

SwissProt top hitse value%identityAlignment
A8KBE0 Oxidation resistance protein 13.1e-0730.81Show/hide
Query:  QEEGSDTLTERSLVCKTKEVPSNQGENEDCGSAHEKKVKLNIPGVDDDPASGKSTCSSDV-FEEAMERPSPRKPLS---DLTDESCFISEDLYEFLGCCL
        + EGSD        C T +  +   E       H +   LN   +       K T   DV  ++A    + ++P S   +L+D S  +  D  E L   L
Subjt:  QEEGSDTLTERSLVCKTKEVPSNQGENEDCGSAHEKKVKLNIPGVDDDPASGKSTCSSDV-FEEAMERPSPRKPLS---DLTDESCFISEDLYEFLGCCL

Query:  PNIVKGCKWILMYSTAKHGISLRTLIRKSHDLSGPCLLIVGDTRGAIFGGLLECPLEPTP-KRKYQKPLLFLFSPRFRFLYFFSW
        P    G  W L+YSTAKHG+SL+TL R    L  P LL++ D+   IFG L   P + +       +  LF F P F     F W
Subjt:  PNIVKGCKWILMYSTAKHGISLRTLIRKSHDLSGPCLLIVGDTRGAIFGGLLECPLEPTP-KRKYQKPLLFLFSPRFRFLYFFSW

B4F6Q9 Oxidation resistance protein 12.4e-0735.34Show/hide
Query:  ERPSPRKPLSDLTDESCFISEDLYEFLGCCLPNIVKGCKWILMYSTAKHGISLRTLIRKSHDLSGPCLLIVGDTRGAIFGGLLECPLEPTP-KRKYQKPL
        + P   +P  +L+D S  +  +  E L   LP    G  W L+YSTAKHG+SL+TL R    L  P LL++ D+   IFG L   P + +       +  
Subjt:  ERPSPRKPLSDLTDESCFISEDLYEFLGCCLPNIVKGCKWILMYSTAKHGISLRTLIRKSHDLSGPCLLIVGDTRGAIFGGLLECPLEPTP-KRKYQKPL

Query:  LFLFSPRFRFLYFFSW
        LF F P F     F W
Subjt:  LFLFSPRFRFLYFFSW

Q6DFV7 Nuclear receptor coactivator 71.3e-0836.43Show/hide
Query:  KSTCSSDVFEEAMERPSPRKPLSDLTDESCFISEDLYEFLGCCLPNIVKGCKWILMYSTAKHGISLRTLIRKSHDLSGPCLLIVGDTRGAIFGGLLECPL
        KSTCS   +EE  E     + L  L   S  +     E L   LP  V+G  W L YST +HG SL+TL RKS  L  P LL++ D    IFG     P 
Subjt:  KSTCSSDVFEEAMERPSPRKPLSDLTDESCFISEDLYEFLGCCLPNIVKGCKWILMYSTAKHGISLRTLIRKSHDLSGPCLLIVGDTRGAIFGGLLECPL

Query:  EPTPK-RKYQKPLLFLFSPRFRFLYFFSW
        + +       +  L+ FSP F+    F W
Subjt:  EPTPK-RKYQKPLLFLFSPRFRFLYFFSW

Q8N573 Oxidation resistance protein 14.1e-0731.47Show/hide
Query:  KLNIPGVDDDPASGKSTCSSDVFEEAMERPSPRKPLSDLTDESCFISEDLYEFLGCCLPNIVKGCKWILMYSTAKHGISLRTLIRKSHDLSGPCLLIVGD
        +L I   +D  +   +T  +D+  E+  RP       +L+D S  +  D  E L   LP    G  W L+Y T KHG SL+TL R    L  P L+++ D
Subjt:  KLNIPGVDDDPASGKSTCSSDVFEEAMERPSPRKPLSDLTDESCFISEDLYEFLGCCLPNIVKGCKWILMYSTAKHGISLRTLIRKSHDLSGPCLLIVGD

Query:  TRGAIFGGLLECPLEPTPK-RKYQKPLLFLFSPRFRFLYFFSW
        + G +FG L   PL+ +       +  +F F P F     F W
Subjt:  TRGAIFGGLLECPLEPTPK-RKYQKPLLFLFSPRFRFLYFFSW

Q8NI08 Nuclear receptor coactivator 76.3e-0835.66Show/hide
Query:  KSTCSSDVFEEAMERPSPRKPLSDLTDESCFISEDLYEFLGCCLPNIVKGCKWILMYSTAKHGISLRTLIRKSHDLSGPCLLIVGDTRGAIFGGLLECPL
        KSTCS   +E+  E   P      L   S  +     E L   LP  V+G  W L YST +HG SL+TL RKS  L  P LL++ D    IFG     P 
Subjt:  KSTCSSDVFEEAMERPSPRKPLSDLTDESCFISEDLYEFLGCCLPNIVKGCKWILMYSTAKHGISLRTLIRKSHDLSGPCLLIVGDTRGAIFGGLLECPL

Query:  EPTPK-RKYQKPLLFLFSPRFRFLYFFSW
        + +       +  L+ FSP F+    F W
Subjt:  EPTPK-RKYQKPLLFLFSPRFRFLYFFSW

Arabidopsis top hitse value%identityAlignment
AT1G04500.1 CCT motif family protein5.7e-7347.33Show/hide
Query:  EEMSSPINGQMFDFCDPELFSETL-QNLEFNSGSNCC--YEKNSSDHSPETENHIAAGGAVFIPATDASAATNSNLTAIFDSQEELDNDISASIDFSPSA
        +E++SP+  Q+FDFCD +LF ET  Q  E  S SN C   E N++++ P+  N     G+            N++L+ IFDSQ++ DNDI+ASIDFS S 
Subjt:  EEMSSPINGQMFDFCDPELFSETL-QNLEFNSGSNCC--YEKNSSDHSPETENHIAAGGAVFIPATDASAATNSNLTAIFDSQEELDNDISASIDFSPSA

Query:  SFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPVEDLPSIYVDDCLSSLTS-NL-PLNPSSPSCSFVGAT-MATYLPASAASVES----
         F        Q   QFD   +Q   P   PN +        + P     S++ +DCLSS+ S NL  +NPSSPSCSF+G T + TY+  +   + +    
Subjt:  SFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPVEDLPSIYVDDCLSSLTS-NL-PLNPSSPSCSFVGAT-MATYLPASAASVES----

Query:  ---CGGIFPLMGSELQP---QDLDFQGDNCGLYSQDCLQGTFNPAD-----LQVLNNETLQLAGGAMNCTSLASDLSSLKDSTF-KVGKLSVEERKEKIH
            G I   +GS+ +P   Q ++ Q DN GL+  D ++  FNP D     L  + N+   +A   +    L ++++ L D +F KVGKLS E+RKEKIH
Subjt:  ---CGGIFPLMGSELQP---QDLDFQGDNCGLYSQDCLQGTFNPAD-----LQVLNNETLQLAGGAMNCTSLASDLSSLKDSTF-KVGKLSVEERKEKIH

Query:  RYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTE-NHRTASSNHEGDEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI
        RYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDE  E N +  SS+HE D+++V VKEE+ +VDSSDIF+HISGVNSFKCNYPIQSWI
Subjt:  RYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTE-NHRTASSNHEGDEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI

AT2G05590.1 TLD-domain containing nucleolar protein1.1e-3440.16Show/hide
Query:  MHSLKEKVFGLFSNSTSSSSSKSSPP----DPRSQARPKSKGRKSLSSYFSLVI-HSIHGSKSASCHHDDNAVQS-PSVQFCDANNDFQEEGSDTLTERS
        MH+LK+KV    SN  + S S+S+ P        +AR  S   KSLSSYFS V+  S +   S  C       +S   ++ C + N   + G+       
Subjt:  MHSLKEKVFGLFSNSTSSSSSKSSPP----DPRSQARPKSKGRKSLSSYFSLVI-HSIHGSKSASCHHDDNAVQS-PSVQFCDANNDFQEEGSDTLTERS

Query:  LVCKTKEVPSNQGENEDCGSAHEKKVKLNIPGVDDDPASGKSTCSSDVFEEAMERPSPRKPLSDLTDESCFISEDLYEFLGCCLPNIVKGCKWILMYSTA
                  + GE++DC      KV+                  +D F+         K + +LT+ S FI+ +L+EFL   LPNIV+GCKWIL+YST 
Subjt:  LVCKTKEVPSNQGENEDCGSAHEKKVKLNIPGVDDDPASGKSTCSSDVFEEAMERPSPRKPLSDLTDESCFISEDLYEFLGCCLPNIVKGCKWILMYSTA

Query:  KHGISLRTLIRKSHDLSGPCLLIVGDTRGAIFGGLLECPLEPTPKRKYQ
        KHGISLRTL+R+S +L GPCLL+ GD +GA+FG LLECPL+PTPKRKYQ
Subjt:  KHGISLRTLIRKSHDLSGPCLLIVGDTRGAIFGGLLECPLEPTPKRKYQ

AT2G05590.2 TLD-domain containing nucleolar protein1.1e-3440.16Show/hide
Query:  MHSLKEKVFGLFSNSTSSSSSKSSPP----DPRSQARPKSKGRKSLSSYFSLVI-HSIHGSKSASCHHDDNAVQS-PSVQFCDANNDFQEEGSDTLTERS
        MH+LK+KV    SN  + S S+S+ P        +AR  S   KSLSSYFS V+  S +   S  C       +S   ++ C + N   + G+       
Subjt:  MHSLKEKVFGLFSNSTSSSSSKSSPP----DPRSQARPKSKGRKSLSSYFSLVI-HSIHGSKSASCHHDDNAVQS-PSVQFCDANNDFQEEGSDTLTERS

Query:  LVCKTKEVPSNQGENEDCGSAHEKKVKLNIPGVDDDPASGKSTCSSDVFEEAMERPSPRKPLSDLTDESCFISEDLYEFLGCCLPNIVKGCKWILMYSTA
                  + GE++DC      KV+                  +D F+         K + +LT+ S FI+ +L+EFL   LPNIV+GCKWIL+YST 
Subjt:  LVCKTKEVPSNQGENEDCGSAHEKKVKLNIPGVDDDPASGKSTCSSDVFEEAMERPSPRKPLSDLTDESCFISEDLYEFLGCCLPNIVKGCKWILMYSTA

Query:  KHGISLRTLIRKSHDLSGPCLLIVGDTRGAIFGGLLECPLEPTPKRKYQ
        KHGISLRTL+R+S +L GPCLL+ GD +GA+FG LLECPL+PTPKRKYQ
Subjt:  KHGISLRTLIRKSHDLSGPCLLIVGDTRGAIFGGLLECPLEPTPKRKYQ

AT2G33350.1 CCT motif family protein1.2e-7047.67Show/hide
Query:  EEMSSPINGQMFDFCDPELFSETL-QNLEFNSGSNCCYEKNSSDHS-------PETENHIAAGGAVFIPATDASAATNSNLTAIFDSQEELDNDISASID
        ++++SP++ Q+FDFCDP+LF ET  Q+ E  S SN   EK+ S HS        E  N+        +   D     N++L+ IFDSQE+ +NDI+ASID
Subjt:  EEMSSPINGQMFDFCDPELFSETL-QNLEFNSGSNCCYEKNSSDHS-------PETENHIAAGGAVFIPATDASAATNSNLTAIFDSQEELDNDISASID

Query:  FSPSASFSIP---QYLTIQSAGQFDV---CQMQAQMPLV--DPNPIMEGLVQCPMAPVEDLPSIYVDDCLSSLTS-NLPLNPSSPSCSFVGAT-MATYLP
        FS S+S   P     LT  S  QFD     Q+  Q P +    +P+    +   +AP      ++ +DCLSS+ S NL LN   PSCSF  ++ +  Y+ 
Subjt:  FSPSASFSIP---QYLTIQSAGQFDV---CQMQAQMPLV--DPNPIMEGLVQCPMAPVEDLPSIYVDDCLSSLTS-NLPLNPSSPSCSFVGAT-MATYLP

Query:  ASAASVESCGGIFP---LMGSEL-QPQD--LDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLAS----------DLSSLKDSTF-KV
            S ES  G  P    +GSE+ +P D  +DFQ DN G +  D ++  FNP DLQ        L GGA N + L +          D++ L+DST  KV
Subjt:  ASAASVESCGGIFP---LMGSEL-QPQD--LDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLAS----------DLSSLKDSTF-KV

Query:  GKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE--VVVKEEDSMVDSSDIFAHISGVNSFKCN
        GKLS E+RKEKI RYMKKRNERNF+KKIKYACRKTLADSRPRVRGRFAKNDE  E +R A S+H  DE+E  + VK+E+ +VDSSDIFAHISG NSFKCN
Subjt:  GKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE--VVVKEEDSMVDSSDIFAHISGVNSFKCN

Query:  YPIQSWI
        YPIQSWI
Subjt:  YPIQSWI

AT2G33350.2 CCT motif family protein5.4e-7147.67Show/hide
Query:  EEMSSPINGQMFDFCDPELFSETL-QNLEFNSGSNCCYEKNSSDHS-------PETENHIAAGGAVFIPATDASAATNSNLTAIFDSQEELDNDISASID
        ++++SP++ Q+FDFCDP+LF ET  Q+ E  S SN   EK+ S HS        E  N+        +   D     N++L+ IFDSQE+ +NDI+ASID
Subjt:  EEMSSPINGQMFDFCDPELFSETL-QNLEFNSGSNCCYEKNSSDHS-------PETENHIAAGGAVFIPATDASAATNSNLTAIFDSQEELDNDISASID

Query:  FSPSASFSIP---QYLTIQSAGQFDV---CQMQAQMPLV--DPNPIMEGLVQCPMAPVEDLPSIYVDDCLSSLTS-NLPLNPSSPSCSFVGAT-MATYLP
        FS S+S   P     LT  S  QFD     Q+  Q P +    +P+    +   +AP      ++ +DCLSS+ S NL LN   PSCSF  ++ +  Y+ 
Subjt:  FSPSASFSIP---QYLTIQSAGQFDV---CQMQAQMPLV--DPNPIMEGLVQCPMAPVEDLPSIYVDDCLSSLTS-NLPLNPSSPSCSFVGAT-MATYLP

Query:  ASAASVESCGGIFP---LMGSEL-QPQD--LDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLAS----------DLSSLKDSTF-KV
            S ES  G  P    +GSE+ +P D  +DFQ DN G +  D ++  FNP DLQ        L GGA N + L +          D++ L+DST  KV
Subjt:  ASAASVESCGGIFP---LMGSEL-QPQD--LDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLAS----------DLSSLKDSTF-KV

Query:  GKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE--VVVKEEDSMVDSSDIFAHISGVNSFKCN
        GKLS E+RKEKI RYMKKRNERNF+KKIKYACRKTLADSRPRVRGRFAKNDE  E +R A S+H  DE+E  + VK+E+ +VDSSDIFAHISG NSFKCN
Subjt:  GKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEE--VVVKEEDSMVDSSDIFAHISGVNSFKCN

Query:  YPIQSWI
        YPIQSWI
Subjt:  YPIQSWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTCCCTGAAGGAGAAAGTCTTCGGCCTCTTCTCCAATTCAACTAGCTCGTCGTCCTCTAAATCGTCGCCCCCTGATCCTCGTAGTCAGGCCAGGCCAAAATCGAA
AGGAAGAAAATCACTGTCTTCGTATTTTTCGCTGGTAATCCATTCCATACATGGATCTAAATCAGCTAGTTGTCACCATGACGACAATGCAGTTCAATCTCCCTCGGTTC
AATTTTGTGATGCAAACAATGATTTCCAGGAGGAAGGCTCAGATACTCTTACAGAACGTAGTCTAGTATGCAAGACGAAAGAAGTACCTAGTAATCAGGGAGAAAATGAG
GATTGTGGTTCAGCGCATGAGAAAAAGGTAAAACTGAATATACCAGGAGTTGACGATGACCCAGCATCTGGAAAGAGCACTTGTAGTTCAGATGTATTTGAAGAAGCTAT
GGAGCGGCCCTCTCCAAGAAAGCCCTTATCAGACCTCACAGATGAGTCGTGTTTTATTTCTGAAGACTTGTATGAATTCCTGGGATGTTGTCTGCCCAACATTGTGAAAG
GGTGCAAATGGATCCTAATGTACAGTACGGCGAAGCATGGTATATCTCTTCGAACACTTATTCGCAAGAGCCATGATCTTTCTGGCCCTTGTTTACTGATTGTTGGAGAT
ACACGAGGTGCTATATTTGGTGGTCTTCTAGAATGCCCGTTGGAGCCTACACCCAAGAGAAAGTATCAAAAACCCCTATTATTCTTGTTTAGTCCCCGGTTCCGGTTTTT
GTATTTCTTCTCATGGTTTTTGATATTGGGTTTTCAGGAAGAGATGTCGAGTCCAATCAATGGTCAGATGTTTGATTTCTGTGATCCCGAGCTGTTCTCGGAGACTCTCC
AGAACTTGGAGTTCAATTCGGGCTCGAATTGCTGTTACGAAAAGAACTCCTCTGACCATTCCCCAGAAACAGAGAACCACATCGCAGCCGGTGGTGCGGTGTTTATACCT
GCCACCGACGCATCGGCTGCAACAAACAGTAATCTCACAGCTATCTTCGATTCCCAAGAAGAACTCGACAATGACATCTCTGCTTCCATTGACTTCTCCCCGTCCGCTTC
ATTTTCCATCCCTCAATACCTGACCATCCAGTCAGCTGGGCAGTTTGATGTTTGTCAAATGCAGGCCCAAATGCCATTAGTAGACCCCAATCCCATCATGGAGGGGCTGG
TGCAGTGCCCCATGGCTCCAGTTGAAGACTTGCCTTCCATTTATGTTGACGATTGCTTGTCTTCCTTGACTTCCAACTTGCCACTGAATCCTTCCTCCCCTTCCTGCTCC
TTCGTGGGAGCAACCATGGCAACTTACCTGCCAGCATCAGCAGCGTCGGTCGAAAGTTGCGGCGGAATCTTCCCTCTGATGGGCTCTGAATTGCAGCCACAAGACCTGGA
CTTTCAGGGCGACAACTGTGGACTCTACAGCCAAGACTGTTTGCAGGGGACTTTCAACCCAGCAGATCTTCAAGTGCTCAACAATGAGACCCTGCAACTGGCGGGTGGGG
CAATGAACTGCACTTCTTTAGCATCGGATCTCTCGAGCTTGAAGGACAGTACTTTCAAAGTAGGGAAACTGTCCGTAGAAGAGAGGAAGGAGAAGATTCATAGGTACATG
AAGAAGAGAAATGAGAGGAACTTCAGCAAGAAAATCAAGTATGCCTGCCGAAAAACACTAGCAGATAGCCGCCCACGGGTTCGGGGACGGTTCGCCAAGAACGACGAATT
GACAGAGAATCATAGGACAGCTTCTAGCAACCACGAAGGAGATGAAGAAGAAGTAGTTGTGAAGGAAGAAGATAGTATGGTTGATTCCTCAGATATCTTTGCTCATATCA
GTGGAGTGAACTCCTTCAAGTGCAACTATCCAATCCAATCTTGGATTTGA
mRNA sequenceShow/hide mRNA sequence
GCCAACATGAGCTTAGCTCAATGTAATTCACACCGGCTCTTGTCTTTGAAGTCAAAGATCTGAACCCCGAACTCTCGATTTTTGTACTAAAAAAAGTTGAGTAGTTTGTT
ACAATTTTTTTTAATTAAATAGATAAATAAAAACTAAAGATACATAGATTTGATGGCAACAAAAAAACTAAAAAAAAAATGTTGAAAATTTATCCTGAAATGGAAGTTTC
AATTTCAGCGCCAAATGTGACAATGTCTGATAGAAATCGTCGCCACCGCTCTCTTTCTACAATTTGTTCTGCAGCAGATTGTGATTCGCCACACGCCATCCCTCCCTCTC
TTCCACCGACCGCCTTCGCTTTCTTCTTGTCTCTTTGATTTCTTCATCCGCGGGGTACCCAAGGCCAGAGCAGAAGAAGAGAAGCATGGAGGTCGAAGCAACCCGAGCGT
GAGAGATTACTGAATTTCCCGTTTTCTTTTTGATCTTCTCACCGGAGCCAGATGCATTCCCTGAAGGAGAAAGTCTTCGGCCTCTTCTCCAATTCAACTAGCTCGTCGTC
CTCTAAATCGTCGCCCCCTGATCCTCGTAGTCAGGCCAGGCCAAAATCGAAAGGAAGAAAATCACTGTCTTCGTATTTTTCGCTGGTAATCCATTCCATACATGGATCTA
AATCAGCTAGTTGTCACCATGACGACAATGCAGTTCAATCTCCCTCGGTTCAATTTTGTGATGCAAACAATGATTTCCAGGAGGAAGGCTCAGATACTCTTACAGAACGT
AGTCTAGTATGCAAGACGAAAGAAGTACCTAGTAATCAGGGAGAAAATGAGGATTGTGGTTCAGCGCATGAGAAAAAGGTAAAACTGAATATACCAGGAGTTGACGATGA
CCCAGCATCTGGAAAGAGCACTTGTAGTTCAGATGTATTTGAAGAAGCTATGGAGCGGCCCTCTCCAAGAAAGCCCTTATCAGACCTCACAGATGAGTCGTGTTTTATTT
CTGAAGACTTGTATGAATTCCTGGGATGTTGTCTGCCCAACATTGTGAAAGGGTGCAAATGGATCCTAATGTACAGTACGGCGAAGCATGGTATATCTCTTCGAACACTT
ATTCGCAAGAGCCATGATCTTTCTGGCCCTTGTTTACTGATTGTTGGAGATACACGAGGTGCTATATTTGGTGGTCTTCTAGAATGCCCGTTGGAGCCTACACCCAAGAG
AAAGTATCAAAAACCCCTATTATTCTTGTTTAGTCCCCGGTTCCGGTTTTTGTATTTCTTCTCATGGTTTTTGATATTGGGTTTTCAGGAAGAGATGTCGAGTCCAATCA
ATGGTCAGATGTTTGATTTCTGTGATCCCGAGCTGTTCTCGGAGACTCTCCAGAACTTGGAGTTCAATTCGGGCTCGAATTGCTGTTACGAAAAGAACTCCTCTGACCAT
TCCCCAGAAACAGAGAACCACATCGCAGCCGGTGGTGCGGTGTTTATACCTGCCACCGACGCATCGGCTGCAACAAACAGTAATCTCACAGCTATCTTCGATTCCCAAGA
AGAACTCGACAATGACATCTCTGCTTCCATTGACTTCTCCCCGTCCGCTTCATTTTCCATCCCTCAATACCTGACCATCCAGTCAGCTGGGCAGTTTGATGTTTGTCAAA
TGCAGGCCCAAATGCCATTAGTAGACCCCAATCCCATCATGGAGGGGCTGGTGCAGTGCCCCATGGCTCCAGTTGAAGACTTGCCTTCCATTTATGTTGACGATTGCTTG
TCTTCCTTGACTTCCAACTTGCCACTGAATCCTTCCTCCCCTTCCTGCTCCTTCGTGGGAGCAACCATGGCAACTTACCTGCCAGCATCAGCAGCGTCGGTCGAAAGTTG
CGGCGGAATCTTCCCTCTGATGGGCTCTGAATTGCAGCCACAAGACCTGGACTTTCAGGGCGACAACTGTGGACTCTACAGCCAAGACTGTTTGCAGGGGACTTTCAACC
CAGCAGATCTTCAAGTGCTCAACAATGAGACCCTGCAACTGGCGGGTGGGGCAATGAACTGCACTTCTTTAGCATCGGATCTCTCGAGCTTGAAGGACAGTACTTTCAAA
GTAGGGAAACTGTCCGTAGAAGAGAGGAAGGAGAAGATTCATAGGTACATGAAGAAGAGAAATGAGAGGAACTTCAGCAAGAAAATCAAGTATGCCTGCCGAAAAACACT
AGCAGATAGCCGCCCACGGGTTCGGGGACGGTTCGCCAAGAACGACGAATTGACAGAGAATCATAGGACAGCTTCTAGCAACCACGAAGGAGATGAAGAAGAAGTAGTTG
TGAAGGAAGAAGATAGTATGGTTGATTCCTCAGATATCTTTGCTCATATCAGTGGAGTGAACTCCTTCAAGTGCAACTATCCAATCCAATCTTGGATTTGATTTTTTTTT
TTTTATGTTGCTTTCTTTAAATAAATAACCAAAAAAGAAGGAGAACAAAACAAAAAAAAGGAAAAAACTACAATTTTGCAGGACCCAACTTGAGTTATGAATATTAGAAA
AACTTTTCTGATGTGCAGTAGAAAGAGAGTGATGGAAGGGTCATAAAAATAGAAATAAGTTTGAGTGAGGTCACCTAGCAATCAAACTTAGCACTTTTCAATTTGTAGAT
ATTTCATCAACAAATTCAAAATCTTTTATTTGTGTACATGTTAGGGGTTTATGC
Protein sequenceShow/hide protein sequence
MHSLKEKVFGLFSNSTSSSSSKSSPPDPRSQARPKSKGRKSLSSYFSLVIHSIHGSKSASCHHDDNAVQSPSVQFCDANNDFQEEGSDTLTERSLVCKTKEVPSNQGENE
DCGSAHEKKVKLNIPGVDDDPASGKSTCSSDVFEEAMERPSPRKPLSDLTDESCFISEDLYEFLGCCLPNIVKGCKWILMYSTAKHGISLRTLIRKSHDLSGPCLLIVGD
TRGAIFGGLLECPLEPTPKRKYQKPLLFLFSPRFRFLYFFSWFLILGFQEEMSSPINGQMFDFCDPELFSETLQNLEFNSGSNCCYEKNSSDHSPETENHIAAGGAVFIP
ATDASAATNSNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLTIQSAGQFDVCQMQAQMPLVDPNPIMEGLVQCPMAPVEDLPSIYVDDCLSSLTSNLPLNPSSPSCS
FVGATMATYLPASAASVESCGGIFPLMGSELQPQDLDFQGDNCGLYSQDCLQGTFNPADLQVLNNETLQLAGGAMNCTSLASDLSSLKDSTFKVGKLSVEERKEKIHRYM
KKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELTENHRTASSNHEGDEEEVVVKEEDSMVDSSDIFAHISGVNSFKCNYPIQSWI