; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003593 (gene) of Snake gourd v1 genome

Gene IDTan0003593
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionZinc finger-like protein
Genome locationLG06:75769483..75782065
RNA-Seq ExpressionTan0003593
SyntenyTan0003593
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR006571 - TLDc domain
IPR010402 - CCT domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031932.1 CCT domain-containing protein [Cucumis melo var. makuwa]2.7e-20084.44Show/hide
Query:  QNCASCLFEANVDFCTIVPKDISEEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIPAN
        +N A CLFE  +DFC  VPK IS+EISSPINAQIFDF DPELF+ETLQNSEFNSCSNCCYDKNS YATNLS+SP +TD   NGNGNGNTVA  ASFIPAN
Subjt:  QNCASCLFEANVDFCTIVPKDISEEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIPAN

Query:  DASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLS
        DAS ATNIT NS SNL+AIFDSQEELDNDISASI+FSPSASFS+PQYL IQS QFDVSQ+QSQMPLVDPMIEGLVQCPMAPV  L+D+DL SIYVDDCLS
Subjt:  DASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLS

Query:  SLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSL
        SLTSYMP+NP+SPSCSFVGA+M TYLP  SMNPA S+VESCGMFSLLGP+LQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQL  G MNCTSL
Subjt:  SLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSL

Query:  ASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE---------------
        ASDLSSLKDSTFKVGKLS+EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE               
Subjt:  ASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE---------------

Query:  ---VVVKEEDSMVDASDIFAHISGVNSFKCNYPIQSW
           VVVKEEDSM+D+SDIFAHISGVNSFKCNYPIQSW
Subjt:  ---VVVKEEDSMVDASDIFAHISGVNSFKCNYPIQSW

XP_008460686.1 PREDICTED: uncharacterized protein LOC103499454 [Cucumis melo]3.7e-20287.65Show/hide
Query:  QNCASCLFEANVDFCTIVPKDIS--EEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIP
        +N A CLFE  +DFC  VPK IS  +EISSPINAQIFDF DPELF+ETLQNSEFNSCSNCCYDKNS YATNLS+SP +TD   NGNGNGNTVA  ASFIP
Subjt:  QNCASCLFEANVDFCTIVPKDIS--EEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIP

Query:  ANDASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDC
        ANDAS ATNIT NS SNL+AIFDSQEELDNDISASI+FSPSASFS+PQYL IQS QFDVSQ+QSQMPLVDPMIEGLVQCPMAPV  L+D+DL SIYVDDC
Subjt:  ANDASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDC

Query:  LSSLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCT
        LSSLTSYMP+NP+SPSCSFVGA+M TYLP  SMNPA S+VESCGMFSLLGP+LQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQL  G MNCT
Subjt:  LSSLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCT

Query:  SLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDAS
        SLASDLSSLKDSTFKVGKLS+EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSM+D+S
Subjt:  SLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDAS

Query:  DIFAHISGVNSFKCNYPIQSW
        DIFAHISGVNSFKCNYPIQSW
Subjt:  DIFAHISGVNSFKCNYPIQSW

XP_011649131.1 uncharacterized protein LOC101214336 isoform X1 [Cucumis sativus]6.3e-20288.6Show/hide
Query:  QNCASCLFEANVDFCTIVPKDIS--EEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIP
        +N   CLFE  V FC  VPK IS  +EISSPINAQIFDF DPELF+ETLQNSEFNSCSNCCYDKNS YATNLS+SP +TD  GNGNGNGNTVA  ASFIP
Subjt:  QNCASCLFEANVDFCTIVPKDIS--EEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIP

Query:  ANDASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDC
         NDAS ATNIT NSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYL IQS QFDVSQ+QSQMPLVDPMIEGLVQCPMAPV  L+D+DL SIYVDDC
Subjt:  ANDASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDC

Query:  LSSLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCT
        LSSLTSYMPLNP+SPSCSFVG TM TYLP  SMNPA S+VESCGMFSLLGPDL  QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQL  G MNCT
Subjt:  LSSLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCT

Query:  SLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDAS
        SLASDLSSLKDSTFKVGKLS EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVD+S
Subjt:  SLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDAS

Query:  DIFAHISGVNSFKCNYPIQSW
        DIFAHISGVNSFKCNYPIQSW
Subjt:  DIFAHISGVNSFKCNYPIQSW

XP_011649132.1 uncharacterized protein LOC101214336 isoform X2 [Cucumis sativus]2.0e-20389.02Show/hide
Query:  QNCASCLFEANVDFCTIVPKDISEEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIPAN
        +N   CLFE  V FC  VPK IS+EISSPINAQIFDF DPELF+ETLQNSEFNSCSNCCYDKNS YATNLS+SP +TD  GNGNGNGNTVA  ASFIP N
Subjt:  QNCASCLFEANVDFCTIVPKDISEEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIPAN

Query:  DASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLS
        DAS ATNIT NSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYL IQS QFDVSQ+QSQMPLVDPMIEGLVQCPMAPV  L+D+DL SIYVDDCLS
Subjt:  DASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLS

Query:  SLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSL
        SLTSYMPLNP+SPSCSFVG TM TYLP  SMNPA S+VESCGMFSLLGPDL  QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQL  G MNCTSL
Subjt:  SLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSL

Query:  ASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDASDI
        ASDLSSLKDSTFKVGKLS EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVD+SDI
Subjt:  ASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDASDI

Query:  FAHISGVNSFKCNYPIQSW
        FAHISGVNSFKCNYPIQSW
Subjt:  FAHISGVNSFKCNYPIQSW

XP_038875443.1 uncharacterized protein LOC120067896 isoform X1 [Benincasa hispida]3.7e-20288.33Show/hide
Query:  QNCASCLFEANVDFCTIVPKDISEEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIPAN
        +N ASCLFE  VDF   VPK I +EISSPINAQIFDF DPELF+ETLQ+SEFNSCSNCCYDKNS Y TNLS+SP +TD   NGN NGNTVAA ASF+PAN
Subjt:  QNCASCLFEANVDFCTIVPKDISEEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIPAN

Query:  DASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLS
        DAS ATNIT NS SNLTAIFDSQEELDNDISASIDFSPSASFSIPQYL IQS QFDVSQ+QSQMPL+DPMIEGLVQCPMAPV TL+D+DL SIYVDDCLS
Subjt:  DASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLS

Query:  SLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSL
        SLTSYMPLNPSSPSCSFVGATM TYLP  SM PA S+VESCGMFSLLG +LQPQDLDYQGDNCGLY+QDCMQGTFNPADLQVLNNENLQL  G MNCTSL
Subjt:  SLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSL

Query:  ASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDASDI
        ASDLSSLKDSTFKVGKLS+EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVD+SDI
Subjt:  ASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDASDI

Query:  FAHISGVNSFKCNYPIQSWI
        FAHISGVNSFKCNYPIQSWI
Subjt:  FAHISGVNSFKCNYPIQSWI

TrEMBL top hitse value%identityAlignment
A0A0A0LKJ8 CCT domain-containing protein5.6e-19690.91Show/hide
Query:  EEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIPANDASGATNITANSASNLTAIFDSQ
        +EISSPINAQIFDF DPELF+ETLQNSEFNSCSNCCYDKNS YATNLS+SP +TD  GNGNGNGNTVA  ASFIP NDAS ATNIT NSASNLTAIFDSQ
Subjt:  EEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIPANDASGATNITANSASNLTAIFDSQ

Query:  EELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMG
        EELDNDISASIDFSPSASFSIPQYL IQS QFDVSQ+QSQMPLVDPMIEGLVQCPMAPV  L+D+DL SIYVDDCLSSLTSYMPLNP+SPSCSFVG TM 
Subjt:  EELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLSSLTSYMPLNPSSPSCSFVGATMG

Query:  TYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSLASDLSSLKDSTFKVGKLSVEERK
        TYLP  SMNPA S+VESCGMFSLLGPDL  QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQL  G MNCTSLASDLSSLKDSTFKVGKLS EERK
Subjt:  TYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSLASDLSSLKDSTFKVGKLSVEERK

Query:  EKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDASDIFAHISGVNSFKCNYPIQSW
        EKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVD+SDIFAHISGVNSFKCNYPIQSW
Subjt:  EKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDASDIFAHISGVNSFKCNYPIQSW

A0A1S3CCK1 uncharacterized protein LOC1034994541.8e-20287.65Show/hide
Query:  QNCASCLFEANVDFCTIVPKDIS--EEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIP
        +N A CLFE  +DFC  VPK IS  +EISSPINAQIFDF DPELF+ETLQNSEFNSCSNCCYDKNS YATNLS+SP +TD   NGNGNGNTVA  ASFIP
Subjt:  QNCASCLFEANVDFCTIVPKDIS--EEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIP

Query:  ANDASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDC
        ANDAS ATNIT NS SNL+AIFDSQEELDNDISASI+FSPSASFS+PQYL IQS QFDVSQ+QSQMPLVDPMIEGLVQCPMAPV  L+D+DL SIYVDDC
Subjt:  ANDASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDC

Query:  LSSLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCT
        LSSLTSYMP+NP+SPSCSFVGA+M TYLP  SMNPA S+VESCGMFSLLGP+LQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQL  G MNCT
Subjt:  LSSLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCT

Query:  SLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDAS
        SLASDLSSLKDSTFKVGKLS+EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSM+D+S
Subjt:  SLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDAS

Query:  DIFAHISGVNSFKCNYPIQSW
        DIFAHISGVNSFKCNYPIQSW
Subjt:  DIFAHISGVNSFKCNYPIQSW

A0A5A7SLG3 CCT domain-containing protein1.3e-20084.44Show/hide
Query:  QNCASCLFEANVDFCTIVPKDISEEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIPAN
        +N A CLFE  +DFC  VPK IS+EISSPINAQIFDF DPELF+ETLQNSEFNSCSNCCYDKNS YATNLS+SP +TD   NGNGNGNTVA  ASFIPAN
Subjt:  QNCASCLFEANVDFCTIVPKDISEEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIPAN

Query:  DASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLS
        DAS ATNIT NS SNL+AIFDSQEELDNDISASI+FSPSASFS+PQYL IQS QFDVSQ+QSQMPLVDPMIEGLVQCPMAPV  L+D+DL SIYVDDCLS
Subjt:  DASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLS

Query:  SLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSL
        SLTSYMP+NP+SPSCSFVGA+M TYLP  SMNPA S+VESCGMFSLLGP+LQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQL  G MNCTSL
Subjt:  SLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSL

Query:  ASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE---------------
        ASDLSSLKDSTFKVGKLS+EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE               
Subjt:  ASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEE---------------

Query:  ---VVVKEEDSMVDASDIFAHISGVNSFKCNYPIQSW
           VVVKEEDSM+D+SDIFAHISGVNSFKCNYPIQSW
Subjt:  ---VVVKEEDSMVDASDIFAHISGVNSFKCNYPIQSW

A0A6J1HJ57 uncharacterized protein LOC111464531 isoform X11.3e-17982.31Show/hide
Query:  QNCASCLFEANVDFCTI-VPKDISEEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSPETDYTGNGNGNGNTVAAVASFIPAN
        +N ASC FEANVDF TI VPK ISEEISSPINAQ+FDF DPELFSETLQNSEFNS SNCCYDKNSS  TN+SHSPETD  GNGNGN  T AA ASFIP N
Subjt:  QNCASCLFEANVDFCTI-VPKDISEEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSPETDYTGNGNGNGNTVAAVASFIPAN

Query:  DASGA-TNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYL-GIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDC
        DAS A TNI AN       IFDSQEELDNDISASIDFSPSASFSIPQYL  I   QFDV Q+QSQ+PLVDPM EGL+QCPM PVATLLDDDL SIYVDDC
Subjt:  DASGA-TNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYL-GIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDC

Query:  LSSLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGG--MN
        LSSLTSYMPLNPSSPSCSFVGATM  YLPAASMNPAASSVESCGMFSLLG +LQP DLDYQGDNCG+YSQD MQGTF+P DLQV+NNE +Q+ GGG  M 
Subjt:  LSSLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGG--MN

Query:  CTSLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVD
        CT +AS+LSSLKDS+FKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAE  RA  SNH  E EEEVVVK+EDS+VD
Subjt:  CTSLASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVD

Query:  ASDIFAHISGVNSFKCNYPIQSWI
        +S IFAHI+GVNSFKCNY IQSWI
Subjt:  ASDIFAHISGVNSFKCNYPIQSWI

B0F827 Zinc finger-like protein9.6e-20489.02Show/hide
Query:  QNCASCLFEANVDFCTIVPKDISEEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIPAN
        +N   CLFE  V FC  VPK IS+EISSPINAQIFDF DPELF+ETLQNSEFNSCSNCCYDKNS YATNLS+SP +TD  GNGNGNGNTVA  ASFIP N
Subjt:  QNCASCLFEANVDFCTIVPKDISEEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSP-ETDYTGNGNGNGNTVAAVASFIPAN

Query:  DASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLS
        DAS ATNIT NSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYL IQS QFDVSQ+QSQMPLVDPMIEGLVQCPMAPV  L+D+DL SIYVDDCLS
Subjt:  DASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLS

Query:  SLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSL
        SLTSYMPLNP+SPSCSFVG TM TYLP  SMNPA S+VESCGMFSLLGPDL  QDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQL  G MNCTSL
Subjt:  SLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSL

Query:  ASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDASDI
        ASDLSSLKDSTFKVGKLS EERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVD+SDI
Subjt:  ASDLSSLKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDASDI

Query:  FAHISGVNSFKCNYPIQSW
        FAHISGVNSFKCNYPIQSW
Subjt:  FAHISGVNSFKCNYPIQSW

SwissProt top hitse value%identityAlignment
A0PJX2 TLD domain-containing protein 22.5e-0736.79Show/hide
Query:  PGVDDDPASGKSTSSSEVFEEAMERP--TPRNP-LSDLTDESAFISADLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLSGPCLLIVGDI
        P   +D  SG+  +  E  EEA   P   P +P +  LT+ S  +SA     L    P  V G  W L++ T + G SLQ+L R     SGP LL++ D 
Subjt:  PGVDDDPASGKSTSSSEVFEEAMERP--TPRNP-LSDLTDESAFISADLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLSGPCLLIVGDI

Query:  RGAIFG
         G IFG
Subjt:  RGAIFG

A8KBE0 Oxidation resistance protein 13.8e-0840.48Show/hide
Query:  PRNPLSDLTDESAFISADLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLSGPCLLIVGDIRGAIFGGLLECPLK
        P +   +L+D S+ +  D  E L   LP    G  W L+YST KHG+SL+TL R    L  P LL++ D    IFG L   P K
Subjt:  PRNPLSDLTDESAFISADLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLSGPCLLIVGDIRGAIFGGLLECPLK

B4F6Q9 Oxidation resistance protein 12.5e-0739.29Show/hide
Query:  PRNPLSDLTDESAFISADLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLSGPCLLIVGDIRGAIFGGLLECPLK
        P +   +L+D S+ +  +  E L   LP    G  W L+YST KHG+SL+TL R    L  P LL++ D    IFG L   P K
Subjt:  PRNPLSDLTDESAFISADLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLSGPCLLIVGDIRGAIFGGLLECPLK

Q6DFV7 Nuclear receptor coactivator 73.2e-0739.6Show/hide
Query:  KSTSSSEVFEEAMERPTPRNPLSDLTDESAFISADLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLSGPCLLIVGDIRGAIFGGLLECPL
        KST S    EE  E   P      L   SA +     E L   LP  V+G  W L YST++HG SL+TL R S +L  P LL++ D+   IFG     P 
Subjt:  KSTSSSEVFEEAMERPTPRNPLSDLTDESAFISADLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLSGPCLLIVGDIRGAIFGGLLECPL

Query:  K
        K
Subjt:  K

Q8N573 Oxidation resistance protein 11.5e-0741.03Show/hide
Query:  DLTDESAFISADLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLSGPCLLIVGDIRGAIFGGLLECPLK
        +L+D S  +  D  E L   LP    G  W L+Y T KHG SL+TL R    L  P L+++ D  G +FG L   PLK
Subjt:  DLTDESAFISADLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLSGPCLLIVGDIRGAIFGGLLECPLK

Arabidopsis top hitse value%identityAlignment
AT1G04500.1 CCT motif family protein6.6e-7246.25Show/hide
Query:  EEISSPINAQIFDFYDPELFSETL-QNSEFNSCSN-CCYDKNSSYATNLSHSPETDYTGNGNGNGNTVAAVASFIPANDASGATNITANSASNLTAIFDS
        +EI+SP+ AQIFDF D +LF ET  Q SE  S SN C Y +N+    N ++ P+   +G+   + +                      N  ++L+ IFDS
Subjt:  EEISSPINAQIFDFYDPELFSETL-QNSEFNSCSN-CCYDKNSSYATNLSHSPETDYTGNGNGNGNTVAAVASFIPANDASGATNITANSASNLTAIFDS

Query:  QEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMP---LVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLSSLTSYM--PLNPSSPSCSF
        Q++ DNDI+ASIDFS S  F     L     QFD + +Q   P   L       L+  P+            S++ +DCLSS+ SY    +NPSSPSCSF
Subjt:  QEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMP---LVDPMIEGLVQCPMAPVATLLDDDLQSIYVDDCLSSLTSYM--PLNPSSPSCSF

Query:  VGAT-MGTYLPAAS--MNPAASSVESCGMFSLLGPDLQP---QDLDYQGDNCGLYSQDCMQGTFNPAD--LQVLNN-ENLQLTGGGMNCTSLASDLSSLK
        +G T + TY+      MN    S    G    LG D +P   Q ++ Q DN GL+  D ++  FNP D  LQ L+  EN            L ++++ L 
Subjt:  VGAT-MGTYLPAAS--MNPAASSVESCGMFSLLGPDLQP---QDLDYQGDNCGLYSQDCMQGTFNPAD--LQVLNN-ENLQLTGGGMNCTSLASDLSSLK

Query:  DSTF-KVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDASDIFAHISGV
        D +F KVGKLS E+RKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDE  E +R ACS+H  +++++V VKEE+ +VD+SDIF+HISGV
Subjt:  DSTF-KVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDASDIFAHISGV

Query:  NSFKCNYPIQSWI
        NSFKCNYPIQSWI
Subjt:  NSFKCNYPIQSWI

AT2G05590.1 TLD-domain containing nucleolar protein1.7e-3537.7Show/hide
Query:  MYSLKDKVSGRFSSLFSNSTSSESPKPPAPDPHTQARPKSKGRKSLSSYLSLVIPSIHGSKSTSSHQDADVVQSPSVRYCDANNDFQEEGSDTFLECSIP
        M++LKDKVS + S+LF++S S  +    +     +AR  S   KSLSSY S V+P       + + +D+++     +R            ++++ EC   
Subjt:  MYSLKDKVSGRFSSLFSNSTSSESPKPPAPDPHTQARPKSKGRKSLSSYLSLVIPSIHGSKSTSSHQDADVVQSPSVRYCDANNDFQEEGSDTFLECSIP

Query:  CKTEEMASNQ------EENKDCGSAYAKVKLNKPGVDDDPASGKSTSSSEVFEEAMERPTPRNPLSDLTDESAFISADLYEFLGCCLPNIVKGCKWVLLY
        CK+    +         E+KDC               +   S K   S   + + +++      + +LT+ S FI+A+L+EFL   LPNIV+GCKW+LLY
Subjt:  CKTEEMASNQ------EENKDCGSAYAKVKLNKPGVDDDPASGKSTSSSEVFEEAMERPTPRNPLSDLTDESAFISADLYEFLGCCLPNIVKGCKWVLLY

Query:  STMKHGISLQTLIRNSHNLSGPCLLIVGDIRGAIFGGLLECPLKPTAQRKYQ
        ST+KHGISL+TL+R S  L GPCLL+ GD +GA+FG LLECPL+PT +RKYQ
Subjt:  STMKHGISLQTLIRNSHNLSGPCLLIVGDIRGAIFGGLLECPLKPTAQRKYQ

AT2G05590.2 TLD-domain containing nucleolar protein1.7e-3537.7Show/hide
Query:  MYSLKDKVSGRFSSLFSNSTSSESPKPPAPDPHTQARPKSKGRKSLSSYLSLVIPSIHGSKSTSSHQDADVVQSPSVRYCDANNDFQEEGSDTFLECSIP
        M++LKDKVS + S+LF++S S  +    +     +AR  S   KSLSSY S V+P       + + +D+++     +R            ++++ EC   
Subjt:  MYSLKDKVSGRFSSLFSNSTSSESPKPPAPDPHTQARPKSKGRKSLSSYLSLVIPSIHGSKSTSSHQDADVVQSPSVRYCDANNDFQEEGSDTFLECSIP

Query:  CKTEEMASNQ------EENKDCGSAYAKVKLNKPGVDDDPASGKSTSSSEVFEEAMERPTPRNPLSDLTDESAFISADLYEFLGCCLPNIVKGCKWVLLY
        CK+    +         E+KDC               +   S K   S   + + +++      + +LT+ S FI+A+L+EFL   LPNIV+GCKW+LLY
Subjt:  CKTEEMASNQ------EENKDCGSAYAKVKLNKPGVDDDPASGKSTSSSEVFEEAMERPTPRNPLSDLTDESAFISADLYEFLGCCLPNIVKGCKWVLLY

Query:  STMKHGISLQTLIRNSHNLSGPCLLIVGDIRGAIFGGLLECPLKPTAQRKYQ
        ST+KHGISL+TL+R S  L GPCLL+ GD +GA+FG LLECPL+PT +RKYQ
Subjt:  STMKHGISLQTLIRNSHNLSGPCLLIVGDIRGAIFGGLLECPLKPTAQRKYQ

AT2G33350.1 CCT motif family protein1.2e-6844.29Show/hide
Query:  EEISSPINAQIFDFYDPELFSETL-QNSEFNSCSNCCYDKNSSYATNLSHSPETDYTGNGNGNGNTVAAVASFIPANDASGATNITANSASNLTAIFDSQ
        ++I+SP++AQIFDF DP+LF ET  Q+SE  S SN   +K+ S+ +N + +  T+ + N N N NT             +   +   N+ ++L+ IFDSQ
Subjt:  EEISSPINAQIFDFYDPELFSETL-QNSEFNSCSNCCYDKNSSYATNLSHSPETDYTGNGNGNGNTVAAVASFIPANDASGATNITANSASNLTAIFDSQ

Query:  EELDNDISASIDFSPSA-SFSIPQYL--GIQSCQFDVS---QLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQS-IYVDDCLSSLTSY-MPLNPSSPSC
        E+ +NDI+ASIDFS S+  + +  +L   I   QFD S   Q+  Q P +    + L    ++ +++L    LQS ++ +DCLSS+ SY + LN   PSC
Subjt:  EELDNDISASIDFSPSA-SFSIPQYL--GIQSCQFDVS---QLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQS-IYVDDCLSSLTSY-MPLNPSSPSC

Query:  SFVGAT-MGTYLPAASMNPAASSVESCGMFSLLGPDLQPQD--LDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSLAS----------
        SF  ++ +  Y+    ++  ++     G   +     +P D  +D+Q DN G +  D ++  FNP DLQ        L GG  N + L +          
Subjt:  SFVGAT-MGTYLPAASMNPAASSVESCGMFSLLGPDLQPQD--LDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSLAS----------

Query:  DLSSLKDSTF-KVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAE-NHRAACSNHEGEEEEEVVVKEEDSMVDASDI
        D++ L+DST  KVGKLS E+RKEKI RYMKKRNERNF+KKIKYACRKTLADSRPRVRGRFAKNDE  E N +A  S+H+ E+E+++ VK+E+ +VD+SDI
Subjt:  DLSSLKDSTF-KVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAE-NHRAACSNHEGEEEEEVVVKEEDSMVDASDI

Query:  FAHISGVNSFKCNYPIQSWI
        FAHISG NSFKCNYPIQSWI
Subjt:  FAHISGVNSFKCNYPIQSWI

AT2G33350.2 CCT motif family protein4.0e-6944.29Show/hide
Query:  EEISSPINAQIFDFYDPELFSETL-QNSEFNSCSNCCYDKNSSYATNLSHSPETDYTGNGNGNGNTVAAVASFIPANDASGATNITANSASNLTAIFDSQ
        ++I+SP++AQIFDF DP+LF ET  Q+SE  S SN   +K+ S+ +N + +  T+ + N N N NT             +   +   N+ ++L+ IFDSQ
Subjt:  EEISSPINAQIFDFYDPELFSETL-QNSEFNSCSNCCYDKNSSYATNLSHSPETDYTGNGNGNGNTVAAVASFIPANDASGATNITANSASNLTAIFDSQ

Query:  EELDNDISASIDFSPSA-SFSIPQYL--GIQSCQFDVS---QLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQS-IYVDDCLSSLTSY-MPLNPSSPSC
        E+ +NDI+ASIDFS S+  + +  +L   I   QFD S   Q+  Q P +    + L    ++ +++L    LQS ++ +DCLSS+ SY + LN   PSC
Subjt:  EELDNDISASIDFSPSA-SFSIPQYL--GIQSCQFDVS---QLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQS-IYVDDCLSSLTSY-MPLNPSSPSC

Query:  SFVGAT-MGTYLPAASMNPAASSVESCGMFSLLGPDLQPQD--LDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSLAS----------
        SF  ++ +  Y+    ++  ++     G   +     +P D  +D+Q DN G +  D ++  FNP DLQ L        GG  N + L +          
Subjt:  SFVGAT-MGTYLPAASMNPAASSVESCGMFSLLGPDLQPQD--LDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSLAS----------

Query:  DLSSLKDSTF-KVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAE-NHRAACSNHEGEEEEEVVVKEEDSMVDASDI
        D++ L+DST  KVGKLS E+RKEKI RYMKKRNERNF+KKIKYACRKTLADSRPRVRGRFAKNDE  E N +A  S+H+ E+E+++ VK+E+ +VD+SDI
Subjt:  DLSSLKDSTF-KVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAE-NHRAACSNHEGEEEEEVVVKEEDSMVDASDI

Query:  FAHISGVNSFKCNYPIQSWI
        FAHISG NSFKCNYPIQSWI
Subjt:  FAHISGVNSFKCNYPIQSWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTCTCTCAAGGATAAAGTCTCCGGGAGGTTCTCTAGCCTCTTCTCTAATTCCACTAGCTCGGAATCCCCAAAACCGCCGGCGCCTGATCCTCATACTCAGGCCAG
ACCGAAATCGAAAGGGAGAAAATCACTGTCTTCATATCTATCCTTAGTAATCCCTTCCATACATGGATCTAAATCAACCAGTTCTCACCAAGACGCCGATGTAGTTCAGT
CTCCCTCAGTTCGATATTGTGATGCAAATAATGATTTCCAGGAGGAAGGATCTGATACATTTTTAGAATGTAGTATACCATGCAAGACGGAAGAAATGGCTAGTAATCAG
GAAGAAAATAAGGATTGTGGTTCAGCATATGCGAAGGTAAAACTGAATAAACCAGGGGTTGATGATGACCCAGCATCTGGAAAGAGCACTAGTAGTTCAGAAGTATTTGA
AGAAGCTATGGAGCGGCCCACTCCAAGGAACCCCTTATCAGACCTCACAGACGAGTCTGCTTTTATCTCTGCAGACTTGTATGAATTCCTGGGATGTTGTCTACCCAACA
TTGTGAAAGGGTGCAAATGGGTCTTACTATACAGTACGATGAAGCATGGTATATCTCTTCAAACTCTTATTCGCAATAGCCACAATCTTTCTGGCCCTTGTTTACTGATT
GTTGGAGATATACGAGGTGCTATATTTGGTGGTCTTCTAGAATGCCCGTTGAAGCCTACAGCCCAAAGAAAATATCAAAATTGTGCTTCTTGCTTGTTCGAAGCAAACGT
CGATTTCTGCACGATCGTCCCTAAGGACATATCGGAAGAAATCTCGAGTCCGATCAATGCGCAAATATTTGATTTCTATGATCCCGAGCTGTTCTCGGAGACGCTTCAGA
ACTCCGAGTTCAATTCTTGCTCGAATTGTTGTTACGACAAGAATTCATCATATGCTACAAATCTGTCTCATTCTCCAGAAACAGATTACACTGGCAATGGCAATGGCAAT
GGCAATACCGTTGCGGCTGTTGCGTCGTTTATACCTGCTAACGACGCATCGGGTGCAACTAACATAACAGCCAATAGTGCTAGTAATCTGACTGCTATCTTTGATTCCCA
AGAAGAACTTGACAATGACATCTCTGCTTCCATAGACTTCTCTCCATCGGCTTCGTTTTCGATCCCTCAATATCTCGGCATCCAGTCATGCCAGTTTGATGTTTCTCAAC
TGCAGTCTCAAATGCCATTAGTAGATCCCATGATTGAGGGGCTCGTGCAGTGTCCTATGGCTCCAGTTGCAACTCTCCTCGACGACGATCTACAGTCAATTTACGTCGAT
GATTGCCTGTCTTCCTTGACTTCCTACATGCCCCTGAATCCTTCTTCCCCTTCGTGCTCGTTTGTCGGAGCAACCATGGGAACTTACCTGCCTGCTGCATCAATGAATCC
CGCTGCATCATCTGTCGAAAGCTGTGGAATGTTCTCTCTCCTTGGCCCGGATTTGCAACCGCAAGACCTCGACTATCAGGGAGATAACTGTGGACTCTACAGCCAAGACT
GTATGCAGGGGACTTTCAATCCAGCAGACCTTCAGGTGCTTAACAATGAGAACCTGCAACTGACTGGTGGGGGAATGAACTGCACTTCATTAGCATCAGATCTCTCAAGC
TTGAAGGACAGTACTTTCAAAGTAGGAAAACTCTCTGTGGAAGAGAGGAAGGAGAAGATTCATAGGTACATGAAGAAAAGAAATGAGAGGAACTTCAGCAAGAAAATCAA
GTATGCCTGCCGAAAAACGCTAGCAGATAGCCGGCCACGAGTTCGGGGACGGTTCGCAAAGAACGACGAATTGGCAGAGAATCACAGGGCTGCTTGTAGCAACCATGAAG
GGGAAGAAGAAGAAGAGGTAGTTGTGAAGGAAGAAGATAGCATGGTTGATGCCTCAGATATCTTTGCTCATATCAGTGGAGTGAACTCCTTCAAATGCAACTATCCAATC
CAGTCTTGGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATTCTCTCAAGGATAAAGTCTCCGGGAGGTTCTCTAGCCTCTTCTCTAATTCCACTAGCTCGGAATCCCCAAAACCGCCGGCGCCTGATCCTCATACTCAGGCCAG
ACCGAAATCGAAAGGGAGAAAATCACTGTCTTCATATCTATCCTTAGTAATCCCTTCCATACATGGATCTAAATCAACCAGTTCTCACCAAGACGCCGATGTAGTTCAGT
CTCCCTCAGTTCGATATTGTGATGCAAATAATGATTTCCAGGAGGAAGGATCTGATACATTTTTAGAATGTAGTATACCATGCAAGACGGAAGAAATGGCTAGTAATCAG
GAAGAAAATAAGGATTGTGGTTCAGCATATGCGAAGGTAAAACTGAATAAACCAGGGGTTGATGATGACCCAGCATCTGGAAAGAGCACTAGTAGTTCAGAAGTATTTGA
AGAAGCTATGGAGCGGCCCACTCCAAGGAACCCCTTATCAGACCTCACAGACGAGTCTGCTTTTATCTCTGCAGACTTGTATGAATTCCTGGGATGTTGTCTACCCAACA
TTGTGAAAGGGTGCAAATGGGTCTTACTATACAGTACGATGAAGCATGGTATATCTCTTCAAACTCTTATTCGCAATAGCCACAATCTTTCTGGCCCTTGTTTACTGATT
GTTGGAGATATACGAGGTGCTATATTTGGTGGTCTTCTAGAATGCCCGTTGAAGCCTACAGCCCAAAGAAAATATCAAAATTGTGCTTCTTGCTTGTTCGAAGCAAACGT
CGATTTCTGCACGATCGTCCCTAAGGACATATCGGAAGAAATCTCGAGTCCGATCAATGCGCAAATATTTGATTTCTATGATCCCGAGCTGTTCTCGGAGACGCTTCAGA
ACTCCGAGTTCAATTCTTGCTCGAATTGTTGTTACGACAAGAATTCATCATATGCTACAAATCTGTCTCATTCTCCAGAAACAGATTACACTGGCAATGGCAATGGCAAT
GGCAATACCGTTGCGGCTGTTGCGTCGTTTATACCTGCTAACGACGCATCGGGTGCAACTAACATAACAGCCAATAGTGCTAGTAATCTGACTGCTATCTTTGATTCCCA
AGAAGAACTTGACAATGACATCTCTGCTTCCATAGACTTCTCTCCATCGGCTTCGTTTTCGATCCCTCAATATCTCGGCATCCAGTCATGCCAGTTTGATGTTTCTCAAC
TGCAGTCTCAAATGCCATTAGTAGATCCCATGATTGAGGGGCTCGTGCAGTGTCCTATGGCTCCAGTTGCAACTCTCCTCGACGACGATCTACAGTCAATTTACGTCGAT
GATTGCCTGTCTTCCTTGACTTCCTACATGCCCCTGAATCCTTCTTCCCCTTCGTGCTCGTTTGTCGGAGCAACCATGGGAACTTACCTGCCTGCTGCATCAATGAATCC
CGCTGCATCATCTGTCGAAAGCTGTGGAATGTTCTCTCTCCTTGGCCCGGATTTGCAACCGCAAGACCTCGACTATCAGGGAGATAACTGTGGACTCTACAGCCAAGACT
GTATGCAGGGGACTTTCAATCCAGCAGACCTTCAGGTGCTTAACAATGAGAACCTGCAACTGACTGGTGGGGGAATGAACTGCACTTCATTAGCATCAGATCTCTCAAGC
TTGAAGGACAGTACTTTCAAAGTAGGAAAACTCTCTGTGGAAGAGAGGAAGGAGAAGATTCATAGGTACATGAAGAAAAGAAATGAGAGGAACTTCAGCAAGAAAATCAA
GTATGCCTGCCGAAAAACGCTAGCAGATAGCCGGCCACGAGTTCGGGGACGGTTCGCAAAGAACGACGAATTGGCAGAGAATCACAGGGCTGCTTGTAGCAACCATGAAG
GGGAAGAAGAAGAAGAGGTAGTTGTGAAGGAAGAAGATAGCATGGTTGATGCCTCAGATATCTTTGCTCATATCAGTGGAGTGAACTCCTTCAAATGCAACTATCCAATC
CAGTCTTGGATTTGA
Protein sequenceShow/hide protein sequence
MYSLKDKVSGRFSSLFSNSTSSESPKPPAPDPHTQARPKSKGRKSLSSYLSLVIPSIHGSKSTSSHQDADVVQSPSVRYCDANNDFQEEGSDTFLECSIPCKTEEMASNQ
EENKDCGSAYAKVKLNKPGVDDDPASGKSTSSSEVFEEAMERPTPRNPLSDLTDESAFISADLYEFLGCCLPNIVKGCKWVLLYSTMKHGISLQTLIRNSHNLSGPCLLI
VGDIRGAIFGGLLECPLKPTAQRKYQNCASCLFEANVDFCTIVPKDISEEISSPINAQIFDFYDPELFSETLQNSEFNSCSNCCYDKNSSYATNLSHSPETDYTGNGNGN
GNTVAAVASFIPANDASGATNITANSASNLTAIFDSQEELDNDISASIDFSPSASFSIPQYLGIQSCQFDVSQLQSQMPLVDPMIEGLVQCPMAPVATLLDDDLQSIYVD
DCLSSLTSYMPLNPSSPSCSFVGATMGTYLPAASMNPAASSVESCGMFSLLGPDLQPQDLDYQGDNCGLYSQDCMQGTFNPADLQVLNNENLQLTGGGMNCTSLASDLSS
LKDSTFKVGKLSVEERKEKIHRYMKKRNERNFSKKIKYACRKTLADSRPRVRGRFAKNDELAENHRAACSNHEGEEEEEVVVKEEDSMVDASDIFAHISGVNSFKCNYPI
QSWI