; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013045 (gene) of Chayote v1 genome

Gene IDSed0013045
OrganismSechium edule (Chayote v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG12:524791..528733
RNA-Seq ExpressionSed0013045
SyntenySed0013045
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK20834.1 Zinc knuckle family protein, putative isoform 2 [Cucumis melo var. makuwa]2.4e-14961.35Show/hide
Query:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREG---------------DGQLPEPEISPKKELNDDGY
        M+ ETR +IEE V+++LK+SN+ED TE+KVR + E R+G DLS+KQCK LVRNVVE FL+S+ E                + +  E +I PKKE NDDG 
Subjt:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREG---------------DGQLPEPEISPKKELNDDGY

Query:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR
         +IC+LSNNR+VT+H FKG  +VSIRQ+Y KDGKQLP  KGIS+ T+QWS F+S++PAI EAI+QMKR  KRSEHDADKIGA+ +P     P FPIETIR
Subjt:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR

Query:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE-
        FDGKNY  WA QME  L+ LKIAYVL ++CP AV   ESSS N  +SKVAEQ+W SDDH CH  ILNSL D LF+++++K M+A ELWKELKLLY  +E 
Subjt:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE-

Query:  STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASF
         TKRSQVKKY+EF+MVEEKSILEQVEELN IADSI SAG  ID DFHVS IISKLP SW+N+ + LMHE  L    L DRLR EEQLRTQ+NS LSR S 
Subjt:  STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASF

Query:  NPG--GQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK
         P   GQH  ++H  KMGD  P ++PLR    +++VKT+LCL+CGKEGH   +CP+ K
Subjt:  NPG--GQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK

XP_008437880.1 PREDICTED: uncharacterized protein LOC103483179 [Cucumis melo]2.1e-14861.14Show/hide
Query:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREG---------------DGQLPEPEISPKKELNDDGY
        M+ ETR +IEE V+++LK+SN+ED TE+KVR + E R+G DLS+KQCK LVRNVVE FL+S+ E                + +  E +I PKKE NDDG 
Subjt:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREG---------------DGQLPEPEISPKKELNDDGY

Query:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR
         +IC+LSNNR+VT+H FKG  +VSIRQ+Y KDGKQLP  KGIS+ T+QWS F+S++PAI EAI+QMKR  KRSEHDADKIGA+ +P     P FPIETIR
Subjt:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR

Query:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE-
        FDGKNY  WA QME  L+ LKIAYVL ++CP AV   ESSS N  +SKVAEQ+W SDDH CH  ILNSL D LF+++++K M+A ELWKELKLLY  +E 
Subjt:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE-

Query:  STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASF
         TKRSQVKKY+EF+MVEEKSILEQVEELN IADSI SAG  ID DFHVS IISKLP SW+N+ + LM E  L    L DRLR EEQLRTQ+NS LSR S 
Subjt:  STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASF

Query:  NPG--GQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK
         P   GQH  ++H  KMGD  P ++PLR    +++VKT+LCL+CGKEGH   +CP+ K
Subjt:  NPG--GQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK

XP_022945450.1 uncharacterized protein LOC111449676 [Cucurbita moschata]9.8e-16768.35Show/hide
Query:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREGD---------------GQLPEPEISPKKELNDDGY
        MD+ETR RI+ETV+D+LK SN+E+MTEYK+R EAE RLG DLSD QCKCLVR+VVE FL S  E D                +  E EI  KKE+N D  
Subjt:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREGD---------------GQLPEPEISPKKELNDDGY

Query:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR
         VICQLSNNRNVTVH+FKG+ALVSIRQ+YEKDGKQLPG KGISLTT+QWSAFRS++PAIEEAI+QMKRK+KRSEHDA+  GAV  P  G  P FP ETIR
Subjt:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR

Query:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDES
        FDGKNYRVWA QMEF LR LKIAYVL D  P ++   ESSS N   SK +EQEW SDDH C HIILNSL DSLFH++T++TM+ARELWKEL  LY  D  
Subjt:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDES

Query:  TKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASFN
        T+RSQVKKY+EFRMVEEKSILEQVEELN+IA+SI+SAGM ID DFHVS IISKLPPSW N+ VKLM EE L  V L+DRLR EE+LRTQQNSH S     
Subjt:  TKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASFN

Query:  PGGQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK
         GG+ P  +H  KMGD   QSLP R   WK DVKT+LCLNCGKEGH+ RDCPS K
Subjt:  PGGQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK

XP_023539029.1 uncharacterized protein LOC111799782 [Cucurbita pepo subsp. pepo]1.4e-16568.43Show/hide
Query:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREGD---------------GQLPEPEISPKKELNDDGY
        MD+ETR RIEETV+D+LK SN+E+MTEYK+R EAE +LG DLSD QCKCLVRNVVE FL S  E D                +  E EI  KKE+N D  
Subjt:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREGD---------------GQLPEPEISPKKELNDDGY

Query:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR
         VICQLSNNRNVTVH+FKG+ALVSIRQ+YEKDGKQLPG KGISLTT+QWSAFRS++PAIEEAI+QMKRK++RSEHDA+  GA   P  G  P FP ETIR
Subjt:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR

Query:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDES
        FDGKNYRVWA QMEF LR LKIAYVL D  P A+   ESSS N   SK +EQEW SDDH C HIILNSL DSLFH++T++TM+ARELWKEL  LY  D  
Subjt:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDES

Query:  TKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASFN
        T+RSQVKKY+EFRMVEEKSILEQVEELN+IA+SI+SAGM ID DFHVS IISKLPPSW N+ VKLM EE L  V L+DRLR EE+LRTQQNSH S     
Subjt:  TKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASFN

Query:  PGGQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPS
         GG+ P  +H  KMGD   QSLP R   WK DVKT+LCLNCGKEGH+ RDCPS
Subjt:  PGGQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPS

XP_038878142.1 uncharacterized protein LOC120070296 [Benincasa hispida]1.2e-14862.75Show/hide
Query:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIRE----GDGQLPEP-----------EISPKKELNDDGY
        MD ET+ +IEETV+D+LKKSN+E+ TE+KVR + E RLG DLS+++ K LVRNVVE FL+S+ E    G    P P           EI+P KE NDDG 
Subjt:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIRE----GDGQLPEP-----------EISPKKELNDDGY

Query:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVD-PNIGIPPGFPIETI
         VIC+LSNNR+VT+H F+G+ +VSIRQF+EKDGKQLP  KGIS++T+QWSAF+S++PAIE+AI+QMK K KRSEHDADKIGAV +  N   PP FP ETI
Subjt:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVD-PNIGIPPGFPIETI

Query:  RFDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE
        RFDGKNY +WA QME  L++LKIAYVL ++CP  V   ESSS N  ++K AEQ+W SDD  C   ILNSL D LF+++  KTM+A ELW ELKLLY  +E
Subjt:  RFDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE

Query:  -STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRAS
          TKRSQVKKY+EFRMVEEKSILEQVE+LN+IADSIVSAG  ID DFHVS IISKLP SW ++ V LMHE+ LS   L+DRLR EEQLRTQ+NSHLS  S
Subjt:  -STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRAS

Query:  FNPG--GQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK
          P   GQH  ++H+ KM D K  SLPLR   W+ DVKT+LCLNCGKEGH   +CPS K
Subjt:  FNPG--GQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK

TrEMBL top hitse value%identityAlignment
A0A0A0L3U5 CCHC-type domain-containing protein2.4e-14760.78Show/hide
Query:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREG---------------DGQLPEPEISPKKELNDDGY
        M+ ETR RIEE V+++LKKS++ED TE+KVR + E RLG DLS+KQCK LVRNVVE FL+S+ E                + +  E +I PKKE NDDG 
Subjt:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREG---------------DGQLPEPEISPKKELNDDGY

Query:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGI-PPGFPIETI
         +IC+LSNNR+VT+H FKG+ +VS+RQ+YEKDGKQLP  KGIS+ T+QWS F+S++PAI EAI+QMKR  KRSEHDA+KIGA  +P   +  P +PIETI
Subjt:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGI-PPGFPIETI

Query:  RFDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE
        RFDGKNY  WA QME  L+ LKIAYVL ++CP AV   ESSS N  +SK AEQ+W  DDH C   ILNSL D LF+++++KTM+A ELWKELKLLY  +E
Subjt:  RFDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE

Query:  -STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRAS
          TKRSQVKKY+EF+MVEEKSILEQVEELN IADSI S+G  ID DFHVS IISKLP SW+N+ V LMHE+ L    L DRLR EEQLRTQ+NS LS  S
Subjt:  -STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRAS

Query:  FN--PGGQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK
         +  P GQH  ++H  KMGD KP ++PLR    +++VKT+LCL+CGKEGH   +CP+ K
Subjt:  FN--PGGQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK

A0A1S3AV18 uncharacterized protein LOC1034831799.9e-14961.14Show/hide
Query:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREG---------------DGQLPEPEISPKKELNDDGY
        M+ ETR +IEE V+++LK+SN+ED TE+KVR + E R+G DLS+KQCK LVRNVVE FL+S+ E                + +  E +I PKKE NDDG 
Subjt:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREG---------------DGQLPEPEISPKKELNDDGY

Query:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR
         +IC+LSNNR+VT+H FKG  +VSIRQ+Y KDGKQLP  KGIS+ T+QWS F+S++PAI EAI+QMKR  KRSEHDADKIGA+ +P     P FPIETIR
Subjt:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR

Query:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE-
        FDGKNY  WA QME  L+ LKIAYVL ++CP AV   ESSS N  +SKVAEQ+W SDDH CH  ILNSL D LF+++++K M+A ELWKELKLLY  +E 
Subjt:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE-

Query:  STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASF
         TKRSQVKKY+EF+MVEEKSILEQVEELN IADSI SAG  ID DFHVS IISKLP SW+N+ + LM E  L    L DRLR EEQLRTQ+NS LSR S 
Subjt:  STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASF

Query:  NPG--GQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK
         P   GQH  ++H  KMGD  P ++PLR    +++VKT+LCL+CGKEGH   +CP+ K
Subjt:  NPG--GQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK

A0A5A7TZ44 Zinc knuckle family protein, putative isoform 29.9e-14961.14Show/hide
Query:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREG---------------DGQLPEPEISPKKELNDDGY
        M+ ETR +IEE V+++LK+SN+ED TE+KVR + E R+G DLS+KQCK LVRNVVE FL+S+ E                + +  E +I PKKE NDDG 
Subjt:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREG---------------DGQLPEPEISPKKELNDDGY

Query:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR
         +IC+LSNNR+VT+H FKG  +VSIRQ+Y KDGKQLP  KGIS+ T+QWS F+S++PAI EAI+QMKR  KRSEHDADKIGA+ +P     P FPIETIR
Subjt:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR

Query:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE-
        FDGKNY  WA QME  L+ LKIAYVL ++CP AV   ESSS N  +SKVAEQ+W SDDH CH  ILNSL D LF+++++K M+A ELWKELKLLY  +E 
Subjt:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE-

Query:  STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASF
         TKRSQVKKY+EF+MVEEKSILEQVEELN IADSI SAG  ID DFHVS IISKLP SW+N+ + LM E  L    L DRLR EEQLRTQ+NS LSR S 
Subjt:  STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASF

Query:  NPG--GQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK
         P   GQH  ++H  KMGD  P ++PLR    +++VKT+LCL+CGKEGH   +CP+ K
Subjt:  NPG--GQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK

A0A5D3DBA1 Zinc knuckle family protein, putative isoform 21.2e-14961.35Show/hide
Query:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREG---------------DGQLPEPEISPKKELNDDGY
        M+ ETR +IEE V+++LK+SN+ED TE+KVR + E R+G DLS+KQCK LVRNVVE FL+S+ E                + +  E +I PKKE NDDG 
Subjt:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREG---------------DGQLPEPEISPKKELNDDGY

Query:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR
         +IC+LSNNR+VT+H FKG  +VSIRQ+Y KDGKQLP  KGIS+ T+QWS F+S++PAI EAI+QMKR  KRSEHDADKIGA+ +P     P FPIETIR
Subjt:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR

Query:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE-
        FDGKNY  WA QME  L+ LKIAYVL ++CP AV   ESSS N  +SKVAEQ+W SDDH CH  ILNSL D LF+++++K M+A ELWKELKLLY  +E 
Subjt:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDE-

Query:  STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASF
         TKRSQVKKY+EF+MVEEKSILEQVEELN IADSI SAG  ID DFHVS IISKLP SW+N+ + LMHE  L    L DRLR EEQLRTQ+NS LSR S 
Subjt:  STKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASF

Query:  NPG--GQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK
         P   GQH  ++H  KMGD  P ++PLR    +++VKT+LCL+CGKEGH   +CP+ K
Subjt:  NPG--GQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK

A0A6J1G0Z2 uncharacterized protein LOC1114496764.7e-16768.35Show/hide
Query:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREGD---------------GQLPEPEISPKKELNDDGY
        MD+ETR RI+ETV+D+LK SN+E+MTEYK+R EAE RLG DLSD QCKCLVR+VVE FL S  E D                +  E EI  KKE+N D  
Subjt:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREGD---------------GQLPEPEISPKKELNDDGY

Query:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR
         VICQLSNNRNVTVH+FKG+ALVSIRQ+YEKDGKQLPG KGISLTT+QWSAFRS++PAIEEAI+QMKRK+KRSEHDA+  GAV  P  G  P FP ETIR
Subjt:  TVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIR

Query:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDES
        FDGKNYRVWA QMEF LR LKIAYVL D  P ++   ESSS N   SK +EQEW SDDH C HIILNSL DSLFH++T++TM+ARELWKEL  LY  D  
Subjt:  FDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDES

Query:  TKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASFN
        T+RSQVKKY+EFRMVEEKSILEQVEELN+IA+SI+SAGM ID DFHVS IISKLPPSW N+ VKLM EE L  V L+DRLR EE+LRTQQNSH S     
Subjt:  TKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASFN

Query:  PGGQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK
         GG+ P  +H  KMGD   QSLP R   WK DVKT+LCLNCGKEGH+ RDCPS K
Subjt:  PGGQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK

SwissProt top hitse value%identityAlignment
O65154 RNA polymerase II transcriptional coactivator KIWI4.9e-1238.2Show/hide
Query:  EGDGQLPEPEIS-PKKELNDDGYTVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQM
        E +   P  +++ P  + +     V+C +S NR V+V ++ G   + IR+FY KDGK LPG KGISL+ DQW+  R+    IE+A+  +
Subjt:  EGDGQLPEPEIS-PKKELNDDGYTVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQM

O65155 RNA polymerase II transcriptional coactivator KELP4.6e-3446.67Show/hide
Query:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIR----EGDGQLPEPEISPK------KELNDDGYTVICQ
        M+ ET+ +IE+TV++IL +S+++++TE+KVR  A  +L  DLS+K  K  VR+VVE FL   R    E      E E   K      KE +DDG  +IC+
Subjt:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIR----EGDGQLPEPEISPK------KELNDDGYTVICQ

Query:  LSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKM
        LS+ R VT+ +FKG +LVSIR++Y+KDGK+LP  KGISLT +QWS F+ ++PAIE A+ +M+ ++
Subjt:  LSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKM

P53999 Activated RNA polymerase II transcriptional coactivator p156.2e-0732.91Show/hide
Query:  ISPKKELNDDGYTVICQLSNNRNVTVHDFKGSALVSIRQFY-EKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQM
        +S  K+ +      + Q+   R V+V DFKG  L+ IR+++ + +G+  PG KGISL  +QWS  +  +  I++A+ ++
Subjt:  ISPKKELNDDGYTVICQLSNNRNVTVHDFKGSALVSIRQFY-EKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQM

Q4R947 Activated RNA polymerase II transcriptional coactivator p156.2e-0732.91Show/hide
Query:  ISPKKELNDDGYTVICQLSNNRNVTVHDFKGSALVSIRQFY-EKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQM
        +S  K+ +      + Q+   R V+V DFKG  L+ IR+++ + +G+  PG KGISL  +QWS  +  +  I++A+ ++
Subjt:  ISPKKELNDDGYTVICQLSNNRNVTVHDFKGSALVSIRQFY-EKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQM

Q5R6D0 Activated RNA polymerase II transcriptional coactivator p156.2e-0732.91Show/hide
Query:  ISPKKELNDDGYTVICQLSNNRNVTVHDFKGSALVSIRQFY-EKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQM
        +S  K+ +      + Q+   R V+V DFKG  L+ IR+++ + +G+  PG KGISL  +QWS  +  +  I++A+ ++
Subjt:  ISPKKELNDDGYTVICQLSNNRNVTVHDFKGSALVSIRQFY-EKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQM

Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein4.8e-7939.51Show/hide
Query:  RIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREGDGQLPEPEISPKKE--------LNDDGYTVICQLSNNRNVTV
        +IEETV  IL +S+++ MTE+K+R +A  +LG DLS    K LVR+V+E FL+S   G+  +PE     K E        +  +    IC+LS  +N TV
Subjt:  RIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREGDGQLPEPEISPKKE--------LNDDGYTVICQLSNNRNVTV

Query:  HDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETI-RFDGKNYRVWACQM
          ++G   +SI    ++ GK    F+G  L+T+QWS  + +  AIE+ I Q + K+K            VD +     GF +  I RFDGK+Y  WA QM
Subjt:  HDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETI-RFDGKNYRVWACQM

Query:  EFFLRYLKIAYVLFDRCPI--AVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDES-TKRSQVKKYI
        E FL+ LK+ YVL + CP   +    E++ R +  +    ++W  DD+ C+  ++NSL D L+ ++++K   A+ELW ELK +Y  DES +KRSQV+KYI
Subjt:  EFFLRYLKIAYVLFDRCPI--AVPELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDES-TKRSQVKKYI

Query:  EFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASFNPGGQHPTSHH
        EFRMVEE+ ILEQV+  N IADSIVSAGM +D  FHVS IISK PPSWR    +LM EE L    L++R++ EE+L     +     ++ P         
Subjt:  EFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDFHVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASFNPGGQHPTSHH

Query:  MPKMGDLK--PQSLPLRNNVWKRDVKTVL-CLNCGKEGHVCRDCPSGK
         P +G      QS+  +    +RD + ++ C NCG++GH+ + C   K
Subjt:  MPKMGDLK--PQSLPLRNNVWKRDVKTVL-CLNCGKEGHVCRDCPSGK

AT4G10920.1 transcriptional coactivator p15 (PC4) family protein (KELP)3.3e-3546.67Show/hide
Query:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIR----EGDGQLPEPEISPK------KELNDDGYTVICQ
        M+ ET+ +IE+TV++IL +S+++++TE+KVR  A  +L  DLS+K  K  VR+VVE FL   R    E      E E   K      KE +DDG  +IC+
Subjt:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIR----EGDGQLPEPEISPK------KELNDDGYTVICQ

Query:  LSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKM
        LS+ R VT+ +FKG +LVSIR++Y+KDGK+LP  KGISLT +QWS F+ ++PAIE A+ +M+ ++
Subjt:  LSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKM

AT4G10920.2 transcriptional coactivator p15 (PC4) family protein (KELP)3.3e-3546.67Show/hide
Query:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIR----EGDGQLPEPEISPK------KELNDDGYTVICQ
        M+ ET+ +IE+TV++IL +S+++++TE+KVR  A  +L  DLS+K  K  VR+VVE FL   R    E      E E   K      KE +DDG  +IC+
Subjt:  MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIR----EGDGQLPEPEISPK------KELNDDGYTVICQ

Query:  LSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKM
        LS+ R VT+ +FKG +LVSIR++Y+KDGK+LP  KGISLT +QWS F+ ++PAIE A+ +M+ ++
Subjt:  LSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKM

AT5G09240.1 ssDNA-binding transcriptional regulator2.2e-0735.71Show/hide
Query:  PEPEISPKKELNDDGYTVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLP--GFKGISLTTDQWSAFRSSVPAIEEAIMQM
        P+    P  E+ D     IC L  NR V V +  G   ++IRQF+ KDG  LP    +GISL+ +QW+  R+    I++A+ ++
Subjt:  PEPEISPKKELNDDGYTVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLP--GFKGISLTTDQWSAFRSSVPAIEEAIMQM

AT5G09250.1 ssDNA-binding transcriptional regulator3.5e-1338.2Show/hide
Query:  EGDGQLPEPEIS-PKKELNDDGYTVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQM
        E +   P  +++ P  + +     V+C +S NR V+V ++ G   + IR+FY KDGK LPG KGISL+ DQW+  R+    IE+A+  +
Subjt:  EGDGQLPEPEIS-PKKELNDDGYTVICQLSNNRNVTVHDFKGSALVSIRQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGCGAAACGCGGTGGAGAATCGAGGAAACGGTGATGGACATATTGAAGAAATCGAACTTGGAAGACATGACGGAGTACAAAGTCCGATTCGAGGCCGAGAACCG
CCTCGGATTTGATCTCTCCGACAAGCAATGCAAGTGCCTGGTCAGGAACGTGGTCGAGGGTTTTCTAGTTTCAATCAGGGAGGGTGACGGACAGCTGCCGGAGCCGGAGA
TTTCCCCCAAGAAGGAGCTTAACGATGATGGCTACACTGTGATTTGCCAGCTATCTAATAACAGGAATGTGACAGTTCATGATTTTAAAGGGAGCGCTCTGGTATCTATT
AGGCAGTTTTATGAAAAAGATGGAAAACAGCTTCCTGGGTTTAAAGGAATCAGCTTGACAACTGATCAATGGTCTGCATTTAGGAGTAGTGTTCCTGCTATAGAGGAAGC
TATTATGCAGATGAAAAGAAAAATGAAAAGATCTGAACATGATGCTGATAAAATTGGTGCTGTCGTCGATCCCAATATTGGTATTCCTCCAGGATTTCCCATTGAAACTA
TACGATTTGATGGAAAAAATTACCGTGTGTGGGCATGTCAGATGGAGTTTTTTCTGCGGTACTTGAAGATTGCTTATGTACTTTTTGATCGTTGTCCTATTGCCGTGCCT
GAACTAGAATCAAGCTCTAGAAATGTTGTTGAATCGAAGGTAGCTGAACAAGAGTGGAGGAGCGATGACCACACATGTCACCACATCATTTTGAACTCCCTCTGTGATAG
TCTGTTTCATCAGTTTACAGAGAAAACAATGACTGCTAGAGAACTCTGGAAGGAACTAAAATTGCTTTATAATTTTGACGAAAGCACCAAGAGATCTCAAGTTAAAAAAT
ATATTGAATTCAGGATGGTTGAGGAGAAGTCAATCTTAGAACAAGTTGAAGAACTTAATAGCATTGCTGATTCCATTGTTTCTGCTGGAATGCATATTGATAGGGATTTT
CATGTTAGTGTCATTATTTCAAAACTTCCACCCTCTTGGAGGAATATCTCTGTGAAGTTGATGCATGAGGAGCGTCTTTCCTGTGTGGCATTGTTAGATCGATTGAGGTT
TGAAGAACAATTACGTACACAACAAAACTCGCATCTCTCGAGAGCGTCTTTCAATCCAGGAGGCCAACATCCTACCTCACATCATATGCCAAAGATGGGAGACTTGAAGC
CCCAAAGCCTACCGTTGAGGAACAATGTATGGAAAAGGGATGTGAAGACTGTACTCTGTTTGAATTGTGGCAAGGAAGGGCACGTATGTCGAGATTGTCCAAGTGGTAAG
TAG
mRNA sequenceShow/hide mRNA sequence
GTTTGTTTTTCATCTTCTACAAATCACAGCAATCGGAATCGGCTGCAACAGTTTTCATTCTTCTTCTTCCAACGCTTCTCCTCTATGTTCTAGATTCATTCTGCCCCATT
ATCCATTCTTGTTCTTCGTATTCCAGAGGCATGGACAGCGAAACGCGGTGGAGAATCGAGGAAACGGTGATGGACATATTGAAGAAATCGAACTTGGAAGACATGACGGA
GTACAAAGTCCGATTCGAGGCCGAGAACCGCCTCGGATTTGATCTCTCCGACAAGCAATGCAAGTGCCTGGTCAGGAACGTGGTCGAGGGTTTTCTAGTTTCAATCAGGG
AGGGTGACGGACAGCTGCCGGAGCCGGAGATTTCCCCCAAGAAGGAGCTTAACGATGATGGCTACACTGTGATTTGCCAGCTATCTAATAACAGGAATGTGACAGTTCAT
GATTTTAAAGGGAGCGCTCTGGTATCTATTAGGCAGTTTTATGAAAAAGATGGAAAACAGCTTCCTGGGTTTAAAGGAATCAGCTTGACAACTGATCAATGGTCTGCATT
TAGGAGTAGTGTTCCTGCTATAGAGGAAGCTATTATGCAGATGAAAAGAAAAATGAAAAGATCTGAACATGATGCTGATAAAATTGGTGCTGTCGTCGATCCCAATATTG
GTATTCCTCCAGGATTTCCCATTGAAACTATACGATTTGATGGAAAAAATTACCGTGTGTGGGCATGTCAGATGGAGTTTTTTCTGCGGTACTTGAAGATTGCTTATGTA
CTTTTTGATCGTTGTCCTATTGCCGTGCCTGAACTAGAATCAAGCTCTAGAAATGTTGTTGAATCGAAGGTAGCTGAACAAGAGTGGAGGAGCGATGACCACACATGTCA
CCACATCATTTTGAACTCCCTCTGTGATAGTCTGTTTCATCAGTTTACAGAGAAAACAATGACTGCTAGAGAACTCTGGAAGGAACTAAAATTGCTTTATAATTTTGACG
AAAGCACCAAGAGATCTCAAGTTAAAAAATATATTGAATTCAGGATGGTTGAGGAGAAGTCAATCTTAGAACAAGTTGAAGAACTTAATAGCATTGCTGATTCCATTGTT
TCTGCTGGAATGCATATTGATAGGGATTTTCATGTTAGTGTCATTATTTCAAAACTTCCACCCTCTTGGAGGAATATCTCTGTGAAGTTGATGCATGAGGAGCGTCTTTC
CTGTGTGGCATTGTTAGATCGATTGAGGTTTGAAGAACAATTACGTACACAACAAAACTCGCATCTCTCGAGAGCGTCTTTCAATCCAGGAGGCCAACATCCTACCTCAC
ATCATATGCCAAAGATGGGAGACTTGAAGCCCCAAAGCCTACCGTTGAGGAACAATGTATGGAAAAGGGATGTGAAGACTGTACTCTGTTTGAATTGTGGCAAGGAAGGG
CACGTATGTCGAGATTGTCCAAGTGGTAAGTAGGAATGTCGATGTCGGTATCTCAGAAAGAACATAGCAGAATCTTACTGAGAATCTGTCATGCGCTCTGGGGCTTTGAA
AGTGCAAAGTCAAGGTTCTAAGCGATAACTTTTCTGCAGGTTTTGTTTTTCACAGGAGCCACTTTAGTGAAGATCATTGGAGGTCTAATTATAATTGGCTCTAAGTTGAT
AGCTTCCTGCTTGATTGTATTATGAGCAGATGATGAATTCATAAGCTCCTTGTCGATTTTCACCACACTGATGTGTATATAAATGTTGCTATGGTTAGTATCAGGATAGA
TGTTAGAAACATTACCAGAAGTTTCTTTTTTTTCCTTCTAATTAGGAGCTAGTTATAACTTAGAAATAGAGGTGGTGGAAGAGATGTAGAAGGTAGTTGTTTTGTTTATA
TGTTTCGACTTTGAGTTTCCTTTTGCTCAAGAGTAGGAAGTGTCCAAGTACCTTGAATCACTTACCGCCAGAAAAGTACTGGCTACTAATTTATTTATTTTTCTCTGCCT
TCTCGGGAGAATTTATTTCTTAAAGGCCGATAAATTCCTGGTATTTTGATCCTAAATTTACTTTTCCTTCTTATCGAAAGTTCAGATTCAAAATCAGTGTCATTACATAA
GCACACATTAGAAAAATCTATATTCATTCTTCAATTCTATCCTTTTCTTCTCTTTAGTCTTCTCAGCAGTAGGGAAACCTTTTTGTTAAAGAATCCCCTTCTTACCAAAA
GGAAAGTGCCTCTCTTTTTTACTAAAAAGAGAAATCCTTTAACTAAAAGGAAAAGAAAGAATAAAATTGACAAGAAAAGATGAAATGGAAG
Protein sequenceShow/hide protein sequence
MDSETRWRIEETVMDILKKSNLEDMTEYKVRFEAENRLGFDLSDKQCKCLVRNVVEGFLVSIREGDGQLPEPEISPKKELNDDGYTVICQLSNNRNVTVHDFKGSALVSI
RQFYEKDGKQLPGFKGISLTTDQWSAFRSSVPAIEEAIMQMKRKMKRSEHDADKIGAVVDPNIGIPPGFPIETIRFDGKNYRVWACQMEFFLRYLKIAYVLFDRCPIAVP
ELESSSRNVVESKVAEQEWRSDDHTCHHIILNSLCDSLFHQFTEKTMTARELWKELKLLYNFDESTKRSQVKKYIEFRMVEEKSILEQVEELNSIADSIVSAGMHIDRDF
HVSVIISKLPPSWRNISVKLMHEERLSCVALLDRLRFEEQLRTQQNSHLSRASFNPGGQHPTSHHMPKMGDLKPQSLPLRNNVWKRDVKTVLCLNCGKEGHVCRDCPSGK