; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008641 (gene) of Snake gourd v1 genome

Gene IDTan0008641
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG10:1322377..1328810
RNA-Seq ExpressionTan0008641
SyntenyTan0008641
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR003173 - Transcriptional coactivator p15 (PC4), C-terminal
IPR009044 - ssDNA-binding transcriptional regulator
IPR014876 - DEK, C-terminal
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK20834.1 Zinc knuckle family protein, putative isoform 2 [Cucumis melo var. makuwa]1.4e-17669.65Show/hide
Query:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG
        M+ ETR +IEE VI+VLK+SN+ED TE++VR + E+R+G+DLS+ QCK LVRNVVE FLL   E    GKE EPGP+VRYEN+A EQ ++PK+E NDDG 
Subjt:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG

Query:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR
         +ICRLS NR V +H+FKG  +VSIRQYY KDGKQ+P  KGIS+ T+QWS F+S+IPAI EAILQMKR KRSE DADKIG +SNP T VT PKFP ETIR
Subjt:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR

Query:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF
        FDGKNYH WA QME LLQ LKI YVLS++CPTAVLG ESSSGNAA+SKVAE++WMSDDHMC   I+NSLSD LF++Y+KK MSA ELWKEL LLY +E+F
Subjt:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF

Query:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF
        GTKRSQVKKYLEF MVEEKSILEQVEELN++AD+I SAG  IDEDFHVSAIISKLP SW NV+++LMHE YLP  +L DRLRIEEQLR +KNS  SRVS 
Subjt:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF

Query:  NPG--GQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK
         P   GQ  AANH SKMG+P   ++P R++E Q EVKT+LCL+CGKEGH S +CP+ K
Subjt:  NPG--GQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK

XP_004134299.1 uncharacterized protein LOC101205072 [Cucumis sativus]4.4e-17870.31Show/hide
Query:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG
        M+ ETR RIEE VI+VLKKS+MED TE++VR + E+RLG+DLS+ QCK LVRNVVE FLL   E    GKE EPGP+VRYENKA EQ +VPK+E NDDG 
Subjt:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG

Query:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR
         +ICRLS NR V +H+FKG  +VS+RQYYEKDGKQ+P  KGIS+ T+QWS F+S+IPAI EAILQMKR KRSE DA+KIG  SNP T VT PK+P ETIR
Subjt:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR

Query:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF
        FDGKNY+ WA QME LLQ LKI YVLS++CPTAVLG ESSSGNAA+SK AE++WM DDHMCR  I+NSLSD LF++Y+KKTMSA ELWKEL LLYL+E+F
Subjt:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF

Query:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF
        GTKRSQVKKYLEF MVEEKSILEQVEELN++AD+I S+G  IDEDFHVSAIISKLP SW NV+V LMHE+YLP  +L DRLRIEEQLR +KNS  S VS 
Subjt:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF

Query:  N--PGGQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK
        +  P GQ  AANH SKMG+PK  ++P R++E Q EVKT+LCL+CGKEGH S +CP+ K
Subjt:  N--PGGQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK

XP_008437880.1 PREDICTED: uncharacterized protein LOC103483179 [Cucumis melo]1.2e-17569.43Show/hide
Query:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG
        M+ ETR +IEE VI+VLK+SN+ED TE++VR + E+R+G+DLS+ QCK LVRNVVE FLL   E    GKE EPGP+VRYEN+A EQ ++PK+E NDDG 
Subjt:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG

Query:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR
         +ICRLS NR V +H+FKG  +VSIRQYY KDGKQ+P  KGIS+ T+QWS F+S+IPAI EAILQMKR KRSE DADKIG +SNP T VT PKFP ETIR
Subjt:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR

Query:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF
        FDGKNYH WA QME LLQ LKI YVLS++CPTAVLG ESSSGNAA+SKVAE++WMSDDHMC   I+NSLSD LF++Y+KK MSA ELWKEL LLY +E+F
Subjt:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF

Query:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF
        GTKRSQVKKYLEF MVEEKSILEQVEELN++AD+I SAG  IDEDFHVSAIISKLP SW NV+++LM E YLP  +L DRLRIEEQLR +KNS  SRVS 
Subjt:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF

Query:  NPG--GQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK
         P   GQ  AANH SKMG+P   ++P R++E Q EVKT+LCL+CGKEGH S +CP+ K
Subjt:  NPG--GQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK

XP_022945450.1 uncharacterized protein LOC111449676 [Cucurbita moschata]2.2e-19076.37Show/hide
Query:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG
        MD+ETR RI+ETVID+LK SNME+MTEY++R EAEKRLGMDLSD QCK LVR+VVE FL    E +DKGKEGEPGP+ RYENKA EQ +V K+EIN D  
Subjt:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG

Query:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRK-KRSEPDADKIGGVSNPATGVTPPKFPNETI
        RVIC+LS NR V VHEFKGN LVSIRQYYEKDGKQ+PG KGISLTT+QWSAFRS+IPAIEEAILQMKRK KRSE DA+  G VS PATG + PKFP+ETI
Subjt:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRK-KRSEPDADKIGGVSNPATGVTPPKFPNETI

Query:  RFDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQ
        RFDGKNY VWARQMEFLL+ LKI YVLSD  PT++LGPESSSGN +RSK +E+EWMSDDHMCRHII+NSLSD+LFH+YTK+TMSA+ELWKELN LYL + 
Subjt:  RFDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQ

Query:  FGTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVS
        +GT+RSQVKKYLEF MVEEKSILEQVEELNN+A++IISAGMRIDEDFHVSAIISKLPPSW NVFV LM EE+LP V LIDRLR EE+LR ++NSH S   
Subjt:  FGTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVS

Query:  FNPGGQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK
           GG+RP  NH  KMG+  SQSLPSR+RE +M+VKT+LCLNCGKEGHISRDCPSSK
Subjt:  FNPGGQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK

XP_023539029.1 uncharacterized protein LOC111799782 [Cucurbita pepo subsp. pepo]2.5e-18976.32Show/hide
Query:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG
        MD+ETR RIEETVID+LK SNME+MTEY++R EAEK+LGMDLSD QCK LVRNVVE FL    E +DKGKEGEPGP+ RYENKA EQ +V K+EIN D  
Subjt:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG

Query:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRK-KRSEPDADKIGGVSNPATGVTPPKFPNETI
        RVIC+LS NR V VHEFKGN LVSIRQYYEKDGKQ+PG KGISLTT+QWSAFRS+IPAIEEAILQMKRK +RSE DA+  G  S PATG + PKFP+ETI
Subjt:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRK-KRSEPDADKIGGVSNPATGVTPPKFPNETI

Query:  RFDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQ
        RFDGKNY VWARQMEFLL+ LKI YVLSD  PTA+LGPESSSGN +RSK +E+EWMSDDHMCRHII+NSLSD+LFH+YTK+TMSA+ELWKELN LYL + 
Subjt:  RFDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQ

Query:  FGTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVS
        +GT+RSQVKKYLEF MVEEKSILEQVEELNN+A++IISAGMRIDEDFHVSAIISKLPPSW NVFV LM EE+LP V LIDRLR EE+LR ++NSH S   
Subjt:  FGTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVS

Query:  FNPGGQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSS
           GG+RP  NH  KMG+  SQSLPSR+RE +M+VKT+LCLNCGKEGHISRDCPSS
Subjt:  FNPGGQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSS

TrEMBL top hitse value%identityAlignment
A0A0A0L3U5 CCHC-type domain-containing protein2.1e-17870.31Show/hide
Query:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG
        M+ ETR RIEE VI+VLKKS+MED TE++VR + E+RLG+DLS+ QCK LVRNVVE FLL   E    GKE EPGP+VRYENKA EQ +VPK+E NDDG 
Subjt:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG

Query:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR
         +ICRLS NR V +H+FKG  +VS+RQYYEKDGKQ+P  KGIS+ T+QWS F+S+IPAI EAILQMKR KRSE DA+KIG  SNP T VT PK+P ETIR
Subjt:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR

Query:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF
        FDGKNY+ WA QME LLQ LKI YVLS++CPTAVLG ESSSGNAA+SK AE++WM DDHMCR  I+NSLSD LF++Y+KKTMSA ELWKEL LLYL+E+F
Subjt:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF

Query:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF
        GTKRSQVKKYLEF MVEEKSILEQVEELN++AD+I S+G  IDEDFHVSAIISKLP SW NV+V LMHE+YLP  +L DRLRIEEQLR +KNS  S VS 
Subjt:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF

Query:  N--PGGQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK
        +  P GQ  AANH SKMG+PK  ++P R++E Q EVKT+LCL+CGKEGH S +CP+ K
Subjt:  N--PGGQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK

A0A1S3AV18 uncharacterized protein LOC1034831795.8e-17669.43Show/hide
Query:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG
        M+ ETR +IEE VI+VLK+SN+ED TE++VR + E+R+G+DLS+ QCK LVRNVVE FLL   E    GKE EPGP+VRYEN+A EQ ++PK+E NDDG 
Subjt:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG

Query:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR
         +ICRLS NR V +H+FKG  +VSIRQYY KDGKQ+P  KGIS+ T+QWS F+S+IPAI EAILQMKR KRSE DADKIG +SNP T VT PKFP ETIR
Subjt:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR

Query:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF
        FDGKNYH WA QME LLQ LKI YVLS++CPTAVLG ESSSGNAA+SKVAE++WMSDDHMC   I+NSLSD LF++Y+KK MSA ELWKEL LLY +E+F
Subjt:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF

Query:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF
        GTKRSQVKKYLEF MVEEKSILEQVEELN++AD+I SAG  IDEDFHVSAIISKLP SW NV+++LM E YLP  +L DRLRIEEQLR +KNS  SRVS 
Subjt:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF

Query:  NPG--GQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK
         P   GQ  AANH SKMG+P   ++P R++E Q EVKT+LCL+CGKEGH S +CP+ K
Subjt:  NPG--GQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK

A0A5A7TZ44 Zinc knuckle family protein, putative isoform 25.8e-17669.43Show/hide
Query:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG
        M+ ETR +IEE VI+VLK+SN+ED TE++VR + E+R+G+DLS+ QCK LVRNVVE FLL   E    GKE EPGP+VRYEN+A EQ ++PK+E NDDG 
Subjt:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG

Query:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR
         +ICRLS NR V +H+FKG  +VSIRQYY KDGKQ+P  KGIS+ T+QWS F+S+IPAI EAILQMKR KRSE DADKIG +SNP T VT PKFP ETIR
Subjt:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR

Query:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF
        FDGKNYH WA QME LLQ LKI YVLS++CPTAVLG ESSSGNAA+SKVAE++WMSDDHMC   I+NSLSD LF++Y+KK MSA ELWKEL LLY +E+F
Subjt:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF

Query:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF
        GTKRSQVKKYLEF MVEEKSILEQVEELN++AD+I SAG  IDEDFHVSAIISKLP SW NV+++LM E YLP  +L DRLRIEEQLR +KNS  SRVS 
Subjt:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF

Query:  NPG--GQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK
         P   GQ  AANH SKMG+P   ++P R++E Q EVKT+LCL+CGKEGH S +CP+ K
Subjt:  NPG--GQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK

A0A5D3DBA1 Zinc knuckle family protein, putative isoform 26.8e-17769.65Show/hide
Query:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG
        M+ ETR +IEE VI+VLK+SN+ED TE++VR + E+R+G+DLS+ QCK LVRNVVE FLL   E    GKE EPGP+VRYEN+A EQ ++PK+E NDDG 
Subjt:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG

Query:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR
         +ICRLS NR V +H+FKG  +VSIRQYY KDGKQ+P  KGIS+ T+QWS F+S+IPAI EAILQMKR KRSE DADKIG +SNP T VT PKFP ETIR
Subjt:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIR

Query:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF
        FDGKNYH WA QME LLQ LKI YVLS++CPTAVLG ESSSGNAA+SKVAE++WMSDDHMC   I+NSLSD LF++Y+KK MSA ELWKEL LLY +E+F
Subjt:  FDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQF

Query:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF
        GTKRSQVKKYLEF MVEEKSILEQVEELN++AD+I SAG  IDEDFHVSAIISKLP SW NV+++LMHE YLP  +L DRLRIEEQLR +KNS  SRVS 
Subjt:  GTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSF

Query:  NPG--GQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK
         P   GQ  AANH SKMG+P   ++P R++E Q EVKT+LCL+CGKEGH S +CP+ K
Subjt:  NPG--GQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK

A0A6J1G0Z2 uncharacterized protein LOC1114496761.1e-19076.37Show/hide
Query:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG
        MD+ETR RI+ETVID+LK SNME+MTEY++R EAEKRLGMDLSD QCK LVR+VVE FL    E +DKGKEGEPGP+ RYENKA EQ +V K+EIN D  
Subjt:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG

Query:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRK-KRSEPDADKIGGVSNPATGVTPPKFPNETI
        RVIC+LS NR V VHEFKGN LVSIRQYYEKDGKQ+PG KGISLTT+QWSAFRS+IPAIEEAILQMKRK KRSE DA+  G VS PATG + PKFP+ETI
Subjt:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRK-KRSEPDADKIGGVSNPATGVTPPKFPNETI

Query:  RFDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQ
        RFDGKNY VWARQMEFLL+ LKI YVLSD  PT++LGPESSSGN +RSK +E+EWMSDDHMCRHII+NSLSD+LFH+YTK+TMSA+ELWKELN LYL + 
Subjt:  RFDGKNYHVWARQMEFLLQHLKIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQ

Query:  FGTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVS
        +GT+RSQVKKYLEF MVEEKSILEQVEELNN+A++IISAGMRIDEDFHVSAIISKLPPSW NVFV LM EE+LP V LIDRLR EE+LR ++NSH S   
Subjt:  FGTKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVS

Query:  FNPGGQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK
           GG+RP  NH  KMG+  SQSLPSR+RE +M+VKT+LCLNCGKEGHISRDCPSSK
Subjt:  FNPGGQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCLNCGKEGHISRDCPSSK

SwissProt top hitse value%identityAlignment
O65154 RNA polymerase II transcriptional coactivator KIWI2.5e-1140.51Show/hide
Query:  AEQNVVPKREINDDGGRVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAI
        A++   P  + +     V+C +SKNR V+V  + G   + IR++Y KDGK +PG KGISL+ DQW+  R+    IE+A+
Subjt:  AEQNVVPKREINDDGGRVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAI

O65155 RNA polymerase II transcriptional coactivator KELP1.3e-3444.38Show/hide
Query:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG
        M+ ET+ +IE+TVI++L +S+M+++TE++VR  A ++L +DLS+   K  VR+VVE FL      E++ +E E     + E    +      +E +DDG 
Subjt:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG

Query:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRK
         +ICRLS  R V + EFKG +LVSIR+YY+KDGK++P +KGISLT +QWS F+ ++PAIE A+ +M+ +
Subjt:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRK

P53999 Activated RNA polymerase II transcriptional coactivator p156.5e-0739.68Show/hide
Query:  RLSKNRGVAVHEFKGNTLVSIRQYY-EKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQM
        ++ K R V+V +FKG  L+ IR+Y+ + +G+  PG KGISL  +QWS  +  I  I++A+ ++
Subjt:  RLSKNRGVAVHEFKGNTLVSIRQYY-EKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQM

Q4R947 Activated RNA polymerase II transcriptional coactivator p156.5e-0739.68Show/hide
Query:  RLSKNRGVAVHEFKGNTLVSIRQYY-EKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQM
        ++ K R V+V +FKG  L+ IR+Y+ + +G+  PG KGISL  +QWS  +  I  I++A+ ++
Subjt:  RLSKNRGVAVHEFKGNTLVSIRQYY-EKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQM

Q9VLR5 RNA polymerase II transcriptional coactivator3.8e-0732.35Show/hide
Query:  EDKGKEGEPGPTVRYE--NKAAEQNVVPKREINDDG--GRVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEE
        +D   + + GP  R +  +K A+++  P  +  D G  G     L   R V ++EF+G   V IR++Y+K G+ +PG KGISL+  QW         +  
Subjt:  EDKGKEGEPGPTVRYE--NKAAEQNVVPKREINDDG--GRVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEE

Query:  AI
        AI
Subjt:  AI

Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein1.0e-7939.3Show/hide
Query:  RIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQN---VVPKREINDDGGRVIC
        +IEETV  +L +S+M+ MTE+++R++A  +LG+DLS T  K LVR+V+E FLL             PG  +  E  A  +N    V    +  +  R IC
Subjt:  RIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQN---VVPKREINDDGGRVIC

Query:  RLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETI-RFDG
        +LS+ +   V  ++G   +SI    ++ GK   GA    L+T+QWS  + +  AIE+ I Q + K +SE  A + G  S      +   F    I RFDG
Subjt:  RLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETI-RFDG

Query:  KNYHVWARQMEFLLQHLKIFYVLSDRCPT--AVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQFG
        K+Y  WA QME  L+ LK+ YVLS+ CP+  +  GPE++     R+    ++W+ DD++C   +MNSLSD+L+ +Y++K   AKELW EL  +Y  ++  
Subjt:  KNYHVWARQMEFLLQHLKIFYVLSDRCPT--AVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQFG

Query:  TKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSFN
        +KRSQV+KY+EF MVEE+ ILEQV+  N +AD+I+SAGM +DE FHVS IISK PPSW      LM EEYLP   L++R++ EE+L +R  +    V++ 
Subjt:  TKRSQVKKYLEFSMVEEKSILEQVEELNNMADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSFN

Query:  PGGQRPAANHTSKMGEPK--SQSLPSRERERQMEVKTVL-CLNCGKEGHISRDCPSSK
        P         T  +G     SQS+  + +E + + + ++ C NCG++GH+++ C  SK
Subjt:  PGGQRPAANHTSKMGEPK--SQSLPSRERERQMEVKTVL-CLNCGKEGHISRDCPSSK

AT4G10920.1 transcriptional coactivator p15 (PC4) family protein (KELP)8.9e-3644.38Show/hide
Query:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG
        M+ ET+ +IE+TVI++L +S+M+++TE++VR  A ++L +DLS+   K  VR+VVE FL      E++ +E E     + E    +      +E +DDG 
Subjt:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG

Query:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRK
         +ICRLS  R V + EFKG +LVSIR+YY+KDGK++P +KGISLT +QWS F+ ++PAIE A+ +M+ +
Subjt:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRK

AT4G10920.2 transcriptional coactivator p15 (PC4) family protein (KELP)8.9e-3644.38Show/hide
Query:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG
        M+ ET+ +IE+TVI++L +S+M+++TE++VR  A ++L +DLS+   K  VR+VVE FL      E++ +E E     + E    +      +E +DDG 
Subjt:  MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGG

Query:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRK
         +ICRLS  R V + EFKG +LVSIR+YY+KDGK++P +KGISLT +QWS F+ ++PAIE A+ +M+ +
Subjt:  RVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRK

AT5G09240.1 ssDNA-binding transcriptional regulator1.9e-0630.56Show/hide
Query:  DKGKEGEPGPTVRYENKAAEQNVVPKR------EINDDGGRVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIP--GAKGISLTTDQWSAFRSSIPA
        + GK  +       +   +E +  PK+      EI D     IC L KNR V V    G   ++IRQ++ KDG  +P    +GISL+ +QW+  R+    
Subjt:  DKGKEGEPGPTVRYENKAAEQNVVPKR------EINDDGGRVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIP--GAKGISLTTDQWSAFRSSIPA

Query:  IEEAILQM
        I++A+ ++
Subjt:  IEEAILQM

AT5G09250.1 ssDNA-binding transcriptional regulator1.8e-1240.51Show/hide
Query:  AEQNVVPKREINDDGGRVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAI
        A++   P  + +     V+C +SKNR V+V  + G   + IR++Y KDGK +PG KGISL+ DQW+  R+    IE+A+
Subjt:  AEQNVVPKREINDDGGRVICRLSKNRGVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGTGAAACCCGATGGAGAATCGAGGAAACGGTGATTGACGTATTGAAGAAATCGAACATGGAAGACATGACGGAGTACAGAGTTCGAGTCGAGGCCGAAAAACG
ACTCGGAATGGATCTCTCCGATACGCAATGCAAGTGGCTGGTGAGGAACGTGGTCGAGGGCTTTTTACTTTGGTCAATGGAGCATGAGGATAAGGGCAAAGAGGGAGAAC
CGGGACCTACTGTTCGTTATGAAAATAAAGCAGCGGAGCAGAATGTAGTCCCGAAGCGGGAGATTAACGATGATGGTGGCCGTGTGATTTGCCGGCTATCCAAGAACAGG
GGTGTGGCAGTTCATGAATTTAAAGGGAACACTCTGGTATCAATTAGGCAGTATTATGAAAAAGATGGAAAACAGATTCCTGGTGCTAAAGGAATTAGCTTGACAACTGA
TCAATGGTCTGCCTTTAGGAGTAGTATTCCTGCTATTGAGGAAGCTATTTTGCAGATGAAAAGAAAAAAAAGATCTGAACCTGATGCTGATAAAATTGGTGGTGTCTCCA
ATCCTGCTACTGGGGTTACTCCTCCAAAATTTCCAAATGAAACTATTCGGTTTGATGGAAAAAACTATCATGTATGGGCACGTCAGATGGAGTTTTTGCTGCAGCACTTA
AAGATTTTTTATGTACTTTCTGATCGCTGTCCTACTGCCGTGCTTGGACCAGAATCAAGTTCTGGAAATGCTGCTCGATCCAAGGTGGCTGAACGGGAATGGATGAGTGA
TGACCACATGTGTCGCCACATCATTATGAACTCCCTCTCCGATAATCTTTTTCATCAATATACAAAGAAAACAATGAGTGCCAAAGAACTCTGGAAGGAGCTAAACTTGC
TTTATCTTATTGAGCAATTTGGCACCAAGAGATCTCAAGTTAAAAAATATCTGGAATTCAGCATGGTTGAGGAGAAGTCAATATTAGAACAAGTTGAAGAACTTAATAAC
ATGGCTGATACCATTATTTCTGCTGGAATGCGGATTGATGAGGATTTTCATGTTAGTGCCATTATTTCGAAGCTTCCACCTTCTTGGATGAATGTCTTTGTGACGTTAAT
GCATGAGGAGTATCTTCCCTATGTGGAGTTGATAGATCGATTGAGGATTGAAGAACAATTACGTATACGGAAAAACTCACATTTCTCGAGAGTGTCTTTTAATCCTGGAG
GCCAACGTCCAGCTGCAAATCACACATCAAAGATGGGAGAACCGAAGTCCCAAAGCCTACCGTCGAGGGAAAGGGAACGGCAAATGGAGGTCAAGACTGTACTCTGCTTG
AATTGTGGCAAGGAAGGGCACATATCTCGAGATTGTCCAAGTAGTAAGTAG
mRNA sequenceShow/hide mRNA sequence
AACAGCCGTAAGCCGAAGCCAATGGAGTTATCTCTACAAATGACAACACTTCCAATCGGAGGCGGCCGTTCTCATTTTTCCACCGATCTGAACTGAACCAAGAAACGGCC
GGAACACTTTCCATCTCTCTCTCAACTCCTTCTTCATTTCTGAGATTGATGCCGCTGTATTTTCTTCTTTTCAAGAAACATGGACAGTGAAACCCGATGGAGAATCGAGG
AAACGGTGATTGACGTATTGAAGAAATCGAACATGGAAGACATGACGGAGTACAGAGTTCGAGTCGAGGCCGAAAAACGACTCGGAATGGATCTCTCCGATACGCAATGC
AAGTGGCTGGTGAGGAACGTGGTCGAGGGCTTTTTACTTTGGTCAATGGAGCATGAGGATAAGGGCAAAGAGGGAGAACCGGGACCTACTGTTCGTTATGAAAATAAAGC
AGCGGAGCAGAATGTAGTCCCGAAGCGGGAGATTAACGATGATGGTGGCCGTGTGATTTGCCGGCTATCCAAGAACAGGGGTGTGGCAGTTCATGAATTTAAAGGGAACA
CTCTGGTATCAATTAGGCAGTATTATGAAAAAGATGGAAAACAGATTCCTGGTGCTAAAGGAATTAGCTTGACAACTGATCAATGGTCTGCCTTTAGGAGTAGTATTCCT
GCTATTGAGGAAGCTATTTTGCAGATGAAAAGAAAAAAAAGATCTGAACCTGATGCTGATAAAATTGGTGGTGTCTCCAATCCTGCTACTGGGGTTACTCCTCCAAAATT
TCCAAATGAAACTATTCGGTTTGATGGAAAAAACTATCATGTATGGGCACGTCAGATGGAGTTTTTGCTGCAGCACTTAAAGATTTTTTATGTACTTTCTGATCGCTGTC
CTACTGCCGTGCTTGGACCAGAATCAAGTTCTGGAAATGCTGCTCGATCCAAGGTGGCTGAACGGGAATGGATGAGTGATGACCACATGTGTCGCCACATCATTATGAAC
TCCCTCTCCGATAATCTTTTTCATCAATATACAAAGAAAACAATGAGTGCCAAAGAACTCTGGAAGGAGCTAAACTTGCTTTATCTTATTGAGCAATTTGGCACCAAGAG
ATCTCAAGTTAAAAAATATCTGGAATTCAGCATGGTTGAGGAGAAGTCAATATTAGAACAAGTTGAAGAACTTAATAACATGGCTGATACCATTATTTCTGCTGGAATGC
GGATTGATGAGGATTTTCATGTTAGTGCCATTATTTCGAAGCTTCCACCTTCTTGGATGAATGTCTTTGTGACGTTAATGCATGAGGAGTATCTTCCCTATGTGGAGTTG
ATAGATCGATTGAGGATTGAAGAACAATTACGTATACGGAAAAACTCACATTTCTCGAGAGTGTCTTTTAATCCTGGAGGCCAACGTCCAGCTGCAAATCACACATCAAA
GATGGGAGAACCGAAGTCCCAAAGCCTACCGTCGAGGGAAAGGGAACGGCAAATGGAGGTCAAGACTGTACTCTGCTTGAATTGTGGCAAGGAAGGGCACATATCTCGAG
ATTGTCCAAGTAGTAAGTAGGAAAGTCGCTAATGAAGTAACGCGGGAGAGAACATAGCACAATCTTACTGAGGTAAATATGTCCGAGGATAAAAATAGTAGATTCACATT
TAGATGCCATCGCTTTTGATTCATATGTTCTTCTAAAGCATGGATACAATACTCAATTTGAAATTTCTAACGATTTGCATTGACTCAGAGTTGTCATGCGCTTAGGAGCT
TTCAAAGTGCAAAGTCAAGGCTCTAAGCGATTATGTGCATAACTAACTTATAATCTTGTTGATTTTAACTTAGCTCCTTAGGATAGTGATGTATTCTTATCCTCTTTCTT
GAAACTTATGTGAAGCTTAGCATGTTCGGAACTTTTCTGAAGGGCCCTTTTTTTATAGGAAACACTTTAGTGAAGATCATTGGATAGGTCTATTACGATTGACTCTAAGT
TGATAGCTTCCTACTGGATTGTATTATGAGCAGATGACGAATTCATAACATAAGCTCCTAGTTAATTTACACTATTCTGCTGACATGTAAATGTTGTTGGGGTTATATCA
AGATAGATGATAGAAATATTATTAGGACGTAGTAGGGGTCGTTTGGATTGAGGAGTTCTGGGGAGTAGGAGTTGTAGTGAGTAGGAGTAAAGAAGTCTGTGGGACCTACA
AGAACGAGTTAAAAATATGTGGGGCCCATAAAAAAGAGTTGGGAAAAGAGTTAATAAAGCTGTGGGACCTACAAGGAAGAGATAATAACTCCTTGGGCCAAACAAGGTGT
TGGGGGGAGTTATTAACTCCTCCCAACTCCACTCTCCCAACTCTTCAATCAAGCTGCTTAGATAAGGAGTTATTCAATAGGAGGTAGTTATAACTTATAAATAGAGGTAA
TGGAAGGGATGTAGGATGTATTTGTTTACCATGTTTATACTTGAGTTTGCTTTTGGTCGAGAG
Protein sequenceShow/hide protein sequence
MDSETRWRIEETVIDVLKKSNMEDMTEYRVRVEAEKRLGMDLSDTQCKWLVRNVVEGFLLWSMEHEDKGKEGEPGPTVRYENKAAEQNVVPKREINDDGGRVICRLSKNR
GVAVHEFKGNTLVSIRQYYEKDGKQIPGAKGISLTTDQWSAFRSSIPAIEEAILQMKRKKRSEPDADKIGGVSNPATGVTPPKFPNETIRFDGKNYHVWARQMEFLLQHL
KIFYVLSDRCPTAVLGPESSSGNAARSKVAEREWMSDDHMCRHIIMNSLSDNLFHQYTKKTMSAKELWKELNLLYLIEQFGTKRSQVKKYLEFSMVEEKSILEQVEELNN
MADTIISAGMRIDEDFHVSAIISKLPPSWMNVFVTLMHEEYLPYVELIDRLRIEEQLRIRKNSHFSRVSFNPGGQRPAANHTSKMGEPKSQSLPSRERERQMEVKTVLCL
NCGKEGHISRDCPSSK