; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G01160 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G01160
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionCCHC-type domain-containing protein
Genome locationClcChr05:802769..807808
RNA-Seq ExpressionClc05G01160
SyntenyClc05G01160
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR003173 - Transcriptional coactivator p15 (PC4), C-terminal
IPR009044 - ssDNA-binding transcriptional regulator
IPR014876 - DEK, C-terminal
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK20834.1 Zinc knuckle family protein, putative isoform 2 [Cucumis melo var. makuwa]8.9e-20381.13Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD
        M+ ET+R+IEE VI+VLK+SN+EDTTE+KVR QVEER+GIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        EN+AVEQ+I+PKKE NDD D
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD

Query:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR
        L+ICRLSNNRSVTIHKF+G  MVSIRQ++ K GKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSE DA+KIGA+S PT  VT PKFP ETIR
Subjt:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR

Query:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF
        FDGKNY AWA QME LLQ LKI YVLS++CPTAVL  ESSSGNA QSK AEQKWMSDDHMC RNILNSLSDRLFNEY+ K MSASELWKELKLLYFLEEF
Subjt:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF

Query:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS
        GTKRSQVKKYLEFKMV EKSILEQVEELN+IADSI SAGT IDEDFHVSAIISKLPLSWKNVW++LMHE YLPLSKL DRLRIEEQLRTQKNSRLS VS 
Subjt:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS

Query:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDD
         PN  GQHHAANH SKMGDP   ++PLRK+E QK+VKTLLCL+CGKEGHTSPNCP++KVD+
Subjt:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDD

XP_004134299.1 uncharacterized protein LOC101205072 [Cucumis sativus]2.7e-21283.12Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD
        M+ ET+RRIEE VI+VLKKS+MEDTTE+KVR QVEERLGIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        ENKAVEQ+IVPKKE NDD D
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD

Query:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR
        L+ICRLSNNRSVTIHKF+GA MVS+RQ++EK GKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSE DAEKIGA S PTT VT PK+P ETIR
Subjt:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR

Query:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF
        FDGKNY AWA QME LLQ LKI YVLS++CPTAVL  ESSSGNA QSKAAEQKWM DDHMCRRNILNSLSDRLFNEY+ KTMSASELWKELKLLY LEEF
Subjt:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF

Query:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS
        GTKRSQVKKYLEFKMV EKSILEQVEELN+IADSI S+GT IDEDFHVSAIISKLPLSWKNVW+NLMHEQYLPL KL DRLRIEEQLRTQKNSRLSGVSS
Subjt:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS

Query:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDDEVTRGRT
        +P P GQHHAANH SKMGDPK  ++PLRK+E QK+VKTLLCL+CGKEGHTSPNCP++KV++EV R RT
Subjt:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDDEVTRGRT

XP_008437880.1 PREDICTED: uncharacterized protein LOC103483179 [Cucumis melo]7.5e-20280.91Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD
        M+ ET+R+IEE VI+VLK+SN+EDTTE+KVR QVEER+GIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        EN+AVEQ+I+PKKE NDD D
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD

Query:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR
        L+ICRLSNNRSVTIHKF+G  MVSIRQ++ K GKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSE DA+KIGA+S PT  VT PKFP ETIR
Subjt:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR

Query:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF
        FDGKNY AWA QME LLQ LKI YVLS++CPTAVL  ESSSGNA QSK AEQKWMSDDHMC RNILNSLSDRLFNEY+ K MSASELWKELKLLYFLEEF
Subjt:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF

Query:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS
        GTKRSQVKKYLEFKMV EKSILEQVEELN+IADSI SAGT IDEDFHVSAIISKLPLSWKNVW++LM E YLPLSKL DRLRIEEQLRTQKNSRLS VS 
Subjt:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS

Query:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDD
         PN  GQHHAANH SKMGDP   ++PLRK+E QK+VKTLLCL+CGKEGHTSPNCP++KVD+
Subjt:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDD

XP_022945450.1 uncharacterized protein LOC111449676 [Cucurbita moschata]2.4e-16068.19Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD
        MD ET+RRI+ETVID+LK SNME+ TEYK+R + E+RLG+DLS+ Q K LVR+VVE FL S +ER   GKE         ENKA EQEIV KKEIN DVD
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD

Query:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRR-KRSELDAEKIGAVSEPTTVVTPPKFPNETI
         VIC+LSNNR+VT+H+F+G A+VSIRQ++EK GKQLP +KGIS++TEQWS F+SNIPAIEEAILQMKR+ KRSE DA   GAVS P T  + PKFP+ETI
Subjt:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRR-KRSELDAEKIGAVSEPTTVVTPPKFPNETI

Query:  RFDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEE
        RFDGKNY  WARQMEFLL+ LKI YVLSD  PT++L PESSSGN  +SKA+EQ+WMSDDHMCR  ILNSLSD LF++Y  +TMSA ELWKEL  LY L +
Subjt:  RFDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEE

Query:  FGTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVS
        +GT+RSQVKKYLEF+MV EKSILEQVEELNNIA+SI+SAG  IDEDFHVSAIISKLP SW NV++ LM E++LP   LIDRLR EE+LRTQ+NS  S   
Subjt:  FGTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVS

Query:  SAPNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRK
             GG+    NH  KMGD    SLP RKREW+ DVKTLLCLNCGKEGH S +CPS K
Subjt:  SAPNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRK

XP_038878142.1 uncharacterized protein LOC120070296 [Benincasa hispida]1.9e-21385.81Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD
        MD+ETQ +IEETVIDVLKKSNME+TTE+KVRGQVEERLGIDLSNR+YKLLVRNVVESFLLSMSE+VCMGKED        ENK ++QEI P KE NDD D
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD

Query:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR
        LVICRLSNNRSVTIHKFRGA MVSIRQFFEK GKQLPTLKGISMSTEQWS FKSNIPAIE+AILQMK +KRSE DA+KIGAVSE T  VTPP FPNETIR
Subjt:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR

Query:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF
        FDGKNY  WA QME LLQHLKI YVLS++CPT VL PESSSGN  Q+KAAEQKWMSDD MC RNILNSLSDRLFNEYATKTMSASELW ELKLLYFLEEF
Subjt:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF

Query:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS
        GTKRSQVKKYLEF+MV EKSILEQVE+LNNIADSIVSAGT IDEDFHVSAIISKLPLSW +VW+NLMHEQYL L+KLIDRLRIEEQLRTQKNS LSGVSS
Subjt:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS

Query:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRK
        AP+  GQHH ANHLSKM DPKLSSLPLRKREWQ DVKTLLCLNCGKEGHTSPNCPSRK
Subjt:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRK

TrEMBL top hitse value%identityAlignment
A0A0A0L3U5 CCHC-type domain-containing protein1.3e-21283.12Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD
        M+ ET+RRIEE VI+VLKKS+MEDTTE+KVR QVEERLGIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        ENKAVEQ+IVPKKE NDD D
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD

Query:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR
        L+ICRLSNNRSVTIHKF+GA MVS+RQ++EK GKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSE DAEKIGA S PTT VT PK+P ETIR
Subjt:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR

Query:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF
        FDGKNY AWA QME LLQ LKI YVLS++CPTAVL  ESSSGNA QSKAAEQKWM DDHMCRRNILNSLSDRLFNEY+ KTMSASELWKELKLLY LEEF
Subjt:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF

Query:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS
        GTKRSQVKKYLEFKMV EKSILEQVEELN+IADSI S+GT IDEDFHVSAIISKLPLSWKNVW+NLMHEQYLPL KL DRLRIEEQLRTQKNSRLSGVSS
Subjt:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS

Query:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDDEVTRGRT
        +P P GQHHAANH SKMGDPK  ++PLRK+E QK+VKTLLCL+CGKEGHTSPNCP++KV++EV R RT
Subjt:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDDEVTRGRT

A0A1S3AV18 uncharacterized protein LOC1034831793.6e-20280.91Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD
        M+ ET+R+IEE VI+VLK+SN+EDTTE+KVR QVEER+GIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        EN+AVEQ+I+PKKE NDD D
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD

Query:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR
        L+ICRLSNNRSVTIHKF+G  MVSIRQ++ K GKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSE DA+KIGA+S PT  VT PKFP ETIR
Subjt:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR

Query:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF
        FDGKNY AWA QME LLQ LKI YVLS++CPTAVL  ESSSGNA QSK AEQKWMSDDHMC RNILNSLSDRLFNEY+ K MSASELWKELKLLYFLEEF
Subjt:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF

Query:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS
        GTKRSQVKKYLEFKMV EKSILEQVEELN+IADSI SAGT IDEDFHVSAIISKLPLSWKNVW++LM E YLPLSKL DRLRIEEQLRTQKNSRLS VS 
Subjt:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS

Query:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDD
         PN  GQHHAANH SKMGDP   ++PLRK+E QK+VKTLLCL+CGKEGHTSPNCP++KVD+
Subjt:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDD

A0A5A7TZ44 Zinc knuckle family protein, putative isoform 23.6e-20280.91Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD
        M+ ET+R+IEE VI+VLK+SN+EDTTE+KVR QVEER+GIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        EN+AVEQ+I+PKKE NDD D
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD

Query:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR
        L+ICRLSNNRSVTIHKF+G  MVSIRQ++ K GKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSE DA+KIGA+S PT  VT PKFP ETIR
Subjt:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR

Query:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF
        FDGKNY AWA QME LLQ LKI YVLS++CPTAVL  ESSSGNA QSK AEQKWMSDDHMC RNILNSLSDRLFNEY+ K MSASELWKELKLLYFLEEF
Subjt:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF

Query:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS
        GTKRSQVKKYLEFKMV EKSILEQVEELN+IADSI SAGT IDEDFHVSAIISKLPLSWKNVW++LM E YLPLSKL DRLRIEEQLRTQKNSRLS VS 
Subjt:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS

Query:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDD
         PN  GQHHAANH SKMGDP   ++PLRK+E QK+VKTLLCL+CGKEGHTSPNCP++KVD+
Subjt:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDD

A0A5D3DBA1 Zinc knuckle family protein, putative isoform 24.3e-20381.13Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD
        M+ ET+R+IEE VI+VLK+SN+EDTTE+KVR QVEER+GIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        EN+AVEQ+I+PKKE NDD D
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD

Query:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR
        L+ICRLSNNRSVTIHKF+G  MVSIRQ++ K GKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSE DA+KIGA+S PT  VT PKFP ETIR
Subjt:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIR

Query:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF
        FDGKNY AWA QME LLQ LKI YVLS++CPTAVL  ESSSGNA QSK AEQKWMSDDHMC RNILNSLSDRLFNEY+ K MSASELWKELKLLYFLEEF
Subjt:  FDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEF

Query:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS
        GTKRSQVKKYLEFKMV EKSILEQVEELN+IADSI SAGT IDEDFHVSAIISKLPLSWKNVW++LMHE YLPLSKL DRLRIEEQLRTQKNSRLS VS 
Subjt:  GTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSS

Query:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDD
         PN  GQHHAANH SKMGDP   ++PLRK+E QK+VKTLLCL+CGKEGHTSPNCP++KVD+
Subjt:  APNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRKVDD

A0A6J1G0Z2 uncharacterized protein LOC1114496761.2e-16068.19Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD
        MD ET+RRI+ETVID+LK SNME+ TEYK+R + E+RLG+DLS+ Q K LVR+VVE FL S +ER   GKE         ENKA EQEIV KKEIN DVD
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVPKKEINDDVD

Query:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRR-KRSELDAEKIGAVSEPTTVVTPPKFPNETI
         VIC+LSNNR+VT+H+F+G A+VSIRQ++EK GKQLP +KGIS++TEQWS F+SNIPAIEEAILQMKR+ KRSE DA   GAVS P T  + PKFP+ETI
Subjt:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRR-KRSELDAEKIGAVSEPTTVVTPPKFPNETI

Query:  RFDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEE
        RFDGKNY  WARQMEFLL+ LKI YVLSD  PT++L PESSSGN  +SKA+EQ+WMSDDHMCR  ILNSLSD LF++Y  +TMSA ELWKEL  LY L +
Subjt:  RFDGKNYIAWARQMEFLLQHLKIDYVLSDRCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEE

Query:  FGTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVS
        +GT+RSQVKKYLEF+MV EKSILEQVEELNNIA+SI+SAG  IDEDFHVSAIISKLP SW NV++ LM E++LP   LIDRLR EE+LRTQ+NS  S   
Subjt:  FGTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVS

Query:  SAPNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRK
             GG+    NH  KMGD    SLP RKREW+ DVKTLLCLNCGKEGH S +CPS K
Subjt:  SAPNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEGHTSPNCPSRK

SwissProt top hitse value%identityAlignment
O65154 RNA polymerase II transcriptional coactivator KIWI6.3e-1036.05Show/hide
Query:  EDENKAVEQEIV-PKKEINDDVDLVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAI
        E E  A  +++  P  + +   D+V+C +S NR V++  + G   + IR+F+ K GK LP  KGIS+S +QW+T +++   IE+A+
Subjt:  EDENKAVEQEIV-PKKEINDDVDLVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAI

O65155 RNA polymerase II transcriptional coactivator KELP3.3e-3549.11Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVP--------KKEINDDVD
        M++ET+ +IE+TVI++L +S+M++ TE+KVR    E+L IDLS + +K  VR+VVE FL    ER    +E EN  V +E            KE +DD D
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVP--------KKEINDDVD

Query:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRR
        L+ICRLS+ R VTI +F+G ++VSIR++++K GK+LPT KGIS++ EQWSTFK N+PAIE A+ +M+ R
Subjt:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRR

P87294 Putative RNA polymerase II transcriptional coactivator3.0e-0431.08Show/hide
Query:  PKKEINDDVDL-VICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAI
        PK E   D +L      +  + +T+ +FRG   V IR+++EK G  LP  KGI+++  +W   K  I  +++++
Subjt:  PKKEINDDVDL-VICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAI

Q94045 Putative RNA polymerase II transcriptional coactivator1.0e-0433.33Show/hide
Query:  SERVCMGKEDENKAVEQEIVPKKEINDDVDLVICRLSNNRSVTIHKFRGAAMVSIRQFF--EKGGKQLPTLKGISMSTEQWSTFKSNIPAIEE
        SE V   K++  KA  +E V  + + D     +  + N R  T+ KF+G   V+IR+++      K +P+ KGIS+S  QW+  K  IP I++
Subjt:  SERVCMGKEDENKAVEQEIVPKKEINDDVDLVICRLSNNRSVTIHKFRGAAMVSIRQFF--EKGGKQLPTLKGISMSTEQWSTFKSNIPAIEE

Q9VLR5 RNA polymerase II transcriptional coactivator1.0e-0443.1Show/hide
Query:  LSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAI
        L   R V I++FRG   V IR+F++KGG+ LP  KGIS+S  QW         +  AI
Subjt:  LSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAI

Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein8.0e-7738.04Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVPKKEINDDVDLVICRLSN
        M+    ++IEETV  +L +S+M+  TE+K+R     +LGIDLS   +K LVR+V+E FLLS      + +       E   V    +  + +  IC+LS 
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVPKKEINDDVDLVICRLSN

Query:  NRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETI-RFDGKNYI
         ++ T+ ++RG   +SI    ++ GK     +G  +ST QWS  K N  AIE+ I Q + + +SE  A + G  SE     +   F    I RFDGK+Y+
Subjt:  NRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETI-RFDGKNYI

Query:  AWARQMEFLLQHLKIDYVLSDRCPT--AVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEFGTKRS
         WA QME  L+ LK+ YVLS+ CP+  +   PE++     ++ A  +KW+ DD++C  +++NSLSD L+  Y+ K   A ELW ELK +Y  +E  +KRS
Subjt:  AWARQMEFLLQHLKIDYVLSDRCPT--AVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEFGTKRS

Query:  QVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSSAPNPG
        QV+KY+EF+MV E+ ILEQV+  N IADSIVSAG  +DE FHVS IISK P SW+     LM E+YLP+  L++R++ EE+L     +   GV+  P  G
Subjt:  QVKKYLEFKMVAEKSILEQVEELNNIADSIVSAGTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSSAPNPG

Query:  GQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLL-CLNCGKEGHTSPNCPSRKVDDEVT
                          S+  +++E ++D + ++ C NCG++GH + +C   K D+  +
Subjt:  GQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLL-CLNCGKEGHTSPNCPSRKVDDEVT

AT4G10920.1 transcriptional coactivator p15 (PC4) family protein (KELP)2.4e-3649.11Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVP--------KKEINDDVD
        M++ET+ +IE+TVI++L +S+M++ TE+KVR    E+L IDLS + +K  VR+VVE FL    ER    +E EN  V +E            KE +DD D
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVP--------KKEINDDVD

Query:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRR
        L+ICRLS+ R VTI +F+G ++VSIR++++K GK+LPT KGIS++ EQWSTFK N+PAIE A+ +M+ R
Subjt:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRR

AT4G10920.2 transcriptional coactivator p15 (PC4) family protein (KELP)2.4e-3649.11Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVP--------KKEINDDVD
        M++ET+ +IE+TVI++L +S+M++ TE+KVR    E+L IDLS + +K  VR+VVE FL    ER    +E EN  V +E            KE +DD D
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVP--------KKEINDDVD

Query:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRR
        L+ICRLS+ R VTI +F+G ++VSIR++++K GK+LPT KGIS++ EQWSTFK N+PAIE A+ +M+ R
Subjt:  LVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRR

AT5G09240.1 ssDNA-binding transcriptional regulator3.9e-0734.78Show/hide
Query:  DENKAVEQEIVPKKEIN--DDV-DLVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLP--TLKGISMSTEQWSTFKSNIPAIEEAILQM
        D+    E    PKK     D++ D+ IC L  NR V +    G   ++IRQFF K G  LP  + +GIS+S EQW+  +++   I++A+ ++
Subjt:  DENKAVEQEIVPKKEIN--DDV-DLVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLP--TLKGISMSTEQWSTFKSNIPAIEEAILQM

AT5G09250.1 ssDNA-binding transcriptional regulator4.5e-1136.05Show/hide
Query:  EDENKAVEQEIV-PKKEINDDVDLVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAI
        E E  A  +++  P  + +   D+V+C +S NR V++  + G   + IR+F+ K GK LP  KGIS+S +QW+T +++   IE+A+
Subjt:  EDENKAVEQEIV-PKKEINDDVDLVICRLSNNRSVTIHKFRGAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCAGGAGACCCAACGGAGAATCGAGGAAACCGTGATTGACGTATTGAAGAAATCGAACATGGAAGACACAACGGAGTATAAAGTTCGAGGCCAGGTCGAAGAGCG
GCTCGGAATTGATCTCTCAAATAGACAATACAAGTTGCTGGTGAGAAACGTGGTCGAGAGCTTTTTACTTTCAATGTCGGAGCGGGTGTGTATGGGGAAAGAGGATGAGA
ATAAAGCGGTGGAGCAGGAGATAGTCCCGAAGAAGGAGATTAACGATGATGTCGACCTTGTGATTTGCCGGCTATCTAATAACAGGAGTGTGACAATTCATAAATTTAGA
GGGGCAGCTATGGTATCAATTAGGCAGTTTTTTGAAAAAGGTGGAAAACAGCTACCTACTCTTAAAGGAATCAGCATGTCAACTGAACAATGGTCAACCTTTAAGAGTAA
TATTCCTGCTATAGAGGAAGCTATTTTGCAGATGAAAAGACGAAAAAGATCTGAACTTGATGCTGAAAAAATTGGTGCTGTCTCGGAACCAACGACTGTGGTAACTCCTC
CAAAATTTCCAAATGAAACTATTCGATTTGATGGAAAAAACTACATTGCATGGGCACGTCAGATGGAGTTTTTGCTGCAGCACTTAAAGATTGATTATGTATTATCTGAT
CGATGTCCTACTGCGGTGCTTGAGCCAGAATCAAGTTCTGGAAATGCTGATCAATCCAAGGCTGCTGAACAAAAATGGATGAGTGATGACCACATGTGTCGCCGCAACAT
TCTGAACTCCCTCTCCGATAGGCTTTTTAATGAATACGCAACCAAAACAATGAGTGCTAGTGAACTTTGGAAGGAGCTAAAATTACTATACTTTTTGGAGGAGTTTGGCA
CTAAGAGGTCTCAAGTCAAAAAGTATCTGGAATTCAAGATGGTTGCGGAAAAGTCAATATTAGAACAAGTTGAAGAACTTAATAACATTGCTGACTCCATTGTTTCTGCT
GGAACGTCGATTGATGAGGATTTTCATGTTAGTGCCATTATTTCAAAGCTTCCACTTTCTTGGAAGAATGTCTGGATTAACTTAATGCATGAGCAGTATCTTCCCCTTTC
AAAGTTGATAGATCGGTTGAGGATTGAAGAACAATTACGTACACAAAAAAACTCACGTCTCTCAGGAGTGTCTTCTGCTCCTAATCCAGGAGGCCAACATCATGCTGCGA
ATCACTTGTCAAAGATGGGAGATCCGAAGCTCTCAAGTCTACCACTGAGGAAAAGGGAATGGCAAAAGGATGTCAAAACTTTACTCTGCTTGAATTGTGGCAAGGAAGGG
CACACATCTCCGAATTGCCCGAGTAGGAAAGTCGATGATGAAGTAACTCGGGGAAGAACATAG
mRNA sequenceShow/hide mRNA sequence
CCCGAATCTCATTTGTTGTATGTAAACAAATAATGGTAATGGATAAACCTTTGCAATAAATTTGTATGATTTAACGCTACAAGTATCAACAAATGGTTCCATGGTCTAGT
GGTCAGGACATTGGACTCTGAATCCAGTAACCCGAGTTCAAATCTCGGTGGAACCTTCCTTTTTCTATTTTCCCTTTTTTTTTTTTGGCCCCTTCAGAATTTCTTCAACC
GCCGTGCAAAGCCGAAGCCAATGGAATTCCTTCTACAAATTACAATACTTCCAATCGGAAACGGCCGGCGTTTCCCTTTCAACTCCTTATTTTCTGGTATTTTCCATCTT
GTTTGCAAGAAAGATGGACCAGGAGACCCAACGGAGAATCGAGGAAACCGTGATTGACGTATTGAAGAAATCGAACATGGAAGACACAACGGAGTATAAAGTTCGAGGCC
AGGTCGAAGAGCGGCTCGGAATTGATCTCTCAAATAGACAATACAAGTTGCTGGTGAGAAACGTGGTCGAGAGCTTTTTACTTTCAATGTCGGAGCGGGTGTGTATGGGG
AAAGAGGATGAGAATAAAGCGGTGGAGCAGGAGATAGTCCCGAAGAAGGAGATTAACGATGATGTCGACCTTGTGATTTGCCGGCTATCTAATAACAGGAGTGTGACAAT
TCATAAATTTAGAGGGGCAGCTATGGTATCAATTAGGCAGTTTTTTGAAAAAGGTGGAAAACAGCTACCTACTCTTAAAGGAATCAGCATGTCAACTGAACAATGGTCAA
CCTTTAAGAGTAATATTCCTGCTATAGAGGAAGCTATTTTGCAGATGAAAAGACGAAAAAGATCTGAACTTGATGCTGAAAAAATTGGTGCTGTCTCGGAACCAACGACT
GTGGTAACTCCTCCAAAATTTCCAAATGAAACTATTCGATTTGATGGAAAAAACTACATTGCATGGGCACGTCAGATGGAGTTTTTGCTGCAGCACTTAAAGATTGATTA
TGTATTATCTGATCGATGTCCTACTGCGGTGCTTGAGCCAGAATCAAGTTCTGGAAATGCTGATCAATCCAAGGCTGCTGAACAAAAATGGATGAGTGATGACCACATGT
GTCGCCGCAACATTCTGAACTCCCTCTCCGATAGGCTTTTTAATGAATACGCAACCAAAACAATGAGTGCTAGTGAACTTTGGAAGGAGCTAAAATTACTATACTTTTTG
GAGGAGTTTGGCACTAAGAGGTCTCAAGTCAAAAAGTATCTGGAATTCAAGATGGTTGCGGAAAAGTCAATATTAGAACAAGTTGAAGAACTTAATAACATTGCTGACTC
CATTGTTTCTGCTGGAACGTCGATTGATGAGGATTTTCATGTTAGTGCCATTATTTCAAAGCTTCCACTTTCTTGGAAGAATGTCTGGATTAACTTAATGCATGAGCAGT
ATCTTCCCCTTTCAAAGTTGATAGATCGGTTGAGGATTGAAGAACAATTACGTACACAAAAAAACTCACGTCTCTCAGGAGTGTCTTCTGCTCCTAATCCAGGAGGCCAA
CATCATGCTGCGAATCACTTGTCAAAGATGGGAGATCCGAAGCTCTCAAGTCTACCACTGAGGAAAAGGGAATGGCAAAAGGATGTCAAAACTTTACTCTGCTTGAATTG
TGGCAAGGAAGGGCACACATCTCCGAATTGCCCGAGTAGGAAAGTCGATGATGAAGTAACTCGGGGAAGAACATAGTAGTTTCTTACCGAGGTAAATATTTCTGACTTCT
GAGGATGAAAATAGTGGATTCACATTTAGATGTCACACTTATTGATTCATATGTTCTTGCAAAGCATGGAACACAATAGTCAATTTGAAATTTTGAACCATTTTCAAGAA
CTCTGAAACTATCATGCGCTTAGGAGCCTTCAAAGAGCAGAGTCAAGGCTCTACGCCATGTATGTGCATAACTGATATTCTTGCTGATTTTAACTCAGTTACCTTAGAAT
AGTTATATGTTCTCATTCTTTTTCTTGAAGCTTAAGTGAATTGCAATATGTTCGGCACCTTTGTGTAGGATTGTAATTAACTCGATAGCTTCCTGTTATATAGTATTATG
GGGATATGATAAGCTCATAGTTAGATATACGAGGAACTTGTTAATTTACACCATTCTGATGTGCATGTCAATGTGGTTAGTAGGATTATTATCAGGATAAAATACTTTTA
GGATTTATTAGTCATTAGCTAAGCTCAGGAGTGTTTCAATAAGAGGTAGTTATGAATAGTGGGGGTGTAGTAAAAAAATTATAAAATGAGGGAGTGAGAGTGGGAGGGAT
GAAAGATGTATTTGTTTAGATGTTTGACTTTAGTTTGTTTGAGGGGAGGTTGTGTTCATAGTACTTCAAATTCGCAAGTTGGAATAGTCACCAAGGCCTAAGTTTTAGAA
CTCGACTTGAATATGGTATTTTGTGCCTAAATTTACTTGTACATTTCTTTTTTGTCGTAACCAAATTCCATATTAAAAGTTCAGATTCCCTTTT
Protein sequenceShow/hide protein sequence
MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVPKKEINDDVDLVICRLSNNRSVTIHKFR
GAAMVSIRQFFEKGGKQLPTLKGISMSTEQWSTFKSNIPAIEEAILQMKRRKRSELDAEKIGAVSEPTTVVTPPKFPNETIRFDGKNYIAWARQMEFLLQHLKIDYVLSD
RCPTAVLEPESSSGNADQSKAAEQKWMSDDHMCRRNILNSLSDRLFNEYATKTMSASELWKELKLLYFLEEFGTKRSQVKKYLEFKMVAEKSILEQVEELNNIADSIVSA
GTSIDEDFHVSAIISKLPLSWKNVWINLMHEQYLPLSKLIDRLRIEEQLRTQKNSRLSGVSSAPNPGGQHHAANHLSKMGDPKLSSLPLRKREWQKDVKTLLCLNCGKEG
HTSPNCPSRKVDDEVTRGRT