; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011181 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011181
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiontranscription repressor KAN1-like isoform X1
Genome locationChr01:3172811..3180078
RNA-Seq ExpressionHG10011181
SyntenyHG10011181
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0010158 - abaxial cell fate specification (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR006447 - Myb domain, plants
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR044847 - Transcription repressor KANADI


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140729.1 transcription repressor KAN1 isoform X1 [Cucumis sativus]8.2e-17288.35Show/hide
Query:  MGTNNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHH
        +GTNNNNN  N+ F GATLHPPP LPHLRGLS FDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNY NTN N +GGGGV +MGYHH
Subjt:  MGTNNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHH

Query:  HPGGGVGGNNNNNNCSARFNNGVVSVEAI-KCLNNNINSNNSNAA-----SSSSDVC-SHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEH
          GGGVGG  NNNNCSARFNNGVV VEAI KCLNNN NSN++NAA     SSSSDVC SHGMMMMRSRFLQKLP KRSMRAPRMRWTTSLHARFVHAVEH
Subjt:  HPGGGVGGNNNNNNCSARFNNGVVSVEAI-KCLNNNINSNNSNAA-----SSSSDVC-SHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEH

Query:  LGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPT-MCTTRGDH-GSKQFSDQRAQPDRSGQPLPDSEFGCSTLW
        LGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPT M TTRGDH GSKQFSDQRA PDRSGQ  PD EFGCSTLW
Subjt:  LGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPT-MCTTRGDH-GSKQFSDQRAQPDRSGQPLPDSEFGCSTLW

Query:  SNSSRDVWPQTNSNEMD--VIRPTLSTQQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVKED
        SNSSRDVWPQTNSNEMD  V  PTLSTQQK+MHQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVK+D
Subjt:  SNSSRDVWPQTNSNEMD--VIRPTLSTQQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVKED

XP_008456199.1 PREDICTED: transcription repressor KAN1-like isoform X1 [Cucumis melo]2.0e-15488.39Show/hide
Query:  FDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHHHPGGGVGGNNNNNNCSARFNNGVVSVEAI-KCL
        FDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNY NTN N SGGGGV +MGY+H  GGGVGG  NNNNCSARFNNGVV  EAI KCL
Subjt:  FDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHHHPGGGVGGNNNNNNCSARFNNGVVSVEAI-KCL

Query:  NN--NINSNNSNAA----SSSSDVC-SHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQM
        NN  N NSNN+NAA    SSSSDVC SHGMMMMRSRFLQKLP KRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQM
Subjt:  NN--NINSNNSNAA----SSSSDVC-SHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQM

Query:  YRTVKTTDKPAASSGQSDGSGEDDVSPT-MCTTRGDH-GSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTNSNEMD-VIRPTLSTQQKSMH
        YRTVKTTDKPAASSGQSDGSGEDDVSPT M TTRGDH GSKQF DQRA PDRS QP  D EFGCSTLWSNSSRDVWPQTNSNEMD V  PTLSTQ+K MH
Subjt:  YRTVKTTDKPAASSGQSDGSGEDDVSPT-MCTTRGDH-GSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTNSNEMD-VIRPTLSTQQKSMH

Query:  QIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVKED
        QIQECDSGA+KRYNSECKKPSLEFRLGRAEWDVK+D
Subjt:  QIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVKED

XP_011651216.1 transcription repressor KAN1 isoform X2 [Cucumis sativus]2.8e-15187.5Show/hide
Query:  MGTNNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHH
        +GTNNNNN  N+ F GATLHPPP LPHLRGLS FDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNY NTN N +GGGGV +MGYHH
Subjt:  MGTNNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHH

Query:  HPGGGVGGNNNNNNCSARFNNGVVSVEAI-KCLNNNINSNNSNAA-----SSSSDVC-SHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEH
          GGGVGG  NNNNCSARFNNGVV VEAI KCLNNN NSN++NAA     SSSSDVC SHGMMMMRSRFLQKLP KRSMRAPRMRWTTSLHARFVHAVEH
Subjt:  HPGGGVGGNNNNNNCSARFNNGVVSVEAI-KCLNNNINSNNSNAA-----SSSSDVC-SHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEH

Query:  LGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPT-MCTTRGDH-GSKQFSDQRAQPDRSGQPLPDSEFGCSTLW
        LGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPT M TTRGDH GSKQFSDQRA PDRSGQ  PD EFGCSTLW
Subjt:  LGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPT-MCTTRGDH-GSKQFSDQRAQPDRSGQPLPDSEFGCSTLW

Query:  SNSSRDVWPQTNSNEMD--VIRPTLSTQQKSMHQIQ
        SNSSRDVWPQTNSNEMD  V  PTLSTQQK+MHQIQ
Subjt:  SNSSRDVWPQTNSNEMD--VIRPTLSTQQKSMHQIQ

XP_011651217.1 transcription repressor KAN1 isoform X3 [Cucumis sativus]2.3e-15079.83Show/hide
Query:  MGTNNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHH
        +GTNNNNN  N+ F GATLHPPP LPHLRGLS FDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNY NTN N +GGGGV +MGYHH
Subjt:  MGTNNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHH

Query:  HPGGGVGGNNNNNNCSARFNNGVVSVEAIKCLNNNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERA
          GGGVGG  NNNNCSARFNNGV                                         KLP KRSMRAPRMRWTTSLHARFVHAVEHLGGHERA
Subjt:  HPGGGVGGNNNNNNCSARFNNGVVSVEAIKCLNNNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERA

Query:  TPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPT-MCTTRGDH-GSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDV
        TPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPT M TTRGDH GSKQFSDQRA PDRSGQ  PD EFGCSTLWSNSSRDV
Subjt:  TPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPT-MCTTRGDH-GSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDV

Query:  WPQTNSNEMD--VIRPTLSTQQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVKED
        WPQTNSNEMD  V  PTLSTQQK+MHQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVK+D
Subjt:  WPQTNSNEMD--VIRPTLSTQQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVKED

XP_038891138.1 uncharacterized protein LOC120080522 isoform X1 [Benincasa hispida]5.7e-15789.7Show/hide
Query:  MGTNNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSD-GLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYH
        +GTNN NN  NNSF GATLHPP PLPH+RGLS FDVS+D GLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNY NT+ NG   GGVVEMGYH
Subjt:  MGTNNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSD-GLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYH

Query:  HH--PGGGVGGNNNNNNCSARFNNGVVSVEAIKCL-NNNINSNN-SNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLG
        HH   GG VGGNN NNNCSARFNNGVVSVEAIKCL NNN NSNN SNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLG
Subjt:  HH--PGGGVGGNNNNNNCSARFNNGVVSVEAIKCL-NNNINSNN-SNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLG

Query:  GHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSS
        GHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTM TTR DHG KQFSDQRA  DRS QP PD EFGCSTLWSNSS
Subjt:  GHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSS

Query:  RDVWPQTNSNEMDVIRPTLSTQQKSMHQIQ
        RDVWP TNSNEMDV RPTLSTQQKSMHQIQ
Subjt:  RDVWPQTNSNEMDVIRPTLSTQQKSMHQIQ

TrEMBL top hitse value%identityAlignment
A0A0A0LBR3 SANT domain-containing protein4.0e-17288.35Show/hide
Query:  MGTNNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHH
        +GTNNNNN  N+ F GATLHPPP LPHLRGLS FDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNY NTN N +GGGGV +MGYHH
Subjt:  MGTNNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHH

Query:  HPGGGVGGNNNNNNCSARFNNGVVSVEAI-KCLNNNINSNNSNAA-----SSSSDVC-SHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEH
          GGGVGG  NNNNCSARFNNGVV VEAI KCLNNN NSN++NAA     SSSSDVC SHGMMMMRSRFLQKLP KRSMRAPRMRWTTSLHARFVHAVEH
Subjt:  HPGGGVGGNNNNNNCSARFNNGVVSVEAI-KCLNNNINSNNSNAA-----SSSSDVC-SHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEH

Query:  LGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPT-MCTTRGDH-GSKQFSDQRAQPDRSGQPLPDSEFGCSTLW
        LGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPT M TTRGDH GSKQFSDQRA PDRSGQ  PD EFGCSTLW
Subjt:  LGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPT-MCTTRGDH-GSKQFSDQRAQPDRSGQPLPDSEFGCSTLW

Query:  SNSSRDVWPQTNSNEMD--VIRPTLSTQQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVKED
        SNSSRDVWPQTNSNEMD  V  PTLSTQQK+MHQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVK+D
Subjt:  SNSSRDVWPQTNSNEMD--VIRPTLSTQQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVKED

A0A1S3C2S7 transcription repressor KAN1-like isoform X19.9e-15588.39Show/hide
Query:  FDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHHHPGGGVGGNNNNNNCSARFNNGVVSVEAI-KCL
        FDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNY NTN N SGGGGV +MGY+H  GGGVGG  NNNNCSARFNNGVV  EAI KCL
Subjt:  FDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHHHPGGGVGGNNNNNNCSARFNNGVVSVEAI-KCL

Query:  NN--NINSNNSNAA----SSSSDVC-SHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQM
        NN  N NSNN+NAA    SSSSDVC SHGMMMMRSRFLQKLP KRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQM
Subjt:  NN--NINSNNSNAA----SSSSDVC-SHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQM

Query:  YRTVKTTDKPAASSGQSDGSGEDDVSPT-MCTTRGDH-GSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTNSNEMD-VIRPTLSTQQKSMH
        YRTVKTTDKPAASSGQSDGSGEDDVSPT M TTRGDH GSKQF DQRA PDRS QP  D EFGCSTLWSNSSRDVWPQTNSNEMD V  PTLSTQ+K MH
Subjt:  YRTVKTTDKPAASSGQSDGSGEDDVSPT-MCTTRGDH-GSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTNSNEMD-VIRPTLSTQQKSMH

Query:  QIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVKED
        QIQECDSGA+KRYNSECKKPSLEFRLGRAEWDVK+D
Subjt:  QIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVKED

A0A6J1F3E1 transcription repressor KAN1-like isoform X65.3e-14081.35Show/hide
Query:  FDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHHHPGGGVGGNNNNNNCSARFNNGVVSVEAIKCLN
        FD+SSDGL+PIKGIPVYHNRPF FLGVDQKDH++ HQFPS+CFFPNY N+N       GVV+MGYHHH   G+GGNN+    S RFNNGVVS+EA+K LN
Subjt:  FDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHHHPGGGVGGNNNNNNCSARFNNGVVSVEAIKCLN

Query:  NNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTD
         N +SN SNAASSSS VCSHG MMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTD
Subjt:  NNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTD

Query:  KPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTNSNEMDVIRPTL--STQQKSMHQIQECDSGA
        KPAASSGQSDGSGED+VSPT+ T  G HG KQFSDQRA PDRSGQ  PD EFG STLWSNSSRDVW +TN NEMDVIRP+L    QQK MHQIQECDSGA
Subjt:  KPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTNSNEMDVIRPTL--STQQKSMHQIQECDSGA

Query:  MKRYNSE-CKKPSLEFRLGRAEWDVKE
        MK+YNSE C KPSLEFRLGRAEWDV+E
Subjt:  MKRYNSE-CKKPSLEFRLGRAEWDVKE

A0A6J1F8G0 transcription repressor KAN1-like isoform X17.3e-15079.44Show/hide
Query:  MGTNNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHH
        +GTNNNNN  NNS +G  L+ P P P LRGLS FD+SSDGL+PIKGIPVYHNRPF FLGVDQKDH++ HQFPS+CFFPNY N+N       GVV+MGYHH
Subjt:  MGTNNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHH

Query:  HPGGGVGGNNNNNNCSARFNNGVVSVEAIKCLNNNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERA
        H   G+GGNN+    S RFNNGVVS+EA+K LN N +SN SNAASSSS VCSHG MMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERA
Subjt:  HPGGGVGGNNNNNNCSARFNNGVVSVEAIKCLNNNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERA

Query:  TPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWP
        TPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGED+VSPT+ T  G HG KQFSDQRA PDRSGQ  PD EFG STLWSNSSRDVW 
Subjt:  TPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWP

Query:  QTNSNEMDVIRPTL--STQQKSMHQIQECDSGAMKRYNSE-CKKPSLEFRLGRAEWDVKE
        +TN NEMDVIRP+L    QQK MHQIQECDSGAMK+YNSE C KPSLEFRLGRAEWDV+E
Subjt:  QTNSNEMDVIRPTL--STQQKSMHQIQECDSGAMKRYNSE-CKKPSLEFRLGRAEWDVKE

A0A6J1J6F7 transcription repressor KAN1-like isoform X14.7e-14979.1Show/hide
Query:  NNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHHHPGGGVG
        +N++NS +G  L  P   PHLRGLS FDVSSDGL+PIKGIPVY+NRPFPFLGVDQKD ++ HQFPS CFFPNY N+N       G+V+MGYHH    G+G
Subjt:  NNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHHHPGGGVG

Query:  GNNNNNNCSARFNNGVVSVEAIKCLNNNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLE
        GNN+    S+RFNNGVVS+EA+K LN N +SN SNAASSSSDVCSHG MMMRSRFLQKLPAKRSMR+PRMRWTTSLHARFVHAVEHLGGHERATPKSVLE
Subjt:  GNNNNNNCSARFNNGVVSVEAIKCLNNNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLE

Query:  LMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTNSNEM
        LMDVKDLTLAHVKSHLQMYRTV+TTDKPAASSGQSDGSGED+VSPT+ TT GDHG KQFSDQRA PDRS Q  PD EFG STLWSNSSRDVW QTN+NEM
Subjt:  LMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTNSNEM

Query:  DVIRPTLST----QQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVKE
        DVIRP+L T    QQK MHQIQECDSGAMKRY+SEC KPSLEFRLGRAEWDVKE
Subjt:  DVIRPTLST----QQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVKE

SwissProt top hitse value%identityAlignment
Q0J235 Probable transcription factor RL91.7e-4241.05Show/hide
Query:  PIKGIPVYHNRP--FPFL-----GVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVV-----EMGYHHHPGGGVGGNNNNNNCSARFN--NGVVSVEA
        PI+GIP+Y N P  FPFL       D   HH+ H  P   F+ +Y + +T  S     +            P      + ++   SA     NG++SV  
Subjt:  PIKGIPVYHNRP--FPFL-----GVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVV-----EMGYHHHPGGGVGGNNNNNNCSARFN--NGVVSVEA

Query:  IKCLNNNINSNNSNAASSSSDVCS--------HG----MMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTL
            ++ + S     A++   +          HG    +  + SRF+ KLPAKRSMRAPRMRWT++LHARFVHAVE LGGHERATPKSVLELMDVKDLTL
Subjt:  IKCLNNNINSNNSNAASSSSDVCS--------HG----MMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTL

Query:  AHVKSHLQMYRTVKTTDKPAASSGQSD-GSGEDDVS----------PTMCTTRGDHGSKQFSDQRAQPDRS-------------GQPLPDSEFG--CSTL
        AHVKSHLQMYRTVK+TDKPAASSG +D GSG+++ +           +MC  RG  G    +   A+  RS             G  +  S  G   +T 
Subjt:  AHVKSHLQMYRTVKTTDKPAASSGQSD-GSGEDDVS----------PTMCTTRGDHGSKQFSDQRAQPDRS-------------GQPLPDSEFG--CSTL

Query:  WSNSSRDVWPQTNSNEMDVIRPTLSTQQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRAEW
        WSNSSRD W  +NS  MD  R         +  ++ C S + +  N E   PSLEF LGR +W
Subjt:  WSNSSRDVWPQTNSNEMDVIRPTLSTQQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRAEW

Q93WJ9 Transcription repressor KAN16.1e-5345.37Show/hide
Query:  RPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHHHPGGGVGGNNNNNNCSARFNNGVVSVEAIKCLN----NNIN
        RPI+GIPVYHNR FP           FHQ           N++    GGG + ++                 N S+ +NN   S+++   L     ++ +
Subjt:  RPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHHHPGGGVGGNNNNNNCSARFNNGVVSVEAIKCLN----NNIN

Query:  SNNSNAASSSSDVCS------HGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKT
         +N      SSD  S      H   M+RSRFL K+P KRSMRAPRMRWT+SLHARFVHAVE LGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKT
Subjt:  SNNSNAASSSSDVCS------HGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKT

Query:  TDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFS-DQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTN--SNEMDVIRPTLSTQQKS-------M
        T+KPAAS   SDGSGE+++        G+    Q S DQRAQ D +              WSNSSR+ WP +N  S+++D +  T ST   S        
Subjt:  TDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFS-DQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTN--SNEMDVIRPTLSTQQKS-------M

Query:  HQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVK
        +Q Q  +  A +  N  C+ PSLEF LGR +W  K
Subjt:  HQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVK

Q941I2 Probable transcription factor KAN31.4e-2840.18Show/hide
Query:  NNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLP----AKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTV
        + +N  +++    ++  CS  +     R  Q  P    AKR +RAPRMRWTT+LHA FVHAV+ LGGHERATPKSVLELMDV+DLTLAHVKSHLQMYRT+
Subjt:  NNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLP----AKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTV

Query:  KTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTN---SNEMDVI--RPTLSTQQKSMHQI
        K+T+KP  SSGQSD           C    ++GS+  S++ A+               + LW+NSS +   Q     S+ +D+         ++   ++ 
Subjt:  KTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTN---SNEMDVI--RPTLSTQQKSMHQI

Query:  QECDSGAMKRYNSECKKPSLEFRL
           DS ++     E + P+L+F L
Subjt:  QECDSGAMKRYNSECKKPSLEFRL

Q9C616 Probable transcription factor KAN28.3e-4240Show/hide
Query:  NNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSAC--FFPNYLNTNTNGSGGGGVVEMGYHHH
        N N++N   SF   T        HL+G    D+++  LRPI+GIP+YHN P          HH+ H+ P  C  F P+ L  +++ S             
Subjt:  NNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSAC--FFPNYLNTNTNGSGGGGVVEMGYHHH

Query:  PGGGVGGNNNNNNCSARFNNGVVSVEAIKCLNNNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERAT
            + GNNN+      FN   VS           N N  N          H   + R+RF+ + PAKRSMRAPRMRWTT+LHARFVHAVE LGGHERAT
Subjt:  PGGGVGGNNNNNNCSARFNNGVVSVEAIKCLNNNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERAT

Query:  PKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRD--VW
        PKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDK AASSGQSD             + GD+ S  +     +  R  + L +     + LW+NSS +  + 
Subjt:  PKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRD--VW

Query:  PQTNSNEMDVIRPT---LSTQQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRA
         +   N  +++ P+   L  +  S  +I   +  +     +   KP+LEF LGR+
Subjt:  PQTNSNEMDVIRPT---LSTQQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRA

Q9FJV5 Probable transcription factor KAN41.3e-2665.26Show/hide
Query:  KRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGS
        KRS+RAPRMRWT++LHA FVHAV+ LGGHERATPKSVLELM+VKDLTLAHVKSHLQMYRTVK TDK +   G+ +   E  +         D G+
Subjt:  KRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGS

Arabidopsis top hitse value%identityAlignment
AT1G32240.1 Homeodomain-like superfamily protein5.9e-4340Show/hide
Query:  NNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSAC--FFPNYLNTNTNGSGGGGVVEMGYHHH
        N N++N   SF   T        HL+G    D+++  LRPI+GIP+YHN P          HH+ H+ P  C  F P+ L  +++ S             
Subjt:  NNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSAC--FFPNYLNTNTNGSGGGGVVEMGYHHH

Query:  PGGGVGGNNNNNNCSARFNNGVVSVEAIKCLNNNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERAT
            + GNNN+      FN   VS           N N  N          H   + R+RF+ + PAKRSMRAPRMRWTT+LHARFVHAVE LGGHERAT
Subjt:  PGGGVGGNNNNNNCSARFNNGVVSVEAIKCLNNNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERAT

Query:  PKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRD--VW
        PKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDK AASSGQSD             + GD+ S  +     +  R  + L +     + LW+NSS +  + 
Subjt:  PKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRD--VW

Query:  PQTNSNEMDVIRPT---LSTQQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRA
         +   N  +++ P+   L  +  S  +I   +  +     +   KP+LEF LGR+
Subjt:  PQTNSNEMDVIRPT---LSTQQKSMHQIQECDSGAMKRYNSECKKPSLEFRLGRA

AT4G17695.1 Homeodomain-like superfamily protein9.8e-3040.18Show/hide
Query:  NNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLP----AKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTV
        + +N  +++    ++  CS  +     R  Q  P    AKR +RAPRMRWTT+LHA FVHAV+ LGGHERATPKSVLELMDV+DLTLAHVKSHLQMYRT+
Subjt:  NNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLP----AKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTV

Query:  KTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTN---SNEMDVI--RPTLSTQQKSMHQI
        K+T+KP  SSGQSD           C    ++GS+  S++ A+               + LW+NSS +   Q     S+ +D+         ++   ++ 
Subjt:  KTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTN---SNEMDVI--RPTLSTQQKSMHQI

Query:  QECDSGAMKRYNSECKKPSLEFRL
           DS ++     E + P+L+F L
Subjt:  QECDSGAMKRYNSECKKPSLEFRL

AT5G16560.1 Homeodomain-like superfamily protein4.4e-5445.37Show/hide
Query:  RPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHHHPGGGVGGNNNNNNCSARFNNGVVSVEAIKCLN----NNIN
        RPI+GIPVYHNR FP           FHQ           N++    GGG + ++                 N S+ +NN   S+++   L     ++ +
Subjt:  RPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHHHPGGGVGGNNNNNNCSARFNNGVVSVEAIKCLN----NNIN

Query:  SNNSNAASSSSDVCS------HGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKT
         +N      SSD  S      H   M+RSRFL K+P KRSMRAPRMRWT+SLHARFVHAVE LGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKT
Subjt:  SNNSNAASSSSDVCS------HGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKT

Query:  TDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFS-DQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTN--SNEMDVIRPTLSTQQKS-------M
        T+KPAAS   SDGSGE+++        G+    Q S DQRAQ D +              WSNSSR+ WP +N  S+++D +  T ST   S        
Subjt:  TDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFS-DQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTN--SNEMDVIRPTLSTQQKS-------M

Query:  HQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVK
        +Q Q  +  A +  N  C+ PSLEF LGR +W  K
Subjt:  HQIQECDSGAMKRYNSECKKPSLEFRLGRAEWDVK

AT5G42630.1 Homeodomain-like superfamily protein9.2e-2865.26Show/hide
Query:  KRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGS
        KRS+RAPRMRWT++LHA FVHAV+ LGGHERATPKSVLELM+VKDLTLAHVKSHLQMYRTVK TDK +   G+ +   E  +         D G+
Subjt:  KRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGS

AT5G42630.2 Homeodomain-like superfamily protein9.2e-2865.26Show/hide
Query:  KRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGS
        KRS+RAPRMRWT++LHA FVHAV+ LGGHERATPKSVLELM+VKDLTLAHVKSHLQMYRTVK TDK +   G+ +   E  +         D G+
Subjt:  KRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGGAACCAACAACAACAATAACAACCACAACAATTCCTTCGCTGGAGCAACACTGCATCCACCGCCACCGCTCCCGCATCTCCGGGGGCTTTCGACGTTTGATGT
ATCATCGGACGGTTTACGACCGATCAAAGGGATTCCGGTGTATCACAACCGTCCATTTCCATTTTTGGGTGTGGATCAAAAAGATCACCATTATTTTCACCAATTTCCAT
CAGCTTGTTTCTTCCCAAATTACTTGAACACAAACACAAACGGCAGCGGCGGAGGAGGAGTCGTTGAAATGGGTTATCATCATCATCCAGGCGGCGGAGTTGGAGGAAAT
AATAATAATAATAATTGTTCAGCGAGGTTCAACAATGGGGTTGTTTCTGTTGAAGCAATTAAATGTTTGAATAATAATATTAATAGTAATAATAGTAATGCAGCTTCTTC
TTCTTCTGATGTTTGTTCTCATGGGATGATGATGATGAGATCCAGATTTTTGCAGAAGCTTCCTGCAAAACGAAGCATGAGAGCTCCTAGAATGAGATGGACTACCTCCT
TACATGCTCGTTTCGTTCATGCTGTCGAACATCTCGGCGGCCATGAAAGAGCAACACCAAAGTCAGTTCTTGAGCTGATGGATGTTAAGGATCTAACACTAGCTCATGTC
AAAAGCCATTTACAGATGTATCGAACTGTTAAGACCACTGACAAACCTGCAGCTTCCTCAGGCCAATCGGATGGTTCGGGAGAGGATGATGTATCTCCTACAATGTGCAC
GACCAGAGGGGATCATGGCTCCAAGCAATTTTCCGATCAAAGAGCTCAGCCCGACCGGTCCGGCCAGCCACTGCCAGATTCGGAATTTGGCTGCTCCACACTGTGGTCTA
ATTCTTCAAGAGATGTTTGGCCACAAACAAACTCTAATGAAATGGATGTCATTAGACCAACTTTGTCTACACAACAAAAATCCATGCACCAAATTCAGGAATGTGATTCA
GGTGCAATGAAGAGATATAATTCAGAATGCAAGAAACCAAGCTTGGAGTTCAGATTAGGCAGAGCAGAGTGGGATGTGAAAGAAGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGGGAACCAACAACAACAATAACAACCACAACAATTCCTTCGCTGGAGCAACACTGCATCCACCGCCACCGCTCCCGCATCTCCGGGGGCTTTCGACGTTTGATGT
ATCATCGGACGGTTTACGACCGATCAAAGGGATTCCGGTGTATCACAACCGTCCATTTCCATTTTTGGGTGTGGATCAAAAAGATCACCATTATTTTCACCAATTTCCAT
CAGCTTGTTTCTTCCCAAATTACTTGAACACAAACACAAACGGCAGCGGCGGAGGAGGAGTCGTTGAAATGGGTTATCATCATCATCCAGGCGGCGGAGTTGGAGGAAAT
AATAATAATAATAATTGTTCAGCGAGGTTCAACAATGGGGTTGTTTCTGTTGAAGCAATTAAATGTTTGAATAATAATATTAATAGTAATAATAGTAATGCAGCTTCTTC
TTCTTCTGATGTTTGTTCTCATGGGATGATGATGATGAGATCCAGATTTTTGCAGAAGCTTCCTGCAAAACGAAGCATGAGAGCTCCTAGAATGAGATGGACTACCTCCT
TACATGCTCGTTTCGTTCATGCTGTCGAACATCTCGGCGGCCATGAAAGAGCAACACCAAAGTCAGTTCTTGAGCTGATGGATGTTAAGGATCTAACACTAGCTCATGTC
AAAAGCCATTTACAGATGTATCGAACTGTTAAGACCACTGACAAACCTGCAGCTTCCTCAGGCCAATCGGATGGTTCGGGAGAGGATGATGTATCTCCTACAATGTGCAC
GACCAGAGGGGATCATGGCTCCAAGCAATTTTCCGATCAAAGAGCTCAGCCCGACCGGTCCGGCCAGCCACTGCCAGATTCGGAATTTGGCTGCTCCACACTGTGGTCTA
ATTCTTCAAGAGATGTTTGGCCACAAACAAACTCTAATGAAATGGATGTCATTAGACCAACTTTGTCTACACAACAAAAATCCATGCACCAAATTCAGGAATGTGATTCA
GGTGCAATGAAGAGATATAATTCAGAATGCAAGAAACCAAGCTTGGAGTTCAGATTAGGCAGAGCAGAGTGGGATGTGAAAGAAGACTAA
Protein sequenceShow/hide protein sequence
MMGTNNNNNNHNNSFAGATLHPPPPLPHLRGLSTFDVSSDGLRPIKGIPVYHNRPFPFLGVDQKDHHYFHQFPSACFFPNYLNTNTNGSGGGGVVEMGYHHHPGGGVGGN
NNNNNCSARFNNGVVSVEAIKCLNNNINSNNSNAASSSSDVCSHGMMMMRSRFLQKLPAKRSMRAPRMRWTTSLHARFVHAVEHLGGHERATPKSVLELMDVKDLTLAHV
KSHLQMYRTVKTTDKPAASSGQSDGSGEDDVSPTMCTTRGDHGSKQFSDQRAQPDRSGQPLPDSEFGCSTLWSNSSRDVWPQTNSNEMDVIRPTLSTQQKSMHQIQECDS
GAMKRYNSECKKPSLEFRLGRAEWDVKED