; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0008571 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0008571
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPlastid envelope DNA binding protein
Genome locationtig00000281:24340..29735
RNA-Seq ExpressionIVF0008571
SyntenyIVF0008571
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057131.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]0.099.78Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH

Query:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
        KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
Subjt:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA

Query:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG
        GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG

Query:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWS+
Subjt:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

TYJ97112.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]2.35e-30095.77Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH

Query:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
        KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
Subjt:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA

Query:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG
        GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMK                  ASKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG

Query:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWS+
Subjt:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

XP_004138835.1 uncharacterized protein LOC101202832 [Cucumis sativus]8.79e-28992Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH

Query:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHATPLATDV-LVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHATPLATDV-LVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEK

Query:  AGENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AGENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTG

Query:  GSQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

XP_008456554.1 PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo]0.0100Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH

Query:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
        KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
Subjt:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA

Query:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG
        GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG

Query:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
Subjt:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

XP_008456557.1 PREDICTED: uncharacterized protein LOC103496473 isoform X2 [Cucumis melo]2.03e-30195.99Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH

Query:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
        KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
Subjt:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA

Query:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG
        GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMK                  ASKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG

Query:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
Subjt:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A0A0LML9 Uncharacterized protein1.0e-22592Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLL EEHN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH

Query:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHATPLATD-VLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEK
        KNH T LATD VLVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHATPLATD-VLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEK

Query:  AGENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AGENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTG

Query:  GSQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

A0A1S3C344 uncharacterized protein LOC103496473 isoform X27.1e-23595.99Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH

Query:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
        KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
Subjt:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA

Query:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG
        GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMK                  ASKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG

Query:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
Subjt:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

A0A1S3C473 uncharacterized protein LOC103496473 isoform X11.9e-248100Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH

Query:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
        KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
Subjt:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA

Query:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG
        GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG

Query:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
Subjt:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

A0A5A7UUF2 Plastid envelope DNA binding protein4.3e-24899.78Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH

Query:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
        KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
Subjt:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA

Query:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG
        GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG

Query:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWS+
Subjt:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

A0A5D3BB97 Plastid envelope DNA binding protein1.6e-23495.77Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH

Query:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLDQNPLHSIAIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
        KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA
Subjt:  KNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKA

Query:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG
        GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMK                  ASKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKSDLVEVAQIVEISNGSTVKEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGG

Query:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWS+
Subjt:  SQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding6.0e-4532.55Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
        MH++K    G+  ALAK +++ G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENR+LGPG LLLE   +   
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH

Query:  SLDQNPLHSIAIEPQSPLTLSSKEVHF---------------PLNYNKYINEEPIFVSDEQCTATNI----QGSQNESIINGSLVDVSNEDSD-------
          DQ+   SI ++P  PL+LS    H                 +N ++   +    VS  Q    +I    Q   +  I    L    +ED+D       
Subjt:  SLDQNPLHSIAIEPQSPLTLSSKEVHF---------------PLNYNKYINEEPIFVSDEQCTATNI----QGSQNESIINGSLVDVSNEDSD-------

Query:  ----EFIQSELLVNEHKEVEEVVEKESG------MPKNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEI
            E +   +      +  +V  K+ G      M  +   P+  D  VN+       +K   G        ++ + VVETFPL  V   ++  D +   
Subjt:  ----EFIQSELLVNEHKEVEEVVEKESG------MPKNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEI

Query:  LISTNASEKQVSQSIELES------DVGLSNITASDSVVEKAG-ENFAGPLSETKSDLVEVAQIVEISNGSTV-------KE--------GSMHEV---G
        L       K     +E +       D+G  + + S +V+E  G E   G +    S  +E     EI N ++V       KE        G++HE     
Subjt:  LISTNASEKQVSQSIELES------DVGLSNITASDSVVEKAG-ENFAGPLSETKSDLVEVAQIVEISNGSTV-------KE--------GSMHEV---G

Query:  GPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGGSQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSF
           L      P S      +K+ + K    S       ++  ++ + +E + K+D   +  SQKE+  TLNRI  ESW+G S N  + E NPLL ++KSF
Subjt:  GPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGGSQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSF

Query:  IAAFVKFWSE
        + AFVKFWSE
Subjt:  IAAFVKFWSE

AT3G52170.2 DNA binding6.0e-4532.55Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH
        MH++K    G+  ALAK +++ G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENR+LGPG LLLE   +   
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDH

Query:  SLDQNPLHSIAIEPQSPLTLSSKEVHF---------------PLNYNKYINEEPIFVSDEQCTATNI----QGSQNESIINGSLVDVSNEDSD-------
          DQ+   SI ++P  PL+LS    H                 +N ++   +    VS  Q    +I    Q   +  I    L    +ED+D       
Subjt:  SLDQNPLHSIAIEPQSPLTLSSKEVHF---------------PLNYNKYINEEPIFVSDEQCTATNI----QGSQNESIINGSLVDVSNEDSD-------

Query:  ----EFIQSELLVNEHKEVEEVVEKESG------MPKNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEI
            E +   +      +  +V  K+ G      M  +   P+  D  VN+       +K   G        ++ + VVETFPL  V   ++  D +   
Subjt:  ----EFIQSELLVNEHKEVEEVVEKESG------MPKNHATPLATDVLVNEHNKVEEVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEI

Query:  LISTNASEKQVSQSIELES------DVGLSNITASDSVVEKAG-ENFAGPLSETKSDLVEVAQIVEISNGSTV-------KE--------GSMHEV---G
        L       K     +E +       D+G  + + S +V+E  G E   G +    S  +E     EI N ++V       KE        G++HE     
Subjt:  LISTNASEKQVSQSIELES------DVGLSNITASDSVVEKAG-ENFAGPLSETKSDLVEVAQIVEISNGSTV-------KE--------GSMHEV---G

Query:  GPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGGSQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSF
           L      P S      +K+ + K    S       ++  ++ + +E + K+D   +  SQKE+  TLNRI  ESW+G S N  + E NPLL ++KSF
Subjt:  GPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGGSQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSF

Query:  IAAFVKFWSE
        + AFVKFWSE
Subjt:  IAAFVKFWSE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein1.7e-0723.41Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGP--GKLLLEEEHNT--DHSLDQNPLHSIAIEPQSPLTLSSKE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E   +   D S   +P     +E ++   LS   
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGP--GKLLLEEEHNT--DHSLDQNPLHSIAIEPQSPLTLSSKE

Query:  VHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHATPLATDVLVNEHNKVEEVVK
           P + + +++  P+ + + +              ++     V  + S  F  + + + E + +  V         +H +P    ++  E   + EV  
Subjt:  VHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHATPLATDVLVNEHNKVEEVVK

Query:  EESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVS
               +H +P+   VV   F    V  ++ GS+ R +I+ +++++    S
Subjt:  EESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVS

AT5G58210.2 hydroxyproline-rich glycoprotein family protein1.7e-0723.41Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGP--GKLLLEEEHNT--DHSLDQNPLHSIAIEPQSPLTLSSKE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E   +   D S   +P     +E ++   LS   
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGP--GKLLLEEEHNT--DHSLDQNPLHSIAIEPQSPLTLSSKE

Query:  VHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHATPLATDVLVNEHNKVEEVVK
           P + + +++  P+ + + +              ++     V  + S  F  + + + E + +  V         +H +P    ++  E   + EV  
Subjt:  VHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHATPLATDVLVNEHNKVEEVVK

Query:  EESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVS
               +H +P+   VV   F    V  ++ GS+ R +I+ +++++    S
Subjt:  EESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVS

AT5G58210.3 hydroxyproline-rich glycoprotein family protein1.7e-0723.41Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGP--GKLLLEEEHNT--DHSLDQNPLHSIAIEPQSPLTLSSKE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E   +   D S   +P     +E ++   LS   
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGP--GKLLLEEEHNT--DHSLDQNPLHSIAIEPQSPLTLSSKE

Query:  VHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHATPLATDVLVNEHNKVEEVVK
           P + + +++  P+ + + +              ++     V  + S  F  + + + E + +  V         +H +P    ++  E   + EV  
Subjt:  VHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHATPLATDVLVNEHNKVEEVVK

Query:  EESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVS
               +H +P+   VV   F    V  ++ GS+ R +I+ +++++    S
Subjt:  EESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGCTATAAAGGGTGGGTGGACGGGACGTCCTCTTGCCCTAGCCAAGAACAATGAGGCTGAAGGGAGGAAGACTAGAATTCGGCGTTCGAAGGAGGAAAGGAAGGC
TATGGTTGAAGTTTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGTTTCCCCTCGCTCAACCTCACTCACAAGGAGGTTGGGGGATCTTTCTATACTGTTCGAGAGA
TTGTACGTGATATAATCCAAGAAAATAGAATCCTTGGTCCAGGAAAGTTGTTATTAGAAGAAGAGCACAATACAGATCATTCACTTGACCAGAATCCACTCCATTCAATT
GCTATTGAACCTCAATCTCCTTTAACCTTATCGTCTAAGGAAGTCCATTTTCCACTCAACTACAACAAATATATAAATGAAGAACCAATCTTTGTGTCAGATGAGCAATG
CACAGCAACAAATATTCAGGGATCACAGAATGAGTCAATAATTAATGGAAGCCTTGTGGATGTAAGCAACGAGGATTCTGATGAATTTATTCAGTCAGAGTTGCTAGTAA
ATGAACACAAGGAAGTAGAGGAAGTGGTTGAAAAAGAATCAGGAATGCCAAAAAATCACGCAACTCCTTTAGCAACAGATGTGCTAGTAAATGAACACAATAAAGTAGAG
GAAGTGGTGAAAGAGGAATCAGGAATGCCAATTAATCATGTAACTCCTCTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATCCAGTTCCTTGGGATGTTAATGG
TTCAGATGTAAGATCTGAGATATTGATTTCAACTAATGCCTCAGAGAAGCAAGTTAGTCAATCCATTGAGTTAGAATCAGATGTTGGCTTGTCTAACATTACAGCTTCTG
ACTCTGTAGTTGAGAAAGCAGGGGAAAACTTTGCAGGGCCATTATCAGAAACGAAGTCTGATTTGGTGGAGGTAGCACAAATTGTTGAAATTTCTAATGGGTCTACTGTT
AAAGAAGGTAGCATGCATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATACCCCAATATCTGTGAACTTTGAACAAGGCCAGAAATCTAGTAAAATGAAGTCTCC
AATTGCTTCTGAGAATCTCAATAAGACATTCAGCAATGACTTTGATCAGGCCTCAAAAATCGAGATAGAAAATAAAGTAGATCCTGGACAGACTGGCGGCTCCCAGAAAG
AAAGCGTTCCAACATTAAATAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACTCTTCAAAACCTGAAAACAACCCACTTTTGGAAATCATCAAGTCATTCATC
GCTGCCTTCGTGAAGTTTTGGTCTGAGTAA
mRNA sequenceShow/hide mRNA sequence
CCATTTCTGTTGGAAGTTTGTACAATGTCTCTCACTGGCTCCCTCCCTATAACTCTTCAGTAGGGTTTTAGCCTTCTTTTTATCTTCAATTCTCCTTCTACCCTTTCATT
TCGCTCTCTCTCCGCTTCTAGATCTGACTCTAAGACAAGTTGAACTTTGTGGATTTCATGCATGCTATAAAGGGTGGGTGGACGGGACGTCCTCTTGCCCTAGCCAAGAA
CAATGAGGCTGAAGGGAGGAAGACTAGAATTCGGCGTTCGAAGGAGGAAAGGAAGGCTATGGTTGAAGTTTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGTTTCC
CCTCGCTCAACCTCACTCACAAGGAGGTTGGGGGATCTTTCTATACTGTTCGAGAGATTGTACGTGATATAATCCAAGAAAATAGAATCCTTGGTCCAGGAAAGTTGTTA
TTAGAAGAAGAGCACAATACAGATCATTCACTTGACCAGAATCCACTCCATTCAATTGCTATTGAACCTCAATCTCCTTTAACCTTATCGTCTAAGGAAGTCCATTTTCC
ACTCAACTACAACAAATATATAAATGAAGAACCAATCTTTGTGTCAGATGAGCAATGCACAGCAACAAATATTCAGGGATCACAGAATGAGTCAATAATTAATGGAAGCC
TTGTGGATGTAAGCAACGAGGATTCTGATGAATTTATTCAGTCAGAGTTGCTAGTAAATGAACACAAGGAAGTAGAGGAAGTGGTTGAAAAAGAATCAGGAATGCCAAAA
AATCACGCAACTCCTTTAGCAACAGATGTGCTAGTAAATGAACACAATAAAGTAGAGGAAGTGGTGAAAGAGGAATCAGGAATGCCAATTAATCATGTAACTCCTCTGGC
AACAGATGTTGTGGTAGAGACATTCCCATTGGATCCAGTTCCTTGGGATGTTAATGGTTCAGATGTAAGATCTGAGATATTGATTTCAACTAATGCCTCAGAGAAGCAAG
TTAGTCAATCCATTGAGTTAGAATCAGATGTTGGCTTGTCTAACATTACAGCTTCTGACTCTGTAGTTGAGAAAGCAGGGGAAAACTTTGCAGGGCCATTATCAGAAACG
AAGTCTGATTTGGTGGAGGTAGCACAAATTGTTGAAATTTCTAATGGGTCTACTGTTAAAGAAGGTAGCATGCATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGA
TACCCCAATATCTGTGAACTTTGAACAAGGCCAGAAATCTAGTAAAATGAAGTCTCCAATTGCTTCTGAGAATCTCAATAAGACATTCAGCAATGACTTTGATCAGGCCT
CAAAAATCGAGATAGAAAATAAAGTAGATCCTGGACAGACTGGCGGCTCCCAGAAAGAAAGCGTTCCAACATTAAATAGAATTAATCTTGAATCATGGGAAGGGATGTCC
AAAAACTCTTCAAAACCTGAAAACAACCCACTTTTGGAAATCATCAAGTCATTCATCGCTGCCTTCGTGAAGTTTTGGTCTGAGTAAGTAATATGATTGTTAAGTAGACG
AATAGAGAGTAGTAGCTAAATTTTCTGCCACAGAACCTGTCTGTCTTGGTACCAAAGTTGCAGTCGGGTACCCCGTTCACTCCAGTCGGTCCCATACACGATATTTATGA
GGACAAAACTGGATTCTGGATGTGGGTTGGCATTTGTGTACTCTGCAGTAGGGAAGAAAGTTCAATAGTAGGCATTTCCCACCCCTCCCCTCGGTAAAAAAGATAAGCAA
TAGCTTGTACAAGGGATATTTCCTCATCTTTTTACTATTAATTACAATGACATAATATGTCCCACCCTTCATGATTTTATTATAGATACAAAGGTTTCCTATTATTTC
Protein sequenceShow/hide protein sequence
MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEEHNTDHSLDQNPLHSI
AIEPQSPLTLSSKEVHFPLNYNKYINEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHATPLATDVLVNEHNKVE
EVVKEESGMPINHVTPLATDVVVETFPLDPVPWDVNGSDVRSEILISTNASEKQVSQSIELESDVGLSNITASDSVVEKAGENFAGPLSETKSDLVEVAQIVEISNGSTV
KEGSMHEVGGPELEVCSDTPISVNFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIENKVDPGQTGGSQKESVPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFI
AAFVKFWSE