; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0018740 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0018740
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPlastid envelope DNA binding protein
Genome locationchr03:7376510..7380837
RNA-Seq ExpressionPI0018740
SyntenyPI0018740
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057131.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]1.5e-22091.31Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVE+FIKK+QESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSSK+VHFP                 QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA
        KNH TPLATDVLVNEH   EEVVK ESGMPINHVTPLATDVVVETFPLD VPWDVNGSDVRSEILIST+ASEKQVSQSIELESDVGL NITASD VVEKA
Subjt:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA

Query:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG
        GENFAGPLSETK DLVEVAQIVEISNGSTVKEGS+HEVGGPEL VCSDTP+SVNFEQGQKSSKMKS IASENLNKTFSNDFDQ SKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG

Query:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKES+PTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWS+
Subjt:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

TYJ97112.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]5.0e-20887.53Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVE+FIKK+QESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSSK+VHFP                 QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA
        KNH TPLATDVLVNEH   EEVVK ESGMPINHVTPLATDVVVETFPLD VPWDVNGSDVRSEILIST+ASEKQVSQSIELESDVGL NITASD VVEKA
Subjt:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA

Query:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG
        GENFAGPLSETK DLVEVAQIVEISNGSTVKEGS+HEVGGPEL VCSDTP+SVNFEQGQKSSKMK+                  SKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG

Query:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKES+PTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWS+
Subjt:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

XP_004138835.1 uncharacterized protein LOC101202832 [Cucumis sativus]6.3e-21187.97Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEHNTDHS
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVE+FIKK+QESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEEHN DHS
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEHNTDHS

Query:  LEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMPK
        LEQNPLHSIAIEP SPLTLSS +VHFP                 QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMPK
Subjt:  LEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMPK

Query:  NHVTPLATD-VLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA
        NHVT LATD VLVNEH   EEVVK ESGMPIN+VTPLATDVVVETFPLDSVPWDVNG DVRSEILISTSASEKQVSQSIELESDVGLFNIT SDCVVEKA
Subjt:  NHVTPLATD-VLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA

Query:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG
         EN   PL++TK DLV+ AQIVEISNGSTVKEGSIHEVGGPEL VCSDTPVSV+FEQGQKSSKMKS IASENLNKTFSNDFDQ SKIEI+NKVDPGQTGG
Subjt:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG

Query:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKES+PTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

XP_008456554.1 PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo]6.7e-22191.54Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVE+FIKK+QESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSSK+VHFP                 QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA
        KNH TPLATDVLVNEH   EEVVK ESGMPINHVTPLATDVVVETFPLD VPWDVNGSDVRSEILIST+ASEKQVSQSIELESDVGL NITASD VVEKA
Subjt:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA

Query:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG
        GENFAGPLSETK DLVEVAQIVEISNGSTVKEGS+HEVGGPEL VCSDTP+SVNFEQGQKSSKMKS IASENLNKTFSNDFDQ SKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG

Query:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKES+PTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
Subjt:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

XP_008456557.1 PREDICTED: uncharacterized protein LOC103496473 isoform X2 [Cucumis melo]2.2e-20887.75Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVE+FIKK+QESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSSK+VHFP                 QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA
        KNH TPLATDVLVNEH   EEVVK ESGMPINHVTPLATDVVVETFPLD VPWDVNGSDVRSEILIST+ASEKQVSQSIELESDVGL NITASD VVEKA
Subjt:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA

Query:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG
        GENFAGPLSETK DLVEVAQIVEISNGSTVKEGS+HEVGGPEL VCSDTP+SVNFEQGQKSSKMK+                  SKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG

Query:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKES+PTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
Subjt:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A0A0LML9 Uncharacterized protein3.1e-21187.97Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEHNTDHS
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVE+FIKK+QESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEEHN DHS
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEHNTDHS

Query:  LEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMPK
        LEQNPLHSIAIEP SPLTLSS +VHFP                 QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMPK
Subjt:  LEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMPK

Query:  NHVTPLATD-VLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA
        NHVT LATD VLVNEH   EEVVK ESGMPIN+VTPLATDVVVETFPLDSVPWDVNG DVRSEILISTSASEKQVSQSIELESDVGLFNIT SDCVVEKA
Subjt:  NHVTPLATD-VLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA

Query:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG
         EN   PL++TK DLV+ AQIVEISNGSTVKEGSIHEVGGPEL VCSDTPVSV+FEQGQKSSKMKS IASENLNKTFSNDFDQ SKIEI+NKVDPGQTGG
Subjt:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG

Query:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKES+PTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

A0A1S3C344 uncharacterized protein LOC103496473 isoform X21.1e-20887.75Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVE+FIKK+QESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSSK+VHFP                 QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA
        KNH TPLATDVLVNEH   EEVVK ESGMPINHVTPLATDVVVETFPLD VPWDVNGSDVRSEILIST+ASEKQVSQSIELESDVGL NITASD VVEKA
Subjt:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA

Query:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG
        GENFAGPLSETK DLVEVAQIVEISNGSTVKEGS+HEVGGPEL VCSDTP+SVNFEQGQKSSKMK+                  SKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG

Query:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKES+PTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
Subjt:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

A0A1S3C473 uncharacterized protein LOC103496473 isoform X13.3e-22191.54Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVE+FIKK+QESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSSK+VHFP                 QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA
        KNH TPLATDVLVNEH   EEVVK ESGMPINHVTPLATDVVVETFPLD VPWDVNGSDVRSEILIST+ASEKQVSQSIELESDVGL NITASD VVEKA
Subjt:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA

Query:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG
        GENFAGPLSETK DLVEVAQIVEISNGSTVKEGS+HEVGGPEL VCSDTP+SVNFEQGQKSSKMKS IASENLNKTFSNDFDQ SKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG

Query:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKES+PTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
Subjt:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

A0A5A7UUF2 Plastid envelope DNA binding protein7.3e-22191.31Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVE+FIKK+QESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSSK+VHFP                 QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA
        KNH TPLATDVLVNEH   EEVVK ESGMPINHVTPLATDVVVETFPLD VPWDVNGSDVRSEILIST+ASEKQVSQSIELESDVGL NITASD VVEKA
Subjt:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA

Query:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG
        GENFAGPLSETK DLVEVAQIVEISNGSTVKEGS+HEVGGPEL VCSDTP+SVNFEQGQKSSKMKS IASENLNKTFSNDFDQ SKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG

Query:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKES+PTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWS+
Subjt:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

A0A5D3BB97 Plastid envelope DNA binding protein2.4e-20887.53Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVE+FIKK+QESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSSK+VHFP                 QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSKQVHFP-----------------QCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMP

Query:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA
        KNH TPLATDVLVNEH   EEVVK ESGMPINHVTPLATDVVVETFPLD VPWDVNGSDVRSEILIST+ASEKQVSQSIELESDVGL NITASD VVEKA
Subjt:  KNHVTPLATDVLVNEHK--EEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKA

Query:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG
        GENFAGPLSETK DLVEVAQIVEISNGSTVKEGS+HEVGGPEL VCSDTP+SVNFEQGQKSSKMK+                  SKIEIENKVDPGQTGG
Subjt:  GENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTPVSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGG

Query:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE
        SQKES+PTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWS+
Subjt:  SQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding4.3e-4832.48Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEHNTDHS
        MH++K    G+  ALAK +++ G++TR R  KEERK +VE FIKK Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENR+LGPG LLLE + +   
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEHNTDHS

Query:  LEQNPLHSIAIEPQSPLTLSSKQVHFPQCTATNIQGSQNESIINGSLVDVSN--------------------EDSDEFIQSELLV--NEHKEVE------
         +Q+   SI ++P  PL+LS    H     + +      E  +NGS V + N                     DS +   ++L    +E  +++      
Subjt:  LEQNPLHSIAIEPQSPLTLSSKQVHFPQCTATNIQGSQNESIINGSLVDVSN--------------------EDSDEFIQSELLV--NEHKEVE------

Query:  -------------------EVVEKESG------MPKNHVTPLATDVLVNEHKEEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILIS
                           +V  K+ G      M  +   P+  D  VN+    + ++++G+       ++ + VVETFPL SV   ++  D +   L  
Subjt:  -------------------EVVEKESG------MPKNHVTPLATDVLVNEHKEEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILIS

Query:  TSASEKQVSQSIELES------DVGLFNITASDCVVEKAG-ENFAGPLSETKLDLVEVAQIVEISNGSTV-------KE--------GSIHEV---GGPE
             K     +E +       D+G  + + S  V+E  G E   G +       +E     EI N ++V       KE        G++HE        
Subjt:  TSASEKQVSQSIELES------DVGLFNITASDCVVEKAG-ENFAGPLSETKLDLVEVAQIVEISNGSTV-------KE--------GSIHEV---GGPE

Query:  LAVCSDTPVSVNFEQGQKSSKMK-STIASENLNKTFSNDFDQPSKIEIENKVDPGQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIA
        L      P S      +K+ + K  T++S   N+  S   ++ + +E + K+D   +  SQKE+  TLNRI  ESW+G S N  + E NPLL ++KSF+ 
Subjt:  LAVCSDTPVSVNFEQGQKSSKMK-STIASENLNKTFSNDFDQPSKIEIENKVDPGQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIA

Query:  AFVKFWSE
        AFVKFWSE
Subjt:  AFVKFWSE

AT3G52170.2 DNA binding4.3e-4832.48Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEHNTDHS
        MH++K    G+  ALAK +++ G++TR R  KEERK +VE FIKK Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENR+LGPG LLLE + +   
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEHNTDHS

Query:  LEQNPLHSIAIEPQSPLTLSSKQVHFPQCTATNIQGSQNESIINGSLVDVSN--------------------EDSDEFIQSELLV--NEHKEVE------
         +Q+   SI ++P  PL+LS    H     + +      E  +NGS V + N                     DS +   ++L    +E  +++      
Subjt:  LEQNPLHSIAIEPQSPLTLSSKQVHFPQCTATNIQGSQNESIINGSLVDVSN--------------------EDSDEFIQSELLV--NEHKEVE------

Query:  -------------------EVVEKESG------MPKNHVTPLATDVLVNEHKEEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILIS
                           +V  K+ G      M  +   P+  D  VN+    + ++++G+       ++ + VVETFPL SV   ++  D +   L  
Subjt:  -------------------EVVEKESG------MPKNHVTPLATDVLVNEHKEEVVKVESGMPINHVTPLATDVVVETFPLDSVPWDVNGSDVRSEILIS

Query:  TSASEKQVSQSIELES------DVGLFNITASDCVVEKAG-ENFAGPLSETKLDLVEVAQIVEISNGSTV-------KE--------GSIHEV---GGPE
             K     +E +       D+G  + + S  V+E  G E   G +       +E     EI N ++V       KE        G++HE        
Subjt:  TSASEKQVSQSIELES------DVGLFNITASDCVVEKAG-ENFAGPLSETKLDLVEVAQIVEISNGSTV-------KE--------GSIHEV---GGPE

Query:  LAVCSDTPVSVNFEQGQKSSKMK-STIASENLNKTFSNDFDQPSKIEIENKVDPGQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIA
        L      P S      +K+ + K  T++S   N+  S   ++ + +E + K+D   +  SQKE+  TLNRI  ESW+G S N  + E NPLL ++KSF+ 
Subjt:  LAVCSDTPVSVNFEQGQKSSKMK-STIASENLNKTFSNDFDQPSKIEIENKVDPGQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIA

Query:  AFVKFWSE
        AFVKFWSE
Subjt:  AFVKFWSE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein7.4e-0826.67Show/hide
Query:  RRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGP--GKLLLEEHNT---DHSLEQNPLHSIAIEPQSPLTLSSKQ
        R SK++R+A+VE F+ +++ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E  ++   D S   +P     +E ++   LS   
Subjt:  RRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGP--GKLLLEEHNT---DHSLEQNPLHSIAIEPQSPLTLSSKQ

Query:  VHFPQCTATNIQGSQNESIINGSLVDVS----NEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHVTPLATDVLVNEHKEEVVKVESGMPINHVTPLAT
           P   ++++  S    +   +L +VS     + S  F  + + + E + +  V         +H +P    ++  +   EV         +H +P+  
Subjt:  VHFPQCTATNIQGSQNESIINGSLVDVS----NEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHVTPLATDVLVNEHKEEVVKVESGMPINHVTPLAT

Query:  DVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASD
         VV   F   SV  ++ GS+ R +I I TS S      + +  + V    ITA D
Subjt:  DVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASD

AT5G58210.2 hydroxyproline-rich glycoprotein family protein7.4e-0826.67Show/hide
Query:  RRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGP--GKLLLEEHNT---DHSLEQNPLHSIAIEPQSPLTLSSKQ
        R SK++R+A+VE F+ +++ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E  ++   D S   +P     +E ++   LS   
Subjt:  RRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGP--GKLLLEEHNT---DHSLEQNPLHSIAIEPQSPLTLSSKQ

Query:  VHFPQCTATNIQGSQNESIINGSLVDVS----NEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHVTPLATDVLVNEHKEEVVKVESGMPINHVTPLAT
           P   ++++  S    +   +L +VS     + S  F  + + + E + +  V         +H +P    ++  +   EV         +H +P+  
Subjt:  VHFPQCTATNIQGSQNESIINGSLVDVS----NEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHVTPLATDVLVNEHKEEVVKVESGMPINHVTPLAT

Query:  DVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASD
         VV   F   SV  ++ GS+ R +I I TS S      + +  + V    ITA D
Subjt:  DVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASD

AT5G58210.3 hydroxyproline-rich glycoprotein family protein7.4e-0826.67Show/hide
Query:  RRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGP--GKLLLEEHNT---DHSLEQNPLHSIAIEPQSPLTLSSKQ
        R SK++R+A+VE F+ +++ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E  ++   D S   +P     +E ++   LS   
Subjt:  RRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGP--GKLLLEEHNT---DHSLEQNPLHSIAIEPQSPLTLSSKQ

Query:  VHFPQCTATNIQGSQNESIINGSLVDVS----NEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHVTPLATDVLVNEHKEEVVKVESGMPINHVTPLAT
           P   ++++  S    +   +L +VS     + S  F  + + + E + +  V         +H +P    ++  +   EV         +H +P+  
Subjt:  VHFPQCTATNIQGSQNESIINGSLVDVS----NEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHVTPLATDVLVNEHKEEVVKVESGMPINHVTPLAT

Query:  DVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASD
         VV   F   SV  ++ GS+ R +I I TS S      + +  + V    ITA D
Subjt:  DVVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGCTATAAAGGGTGGGTGGACGGGGCGTCCTCTCGCCCTAGCCAAGAACAATGAGGCTGAAGGGAGGAAGACTAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGC
AATGGTTGAAATTTTCATAAAAAAGTTTCAGGAATCAAATAATGGGAGTTTCCCCTCGCTCAACCTCACTCACAAGGAAGTTGGGGGATCTTTCTATACAGTGCGAGAGA
TTGTACGTGATATAATCCAAGAAAATAGAATCCTTGGTCCAGGAAAGTTGTTATTAGAAGAGCACAATACGGATCATTCACTTGAACAGAATCCACTCCATTCTATTGCT
ATTGAACCTCAGTCTCCTTTAACCTTATCATCTAAGCAAGTCCATTTTCCACAATGCACAGCAACAAATATTCAGGGATCACAGAATGAGTCAATAATTAATGGAAGCCT
TGTGGATGTAAGTAACGAGGATTCTGATGAATTTATCCAGTCAGAGTTGCTAGTAAATGAACACAAGGAAGTAGAGGAAGTGGTTGAAAAAGAATCAGGAATGCCAAAAA
ATCACGTAACTCCTTTGGCAACGGATGTGCTAGTAAATGAACACAAGGAAGAAGTGGTGAAAGTGGAATCAGGAATGCCAATTAATCATGTAACTCCTTTGGCAACAGAT
GTTGTCGTAGAGACATTCCCATTGGATTCAGTTCCTTGGGATGTTAATGGTTCAGATGTAAGATCTGAGATATTGATTTCAACTAGTGCCTCAGAGAAGCAAGTTAGTCA
ATCCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTACAGCTTCTGACTGTGTAGTTGAGAAAGCAGGGGAAAACTTTGCAGGTCCATTATCAGAAACGAAGTTGG
ATTTGGTGGAGGTAGCACAAATTGTTGAAATTTCTAATGGGTCTACTGTTAAAGAAGGTAGCATACATGAAGTTGGGGGTCCTGAGTTGGCAGTTTGCAGTGATACTCCA
GTATCTGTGAACTTTGAACAAGGTCAGAAATCTAGTAAAATGAAGTCTACAATTGCTTCCGAGAATCTCAATAAGACATTCAGCAATGACTTTGATCAGCCCTCAAAAAT
CGAGATAGAAAATAAAGTAGATCCTGGACAGACTGGCGGCTCCCAGAAAGAAAGCATTCCAACATTAAATAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACT
CATCAAAACCTGAAAACAACCCGCTTTTGGAAATCATCAAGTCATTCATCGCTGCCTTCGTGAAGTTTTGGTCCGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATGCTATAAAGGGTGGGTGGACGGGGCGTCCTCTCGCCCTAGCCAAGAACAATGAGGCTGAAGGGAGGAAGACTAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGC
AATGGTTGAAATTTTCATAAAAAAGTTTCAGGAATCAAATAATGGGAGTTTCCCCTCGCTCAACCTCACTCACAAGGAAGTTGGGGGATCTTTCTATACAGTGCGAGAGA
TTGTACGTGATATAATCCAAGAAAATAGAATCCTTGGTCCAGGAAAGTTGTTATTAGAAGAGCACAATACGGATCATTCACTTGAACAGAATCCACTCCATTCTATTGCT
ATTGAACCTCAGTCTCCTTTAACCTTATCATCTAAGCAAGTCCATTTTCCACAATGCACAGCAACAAATATTCAGGGATCACAGAATGAGTCAATAATTAATGGAAGCCT
TGTGGATGTAAGTAACGAGGATTCTGATGAATTTATCCAGTCAGAGTTGCTAGTAAATGAACACAAGGAAGTAGAGGAAGTGGTTGAAAAAGAATCAGGAATGCCAAAAA
ATCACGTAACTCCTTTGGCAACGGATGTGCTAGTAAATGAACACAAGGAAGAAGTGGTGAAAGTGGAATCAGGAATGCCAATTAATCATGTAACTCCTTTGGCAACAGAT
GTTGTCGTAGAGACATTCCCATTGGATTCAGTTCCTTGGGATGTTAATGGTTCAGATGTAAGATCTGAGATATTGATTTCAACTAGTGCCTCAGAGAAGCAAGTTAGTCA
ATCCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTACAGCTTCTGACTGTGTAGTTGAGAAAGCAGGGGAAAACTTTGCAGGTCCATTATCAGAAACGAAGTTGG
ATTTGGTGGAGGTAGCACAAATTGTTGAAATTTCTAATGGGTCTACTGTTAAAGAAGGTAGCATACATGAAGTTGGGGGTCCTGAGTTGGCAGTTTGCAGTGATACTCCA
GTATCTGTGAACTTTGAACAAGGTCAGAAATCTAGTAAAATGAAGTCTACAATTGCTTCCGAGAATCTCAATAAGACATTCAGCAATGACTTTGATCAGCCCTCAAAAAT
CGAGATAGAAAATAAAGTAGATCCTGGACAGACTGGCGGCTCCCAGAAAGAAAGCATTCCAACATTAAATAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACT
CATCAAAACCTGAAAACAACCCGCTTTTGGAAATCATCAAGTCATTCATCGCTGCCTTCGTGAAGTTTTGGTCCGAGTAAGTAATATGATTGTCAAGTAGACGGATAGAG
AGTAGTAGCTAAATTTTCTGCCACAGAACCTGTCTGTCTTTGTACCGAAGTTGCAGTCGGTTACCCCGTTCACTCCAGTTGGTCCCATACACGATATTTATGAGGACAAA
ACTGGATTCTGGATGTGGGTTGGCATTTGTGTACTCTGCAGTAGGAAAGAAAGTTTAATAGTAGGCATTTCCCACCCCTCCCCTCAGTAAAAATAGATAAGCAATAGCTT
GTACAAGGGATATTTTCTTATCTTTTTACTATTGCCATGAGATAATATGTCCCACCCTCCATGATTTTATTATAGATACGAAGGTTTCCTATTA
Protein sequenceShow/hide protein sequence
MHAIKGGWTGRPLALAKNNEAEGRKTRIRRSKEERKAMVEIFIKKFQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGKLLLEEHNTDHSLEQNPLHSIA
IEPQSPLTLSSKQVHFPQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNEHKEVEEVVEKESGMPKNHVTPLATDVLVNEHKEEVVKVESGMPINHVTPLATD
VVVETFPLDSVPWDVNGSDVRSEILISTSASEKQVSQSIELESDVGLFNITASDCVVEKAGENFAGPLSETKLDLVEVAQIVEISNGSTVKEGSIHEVGGPELAVCSDTP
VSVNFEQGQKSSKMKSTIASENLNKTFSNDFDQPSKIEIENKVDPGQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNPLLEIIKSFIAAFVKFWSE