; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G24210 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G24210
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPlastid envelope DNA binding protein
Genome locationChr2:20755301..20760959
RNA-Seq ExpressionCSPI02G24210
SyntenyCSPI02G24210
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057131.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]1.5e-22692Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSN+DSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATD VLVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWS+
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

TYJ97112.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]5.4e-21388Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSN+DSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATD VLVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMK                  ASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWS+
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

XP_004138835.1 uncharacterized protein LOC101202832 [Cucumis sativus]2.6e-24799.33Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNTDHS
        MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHN DHS
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNTDHS

Query:  LEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMPK
        LEQNPLHSIAIEP SPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSN+DSDEFIQSELLVNGHKEVEEMVEKESGMPK
Subjt:  LEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMPK

Query:  NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA
        NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA
Subjt:  NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA

Query:  EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG
        EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG
Subjt:  EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG

Query:  SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
Subjt:  SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

XP_008456554.1 PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo]6.6e-22792.22Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSN+DSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATD VLVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

XP_008456557.1 PREDICTED: uncharacterized protein LOC103496473 isoform X2 [Cucumis melo]2.4e-21388.22Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSN+DSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATD VLVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMK                  ASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A0A0LML9 Uncharacterized protein1.2e-24799.33Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNTDHS
        MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHN DHS
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNTDHS

Query:  LEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMPK
        LEQNPLHSIAIEP SPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSN+DSDEFIQSELLVNGHKEVEEMVEKESGMPK
Subjt:  LEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMPK

Query:  NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA
        NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA
Subjt:  NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA

Query:  EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG
        EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG
Subjt:  EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG

Query:  SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
Subjt:  SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

A0A1S3C344 uncharacterized protein LOC103496473 isoform X21.2e-21388.22Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSN+DSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATD VLVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMK                  ASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

A0A1S3C473 uncharacterized protein LOC103496473 isoform X13.2e-22792.22Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSN+DSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATD VLVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

A0A5A7UUF2 Plastid envelope DNA binding protein7.1e-22792Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSN+DSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATD VLVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWS+
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

A0A5D3BB97 Plastid envelope DNA binding protein2.6e-21388Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLL EEHNTDH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLL-EEHNTDH

Query:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEPQSPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSN+DSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATD VLVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMK                  ASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWS+
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding3.2e-4631.1Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNTDHS
        MH++K    G+  ALAK +++ G+RTR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENR+LGPG+LLLE + +   
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNTDHS

Query:  LEQNPLHSIAIEPQSPLTLSSNEVHFPVNYN-KYISEEPI-FVSDEQCTATNIQGSQNESIINGSLVDV-SNKDSDEFIQSELLVNGHKEVEEMVEKESG
         +Q+   SI ++P  PL+LS N  H     +  + SE P   V+  Q    N +      ++   +  V  + DS +   ++L  +  ++ +  ++  +G
Subjt:  LEQNPLHSIAIEPQSPLTLSSNEVHFPVNYN-KYISEEPI-FVSDEQCTATNIQGSQNESIINGSLVDV-SNKDSDEFIQSELLVNGHKEVEEMVEKESG

Query:  MPKNHVTSLATDVVLVNEHNKVEEVVKEESGM------------PINY-------------------VTPLATDVVVETFPLDSVPWDVNGFDVRSEILI
        +     T    D V     +K  +V  ++ G             P+N                       ++ + VVETFPL SV   ++  D +   L 
Subjt:  MPKNHVTSLATDVVLVNEHNKVEEVVKEESGM------------PINY-------------------VTPLATDVVVETFPLDSVPWDVNGFDVRSEILI

Query:  STSASEKQVSQSIELES------DVGLFNITTSDCVVE---------KAEENLTEPLTKTKSDLVDEAQIVEISNGSTVKE-------GSIHEV---GGP
              K     +E +       D+G  + +TS  V+E         +   +++ P+ K   + +  +  V++      +        G++HE       
Subjt:  STSASEKQVSQSIELES------DVGLFNITTSDCVVE---------KAEENLTEPLTKTKSDLVDEAQIVEISNGSTVKE-------GSIHEV---GGP

Query:  ELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFIT
         L      P S +    +K+ + K    S       ++  ++ + +E K K+D   +  SQKE+  TLNRI  +SW+G S N  +   NPLL ++KSF+T
Subjt:  ELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFIT

Query:  AFVKFWSE
        AFVKFWSE
Subjt:  AFVKFWSE

AT3G52170.2 DNA binding3.2e-4631.1Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNTDHS
        MH++K    G+  ALAK +++ G+RTR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENR+LGPG+LLLE + +   
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNTDHS

Query:  LEQNPLHSIAIEPQSPLTLSSNEVHFPVNYN-KYISEEPI-FVSDEQCTATNIQGSQNESIINGSLVDV-SNKDSDEFIQSELLVNGHKEVEEMVEKESG
         +Q+   SI ++P  PL+LS N  H     +  + SE P   V+  Q    N +      ++   +  V  + DS +   ++L  +  ++ +  ++  +G
Subjt:  LEQNPLHSIAIEPQSPLTLSSNEVHFPVNYN-KYISEEPI-FVSDEQCTATNIQGSQNESIINGSLVDV-SNKDSDEFIQSELLVNGHKEVEEMVEKESG

Query:  MPKNHVTSLATDVVLVNEHNKVEEVVKEESGM------------PINY-------------------VTPLATDVVVETFPLDSVPWDVNGFDVRSEILI
        +     T    D V     +K  +V  ++ G             P+N                       ++ + VVETFPL SV   ++  D +   L 
Subjt:  MPKNHVTSLATDVVLVNEHNKVEEVVKEESGM------------PINY-------------------VTPLATDVVVETFPLDSVPWDVNGFDVRSEILI

Query:  STSASEKQVSQSIELES------DVGLFNITTSDCVVE---------KAEENLTEPLTKTKSDLVDEAQIVEISNGSTVKE-------GSIHEV---GGP
              K     +E +       D+G  + +TS  V+E         +   +++ P+ K   + +  +  V++      +        G++HE       
Subjt:  STSASEKQVSQSIELES------DVGLFNITTSDCVVE---------KAEENLTEPLTKTKSDLVDEAQIVEISNGSTVKE-------GSIHEV---GGP

Query:  ELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFIT
         L      P S +    +K+ + K    S       ++  ++ + +E K K+D   +  SQKE+  TLNRI  +SW+G S N  +   NPLL ++KSF+T
Subjt:  ELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFIT

Query:  AFVKFWSE
        AFVKFWSE
Subjt:  AFVKFWSE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein1.4e-0953.57Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE ++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI

AT5G58210.2 hydroxyproline-rich glycoprotein family protein1.4e-0953.57Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE ++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI

AT5G58210.3 hydroxyproline-rich glycoprotein family protein1.4e-0953.57Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE ++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGCCATAAAGGGTGGGTGGACGGGGCGTCCTCTCGCCCTAGCCAAGAACAATGAGGCTGAAGGGAGGAGGACTAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGC
AATGGTTGAAGTTTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGTTTCCCCTCACTCAACCTCACTCACAAGGAAGTTGGGGGATCTTTCTATACAGTGCGAGAGA
TTGTACGTGATATAATCCAAGAAAATAGAATCCTTGGTCCAGGAAATTTGTTATTAGAAGAGCACAATACGGATCATTCACTTGAACAGAATCCACTCCACTCAATTGCT
ATTGAACCTCAATCTCCTTTAACCTTATCGTCTAATGAAGTCCATTTTCCAGTCAACTACAACAAATATATAAGTGAAGAACCAATCTTTGTGTCAGATGAGCAATGCAC
AGCAACAAATATTCAGGGATCACAGAATGAGTCAATAATTAATGGAAGCCTTGTGGATGTAAGCAACAAGGATTCTGATGAATTTATCCAGTCAGAGTTGCTAGTAAATG
GACACAAGGAAGTGGAGGAAATGGTTGAAAAAGAATCAGGAATGCCAAAAAATCACGTAACTTCTTTGGCAACAGATGTTGTGCTAGTAAATGAACACAATAAAGTAGAG
GAAGTGGTGAAAGAGGAATCAGGAATGCCAATTAATTATGTAACTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGTTCCTTGGGATGTTAATGG
TTTTGATGTAAGATCTGAGATATTGATTTCAACTAGTGCCTCAGAGAAGCAAGTTAGTCAATCCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTACAACTTCCG
ACTGTGTAGTTGAGAAAGCAGAGGAAAACCTTACAGAGCCATTAACAAAAACGAAGTCTGATTTGGTGGACGAAGCACAAATTGTTGAAATTTCTAATGGGTCTACTGTT
AAAGAAGGTAGCATACATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATACTCCAGTATCTGTGAGCTTTGAACAAGGCCAGAAATCTAGTAAAATGAAGTCTCC
AATTGCTTCTGAGAATCTCAACAAGACATTCAGCAATGACTTTGATCAGGCCTCAAAAATCGAGATAAAAAATAAAGTAGATCCTGGACAGACTGGCGGCTCCCAGAAAG
AAAGCGTTCCAACATTAAATAGAATTAATCTTGACTCATGGGAAGGGATGTCCAAAAACTCTTCAAAACCCGGAAACAACCCGCTTTTGGAAATCATCAAGTCATTCATC
ACTGCCTTCGTGAAGTTTTGGTCCGAGTAA
mRNA sequenceShow/hide mRNA sequence
CGCTCCGCCATTTCCATCTCCCCACTGCCCATTTCTGTTGGAAGTTTGTACATTGTCTCTCACTAGTTCCCTCCTATAACTCTTCAGTAGGGTTTTAGCCTTCTTTTATC
TTCAATTCTTCTTCCACTCTTTCATTTCGCGCTCTCTCCCTTTCTAGATCTATACTCTAAGACGAGCTGAACTTTCTGGATTTCATGCATGCCATAAAGGGTGGGTGGAC
GGGGCGTCCTCTCGCCCTAGCCAAGAACAATGAGGCTGAAGGGAGGAGGACTAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTTTTCATAAAAAAGT
ATCAGGAATCAAATAATGGGAGTTTCCCCTCACTCAACCTCACTCACAAGGAAGTTGGGGGATCTTTCTATACAGTGCGAGAGATTGTACGTGATATAATCCAAGAAAAT
AGAATCCTTGGTCCAGGAAATTTGTTATTAGAAGAGCACAATACGGATCATTCACTTGAACAGAATCCACTCCACTCAATTGCTATTGAACCTCAATCTCCTTTAACCTT
ATCGTCTAATGAAGTCCATTTTCCAGTCAACTACAACAAATATATAAGTGAAGAACCAATCTTTGTGTCAGATGAGCAATGCACAGCAACAAATATTCAGGGATCACAGA
ATGAGTCAATAATTAATGGAAGCCTTGTGGATGTAAGCAACAAGGATTCTGATGAATTTATCCAGTCAGAGTTGCTAGTAAATGGACACAAGGAAGTGGAGGAAATGGTT
GAAAAAGAATCAGGAATGCCAAAAAATCACGTAACTTCTTTGGCAACAGATGTTGTGCTAGTAAATGAACACAATAAAGTAGAGGAAGTGGTGAAAGAGGAATCAGGAAT
GCCAATTAATTATGTAACTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGTTCCTTGGGATGTTAATGGTTTTGATGTAAGATCTGAGATATTGA
TTTCAACTAGTGCCTCAGAGAAGCAAGTTAGTCAATCCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTACAACTTCCGACTGTGTAGTTGAGAAAGCAGAGGAA
AACCTTACAGAGCCATTAACAAAAACGAAGTCTGATTTGGTGGACGAAGCACAAATTGTTGAAATTTCTAATGGGTCTACTGTTAAAGAAGGTAGCATACATGAAGTTGG
GGGTCCTGAGTTGGAAGTTTGCAGTGATACTCCAGTATCTGTGAGCTTTGAACAAGGCCAGAAATCTAGTAAAATGAAGTCTCCAATTGCTTCTGAGAATCTCAACAAGA
CATTCAGCAATGACTTTGATCAGGCCTCAAAAATCGAGATAAAAAATAAAGTAGATCCTGGACAGACTGGCGGCTCCCAGAAAGAAAGCGTTCCAACATTAAATAGAATT
AATCTTGACTCATGGGAAGGGATGTCCAAAAACTCTTCAAAACCCGGAAACAACCCGCTTTTGGAAATCATCAAGTCATTCATCACTGCCTTCGTGAAGTTTTGGTCCGA
GTAAGTAGTATGATTGTCAAGTCAAGTAGACGAATAGAGAGTAGTAGCTAAATTTTCTGCCACAGAACCTGTCTGTCTTTGTACCAAAGTTGCAGTCGGTTGCCCCGTTC
ACTCCAGTCGGTCCCATACACGATATTTATGAGGACAAAACTAGATTCTGGATGTGGGTTGGCATTTGTGTACTCTGCAGTAGGAAAGAAAGTTTAATAGTAGGCATTTC
CCATCCCTCCCCTCCCCTCCCCTCCCCTCCCCTCCCCTCAGTAAAAAAAAAAGATAAAAAAGATAAGCAGTAGCTTGTACAAGGGATATTTTTCTTATCTTTTTACTATT
AATTAGCATGGCATAATATGTCCCACCGTCCATGATTTTATTATAGATACAAGGGTTTCCTATTATTTCTTCATTACTATATGTTTAAAAAAGGGGGAGGGGGGATGAAG
AAGAAGAAGAAGAAGAAGGAAGAAGAAGAAAAAGGTTGGAGTAACGTGGGAAGGAGGCTTTTGATGGTAAGGTGTTTGTGCGTTAGAAGGAGGACAAGGAAACAAGAGAG
AAAGGCATATAACAGATTGGAGTTTTAATAGGCATCTTCTTTCCCCCCTTTTTGCTCTGACCCGACATCTTATATCATATGGATCATGTAGTCACATGATCTTCTGGATT
CCCCAATTCTTTTATTAGTACTTTTTCACCCC
Protein sequenceShow/hide protein sequence
MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNTDHSLEQNPLHSIA
IEPQSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNKDSDEFIQSELLVNGHKEVEEMVEKESGMPKNHVTSLATDVVLVNEHNKVE
EVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKAEENLTEPLTKTKSDLVDEAQIVEISNGSTV
KEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFI
TAFVKFWSE