; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy2G024010 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy2G024010
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPlastid envelope DNA binding protein
Genome locationGy14Chr2:31792621..31798609
RNA-Seq ExpressionCsGy2G024010
SyntenyCsGy2G024010
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057131.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]7.39e-28891.78Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWS+
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

TYJ97112.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]1.75e-27087.78Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMK                  ASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWS+
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

XP_004138835.1 uncharacterized protein LOC101202832 [Cucumis sativus]0.0100Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS
        MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS

Query:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPK
        LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPK
Subjt:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPK

Query:  NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA
        NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA
Subjt:  NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA

Query:  EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG
        EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG
Subjt:  EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG

Query:  SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
Subjt:  SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

XP_008456554.1 PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo]6.44e-28992Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

XP_008456557.1 PREDICTED: uncharacterized protein LOC103496473 isoform X2 [Cucumis melo]1.53e-27188Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMK                  ASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A0A0LML9 Uncharacterized protein0.0100Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS
        MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS

Query:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPK
        LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPK
Subjt:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPK

Query:  NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA
        NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA
Subjt:  NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA

Query:  EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG
        EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG
Subjt:  EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG

Query:  SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
Subjt:  SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

A0A1S3C344 uncharacterized protein LOC103496473 isoform X27.41e-27288Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMK                  ASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

A0A1S3C473 uncharacterized protein LOC103496473 isoform X13.12e-28992Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

A0A5A7UUF2 Plastid envelope DNA binding protein3.58e-28891.78Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWS+
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

A0A5D3BB97 Plastid envelope DNA binding protein8.45e-27187.78Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMK                  ASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWS+
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding2.1e-4531.1Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS
        MH++K    G+  ALAK +++ G+RTR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENR+LGPG+LLLE +     
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS

Query:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYN-KYISEEPI-FVSDEQCTATNIQGSQNESIINGSLVDV-SNEDSDEFIQSELLVNGHKEVEEMVEKESG
         +Q+   SI ++P  PL+LS N  H     +  + SE P   V+  Q    N +      ++   +  V  + DS +   ++L  +  ++ +  ++  +G
Subjt:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYN-KYISEEPI-FVSDEQCTATNIQGSQNESIINGSLVDV-SNEDSDEFIQSELLVNGHKEVEEMVEKESG

Query:  MPKNHVTSLATDVVLVNEHNKVEEVVKEESGM------------PINY-------------------VTPLATDVVVETFPLDSVPWDVNGFDVRSEILI
        +     T    D V     +K  +V  ++ G             P+N                       ++ + VVETFPL SV   ++  D +   L 
Subjt:  MPKNHVTSLATDVVLVNEHNKVEEVVKEESGM------------PINY-------------------VTPLATDVVVETFPLDSVPWDVNGFDVRSEILI

Query:  STSASEKQVSQSIELES------DVGLFNITTSDCVVE---------KAEENLTEPLTKTKSDLVDEAQIVEISNGSTVKE-------GSIHEV---GGP
              K     +E +       D+G  + +TS  V+E         +   +++ P+ K   + +  +  V++      +        G++HE       
Subjt:  STSASEKQVSQSIELES------DVGLFNITTSDCVVE---------KAEENLTEPLTKTKSDLVDEAQIVEISNGSTVKE-------GSIHEV---GGP

Query:  ELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFIT
         L      P S +    +K+ + K    S       ++  ++ + +E K K+D   +  SQKE+  TLNRI  +SW+G S N  +   NPLL ++KSF+T
Subjt:  ELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFIT

Query:  AFVKFWSE
        AFVKFWSE
Subjt:  AFVKFWSE

AT3G52170.2 DNA binding2.1e-4531.1Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS
        MH++K    G+  ALAK +++ G+RTR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENR+LGPG+LLLE +     
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS

Query:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYN-KYISEEPI-FVSDEQCTATNIQGSQNESIINGSLVDV-SNEDSDEFIQSELLVNGHKEVEEMVEKESG
         +Q+   SI ++P  PL+LS N  H     +  + SE P   V+  Q    N +      ++   +  V  + DS +   ++L  +  ++ +  ++  +G
Subjt:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYN-KYISEEPI-FVSDEQCTATNIQGSQNESIINGSLVDV-SNEDSDEFIQSELLVNGHKEVEEMVEKESG

Query:  MPKNHVTSLATDVVLVNEHNKVEEVVKEESGM------------PINY-------------------VTPLATDVVVETFPLDSVPWDVNGFDVRSEILI
        +     T    D V     +K  +V  ++ G             P+N                       ++ + VVETFPL SV   ++  D +   L 
Subjt:  MPKNHVTSLATDVVLVNEHNKVEEVVKEESGM------------PINY-------------------VTPLATDVVVETFPLDSVPWDVNGFDVRSEILI

Query:  STSASEKQVSQSIELES------DVGLFNITTSDCVVE---------KAEENLTEPLTKTKSDLVDEAQIVEISNGSTVKE-------GSIHEV---GGP
              K     +E +       D+G  + +TS  V+E         +   +++ P+ K   + +  +  V++      +        G++HE       
Subjt:  STSASEKQVSQSIELES------DVGLFNITTSDCVVE---------KAEENLTEPLTKTKSDLVDEAQIVEISNGSTVKE-------GSIHEV---GGP

Query:  ELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFIT
         L      P S +    +K+ + K    S       ++  ++ + +E K K+D   +  SQKE+  TLNRI  +SW+G S N  +   NPLL ++KSF+T
Subjt:  ELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFIT

Query:  AFVKFWSE
        AFVKFWSE
Subjt:  AFVKFWSE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein1.4e-0953.57Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE ++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI

AT5G58210.2 hydroxyproline-rich glycoprotein family protein1.4e-0953.57Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE ++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI

AT5G58210.3 hydroxyproline-rich glycoprotein family protein1.4e-0953.57Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE ++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGCCATAAAGGGTGGGTGGACGGGGCGTCCTCTCGCCCTAGCCAAGAACAATGAGGCTGAAGGGAGAAGGACTAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGC
AATGGTTGAAGTTTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGTTTCCCCTCACTCAACCTCACTCACAAGGAAGTTGGGGGATCTTTCTATACGGTGCGAGAGA
TTGTACGTGATATAATCCAAGAAAATAGAATCCTTGGTCCAGGAAATTTGTTATTAGAAGAGCACAATCCGGATCATTCACTTGAACAGAATCCACTCCACTCAATTGCT
ATTGAACCTCACTCTCCTTTAACCTTATCGTCTAATGAAGTCCATTTTCCAGTCAACTACAACAAATATATAAGTGAAGAACCAATCTTTGTGTCAGATGAGCAATGCAC
AGCAACAAATATTCAGGGATCACAGAATGAGTCAATAATTAATGGAAGCCTTGTGGATGTAAGCAACGAGGATTCTGATGAATTTATCCAGTCAGAGTTGCTAGTAAATG
GACACAAGGAAGTGGAGGAAATGGTTGAAAAAGAATCAGGAATGCCAAAAAATCACGTAACTTCTTTGGCAACAGATGTTGTGCTAGTAAATGAACACAATAAAGTAGAG
GAAGTGGTGAAAGAGGAATCAGGAATGCCAATTAATTATGTAACTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGTTCCTTGGGATGTTAATGG
TTTTGATGTAAGATCTGAGATATTGATTTCAACTAGTGCCTCAGAGAAGCAAGTTAGTCAATCCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTACAACTTCCG
ACTGTGTAGTTGAGAAAGCAGAGGAAAACCTTACAGAGCCATTAACAAAAACGAAGTCTGATTTGGTGGACGAAGCACAAATTGTTGAAATTTCTAATGGGTCTACTGTT
AAAGAAGGTAGCATACATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATACTCCAGTATCTGTGAGCTTTGAACAAGGCCAGAAATCTAGTAAAATGAAGTCTCC
AATTGCTTCTGAGAATCTCAACAAGACATTCAGCAATGACTTTGATCAGGCCTCAAAAATCGAGATAAAAAATAAAGTAGATCCTGGACAGACTGGCGGCTCCCAGAAAG
AAAGCGTTCCAACATTAAATAGAATTAATCTTGACTCATGGGAAGGGATGTCCAAAAACTCTTCAAAACCCGGAAACAACCCGCTTTTGGAAATCATCAAGTCATTCATC
ACTGCCTTCGTGAAGTTTTGGTCCGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAATGGTTCAAAGGTCCATTAAACTTGGACGGCCCAATCCATCCAGTATAACAATAAGACATCAATTTCAACATAATTACATAAATTTCCCACGCTCCGCCATTT
CCATCTCCCCACTGCCCATTTCTGTTGGAAGTTTGTACATTGTCTCTCACTAGTTCCCTCCTATAACTCTTCAGTAGGGTTTTAGCCTTCTTTTATCTTCAATTCTTCTT
CTACTCTTTCATTTCGCGCTCTCTCCCCTTCTAGCGGCAATTTGTTCATGCTCATTTCTGGTCTTTCAACTTCAACTAATTGGTAGCGCAAAAAGTGGATATTTGGATGT
GCTTTATCAGAAGTTGTGACTTGTGATAAGGACTAAAGTTAGCTTCCAGTATACTTATGCGATGCAGACTTGTTAAAATTCAGGGTTGAGCGAGATTAAACAATCGCTCC
ACTGGTGCAGGTTGTTCAGAATAATTTATTATTTTGTCTAAAAGGGAAGTCATCTGTTCCTGTATGATCTGTTAGATGAACTCTTCAATCCATTTGTGAATTTCCGATGG
ATCTTATCTCATCCAATTCATTCTGAAACACAAATATTATATGGGCGGACTCATCTTATTACTGCTCTGTTATATTGTAGATCTATACTCTAAGACGAGCTGAACTTTCT
GGATTTCATGCATGCCATAAAGGGTGGGTGGACGGGGCGTCCTCTCGCCCTAGCCAAGAACAATGAGGCTGAAGGGAGAAGGACTAGAATTCGGCGTTCAAAGGAGGAAA
GGAAGGCAATGGTTGAAGTTTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGTTTCCCCTCACTCAACCTCACTCACAAGGAAGTTGGGGGATCTTTCTATACGGTG
CGAGAGATTGTACGTGATATAATCCAAGAAAATAGAATCCTTGGTCCAGGAAATTTGTTATTAGAAGAGCACAATCCGGATCATTCACTTGAACAGAATCCACTCCACTC
AATTGCTATTGAACCTCACTCTCCTTTAACCTTATCGTCTAATGAAGTCCATTTTCCAGTCAACTACAACAAATATATAAGTGAAGAACCAATCTTTGTGTCAGATGAGC
AATGCACAGCAACAAATATTCAGGGATCACAGAATGAGTCAATAATTAATGGAAGCCTTGTGGATGTAAGCAACGAGGATTCTGATGAATTTATCCAGTCAGAGTTGCTA
GTAAATGGACACAAGGAAGTGGAGGAAATGGTTGAAAAAGAATCAGGAATGCCAAAAAATCACGTAACTTCTTTGGCAACAGATGTTGTGCTAGTAAATGAACACAATAA
AGTAGAGGAAGTGGTGAAAGAGGAATCAGGAATGCCAATTAATTATGTAACTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGTTCCTTGGGATG
TTAATGGTTTTGATGTAAGATCTGAGATATTGATTTCAACTAGTGCCTCAGAGAAGCAAGTTAGTCAATCCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTACA
ACTTCCGACTGTGTAGTTGAGAAAGCAGAGGAAAACCTTACAGAGCCATTAACAAAAACGAAGTCTGATTTGGTGGACGAAGCACAAATTGTTGAAATTTCTAATGGGTC
TACTGTTAAAGAAGGTAGCATACATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATACTCCAGTATCTGTGAGCTTTGAACAAGGCCAGAAATCTAGTAAAATGA
AGTCTCCAATTGCTTCTGAGAATCTCAACAAGACATTCAGCAATGACTTTGATCAGGCCTCAAAAATCGAGATAAAAAATAAAGTAGATCCTGGACAGACTGGCGGCTCC
CAGAAAGAAAGCGTTCCAACATTAAATAGAATTAATCTTGACTCATGGGAAGGGATGTCCAAAAACTCTTCAAAACCCGGAAACAACCCGCTTTTGGAAATCATCAAGTC
ATTCATCACTGCCTTCGTGAAGTTTTGGTCCGAGTAAGTAGTATGATTGTCAAGTCAAGTAGACGAATAGAGAGTAGTAGCTAAATTTTCTGCCACAGAACCTGTCTGTC
TTTGTACCAAAGTTGCAGTCGGTTGCCCCGTTCACTCCAGTCGGTCCCATACACGATATTTAGGAGGACAAAACTAGATTCTGGATGTGGGTTGGCATTTGTGTACTCTG
CAGTAGGAAAGAAAGTTTTAATAGTAGGCATTTCCCACCCCTCCCCTCCCCTCCCCTCCCCTCCCCTCAGTAAAAAAAATGATAAAAAAGATAAGCAGTAGCTTGTACAA
GGGATATTTTTCTCATCTTTTTACTATTAATTAGCATGGCATAATATGTCCCACCGTCCATGATTTTATTATAGATACAAGGGTTTCCTATTATTTCTTCATTACTATAT
GTTAAAAAAAGGGGGAGGGGGGATGAAGAAGAAGAAGAAGAAGAAGAAGAAGGAAGAAGAAAAAGGTTGGAGTAACGTGGGAAGGAGGCTTTTGATGGTAAGGTGTTTGT
GCGTTAGAAGGAGGACAAGGAAACAAGAGAGAAAGGCATATAACAGATTGGAGTTTCAATAGGCATCTTCTTTCCCCCCTTTTTGCTCTGACCCGACATCTTATATCATA
TGGATCATGTAGTCACATGATCTTCTGGATTCCCCAATTCTTTTATTAGTACTTTTTCACCCCTTTTTTTCTTTTAAAAATATCAATTTTTTCTTTTTAATAAAACGACC
TTCCAATTAATATACCATCTTGTTTTGGTCAGTTTCAAAATTACTCATTTTGTTCTCATGCCTTATTGTGTCTATACAACCCTTCAAGTTTGTTTCAGTCGAATTCAACC
CATAAACTTAACATTGGATCACATCTGATTTTTGGTTCTTCAAGTTTTCAAATTTCAAATTTTGTTTTATCTTTTCTACTAGA
Protein sequenceShow/hide protein sequence
MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHSLEQNPLHSIA
IEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPKNHVTSLATDVVLVNEHNKVE
EVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKAEENLTEPLTKTKSDLVDEAQIVEISNGSTV
KEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFI
TAFVKFWSE