; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G5078 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G5078
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionPlastid envelope DNA binding protein
Genome locationctg1227:3274481..3282043
RNA-Seq ExpressionCucsat.G5078
SyntenyCucsat.G5078
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057131.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]1.43e-28791.78Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWS+
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

TYJ97112.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]3.38e-27087.78Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMK                  ASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWS+
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

XP_004138835.1 uncharacterized protein LOC101202832 [Cucumis sativus]0.0100Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS
        MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS

Query:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPK
        LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPK
Subjt:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPK

Query:  NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA
        NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA
Subjt:  NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA

Query:  EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG
        EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG
Subjt:  EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG

Query:  SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
Subjt:  SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

XP_008456554.1 PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo]1.25e-28892Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

XP_008456557.1 PREDICTED: uncharacterized protein LOC103496473 isoform X2 [Cucumis melo]2.97e-27188Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMK                  ASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A0A0LML9 Uncharacterized protein0.0100Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS
        MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS

Query:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPK
        LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPK
Subjt:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPK

Query:  NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA
        NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA
Subjt:  NHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKA

Query:  EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG
        EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG
Subjt:  EENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGG

Query:  SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
Subjt:  SQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

A0A1S3C344 uncharacterized protein LOC103496473 isoform X21.44e-27188Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMK                  ASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

A0A1S3C473 uncharacterized protein LOC103496473 isoform X16.04e-28992Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWSE
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

A0A5A7UUF2 Plastid envelope DNA binding protein6.93e-28891.78Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWS+
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

A0A5D3BB97 Plastid envelope DNA binding protein1.64e-27087.78Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH
        MHAIKGGWTGRPLALAKNNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPG LLLEE HN DH
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEE-HNPDH

Query:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP
        SL+QNPLHSIAIEP SPLTLSS EVHFP+NYNKYI+EEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVN HKEVEE+VEKESGMP
Subjt:  SLEQNPLHSIAIEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMP

Query:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK
        KNH T LATDV LVNEHNKVEEVVKEESGMPIN+VTPLATDVVVETFPLD VPWDVNG DVRSEILIST+ASEKQVSQSIELESDVGL NIT SD VVEK
Subjt:  KNHVTSLATDVVLVNEHNKVEEVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEK

Query:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG
        A EN   PL++TKSDLV+ AQIVEISNGSTVKEGS+HEVGGPELEVCSDTP+SV+FEQGQKSSKMK                  ASKIEI+NKVDPGQTG
Subjt:  AEENLTEPLTKTKSDLVDEAQIVEISNGSTVKEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTG

Query:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE
        GSQKESVPTLNRINL+SWEGMSKNSSKP NNPLLEIIKSFI AFVKFWS+
Subjt:  GSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFITAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding3.5e-4531.1Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS
        MH++K    G+  ALAK +++ G+RTR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENR+LGPG+LLLE +     
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS

Query:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYN-KYISEEPI-FVSDEQCTATNIQGSQNESIINGSLVDV-SNEDSDEFIQSELLVNGHKEVEEMVEKESG
         +Q+   SI ++P  PL+LS N  H     +  + SE P   V+  Q    N +      ++   +  V  + DS +   ++L  +  ++ +  ++  +G
Subjt:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYN-KYISEEPI-FVSDEQCTATNIQGSQNESIINGSLVDV-SNEDSDEFIQSELLVNGHKEVEEMVEKESG

Query:  MPKNHVTSLATDVVLVNEHNKVEEVVKEESGM------------PINY-------------------VTPLATDVVVETFPLDSVPWDVNGFDVRSEILI
        +     T    D V     +K  +V  ++ G             P+N                       ++ + VVETFPL SV   ++  D +   L 
Subjt:  MPKNHVTSLATDVVLVNEHNKVEEVVKEESGM------------PINY-------------------VTPLATDVVVETFPLDSVPWDVNGFDVRSEILI

Query:  STSASEKQVSQSIELES------DVGLFNITTSDCVVE---------KAEENLTEPLTKTKSDLVDEAQIVEISNGSTVKE-------GSIHEV---GGP
              K     +E +       D+G  + +TS  V+E         +   +++ P+ K   + +  +  V++      +        G++HE       
Subjt:  STSASEKQVSQSIELES------DVGLFNITTSDCVVE---------KAEENLTEPLTKTKSDLVDEAQIVEISNGSTVKE-------GSIHEV---GGP

Query:  ELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFIT
         L      P S +    +K+ + K    S       ++  ++ + +E K K+D   +  SQKE+  TLNRI  +SW+G S N  +   NPLL ++KSF+T
Subjt:  ELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFIT

Query:  AFVKFWSE
        AFVKFWSE
Subjt:  AFVKFWSE

AT3G52170.2 DNA binding3.5e-4531.1Show/hide
Query:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS
        MH++K    G+  ALAK +++ G+RTR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENR+LGPG+LLLE +     
Subjt:  MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHS

Query:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYN-KYISEEPI-FVSDEQCTATNIQGSQNESIINGSLVDV-SNEDSDEFIQSELLVNGHKEVEEMVEKESG
         +Q+   SI ++P  PL+LS N  H     +  + SE P   V+  Q    N +      ++   +  V  + DS +   ++L  +  ++ +  ++  +G
Subjt:  LEQNPLHSIAIEPHSPLTLSSNEVHFPVNYN-KYISEEPI-FVSDEQCTATNIQGSQNESIINGSLVDV-SNEDSDEFIQSELLVNGHKEVEEMVEKESG

Query:  MPKNHVTSLATDVVLVNEHNKVEEVVKEESGM------------PINY-------------------VTPLATDVVVETFPLDSVPWDVNGFDVRSEILI
        +     T    D V     +K  +V  ++ G             P+N                       ++ + VVETFPL SV   ++  D +   L 
Subjt:  MPKNHVTSLATDVVLVNEHNKVEEVVKEESGM------------PINY-------------------VTPLATDVVVETFPLDSVPWDVNGFDVRSEILI

Query:  STSASEKQVSQSIELES------DVGLFNITTSDCVVE---------KAEENLTEPLTKTKSDLVDEAQIVEISNGSTVKE-------GSIHEV---GGP
              K     +E +       D+G  + +TS  V+E         +   +++ P+ K   + +  +  V++      +        G++HE       
Subjt:  STSASEKQVSQSIELES------DVGLFNITTSDCVVE---------KAEENLTEPLTKTKSDLVDEAQIVEISNGSTVKE-------GSIHEV---GGP

Query:  ELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFIT
         L      P S +    +K+ + K    S       ++  ++ + +E K K+D   +  SQKE+  TLNRI  +SW+G S N  +   NPLL ++KSF+T
Subjt:  ELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFIT

Query:  AFVKFWSE
        AFVKFWSE
Subjt:  AFVKFWSE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein1.4e-0953.57Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE ++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI

AT5G58210.2 hydroxyproline-rich glycoprotein family protein1.4e-0953.57Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE ++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI

AT5G58210.3 hydroxyproline-rich glycoprotein family protein1.4e-0953.57Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE ++
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGCCATAAAGGGTGGGTGGACGGGGCGTCCTCTCGCCCTAGCCAAGAACAATGAGGCTGAAGGGAGAAGGACTAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGC
AATGGTTGAAGTTTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGTTTCCCCTCACTCAACCTCACTCACAAGGAAGTTGGGGGATCTTTCTATACGGTGCGAGAGA
TTGTACGTGATATAATCCAAGAAAATAGAATCCTTGGTCCAGGAAATTTGTTATTAGAAGAGCACAATCCGGATCATTCACTTGAACAGAATCCACTCCACTCAATTGCT
ATTGAACCTCACTCTCCTTTAACCTTATCGTCTAATGAAGTCCATTTTCCAGTCAACTACAACAAATATATAAGTGAAGAACCAATCTTTGTGTCAGATGAGCAATGCAC
AGCAACAAATATTCAGGGATCACAGAATGAGTCAATAATTAATGGAAGCCTTGTGGATGTAAGCAACGAGGATTCTGATGAATTTATCCAGTCAGAGTTGCTAGTAAATG
GACACAAGGAAGTGGAGGAAATGGTTGAAAAAGAATCAGGAATGCCAAAAAATCACGTAACTTCTTTGGCAACAGATGTTGTGCTAGTAAATGAACACAATAAAGTAGAG
GAAGTGGTGAAAGAGGAATCAGGAATGCCAATTAATTATGTAACTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGTTCCTTGGGATGTTAATGG
TTTTGATGTAAGATCTGAGATATTGATTTCAACTAGTGCCTCAGAGAAGCAAGTTAGTCAATCCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTACAACTTCCG
ACTGTGTAGTTGAGAAAGCAGAGGAAAACCTTACAGAGCCATTAACAAAAACGAAGTCTGATTTGGTGGACGAAGCACAAATTGTTGAAATTTCTAATGGGTCTACTGTT
AAAGAAGGTAGCATACATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATACTCCAGTATCTGTGAGCTTTGAACAAGGCCAGAAATCTAGTAAAATGAAGTCTCC
AATTGCTTCTGAGAATCTCAACAAGACATTCAGCAATGACTTTGATCAGGCCTCAAAAATCGAGATAAAAAATAAAGTAGATCCTGGACAGACTGGCGGCTCCCAGAAAG
AAAGCGTTCCAACATTAAATAGAATTAATCTTGACTCATGGGAAGGGATGTCCAAAAACTCTTCAAAACCCGGAAACAACCCGCTTTTGGAAATCATCAAGTCATTCATC
ACTGCCTTCGTGAAGTTTTGGTCCGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATGCCATAAAGGGTGGGTGGACGGGGCGTCCTCTCGCCCTAGCCAAGAACAATGAGGCTGAAGGGAGAAGGACTAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGC
AATGGTTGAAGTTTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGTTTCCCCTCACTCAACCTCACTCACAAGGAAGTTGGGGGATCTTTCTATACGGTGCGAGAGA
TTGTACGTGATATAATCCAAGAAAATAGAATCCTTGGTCCAGGAAATTTGTTATTAGAAGAGCACAATCCGGATCATTCACTTGAACAGAATCCACTCCACTCAATTGCT
ATTGAACCTCACTCTCCTTTAACCTTATCGTCTAATGAAGTCCATTTTCCAGTCAACTACAACAAATATATAAGTGAAGAACCAATCTTTGTGTCAGATGAGCAATGCAC
AGCAACAAATATTCAGGGATCACAGAATGAGTCAATAATTAATGGAAGCCTTGTGGATGTAAGCAACGAGGATTCTGATGAATTTATCCAGTCAGAGTTGCTAGTAAATG
GACACAAGGAAGTGGAGGAAATGGTTGAAAAAGAATCAGGAATGCCAAAAAATCACGTAACTTCTTTGGCAACAGATGTTGTGCTAGTAAATGAACACAATAAAGTAGAG
GAAGTGGTGAAAGAGGAATCAGGAATGCCAATTAATTATGTAACTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGTTCCTTGGGATGTTAATGG
TTTTGATGTAAGATCTGAGATATTGATTTCAACTAGTGCCTCAGAGAAGCAAGTTAGTCAATCCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTACAACTTCCG
ACTGTGTAGTTGAGAAAGCAGAGGAAAACCTTACAGAGCCATTAACAAAAACGAAGTCTGATTTGGTGGACGAAGCACAAATTGTTGAAATTTCTAATGGGTCTACTGTT
AAAGAAGGTAGCATACATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATACTCCAGTATCTGTGAGCTTTGAACAAGGCCAGAAATCTAGTAAAATGAAGTCTCC
AATTGCTTCTGAGAATCTCAACAAGACATTCAGCAATGACTTTGATCAGGCCTCAAAAATCGAGATAAAAAATAAAGTAGATCCTGGACAGACTGGCGGCTCCCAGAAAG
AAAGCGTTCCAACATTAAATAGAATTAATCTTGACTCATGGGAAGGGATGTCCAAAAACTCTTCAAAACCCGGAAACAACCCGCTTTTGGAAATCATCAAGTCATTCATC
ACTGCCTTCGTGAAGTTTTGGTCCGAGTAA
Protein sequenceShow/hide protein sequence
MHAIKGGWTGRPLALAKNNEAEGRRTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRILGPGNLLLEEHNPDHSLEQNPLHSIA
IEPHSPLTLSSNEVHFPVNYNKYISEEPIFVSDEQCTATNIQGSQNESIINGSLVDVSNEDSDEFIQSELLVNGHKEVEEMVEKESGMPKNHVTSLATDVVLVNEHNKVE
EVVKEESGMPINYVTPLATDVVVETFPLDSVPWDVNGFDVRSEILISTSASEKQVSQSIELESDVGLFNITTSDCVVEKAEENLTEPLTKTKSDLVDEAQIVEISNGSTV
KEGSIHEVGGPELEVCSDTPVSVSFEQGQKSSKMKSPIASENLNKTFSNDFDQASKIEIKNKVDPGQTGGSQKESVPTLNRINLDSWEGMSKNSSKPGNNPLLEIIKSFI
TAFVKFWSE