; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi04G001041 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi04G001041
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionPlastid envelope DNA binding protein
Genome locationchr4:33026477..33031909
RNA-Seq ExpressionBhi04G001041
SyntenyBhi04G001041
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057131.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]6.1e-19079.43Show/hide
Query:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH
        MHAIKGGWTG PLALA+NNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLL EEEH  DH
Subjt:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH

Query:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------
        SL++NPLHSIAIEPQSPLTLS+KEVHFP+NY++ INEEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SELLVNEHK             
Subjt:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA
                         KVEEVVKEESGMPINHVTPLATDVVVETFPLD   W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEKA
Subjt:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA

Query:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK
         ENFAGPLSE  SD+VE AQIVE SNGSTVKEG ++EVGGPELEVCSDTPISV FEQGQKSS+MK+P AS    ENLNKTFSN FDQASKI    E+ENK
Subjt:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK

Query:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE
        VD GQTGGSQKES+PTLNRINLESWEGMSKNSSK ENNP+LEI K+FIAAFVKFWS+
Subjt:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE

XP_004138835.1 uncharacterized protein LOC101202832 [Cucumis sativus]3.6e-18276.42Show/hide
Query:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH
        MHAIKGGWTG PLALA+NNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPG LL  EEH  DH
Subjt:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH

Query:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESEL-------------------
        SLE+NPLHSIAIEP SPLTLS+ EVHFP+NY++ I+EEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SEL                   
Subjt:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESEL-------------------

Query:  ------------LVNEHKKVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEK
                    LVNEH KVEEVVKEESGMPIN+VTPLATDVVVETFPLDS  W VNG DVRSEILISTSASEKQVSQ+IELESDVGLFNI  S CVVEK
Subjt:  ------------LVNEHKKVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEK

Query:  AEENFAGPL--SESDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMEN
        AEEN   PL  ++SD+V+ AQIVE SNGSTVKEG I+EVGGPELEVCSDTP+SV+FEQGQKSS+MK+P AS    ENLNKTFSN FDQASKI    E++N
Subjt:  AEENFAGPL--SESDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMEN

Query:  KVDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE
        KVD GQTGGSQKES+PTLNRINL+SWEGMSKNSSK  NNP+LEI K+FI AFVKFWSE
Subjt:  KVDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE

XP_008456554.1 PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo]2.7e-19079.65Show/hide
Query:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH
        MHAIKGGWTG PLALA+NNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLL EEEH  DH
Subjt:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH

Query:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------
        SL++NPLHSIAIEPQSPLTLS+KEVHFP+NY++ INEEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SELLVNEHK             
Subjt:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA
                         KVEEVVKEESGMPINHVTPLATDVVVETFPLD   W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEKA
Subjt:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA

Query:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK
         ENFAGPLSE  SD+VE AQIVE SNGSTVKEG ++EVGGPELEVCSDTPISV FEQGQKSS+MK+P AS    ENLNKTFSN FDQASKI    E+ENK
Subjt:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK

Query:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE
        VD GQTGGSQKES+PTLNRINLESWEGMSKNSSK ENNP+LEI K+FIAAFVKFWSE
Subjt:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE

XP_008456557.1 PREDICTED: uncharacterized protein LOC103496473 isoform X2 [Cucumis melo]5.7e-18076.37Show/hide
Query:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH
        MHAIKGGWTG PLALA+NNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLL EEEH  DH
Subjt:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH

Query:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------
        SL++NPLHSIAIEPQSPLTLS+KEVHFP+NY++ INEEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SELLVNEHK             
Subjt:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA
                         KVEEVVKEESGMPINHVTPLATDVVVETFPLD   W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEKA
Subjt:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA

Query:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK
         ENFAGPLSE  SD+VE AQIVE SNGSTVKEG ++EVGGPELEVCSDTPISV FEQGQKSS+MK                      ASKI    E+ENK
Subjt:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK

Query:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE
        VD GQTGGSQKES+PTLNRINLESWEGMSKNSSK ENNP+LEI K+FIAAFVKFWSE
Subjt:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE

XP_038886590.1 uncharacterized protein LOC120076760 [Benincasa hispida]9.6e-236100Show/hide
Query:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH
        MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH
Subjt:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH

Query:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHKKVEEVVKEESGMP
        SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHKKVEEVVKEESGMP
Subjt:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHKKVEEVVKEESGMP

Query:  INHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKAEENFAGPLSESDMVEAAQIVETSNGSTVKE
        INHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKAEENFAGPLSESDMVEAAQIVETSNGSTVKE
Subjt:  INHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKAEENFAGPLSESDMVEAAQIVETSNGSTVKE

Query:  GIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRINLESWEGMSKNS
        GIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRINLESWEGMSKNS
Subjt:  GIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRINLESWEGMSKNS

Query:  SKAENNPVLEIFKAFIAAFVKFWSE
        SKAENNPVLEIFKAFIAAFVKFWSE
Subjt:  SKAENNPVLEIFKAFIAAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A0A0LML9 Uncharacterized protein1.7e-18276.42Show/hide
Query:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH
        MHAIKGGWTG PLALA+NNEAEGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPG LL  EEH  DH
Subjt:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH

Query:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESEL-------------------
        SLE+NPLHSIAIEP SPLTLS+ EVHFP+NY++ I+EEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SEL                   
Subjt:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESEL-------------------

Query:  ------------LVNEHKKVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEK
                    LVNEH KVEEVVKEESGMPIN+VTPLATDVVVETFPLDS  W VNG DVRSEILISTSASEKQVSQ+IELESDVGLFNI  S CVVEK
Subjt:  ------------LVNEHKKVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEK

Query:  AEENFAGPL--SESDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMEN
        AEEN   PL  ++SD+V+ AQIVE SNGSTVKEG I+EVGGPELEVCSDTP+SV+FEQGQKSS+MK+P AS    ENLNKTFSN FDQASKI    E++N
Subjt:  AEENFAGPL--SESDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMEN

Query:  KVDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE
        KVD GQTGGSQKES+PTLNRINL+SWEGMSKNSSK  NNP+LEI K+FI AFVKFWSE
Subjt:  KVDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE

A0A1S3C344 uncharacterized protein LOC103496473 isoform X22.8e-18076.37Show/hide
Query:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH
        MHAIKGGWTG PLALA+NNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLL EEEH  DH
Subjt:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH

Query:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------
        SL++NPLHSIAIEPQSPLTLS+KEVHFP+NY++ INEEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SELLVNEHK             
Subjt:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA
                         KVEEVVKEESGMPINHVTPLATDVVVETFPLD   W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEKA
Subjt:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA

Query:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK
         ENFAGPLSE  SD+VE AQIVE SNGSTVKEG ++EVGGPELEVCSDTPISV FEQGQKSS+MK                      ASKI    E+ENK
Subjt:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK

Query:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE
        VD GQTGGSQKES+PTLNRINLESWEGMSKNSSK ENNP+LEI K+FIAAFVKFWSE
Subjt:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE

A0A1S3C473 uncharacterized protein LOC103496473 isoform X11.3e-19079.65Show/hide
Query:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH
        MHAIKGGWTG PLALA+NNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLL EEEH  DH
Subjt:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH

Query:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------
        SL++NPLHSIAIEPQSPLTLS+KEVHFP+NY++ INEEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SELLVNEHK             
Subjt:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA
                         KVEEVVKEESGMPINHVTPLATDVVVETFPLD   W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEKA
Subjt:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA

Query:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK
         ENFAGPLSE  SD+VE AQIVE SNGSTVKEG ++EVGGPELEVCSDTPISV FEQGQKSS+MK+P AS    ENLNKTFSN FDQASKI    E+ENK
Subjt:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK

Query:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE
        VD GQTGGSQKES+PTLNRINLESWEGMSKNSSK ENNP+LEI K+FIAAFVKFWSE
Subjt:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE

A0A5A7UUF2 Plastid envelope DNA binding protein2.9e-19079.43Show/hide
Query:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH
        MHAIKGGWTG PLALA+NNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLL EEEH  DH
Subjt:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH

Query:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------
        SL++NPLHSIAIEPQSPLTLS+KEVHFP+NY++ INEEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SELLVNEHK             
Subjt:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA
                         KVEEVVKEESGMPINHVTPLATDVVVETFPLD   W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEKA
Subjt:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA

Query:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK
         ENFAGPLSE  SD+VE AQIVE SNGSTVKEG ++EVGGPELEVCSDTPISV FEQGQKSS+MK+P AS    ENLNKTFSN FDQASKI    E+ENK
Subjt:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK

Query:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE
        VD GQTGGSQKES+PTLNRINLESWEGMSKNSSK ENNP+LEI K+FIAAFVKFWS+
Subjt:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE

A0A5D3BB97 Plastid envelope DNA binding protein6.2e-18076.15Show/hide
Query:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH
        MHAIKGGWTG PLALA+NNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLL EEEH  DH
Subjt:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDH

Query:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------
        SL++NPLHSIAIEPQSPLTLS+KEVHFP+NY++ INEEPIFVSDEQCT TNIQGSQN  IINGSLVD+++++  EFI+SELLVNEHK             
Subjt:  SLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA
                         KVEEVVKEESGMPINHVTPLATDVVVETFPLD   W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEKA
Subjt:  -----------------KVEEVVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKA

Query:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK
         ENFAGPLSE  SD+VE AQIVE SNGSTVKEG ++EVGGPELEVCSDTPISV FEQGQKSS+MK                      ASKI    E+ENK
Subjt:  EENFAGPLSE--SDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENK

Query:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE
        VD GQTGGSQKES+PTLNRINLESWEGMSKNSSK ENNP+LEI K+FIAAFVKFWS+
Subjt:  VDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding5.4e-4332.55Show/hide
Query:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRI-D
        MH++K    G   ALA+ +++ G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG LL E    + D
Subjt:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRI-D

Query:  HSLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDIN---------------------------EEPIFVSDEQCTTTNIQGSQ--------NGPIINGS
         SL      SI ++P  PL+LS    H       D +                           +E I +  +   +T+I  +Q        N    N  
Subjt:  HSLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDIN---------------------------EEPIFVSDEQCTTTNIQGSQ--------NGPIINGS

Query:  L-------------------VDINDKEPG----EFIESE--LLVNEHKKVEE----VVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEIL
        L                   +D+++K+ G     F+ES+    VN   +V +    + + ++G+       ++ + VVETFPL S +  ++  D +   L
Subjt:  L-------------------VDINDKEPG----EFIESE--LLVNEHKKVEE----VVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEIL

Query:  ISTSASEKQVSQTIELES------DVGLFNIKASGCVVEK-AEENFAGPLSESDMVEAAQIV--ETSNGSTV-------KEGIIYEVGGPELEVCSDTPI
               K     +E +       D+G  +   S  V+E    E   G +     V   + V  E  N ++V       KE ++  V G    V  +   
Subjt:  ISTSASEKQVSQTIELES------DVGLFNIKASGCVVEK-AEENFAGPLSESDMVEAAQIV--ETSNGSTV-------KEGIIYEVGGPELEVCSDTPI

Query:  SVTFEQGQKSSEMKAPNASPSTIENLN-----KTFSN--GFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIF
        +  F  G  ++E K P +S  +    N      T S+  G + AS  K+ T  + K+DA  +  SQKE+  TLNRI  ESW+G S N  + E NP+L + 
Subjt:  SVTFEQGQKSSEMKAPNASPSTIENLN-----KTFSN--GFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIF

Query:  KAFIAAFVKFWSE
        K+F+ AFVKFWSE
Subjt:  KAFIAAFVKFWSE

AT3G52170.2 DNA binding5.4e-4332.55Show/hide
Query:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRI-D
        MH++K    G   ALA+ +++ G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG LL E    + D
Subjt:  MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRI-D

Query:  HSLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDIN---------------------------EEPIFVSDEQCTTTNIQGSQ--------NGPIINGS
         SL      SI ++P  PL+LS    H       D +                           +E I +  +   +T+I  +Q        N    N  
Subjt:  HSLEENPLHSIAIEPQSPLTLSAKEVHFPINYDQDIN---------------------------EEPIFVSDEQCTTTNIQGSQ--------NGPIINGS

Query:  L-------------------VDINDKEPG----EFIESE--LLVNEHKKVEE----VVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEIL
        L                   +D+++K+ G     F+ES+    VN   +V +    + + ++G+       ++ + VVETFPL S +  ++  D +   L
Subjt:  L-------------------VDINDKEPG----EFIESE--LLVNEHKKVEE----VVKEESGMPINHVTPLATDVVVETFPLDSGSWGVNGSDVRSEIL

Query:  ISTSASEKQVSQTIELES------DVGLFNIKASGCVVEK-AEENFAGPLSESDMVEAAQIV--ETSNGSTV-------KEGIIYEVGGPELEVCSDTPI
               K     +E +       D+G  +   S  V+E    E   G +     V   + V  E  N ++V       KE ++  V G    V  +   
Subjt:  ISTSASEKQVSQTIELES------DVGLFNIKASGCVVEK-AEENFAGPLSESDMVEAAQIV--ETSNGSTV-------KEGIIYEVGGPELEVCSDTPI

Query:  SVTFEQGQKSSEMKAPNASPSTIENLN-----KTFSN--GFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIF
        +  F  G  ++E K P +S  +    N      T S+  G + AS  K+ T  + K+DA  +  SQKE+  TLNRI  ESW+G S N  + E NP+L + 
Subjt:  SVTFEQGQKSSEMKAPNASPSTIENLN-----KTFSN--GFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIF

Query:  KAFIAAFVKFWSE
        K+F+ AFVKFWSE
Subjt:  KAFIAAFVKFWSE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein8.1e-0723.01Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKLLEEEEHRI--DHSLEENPLHSIAIEPQSPLTLSAKE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E    +  D S   +P     +E ++   LS   
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKLLEEEEHRI--DHSLEENPLHSIAIEPQSPLTLSAKE

Query:  VHFPINYDQDINEEPIFVSDEQ----------CTTTNIQGSQNGPIING-SLVDINDKEPGE----FIESELLVNEHKKVEEVVKEESGMPINHVTPLAT
           P +    ++  P+ + + +            T++       PI+   +L  ++   P +    F  + + + E K + EV         +H +P   
Subjt:  VHFPINYDQDINEEPIFVSDEQ----------CTTTNIQGSQNGPIING-SLVDINDKEPGE----FIESELLVNEHKKVEEVVKEESGMPINHVTPLAT

Query:  DVVVETFPLDSGSWGVN-GSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVV----EKAEENFAGPLSESDMVEAAQIVETSNGSTVKE----
         VVVE     +GS  +  GS+ R +I+ ++ ++    S            N + +  V+    +K E      +  S+  E   +    N  +  +    
Subjt:  DVVVETFPLDSGSWGVN-GSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVV----EKAEENFAGPLSESDMVEAAQIVETSNGSTVKE----

Query:  -GIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSN
          +  +V   E    S+T      E   ++ E+K  ++S S I +  K F+N
Subjt:  -GIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSN

AT5G58210.2 hydroxyproline-rich glycoprotein family protein8.1e-0723.01Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKLLEEEEHRI--DHSLEENPLHSIAIEPQSPLTLSAKE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E    +  D S   +P     +E ++   LS   
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKLLEEEEHRI--DHSLEENPLHSIAIEPQSPLTLSAKE

Query:  VHFPINYDQDINEEPIFVSDEQ----------CTTTNIQGSQNGPIING-SLVDINDKEPGE----FIESELLVNEHKKVEEVVKEESGMPINHVTPLAT
           P +    ++  P+ + + +            T++       PI+   +L  ++   P +    F  + + + E K + EV         +H +P   
Subjt:  VHFPINYDQDINEEPIFVSDEQ----------CTTTNIQGSQNGPIING-SLVDINDKEPGE----FIESELLVNEHKKVEEVVKEESGMPINHVTPLAT

Query:  DVVVETFPLDSGSWGVN-GSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVV----EKAEENFAGPLSESDMVEAAQIVETSNGSTVKE----
         VVVE     +GS  +  GS+ R +I+ ++ ++    S            N + +  V+    +K E      +  S+  E   +    N  +  +    
Subjt:  DVVVETFPLDSGSWGVN-GSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVV----EKAEENFAGPLSESDMVEAAQIVETSNGSTVKE----

Query:  -GIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSN
          +  +V   E    S+T      E   ++ E+K  ++S S I +  K F+N
Subjt:  -GIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSN

AT5G58210.3 hydroxyproline-rich glycoprotein family protein8.1e-0723.01Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKLLEEEEHRI--DHSLEENPLHSIAIEPQSPLTLSAKE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E    +  D S   +P     +E ++   LS   
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKLLEEEEHRI--DHSLEENPLHSIAIEPQSPLTLSAKE

Query:  VHFPINYDQDINEEPIFVSDEQ----------CTTTNIQGSQNGPIING-SLVDINDKEPGE----FIESELLVNEHKKVEEVVKEESGMPINHVTPLAT
           P +    ++  P+ + + +            T++       PI+   +L  ++   P +    F  + + + E K + EV         +H +P   
Subjt:  VHFPINYDQDINEEPIFVSDEQ----------CTTTNIQGSQNGPIING-SLVDINDKEPGE----FIESELLVNEHKKVEEVVKEESGMPINHVTPLAT

Query:  DVVVETFPLDSGSWGVN-GSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVV----EKAEENFAGPLSESDMVEAAQIVETSNGSTVKE----
         VVVE     +GS  +  GS+ R +I+ ++ ++    S            N + +  V+    +K E      +  S+  E   +    N  +  +    
Subjt:  DVVVETFPLDSGSWGVN-GSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVV----EKAEENFAGPLSESDMVEAAQIVETSNGSTVKE----

Query:  -GIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSN
          +  +V   E    S+T      E   ++ E+K  ++S S I +  K F+N
Subjt:  -GIIYEVGGPELEVCSDTPISVTFEQGQKSSEMKAPNASPSTIENLNKTFSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGCTATAAAGGGTGGGTGGACGGGGCATCCTCTTGCCCTAGCCCAGAACAATGAGGCTGAAGGGAGGAAGACCAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGC
AATGGTTGAAGTCTTCATAAAAAAGTACCAGGAATCAAATAATGGGAGTTTCCCCTCACTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACGGTGCGAGAGA
TTGTACGTGATATAATCCAAGAAAATAGAGTCCTTGGTCCAGGAAAGTTATTAGAAGAGGAAGAACACAGAATTGATCATTCACTTGAAGAGAATCCACTCCACTCAATT
GCTATTGAACCTCAATCTCCTTTAACGTTATCGGCTAAGGAAGTCCATTTTCCAATCAACTACGACCAAGATATAAATGAAGAACCAATCTTTGTTTCAGATGAGCAATG
CACTACAACAAATATTCAGGGATCACAGAATGGGCCAATAATTAATGGTAGCCTGGTGGACATAAACGACAAGGAACCTGGGGAATTTATCGAGTCAGAGTTGCTAGTAA
ATGAACACAAGAAAGTAGAGGAAGTGGTAAAAGAGGAATCAGGAATGCCAATTAATCATGTAACTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCA
GGTTCTTGGGGTGTTAATGGTTCAGATGTAAGATCTGAGATATTGATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTT
GTTTAACATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGCAGAGGAAAACTTTGCAGGTCCATTATCAGAATCTGATATGGTGGAGGCAGCACAAATTGTTGAAACATCTA
ATGGATCTACTGTGAAAGAAGGTATCATTTATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATACTCCAATATCTGTGACCTTTGAACAAGGCCAGAAATCTAGT
GAAATGAAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAATCTCAACAAGACATTCAGCAATGGGTTTGATCAGGCCTCAAAAATCAAAGAGGAGACAGAGATGGAAAA
TAAAGTAGATGCTGGACAAACTGGTGGCTCCCAGAAAGAAAGCATTCCAACTTTAAACAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACTCATCAAAAGCCG
AAAACAACCCGGTTTTGGAAATCTTCAAGGCATTTATCGCTGCCTTCGTGAAGTTTTGGTCTGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATTCATTCTCCCACACTCCGCCATTCCCATCCCCCACTGCCCATTTCTGTTGGAATTTTACAGAATGTATCACTGCTTCGCTCTCTCTAATCCATCGGTAGGGTTTTAGG
CTTCTTTATCTTCAATTCTCCTTCCACCCTTTCATTTCGCGCTCTCTCTGCTTCTAGGCTTGTACTCTTGGAAAAGCTGAACTTTGTGGATTTCATGCATGCTATAAAGG
GTGGGTGGACGGGGCATCCTCTTGCCCTAGCCCAGAACAATGAGGCTGAAGGGAGGAAGACCAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTC
ATAAAAAAGTACCAGGAATCAAATAATGGGAGTTTCCCCTCACTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACGGTGCGAGAGATTGTACGTGATATAAT
CCAAGAAAATAGAGTCCTTGGTCCAGGAAAGTTATTAGAAGAGGAAGAACACAGAATTGATCATTCACTTGAAGAGAATCCACTCCACTCAATTGCTATTGAACCTCAAT
CTCCTTTAACGTTATCGGCTAAGGAAGTCCATTTTCCAATCAACTACGACCAAGATATAAATGAAGAACCAATCTTTGTTTCAGATGAGCAATGCACTACAACAAATATT
CAGGGATCACAGAATGGGCCAATAATTAATGGTAGCCTGGTGGACATAAACGACAAGGAACCTGGGGAATTTATCGAGTCAGAGTTGCTAGTAAATGAACACAAGAAAGT
AGAGGAAGTGGTAAAAGAGGAATCAGGAATGCCAATTAATCATGTAACTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGGTTCTTGGGGTGTTA
ATGGTTCAGATGTAAGATCTGAGATATTGATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAACATTAAAGCT
TCTGGCTGTGTAGTTGAGAAAGCAGAGGAAAACTTTGCAGGTCCATTATCAGAATCTGATATGGTGGAGGCAGCACAAATTGTTGAAACATCTAATGGATCTACTGTGAA
AGAAGGTATCATTTATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATACTCCAATATCTGTGACCTTTGAACAAGGCCAGAAATCTAGTGAAATGAAGGCTCCAA
ATGCTTCTCCAAGTACTATTGAGAATCTCAACAAGACATTCAGCAATGGGTTTGATCAGGCCTCAAAAATCAAAGAGGAGACAGAGATGGAAAATAAAGTAGATGCTGGA
CAAACTGGTGGCTCCCAGAAAGAAAGCATTCCAACTTTAAACAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACTCATCAAAAGCCGAAAACAACCCGGTTTT
GGAAATCTTCAAGGCATTTATCGCTGCCTTCGTGAAGTTTTGGTCTGAGTAAATAATATGATTGTCGAGTATAAACGAATAGAGAGTAGTAGTAGTTAAATTTTCTGCCA
CAGAACCTGTCTGTCTTTGTACCAAGTTGCAATCGGTTACCCCGTTCACTCGAGTCGGTCCCATCGTCGATAATTACAAGGACAAAACTGGATTCTGGATGTGGGTTGGC
ATTTCTGTACGCTGCAGTAAGAAAGAAAGTTTAGGAGTAGGCATTTCCCACCCCTCTCAGTAGCTTGTAGAAGGGGTATTTTTCTTTTTAATCTTTTTACTGTAACCGTG
AGATATGTCCCACCCTTATGATTTTATTCCAGATATGAATGGTTTCCTATTCATTTTTCATTACCATATGATAAAAAAGAGGAAGAAGAAAAAGTTGGAGTAACGTGAGA
AGGAGCCTTTGATGGTAAGGTGTTTGTGTGTTAGGAGGAGGACAGGGAAACAAGAGAGAAAGGCATATACAGATTGGAGTTTTAATAGGCATCTTCTTTCCCCCTTTTTG
CTCTGACCCGACATCTTATATCATATGGATCATGTAGTCACATGATTTTCTGGATTCCCCAATTTTCTAATACTACTTTTCTACCCTTTTTTTTTTTTTTTTTTTTAAAT
AAAAAAAATTACATTCCAAATAATGCAC
Protein sequenceShow/hide protein sequence
MHAIKGGWTGHPLALAQNNEAEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLEEEEHRIDHSLEENPLHSI
AIEPQSPLTLSAKEVHFPINYDQDINEEPIFVSDEQCTTTNIQGSQNGPIINGSLVDINDKEPGEFIESELLVNEHKKVEEVVKEESGMPINHVTPLATDVVVETFPLDS
GSWGVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKAEENFAGPLSESDMVEAAQIVETSNGSTVKEGIIYEVGGPELEVCSDTPISVTFEQGQKSS
EMKAPNASPSTIENLNKTFSNGFDQASKIKEETEMENKVDAGQTGGSQKESIPTLNRINLESWEGMSKNSSKAENNPVLEIFKAFIAAFVKFWSE