; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G07710 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G07710
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPlastid envelope DNA binding protein
Genome locationClcChr08:19288462..19291495
RNA-Seq ExpressionClc08G07710
SyntenyClc08G07710
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057131.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]2.0e-18878.77Show/hide
Query:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH
        MHA+KGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+ DH
Subjt:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH

Query:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------
        SL++NPLHSIAIEP SPLTLSSKEVHFP+NYN+YINEE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SEL VNEHK             
Subjt:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE
                         KVEEVVKEESGMPINH+TPLATDVVVETFPLD V W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEK 
Subjt:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE

Query:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK
         ENFAGPLSETKSDLVEVAQIVE SNGST+KEG ++EVGG ELEVCSDTPISV  EQGQKSS+MK+P AS    ENLNKTFSN FDQASKI    E+E K
Subjt:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK

Query:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE
        VD GQTGGSQ ES+PTLNRINLESWEGMSKNS KPENNPLLEI+K+FI AFVKFWS+
Subjt:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE

XP_004138835.1 uncharacterized protein LOC101202832 [Cucumis sativus]4.5e-18576.81Show/hide
Query:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHS
        MHA+KGGWTGRPLALAKNNE EGR+TRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPG LLLEEH+ DHS
Subjt:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHS

Query:  LEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSEL--------------------
        LE+NPLHSIAIEP SPLTLSS EVHFPVNYN+YI+EE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SEL                    
Subjt:  LEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSEL--------------------

Query:  -----------PVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE
                    VNEH KVEEVVKEESGMPIN++TPLATDVVVETFPLDSV W VNG DVRSEILISTSASEKQVSQ+IELESDVGLFNI  S CVVEK 
Subjt:  -----------PVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE

Query:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK
        EEN   PL++TKSDLV+ AQIVE SNGST+KEG I+EVGG ELEVCSDTP+SV+ EQGQKSS+MK+P AS    ENLNKTFSN FDQASKI    E++ K
Subjt:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK

Query:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE
        VD GQTGGSQ ES+PTLNRINL+SWEGMSKNS KP NNPLLEI+K+FITAFVKFWSE
Subjt:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE

XP_008456554.1 PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo]8.8e-18978.99Show/hide
Query:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH
        MHA+KGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+ DH
Subjt:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH

Query:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------
        SL++NPLHSIAIEP SPLTLSSKEVHFP+NYN+YINEE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SEL VNEHK             
Subjt:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE
                         KVEEVVKEESGMPINH+TPLATDVVVETFPLD V W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEK 
Subjt:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE

Query:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK
         ENFAGPLSETKSDLVEVAQIVE SNGST+KEG ++EVGG ELEVCSDTPISV  EQGQKSS+MK+P AS    ENLNKTFSN FDQASKI    E+E K
Subjt:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK

Query:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE
        VD GQTGGSQ ES+PTLNRINLESWEGMSKNS KPENNPLLEI+K+FI AFVKFWSE
Subjt:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE

XP_008456557.1 PREDICTED: uncharacterized protein LOC103496473 isoform X2 [Cucumis melo]1.8e-17875.71Show/hide
Query:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH
        MHA+KGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+ DH
Subjt:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH

Query:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------
        SL++NPLHSIAIEP SPLTLSSKEVHFP+NYN+YINEE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SEL VNEHK             
Subjt:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE
                         KVEEVVKEESGMPINH+TPLATDVVVETFPLD V W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEK 
Subjt:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE

Query:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK
         ENFAGPLSETKSDLVEVAQIVE SNGST+KEG ++EVGG ELEVCSDTPISV  EQGQKSS+MK                      ASKI    E+E K
Subjt:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK

Query:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE
        VD GQTGGSQ ES+PTLNRINLESWEGMSKNS KPENNPLLEI+K+FI AFVKFWSE
Subjt:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE

XP_038886590.1 uncharacterized protein LOC120076760 [Benincasa hispida]2.7e-20688.99Show/hide
Query:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLL-LEEHSIDH
        MHA+KGGWTG PLALA+NNE EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLL  EEH IDH
Subjt:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLL-LEEHSIDH

Query:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHKKVEEVVKEESGMP
        SLEENPLHSIAIEP SPLTLS+KEVHFP+NY+Q INEE IFVSDE+CT T++QGSQNGPI+NGSLVD ++K+  +FI+SEL VNEHKKVEEVVKEESGMP
Subjt:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHKKVEEVVKEESGMP

Query:  INHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKEEENFAGPLSETKSDLVEVAQIVETSNGSTL
        INH+TPLATDVVVETFPLDS SW VNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEK EENFAGPLSE  SD+VE AQIVETSNGST+
Subjt:  INHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKEEENFAGPLSETKSDLVEVAQIVETSNGSTL

Query:  KEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRINLESWEGMSK
        KEGIIYEVGG ELEVCSDTPISVT EQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKE TEME KVDAGQTGGSQ ESIPTLNRINLESWEGMSK
Subjt:  KEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRINLESWEGMSK

Query:  NSLKPENNPLLEILKAFITAFVKFWSE
        NS K ENNP+LEI KAFI AFVKFWSE
Subjt:  NSLKPENNPLLEILKAFITAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A0A0LML9 Uncharacterized protein2.2e-18576.81Show/hide
Query:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHS
        MHA+KGGWTGRPLALAKNNE EGR+TRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPG LLLEEH+ DHS
Subjt:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHS

Query:  LEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSEL--------------------
        LE+NPLHSIAIEP SPLTLSS EVHFPVNYN+YI+EE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SEL                    
Subjt:  LEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSEL--------------------

Query:  -----------PVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE
                    VNEH KVEEVVKEESGMPIN++TPLATDVVVETFPLDSV W VNG DVRSEILISTSASEKQVSQ+IELESDVGLFNI  S CVVEK 
Subjt:  -----------PVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE

Query:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK
        EEN   PL++TKSDLV+ AQIVE SNGST+KEG I+EVGG ELEVCSDTP+SV+ EQGQKSS+MK+P AS    ENLNKTFSN FDQASKI    E++ K
Subjt:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK

Query:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE
        VD GQTGGSQ ES+PTLNRINL+SWEGMSKNS KP NNPLLEI+K+FITAFVKFWSE
Subjt:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE

A0A1S3C344 uncharacterized protein LOC103496473 isoform X28.9e-17975.71Show/hide
Query:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH
        MHA+KGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+ DH
Subjt:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH

Query:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------
        SL++NPLHSIAIEP SPLTLSSKEVHFP+NYN+YINEE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SEL VNEHK             
Subjt:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE
                         KVEEVVKEESGMPINH+TPLATDVVVETFPLD V W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEK 
Subjt:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE

Query:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK
         ENFAGPLSETKSDLVEVAQIVE SNGST+KEG ++EVGG ELEVCSDTPISV  EQGQKSS+MK                      ASKI    E+E K
Subjt:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK

Query:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE
        VD GQTGGSQ ES+PTLNRINLESWEGMSKNS KPENNPLLEI+K+FI AFVKFWSE
Subjt:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE

A0A1S3C473 uncharacterized protein LOC103496473 isoform X14.3e-18978.99Show/hide
Query:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH
        MHA+KGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+ DH
Subjt:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH

Query:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------
        SL++NPLHSIAIEP SPLTLSSKEVHFP+NYN+YINEE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SEL VNEHK             
Subjt:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE
                         KVEEVVKEESGMPINH+TPLATDVVVETFPLD V W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEK 
Subjt:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE

Query:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK
         ENFAGPLSETKSDLVEVAQIVE SNGST+KEG ++EVGG ELEVCSDTPISV  EQGQKSS+MK+P AS    ENLNKTFSN FDQASKI    E+E K
Subjt:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK

Query:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE
        VD GQTGGSQ ES+PTLNRINLESWEGMSKNS KPENNPLLEI+K+FI AFVKFWSE
Subjt:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE

A0A5A7UUF2 Plastid envelope DNA binding protein9.5e-18978.77Show/hide
Query:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH
        MHA+KGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+ DH
Subjt:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH

Query:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------
        SL++NPLHSIAIEP SPLTLSSKEVHFP+NYN+YINEE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SEL VNEHK             
Subjt:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE
                         KVEEVVKEESGMPINH+TPLATDVVVETFPLD V W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEK 
Subjt:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE

Query:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK
         ENFAGPLSETKSDLVEVAQIVE SNGST+KEG ++EVGG ELEVCSDTPISV  EQGQKSS+MK+P AS    ENLNKTFSN FDQASKI    E+E K
Subjt:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK

Query:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE
        VD GQTGGSQ ES+PTLNRINLESWEGMSKNS KPENNPLLEI+K+FI AFVKFWS+
Subjt:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE

A0A5D3BB97 Plastid envelope DNA binding protein2.0e-17875.49Show/hide
Query:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH
        MHA+KGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+ DH
Subjt:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSIDH

Query:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------
        SL++NPLHSIAIEP SPLTLSSKEVHFP+NYN+YINEE IFVSDE+CTAT++QGSQN  I+NGSLVD S +DSD+FI+SEL VNEHK             
Subjt:  SLEENPLHSIAIEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHK-------------

Query:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE
                         KVEEVVKEESGMPINH+TPLATDVVVETFPLD V W VNGSDVRSEILIST+ASEKQVSQ+IELESDVGL NI AS  VVEK 
Subjt:  -----------------KVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKE

Query:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK
         ENFAGPLSETKSDLVEVAQIVE SNGST+KEG ++EVGG ELEVCSDTPISV  EQGQKSS+MK                      ASKI    E+E K
Subjt:  EENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKK

Query:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE
        VD GQTGGSQ ES+PTLNRINLESWEGMSKNS KPENNPLLEI+K+FI AFVKFWS+
Subjt:  VDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding2.2e-4430.88Show/hide
Query:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLE------E
        MH++K    G+  ALAK ++  G++TR R  KEERK +VE FIKK+Q+ N GSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG LLLE      +
Subjt:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLE------E

Query:  HSIDHSLEENPLHSIAIEPP-------SPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSL----QGSQNGPIVNGSLVDASEKDSDDFIKSELPV-
         S+  S+  +P+  +++ P          L  SS+     VN +Q   +    VS  +     +    Q   +  I    L  +  +D+D  IKS   + 
Subjt:  HSIDHSLEENPLHSIAIEPP-------SPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSL----QGSQNGPIVNGSLVDASEKDSDDFIKSELPV-

Query:  ----------------------NEHKKVEEV--VKEESGMPINH-------------------LTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSA
                              N+ +  EE+  ++ +   P+N+                      ++ + VVETFPL SV+ +++  D +   L     
Subjt:  ----------------------NEHKKVEEV--VKEESGMPINH-------------------LTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSA

Query:  SEKQVSQTIELES------DVGLFNIKASGCVVE---------KEEENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTS
          K     +E +       D+G  +   S  V+E         +   + + P+ +   + +  +  V+       +  ++  V G+  E    +  ++T+
Subjt:  SEKQVSQTIELES------DVGLFNIKASGCVVE---------KEEENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTS

Query:  EQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFW
        EQ   +S  ++ +      +    +   G + AS  K+AT  + K+DA  +  SQ E+  TLNRI  ESW+G S N  + E NPLL +LK+F+TAFVKFW
Subjt:  EQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFW

Query:  SE
        SE
Subjt:  SE

AT3G52170.2 DNA binding2.2e-4430.88Show/hide
Query:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLE------E
        MH++K    G+  ALAK ++  G++TR R  KEERK +VE FIKK+Q+ N GSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG LLLE      +
Subjt:  MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLE------E

Query:  HSIDHSLEENPLHSIAIEPP-------SPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSL----QGSQNGPIVNGSLVDASEKDSDDFIKSELPV-
         S+  S+  +P+  +++ P          L  SS+     VN +Q   +    VS  +     +    Q   +  I    L  +  +D+D  IKS   + 
Subjt:  HSIDHSLEENPLHSIAIEPP-------SPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSL----QGSQNGPIVNGSLVDASEKDSDDFIKSELPV-

Query:  ----------------------NEHKKVEEV--VKEESGMPINH-------------------LTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSA
                              N+ +  EE+  ++ +   P+N+                      ++ + VVETFPL SV+ +++  D +   L     
Subjt:  ----------------------NEHKKVEEV--VKEESGMPINH-------------------LTPLATDVVVETFPLDSVSWSVNGSDVRSEILISTSA

Query:  SEKQVSQTIELES------DVGLFNIKASGCVVE---------KEEENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTS
          K     +E +       D+G  +   S  V+E         +   + + P+ +   + +  +  V+       +  ++  V G+  E    +  ++T+
Subjt:  SEKQVSQTIELES------DVGLFNIKASGCVVE---------KEEENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTS

Query:  EQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFW
        EQ   +S  ++ +      +    +   G + AS  K+AT  + K+DA  +  SQ E+  TLNRI  ESW+G S N  + E NPLL +LK+F+TAFVKFW
Subjt:  EQGQKSSEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFW

Query:  SE
        SE
Subjt:  SE

AT5G58210.1 hydroxyproline-rich glycoprotein family protein6.6e-0924.5Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIAIEPPSPL------TLSSK
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE ++     + +   ++       P  + +   P+P+       LS  
Subjt:  RRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIAIEPPSPL------TLSSK

Query:  EVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSW
            P + + +++   + + + +        S+  P V           S  F  + +P+ E + +  V         +H +P A   +VE   L  VS 
Subjt:  EVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSW

Query:  SV--NGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKEEENFAGP--LSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSE-------
        SV  + S   S +++        V   I  E    + +   S      +E NF G   +   K D  E        N  T +E  +  +G  E       
Subjt:  SV--NGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKEEENFAGP--LSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSE-------

Query:  -LEVCSD---TPISVTSEQG------QKSSEMKAPNASPSTIENLNKTFSN
          +V +D   T     SE G       ++ E+K  ++S S I +  K F+N
Subjt:  -LEVCSD---TPISVTSEQG------QKSSEMKAPNASPSTIENLNKTFSN

AT5G58210.2 hydroxyproline-rich glycoprotein family protein6.6e-0924.5Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIAIEPPSPL------TLSSK
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE ++     + +   ++       P  + +   P+P+       LS  
Subjt:  RRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIAIEPPSPL------TLSSK

Query:  EVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSW
            P + + +++   + + + +        S+  P V           S  F  + +P+ E + +  V         +H +P A   +VE   L  VS 
Subjt:  EVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSW

Query:  SV--NGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKEEENFAGP--LSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSE-------
        SV  + S   S +++        V   I  E    + +   S      +E NF G   +   K D  E        N  T +E  +  +G  E       
Subjt:  SV--NGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKEEENFAGP--LSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSE-------

Query:  -LEVCSD---TPISVTSEQG------QKSSEMKAPNASPSTIENLNKTFSN
          +V +D   T     SE G       ++ E+K  ++S S I +  K F+N
Subjt:  -LEVCSD---TPISVTSEQG------QKSSEMKAPNASPSTIENLNKTFSN

AT5G58210.3 hydroxyproline-rich glycoprotein family protein6.6e-0924.5Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIAIEPPSPL------TLSSK
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE ++     + +   ++       P  + +   P+P+       LS  
Subjt:  RRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIAIEPPSPL------TLSSK

Query:  EVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSW
            P + + +++   + + + +        S+  P V           S  F  + +P+ E + +  V         +H +P A   +VE   L  VS 
Subjt:  EVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSVSW

Query:  SV--NGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKEEENFAGP--LSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSE-------
        SV  + S   S +++        V   I  E    + +   S      +E NF G   +   K D  E        N  T +E  +  +G  E       
Subjt:  SV--NGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKEEENFAGP--LSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSE-------

Query:  -LEVCSD---TPISVTSEQG------QKSSEMKAPNASPSTIENLNKTFSN
          +V +D   T     SE G       ++ E+K  ++S S I +  K F+N
Subjt:  -LEVCSD---TPISVTSEQG------QKSSEMKAPNASPSTIENLNKTFSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGCTGTAAAAGGTGGGTGGACGGGGCGTCCTCTTGCCCTAGCCAAGAACAATGAGCCTGAAGGGAGGAAGACAAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGC
AATGGTTGAAGTTTTCATAAAAAAGTACCAGGAATCAAATAAGGGGAGCTTCCCCTCGCTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACAGTACGAGAGA
TTGTACGTGATATAATCCAAGAAAATAGAGTCCTTGGTCCAGGAAAGTTGTTATTAGAAGAGCACAGCATTGATCATTCACTTGAAGAGAATCCACTTCACTCAATTGCT
ATTGAACCTCCATCTCCTTTAACCTTATCGTCTAAGGAAGTCCATTTTCCAGTCAACTACAACCAATATATAAATGAAGAAGCAATCTTTGTTTCAGATGAGCGCTGCAC
TGCAACAAGTCTTCAGGGATCACAGAATGGGCCAATAGTTAATGGCAGCCTGGTGGACGCAAGCGAAAAGGATTCTGATGATTTTATCAAGTCAGAGTTGCCAGTAAATG
AACACAAGAAAGTAGAGGAAGTGGTGAAAGAGGAATCAGGAATGCCAATTAATCATTTAACTCCTTTGGCAACAGATGTTGTGGTTGAGACATTCCCATTGGATTCAGTT
TCTTGGTCTGTTAATGGTTCAGATGTAAGATCTGAGATATTAATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTT
TAACATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGAAGAGGAAAACTTTGCAGGTCCATTATCAGAAACAAAGTCTGACTTGGTGGAGGTAGCACAAATTGTTGAAACAT
CTAATGGATCTACTCTGAAAGAAGGTATCATATATGAAGTTGGGGGTTCTGAGTTGGAAGTTTGCAGTGATACTCCAATATCTGTGACCTCTGAACAAGGCCAGAAATCT
AGTGAAATGAAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAATCTCAACAAGACATTTAGCAATGGCTTTGATCAGGCCTCAAAAATCAAAGAGGCGACAGAGATGGA
AAAGAAAGTAGATGCTGGACAAACTGGTGGCTCCCAGAATGAAAGCATTCCAACTTTAAACAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACTCATTAAAAC
CCGAGAACAACCCGCTTTTGGAAATCTTGAAGGCATTCATCACTGCCTTTGTGAAGTTTTGGTCCGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATGCTGTAAAAGGTGGGTGGACGGGGCGTCCTCTTGCCCTAGCCAAGAACAATGAGCCTGAAGGGAGGAAGACAAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGC
AATGGTTGAAGTTTTCATAAAAAAGTACCAGGAATCAAATAAGGGGAGCTTCCCCTCGCTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACAGTACGAGAGA
TTGTACGTGATATAATCCAAGAAAATAGAGTCCTTGGTCCAGGAAAGTTGTTATTAGAAGAGCACAGCATTGATCATTCACTTGAAGAGAATCCACTTCACTCAATTGCT
ATTGAACCTCCATCTCCTTTAACCTTATCGTCTAAGGAAGTCCATTTTCCAGTCAACTACAACCAATATATAAATGAAGAAGCAATCTTTGTTTCAGATGAGCGCTGCAC
TGCAACAAGTCTTCAGGGATCACAGAATGGGCCAATAGTTAATGGCAGCCTGGTGGACGCAAGCGAAAAGGATTCTGATGATTTTATCAAGTCAGAGTTGCCAGTAAATG
AACACAAGAAAGTAGAGGAAGTGGTGAAAGAGGAATCAGGAATGCCAATTAATCATTTAACTCCTTTGGCAACAGATGTTGTGGTTGAGACATTCCCATTGGATTCAGTT
TCTTGGTCTGTTAATGGTTCAGATGTAAGATCTGAGATATTAATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTT
TAACATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGAAGAGGAAAACTTTGCAGGTCCATTATCAGAAACAAAGTCTGACTTGGTGGAGGTAGCACAAATTGTTGAAACAT
CTAATGGATCTACTCTGAAAGAAGGTATCATATATGAAGTTGGGGGTTCTGAGTTGGAAGTTTGCAGTGATACTCCAATATCTGTGACCTCTGAACAAGGCCAGAAATCT
AGTGAAATGAAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAATCTCAACAAGACATTTAGCAATGGCTTTGATCAGGCCTCAAAAATCAAAGAGGCGACAGAGATGGA
AAAGAAAGTAGATGCTGGACAAACTGGTGGCTCCCAGAATGAAAGCATTCCAACTTTAAACAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACTCATTAAAAC
CCGAGAACAACCCGCTTTTGGAAATCTTGAAGGCATTCATCACTGCCTTTGTGAAGTTTTGGTCCGAGTAA
Protein sequenceShow/hide protein sequence
MHAVKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNKGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSIDHSLEENPLHSIA
IEPPSPLTLSSKEVHFPVNYNQYINEEAIFVSDERCTATSLQGSQNGPIVNGSLVDASEKDSDDFIKSELPVNEHKKVEEVVKEESGMPINHLTPLATDVVVETFPLDSV
SWSVNGSDVRSEILISTSASEKQVSQTIELESDVGLFNIKASGCVVEKEEENFAGPLSETKSDLVEVAQIVETSNGSTLKEGIIYEVGGSELEVCSDTPISVTSEQGQKS
SEMKAPNASPSTIENLNKTFSNGFDQASKIKEATEMEKKVDAGQTGGSQNESIPTLNRINLESWEGMSKNSLKPENNPLLEILKAFITAFVKFWSE