; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004157 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004157
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPlastid envelope DNA binding protein
Genome locationChr08:14278928..14289129
RNA-Seq ExpressionHG10004157
SyntenyHG10004157
Gene Ontology termsGO:0006351 - transcription, DNA-templated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000268 - DNA-directed RNA polymerase, subunit N/Rpb10
IPR020789 - RNA polymerases, subunit N, zinc binding site
IPR023580 - RNA polymerase subunit RPB10


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057131.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]2.6e-18779.13Show/hide
Query:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH
        MHAIKGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+TDH
Subjt:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH

Query:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------
         L++NPLHSIAIEPQSPLT+SSKE + P++ YN+YINEE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ ELLVNEHK            
Subjt:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------

Query:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK
                          KVEEV+KEESGMPINHV+PLATDVVVETFPLD V W VNGSDVRSE+LIST+ASEKQVSQ+IELESDVGL NI AS  VVEK
Subjt:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK

Query:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN
        A +NFAGPLSETKSDLVEVAQIVE SNGSTVKEG ++EVGGPELEVCSD PISV FEQGQKSS+MK+P AS    E+LNKTFSN FDQASKI    E+EN
Subjt:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN

Query:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSDPV
        KVD  QTGGSQKES+PTLNRINLESWEGMSKNSSKPENN LLEI K+FIAAFVKFWSD V
Subjt:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSDPV

TYJ97112.1 plastid envelope DNA binding protein [Cucumis melo var. makuwa]1.4e-17776.09Show/hide
Query:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH
        MHAIKGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+TDH
Subjt:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH

Query:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------
         L++NPLHSIAIEPQSPLT+SSKE + P++ YN+YINEE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ ELLVNEHK            
Subjt:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------

Query:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK
                          KVEEV+KEESGMPINHV+PLATDVVVETFPLD V W VNGSDVRSE+LIST+ASEKQVSQ+IELESDVGL NI AS  VVEK
Subjt:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK

Query:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN
        A +NFAGPLSETKSDLVEVAQIVE SNGSTVKEG ++EVGGPELEVCSD PISV FEQGQKSS+MK                      ASKI    E+EN
Subjt:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN

Query:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSDPV
        KVD  QTGGSQKES+PTLNRINLESWEGMSKNSSKPENN LLEI K+FIAAFVKFWSD V
Subjt:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSDPV

XP_004138835.1 uncharacterized protein LOC101202832 [Cucumis sativus]6.2e-18175.98Show/hide
Query:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL
        MHAIKGGWTGRPLALAKNNE EGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPG LLLEEH+ DH 
Subjt:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPEL-------------------
        LE+NPLHSIAIEP SPLT+SS E + PV+ YN+YI+EE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ EL                   
Subjt:  LEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPEL-------------------

Query:  ------------LVNEHKKVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK
                    LVNEH KVEEV+KEESGMPIN+V+PLATDVVVETFPLDSV W VNG DVRSE+LISTSASEKQVSQ+IELESDVGLFNI  S CVVEK
Subjt:  ------------LVNEHKKVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK

Query:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN
        AE+N   PL++TKSDLV+ AQIVE SNGSTVKEG I+EVGGPELEVCSD P+SV+FEQGQKSS+MK+P AS    E+LNKTFSN FDQASKI    E++N
Subjt:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN

Query:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD
        KVD  QTGGSQKES+PTLNRINL+SWEGMSKNSSKP NN LLEI K+FI AFVKFWS+
Subjt:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD

XP_008456554.1 PREDICTED: uncharacterized protein LOC103496473 isoform X1 [Cucumis melo]1.3e-18679.04Show/hide
Query:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH
        MHAIKGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+TDH
Subjt:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH

Query:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------
         L++NPLHSIAIEPQSPLT+SSKE + P++ YN+YINEE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ ELLVNEHK            
Subjt:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------

Query:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK
                          KVEEV+KEESGMPINHV+PLATDVVVETFPLD V W VNGSDVRSE+LIST+ASEKQVSQ+IELESDVGL NI AS  VVEK
Subjt:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK

Query:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN
        A +NFAGPLSETKSDLVEVAQIVE SNGSTVKEG ++EVGGPELEVCSD PISV FEQGQKSS+MK+P AS    E+LNKTFSN FDQASKI    E+EN
Subjt:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN

Query:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD
        KVD  QTGGSQKES+PTLNRINLESWEGMSKNSSKPENN LLEI K+FIAAFVKFWS+
Subjt:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD

XP_038886590.1 uncharacterized protein LOC120076760 [Benincasa hispida]8.6e-20789.72Show/hide
Query:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLL-LEEHSTDH
        MHAIKGGWTG PLALA+NNE EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLL  EEH  DH
Subjt:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLL-LEEHSTDH

Query:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHKKVEEVMKEESGM
         LEENPLHSIAIEPQSPLT+S+KE + P++ Y+Q INEE I VSDEQCT TNIQGSQNGPIINGSLVD+ DK+  EFI+ ELLVNEHKKVEEV+KEESGM
Subjt:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHKKVEEVMKEESGM

Query:  PINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEKAEKNFAGPLSETKSDLVEVAQIVETSNGST
        PINHV+PLATDVVVETFPLDS SWGVNGSDVRSE+LISTSASEKQVSQTIELESDVGLFNIKASGCVVEKAE+NFAGPLSE  SD+VE AQIVETSNGST
Subjt:  PINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEKAEKNFAGPLSETKSDLVEVAQIVETSNGST

Query:  VKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNRINLESWEGMS
        VKEGIIYEVGGPELEVCSD PISVTFEQGQKSSEMKAPNASPSTIE+LNKTFSNGFDQASKIKEETEMENKVDA QTGGSQKESIPTLNRINLESWEGMS
Subjt:  VKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNRINLESWEGMS

Query:  KNSSKPENNLLLEIFKAFIAAFVKFWSD
        KNSSK ENN +LEIFKAFIAAFVKFWS+
Subjt:  KNSSKPENNLLLEIFKAFIAAFVKFWSD

TrEMBL top hitse value%identityAlignment
A0A0A0LML9 Uncharacterized protein3.0e-18175.98Show/hide
Query:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL
        MHAIKGGWTGRPLALAKNNE EGR+TRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPG LLLEEH+ DH 
Subjt:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPEL-------------------
        LE+NPLHSIAIEP SPLT+SS E + PV+ YN+YI+EE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ EL                   
Subjt:  LEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPEL-------------------

Query:  ------------LVNEHKKVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK
                    LVNEH KVEEV+KEESGMPIN+V+PLATDVVVETFPLDSV W VNG DVRSE+LISTSASEKQVSQ+IELESDVGLFNI  S CVVEK
Subjt:  ------------LVNEHKKVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK

Query:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN
        AE+N   PL++TKSDLV+ AQIVE SNGSTVKEG I+EVGGPELEVCSD P+SV+FEQGQKSS+MK+P AS    E+LNKTFSN FDQASKI    E++N
Subjt:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN

Query:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD
        KVD  QTGGSQKES+PTLNRINL+SWEGMSKNSSKP NN LLEI K+FI AFVKFWS+
Subjt:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD

A0A1S3C344 uncharacterized protein LOC103496473 isoform X23.5e-17775.98Show/hide
Query:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH
        MHAIKGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+TDH
Subjt:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH

Query:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------
         L++NPLHSIAIEPQSPLT+SSKE + P++ YN+YINEE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ ELLVNEHK            
Subjt:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------

Query:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK
                          KVEEV+KEESGMPINHV+PLATDVVVETFPLD V W VNGSDVRSE+LIST+ASEKQVSQ+IELESDVGL NI AS  VVEK
Subjt:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK

Query:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN
        A +NFAGPLSETKSDLVEVAQIVE SNGSTVKEG ++EVGGPELEVCSD PISV FEQGQKSS+MK                      ASKI    E+EN
Subjt:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN

Query:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD
        KVD  QTGGSQKES+PTLNRINLESWEGMSKNSSKPENN LLEI K+FIAAFVKFWS+
Subjt:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD

A0A1S3C473 uncharacterized protein LOC103496473 isoform X16.3e-18779.04Show/hide
Query:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH
        MHAIKGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+TDH
Subjt:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH

Query:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------
         L++NPLHSIAIEPQSPLT+SSKE + P++ YN+YINEE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ ELLVNEHK            
Subjt:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------

Query:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK
                          KVEEV+KEESGMPINHV+PLATDVVVETFPLD V W VNGSDVRSE+LIST+ASEKQVSQ+IELESDVGL NI AS  VVEK
Subjt:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK

Query:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN
        A +NFAGPLSETKSDLVEVAQIVE SNGSTVKEG ++EVGGPELEVCSD PISV FEQGQKSS+MK+P AS    E+LNKTFSN FDQASKI    E+EN
Subjt:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN

Query:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD
        KVD  QTGGSQKES+PTLNRINLESWEGMSKNSSKPENN LLEI K+FIAAFVKFWS+
Subjt:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSD

A0A5A7UUF2 Plastid envelope DNA binding protein1.3e-18779.13Show/hide
Query:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH
        MHAIKGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+TDH
Subjt:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH

Query:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------
         L++NPLHSIAIEPQSPLT+SSKE + P++ YN+YINEE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ ELLVNEHK            
Subjt:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------

Query:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK
                          KVEEV+KEESGMPINHV+PLATDVVVETFPLD V W VNGSDVRSE+LIST+ASEKQVSQ+IELESDVGL NI AS  VVEK
Subjt:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK

Query:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN
        A +NFAGPLSETKSDLVEVAQIVE SNGSTVKEG ++EVGGPELEVCSD PISV FEQGQKSS+MK+P AS    E+LNKTFSN FDQASKI    E+EN
Subjt:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN

Query:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSDPV
        KVD  QTGGSQKES+PTLNRINLESWEGMSKNSSKPENN LLEI K+FIAAFVKFWSD V
Subjt:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSDPV

A0A5D3BB97 Plastid envelope DNA binding protein7.0e-17876.09Show/hide
Query:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH
        MHAIKGGWTGRPLALAKNNE EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGKLLL EEH+TDH
Subjt:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLL-EEHSTDH

Query:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------
         L++NPLHSIAIEPQSPLT+SSKE + P++ YN+YINEE I VSDEQCTATNIQGSQN  IINGSLVDV ++DS EFI+ ELLVNEHK            
Subjt:  LLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTATNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHK------------

Query:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK
                          KVEEV+KEESGMPINHV+PLATDVVVETFPLD V W VNGSDVRSE+LIST+ASEKQVSQ+IELESDVGL NI AS  VVEK
Subjt:  ------------------KVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFNIKASGCVVEK

Query:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN
        A +NFAGPLSETKSDLVEVAQIVE SNGSTVKEG ++EVGGPELEVCSD PISV FEQGQKSS+MK                      ASKI    E+EN
Subjt:  AEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN

Query:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSDPV
        KVD  QTGGSQKES+PTLNRINLESWEGMSKNSSKPENN LLEI K+FIAAFVKFWSD V
Subjt:  KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSDPV

SwissProt top hitse value%identityAlignment
Q32P78 DNA-directed RNA polymerases I, II, and III subunit RPABC53.1e-2681.54Show/hide
Query:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL
        MIIPVRCFTCGK++GNKW+ YL LLQA+Y+EGDALDALGL RYCCRRML+ HVDLIEKLLNY  L
Subjt:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL

Q39290 DNA-directed RNA polymerases I, II, and III subunit RPABC51.0e-2992.31Show/hide
Query:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL
        MIIPVRCFTCGKVIGNKWD YLDLLQ DY+EGDALDAL LVRYCCRRMLMTHVDLIEKLLNYN+L
Subjt:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL

Q55AB6 DNA-directed RNA polymerases I, II, and III subunit rpabc51.1e-2686.15Show/hide
Query:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL
        MIIPVRCFTCGKVIGNKWD YL LLQ DY+EGDALDAL L RYCCRRML+THVDLIEKLLNY  L
Subjt:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL

Q8LFJ6 DNA-directed RNA polymerases II, IV and V subunit 101.0e-2992.31Show/hide
Query:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL
        MIIPVRCFTCGKVIGNKWD YLDLLQ DY+EGDALDAL LVRYCCRRMLMTHVDLIEKLLNYN+L
Subjt:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL

Q9SYA6 DNA-directed RNA polymerase subunit 10-like protein3.6e-3090.77Show/hide
Query:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL
        MI+PVRCFTCGKVIGNKWD YL+LLQADY+EGDALDALGLVRYCCRRMLMTHVDLIEKLLNYN++
Subjt:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL

Arabidopsis top hitse value%identityAlignment
AT1G11475.1 RNA polymerases N / 8 kDa subunit7.4e-3192.31Show/hide
Query:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL
        MIIPVRCFTCGKVIGNKWD YLDLLQ DY+EGDALDAL LVRYCCRRMLMTHVDLIEKLLNYN+L
Subjt:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL

AT1G61700.1 RNA polymerases N / 8 kDa subunit2.5e-3190.77Show/hide
Query:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL
        MI+PVRCFTCGKVIGNKWD YL+LLQADY+EGDALDALGLVRYCCRRMLMTHVDLIEKLLNYN++
Subjt:  MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSL

AT3G52170.1 DNA binding3.4e-4432.24Show/hide
Query:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL
        MH++K    G+  ALAK ++  G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG LLLE + +  +
Subjt:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTVSSKEF----YSPVDYYNQ----------------------YINEESIIVSDEQCTATNIQGSQ--------NGPIINGSLV
         +++   SI ++P  PL++S   F    Y  +D+ ++                       + +E I +  +   +T+I  +Q        N    N  L 
Subjt:  LEENPLHSIAIEPQSPLTVSSKEF----YSPVDYYNQ----------------------YINEESIIVSDEQCTATNIQGSQ--------NGPIINGSLV

Query:  DVRDK--DSVEFIKPE----LLVNEHKKVEEV--MKEESGMPINH-------------------VSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTS
        +  +   DSV+  KP+     + N+ +  EE+  M+ +   P+N+                      ++ + VVETFPL SV+  ++  D +   L    
Subjt:  DVRDK--DSVEFIKPE----LLVNEHKKVEEV--MKEESGMPINH-------------------VSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTS

Query:  ASEKQVSQTIELES------DVGLFNIKASGCVVE-----------------KAEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVC
           K     +E +       D+G  +   S  V+E                   EK     +  + S  VE A   ET     V  G+I  V   +    
Subjt:  ASEKQVSQTIELES------DVGLFNIKASGCVVE-----------------KAEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVC

Query:  SDNPISVTFEQGQKSSEMKAPNASPSTIESLN-----KTFSN--GFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNL
                F  G  ++E K P +S  +    N      T S+  G + AS  K+ T  + K+DA  +  SQKE+  TLNRI  ESW+G S N  + E N 
Subjt:  SDNPISVTFEQGQKSSEMKAPNASPSTIESLN-----KTFSN--GFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNL

Query:  LLEIFKAFIAAFVKFWSD
        LL + K+F+ AFVKFWS+
Subjt:  LLEIFKAFIAAFVKFWSD

AT3G52170.2 DNA binding3.4e-4432.24Show/hide
Query:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL
        MH++K    G+  ALAK ++  G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG LLLE + +  +
Subjt:  MHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHL

Query:  LEENPLHSIAIEPQSPLTVSSKEF----YSPVDYYNQ----------------------YINEESIIVSDEQCTATNIQGSQ--------NGPIINGSLV
         +++   SI ++P  PL++S   F    Y  +D+ ++                       + +E I +  +   +T+I  +Q        N    N  L 
Subjt:  LEENPLHSIAIEPQSPLTVSSKEF----YSPVDYYNQ----------------------YINEESIIVSDEQCTATNIQGSQ--------NGPIINGSLV

Query:  DVRDK--DSVEFIKPE----LLVNEHKKVEEV--MKEESGMPINH-------------------VSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTS
        +  +   DSV+  KP+     + N+ +  EE+  M+ +   P+N+                      ++ + VVETFPL SV+  ++  D +   L    
Subjt:  DVRDK--DSVEFIKPE----LLVNEHKKVEEV--MKEESGMPINH-------------------VSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTS

Query:  ASEKQVSQTIELES------DVGLFNIKASGCVVE-----------------KAEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVC
           K     +E +       D+G  +   S  V+E                   EK     +  + S  VE A   ET     V  G+I  V   +    
Subjt:  ASEKQVSQTIELES------DVGLFNIKASGCVVE-----------------KAEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVC

Query:  SDNPISVTFEQGQKSSEMKAPNASPSTIESLN-----KTFSN--GFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNL
                F  G  ++E K P +S  +    N      T S+  G + AS  K+ T  + K+DA  +  SQKE+  TLNRI  ESW+G S N  + E N 
Subjt:  SDNPISVTFEQGQKSSEMKAPNASPSTIESLN-----KTFSN--GFDQASKIKEETEMENKVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNL

Query:  LLEIFKAFIAAFVKFWSD
        LL + K+F+ AFVKFWS+
Subjt:  LLEIFKAFIAAFVKFWSD

AT5G58210.3 hydroxyproline-rich glycoprotein family protein2.1e-0956.6Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y    IVRDI QE
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCATTCCAGTGCGTTGCTTCACTTGCGGAAAGGTGATTGGAAACAAATGGGATCATTACCTTGATCTTCTACAGGCAGACTATTCTGAAGGGGATGCACTTGATGC
TCTGGGCTTGGTTCGCTACTGCTGCAGGCGAATGCTCATGACTCATGTTGATCTTATTGAGAAGCTTCTCAATTACAACAGTTTGTTGAACTTTCTCGATTTCATGCATG
CTATAAAAGGTGGGTGGACGGGGCGTCCTCTTGCCCTAGCCAAGAACAATGAGCCTGAAGGGAGGAAGACAAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTT
GAAGTCTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGTTTCCCCTCGCTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACGGTACGAGAGATTGTACG
TGATATAATCCAAGAAAACAGAGTCCTTGGTCCGGGAAAGTTGTTATTAGAAGAGCACAGCACTGATCATTTACTTGAAGAGAATCCACTCCACTCAATTGCTATTGAAC
CTCAATCTCCTTTAACCGTATCGTCTAAGGAATTCTACTCTCCAGTTGACTACTACAACCAATATATAAATGAAGAATCAATCATTGTTTCAGATGAGCAATGCACTGCA
ACAAATATTCAGGGATCACAGAATGGGCCAATAATTAATGGCAGCCTGGTGGATGTAAGGGACAAGGATTCTGTTGAATTTATCAAGCCAGAGTTGCTAGTAAATGAACA
CAAGAAAGTAGAGGAAGTGATGAAAGAGGAATCAGGAATGCCAATTAATCATGTAAGTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGTTTCTT
GGGGTGTTAATGGTTCAGATGTAAGATCTGAAATGTTGATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAAC
ATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGCAGAGAAAAACTTTGCAGGTCCATTATCAGAAACAAAGTCTGACTTGGTGGAGGTAGCACAAATTGTTGAAACATCTAA
TGGATCTACTGTGAAAGAAGGTATCATATATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATAATCCAATATCTGTGACCTTTGAACAAGGCCAGAAATCTAGTG
AAATGAAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAGTCTCAACAAGACATTCAGCAATGGCTTTGATCAGGCCTCAAAAATCAAAGAGGAGACAGAGATGGAAAAT
AAAGTAGATGCTGAACAAACTGGTGGCTCCCAGAAAGAAAGCATTCCAACTTTAAACAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACTCGTCAAAACCCGA
AAACAACCTGCTTTTGGAAATCTTCAAGGCATTCATTGCTGCCTTCGTGAAGTTTTGGTCCGATCCAGTCGGTCCCATCATCGATATTTATGAGGACAAAACTGGATTCT
GA
mRNA sequenceShow/hide mRNA sequence
ATGATCATTCCAGTGCGTTGCTTCACTTGCGGAAAGGTGATTGGAAACAAATGGGATCATTACCTTGATCTTCTACAGGCAGACTATTCTGAAGGGGATGCACTTGATGC
TCTGGGCTTGGTTCGCTACTGCTGCAGGCGAATGCTCATGACTCATGTTGATCTTATTGAGAAGCTTCTCAATTACAACAGTTTGTTGAACTTTCTCGATTTCATGCATG
CTATAAAAGGTGGGTGGACGGGGCGTCCTCTTGCCCTAGCCAAGAACAATGAGCCTGAAGGGAGGAAGACAAGAATTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTT
GAAGTCTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGTTTCCCCTCGCTCAACCTTACTCACAAGGAAGTTGGTGGATCTTTCTATACGGTACGAGAGATTGTACG
TGATATAATCCAAGAAAACAGAGTCCTTGGTCCGGGAAAGTTGTTATTAGAAGAGCACAGCACTGATCATTTACTTGAAGAGAATCCACTCCACTCAATTGCTATTGAAC
CTCAATCTCCTTTAACCGTATCGTCTAAGGAATTCTACTCTCCAGTTGACTACTACAACCAATATATAAATGAAGAATCAATCATTGTTTCAGATGAGCAATGCACTGCA
ACAAATATTCAGGGATCACAGAATGGGCCAATAATTAATGGCAGCCTGGTGGATGTAAGGGACAAGGATTCTGTTGAATTTATCAAGCCAGAGTTGCTAGTAAATGAACA
CAAGAAAGTAGAGGAAGTGATGAAAGAGGAATCAGGAATGCCAATTAATCATGTAAGTCCTTTGGCAACAGATGTTGTGGTAGAGACATTCCCATTGGATTCAGTTTCTT
GGGGTGTTAATGGTTCAGATGTAAGATCTGAAATGTTGATTTCAACCAGTGCCTCAGAAAAGCAAGTTAGTCAAACCATTGAGTTAGAATCAGATGTTGGCTTGTTTAAC
ATTAAAGCTTCTGGCTGTGTAGTTGAGAAAGCAGAGAAAAACTTTGCAGGTCCATTATCAGAAACAAAGTCTGACTTGGTGGAGGTAGCACAAATTGTTGAAACATCTAA
TGGATCTACTGTGAAAGAAGGTATCATATATGAAGTTGGGGGTCCTGAGTTGGAAGTTTGCAGTGATAATCCAATATCTGTGACCTTTGAACAAGGCCAGAAATCTAGTG
AAATGAAGGCTCCAAATGCTTCTCCAAGTACTATTGAGAGTCTCAACAAGACATTCAGCAATGGCTTTGATCAGGCCTCAAAAATCAAAGAGGAGACAGAGATGGAAAAT
AAAGTAGATGCTGAACAAACTGGTGGCTCCCAGAAAGAAAGCATTCCAACTTTAAACAGAATTAATCTTGAATCATGGGAAGGGATGTCCAAAAACTCGTCAAAACCCGA
AAACAACCTGCTTTTGGAAATCTTCAAGGCATTCATTGCTGCCTTCGTGAAGTTTTGGTCCGATCCAGTCGGTCCCATCATCGATATTTATGAGGACAAAACTGGATTCT
GA
Protein sequenceShow/hide protein sequence
MIIPVRCFTCGKVIGNKWDHYLDLLQADYSEGDALDALGLVRYCCRRMLMTHVDLIEKLLNYNSLLNFLDFMHAIKGGWTGRPLALAKNNEPEGRKTRIRRSKEERKAMV
EVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKLLLEEHSTDHLLEENPLHSIAIEPQSPLTVSSKEFYSPVDYYNQYINEESIIVSDEQCTA
TNIQGSQNGPIINGSLVDVRDKDSVEFIKPELLVNEHKKVEEVMKEESGMPINHVSPLATDVVVETFPLDSVSWGVNGSDVRSEMLISTSASEKQVSQTIELESDVGLFN
IKASGCVVEKAEKNFAGPLSETKSDLVEVAQIVETSNGSTVKEGIIYEVGGPELEVCSDNPISVTFEQGQKSSEMKAPNASPSTIESLNKTFSNGFDQASKIKEETEMEN
KVDAEQTGGSQKESIPTLNRINLESWEGMSKNSSKPENNLLLEIFKAFIAAFVKFWSDPVGPIIDIYEDKTGF