; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014918 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014918
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein gar2
Genome locationChr02:21819903..21822589
RNA-Seq ExpressionHG10014918
SyntenyHG10014918
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR034361 - PHIP1, RNA recognition motif 1
IPR034362 - PHIP1, RNA recognition motif 2
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461639.1 PREDICTED: RNA-binding protein CP31B, chloroplastic [Cucumis melo]7.8e-17185.28Show/hide
Query:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRK
        MVLSNKKLKQKLREKLAESLISSVAGKD+N DVSGE DTES R+SLKELLGTAS HGPRLSKREKRRESLILT  DGN +EKKE+EN+GLGE KREKKRK
Subjt:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRK

Query:  RDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQ--KKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDI
        R+EGVKEKNAVDGLE+D EKAKKLK KKKQQ  KKKKKSNKK  NGEEEKEK+  GDV+GN+VKGQVEET+YNIGSEFNENLATKVYVGGIPYYSTEDDI
Subjt:  RDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQ--KKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDI

Query:  CSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLFE
        CSFFESCGTITE+DCMKFPESGKFRGIAILSFKTEAAAKRALA DGADMGGLFLKVQPYKGTR NRA DFSPG+VEGYNRIY+GNLSWDVTEDDLKKLF 
Subjt:  CSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLFE

Query:  NCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPEMKEA
        NC I SIRFGMDKETGEFRGYAHVDFSDS+SLKTALKLDQ+ IH RPVKIRCAVPKKGTEK G  AA  A       E HPE + EP PE KEA
Subjt:  NCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPEMKEA

XP_022139342.1 protein gar2 [Momordica charantia]1.4e-15680.6Show/hide
Query:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWD-GNTDEKKEDENRGLGETKREKKR
        MVLSNKKLKQKLREKLAESLI++VAGKDS GDVSGEPD+ESR QSL+ELLG  S HGPRLSKREKRRESL L A D  N +EKKEDEN+ LG TKRE KR
Subjt:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWD-GNTDEKKEDENRGLGETKREKKR

Query:  KRDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQKKK-KKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDI
        KRDE   EK+A+DG  E+NEKAKKLKKKKK++KKK KKSNKKA NGEEEKEK+  G   GNEV+G +EET+ N+GS   EN+ATKVYVGGIPYYSTEDDI
Subjt:  KRDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQKKK-KKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDI

Query:  CSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLFE
        CSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALA DGADMGGLFLKVQPYKGTR N+AVDFSPGI+EGYNRIYLGNLSWDVTEDDLKKLF 
Subjt:  CSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLFE

Query:  NCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHP--------ETHPEPKPEMK
        NC IASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQ+TIH RPVKIRCAVPKKGTE RGA     AAAA+TH +  P        ETHP+P+P MK
Subjt:  NCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHP--------ETHPEPKPEMK

Query:  EA
        EA
Subjt:  EA

XP_023517053.1 protein gar2 [Cucurbita pepo subsp. pepo]1.6e-15578.55Show/hide
Query:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDG-NTDEKKEDENRGLGETKREKKR
        MVLSNKKLKQK REKLAESLISS+AGK SNGDVSGEP+++SR QSLKELLG   RHG RLSKREKRRES+I  A DG N +EKKE EN+ LGE KREKKR
Subjt:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDG-NTDEKKEDENRGLGETKREKKR

Query:  KRDEGVKEKNAVDGLEEDNEKAKKL--------KKKKKQQKKKKKS--NKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGI
        KRDE VKEK+AVDGLEED+EKAKKL        KKKKK+QKK+KK   N+KAKNGEEEKEK+  G   GNEV GQVEET+ NIGS+ NEN+ATKVYVGGI
Subjt:  KRDEGVKEKNAVDGLEEDNEKAKKL--------KKKKKQQKKKKKS--NKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGI

Query:  PYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVT
        PYYSTEDDICSFFESCGTITEVDCM FPESGKFRGIAILSFKTEAAAKRALA DGADMGGLFLKVQPYKGTR+N+AVDFSPGIVEGYNRIYLGNLSWDVT
Subjt:  PYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVT

Query:  EDDLKKLFENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAA-------VAAAAAKTHPETHP---
        EDDLKKLF NC IASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQ+TIH RPVKIRCAVPK+GT + G AAA        AAAA    P   P   
Subjt:  EDDLKKLFENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAA-------VAAAAAKTHPETHP---

Query:  --ETHPEPKPEMKEA
          ET P+P PEM  A
Subjt:  --ETHPEPKPEMKEA

XP_031742307.1 protein gar2 [Cucumis sativus]2.4e-16782.88Show/hide
Query:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRK
        MVLSNKKLKQKLREKLAESLISSVAG+D+N  VSGE DTES R+SLKELLGTAS +GPRLSKREKRRESL+LT  DGN  EKKEDEN+GLG    EKKRK
Subjt:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRK

Query:  RDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQK----KKKKSNKKAKNGEE------EKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIP
        R+EGVKEKNAVDGLEEDNEKAKKLK KKKQQ+    KKKKSNKK  NGEE      EKEKEK+GDV+GN+VKGQVEET+YNIGSEF+ENLATKVYVGGIP
Subjt:  RDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQK----KKKKSNKKAKNGEE------EKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIP

Query:  YYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTE
        YYSTEDDICSFFESCGTITE+DCMKFPESGKFRGIAILSFKTEAAAKRALA DGADMGGLFLKVQPYKGTR NRA DFSPG+VEGYNRIY+GNLSWDVTE
Subjt:  YYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTE

Query:  DDLKKLFENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPEMK
        DDLKKLF NC IASIRFGMDKETGEFRGYAHVDFSD +SLKTALKLDQK IH RPVKIRCAVPKKGTE  G  A   A       ETHPE +PEP PE K
Subjt:  DDLKKLFENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPEMK

Query:  EAA
        EAA
Subjt:  EAA

XP_038891335.1 protein gar2 [Benincasa hispida]1.3e-18189.14Show/hide
Query:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRK
        MVLSNKKLK+KLREKLAESLISSVAGKDSNGDVSGEPDTESRR+SLKELLGT  RHGPRLSKREKRRESLILTA DGN DEKKED NRGLGETKREKKRK
Subjt:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRK

Query:  RDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQ-KKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDIC
        RDEG KEKN VDGLEE+NEKAKKLKKKKKQQ KKKKKSNKKAKNGEEE+EK+  G+V+GNEVK QVEET+YNIGSEFNENLATKVYVGGIPYYSTEDDIC
Subjt:  RDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQ-KKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDIC

Query:  SFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLFEN
         FFESCGTITE+DCMKFPESGKFRGIAIL+FKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSP IVEGYNRIYLGNLSWDVTEDDLKKLF N
Subjt:  SFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLFEN

Query:  CNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPEMKEAAET
        C IASIRFGMDKETGEFRGYAHVDFSD+VSLKTALKLDQ  IH RPVKIRCAVPKKGTE+RG  AA AAA A    ETH E HP+P PE KEAAET
Subjt:  CNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPEMKEAAET

TrEMBL top hitse value%identityAlignment
A0A0A0KLT8 Uncharacterized protein7.2e-16273.73Show/hide
Query:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRK
        MVLSNKKLKQKLREKLAESLISSVAG+D+N  VSGE DTES R+SLKELLGTAS +GPRLSKREKRRESL+LT  DGN  EKKEDEN+GLG    EKKRK
Subjt:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRK

Query:  RDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQK----KKKKSNKKAKNGEE--------------------------------------------------
        R+EGVKEKNAVDGLEEDNEKAKKLK KKKQQ+    KKKKSNKK  NGEE                                                  
Subjt:  RDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQK----KKKKSNKKAKNGEE--------------------------------------------------

Query:  ------EKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRAL
              EKEKEK+GDV+GN+VKGQVEET+YNIGSEF+ENLATKVYVGGIPYYSTEDDICSFFESCGTITE+DCMKFPESGKFRGIAILSFKTEAAAKRAL
Subjt:  ------EKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRAL

Query:  AWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLFENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKT
        A DGADMGGLFLKVQPYKGTR NRA DFSPG+VEGYNRIY+GNLSWDVTEDDLKKLF NC IASIRFGMDKETGEFRGYAHVDFSD +SLKTALKLDQK 
Subjt:  AWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLFENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKT

Query:  IHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPEMKEAA
        IH RPVKIRCAVPKKGTE  G  A   A       ETHPE +PEP PE KEAA
Subjt:  IHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPEMKEAA

A0A1S3CF29 RNA-binding protein CP31B, chloroplastic3.8e-17185.28Show/hide
Query:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRK
        MVLSNKKLKQKLREKLAESLISSVAGKD+N DVSGE DTES R+SLKELLGTAS HGPRLSKREKRRESLILT  DGN +EKKE+EN+GLGE KREKKRK
Subjt:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRK

Query:  RDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQ--KKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDI
        R+EGVKEKNAVDGLE+D EKAKKLK KKKQQ  KKKKKSNKK  NGEEEKEK+  GDV+GN+VKGQVEET+YNIGSEFNENLATKVYVGGIPYYSTEDDI
Subjt:  RDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQ--KKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDI

Query:  CSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLFE
        CSFFESCGTITE+DCMKFPESGKFRGIAILSFKTEAAAKRALA DGADMGGLFLKVQPYKGTR NRA DFSPG+VEGYNRIY+GNLSWDVTEDDLKKLF 
Subjt:  CSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLFE

Query:  NCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPEMKEA
        NC I SIRFGMDKETGEFRGYAHVDFSDS+SLKTALKLDQ+ IH RPVKIRCAVPKKGTEK G  AA  A       E HPE + EP PE KEA
Subjt:  NCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPEMKEA

A0A6J1CDP4 protein gar26.9e-15780.6Show/hide
Query:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWD-GNTDEKKEDENRGLGETKREKKR
        MVLSNKKLKQKLREKLAESLI++VAGKDS GDVSGEPD+ESR QSL+ELLG  S HGPRLSKREKRRESL L A D  N +EKKEDEN+ LG TKRE KR
Subjt:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWD-GNTDEKKEDENRGLGETKREKKR

Query:  KRDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQKKK-KKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDI
        KRDE   EK+A+DG  E+NEKAKKLKKKKK++KKK KKSNKKA NGEEEKEK+  G   GNEV+G +EET+ N+GS   EN+ATKVYVGGIPYYSTEDDI
Subjt:  KRDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQKKK-KKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDI

Query:  CSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLFE
        CSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALA DGADMGGLFLKVQPYKGTR N+AVDFSPGI+EGYNRIYLGNLSWDVTEDDLKKLF 
Subjt:  CSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLFE

Query:  NCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHP--------ETHPEPKPEMK
        NC IASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQ+TIH RPVKIRCAVPKKGTE RGA     AAAA+TH +  P        ETHP+P+P MK
Subjt:  NCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHP--------ETHPEPKPEMK

Query:  EA
        EA
Subjt:  EA

A0A6J1FSC3 protein gar27.2e-15479.29Show/hide
Query:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDG-NTDEKKEDENRGLGETKREKKR
        MVLSNKKLKQK REKLAESLISS+AGK SNGDVSGEP+++SR QSLKELLG   RHG RLSKREKRRES+I TA DG N +EKKE EN+ LGE K EKKR
Subjt:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDG-NTDEKKEDENRGLGETKREKKR

Query:  KRDEGVKEKNAVDGLEEDNEKAK--KLKKKKKQQKKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDD
        KR+E VKEK+AVDGLEED+EKAK  KLKK K  +KKKKK  K+ K    E+EKEK+G   GNEV GQVEET+ NIGS+ NEN+ATKVYVGGIPYYSTEDD
Subjt:  KRDEGVKEKNAVDGLEEDNEKAK--KLKKKKKQQKKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDD

Query:  ICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF
        ICSFFESCGTITEVDCM FPESGKFRGIAILSFKTEAAAKRALA DGADMGGLFLKVQPYKGTR+N+AVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF
Subjt:  ICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF

Query:  ENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPEMKEAA
         NC IASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQ+TIH RPVKIRCAVPK+GT + G AAA AA  A T     P   P P P    AA
Subjt:  ENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPEMKEAA

A0A6J1JA04 protein gar2-like1.6e-15377.59Show/hide
Query:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDG-NTDEKKEDENRGLGETKREKKR
        MVLSNKKLKQK REKLAESLISS+AG+ SNGDVSGEP+++SR QSLKELLG   RHG RLSKREKRRES I TA DG N +EKKE EN+ LGE K EKKR
Subjt:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDG-NTDEKKEDENRGLGETKREKKR

Query:  KRDEGVKEKNAVDGLEEDNEKAKKL-------KKKKKQQK-KKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPY
        KR+E VKEK+AVDGLEED+EKAKKL       KKKKKQ+K KK+KSN+KAKN EEEKEK+  G   GNEV GQVEET+ NIGS+ NEN+ATKVYVGGIPY
Subjt:  KRDEGVKEKNAVDGLEEDNEKAKKL-------KKKKKQQK-KKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPY

Query:  YSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTED
        YSTEDDICSFFESCGTITEVDCM FPESGKFRGIAILSFKTEAAAKRALA DGADMGGLFLKVQPYKGTR+N+AVDFSPGIVEGYNRIYLGNLSWDVTED
Subjt:  YSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTED

Query:  DLKKLFENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVA------AAAAKTHPETHP------
        DLKKLF +C IASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQ+TIH RPVKIRCAVPK+GT + G AAA          AA T P   P      
Subjt:  DLKKLFENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVA------AAAAKTHPETHP------

Query:  --ETHPEPKPEMKEA
          ET P+P PEM  A
Subjt:  --ETHPEPKPEMKEA

SwissProt top hitse value%identityAlignment
P07909 Heterogeneous nuclear ribonucleoprotein A14.1e-1327.03Show/hide
Query:  KVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFK-----TEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYN
        K+++GG+ Y +T++++ + FE  G I +V  MK P + + RG   +++       EA   R    DG        +V   K     + +D SP       
Subjt:  KVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFK-----TEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYN

Query:  RIYLGNLSWDVTEDDLKKLFENC-NIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRG
        ++++G L  D  E  ++  F++  NI  I   +DKETG+ RG+A V+F D   +   +   Q  ++ + V ++ A+PK+  ++ G
Subjt:  RIYLGNLSWDVTEDDLKKLFENC-NIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAVPKKGTEKRG

P27476 Nuclear localization sequence-binding protein4.5e-0421.51Show/hide
Query:  KLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRKRDEGVK
        K  ++ +E+ A+++ SS +   S+   S E ++ES  +S      ++S                     D  +      ++    ETK+E+ +       
Subjt:  KLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRKRDEGVK

Query:  EKNAVDGLEEDNEKAKKLKKKKKQQKKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDICSFFESCG
        + ++ +  EE+ E+ KK + K+        S+      E+E+  +KK      E +   E +     +E  E  AT ++VG + +   ++ +   FE  G
Subjt:  EKNAVDGLEEDNEKAKKLKKKKKQQKKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDICSFFESCG

Query:  TITEVDCMKFPESGKFRGIAILSFKTEAAAKRAL-AWDGADMGG--LFLKVQPYKGTRTN-RAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF-ENCN
         +     +    + + RG   + F+ ++ A++A+    G ++ G  +   +   K    N RA  F     E  + ++LGNLS++   D + +LF ++  
Subjt:  TITEVDCMKFPESGKFRGIAILSFKTEAAAKRAL-AWDGADMGG--LFLKVQPYKGTRTN-RAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF-ENCN

Query:  IASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLDQKTIHDRPVKIRCAVPKKGTE
        + S+R     ET + +G+ +V FS+    K AL  L  + I +RPV++  + P+   +
Subjt:  IASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLDQKTIHDRPVKIRCAVPKKGTE

Q04836 31 kDa ribonucleoprotein, chloroplastic5.9e-1227.85Show/hide
Query:  QQKKKKKSNKKAKNGEEEKEKEKKGDVA-GNEVKGQVEETYYNIGSEFNE-NLATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAI
        ++ +    ++    G+E +    +GDV+ G+E +G V E   +  +EF E +   K++VG + Y      +   FE  GT+   + +   E+ + RG   
Subjt:  QQKKKKKSNKKAKNGEEEKEKEKKGDVA-GNEVKGQVEETYYNIGSEFNE-NLATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAI

Query:  LSFKTEAAAKRAL-AWDGADMGGLFLKVQPY--KGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF-ENCNIASIRFGMDKETGEFRGYAHVD
        ++  +   A+ A+  ++  D+ G  L V     +G+R  RA    P + E   R+Y+GNL WDV    L++LF E+  +   R   D+ETG  RG+  V 
Subjt:  LSFKTEAAAKRAL-AWDGADMGGLFLKVQPY--KGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF-ENCNIASIRFGMDKETGEFRGYAHVD

Query:  FSDSVSLKTALK-LDQKTIHDRPVKIRCA---VPKKG
         SD   L  A+  LD + +  R +++  A    P++G
Subjt:  FSDSVSLKTALK-LDQKTIHDRPVKIRCA---VPKKG

Q9FGS0 RNA-binding protein CP31B, chloroplastic2.6e-1228.3Show/hide
Query:  EKEKEKKGDVAGNEVKGQVEETYYN-IGSEFNE-NLATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKT-EAAAKRALAWD
        E+E+ + G + G  V   V+E++ +  G  F E     K++VG +PY      +   FE  GT+   + +   ++ + RG   ++  T E A K    ++
Subjt:  EKEKEKKGDVAGNEVKGQVEETYYN-IGSEFNE-NLATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKT-EAAAKRALAWD

Query:  GADMGGLFLKVQ--PYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF-ENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLDQK
          ++ G  L V     +G+R  R     P + +   RIY+GNL WDV    L++LF E+  +   R   D+ETG  RG+  V  S+   +  A+  LD +
Subjt:  GADMGGLFLKVQ--PYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF-ENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLDQK

Query:  TIHDRPVKIRCA
         +  R +K+  A
Subjt:  TIHDRPVKIRCA

Q9M3B8 Phragmoplastin interacting protein 12.4e-6950Show/hide
Query:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDE--NRGLGE---TKR
        MVLSNKKLKQ++R+ LAESL  SV+            +T  + QSLK LL ++S H PRLSKREKRR        D   DE +E+E  N G  E   TK 
Subjt:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDE--NRGLGE---TKR

Query:  EKKRKRDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQKKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLA-TKVYVGGIPYYST
        +KKRKRD+ V E + ++G E   E+ K  KKK K++KKK+K NK  K  EE       G+V   E K +VEE   N  ++  + +   K+YVGGIPY ST
Subjt:  EKKRKRDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQKKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLA-TKVYVGGIPYYST

Query:  EDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGT------RTNRAVDFSPGIVEGYNRIYLGNLSWDV
        ED+I S+F SCG I +VDC   PE G F GIA ++F TE  AKRALA+D A MG  +L +Q Y  T      R   +  F+P +V+GYNR+Y+GNL+WD 
Subjt:  EDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGT------RTNRAVDFSPGIVEGYNRIYLGNLSWDV

Query:  TEDDLKKLFENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAV
        TE D++KLF +C I S+R G +KETGEF+GYAHVDF DSVS+  ALKLDQ+ I  RPVKI CA+
Subjt:  TEDDLKKLFENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAV

Arabidopsis top hitse value%identityAlignment
AT2G16940.1 Splicing factor, CC1-like4.6e-0422.68Show/hide
Query:  NKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRKRDEG
        ++K+K + +E+   S       K+ + D  G     SR         +  R   R  +R++ R S      D   D  KE+ N    E  R+K R     
Subjt:  NKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRKRDEG

Query:  VKEKNAVDGLEEDNEKAKKLKKKKKQQKKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDICSFFES
          EK+        +E+ +  +++K  + + K+   K ++ +  + K+KK D    E   + ++                V+   I   +TE D+  FF  
Subjt:  VKEKNAVDGLEEDNEKAKKLKKKKKQQKKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDICSFFES

Query:  CGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKG----TRTNRAVDFSPGIVEGYN----RIYLGNLSWDVTEDDLKKL
         G + +V  +    S + RGI  + F    +   A+A  G  + G  + V+P +      ++  A   + G++  Y+    R+Y+GNL  +++EDDL+K+
Subjt:  CGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKG----TRTNRAVDFSPGIVEGYN----RIYLGNLSWDVTEDDLKKL

Query:  FENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLD-QKTIHDRPVKIRCAVPKKGTEKRG
        FE+     +      ETG  +G+  V F+     + AL L+ Q  I  R +K+     +    + G
Subjt:  FENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLD-QKTIHDRPVKIRCAVPKKGTEKRG

AT3G52380.1 chloroplast RNA-binding protein 336.7e-1123.72Show/hide
Query:  EEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRAL-AWD
        EEE+E E++GD    EV+ + + T  + G E       ++YVG +PY  T  ++   F   GT+ +V  +    + + RG   ++  +   AK A+  ++
Subjt:  EEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRAL-AWD

Query:  GADMGGLFLKVQ----PYKG---TRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF-ENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALK
         + +GG  +KV     P  G       +  D +   V+  +++Y GNL W++T   LK  F +   +   +   ++ TG  RG+  + F  + ++++AL 
Subjt:  GADMGGLFLKVQ----PYKG---TRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF-ENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALK

Query:  LDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPE
                            G E  G A  +  A+ +  P   P +  E + E
Subjt:  LDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPE

AT3G55340.1 phragmoplastin interacting protein 11.7e-7050Show/hide
Query:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDE--NRGLGE---TKR
        MVLSNKKLKQ++R+ LAESL  SV+            +T  + QSLK LL ++S H PRLSKREKRR        D   DE +E+E  N G  E   TK 
Subjt:  MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDE--NRGLGE---TKR

Query:  EKKRKRDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQKKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLA-TKVYVGGIPYYST
        +KKRKRD+ V E + ++G E   E+ K  KKK K++KKK+K NK  K  EE       G+V   E K +VEE   N  ++  + +   K+YVGGIPY ST
Subjt:  EKKRKRDEGVKEKNAVDGLEEDNEKAKKLKKKKKQQKKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLA-TKVYVGGIPYYST

Query:  EDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGT------RTNRAVDFSPGIVEGYNRIYLGNLSWDV
        ED+I S+F SCG I +VDC   PE G F GIA ++F TE  AKRALA+D A MG  +L +Q Y  T      R   +  F+P +V+GYNR+Y+GNL+WD 
Subjt:  EDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGT------RTNRAVDFSPGIVEGYNRIYLGNLSWDV

Query:  TEDDLKKLFENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAV
        TE D++KLF +C I S+R G +KETGEF+GYAHVDF DSVS+  ALKLDQ+ I  RPVKI CA+
Subjt:  TEDDLKKLFENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQKTIHDRPVKIRCAV

AT4G24770.1 31-kDa RNA binding protein4.2e-1327.85Show/hide
Query:  QQKKKKKSNKKAKNGEEEKEKEKKGDVA-GNEVKGQVEETYYNIGSEFNE-NLATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAI
        ++ +    ++    G+E +    +GDV+ G+E +G V E   +  +EF E +   K++VG + Y      +   FE  GT+   + +   E+ + RG   
Subjt:  QQKKKKKSNKKAKNGEEEKEKEKKGDVA-GNEVKGQVEETYYNIGSEFNE-NLATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAI

Query:  LSFKTEAAAKRAL-AWDGADMGGLFLKVQPY--KGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF-ENCNIASIRFGMDKETGEFRGYAHVD
        ++  +   A+ A+  ++  D+ G  L V     +G+R  RA    P + E   R+Y+GNL WDV    L++LF E+  +   R   D+ETG  RG+  V 
Subjt:  LSFKTEAAAKRAL-AWDGADMGGLFLKVQPY--KGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF-ENCNIASIRFGMDKETGEFRGYAHVD

Query:  FSDSVSLKTALK-LDQKTIHDRPVKIRCA---VPKKG
         SD   L  A+  LD + +  R +++  A    P++G
Subjt:  FSDSVSLKTALK-LDQKTIHDRPVKIRCA---VPKKG

AT5G50250.1 chloroplast RNA-binding protein 31B1.9e-1328.3Show/hide
Query:  EKEKEKKGDVAGNEVKGQVEETYYN-IGSEFNE-NLATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKT-EAAAKRALAWD
        E+E+ + G + G  V   V+E++ +  G  F E     K++VG +PY      +   FE  GT+   + +   ++ + RG   ++  T E A K    ++
Subjt:  EKEKEKKGDVAGNEVKGQVEETYYN-IGSEFNE-NLATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKT-EAAAKRALAWD

Query:  GADMGGLFLKVQ--PYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF-ENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLDQK
          ++ G  L V     +G+R  R     P + +   RIY+GNL WDV    L++LF E+  +   R   D+ETG  RG+  V  S+   +  A+  LD +
Subjt:  GADMGGLFLKVQ--PYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLF-ENCNIASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLDQK

Query:  TIHDRPVKIRCA
         +  R +K+  A
Subjt:  TIHDRPVKIRCA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTGTCAAACAAAAAGTTGAAGCAGAAATTGAGAGAAAAGCTAGCCGAATCGTTAATTTCATCAGTCGCCGGAAAAGACTCCAACGGTGATGTTTCCGGTGAACC
GGATACAGAATCTCGGCGACAATCCCTCAAAGAGCTATTAGGCACCGCTAGTCGCCATGGACCCAGATTGTCCAAGCGAGAGAAACGAAGAGAATCGCTCATTTTGACAG
CTTGGGATGGGAATACGGATGAGAAGAAGGAAGATGAGAATCGGGGATTGGGAGAGACGAAGAGAGAGAAGAAGAGGAAGAGAGATGAAGGGGTGAAGGAGAAGAATGCG
GTTGATGGATTGGAAGAGGATAATGAGAAGGCCAAAAAGTTGAAGAAGAAAAAGAAGCAGCAGAAGAAGAAGAAGAAGAGTAATAAGAAGGCAAAGAATGGCGAGGAGGA
GAAGGAGAAGGAGAAGAAGGGAGATGTAGCTGGGAATGAAGTGAAAGGGCAGGTTGAAGAAACTTATTATAATATAGGCAGTGAATTTAATGAAAATTTGGCTACAAAAG
TGTATGTTGGAGGCATTCCATATTATTCAACCGAGGACGATATTTGTAGCTTTTTTGAAAGCTGTGGCACTATTACTGAAGTTGATTGTATGAAGTTTCCAGAGAGTGGG
AAGTTCAGAGGCATTGCAATATTGAGTTTCAAGACAGAAGCTGCAGCGAAACGAGCACTAGCCTGGGATGGGGCTGACATGGGGGGACTCTTCCTTAAAGTACAGCCCTA
CAAAGGAACTCGAACAAATAGAGCAGTTGATTTCTCTCCAGGAATTGTGGAAGGCTACAACAGAATCTATCTGGGGAATCTGTCGTGGGATGTAACTGAGGATGATCTGA
AGAAACTCTTTGAAAACTGTAACATTGCGTCGATACGTTTTGGCATGGACAAGGAAACAGGGGAGTTTCGTGGCTATGCCCATGTTGATTTCTCTGACAGTGTCTCGTTA
AAAACGGCTCTGAAGTTAGACCAGAAGACAATTCATGACAGACCCGTCAAGATAAGATGTGCAGTTCCAAAGAAAGGAACAGAAAAAAGAGGGGCAGCAGCTGCAGTAGC
AGCAGCAGCAGCAAAAACACATCCAGAAACACATCCAGAAACGCATCCAGAACCTAAGCCTGAAATGAAGGAAGCTGCTGAAACTGGAGCAGCAAAATTGGACTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTGTCAAACAAAAAGTTGAAGCAGAAATTGAGAGAAAAGCTAGCCGAATCGTTAATTTCATCAGTCGCCGGAAAAGACTCCAACGGTGATGTTTCCGGTGAACC
GGATACAGAATCTCGGCGACAATCCCTCAAAGAGCTATTAGGCACCGCTAGTCGCCATGGACCCAGATTGTCCAAGCGAGAGAAACGAAGAGAATCGCTCATTTTGACAG
CTTGGGATGGGAATACGGATGAGAAGAAGGAAGATGAGAATCGGGGATTGGGAGAGACGAAGAGAGAGAAGAAGAGGAAGAGAGATGAAGGGGTGAAGGAGAAGAATGCG
GTTGATGGATTGGAAGAGGATAATGAGAAGGCCAAAAAGTTGAAGAAGAAAAAGAAGCAGCAGAAGAAGAAGAAGAAGAGTAATAAGAAGGCAAAGAATGGCGAGGAGGA
GAAGGAGAAGGAGAAGAAGGGAGATGTAGCTGGGAATGAAGTGAAAGGGCAGGTTGAAGAAACTTATTATAATATAGGCAGTGAATTTAATGAAAATTTGGCTACAAAAG
TGTATGTTGGAGGCATTCCATATTATTCAACCGAGGACGATATTTGTAGCTTTTTTGAAAGCTGTGGCACTATTACTGAAGTTGATTGTATGAAGTTTCCAGAGAGTGGG
AAGTTCAGAGGCATTGCAATATTGAGTTTCAAGACAGAAGCTGCAGCGAAACGAGCACTAGCCTGGGATGGGGCTGACATGGGGGGACTCTTCCTTAAAGTACAGCCCTA
CAAAGGAACTCGAACAAATAGAGCAGTTGATTTCTCTCCAGGAATTGTGGAAGGCTACAACAGAATCTATCTGGGGAATCTGTCGTGGGATGTAACTGAGGATGATCTGA
AGAAACTCTTTGAAAACTGTAACATTGCGTCGATACGTTTTGGCATGGACAAGGAAACAGGGGAGTTTCGTGGCTATGCCCATGTTGATTTCTCTGACAGTGTCTCGTTA
AAAACGGCTCTGAAGTTAGACCAGAAGACAATTCATGACAGACCCGTCAAGATAAGATGTGCAGTTCCAAAGAAAGGAACAGAAAAAAGAGGGGCAGCAGCTGCAGTAGC
AGCAGCAGCAGCAAAAACACATCCAGAAACACATCCAGAAACGCATCCAGAACCTAAGCCTGAAATGAAGGAAGCTGCTGAAACTGGAGCAGCAAAATTGGACTGGTGA
Protein sequenceShow/hide protein sequence
MVLSNKKLKQKLREKLAESLISSVAGKDSNGDVSGEPDTESRRQSLKELLGTASRHGPRLSKREKRRESLILTAWDGNTDEKKEDENRGLGETKREKKRKRDEGVKEKNA
VDGLEEDNEKAKKLKKKKKQQKKKKKSNKKAKNGEEEKEKEKKGDVAGNEVKGQVEETYYNIGSEFNENLATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESG
KFRGIAILSFKTEAAAKRALAWDGADMGGLFLKVQPYKGTRTNRAVDFSPGIVEGYNRIYLGNLSWDVTEDDLKKLFENCNIASIRFGMDKETGEFRGYAHVDFSDSVSL
KTALKLDQKTIHDRPVKIRCAVPKKGTEKRGAAAAVAAAAAKTHPETHPETHPEPKPEMKEAAETGAAKLDW