; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g34080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g34080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein gar2
Genome locationchr9:26045205..26048004
RNA-Seq ExpressionMoc09g34080
SyntenyMoc09g34080
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR001878 - Zinc finger, CCHC-type
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR034361 - PHIP1, RNA recognition motif 1
IPR034362 - PHIP1, RNA recognition motif 2
IPR035979 - RNA-binding domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030736.1 hypothetical protein SDJN02_04773, partial [Cucurbita argyrosperma subsp. argyrosperma]5.5e-17878.94Show/hide
Query:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR
        MVLSNKKLKQK REKLAE+LI+++AGK S GDVSGEP+S+SRPQSL+ELLG    HG RLSKREKRRES+   ASD KN+EEKKE E+QRLG  K EKKR
Subjt:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR

Query:  KRDEEGAEKSALDGSAENNEKAKKL---------KKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPY
        KR+EE  EKSA+DG  E++EKAKKL         KKKKK+KK+K +KSN+KA N EEEKEKQGGE GNEV G +EETHDN+GS V ENVATKVYVGGIPY
Subjt:  KRDEEGAEKSALDGSAENNEKAKKL---------KKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPY

Query:  YSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTED
        YSTEDDICSFFESCGTITEVDCM FPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTR NKAVDFSPGI+EGYNRIYLGNLSWDVTED
Subjt:  YSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTED

Query:  DLKKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAA------AETHSKPEPVGGGAAAAET
        DLKKLF+NCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPK+GT RG G A A          +   P+    AAAET
Subjt:  DLKKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAA------AETHSKPEPVGGGAAAAET

Query:  HPKPEPVMKEADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS
         PKP P M  ADSGLS+VSGKIRRRTCYECGEKGHLSSNCP KQ+ +SVAS
Subjt:  HPKPEPVMKEADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS

XP_022139342.1 protein gar2 [Momordica charantia]3.0e-23299.77Show/hide
Query:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR
        MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKRE KR
Subjt:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR

Query:  KRDEEGAEKSALDGSAENNEKAKKLKKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSF
        KRDEEGAEKSALDGSAENNEKAKKLKKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSF
Subjt:  KRDEEGAEKSALDGSAENNEKAKKLKKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSF

Query:  FESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSNCK
        FESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSNCK
Subjt:  FESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSNCK

Query:  IASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAAAETHSKPEPVGGGAAAAETHPKPEPVMKEADSGLS
        IASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAAAETHSKPEPVGGGAAAAETHPKPEPVMKEADSGLS
Subjt:  IASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAAAETHSKPEPVGGGAAAAETHPKPEPVMKEADSGLS

Query:  SVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS
        SVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS
Subjt:  SVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS

XP_022943192.1 protein gar2 [Cucurbita moschata]2.5e-17879.56Show/hide
Query:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR
        MVLSNKKLKQK REKLAESLI+++AGK S GDVSGEP+S+SRPQSL+ELLG    HG RLSKREKRRES+   ASD KN+EEKKE ENQRLG  K EKKR
Subjt:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR

Query:  KRDEEGAEKSALDGSAENNEKAKKLKKKK---KKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPYYSTEDD
        KR+EE  EKSA+DG  E++EKAKKLK KK    KKKKK +K  KK  + EEEKEKQGGE GNEV G +EETHDN+GS V ENVATKVYVGGIPYYSTEDD
Subjt:  KRDEEGAEKSALDGSAENNEKAKKLKKKK---KKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPYYSTEDD

Query:  ICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLF
        ICSFFESCGTITEVDCM FPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTR NKAVDFSPGI+EGYNRIYLGNLSWDVTEDDLKKLF
Subjt:  ICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLF

Query:  SNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAAAETHS-----------KPEPVGGGAAAAETH
        +NCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPK+GT RG G AAAA   +            P P     AAAET 
Subjt:  SNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAAAETHS-----------KPEPVGGGAAAAETH

Query:  PKPEPVMKEADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS
        PKP P M  ADSGLS+VSGKIRRRTCYECGEKGHLSSNCP KQ+ +SVAS
Subjt:  PKPEPVMKEADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS

XP_022986056.1 protein gar2-like [Cucurbita maxima]1.3e-17979.47Show/hide
Query:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR
        MVLSNKKLKQK REKLAESLI+++AG+ S GDVSGEP+S+SRPQSL+ELLG    HG RLSKREKRRES    ASD KN+EEKKE ENQRLG  K EKKR
Subjt:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR

Query:  KRDEEGAEKSALDGSAENNEKAKKL-------KKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPYYS
        KR+EE  EKSA+DG  E++EKAKKL       KKKKK+KK+K +KSN+KA N EEEKEKQGGE GNEV G +EETHDN+GS V ENVATKVYVGGIPYYS
Subjt:  KRDEEGAEKSALDGSAENNEKAKKL-------KKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPYYS

Query:  TEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDL
        TEDDICSFFESCGTITEVDCM FPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTR NKAVDFSPGI+EGYNRIYLGNLSWDVTEDDL
Subjt:  TEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDL

Query:  KKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAA----------AAETHSKPEPVGGGAAAA
        KKLF++CKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPK+GT RG G AA          AA T   P P     AAA
Subjt:  KKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAA----------AAETHSKPEPVGGGAAAA

Query:  ETHPKPEPVMKEADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS
        ET PKP P M  ADSGLS+VSGKIRRRTCYECGEKGHLSSNCP KQ+ +SVAS
Subjt:  ETHPKPEPVMKEADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS

XP_023517053.1 protein gar2 [Cucurbita pepo subsp. pepo]9.0e-18179.69Show/hide
Query:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR
        MVLSNKKLKQK REKLAESLI+++AGK S GDVSGEP+S+SRPQSL+ELLG    HG RLSKREKRRES+   ASD KN+EEKKE ENQRLG  KREKKR
Subjt:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR

Query:  KRDEEGAEKSALDGSAENNEKAKKL---------KKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPY
        KRDEE  EKSA+DG  E++EKAKKL         KKKKK+KK+K +K N+KA NGEEEKEKQGGE GNEV G +EETHDN+GS V ENVATKVYVGGIPY
Subjt:  KRDEEGAEKSALDGSAENNEKAKKL---------KKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPY

Query:  YSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTED
        YSTEDDICSFFESCGTITEVDCM FPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTR NKAVDFSPGI+EGYNRIYLGNLSWDVTED
Subjt:  YSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTED

Query:  DLKKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAA--------AAETHSKPEPVGGGAAAA
        DLKKLF+NC+IASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPK+GT RG G AA        AA    +P P     A A
Subjt:  DLKKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAA--------AAETHSKPEPVGGGAAAA

Query:  ETHPKPEPVMKEADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS
        ET PKP P M  ADSGLS+VSGKIRRRTCYECGEKGHLSSNCP KQ+ +SVAS
Subjt:  ETHPKPEPVMKEADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS

TrEMBL top hitse value%identityAlignment
A0A0A0KLT8 Uncharacterized protein1.3e-15667.54Show/hide
Query:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR
        MVLSNKKLKQKLREKLAESLI++VAG+D+   VSGE D+ES  +SL+ELLG  S +GPRLSKREKRRESL L  SD  N +EKKEDENQ LG    EKKR
Subjt:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR

Query:  KRDEEGAEKSALDGSAENNEKAKKLKKKKK---KKKKKTKKSNKKADNGEE-------------------------------------------------
        KR+E   EK+A+DG  E+NEKAKKLK KKK   KKK K KKSNKK +NGEE                                                 
Subjt:  KRDEEGAEKSALDGSAENNEKAKKLKKKKK---KKKKKTKKSNKKADNGEE-------------------------------------------------

Query:  ---------EKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRA
                 EKEKQG   GN+V+G +EETH N+GS   EN+ATKVYVGGIPYYSTEDDICSFFESCGTITE+DCMKFPESGKFRGIAILSFKTEAAAKRA
Subjt:  ---------EKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRA

Query:  LAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQE
        LA DGADMGGLFLKVQPYKGTR N+A DFSPG++EGYNRIY+GNLSWDVTEDDLKKLF NCKIASIRFGMDKETGEFRGYAHVDFSD +SLKTALKLDQ+
Subjt:  LAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQE

Query:  TIHGRPVKIRCAVPKKGTERGAGGA-AAAETHSKPEPVGGGAAAAETHPKPEPVMKEAD-SGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS
         IHGRPVKIRCAVPKKGTE G GGA A AETH             E +P+P P  KEA  S +S+VSGKIRRRTCYECGEKGHLSSNCP KQL +SV S
Subjt:  TIHGRPVKIRCAVPKKGTERGAGGA-AAAETHSKPEPVGGGAAAAETHPKPEPVMKEAD-SGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS

A0A1S3CF29 RNA-binding protein CP31B, chloroplastic2.3e-16676.13Show/hide
Query:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR
        MVLSNKKLKQKLREKLAESLI++VAGKD+  DVSGE D+ES  +SL+ELLG  S HGPRLSKREKRRESL L  SD  N  EKKE+ENQ LG  KREKKR
Subjt:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR

Query:  KRDEEGAEKSALDGSAENNEKAKKLK-KKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPYYSTEDDIC
        KR+E   EK+A+DG  ++ EKAKKLK KKK+++KKK KKSNKK +NGEEEKEKQG   GN+V+G +EETH N+GS   EN+ATKVYVGGIPYYSTEDDIC
Subjt:  KRDEEGAEKSALDGSAENNEKAKKLK-KKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPYYSTEDDIC

Query:  SFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSN
        SFFESCGTITE+DCMKFPESGKFRGIAILSFKTEAAAKRALA DGADMGGLFLKVQPYKGTR N+A DFSPG++EGYNRIY+GNLSWDVTEDDLKKLFSN
Subjt:  SFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSN

Query:  CKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAAAETHSKPEPVGGGAAAAETHP----KPEPVMKE
        CKI SIRFGMDKETGEFRGYAHVDFSDS+SLKTALKLDQE IHGRPVKIRCAVPKKGTE+G GGAAA              A  E HP    +P P  KE
Subjt:  CKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAAAETHSKPEPVGGGAAAAETHP----KPEPVMKE

Query:  ---ADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS
           A+S ++SVSGKIRRRTCYECGEKGHLSSNCP KQ  +SV S
Subjt:  ---ADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS

A0A6J1CDP4 protein gar21.4e-23299.77Show/hide
Query:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR
        MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKRE KR
Subjt:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR

Query:  KRDEEGAEKSALDGSAENNEKAKKLKKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSF
        KRDEEGAEKSALDGSAENNEKAKKLKKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSF
Subjt:  KRDEEGAEKSALDGSAENNEKAKKLKKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSF

Query:  FESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSNCK
        FESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSNCK
Subjt:  FESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSNCK

Query:  IASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAAAETHSKPEPVGGGAAAAETHPKPEPVMKEADSGLS
        IASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAAAETHSKPEPVGGGAAAAETHPKPEPVMKEADSGLS
Subjt:  IASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAAAETHSKPEPVGGGAAAAETHPKPEPVMKEADSGLS

Query:  SVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS
        SVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS
Subjt:  SVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS

A0A6J1FSC3 protein gar21.2e-17879.56Show/hide
Query:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR
        MVLSNKKLKQK REKLAESLI+++AGK S GDVSGEP+S+SRPQSL+ELLG    HG RLSKREKRRES+   ASD KN+EEKKE ENQRLG  K EKKR
Subjt:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR

Query:  KRDEEGAEKSALDGSAENNEKAKKLKKKK---KKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPYYSTEDD
        KR+EE  EKSA+DG  E++EKAKKLK KK    KKKKK +K  KK  + EEEKEKQGGE GNEV G +EETHDN+GS V ENVATKVYVGGIPYYSTEDD
Subjt:  KRDEEGAEKSALDGSAENNEKAKKLKKKK---KKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPYYSTEDD

Query:  ICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLF
        ICSFFESCGTITEVDCM FPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTR NKAVDFSPGI+EGYNRIYLGNLSWDVTEDDLKKLF
Subjt:  ICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLF

Query:  SNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAAAETHS-----------KPEPVGGGAAAAETH
        +NCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPK+GT RG G AAAA   +            P P     AAAET 
Subjt:  SNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAAAETHS-----------KPEPVGGGAAAAETH

Query:  PKPEPVMKEADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS
        PKP P M  ADSGLS+VSGKIRRRTCYECGEKGHLSSNCP KQ+ +SVAS
Subjt:  PKPEPVMKEADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS

A0A6J1JA04 protein gar2-like6.3e-18079.47Show/hide
Query:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR
        MVLSNKKLKQK REKLAESLI+++AG+ S GDVSGEP+S+SRPQSL+ELLG    HG RLSKREKRRES    ASD KN+EEKKE ENQRLG  K EKKR
Subjt:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKR

Query:  KRDEEGAEKSALDGSAENNEKAKKL-------KKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPYYS
        KR+EE  EKSA+DG  E++EKAKKL       KKKKK+KK+K +KSN+KA N EEEKEKQGGE GNEV G +EETHDN+GS V ENVATKVYVGGIPYYS
Subjt:  KRDEEGAEKSALDGSAENNEKAKKL-------KKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEG-LEETHDNVGSVVIENVATKVYVGGIPYYS

Query:  TEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDL
        TEDDICSFFESCGTITEVDCM FPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTR NKAVDFSPGI+EGYNRIYLGNLSWDVTEDDL
Subjt:  TEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDL

Query:  KKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAA----------AAETHSKPEPVGGGAAAA
        KKLF++CKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPK+GT RG G AA          AA T   P P     AAA
Subjt:  KKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGGAA----------AAETHSKPEPVGGGAAAA

Query:  ETHPKPEPVMKEADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS
        ET PKP P M  ADSGLS+VSGKIRRRTCYECGEKGHLSSNCP KQ+ +SVAS
Subjt:  ETHPKPEPVMKEADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS

SwissProt top hitse value%identityAlignment
P07909 Heterogeneous nuclear ribonucleoprotein A11.1e-1427.81Show/hide
Query:  KVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFK-----TEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYN
        K+++GG+ Y +T++++ + FE  G I +V  MK P + + RG   +++       EA   R   +DG        +V   K     + +D SP       
Subjt:  KVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFK-----TEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYN

Query:  RIYLGNLSWDVTEDDLKKLFSNC-KIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGG
        ++++G L  D  E  ++  F +   I  I   +DKETG+ RG+A V+F D   +   +   Q  ++G+ V ++ A+PK+  ++G GG
Subjt:  RIYLGNLSWDVTEDDLKKLFSNC-KIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKKGTERGAGG

Q04836 31 kDa ribonucleoprotein, chloroplastic3.2e-1127.43Show/hide
Query:  KKTKKSNKKADNGEEEKEKQGGEGGNEVEGLEETHDNVGSVVIENV-------ATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAI
        ++T+ S +  D  E ++ +     G+  EG E   D     V E           K++VG + Y      +   FE  GT+   + +   E+ + RG   
Subjt:  KKTKKSNKKADNGEEEKEKQGGEGGNEVEGLEETHDNVGSVVIENV-------ATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAI

Query:  LSFKTEAAAKRAL-AMDGADMGGLFLKVQPY--KGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFS-NCKIASIRFGMDKETGEFRGYAHVD
        ++  +   A+ A+   +  D+ G  L V     +G+R  +A    P + E   R+Y+GNL WDV    L++LFS + K+   R   D+ETG  RG+  V 
Subjt:  LSFKTEAAAKRAL-AMDGADMGGLFLKVQPY--KGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFS-NCKIASIRFGMDKETGEFRGYAHVD

Query:  FSDSVSLKTALK-LDQETIHGRPVKIRCA---VPKKG
         SD   L  A+  LD + + GR +++  A    P++G
Subjt:  FSDSVSLKTALK-LDQETIHGRPVKIRCA---VPKKG

Q39061 RNA-binding protein CP33, chloroplastic5.8e-1325.7Show/hide
Query:  EEEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAM-DGAD
        EEE E++G EG  EVE  ++T    G         ++YVG +PY  T  ++   F   GT+ +V  +    + + RG   ++  +   AK A+ M + + 
Subjt:  EEEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAM-DGAD

Query:  MGGLFLKVQ----PYKG---TRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSNCK-IASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLD
        +GG  +KV     P  G       K  D +   ++  +++Y GNL W++T   LK  F +   +   +   ++ TG  RG+  + F  + ++++AL  ++
Subjt:  MGGLFLKVQ----PYKG---TRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSNCK-IASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLD

Query:  QETIHGRPVKIRCA
           + GR +++  A
Subjt:  QETIHGRPVKIRCA

Q9FGS0 RNA-binding protein CP31B, chloroplastic4.0e-1429.67Show/hide
Query:  EEEKEKQGGEGGNEVEGLE--ETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKT-EAAAKRALAMDG
        EEE+ + G  GG  V   E  E+ D VG       A K++VG +PY      +   FE  GT+   + +   ++ + RG   ++  T E A K     + 
Subjt:  EEEKEKQGGEGGNEVEGLE--ETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKT-EAAAKRALAMDG

Query:  ADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFS-NCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLDQETIH
         ++ G  L V   +        +  P + +   RIY+GNL WDV    L++LFS + K+   R   D+ETG  RG+  V  S+   +  A+  LD + + 
Subjt:  ADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFS-NCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLDQETIH

Query:  GRPVKIRCA
        GR +K+  A
Subjt:  GRPVKIRCA

Q9M3B8 Phragmoplastin interacting protein 13.7e-7645.84Show/hide
Query:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNE-EKKEDENQRLGGTKREKK
        MVLSNKKLKQ++R+ LAESL  +V+            ++  + QSL+ LL + S H PRLSKREKRR   T    D++  E E     +     TK +KK
Subjt:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNE-EKKEDENQRLGGTKREKK

Query:  RKRDEEGAEKSALDG--SAENNEKAKKLKKKKKKKKKKTKKSNKKADNGE-EEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDD
        RKRD +  E   L+G    +  +K +K K KKKKKK+K  K+ KKA+ G  EEK K        VE +E   DN     +  V  K+YVGGIPY STED+
Subjt:  RKRDEEGAEKSALDG--SAENNEKAKKLKKKKKKKKKKTKKSNKKADNGE-EEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDD

Query:  ICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGT------RLNKAVDFSPGIMEGYNRIYLGNLSWDVTED
        I S+F SCG I +VDC   PE G F GIA ++F TE  AKRALA D A MG  +L +Q Y  T      R   +  F+P +++GYNR+Y+GNL+WD TE 
Subjt:  ICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGT------RLNKAVDFSPGIMEGYNRIYLGNLSWDVTED

Query:  DLKKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKK-------GTERGAGGAAAAETHSKPEPVGGGAAAAE
        D++KLFS+C I S+R G +KETGEF+GYAHVDF DSVS+  ALKLDQ+ I GRPVKI CA+  +       G    AG     +T++  +PV   A  +E
Subjt:  DLKKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKK-------GTERGAGGAAAAETHSKPEPVGGGAAAAE

Query:  THPKPEPVMKEADSGLSSV-SGKIRRRTCYECGEKGHLSSNCPKK
                + + +   ++V S K++RR CYECGEKGHLS+ CP K
Subjt:  THPKPEPVMKEADSGLSSV-SGKIRRRTCYECGEKGHLSSNCPKK

Arabidopsis top hitse value%identityAlignment
AT3G52380.1 chloroplast RNA-binding protein 334.1e-1425.7Show/hide
Query:  EEEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAM-DGAD
        EEE E++G EG  EVE  ++T    G         ++YVG +PY  T  ++   F   GT+ +V  +    + + RG   ++  +   AK A+ M + + 
Subjt:  EEEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAM-DGAD

Query:  MGGLFLKVQ----PYKG---TRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSNCK-IASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLD
        +GG  +KV     P  G       K  D +   ++  +++Y GNL W++T   LK  F +   +   +   ++ TG  RG+  + F  + ++++AL  ++
Subjt:  MGGLFLKVQ----PYKG---TRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSNCK-IASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLD

Query:  QETIHGRPVKIRCA
           + GR +++  A
Subjt:  QETIHGRPVKIRCA

AT3G55340.1 phragmoplastin interacting protein 12.6e-7745.84Show/hide
Query:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNE-EKKEDENQRLGGTKREKK
        MVLSNKKLKQ++R+ LAESL  +V+            ++  + QSL+ LL + S H PRLSKREKRR   T    D++  E E     +     TK +KK
Subjt:  MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNE-EKKEDENQRLGGTKREKK

Query:  RKRDEEGAEKSALDG--SAENNEKAKKLKKKKKKKKKKTKKSNKKADNGE-EEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDD
        RKRD +  E   L+G    +  +K +K K KKKKKK+K  K+ KKA+ G  EEK K        VE +E   DN     +  V  K+YVGGIPY STED+
Subjt:  RKRDEEGAEKSALDG--SAENNEKAKKLKKKKKKKKKKTKKSNKKADNGE-EEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDD

Query:  ICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGT------RLNKAVDFSPGIMEGYNRIYLGNLSWDVTED
        I S+F SCG I +VDC   PE G F GIA ++F TE  AKRALA D A MG  +L +Q Y  T      R   +  F+P +++GYNR+Y+GNL+WD TE 
Subjt:  ICSFFESCGTITEVDCMKFPESGKFRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGT------RLNKAVDFSPGIMEGYNRIYLGNLSWDVTED

Query:  DLKKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKK-------GTERGAGGAAAAETHSKPEPVGGGAAAAE
        D++KLFS+C I S+R G +KETGEF+GYAHVDF DSVS+  ALKLDQ+ I GRPVKI CA+  +       G    AG     +T++  +PV   A  +E
Subjt:  DLKKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTALKLDQETIHGRPVKIRCAVPKK-------GTERGAGGAAAAETHSKPEPVGGGAAAAE

Query:  THPKPEPVMKEADSGLSSV-SGKIRRRTCYECGEKGHLSSNCPKK
                + + +   ++V S K++RR CYECGEKGHLS+ CP K
Subjt:  THPKPEPVMKEADSGLSSV-SGKIRRRTCYECGEKGHLSSNCPKK

AT4G24770.1 31-kDa RNA binding protein2.2e-1227.43Show/hide
Query:  KKTKKSNKKADNGEEEKEKQGGEGGNEVEGLEETHDNVGSVVIENV-------ATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAI
        ++T+ S +  D  E ++ +     G+  EG E   D     V E           K++VG + Y      +   FE  GT+   + +   E+ + RG   
Subjt:  KKTKKSNKKADNGEEEKEKQGGEGGNEVEGLEETHDNVGSVVIENV-------ATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAI

Query:  LSFKTEAAAKRAL-AMDGADMGGLFLKVQPY--KGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFS-NCKIASIRFGMDKETGEFRGYAHVD
        ++  +   A+ A+   +  D+ G  L V     +G+R  +A    P + E   R+Y+GNL WDV    L++LFS + K+   R   D+ETG  RG+  V 
Subjt:  LSFKTEAAAKRAL-AMDGADMGGLFLKVQPY--KGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFS-NCKIASIRFGMDKETGEFRGYAHVD

Query:  FSDSVSLKTALK-LDQETIHGRPVKIRCA---VPKKG
         SD   L  A+  LD + + GR +++  A    P++G
Subjt:  FSDSVSLKTALK-LDQETIHGRPVKIRCA---VPKKG

AT5G50250.1 chloroplast RNA-binding protein 31B2.8e-1529.67Show/hide
Query:  EEEKEKQGGEGGNEVEGLE--ETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKT-EAAAKRALAMDG
        EEE+ + G  GG  V   E  E+ D VG       A K++VG +PY      +   FE  GT+   + +   ++ + RG   ++  T E A K     + 
Subjt:  EEEKEKQGGEGGNEVEGLE--ETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGKFRGIAILSFKT-EAAAKRALAMDG

Query:  ADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFS-NCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLDQETIH
         ++ G  L V   +        +  P + +   RIY+GNL WDV    L++LFS + K+   R   D+ETG  RG+  V  S+   +  A+  LD + + 
Subjt:  ADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFS-NCKIASIRFGMDKETGEFRGYAHVDFSDSVSLKTAL-KLDQETIH

Query:  GRPVKIRCA
        GR +K+  A
Subjt:  GRPVKIRCA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCTATCAAACAAAAAGTTGAAGCAGAAGCTGAGAGAAAAGCTAGCCGAATCGTTAATTGCCACAGTCGCCGGAAAAGATTCCGTCGGGGATGTTTCCGGTGAACC
GGATTCGGAGTCTAGGCCACAATCCCTGAGAGAGCTTTTAGGCGCCGTGAGTCACCATGGACCTAGATTGTCCAAGCGAGAGAAACGTAGAGAGTCGCTTACTTTGGTAG
CTTCGGATGAGAAGAACAATGAGGAGAAGAAGGAAGATGAGAATCAGAGATTGGGAGGGACCAAGAGAGAGAAGAAAAGGAAGAGAGATGAAGAGGGGGCGGAAAAGAGT
GCGTTAGATGGGTCGGCAGAGAATAATGAGAAGGCCAAGAAGTTGAAGAAGAAAAAGAAGAAGAAGAAGAAGAAGACGAAGAAGAGTAATAAGAAGGCAGACAATGGCGA
GGAGGAGAAGGAAAAGCAGGGAGGTGAAGGTGGGAATGAAGTGGAAGGGCTTGAAGAAACTCATGATAATGTAGGCAGTGTAGTAATTGAAAATGTGGCTACAAAAGTAT
ATGTTGGAGGAATTCCATATTATTCAACAGAGGATGATATTTGTAGCTTTTTTGAAAGCTGTGGCACTATCACCGAAGTTGATTGTATGAAGTTTCCAGAGAGTGGGAAG
TTCAGAGGCATTGCAATATTAAGTTTCAAGACAGAAGCTGCAGCGAAACGAGCACTAGCCATGGACGGAGCTGACATGGGTGGACTCTTCCTTAAAGTACAGCCCTACAA
GGGAACTCGATTGAATAAAGCAGTTGATTTTTCTCCCGGAATCATGGAAGGCTACAACAGAATCTATCTGGGCAATCTGTCATGGGATGTAACTGAGGATGATCTGAAGA
AACTCTTCTCAAACTGTAAGATTGCATCGATACGTTTCGGCATGGACAAGGAAACAGGGGAGTTTCGTGGCTATGCCCACGTTGATTTCTCTGACAGTGTCTCGTTAAAA
ACAGCTTTGAAGTTAGACCAGGAGACGATTCACGGCAGACCCGTCAAGATAAGATGTGCAGTTCCTAAGAAAGGAACCGAAAGAGGAGCAGGAGGAGCAGCAGCAGCAGA
AACACATTCAAAACCTGAACCTGTAGGAGGAGGAGCAGCAGCAGCAGAAACGCATCCAAAACCTGAACCTGTAATGAAGGAAGCGGATAGTGGATTAAGCTCTGTAAGTG
GTAAAATAAGAAGAAGAACATGCTATGAATGTGGTGAAAAAGGCCATCTTTCTTCCAACTGTCCTAAGAAGCAGCTTGAGAATTCAGTTGCAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCTATCAAACAAAAAGTTGAAGCAGAAGCTGAGAGAAAAGCTAGCCGAATCGTTAATTGCCACAGTCGCCGGAAAAGATTCCGTCGGGGATGTTTCCGGTGAACC
GGATTCGGAGTCTAGGCCACAATCCCTGAGAGAGCTTTTAGGCGCCGTGAGTCACCATGGACCTAGATTGTCCAAGCGAGAGAAACGTAGAGAGTCGCTTACTTTGGTAG
CTTCGGATGAGAAGAACAATGAGGAGAAGAAGGAAGATGAGAATCAGAGATTGGGAGGGACCAAGAGAGAGAAGAAAAGGAAGAGAGATGAAGAGGGGGCGGAAAAGAGT
GCGTTAGATGGGTCGGCAGAGAATAATGAGAAGGCCAAGAAGTTGAAGAAGAAAAAGAAGAAGAAGAAGAAGAAGACGAAGAAGAGTAATAAGAAGGCAGACAATGGCGA
GGAGGAGAAGGAAAAGCAGGGAGGTGAAGGTGGGAATGAAGTGGAAGGGCTTGAAGAAACTCATGATAATGTAGGCAGTGTAGTAATTGAAAATGTGGCTACAAAAGTAT
ATGTTGGAGGAATTCCATATTATTCAACAGAGGATGATATTTGTAGCTTTTTTGAAAGCTGTGGCACTATCACCGAAGTTGATTGTATGAAGTTTCCAGAGAGTGGGAAG
TTCAGAGGCATTGCAATATTAAGTTTCAAGACAGAAGCTGCAGCGAAACGAGCACTAGCCATGGACGGAGCTGACATGGGTGGACTCTTCCTTAAAGTACAGCCCTACAA
GGGAACTCGATTGAATAAAGCAGTTGATTTTTCTCCCGGAATCATGGAAGGCTACAACAGAATCTATCTGGGCAATCTGTCATGGGATGTAACTGAGGATGATCTGAAGA
AACTCTTCTCAAACTGTAAGATTGCATCGATACGTTTCGGCATGGACAAGGAAACAGGGGAGTTTCGTGGCTATGCCCACGTTGATTTCTCTGACAGTGTCTCGTTAAAA
ACAGCTTTGAAGTTAGACCAGGAGACGATTCACGGCAGACCCGTCAAGATAAGATGTGCAGTTCCTAAGAAAGGAACCGAAAGAGGAGCAGGAGGAGCAGCAGCAGCAGA
AACACATTCAAAACCTGAACCTGTAGGAGGAGGAGCAGCAGCAGCAGAAACGCATCCAAAACCTGAACCTGTAATGAAGGAAGCGGATAGTGGATTAAGCTCTGTAAGTG
GTAAAATAAGAAGAAGAACATGCTATGAATGTGGTGAAAAAGGCCATCTTTCTTCCAACTGTCCTAAGAAGCAGCTTGAGAATTCAGTTGCAAGTTGA
Protein sequenceShow/hide protein sequence
MVLSNKKLKQKLREKLAESLIATVAGKDSVGDVSGEPDSESRPQSLRELLGAVSHHGPRLSKREKRRESLTLVASDEKNNEEKKEDENQRLGGTKREKKRKRDEEGAEKS
ALDGSAENNEKAKKLKKKKKKKKKKTKKSNKKADNGEEEKEKQGGEGGNEVEGLEETHDNVGSVVIENVATKVYVGGIPYYSTEDDICSFFESCGTITEVDCMKFPESGK
FRGIAILSFKTEAAAKRALAMDGADMGGLFLKVQPYKGTRLNKAVDFSPGIMEGYNRIYLGNLSWDVTEDDLKKLFSNCKIASIRFGMDKETGEFRGYAHVDFSDSVSLK
TALKLDQETIHGRPVKIRCAVPKKGTERGAGGAAAAETHSKPEPVGGGAAAAETHPKPEPVMKEADSGLSSVSGKIRRRTCYECGEKGHLSSNCPKKQLENSVAS