; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013023 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013023
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr01:26212269..26214184
RNA-Seq ExpressionHG10013023
SyntenyHG10013023
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7028669.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]7.7e-13762.88Show/hide
Query:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV
        ML+ +NSCH WPSTP TLLLLR CETQNDVNQ+HARIIKTG FK+SSL T+IILNSISSPH+PLVEFARYVFFTRYA+QRIR    RNH  DDPFLWNAV
Subjt:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV

Query:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS
        IKSYSHGN P+RALVLFCMMLENGF VDKFSFSLILKACARV LVEQGKQIHG LMKLEIGSNLFLLNCLI +Y+RCG+IE ARQVFDRMPMQDSVSYNS
Subjt:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS

Query:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------
        MIDGYVKCGL+DLARELFDSMPL++KNLISWNSM+                                                                 
Subjt:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------

Query:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI
                                               E LEIF+ +QRQ NLSP+ETTL +A SA+SQLG +EKA SMHSYLVENG S+ GKVGV LI
Subjt:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI

Query:  DMYSKCGSIENAMLIFEGIDQKG
        DMYSKCGSIENAM IF GID+KG
Subjt:  DMYSKCGSIENAMLIFEGIDQKG

XP_004133915.1 pentatricopeptide repeat-containing protein At2g45350, chloroplastic [Cucumis sativus]3.5e-15068.79Show/hide
Query:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV
        MLV ANSCH WPSTPPTLLLLRHCETQNDVNQVHARIIKTGY KNSSLTT+IILNSISSPHKPLVEFARYVFFTRYA+QRIR    RNH DDDPFLWNAV
Subjt:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV

Query:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS
        IKSYSHGN+PVRALVLFCMMLENGF VDKFSFSLILKACARVCLVE+GKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMP+QDSVSYNS
Subjt:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS

Query:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------
        MIDGYVK G IDLARELFDSMPLEDKNLISWNSML                                                                 
Subjt:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------

Query:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI
                                               E LEIFH++QRQSNLSPDETTLVVALSAISQLG +EKAASMH+Y +ENGISVTGKV VALI
Subjt:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI

Query:  DMYSKCGSIENAMLIFEGIDQKG
        DMYSKCGSIENA+LIF+G+DQKG
Subjt:  DMYSKCGSIENAMLIFEGIDQKG

XP_008438168.1 PREDICTED: pentatricopeptide repeat-containing protein At2g45350, chloroplastic [Cucumis melo]3.0e-14969.03Show/hide
Query:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV
        MLV ANSCH WPSTPPTLLLLRHCETQNDVNQVHARIIKTGY KNSSLTT+IILNSISSPHKPLVEFARYVFFTRYA+QRIR    R+H DDDPFLWNAV
Subjt:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV

Query:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS
        IKSYSHGNDPVRALVLFCMMLENGF VDKFSFSLILKACARVCLVE+GKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFD MP+QDSVSYNS
Subjt:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS

Query:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------
        MIDGYVK G IDLARELFDSMPL DKNLISWNSML                                                                 
Subjt:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------

Query:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI
                                               E LEIFHK+QRQSNLSPDETTLVVALSAISQLG IEKAASMH+Y VENGISVTGKV VALI
Subjt:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI

Query:  DMYSKCGSIENAMLIFEGIDQKG
        DMYSKCGSIENA LIF+G+DQKG
Subjt:  DMYSKCGSIENAMLIFEGIDQKG

XP_022146972.1 pentatricopeptide repeat-containing protein At2g45350, chloroplastic [Momordica charantia]4.5e-13763.59Show/hide
Query:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV
        M+V ANS   WPS PPT LLLRHCETQ++VNQ+HARIIK+G  KNSSLTTRIILNSISSPH+PLVEFARYVFFTRYA++RIRRNR     DDDPFLWNAV
Subjt:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV

Query:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS
        IKSYSHGNDP+RALVLFCMMLENGF VDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLI MYLRCGDIEFARQVFDRMP QDSVSYNS
Subjt:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS

Query:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------
        MIDGYVK G++D ARELFDSMPLEDKNLISWNSM+                                                                 
Subjt:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------

Query:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI
                                               + L+IF+ +Q QSNLSPD+TTLV+ALSA+SQLGRI+KAAS+HSYL+ENG S+ GKVGVALI
Subjt:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI

Query:  DMYSKCGSIENAMLIFEGIDQKG
        DMYSKCGSI NAM IFEGI++KG
Subjt:  DMYSKCGSIENAMLIFEGIDQKG

XP_038877671.1 pentatricopeptide repeat-containing protein At2g45350, chloroplastic [Benincasa hispida]2.4e-15470.45Show/hide
Query:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV
        MLV ANSCHSWPSTP TLLLLRHCETQNDVNQ+HARIIKTGYFKNSSLTT+IILNSISS  KPLVEFARYVFFTR+A+QR R  RNR+HRDDDPFLWNAV
Subjt:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV

Query:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS
        IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS
Subjt:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS

Query:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------
        MIDGYVKCG+IDLARELFDSMP EDKNLISWNSML                                                                 
Subjt:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------

Query:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI
                                               E LEIFHK+QRQ+NLSPDETTLVVALSA+SQLGRIEKAASMHSYLVENG+SV GKVGVALI
Subjt:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI

Query:  DMYSKCGSIENAMLIFEGIDQKG
        DMYSKCGSIENAMLIFEGI+QKG
Subjt:  DMYSKCGSIENAMLIFEGIDQKG

TrEMBL top hitse value%identityAlignment
A0A0A0L4E1 Uncharacterized protein1.7e-15068.79Show/hide
Query:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV
        MLV ANSCH WPSTPPTLLLLRHCETQNDVNQVHARIIKTGY KNSSLTT+IILNSISSPHKPLVEFARYVFFTRYA+QRIR    RNH DDDPFLWNAV
Subjt:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV

Query:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS
        IKSYSHGN+PVRALVLFCMMLENGF VDKFSFSLILKACARVCLVE+GKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMP+QDSVSYNS
Subjt:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS

Query:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------
        MIDGYVK G IDLARELFDSMPLEDKNLISWNSML                                                                 
Subjt:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------

Query:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI
                                               E LEIFH++QRQSNLSPDETTLVVALSAISQLG +EKAASMH+Y +ENGISVTGKV VALI
Subjt:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI

Query:  DMYSKCGSIENAMLIFEGIDQKG
        DMYSKCGSIENA+LIF+G+DQKG
Subjt:  DMYSKCGSIENAMLIFEGIDQKG

A0A1S3AWC2 pentatricopeptide repeat-containing protein At2g45350, chloroplastic1.5e-14969.03Show/hide
Query:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV
        MLV ANSCH WPSTPPTLLLLRHCETQNDVNQVHARIIKTGY KNSSLTT+IILNSISSPHKPLVEFARYVFFTRYA+QRIR    R+H DDDPFLWNAV
Subjt:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV

Query:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS
        IKSYSHGNDPVRALVLFCMMLENGF VDKFSFSLILKACARVCLVE+GKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFD MP+QDSVSYNS
Subjt:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS

Query:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------
        MIDGYVK G IDLARELFDSMPL DKNLISWNSML                                                                 
Subjt:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------

Query:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI
                                               E LEIFHK+QRQSNLSPDETTLVVALSAISQLG IEKAASMH+Y VENGISVTGKV VALI
Subjt:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI

Query:  DMYSKCGSIENAMLIFEGIDQKG
        DMYSKCGSIENA LIF+G+DQKG
Subjt:  DMYSKCGSIENAMLIFEGIDQKG

A0A5A7TZK8 Pentatricopeptide repeat-containing protein1.5e-14969.03Show/hide
Query:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV
        MLV ANSCH WPSTPPTLLLLRHCETQNDVNQVHARIIKTGY KNSSLTT+IILNSISSPHKPLVEFARYVFFTRYA+QRIR    R+H DDDPFLWNAV
Subjt:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV

Query:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS
        IKSYSHGNDPVRALVLFCMMLENGF VDKFSFSLILKACARVCLVE+GKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFD MP+QDSVSYNS
Subjt:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS

Query:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------
        MIDGYVK G IDLARELFDSMPL DKNLISWNSML                                                                 
Subjt:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------

Query:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI
                                               E LEIFHK+QRQSNLSPDETTLVVALSAISQLG IEKAASMH+Y VENGISVTGKV VALI
Subjt:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI

Query:  DMYSKCGSIENAMLIFEGIDQKG
        DMYSKCGSIENA LIF+G+DQKG
Subjt:  DMYSKCGSIENAMLIFEGIDQKG

A0A6J1D128 pentatricopeptide repeat-containing protein At2g45350, chloroplastic2.2e-13763.59Show/hide
Query:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV
        M+V ANS   WPS PPT LLLRHCETQ++VNQ+HARIIK+G  KNSSLTTRIILNSISSPH+PLVEFARYVFFTRYA++RIRRNR     DDDPFLWNAV
Subjt:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV

Query:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS
        IKSYSHGNDP+RALVLFCMMLENGF VDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLI MYLRCGDIEFARQVFDRMP QDSVSYNS
Subjt:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS

Query:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------
        MIDGYVK G++D ARELFDSMPLEDKNLISWNSM+                                                                 
Subjt:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML-----------------------------------------------------------------

Query:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI
                                               + L+IF+ +Q QSNLSPD+TTLV+ALSA+SQLGRI+KAAS+HSYL+ENG S+ GKVGVALI
Subjt:  ---------------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI

Query:  DMYSKCGSIENAMLIFEGIDQKG
        DMYSKCGSI NAM IFEGI++KG
Subjt:  DMYSKCGSIENAMLIFEGIDQKG

A0A6J1ID28 pentatricopeptide repeat-containing protein At2g45350, chloroplastic7.0e-13663.16Show/hide
Query:  NSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYS
        NSCH WPSTP TLLLLR CETQNDVNQ+HARIIKTG FK+SSL T+IILNSISSP++PLVEFARYVFFT YA+QRIR    RNH  DDPFLWNAVIKSYS
Subjt:  NSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYS

Query:  HGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGY
        HGNDP+RALVLFCMMLENGF VDKFSFSLILKACARV LVEQGKQIHG LMKLEIGSNLFLLNCLIA+Y+RCG+IE ARQVFDRMPMQDSVSYNSMIDGY
Subjt:  HGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGY

Query:  VKCGLIDLARELFDSMPLEDKNLISWNSML----------------------------------------------------------------------
        VKCGL+DLARELFDSMPL++KNLISWNSM+                                                                      
Subjt:  VKCGLIDLARELFDSMPLEDKNLISWNSML----------------------------------------------------------------------

Query:  ----------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSK
                                          E LEIF+ +QRQ NLSP+ETTL +A SA+SQLG +EKA SMHSYLVENG S+ GKVGV LIDMYSK
Subjt:  ----------------------------------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSK

Query:  CGSIENAMLIFEGIDQKG
        CGSIENAM IF GID+KG
Subjt:  CGSIENAMLIFEGIDQKG

SwissProt top hitse value%identityAlignment
O22137 Pentatricopeptide repeat-containing protein At2g45350, chloroplastic1.9e-8544.08Show/hide
Query:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV
        MLV+ ++     S   T+ +L  C+T +DVNQ+H R+IKTG  KNS+LTTRI+L   SS    L +FAR VF   +               +DPFLWNAV
Subjt:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV

Query:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS
        IKS+SHG DP +AL+L C+MLENG  VDKFS SL+LKAC+R+  V+ G QIHG L K  + S+LFL NCLI +YL+CG +  +RQ+FDRMP +DSVSYNS
Subjt:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS

Query:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSM------------------------------------------------------------------
        MIDGYVKCGLI  ARELFD MP+E KNLISWNSM                                                                  
Subjt:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSM------------------------------------------------------------------

Query:  --------------------------------------LETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI
                                              +E LEIF  ++++S+L PD+TTLV+ L AI+QLGR+ KA  MH Y+VE    + GK+GVALI
Subjt:  --------------------------------------LETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI

Query:  DMYSKCGSIENAMLIFEGIDQK
        DMYSKCGSI++AML+FEGI+ K
Subjt:  DMYSKCGSIENAMLIFEGIDQK

O49399 Pentatricopeptide repeat-containing protein At4g188403.7e-4933.91Show/hide
Query:  STP-PTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPV
        STP P L      ++  ++ Q HA ++KTG F ++   ++++  + ++P    V +A  +      L RI       H        N+VI++Y++ + P 
Subjt:  STP-PTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPV

Query:  RALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS-----------
         AL +F  ML    F DK+SF+ +LKACA  C  E+G+QIHGL +K  + +++F+ N L+ +Y R G  E AR+V DRMP++D+VS+NS           
Subjt:  RALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS-----------

Query:  --------------------MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRI
                            MI GY   GL+  A+E+FDSMP+ D  ++SWN+M+          E LE+F+K+   S   PD  TLV  LSA + LG +
Subjt:  --------------------MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRI

Query:  EKAASMHSYLVENGISVTGKVGVALIDMYSKCGSIENAMLIFEGIDQK
         +   +H Y+ ++GI + G +  AL+DMYSKCG I+ A+ +F    ++
Subjt:  EKAASMHSYLVENGISVTGKVGVALIDMYSKCGSIENAMLIFEGIDQK

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665202.8e-4935.99Show/hide
Query:  TLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVF--FTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPVRAL
        T+  L+ C  Q ++ Q+HAR++KTG  ++S   T+ +   ISS     + +A+ VF  F R                 D FLWN +I+ +S  ++P R+L
Subjt:  TLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVF--FTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPVRAL

Query:  VLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGYVKCGLIDLA
        +L+  ML +    + ++F  +LKAC+ +   E+  QIH  + KL   ++++ +N LI  Y   G+ + A  +FDR+P  D VS+NS+I GYVK G +D+A
Subjt:  VLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGYVKCGLIDLA

Query:  RELFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSKCGS
          LF  M   +KN ISW +M+          E L++FH++Q  S++ PD  +L  ALSA +QLG +E+   +HSYL +  I +   +G  LIDMY+KCG 
Subjt:  RELFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSKCGS

Query:  IENAMLIFEGIDQK
        +E A+ +F+ I +K
Subjt:  IENAMLIFEGIDQK

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic3.4e-4733.55Show/hide
Query:  PTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPVRALV
        P+L LL +C+T   +  +HA++IK G    +   +++I   I SPH   + +A  VF              +  ++ +  +WN + + ++  +DPV AL 
Subjt:  PTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPVRALV

Query:  LFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGYVKCGLIDLAR
        L+  M+  G   + ++F  +LK+CA+    ++G+QIHG ++KL    +L++   LI+MY++ G +E A +VFD+ P +D VSY ++I GY   G I+ A+
Subjt:  LFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGYVKCGLIDLAR

Query:  ELFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSKCGSI
        +LFD +P++D  ++SWN+M+          E LE+F  + + +N+ PDE+T+V  +SA +Q G IE    +H ++ ++G     K+  ALID+YSKCG +
Subjt:  ELFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSKCGSI

Query:  ENAMLIFEGIDQK
        E A  +FE +  K
Subjt:  ENAMLIFEGIDQK

Q9SJG6 Pentatricopeptide repeat-containing protein At2g42920, chloroplastic2.5e-4233.65Show/hide
Query:  LLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPVRALVLFC
        L+   C T  ++ Q+HA +IKTG   ++   +R++    +SP    + +A Y+ FTR            NH+  +PF+WN +I+ +S  + P  A+ +F 
Subjt:  LLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPVRALVLFC

Query:  MMLENGFFV--DKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGYVKCGLIDLARE
         ML +   V   + ++  + KA  R+     G+Q+HG+++K  +  + F+ N ++ MY+ CG +  A ++F  M   D V++NSMI G+ KCGLID A+ 
Subjt:  MMLENGFFV--DKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGYVKCGLIDLARE

Query:  LFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSKCGSIE
        LFD MP   +N +SWNSM+          + L++F ++Q + ++ PD  T+V  L+A + LG  E+   +H Y+V N   +   V  ALIDMY KCG IE
Subjt:  LFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSKCGSIE

Query:  NAMLIFEGIDQK
          + +FE   +K
Subjt:  NAMLIFEGIDQK

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-4833.55Show/hide
Query:  PTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPVRALV
        P+L LL +C+T   +  +HA++IK G    +   +++I   I SPH   + +A  VF              +  ++ +  +WN + + ++  +DPV AL 
Subjt:  PTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPVRALV

Query:  LFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGYVKCGLIDLAR
        L+  M+  G   + ++F  +LK+CA+    ++G+QIHG ++KL    +L++   LI+MY++ G +E A +VFD+ P +D VSY ++I GY   G I+ A+
Subjt:  LFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGYVKCGLIDLAR

Query:  ELFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSKCGSI
        +LFD +P++D  ++SWN+M+          E LE+F  + + +N+ PDE+T+V  +SA +Q G IE    +H ++ ++G     K+  ALID+YSKCG +
Subjt:  ELFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSKCGSI

Query:  ENAMLIFEGIDQK
        E A  +FE +  K
Subjt:  ENAMLIFEGIDQK

AT2G42920.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.8e-4333.65Show/hide
Query:  LLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPVRALVLFC
        L+   C T  ++ Q+HA +IKTG   ++   +R++    +SP    + +A Y+ FTR            NH+  +PF+WN +I+ +S  + P  A+ +F 
Subjt:  LLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPVRALVLFC

Query:  MMLENGFFV--DKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGYVKCGLIDLARE
         ML +   V   + ++  + KA  R+     G+Q+HG+++K  +  + F+ N ++ MY+ CG +  A ++F  M   D V++NSMI G+ KCGLID A+ 
Subjt:  MMLENGFFV--DKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGYVKCGLIDLARE

Query:  LFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSKCGSIE
        LFD MP   +N +SWNSM+          + L++F ++Q + ++ PD  T+V  L+A + LG  E+   +H Y+V N   +   V  ALIDMY KCG IE
Subjt:  LFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSKCGSIE

Query:  NAMLIFEGIDQK
          + +FE   +K
Subjt:  NAMLIFEGIDQK

AT2G45350.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-8644.08Show/hide
Query:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV
        MLV+ ++     S   T+ +L  C+T +DVNQ+H R+IKTG  KNS+LTTRI+L   SS    L +FAR VF   +               +DPFLWNAV
Subjt:  MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAV

Query:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS
        IKS+SHG DP +AL+L C+MLENG  VDKFS SL+LKAC+R+  V+ G QIHG L K  + S+LFL NCLI +YL+CG +  +RQ+FDRMP +DSVSYNS
Subjt:  IKSYSHGNDPVRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS

Query:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSM------------------------------------------------------------------
        MIDGYVKCGLI  ARELFD MP+E KNLISWNSM                                                                  
Subjt:  MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSM------------------------------------------------------------------

Query:  --------------------------------------LETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI
                                              +E LEIF  ++++S+L PD+TTLV+ L AI+QLGR+ KA  MH Y+VE    + GK+GVALI
Subjt:  --------------------------------------LETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALI

Query:  DMYSKCGSIENAMLIFEGIDQK
        DMYSKCGSI++AML+FEGI+ K
Subjt:  DMYSKCGSIENAMLIFEGIDQK

AT4G18840.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.6e-5033.91Show/hide
Query:  STP-PTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPV
        STP P L      ++  ++ Q HA ++KTG F ++   ++++  + ++P    V +A  +      L RI       H        N+VI++Y++ + P 
Subjt:  STP-PTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPV

Query:  RALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS-----------
         AL +F  ML    F DK+SF+ +LKACA  C  E+G+QIHGL +K  + +++F+ N L+ +Y R G  E AR+V DRMP++D+VS+NS           
Subjt:  RALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNS-----------

Query:  --------------------MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRI
                            MI GY   GL+  A+E+FDSMP+ D  ++SWN+M+          E LE+F+K+   S   PD  TLV  LSA + LG +
Subjt:  --------------------MIDGYVKCGLIDLARELFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRI

Query:  EKAASMHSYLVENGISVTGKVGVALIDMYSKCGSIENAMLIFEGIDQK
         +   +H Y+ ++GI + G +  AL+DMYSKCG I+ A+ +F    ++
Subjt:  EKAASMHSYLVENGISVTGKVGVALIDMYSKCGSIENAMLIFEGIDQK

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-5035.99Show/hide
Query:  TLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVF--FTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPVRAL
        T+  L+ C  Q ++ Q+HAR++KTG  ++S   T+ +   ISS     + +A+ VF  F R                 D FLWN +I+ +S  ++P R+L
Subjt:  TLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVF--FTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDPVRAL

Query:  VLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGYVKCGLIDLA
        +L+  ML +    + ++F  +LKAC+ +   E+  QIH  + KL   ++++ +N LI  Y   G+ + A  +FDR+P  D VS+NS+I GYVK G +D+A
Subjt:  VLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGYVKCGLIDLA

Query:  RELFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSKCGS
          LF  M   +KN ISW +M+          E L++FH++Q  S++ PD  +L  ALSA +QLG +E+   +HSYL +  I +   +G  LIDMY+KCG 
Subjt:  RELFDSMPLEDKNLISWNSML----------ETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSKCGS

Query:  IENAMLIFEGIDQK
        +E A+ +F+ I +K
Subjt:  IENAMLIFEGIDQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGTTTATGCAAATTCATGCCATTCATGGCCCTCAACACCGCCGACCCTTCTTCTCCTCCGCCACTGCGAGACTCAGAACGACGTGAATCAAGTCCACGCCAGAAT
CATCAAAACCGGCTACTTCAAAAATTCGTCACTCACAACCAGAATCATTCTCAATTCCATTTCTTCTCCACACAAACCGCTTGTTGAATTCGCTCGTTACGTCTTCTTCA
CTCGCTATGCTCTTCAAAGAATTCGTCGGAATCGGAATCGGAATCACCGCGATGACGATCCGTTTCTCTGGAATGCTGTGATTAAATCGTATTCGCACGGGAATGACCCT
GTTCGAGCTCTGGTATTGTTCTGTATGATGCTTGAGAATGGGTTTTTTGTTGATAAGTTTTCGTTTTCTTTGATTCTGAAAGCATGTGCTCGTGTGTGTTTGGTGGAGCA
AGGGAAGCAGATTCATGGGTTGTTGATGAAGTTAGAAATTGGATCGAATTTGTTTCTGCTTAATTGTTTGATTGCGATGTATTTGAGATGTGGGGATATTGAGTTTGCAC
GACAGGTGTTCGACAGAATGCCAATGCAGGACTCTGTTTCGTATAATTCGATGATTGATGGTTATGTGAAATGTGGGTTGATTGATTTGGCTCGTGAATTGTTTGATTCT
ATGCCATTGGAAGATAAGAATTTAATCTCTTGGAACTCAATGCTTGAAACATTAGAAATTTTTCATAAGCTGCAGAGACAAAGCAACTTATCTCCCGATGAAACGACATT
GGTGGTCGCGCTTTCAGCCATTTCTCAGTTAGGACGCATTGAGAAGGCTGCAAGTATGCATAGCTATTTAGTAGAAAATGGCATTTCGGTGACGGGAAAGGTTGGCGTTG
CCCTTATTGACATGTATTCTAAATGTGGTAGCATCGAGAACGCCATGTTGATATTTGAAGGCATTGACCAAAAAGGTTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGTTTATGCAAATTCATGCCATTCATGGCCCTCAACACCGCCGACCCTTCTTCTCCTCCGCCACTGCGAGACTCAGAACGACGTGAATCAAGTCCACGCCAGAAT
CATCAAAACCGGCTACTTCAAAAATTCGTCACTCACAACCAGAATCATTCTCAATTCCATTTCTTCTCCACACAAACCGCTTGTTGAATTCGCTCGTTACGTCTTCTTCA
CTCGCTATGCTCTTCAAAGAATTCGTCGGAATCGGAATCGGAATCACCGCGATGACGATCCGTTTCTCTGGAATGCTGTGATTAAATCGTATTCGCACGGGAATGACCCT
GTTCGAGCTCTGGTATTGTTCTGTATGATGCTTGAGAATGGGTTTTTTGTTGATAAGTTTTCGTTTTCTTTGATTCTGAAAGCATGTGCTCGTGTGTGTTTGGTGGAGCA
AGGGAAGCAGATTCATGGGTTGTTGATGAAGTTAGAAATTGGATCGAATTTGTTTCTGCTTAATTGTTTGATTGCGATGTATTTGAGATGTGGGGATATTGAGTTTGCAC
GACAGGTGTTCGACAGAATGCCAATGCAGGACTCTGTTTCGTATAATTCGATGATTGATGGTTATGTGAAATGTGGGTTGATTGATTTGGCTCGTGAATTGTTTGATTCT
ATGCCATTGGAAGATAAGAATTTAATCTCTTGGAACTCAATGCTTGAAACATTAGAAATTTTTCATAAGCTGCAGAGACAAAGCAACTTATCTCCCGATGAAACGACATT
GGTGGTCGCGCTTTCAGCCATTTCTCAGTTAGGACGCATTGAGAAGGCTGCAAGTATGCATAGCTATTTAGTAGAAAATGGCATTTCGGTGACGGGAAAGGTTGGCGTTG
CCCTTATTGACATGTATTCTAAATGTGGTAGCATCGAGAACGCCATGTTGATATTTGAAGGCATTGACCAAAAAGGTTATTGA
Protein sequenceShow/hide protein sequence
MLVYANSCHSWPSTPPTLLLLRHCETQNDVNQVHARIIKTGYFKNSSLTTRIILNSISSPHKPLVEFARYVFFTRYALQRIRRNRNRNHRDDDPFLWNAVIKSYSHGNDP
VRALVLFCMMLENGFFVDKFSFSLILKACARVCLVEQGKQIHGLLMKLEIGSNLFLLNCLIAMYLRCGDIEFARQVFDRMPMQDSVSYNSMIDGYVKCGLIDLARELFDS
MPLEDKNLISWNSMLETLEIFHKLQRQSNLSPDETTLVVALSAISQLGRIEKAASMHSYLVENGISVTGKVGVALIDMYSKCGSIENAMLIFEGIDQKGY