; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021731 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021731
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153823:367692..369271
RNA-Seq ExpressionSgr021731
SyntenySgr021731
Gene Ontology termsGO:0032544 - plastid translation (biological process)
GO:0043489 - RNA stabilization (biological process)
GO:0009536 - plastid (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571877.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]4.9e-21574.62Show/hide
Query:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV
        M PKSI FV  F                LSN I+SSF TISSILTYST+PNL+       SHEAI SA +QSQ  EQSL+ FKLM+ +GHCPSS +FN V
Subjt:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------
        LGLL KSGD+H+ W FFTEFLGRTHFD YSFG+ IKAFC++GNVSKGFELLAQMER+GLSPNVVIYTILIDACCKNGDIEQ                   
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------

Query:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
                   GYKKDGFEL+EKMKLVGV PS+YTYN+LINE+CRDGKLS+AFK+FDEMST GVSCNVVTY ILIGGLCRKRQ+SKAERL E+MKQ  IN
Subjt:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT T+NLLMDGFCNIGKL+KA+GYFDELKLIG +PTSV+YNILIAGFSKAGNS+VVSELVREMEDRG+SPSKVTYTILMDAFVRSDDVEKASQ+FHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQ+TYGVLMHGLCMKGNMV+ASKLY SM+EM++EPNDVIYN MINGYCKECNSYKALKFLQEMV KG+TPS+ SY STIEVL  + KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEMIEAGLNPSESL  ++G
Subjt:  LKEMIEAGLNPSESLYTRVG

XP_022136943.1 pentatricopeptide repeat-containing protein At4g11690 [Momordica charantia]3.7e-23981.61Show/hide
Query:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV
        M PK  GFV  FLA+++R   HSIPVQVLSN I S FFTISSILT+STR NL++ P+CG  HEAIISA+VQSQLPEQSL+HFKLMV +G CPSSN+FN V
Subjt:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------
        LGLLTKSGD+++AWCFF EFLGRTHFDVYSFG+MIKAFCE GNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ                   
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------

Query:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
                   G KKDGFELYEKM L+GVFPSVYTYNSLINE+CRDG L LAFKLFDEM TRGVSCNVVTYNILIGGLCR RQV KAE LLEQMK AHIN
Subjt:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        P+TITFNLLMDGFCN+GK DKA+ YFDELKLIG SPTSVTYNILIAGFSKAGNSAVV ELVREMEDRGISPSKVTYTILMDAFVRSDDV KASQMFHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIE HL+PNDVIYNTMINGYCKECNSYKALKFLQEMV+KGMTPS ASYSSTIEVLCKD KS EAK+L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVGLA
        LKEMIE GL+PSESL  RVG A
Subjt:  LKEMIEAGLNPSESLYTRVGLA

XP_022971940.1 pentatricopeptide repeat-containing protein At4g11690 [Cucurbita maxima]1.5e-22477.12Show/hide
Query:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV
        M PKSIGF+  F                LSN I+SSFFTISSILTYST+PNL+       SHEAII+A +QSQL EQSLH FKLM+ +GHCPSS +FN V
Subjt:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------
        LGLL KSGD+H+ WCFFTEFLGRT FD YSFG+ IKAFC++GNVSKGFELLAQMER+GLSPNVVIYTILIDACCKNGDIEQ                   
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------

Query:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
                   GYKKDGFELYEKMKLVGV PS+YTYN+LINE+CRDGKLS+AFK+FDEMSTRGVSCNVVTY ILIGGLCRKRQ+SKAERL E+MKQ HIN
Subjt:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT T+NLLMDGFCNIGKL+KA+GYFD+LKLIG +PTSV+YNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQ+FHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQ+TYGVLMHGLCMKGNMVEASKLYKSM+EM++EPNDVIYN MINGYCKECNSYKALKFLQEMV KG+TPS+ASYSSTIEVLC + KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEMIEAGLNPSESLY ++G
Subjt:  LKEMIEAGLNPSESLYTRVG

XP_023511558.1 pentatricopeptide repeat-containing protein At4g11690 [Cucurbita pepo subsp. pepo]3.7e-21575.19Show/hide
Query:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV
        M PKSIG V  F                LSN  +SSFFTISSILTYST+PNL+       SHEAI SA +QSQ  EQSL+ FKLM+ +G CPSS +FN V
Subjt:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------
        LGLL KSGD+H+ W FFTEFLGRTHFD YSFG+ IKAFCE+GNVSKGFELLAQMER+GLSPNVVIYTILIDACCKNGDIEQ                   
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------

Query:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
                   GYKKDGFE +EKMKLVGV PS+YTYN+LINE+CRDGKLS+AFK+FDEMST GVSCNVVTY ILIGGLCRKRQ+SKAERL E+MKQ  IN
Subjt:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT T+NLLMDGFCNIGKL+KA+GYFDELKLIG +PTSV+YNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQ+ HLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQ+TYGVLMHGLCMKGNMVEASKLY SM+EM++EPNDVIYN MINGYCKECNSYKALKFLQEMV KG+TPS+ASYSSTIEVL  + KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEMIEAGLNPSESL  ++G
Subjt:  LKEMIEAGLNPSESLYTRVG

XP_038887006.1 pentatricopeptide repeat-containing protein At4g11690 [Benincasa hispida]6.2e-21875.58Show/hide
Query:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV
        M  KSIGFV  F                L N I+SSFFT SSILTYST+PNL+ D  CG SH AII+A +QSQ  EQSLH+FKLMV +G+CPSS++FN V
Subjt:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------
        LG L KSG++HR W FF+EFL RT FDVYSFG+ IKAFCE+GN+SKGF+LLAQMERMGLS NVVIYTILIDACCKNGDIEQ                   
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------

Query:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
                   GY+KDGFELYEKMKLVGV P++YTYNSLINE+CRDGKLSLAFKLFDEMSTRGVSCNV+TYNILIGGLCRKRQVSKAE LLEQMKQAHIN
Subjt:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT TFNLL+DG CN GKLDKA+ YFD++KLIGQSPTSVTYNILIAGFSK GNS+VVSELVREMEDRGISPSKVTYTILM AFVRSDDVEKA +MF LMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        K+GSVPDQ+TYGVL+HGLCMKGNMVEASKLYKSM+EMHLEPNDVIYNTMINGYCKECNSYKALKFL+EMVKKG+TPS+ASYS TI VLC + KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEMIEAGL PSESL  +VG
Subjt:  LKEMIEAGLNPSESLYTRVG

TrEMBL top hitse value%identityAlignment
A0A1S3C0B2 pentatricopeptide repeat-containing protein At4g116906.7e-21072.31Show/hide
Query:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV
        M PKSIGFV  F                LSN I+SSFFTISS+LTYST+ NL+S+ VCGR H+A+I+A +QS   EQSL  FKLMV +GH PSS +FN V
Subjt:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------
        L LL KSG++ R W FFTE+LGRT FDVYSFG+ IKAFCE+GNVSKGFELLAQME MG+SPNV IYTILI+ACCKNGDI+Q                   
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------

Query:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
                   GYKKDGFELYEKMKL+GV P++YTYNSLI E+CRDGKLSLAFKLFDE+S RGV+CN VTYNILIGGLCRK QV KAE LLE+MK+AHIN
Subjt:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT TFNLLMDG CN GKLDKA+ Y D+LKLIGQSPT VTYNILI+GFSK GNS+VVSELVREMEDRGISPSKVTYTILMDAFVRSDD+EKA +MFHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        ++G VPDQ+TYGVL+HGLC++GNMVEASKLYKSM+EMHLEPNDVIYNTMINGYCKECNSYKALKFL+EMVK G+TP++ASY STI+VLCKD KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEM EAGL P ESL ++VG
Subjt:  LKEMIEAGLNPSESLYTRVG

A0A5D3C6B4 Pentatricopeptide repeat-containing protein6.7e-21072.31Show/hide
Query:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV
        M PKSIGFV  F                LSN I+SSFFTISS+LTYST+ NL+S+ VCGR H+A+I+A +QS   EQSL  FKLMV +GH PSS +FN V
Subjt:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------
        L LL KSG++ R W FFTE+LGRT FDVYSFG+ IKAFCE+GNVSKGFELLAQME MG+SPNV IYTILI+ACCKNGDI+Q                   
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------

Query:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
                   GYKKDGFELYEKMKL+GV P++YTYNSLI E+CRDGKLSLAFKLFDE+S RGV+CN VTYNILIGGLCRK QV KAE LLE+MK+AHIN
Subjt:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT TFNLLMDG CN GKLDKA+ Y D+LKLIGQSPT VTYNILI+GFSK GNS+VVSELVREMEDRGISPSKVTYTILMDAFVRSDD+EKA +MFHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        ++G VPDQ+TYGVL+HGLC++GNMVEASKLYKSM+EMHLEPNDVIYNTMINGYCKECNSYKALKFL+EMVK G+TP++ASY STI+VLCKD KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEM EAGL P ESL ++VG
Subjt:  LKEMIEAGLNPSESLYTRVG

A0A6J1C8X2 pentatricopeptide repeat-containing protein At4g116901.8e-23981.61Show/hide
Query:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV
        M PK  GFV  FLA+++R   HSIPVQVLSN I S FFTISSILT+STR NL++ P+CG  HEAIISA+VQSQLPEQSL+HFKLMV +G CPSSN+FN V
Subjt:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------
        LGLLTKSGD+++AWCFF EFLGRTHFDVYSFG+MIKAFCE GNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ                   
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------

Query:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
                   G KKDGFELYEKM L+GVFPSVYTYNSLINE+CRDG L LAFKLFDEM TRGVSCNVVTYNILIGGLCR RQV KAE LLEQMK AHIN
Subjt:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        P+TITFNLLMDGFCN+GK DKA+ YFDELKLIG SPTSVTYNILIAGFSKAGNSAVV ELVREMEDRGISPSKVTYTILMDAFVRSDDV KASQMFHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIE HL+PNDVIYNTMINGYCKECNSYKALKFLQEMV+KGMTPS ASYSSTIEVLCKD KS EAK+L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVGLA
        LKEMIE GL+PSESL  RVG A
Subjt:  LKEMIEAGLNPSESLYTRVGLA

A0A6J1GM09 pentatricopeptide repeat-containing protein At4g116905.3e-21574.42Show/hide
Query:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV
        M PKSI FV  F                LSN I+SSF TISSILTYST+PNL+       SHEAI SA +QSQ  EQSL+ FKLM+ +GHCPS+ +FN V
Subjt:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------
        LGLL KSGD+H+ W FFTEFLGRTHFD YSFG+ IKAFC++GNVSKGFELLAQMER+GLSPNVVIYTILIDACCKNGDIEQ                   
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------

Query:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
                   GYKKDGFEL+EKMKLVGV PS+YTYN+LINE+CRDGKLS+AFK+FDEMST GVSCNVVTY ILIGGLCRKRQ++KAERL E+MKQ  IN
Subjt:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT T+NLLMDGFCNIGKL+KA+GYFDELKLIG +PTSV+YNILIAGFSKAGNS+VVSELVREMEDRG+SPSKVTYTILMDAFVRSDDVEKASQ+FHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQ+TYGVLMHGLCMKGNMVEASKLY SM+EM++EPNDVIYN MINGYCKECNSYKALKFLQEMV KG+TPS+ SY STIEVL  + KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEMIEAGLNPSESL  ++G
Subjt:  LKEMIEAGLNPSESLYTRVG

A0A6J1I748 pentatricopeptide repeat-containing protein At4g116907.3e-22577.12Show/hide
Query:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV
        M PKSIGF+  F                LSN I+SSFFTISSILTYST+PNL+       SHEAII+A +QSQL EQSLH FKLM+ +GHCPSS +FN V
Subjt:  MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------
        LGLL KSGD+H+ WCFFTEFLGRT FD YSFG+ IKAFC++GNVSKGFELLAQMER+GLSPNVVIYTILIDACCKNGDIEQ                   
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ-------------------

Query:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
                   GYKKDGFELYEKMKLVGV PS+YTYN+LINE+CRDGKLS+AFK+FDEMSTRGVSCNVVTY ILIGGLCRKRQ+SKAERL E+MKQ HIN
Subjt:  -----------GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT T+NLLMDGFCNIGKL+KA+GYFD+LKLIG +PTSV+YNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQ+FHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQ+TYGVLMHGLCMKGNMVEASKLYKSM+EM++EPNDVIYN MINGYCKECNSYKALKFLQEMV KG+TPS+ASYSSTIEVLC + KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEMIEAGLNPSESLY ++G
Subjt:  LKEMIEAGLNPSESLYTRVG

SwissProt top hitse value%identityAlignment
Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.1e-6030.21Show/hide
Query:  EAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKS-GDVHRAWCFFTEFL-GRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLS
        + ++ +Y +  L +++L    L    G  P   ++N VL    +S  ++  A   F E L  +   +V+++ ++I+ FC  GN+     L  +ME  G  
Subjt:  EAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKS-GDVHRAWCFFTEFL-GRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLS

Query:  PNVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERL
        PNVV Y  LID  CK   I+     DGF+L   M L G+ P++ +YN +IN  CR+G++     +  EM+ RG S + VTYN LI G C++    +A  +
Subjt:  PNVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERL

Query:  LEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVE
          +M +  + P+ IT+  L+   C  G +++A+ + D++++ G  P   TY  L+ GFS+ G       ++REM D G SPS VTY  L++    +  +E
Subjt:  LEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVE

Query:  KASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCK
         A  +   MK+ G  PD  +Y  ++ G C   ++ EA ++ + M+E  ++P+ + Y+++I G+C++  + +A    +EM++ G+ P   +Y++ I   C 
Subjt:  KASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCK

Query:  DRKSAEAKDLLKEMIEAGLNPSESLYT
        +    +A  L  EM+E G+ P    Y+
Subjt:  DRKSAEAKDLLKEMIEAGLNPSESLYT

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011107.9e-5931.59Show/hide
Query:  EAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSGDVHRAWCFFTEF-LGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSP
        + +I  YVQ++   ++   F L+  +G   S +  N ++G L + G V  AW  + E        +VY+  +M+ A C+DG + K    L+Q++  G+ P
Subjt:  EAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSGDVHRAWCFFTEF-LGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSP

Query:  NVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLL
        ++V Y  LI A    G +E     + FEL   M   G  P VYTYN++IN  C+ GK   A ++F EM   G+S +  TY  L+   C+K  V + E++ 
Subjt:  NVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLL

Query:  EQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEK
          M+   + P  + F+ +M  F   G LDKA+ YF+ +K  G  P +V Y ILI G+ + G  +V   L  EM  +G +   VTY  ++    +   + +
Subjt:  EQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEK

Query:  ASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKD
        A ++F+ M +    PD YT  +L+ G C  GN+  A +L++ M E  +  + V YNT+++G+ K  +   A +   +MV K + P+  SYS  +  LC  
Subjt:  ASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKD

Query:  RKSAEAKDLLKEMIEAGLNPS
           AEA  +  EMI   + P+
Subjt:  RKSAEAKDLLKEMIEAGLNPS

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic3.2e-6030.59Show/hide
Query:  RSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF--DVYSFGVMIKAFCEDGNVSKGFELLAQMERM
        ++   ++  Y++    + +L   + MV  G   S+ + N+++    K G V  A  F  E   +  F  D Y+F  ++   C+ G+V    E++  M + 
Subjt:  RSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF--DVYSFGVMIKAFCEDGNVSKGFELLAQMERM

Query:  GLSPNVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKA
        G  P+V  Y  +I   CK G++     K+  E+ ++M      P+  TYN+LI+  C++ ++  A +L   ++++G+  +V T+N LI GLC  R    A
Subjt:  GLSPNVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKA

Query:  ERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSD
          L E+M+     P   T+N+L+D  C+ GKLD+A+    +++L G + + +TYN LI GF KA  +    E+  EME  G+S + VTY  L+D   +S 
Subjt:  ERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSD

Query:  DVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEV
         VE A+Q+   M   G  PD+YTY  L+   C  G++ +A+ + ++M     EP+ V Y T+I+G CK      A K L+ +  KG+  +  +Y+  I+ 
Subjt:  DVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEV

Query:  LCKDRKSAEAKDLLKEMIEAGLNPSESLYTRVGLAKSC
        L + RK+ EA +L +EM+E    P +++  R+     C
Subjt:  LCKDRKSAEAKDLLKEMIEAGLNPSESLYTRVGLAKSC

Q9LVQ5 Pentatricopeptide repeat-containing protein At5g558405.1e-5830.33Show/hide
Query:  HEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF-DVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLS
        ++ +I  Y++  + + SL  F+LM   G  PS  T N +LG + KSG+    W F  E L R    DV +F ++I   C +G+  K   L+ +ME+ G +
Subjt:  HEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF-DVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLS

Query:  PNVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERL
        P +V Y  ++   CK G       K   EL + MK  GV   V TYN LI++ CR  +++  + L  +M  R +  N VTYN LI G   + +V  A +L
Subjt:  PNVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERL

Query:  LEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVE
        L +M    ++P  +TFN L+DG  + G   +A+  F  ++  G +P+ V+Y +L+ G  K     +       M+  G+   ++TYT ++D   ++  ++
Subjt:  LEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVE

Query:  KASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCK
        +A  + + M K G  PD  TY  L++G C  G    A ++   +  + L PN +IY+T+I   C+     +A++  + M+ +G T    +++  +  LCK
Subjt:  KASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCK

Query:  DRKSAEAKDLLKEMIEAGLNPS
          K AEA++ ++ M   G+ P+
Subjt:  DRKSAEAKDLLKEMIEAGLNPS

Q9T0D6 Pentatricopeptide repeat-containing protein At4g116902.8e-14149.22Show/hide
Query:  VRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSG
        +R  L+ NL  HA S+ +QV+S  I S FFT SS+L Y T    S      R +E II++YVQSQ    S+ +F  MV  G  P SN FN +L  +  S 
Subjt:  VRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSG

Query:  DVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ---------------------------
          ++ W FF E   +   DVYSFG++IK  CE G + K F+LL ++   G SPNVVIYT LID CCK G+IE+                           
Subjt:  DVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ---------------------------

Query:  ---GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNL
           G KK GFE+YEKM+  GVFP++YTYN ++N+ C+DG+   AF++FDEM  RGVSCN+VTYN LIGGLCR+ ++++A ++++QMK   INP  IT+N 
Subjt:  ---GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNL

Query:  LMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQ
        L+DGFC +GKL KA+    +LK  G SP+ VTYNIL++GF + G+++  +++V+EME+RGI PSKVTYTIL+D F RSD++EKA Q+   M+++G VPD 
Subjt:  LMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQ

Query:  YTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAG
        +TY VL+HG C+KG M EAS+L+KSM+E + EPN+VIYNTMI GYCKE +SY+ALK L+EM +K + P++ASY   IEVLCK+RKS EA+ L+++MI++G
Subjt:  YTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAG

Query:  LNPSESLYTRVGLAKS
        ++PS S+ + +  AK+
Subjt:  LNPSESLYTRVGLAKS

Arabidopsis top hitse value%identityAlignment
AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-6130.59Show/hide
Query:  RSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF--DVYSFGVMIKAFCEDGNVSKGFELLAQMERM
        ++   ++  Y++    + +L   + MV  G   S+ + N+++    K G V  A  F  E   +  F  D Y+F  ++   C+ G+V    E++  M + 
Subjt:  RSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF--DVYSFGVMIKAFCEDGNVSKGFELLAQMERM

Query:  GLSPNVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKA
        G  P+V  Y  +I   CK G++     K+  E+ ++M      P+  TYN+LI+  C++ ++  A +L   ++++G+  +V T+N LI GLC  R    A
Subjt:  GLSPNVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKA

Query:  ERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSD
          L E+M+     P   T+N+L+D  C+ GKLD+A+    +++L G + + +TYN LI GF KA  +    E+  EME  G+S + VTY  L+D   +S 
Subjt:  ERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSD

Query:  DVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEV
         VE A+Q+   M   G  PD+YTY  L+   C  G++ +A+ + ++M     EP+ V Y T+I+G CK      A K L+ +  KG+  +  +Y+  I+ 
Subjt:  DVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEV

Query:  LCKDRKSAEAKDLLKEMIEAGLNPSESLYTRVGLAKSC
        L + RK+ EA +L +EM+E    P +++  R+     C
Subjt:  LCKDRKSAEAKDLLKEMIEAGLNPSESLYTRVGLAKSC

AT4G11690.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.0e-14249.22Show/hide
Query:  VRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSG
        +R  L+ NL  HA S+ +QV+S  I S FFT SS+L Y T    S      R +E II++YVQSQ    S+ +F  MV  G  P SN FN +L  +  S 
Subjt:  VRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSG

Query:  DVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ---------------------------
          ++ W FF E   +   DVYSFG++IK  CE G + K F+LL ++   G SPNVVIYT LID CCK G+IE+                           
Subjt:  DVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ---------------------------

Query:  ---GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNL
           G KK GFE+YEKM+  GVFP++YTYN ++N+ C+DG+   AF++FDEM  RGVSCN+VTYN LIGGLCR+ ++++A ++++QMK   INP  IT+N 
Subjt:  ---GYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNL

Query:  LMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQ
        L+DGFC +GKL KA+    +LK  G SP+ VTYNIL++GF + G+++  +++V+EME+RGI PSKVTYTIL+D F RSD++EKA Q+   M+++G VPD 
Subjt:  LMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQ

Query:  YTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAG
        +TY VL+HG C+KG M EAS+L+KSM+E + EPN+VIYNTMI GYCKE +SY+ALK L+EM +K + P++ASY   IEVLCK+RKS EA+ L+++MI++G
Subjt:  YTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAG

Query:  LNPSESLYTRVGLAKS
        ++PS S+ + +  AK+
Subjt:  LNPSESLYTRVGLAKS

AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.6e-6031.59Show/hide
Query:  EAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSGDVHRAWCFFTEF-LGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSP
        + +I  YVQ++   ++   F L+  +G   S +  N ++G L + G V  AW  + E        +VY+  +M+ A C+DG + K    L+Q++  G+ P
Subjt:  EAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSGDVHRAWCFFTEF-LGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSP

Query:  NVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLL
        ++V Y  LI A    G +E     + FEL   M   G  P VYTYN++IN  C+ GK   A ++F EM   G+S +  TY  L+   C+K  V + E++ 
Subjt:  NVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLL

Query:  EQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEK
          M+   + P  + F+ +M  F   G LDKA+ YF+ +K  G  P +V Y ILI G+ + G  +V   L  EM  +G +   VTY  ++    +   + +
Subjt:  EQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEK

Query:  ASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKD
        A ++F+ M +    PD YT  +L+ G C  GN+  A +L++ M E  +  + V YNT+++G+ K  +   A +   +MV K + P+  SYS  +  LC  
Subjt:  ASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKD

Query:  RKSAEAKDLLKEMIEAGLNPS
           AEA  +  EMI   + P+
Subjt:  RKSAEAKDLLKEMIEAGLNPS

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.9e-6230.21Show/hide
Query:  EAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKS-GDVHRAWCFFTEFL-GRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLS
        + ++ +Y +  L +++L    L    G  P   ++N VL    +S  ++  A   F E L  +   +V+++ ++I+ FC  GN+     L  +ME  G  
Subjt:  EAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKS-GDVHRAWCFFTEFL-GRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLS

Query:  PNVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERL
        PNVV Y  LID  CK   I+     DGF+L   M L G+ P++ +YN +IN  CR+G++     +  EM+ RG S + VTYN LI G C++    +A  +
Subjt:  PNVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERL

Query:  LEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVE
          +M +  + P+ IT+  L+   C  G +++A+ + D++++ G  P   TY  L+ GFS+ G       ++REM D G SPS VTY  L++    +  +E
Subjt:  LEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVE

Query:  KASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCK
         A  +   MK+ G  PD  +Y  ++ G C   ++ EA ++ + M+E  ++P+ + Y+++I G+C++  + +A    +EM++ G+ P   +Y++ I   C 
Subjt:  KASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCK

Query:  DRKSAEAKDLLKEMIEAGLNPSESLYT
        +    +A  L  EM+E G+ P    Y+
Subjt:  DRKSAEAKDLLKEMIEAGLNPSESLYT

AT5G55840.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-5930.33Show/hide
Query:  HEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF-DVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLS
        ++ +I  Y++  + + SL  F+LM   G  PS  T N +LG + KSG+    W F  E L R    DV +F ++I   C +G+  K   L+ +ME+ G +
Subjt:  HEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF-DVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLS

Query:  PNVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERL
        P +V Y  ++   CK G       K   EL + MK  GV   V TYN LI++ CR  +++  + L  +M  R +  N VTYN LI G   + +V  A +L
Subjt:  PNVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERL

Query:  LEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVE
        L +M    ++P  +TFN L+DG  + G   +A+  F  ++  G +P+ V+Y +L+ G  K     +       M+  G+   ++TYT ++D   ++  ++
Subjt:  LEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVE

Query:  KASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCK
        +A  + + M K G  PD  TY  L++G C  G    A ++   +  + L PN +IY+T+I   C+     +A++  + M+ +G T    +++  +  LCK
Subjt:  KASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCK

Query:  DRKSAEAKDLLKEMIEAGLNPS
          K AEA++ ++ M   G+ P+
Subjt:  DRKSAEAKDLLKEMIEAGLNPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACCCAAATCCATTGGCTTCGTGCGCCAATTTCTTGCCGCCAATTTGCGATGCCACGCTCACTCCATTCCCGTACAAGTTCTTTCTAATGGAATCTCCTCGTCTTT
CTTCACCATATCTTCCATTCTAACCTATTCAACACGACCAAATCTGAGTTCTGATCCAGTCTGTGGCCGTTCTCATGAAGCAATTATCAGTGCTTATGTCCAATCTCAGT
TACCAGAACAATCCCTTCACCATTTCAAACTAATGGTCCATGAAGGGCATTGCCCGAGTTCAAACACTTTCAATATTGTGTTGGGTTTACTTACTAAGTCAGGTGATGTA
CATAGAGCTTGGTGTTTTTTCACTGAATTTTTGGGGAGGACTCACTTTGATGTGTATAGTTTTGGGGTAATGATTAAAGCCTTTTGTGAAGATGGGAATGTAAGTAAAGG
TTTTGAGCTTTTGGCTCAAATGGAGAGGATGGGTTTGTCTCCTAATGTTGTTATATACACTATTTTGATTGATGCTTGTTGCAAAAATGGTGACATTGAGCAGGGATATA
AAAAGGATGGTTTTGAGCTTTATGAAAAGATGAAGCTTGTTGGGGTGTTTCCTAGTGTATACACTTACAACAGTCTTATTAATGAGCATTGCAGGGATGGGAAGTTGAGT
CTTGCATTTAAGCTGTTCGATGAAATGTCTACTAGAGGGGTGTCGTGTAACGTAGTCACGTACAATATTCTGATTGGTGGGTTATGTCGTAAGAGACAAGTTTCGAAAGC
AGAACGGTTGTTAGAACAAATGAAACAAGCTCATATAAATCCAACTACAATAACATTTAACTTGTTGATGGATGGGTTCTGTAACATTGGAAAGTTGGATAAGGCTATAG
GTTATTTCGATGAGTTGAAGTTGATTGGTCAGTCCCCAACTTCAGTGACCTACAACATTTTAATTGCAGGTTTCTCTAAAGCAGGAAATTCTGCTGTAGTTTCTGAATTA
GTGAGAGAGATGGAGGATAGAGGCATTTCTCCCTCTAAAGTAACATATACAATTCTGATGGATGCCTTCGTTCGATCAGATGATGTGGAGAAAGCCTCTCAAATGTTTCA
TCTCATGAAGAAAGTCGGTTCGGTCCCGGATCAGTATACGTATGGTGTCCTGATGCATGGTTTATGTATGAAAGGTAACATGGTAGAGGCATCTAAGCTGTACAAATCAA
TGATTGAGATGCATCTTGAGCCTAATGATGTAATCTATAATACAATGATAAATGGGTACTGCAAAGAGTGCAATTCTTACAAGGCCTTGAAGTTTCTTCAAGAAATGGTT
AAGAAAGGAATGACTCCAAGTATGGCTAGTTATAGTTCTACCATTGAAGTCCTCTGCAAGGACAGGAAGTCGGCCGAGGCGAAAGATTTACTTAAAGAGATGATCGAGGC
CGGTTTGAACCCGTCAGAATCTCTCTATACTAGGGTTGGTTTAGCCAAGTCTTGTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCACCCAAATCCATTGGCTTCGTGCGCCAATTTCTTGCCGCCAATTTGCGATGCCACGCTCACTCCATTCCCGTACAAGTTCTTTCTAATGGAATCTCCTCGTCTTT
CTTCACCATATCTTCCATTCTAACCTATTCAACACGACCAAATCTGAGTTCTGATCCAGTCTGTGGCCGTTCTCATGAAGCAATTATCAGTGCTTATGTCCAATCTCAGT
TACCAGAACAATCCCTTCACCATTTCAAACTAATGGTCCATGAAGGGCATTGCCCGAGTTCAAACACTTTCAATATTGTGTTGGGTTTACTTACTAAGTCAGGTGATGTA
CATAGAGCTTGGTGTTTTTTCACTGAATTTTTGGGGAGGACTCACTTTGATGTGTATAGTTTTGGGGTAATGATTAAAGCCTTTTGTGAAGATGGGAATGTAAGTAAAGG
TTTTGAGCTTTTGGCTCAAATGGAGAGGATGGGTTTGTCTCCTAATGTTGTTATATACACTATTTTGATTGATGCTTGTTGCAAAAATGGTGACATTGAGCAGGGATATA
AAAAGGATGGTTTTGAGCTTTATGAAAAGATGAAGCTTGTTGGGGTGTTTCCTAGTGTATACACTTACAACAGTCTTATTAATGAGCATTGCAGGGATGGGAAGTTGAGT
CTTGCATTTAAGCTGTTCGATGAAATGTCTACTAGAGGGGTGTCGTGTAACGTAGTCACGTACAATATTCTGATTGGTGGGTTATGTCGTAAGAGACAAGTTTCGAAAGC
AGAACGGTTGTTAGAACAAATGAAACAAGCTCATATAAATCCAACTACAATAACATTTAACTTGTTGATGGATGGGTTCTGTAACATTGGAAAGTTGGATAAGGCTATAG
GTTATTTCGATGAGTTGAAGTTGATTGGTCAGTCCCCAACTTCAGTGACCTACAACATTTTAATTGCAGGTTTCTCTAAAGCAGGAAATTCTGCTGTAGTTTCTGAATTA
GTGAGAGAGATGGAGGATAGAGGCATTTCTCCCTCTAAAGTAACATATACAATTCTGATGGATGCCTTCGTTCGATCAGATGATGTGGAGAAAGCCTCTCAAATGTTTCA
TCTCATGAAGAAAGTCGGTTCGGTCCCGGATCAGTATACGTATGGTGTCCTGATGCATGGTTTATGTATGAAAGGTAACATGGTAGAGGCATCTAAGCTGTACAAATCAA
TGATTGAGATGCATCTTGAGCCTAATGATGTAATCTATAATACAATGATAAATGGGTACTGCAAAGAGTGCAATTCTTACAAGGCCTTGAAGTTTCTTCAAGAAATGGTT
AAGAAAGGAATGACTCCAAGTATGGCTAGTTATAGTTCTACCATTGAAGTCCTCTGCAAGGACAGGAAGTCGGCCGAGGCGAAAGATTTACTTAAAGAGATGATCGAGGC
CGGTTTGAACCCGTCAGAATCTCTCTATACTAGGGTTGGTTTAGCCAAGTCTTGTGCATAA
Protein sequenceShow/hide protein sequence
MAPKSIGFVRQFLAANLRCHAHSIPVQVLSNGISSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNTFNIVLGLLTKSGDV
HRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLS
LAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSEL
VREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMV
KKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAGLNPSESLYTRVGLAKSCA