; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr009713 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr009713
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00008248:12346..14230
RNA-Seq ExpressionSgr009713
SyntenySgr009713
Gene Ontology termsGO:0032544 - plastid translation (biological process)
GO:0043489 - RNA stabilization (biological process)
GO:0009536 - plastid (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136943.1 pentatricopeptide repeat-containing protein At4g11690 [Momordica charantia]2.3e-25986.59Show/hide
Query:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV
        M P   GFV  FLA+++R   HSIPVQVLSN IFS FFTISSILT+STR NL++ P+CG  HEAIISA+VQSQLPEQSL+HFKLMV +G CPSSNSFN V
Subjt:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT
        LGLLTKSGD+++AWCFF EFLGRTHFDVYSFG+MIKAFCE GNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ K+LFS+M DLGLVANQYT
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT

Query:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
        YTVMING FKKG KKDGFELYEKM L+GVFPSVYTYNSLINE+CRDG L LAFKLFDEM TRGVSCNVVTYNILIGGLCR RQV KAE LLEQMK AHIN
Subjt:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        P+TITFNLLMDGFCN+GK DKA+ YFDELKLIG SPTSVTYNILIAGFSKAGNSAVV ELVREMEDRGISPSKVTYTILMDAFVRSDDV KASQMFHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIE HL+PNDVIYNTMINGYCKECNSYKALKFLQEMV+KGMTPS ASYSSTIEVLCKD KS EAK+L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVGLA
        LKEMIE GL+PSESL  RVG A
Subjt:  LKEMIEAGLNPSESLYTRVGLA

XP_022953086.1 pentatricopeptide repeat-containing protein At4g11690 [Cucurbita moschata]4.4e-23479.23Show/hide
Query:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV
        M P SI FV  F                LSN I SSF TISSILTYST+PNL+       SHEAI SA +QSQ  EQSL+ FKLM+ +GHCPS+ SFN V
Subjt:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT
        LGLL KSGD+H+ W FFTEFLGRTHFD YSFG+ IKAFC++GNVSKGFELLAQMER+GLSPNVVIYTILIDACCKNGDIEQ K+LFS+M+DLGLVANQYT
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT

Query:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
        YT MINGFFKKGYKKDGFEL+EKMKLVGV PS+YTYN+LINE+CRDGKLS+AFK+FDEMST GVSCNVVTY ILIGGLCRKRQ++KAERL E+MKQ  IN
Subjt:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT T+NLLMDGFCNIGKL+KA+GYFDELKLIG +PTSV+YNILIAGFSKAGNS+VVSELVREMEDRG+SPSKVTYTILMDAFVRSDDVEKASQ+FHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQ+TYGVLMHGLCMKGNMVEASKLY SM+EM++EPNDVIYN MINGYCKECNSYKALKFLQEMV KG+TPS+ SY STIEVL  + KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEMIEAGLNPSESL  ++G
Subjt:  LKEMIEAGLNPSESLYTRVG

XP_022971940.1 pentatricopeptide repeat-containing protein At4g11690 [Cucurbita maxima]1.8e-24381.73Show/hide
Query:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV
        M P SIGF+  F                LSN I SSFFTISSILTYST+PNL+       SHEAII+A +QSQL EQSLH FKLM+ +GHCPSS SFN V
Subjt:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT
        LGLL KSGD+H+ WCFFTEFLGRT FD YSFG+ IKAFC++GNVSKGFELLAQMER+GLSPNVVIYTILIDACCKNGDIEQ K+LFS+M+DLG VANQYT
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT

Query:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
        YT MINGFFKKGYKKDGFELYEKMKLVGV PS+YTYN+LINE+CRDGKLS+AFK+FDEMSTRGVSCNVVTY ILIGGLCRKRQ+SKAERL E+MKQ HIN
Subjt:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT T+NLLMDGFCNIGKL+KA+GYFD+LKLIG +PTSV+YNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQ+FHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQ+TYGVLMHGLCMKGNMVEASKLYKSM+EM++EPNDVIYN MINGYCKECNSYKALKFLQEMV KG+TPS+ASYSSTIEVLC + KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEMIEAGLNPSESLY ++G
Subjt:  LKEMIEAGLNPSESLYTRVG

XP_023511558.1 pentatricopeptide repeat-containing protein At4g11690 [Cucurbita pepo subsp. pepo]1.5e-23480Show/hide
Query:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV
        M P SIG V  F                LSN   SSFFTISSILTYST+PNL+       SHEAI SA +QSQ  EQSL+ FKLM+ +G CPSS SFN V
Subjt:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT
        LGLL KSGD+H+ W FFTEFLGRTHFD YSFG+ IKAFCE+GNVSKGFELLAQMER+GLSPNVVIYTILIDACCKNGDIEQ K+LFS+M+DLGLVANQYT
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT

Query:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
        YT MINGFFKKGYKKDGFE +EKMKLVGV PS+YTYN+LINE+CRDGKLS+AFK+FDEMST GVSCNVVTY ILIGGLCRKRQ+SKAERL E+MKQ  IN
Subjt:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT T+NLLMDGFCNIGKL+KA+GYFDELKLIG +PTSV+YNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQ+ HLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQ+TYGVLMHGLCMKGNMVEASKLY SM+EM++EPNDVIYN MINGYCKECNSYKALKFLQEMV KG+TPS+ASYSSTIEVL  + KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEMIEAGLNPSESL  ++G
Subjt:  LKEMIEAGLNPSESLYTRVG

XP_038887006.1 pentatricopeptide repeat-containing protein At4g11690 [Benincasa hispida]2.9e-23880.77Show/hide
Query:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV
        M   SIGFV  F                L N I SSFFT SSILTYST+PNL+ D  CG SH AII+A +QSQ  EQSLH+FKLMV +G+CPSS+SFN V
Subjt:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT
        LG L KSG++HR W FF+EFL RT FDVYSFG+ IKAFCE+GN+SKGF+LLAQMERMGLS NVVIYTILIDACCKNGDIEQ K+LFSRMDDLGLVAN YT
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT

Query:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
        YTVMINGFFKKGY+KDGFELYEKMKLVGV P++YTYNSLINE+CRDGKLSLAFKLFDEMSTRGVSCNV+TYNILIGGLCRKRQVSKAE LLEQMKQAHIN
Subjt:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT TFNLL+DG CN GKLDKA+ YFD++KLIGQSPTSVTYNILIAGFSK GNS+VVSELVREMEDRGISPSKVTYTILM AFVRSDDVEKA +MF LMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        K+GSVPDQ+TYGVL+HGLCMKGNMVEASKLYKSM+EMHLEPNDVIYNTMINGYCKECNSYKALKFL+EMVKKG+TPS+ASYS TI VLC + KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEMIEAGL PSESL  +VG
Subjt:  LKEMIEAGLNPSESLYTRVG

TrEMBL top hitse value%identityAlignment
A0A1S3C0B2 pentatricopeptide repeat-containing protein At4g116907.8e-22976.92Show/hide
Query:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV
        M P SIGFV  F                LSN I SSFFTISS+LTYST+ NL+S+ VCGR H+A+I+A +QS   EQSL  FKLMV +GH PSS SFN V
Subjt:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT
        L LL KSG++ R W FFTE+LGRT FDVYSFG+ IKAFCE+GNVSKGFELLAQME MG+SPNV IYTILI+ACCKNGDI+Q K++FSRMDDLGL A+QY 
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT

Query:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
        YTVMINGFFKKGYKKDGFELYEKMKL+GV P++YTYNSLI E+CRDGKLSLAFKLFDE+S RGV+CN VTYNILIGGLCRK QV KAE LLE+MK+AHIN
Subjt:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT TFNLLMDG CN GKLDKA+ Y D+LKLIGQSPT VTYNILI+GFSK GNS+VVSELVREMEDRGISPSKVTYTILMDAFVRSDD+EKA +MFHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        ++G VPDQ+TYGVL+HGLC++GNMVEASKLYKSM+EMHLEPNDVIYNTMINGYCKECNSYKALKFL+EMVK G+TP++ASY STI+VLCKD KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEM EAGL P ESL ++VG
Subjt:  LKEMIEAGLNPSESLYTRVG

A0A5D3C6B4 Pentatricopeptide repeat-containing protein7.8e-22976.92Show/hide
Query:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV
        M P SIGFV  F                LSN I SSFFTISS+LTYST+ NL+S+ VCGR H+A+I+A +QS   EQSL  FKLMV +GH PSS SFN V
Subjt:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT
        L LL KSG++ R W FFTE+LGRT FDVYSFG+ IKAFCE+GNVSKGFELLAQME MG+SPNV IYTILI+ACCKNGDI+Q K++FSRMDDLGL A+QY 
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT

Query:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
        YTVMINGFFKKGYKKDGFELYEKMKL+GV P++YTYNSLI E+CRDGKLSLAFKLFDE+S RGV+CN VTYNILIGGLCRK QV KAE LLE+MK+AHIN
Subjt:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT TFNLLMDG CN GKLDKA+ Y D+LKLIGQSPT VTYNILI+GFSK GNS+VVSELVREMEDRGISPSKVTYTILMDAFVRSDD+EKA +MFHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        ++G VPDQ+TYGVL+HGLC++GNMVEASKLYKSM+EMHLEPNDVIYNTMINGYCKECNSYKALKFL+EMVK G+TP++ASY STI+VLCKD KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEM EAGL P ESL ++VG
Subjt:  LKEMIEAGLNPSESLYTRVG

A0A6J1C8X2 pentatricopeptide repeat-containing protein At4g116901.1e-25986.59Show/hide
Query:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV
        M P   GFV  FLA+++R   HSIPVQVLSN IFS FFTISSILT+STR NL++ P+CG  HEAIISA+VQSQLPEQSL+HFKLMV +G CPSSNSFN V
Subjt:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT
        LGLLTKSGD+++AWCFF EFLGRTHFDVYSFG+MIKAFCE GNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQ K+LFS+M DLGLVANQYT
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT

Query:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
        YTVMING FKKG KKDGFELYEKM L+GVFPSVYTYNSLINE+CRDG L LAFKLFDEM TRGVSCNVVTYNILIGGLCR RQV KAE LLEQMK AHIN
Subjt:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        P+TITFNLLMDGFCN+GK DKA+ YFDELKLIG SPTSVTYNILIAGFSKAGNSAVV ELVREMEDRGISPSKVTYTILMDAFVRSDDV KASQMFHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIE HL+PNDVIYNTMINGYCKECNSYKALKFLQEMV+KGMTPS ASYSSTIEVLCKD KS EAK+L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVGLA
        LKEMIE GL+PSESL  RVG A
Subjt:  LKEMIEAGLNPSESLYTRVGLA

A0A6J1GM09 pentatricopeptide repeat-containing protein At4g116902.1e-23479.23Show/hide
Query:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV
        M P SI FV  F                LSN I SSF TISSILTYST+PNL+       SHEAI SA +QSQ  EQSL+ FKLM+ +GHCPS+ SFN V
Subjt:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT
        LGLL KSGD+H+ W FFTEFLGRTHFD YSFG+ IKAFC++GNVSKGFELLAQMER+GLSPNVVIYTILIDACCKNGDIEQ K+LFS+M+DLGLVANQYT
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT

Query:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
        YT MINGFFKKGYKKDGFEL+EKMKLVGV PS+YTYN+LINE+CRDGKLS+AFK+FDEMST GVSCNVVTY ILIGGLCRKRQ++KAERL E+MKQ  IN
Subjt:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT T+NLLMDGFCNIGKL+KA+GYFDELKLIG +PTSV+YNILIAGFSKAGNS+VVSELVREMEDRG+SPSKVTYTILMDAFVRSDDVEKASQ+FHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQ+TYGVLMHGLCMKGNMVEASKLY SM+EM++EPNDVIYN MINGYCKECNSYKALKFLQEMV KG+TPS+ SY STIEVL  + KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEMIEAGLNPSESL  ++G
Subjt:  LKEMIEAGLNPSESLYTRVG

A0A6J1I748 pentatricopeptide repeat-containing protein At4g116908.6e-24481.73Show/hide
Query:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV
        M P SIGF+  F                LSN I SSFFTISSILTYST+PNL+       SHEAII+A +QSQL EQSLH FKLM+ +GHCPSS SFN V
Subjt:  MAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIV

Query:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT
        LGLL KSGD+H+ WCFFTEFLGRT FD YSFG+ IKAFC++GNVSKGFELLAQMER+GLSPNVVIYTILIDACCKNGDIEQ K+LFS+M+DLG VANQYT
Subjt:  LGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYT

Query:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN
        YT MINGFFKKGYKKDGFELYEKMKLVGV PS+YTYN+LINE+CRDGKLS+AFK+FDEMSTRGVSCNVVTY ILIGGLCRKRQ+SKAERL E+MKQ HIN
Subjt:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHIN

Query:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK
        PTT T+NLLMDGFCNIGKL+KA+GYFD+LKLIG +PTSV+YNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQ+FHLMK
Subjt:  PTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMK

Query:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL
        KVGSVPDQ+TYGVLMHGLCMKGNMVEASKLYKSM+EM++EPNDVIYN MINGYCKECNSYKALKFLQEMV KG+TPS+ASYSSTIEVLC + KS EAK L
Subjt:  KVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDL

Query:  LKEMIEAGLNPSESLYTRVG
        LKEMIEAGLNPSESLY ++G
Subjt:  LKEMIEAGLNPSESLYTRVG

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial2.4e-6228.42Show/hide
Query:  TISSILTYSTRPNLSSDPVCGR--SHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF--DVYSFGVM
        T ++I+ +   P +    VC    S+  +I    Q    +++ H   LM  +G+ P   S++ V+    + G++ + W    E + R     + Y +G +
Subjt:  TISSILTYSTRPNLSSDPVCGR--SHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF--DVYSFGVM

Query:  IKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVY
        I   C    +++  E  ++M R G+ P+ V+YT LID  CK GDI      F  M    +  +  TYT +I+GF + G   +  +L+ +M   G+ P   
Subjt:  IKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVY

Query:  TYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQ
        T+  LIN +C+ G +  AF++ + M   G S NVVTY  LI GLC++  +  A  LL +M +  + P   T+N +++G C  G +++A+    E +  G 
Subjt:  TYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQ

Query:  SPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSM
        +  +VTY  L+  + K+G      E+++EM  +G+ P+ VT+ +LM+ F     +E   ++ + M   G  P+  T+  L+   C++ N+  A+ +YK M
Subjt:  SPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSM

Query:  IEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAGLNPSESLY
            + P+   Y  ++ G+CK  N  +A    QEM  KG + S+++YS  I+   K +K  EA+++  +M   GL   + ++
Subjt:  IEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAGLNPSESLY

Q3EDF8 Pentatricopeptide repeat-containing protein At1g099004.2e-6230.09Show/hide
Query:  KLMVHEGHCPSSNSFNIVLGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQG
        +++   G  P   ++N+++    K+G+++ A             DV ++  ++++ C+ G + +  E+L +M +    P+V+ YTILI+A C++  +   
Subjt:  KLMVHEGHCPSSNSFNIVLGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQG

Query:  KILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKR
          L   M D G   +  TY V++NG  K+G   +  +    M   G  P+V T+N ++   C  G+   A KL  +M  +G S +VVT+NILI  LCRK 
Subjt:  KILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKR

Query:  QVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDA
         + +A  +LE+M Q    P ++++N L+ GFC   K+D+AI Y + +   G  P  VTYN ++    K G      E++ ++  +G SP  +TY  ++D 
Subjt:  QVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDA

Query:  FVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYS
          ++    KA ++   M+     PD  TY  L+ GL  +G + EA K +     M + PN V +N+++ G CK   + +A+ FL  M+ +G  P+  SY+
Subjt:  FVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYS

Query:  STIEVLCKDRKSAEAKDLLKEMIEAGLNPSES
          IE L  +  + EA +LL E+   GL    S
Subjt:  STIEVLCKDRKSAEAKDLLKEMIEAGLNPSES

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397102.0e-6432.46Show/hide
Query:  EAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKS-GDVHRAWCFFTEFL-GRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLS
        + ++ +Y +  L +++L    L    G  P   S+N VL    +S  ++  A   F E L  +   +V+++ ++I+ FC  GN+     L  +ME  G  
Subjt:  EAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKS-GDVHRAWCFFTEFL-GRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLS

Query:  PNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMS
        PNVV Y  LID  CK   I+ G  L   M   GL  N  +Y V+ING  ++G  K+   +  +M   G      TYN+LI  +C++G    A  +  EM 
Subjt:  PNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMS

Query:  TRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSEL
          G++ +V+TY  LI  +C+   +++A   L+QM+   + P   T+  L+DGF   G +++A     E+   G SP+ VTYN LI G    G       +
Subjt:  TRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSEL

Query:  VREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSY
        + +M+++G+SP  V+Y+ ++  F RS DV++A ++   M + G  PD  TY  L+ G C +    EA  LY+ M+ + L P++  Y  +IN YC E +  
Subjt:  VREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSY

Query:  KALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAGLNPSESLY
        KAL+   EMV+KG+ P + +YS  I  L K  ++ EAK LL ++      PS+  Y
Subjt:  KALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAGLNPSESLY

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial4.2e-6228.92Show/hide
Query:  AIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSGDVHRAWCFFTEFLGR-THFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPN
        ++++ Y  S+   +++     M   G+ P++ +FN ++  L        A       + +    D+ ++GV++   C+ G+    F LL +ME+  L P 
Subjt:  AIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSGDVHRAWCFFTEFLGR-THFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPN

Query:  VVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTR
        V+IY  +ID  CK   ++    LF  M+  G+  N  TY+ +I+     G   D   L   M    + P V+T+++LI+   ++GKL  A KL+DEM  R
Subjt:  VVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTR

Query:  GVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVR
         +  ++VTY+ LI G C   ++ +A+++ E M   H  P  +T+N L+ GFC   ++++ +  F E+   G    +VTYNILI G  +AG+  +  E+ +
Subjt:  GVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVR

Query:  EMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKA
        EM   G+ P+ +TY  L+D   ++  +EKA  +F  +++    P  YTY +++ G+C  G + +   L+ ++    ++P+ V YNTMI+G+C++ +  +A
Subjt:  EMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKA

Query:  LKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAG
            +EM + G  P+   Y++ I    +D     + +L+KEM   G
Subjt:  LKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAG

Q9T0D6 Pentatricopeptide repeat-containing protein At4g116901.7e-15652.91Show/hide
Query:  VRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSG
        +R  L+ NL  HA S+ +QV+S  I S FFT SS+L Y T    S      R +E II++YVQSQ    S+ +F  MV  G  P SN FN +L  +  S 
Subjt:  VRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSG

Query:  DVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGF
          ++ W FF E   +   DVYSFG++IK  CE G + K F+LL ++   G SPNVVIYT LID CCK G+IE+ K LF  M  LGLVAN+ TYTV+ING 
Subjt:  DVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGF

Query:  FKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNL
        FK G KK GFE+YEKM+  GVFP++YTYN ++N+ C+DG+   AF++FDEM  RGVSCN+VTYN LIGGLCR+ ++++A ++++QMK   INP  IT+N 
Subjt:  FKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNL

Query:  LMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQ
        L+DGFC +GKL KA+    +LK  G SP+ VTYNIL++GF + G+++  +++V+EME+RGI PSKVTYTIL+D F RSD++EKA Q+   M+++G VPD 
Subjt:  LMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQ

Query:  YTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAG
        +TY VL+HG C+KG M EAS+L+KSM+E + EPN+VIYNTMI GYCKE +SY+ALK L+EM +K + P++ASY   IEVLCK+RKS EA+ L+++MI++G
Subjt:  YTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAG

Query:  LNPSESLYTRVGLAKS
        ++PS S+ + +  AK+
Subjt:  LNPSESLYTRVGLAKS

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.7e-6328.42Show/hide
Query:  TISSILTYSTRPNLSSDPVCGR--SHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF--DVYSFGVM
        T ++I+ +   P +    VC    S+  +I    Q    +++ H   LM  +G+ P   S++ V+    + G++ + W    E + R     + Y +G +
Subjt:  TISSILTYSTRPNLSSDPVCGR--SHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF--DVYSFGVM

Query:  IKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVY
        I   C    +++  E  ++M R G+ P+ V+YT LID  CK GDI      F  M    +  +  TYT +I+GF + G   +  +L+ +M   G+ P   
Subjt:  IKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVY

Query:  TYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQ
        T+  LIN +C+ G +  AF++ + M   G S NVVTY  LI GLC++  +  A  LL +M +  + P   T+N +++G C  G +++A+    E +  G 
Subjt:  TYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQ

Query:  SPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSM
        +  +VTY  L+  + K+G      E+++EM  +G+ P+ VT+ +LM+ F     +E   ++ + M   G  P+  T+  L+   C++ N+  A+ +YK M
Subjt:  SPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSM

Query:  IEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAGLNPSESLY
            + P+   Y  ++ G+CK  N  +A    QEM  KG + S+++YS  I+   K +K  EA+++  +M   GL   + ++
Subjt:  IEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAGLNPSESLY

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein1.7e-6328.42Show/hide
Query:  TISSILTYSTRPNLSSDPVCGR--SHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF--DVYSFGVM
        T ++I+ +   P +    VC    S+  +I    Q    +++ H   LM  +G+ P   S++ V+    + G++ + W    E + R     + Y +G +
Subjt:  TISSILTYSTRPNLSSDPVCGR--SHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSGDVHRAWCFFTEFLGRTHF--DVYSFGVM

Query:  IKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVY
        I   C    +++  E  ++M R G+ P+ V+YT LID  CK GDI      F  M    +  +  TYT +I+GF + G   +  +L+ +M   G+ P   
Subjt:  IKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVY

Query:  TYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQ
        T+  LIN +C+ G +  AF++ + M   G S NVVTY  LI GLC++  +  A  LL +M +  + P   T+N +++G C  G +++A+    E +  G 
Subjt:  TYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQ

Query:  SPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSM
        +  +VTY  L+  + K+G      E+++EM  +G+ P+ VT+ +LM+ F     +E   ++ + M   G  P+  T+  L+   C++ N+  A+ +YK M
Subjt:  SPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSM

Query:  IEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAGLNPSESLY
            + P+   Y  ++ G+CK  N  +A    QEM  KG + S+++YS  I+   K +K  EA+++  +M   GL   + ++
Subjt:  IEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAGLNPSESLY

AT1G62670.1 rna processing factor 23.0e-6328.92Show/hide
Query:  AIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSGDVHRAWCFFTEFLGR-THFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPN
        ++++ Y  S+   +++     M   G+ P++ +FN ++  L        A       + +    D+ ++GV++   C+ G+    F LL +ME+  L P 
Subjt:  AIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSGDVHRAWCFFTEFLGR-THFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPN

Query:  VVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTR
        V+IY  +ID  CK   ++    LF  M+  G+  N  TY+ +I+     G   D   L   M    + P V+T+++LI+   ++GKL  A KL+DEM  R
Subjt:  VVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTR

Query:  GVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVR
         +  ++VTY+ LI G C   ++ +A+++ E M   H  P  +T+N L+ GFC   ++++ +  F E+   G    +VTYNILI G  +AG+  +  E+ +
Subjt:  GVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVR

Query:  EMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKA
        EM   G+ P+ +TY  L+D   ++  +EKA  +F  +++    P  YTY +++ G+C  G + +   L+ ++    ++P+ V YNTMI+G+C++ +  +A
Subjt:  EMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKA

Query:  LKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAG
            +EM + G  P+   Y++ I    +D     + +L+KEM   G
Subjt:  LKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAG

AT4G11690.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.2e-15752.91Show/hide
Query:  VRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSG
        +R  L+ NL  HA S+ +QV+S  I S FFT SS+L Y T    S      R +E II++YVQSQ    S+ +F  MV  G  P SN FN +L  +  S 
Subjt:  VRQFLAANLRCHAHSIPVQVLSNGIFSSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSG

Query:  DVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGF
          ++ W FF E   +   DVYSFG++IK  CE G + K F+LL ++   G SPNVVIYT LID CCK G+IE+ K LF  M  LGLVAN+ TYTV+ING 
Subjt:  DVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGF

Query:  FKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNL
        FK G KK GFE+YEKM+  GVFP++YTYN ++N+ C+DG+   AF++FDEM  RGVSCN+VTYN LIGGLCR+ ++++A ++++QMK   INP  IT+N 
Subjt:  FKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNL

Query:  LMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQ
        L+DGFC +GKL KA+    +LK  G SP+ VTYNIL++GF + G+++  +++V+EME+RGI PSKVTYTIL+D F RSD++EKA Q+   M+++G VPD 
Subjt:  LMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQ

Query:  YTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAG
        +TY VL+HG C+KG M EAS+L+KSM+E + EPN+VIYNTMI GYCKE +SY+ALK L+EM +K + P++ASY   IEVLCK+RKS EA+ L+++MI++G
Subjt:  YTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAG

Query:  LNPSESLYTRVGLAKS
        ++PS S+ + +  AK+
Subjt:  LNPSESLYTRVGLAKS

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-6532.46Show/hide
Query:  EAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKS-GDVHRAWCFFTEFL-GRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLS
        + ++ +Y +  L +++L    L    G  P   S+N VL    +S  ++  A   F E L  +   +V+++ ++I+ FC  GN+     L  +ME  G  
Subjt:  EAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKS-GDVHRAWCFFTEFL-GRTHFDVYSFGVMIKAFCEDGNVSKGFELLAQMERMGLS

Query:  PNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMS
        PNVV Y  LID  CK   I+ G  L   M   GL  N  +Y V+ING  ++G  K+   +  +M   G      TYN+LI  +C++G    A  +  EM 
Subjt:  PNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFKLFDEMS

Query:  TRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSEL
          G++ +V+TY  LI  +C+   +++A   L+QM+   + P   T+  L+DGF   G +++A     E+   G SP+ VTYN LI G    G       +
Subjt:  TRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSEL

Query:  VREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSY
        + +M+++G+SP  V+Y+ ++  F RS DV++A ++   M + G  PD  TY  L+ G C +    EA  LY+ M+ + L P++  Y  +IN YC E +  
Subjt:  VREMEDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSY

Query:  KALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAGLNPSESLY
        KAL+   EMV+KG+ P + +YS  I  L K  ++ EAK LL ++      PS+  Y
Subjt:  KALKFLQEMVKKGMTPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAGLNPSESLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATATTTTTCTGTTTCTCCCGAATCCTGATTGAAGTTCGACATCAAATTTCCTCTAATTCTTGGTCGCAATCGGCCTCAAAGTTCTCGGACAATCTCAGAATCAA
CATTATCAATATCCAACTCAACTTCATCGCCATTTTCGAAGAGCCTCTGATTCCTCTTCTTCATAATCTTTCACAAAATTGTTCTGCTTATCTGAAGTTCTTAATTAGTC
TCGTCTTTATGGCACCCATGTCCATTGGCTTCGTGCGCCAATTTCTTGCCGCCAATTTGCGATGCCACGCTCACTCCATTCCCGTACAAGTTCTTTCTAATGGAATCTTC
TCGTCTTTCTTCACCATATCTTCCATTCTAACCTATTCAACACGACCAAATCTGAGTTCTGATCCAGTCTGTGGCCGTTCTCATGAAGCAATTATCAGTGCTTATGTCCA
ATCTCAGTTACCAGAACAATCCCTTCACCATTTCAAACTAATGGTCCATGAAGGGCATTGCCCGAGTTCAAACTCTTTCAATATTGTGTTGGGTTTACTTACTAAGTCAG
GTGATGTACATAGAGCTTGGTGTTTTTTCACTGAATTTTTGGGGAGGACTCACTTTGATGTGTATAGTTTTGGGGTAATGATTAAAGCCTTTTGTGAAGATGGGAATGTA
AGTAAAGGTTTTGAGCTTTTGGCTCAAATGGAGAGGATGGGTTTGTCTCCTAATGTTGTTATATACACTATTTTGATTGATGCTTGTTGCAAAAATGGTGACATTGAGCA
GGGTAAAATCTTATTTTCTAGGATGGATGACCTTGGTTTGGTTGCAAACCAATATACTTATACTGTGATGATCAATGGATTTTTCAAGAAAGGATATAAAAAGGATGGTT
TTGAGCTTTATGAAAAGATGAAGCTTGTTGGGGTGTTTCCTAGTGTATACACTTACAACAGTCTTATTAATGAGCATTGCAGGGATGGGAAGTTGAGTCTTGCATTTAAG
CTGTTCGATGAAATGTCTACTAGAGGGGTGTCGTGTAACGTAGTCACGTACAATATTCTGATTGGTGGGTTATGTCGTAAGAGACAAGTTTCGAAAGCAGAACGGTTGTT
AGAACAAATGAAACAAGCTCATATAAATCCAACTACAATAACATTTAACTTGTTGATGGATGGGTTCTGTAACATTGGAAAGTTGGATAAGGCTATAGGTTATTTCGATG
AGTTGAAGTTGATTGGTCAGTCCCCAACTTCAGTGACCTACAACATTTTAATTGCAGGTTTCTCTAAAGCAGGAAATTCTGCTGTAGTTTCTGAATTAGTGAGAGAGATG
GAGGATAGAGGCATTTCTCCCTCTAAAGTAACATATACAATTCTGATGGATGCCTTCGTTCGATCAGATGATGTGGAGAAAGCCTCTCAAATGTTTCATCTCATGAAGAA
AGTCGGTTCGGTCCCGGATCAGTATACGTATGGTGTCCTGATGCATGGTTTATGTATGAAAGGTAACATGGTAGAGGCATCTAAGCTGTACAAATCAATGATTGAGATGC
ATCTTGAGCCTAATGATGTAATCTATAATACAATGATAAATGGGTACTGCAAAGAGTGCAATTCTTACAAGGCCTTGAAGTTTCTTCAAGAAATGGTTAAGAAAGGAATG
ACTCCAAGTATGGCTAGTTATAGTTCTACCATTGAAGTCCTCTGCAAGGACAGGAAGTCGGCCGAGGCGAAAGATTTACTTAAAGAGATGATCGAGGCCGGTTTGAACCC
GTCAGAATCTCTCTATACTAGGGTTGGTTTAGCCAAGTCTTGTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATATTTTTCTGTTTCTCCCGAATCCTGATTGAAGTTCGACATCAAATTTCCTCTAATTCTTGGTCGCAATCGGCCTCAAAGTTCTCGGACAATCTCAGAATCAA
CATTATCAATATCCAACTCAACTTCATCGCCATTTTCGAAGAGCCTCTGATTCCTCTTCTTCATAATCTTTCACAAAATTGTTCTGCTTATCTGAAGTTCTTAATTAGTC
TCGTCTTTATGGCACCCATGTCCATTGGCTTCGTGCGCCAATTTCTTGCCGCCAATTTGCGATGCCACGCTCACTCCATTCCCGTACAAGTTCTTTCTAATGGAATCTTC
TCGTCTTTCTTCACCATATCTTCCATTCTAACCTATTCAACACGACCAAATCTGAGTTCTGATCCAGTCTGTGGCCGTTCTCATGAAGCAATTATCAGTGCTTATGTCCA
ATCTCAGTTACCAGAACAATCCCTTCACCATTTCAAACTAATGGTCCATGAAGGGCATTGCCCGAGTTCAAACTCTTTCAATATTGTGTTGGGTTTACTTACTAAGTCAG
GTGATGTACATAGAGCTTGGTGTTTTTTCACTGAATTTTTGGGGAGGACTCACTTTGATGTGTATAGTTTTGGGGTAATGATTAAAGCCTTTTGTGAAGATGGGAATGTA
AGTAAAGGTTTTGAGCTTTTGGCTCAAATGGAGAGGATGGGTTTGTCTCCTAATGTTGTTATATACACTATTTTGATTGATGCTTGTTGCAAAAATGGTGACATTGAGCA
GGGTAAAATCTTATTTTCTAGGATGGATGACCTTGGTTTGGTTGCAAACCAATATACTTATACTGTGATGATCAATGGATTTTTCAAGAAAGGATATAAAAAGGATGGTT
TTGAGCTTTATGAAAAGATGAAGCTTGTTGGGGTGTTTCCTAGTGTATACACTTACAACAGTCTTATTAATGAGCATTGCAGGGATGGGAAGTTGAGTCTTGCATTTAAG
CTGTTCGATGAAATGTCTACTAGAGGGGTGTCGTGTAACGTAGTCACGTACAATATTCTGATTGGTGGGTTATGTCGTAAGAGACAAGTTTCGAAAGCAGAACGGTTGTT
AGAACAAATGAAACAAGCTCATATAAATCCAACTACAATAACATTTAACTTGTTGATGGATGGGTTCTGTAACATTGGAAAGTTGGATAAGGCTATAGGTTATTTCGATG
AGTTGAAGTTGATTGGTCAGTCCCCAACTTCAGTGACCTACAACATTTTAATTGCAGGTTTCTCTAAAGCAGGAAATTCTGCTGTAGTTTCTGAATTAGTGAGAGAGATG
GAGGATAGAGGCATTTCTCCCTCTAAAGTAACATATACAATTCTGATGGATGCCTTCGTTCGATCAGATGATGTGGAGAAAGCCTCTCAAATGTTTCATCTCATGAAGAA
AGTCGGTTCGGTCCCGGATCAGTATACGTATGGTGTCCTGATGCATGGTTTATGTATGAAAGGTAACATGGTAGAGGCATCTAAGCTGTACAAATCAATGATTGAGATGC
ATCTTGAGCCTAATGATGTAATCTATAATACAATGATAAATGGGTACTGCAAAGAGTGCAATTCTTACAAGGCCTTGAAGTTTCTTCAAGAAATGGTTAAGAAAGGAATG
ACTCCAAGTATGGCTAGTTATAGTTCTACCATTGAAGTCCTCTGCAAGGACAGGAAGTCGGCCGAGGCGAAAGATTTACTTAAAGAGATGATCGAGGCCGGTTTGAACCC
GTCAGAATCTCTCTATACTAGGGTTGGTTTAGCCAAGTCTTGTGCATAA
Protein sequenceShow/hide protein sequence
MAIFFCFSRILIEVRHQISSNSWSQSASKFSDNLRINIINIQLNFIAIFEEPLIPLLHNLSQNCSAYLKFLISLVFMAPMSIGFVRQFLAANLRCHAHSIPVQVLSNGIF
SSFFTISSILTYSTRPNLSSDPVCGRSHEAIISAYVQSQLPEQSLHHFKLMVHEGHCPSSNSFNIVLGLLTKSGDVHRAWCFFTEFLGRTHFDVYSFGVMIKAFCEDGNV
SKGFELLAQMERMGLSPNVVIYTILIDACCKNGDIEQGKILFSRMDDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEHCRDGKLSLAFK
LFDEMSTRGVSCNVVTYNILIGGLCRKRQVSKAERLLEQMKQAHINPTTITFNLLMDGFCNIGKLDKAIGYFDELKLIGQSPTSVTYNILIAGFSKAGNSAVVSELVREM
EDRGISPSKVTYTILMDAFVRSDDVEKASQMFHLMKKVGSVPDQYTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLQEMVKKGM
TPSMASYSSTIEVLCKDRKSAEAKDLLKEMIEAGLNPSESLYTRVGLAKSCA