; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G001890 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G001890
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr06:2055069..2056520
RNA-Seq ExpressionLsi06G001890
SyntenyLsi06G001890
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008450741.1 PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Cucumis melo]8.9e-24688.15Show/hide
Query:  MHLAPRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPH
        MHLAPRL CI+HL+N R SS QL QIHAQ ITNGFKSPSPYAKLI H CKKSS E+IA+A LIF+H QH PNLFLFNT IRCAPPQ+SISIFA WVSTPH
Subjt:  MHLAPRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPH

Query:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL
        FEFDDFTFIFVLGACAR PS STLMIG+QIHTHILKRGIVSNIW QTTMIHFY+ NKDVG ARKVFDEMSVRNSVTWNAMIAGYCSQS +V+QKYARD L
Subjt:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL

Query:  ELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVH
        ELFR M VESTN EVKPTDTTMVC+LSAAS LG+LETG CVHAYI+KTIDSPE D+FIGTGLVNMYSKCG L+SASSVFKQMKQRNVLTWT+MATGLAVH
Subjt:  ELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVH

Query:  GKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSC
        G+GKEALELLDAMGA GVKPNAVTFTSLLSACCHGGLIEEGLHLF VMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELIL MP++PDGVLWRSLLSSC
Subjt:  GKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSC

Query:  MVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTGYQGL
        M+HGDVEMGERVGKLLVERQGGESFDDEWCVGS DFVALSNVYASA+RW+DVE LREEMKIKGIENKAG SSVQTTG QGL
Subjt:  MVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTGYQGL

XP_011659935.1 pentatricopeptide repeat-containing protein At3g18970 [Cucumis sativus]1.0e-24687.42Show/hide
Query:  MHLAPRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPH
        MHLAPRL CIHHL+N R SS QL QIHAQLITNGFKSPSPYAKLI H CKKSS E+IA+A LIF+H Q+ PNLFLFNT IRCAPP HSISIFATWVST H
Subjt:  MHLAPRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPH

Query:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL
        FEFDDFTFIFVLGACAR PS STLMIG+QIHTHILKRGIVSNIWVQTTMIHFY+INKDVG ARK+FDEMS+RNSVTWNAMIAGYCSQ  +V+QKYARD L
Subjt:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL

Query:  ELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVH
        ELFR M VESTN EVKPTDTTMVC+LSAASQLG+LETG+CVHAYI+KT+DSPE D+FIGTGLVNMYSKCG LNSASSVFKQMKQ+NVLTWT+MATGLAVH
Subjt:  ELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVH

Query:  GKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSC
        G+GKEALELLDAMGA GVKPNAVTFTSLLSACCHGGLIEEGLHLF VMERKFGVVPQMQHYGCIVDLLGRSGHLREAY+LIL MP++PDGVLWRSLLSSC
Subjt:  GKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSC

Query:  MVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTGYQGL-EAL
        M+HGDVEMGERVGKLLVERQGGESFDDEWCVGS DFVALSNVYAS +RW+DVE LR+EMKIKGIENKAGCSS+QTTG QGL EAL
Subjt:  MVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTGYQGL-EAL

XP_022988094.1 pentatricopeptide repeat-containing protein At3g18970 [Cucurbita maxima]8.1e-23184.73Show/hide
Query:  MHLAPRLRCIHHLNNTRNSS-HQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTP
        MHL+PRL CIHHLNN  NSS  QLKQI AQLITN FKSPSPYAKLIAHFC K SP+A AYA LI  H +HP NLFLFNT IRCAPPQHSISIFA  +ST 
Subjt:  MHLAPRLRCIHHLNNTRNSS-HQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTP

Query:  HFEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDG
        HFEFDDFTFIF+LGACAR PS  TL  GKQIHTH+LKRG+VSNIWVQTTMIHFYAINKDVG ARKVFDEMSVRN+VTWNAMIAGYCSQS RVAQKY R+ 
Subjt:  HFEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDG

Query:  LELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAV
        LELFR M VESTNS+V PTDTTMVCLLSAASQLGVLETG CVHAYIEKT DSPEND+FIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLA+
Subjt:  LELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAV

Query:  HGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSS
        HGKGKEALELL+AMG  GVKPNAVTFTSLLS CCHGGLIEEGLHLF VMERKFGVVPQMQHYGCIVDLLGR GHL+EAYELILGMPV PDGVLWRSL+SS
Subjt:  HGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSS

Query:  CMVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG
        CMVH DVEMGE+VGK LVE   G    DEWC GS DFVALSNVYASAQRWE+V+ +REEMKIKGI+NKAG SSVQT G
Subjt:  CMVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG

XP_023515866.1 pentatricopeptide repeat-containing protein At3g18970 [Cucurbita pepo subsp. pepo]9.9e-22984.1Show/hide
Query:  MHLAPRLRCIHHLNNTRNSS-HQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTP
        M L+PRL CIHHLNN  +SS  QLKQIHAQLITN FKSP PYAKLIAHFC   SPEA AYA LI  H +HP NLFLFNT IRCAPPQHSISIFA  VST 
Subjt:  MHLAPRLRCIHHLNNTRNSS-HQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTP

Query:  HFEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDG
        HFEFDDFTFIF+LGACAR PS  TL  GKQIHTHILKRG+VSNIWVQTTMIHFYAINKDVG ARKVFDEMSVRN+VTWNAMIAGYCSQS RVAQKY R+ 
Subjt:  HFEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDG

Query:  LELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAV
        LELFR M VESTNS+V PTDTTMVCLLSAASQLGVLETG CVHAYIEKTIDSPEND+FIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLA+
Subjt:  LELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAV

Query:  HGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSS
        HGKGKEALELL+AMG  GVKPNAVTFTSLLS CCHGGLIEEGLHLF VMERKFGVVPQMQHYGCIVDLLGR GHL+EAYE+ILGMP+ PDGVLWR L+SS
Subjt:  HGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSS

Query:  CMVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG
        CMVH DVEMGE+VGK LVE   G    DEWC GS DFVALSNVYASAQRWE+V+ +REEMKIKGI+NK G SSVQT G
Subjt:  CMVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG

XP_038877553.1 pentatricopeptide repeat-containing protein At3g18970 [Benincasa hispida]1.5e-25391.32Show/hide
Query:  MHLAPRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPH
        MHLAPR+RCIHHLNN RNSS QLKQIHAQLIT+GFK PSPYAKLI HFC KSSPEAIAYA LIF+H QHPPNLFLFNT IRCAPPQHSISIFA+WVS PH
Subjt:  MHLAPRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPH

Query:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL
        FEFD+FTFIFVLGACAR PS STLMIGKQIHT ILKRGIVSNI VQTTMIHFYAINK+VG ARKVFDEMSVRNSVTWNAMIAGYCSQ    A KYARD L
Subjt:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL

Query:  ELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVH
        ELFR M VESTNSEVKPTDTTMVCLLSAASQLG+LETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVH
Subjt:  ELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVH

Query:  GKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSC
        G+GKEALELLDAMG  GVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMP++ DGVLWRSLLSSC
Subjt:  GKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSC

Query:  MVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTGYQGLEAL
        MVHG+VEMGERVGKLLVERQGGESFDDE CVGS DFVALSNVYASA+RWEDVE +REEMKIKGIENKAGCSSVQTTG QGLEAL
Subjt:  MVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTGYQGLEAL

TrEMBL top hitse value%identityAlignment
A0A0A0LZ63 Uncharacterized protein5.1e-24787.42Show/hide
Query:  MHLAPRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPH
        MHLAPRL CIHHL+N R SS QL QIHAQLITNGFKSPSPYAKLI H CKKSS E+IA+A LIF+H Q+ PNLFLFNT IRCAPP HSISIFATWVST H
Subjt:  MHLAPRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPH

Query:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL
        FEFDDFTFIFVLGACAR PS STLMIG+QIHTHILKRGIVSNIWVQTTMIHFY+INKDVG ARK+FDEMS+RNSVTWNAMIAGYCSQ  +V+QKYARD L
Subjt:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL

Query:  ELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVH
        ELFR M VESTN EVKPTDTTMVC+LSAASQLG+LETG+CVHAYI+KT+DSPE D+FIGTGLVNMYSKCG LNSASSVFKQMKQ+NVLTWT+MATGLAVH
Subjt:  ELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVH

Query:  GKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSC
        G+GKEALELLDAMGA GVKPNAVTFTSLLSACCHGGLIEEGLHLF VMERKFGVVPQMQHYGCIVDLLGRSGHLREAY+LIL MP++PDGVLWRSLLSSC
Subjt:  GKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSC

Query:  MVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTGYQGL-EAL
        M+HGDVEMGERVGKLLVERQGGESFDDEWCVGS DFVALSNVYAS +RW+DVE LR+EMKIKGIENKAGCSS+QTTG QGL EAL
Subjt:  MVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTGYQGL-EAL

A0A1S3BPA5 pentatricopeptide repeat-containing protein At3g189704.3e-24688.15Show/hide
Query:  MHLAPRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPH
        MHLAPRL CI+HL+N R SS QL QIHAQ ITNGFKSPSPYAKLI H CKKSS E+IA+A LIF+H QH PNLFLFNT IRCAPPQ+SISIFA WVSTPH
Subjt:  MHLAPRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPH

Query:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL
        FEFDDFTFIFVLGACAR PS STLMIG+QIHTHILKRGIVSNIW QTTMIHFY+ NKDVG ARKVFDEMSVRNSVTWNAMIAGYCSQS +V+QKYARD L
Subjt:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL

Query:  ELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVH
        ELFR M VESTN EVKPTDTTMVC+LSAAS LG+LETG CVHAYI+KTIDSPE D+FIGTGLVNMYSKCG L+SASSVFKQMKQRNVLTWT+MATGLAVH
Subjt:  ELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVH

Query:  GKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSC
        G+GKEALELLDAMGA GVKPNAVTFTSLLSACCHGGLIEEGLHLF VMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELIL MP++PDGVLWRSLLSSC
Subjt:  GKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSC

Query:  MVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTGYQGL
        M+HGDVEMGERVGKLLVERQGGESFDDEWCVGS DFVALSNVYASA+RW+DVE LREEMKIKGIENKAG SSVQTTG QGL
Subjt:  MVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTGYQGL

A0A5D3CFP0 Pentatricopeptide repeat-containing protein4.3e-24688.15Show/hide
Query:  MHLAPRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPH
        MHLAPRL CI+HL+N R SS QL QIHAQ ITNGFKSPSPYAKLI H CKKSS E+IA+A LIF+H QH PNLFLFNT IRCAPPQ+SISIFA WVSTPH
Subjt:  MHLAPRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPH

Query:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL
        FEFDDFTFIFVLGACAR PS STLMIG+QIHTHILKRGIVSNIW QTTMIHFY+ NKDVG ARKVFDEMSVRNSVTWNAMIAGYCSQS +V+QKYARD L
Subjt:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL

Query:  ELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVH
        ELFR M VESTN EVKPTDTTMVC+LSAAS LG+LETG CVHAYI+KTIDSPE D+FIGTGLVNMYSKCG L+SASSVFKQMKQRNVLTWT+MATGLAVH
Subjt:  ELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVH

Query:  GKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSC
        G+GKEALELLDAMGA GVKPNAVTFTSLLSACCHGGLIEEGLHLF VMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELIL MP++PDGVLWRSLLSSC
Subjt:  GKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSC

Query:  MVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTGYQGL
        M+HGDVEMGERVGKLLVERQGGESFDDEWCVGS DFVALSNVYASA+RW+DVE LREEMKIKGIENKAG SSVQTTG QGL
Subjt:  MVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTGYQGL

A0A6J1HCU6 pentatricopeptide repeat-containing protein At3g18970 isoform X19.0e-22883.89Show/hide
Query:  MHLAPRLRCIHHLNNTRNSS-HQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTP
        MHL+PRL CIH LNN  NSS  QLKQIHAQLITN FKSPSPYAKLIAHFC K SPEA AYA  I  H +HP NLFLFNT IRCAPPQHSISIFA  VST 
Subjt:  MHLAPRLRCIHHLNNTRNSS-HQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTP

Query:  HFEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDG
        HF+FDDFTFIF+LGACAR PS  TL  GKQIHTHILKRG+VSNIWVQTTMIHFYAINKDVG ARKVFDEM VRN+VTWNAMIAGYCSQS RVA KY  + 
Subjt:  HFEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDG

Query:  LELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAV
        L+LFR M VESTNS+V PTDTTMVCLLSAASQLGVLETG CVHAYIEKTIDSPEND+FIGTGLVNMYSKCGCLNSAS VFKQMKQRNVLTWTAMATGLA+
Subjt:  LELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAV

Query:  HGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSS
        HGKGKEALELL+AMG  GVKPNAVTFTSLLS CCHGGLIEEGLHLF VMERKFGVVPQMQHYGCIVDLLGR GHL+EAYELILGMP+ PDGVLWRSL+SS
Subjt:  HGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSS

Query:  CMVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG
        CMVH DVEMGE+VGK LVE   G    DEWC GS DFVALSNVYASAQRWE+V+ +REEMKIKGI+NKAG SSVQT G
Subjt:  CMVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG

A0A6J1JKN4 pentatricopeptide repeat-containing protein At3g189703.9e-23184.73Show/hide
Query:  MHLAPRLRCIHHLNNTRNSS-HQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTP
        MHL+PRL CIHHLNN  NSS  QLKQI AQLITN FKSPSPYAKLIAHFC K SP+A AYA LI  H +HP NLFLFNT IRCAPPQHSISIFA  +ST 
Subjt:  MHLAPRLRCIHHLNNTRNSS-HQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTP

Query:  HFEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDG
        HFEFDDFTFIF+LGACAR PS  TL  GKQIHTH+LKRG+VSNIWVQTTMIHFYAINKDVG ARKVFDEMSVRN+VTWNAMIAGYCSQS RVAQKY R+ 
Subjt:  HFEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDG

Query:  LELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAV
        LELFR M VESTNS+V PTDTTMVCLLSAASQLGVLETG CVHAYIEKT DSPEND+FIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLA+
Subjt:  LELFREM-VESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAV

Query:  HGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSS
        HGKGKEALELL+AMG  GVKPNAVTFTSLLS CCHGGLIEEGLHLF VMERKFGVVPQMQHYGCIVDLLGR GHL+EAYELILGMPV PDGVLWRSL+SS
Subjt:  HGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSS

Query:  CMVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG
        CMVH DVEMGE+VGK LVE   G    DEWC GS DFVALSNVYASAQRWE+V+ +REEMKIKGI+NKAG SSVQT G
Subjt:  CMVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG

SwissProt top hitse value%identityAlignment
Q0WQW5 Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial7.2e-7336.38Show/hide
Query:  IHHLNNTRNSSHQLKQIHA-QLITNGFKSPSP---YAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIR-CA----PPQHSISIFATWVSTPH
        I  L  T +   QLKQ+HA  L T   + P+    Y K++      SS   + YA  +F   ++  + F++NT IR CA      + +  ++   +    
Subjt:  IHHLNNTRNSSHQLKQIHA-QLITNGFKSPSP---YAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIR-CA----PPQHSISIFATWVSTPH

Query:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARD-G
           D  TF FVL ACA     S    GKQ+H  I+K G   +++V   +IH Y     + +ARKVFDEM  R+ V+WN+MI       D + +    D  
Subjt:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARD-G

Query:  LELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTID-SPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAV
        L+LFREM  S     +P   TM  +LSA + LG L  G   HA++ +  D     D+ +   L+ MY KCG L  A  VF+ M++R++ +W AM  G A 
Subjt:  LELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTID-SPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAV

Query:  HGKGKEALELLDAM--GALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLL
        HG+ +EA+   D M      V+PN+VTF  LL AC H G + +G   F +M R + + P ++HYGCIVDL+ R+G++ EA ++++ MP+KPD V+WRSLL
Subjt:  HGKGKEALELLDAM--GALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLL

Query:  SSCMVHG-DVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG
         +C   G  VE+ E + + ++  +      +  C G+  +V LS VYASA RW DV ++R+ M   GI  + GCSS++  G
Subjt:  SSCMVHG-DVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG

Q9FI80 Pentatricopeptide repeat-containing protein At5g489105.9e-7535.78Show/hide
Query:  LNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSS--PEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQH------SISIFATWVSTPHFEFD
        +NN R +   L QIHA  I +G    +  A  I  FC  S      + YA  IF +Q    N F +NT IR            +I++F   +S    E +
Subjt:  LNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSS--PEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQH------SISIFATWVSTPHFEFD

Query:  DFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYA----------------INKDVGI---------------------------
         FTF  VL ACA+T     +  GKQIH   LK G   + +V + ++  Y                 I KD+ +                           
Subjt:  DFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYA----------------INKDVGI---------------------------

Query:  --ARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGLELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGT
          AR +FD+M  R+ V+WN MI+GY          + +D +E+FREM      +++P   T+V +L A S+LG LE G  +H Y E +      D  +G+
Subjt:  --ARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGLELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGT

Query:  GLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVHGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQH
         L++MYSKCG +  A  VF+++ + NV+TW+AM  G A+HG+  +A++    M   GV+P+ V + +LL+AC HGGL+EEG   F  M    G+ P+++H
Subjt:  GLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVHGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQH

Query:  YGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSCMVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMK
        YGC+VDLLGRSG L EA E IL MP+KPD V+W++LL +C + G+VEMG+RV  +L++    +S       G+  +VALSN+YAS   W +V  +R  MK
Subjt:  YGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSCMVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMK

Query:  IKGIENKAGCSSVQTTG
         K I    GCS +   G
Subjt:  IKGIENKAGCSSVQTTG

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665202.0e-7533.6Show/hide
Query:  NSSHQLKQIHAQLITNGFKSPSPYAKLIAHFC-KKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIR---CA-PPQHSISIFATWV--STPHFEFDDFTFIF
        +   +LKQIHA+++  G    S        FC   +S + + YAQ++F      P+ FL+N  IR   C+  P+ S+ ++   +  S PH   + +TF  
Subjt:  NSSHQLKQIHAQLITNGFKSPSPYAKLIAHFC-KKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIR---CA-PPQHSISIFATWV--STPHFEFDDFTFIF

Query:  VLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSD---------RVAQKYA-----
        +L AC+   + S      QIH  I K G  ++++   ++I+ YA+  +  +A  +FD +   + V+WN++I GY              ++A+K A     
Subjt:  VLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSD---------RVAQKYA-----

Query:  -----------RDGLELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRN
                   ++ L+LF EM    NS+V+P + ++   LSA +QLG LE G  +H+Y+ KT      D  +G  L++MY+KCG +  A  VFK +K+++
Subjt:  -----------RDGLELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRN

Query:  VLTWTAMATGLAVHGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPV
        V  WTA+ +G A HG G+EA+     M  +G+KPN +TFT++L+AC + GL+EEG  +F+ MER + + P ++HYGCIVDLLGR+G L EA   I  MP+
Subjt:  VLTWTAMATGLAVHGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPV

Query:  KPDGVLWRSLLSSCMVHGDVEMGERVGKLLV---ERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG
        KP+ V+W +LL +C +H ++E+GE +G++L+      GG             +V  +N++A  ++W+     R  MK +G+    GCS++   G
Subjt:  KPDGVLWRSLLSSCMVHGDVEMGERVGKLLV---ERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG

Q9LJ69 Pentatricopeptide repeat-containing protein At3g189702.6e-12350.63Show/hide
Query:  PRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQ--LIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPHFE
        P  R +  L     +  Q KQIHAQL+ NG    S + KLI H+C K S E+ +     L+F    H P+ FLFNT ++C+ P+ SI IFA + S     
Subjt:  PRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQ--LIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPHFE

Query:  F-DDFTFIFVLGACARTPSASTLMIGKQIHTHILKRG-IVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL
        + ++ TF+FVLGACAR+ S+S L +G+ +H  + K G +  +  + TT++HFYA N D+  ARKVFDEM  R SVTWNAMI GYCS  D+     AR  +
Subjt:  F-DDFTFIFVLGACARTPSASTLMIGKQIHTHILKRG-IVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL

Query:  ELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVHG
         LFR       S V+PTDTTMVC+LSA SQ G+LE G+ VH YIEK   +PE D+FIGT LV+MYSKCGCLN+A SVF+ MK +NV TWT+MATGLA++G
Subjt:  ELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVHG

Query:  KGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSCM
        +G E   LL+ M   G+KPN +TFTSLLSA  H GL+EEG+ LF  M+ +FGV P ++HYGCIVDLLG++G ++EAY+ IL MP+KPD +L RSL ++C 
Subjt:  KGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSCM

Query:  VHGDVEMGERVGKLLVERQGGESFDDEWCVGS--VDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSV
        ++G+  MGE +GK L+E +     +DE   GS   D+VALSNV A   +W +VE LR+EMK + I+ + G S V
Subjt:  VHGDVEMGERVGKLLVERQGGESFDDEWCVGS--VDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSV

Q9SN85 Pentatricopeptide repeat-containing protein At3g475302.1e-7236.3Show/hide
Query:  LKQIHAQLI-TNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRC----APPQHSISIFATWVSTPHFEFDDFTFIFVLGACAR
        L+QIHA L+ T+  ++   +   ++       P  I Y+  +F  Q+  P L   NT IR       P     +F +         +  +  F L  C +
Subjt:  LKQIHAQLI-TNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRC----APPQHSISIFATWVSTPHFEFDDFTFIFVLGACAR

Query:  TPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGLELFREMVESTNSEVKPT
           +  L+ G QIH  I   G +S+  + TT++  Y+  ++   A KVFDE+  R++V+WN + + Y      +  K  RD L LF +M    +  VKP 
Subjt:  TPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGLELFREMVESTNSEVKPT

Query:  DTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVHGKGKEALELLDAMGALGV
          T +  L A + LG L+ G  VH +I++  +     L +   LV+MYS+CG ++ A  VF  M++RNV++WTA+ +GLA++G GKEA+E  + M   G+
Subjt:  DTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVHGKGKEALELLDAMGALGV

Query:  KPNAVTFTSLLSACCHGGLIEEGLHLFHVMER-KFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSCMVHGDVEMGERVGKLLV
         P   T T LLSAC H GL+ EG+  F  M   +F + P + HYGC+VDLLGR+  L +AY LI  M +KPD  +WR+LL +C VHGDVE+GERV   L+
Subjt:  KPNAVTFTSLLSACCHGGLIEEGLHLFHVMER-KFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSCMVHGDVEMGERVGKLLV

Query:  ERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG
        E +  E         + D+V L N Y++  +WE V  LR  MK K I  K GCS+++  G
Subjt:  ERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG

Arabidopsis top hitse value%identityAlignment
AT1G59720.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.1e-7436.38Show/hide
Query:  IHHLNNTRNSSHQLKQIHA-QLITNGFKSPSP---YAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIR-CA----PPQHSISIFATWVSTPH
        I  L  T +   QLKQ+HA  L T   + P+    Y K++      SS   + YA  +F   ++  + F++NT IR CA      + +  ++   +    
Subjt:  IHHLNNTRNSSHQLKQIHA-QLITNGFKSPSP---YAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIR-CA----PPQHSISIFATWVSTPH

Query:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARD-G
           D  TF FVL ACA     S    GKQ+H  I+K G   +++V   +IH Y     + +ARKVFDEM  R+ V+WN+MI       D + +    D  
Subjt:  FEFDDFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARD-G

Query:  LELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTID-SPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAV
        L+LFREM  S     +P   TM  +LSA + LG L  G   HA++ +  D     D+ +   L+ MY KCG L  A  VF+ M++R++ +W AM  G A 
Subjt:  LELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTID-SPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAV

Query:  HGKGKEALELLDAM--GALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLL
        HG+ +EA+   D M      V+PN+VTF  LL AC H G + +G   F +M R + + P ++HYGCIVDL+ R+G++ EA ++++ MP+KPD V+WRSLL
Subjt:  HGKGKEALELLDAM--GALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLL

Query:  SSCMVHG-DVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG
         +C   G  VE+ E + + ++  +      +  C G+  +V LS VYASA RW DV ++R+ M   GI  + GCSS++  G
Subjt:  SSCMVHG-DVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG

AT3G18970.1 mitochondrial editing factor 201.8e-12450.63Show/hide
Query:  PRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQ--LIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPHFE
        P  R +  L     +  Q KQIHAQL+ NG    S + KLI H+C K S E+ +     L+F    H P+ FLFNT ++C+ P+ SI IFA + S     
Subjt:  PRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQ--LIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPHFE

Query:  F-DDFTFIFVLGACARTPSASTLMIGKQIHTHILKRG-IVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL
        + ++ TF+FVLGACAR+ S+S L +G+ +H  + K G +  +  + TT++HFYA N D+  ARKVFDEM  R SVTWNAMI GYCS  D+     AR  +
Subjt:  F-DDFTFIFVLGACARTPSASTLMIGKQIHTHILKRG-IVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGL

Query:  ELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVHG
         LFR       S V+PTDTTMVC+LSA SQ G+LE G+ VH YIEK   +PE D+FIGT LV+MYSKCGCLN+A SVF+ MK +NV TWT+MATGLA++G
Subjt:  ELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVHG

Query:  KGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSCM
        +G E   LL+ M   G+KPN +TFTSLLSA  H GL+EEG+ LF  M+ +FGV P ++HYGCIVDLLG++G ++EAY+ IL MP+KPD +L RSL ++C 
Subjt:  KGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSCM

Query:  VHGDVEMGERVGKLLVERQGGESFDDEWCVGS--VDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSV
        ++G+  MGE +GK L+E +     +DE   GS   D+VALSNV A   +W +VE LR+EMK + I+ + G S V
Subjt:  VHGDVEMGERVGKLLVERQGGESFDDEWCVGS--VDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSV

AT3G47530.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-7336.3Show/hide
Query:  LKQIHAQLI-TNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRC----APPQHSISIFATWVSTPHFEFDDFTFIFVLGACAR
        L+QIHA L+ T+  ++   +   ++       P  I Y+  +F  Q+  P L   NT IR       P     +F +         +  +  F L  C +
Subjt:  LKQIHAQLI-TNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRC----APPQHSISIFATWVSTPHFEFDDFTFIFVLGACAR

Query:  TPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGLELFREMVESTNSEVKPT
           +  L+ G QIH  I   G +S+  + TT++  Y+  ++   A KVFDE+  R++V+WN + + Y      +  K  RD L LF +M    +  VKP 
Subjt:  TPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGLELFREMVESTNSEVKPT

Query:  DTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVHGKGKEALELLDAMGALGV
          T +  L A + LG L+ G  VH +I++  +     L +   LV+MYS+CG ++ A  VF  M++RNV++WTA+ +GLA++G GKEA+E  + M   G+
Subjt:  DTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVHGKGKEALELLDAMGALGV

Query:  KPNAVTFTSLLSACCHGGLIEEGLHLFHVMER-KFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSCMVHGDVEMGERVGKLLV
         P   T T LLSAC H GL+ EG+  F  M   +F + P + HYGC+VDLLGR+  L +AY LI  M +KPD  +WR+LL +C VHGDVE+GERV   L+
Subjt:  KPNAVTFTSLLSACCHGGLIEEGLHLFHVMER-KFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSCMVHGDVEMGERVGKLLV

Query:  ERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG
        E +  E         + D+V L N Y++  +WE V  LR  MK K I  K GCS+++  G
Subjt:  ERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein4.2e-7635.78Show/hide
Query:  LNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSS--PEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQH------SISIFATWVSTPHFEFD
        +NN R +   L QIHA  I +G    +  A  I  FC  S      + YA  IF +Q    N F +NT IR            +I++F   +S    E +
Subjt:  LNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSS--PEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQH------SISIFATWVSTPHFEFD

Query:  DFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYA----------------INKDVGI---------------------------
         FTF  VL ACA+T     +  GKQIH   LK G   + +V + ++  Y                 I KD+ +                           
Subjt:  DFTFIFVLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYA----------------INKDVGI---------------------------

Query:  --ARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGLELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGT
          AR +FD+M  R+ V+WN MI+GY          + +D +E+FREM      +++P   T+V +L A S+LG LE G  +H Y E +      D  +G+
Subjt:  --ARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGLELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGT

Query:  GLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVHGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQH
         L++MYSKCG +  A  VF+++ + NV+TW+AM  G A+HG+  +A++    M   GV+P+ V + +LL+AC HGGL+EEG   F  M    G+ P+++H
Subjt:  GLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVHGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQH

Query:  YGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSCMVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMK
        YGC+VDLLGRSG L EA E IL MP+KPD V+W++LL +C + G+VEMG+RV  +L++    +S       G+  +VALSN+YAS   W +V  +R  MK
Subjt:  YGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSCMVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMK

Query:  IKGIENKAGCSSVQTTG
         K I    GCS +   G
Subjt:  IKGIENKAGCSSVQTTG

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-7633.6Show/hide
Query:  NSSHQLKQIHAQLITNGFKSPSPYAKLIAHFC-KKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIR---CA-PPQHSISIFATWV--STPHFEFDDFTFIF
        +   +LKQIHA+++  G    S        FC   +S + + YAQ++F      P+ FL+N  IR   C+  P+ S+ ++   +  S PH   + +TF  
Subjt:  NSSHQLKQIHAQLITNGFKSPSPYAKLIAHFC-KKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIR---CA-PPQHSISIFATWV--STPHFEFDDFTFIF

Query:  VLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSD---------RVAQKYA-----
        +L AC+   + S      QIH  I K G  ++++   ++I+ YA+  +  +A  +FD +   + V+WN++I GY              ++A+K A     
Subjt:  VLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSD---------RVAQKYA-----

Query:  -----------RDGLELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRN
                   ++ L+LF EM    NS+V+P + ++   LSA +QLG LE G  +H+Y+ KT      D  +G  L++MY+KCG +  A  VFK +K+++
Subjt:  -----------RDGLELFREMVESTNSEVKPTDTTMVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRN

Query:  VLTWTAMATGLAVHGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPV
        V  WTA+ +G A HG G+EA+     M  +G+KPN +TFT++L+AC + GL+EEG  +F+ MER + + P ++HYGCIVDLLGR+G L EA   I  MP+
Subjt:  VLTWTAMATGLAVHGKGKEALELLDAMGALGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPV

Query:  KPDGVLWRSLLSSCMVHGDVEMGERVGKLLV---ERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG
        KP+ V+W +LL +C +H ++E+GE +G++L+      GG             +V  +N++A  ++W+     R  MK +G+    GCS++   G
Subjt:  KPDGVLWRSLLSSCMVHGDVEMGERVGKLLV---ERQGGESFDDEWCVGSVDFVALSNVYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACCTAGCCCCAAGACTGAGGTGCATCCACCACCTAAATAACACACGAAATTCTTCCCATCAACTCAAACAAATTCACGCCCAATTGATAACCAATGGCTTCAAATC
TCCCTCCCCTTACGCCAAACTAATCGCCCATTTCTGTAAGAAATCTTCCCCAGAAGCCATTGCCTACGCCCAATTGATCTTCCAGCACCAACAACACCCTCCAAATCTCT
TCCTCTTCAACACTTCCATAAGATGCGCTCCACCTCAACATTCCATCTCCATTTTCGCCACTTGGGTCTCCACCCCCCACTTCGAATTCGACGACTTCACTTTCATTTTC
GTGCTCGGAGCTTGCGCGCGAACCCCATCGGCGTCTACGTTAATGATCGGTAAGCAAATTCATACTCATATTCTCAAACGTGGGATTGTTTCGAACATTTGGGTGCAGAC
TACGATGATACATTTTTATGCGATTAATAAAGATGTGGGTATTGCGCGGAAGGTGTTTGATGAAATGTCTGTGAGAAATAGTGTTACCTGGAATGCGATGATTGCAGGGT
ACTGCTCACAAAGTGATAGGGTTGCTCAGAAATATGCCCGAGATGGGTTGGAATTGTTTCGGGAGATGGTTGAATCAACGAATTCTGAGGTGAAACCAACGGATACTACA
ATGGTTTGCCTTCTTTCAGCTGCATCTCAACTGGGTGTGCTTGAAACCGGCGCTTGTGTTCATGCATATATCGAGAAGACAATTGATTCTCCTGAAAATGATCTGTTTAT
TGGCACTGGTTTGGTTAATATGTACTCAAAATGTGGGTGTCTTAACAGTGCTTCATCAGTTTTTAAGCAGATGAAGCAGAGGAACGTTTTGACGTGGACAGCCATGGCGA
CAGGGCTGGCCGTTCATGGAAAGGGTAAAGAAGCATTGGAGCTATTGGATGCAATGGGAGCTCTTGGTGTAAAGCCAAACGCAGTAACATTCACAAGTCTGCTTTCTGCT
TGCTGTCATGGAGGGCTTATTGAAGAGGGGCTCCATTTGTTTCATGTCATGGAGAGGAAGTTTGGGGTTGTGCCTCAAATGCAGCATTATGGCTGCATTGTTGACCTTCT
TGGGCGCTCTGGGCACTTGAGAGAGGCATATGAGTTGATACTTGGAATGCCAGTGAAACCCGATGGTGTTTTATGGAGGAGTTTGCTGAGTTCTTGTATGGTACATGGTG
ATGTTGAAATGGGAGAGAGGGTGGGTAAGTTGCTTGTGGAGAGACAGGGAGGGGAGAGTTTTGATGATGAGTGGTGTGTTGGAAGTGTGGACTTTGTAGCTTTGTCAAAT
GTTTATGCTTCTGCTCAAAGGTGGGAGGATGTGGAGGTTTTAAGAGAGGAGATGAAGATAAAAGGGATTGAAAACAAAGCTGGGTGTAGTTCAGTTCAAACTACGGGTTA
TCAAGGCTTGGAGGCTTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGCACCTAGCCCCAAGACTGAGGTGCATCCACCACCTAAATAACACACGAAATTCTTCCCATCAACTCAAACAAATTCACGCCCAATTGATAACCAATGGCTTCAAATC
TCCCTCCCCTTACGCCAAACTAATCGCCCATTTCTGTAAGAAATCTTCCCCAGAAGCCATTGCCTACGCCCAATTGATCTTCCAGCACCAACAACACCCTCCAAATCTCT
TCCTCTTCAACACTTCCATAAGATGCGCTCCACCTCAACATTCCATCTCCATTTTCGCCACTTGGGTCTCCACCCCCCACTTCGAATTCGACGACTTCACTTTCATTTTC
GTGCTCGGAGCTTGCGCGCGAACCCCATCGGCGTCTACGTTAATGATCGGTAAGCAAATTCATACTCATATTCTCAAACGTGGGATTGTTTCGAACATTTGGGTGCAGAC
TACGATGATACATTTTTATGCGATTAATAAAGATGTGGGTATTGCGCGGAAGGTGTTTGATGAAATGTCTGTGAGAAATAGTGTTACCTGGAATGCGATGATTGCAGGGT
ACTGCTCACAAAGTGATAGGGTTGCTCAGAAATATGCCCGAGATGGGTTGGAATTGTTTCGGGAGATGGTTGAATCAACGAATTCTGAGGTGAAACCAACGGATACTACA
ATGGTTTGCCTTCTTTCAGCTGCATCTCAACTGGGTGTGCTTGAAACCGGCGCTTGTGTTCATGCATATATCGAGAAGACAATTGATTCTCCTGAAAATGATCTGTTTAT
TGGCACTGGTTTGGTTAATATGTACTCAAAATGTGGGTGTCTTAACAGTGCTTCATCAGTTTTTAAGCAGATGAAGCAGAGGAACGTTTTGACGTGGACAGCCATGGCGA
CAGGGCTGGCCGTTCATGGAAAGGGTAAAGAAGCATTGGAGCTATTGGATGCAATGGGAGCTCTTGGTGTAAAGCCAAACGCAGTAACATTCACAAGTCTGCTTTCTGCT
TGCTGTCATGGAGGGCTTATTGAAGAGGGGCTCCATTTGTTTCATGTCATGGAGAGGAAGTTTGGGGTTGTGCCTCAAATGCAGCATTATGGCTGCATTGTTGACCTTCT
TGGGCGCTCTGGGCACTTGAGAGAGGCATATGAGTTGATACTTGGAATGCCAGTGAAACCCGATGGTGTTTTATGGAGGAGTTTGCTGAGTTCTTGTATGGTACATGGTG
ATGTTGAAATGGGAGAGAGGGTGGGTAAGTTGCTTGTGGAGAGACAGGGAGGGGAGAGTTTTGATGATGAGTGGTGTGTTGGAAGTGTGGACTTTGTAGCTTTGTCAAAT
GTTTATGCTTCTGCTCAAAGGTGGGAGGATGTGGAGGTTTTAAGAGAGGAGATGAAGATAAAAGGGATTGAAAACAAAGCTGGGTGTAGTTCAGTTCAAACTACGGGTTA
TCAAGGCTTGGAGGCTTTATAG
Protein sequenceShow/hide protein sequence
MHLAPRLRCIHHLNNTRNSSHQLKQIHAQLITNGFKSPSPYAKLIAHFCKKSSPEAIAYAQLIFQHQQHPPNLFLFNTSIRCAPPQHSISIFATWVSTPHFEFDDFTFIF
VLGACARTPSASTLMIGKQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMSVRNSVTWNAMIAGYCSQSDRVAQKYARDGLELFREMVESTNSEVKPTDTT
MVCLLSAASQLGVLETGACVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAVHGKGKEALELLDAMGALGVKPNAVTFTSLLSA
CCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYELILGMPVKPDGVLWRSLLSSCMVHGDVEMGERVGKLLVERQGGESFDDEWCVGSVDFVALSN
VYASAQRWEDVEVLREEMKIKGIENKAGCSSVQTTGYQGLEAL