; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017046 (gene) of Snake gourd v1 genome

Gene IDTan0017046
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG07:70595816..70598570
RNA-Seq ExpressionTan0017046
SyntenyTan0017046
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606214.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0085.77Show/hide
Query:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ
        MA++LARERLLQSR+  GFSLKS L SA QS ASI ACSEKL FIKKFVNP FAEL  I+      RCV TSV+TT LCWGGSS+AVLLGKLEIAL+DHQ
Subjt:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ

Query:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV
        ID+AWELFNDF+RLYGFPKDNVLLMLISQLSYT DCNWLQKACNLVLQIWKEKPVVLQL+ALTKL LGLAR QMP+PAS++LRLML +KRLP+MELLQ+V
Subjt:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV

Query:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL
        IMHMVKTEVGTYLA+NILVQICDCF QQ ASRNDQAKSMKPDTVIFNL+L ACVGFRLSFKGQQLVELMS+TGVVADAQT+VLIARIYDMNGQRDDLKN 
Subjt:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL

Query:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN
        K+HIDQV PSL CHYC FYDSLLSL FKFD+FDS  NLVLEICRFG+S SIQK   D QKSSLVPIGS HLKDGLKIKIM ELLQRD V+N EVKPE IN
Subjt:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN

Query:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA
         KNGKLVASNKTLAKLIVEFKRLG+TS+LSKLLL+VQKGLASV+G NLCSDVVKACIYLGWLETAHDILDD+E AGSAMDS+VYFLLL+AYYK+EMLREA
Subjt:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA

Query:  DVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFG
        DVLQKQMAKAGL TA  EDMANRS+LHENE  THDTSL ES+VQEM+ETS  SPRVYKFNSSIYFFCKAKM+EDALQAYKRMQQTGIQPTA+TFA+LAFG
Subjt:  DVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFG

Query:  FSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYV
        FSSLQ YRDIT LWGDMKRNMQS+ LVLSRDLYEFLLLCFLQGGYFERVME+VGHMEEQKMFTDKGMYK+Q+LKLHKNLYRSLKPSEARTEAQKNRL++V
Subjt:  FSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYV

Query:  RAFKKWVGIY
        RAFKKWVGIY
Subjt:  RAFKKWVGIY

KAG7036161.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0086.06Show/hide
Query:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ
        MA++LARERLLQSR+  GFSLKS L SA QS ASI ACSEKL FIKKFVNP FAEL  I+     CRCV TSV+TT LCWGGSS+AVLLGKLEIAL+DHQ
Subjt:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ

Query:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV
        ID+AWELFNDF+RLYGFPKDNVLLMLISQLSYT DCNWLQKACNLVLQIWKEKPVVLQL+ALTKL LGLAR QMP+PAS++LRLML +KRLP+MELLQ+V
Subjt:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV

Query:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL
        IMHMVKTEVGTYLA+NILVQICDCF QQ ASRNDQAKSMKPDTVIFNL+L ACVGFRLSFKGQQLVELMS+TGVVADAQT+VLIARIYDMNGQRDDLKN 
Subjt:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL

Query:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN
        K+HIDQV  SL CHYC FYDSLLSL FKFD+FDS  NLVLEICRFG+S SIQK   D QKSSLVPIGS HLKDGLKIKIM ELLQRD V+N EVKPE IN
Subjt:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN

Query:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA
         KNGKLVASNKTLAKLIVEFKRLG+TS+LSKLLL+VQKGLASV+G NLCSDVVKACIYLGWLETAHDILDD+E AGSAMDS+VYFLLL+AYYK+EMLREA
Subjt:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA

Query:  DVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFG
        DVLQKQMAKAGL TA AEDMANRS+LHENE  THDTSL ES+VQEM+ETS  SPRVYKFNSSIYFFCKAKM+EDALQAYKRMQQTGIQPTA+TFA+LAFG
Subjt:  DVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFG

Query:  FSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYV
        FSSLQ YRDIT LWGDMKRNMQS+ LVLSRDLYEFLLLCFLQGGYFERVME+VGHMEEQKMFTDKGMYK+Q+LKLHKNLYRSLKPSEARTEAQKNRLE+V
Subjt:  FSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYV

Query:  RAFKKWVGIY
        RAFKKWVGIY
Subjt:  RAFKKWVGIY

XP_022157932.1 pentatricopeptide repeat-containing protein At4g17616 [Momordica charantia]0.0e+0080.5Show/hide
Query:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ
        MA++LARERL+QSRL  GFSLKS L SAL+        SE LIFI+ F +PR  EL Y+K Q  L RC+STSVHTTKL WGGSSY VLLGKLE ALKDHQ
Subjt:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ

Query:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV
         DEAWELF+DFRRLYGFPKDNVLLML+SQLSYTSDCN L+KACNLV QIWKEKP+VLQLD LTKLAL LAR QMP+ AS ILRLML  KRLPRMELLQLV
Subjt:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV

Query:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL
        I+H VKTEVGTYLA+NILVQICDCFLQQ A+RNDQAK MKPDT++FNL+  ACV F+LSFKGQQLVELMSQTGVVADA T+VLIA+IYDMNGQRDD+ N 
Subjt:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL

Query:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN
        KIHIDQV PSL CHYCQFYDSLLSL FKF+DF+S ANLVLE CRFGESP IQKH RD QKSSL+PIGSHHLK GLKIKIM ELLQ+D V+NVE KPEFIN
Subjt:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN

Query:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA
          NGKLV+S KTL+KL++EFKRLGKTS+LSKLLLQVQKGLAS EG NLCS VVK CIYLGWLE AHDILDDVE AGSA+DS+VYFLLLKAYY +EMLREA
Subjt:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA

Query:  DVLQKQMAKAGLSTATAEDM---------ANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTA
        DVLQKQMAK GLST T  DM         ANR+   +    TH+TSLVESLVQEMKETSA SPRVYK NSSIYFFCKAKMIEDALQAYKRMQQ  IQPT 
Subjt:  DVLQKQMAKAGLSTATAEDM---------ANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTA

Query:  QTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTE
        QTFANLAFGFSSLQMYRDITILWGDMKRN+ SR+ V+SR+LYEFLLLCFLQGGYFERVMEI GHMEEQKMFTDKGMYK++FLKLHKNLYRSLKPSEARTE
Subjt:  QTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTE

Query:  AQKNRLEYVRAFKKWVGI
        AQK RLEYVRAFKKWVGI
Subjt:  AQKNRLEYVRAFKKWVGI

XP_023534668.1 pentatricopeptide repeat-containing protein At4g17616 [Cucurbita pepo subsp. pepo]0.0e+0086.06Show/hide
Query:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ
        MA++LARERLLQSR+  GFSLKS L SALQS ASI ACSEKL FIKK VNP FAEL  I+     CRCV TSV+TT LCWGGSS+ VLLGKLEIAL+DHQ
Subjt:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ

Query:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV
        IDEAWELFNDFRRLYGFPKDN+LLMLISQLSYT DCNWLQKACNLVLQIWKEKPVVLQL+ALTKL LGLAR QMP+PAS++LRLML +KRLP+MELLQ++
Subjt:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV

Query:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL
        IMHMVKTEVGTYLA+NILVQICDCF QQAASRNDQAKSMKPDT IFNL+L ACV FRLSFKGQQLVELMS+TGVVA+AQT+VLIARIYDMNGQRDDLKN 
Subjt:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL

Query:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN
        K+HIDQV P L CHYC FYDSLLSL FKFD+FDS  NLVLEICRFG+S SIQK   D QKSSLVPIGS HLKDGLKIKIM ELLQRD V+NVEVKPEFIN
Subjt:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN

Query:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA
         KNGKLVASNKTLAKLIVEFKRLGKTS+LSKLLLQVQKGLASV+G NLCSDVVKACIYLGWLETAHDILDD+E AGSAM S+VYFLLL+AYYK+EMLREA
Subjt:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA

Query:  DVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFG
        DVLQKQMAKAGLSTA AEDMANRSLLHENE  THDT L ES+VQEM+ETS  S RVYKFNSSIYFFCKAKM+EDALQAYKRMQQTGIQPTA+TFA+LAFG
Subjt:  DVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFG

Query:  FSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYV
        FSSLQ YRDIT LWGDMKRNMQS+ LVLSRDLYEFLLLCFLQGGYFERVME+VGHMEEQKMFTDKGMYK+Q+LKLHKNLYRSLKPSEARTEAQKNRLE+V
Subjt:  FSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYV

Query:  RAFKKWVGIY
        RAFKKWVGIY
Subjt:  RAFKKWVGIY

XP_031738206.1 pentatricopeptide repeat-containing protein At4g17616 [Cucumis sativus]0.0e+0080.73Show/hide
Query:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ
        MA++LARERLL SRLS  F LKS+L SALQS+A I AC EKLIF+K F N R  ELWY K Q+P  RCVST VH TKLCWGGSSY VLLGKLEIALKDHQ
Subjt:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ

Query:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV
        IDEAWELF+DFR+LYGFP DN LLML+SQLSYTSDC  L KA NLVLQ WKEKPVVLQLD LTKL LGLAR QMP+PAS+ILRLML  +RLPRMELLQLV
Subjt:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV

Query:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL
        I+HMVK+EVGTYLA+NILVQICDCFLQQA SRNDQAKSMKPDT++FNL+L ACV F+LSFKGQQLVELMSQT VVADA T+VLIARIY+MN QRD+LKNL
Subjt:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL

Query:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN
        K HIDQV PSL CHYCQFYD+LLSL FK+DDFDS ANL+LEICRFGES SIQKH R+LQKSS +PIGS HLKDGLKIKIM ELLQRD V+NVEVKPEFIN
Subjt:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN

Query:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA
         KNGKLVASNKT+AK IVE +R+G+TS+LSKLLLQVQKGLASVEG NLCSDVVKACI LGWLETAHDILDDVEA GS +DS+VYFLLLKAYYK++MLREA
Subjt:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA

Query:  DVLQKQMAKAGLSTATAEDMA------NRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTF
        DVLQKQM K GLS +T EDMA      +R LL   E  TH TSLVESL+QEMKETS+ S RV KFNSSIYFFCKAKMIEDALQAYKRMQQ GIQPTAQTF
Subjt:  DVLQKQMAKAGLSTATAEDMA------NRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTF

Query:  ANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQK
        ANL FGFS LQMYR+ITILWGD+KR MQS  LVLSRDLYE LLLCF++GGYFERVMEIVG MEEQ M+TDK MYK +FL LHKNLYRSLKPSEA+TEAQK
Subjt:  ANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQK

Query:  NRLEYVRAFKKWVGIY
         RLE VRAFKKWVGIY
Subjt:  NRLEYVRAFKKWVGIY

TrEMBL top hitse value%identityAlignment
A0A0A0LKC3 Uncharacterized protein0.0e+0080.73Show/hide
Query:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ
        MA++LARERLL SRLS  F LKS+L SALQS+A I AC EKLIF+K F N R  ELWY K Q+P  RCVST VH TKLCWGGSSY VLLGKLEIALKDHQ
Subjt:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ

Query:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV
        IDEAWELF+DFR+LYGFP DN LLML+SQLSYTSDC  L KA NLVLQ WKEKPVVLQLD LTKL LGLAR QMP+PAS+ILRLML  +RLPRMELLQLV
Subjt:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV

Query:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL
        I+HMVK+EVGTYLA+NILVQICDCFLQQA SRNDQAKSMKPDT++FNL+L ACV F+LSFKGQQLVELMSQT VVADA T+VLIARIY+MN QRD+LKNL
Subjt:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL

Query:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN
        K HIDQV PSL CHYCQFYD+LLSL FK+DDFDS ANL+LEICRFGES SIQKH R+LQKSS +PIGS HLKDGLKIKIM ELLQRD V+NVEVKPEFIN
Subjt:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN

Query:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA
         KNGKLVASNKT+AK IVE +R+G+TS+LSKLLLQVQKGLASVEG NLCSDVVKACI LGWLETAHDILDDVEA GS +DS+VYFLLLKAYYK++MLREA
Subjt:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA

Query:  DVLQKQMAKAGLSTATAEDMA------NRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTF
        DVLQKQM K GLS +T EDMA      +R LL   E  TH TSLVESL+QEMKETS+ S RV KFNSSIYFFCKAKMIEDALQAYKRMQQ GIQPTAQTF
Subjt:  DVLQKQMAKAGLSTATAEDMA------NRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTF

Query:  ANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQK
        ANL FGFS LQMYR+ITILWGD+KR MQS  LVLSRDLYE LLLCF++GGYFERVMEIVG MEEQ M+TDK MYK +FL LHKNLYRSLKPSEA+TEAQK
Subjt:  ANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQK

Query:  NRLEYVRAFKKWVGIY
         RLE VRAFKKWVGIY
Subjt:  NRLEYVRAFKKWVGIY

A0A1S3BZV2 pentatricopeptide repeat-containing protein At4g176160.0e+0079.14Show/hide
Query:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASI---LACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALK
        MA++LARE LL SRLS  F L+S L SALQS+A I    ACS+KLI +  F N    ELW  K QIP  RCVSTSVH TKLCWGGSSY VLLGKLEIALK
Subjt:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASI---LACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALK

Query:  DHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELL
        DHQIDEAWELF+DFRRLYGFP D  LLML+SQLSYTSDC  L KA NLVLQ WKEKPVVLQLD LTKL LGLAR QMP+PAS+ILRLMLH +RLPRMELL
Subjt:  DHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELL

Query:  QLVIMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDL
        QLVI+HMVK+EVGTYLA+NILVQICDCFLQQAASR+DQAKSM+PDT++FNL+L ACV F+LS KGQQLVELMSQT VVADA T+VLIARIY+MNGQRD+L
Subjt:  QLVIMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDL

Query:  KNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPE
        KNLK HIDQV PSL CHY QFYD+LLSL FK+DDFDS ANL+LEICRFGES SIQKH R+LQKSS +PIGS HLKDGLKIK+M ELLQ+D V+NVEVKPE
Subjt:  KNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPE

Query:  FINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEML
        FIN KNGKLVASNKT+AK IVE +R+G+TS+LSKLLLQVQKGLASVEG NLCSDVVKACI LGWLETAHD+LDDVEA GS MDS+VYFLLLKAYYK++ML
Subjt:  FINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEML

Query:  READVLQKQMAKAGLSTATAEDM------ANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTA
        READVLQKQM K GLS +T +DM      ++R LL   E  TH TSLVESL+QEMKETS+ S  V KFNSSIYFFCKAKMIEDALQAYKRMQQ GIQPTA
Subjt:  READVLQKQMAKAGLSTATAEDM------ANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTA

Query:  QTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTE
        QTFANL FGFSSLQMY +ITILWGDMKR MQS  LVLSRDLYE LLLCFL+GGYFERVMEIVG MEEQ M+TDKGMYK +FL LHKNLYRSLKPSEA++E
Subjt:  QTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTE

Query:  AQKNRLEYVRAFKKWVGIY
        AQK RLE VRAFKKWVG+Y
Subjt:  AQKNRLEYVRAFKKWVGIY

A0A5D3BA19 Pentatricopeptide repeat-containing protein0.0e+0079.14Show/hide
Query:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASI---LACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALK
        MA++LARE LL SRLS  F L+S L SALQS+A I    ACS+KLI +  F N    ELW  K QIP  RCVSTSVH TKLCWGGSSY VLLGKLEIALK
Subjt:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASI---LACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALK

Query:  DHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELL
        DHQIDEAWELF+DFRRLYGFP D  LLML+SQLSYTSDC  L KA NLVLQ WKEKPVVLQLD LTKL LGLAR QMP+PAS+ILRLMLH +RLPRMELL
Subjt:  DHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELL

Query:  QLVIMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDL
        QLVI+HMVK+EVGTYLA+NILVQICDCFLQQAASR+DQAKSM+PDT++FNL+L ACV F+LS KGQQLVELMSQT VVADA T+VLIARIY+MNGQRD+L
Subjt:  QLVIMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDL

Query:  KNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPE
        KNLK HIDQV PSL CHY QFYD+LLSL FK+DDFDS ANL+LEICRFGES SIQKH R+LQKSS +PIGS HLKDGLKIK+M ELLQ+D V+NVEVKPE
Subjt:  KNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPE

Query:  FINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEML
        FIN KNGKLVASNKT+AK IVE +R+G+TS+LSKLLLQVQKGLASVEG NLCSDVVKACI LGWLETAHD+LDDVEA GS MDS+VYFLLLKAYYK++ML
Subjt:  FINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEML

Query:  READVLQKQMAKAGLSTATAEDM------ANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTA
        READVLQKQM K GLS +T +DM      ++R LL   E  TH TSLVESL+QEMKETS+ S  V KFNSSIYFFCKAKMIEDALQAYKRMQQ GIQPTA
Subjt:  READVLQKQMAKAGLSTATAEDM------ANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTA

Query:  QTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTE
        QTFANL FGFSSLQMY +ITILWGDMKR MQS  LVLSRDLYE LLLCFL+GGYFERVMEIVG MEEQ M+TDKGMYK +FL LHKNLYRSLKPSEA++E
Subjt:  QTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTE

Query:  AQKNRLEYVRAFKKWVGIY
        AQK RLE VRAFKKWVG+Y
Subjt:  AQKNRLEYVRAFKKWVGIY

A0A6J1DVU2 pentatricopeptide repeat-containing protein At4g176160.0e+0080.5Show/hide
Query:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ
        MA++LARERL+QSRL  GFSLKS L SAL+        SE LIFI+ F +PR  EL Y+K Q  L RC+STSVHTTKL WGGSSY VLLGKLE ALKDHQ
Subjt:  MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQ

Query:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV
         DEAWELF+DFRRLYGFPKDNVLLML+SQLSYTSDCN L+KACNLV QIWKEKP+VLQLD LTKLAL LAR QMP+ AS ILRLML  KRLPRMELLQLV
Subjt:  IDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLV

Query:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL
        I+H VKTEVGTYLA+NILVQICDCFLQQ A+RNDQAK MKPDT++FNL+  ACV F+LSFKGQQLVELMSQTGVVADA T+VLIA+IYDMNGQRDD+ N 
Subjt:  IMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNL

Query:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN
        KIHIDQV PSL CHYCQFYDSLLSL FKF+DF+S ANLVLE CRFGESP IQKH RD QKSSL+PIGSHHLK GLKIKIM ELLQ+D V+NVE KPEFIN
Subjt:  KIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFIN

Query:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA
          NGKLV+S KTL+KL++EFKRLGKTS+LSKLLLQVQKGLAS EG NLCS VVK CIYLGWLE AHDILDDVE AGSA+DS+VYFLLLKAYY +EMLREA
Subjt:  CKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREA

Query:  DVLQKQMAKAGLSTATAEDM---------ANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTA
        DVLQKQMAK GLST T  DM         ANR+   +    TH+TSLVESLVQEMKETSA SPRVYK NSSIYFFCKAKMIEDALQAYKRMQQ  IQPT 
Subjt:  DVLQKQMAKAGLSTATAEDM---------ANRSLLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTA

Query:  QTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTE
        QTFANLAFGFSSLQMYRDITILWGDMKRN+ SR+ V+SR+LYEFLLLCFLQGGYFERVMEI GHMEEQKMFTDKGMYK++FLKLHKNLYRSLKPSEARTE
Subjt:  QTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTE

Query:  AQKNRLEYVRAFKKWVGI
        AQK RLEYVRAFKKWVGI
Subjt:  AQKNRLEYVRAFKKWVGI

A0A6J1H461 pentatricopeptide repeat-containing protein At4g176166.5e-29187.97Show/hide
Query:  MLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDC
        MLISQLSYT DCNWLQKACNLVLQIWKEKPVVLQL+ALTKL LGLAR QMP+PAS++LRLML +KRLP+MELLQ+VIMHMVKTEVGTYLA+NILVQICDC
Subjt:  MLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDC

Query:  FLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLS
        F QQ ASRNDQAKSMKPDTVIFNL+L ACVGFRLSFKGQQLVELMS+TGVVADAQT+VLIARIYDMNGQRDDLKN K+HIDQV PSL CHYC FYDSLLS
Subjt:  FLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLS

Query:  LDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLG
        L FKFD+FDS  NLVLEICRFG+S SIQK   D QKSSLVPIGS HLKDGLKIKIM ELLQRD V+NVEVKPEFIN KNGKLVASNKTLAKLIVEFKRLG
Subjt:  LDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLG

Query:  KTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRS
        KTS+LSKLLLQVQKGLASV+G NLCSDVVKACIYLGWLETAHDILDD+E AGSAMDS+VYFLLL+AYYK+EMLREADVLQKQMAKAGLSTA AEDMAN+S
Subjt:  KTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRS

Query:  LLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSR
        +LHENE  THDTSL ES+VQEM+ETS  SPRVYKFNSSIYFFCKAKM+EDAL AYKRMQQTGIQPTA+TFA+LAFGFSSLQ YRDIT LWGDMKRNMQS+
Subjt:  LLHENESITHDTSLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSR

Query:  SLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKW
         LVLSRDLYEFLLLCFL+GGYFERVME+VGHMEEQKMFTDKGMYK+Q+LKLHKNLYRSLKPSEARTEAQKNRLE+VRAFK+W
Subjt:  SLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKW

SwissProt top hitse value%identityAlignment
B3H672 Pentatricopeptide repeat-containing protein At4g176163.7e-16649.07Show/hide
Query:  TSVHTTKLCWGGSSYAVLLGKLEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLA
        TSV   +L W  SS  +L  KLE ALKDH++D+AW++F DF+RLYGFP+  ++   ++ LSY+SD  WL KA +L     K+ P +L  D LTKL+L LA
Subjt:  TSVHTTKLCWGGSSYAVLLGKLEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLA

Query:  RFQMPVPASDILRLMLHMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDCFLQ-QAASRNDQ-AKSMKPDTVIFNLILQACVGFRLSFKGQQLVEL
        R QM   A  ILR+ML    +   ++L+LV+MHMVKTE+GT LA+N LVQ+CD F++     RN      +KPDTV+FNL+L +CV F  S KGQ+L+EL
Subjt:  RFQMPVPASDILRLMLHMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDCFLQ-QAASRNDQ-AKSMKPDTVIFNLILQACVGFRLSFKGQQLVEL

Query:  MSQTGVVADAQTVVLIARIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGS
        M++  VVADA ++V+++ IY+MNG RD+L+  K HI QV P L  HY  F+D+LLSL+FKFDD  S   L L++C+     S++    D +K  ++P+GS
Subjt:  MSQTGVVADAQTVVLIARIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGS

Query:  HHLKDGLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDI
        HH++ GLKI I  +LLQRD  + V+ +  F+N  N KL  +NKTLAKL+  +KR     +LSKLL        S+ G  LC+DV+ AC+ +GWLE AHDI
Subjt:  HHLKDGLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDI

Query:  LDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEM---KETSATSPRVYKFNSSIYF
        LDD+ +AG  M+ + Y ++L  YYK +MLR A+VL KQM KAGL T  + ++       E +S   +T L + LVQE+   K+  A S  +Y+ NSS+Y+
Subjt:  LDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEM---KETSATSPRVYKFNSSIYF

Query:  FCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDK
        FCKAKM  DAL  Y+++ +  I PT Q+F  L   +SSL MYR+ITI+WGD+KRN+ S++L  ++DL E L++ FL+GGYFERVME++ +M+E  M+ D 
Subjt:  FCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDK

Query:  GMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWVGI
         MYKN++LKLHKNLYR+LK S+A TEAQ  RLE+V+ F+K VGI
Subjt:  GMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWVGI

P0C7R4 Pentatricopeptide repeat-containing protein At1g692908.2e-3324.09Show/hide
Query:  LEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYT-----SDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLML
        L  +L  H  DEAW+ F         P+  ++  LI+ LS       S  + L++A      + ++ P++L+ + +  L   +   +   PA  +++ M 
Subjt:  LEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYT-----SDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLML

Query:  HMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSF-KGQQLVELMSQTGVVADAQTVVLIA
          +     +L   +++ + +          +  + C        S +++ + MKPD V  N  L+AC     S    + ++E M+  GV  D  +   +A
Subjt:  HMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSF-KGQQLVELMSQTGVVADAQTVVLIA

Query:  RIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQ
         +Y   G R+ +  L+  +D      A      Y +++S   K  D DSV++++L                            H LK+G       E   
Subjt:  RIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQ

Query:  RDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEA-AGSAMDSSVY
               E+   FI  K      S K+LAK+I+E ++L  +            G+ S  G      ++ AC+ LG+ + AH IL+++ A  G ++   VY
Subjt:  RDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEA-AGSAMDSSVY

Query:  FLLLKAYYKEEMLREADVLQKQMAKAGL------------STATAEDM-----------ANRSLLHENESITHDTSLVES----LVQEMKETSATSPRV-
          +LKAY KE    EA  L  +++ +GL            ++ T +D             NR +  +   +T  T L+E+    L+    +     PRV 
Subjt:  FLLLKAYYKEEMLREADVLQKQMAKAGL------------STATAEDM-----------ANRSLLHENESITHDTSLVES----LVQEMKETSATSPRV-

Query:  ---YKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQS----RSLVLSRDLYEFLLLCFLQGGYFERV
           + +NS I+ FCK+  +EDA + ++RM     +P  QT+ +L  G+ S + Y ++ +LW ++K  + S    +   L   L +  L   ++GG+F+  
Subjt:  ---YKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQS----RSLVLSRDLYEFLLLCFLQGGYFERV

Query:  MEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWVGI
        M++V   +E K+F DK  YK  F++ HK     L+  + R    K ++E + AFK W G+
Subjt:  MEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWVGI

Q9ASZ8 Pentatricopeptide repeat-containing protein At1g126203.1e-0823.96Show/hide
Query:  VVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEMKETSA
        +++   Y G  +    +L D+       D   +  L+  + KE  LREA+ L K+M + G+S  T    +      +   +     +++ +V     +  
Subjt:  VVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEMKETSA

Query:  TSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVME
          P +  FN  I  +CKA +I+D L+ +++M   G+     T+  L  GF  L        L+ +M        +V     Y+ LL      G  E+ +E
Subjt:  TSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVME

Query:  IVGHMEEQKMFTDKGMY
        I   +E+ KM  D G+Y
Subjt:  IVGHMEEQKMFTDKGMY

Q9SA60 Pentatricopeptide repeat-containing protein At1g03100, mitochondrial5.6e-8230.06Show/hide
Query:  AVLLGKLEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKE-KPVVLQLDALTKLALGLARFQMPVPASDILRL
        A L  +++IA+ +H+ DEAW LF    ++ GFP+ +V+  ++   + + D NWLQK  +LV Q ++E K  +L+ + L  L+L LA+  M VPAS ILR 
Subjt:  AVLLGKLEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKE-KPVVLQLDALTKLALGLARFQMPVPASDILRL

Query:  MLHMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDCF----LQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQT
        ++  +  P +     V+ HM     G+YL+  ++++I   F    +      N    +MKP+T + N+ L  C+ F  + K +QL++++ + GV ADA  
Subjt:  MLHMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDCF----LQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQT

Query:  VVLIARIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICR-----------------------------FGESPSI
        +V++A IY+ NG+R++L+ L+ HID+        + QFY+ LL    KF D +S + +VLE+ R                              G+   +
Subjt:  VVLIARIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICR-----------------------------FGESPSI

Query:  QKH----SRDLQKSSLVPIGSHHL-KDGLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGC
        ++H    +R +   S++P       +  LK++  A+ +    +  + V+ E I  + G L  + +   KL   F   GK  +L+K LL+ +   + V   
Subjt:  QKH----SRDLQKSSLVPIGSHHL-KDGLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGC

Query:  N-LCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRSLLHENESITHDT---------
        N +  +V+ ACI LG L+ AHD+LD++  AG    SSVY  LLKAY      RE   L +   KAG+      D +    L +++ I +DT         
Subjt:  N-LCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRSLLHENESITHDT---------

Query:  ---------------------------SLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSL-QMYR
                                    L+  L++E++E  +    V+ +N+ I+FF K  +++DA +A KRM+  G  P AQTF ++  G++++   Y 
Subjt:  ---------------------------SLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSL-QMYR

Query:  DITILWGDMKR-NMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWV
        ++T LWG+MK     + S+   ++L + +L  F++GG+F R  E+V  ME++ MF DK  Y+  FLK HK  Y+   P + ++E+Q  + E    FKKW+
Subjt:  DITILWGDMKR-NMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWV

Query:  GI
        G+
Subjt:  GI

Q9SF38 Pentatricopeptide repeat-containing protein At3g09650, chloroplastic1.5e-1820.12Show/hide
Query:  KLEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPV-VLQLDALTKLALGLARFQMPVPASDILRLMLHMK
        +L   L++ + DEAW  +     L   P    L  L+SQLSY S    L +A +++ ++  E+ +  L  ++L  LA+  A+    + A  +++ M+   
Subjt:  KLEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPV-VLQLDALTKLALGLARFQMPVPASDILRLMLHMK

Query:  RLPRMELLQLVIMHM-VKTEVGTYLATNILVQICDCFLQQAASRNDQA--KSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIAR
         LP ++     +  +    + G   +  + + I     ++     DQ+     +PDT  FN +L AC     + K  +L E MS+     D  T  ++ +
Subjt:  RLPRMELLQLVIMHM-VKTEVGTYLATNILVQICDCFLQQAASRNDQA--KSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIAR

Query:  IYDMNGQRDDL---------KNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANL--VLEICRFGESPSIQKHSRDLQKSSLV--PIGSHHLKD
        +    G+++ +         K +K+ +   M SL   Y  F D   +        +   +L  VL  C   +    ++   +  + +        +  +D
Subjt:  IYDMNGQRDDL---------KNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANL--VLEICRFGESPSIQKHSRDLQKSSLV--PIGSHHLKD

Query:  GLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVA-SNKTLAKLIVEFKRLGKTSDLSKLL--LQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILD
         +  + + ++ ++  ++   V P        K+ A  ++    L+  + + G+ +D +++L  ++ Q    S       + VV A +  G ++ A  +L 
Subjt:  GLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVA-SNKTLAKLIVEFKRLGKTSDLSKLL--LQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILD

Query:  DVEAAGSAMDSSVYFLLLKAYYKE-EMLREADVLQKQMAKAGL----------------------STATAEDMANRSL---------LHENESITHDTSL
        ++   G   +   Y +LLK Y K+ ++ R  D+L++    AG+                      + A   +M  R +         L +  +++    L
Subjt:  DVEAAGSAMDSSVYFLLLKAYYKE-EMLREADVLQKQMAKAGL----------------------STATAEDMANRSL---------LHENESITHDTSL

Query:  VESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRD-------
           +  EM         +  +N  +  +C+  +IEDA +   RM++ G  P   T+ +LA G S  +   D  +LW ++K     +      D       
Subjt:  VESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRD-------

Query:  --------LYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWVGI
                L + L    ++  +F++ +EI+  MEE  +  +K  YK  ++++H  ++ S   S+AR + +  R     AFK W+G+
Subjt:  --------LYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWVGI

Arabidopsis top hitse value%identityAlignment
AT1G03100.1 Pentatricopeptide repeat (PPR) superfamily protein4.0e-8330.06Show/hide
Query:  AVLLGKLEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKE-KPVVLQLDALTKLALGLARFQMPVPASDILRL
        A L  +++IA+ +H+ DEAW LF    ++ GFP+ +V+  ++   + + D NWLQK  +LV Q ++E K  +L+ + L  L+L LA+  M VPAS ILR 
Subjt:  AVLLGKLEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKE-KPVVLQLDALTKLALGLARFQMPVPASDILRL

Query:  MLHMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDCF----LQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQT
        ++  +  P +     V+ HM     G+YL+  ++++I   F    +      N    +MKP+T + N+ L  C+ F  + K +QL++++ + GV ADA  
Subjt:  MLHMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDCF----LQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQT

Query:  VVLIARIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICR-----------------------------FGESPSI
        +V++A IY+ NG+R++L+ L+ HID+        + QFY+ LL    KF D +S + +VLE+ R                              G+   +
Subjt:  VVLIARIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICR-----------------------------FGESPSI

Query:  QKH----SRDLQKSSLVPIGSHHL-KDGLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGC
        ++H    +R +   S++P       +  LK++  A+ +    +  + V+ E I  + G L  + +   KL   F   GK  +L+K LL+ +   + V   
Subjt:  QKH----SRDLQKSSLVPIGSHHL-KDGLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGC

Query:  N-LCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRSLLHENESITHDT---------
        N +  +V+ ACI LG L+ AHD+LD++  AG    SSVY  LLKAY      RE   L +   KAG+      D +    L +++ I +DT         
Subjt:  N-LCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRSLLHENESITHDT---------

Query:  ---------------------------SLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSL-QMYR
                                    L+  L++E++E  +    V+ +N+ I+FF K  +++DA +A KRM+  G  P AQTF ++  G++++   Y 
Subjt:  ---------------------------SLVESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSL-QMYR

Query:  DITILWGDMKR-NMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWV
        ++T LWG+MK     + S+   ++L + +L  F++GG+F R  E+V  ME++ MF DK  Y+  FLK HK  Y+   P + ++E+Q  + E    FKKW+
Subjt:  DITILWGDMKR-NMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWV

Query:  GI
        G+
Subjt:  GI

AT1G12620.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-0923.96Show/hide
Query:  VVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEMKETSA
        +++   Y G  +    +L D+       D   +  L+  + KE  LREA+ L K+M + G+S  T    +      +   +     +++ +V     +  
Subjt:  VVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEMKETSA

Query:  TSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVME
          P +  FN  I  +CKA +I+D L+ +++M   G+     T+  L  GF  L        L+ +M        +V     Y+ LL      G  E+ +E
Subjt:  TSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVME

Query:  IVGHMEEQKMFTDKGMY
        I   +E+ KM  D G+Y
Subjt:  IVGHMEEQKMFTDKGMY

AT1G69290.1 Pentatricopeptide repeat (PPR) superfamily protein5.8e-3424.09Show/hide
Query:  LEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYT-----SDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLML
        L  +L  H  DEAW+ F         P+  ++  LI+ LS       S  + L++A      + ++ P++L+ + +  L   +   +   PA  +++ M 
Subjt:  LEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYT-----SDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLML

Query:  HMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSF-KGQQLVELMSQTGVVADAQTVVLIA
          +     +L   +++ + +          +  + C        S +++ + MKPD V  N  L+AC     S    + ++E M+  GV  D  +   +A
Subjt:  HMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSF-KGQQLVELMSQTGVVADAQTVVLIA

Query:  RIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQ
         +Y   G R+ +  L+  +D      A      Y +++S   K  D DSV++++L                            H LK+G       E   
Subjt:  RIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQ

Query:  RDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEA-AGSAMDSSVY
               E+   FI  K      S K+LAK+I+E ++L  +            G+ S  G      ++ AC+ LG+ + AH IL+++ A  G ++   VY
Subjt:  RDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILDDVEA-AGSAMDSSVY

Query:  FLLLKAYYKEEMLREADVLQKQMAKAGL------------STATAEDM-----------ANRSLLHENESITHDTSLVES----LVQEMKETSATSPRV-
          +LKAY KE    EA  L  +++ +GL            ++ T +D             NR +  +   +T  T L+E+    L+    +     PRV 
Subjt:  FLLLKAYYKEEMLREADVLQKQMAKAGL------------STATAEDM-----------ANRSLLHENESITHDTSLVES----LVQEMKETSATSPRV-

Query:  ---YKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQS----RSLVLSRDLYEFLLLCFLQGGYFERV
           + +NS I+ FCK+  +EDA + ++RM     +P  QT+ +L  G+ S + Y ++ +LW ++K  + S    +   L   L +  L   ++GG+F+  
Subjt:  ---YKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQS----RSLVLSRDLYEFLLLCFLQGGYFERV

Query:  MEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWVGI
        M++V   +E K+F DK  YK  F++ HK     L+  + R    K ++E + AFK W G+
Subjt:  MEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWVGI

AT3G09650.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-1920.12Show/hide
Query:  KLEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPV-VLQLDALTKLALGLARFQMPVPASDILRLMLHMK
        +L   L++ + DEAW  +     L   P    L  L+SQLSY S    L +A +++ ++  E+ +  L  ++L  LA+  A+    + A  +++ M+   
Subjt:  KLEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPV-VLQLDALTKLALGLARFQMPVPASDILRLMLHMK

Query:  RLPRMELLQLVIMHM-VKTEVGTYLATNILVQICDCFLQQAASRNDQA--KSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIAR
         LP ++     +  +    + G   +  + + I     ++     DQ+     +PDT  FN +L AC     + K  +L E MS+     D  T  ++ +
Subjt:  RLPRMELLQLVIMHM-VKTEVGTYLATNILVQICDCFLQQAASRNDQA--KSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIAR

Query:  IYDMNGQRDDL---------KNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANL--VLEICRFGESPSIQKHSRDLQKSSLV--PIGSHHLKD
        +    G+++ +         K +K+ +   M SL   Y  F D   +        +   +L  VL  C   +    ++   +  + +        +  +D
Subjt:  IYDMNGQRDDL---------KNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANL--VLEICRFGESPSIQKHSRDLQKSSLV--PIGSHHLKD

Query:  GLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVA-SNKTLAKLIVEFKRLGKTSDLSKLL--LQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILD
         +  + + ++ ++  ++   V P        K+ A  ++    L+  + + G+ +D +++L  ++ Q    S       + VV A +  G ++ A  +L 
Subjt:  GLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVA-SNKTLAKLIVEFKRLGKTSDLSKLL--LQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDILD

Query:  DVEAAGSAMDSSVYFLLLKAYYKE-EMLREADVLQKQMAKAGL----------------------STATAEDMANRSL---------LHENESITHDTSL
        ++   G   +   Y +LLK Y K+ ++ R  D+L++    AG+                      + A   +M  R +         L +  +++    L
Subjt:  DVEAAGSAMDSSVYFLLLKAYYKE-EMLREADVLQKQMAKAGL----------------------STATAEDMANRSL---------LHENESITHDTSL

Query:  VESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRD-------
           +  EM         +  +N  +  +C+  +IEDA +   RM++ G  P   T+ +LA G S  +   D  +LW ++K     +      D       
Subjt:  VESLVQEMKETSATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRD-------

Query:  --------LYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWVGI
                L + L    ++  +F++ +EI+  MEE  +  +K  YK  ++++H  ++ S   S+AR + +  R     AFK W+G+
Subjt:  --------LYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWVGI

AT4G17616.1 Pentatricopeptide repeat (PPR) superfamily protein2.6e-16749.07Show/hide
Query:  TSVHTTKLCWGGSSYAVLLGKLEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLA
        TSV   +L W  SS  +L  KLE ALKDH++D+AW++F DF+RLYGFP+  ++   ++ LSY+SD  WL KA +L     K+ P +L  D LTKL+L LA
Subjt:  TSVHTTKLCWGGSSYAVLLGKLEIALKDHQIDEAWELFNDFRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLA

Query:  RFQMPVPASDILRLMLHMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDCFLQ-QAASRNDQ-AKSMKPDTVIFNLILQACVGFRLSFKGQQLVEL
        R QM   A  ILR+ML    +   ++L+LV+MHMVKTE+GT LA+N LVQ+CD F++     RN      +KPDTV+FNL+L +CV F  S KGQ+L+EL
Subjt:  RFQMPVPASDILRLMLHMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQICDCFLQ-QAASRNDQ-AKSMKPDTVIFNLILQACVGFRLSFKGQQLVEL

Query:  MSQTGVVADAQTVVLIARIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGS
        M++  VVADA ++V+++ IY+MNG RD+L+  K HI QV P L  HY  F+D+LLSL+FKFDD  S   L L++C+     S++    D +K  ++P+GS
Subjt:  MSQTGVVADAQTVVLIARIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFDDFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGS

Query:  HHLKDGLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDI
        HH++ GLKI I  +LLQRD  + V+ +  F+N  N KL  +NKTLAKL+  +KR     +LSKLL        S+ G  LC+DV+ AC+ +GWLE AHDI
Subjt:  HHLKDGLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGLASVEGCNLCSDVVKACIYLGWLETAHDI

Query:  LDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEM---KETSATSPRVYKFNSSIYF
        LDD+ +AG  M+ + Y ++L  YYK +MLR A+VL KQM KAGL T  + ++       E +S   +T L + LVQE+   K+  A S  +Y+ NSS+Y+
Subjt:  LDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEM---KETSATSPRVYKFNSSIYF

Query:  FCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDK
        FCKAKM  DAL  Y+++ +  I PT Q+F  L   +SSL MYR+ITI+WGD+KRN+ S++L  ++DL E L++ FL+GGYFERVME++ +M+E  M+ D 
Subjt:  FCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQKMFTDK

Query:  GMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWVGI
         MYKN++LKLHKNLYR+LK S+A TEAQ  RLE+V+ F+K VGI
Subjt:  GMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWVGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTGGTCTTAGCAAGAGAAAGACTACTGCAATCCCGTTTATCAATGGGCTTTTCTTTGAAATCTAAATTAGATTCAGCTCTGCAGAGCTATGCTTCGATATTAGC
TTGTAGTGAGAAGCTGATCTTCATAAAGAAATTTGTAAACCCTAGATTTGCAGAGCTATGGTATATAAAACCTCAGATTCCGCTTTGTCGTTGTGTTTCTACTTCTGTAC
ACACTACAAAATTATGTTGGGGAGGTTCCTCTTATGCAGTGCTTTTGGGAAAGCTAGAAATTGCTTTGAAAGATCATCAAATTGATGAAGCATGGGAGTTGTTTAATGAT
TTCAGAAGGCTTTATGGTTTTCCAAAGGATAATGTTTTGCTCATGTTGATTTCTCAATTGTCCTATACTTCTGATTGCAATTGGCTACAAAAGGCATGTAACTTGGTTCT
TCAAATTTGGAAAGAGAAACCAGTTGTATTGCAACTTGATGCCTTAACTAAACTTGCCCTCGGATTGGCAAGATTCCAAATGCCGGTTCCTGCTTCGGATATTCTTAGAT
TGATGCTGCATATGAAGAGATTACCGCGAATGGAACTGTTGCAGCTGGTTATTATGCACATGGTGAAGACAGAGGTTGGAACATATCTTGCTACTAATATATTGGTCCAG
ATTTGTGATTGTTTTTTACAACAGGCTGCAAGTAGAAATGACCAAGCGAAGTCGATGAAACCAGATACTGTGATCTTTAACCTGATACTCCAAGCCTGTGTCGGGTTTAG
ATTATCTTTTAAGGGTCAGCAGCTTGTGGAATTGATGTCTCAAACTGGAGTTGTTGCTGATGCACAGACAGTTGTTCTAATTGCTCGGATTTATGACATGAATGGTCAAA
GAGATGATCTAAAGAATCTCAAAATCCACATTGACCAAGTTATGCCTTCATTGGCTTGTCATTATTGTCAGTTCTATGATAGCTTGTTGAGCTTGGACTTTAAGTTTGAT
GATTTTGATTCCGTTGCTAACCTTGTGCTGGAAATATGTCGATTTGGTGAGTCTCCTAGCATTCAAAAACATTCGAGGGACTTGCAAAAGTCTAGCCTTGTTCCAATTGG
ATCACACCATCTAAAGGATGGATTAAAGATAAAGATTATGGCAGAACTACTGCAAAGAGATTATGTTGTCAATGTGGAAGTCAAACCAGAGTTTATAAATTGTAAGAATG
GGAAACTTGTTGCCAGTAACAAGACCCTTGCTAAACTCATTGTTGAATTCAAGAGACTTGGAAAAACTTCTGATCTCTCGAAACTTTTACTTCAGGTTCAAAAAGGGTTG
GCCTCAGTTGAAGGTTGTAATTTATGTTCTGATGTAGTTAAAGCTTGCATTTATTTAGGTTGGCTCGAAACTGCTCATGATATTTTGGACGATGTTGAAGCAGCTGGTTC
TGCAATGGACTCCTCTGTATATTTCTTGCTCTTGAAAGCATATTACAAAGAGGAAATGCTCAGAGAAGCAGATGTACTGCAAAAACAAATGGCAAAGGCTGGCCTGTCCA
CTGCTACCGCAGAAGACATGGCTAACCGAAGTTTGTTGCACGAAAATGAATCAATTACCCATGACACATCTCTGGTTGAATCTCTAGTTCAAGAAATGAAAGAGACCAGT
GCGACATCTCCTAGAGTTTACAAGTTCAATTCTTCCATTTACTTTTTCTGCAAGGCCAAAATGATTGAGGATGCCTTACAGGCATACAAAAGAATGCAGCAGACAGGCAT
TCAACCCACTGCGCAAACTTTCGCCAATCTAGCTTTCGGGTTTTCTTCGTTGCAAATGTATCGTGACATCACGATCCTATGGGGAGACATGAAGAGGAATATGCAGAGCA
GGAGTTTGGTGCTGAGCAGAGATCTCTATGAGTTCTTGTTGCTGTGCTTTCTTCAAGGTGGTTACTTTGAGAGAGTGATGGAAATTGTTGGACATATGGAGGAGCAGAAG
ATGTTCACTGACAAGGGAATGTACAAAAATCAGTTTCTAAAGCTTCACAAGAATCTTTATAGGAGTTTAAAGCCATCAGAAGCCAGAACTGAGGCACAGAAGAATAGATT
AGAGTATGTTAGAGCATTCAAAAAATGGGTTGGTATTTACTGA
mRNA sequenceShow/hide mRNA sequence
CTGCAATTCGATTGTTCTTGAAAATTTTCGGGTTAATTTGTTGGTTCGTGTGCTTCTTCACCGATTCAGAATTATTTATCCTCGTCTTCGATGTTATGAATCATCCTTCT
CTTCTTCTTTGCCATTTCTTGTTTCCATATGAAATTTCATGGTTCCGACCGCACCCTGTATCTTCGTTTTCAAGCCAGGTTAATTTTCAATCTTAGTAAGCAATCTCTCT
TTCGGTTTATTAGTTACAATCCAATAAACTTGTTTTACCGAAGATATTTGCAGCAGTCATCTTCAATTATTTTGTAAATTTCAAGCATCATATCTTATATGTGGATAATT
TGTAGTGTAATCTCTTAATTTTTATGGCCGTGGTCTTAGCAAGAGAAAGACTACTGCAATCCCGTTTATCAATGGGCTTTTCTTTGAAATCTAAATTAGATTCAGCTCTG
CAGAGCTATGCTTCGATATTAGCTTGTAGTGAGAAGCTGATCTTCATAAAGAAATTTGTAAACCCTAGATTTGCAGAGCTATGGTATATAAAACCTCAGATTCCGCTTTG
TCGTTGTGTTTCTACTTCTGTACACACTACAAAATTATGTTGGGGAGGTTCCTCTTATGCAGTGCTTTTGGGAAAGCTAGAAATTGCTTTGAAAGATCATCAAATTGATG
AAGCATGGGAGTTGTTTAATGATTTCAGAAGGCTTTATGGTTTTCCAAAGGATAATGTTTTGCTCATGTTGATTTCTCAATTGTCCTATACTTCTGATTGCAATTGGCTA
CAAAAGGCATGTAACTTGGTTCTTCAAATTTGGAAAGAGAAACCAGTTGTATTGCAACTTGATGCCTTAACTAAACTTGCCCTCGGATTGGCAAGATTCCAAATGCCGGT
TCCTGCTTCGGATATTCTTAGATTGATGCTGCATATGAAGAGATTACCGCGAATGGAACTGTTGCAGCTGGTTATTATGCACATGGTGAAGACAGAGGTTGGAACATATC
TTGCTACTAATATATTGGTCCAGATTTGTGATTGTTTTTTACAACAGGCTGCAAGTAGAAATGACCAAGCGAAGTCGATGAAACCAGATACTGTGATCTTTAACCTGATA
CTCCAAGCCTGTGTCGGGTTTAGATTATCTTTTAAGGGTCAGCAGCTTGTGGAATTGATGTCTCAAACTGGAGTTGTTGCTGATGCACAGACAGTTGTTCTAATTGCTCG
GATTTATGACATGAATGGTCAAAGAGATGATCTAAAGAATCTCAAAATCCACATTGACCAAGTTATGCCTTCATTGGCTTGTCATTATTGTCAGTTCTATGATAGCTTGT
TGAGCTTGGACTTTAAGTTTGATGATTTTGATTCCGTTGCTAACCTTGTGCTGGAAATATGTCGATTTGGTGAGTCTCCTAGCATTCAAAAACATTCGAGGGACTTGCAA
AAGTCTAGCCTTGTTCCAATTGGATCACACCATCTAAAGGATGGATTAAAGATAAAGATTATGGCAGAACTACTGCAAAGAGATTATGTTGTCAATGTGGAAGTCAAACC
AGAGTTTATAAATTGTAAGAATGGGAAACTTGTTGCCAGTAACAAGACCCTTGCTAAACTCATTGTTGAATTCAAGAGACTTGGAAAAACTTCTGATCTCTCGAAACTTT
TACTTCAGGTTCAAAAAGGGTTGGCCTCAGTTGAAGGTTGTAATTTATGTTCTGATGTAGTTAAAGCTTGCATTTATTTAGGTTGGCTCGAAACTGCTCATGATATTTTG
GACGATGTTGAAGCAGCTGGTTCTGCAATGGACTCCTCTGTATATTTCTTGCTCTTGAAAGCATATTACAAAGAGGAAATGCTCAGAGAAGCAGATGTACTGCAAAAACA
AATGGCAAAGGCTGGCCTGTCCACTGCTACCGCAGAAGACATGGCTAACCGAAGTTTGTTGCACGAAAATGAATCAATTACCCATGACACATCTCTGGTTGAATCTCTAG
TTCAAGAAATGAAAGAGACCAGTGCGACATCTCCTAGAGTTTACAAGTTCAATTCTTCCATTTACTTTTTCTGCAAGGCCAAAATGATTGAGGATGCCTTACAGGCATAC
AAAAGAATGCAGCAGACAGGCATTCAACCCACTGCGCAAACTTTCGCCAATCTAGCTTTCGGGTTTTCTTCGTTGCAAATGTATCGTGACATCACGATCCTATGGGGAGA
CATGAAGAGGAATATGCAGAGCAGGAGTTTGGTGCTGAGCAGAGATCTCTATGAGTTCTTGTTGCTGTGCTTTCTTCAAGGTGGTTACTTTGAGAGAGTGATGGAAATTG
TTGGACATATGGAGGAGCAGAAGATGTTCACTGACAAGGGAATGTACAAAAATCAGTTTCTAAAGCTTCACAAGAATCTTTATAGGAGTTTAAAGCCATCAGAAGCCAGA
ACTGAGGCACAGAAGAATAGATTAGAGTATGTTAGAGCATTCAAAAAATGGGTTGGTATTTACTGAAATTATACTTGAACCTTGAAACTAGTTTGACAAAGATTTATTTT
GTTTAGATTTTTGTAGAACCATCTCCAAGGATATAGTTGAAACCCATTAAAGAAGATTGTTTGAACTAGGGATCAAAACAAAGATATTATGTTTAGCAAAGTGGAAAAAT
AATGAAACAAGTCTACATTTTTTAGATGTGATCGAGTATGTACGGTGAAAACTTCCAATCTATGTCTCATTATCTAGCCAGTGTTGAACGTTATGACTCGATTGATAACT
ATTTA
Protein sequenceShow/hide protein sequence
MAVVLARERLLQSRLSMGFSLKSKLDSALQSYASILACSEKLIFIKKFVNPRFAELWYIKPQIPLCRCVSTSVHTTKLCWGGSSYAVLLGKLEIALKDHQIDEAWELFND
FRRLYGFPKDNVLLMLISQLSYTSDCNWLQKACNLVLQIWKEKPVVLQLDALTKLALGLARFQMPVPASDILRLMLHMKRLPRMELLQLVIMHMVKTEVGTYLATNILVQ
ICDCFLQQAASRNDQAKSMKPDTVIFNLILQACVGFRLSFKGQQLVELMSQTGVVADAQTVVLIARIYDMNGQRDDLKNLKIHIDQVMPSLACHYCQFYDSLLSLDFKFD
DFDSVANLVLEICRFGESPSIQKHSRDLQKSSLVPIGSHHLKDGLKIKIMAELLQRDYVVNVEVKPEFINCKNGKLVASNKTLAKLIVEFKRLGKTSDLSKLLLQVQKGL
ASVEGCNLCSDVVKACIYLGWLETAHDILDDVEAAGSAMDSSVYFLLLKAYYKEEMLREADVLQKQMAKAGLSTATAEDMANRSLLHENESITHDTSLVESLVQEMKETS
ATSPRVYKFNSSIYFFCKAKMIEDALQAYKRMQQTGIQPTAQTFANLAFGFSSLQMYRDITILWGDMKRNMQSRSLVLSRDLYEFLLLCFLQGGYFERVMEIVGHMEEQK
MFTDKGMYKNQFLKLHKNLYRSLKPSEARTEAQKNRLEYVRAFKKWVGIY