; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020558 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020558
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr05:556327..561938
RNA-Seq ExpressionHG10020558
SyntenyHG10020558
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR009349 - Zinc finger, C2HC5-type
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR039128 - Activating signal cointegrator 1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036077.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0061.57Show/hide
Query:  MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVSQNQLLPLLQMNQMNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDF
        MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVS  +L                                A   +A                       
Subjt:  MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVSQNQLLPLLQMNQMNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDF

Query:  VKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSV
                                             Y  SNAMDHALKLF+T+L+PNVISWN II+GF++ FLHLDS R FC MH+LGF+P+E+T GSV
Subjt:  VKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSV

Query:  LSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSS
        LSACAAIQA MFGKQVYSLAVRNGFF NGYVR  MIDLFAKDS FLDALRVF DVDC NVVCWNAIVSAAV NGE LMALDLFN MCS  LEPNSFTFSS
Subjt:  LSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSS

Query:  VLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTS
        VLTAC+AL+DLE GK VQGRVIKCGG DVFVETAL+ LY KCGDMD AVK F +MPIRNVVSWT I+SGFVQNNDYLM +K FED+RK+GEEINSYTVT+
Subjt:  VLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTS

Query:  VLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS------------------
        +L ACANP M KEATQLH+WILKAGFSS A V AALI MYSK GAIDLSLMVFREMDN RNLSSWTAMI S A+NNDKE+AS                  
Subjt:  VLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSIS
                      REIHGYS+R GL ++V+ GSSLVTMYSKCGNL LARRVFETLPQKD I CSSLVSGYAQQK  +EALLLF  LLVAGLAID FSIS
Subjt:  --------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSIS

Query:  SVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI-------------------------------
        S+LG IALL RPAIG QIHALI+KV LEKDVSVGSSLVMVYS+CGSIEDCCKAFGQIGKPDLI WT+MI                               
Subjt:  SVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------NIIGQEVGKSVIDEYLRLRGHSDLCSKTLDVPSSTLHAYVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNVENQ
                         NIIGQEVGKSVI+EYLRLRGHSDLCSKTLDVP+STLH YVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNV++Q
Subjt:  -----------------NIIGQEVGKSVIDEYLRLRGHSDLCSKTLDVPSSTLHAYVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNVENQ

Query:  VSSDTRNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSD
        VS D RNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSD
Subjt:  VSSDTRNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSD

Query:  AEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQFEGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDDASELESRNNILRPPDE
        AEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQ EGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDD+SELES  NILR  DE
Subjt:  AEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQFEGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDDASELESRNNILRPPDE

Query:  REVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHDRDELKHLMMENDLKTSF
        REVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHD +ELKH M+EN+L+TSF
Subjt:  REVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHDRDELKHLMMENDLKTSF

KAE8652506.1 hypothetical protein Csa_013256 [Cucumis sativus]0.0e+0075.3Show/hide
Query:  MNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        MNFIAIQ FVNKTL+SPRRLVSSVA VDN SNFSFTKI TF  F+P++LLNDFVK  K SLRNTKVLHAKLLR T L  +IYVSNSLL  YSKSNAMDHA
Subjt:  MNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLD
        +KLF+T+L+PNVISWN II+G ++ FLHLDS RTFC MHFLGF+P+E+T GSVLSACAAIQA MFGKQVYSLAVRNGFF NGYVR  MIDLFAKDS FLD
Subjt:  LKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLD

Query:  ALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDG
        ALRVF DVDC NVVCWNAIVSAAV NGENLMALDLFN MCS  LEPNSFTFSSVLTAC+AL+DLE GK+VQGRVIKCGG DVFVETAL+ LY KCGDMD 
Subjt:  ALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDG

Query:  AVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAID
        AVKTFL+MPIRNVVSWT I+SGFVQ+NDYLM +KFFED+RKVGEEINSYTVT++L ACANPAM KEATQLH+WILKAGFSSH+ VAAALI MYSK GA+D
Subjt:  AVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAID

Query:  LSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS-------------------------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLEL
        LSLM+FREMDN RNLSSWTAMI SFA+NNDKE+AS                               R+IH Y+++  L  +V VGSSL+TMYSKCG+L+ 
Subjt:  LSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS-------------------------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLEL

Query:  ARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIE
        A +VFE +P+KD+++ + ++S +++    ++A+ LF ++L+  +  D  S+S+VL A   L    +G +IH   ++V L ++V++GSSLV +YSKCG++ 
Subjt:  ARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIE

Query:  DCCKAFGQIGKPD--------------------LIAWTAMINIIGQEVGKSVIDEYLRLRGHSDLCSKTLDVPSSTLHAYVKPPSHEGSFGGSKKPVKTP
           + F  + + D                    L+ + +++NIIGQEVGKSVI+EYLRLRGHSDLCSKTLDVP+STLH YVKPPSHE SFGGSKKPVKTP
Subjt:  DCCKAFGQIGKPD--------------------LIAWTAMINIIGQEVGKSVIDEYLRLRGHSDLCSKTLDVPSSTLHAYVKPPSHEGSFGGSKKPVKTP

Query:  KTISISSKEIEPKKATSSSNVENQVSSDTRNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSF
        KTISISSKEIEPKKAT+SSNVE+QVSSDTRNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSF
Subjt:  KTISISSKEIEPKKATSSSNVENQVSSDTRNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSF

Query:  CGSLVLREGSTYAGMDEGFTPLSDAEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQFEGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGR
        CGSLVLREGSTYAGMDEGFTPLSDAEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQ EGNSWLSNEEKELL+KKQEEIEEAERAKRNKVVVTFDLVGR
Subjt:  CGSLVLREGSTYAGMDEGFTPLSDAEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQFEGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGR

Query:  KVLLNEDDASELESRNNILRPPDEREVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHDRDELKHLMMEN
        KVLLNEDD+SELES  NI+RP DEREVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAV KKGICLEITGRVQHD +ELKHLMME+
Subjt:  KVLLNEDDASELESRNNILRPPDEREVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHDRDELKHLMMEN

TYJ98884.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0067.17Show/hide
Query:  MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVSQNQLLPLLQMNQMNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDF
        MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVS          NQM+FIAIQT VNKTLLSPRRLVSSVATVDN SNFSFTKIETF  F+P+ LLNDF
Subjt:  MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVSQNQLLPLLQMNQMNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDF

Query:  VKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSV
        VK    SLRNTKVLHAK LR T    +IYVSNSLL CYSKSNAMDHALKLF+T+L+PNVISWN II+GF++ FLHLDS R FC MH+LGF+P+E+T GSV
Subjt:  VKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSV

Query:  LSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSS
        LSACAAIQA MFGKQVYSLAVRNGFF NGYVR  MIDLFAKDS FLDALRVF DVDC NVVCWNAIVSAAV NGE LMALDLFN MCS  LEPNSFTFSS
Subjt:  LSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSS

Query:  VLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTS
        VLTAC+AL+DLE GK VQGRVIKCGG DVFVETAL+ LY KCGDMD AVK F +MPIRNVVSWT I+SGFVQNNDYLM +K FED+RK+GEEINSYTVT+
Subjt:  VLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTS

Query:  VLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS------------------
        +L ACANP M KEATQLH+WILKAGFSS A V AALI MYSK GAIDLSLMVFREMDN RNLSSWTAMI S A+NNDKE+AS                  
Subjt:  VLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSIS
                      REIHGYS+R GL ++V+ GSSLVTMYSKCGNL LARRVFETLPQKD I CSSLVSGYAQQK  +EALLLF  LLVAGLAID FSIS
Subjt:  --------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSIS

Query:  SVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI-------------------------------
        S+LG IALL RPAIG QIHALI+KV LEKDVSVGSSLVMVYS+CGSIEDCCKAFGQIGKPDLI WT+MI                               
Subjt:  SVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------NIIGQEVGKSVIDEYLRLRGHSDLCSKTLDVPSSTLHAYVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNVENQ
                         NIIGQEVGKSVI+EYLRLRGHSDLCSKTLDVP+STLH YVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNV++Q
Subjt:  -----------------NIIGQEVGKSVIDEYLRLRGHSDLCSKTLDVPSSTLHAYVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNVENQ

Query:  VSSDTRNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSD
        VS D RNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSD
Subjt:  VSSDTRNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSD

Query:  AEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQFEGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDDASELESRNNILRPPDE
        AEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQ EGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDD+SELES  NILR  DE
Subjt:  AEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQFEGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDDASELESRNNILRPPDE

Query:  REVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHDRDELKHLMMENDLKTSF
        REVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHD +ELKH M+EN+L+TSF
Subjt:  REVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHDRDELKHLMMENDLKTSF

XP_023519257.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita pepo subsp. pepo]1.8e-28071.88Show/hide
Query:  MNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        MNF  I TFVNKTLLS RRL+SSVATVDNAS+FSFTKIET+PLFDP +LL+D+VKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSN++DHA
Subjt:  MNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLD
        LKLF+TMLHPNVISWNI+IS F+H F++LDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLD
Subjt:  LKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLD

Query:  ALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDG
        ALRVF DVDCENVVCWNAIVSAAVRNGEN MALDL+NTMC G LEPNSFTFSSVLTACAALE  E GKRVQG+VIKCGGEDVFVETALIDLY+KCG+MD 
Subjt:  ALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDG

Query:  AVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAID
        AVK FLRMPIRNVVSWTAIISGFVQ NDYLMALKFF+DMRK+GEEINSYTVTSVLTACANPAMTKEA QLH+WIL+AGFSSHAVV AALINMYSK GAID
Subjt:  AVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAID

Query:  LSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS-----------------------------------------------------------------
        LS+ VF EMDN+RNLSSWTAMITSFAQNNDKEKAS                                                                 
Subjt:  LSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS-----------------------------------------------------------------

Query:  -------------------------------------------------------------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLE
                                                                           REIH YSVR GL KDVA+G SLVTMYSKCGNLE
Subjt:  -------------------------------------------------------------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLE

Query:  LARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSI
        +ARRVFETLP+KD+IACSSLVSGYAQ K  +E +LLF DLL AGLAID FSISS+LGAIALLNRP IG Q+HA+I KV LEKDVSVGSSLVMVYSKCGSI
Subjt:  LARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSI

Query:  EDCCKAFGQIGKPDLIAWTAMI
        EDCCKAF QIGKPDLI WTAMI
Subjt:  EDCCKAFGQIGKPDLIAWTAMI

XP_038893557.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Benincasa hispida]1.6e-29775.14Show/hide
Query:  VSQNQLLPLLQMNQMNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNS
        ++Q   + LLQMNQMNFIAIQTFVNKTLLSPR LVSSVATVD+ SNFSFTKI TFP  DPL+ LNDFVKSRKCSLRNTKVLHAKLLRA LLHSNIYVSNS
Subjt:  VSQNQLLPLLQMNQMNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNS

Query:  LLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRA
        LLDCYSKSNAMDHALKLF+TMLHPNVISWNIIISGF++KFLHLD+ RTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRA
Subjt:  LLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRA

Query:  GMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVET
        GMIDLFAKDSSFLDALRVF DVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSG LEPNSFTFSSVLTACAALEDLE GKRVQGRVIKCGGEDVFVET
Subjt:  GMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVET

Query:  ALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVA
        ALID Y KCGD D AVK FLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRK GEEINSYTVTSVLTACANPAMTKEATQLH+WILKAGFSSHAVVA
Subjt:  ALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVA

Query:  AALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS---------------------------------------------------
        AALINMYSK GAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKE AS                                                   
Subjt:  AALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS---------------------------------------------------

Query:  ---------------------------------------------------------------------------------REIHGYSVREGLGKDVAVG
                                                                                         REIHGYSVR GLGKDVAVG
Subjt:  ---------------------------------------------------------------------------------REIHGYSVREGLGKDVAVG

Query:  SSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSV
        +SLV MYSKCGNL LARR+FE LPQKDHIACSSL+SGYAQQK NE+A LLF DLLVAGLAID FSISS+LGAIALLNRPAIG QIHA+IMKV LEKDVSV
Subjt:  SSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSV

Query:  GSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI
        GSSLVMVYSKCGS+EDCCKAFGQIGKPDLI WTAMI
Subjt:  GSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI

TrEMBL top hitse value%identityAlignment
A0A1S3B4I2 pentatricopeptide repeat-containing protein At1g74600, chloroplastic3.8e-25266.76Show/hide
Query:  MNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        M+FIAIQT VNKTLLSPRRLVSSVATVDN SNFSFTKIETF  F+P+ LLNDFVK    SLRNTKVLHAK LR T    +IYVSNSLL CYSKSNAMDHA
Subjt:  MNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLD
        LKLF+T+L+PNVISWN II+GF++ FLHLDS R FC MH+LGF+P+E+T GSVLSACAAIQA MFGKQVYSLAVRNGFF NGYVR  MIDLFAKDS FLD
Subjt:  LKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLD

Query:  ALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDG
        ALRVF DVDC NVVCWNAIVSAAV NGE LMALDLFN MCS  LEPNSFTFSSVLTAC+AL+DLE GK VQGRVIKCGG DVFVETAL+ LY KCGDMD 
Subjt:  ALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDG

Query:  AVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAID
        AVK F +MPIRNVVSWT I+SGFVQNNDYLM +K FED+RK+GEEINSYTVT++L ACANP M KEATQLH+WILKAGFSS A V AALI MYSK GAID
Subjt:  AVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAID

Query:  LSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS-----------------------------------------------------------------
        LSLMVFREMDN RNLSSWTAMI S A+NNDKE+AS                                                                 
Subjt:  LSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS-----------------------------------------------------------------

Query:  -------------------------------------------------------------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLE
                                                                           REIHGYS+R GL ++V+ GSSLVTMYSKCGNL 
Subjt:  -------------------------------------------------------------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLE

Query:  LARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSI
        LARRVFETLPQKD I CSSLVSGYAQQK  +EALLLF  LLVAGLAID FSISS+LG IALL RPAIG QIHALI+KV LEKDVSVGSSLVMVYS+CGSI
Subjt:  LARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSI

Query:  EDCCKAFGQIGKPDLIAWTAMI
        EDCCKAFGQIGKPDLI WT+MI
Subjt:  EDCCKAFGQIGKPDLIAWTAMI

A0A5A7T3B5 Pentatricopeptide repeat-containing protein0.0e+0061.57Show/hide
Query:  MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVSQNQLLPLLQMNQMNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDF
        MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVS  +L                                A   +A                       
Subjt:  MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVSQNQLLPLLQMNQMNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDF

Query:  VKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSV
                                             Y  SNAMDHALKLF+T+L+PNVISWN II+GF++ FLHLDS R FC MH+LGF+P+E+T GSV
Subjt:  VKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSV

Query:  LSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSS
        LSACAAIQA MFGKQVYSLAVRNGFF NGYVR  MIDLFAKDS FLDALRVF DVDC NVVCWNAIVSAAV NGE LMALDLFN MCS  LEPNSFTFSS
Subjt:  LSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSS

Query:  VLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTS
        VLTAC+AL+DLE GK VQGRVIKCGG DVFVETAL+ LY KCGDMD AVK F +MPIRNVVSWT I+SGFVQNNDYLM +K FED+RK+GEEINSYTVT+
Subjt:  VLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTS

Query:  VLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS------------------
        +L ACANP M KEATQLH+WILKAGFSS A V AALI MYSK GAIDLSLMVFREMDN RNLSSWTAMI S A+NNDKE+AS                  
Subjt:  VLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSIS
                      REIHGYS+R GL ++V+ GSSLVTMYSKCGNL LARRVFETLPQKD I CSSLVSGYAQQK  +EALLLF  LLVAGLAID FSIS
Subjt:  --------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSIS

Query:  SVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI-------------------------------
        S+LG IALL RPAIG QIHALI+KV LEKDVSVGSSLVMVYS+CGSIEDCCKAFGQIGKPDLI WT+MI                               
Subjt:  SVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------NIIGQEVGKSVIDEYLRLRGHSDLCSKTLDVPSSTLHAYVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNVENQ
                         NIIGQEVGKSVI+EYLRLRGHSDLCSKTLDVP+STLH YVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNV++Q
Subjt:  -----------------NIIGQEVGKSVIDEYLRLRGHSDLCSKTLDVPSSTLHAYVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNVENQ

Query:  VSSDTRNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSD
        VS D RNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSD
Subjt:  VSSDTRNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSD

Query:  AEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQFEGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDDASELESRNNILRPPDE
        AEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQ EGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDD+SELES  NILR  DE
Subjt:  AEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQFEGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDDASELESRNNILRPPDE

Query:  REVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHDRDELKHLMMENDLKTSF
        REVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHD +ELKH M+EN+L+TSF
Subjt:  REVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHDRDELKHLMMENDLKTSF

A0A5D3BIJ5 Pentatricopeptide repeat-containing protein0.0e+0067.17Show/hide
Query:  MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVSQNQLLPLLQMNQMNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDF
        MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVS          NQM+FIAIQT VNKTLLSPRRLVSSVATVDN SNFSFTKIETF  F+P+ LLNDF
Subjt:  MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVSQNQLLPLLQMNQMNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDF

Query:  VKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSV
        VK    SLRNTKVLHAK LR T    +IYVSNSLL CYSKSNAMDHALKLF+T+L+PNVISWN II+GF++ FLHLDS R FC MH+LGF+P+E+T GSV
Subjt:  VKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSV

Query:  LSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSS
        LSACAAIQA MFGKQVYSLAVRNGFF NGYVR  MIDLFAKDS FLDALRVF DVDC NVVCWNAIVSAAV NGE LMALDLFN MCS  LEPNSFTFSS
Subjt:  LSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSS

Query:  VLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTS
        VLTAC+AL+DLE GK VQGRVIKCGG DVFVETAL+ LY KCGDMD AVK F +MPIRNVVSWT I+SGFVQNNDYLM +K FED+RK+GEEINSYTVT+
Subjt:  VLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTS

Query:  VLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS------------------
        +L ACANP M KEATQLH+WILKAGFSS A V AALI MYSK GAIDLSLMVFREMDN RNLSSWTAMI S A+NNDKE+AS                  
Subjt:  VLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSIS
                      REIHGYS+R GL ++V+ GSSLVTMYSKCGNL LARRVFETLPQKD I CSSLVSGYAQQK  +EALLLF  LLVAGLAID FSIS
Subjt:  --------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSIS

Query:  SVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI-------------------------------
        S+LG IALL RPAIG QIHALI+KV LEKDVSVGSSLVMVYS+CGSIEDCCKAFGQIGKPDLI WT+MI                               
Subjt:  SVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------NIIGQEVGKSVIDEYLRLRGHSDLCSKTLDVPSSTLHAYVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNVENQ
                         NIIGQEVGKSVI+EYLRLRGHSDLCSKTLDVP+STLH YVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNV++Q
Subjt:  -----------------NIIGQEVGKSVIDEYLRLRGHSDLCSKTLDVPSSTLHAYVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNVENQ

Query:  VSSDTRNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSD
        VS D RNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSD
Subjt:  VSSDTRNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSD

Query:  AEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQFEGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDDASELESRNNILRPPDE
        AEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQ EGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDD+SELES  NILR  DE
Subjt:  AEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQFEGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDDASELESRNNILRPPDE

Query:  REVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHDRDELKHLMMENDLKTSF
        REVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHD +ELKH M+EN+L+TSF
Subjt:  REVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHDRDELKHLMMENDLKTSF

A0A6J1E7L2 pentatricopeptide repeat-containing protein At1g74600, chloroplastic1.5e-28071.88Show/hide
Query:  MNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        MNF  I TFVNKTLLS RRL+SSVATVDNAS+FSFTKIET+PLFDP +LL+D+VKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSN++DHA
Subjt:  MNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLD
        LKLF+TMLHPNVISWNI+IS F+H FL+LDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLD
Subjt:  LKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLD

Query:  ALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDG
        ALRVF D+ CENVVCWNAIVSAAVRNGEN MALDL+NTMC GLLEPNSFTFSSVLTACAALE  E GKRVQG+VIKCGGEDVFVETALIDLY+KCG+MD 
Subjt:  ALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDG

Query:  AVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAID
        AVK FLRMPIRNVVSWTAIISGFVQ NDYLMALKFF+DMRK+GEEINSYTVTSVLTACANPAMTKEA QLH+WIL+AG+SSHAVV AALINMYSK GAID
Subjt:  AVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAID

Query:  LSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS-----------------------------------------------------------------
        LS+ VF EMDNQRNLSSWTAMITSFAQNNDKEKAS                                                                 
Subjt:  LSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS-----------------------------------------------------------------

Query:  -------------------------------------------------------------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLE
                                                                           REIH YSVR GL KDVA+G SLVTMYSKCGNLE
Subjt:  -------------------------------------------------------------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLE

Query:  LARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSI
        +ARRVFETLP+KD+IACSSLVSGYAQ K  +E +LLF DLL AGLAID FSISS+LGAIALLNRP IG Q+HA+I KV LEKDVSVGSSLVMVYSKCGSI
Subjt:  LARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSI

Query:  EDCCKAFGQIGKPDLIAWTAMI
        EDCCKAF QIGKPDLI WTAMI
Subjt:  EDCCKAFGQIGKPDLIAWTAMI

A0A6J1KIC5 pentatricopeptide repeat-containing protein At1g74600, chloroplastic8.6e-28171.75Show/hide
Query:  MNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA
        MNF  I TFVNKTLLS RRL+SSVATVDNAS+FSFTKIET+PLFDP +LL+D+VKSRKCSLR+TKVLHAKLLRATLLHSNIYVSNSLLDCYSKSN++DHA
Subjt:  MNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHA

Query:  LKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLD
        LKLF+TMLHPNVISWNI+IS F+H FL+LDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAK+SSFLD
Subjt:  LKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLD

Query:  ALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDG
        ALRVF+DVDCENVVCWNAIVSAAVRNGEN MALDL+NTMC G LEPNSFTFSSVLTACAALE  E GKRVQG+VIKCGGEDVFVETALIDLY+KCG+MD 
Subjt:  ALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCGDMDG

Query:  AVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAID
        AVK FLRMPIRNVVSWTAIISGFVQ NDYLMALKFF+DMRK+GEEINSYTVTSVLTACANPAMTKEA QLH+WIL+AGFSSHAVV AALINMYSK GAID
Subjt:  AVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKTGAID

Query:  LSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS-----------------------------------------------------------------
        LS+ VF EMDNQRNLSSWTAMITSFAQNNDKEKAS                                                                 
Subjt:  LSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKAS-----------------------------------------------------------------

Query:  -------------------------------------------------------------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLE
                                                                           REIH YSVR GL KDVA+G SLVTMYSKCGNLE
Subjt:  -------------------------------------------------------------------REIHGYSVREGLGKDVAVGSSLVTMYSKCGNLE

Query:  LARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSI
        +ARRVFETLP+KD+IACSSLVSGYAQ K  +E +LLF DLL AGLAID FSISS+LGAIALLNRP IG Q+HA+I KV LEKDVS+GSSLVMVYSKCGSI
Subjt:  LARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSI

Query:  EDCCKAFGQIGKPDLIAWTAMI
        EDCCKAF QIGKPDLI WTAMI
Subjt:  EDCCKAFGQIGKPDLIAWTAMI

SwissProt top hitse value%identityAlignment
Q9CA56 Pentatricopeptide repeat-containing protein At1g74600, chloroplastic5.2e-14240.77Show/hide
Query:  MNFIAIQTFVNKTLLSP---RRLVSSVATVDNASNFSFTKIETFPL-FDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNA
        MN +A ++ +N   +SP    RL+SSV    N  +FS     +    F+P    ND   SR C+LR TK+L A LLR  LL  +++++ SLL  YS S +
Subjt:  MNFIAIQTFVNKTLLSP---RRLVSSVATVDNASNFSFTKIETFPL-FDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNA

Query:  MDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDS
        M  A KLF+T+  P+V+S NI+ISG+    L  +S R F +MHFLGFE +EI+YGSV+SAC+A+QAP+F + V    ++ G+F    V + +ID+F+K+ 
Subjt:  MDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDS

Query:  SFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCG
         F DA +VFRD    NV CWN I++ A+RN       DLF+ MC G  +P+S+T+SSVL ACA+LE L  GK VQ RVIKCG EDVFV TA++DLY KCG
Subjt:  SFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCG

Query:  DMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKT
         M  A++ F R+P  +VVSWT ++SG+ ++ND   AL+ F++MR  G EIN+ TVTSV++AC  P+M  EA+Q+HAW+ K+GF   + VAAALI+MYSK+
Subjt:  DMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKT

Query:  GAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKASR------------------------------------------------------------
        G IDLS  VF ++D+ +  +    MITSF+Q+    KA R                                                            
Subjt:  GAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKASR------------------------------------------------------------

Query:  ------------------------------------------------------------------------EIHGYSVREGLGKDVAVGSSLVTMYSKC
                                                                                EIHGY++R G+ K + +GS+LV MYSKC
Subjt:  ------------------------------------------------------------------------EIHGYSVREGLGKDVAVGSSLVTMYSKC

Query:  GNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSK
        G+L+LAR+V++ LP+ D ++CSSL+SGY+Q    ++  LLF D++++G  +DSF+ISS+L A AL +  ++G Q+HA I K+ L  + SVGSSL+ +YSK
Subjt:  GNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSK

Query:  CGSIEDCCKAFGQIGKPDLIAWTAMI
         GSI+DCCKAF QI  PDLIAWTA+I
Subjt:  CGSIEDCCKAFGQIGKPDLIAWTAMI

Q9FLX6 Pentatricopeptide repeat-containing protein At5g52850, chloroplastic3.5e-6127.81Show/hide
Query:  LHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFG
        +H  +++  LL  N+ + N+LL  Y K++ + +A KLF+ M H  V +W ++IS F        +   F  M   G  P+E T+ SV+ +CA ++   +G
Subjt:  LHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFG

Query:  KQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLES
         +V+   ++ GF  N  V + + DL++K   F +A  +F  +   + + W  ++S+ V   +   AL  ++ M    + PN FTF  +L A + L  LE 
Subjt:  KQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLES

Query:  GKRVQGRVIKCG-GEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTK
        GK +   +I  G   +V ++T+L+D Y++   M+ AV+       ++V  WT+++SGFV+N     A+  F +MR +G + N++T +++L+ C+      
Subjt:  GKRVQGRVIKCG-GEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTK

Query:  EATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDL-SLMVFREMDNQRNLSSWTAMITSFAQNNDKE--------------------------------
           Q+H+  +K GF     V  AL++MY K  A ++ +  VF  M +  N+ SWT +I     +   +                                
Subjt:  EATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDL-SLMVFREMDNQRNLSSWTAMITSFAQNNDKE--------------------------------

Query:  --KASREIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALL
          +   EIH Y +R  +  ++ VG+SLV  Y+    ++ A  V  ++ ++D+I  +SLV+ + +  ++E AL + + +   G+ +D  S+   + A A L
Subjt:  --KASREIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALL

Query:  NRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMIN
             G  +H   +K       SV +SLV +YSKCGS+ED  K F +I  PD+++W  +++
Subjt:  NRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMIN

Q9FWA6 Pentatricopeptide repeat-containing protein At3g02330, mitochondrial4.8e-6329.29Show/hide
Query:  NSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYV
        N +++ YSKSN M  A   F  M   +V+SWN ++SG+      L S   F  M   G E    T+  +L  C+ ++    G Q++ + VR G   +   
Subjt:  NSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYV

Query:  RAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCG-GEDVF
         + ++D++AK   F+++LRVF+ +  +N V W+AI++  V+N    +AL  F  M       +   ++SVL +CAAL +L  G ++    +K     D  
Subjt:  RAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCG-GEDVF

Query:  VETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHA
        V TA +D+Y KC +M  A   F      N  S+ A+I+G+ Q      AL  F  +   G   +  +++ V  ACA      E  Q++   +K+  S   
Subjt:  VETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHA

Query:  VVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKA---------------------------------SREIHGYSVREGLGKD
         VA A I+MY K  A+  +  VF EM  +R+  SW A+I +  QN    +                                    EIH   V+ G+  +
Subjt:  VVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKA---------------------------------SREIHGYSVREGLGKD

Query:  VAVGSSLVTMYSKCGNLELARRVFETLPQKDH--------------------IACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALL
         +VG SL+ MYSKCG +E A ++     Q+ +                    ++ +S++SGY  ++++E+A +LF  ++  G+  D F+ ++VL   A L
Subjt:  VAVGSSLVTMYSKCGNLELARRVFETLPQKDH--------------------IACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALL

Query:  NRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI
            +G QIHA ++K  L+ DV + S+LV +YSKCG + D    F +  + D + W AMI
Subjt:  NRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI

Q9SVA5 Pentatricopeptide repeat-containing protein At4g395301.5e-6729.76Show/hide
Query:  LRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLH-------LDSWRTFCRMHFLGFEPSEITYGSV
        L    V+H +++    L  + Y+SN L++ YS++  M +A K+FE M   N++SW+ ++S  +H  ++       L+ WRT          P+E    S 
Subjt:  LRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLH-------LDSWRTFCRMHFLGFEPSEITYGSV

Query:  LSACAAI--QAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTF
        + AC+ +  +      Q+ S  V++GF  + YV   +ID + KD +   A  VF  +  ++ V W  ++S  V+ G + ++L LF  +    + P+ +  
Subjt:  LSACAAI--QAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTF

Query:  SSVLTACAALEDLESGKRVQGRVIKCGGE-DVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYT
        S+VL+AC+ L  LE GK++   +++ G E D  +   LID Y KCG +  A K F  MP +N++SWT ++SG+ QN  +  A++ F  M K G + + Y 
Subjt:  SSVLTACAALEDLESGKRVQGRVIKCGGE-DVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYT

Query:  VTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKT-------------GAIDL---------------------SLMVFREMD---NQR
         +S+LT+CA+       TQ+HA+ +KA   + + V  +LI+MY+K               A D+                     +L +FR+M     + 
Subjt:  VTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKT-------------GAIDL---------------------SLMVFREMD---NQR

Query:  NLSSWTAMITSFAQNNDKEKASREIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGL
        +L ++ +++ + A        S++IHG   + GL  D+  GS+L+ +YS C  L+ +R VF+ +  KD +  +S+ +GY QQ  NEEAL LF +L ++  
Subjt:  NLSSWTAMITSFAQNNDKEKASREIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGL

Query:  AIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMIN
          D F+ ++++ A   L    +G + H  ++K  LE +  + ++L+ +Y+KCGS ED  KAF      D++ W ++I+
Subjt:  AIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMIN

Q9SVP7 Pentatricopeptide repeat-containing protein At4g136502.2e-6828.7Show/hide
Query:  LHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFG
        +HA++L   L  S + V N L+D YS++  +D A ++F+ +   +  SW  +ISG        ++ R FC M+ LG  P+   + SVLSAC  I++   G
Subjt:  LHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFG

Query:  KQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLES
        +Q++ L ++ GF  + YV   ++ L+    + + A  +F ++   + V +N +++   + G    A++LF  M    LEP+S T +S++ AC+A   L  
Subjt:  KQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLES

Query:  GKRVQGRVIKCG-GEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTK
        G+++     K G   +  +E AL++LY KC D++ A+  FL   + NVV W  ++  +   +D   + + F  M+      N YT  S+L  C      +
Subjt:  GKRVQGRVIKCG-GEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTK

Query:  EATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKA-------------------------------
           Q+H+ I+K  F  +A V + LI+MY+K G +D +  +       +++ SWT MI  + Q N  +KA                               
Subjt:  EATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKA-------------------------------

Query:  ---SREIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLN
            ++IH  +   G   D+   ++LVT+YS+CG +E +   FE     D+IA ++LVSG+ Q   NEEAL +F  +   G+  ++F+  S + A +   
Subjt:  ---SREIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLN

Query:  RPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMINIIGQE-VGKSVIDEY
            G Q+HA+I K   + +  V ++L+ +Y+KCGSI D  K F ++   + ++W A+IN   +   G   +D +
Subjt:  RPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMINIIGQE-VGKSVIDEY

Arabidopsis top hitse value%identityAlignment
AT1G74600.1 pentatricopeptide (PPR) repeat-containing protein3.7e-14340.77Show/hide
Query:  MNFIAIQTFVNKTLLSP---RRLVSSVATVDNASNFSFTKIETFPL-FDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNA
        MN +A ++ +N   +SP    RL+SSV    N  +FS     +    F+P    ND   SR C+LR TK+L A LLR  LL  +++++ SLL  YS S +
Subjt:  MNFIAIQTFVNKTLLSP---RRLVSSVATVDNASNFSFTKIETFPL-FDPLELLNDFVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNA

Query:  MDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDS
        M  A KLF+T+  P+V+S NI+ISG+    L  +S R F +MHFLGFE +EI+YGSV+SAC+A+QAP+F + V    ++ G+F    V + +ID+F+K+ 
Subjt:  MDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDS

Query:  SFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCG
         F DA +VFRD    NV CWN I++ A+RN       DLF+ MC G  +P+S+T+SSVL ACA+LE L  GK VQ RVIKCG EDVFV TA++DLY KCG
Subjt:  SFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVFVETALIDLYTKCG

Query:  DMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKT
         M  A++ F R+P  +VVSWT ++SG+ ++ND   AL+ F++MR  G EIN+ TVTSV++AC  P+M  EA+Q+HAW+ K+GF   + VAAALI+MYSK+
Subjt:  DMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKT

Query:  GAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKASR------------------------------------------------------------
        G IDLS  VF ++D+ +  +    MITSF+Q+    KA R                                                            
Subjt:  GAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKASR------------------------------------------------------------

Query:  ------------------------------------------------------------------------EIHGYSVREGLGKDVAVGSSLVTMYSKC
                                                                                EIHGY++R G+ K + +GS+LV MYSKC
Subjt:  ------------------------------------------------------------------------EIHGYSVREGLGKDVAVGSSLVTMYSKC

Query:  GNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSK
        G+L+LAR+V++ LP+ D ++CSSL+SGY+Q    ++  LLF D++++G  +DSF+ISS+L A AL +  ++G Q+HA I K+ L  + SVGSSL+ +YSK
Subjt:  GNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSK

Query:  CGSIEDCCKAFGQIGKPDLIAWTAMI
         GSI+DCCKAF QI  PDLIAWTA+I
Subjt:  CGSIEDCCKAFGQIGKPDLIAWTAMI

AT3G02330.1 Pentatricopeptide repeat (PPR) superfamily protein3.4e-6429.29Show/hide
Query:  NSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYV
        N +++ YSKSN M  A   F  M   +V+SWN ++SG+      L S   F  M   G E    T+  +L  C+ ++    G Q++ + VR G   +   
Subjt:  NSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYV

Query:  RAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCG-GEDVF
         + ++D++AK   F+++LRVF+ +  +N V W+AI++  V+N    +AL  F  M       +   ++SVL +CAAL +L  G ++    +K     D  
Subjt:  RAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCG-GEDVF

Query:  VETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHA
        V TA +D+Y KC +M  A   F      N  S+ A+I+G+ Q      AL  F  +   G   +  +++ V  ACA      E  Q++   +K+  S   
Subjt:  VETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHA

Query:  VVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKA---------------------------------SREIHGYSVREGLGKD
         VA A I+MY K  A+  +  VF EM  +R+  SW A+I +  QN    +                                    EIH   V+ G+  +
Subjt:  VVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKA---------------------------------SREIHGYSVREGLGKD

Query:  VAVGSSLVTMYSKCGNLELARRVFETLPQKDH--------------------IACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALL
         +VG SL+ MYSKCG +E A ++     Q+ +                    ++ +S++SGY  ++++E+A +LF  ++  G+  D F+ ++VL   A L
Subjt:  VAVGSSLVTMYSKCGNLELARRVFETLPQKDH--------------------IACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALL

Query:  NRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI
            +G QIHA ++K  L+ DV + S+LV +YSKCG + D    F +  + D + W AMI
Subjt:  NRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMI

AT3G47610.1 transcription regulators;zinc ion binding8.9e-11362.75Show/hide
Query:  NIIGQEVGKSVIDEYLRLRGHSDLCSKTLDVPSSTLHAYVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNVENQVSSDTRNSSSGKGNQSS
        NIIG+E GKS+I EYL+ RG+ D  S         L  YVKP    G+  G+KKP KTPK  + S+++    K T+ +                   Q +
Subjt:  NIIGQEVGKSVIDEYLRLRGHSDLCSKTLDVPSSTLHAYVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNVENQVSSDTRNSSSGKGNQSS

Query:  SRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSDAEAAAEAYAKRLVEYDR
         +KKK  KV+SLAEAAKGSIVFQQGKPC+CQARRH LVSNCLSCGKIVCEQEGEGPCSFCG+LVL+EGSTYAG++ G+TP+SDA+ AAEAYAKRLVEYDR
Subjt:  SRKKKATKVVSLAEAAKGSIVFQQGKPCSCQARRHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSDAEAAAEAYAKRLVEYDR

Query:  NSAARTSVIDDQSDYYQFEGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDDASELESRNNILRPPDEREVNRIKPNPSLQIHPV
        NSAART+VIDDQSDYY+ E ++WLS EEKEL+RKK+EEIEEAER K++KVV+TFDL+GRKVLLNEDD SELES N IL PP+ + VNRIKPNP+ ++ P+
Subjt:  NSAARTSVIDDQSDYYQFEGNSWLSNEEKELLRKKQEEIEEAERAKRNKVVVTFDLVGRKVLLNEDDASELESRNNILRPPDEREVNRIKPNPSLQIHPV

Query:  FLDPGPREK---STKDRNSNKAVSKKGICLEITGRVQHDRDELKHLMME
        FLDPGP EK   ST  +  NK  ++ G+CLEITGRVQHDR ELK+L  +
Subjt:  FLDPGPREK---STKDRNSNKAVSKKGICLEITGRVQHDRDELKHLMME

AT3G47610.1 transcription regulators;zinc ion binding1.6e-0570.97Show/hide
Query:  GQWLEKALDDLCKKMETGWGLDKDMISGLVS
        GQWLE+AL DLC+K ETG   D+D+ISGLVS
Subjt:  GQWLEKALDDLCKKMETGWGLDKDMISGLVS

AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-6928.7Show/hide
Query:  LHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFG
        +HA++L   L  S + V N L+D YS++  +D A ++F+ +   +  SW  +ISG        ++ R FC M+ LG  P+   + SVLSAC  I++   G
Subjt:  LHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFG

Query:  KQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLES
        +Q++ L ++ GF  + YV   ++ L+    + + A  +F ++   + V +N +++   + G    A++LF  M    LEP+S T +S++ AC+A   L  
Subjt:  KQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLES

Query:  GKRVQGRVIKCG-GEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTK
        G+++     K G   +  +E AL++LY KC D++ A+  FL   + NVV W  ++  +   +D   + + F  M+      N YT  S+L  C      +
Subjt:  GKRVQGRVIKCG-GEDVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTK

Query:  EATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKA-------------------------------
           Q+H+ I+K  F  +A V + LI+MY+K G +D +  +       +++ SWT MI  + Q N  +KA                               
Subjt:  EATQLHAWILKAGFSSHAVVAAALINMYSKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKA-------------------------------

Query:  ---SREIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLN
            ++IH  +   G   D+   ++LVT+YS+CG +E +   FE     D+IA ++LVSG+ Q   NEEAL +F  +   G+  ++F+  S + A +   
Subjt:  ---SREIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGLAIDSFSISSVLGAIALLN

Query:  RPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMINIIGQE-VGKSVIDEY
            G Q+HA+I K   + +  V ++L+ +Y+KCGSI D  K F ++   + ++W A+IN   +   G   +D +
Subjt:  RPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMINIIGQE-VGKSVIDEY

AT4G39530.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-6829.76Show/hide
Query:  LRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLH-------LDSWRTFCRMHFLGFEPSEITYGSV
        L    V+H +++    L  + Y+SN L++ YS++  M +A K+FE M   N++SW+ ++S  +H  ++       L+ WRT          P+E    S 
Subjt:  LRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLH-------LDSWRTFCRMHFLGFEPSEITYGSV

Query:  LSACAAI--QAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTF
        + AC+ +  +      Q+ S  V++GF  + YV   +ID + KD +   A  VF  +  ++ V W  ++S  V+ G + ++L LF  +    + P+ +  
Subjt:  LSACAAI--QAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTF

Query:  SSVLTACAALEDLESGKRVQGRVIKCGGE-DVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYT
        S+VL+AC+ L  LE GK++   +++ G E D  +   LID Y KCG +  A K F  MP +N++SWT ++SG+ QN  +  A++ F  M K G + + Y 
Subjt:  SSVLTACAALEDLESGKRVQGRVIKCGGE-DVFVETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYT

Query:  VTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKT-------------GAIDL---------------------SLMVFREMD---NQR
         +S+LT+CA+       TQ+HA+ +KA   + + V  +LI+MY+K               A D+                     +L +FR+M     + 
Subjt:  VTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMYSKT-------------GAIDL---------------------SLMVFREMD---NQR

Query:  NLSSWTAMITSFAQNNDKEKASREIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGL
        +L ++ +++ + A        S++IHG   + GL  D+  GS+L+ +YS C  L+ +R VF+ +  KD +  +S+ +GY QQ  NEEAL LF +L ++  
Subjt:  NLSSWTAMITSFAQNNDKEKASREIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALLLFHDLLVAGL

Query:  AIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMIN
          D F+ ++++ A   L    +G + H  ++K  LE +  + ++L+ +Y+KCGS ED  KAF      D++ W ++I+
Subjt:  AIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGTCAGGACAGTGGCTGGAGAAGGCGTTGGACGATCTCTGCAAGAAGATGGAAACTGGTTGGGGTCTCGATAAGGATATGATTTCGGGCTTGGTCTCGCAAAA
TCAACTCCTTCCCCTTCTTCAAATGAATCAAATGAATTTTATTGCGATTCAAACCTTCGTAAACAAGACATTATTATCCCCACGTAGATTGGTTTCCTCTGTCGCGACTG
TAGACAATGCGTCCAATTTTTCCTTCACCAAAATTGAAACTTTCCCTCTTTTCGATCCTTTAGAGTTGCTCAATGATTTTGTAAAATCGAGAAAATGCTCTTTGAGAAAC
ACGAAAGTTCTACACGCAAAGTTACTCCGAGCAACTCTTCTTCATTCCAATATCTATGTTTCAAATTCTTTGCTAGATTGTTATTCAAAGTCTAACGCTATGGACCATGC
ACTCAAACTGTTCGAGACAATGCTCCACCCAAATGTCATTTCTTGGAATATCATTATCTCGGGTTTCCACCACAAGTTCTTACATTTGGACTCGTGGAGAACATTTTGTA
GGATGCATTTCCTGGGTTTTGAACCTAGTGAGATAACATATGGGAGCGTTTTATCTGCTTGTGCTGCCATTCAAGCCCCAATGTTTGGTAAGCAGGTTTATTCTCTTGCG
GTGAGAAATGGGTTCTTTGTTAATGGTTATGTTCGAGCCGGAATGATTGATTTATTTGCAAAAGATTCTAGTTTTCTGGATGCTCTAAGGGTATTTCGTGATGTTGATTG
TGAGAATGTGGTGTGTTGGAATGCTATTGTCTCTGCAGCTGTAAGAAATGGTGAGAACTTGATGGCTCTGGATCTTTTCAACACAATGTGTAGTGGATTACTGGAGCCTA
ATAGTTTCACCTTTTCCAGTGTTCTAACTGCGTGTGCTGCACTTGAAGATCTTGAATCTGGGAAAAGAGTTCAAGGGAGAGTGATTAAATGTGGTGGAGAAGATGTTTTT
GTTGAGACAGCCCTTATTGATTTGTACACCAAGTGTGGAGATATGGATGGAGCTGTTAAGACCTTCTTGCGGATGCCCATTCGCAATGTGGTCTCTTGGACTGCTATAAT
ATCTGGCTTTGTGCAAAATAATGATTATTTAATGGCCCTCAAGTTTTTTGAAGATATGAGAAAAGTGGGAGAGGAAATTAATAGCTACACAGTTACTAGCGTGTTAACTG
CATGTGCTAATCCAGCCATGACAAAAGAAGCAACCCAACTTCATGCCTGGATTCTAAAAGCTGGTTTTTCTTCACATGCAGTGGTGGCGGCTGCTTTAATTAACATGTAT
TCAAAAACAGGAGCAATTGATCTTTCATTGATGGTTTTCAGAGAGATGGACAACCAAAGGAATCTTAGTTCTTGGACAGCTATGATAACTTCATTTGCACAGAATAATGA
TAAAGAGAAAGCAAGCAGAGAAATTCATGGTTACTCTGTTCGTGAGGGACTTGGCAAAGACGTAGCTGTTGGAAGTTCGCTTGTGACTATGTACTCGAAATGTGGCAACC
TGGAATTGGCTAGGAGGGTGTTTGAAACATTGCCCCAGAAAGATCATATTGCATGCTCTTCATTGGTTTCAGGATATGCTCAACAGAAGCGCAATGAAGAGGCTCTTTTG
CTATTCCACGATCTGCTGGTGGCTGGCTTAGCCATCGATTCCTTCTCAATCTCATCCGTACTGGGAGCTATTGCGCTTTTAAATAGGCCTGCAATTGGGATTCAAATCCA
TGCACTCATTATGAAAGTAGCCTTGGAGAAAGATGTTTCTGTTGGGAGTTCGCTAGTAATGGTATACTCCAAATGTGGAAGTATAGAAGACTGCTGCAAAGCATTTGGGC
AGATTGGAAAGCCTGATTTGATAGCCTGGACGGCCATGATTAACATCATAGGTCAGGAAGTTGGGAAAAGTGTGATAGATGAGTATTTGCGGCTGCGAGGCCACTCTGAC
CTCTGTAGCAAAACGTTGGATGTTCCATCTTCAACCTTACATGCCTATGTCAAGCCACCCTCCCATGAAGGTTCTTTTGGCGGATCCAAGAAACCTGTTAAAACACCAAA
GACCATTTCTATCTCCAGTAAAGAGATAGAACCAAAGAAGGCTACTAGCTCTAGTAACGTGGAAAATCAGGTTTCATCGGACACTCGCAATTCATCGTCTGGCAAAGGGA
ATCAAAGTTCATCTAGAAAGAAGAAGGCTACCAAAGTTGTTTCTTTGGCTGAAGCTGCCAAAGGATCAATTGTGTTCCAGCAAGGAAAACCATGTTCATGCCAAGCTCGT
CGTCATAGACTAGTCAGCAATTGTTTATCATGCGGCAAGATTGTATGTGAACAAGAGGGAGAAGGGCCATGCAGTTTTTGCGGTTCCCTTGTGCTGAGAGAAGGGAGCAC
CTATGCTGGTATGGATGAAGGTTTTACCCCACTTTCAGATGCTGAAGCAGCAGCTGAAGCTTATGCAAAAAGATTAGTTGAATATGACAGAAACTCTGCTGCAAGAACAT
CTGTAATCGATGATCAAAGTGATTATTACCAGTTCGAGGGTAATAGCTGGTTGTCTAACGAGGAAAAGGAACTTTTGAGAAAGAAACAAGAGGAGATTGAAGAGGCTGAA
CGAGCTAAACGAAACAAAGTGGTTGTAACCTTTGACTTGGTTGGCCGCAAGGTTCTTTTGAATGAAGATGATGCTTCTGAACTTGAATCACGCAACAATATCTTGCGGCC
ACCAGATGAAAGAGAAGTGAACAGGATTAAACCAAACCCATCTCTTCAAATACATCCTGTGTTTTTAGATCCAGGACCCAGAGAGAAATCCACCAAAGACAGAAACTCAA
ACAAAGCCGTAAGCAAAAAAGGCATTTGTTTGGAAATTACTGGAAGGGTGCAGCATGATCGCGATGAATTGAAGCATCTTATGATGGAAAATGATTTGAAAACGTCATTC
AATAGGAAAGCTTGGGAAGGGCTTTCTGTGAATCACCAAGCAGCAATTGCAGGACAATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACGTCAGGACAGTGGCTGGAGAAGGCGTTGGACGATCTCTGCAAGAAGATGGAAACTGGTTGGGGTCTCGATAAGGATATGATTTCGGGCTTGGTCTCGCAAAA
TCAACTCCTTCCCCTTCTTCAAATGAATCAAATGAATTTTATTGCGATTCAAACCTTCGTAAACAAGACATTATTATCCCCACGTAGATTGGTTTCCTCTGTCGCGACTG
TAGACAATGCGTCCAATTTTTCCTTCACCAAAATTGAAACTTTCCCTCTTTTCGATCCTTTAGAGTTGCTCAATGATTTTGTAAAATCGAGAAAATGCTCTTTGAGAAAC
ACGAAAGTTCTACACGCAAAGTTACTCCGAGCAACTCTTCTTCATTCCAATATCTATGTTTCAAATTCTTTGCTAGATTGTTATTCAAAGTCTAACGCTATGGACCATGC
ACTCAAACTGTTCGAGACAATGCTCCACCCAAATGTCATTTCTTGGAATATCATTATCTCGGGTTTCCACCACAAGTTCTTACATTTGGACTCGTGGAGAACATTTTGTA
GGATGCATTTCCTGGGTTTTGAACCTAGTGAGATAACATATGGGAGCGTTTTATCTGCTTGTGCTGCCATTCAAGCCCCAATGTTTGGTAAGCAGGTTTATTCTCTTGCG
GTGAGAAATGGGTTCTTTGTTAATGGTTATGTTCGAGCCGGAATGATTGATTTATTTGCAAAAGATTCTAGTTTTCTGGATGCTCTAAGGGTATTTCGTGATGTTGATTG
TGAGAATGTGGTGTGTTGGAATGCTATTGTCTCTGCAGCTGTAAGAAATGGTGAGAACTTGATGGCTCTGGATCTTTTCAACACAATGTGTAGTGGATTACTGGAGCCTA
ATAGTTTCACCTTTTCCAGTGTTCTAACTGCGTGTGCTGCACTTGAAGATCTTGAATCTGGGAAAAGAGTTCAAGGGAGAGTGATTAAATGTGGTGGAGAAGATGTTTTT
GTTGAGACAGCCCTTATTGATTTGTACACCAAGTGTGGAGATATGGATGGAGCTGTTAAGACCTTCTTGCGGATGCCCATTCGCAATGTGGTCTCTTGGACTGCTATAAT
ATCTGGCTTTGTGCAAAATAATGATTATTTAATGGCCCTCAAGTTTTTTGAAGATATGAGAAAAGTGGGAGAGGAAATTAATAGCTACACAGTTACTAGCGTGTTAACTG
CATGTGCTAATCCAGCCATGACAAAAGAAGCAACCCAACTTCATGCCTGGATTCTAAAAGCTGGTTTTTCTTCACATGCAGTGGTGGCGGCTGCTTTAATTAACATGTAT
TCAAAAACAGGAGCAATTGATCTTTCATTGATGGTTTTCAGAGAGATGGACAACCAAAGGAATCTTAGTTCTTGGACAGCTATGATAACTTCATTTGCACAGAATAATGA
TAAAGAGAAAGCAAGCAGAGAAATTCATGGTTACTCTGTTCGTGAGGGACTTGGCAAAGACGTAGCTGTTGGAAGTTCGCTTGTGACTATGTACTCGAAATGTGGCAACC
TGGAATTGGCTAGGAGGGTGTTTGAAACATTGCCCCAGAAAGATCATATTGCATGCTCTTCATTGGTTTCAGGATATGCTCAACAGAAGCGCAATGAAGAGGCTCTTTTG
CTATTCCACGATCTGCTGGTGGCTGGCTTAGCCATCGATTCCTTCTCAATCTCATCCGTACTGGGAGCTATTGCGCTTTTAAATAGGCCTGCAATTGGGATTCAAATCCA
TGCACTCATTATGAAAGTAGCCTTGGAGAAAGATGTTTCTGTTGGGAGTTCGCTAGTAATGGTATACTCCAAATGTGGAAGTATAGAAGACTGCTGCAAAGCATTTGGGC
AGATTGGAAAGCCTGATTTGATAGCCTGGACGGCCATGATTAACATCATAGGTCAGGAAGTTGGGAAAAGTGTGATAGATGAGTATTTGCGGCTGCGAGGCCACTCTGAC
CTCTGTAGCAAAACGTTGGATGTTCCATCTTCAACCTTACATGCCTATGTCAAGCCACCCTCCCATGAAGGTTCTTTTGGCGGATCCAAGAAACCTGTTAAAACACCAAA
GACCATTTCTATCTCCAGTAAAGAGATAGAACCAAAGAAGGCTACTAGCTCTAGTAACGTGGAAAATCAGGTTTCATCGGACACTCGCAATTCATCGTCTGGCAAAGGGA
ATCAAAGTTCATCTAGAAAGAAGAAGGCTACCAAAGTTGTTTCTTTGGCTGAAGCTGCCAAAGGATCAATTGTGTTCCAGCAAGGAAAACCATGTTCATGCCAAGCTCGT
CGTCATAGACTAGTCAGCAATTGTTTATCATGCGGCAAGATTGTATGTGAACAAGAGGGAGAAGGGCCATGCAGTTTTTGCGGTTCCCTTGTGCTGAGAGAAGGGAGCAC
CTATGCTGGTATGGATGAAGGTTTTACCCCACTTTCAGATGCTGAAGCAGCAGCTGAAGCTTATGCAAAAAGATTAGTTGAATATGACAGAAACTCTGCTGCAAGAACAT
CTGTAATCGATGATCAAAGTGATTATTACCAGTTCGAGGGTAATAGCTGGTTGTCTAACGAGGAAAAGGAACTTTTGAGAAAGAAACAAGAGGAGATTGAAGAGGCTGAA
CGAGCTAAACGAAACAAAGTGGTTGTAACCTTTGACTTGGTTGGCCGCAAGGTTCTTTTGAATGAAGATGATGCTTCTGAACTTGAATCACGCAACAATATCTTGCGGCC
ACCAGATGAAAGAGAAGTGAACAGGATTAAACCAAACCCATCTCTTCAAATACATCCTGTGTTTTTAGATCCAGGACCCAGAGAGAAATCCACCAAAGACAGAAACTCAA
ACAAAGCCGTAAGCAAAAAAGGCATTTGTTTGGAAATTACTGGAAGGGTGCAGCATGATCGCGATGAATTGAAGCATCTTATGATGGAAAATGATTTGAAAACGTCATTC
AATAGGAAAGCTTGGGAAGGGCTTTCTGTGAATCACCAAGCAGCAATTGCAGGACAATTATGA
Protein sequenceShow/hide protein sequence
MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVSQNQLLPLLQMNQMNFIAIQTFVNKTLLSPRRLVSSVATVDNASNFSFTKIETFPLFDPLELLNDFVKSRKCSLRN
TKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNAMDHALKLFETMLHPNVISWNIIISGFHHKFLHLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLA
VRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFRDVDCENVVCWNAIVSAAVRNGENLMALDLFNTMCSGLLEPNSFTFSSVLTACAALEDLESGKRVQGRVIKCGGEDVF
VETALIDLYTKCGDMDGAVKTFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKVGEEINSYTVTSVLTACANPAMTKEATQLHAWILKAGFSSHAVVAAALINMY
SKTGAIDLSLMVFREMDNQRNLSSWTAMITSFAQNNDKEKASREIHGYSVREGLGKDVAVGSSLVTMYSKCGNLELARRVFETLPQKDHIACSSLVSGYAQQKRNEEALL
LFHDLLVAGLAIDSFSISSVLGAIALLNRPAIGIQIHALIMKVALEKDVSVGSSLVMVYSKCGSIEDCCKAFGQIGKPDLIAWTAMINIIGQEVGKSVIDEYLRLRGHSD
LCSKTLDVPSSTLHAYVKPPSHEGSFGGSKKPVKTPKTISISSKEIEPKKATSSSNVENQVSSDTRNSSSGKGNQSSSRKKKATKVVSLAEAAKGSIVFQQGKPCSCQAR
RHRLVSNCLSCGKIVCEQEGEGPCSFCGSLVLREGSTYAGMDEGFTPLSDAEAAAEAYAKRLVEYDRNSAARTSVIDDQSDYYQFEGNSWLSNEEKELLRKKQEEIEEAE
RAKRNKVVVTFDLVGRKVLLNEDDASELESRNNILRPPDEREVNRIKPNPSLQIHPVFLDPGPREKSTKDRNSNKAVSKKGICLEITGRVQHDRDELKHLMMENDLKTSF
NRKAWEGLSVNHQAAIAGQL