; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G019470 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G019470
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr01:25095131..25098842
RNA-Seq ExpressionLsi01G019470
SyntenyLsi01G019470
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442211.1 PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial isoform X1 [Cucumis melo]0.0e+0080.95Show/hide
Query:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS
        TSN   KC FHFK  +FIRCI +I +YSSN VSNQLLSELSK+GRVD+ARKLFDQMP RDKY+WNIMISAYANLGNLVEAR+LF+ETP KNSITWS+LVS
Subjt:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS

Query:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------
        GYCKNGCEVEGLRLFSQMWS+GQKPSQYTLGSVLRACSTL LLHSGK+IHCY                                                
Subjt:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------

Query:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS
                                                          VHGCIIWSGFG NVYVQSALVDMYAKCGDLASAR+ILN MEIDDVVCWNS
Subjt:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS

Query:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT
        MIVGCV HG+MEEALVLFHKMHNRDIRIDDFTYPS LKSLAS K+LK G+SVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCAL+VFN+I DKDVISWT
Subjt:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT

Query:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW
        SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSME RNVISW
Subjt:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW

Query:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA
        TAIIVGYAQNGRGKDSLHFYDQMI++GI PD VTFIGLLFACSHAGLVETG+SYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAE LLNRM+VEPDA
Subjt:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA

Query:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI
        TIWKSLLSACRVHGNLELGERAG+NLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIR +MKT+GINKEPGYSWIE KSQVH FISEDRSHPL+AEIYSKI
Subjt:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI

Query:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG
        D +MILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTV KGAPIRIFKNLRVCGDCHSAMKYISS+FKRHIILRDLNCFHHFIEGKCSCG
Subjt:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG

Query:  DFW
        DFW
Subjt:  DFW

XP_011653924.1 pentatricopeptide repeat-containing protein At2g03880, mitochondrial isoform X1 [Cucumis sativus]0.0e+0081.69Show/hide
Query:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS
        TSN FTKC FHFKHP+FIRCI  I +YSSN  SNQLLSELSK+GRVD+ARKLFDQMP RDKY+WNIMISAYANLGNLVEARKLFNETP KNSITWSSLVS
Subjt:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS

Query:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------
        GYCKNGCEVEGLR FSQMWS+GQKPSQYTLGSVLRACSTL LLH+GK+IHCY                                                
Subjt:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------

Query:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS
                                                          VHGCIIWSGFG NVYVQSALVDMYAKCGDLASARMIL+ MEIDDVVCWNS
Subjt:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS

Query:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT
        MIVGCV HG+MEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCK+LK GESVHSL IKTGFDACKTVSNALVDMYAKQGNLSCAL+VFNKI DKDVISWT
Subjt:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT

Query:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW
        SLVTGYVHNGFHEKAL+LFCDMR ARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSS GSLLSAENSLITMYAKCGCLEDAIRVFDSME RNVISW
Subjt:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW

Query:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA
        TAIIVGYAQNGRGKDSLHFY+QMI+DGI PD VTFIGLLFACSHAGLVETG+SYFESMEKVYGIKPASDHYACMIDLLGRAGK+NEAE LLNRM+VEPDA
Subjt:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA

Query:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI
        TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRR+MKT+GINKEPGYSWIE KSQVHTFISEDRSHPL+AEIYSKI
Subjt:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI

Query:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG
        D +MILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTV KGAPIRIFKNLRVCGDCHSAMKYISS+FKRHIILRDLNCFHHFIEGKCSCG
Subjt:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG

Query:  DFW
        DFW
Subjt:  DFW

XP_022147252.1 pentatricopeptide repeat-containing protein At2g03880, mitochondrial [Momordica charantia]0.0e+0078.33Show/hide
Query:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS
        TSNSFT+ YFHF++PVFIR I NIV YSSN VSNQLLSELSKDGRVDDARKLFD+MP RDKYSWNIMISAYAN GNLVEARKLF+ETPTKNSITWSSLVS
Subjt:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS

Query:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------
        GYC++GCEVEGLRLFSQMWS+GQKPSQYTLGSVLRACST+GLLH GK+IHCY                                                
Subjt:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------

Query:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS
                                                          VHGCI+WSGFGANV+VQSALVDMYAKCGDL SARM+L+IMEIDDVVCWNS
Subjt:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS

Query:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT
        MIVGCV  G+ EEALVLFHKMH+RD+RIDDFT+PS+L SLASC DLK GESVHSLIIKTGFDAC+TVSNALVDMY+KQGNL CA EVFNKI DKDVISWT
Subjt:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT

Query:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW
        SLVTGYVHNGFHEKALKLFCDMRIA V LDQFVVACVFSACAELTVIEFGRQVHA+FIK+SVGSLLSAENSL+TMYAKCGCLEDA RVFDSM NRNVISW
Subjt:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW

Query:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA
        TAIIVGYAQNGRGKDSLHFYDQMI++GI PDPVTFIGLLFACSHAGLVETGRSYF+SMEKVYGIKPA DHYACMIDLLGRAGKL+EAEDLLN+MEVEPDA
Subjt:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA

Query:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI
        T+WKSLLSACRVHGNLELGERAG+NLIKLEP NSLPYVLLSNMFSVAGRWED A IR+ MKT+GINKEPG SWIE KSQVHTFISEDRSHP++ EIYSKI
Subjt:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI

Query:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG
        D +MILIKEAG+VPDMNFALRDMDEE KERSLA+HSEKLA+AFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYIS VF RHIILRDLNCFHHF EGKCSCG
Subjt:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG

Query:  DFW
        D+W
Subjt:  DFW

XP_022967715.1 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g03880, mitochondrial-like [Cucurbita maxima]0.0e+0079.2Show/hide
Query:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS
        TSNSFTKC+         RCI N+VY SSN VSNQ LSELSKDGRVD+ARKLFD M  RD Y+WNIMISAYAN  N+VEARKLF+ETPTKNSITWSSLVS
Subjt:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS

Query:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------
        GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGK+IH Y                                                
Subjt:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------

Query:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS
                                                          VHGCII SGFGANVYVQSALVDMYAKCGDL SARM+LNIMEIDDVVCWNS
Subjt:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS

Query:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT
        MIVGCV HGHMEEALVLFHKMHNRDI IDDFTYPSVLKSL +C+DLKNGESVHSLI+KTGFDACKTVSNALVDMYAKQGNL+CALEVFNKISDKDVISWT
Subjt:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT

Query:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW
        SLVTGYVHNGFHEKALKLFCDMRIA VDLDQFV+ACVFSACAELT+IEFGRQVH NFIKSSVGSLLSAENSLITMYAKCGCLEDA RVFDSME RNVISW
Subjt:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW

Query:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA
        TAIIVGYAQNGRGKDSLHFYD+MI+DG+ PDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKP SDHYACMIDLLGRAGKLNEAE+LLNRM+VEPDA
Subjt:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA

Query:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI
        T+WKSLLSACRVHGNLELGERAGKNLIKLEP NSLPYVLLSNMFSVAGRWEDA HIR SMK +GINKEPGYSWIE KSQVH+FISEDRSHP++AEIYSKI
Subjt:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI

Query:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG
        D +MILIKEAG+VPDMNFALRDMDEEAKERSL YHSEKLAVAFGLL VP  APIRIFKNLRVCGDCHSAMKYISSVFKRH+ILRDLNCFHHF EGKCSCG
Subjt:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG

Query:  DFW
        DFW
Subjt:  DFW

XP_038883141.1 putative pentatricopeptide repeat-containing protein At3g15130 [Benincasa hispida]0.0e+0083.81Show/hide
Query:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS
        TSNSFTKCYFHFKHPVF+RCI N+VYYSSNPVSNQLLSEL KDGRVD+ARK+FDQMP RDKY+WNIMISAYANLG+LVEARKLFN+TP KNSITWSSLVS
Subjt:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS

Query:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------
        GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGK+IHCY                                                
Subjt:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------

Query:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS
                                                          VHGCII SGFG NVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS
Subjt:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS

Query:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT
        MIVGCV HG++EEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT
Subjt:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT

Query:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW
        SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSME RNVISW
Subjt:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW

Query:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA
        TAIIVGYAQNGRGKDSLHFY+QMI DGI PDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIK A DHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA
Subjt:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA

Query:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI
        TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKT+GINKEPGYSWIE KSQVHTFISEDRSHPL+AEIY+KI
Subjt:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI

Query:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG
        D +MILIKEAGHVPDMNFALRDMDEEAKERSL YHSEKLAVAFGLLT+ KGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG
Subjt:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG

Query:  DFW
        DFW
Subjt:  DFW

TrEMBL top hitse value%identityAlignment
A0A1S3B568 putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial isoform X10.0e+0080.95Show/hide
Query:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS
        TSN   KC FHFK  +FIRCI +I +YSSN VSNQLLSELSK+GRVD+ARKLFDQMP RDKY+WNIMISAYANLGNLVEAR+LF+ETP KNSITWS+LVS
Subjt:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS

Query:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------
        GYCKNGCEVEGLRLFSQMWS+GQKPSQYTLGSVLRACSTL LLHSGK+IHCY                                                
Subjt:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------

Query:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS
                                                          VHGCIIWSGFG NVYVQSALVDMYAKCGDLASAR+ILN MEIDDVVCWNS
Subjt:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS

Query:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT
        MIVGCV HG+MEEALVLFHKMHNRDIRIDDFTYPS LKSLAS K+LK G+SVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCAL+VFN+I DKDVISWT
Subjt:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT

Query:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW
        SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSME RNVISW
Subjt:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW

Query:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA
        TAIIVGYAQNGRGKDSLHFYDQMI++GI PD VTFIGLLFACSHAGLVETG+SYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAE LLNRM+VEPDA
Subjt:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA

Query:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI
        TIWKSLLSACRVHGNLELGERAG+NLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIR +MKT+GINKEPGYSWIE KSQVH FISEDRSHPL+AEIYSKI
Subjt:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI

Query:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG
        D +MILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTV KGAPIRIFKNLRVCGDCHSAMKYISS+FKRHIILRDLNCFHHFIEGKCSCG
Subjt:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG

Query:  DFW
        DFW
Subjt:  DFW

A0A6J1D1V5 pentatricopeptide repeat-containing protein At2g03880, mitochondrial0.0e+0078.33Show/hide
Query:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS
        TSNSFT+ YFHF++PVFIR I NIV YSSN VSNQLLSELSKDGRVDDARKLFD+MP RDKYSWNIMISAYAN GNLVEARKLF+ETPTKNSITWSSLVS
Subjt:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS

Query:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------
        GYC++GCEVEGLRLFSQMWS+GQKPSQYTLGSVLRACST+GLLH GK+IHCY                                                
Subjt:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------

Query:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS
                                                          VHGCI+WSGFGANV+VQSALVDMYAKCGDL SARM+L+IMEIDDVVCWNS
Subjt:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS

Query:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT
        MIVGCV  G+ EEALVLFHKMH+RD+RIDDFT+PS+L SLASC DLK GESVHSLIIKTGFDAC+TVSNALVDMY+KQGNL CA EVFNKI DKDVISWT
Subjt:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT

Query:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW
        SLVTGYVHNGFHEKALKLFCDMRIA V LDQFVVACVFSACAELTVIEFGRQVHA+FIK+SVGSLLSAENSL+TMYAKCGCLEDA RVFDSM NRNVISW
Subjt:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW

Query:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA
        TAIIVGYAQNGRGKDSLHFYDQMI++GI PDPVTFIGLLFACSHAGLVETGRSYF+SMEKVYGIKPA DHYACMIDLLGRAGKL+EAEDLLN+MEVEPDA
Subjt:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA

Query:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI
        T+WKSLLSACRVHGNLELGERAG+NLIKLEP NSLPYVLLSNMFSVAGRWED A IR+ MKT+GINKEPG SWIE KSQVHTFISEDRSHP++ EIYSKI
Subjt:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI

Query:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG
        D +MILIKEAG+VPDMNFALRDMDEE KERSLA+HSEKLA+AFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYIS VF RHIILRDLNCFHHF EGKCSCG
Subjt:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG

Query:  DFW
        D+W
Subjt:  DFW

A0A6J1EL19 pentatricopeptide repeat-containing protein At2g03880, mitochondrial isoform X20.0e+0079.79Show/hide
Query:  LSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRA
        LSELSKDGRVD+ARKLFD MP RD Y+WNIMISAYAN  N+VEARKLF+ETPTKNSITWSSLVSGYC+NGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRA
Subjt:  LSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRA

Query:  CSTLGLLHSGKIIHCY------------------------------------------------------------------------------------
        CSTLGLLHSGK+IH Y                                                                                    
Subjt:  CSTLGLLHSGKIIHCY------------------------------------------------------------------------------------

Query:  --------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSV
                      VHGCII SGFGANVYVQSALVDMYAKCGDL SARM+LNIMEIDDVVCWNSMIVGCV HGHMEEALVLFHKMHNRD  IDDFTYPSV
Subjt:  --------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSV

Query:  LKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVAC
        LKSL +C+DLKNGESVHSLI+KTGFDACKTVSNALVDMYAKQGNL+CALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIA VDLDQFV+AC
Subjt:  LKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVAC

Query:  VFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFI
        VFSACAELT+IEFGRQVH NFIKSSVGSLLSAENSLITMYAKCGCLEDA RVFDSM  RNVISWTAIIVGYAQNGRGKDSL FYD+MI+DG+ PDPVTFI
Subjt:  VFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFI

Query:  GLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLP
        GLLFACSHAGLVETGRSYFESMEKVYGIKP SDHYACMIDLLGRAGKLNEAE+LLNRM+VEPDAT+WKSLLSACRVHGNLELGERAGKNLIKLEP NSLP
Subjt:  GLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLP

Query:  YVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHS
        YVLLSNMFSVAGRWEDA HIR SMK +GINKEPGYSWIE KSQVH+FISEDRSHPL+AEIYSKID +MILIKEAG+VP MNFAL DMDEEAKERSL YHS
Subjt:  YVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHS

Query:  EKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW
        EKLAVAFGLL VP GAPIRIFKNLRVCGDCHSAMKYISSVFKRH+ILRDLNCFHHF EGKCSCGDFW
Subjt:  EKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW

A0A6J1EPT9 pentatricopeptide repeat-containing protein At2g03880, mitochondrial isoform X10.0e+0079.49Show/hide
Query:  RCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQM
        R I N VY SSN VSNQ LSELSKDGRVD+ARKLFD MP RD Y+WNIMISAYAN  N+VEARKLF+ETPTKNSITWSSLVSGYC+NGCEVEGLRLFSQM
Subjt:  RCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQM

Query:  WSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------------------------
        WSEGQKPSQYTLGSVLRACSTLGLLHSGK+IH Y                                                                  
Subjt:  WSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------------------------

Query:  --------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLF
                                        VHGCII SGFGANVYVQSALVDMYAKCGDL SARM+LNIMEIDDVVCWNSMIVGCV HGHMEEALVLF
Subjt:  --------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLF

Query:  HKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKL
        HKMHNRD  IDDFTYPSVLKSL +C+DLKNGESVHSLI+KTGFDACKTVSNALVDMYAKQGNL+CALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKL
Subjt:  HKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKL

Query:  FCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLH
        FCDMRIA VDLDQFV+ACVFSACAELT+IEFGRQVH NFIKSSVGSLLSAENSLITMYAKCGCLEDA RVFDSM  RNVISWTAIIVGYAQNGRGKDSL 
Subjt:  FCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLH

Query:  FYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLEL
        FYD+MI+DG+ PDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKP SDHYACMIDLLGRAGKLNEAE+LLNRM+VEPDAT+WKSLLSACRVHGNLEL
Subjt:  FYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLEL

Query:  GERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNF
        GERAGKNLIKLEP NSLPYVLLSNMFSVAGRWEDA HIR SMK +GINKEPGYSWIE KSQVH+FISEDRSHPL+AEIYSKID +MILIKEAG+VP MNF
Subjt:  GERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNF

Query:  ALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW
        AL DMDEEAKERSL YHSEKLAVAFGLL VP GAPIRIFKNLRVCGDCHSAMKYISSVFKRH+ILRDLNCFHHF EGKCSCGDFW
Subjt:  ALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW

A0A6J1HV89 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g03880, mitochondrial-like0.0e+0079.2Show/hide
Query:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS
        TSNSFTKC+         RCI N+VY SSN VSNQ LSELSKDGRVD+ARKLFD M  RD Y+WNIMISAYAN  N+VEARKLF+ETPTKNSITWSSLVS
Subjt:  TSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS

Query:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------
        GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGK+IH Y                                                
Subjt:  GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCY------------------------------------------------

Query:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS
                                                          VHGCII SGFGANVYVQSALVDMYAKCGDL SARM+LNIMEIDDVVCWNS
Subjt:  --------------------------------------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNS

Query:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT
        MIVGCV HGHMEEALVLFHKMHNRDI IDDFTYPSVLKSL +C+DLKNGESVHSLI+KTGFDACKTVSNALVDMYAKQGNL+CALEVFNKISDKDVISWT
Subjt:  MIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWT

Query:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW
        SLVTGYVHNGFHEKALKLFCDMRIA VDLDQFV+ACVFSACAELT+IEFGRQVH NFIKSSVGSLLSAENSLITMYAKCGCLEDA RVFDSME RNVISW
Subjt:  SLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISW

Query:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA
        TAIIVGYAQNGRGKDSLHFYD+MI+DG+ PDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKP SDHYACMIDLLGRAGKLNEAE+LLNRM+VEPDA
Subjt:  TAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDA

Query:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI
        T+WKSLLSACRVHGNLELGERAGKNLIKLEP NSLPYVLLSNMFSVAGRWEDA HIR SMK +GINKEPGYSWIE KSQVH+FISEDRSHP++AEIYSKI
Subjt:  TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKI

Query:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG
        D +MILIKEAG+VPDMNFALRDMDEEAKERSL YHSEKLAVAFGLL VP  APIRIFKNLRVCGDCHSAMKYISSVFKRH+ILRDLNCFHHF EGKCSCG
Subjt:  DGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCG

Query:  DFW
        DFW
Subjt:  DFW

SwissProt top hitse value%identityAlignment
Q7Y211 Pentatricopeptide repeat-containing protein At3g57430, chloroplastic5.2e-14839.3Show/hide
Query:  NIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCYVHGCIIWSGFGA
        N +++ Y  LG L  ++ L      ++ +TW++++S  C+N   +E L    +M  EG +P ++T+ SVL ACS L +L +GK +H Y    +       
Subjt:  NIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCYVHGCIIWSGFGA

Query:  NVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKM-HNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGF
        N +V SALVDMY  C  + S R + + M    +  WN+MI G   + H +EAL+LF  M  +  +  +  T   V+ +          E++H  ++K G 
Subjt:  NVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKM-HNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGF

Query:  DACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMR---------IARVDL--DQFVVACVFSACAELTVIEFG
        D  + V N L+DMY++ G +  A+ +F K+ D+D+++W +++TGYV +  HE AL L   M+          +RV L  +   +  +  +CA L+ +  G
Subjt:  DACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMR---------IARVDL--DQFVVACVFSACAELTVIEFG

Query:  RQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVET
        +++HA  IK+++ + ++  ++L+ MYAKCGCL+ + +VFD +  +NVI+W  II+ Y  +G G++++     M+V G+ P+ VTFI +  ACSH+G+V+ 
Subjt:  RQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVET

Query:  GRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVE-PDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGR
        G   F  M+  YG++P+SDHYAC++DLLGRAG++ EA  L+N M  +   A  W SLL A R+H NLE+GE A +NLI+LEP+ +  YVLL+N++S AG 
Subjt:  GRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVE-PDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGR

Query:  WEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVP
        W+ A  +RR+MK  G+ KEPG SWIE   +VH F++ D SHP S ++   ++ +   +++ G+VPD +  L +++E+ KE  L  HSEKLA+AFG+L   
Subjt:  WEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVP

Query:  KGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW
         G  IR+ KNLRVC DCH A K+IS +  R IILRD+  FH F  G CSCGD+W
Subjt:  KGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW

Q9CAA8 Putative pentatricopeptide repeat-containing protein At1g689306.8e-14838.23Show/hide
Query:  ARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSE-GQKPSQYTLGSVLRACSTLGLLHSGK
        AR++FD++P  + +SWN ++ AY+  G + E    F + P ++ +TW+ L+ GY  +G     ++ ++ M  +     ++ TL ++L+  S+ G +  GK
Subjt:  ARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSE-GQKPSQYTLGSVLRACSTLGLLHSGK

Query:  IIHCYVHGCIIWSGFGANVYVQSALVDMYAKCGDLASAR-------------------------MILNIMEI-----DDVVCWNSMIVGCVVHGHMEEAL
             +HG +I  GF + + V S L+ MYA  G ++ A+                         MI + +++      D V W +MI G   +G  +EA+
Subjt:  IIHCYVHGCIIWSGFGANVYVQSALVDMYAKCGDLASAR-------------------------MILNIMEI-----DDVVCWNSMIVGCVVHGHMEEAL

Query:  VLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKA
          F +M  + +++D + + SVL +      +  G+ +H+ II+T F     V +AL+DMY K   L  A  VF+++  K+V+SWT++V GY   G  E+A
Subjt:  VLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKA

Query:  LKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKD
        +K+F DM+ + +D D + +    SACA ++ +E G Q H   I S +   ++  NSL+T+Y KCG ++D+ R+F+ M  R+ +SWTA++  YAQ GR  +
Subjt:  LKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKD

Query:  SLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGN
        ++  +D+M+  G+ PD VT  G++ ACS AGLVE G+ YF+ M   YGI P+  HY+CMIDL  R+G+L EA   +N M   PDA  W +LLSACR  GN
Subjt:  SLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGN

Query:  LELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPD
        LE+G+ A ++LI+L+P +   Y LLS++++  G+W+  A +RR M+   + KEPG SWI+ K ++H+F ++D S P   +IY+K++ +   I + G+ PD
Subjt:  LELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPD

Query:  MNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW
         +F   D++E  K + L YHSE+LA+AFGL+ VP G PIR+ KNLRVC DCH+A K+ISSV  R I++RD   FH F +G CSCGDFW
Subjt:  MNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW

Q9FIB2 Putative pentatricopeptide repeat-containing protein At5g099506.4e-14640.9Show/hide
Query:  NIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCYVHGCIIWSGFGA
        N +++ YA  G++ +AR++F     K+S++W+S+++G  +NGC +E +  +  M      P  +TL S L +C++L     G+     +HG  +  G   
Subjt:  NIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCYVHGCIIWSGFGA

Query:  NVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGH--MEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTG
        NV V +AL+ +YA+ G L   R I + M   D V WNS I+G +      + EA+V F        +++  T+ SVL +++S    + G+ +H L +K  
Subjt:  NVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGH--MEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTG

Query:  FDACKTVSNALVDMYAKQGNLSCALEVFNKISD-KDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIK
             T  NAL+  Y K G +    ++F+++++ +D ++W S+++GY+HN    KAL L   M      LD F+ A V SA A +  +E G +VHA  ++
Subjt:  FDACKTVSNALVDMYAKQGNLSCALEVFNKISD-KDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIK

Query:  SSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGIT-PDPVTFIGLLFACSHAGLVETGRSYFESM
        + + S +   ++L+ MY+KCG L+ A+R F++M  RN  SW ++I GYA++G+G+++L  ++ M +DG T PD VTF+G+L ACSHAGL+E G  +FESM
Subjt:  SSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGIT-PDPVTFIGLLFACSHAGLVETGRSYFESM

Query:  EKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSA-CRVHG-NLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHI
           YG+ P  +H++CM D+LGRAG+L++ ED + +M ++P+  IW+++L A CR +G   ELG++A + L +LEP N++ YVLL NM++  GRWED    
Subjt:  EKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSA-CRVHG-NLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHI

Query:  RRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGA-PIR
        R+ MK   + KE GYSW+  K  VH F++ D+SHP +  IY K+  +   +++AG+VP   FAL D+++E KE  L+YHSEKLAVAF L        PIR
Subjt:  RRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGA-PIR

Query:  IFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW
        I KNLRVCGDCHSA KYIS +  R IILRD N FHHF +G CSC DFW
Subjt:  IFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220701.6e-15738.48Show/hide
Query:  NQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSV
        N L++  SK G    ARKLFD+MP R  +SWN ++SAY+  G++    + F++ P ++S++W++++ GY   G   + +R+   M  EG +P+Q+TL +V
Subjt:  NQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSV

Query:  LRACSTLGLLHSGKIIHCYVHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKMHNRDI-----
        L + +    + +GK +H +    I+  G   NV V ++L++MYAKCGD   A+ + + M + D+  WN+MI   +  G M+ A+  F +M  RDI     
Subjt:  LRACSTLGLLHSGKIIHCYVHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKMHNRDI-----

Query:  ---------------------------RIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYA-----------------------
                                     D FT  SVL + A+ + L  G+ +HS I+ TGFD    V NAL+ MY+                       
Subjt:  ---------------------------RIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYA-----------------------

Query:  ----------KQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLL
                  K G+++ A  +F  + D+DV++WT+++ GY  +G + +A+ LF  M       + + +A + S  + L  +  G+Q+H + +KS     +
Subjt:  ----------KQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLL

Query:  SAENSLITMYAKCGCLEDAIRVFDSME-NRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIK
        S  N+LITMYAK G +  A R FD +   R+ +SWT++I+  AQ+G  +++L  ++ M+++G+ PD +T++G+  AC+HAGLV  GR YF+ M+ V  I 
Subjt:  SAENSLITMYAKCGCLEDAIRVFDSME-NRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIK

Query:  PASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGI
        P   HYACM+DL GRAG L EA++ + +M +EPD   W SLLSACRVH N++LG+ A + L+ LEP NS  Y  L+N++S  G+WE+AA IR+SMK   +
Subjt:  PASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGI

Query:  NKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGD
         KE G+SWIE K +VH F  ED +HP   EIY  +  I   IK+ G+VPD    L D++EE KE+ L +HSEKLA+AFGL++ P    +RI KNLRVC D
Subjt:  NKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGD

Query:  CHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW
        CH+A+K+IS +  R II+RD   FHHF +G CSC D+W
Subjt:  CHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331701.3e-15442.59Show/hide
Query:  NIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCYVHGCIIWSGFGA
        N +I+ Y  L     AR +F+    ++ I+W+S+++G  +NG EVE + LF Q+   G KP QYT+ SVL+A S+   L  G  +   VH   I     +
Subjt:  NIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCYVHGCIIWSGFGA

Query:  NVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFD
        + +V +AL+D Y++   +  A ++       D+V WN+M+ G        + L LF  MH +  R DDFT  +V K+      +  G+ VH+  IK+G+D
Subjt:  NVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFD

Query:  ACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSV
            VS+ ++DMY K G++S A   F+ I   D ++WT++++G + NG  E+A  +F  MR+  V  D+F +A +  A + LT +E GRQ+HAN +K + 
Subjt:  ACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSV

Query:  GSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVY
         +      SL+ MYAKCG ++DA  +F  +E  N+ +W A++VG AQ+G GK++L  + QM   GI PD VTFIG+L ACSH+GLV     +  SM   Y
Subjt:  GSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVY

Query:  GIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKT
        GIKP  +HY+C+ D LGRAG + +AE+L+  M +E  A+++++LL+ACRV G+ E G+R    L++LEP +S  YVLLSNM++ A +W++    R  MK 
Subjt:  GIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKT

Query:  LGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRV
          + K+PG+SWIE K+++H F+ +DRS+  +  IY K+  ++  IK+ G+VP+ +F L D++EE KER+L YHSEKLAVAFGLL+ P   PIR+ KNLRV
Subjt:  LGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRV

Query:  CGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW
        CGDCH+AMKYI+ V+ R I+LRD N FH F +G CSCGD+W
Subjt:  CGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW

Arabidopsis top hitse value%identityAlignment
AT1G68930.1 pentatricopeptide (PPR) repeat-containing protein4.8e-14938.23Show/hide
Query:  ARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSE-GQKPSQYTLGSVLRACSTLGLLHSGK
        AR++FD++P  + +SWN ++ AY+  G + E    F + P ++ +TW+ L+ GY  +G     ++ ++ M  +     ++ TL ++L+  S+ G +  GK
Subjt:  ARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSE-GQKPSQYTLGSVLRACSTLGLLHSGK

Query:  IIHCYVHGCIIWSGFGANVYVQSALVDMYAKCGDLASAR-------------------------MILNIMEI-----DDVVCWNSMIVGCVVHGHMEEAL
             +HG +I  GF + + V S L+ MYA  G ++ A+                         MI + +++      D V W +MI G   +G  +EA+
Subjt:  IIHCYVHGCIIWSGFGANVYVQSALVDMYAKCGDLASAR-------------------------MILNIMEI-----DDVVCWNSMIVGCVVHGHMEEAL

Query:  VLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKA
          F +M  + +++D + + SVL +      +  G+ +H+ II+T F     V +AL+DMY K   L  A  VF+++  K+V+SWT++V GY   G  E+A
Subjt:  VLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKA

Query:  LKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKD
        +K+F DM+ + +D D + +    SACA ++ +E G Q H   I S +   ++  NSL+T+Y KCG ++D+ R+F+ M  R+ +SWTA++  YAQ GR  +
Subjt:  LKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKD

Query:  SLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGN
        ++  +D+M+  G+ PD VT  G++ ACS AGLVE G+ YF+ M   YGI P+  HY+CMIDL  R+G+L EA   +N M   PDA  W +LLSACR  GN
Subjt:  SLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGN

Query:  LELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPD
        LE+G+ A ++LI+L+P +   Y LLS++++  G+W+  A +RR M+   + KEPG SWI+ K ++H+F ++D S P   +IY+K++ +   I + G+ PD
Subjt:  LELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPD

Query:  MNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW
         +F   D++E  K + L YHSE+LA+AFGL+ VP G PIR+ KNLRVC DCH+A K+ISSV  R I++RD   FH F +G CSCGDFW
Subjt:  MNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein1.1e-15838.48Show/hide
Query:  NQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSV
        N L++  SK G    ARKLFD+MP R  +SWN ++SAY+  G++    + F++ P ++S++W++++ GY   G   + +R+   M  EG +P+Q+TL +V
Subjt:  NQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSV

Query:  LRACSTLGLLHSGKIIHCYVHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKMHNRDI-----
        L + +    + +GK +H +    I+  G   NV V ++L++MYAKCGD   A+ + + M + D+  WN+MI   +  G M+ A+  F +M  RDI     
Subjt:  LRACSTLGLLHSGKIIHCYVHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKMHNRDI-----

Query:  ---------------------------RIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYA-----------------------
                                     D FT  SVL + A+ + L  G+ +HS I+ TGFD    V NAL+ MY+                       
Subjt:  ---------------------------RIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYA-----------------------

Query:  ----------KQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLL
                  K G+++ A  +F  + D+DV++WT+++ GY  +G + +A+ LF  M       + + +A + S  + L  +  G+Q+H + +KS     +
Subjt:  ----------KQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLL

Query:  SAENSLITMYAKCGCLEDAIRVFDSME-NRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIK
        S  N+LITMYAK G +  A R FD +   R+ +SWT++I+  AQ+G  +++L  ++ M+++G+ PD +T++G+  AC+HAGLV  GR YF+ M+ V  I 
Subjt:  SAENSLITMYAKCGCLEDAIRVFDSME-NRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIK

Query:  PASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGI
        P   HYACM+DL GRAG L EA++ + +M +EPD   W SLLSACRVH N++LG+ A + L+ LEP NS  Y  L+N++S  G+WE+AA IR+SMK   +
Subjt:  PASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGI

Query:  NKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGD
         KE G+SWIE K +VH F  ED +HP   EIY  +  I   IK+ G+VPD    L D++EE KE+ L +HSEKLA+AFGL++ P    +RI KNLRVC D
Subjt:  NKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGD

Query:  CHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW
        CH+A+K+IS +  R II+RD   FHHF +G CSC D+W
Subjt:  CHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW

AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-14939.3Show/hide
Query:  NIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCYVHGCIIWSGFGA
        N +++ Y  LG L  ++ L      ++ +TW++++S  C+N   +E L    +M  EG +P ++T+ SVL ACS L +L +GK +H Y    +       
Subjt:  NIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCYVHGCIIWSGFGA

Query:  NVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKM-HNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGF
        N +V SALVDMY  C  + S R + + M    +  WN+MI G   + H +EAL+LF  M  +  +  +  T   V+ +          E++H  ++K G 
Subjt:  NVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKM-HNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGF

Query:  DACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMR---------IARVDL--DQFVVACVFSACAELTVIEFG
        D  + V N L+DMY++ G +  A+ +F K+ D+D+++W +++TGYV +  HE AL L   M+          +RV L  +   +  +  +CA L+ +  G
Subjt:  DACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMR---------IARVDL--DQFVVACVFSACAELTVIEFG

Query:  RQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVET
        +++HA  IK+++ + ++  ++L+ MYAKCGCL+ + +VFD +  +NVI+W  II+ Y  +G G++++     M+V G+ P+ VTFI +  ACSH+G+V+ 
Subjt:  RQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVET

Query:  GRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVE-PDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGR
        G   F  M+  YG++P+SDHYAC++DLLGRAG++ EA  L+N M  +   A  W SLL A R+H NLE+GE A +NLI+LEP+ +  YVLL+N++S AG 
Subjt:  GRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVE-PDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGR

Query:  WEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVP
        W+ A  +RR+MK  G+ KEPG SWIE   +VH F++ D SHP S ++   ++ +   +++ G+VPD +  L +++E+ KE  L  HSEKLA+AFG+L   
Subjt:  WEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVP

Query:  KGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW
         G  IR+ KNLRVC DCH A K+IS +  R IILRD+  FH F  G CSCGD+W
Subjt:  KGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW

AT3G61170.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-20549.87Show/hide
Query:  SNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGS
        SN LL +LSK GRVD+AR++FD+MP+RD+++WN MI AY+N   L +A KLF   P KN+I+W++L+SGYCK+G +VE   LF +M S+G KP++YTLGS
Subjt:  SNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGS

Query:  VLRACSTLGLLHSGKIIH-----------------------------------------------------------------CY---------------
        VLR C++L LL  G+ IH                                                                 C+               
Subjt:  VLRACSTLGLLHSGKIIH-----------------------------------------------------------------CY---------------

Query:  ------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKMHNRDIRIDDFT
                          VH CI+ SGF  N+YVQSAL+DMYAKC ++ SAR +L  ME+DDVV WNSMIVGCV  G + EAL +F +MH RD++IDDFT
Subjt:  ------------------VHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKMHNRDIRIDDFT

Query:  YPSVLKSLA-SCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQ
         PS+L   A S  ++K   S H LI+KTG+   K V+NALVDMYAK+G +  AL+VF  + +KDVISWT+LVTG  HNG +++ALKLFC+MR+  +  D+
Subjt:  YPSVLKSLA-SCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQ

Query:  FVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPD
         V A V SA AELT++EFG+QVH N+IKS   S LS  NSL+TMY KCG LEDA  +F+SME R++I+WT +IVGYA+N                     
Subjt:  FVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPD

Query:  PVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEP
                      GL+E  + YF+SM  VYGI P  +HYACMIDL GR+G   + E LL++MEVEPDAT+WK++L+A R HGN+E GERA K L++LEP
Subjt:  PVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEP

Query:  SNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERS
        +N++PYV LSNM+S AGR ++AA++RR MK+  I+KEPG SW+E K +VH+F+SEDR HP   EIYSK+D +M+LIKEAG+  DM+FAL D+D+E KE  
Subjt:  SNSLPYVLLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERS

Query:  LAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYI
        LAYHSEKLAVAFGLL VP GAPIRI KNLRVCGDCHSAMK +
Subjt:  LAYHSEKLAVAFGLLTVPKGAPIRIFKNLRVCGDCHSAMKYI

AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.1e-15642.59Show/hide
Query:  NIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCYVHGCIIWSGFGA
        N +I+ Y  L     AR +F+    ++ I+W+S+++G  +NG EVE + LF Q+   G KP QYT+ SVL+A S+   L  G  +   VH   I     +
Subjt:  NIMISAYANLGNLVEARKLFNETPTKNSITWSSLVSGYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCYVHGCIIWSGFGA

Query:  NVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFD
        + +V +AL+D Y++   +  A ++       D+V WN+M+ G        + L LF  MH +  R DDFT  +V K+      +  G+ VH+  IK+G+D
Subjt:  NVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVHGHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFD

Query:  ACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSV
            VS+ ++DMY K G++S A   F+ I   D ++WT++++G + NG  E+A  +F  MR+  V  D+F +A +  A + LT +E GRQ+HAN +K + 
Subjt:  ACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSV

Query:  GSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVY
         +      SL+ MYAKCG ++DA  +F  +E  N+ +W A++VG AQ+G GK++L  + QM   GI PD VTFIG+L ACSH+GLV     +  SM   Y
Subjt:  GSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGITPDPVTFIGLLFACSHAGLVETGRSYFESMEKVY

Query:  GIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKT
        GIKP  +HY+C+ D LGRAG + +AE+L+  M +E  A+++++LL+ACRV G+ E G+R    L++LEP +S  YVLLSNM++ A +W++    R  MK 
Subjt:  GIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRSMKT

Query:  LGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRV
          + K+PG+SWIE K+++H F+ +DRS+  +  IY K+  ++  IK+ G+VP+ +F L D++EE KER+L YHSEKLAVAFGLL+ P   PIR+ KNLRV
Subjt:  LGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVPKGAPIRIFKNLRV

Query:  CGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW
        CGDCH+AMKYI+ V+ R I+LRD N FH F +G CSCGD+W
Subjt:  CGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAAATCAGTGTGTCCATATTTCAAATCGAAAAATCTCCAACTCTTCTTCTACCGCCTTCATACTGCAGTGCAGCCAGCCGCCACCTCCTTTGCCGACTCTCCAGC
TGAAAATCCACGACGACGAGCGGCTTCTCCCTCTCTTCCGGAAAATTTAGCCCTGCGACCACGAGTGTATTTCTTCTTCGGCGAACTCCCCTATTGCAGCACTCCCTTCT
GCTCGACGTTTCTGTCGTTACCCACGCAAGTCCCACAACATCCACTGCTGTGGACGACGTCGAAAACCTCCGACGGCTGGCGGCACGGTTCTCTGCGAAAACCCATCTGT
GCGACGGCGACGAGATTCTTGACGGTGGTAACTTCAAACTCTTTCACCAAGTGCTACTTTCACTTCAAGCACCCTGTTTTTATTCGTTGCATCGGCAACATTGTGTATTA
TTCATCGAATCCTGTCTCCAATCAGCTTCTGAGTGAGTTATCTAAAGATGGTCGAGTTGATGATGCGCGTAAGTTATTTGATCAAATGCCTGATCGGGACAAGTACTCAT
GGAATATTATGATTTCTGCTTATGCCAATTTAGGAAATTTAGTTGAAGCTCGCAAGCTTTTCAATGAAACTCCAACTAAAAATTCTATCACTTGGTCATCCCTAGTATCC
GGATATTGCAAAAATGGTTGTGAAGTTGAAGGCTTGAGGCTGTTCAGCCAAATGTGGAGTGAAGGGCAGAAGCCAAGCCAATACACATTGGGCAGTGTTTTAAGAGCATG
TTCAACTTTAGGTTTGCTCCACAGTGGCAAAATTATTCATTGCTATGTACATGGATGTATTATTTGGAGTGGTTTTGGTGCTAATGTATATGTTCAAAGTGCATTAGTTG
ATATGTATGCCAAATGTGGAGACTTGGCTAGTGCAAGAATGATACTGAATATCATGGAAATTGATGATGTTGTGTGCTGGAACTCGATGATTGTTGGGTGTGTGGTACAT
GGACATATGGAGGAAGCTCTAGTTTTGTTCCATAAGATGCATAATCGGGATATAAGAATTGATGATTTCACATATCCATCTGTTTTGAAATCTCTGGCTTCTTGTAAGGA
CCTGAAAAATGGAGAATCAGTTCATTCGCTGATTATTAAAACTGGTTTTGATGCTTGCAAAACGGTGAGCAATGCGCTTGTTGACATGTATGCTAAACAGGGAAACTTGA
GTTGTGCATTAGAGGTTTTCAATAAGATATCAGATAAAGATGTAATATCGTGGACCTCCTTGGTCACAGGATATGTTCACAATGGCTTCCATGAAAAAGCTCTCAAGTTA
TTTTGTGACATGAGAATTGCAAGGGTTGATCTCGACCAATTCGTAGTTGCCTGTGTTTTTAGTGCATGTGCTGAACTAACAGTTATAGAGTTTGGTCGACAGGTTCATGC
AAACTTTATCAAATCTAGTGTTGGTTCATTGTTATCTGCGGAGAACTCTCTCATAACAATGTATGCGAAATGTGGATGCTTAGAAGATGCAATTAGAGTCTTTGACTCGA
TGGAAAATCGAAATGTCATATCATGGACTGCCATAATAGTTGGTTATGCACAAAATGGGAGAGGGAAGGACTCTCTTCATTTTTATGATCAAATGATAGTTGATGGCATA
ACGCCAGATCCTGTTACTTTTATTGGTTTGTTGTTTGCTTGCAGCCATGCAGGTCTTGTGGAAACTGGCCGATCTTACTTTGAATCAATGGAAAAAGTTTATGGAATAAA
GCCAGCTTCTGATCATTATGCTTGCATGATTGATCTACTGGGACGTGCAGGAAAACTCAATGAGGCAGAGGATTTATTGAACCGAATGGAGGTTGAACCCGATGCAACCA
TATGGAAGTCATTACTTTCAGCATGTAGGGTTCATGGGAACTTAGAACTTGGAGAAAGGGCTGGAAAAAACCTCATTAAATTGGAACCTTCAAATTCTCTGCCTTATGTA
TTATTGTCCAATATGTTCTCTGTTGCTGGTAGATGGGAAGATGCAGCACATATTCGTAGATCAATGAAAACATTGGGTATTAACAAGGAGCCCGGATATAGTTGGATTGA
AACGAAGAGCCAAGTGCATACTTTTATATCTGAAGATAGGAGCCATCCTTTGTCGGCTGAAATATATTCAAAGATTGATGGAATTATGATCTTAATAAAGGAAGCTGGGC
ATGTACCAGATATGAACTTTGCATTACGTGACATGGATGAAGAGGCTAAGGAACGTAGTCTAGCATATCATAGCGAGAAGTTGGCTGTTGCATTTGGACTTCTCACAGTC
CCGAAAGGAGCACCGATTCGAATTTTCAAGAATCTTAGAGTATGCGGGGACTGCCACTCAGCAATGAAATATATATCTAGCGTTTTTAAGCGGCATATTATTTTGAGAGA
TTTAAATTGTTTCCATCACTTCATCGAGGGAAAATGTTCTTGTGGAGACTTCTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAAAATCAGTGTGTCCATATTTCAAATCGAAAAATCTCCAACTCTTCTTCTACCGCCTTCATACTGCAGTGCAGCCAGCCGCCACCTCCTTTGCCGACTCTCCAGC
TGAAAATCCACGACGACGAGCGGCTTCTCCCTCTCTTCCGGAAAATTTAGCCCTGCGACCACGAGTGTATTTCTTCTTCGGCGAACTCCCCTATTGCAGCACTCCCTTCT
GCTCGACGTTTCTGTCGTTACCCACGCAAGTCCCACAACATCCACTGCTGTGGACGACGTCGAAAACCTCCGACGGCTGGCGGCACGGTTCTCTGCGAAAACCCATCTGT
GCGACGGCGACGAGATTCTTGACGGTGGTAACTTCAAACTCTTTCACCAAGTGCTACTTTCACTTCAAGCACCCTGTTTTTATTCGTTGCATCGGCAACATTGTGTATTA
TTCATCGAATCCTGTCTCCAATCAGCTTCTGAGTGAGTTATCTAAAGATGGTCGAGTTGATGATGCGCGTAAGTTATTTGATCAAATGCCTGATCGGGACAAGTACTCAT
GGAATATTATGATTTCTGCTTATGCCAATTTAGGAAATTTAGTTGAAGCTCGCAAGCTTTTCAATGAAACTCCAACTAAAAATTCTATCACTTGGTCATCCCTAGTATCC
GGATATTGCAAAAATGGTTGTGAAGTTGAAGGCTTGAGGCTGTTCAGCCAAATGTGGAGTGAAGGGCAGAAGCCAAGCCAATACACATTGGGCAGTGTTTTAAGAGCATG
TTCAACTTTAGGTTTGCTCCACAGTGGCAAAATTATTCATTGCTATGTACATGGATGTATTATTTGGAGTGGTTTTGGTGCTAATGTATATGTTCAAAGTGCATTAGTTG
ATATGTATGCCAAATGTGGAGACTTGGCTAGTGCAAGAATGATACTGAATATCATGGAAATTGATGATGTTGTGTGCTGGAACTCGATGATTGTTGGGTGTGTGGTACAT
GGACATATGGAGGAAGCTCTAGTTTTGTTCCATAAGATGCATAATCGGGATATAAGAATTGATGATTTCACATATCCATCTGTTTTGAAATCTCTGGCTTCTTGTAAGGA
CCTGAAAAATGGAGAATCAGTTCATTCGCTGATTATTAAAACTGGTTTTGATGCTTGCAAAACGGTGAGCAATGCGCTTGTTGACATGTATGCTAAACAGGGAAACTTGA
GTTGTGCATTAGAGGTTTTCAATAAGATATCAGATAAAGATGTAATATCGTGGACCTCCTTGGTCACAGGATATGTTCACAATGGCTTCCATGAAAAAGCTCTCAAGTTA
TTTTGTGACATGAGAATTGCAAGGGTTGATCTCGACCAATTCGTAGTTGCCTGTGTTTTTAGTGCATGTGCTGAACTAACAGTTATAGAGTTTGGTCGACAGGTTCATGC
AAACTTTATCAAATCTAGTGTTGGTTCATTGTTATCTGCGGAGAACTCTCTCATAACAATGTATGCGAAATGTGGATGCTTAGAAGATGCAATTAGAGTCTTTGACTCGA
TGGAAAATCGAAATGTCATATCATGGACTGCCATAATAGTTGGTTATGCACAAAATGGGAGAGGGAAGGACTCTCTTCATTTTTATGATCAAATGATAGTTGATGGCATA
ACGCCAGATCCTGTTACTTTTATTGGTTTGTTGTTTGCTTGCAGCCATGCAGGTCTTGTGGAAACTGGCCGATCTTACTTTGAATCAATGGAAAAAGTTTATGGAATAAA
GCCAGCTTCTGATCATTATGCTTGCATGATTGATCTACTGGGACGTGCAGGAAAACTCAATGAGGCAGAGGATTTATTGAACCGAATGGAGGTTGAACCCGATGCAACCA
TATGGAAGTCATTACTTTCAGCATGTAGGGTTCATGGGAACTTAGAACTTGGAGAAAGGGCTGGAAAAAACCTCATTAAATTGGAACCTTCAAATTCTCTGCCTTATGTA
TTATTGTCCAATATGTTCTCTGTTGCTGGTAGATGGGAAGATGCAGCACATATTCGTAGATCAATGAAAACATTGGGTATTAACAAGGAGCCCGGATATAGTTGGATTGA
AACGAAGAGCCAAGTGCATACTTTTATATCTGAAGATAGGAGCCATCCTTTGTCGGCTGAAATATATTCAAAGATTGATGGAATTATGATCTTAATAAAGGAAGCTGGGC
ATGTACCAGATATGAACTTTGCATTACGTGACATGGATGAAGAGGCTAAGGAACGTAGTCTAGCATATCATAGCGAGAAGTTGGCTGTTGCATTTGGACTTCTCACAGTC
CCGAAAGGAGCACCGATTCGAATTTTCAAGAATCTTAGAGTATGCGGGGACTGCCACTCAGCAATGAAATATATATCTAGCGTTTTTAAGCGGCATATTATTTTGAGAGA
TTTAAATTGTTTCCATCACTTCATCGAGGGAAAATGTTCTTGTGGAGACTTCTGGTAG
Protein sequenceShow/hide protein sequence
MSKSVCPYFKSKNLQLFFYRLHTAVQPAATSFADSPAENPRRRAASPSLPENLALRPRVYFFFGELPYCSTPFCSTFLSLPTQVPQHPLLWTTSKTSDGWRHGSLRKPIC
ATATRFLTVVTSNSFTKCYFHFKHPVFIRCIGNIVYYSSNPVSNQLLSELSKDGRVDDARKLFDQMPDRDKYSWNIMISAYANLGNLVEARKLFNETPTKNSITWSSLVS
GYCKNGCEVEGLRLFSQMWSEGQKPSQYTLGSVLRACSTLGLLHSGKIIHCYVHGCIIWSGFGANVYVQSALVDMYAKCGDLASARMILNIMEIDDVVCWNSMIVGCVVH
GHMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKDLKNGESVHSLIIKTGFDACKTVSNALVDMYAKQGNLSCALEVFNKISDKDVISWTSLVTGYVHNGFHEKALKL
FCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYAKCGCLEDAIRVFDSMENRNVISWTAIIVGYAQNGRGKDSLHFYDQMIVDGI
TPDPVTFIGLLFACSHAGLVETGRSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEDLLNRMEVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYV
LLSNMFSVAGRWEDAAHIRRSMKTLGINKEPGYSWIETKSQVHTFISEDRSHPLSAEIYSKIDGIMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTV
PKGAPIRIFKNLRVCGDCHSAMKYISSVFKRHIILRDLNCFHHFIEGKCSCGDFW