; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002844 (gene) of Snake gourd v1 genome

Gene IDTan0002844
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG02:65203481..65207796
RNA-Seq ExpressionTan0002844
SyntenyTan0002844
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0010239 - chloroplast mRNA processing (biological process)
GO:0042644 - chloroplast nucleoid (cellular component)
GO:0042651 - thylakoid membrane (cellular component)
GO:0003727 - single-stranded RNA binding (molecular function)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464858.1 PREDICTED: pentatricopeptide repeat-containing protein At5g42310, mitochondrial [Cucumis melo]0.0e+0091.42Show/hide
Query:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHI-ATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPK
        MLLLSPLPL TRFPAT   SP VFLH  NPHI  THLSF FISAAA A   + SSVVTCYTSS++LE DVFE+D VSLQS RYDFTPLLDFLS SSAYPK
Subjt:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHI-ATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPK

Query:  SDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYE
        SD D+DSEVEFD  L+S S+SD ASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYE
Subjt:  SDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYE

Query:  AFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDP
        AFILSQ+QTLTPLTYNALIGACARNNDLEKALNLMSRMRQDG+QSDF+NYSLIIQSLTRTNKID+PILQKLYEEIESDKIELDG LLNDIILGFAKAGDP
Subjt:  AFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDP

Query:  NRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYA
        NRALYFLSMVQASGLNPKTSTFVA+ISALGN+GRTEEAEAIFEEMKEGGL+PRIKA NALLKGY +KGSLKEAESIVSEMEKSGLSPDEHTYGLL+DAYA
Subjt:  NRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYA

Query:  NVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTL
        NVGRWESAR LLK+MEA+NVQPN+FIFSRILASYRDRGEWQ+TFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTL
Subjt:  NVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTL

Query:  IDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSST
        IDCH KHGYH+RA+ELFEEMQERGYLPC TTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNV+TYTTLVDIYG SGRFNDAI+CLEAMKSAGLKPS+T
Subjt:  IDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSST

Query:  MYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGC
        MYNALINAFAQRGLSEQAVNAYRVM SDGL+PSLLALNSLINAFGEDRRD+EAF++LQYMKENDVKPDVVTYTTLMKALIRVDKF+KVPAVYEEMILSGC
Subjt:  MYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGC

Query:  TPDGKARAMLRSALRYMKRTLSL
        TPDGKARAMLRSALRYMKRTLSL
Subjt:  TPDGKARAMLRSALRYMKRTLSL

XP_022946456.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic [Cucurbita moschata]0.0e+0092.66Show/hide
Query:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPKS
        MLLLSPLPL TRFPATQ PSPTVFLH QNP I THLSFP ISAAA A T+S SSVVTC TSS++LELDVFENDHVS QS RYDFTPLLDFLS S AYPKS
Subjt:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPKS

Query:  DYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEA
          DSDSEVEFDSVLDS+SESD ASPTSLDPTEFQLAE YRAVPAPLWHSLLKSLC+SSSSIGLGYAVV WLQKHNLCFSYELLYSILIHALGRSEKLYEA
Subjt:  DYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEA

Query:  FILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN
        FILSQ QTLTPLTYNALIGACARNND EKALNL+SRMRQDGYQSDF+NYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN
Subjt:  FILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN

Query:  RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYAN
        RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGL+PRIKA NALLKGY KKGSLKEAESIVSEMEKSGLSPDEHTYGLL+DAYAN
Subjt:  RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYAN

Query:  VGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLI
        VG W+SAR LLK+MEA+NVQPN+FIFSRILASYRDRGEWQ+TFEVLREMKN NVKPDRHFYNVMIDTFGKFNC+DHAMETY+RMLSEGIEPDVVTWNTLI
Subjt:  VGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLI

Query:  DCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM
        DCH KHGYHERA+ELFEEMQERGY PC TTYNIMINSLGEQEKWDEVK+LLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM
Subjt:  DCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM

Query:  YNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCT
        YNALINAFAQ+GLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAF +LQYMKENDVKPDVVTYTTLMKALIRV+KF+KVPAVYEEMILSGCT
Subjt:  YNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCT

Query:  PDGKARAMLRSALRYMKRTLSL
        PDGKARAMLRSAL+YMKRTLSL
Subjt:  PDGKARAMLRSALRYMKRTLSL

XP_022999638.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic [Cucurbita maxima]0.0e+0092.8Show/hide
Query:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPKS
        MLLLSPLPL TRFPATQ PSPTVFLH QNP I THLSFP ISAAA A T+S SSVVTC TSS++LELDVFENDHVS QS RYDFTPLLDFLS S AYPKS
Subjt:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPKS

Query:  DYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEA
          DSDSEVEFDSVLDS+SESD ASPTSLDPTEFQLAE YRAVPAPLWHSLLKSLC+SSSSIGLGYAVV WLQKHNLCFSYELLYSILIHALGRSEKLYEA
Subjt:  DYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEA

Query:  FILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN
        FILSQ QTLTPLTYNALIGACARNND EKALNL+SRMRQDGYQSDF+NYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN
Subjt:  FILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN

Query:  RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYAN
        RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGL+PRIKA NALLKGY KKGSLKEAESIVSEMEKSGLSPDEHTYGLL+DAYAN
Subjt:  RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYAN

Query:  VGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLI
        VG W+SAR LLK+MEA+NVQPN+FIFSRILASYRDRGEWQ+TFEVLREMKN NVKPDRHFYNVMIDTFGKFNC+DHAMETY+RMLSEGIEPDVVTWNTLI
Subjt:  VGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLI

Query:  DCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM
        DCH KHGYHERA+ELFEEMQERGY PC TTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM
Subjt:  DCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM

Query:  YNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCT
        YNALINAFAQ+GLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAF +LQYMKENDVKPDVVTYTTLMKALIRV+KF+KVPAVYEEMILSGCT
Subjt:  YNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCT

Query:  PDGKARAMLRSALRYMKRTLSL
        PDGKARAMLRSAL+YMKRTLSL
Subjt:  PDGKARAMLRSALRYMKRTLSL

XP_023546092.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0092.8Show/hide
Query:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPKS
        MLLLSPLPL TRFPATQ PSPTVFLH QNP I THLSFP ISAAA A T+S SSVVTC TSS++LELDVFENDHVS QS RYDFTPLLDFLS S AYPKS
Subjt:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPKS

Query:  DYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEA
          DSDSEVEFDSVLDS+SESD ASPTSLDPTEFQLAE YRAVPAPLWHSLLKSLC+SSSSIGLGYAVV WLQKHNLCFSYELLYSILIHALGRSEKLYEA
Subjt:  DYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEA

Query:  FILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN
        FILSQ QTLTPLTYNALIGACARNND EKALNL+SRMRQDGYQSDF+NYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN
Subjt:  FILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN

Query:  RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYAN
        RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGL+PRIKA NALLKGY KKGSLKEAESIVSEMEKSGLSPDEHTYGLL+DAYAN
Subjt:  RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYAN

Query:  VGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLI
        VG W+SAR LLK+MEA+NVQPN+FIFSRILASYRDRGEWQ+TFEVLREMKN NVKPDRHFYNVMIDTFGKFNC+DHAMETY+RMLSEGIEPDVVTWNTLI
Subjt:  VGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLI

Query:  DCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM
        DCH KHGYHERA+ELFEEMQERGY PC TTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM
Subjt:  DCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM

Query:  YNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCT
        YNALINAFAQ+GLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAF +LQYMKENDVKPDVVTYTTLMKALIRV+KF+KVPAVYEEMILSGCT
Subjt:  YNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCT

Query:  PDGKARAMLRSALRYMKRTLSL
        PDGKARAMLRSAL+YMKRTLSL
Subjt:  PDGKARAMLRSALRYMKRTLSL

XP_038883964.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic isoform X1 [Benincasa hispida]0.0e+0093.5Show/hide
Query:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHI-ATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPK
        MLLLSPLPL TRFP  QFPSPTVFLH QNPHI  THL FPFISA AT    + SSVVTCYTSS++LELDVFEND VSLQS  YDFTPLLDFLS SS YPK
Subjt:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHI-ATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPK

Query:  SDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYE
        S  DSDSEVEF+S L+S S+SD ASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYE
Subjt:  SDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYE

Query:  AFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDP
        AFILSQ+QTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDF+NYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDP
Subjt:  AFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDP

Query:  NRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYA
        NRALYFLSMVQASGLNPKTSTFVAIISALGN+GRTEEAEAIFEEMKEGGL+PRIKA NALLKGY +KGSLKEAESIVSEMEKSGLSPDEHTYGLL+DAYA
Subjt:  NRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYA

Query:  NVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTL
        NVGRWESAR LLK+MEA+NVQPNSFIFSRILASYRDRGEWQ+TFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTL
Subjt:  NVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTL

Query:  IDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSST
        IDCHCKHGYH+RA+ELFEEMQERGYLPC TTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSST
Subjt:  IDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSST

Query:  MYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGC
        MYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAF++LQYMKENDVKPDVVTYTTLMKALIRVDKF+KVPAVYEEMILSGC
Subjt:  MYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGC

Query:  TPDGKARAMLRSALRYMKRTLSL
        TPDGKARAMLRSALRYMKRTLSL
Subjt:  TPDGKARAMLRSALRYMKRTLSL

TrEMBL top hitse value%identityAlignment
A0A0A0KCZ6 Uncharacterized protein0.0e+0091.16Show/hide
Query:  MLLLSPLPLPTRFPATQFPSPTVFL-HQQNPHIA-THLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYP
        MLLLSPLPL TRFPAT   SP VFL H  NPHIA THLSF F SA AT    S SS+VTCYTSS++LE DVFEND VSLQS RYDFTPLLDFLS SSAYP
Subjt:  MLLLSPLPLPTRFPATQFPSPTVFL-HQQNPHIA-THLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYP

Query:  KSDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLY
        K D DSDSEVEFDS  +S S+SD ASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQ+HNLCFSYELLYSILIHALGRSEKLY
Subjt:  KSDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLY

Query:  EAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGD
        EAFILSQ+QTLTPLTYNALIGACARNNDLEKALNLMSRMRQDG+QSDFINYSLIIQSLTRTNKID+P+LQKLYEEIESDKIELDG LLNDIILGFAKAGD
Subjt:  EAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGD

Query:  PNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAY
        PNRALYFLSMVQASGLNPKTSTFVA+ISALGN+GRTEEAEAIFEEMKEGGL+PRIKA NALLKGY +KGSLKEAESI+SEMEKSGLSPDEHTYGLL+DAY
Subjt:  PNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAY

Query:  ANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNT
        ANVGRWESAR LLK+MEA+NVQPN+FIFSRILASYRDRGEWQ+TFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNT
Subjt:  ANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNT

Query:  LIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSS
        LIDCH KHGYH+RA+ELFEEMQERGYLPC TTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNV+TYTTLVDIYG SGRFNDAI+CLEAMKSAGLKPS+
Subjt:  LIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSS

Query:  TMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSG
        TMYNALINAFAQRGLSEQAVNAYRVM SDGL+PSLLALNSLINAFGEDRRDIEAF++LQYMKENDVKPDVVTYTTLMKALIRVDKF+KVPAVYEEMILSG
Subjt:  TMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSG

Query:  CTPDGKARAMLRSALRYMKRTLSL
        CTPDGKARAMLRSALRYMKRTLSL
Subjt:  CTPDGKARAMLRSALRYMKRTLSL

A0A1S3CMJ2 pentatricopeptide repeat-containing protein At5g42310, mitochondrial0.0e+0091.42Show/hide
Query:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHI-ATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPK
        MLLLSPLPL TRFPAT   SP VFLH  NPHI  THLSF FISAAA A   + SSVVTCYTSS++LE DVFE+D VSLQS RYDFTPLLDFLS SSAYPK
Subjt:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHI-ATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPK

Query:  SDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYE
        SD D+DSEVEFD  L+S S+SD ASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYE
Subjt:  SDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYE

Query:  AFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDP
        AFILSQ+QTLTPLTYNALIGACARNNDLEKALNLMSRMRQDG+QSDF+NYSLIIQSLTRTNKID+PILQKLYEEIESDKIELDG LLNDIILGFAKAGDP
Subjt:  AFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDP

Query:  NRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYA
        NRALYFLSMVQASGLNPKTSTFVA+ISALGN+GRTEEAEAIFEEMKEGGL+PRIKA NALLKGY +KGSLKEAESIVSEMEKSGLSPDEHTYGLL+DAYA
Subjt:  NRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYA

Query:  NVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTL
        NVGRWESAR LLK+MEA+NVQPN+FIFSRILASYRDRGEWQ+TFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTL
Subjt:  NVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTL

Query:  IDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSST
        IDCH KHGYH+RA+ELFEEMQERGYLPC TTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNV+TYTTLVDIYG SGRFNDAI+CLEAMKSAGLKPS+T
Subjt:  IDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSST

Query:  MYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGC
        MYNALINAFAQRGLSEQAVNAYRVM SDGL+PSLLALNSLINAFGEDRRD+EAF++LQYMKENDVKPDVVTYTTLMKALIRVDKF+KVPAVYEEMILSGC
Subjt:  MYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGC

Query:  TPDGKARAMLRSALRYMKRTLSL
        TPDGKARAMLRSALRYMKRTLSL
Subjt:  TPDGKARAMLRSALRYMKRTLSL

A0A5A7T6H7 Pentatricopeptide repeat-containing protein0.0e+0091.42Show/hide
Query:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHI-ATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPK
        MLLLSPLPL TRFPAT   SP VFLH  NPHI  THLSF FISAAA A   + SSVVTCYTSS++LE DVFE+D VSLQS RYDFTPLLDFLS SSAYPK
Subjt:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHI-ATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPK

Query:  SDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYE
        SD D+DSEVEFD  L+S S+SD ASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYE
Subjt:  SDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYE

Query:  AFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDP
        AFILSQ+QTLTPLTYNALIGACARNNDLEKALNLMSRMRQDG+QSDF+NYSLIIQSLTRTNKID+PILQKLYEEIESDKIELDG LLNDIILGFAKAGDP
Subjt:  AFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDP

Query:  NRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYA
        NRALYFLSMVQASGLNPKTSTFVA+ISALGN+GRTEEAEAIFEEMKEGGL+PRIKA NALLKGY +KGSLKEAESIVSEMEKSGLSPDEHTYGLL+DAYA
Subjt:  NRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYA

Query:  NVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTL
        NVGRWESAR LLK+MEA+NVQPN+FIFSRILASYRDRGEWQ+TFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTL
Subjt:  NVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTL

Query:  IDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSST
        IDCH KHGYH+RA+ELFEEMQERGYLPC TTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNV+TYTTLVDIYG SGRFNDAI+CLEAMKSAGLKPS+T
Subjt:  IDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSST

Query:  MYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGC
        MYNALINAFAQRGLSEQAVNAYRVM SDGL+PSLLALNSLINAFGEDRRD+EAF++LQYMKENDVKPDVVTYTTLMKALIRVDKF+KVPAVYEEMILSGC
Subjt:  MYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGC

Query:  TPDGKARAMLRSALRYMKRTLSL
        TPDGKARAMLRSALRYMKRTLSL
Subjt:  TPDGKARAMLRSALRYMKRTLSL

A0A6J1G3R4 pentatricopeptide repeat-containing protein At5g42310, chloroplastic0.0e+0092.66Show/hide
Query:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPKS
        MLLLSPLPL TRFPATQ PSPTVFLH QNP I THLSFP ISAAA A T+S SSVVTC TSS++LELDVFENDHVS QS RYDFTPLLDFLS S AYPKS
Subjt:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPKS

Query:  DYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEA
          DSDSEVEFDSVLDS+SESD ASPTSLDPTEFQLAE YRAVPAPLWHSLLKSLC+SSSSIGLGYAVV WLQKHNLCFSYELLYSILIHALGRSEKLYEA
Subjt:  DYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEA

Query:  FILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN
        FILSQ QTLTPLTYNALIGACARNND EKALNL+SRMRQDGYQSDF+NYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN
Subjt:  FILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN

Query:  RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYAN
        RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGL+PRIKA NALLKGY KKGSLKEAESIVSEMEKSGLSPDEHTYGLL+DAYAN
Subjt:  RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYAN

Query:  VGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLI
        VG W+SAR LLK+MEA+NVQPN+FIFSRILASYRDRGEWQ+TFEVLREMKN NVKPDRHFYNVMIDTFGKFNC+DHAMETY+RMLSEGIEPDVVTWNTLI
Subjt:  VGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLI

Query:  DCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM
        DCH KHGYHERA+ELFEEMQERGY PC TTYNIMINSLGEQEKWDEVK+LLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM
Subjt:  DCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM

Query:  YNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCT
        YNALINAFAQ+GLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAF +LQYMKENDVKPDVVTYTTLMKALIRV+KF+KVPAVYEEMILSGCT
Subjt:  YNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCT

Query:  PDGKARAMLRSALRYMKRTLSL
        PDGKARAMLRSAL+YMKRTLSL
Subjt:  PDGKARAMLRSALRYMKRTLSL

A0A6J1KKB7 pentatricopeptide repeat-containing protein At5g42310, chloroplastic0.0e+0092.8Show/hide
Query:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPKS
        MLLLSPLPL TRFPATQ PSPTVFLH QNP I THLSFP ISAAA A T+S SSVVTC TSS++LELDVFENDHVS QS RYDFTPLLDFLS S AYPKS
Subjt:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPKS

Query:  DYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEA
          DSDSEVEFDSVLDS+SESD ASPTSLDPTEFQLAE YRAVPAPLWHSLLKSLC+SSSSIGLGYAVV WLQKHNLCFSYELLYSILIHALGRSEKLYEA
Subjt:  DYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEA

Query:  FILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN
        FILSQ QTLTPLTYNALIGACARNND EKALNL+SRMRQDGYQSDF+NYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN
Subjt:  FILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPN

Query:  RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYAN
        RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGL+PRIKA NALLKGY KKGSLKEAESIVSEMEKSGLSPDEHTYGLL+DAYAN
Subjt:  RALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYAN

Query:  VGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLI
        VG W+SAR LLK+MEA+NVQPN+FIFSRILASYRDRGEWQ+TFEVLREMKN NVKPDRHFYNVMIDTFGKFNC+DHAMETY+RMLSEGIEPDVVTWNTLI
Subjt:  VGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLI

Query:  DCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM
        DCH KHGYHERA+ELFEEMQERGY PC TTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM
Subjt:  DCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTM

Query:  YNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCT
        YNALINAFAQ+GLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAF +LQYMKENDVKPDVVTYTTLMKALIRV+KF+KVPAVYEEMILSGCT
Subjt:  YNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCT

Query:  PDGKARAMLRSALRYMKRTLSL
        PDGKARAMLRSAL+YMKRTLSL
Subjt:  PDGKARAMLRSALRYMKRTLSL

SwissProt top hitse value%identityAlignment
A0A1D6IEG9 Pentatricopeptide repeat-containing protein CRP1, chloroplastic1.6e-19353.38Show/hide
Query:  VSLQSHRYDFTPLLDFLSHSSAYPKSDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKH
        VS+ + RYDF PLL +LS  S                    S S +  + P S+   E +LA +Y AVP+  WH+LL+ L +S +S+ L +A++ +L +H
Subjt:  VSLQSHRYDFTPLLDFLSHSSAYPKSDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKH

Query:  NLCFSYELLYSILIHALGRSEKLY-EAFILSQRQTL----TPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRT-NKIDVPILQ
         LCF  +LL S L+H+L  S +L   + +LS   +L    +PL  N+L+ A A  +    AL L+S +R+  +  D  +YS ++ SL  T +  D  +L+
Subjt:  NLCFSYELLYSILIHALGRSEKLY-EAFILSQRQTL----TPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRT-NKIDVPILQ

Query:  KLYEEIESDKIELDGQLLNDIILGFAKAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIF-EEMKEGGLRPRIKALNALLKGYVKKG
        +L  ++   ++E D  L +D+I  FA+A  P+ AL  L+  QA GL P+++   A+ISALG  GR  EAEA+F E    G ++PR +A NALLKGYV+  
Subjt:  KLYEEIESDKIELDGQLLNDIILGFAKAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIF-EEMKEGGLRPRIKALNALLKGYVKKG

Query:  SLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDT
        SLK AE ++ EM + G++PDE TY LL+DAY   GRWESAR LLKEMEA  V+P+S++FSRILA +RDRG+WQ+ F VLREM+ S V+PDRHFYNVMIDT
Subjt:  SLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDT

Query:  FGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYT
        FGK+NCL HAM+ +++M  EGIEPDVVTWNTLID HCK G H+RA+ELFEEM+E    P +TTYNIMIN LGEQE W+ V+ +L +M+ QGL+PN+ITYT
Subjt:  FGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYT

Query:  TLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPD
        TLVD+YG+SGR+ +AI+C+EAMK+ GLKPS TMY+AL+NA+AQRGL++ A+N  + M++DGL+ S+L LNSLINAFGEDRR +EAF+VLQ+M+EN ++PD
Subjt:  TLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPD

Query:  VVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSALRYMK
        V+TYTTLMKALIRV++F+KVP +YEEMI SGC PD KARAMLRS L+Y+K
Subjt:  VVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSALRYMK

B8Y6I0 Pentatricopeptide repeat-containing protein 10, chloroplastic6.3e-5725.17Show/hide
Query:  SLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFILSQRQTLTP------LTYNALIGACARNNDLEKALNLMSRMRQDGY
        SLLK+L   S       A++ W  K     +  L   +++ ALGR  +      L     L P        Y  ++ A +R    E+AL L + +R+ G 
Subjt:  SLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFILSQRQTLTP------LTYNALIGACARNNDLEKALNLMSRMRQDGY

Query:  QSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFE
            + Y++++    R  +   P +  L +E+ +  +E DG   + +I    + G  + A+ F   ++A G  P   T+ A++   G  G   EA  +  
Subjt:  QSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFE

Query:  EMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRT
        EM++ G +P     N L   Y + G  +EA   +  M   GL P+  TY  ++ AY NVG+ + A  L  +M+     PN   ++ +L     +  +   
Subjt:  EMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRT

Query:  FEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQE
         E+L EM  S   P+R  +N M+   GK    D+     + M S G+E    T+NTLI  + + G    A +++ EM   G+ PC TTYN ++N L  Q 
Subjt:  FEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQE

Query:  KWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSG------------------------------------RFNDAIECLEAMKSAGLKPSSTMYNALIN
         W   + ++ KM+++G  PN  +Y+ L+  Y + G                                    R +      + +K+ G  P   ++N++++
Subjt:  KWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSG------------------------------------RFNDAIECLEAMKSAGLKPSSTMYNALIN

Query:  AFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMK-ENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTP
         +A+ G+  +A   +  ++  GL P L+  NSL++ + +     EA  +L  +K    +KPDVV+Y T++    +     +   V  EM+  G  P
Subjt:  AFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMK-ENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTP

Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial4.8e-5727.95Show/hide
Query:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKI--DVPILQKLYEEIESDKIELDGQLLNDIILGFA
        L+E+ I S R   TP+ +N L  A AR    +  L     M  +G + D    +++I    R  K+     +L + ++       E D    + ++ GF 
Subjt:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKI--DVPILQKLYEEIESDKIELDGQLLNDIILGFA

Query:  KAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLL
          G  + A+  +  +      P   T   +I+ L   GR  EA  + + M E G +P       +L    K G+   A  +  +ME+  +      Y ++
Subjt:  KAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLL

Query:  IDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVV
        ID+    G ++ A +L  EME K ++ +   +S ++    + G+W    ++LREM   N+ PD   ++ +ID F K   L  A E Y+ M++ GI PD +
Subjt:  IDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVV

Query:  TWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGL
        T+N+LID  CK      A+++F+ M  +G  P   TY+I+INS  + ++ D+   L  ++ S+GL+PN ITY TLV  + QSG+ N A E  + M S G+
Subjt:  TWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGL

Query:  KPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEM
         PS   Y  L++     G   +A+  +  M+   +   +   N +I+      +  +A+++   + +  VKPDVVTY  ++  L +    ++   ++ +M
Subjt:  KPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEM

Query:  ILSGCTPD
           GCTPD
Subjt:  ILSGCTPD

Q84ZD2 Pentatricopeptide repeat-containing protein CRP1 homolog, chloroplastic1.8e-19254.11Show/hide
Query:  RYDFTPLLDFLSHSSAYPKSDYDSDSEVEFDSVLDSNSESDAASPTSLDP-TEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFS
        RYDF PLL +LS +S+ P                       +  PTS+ P TE +LA +Y AVPA  WH+LL+ L ++ +S+ L +A++ +L +H LCF 
Subjt:  RYDFTPLLDFLSHSSAYPKSDYDSDSEVEFDSVLDSNSESDAASPTSLDP-TEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFS

Query:  YELLYSILIHALGRSEKLY-EAFILSQRQTL----TPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRT-NKIDVPILQKLYEE
         +LL S L+H+L  S +L   + +LS   +L    +PL  N+L+ A A  +    AL L+  +R+  +  D  +YS ++ SL  T +  D  +L +L  +
Subjt:  YELLYSILIHALGRSEKLY-EAFILSQRQTL----TPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRT-NKIDVPILQKLYEE

Query:  IESDKIELDGQLLNDIILGFAKAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIF-EEMKEGGLRPRIKALNALLKGYVKKGSLKEA
        +   ++E D  L +D+I  FA+A  P+ AL  L+  QA GL P+++   A+IS+LG+  R  EAEA+F E    G ++PR +A NALLKGYVK GSLK A
Subjt:  IESDKIELDGQLLNDIILGFAKAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIF-EEMKEGGLRPRIKALNALLKGYVKKGSLKEA

Query:  ESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFN
        E ++ EM + G++PDE TY LL+DAY   GRWESAR LLKEMEA  V+P+S++FSRILA +RDRGEWQ+ F VLREM  S V+PDRHFYNVMIDTFGK+N
Subjt:  ESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFN

Query:  CLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDI
        CL HAM+ +DRM  EGIEPDVVTWNTLID HCK G H+RA ELF+EM+E      +TTYNIMIN LGE+++W+ V+ +L +M+ QGL+PN+ITYTTLVD+
Subjt:  CLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDI

Query:  YGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYT
        YG+SGRF +A++C+EAMK+ GLKPS TMY+AL+NA+AQRGL++ A+N  + MR+DGL+ S + LNSLINAFGEDRR  EAF+VLQ+MKEN ++PDV+TYT
Subjt:  YGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYT

Query:  TLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSALRYMK
        TLMKALIRV++F KVP +YEEMI SGC PD KARAMLRSALRYMK
Subjt:  TLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSALRYMK

Q8L844 Pentatricopeptide repeat-containing protein At5g42310, chloroplastic7.7e-28971.11Show/hide
Query:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSF-PFISAAATAFTSSI-----SSVVTCYTSSNSLELDVFENDHVSLQSH-RYDFTPLLDFLSH
        MLLL   PL     +T+F S     H  + H   H  F P ISA +   ++S+     SS  + ++S N L+ +  E++  S + H RYDF+PLL FLS 
Subjt:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSF-PFISAAATAFTSSI-----SSVVTCYTSSNSLELDVFENDHVSLQSH-RYDFTPLLDFLSH

Query:  SSAYPKSDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGR
                         +  LDS SES+ ASP SL+P EF L E+YRAVPAP WHSL+KSL SS+SS+GL YAVVSWLQKHNLCFSYELLYSILIHALGR
Subjt:  SSAYPKSDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGR

Query:  SEKLYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGF
        SEKLYEAF+LSQ+QTLTPLTYNALIGACARNND+EKALNL+++MRQDGYQSDF+NYSL+IQSLTR+NKID  +L +LY+EIE DK+ELD QL+NDII+GF
Subjt:  SEKLYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGF

Query:  AKAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGL
        AK+GDP++AL  L M QA+GL+ KT+T V+IISAL + GRT EAEA+FEE+++ G++PR +A NALLKGYVK G LK+AES+VSEMEK G+SPDEHTY L
Subjt:  AKAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGL

Query:  LIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDV
        LIDAY N GRWESAR +LKEMEA +VQPNSF+FSR+LA +RDRGEWQ+TF+VL+EMK+  VKPDR FYNV+IDTFGKFNCLDHAM T+DRMLSEGIEPD 
Subjt:  LIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDV

Query:  VTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAG
        VTWNTLIDCHCKHG H  A E+FE M+ RG LPC+TTYNIMINS G+QE+WD++K LLGKM+SQG+LPNV+T+TTLVD+YG+SGRFNDAIECLE MKS G
Subjt:  VTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAG

Query:  LKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEE
        LKPSSTMYNALINA+AQRGLSEQAVNA+RVM SDGLKPSLLALNSLINAFGEDRRD EAFAVLQYMKEN VKPDVVTYTTLMKALIRVDKF KVP VYEE
Subjt:  LKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEE

Query:  MILSGCTPDGKARAMLRSALRYMKRTL
        MI+SGC PD KAR+MLRSALRYMK+TL
Subjt:  MILSGCTPDGKARAMLRSALRYMKRTL

Arabidopsis top hitse value%identityAlignment
AT1G62670.1 rna processing factor 25.9e-5826.73Show/hide
Query:  YSILIHALGRSEKLYEAFIL-SQRQTL----TPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDK
        +S L+ A+ +  K      L  Q Q L       TY+ LI    R + L  AL ++ +M + GY+ + +  S ++     + +I   +   L +++    
Subjt:  YSILIHALGRSEKLYEAFIL-SQRQTL----TPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDK

Query:  IELDGQLLNDIILGFAKAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSE
         + +    N +I G       + A+  +  + A G  P   T+  +++ L   G T+ A  +  +M++G L P +   N ++ G  K   + +A ++  E
Subjt:  IELDGQLLNDIILGFAKAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSE

Query:  MEKSGLSPDEHTYGLLIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAM
        ME  G+ P+  TY  LI    N GRW  A  LL +M  + + P+ F FS ++ ++   G+     ++  EM   ++ P    Y+ +I+ F   + LD A 
Subjt:  MEKSGLSPDEHTYGLLIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAM

Query:  ETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGR
        + ++ M+S+   PDVVT+NTLI   CK+   E   E+F EM +RG +  + TYNI+I  L +    D  + +  +M S G+ PN++TY TL+D   ++G+
Subjt:  ETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGR

Query:  FNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKAL
           A+   E ++ + ++P+   YN +I    + G  E   + +  +   G+KP ++A N++I+ F       EA A+ + MKE+   P+   Y TL++A 
Subjt:  FNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKAL

Query:  IRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSAL
        +R         + +EM   G   D     ++ + L
Subjt:  IRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSAL

AT2G31400.1 genomes uncoupled 12.2e-5728.39Show/hide
Query:  NALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPNRALYFLSMVQASGL
        +AL  A   + D E   +LM         SD   Y  II+ L   N+ D  +    +      +    G+L + +I    + G    A        A G 
Subjt:  NALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPNRALYFLSMVQASGL

Query:  NPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKG-SLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARTLLKE
              F A+ISA G  G  EEA ++F  MKE GLRP +   NA++    K G   K+      EM+++G+ PD  T+  L+   +  G WE+AR L  E
Subjt:  NPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKG-SLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARTLLKE

Query:  MEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAS
        M  + ++ + F ++ +L +    G+    FE+L +M    + P+   Y+ +ID F K    D A+  +  M   GI  D V++NTL+  + K G  E A 
Subjt:  MEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAS

Query:  ELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGL
        ++  EM   G      TYN ++   G+Q K+DEVK +  +M+ + +LPN++TY+TL+D Y + G + +A+E     KSAGL+    +Y+ALI+A  + GL
Subjt:  ELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGL

Query:  SEQAVNAYRVMRSDGLKPSLLALNSLINAFG------------------------------EDRRDIEAFA--------------------------VLQ
           AV+    M  +G+ P+++  NS+I+AFG                              E  R I+ F                           V +
Subjt:  SEQAVNAYRVMRSDGLKPSLLALNSLINAFG------------------------------EDRRDIEAFA--------------------------VLQ

Query:  YMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL
         M + ++KP+VVT++ ++ A  R + F     + EE+ L
Subjt:  YMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL

AT3G22470.1 Pentatricopeptide repeat (PPR) superfamily protein3.4e-5827.95Show/hide
Query:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKI--DVPILQKLYEEIESDKIELDGQLLNDIILGFA
        L+E+ I S R   TP+ +N L  A AR    +  L     M  +G + D    +++I    R  K+     +L + ++       E D    + ++ GF 
Subjt:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKI--DVPILQKLYEEIESDKIELDGQLLNDIILGFA

Query:  KAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLL
          G  + A+  +  +      P   T   +I+ L   GR  EA  + + M E G +P       +L    K G+   A  +  +ME+  +      Y ++
Subjt:  KAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLL

Query:  IDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVV
        ID+    G ++ A +L  EME K ++ +   +S ++    + G+W    ++LREM   N+ PD   ++ +ID F K   L  A E Y+ M++ GI PD +
Subjt:  IDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVV

Query:  TWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGL
        T+N+LID  CK      A+++F+ M  +G  P   TY+I+INS  + ++ D+   L  ++ S+GL+PN ITY TLV  + QSG+ N A E  + M S G+
Subjt:  TWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGL

Query:  KPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEM
         PS   Y  L++     G   +A+  +  M+   +   +   N +I+      +  +A+++   + +  VKPDVVTY  ++  L +    ++   ++ +M
Subjt:  KPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEM

Query:  ILSGCTPD
           GCTPD
Subjt:  ILSGCTPD

AT5G42310.1 Pentatricopeptide repeat (PPR-like) superfamily protein5.5e-29071.11Show/hide
Query:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSF-PFISAAATAFTSSI-----SSVVTCYTSSNSLELDVFENDHVSLQSH-RYDFTPLLDFLSH
        MLLL   PL     +T+F S     H  + H   H  F P ISA +   ++S+     SS  + ++S N L+ +  E++  S + H RYDF+PLL FLS 
Subjt:  MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSF-PFISAAATAFTSSI-----SSVVTCYTSSNSLELDVFENDHVSLQSH-RYDFTPLLDFLSH

Query:  SSAYPKSDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGR
                         +  LDS SES+ ASP SL+P EF L E+YRAVPAP WHSL+KSL SS+SS+GL YAVVSWLQKHNLCFSYELLYSILIHALGR
Subjt:  SSAYPKSDYDSDSEVEFDSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGR

Query:  SEKLYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGF
        SEKLYEAF+LSQ+QTLTPLTYNALIGACARNND+EKALNL+++MRQDGYQSDF+NYSL+IQSLTR+NKID  +L +LY+EIE DK+ELD QL+NDII+GF
Subjt:  SEKLYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGF

Query:  AKAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGL
        AK+GDP++AL  L M QA+GL+ KT+T V+IISAL + GRT EAEA+FEE+++ G++PR +A NALLKGYVK G LK+AES+VSEMEK G+SPDEHTY L
Subjt:  AKAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGL

Query:  LIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDV
        LIDAY N GRWESAR +LKEMEA +VQPNSF+FSR+LA +RDRGEWQ+TF+VL+EMK+  VKPDR FYNV+IDTFGKFNCLDHAM T+DRMLSEGIEPD 
Subjt:  LIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDV

Query:  VTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAG
        VTWNTLIDCHCKHG H  A E+FE M+ RG LPC+TTYNIMINS G+QE+WD++K LLGKM+SQG+LPNV+T+TTLVD+YG+SGRFNDAIECLE MKS G
Subjt:  VTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAG

Query:  LKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEE
        LKPSSTMYNALINA+AQRGLSEQAVNA+RVM SDGLKPSLLALNSLINAFGEDRRD EAFAVLQYMKEN VKPDVVTYTTLMKALIRVDKF KVP VYEE
Subjt:  LKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEE

Query:  MILSGCTPDGKARAMLRSALRYMKRTL
        MI+SGC PD KAR+MLRSALRYMK+TL
Subjt:  MILSGCTPDGKARAMLRSALRYMKRTL

AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein2.1e-5527.52Show/hide
Query:  YEAFILSQRQTLTPLTYNALIG-----------ACARNNDLEKALNLMSRMRQD-----GYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIEL
        Y+  + S    LT L  N  +G           +C    D    L+L  +M +D      Y+     Y+ ++ SL R   +D   ++++Y E+  DK+  
Subjt:  YEAFILSQRQTLTPLTYNALIG-----------ACARNNDLEKALNLMSRMRQD-----GYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIEL

Query:  DGQLLNDIILGFAKAGDPNRALYFLSMVQASGLNPKTSTFVAII----------SALGNYG-------------------------RTEEAEAIFEEMKE
        +    N ++ G+ K G+   A  ++S +  +GL+P   T+ ++I          SA   +                          R +EA  +F +MK+
Subjt:  DGQLLNDIILGFAKAGDPNRALYFLSMVQASGLNPKTSTFVAII----------SALGNYG-------------------------RTEEAEAIFEEMKE

Query:  GGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVL
            P ++    L+K         EA ++V EME++G+ P+ HTY +LID+  +  ++E AR LL +M  K + PN   ++ ++  Y  RG  +   +V+
Subjt:  GGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQRTFEVL

Query:  REMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDE
          M++  + P+   YN +I  + K N +  AM   ++ML   + PDVVT+N+LID  C+ G  + A  L   M +RG +P   TY  MI+SL + ++ +E
Subjt:  REMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDE

Query:  VKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGED
           L   ++ +G+ PNV+ YT L+D Y ++G+ ++A   LE M S    P+S  +NALI+     G  ++A      M   GL+P++     LI+   +D
Subjt:  VKILLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGED

Query:  RRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPD
             A++  Q M  +  KPD  TYTT ++   R  +      +  +M  +G +PD
Subjt:  RRDIEAFAVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCTTCTGTCACCATTGCCACTCCCAACTCGCTTTCCTGCAACTCAATTCCCTTCTCCAACCGTCTTCCTCCACCAGCAGAATCCCCATATAGCTACCCATCTCTC
ATTTCCCTTCATTTCCGCCGCCGCCACCGCCTTCACCTCCTCCATCTCCTCCGTCGTCACCTGTTACACTTCCTCCAACTCGCTCGAGCTCGACGTCTTCGAAAACGACC
ACGTTTCCCTTCAGAGTCACCGCTACGACTTCACTCCCCTCCTCGACTTCCTTTCCCACTCTTCAGCCTATCCCAAGTCCGATTACGATTCGGATTCCGAGGTTGAATTT
GACTCTGTATTGGACTCCAATTCTGAGTCGGATGCCGCTTCTCCGACCTCGCTCGACCCCACCGAGTTTCAGCTTGCCGAGGCCTACAGGGCCGTGCCGGCGCCTCTCTG
GCACTCTCTGCTTAAGTCTCTTTGCTCTTCTTCCTCTTCGATTGGGCTAGGTTATGCGGTTGTTTCGTGGCTTCAGAAGCATAATCTTTGTTTCTCTTACGAATTGCTTT
ACTCGATTCTCATTCATGCGCTTGGCCGCTCTGAGAAGCTTTATGAGGCTTTCATTCTCTCCCAGAGGCAAACACTAACCCCCTTAACGTATAATGCTCTCATTGGTGCC
TGTGCTCGCAATAATGATTTGGAGAAGGCCCTCAATTTGATGTCTAGGATGCGGCAAGATGGTTACCAATCTGATTTTATCAATTATAGTTTGATAATTCAGTCGCTAAC
TCGGACGAATAAGATTGATGTTCCAATCTTGCAAAAGCTTTACGAAGAGATTGAGTCTGATAAAATTGAACTCGATGGGCAGCTCCTTAATGATATAATATTGGGTTTTG
CAAAAGCTGGAGACCCTAACCGAGCTCTGTATTTCTTGTCGATGGTACAGGCGAGTGGTTTAAACCCCAAAACTTCTACTTTTGTTGCGATTATTTCTGCATTGGGAAAT
TATGGGCGGACAGAGGAAGCTGAGGCTATCTTTGAGGAAATGAAAGAAGGTGGATTGAGACCAAGAATTAAGGCTTTGAATGCACTTCTTAAAGGTTATGTTAAAAAGGG
TTCTCTAAAAGAAGCTGAATCTATTGTCTCAGAGATGGAAAAGAGTGGATTATCACCGGATGAGCACACATACGGTCTTCTCATTGATGCTTATGCAAATGTGGGCAGAT
GGGAAAGTGCAAGAACTTTGTTGAAAGAAATGGAAGCTAAAAATGTACAGCCCAACTCCTTCATCTTCAGTAGGATTTTAGCTAGTTATCGTGACCGGGGGGAATGGCAG
AGGACATTTGAAGTTTTGAGGGAAATGAAGAACAGCAATGTCAAACCTGACAGGCATTTTTACAATGTCATGATTGATACTTTCGGGAAGTTCAATTGCCTTGATCATGC
CATGGAAACATATGACCGGATGCTCTCTGAGGGGATTGAACCGGACGTTGTTACTTGGAACACACTTATAGATTGTCATTGTAAACACGGATACCATGAAAGGGCTTCAG
AGTTGTTCGAGGAAATGCAGGAGCGTGGTTATTTACCTTGTTCCACAACATATAATATTATGATCAATTCATTAGGAGAGCAGGAAAAATGGGATGAGGTGAAAATCTTG
TTAGGGAAGATGCAGAGCCAGGGCTTACTTCCCAATGTGATAACATACACTACCCTTGTTGATATATATGGACAGTCGGGGAGGTTTAACGACGCAATTGAGTGCTTGGA
GGCCATGAAGTCTGCTGGGCTGAAACCATCCTCAACTATGTATAATGCTTTAATCAATGCCTTTGCTCAAAGAGGTTTGTCCGAGCAGGCAGTAAATGCATATAGAGTTA
TGAGATCAGATGGATTAAAGCCCAGTCTCTTGGCTCTTAATTCATTGATCAATGCATTTGGCGAGGATAGGAGAGACATTGAAGCCTTTGCAGTCTTGCAGTACATGAAG
GAAAATGATGTGAAGCCTGATGTTGTTACATATACAACGCTTATGAAAGCTTTGATTCGTGTTGATAAATTCAACAAGGTTCCAGCTGTGTATGAAGAGATGATTCTGTC
TGGATGTACTCCTGATGGAAAGGCCAGAGCAATGTTGCGGTCTGCCCTCAGATACATGAAGCGTACACTAAGTTTATAG
mRNA sequenceShow/hide mRNA sequence
CCGATCATTAATCCCACTTCTCTTCCTCTTCGCGCAATTATCCGATAACATGCTTCTTCTGTCACCATTGCCACTCCCAACTCGCTTTCCTGCAACTCAATTCCCTTCTC
CAACCGTCTTCCTCCACCAGCAGAATCCCCATATAGCTACCCATCTCTCATTTCCCTTCATTTCCGCCGCCGCCACCGCCTTCACCTCCTCCATCTCCTCCGTCGTCACC
TGTTACACTTCCTCCAACTCGCTCGAGCTCGACGTCTTCGAAAACGACCACGTTTCCCTTCAGAGTCACCGCTACGACTTCACTCCCCTCCTCGACTTCCTTTCCCACTC
TTCAGCCTATCCCAAGTCCGATTACGATTCGGATTCCGAGGTTGAATTTGACTCTGTATTGGACTCCAATTCTGAGTCGGATGCCGCTTCTCCGACCTCGCTCGACCCCA
CCGAGTTTCAGCTTGCCGAGGCCTACAGGGCCGTGCCGGCGCCTCTCTGGCACTCTCTGCTTAAGTCTCTTTGCTCTTCTTCCTCTTCGATTGGGCTAGGTTATGCGGTT
GTTTCGTGGCTTCAGAAGCATAATCTTTGTTTCTCTTACGAATTGCTTTACTCGATTCTCATTCATGCGCTTGGCCGCTCTGAGAAGCTTTATGAGGCTTTCATTCTCTC
CCAGAGGCAAACACTAACCCCCTTAACGTATAATGCTCTCATTGGTGCCTGTGCTCGCAATAATGATTTGGAGAAGGCCCTCAATTTGATGTCTAGGATGCGGCAAGATG
GTTACCAATCTGATTTTATCAATTATAGTTTGATAATTCAGTCGCTAACTCGGACGAATAAGATTGATGTTCCAATCTTGCAAAAGCTTTACGAAGAGATTGAGTCTGAT
AAAATTGAACTCGATGGGCAGCTCCTTAATGATATAATATTGGGTTTTGCAAAAGCTGGAGACCCTAACCGAGCTCTGTATTTCTTGTCGATGGTACAGGCGAGTGGTTT
AAACCCCAAAACTTCTACTTTTGTTGCGATTATTTCTGCATTGGGAAATTATGGGCGGACAGAGGAAGCTGAGGCTATCTTTGAGGAAATGAAAGAAGGTGGATTGAGAC
CAAGAATTAAGGCTTTGAATGCACTTCTTAAAGGTTATGTTAAAAAGGGTTCTCTAAAAGAAGCTGAATCTATTGTCTCAGAGATGGAAAAGAGTGGATTATCACCGGAT
GAGCACACATACGGTCTTCTCATTGATGCTTATGCAAATGTGGGCAGATGGGAAAGTGCAAGAACTTTGTTGAAAGAAATGGAAGCTAAAAATGTACAGCCCAACTCCTT
CATCTTCAGTAGGATTTTAGCTAGTTATCGTGACCGGGGGGAATGGCAGAGGACATTTGAAGTTTTGAGGGAAATGAAGAACAGCAATGTCAAACCTGACAGGCATTTTT
ACAATGTCATGATTGATACTTTCGGGAAGTTCAATTGCCTTGATCATGCCATGGAAACATATGACCGGATGCTCTCTGAGGGGATTGAACCGGACGTTGTTACTTGGAAC
ACACTTATAGATTGTCATTGTAAACACGGATACCATGAAAGGGCTTCAGAGTTGTTCGAGGAAATGCAGGAGCGTGGTTATTTACCTTGTTCCACAACATATAATATTAT
GATCAATTCATTAGGAGAGCAGGAAAAATGGGATGAGGTGAAAATCTTGTTAGGGAAGATGCAGAGCCAGGGCTTACTTCCCAATGTGATAACATACACTACCCTTGTTG
ATATATATGGACAGTCGGGGAGGTTTAACGACGCAATTGAGTGCTTGGAGGCCATGAAGTCTGCTGGGCTGAAACCATCCTCAACTATGTATAATGCTTTAATCAATGCC
TTTGCTCAAAGAGGTTTGTCCGAGCAGGCAGTAAATGCATATAGAGTTATGAGATCAGATGGATTAAAGCCCAGTCTCTTGGCTCTTAATTCATTGATCAATGCATTTGG
CGAGGATAGGAGAGACATTGAAGCCTTTGCAGTCTTGCAGTACATGAAGGAAAATGATGTGAAGCCTGATGTTGTTACATATACAACGCTTATGAAAGCTTTGATTCGTG
TTGATAAATTCAACAAGGTTCCAGCTGTGTATGAAGAGATGATTCTGTCTGGATGTACTCCTGATGGAAAGGCCAGAGCAATGTTGCGGTCTGCCCTCAGATACATGAAG
CGTACACTAAGTTTATAGTTGCCCGCCCGATCATGTAGCCATGCACCATGGAGAGTAAATTTGCATGATTTAATCCAATCAATACTCATAGGAATTGCCAAATTGAAGTT
AGAGTCATATCCTTATGACTTGATCTTACACAAATAGTATGGCATTATACCAGCAAAAGATTAACATTTTTAATGTAGAACTCTCTGTGCCCGCCCCTTCTGCAGCTCCC
TTAAAGGATACAGCACCACTCCATCTGGCCGTCCGGTGGTGGGTCTTAGAAACACCAGGCAAATTCCCACTGTTCATCATTCTTGTATTATTGTTCATATAACATTCACC
ATTATTACAGAAAAGAAATAATTGATCCAAGGAACTCAATGCGTGTGTTCTTTTCTTTCCTTTCATGATGTTGTTCTAAATCA
Protein sequenceShow/hide protein sequence
MLLLSPLPLPTRFPATQFPSPTVFLHQQNPHIATHLSFPFISAAATAFTSSISSVVTCYTSSNSLELDVFENDHVSLQSHRYDFTPLLDFLSHSSAYPKSDYDSDSEVEF
DSVLDSNSESDAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSSSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFILSQRQTLTPLTYNALIGA
CARNNDLEKALNLMSRMRQDGYQSDFINYSLIIQSLTRTNKIDVPILQKLYEEIESDKIELDGQLLNDIILGFAKAGDPNRALYFLSMVQASGLNPKTSTFVAIISALGN
YGRTEEAEAIFEEMKEGGLRPRIKALNALLKGYVKKGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARTLLKEMEAKNVQPNSFIFSRILASYRDRGEWQ
RTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERASELFEEMQERGYLPCSTTYNIMINSLGEQEKWDEVKIL
LGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFAVLQYMK
ENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSALRYMKRTLSL