; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS023471 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS023471
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold787:110072..113519
RNA-Seq ExpressionMS023471
SyntenyMS023471
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464858.1 PREDICTED: pentatricopeptide repeat-containing protein At5g42310, mitochondrial [Cucumis melo]0.0e+0087.6Show/hide
Query:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL--PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSA
        MLLLSP PLS RFPAT   SP +FLHHH+ H+   HLS  FISAAAAA    +S S VTCYTSSD L+      D  SLQSRRYDF PLL FLSRS  SA
Subjt:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL--PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSA

Query:  TAGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEK
           + SD+DSEVEFD         + ASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSS SSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEK
Subjt:  TAGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEK

Query:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKA
        LYEAFILSQ+QTLTPLTYNALIGACARNNDLEKALNLMSRMRQDG+QSDFVNYSLIIQSLTRTNKID+PILQKLY EIESDKIELDG LLNDIILGFAKA
Subjt:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKA

Query:  GDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLID
        GDP+ ALYFLSMVQASGLNPKTSTFVA+ISALGN+GRTEEAEAIFEEMKEGGL+PRIKAFNALLKGY ++GSLKEAESIVSEMEKSGLSPDEHTYGLL+D
Subjt:  GDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLID

Query:  AYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTW
        AYANVGRWESAR+LLK+MEAR+V+PN+FIFSR+LASYRDRGEWQ+TFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTW
Subjt:  AYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTW

Query:  NTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKP
        NTLIDCH KHGYH+RAAELF+EMQERGYLPC TTYNIMINSLGEQEKWDEVK LLGKMQSQGLLPNV+TYTTLVDIYG SGRFNDAI+CLEAMKSAGLKP
Subjt:  NTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKP

Query:  SSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL
        S+TMYNALINAFAQRGLSEQAVNAYRVM SDGL+PSLLALNSLINAFGEDRRD+EAFS+LQYMKENDVKPDVVTYTTLMKALIRVDKF+KVPAVYEEMIL
Subjt:  SSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL

Query:  SGCTPDGKARAMLRSALRYMKRTLSI
        SGCTPDGKARAMLRSALRYMKRTLS+
Subjt:  SGCTPDGKARAMLRSALRYMKRTLSI

XP_022153134.1 pentatricopeptide repeat-containing protein At5g42310, mitochondrial isoform X1 [Momordica charantia]0.0e+0099.44Show/hide
Query:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHLPHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDLDHASLQSRRYDFAPLLHFLSRSSTSATAGAVSD
        MLLLSPPPLSGRFPATQFP PTIFLHHHHHHLPHLSLPFIS AAAAATLTSSPSAVTCYTSSDALDLDHASLQSRRYDFAPLLHFLSRSSTSATAGAVSD
Subjt:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHLPHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDLDHASLQSRRYDFAPLLHFLSRSSTSATAGAVSD

Query:  SDSEVEFDEAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFILSQRQTLTPL
        SDSEVEFDE ASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFILSQRQTLTPL
Subjt:  SDSEVEFDEAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFILSQRQTLTPL

Query:  TYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQAS
        TYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQAS
Subjt:  TYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQAS

Query:  GLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLK
        GLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLK
Subjt:  GLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLK

Query:  EMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERA
        EMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERA
Subjt:  EMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERA

Query:  AELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRG
        AELFQEMQERGYLPCATTYNIMINSLGEQ KWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRG
Subjt:  AELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRG

Query:  LSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSA
        LSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSA
Subjt:  LSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSA

Query:  LRYMKRTLSI
        LRYMKRTLSI
Subjt:  LRYMKRTLSI

XP_022946456.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic [Cucurbita moschata]0.0e+0088Show/hide
Query:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL-PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSAT
        MLLLSP PLS RFPATQ PSPT+FLHH +  +  HLS P IS  AAAAT TS  S VTC TSSDAL+L     DH S QSRRYDF PLL FLSRS     
Subjt:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL-PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSAT

Query:  AGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL
        A   SDSDSEVEFD         + ASPTSLDPTEFQLAE YRAVPAPLWHSLLKSLC+S SSIGLGYAVV WLQKHNLCFSYELLYSILIHALGRSEKL
Subjt:  AGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL

Query:  YEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAG
        YEAFILSQ QTLTPLTYNALIGACARNND EKALNL+SRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLY EIESDKIELDGQLLNDIILGFAKAG
Subjt:  YEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAG

Query:  DPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDA
        DP+ ALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGL+PRIKAFNALLKGY K+GSLKEAESIVSEMEKSGLSPDEHTYGLL+DA
Subjt:  DPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDA

Query:  YANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWN
        YANVG W+SAR LLK+MEAR+V+PN+FIFSR+LASYRDRGEWQ+TFEVLREMKN NVKPDRHFYNVMIDTFGKFNC+DHAMETY+RMLSEGIEPDVVTWN
Subjt:  YANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWN

Query:  TLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS
        TLIDCH KHGYHERAAELF+EMQERGY PC TTYNIMINSLGEQEKWDEVK LLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS
Subjt:  TLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS

Query:  STMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILS
        STMYNALINAFAQ+GLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAF++LQYMKENDVKPDVVTYTTLMKALIRV+KF+KVPAVYEEMILS
Subjt:  STMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILS

Query:  GCTPDGKARAMLRSALRYMKRTLSI
        GCTPDGKARAMLRSAL+YMKRTLS+
Subjt:  GCTPDGKARAMLRSALRYMKRTLSI

XP_023546092.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0087.86Show/hide
Query:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL-PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSAT
        MLLLSP PLS RFPATQ PSPT+FLHH +  +  HLS P ISAAAA+   TS  S VTC TSSDAL+L     DH S QSRRYDF PLL FLSRS     
Subjt:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL-PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSAT

Query:  AGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL
        A   SDSDSEVEFD         + ASPTSLDPTEFQLAE YRAVPAPLWHSLLKSLC+S SSIGLGYAVV WLQKHNLCFSYELLYSILIHALGRSEKL
Subjt:  AGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL

Query:  YEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAG
        YEAFILSQ QTLTPLTYNALIGACARNND EKALNL+SRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLY EIESDKIELDGQLLNDIILGFAKAG
Subjt:  YEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAG

Query:  DPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDA
        DP+ ALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGL+PRIKAFNALLKGY K+GSLKEAESIVSEMEKSGLSPDEHTYGLL+DA
Subjt:  DPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDA

Query:  YANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWN
        YANVG W+SAR LLK+MEAR+V+PN+FIFSR+LASYRDRGEWQ+TFEVLREMKN NVKPDRHFYNVMIDTFGKFNC+DHAMETY+RMLSEGIEPDVVTWN
Subjt:  YANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWN

Query:  TLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS
        TLIDCH KHGYHERAAELF+EMQERGY PC TTYNIMINSLGEQEKWDEVK LLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS
Subjt:  TLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS

Query:  STMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILS
        STMYNALINAFAQ+GLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAF++LQYMKENDVKPDVVTYTTLMKALIRV+KF+KVPAVYEEMILS
Subjt:  STMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILS

Query:  GCTPDGKARAMLRSALRYMKRTLSI
        GCTPDGKARAMLRSAL+YMKRTLS+
Subjt:  GCTPDGKARAMLRSALRYMKRTLSI

XP_038883964.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic isoform X1 [Benincasa hispida]0.0e+0089.26Show/hide
Query:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL--PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSA
        MLLLSP PLS RFP  QFPSPT+FLHH + H+   HL  PFIS     AT T+S S VTCYTSSDAL+L     D  SLQSR YDF PLL FLSRSS   
Subjt:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL--PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSA

Query:  TAGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEK
             SDSDSEVEF+         + ASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSS SSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEK
Subjt:  TAGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEK

Query:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKA
        LYEAFILSQ+QTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLY EIESDKIELDGQLLNDIILGFAKA
Subjt:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKA

Query:  GDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLID
        GDP+ ALYFLSMVQASGLNPKTSTFVAIISALGN+GRTEEAEAIFEEMKEGGL+PRIKAFNALLKGY ++GSLKEAESIVSEMEKSGLSPDEHTYGLL+D
Subjt:  GDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLID

Query:  AYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTW
        AYANVGRWESAR+LLK+MEAR+V+PNSFIFSR+LASYRDRGEWQ+TFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTW
Subjt:  AYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTW

Query:  NTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKP
        NTLIDCHCKHGYH+RAAELF+EMQERGYLPC TTYNIMINSLGEQEKWDEVK LLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKP
Subjt:  NTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKP

Query:  SSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL
        SSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFS+LQYMKENDVKPDVVTYTTLMKALIRVDKF+KVPAVYEEMIL
Subjt:  SSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL

Query:  SGCTPDGKARAMLRSALRYMKRTLSI
        SGCTPDGKARAMLRSALRYMKRTLS+
Subjt:  SGCTPDGKARAMLRSALRYMKRTLSI

TrEMBL top hitse value%identityAlignment
A0A1S3CMJ2 pentatricopeptide repeat-containing protein At5g42310, mitochondrial0.0e+0087.6Show/hide
Query:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL--PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSA
        MLLLSP PLS RFPAT   SP +FLHHH+ H+   HLS  FISAAAAA    +S S VTCYTSSD L+      D  SLQSRRYDF PLL FLSRS  SA
Subjt:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL--PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSA

Query:  TAGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEK
           + SD+DSEVEFD         + ASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSS SSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEK
Subjt:  TAGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEK

Query:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKA
        LYEAFILSQ+QTLTPLTYNALIGACARNNDLEKALNLMSRMRQDG+QSDFVNYSLIIQSLTRTNKID+PILQKLY EIESDKIELDG LLNDIILGFAKA
Subjt:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKA

Query:  GDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLID
        GDP+ ALYFLSMVQASGLNPKTSTFVA+ISALGN+GRTEEAEAIFEEMKEGGL+PRIKAFNALLKGY ++GSLKEAESIVSEMEKSGLSPDEHTYGLL+D
Subjt:  GDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLID

Query:  AYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTW
        AYANVGRWESAR+LLK+MEAR+V+PN+FIFSR+LASYRDRGEWQ+TFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTW
Subjt:  AYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTW

Query:  NTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKP
        NTLIDCH KHGYH+RAAELF+EMQERGYLPC TTYNIMINSLGEQEKWDEVK LLGKMQSQGLLPNV+TYTTLVDIYG SGRFNDAI+CLEAMKSAGLKP
Subjt:  NTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKP

Query:  SSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL
        S+TMYNALINAFAQRGLSEQAVNAYRVM SDGL+PSLLALNSLINAFGEDRRD+EAFS+LQYMKENDVKPDVVTYTTLMKALIRVDKF+KVPAVYEEMIL
Subjt:  SSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL

Query:  SGCTPDGKARAMLRSALRYMKRTLSI
        SGCTPDGKARAMLRSALRYMKRTLS+
Subjt:  SGCTPDGKARAMLRSALRYMKRTLSI

A0A5A7T6H7 Pentatricopeptide repeat-containing protein0.0e+0087.59Show/hide
Query:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHH-HLPHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSAT
        MLLLSP PLS RFPAT   SP +FLHH+ H    HLS  FISAAAAA    +S S VTCYTSSD L+      D  SLQSRRYDF PLL FLSRS  SA 
Subjt:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHH-HLPHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSAT

Query:  AGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL
          + SD+DSEVEFD         + ASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSS SSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL
Subjt:  AGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL

Query:  YEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAG
        YEAFILSQ+QTLTPLTYNALIGACARNNDLEKALNLMSRMRQDG+QSDFVNYSLIIQSLTRTNKID+PILQKLY EIESDKIELDG LLNDIILGFAKAG
Subjt:  YEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAG

Query:  DPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDA
        DP+ ALYFLSMVQASGLNPKTSTFVA+ISALGN+GRTEEAEAIFEEMKEGGL+PRIKAFNALLKGY ++GSLKEAESIVSEMEKSGLSPDEHTYGLL+DA
Subjt:  DPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDA

Query:  YANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWN
        YANVGRWESAR+LLK+MEAR+V+PN+FIFSR+LASYRDRGEWQ+TFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWN
Subjt:  YANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWN

Query:  TLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS
        TLIDCH KHGYH+RAAELF+EMQERGYLPC TTYNIMINSLGEQEKWDEVK LLGKMQSQGLLPNV+TYTTLVDIYG SGRFNDAI+CLEAMKSAGLKPS
Subjt:  TLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS

Query:  STMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILS
        +TMYNALINAFAQRGLSEQAVNAYRVM SDGL+PSLLALNSLINAFGEDRRD+EAFS+LQYMKENDVKPDVVTYTTLMKALIRVDKF+KVPAVYEEMILS
Subjt:  STMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILS

Query:  GCTPDGKARAMLRSALRYMKRTLSI
        GCTPDGKARAMLRSALRYMKRTLS+
Subjt:  GCTPDGKARAMLRSALRYMKRTLSI

A0A6J1DI55 pentatricopeptide repeat-containing protein At5g42310, mitochondrial isoform X10.0e+0099.44Show/hide
Query:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHLPHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDLDHASLQSRRYDFAPLLHFLSRSSTSATAGAVSD
        MLLLSPPPLSGRFPATQFP PTIFLHHHHHHLPHLSLPFIS AAAAATLTSSPSAVTCYTSSDALDLDHASLQSRRYDFAPLLHFLSRSSTSATAGAVSD
Subjt:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHLPHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDLDHASLQSRRYDFAPLLHFLSRSSTSATAGAVSD

Query:  SDSEVEFDEAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFILSQRQTLTPL
        SDSEVEFDE ASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFILSQRQTLTPL
Subjt:  SDSEVEFDEAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFILSQRQTLTPL

Query:  TYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQAS
        TYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQAS
Subjt:  TYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQAS

Query:  GLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLK
        GLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLK
Subjt:  GLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLK

Query:  EMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERA
        EMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERA
Subjt:  EMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERA

Query:  AELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRG
        AELFQEMQERGYLPCATTYNIMINSLGEQ KWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRG
Subjt:  AELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRG

Query:  LSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSA
        LSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSA
Subjt:  LSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSA

Query:  LRYMKRTLSI
        LRYMKRTLSI
Subjt:  LRYMKRTLSI

A0A6J1G3R4 pentatricopeptide repeat-containing protein At5g42310, chloroplastic0.0e+0088Show/hide
Query:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL-PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSAT
        MLLLSP PLS RFPATQ PSPT+FLHH +  +  HLS P IS  AAAAT TS  S VTC TSSDAL+L     DH S QSRRYDF PLL FLSRS     
Subjt:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL-PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSAT

Query:  AGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL
        A   SDSDSEVEFD         + ASPTSLDPTEFQLAE YRAVPAPLWHSLLKSLC+S SSIGLGYAVV WLQKHNLCFSYELLYSILIHALGRSEKL
Subjt:  AGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL

Query:  YEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAG
        YEAFILSQ QTLTPLTYNALIGACARNND EKALNL+SRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLY EIESDKIELDGQLLNDIILGFAKAG
Subjt:  YEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAG

Query:  DPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDA
        DP+ ALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGL+PRIKAFNALLKGY K+GSLKEAESIVSEMEKSGLSPDEHTYGLL+DA
Subjt:  DPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDA

Query:  YANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWN
        YANVG W+SAR LLK+MEAR+V+PN+FIFSR+LASYRDRGEWQ+TFEVLREMKN NVKPDRHFYNVMIDTFGKFNC+DHAMETY+RMLSEGIEPDVVTWN
Subjt:  YANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWN

Query:  TLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS
        TLIDCH KHGYHERAAELF+EMQERGY PC TTYNIMINSLGEQEKWDEVK LLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS
Subjt:  TLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS

Query:  STMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILS
        STMYNALINAFAQ+GLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAF++LQYMKENDVKPDVVTYTTLMKALIRV+KF+KVPAVYEEMILS
Subjt:  STMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILS

Query:  GCTPDGKARAMLRSALRYMKRTLSI
        GCTPDGKARAMLRSAL+YMKRTLS+
Subjt:  GCTPDGKARAMLRSALRYMKRTLSI

A0A6J1KKB7 pentatricopeptide repeat-containing protein At5g42310, chloroplastic0.0e+0087.86Show/hide
Query:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL-PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSAT
        MLLLSP PLS RFPATQ PSPT+FLHH +  +  HLS P IS  AAAAT  S  S VTC TSSDAL+L     DH S QSRRYDF PLL FLSRS     
Subjt:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHL-PHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDL-----DHASLQSRRYDFAPLLHFLSRSSTSAT

Query:  AGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL
        A   SDSDSEVEFD         + ASPTSLDPTEFQLAE YRAVPAPLWHSLLKSLC+S SSIGLGYAVV WLQKHNLCFSYELLYSILIHALGRSEKL
Subjt:  AGAVSDSDSEVEFD---------EAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKL

Query:  YEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAG
        YEAFILSQ QTLTPLTYNALIGACARNND EKALNL+SRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLY EIESDKIELDGQLLNDIILGFAKAG
Subjt:  YEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAG

Query:  DPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDA
        DP+ ALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGL+PRIKAFNALLKGY K+GSLKEAESIVSEMEKSGLSPDEHTYGLL+DA
Subjt:  DPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDA

Query:  YANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWN
        YANVG W+SAR LLK+MEAR+V+PN+FIFSR+LASYRDRGEWQ+TFEVLREMKN NVKPDRHFYNVMIDTFGKFNC+DHAMETY+RMLSEGIEPDVVTWN
Subjt:  YANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWN

Query:  TLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS
        TLIDCH KHGYHERAAELF+EMQERGY PC TTYNIMINSLGEQEKWDEVK LLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS
Subjt:  TLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPS

Query:  STMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILS
        STMYNALINAFAQ+GLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAF++LQYMKENDVKPDVVTYTTLMKALIRV+KF+KVPAVYEEMILS
Subjt:  STMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILS

Query:  GCTPDGKARAMLRSALRYMKRTLSI
        GCTPDGKARAMLRSAL+YMKRTLS+
Subjt:  GCTPDGKARAMLRSALRYMKRTLSI

SwissProt top hitse value%identityAlignment
A0A1D6IEG9 Pentatricopeptide repeat-containing protein CRP1, chloroplastic4.7e-19854.05Show/hide
Query:  CYTSSDALDLDHASLQSRRYDFAPLLHFLSRSSTSATAGAVSDSDSEVEFDEAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVS
        C TSS  +     S+ + RYDF PLL +LS  S SA+  + S             P S+   E +LA +Y AVP+  WH+LL+ L +S +S+ L +A++ 
Subjt:  CYTSSDALDLDHASLQSRRYDFAPLLHFLSRSSTSATAGAVSDSDSEVEFDEAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVS

Query:  WLQKHNLCFSYELLYSILIHALGRSEKLY-EAFILSQRQTL----TPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRT-NKID
        +L +H LCF  +LL S L+H+L  S +L   + +LS   +L    +PL  N+L+ A A  +    AL L+S +R+  +  D  +YS ++ SL  T +  D
Subjt:  WLQKHNLCFSYELLYSILIHALGRSEKLY-EAFILSQRQTL----TPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRT-NKID

Query:  VPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIF-EEMKEGGLRPRIKAFNALLKG
          +L++L G++   ++E D  L +D+I  FA+A  PD AL  L+  QA GL P+++   A+ISALG  GR  EAEA+F E    G ++PR +A+NALLKG
Subjt:  VPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIF-EEMKEGGLRPRIKAFNALLKG

Query:  YVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYN
        YV+  SLK AE ++ EM + G++PDE TY LL+DAY   GRWESAR LLKEMEA  VKP+S++FSR+LA +RDRG+WQ+ F VLREM+ S V+PDRHFYN
Subjt:  YVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYN

Query:  VMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPN
        VMIDTFGK+NCL HAM+ +++M  EGIEPDVVTWNTLID HCK G H+RAAELF+EM+E    P  TTYNIMIN LGEQE W+ V+++L +M+ QGL+PN
Subjt:  VMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPN

Query:  VITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKEN
        +ITYTTLVD+YG+SGR+ +AI+C+EAMK+ GLKPS TMY+AL+NA+AQRGL++ A+N  + M++DGL+ S+L LNSLINAFGEDRR +EAFSVLQ+M+EN
Subjt:  VITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKEN

Query:  DVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSALRYMK
         ++PDV+TYTTLMKALIRV++F+KVP +YEEMI SGC PD KARAMLRS L+Y+K
Subjt:  DVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSALRYMK

B8Y6I0 Pentatricopeptide repeat-containing protein 10, chloroplastic1.3e-5925.73Show/hide
Query:  AVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFILSQRQTLTP------LTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRT
        A++ W  K     +  L   +++ ALGR  +      L     L P        Y  ++ A +R    E+AL L + +R+ G     V Y++++    R 
Subjt:  AVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFILSQRQTLTP------LTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRT

Query:  NKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNAL
         +   P +  L  E+ +  +E DG   + +I    + G  DEA+ F   ++A G  P   T+ A++   G  G   EA  +  EM++ G +P    +N L
Subjt:  NKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNAL

Query:  LKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRH
           Y + G  +EA   +  M   GL P+  TY  ++ AY NVG+ + A  L  +M+     PN   ++ VL     +  +    E+L EM  S   P+R 
Subjt:  LKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRH

Query:  FYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGL
         +N M+   GK    D+     + M S G+E    T+NTLI  + + G    A +++ EM   G+ PC TTYN ++N L  Q  W   +S++ KM+++G 
Subjt:  FYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGL

Query:  LPNVITYTTLVDIYGQSG------------------------------------RFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRV
         PN  +Y+ L+  Y + G                                    R +      + +K+ G  P   ++N++++ +A+ G+  +A   +  
Subjt:  LPNVITYTTLVDIYGQSG------------------------------------RFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRV

Query:  MRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMK-ENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTP
        ++  GL P L+  NSL++ + +     EA  +L  +K    +KPDVV+Y T++    +     +   V  EM+  G  P
Subjt:  MRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMK-ENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTP

Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial5.1e-5928.46Show/hide
Query:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKA
        L+E+ I S R   TP+ +N L  A AR    +  L     M  +G + D    +++I    R  K+       + G       E D    + ++ GF   
Subjt:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKA

Query:  GDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLID
        G   EA+  +  +      P   T   +I+ L   GR  EA  + + M E G +P    +  +L    K G+   A  +  +ME+  +      Y ++ID
Subjt:  GDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLID

Query:  AYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTW
        +    G ++ A +L  EME + +K +   +S ++    + G+W    ++LREM   N+ PD   ++ +ID F K   L  A E Y+ M++ GI PD +T+
Subjt:  AYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTW

Query:  NTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKP
        N+LID  CK      A ++F  M  +G  P   TY+I+INS  + ++ D+   L  ++ S+GL+PN ITY TLV  + QSG+ N A E  + M S G+ P
Subjt:  NTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKP

Query:  SSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL
        S   Y  L++     G   +A+  +  M+   +   +   N +I+      +  +A+S+   + +  VKPDVVTY  ++  L +    ++   ++ +M  
Subjt:  SSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL

Query:  SGCTPD
         GCTPD
Subjt:  SGCTPD

Q84ZD2 Pentatricopeptide repeat-containing protein CRP1 homolog, chloroplastic1.5e-19652.29Show/hide
Query:  PATQFPSPTIFLHHHHHHLPHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDLDHASLQSRRYDFAPLLHFLSRSSTSATAGAVSDSDSEVEFDEAASP
        PA+    PT+  H H   LP           A  + +SSPSA                    RYDF PLL +LS +S+S                 +  P
Subjt:  PATQFPSPTIFLHHHHHHLPHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDLDHASLQSRRYDFAPLLHFLSRSSTSATAGAVSDSDSEVEFDEAASP

Query:  TSLDP-TEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLY-EAFILSQRQTL----TPLTYNALIG
        TS+ P TE +LA +Y AVPA  WH+LL+ L ++ +S+ L +A++ +L +H LCF  +LL S L+H+L  S +L   + +LS   +L    +PL  N+L+ 
Subjt:  TSLDP-TEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLY-EAFILSQRQTL----TPLTYNALIG

Query:  ACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRT-NKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQASGLNPKT
        A A  +    AL L+  +R+  +  D  +YS ++ SL  T +  D  +L +L G++   ++E D  L +D+I  FA+A  PD AL  L+  QA GL P++
Subjt:  ACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRT-NKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQASGLNPKT

Query:  STFVAIISALGNYGRTEEAEAIF-EEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLKEMEAR
        +   A+IS+LG+  R  EAEA+F E    G ++PR +A+NALLKGYVK GSLK AE ++ EM + G++PDE TY LL+DAY   GRWESAR LLKEMEA 
Subjt:  STFVAIISALGNYGRTEEAEAIF-EEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLKEMEAR

Query:  DVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAAELFQ
         VKP+S++FSR+LA +RDRGEWQ+ F VLREM  S V+PDRHFYNVMIDTFGK+NCL HAM+ +DRM  EGIEPDVVTWNTLID HCK G H+RA ELF 
Subjt:  DVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAAELFQ

Query:  EMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQA
        EM+E       TTYNIMIN LGE+++W+ V+++L +M+ QGL+PN+ITYTTLVD+YG+SGRF +A++C+EAMK+ GLKPS TMY+AL+NA+AQRGL++ A
Subjt:  EMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQA

Query:  VNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSALRYMK
        +N  + MR+DGL+ S + LNSLINAFGEDRR  EAFSVLQ+MKEN ++PDV+TYTTLMKALIRV++F KVP +YEEMI SGC PD KARAMLRSALRYMK
Subjt:  VNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSALRYMK

Q8L844 Pentatricopeptide repeat-containing protein At5g42310, chloroplastic7.1e-29572.66Show/hide
Query:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHLPHLSLPF--ISAAAAAATLTSSPSAVTCYTSS-------DALDLDHASLQSRRYDFAPLLHFLSRSST
        MLLL  PPL     +T+F S     HHHHHH      P    SA  +A+  + SPS+ + Y SS       +  D + +S   RRYDF+PLL FLSR   
Subjt:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHLPHLSLPF--ISAAAAAATLTSSPSAVTCYTSS-------DALDLDHASLQSRRYDFAPLLHFLSRSST

Query:  SATAGAVSDSDSEVEFDEAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFIL
           A    DS+SE E    ASP SL+P EF L E+YRAVPAP WHSL+KSL SS SS+GL YAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAF+L
Subjt:  SATAGAVSDSDSEVEFDEAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFIL

Query:  SQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEAL
        SQ+QTLTPLTYNALIGACARNND+EKALNL+++MRQDGYQSDFVNYSL+IQSLTR+NKID  +L +LY EIE DK+ELD QL+NDII+GFAK+GDP +AL
Subjt:  SQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEAL

Query:  YFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGR
          L M QA+GL+ KT+T V+IISAL + GRT EAEA+FEE+++ G++PR +A+NALLKGYVK G LK+AES+VSEMEK G+SPDEHTY LLIDAY N GR
Subjt:  YFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGR

Query:  WESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCH
        WESAR +LKEMEA DV+PNSF+FSR+LA +RDRGEWQ+TF+VL+EMK+  VKPDR FYNV+IDTFGKFNCLDHAM T+DRMLSEGIEPD VTWNTLIDCH
Subjt:  WESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCH

Query:  CKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNA
        CKHG H  A E+F+ M+ RG LPCATTYNIMINS G+QE+WD++K LLGKM+SQG+LPNV+T+TTLVD+YG+SGRFNDAIECLE MKS GLKPSSTMYNA
Subjt:  CKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNA

Query:  LINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDG
        LINA+AQRGLSEQAVNA+RVM SDGLKPSLLALNSLINAFGEDRRD EAF+VLQYMKEN VKPDVVTYTTLMKALIRVDKF KVP VYEEMI+SGC PD 
Subjt:  LINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDG

Query:  KARAMLRSALRYMKRTL
        KAR+MLRSALRYMK+TL
Subjt:  KARAMLRSALRYMKRTL

Arabidopsis top hitse value%identityAlignment
AT1G62670.1 rna processing factor 21.8e-5927.1Show/hide
Query:  YSILIHALGRSEKLYEAFIL-SQRQTL----TPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDK
        +S L+ A+ +  K      L  Q Q L       TY+ LI    R + L  AL ++ +M + GY+ + V  S ++     + +I   +   L  ++    
Subjt:  YSILIHALGRSEKLYEAFIL-SQRQTL----TPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDK

Query:  IELDGQLLNDIILGFAKAGDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSE
         + +    N +I G        EA+  +  + A G  P   T+  +++ L   G T+ A  +  +M++G L P +  +N ++ G  K   + +A ++  E
Subjt:  IELDGQLLNDIILGFAKAGDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSE

Query:  MEKSGLSPDEHTYGLLIDAYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAM
        ME  G+ P+  TY  LI    N GRW  A  LL +M  R + P+ F FS ++ ++   G+     ++  EM   ++ P    Y+ +I+ F   + LD A 
Subjt:  MEKSGLSPDEHTYGLLIDAYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAM

Query:  ETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGR
        + ++ M+S+   PDVVT+NTLI   CK+   E   E+F+EM +RG +    TYNI+I  L +    D  + +  +M S G+ PN++TY TL+D   ++G+
Subjt:  ETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGR

Query:  FNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKAL
           A+   E ++ + ++P+   YN +I    + G  E   + +  +   G+KP ++A N++I+ F       EA ++ + MKE+   P+   Y TL++A 
Subjt:  FNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKAL

Query:  IRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSAL
        +R         + +EM   G   D     ++ + L
Subjt:  IRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSAL

AT2G31400.1 genomes uncoupled 18.1e-6028.76Show/hide
Query:  NALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQASGL
        +AL  A   + D E   +LM         SD   Y  II+ L   N+ D  +    +      +    G+L + +I    + G    A        A G 
Subjt:  NALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQASGL

Query:  NPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRG-SLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLKE
              F A+ISA G  G  EEA ++F  MKE GLRP +  +NA++    K G   K+      EM+++G+ PD  T+  L+   +  G WE+ARNL  E
Subjt:  NPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRG-SLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLKE

Query:  MEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAA
        M  R ++ + F ++ +L +    G+    FE+L +M    + P+   Y+ +ID F K    D A+  +  M   GI  D V++NTL+  + K G  E A 
Subjt:  MEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAA

Query:  ELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGL
        ++ +EM   G      TYN ++   G+Q K+DEVK +  +M+ + +LPN++TY+TL+D Y + G + +A+E     KSAGL+    +Y+ALI+A  + GL
Subjt:  ELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGL

Query:  SEQAVNAYRVMRSDGLKPSLLALNSLINAFG------------------------------EDRRDIEAF--------------------------SVLQ
           AV+    M  +G+ P+++  NS+I+AFG                              E  R I+ F                           V +
Subjt:  SEQAVNAYRVMRSDGLKPSLLALNSLINAFG------------------------------EDRRDIEAF--------------------------SVLQ

Query:  YMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL
         M + ++KP+VVT++ ++ A  R + F     + EE+ L
Subjt:  YMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL

AT3G22470.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-6028.46Show/hide
Query:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKA
        L+E+ I S R   TP+ +N L  A AR    +  L     M  +G + D    +++I    R  K+       + G       E D    + ++ GF   
Subjt:  LYEAFILSQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKA

Query:  GDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLID
        G   EA+  +  +      P   T   +I+ L   GR  EA  + + M E G +P    +  +L    K G+   A  +  +ME+  +      Y ++ID
Subjt:  GDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLID

Query:  AYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTW
        +    G ++ A +L  EME + +K +   +S ++    + G+W    ++LREM   N+ PD   ++ +ID F K   L  A E Y+ M++ GI PD +T+
Subjt:  AYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTW

Query:  NTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKP
        N+LID  CK      A ++F  M  +G  P   TY+I+INS  + ++ D+   L  ++ S+GL+PN ITY TLV  + QSG+ N A E  + M S G+ P
Subjt:  NTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKP

Query:  SSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL
        S   Y  L++     G   +A+  +  M+   +   +   N +I+      +  +A+S+   + +  VKPDVVTY  ++  L +    ++   ++ +M  
Subjt:  SSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMIL

Query:  SGCTPD
         GCTPD
Subjt:  SGCTPD

AT5G42310.1 Pentatricopeptide repeat (PPR-like) superfamily protein5.0e-29672.66Show/hide
Query:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHLPHLSLPF--ISAAAAAATLTSSPSAVTCYTSS-------DALDLDHASLQSRRYDFAPLLHFLSRSST
        MLLL  PPL     +T+F S     HHHHHH      P    SA  +A+  + SPS+ + Y SS       +  D + +S   RRYDF+PLL FLSR   
Subjt:  MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHLPHLSLPF--ISAAAAAATLTSSPSAVTCYTSS-------DALDLDHASLQSRRYDFAPLLHFLSRSST

Query:  SATAGAVSDSDSEVEFDEAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFIL
           A    DS+SE E    ASP SL+P EF L E+YRAVPAP WHSL+KSL SS SS+GL YAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAF+L
Subjt:  SATAGAVSDSDSEVEFDEAASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFIL

Query:  SQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEAL
        SQ+QTLTPLTYNALIGACARNND+EKALNL+++MRQDGYQSDFVNYSL+IQSLTR+NKID  +L +LY EIE DK+ELD QL+NDII+GFAK+GDP +AL
Subjt:  SQRQTLTPLTYNALIGACARNNDLEKALNLMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEAL

Query:  YFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGR
          L M QA+GL+ KT+T V+IISAL + GRT EAEA+FEE+++ G++PR +A+NALLKGYVK G LK+AES+VSEMEK G+SPDEHTY LLIDAY N GR
Subjt:  YFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFEEMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGR

Query:  WESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCH
        WESAR +LKEMEA DV+PNSF+FSR+LA +RDRGEWQ+TF+VL+EMK+  VKPDR FYNV+IDTFGKFNCLDHAM T+DRMLSEGIEPD VTWNTLIDCH
Subjt:  WESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCH

Query:  CKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNA
        CKHG H  A E+F+ M+ RG LPCATTYNIMINS G+QE+WD++K LLGKM+SQG+LPNV+T+TTLVD+YG+SGRFNDAIECLE MKS GLKPSSTMYNA
Subjt:  CKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNA

Query:  LINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDG
        LINA+AQRGLSEQAVNA+RVM SDGLKPSLLALNSLINAFGEDRRD EAF+VLQYMKEN VKPDVVTYTTLMKALIRVDKF KVP VYEEMI+SGC PD 
Subjt:  LINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPDG

Query:  KARAMLRSALRYMKRTL
        KAR+MLRSALRYMK+TL
Subjt:  KARAMLRSALRYMKRTL

AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-5727.7Show/hide
Query:  YEAFILSQRQTLTPLTYNALIG-----------ACARNNDLEKALNLMSRMRQD-----GYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIEL
        Y+  + S    LT L  N  +G           +C    D    L+L  +M +D      Y+     Y+ ++ SL R   +D   ++++Y E+  DK+  
Subjt:  YEAFILSQRQTLTPLTYNALIG-----------ACARNNDLEKALNLMSRMRQD-----GYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIEL

Query:  DGQLLNDIILGFAKAGDPDEALYFLSMVQASGLNPKTSTFVAII----------SALGNYG-------------------------RTEEAEAIFEEMKE
        +    N ++ G+ K G+ +EA  ++S +  +GL+P   T+ ++I          SA   +                          R +EA  +F +MK+
Subjt:  DGQLLNDIILGFAKAGDPDEALYFLSMVQASGLNPKTSTFVAII----------SALGNYG-------------------------RTEEAEAIFEEMKE

Query:  GGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVL
            P ++ +  L+K         EA ++V EME++G+ P+ HTY +LID+  +  ++E AR LL +M  + + PN   ++ ++  Y  RG  +   +V+
Subjt:  GGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVL

Query:  REMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDE
          M++  + P+   YN +I  + K N +  AM   ++ML   + PDVVT+N+LID  C+ G  + A  L   M +RG +P   TY  MI+SL + ++ +E
Subjt:  REMKNSNVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDE

Query:  VKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGED
           L   ++ +G+ PNV+ YT L+D Y ++G+ ++A   LE M S    P+S  +NALI+     G  ++A      M   GL+P++     LI+   +D
Subjt:  VKSLLGKMQSQGLLPNVITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGED

Query:  RRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPD
             A+S  Q M  +  KPD  TYTT ++   R  +      +  +M  +G +PD
Subjt:  RRDIEAFSVLQYMKENDVKPDVVTYTTLMKALIRVDKFNKVPAVYEEMILSGCTPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTACTTCTCTCACCGCCGCCACTCTCCGGTCGCTTCCCTGCAACTCAATTCCCTTCTCCGACCATCTTCCTCCACCACCACCACCACCATTTACCCCACCTCTCTCT
TCCCTTCATTTCCGCCGCCGCCGCCGCCGCCACCTTGACCTCCTCCCCCTCCGCCGTTACCTGTTACACTTCCTCCGACGCACTTGACCTCGACCACGCCTCCCTTCAGA
GTCGCCGCTACGACTTCGCTCCCCTCCTCCACTTCCTTTCCCGTTCTTCCACGTCTGCCACCGCCGGAGCCGTGTCCGATTCCGATTCCGAGGTGGAATTCGACGAGGCA
GCTTCTCCAACTTCGCTCGACCCTACGGAGTTCCAGCTTGCGGAGGCGTATAGGGCTGTGCCGGCGCCTCTCTGGCACTCGCTCCTCAAATCTCTCTGCTCTTCTCCCTC
TTCGATTGGGCTGGGTTATGCGGTTGTTTCGTGGCTTCAGAAGCATAATTTGTGTTTCTCTTATGAATTGCTTTACTCGATTCTTATTCATGCGCTTGGCCGCTCTGAGA
AGCTCTATGAGGCTTTCATTCTCTCCCAGAGACAAACCCTAACCCCCTTAACGTATAATGCTCTGATTGGTGCTTGTGCTCGCAATAATGATTTGGAGAAGGCCCTCAAT
TTGATGTCTAGGATGCGGCAAGATGGTTACCAATCTGATTTTGTCAACTATAGTTTGATAATTCAGTCGCTTACTCGAACGAATAAGATTGATGTTCCAATCTTGCAGAA
GCTCTATGGAGAGATTGAGTCTGATAAAATTGAACTCGATGGGCAGCTCCTTAATGATATCATTTTGGGGTTTGCAAAAGCTGGAGATCCTGACGAAGCTCTGTATTTCT
TGTCCATGGTTCAGGCCAGTGGTTTGAACCCCAAAACTTCTACTTTTGTTGCGATAATTTCCGCATTGGGGAATTATGGGCGGACAGAGGAAGCTGAGGCTATCTTTGAG
GAAATGAAAGAAGGCGGATTGAGACCGAGGATCAAGGCTTTCAATGCTCTTCTTAAAGGTTATGTTAAAAGGGGTTCTCTGAAAGAAGCAGAATCTATTGTTTCAGAGAT
GGAAAAGAGTGGATTATCACCGGATGAGCACACATATGGTCTTCTCATTGATGCTTATGCAAATGTGGGCAGATGGGAAAGTGCAAGAAATTTGTTGAAAGAAATGGAAG
CTAGAGATGTAAAACCTAACTCCTTCATTTTCAGTAGGGTTTTAGCTAGTTATCGCGACCGGGGAGAATGGCAGAGAACATTTGAAGTTTTGAGGGAAATGAAGAACAGC
AACGTCAAACCTGATAGGCATTTTTACAATGTCATGATCGATACTTTTGGGAAGTTCAATTGCCTTGATCATGCCATGGAAACATATGACCGGATGCTTTCTGAGGGGAT
TGAACCAGACGTCGTTACTTGGAACACACTTATAGATTGTCATTGTAAGCACGGATACCATGAACGGGCTGCAGAGTTGTTCCAAGAAATGCAGGAGCGTGGTTACTTAC
CTTGTGCCACAACATATAATATTATGATCAATTCATTAGGAGAGCAGGAAAAATGGGATGAGGTAAAAAGCTTGTTAGGGAAGATGCAGAGTCAGGGCTTACTTCCCAAT
GTTATAACATACACTACCCTTGTCGATATATACGGACAGTCGGGGAGGTTTAATGACGCTATTGAGTGCTTGGAGGCCATGAAGTCTGCTGGGCTGAAACCATCCTCAAC
TATGTATAATGCTTTAATCAATGCCTTCGCTCAAAGAGGTTTGTCGGAGCAGGCAGTAAATGCATATAGAGTTATGAGATCAGATGGACTAAAGCCCAGTCTCTTGGCTC
TTAATTCATTGATCAACGCATTTGGCGAGGATAGGAGGGATATTGAAGCCTTTTCAGTCTTGCAGTACATGAAGGAAAATGATGTGAAGCCTGATGTTGTGACATATACA
ACACTTATGAAAGCTTTGATTCGTGTCGATAAATTTAACAAGGTTCCAGCTGTATATGAAGAGATGATTTTGTCTGGATGTACTCCTGACGGAAAGGCCAGAGCAATGTT
GCGGTCCGCCCTCAGATACATGAAGCGTACACTAAGTATA
mRNA sequenceShow/hide mRNA sequence
ATGCTACTTCTCTCACCGCCGCCACTCTCCGGTCGCTTCCCTGCAACTCAATTCCCTTCTCCGACCATCTTCCTCCACCACCACCACCACCATTTACCCCACCTCTCTCT
TCCCTTCATTTCCGCCGCCGCCGCCGCCGCCACCTTGACCTCCTCCCCCTCCGCCGTTACCTGTTACACTTCCTCCGACGCACTTGACCTCGACCACGCCTCCCTTCAGA
GTCGCCGCTACGACTTCGCTCCCCTCCTCCACTTCCTTTCCCGTTCTTCCACGTCTGCCACCGCCGGAGCCGTGTCCGATTCCGATTCCGAGGTGGAATTCGACGAGGCA
GCTTCTCCAACTTCGCTCGACCCTACGGAGTTCCAGCTTGCGGAGGCGTATAGGGCTGTGCCGGCGCCTCTCTGGCACTCGCTCCTCAAATCTCTCTGCTCTTCTCCCTC
TTCGATTGGGCTGGGTTATGCGGTTGTTTCGTGGCTTCAGAAGCATAATTTGTGTTTCTCTTATGAATTGCTTTACTCGATTCTTATTCATGCGCTTGGCCGCTCTGAGA
AGCTCTATGAGGCTTTCATTCTCTCCCAGAGACAAACCCTAACCCCCTTAACGTATAATGCTCTGATTGGTGCTTGTGCTCGCAATAATGATTTGGAGAAGGCCCTCAAT
TTGATGTCTAGGATGCGGCAAGATGGTTACCAATCTGATTTTGTCAACTATAGTTTGATAATTCAGTCGCTTACTCGAACGAATAAGATTGATGTTCCAATCTTGCAGAA
GCTCTATGGAGAGATTGAGTCTGATAAAATTGAACTCGATGGGCAGCTCCTTAATGATATCATTTTGGGGTTTGCAAAAGCTGGAGATCCTGACGAAGCTCTGTATTTCT
TGTCCATGGTTCAGGCCAGTGGTTTGAACCCCAAAACTTCTACTTTTGTTGCGATAATTTCCGCATTGGGGAATTATGGGCGGACAGAGGAAGCTGAGGCTATCTTTGAG
GAAATGAAAGAAGGCGGATTGAGACCGAGGATCAAGGCTTTCAATGCTCTTCTTAAAGGTTATGTTAAAAGGGGTTCTCTGAAAGAAGCAGAATCTATTGTTTCAGAGAT
GGAAAAGAGTGGATTATCACCGGATGAGCACACATATGGTCTTCTCATTGATGCTTATGCAAATGTGGGCAGATGGGAAAGTGCAAGAAATTTGTTGAAAGAAATGGAAG
CTAGAGATGTAAAACCTAACTCCTTCATTTTCAGTAGGGTTTTAGCTAGTTATCGCGACCGGGGAGAATGGCAGAGAACATTTGAAGTTTTGAGGGAAATGAAGAACAGC
AACGTCAAACCTGATAGGCATTTTTACAATGTCATGATCGATACTTTTGGGAAGTTCAATTGCCTTGATCATGCCATGGAAACATATGACCGGATGCTTTCTGAGGGGAT
TGAACCAGACGTCGTTACTTGGAACACACTTATAGATTGTCATTGTAAGCACGGATACCATGAACGGGCTGCAGAGTTGTTCCAAGAAATGCAGGAGCGTGGTTACTTAC
CTTGTGCCACAACATATAATATTATGATCAATTCATTAGGAGAGCAGGAAAAATGGGATGAGGTAAAAAGCTTGTTAGGGAAGATGCAGAGTCAGGGCTTACTTCCCAAT
GTTATAACATACACTACCCTTGTCGATATATACGGACAGTCGGGGAGGTTTAATGACGCTATTGAGTGCTTGGAGGCCATGAAGTCTGCTGGGCTGAAACCATCCTCAAC
TATGTATAATGCTTTAATCAATGCCTTCGCTCAAAGAGGTTTGTCGGAGCAGGCAGTAAATGCATATAGAGTTATGAGATCAGATGGACTAAAGCCCAGTCTCTTGGCTC
TTAATTCATTGATCAACGCATTTGGCGAGGATAGGAGGGATATTGAAGCCTTTTCAGTCTTGCAGTACATGAAGGAAAATGATGTGAAGCCTGATGTTGTGACATATACA
ACACTTATGAAAGCTTTGATTCGTGTCGATAAATTTAACAAGGTTCCAGCTGTATATGAAGAGATGATTTTGTCTGGATGTACTCCTGACGGAAAGGCCAGAGCAATGTT
GCGGTCCGCCCTCAGATACATGAAGCGTACACTAAGTATA
Protein sequenceShow/hide protein sequence
MLLLSPPPLSGRFPATQFPSPTIFLHHHHHHLPHLSLPFISAAAAAATLTSSPSAVTCYTSSDALDLDHASLQSRRYDFAPLLHFLSRSSTSATAGAVSDSDSEVEFDEA
ASPTSLDPTEFQLAEAYRAVPAPLWHSLLKSLCSSPSSIGLGYAVVSWLQKHNLCFSYELLYSILIHALGRSEKLYEAFILSQRQTLTPLTYNALIGACARNNDLEKALN
LMSRMRQDGYQSDFVNYSLIIQSLTRTNKIDVPILQKLYGEIESDKIELDGQLLNDIILGFAKAGDPDEALYFLSMVQASGLNPKTSTFVAIISALGNYGRTEEAEAIFE
EMKEGGLRPRIKAFNALLKGYVKRGSLKEAESIVSEMEKSGLSPDEHTYGLLIDAYANVGRWESARNLLKEMEARDVKPNSFIFSRVLASYRDRGEWQRTFEVLREMKNS
NVKPDRHFYNVMIDTFGKFNCLDHAMETYDRMLSEGIEPDVVTWNTLIDCHCKHGYHERAAELFQEMQERGYLPCATTYNIMINSLGEQEKWDEVKSLLGKMQSQGLLPN
VITYTTLVDIYGQSGRFNDAIECLEAMKSAGLKPSSTMYNALINAFAQRGLSEQAVNAYRVMRSDGLKPSLLALNSLINAFGEDRRDIEAFSVLQYMKENDVKPDVVTYT
TLMKALIRVDKFNKVPAVYEEMILSGCTPDGKARAMLRSALRYMKRTLSI