; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025996 (gene) of Chayote v1 genome

Gene IDSed0025996
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG08:27162170..27166420
RNA-Seq ExpressionSed0025996
SyntenySed0025996
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588362.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0087.34Show/hide
Query:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN
        M MLKL  PI+ LAPVKFTPFL  SN L SPLLD +KLLK+AADAKNLKFGR IHAHLIITN +  DCRVN +NSLINLYVKCDEL +AR+MFDRMS+RN
Subjt:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN

Query:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG
        VVSWCALMAGYM+NGSPLEVF LFK M+VKDNIFPNEYVI+ V+SSC D QMYVEG+QCHGF+LKSGLELHQYVKNALIQMYSKCSDVRAAL+ILDTVPG
Subjt:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG

Query:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ
        YDIFCYNLVL GLLEHSH+REAIEVL+L+IGEG +WNNAT+VTIF +CA+LKDLK GK VHA+MLKSDIDYDVYIGSSIIDMYGKCGNVL GRAFFDQLQ
Subjt:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ

Query:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM
        ++NVVSWTAIMAAY QNGFFEEALNL SKME DHIPPNEYTLAVLLNSAAGLSA  HG+QLHARAEKSGLKGN+IVGNALIIMY+KSGDILAAQ +FSNM
Subjt:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM

Query:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR
        KCCDS+TWNAIITG+SHH +GKEAL++F DMLTARECPNYVTFIGVLSACAHLS VDEG YYFNHLMK FGIVPGLEHYTCIVGLLSRSGRLDEAENFMR
Subjt:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR

Query:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE
        SNPINWDVVAWRTLLNACY+HRNYDKGKQIA+YLLQMDHEDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKH E
Subjt:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE

Query:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF
        S QIY+ VRDLL+KI+ LGY+PDIAGVLHDIEDEQKL NLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMC DCHTAIKLISK+ NR IIVRDANRFHHF
Subjt:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF

Query:  QDGVCSCGDYW
        QDG CSCGDYW
Subjt:  QDGVCSCGDYW

KAG7022211.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.32Show/hide
Query:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN
        M MLKL  PI+ LAPVKFTPFL  SN L SPLLD +KLLK+AADAKNLKFGR IHAHLIITN +  DCRVN +NSLINLYVKCDEL +AR+MFDRMS+RN
Subjt:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN

Query:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG
        VVSWCALMAGYM+NGSPLEVF LFK M+VKDNIFPNEYVI+ V+SSC D QMYVEG+QCHGF+LKSGLELHQYVKNALIQMYSKCSDVRAAL+ILDTVPG
Subjt:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG

Query:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ
        YDIFCYNLVL GLLEHSH+REAIEVL+L+IGEG +WNNAT+VTIF +CA+LKDLK GK VHA+MLKSDIDYDVYIGSSIIDMYGKCGNVL GRAFFDQLQ
Subjt:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ

Query:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM
        ++NVVSWTAIMAAY QNGFFEEALNL SKME DHIPPNEYTLAVLLNSAAGLSA  HG+QLHARAEKSGLKGN+IVGNALIIMY+KSGDILAAQ +FSNM
Subjt:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM

Query:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR
        KCCDS+TWNAIITG+SHH +GKEAL++F DMLTARECPNYVTFIGVLSACAHLS VDEG YYFNHLMK FGIVPGLEHYTCIVGLLSRSGRLDEAENFMR
Subjt:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR

Query:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE
        SNPINWDVVAWRTLLNACY+HRNYDKGKQIA+YLLQMDHEDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKH E
Subjt:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE

Query:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF
        S QIY+ VRDLL+KI+ LGY+PDIAGVLHDIEDEQKL+NLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMC DCHTAIKLISK+ NR IIVRDANRFHHF
Subjt:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF

Query:  QD
        QD
Subjt:  QD

XP_022933883.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita moschata]0.0e+0087.34Show/hide
Query:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN
        M MLKL  PI+ LAPVKFTPFL  SN L SPLLD +KLLK+AADAKNLKFGR IHAHLIITN +  DCRVN +NSLINLYVKCDEL +AR+MFDRMS+RN
Subjt:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN

Query:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG
        VVSWCALMAGYM+NGSPLEVF LFK M+VKDNIFPNEYVI+ V+SSC D QMYVEG+QCHGF+LKSGLELHQYVKNALIQMYSKCSDVRAAL+ILDTVPG
Subjt:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG

Query:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ
        YD+FCYNLVL GLLEHSH+REAIEVL+L+IGEG +WNNAT+VTIF +CA+LKDLK GK VHA+MLKSDIDYDVYIGSSIIDMYGKCGNVL GRAFFDQLQ
Subjt:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ

Query:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM
        ++NVVSWTAIMAAY QNGFFEEALNL SKME DHIPPNEYTLAVLLNSAAGLSA  HG+QLHARAEKSGLKGN+IVGNALIIMY+KSGDILAAQ +FSNM
Subjt:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM

Query:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR
        KCCDS+TWNAIITG+SHH +GKEAL++F DMLTARECPNYVTFIGVLSACAHLS VDEG YYFNHLMK FGIVPGLEHYTCIVGLLSRSGRLDEAENFMR
Subjt:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR

Query:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE
        SNPINWDVVAWRTLLNACY+HRNYDKGKQIA+YLLQMDHEDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKH E
Subjt:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE

Query:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF
        S QIY+ VRDLL+KI+ LGYVPDIAGVLHDIEDEQKL+NLSYHSEKLAVAYGLMK PSGAPIRVIKNLRMC DCHTAIKLISK+ NR IIVRDANRFHHF
Subjt:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF

Query:  QDGVCSCGDYW
        QDG CSCGDYW
Subjt:  QDGVCSCGDYW

XP_023002421.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita maxima]0.0e+0086.92Show/hide
Query:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN
        M MLKL  PI+ LAPVKFTPFL  SN L SPLLD +KLLK+AADAKNLKFGR+IHAHL+ITN I RDCRVN +NSLINLYVKCDEL +AR+MFDRMS+RN
Subjt:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN

Query:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG
        VVSWCALMAGYM+NGSPL+VF LFK M+VKDNIFPNEYVI+ V+SSC D QMYVEGKQCHGF+LKSGLELHQYVKNALIQMYSKCSDVRAAL+ILDTVPG
Subjt:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG

Query:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ
        YD+FCYNLVL GLLEHSH+ EAIEVL+L+I EG +WNNAT+VTIF +CA+LKDLKLGK VHA+MLKSDID DVYIGSSIIDMYGKCGNVL GRAFFDQLQ
Subjt:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ

Query:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM
        ++NVVSWTAIMAAY QNGFFEEALNL SKME DHIPPNEYTLAVLLNSAAGLSA  HG+QLHARAEKSGLKGN+IVGNALIIMY+KSGDILAAQ +FSNM
Subjt:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM

Query:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR
        KCCDS+TWNAIITG+SHH +GKEAL++F DMLTARECPNYVTFIGVLSACAHLS VDEG YYFNHLMK  GIVPGLEHYTCIVGLLSRSGRLDEAENFMR
Subjt:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR

Query:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE
        SNPINWDVVAWRTLLNACY+HRNYDKGKQIA+YLLQMDHEDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKH E
Subjt:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE

Query:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF
        S QIY+ +RDLL+KI+ LGYVPDIAGVLHDIEDEQK++NLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMC DCHTAIKLISKV NR IIVRDANRFHHF
Subjt:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF

Query:  QDGVCSCGDYW
        QDG CSCGDYW
Subjt:  QDGVCSCGDYW

XP_023529544.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita pepo subsp. pepo]0.0e+0086.78Show/hide
Query:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN
        M MLKL  PI+ LAPVKFTPFL  SN L SPLLD +KLLK+AADAKNLKFGR IHAHLIITN +  DCRVN +NSLINLYVKCDEL +AR+MFDRMS+RN
Subjt:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN

Query:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG
        VVSWCALMAGYM+NGSPL VF LFK M+VKDNIFPNEYVI+ V+SSC D QMYVEG+QCHGF+LKSGLELHQYVKNALIQMYSKCSDVRAAL+ILDTVPG
Subjt:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG

Query:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ
        YD+FCYNLVL GLLEHSH+REAIEVL+L+I EG +WNNAT+VTIF +CA+LKDL+ GK VHA+MLKSDIDYDVYIGSSIIDMYGKCGNVL GRAFFDQLQ
Subjt:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ

Query:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM
        ++NVVSWTAIMAAY QNGFFEEALNL SKME DHIPPNEYTLAVLLNSAAGLSA  HG+QLHARAEKSGLKGN+IVGNALIIMY+KSGDILAAQ +FSNM
Subjt:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM

Query:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR
         CCDS+TWNAIITG+SHH +GKEAL++F DMLTARECPNYVTFIGVLSACAHLS VDEG YYFNHLMK FGIVPGLEHYTCIVGLLSRSGRLDEAENFMR
Subjt:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR

Query:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE
        SNPINWDVVAWRTLLNACY+HRNYDKGKQIA+YLLQMDHEDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKH E
Subjt:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE

Query:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF
        S QIY+ VRDLL+KI+ LGYVPDIAGVLHDIEDEQKL+NLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMC DCHTAIKLISK+ NR IIVRDANRFHHF
Subjt:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF

Query:  QDGVCSCGDYW
        QDG CSCGDYW
Subjt:  QDGVCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0KR26 DYW_deaminase domain-containing protein0.0e+0080.82Show/hide
Query:  MPMLKLPITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVV
        M +LKLPIT + PVKFTPFL  SN+L SP  D IKLLK+AADAKNLKFGR IHAHL ITNH +RD +VN +NSLINLYVKCDE+ +AR++FD M RRNVV
Subjt:  MPMLKLPITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVV

Query:  SWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD
        SW ALMAGYM+NG+PLEVF LFK MVVKDNIFPNEYVI+  +SS CD QMYVEGKQCHG+ALKSGLE HQYVKNALIQ+YSKCSDV AA+QIL TVPG D
Subjt:  SWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD

Query:  IFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQ
        IFCYNLV+ GLL+H+H+ EA++VL+LII EGIEWNNATYVTIF LCA+LKD+ LGKQVHAQMLKSDID DVYIGSSIIDMYGKCGNVL GR FFD+LQS+
Subjt:  IFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQ

Query:  NVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKC
        NVVSWT+I+AAY QN FFEEALNL SKME D IPPNEYT+AVL NSAAGLSA C G+QLHARAEKSGLKGN++VGNALIIMY KSGDILAAQ +FSNM C
Subjt:  NVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKC

Query:  CDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSN
        C+ +TWNAIITG+SHHGLGKEALSMF DM+   E PNYVTFIGV+ ACAHL  VDEG+YYFNHLMK F IVPGLEHYTCIVGLLSRSGRLDEAENFMRS+
Subjt:  CDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSN

Query:  PINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESR
         INWDVV+WRTLLNACY+H++YDKG++IA+YLLQ++  DVGTYILLSNMHARVRRWD VV+IRKLMRERNVKKEPGVSWLEIRN+AHVFTSED KH E+ 
Subjt:  PINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESR

Query:  QIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQD
         IY+ V+DLLSKI+ LGYVPDI  VLHDIEDEQK++NLSYHSEKLAVAYGLMKTPSGAPI VIKNLRMC DCHTAIKLISKV NRVI+VRDANRFHHFQ+
Subjt:  QIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQD

Query:  GVCSCGDYW
        G CSCGDYW
Subjt:  GVCSCGDYW

A0A1S4E243 pentatricopeptide repeat-containing protein At5g39680 isoform X20.0e+0079.41Show/hide
Query:  MPMLKLPITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVV
        M +LKLPI+ + PVKFTPFL  S++  SP  D IKLLK+AADAKNL FGR I AHL ITNH +RD +VN +NSLINLYVKC E+ +AR++FD M RRNVV
Subjt:  MPMLKLPITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVV

Query:  SWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD
        SW  LMAGYM+NG+P EVF LFK MV+KDNI PN+YVI+ V+SS C+ QMYVEGKQCHG+ALKSGLE HQYVKNALIQ+YSKCSDV AA+QIL TVPG D
Subjt:  SWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD

Query:  IFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQ
        IFCYNLV+ GLL+H+H+REA++VL+LII +GIEWN+ATYVTIF LCA+LKD+ LGKQVHAQMLKSDID DVYIGSSIIDMYGKCGNVL GR FFD+LQS+
Subjt:  IFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQ

Query:  NVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKC
        NVVSWT+IMAAY QN FFEEAL+L SKME D IPPNEYT+AVL NSAAGLSA C G+QLHARAEKSGLKGN++VGNALIIMY KSGDILAAQ +FSNM C
Subjt:  NVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKC

Query:  CDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSN
        CD +TWNAIITG+SHHGLGKEALSMF DM+T  E PNYVTFIGV+SACAHL  VDEG+YYFNHLMK FGIVPGLEHYTCIVGLLSRSGRLDEAENFMRS+
Subjt:  CDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSN

Query:  PINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESR
         INWDVV+WRTLLNACY+H++YDKGKQIA+YLLQ++  DVGTYILLSNMHARVRRWD VV+IRKLMRERNVKKEPGVSWLEIRN+AHVFTSED KH ++ 
Subjt:  PINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESR

Query:  QIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQD
         IY+ V++LLSKI+ LGYVPDI  VLHDIEDEQK+ NLSYHSEKLAVAYGLMKT SG PIRVIKNLRMC DCHTAIKLIS+V NRVIIVRD NRFHHFQ+
Subjt:  QIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQD

Query:  GVCSCGDYW
        G CSCGDYW
Subjt:  GVCSCGDYW

A0A6J1CRJ5 pentatricopeptide repeat-containing protein At5g396800.0e+0086.04Show/hide
Query:  MPMLKLPITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVV
        MP LKLP +GL      PFLF SNY  SP  + IKLLKLAADAKNLKFGRIIHAHLIITNH   DCRVN +NSLIN Y KCDELLVAR+MFDRM +RNVV
Subjt:  MPMLKLPITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVV

Query:  SWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD
        SW ALMAGYM+NGS LEVF L K MVV+D+I PNEYVI+ +VSSCC  QMYVEGKQCHG+ALKSGLELHQYVKNALIQMYSKCSDVRAA+QILDTVPGYD
Subjt:  SWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYD

Query:  IFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQ
        IFCYNLVL GLLEHSH+REAIEVL L+IGE IEWNNATYVTIF LCA+LKDL+LGKQVHAQML++DIDYDVYIGSSIIDMYGKCG VL GR FFD+LQSQ
Subjt:  IFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQ

Query:  NVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKC
        NVVSWT IMAAY QNGFFEEALNL SKME D IPPNEYTLAV LNSAAGLSA  HG+QLHARAEKSGLKGN+IVGNALIIMY+KSGDILAAQH+FSNM C
Subjt:  NVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKC

Query:  CDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSN
        CDS+TWNAIITG+SHHGLGKEALSMF DML   ECPNYVTFIGVLSACAHLS V EG+YYFNHLMK FGIVPGLEHYTCI+GLLSRSG+LDEAENFMRSN
Subjt:  CDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSN

Query:  PINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESR
        PINWDVVAWRTLL ACY+HRNYDKGKQIA+YLLQMD EDVG+YILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKH ES 
Subjt:  PINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESR

Query:  QIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQD
        QIY+KVRDLLS+IQ LGYVPDIAGVLHDI+DEQKL+NLSYHSEKLAVAYGLMKTP GAPIRVIKNLRMC DCHTA+KLISKV NRVIIVRDANRFHHF+D
Subjt:  QIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQD

Query:  GVCSCGDYW
        G CSCGDYW
Subjt:  GVCSCGDYW

A0A6J1F637 pentatricopeptide repeat-containing protein At5g396800.0e+0087.34Show/hide
Query:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN
        M MLKL  PI+ LAPVKFTPFL  SN L SPLLD +KLLK+AADAKNLKFGR IHAHLIITN +  DCRVN +NSLINLYVKCDEL +AR+MFDRMS+RN
Subjt:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN

Query:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG
        VVSWCALMAGYM+NGSPLEVF LFK M+VKDNIFPNEYVI+ V+SSC D QMYVEG+QCHGF+LKSGLELHQYVKNALIQMYSKCSDVRAAL+ILDTVPG
Subjt:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG

Query:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ
        YD+FCYNLVL GLLEHSH+REAIEVL+L+IGEG +WNNAT+VTIF +CA+LKDLK GK VHA+MLKSDIDYDVYIGSSIIDMYGKCGNVL GRAFFDQLQ
Subjt:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ

Query:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM
        ++NVVSWTAIMAAY QNGFFEEALNL SKME DHIPPNEYTLAVLLNSAAGLSA  HG+QLHARAEKSGLKGN+IVGNALIIMY+KSGDILAAQ +FSNM
Subjt:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM

Query:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR
        KCCDS+TWNAIITG+SHH +GKEAL++F DMLTARECPNYVTFIGVLSACAHLS VDEG YYFNHLMK FGIVPGLEHYTCIVGLLSRSGRLDEAENFMR
Subjt:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR

Query:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE
        SNPINWDVVAWRTLLNACY+HRNYDKGKQIA+YLLQMDHEDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKH E
Subjt:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE

Query:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF
        S QIY+ VRDLL+KI+ LGYVPDIAGVLHDIEDEQKL+NLSYHSEKLAVAYGLMK PSGAPIRVIKNLRMC DCHTAIKLISK+ NR IIVRDANRFHHF
Subjt:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF

Query:  QDGVCSCGDYW
        QDG CSCGDYW
Subjt:  QDGVCSCGDYW

A0A6J1KL92 pentatricopeptide repeat-containing protein At5g396800.0e+0086.92Show/hide
Query:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN
        M MLKL  PI+ LAPVKFTPFL  SN L SPLLD +KLLK+AADAKNLKFGR+IHAHL+ITN I RDCRVN +NSLINLYVKCDEL +AR+MFDRMS+RN
Subjt:  MPMLKL--PITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRN

Query:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG
        VVSWCALMAGYM+NGSPL+VF LFK M+VKDNIFPNEYVI+ V+SSC D QMYVEGKQCHGF+LKSGLELHQYVKNALIQMYSKCSDVRAAL+ILDTVPG
Subjt:  VVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPG

Query:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ
        YD+FCYNLVL GLLEHSH+ EAIEVL+L+I EG +WNNAT+VTIF +CA+LKDLKLGK VHA+MLKSDID DVYIGSSIIDMYGKCGNVL GRAFFDQLQ
Subjt:  YDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQ

Query:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM
        ++NVVSWTAIMAAY QNGFFEEALNL SKME DHIPPNEYTLAVLLNSAAGLSA  HG+QLHARAEKSGLKGN+IVGNALIIMY+KSGDILAAQ +FSNM
Subjt:  SQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNM

Query:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR
        KCCDS+TWNAIITG+SHH +GKEAL++F DMLTARECPNYVTFIGVLSACAHLS VDEG YYFNHLMK  GIVPGLEHYTCIVGLLSRSGRLDEAENFMR
Subjt:  KCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMR

Query:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE
        SNPINWDVVAWRTLLNACY+HRNYDKGKQIA+YLLQMDHEDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKH E
Subjt:  SNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSE

Query:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF
        S QIY+ +RDLL+KI+ LGYVPDIAGVLHDIEDEQK++NLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMC DCHTAIKLISKV NR IIVRDANRFHHF
Subjt:  SRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF

Query:  QDGVCSCGDYW
        QDG CSCGDYW
Subjt:  QDGVCSCGDYW

SwissProt top hitse value%identityAlignment
Q9FK93 Pentatricopeptide repeat-containing protein At5g396801.4e-21352.4Show/hide
Query:  SNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLF
        S   P P+  L +LLK+ A++  L+ G  IHAHLI+TN   R      +NSLINLYVKC E + AR++FD M  RNVVSWCA+M GY  +G   EV  LF
Subjt:  SNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLF

Query:  KNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLKGLLEHSHIREAIE
        K+M       PNE+V +VV  SC +     EGKQ HG  LK GL  H++V+N L+ MYS CS    A+++LD +P  D+  ++  L G LE    +E ++
Subjt:  KNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLKGLLEHSHIREAIE

Query:  VLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEAL
        VL     E   WNN TY++   L + L+DL L  QVH++M++   + +V    ++I+MYGKCG VL  +  FD   +QN+   T IM AY Q+  FEEAL
Subjt:  VLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEAL

Query:  NLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEA
        NL SKM+   +PPNEYT A+LLNS A LS    G+ LH    KSG + +++VGNAL+ MYAKSG I  A+  FS M   D VTWN +I+G SHHGLG+EA
Subjt:  NLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEA

Query:  LSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNY
        L  F  M+   E PN +TFIGVL AC+H+  V++G +YFN LMK F + P ++HYTCIVGLLS++G   +AE+FMR+ PI WDVVAWRTLLNACY+ RNY
Subjt:  LSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNY

Query:  DKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDI
          GK++A+Y ++    D G Y+LLSN+HA+ R W+GV K+R LM  R VKKEPGVSW+ IRN  HVF +EDN+H E   IY KV++++SKI+ LGY PD+
Subjt:  DKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDI

Query:  AGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQDGVCSCGDYW
        AG  HD+++EQ+ +NLSYHSEKLAVAYGL+KTP  +P+ V KN+R+C DCH+AIKLISK+  R I++RD+NRFHHF DG CSC DYW
Subjt:  AGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQDGVCSCGDYW

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220703.7e-13435.51Show/hide
Query:  NSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQY
        N++++ Y K  ++    E FD++ +R+ VSW  ++ GY   G   +   +  +M VK+ I P ++ ++ V++S    +    GK+ H F +K GL  +  
Subjt:  NSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQY

Query:  VKNALIQMYSKCSDVRAALQILD-------------------------------TVPGYDIFCYNLVLKGLLEHSHIREAIEVLELIIGEG-IEWNNATY
        V N+L+ MY+KC D   A  + D                                +   DI  +N ++ G  +  +   A+++   ++ +  +  +  T 
Subjt:  VKNALIQMYSKCSDVRAALQILD-------------------------------TVPGYDIFCYNLVLKGLLEHSHIREAIEVLELIIGEG-IEWNNATY

Query:  VTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQ---------------------------------LQSQNVVSWT
         ++   CA L+ L +GKQ+H+ ++ +  D    + +++I MY +CG V   R   +Q                                 L+ ++VV+WT
Subjt:  VTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQ---------------------------------LQSQNVVSWT

Query:  AIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKC-CDSVT
        A++  Y Q+G + EA+NL   M      PN YTLA +L+ A+ L++  HG+Q+H  A KSG   ++ V NALI MYAK+G+I +A   F  ++C  D+V+
Subjt:  AIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKC-CDSVT

Query:  WNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWD
        W ++I   + HG  +EAL +F  ML     P+++T++GV SAC H   V++G  YF+ +     I+P L HY C+V L  R+G L EA+ F+   PI  D
Subjt:  WNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWD

Query:  VVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKK
        VV W +LL+AC +H+N D GK  A+ LL ++ E+ G Y  L+N+++   +W+   KIRK M++  VKKE G SW+E+++  HVF  ED  H E  +IY  
Subjt:  VVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKK

Query:  VRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQDGVCSC
        ++ +  +I+ +GYVPD A VLHD+E+E K + L +HSEKLA+A+GL+ TP    +R++KNLR+C DCHTAIK ISK+  R IIVRD  RFHHF+DG CSC
Subjt:  VRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQDGVCSC

Query:  GDYW
         DYW
Subjt:  GDYW

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331701.6e-13737.85Show/hide
Query:  IKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFP
        I +L  A    +L  G+ +H   +    +  D  +   NSLIN+Y K  +   AR +FD MS R+++SW +++AG  +NG  +E   LF  + ++  + P
Subjt:  IKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFP

Query:  NEYVISVVV---SSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLKGLLEHSHIREAIEVLELIIGE
        ++Y ++ V+   SS  +G      KQ H  A+K       +V  ALI  YS+   ++ A +IL     +D+  +N ++ G  +     + +++  L+  +
Subjt:  NEYVISVVV---SSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLKGLLEHSHIREAIEVLELIIGE

Query:  GIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEALNLLSKMEA
        G   ++ T  T+F  C  L  +  GKQVHA  +KS  D D+++ S I+DMY KCG++   +  FD +   + V+WT +++  ++NG  E A ++ S+M  
Subjt:  GIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEALNLLSKMEA

Query:  DHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEALSMFWDML
          + P+E+T+A L  +++ L+A   G Q+HA A K     +  VG +L+ MYAK G I  A  LF  ++  +   WNA++ G + HG GKE L +F  M 
Subjt:  DHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEALSMFWDML

Query:  TARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNYDKGKQIAD
        +    P+ VTFIGVLSAC+H   V E Y +   +   +GI P +EHY+C+   L R+G + +AEN + S  +      +RTLL AC +  + + GK++A 
Subjt:  TARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNYDKGKQIAD

Query:  YLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIE
         LL+++  D   Y+LLSNM+A   +WD +   R +M+   VKK+PG SW+E++N  H+F  +D  + ++  IY+KV+D++  I+  GYVP+    L D+E
Subjt:  YLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIE

Query:  DEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQDGVCSCGDYW
        +E+K   L YHSEKLAVA+GL+ TP   PIRVIKNLR+C DCH A+K I+KV NR I++RDANRFH F+DG+CSCGDYW
Subjt:  DEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQDGVCSCGDYW

Q9SVP7 Pentatricopeptide repeat-containing protein At4g136501.8e-13337.05Show/hide
Query:  SLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYV
        +L+NLY KC ++  A + F      NVV W  ++  Y         F +F+ M +++ I PN+Y    ++ +C        G+Q H   +K+  +L+ YV
Subjt:  SLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYV

Query:  KNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVY
         + LI MY+K   +  A  IL    G D+  +  ++ G  +++   +A+     ++  GI  +          CA L+ LK G+Q+HAQ   S    D+ 
Subjt:  KNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVY

Query:  IGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNI
          ++++ +Y +CG +      F+Q ++ + ++W A+++ + Q+G  EEAL +  +M  + I  N +T    + +A+  +    G+Q+HA   K+G     
Subjt:  IGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNI

Query:  IVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVP
         V NALI MYAK G I  A+  F  +   + V+WNAII  YS HG G EAL  F  M+ +   PN+VT +GVLSAC+H+  VD+G  YF  +   +G+ P
Subjt:  IVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVP

Query:  GLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVK
          EHY C+V +L+R+G L  A+ F++  PI  D + WRTLL+AC +H+N + G+  A +LL+++ ED  TY+LLSN++A  ++WD     R+ M+E+ VK
Subjt:  GLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVK

Query:  KEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDC
        KEPG SW+E++N  H F   D  H  + +I++  +DL  +   +GYV D   +L++++ EQK   +  HSEKLA+++GL+  P+  PI V+KNLR+C DC
Subjt:  KEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDC

Query:  HTAIKLISKVENRVIIVRDANRFHHFQDGVCSCGDYW
        H  IK +SKV NR IIVRDA RFHHF+ G CSC DYW
Subjt:  HTAIKLISKVENRVIIVRDANRFHHFQDGVCSCGDYW

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276102.2e-13437.5Show/hide
Query:  GRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDG
        GR +  H ++  +   D  +   NSLINLY+KC  +  AR +FD+   ++VV+W ++++GY  NG  LE  G+F +M + + +  +E   + V+  C + 
Subjt:  GRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDG

Query:  QMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGY-DIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCA
        +     +Q H   +K G    Q ++ AL+  YSKC+ +  AL++   +    ++  +  ++ G L++    EA+++   +  +G+  N  TY  I     
Subjt:  QMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGY-DIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCA

Query:  TLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSA
         +       +VHAQ++K++ +    +G++++D Y K G V      F  +  +++V+W+A++A Y Q G  E A+ +  ++    I PNE+T + +LN  
Subjt:  TLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSA

Query:  AGLSAKC-HGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLS
        A  +A    G+Q H  A KS L  ++ V +AL+ MYAK G+I +A+ +F   +  D V+WN++I+GY+ HG   +AL +F +M   +   + VTFIGV +
Subjt:  AGLSAKC-HGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLS

Query:  ACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILL
        AC H   V+EG  YF+ +++   I P  EH +C+V L SR+G+L++A   + + P       WRT+L AC +H+  + G+  A+ ++ M  ED   Y+LL
Subjt:  ACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILL

Query:  SNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLA
        SNM+A    W    K+RKLM ERNVKKEPG SW+E++N  + F + D  H    QIY K+ DL ++++ LGY PD + VL DI+DE K   L+ HSE+LA
Subjt:  SNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLA

Query:  VAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF-QDGVCSCGDYW
        +A+GL+ TP G+P+ +IKNLR+C DCH  IKLI+K+E R I+VRD+NRFHHF  DGVCSCGD+W
Subjt:  VAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF-QDGVCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein2.6e-13535.51Show/hide
Query:  NSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQY
        N++++ Y K  ++    E FD++ +R+ VSW  ++ GY   G   +   +  +M VK+ I P ++ ++ V++S    +    GK+ H F +K GL  +  
Subjt:  NSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQY

Query:  VKNALIQMYSKCSDVRAALQILD-------------------------------TVPGYDIFCYNLVLKGLLEHSHIREAIEVLELIIGEG-IEWNNATY
        V N+L+ MY+KC D   A  + D                                +   DI  +N ++ G  +  +   A+++   ++ +  +  +  T 
Subjt:  VKNALIQMYSKCSDVRAALQILD-------------------------------TVPGYDIFCYNLVLKGLLEHSHIREAIEVLELIIGEG-IEWNNATY

Query:  VTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQ---------------------------------LQSQNVVSWT
         ++   CA L+ L +GKQ+H+ ++ +  D    + +++I MY +CG V   R   +Q                                 L+ ++VV+WT
Subjt:  VTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQ---------------------------------LQSQNVVSWT

Query:  AIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKC-CDSVT
        A++  Y Q+G + EA+NL   M      PN YTLA +L+ A+ L++  HG+Q+H  A KSG   ++ V NALI MYAK+G+I +A   F  ++C  D+V+
Subjt:  AIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKC-CDSVT

Query:  WNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWD
        W ++I   + HG  +EAL +F  ML     P+++T++GV SAC H   V++G  YF+ +     I+P L HY C+V L  R+G L EA+ F+   PI  D
Subjt:  WNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWD

Query:  VVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKK
        VV W +LL+AC +H+N D GK  A+ LL ++ E+ G Y  L+N+++   +W+   KIRK M++  VKKE G SW+E+++  HVF  ED  H E  +IY  
Subjt:  VVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKK

Query:  VRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQDGVCSC
        ++ +  +I+ +GYVPD A VLHD+E+E K + L +HSEKLA+A+GL+ TP    +R++KNLR+C DCHTAIK ISK+  R IIVRD  RFHHF+DG CSC
Subjt:  VRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQDGVCSC

Query:  GDYW
         DYW
Subjt:  GDYW

AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-13537.5Show/hide
Query:  GRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDG
        GR +  H ++  +   D  +   NSLINLY+KC  +  AR +FD+   ++VV+W ++++GY  NG  LE  G+F +M + + +  +E   + V+  C + 
Subjt:  GRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDG

Query:  QMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGY-DIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCA
        +     +Q H   +K G    Q ++ AL+  YSKC+ +  AL++   +    ++  +  ++ G L++    EA+++   +  +G+  N  TY  I     
Subjt:  QMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGY-DIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCA

Query:  TLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSA
         +       +VHAQ++K++ +    +G++++D Y K G V      F  +  +++V+W+A++A Y Q G  E A+ +  ++    I PNE+T + +LN  
Subjt:  TLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSA

Query:  AGLSAKC-HGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLS
        A  +A    G+Q H  A KS L  ++ V +AL+ MYAK G+I +A+ +F   +  D V+WN++I+GY+ HG   +AL +F +M   +   + VTFIGV +
Subjt:  AGLSAKC-HGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLS

Query:  ACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILL
        AC H   V+EG  YF+ +++   I P  EH +C+V L SR+G+L++A   + + P       WRT+L AC +H+  + G+  A+ ++ M  ED   Y+LL
Subjt:  ACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILL

Query:  SNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLA
        SNM+A    W    K+RKLM ERNVKKEPG SW+E++N  + F + D  H    QIY K+ DL ++++ LGY PD + VL DI+DE K   L+ HSE+LA
Subjt:  SNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLA

Query:  VAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF-QDGVCSCGDYW
        +A+GL+ TP G+P+ +IKNLR+C DCH  IKLI+K+E R I+VRD+NRFHHF  DGVCSCGD+W
Subjt:  VAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHF-QDGVCSCGDYW

AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-13437.05Show/hide
Query:  SLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYV
        +L+NLY KC ++  A + F      NVV W  ++  Y         F +F+ M +++ I PN+Y    ++ +C        G+Q H   +K+  +L+ YV
Subjt:  SLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYV

Query:  KNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVY
         + LI MY+K   +  A  IL    G D+  +  ++ G  +++   +A+     ++  GI  +          CA L+ LK G+Q+HAQ   S    D+ 
Subjt:  KNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLKGLLEHSHIREAIEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVY

Query:  IGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNI
          ++++ +Y +CG +      F+Q ++ + ++W A+++ + Q+G  EEAL +  +M  + I  N +T    + +A+  +    G+Q+HA   K+G     
Subjt:  IGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEALNLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNI

Query:  IVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVP
         V NALI MYAK G I  A+  F  +   + V+WNAII  YS HG G EAL  F  M+ +   PN+VT +GVLSAC+H+  VD+G  YF  +   +G+ P
Subjt:  IVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVP

Query:  GLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVK
          EHY C+V +L+R+G L  A+ F++  PI  D + WRTLL+AC +H+N + G+  A +LL+++ ED  TY+LLSN++A  ++WD     R+ M+E+ VK
Subjt:  GLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVK

Query:  KEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDC
        KEPG SW+E++N  H F   D  H  + +I++  +DL  +   +GYV D   +L++++ EQK   +  HSEKLA+++GL+  P+  PI V+KNLR+C DC
Subjt:  KEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDC

Query:  HTAIKLISKVENRVIIVRDANRFHHFQDGVCSCGDYW
        H  IK +SKV NR IIVRDA RFHHF+ G CSC DYW
Subjt:  HTAIKLISKVENRVIIVRDANRFHHFQDGVCSCGDYW

AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-13837.85Show/hide
Query:  IKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFP
        I +L  A    +L  G+ +H   +    +  D  +   NSLIN+Y K  +   AR +FD MS R+++SW +++AG  +NG  +E   LF  + ++  + P
Subjt:  IKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLFKNMVVKDNIFP

Query:  NEYVISVVV---SSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLKGLLEHSHIREAIEVLELIIGE
        ++Y ++ V+   SS  +G      KQ H  A+K       +V  ALI  YS+   ++ A +IL     +D+  +N ++ G  +     + +++  L+  +
Subjt:  NEYVISVVV---SSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLKGLLEHSHIREAIEVLELIIGE

Query:  GIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEALNLLSKMEA
        G   ++ T  T+F  C  L  +  GKQVHA  +KS  D D+++ S I+DMY KCG++   +  FD +   + V+WT +++  ++NG  E A ++ S+M  
Subjt:  GIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEALNLLSKMEA

Query:  DHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEALSMFWDML
          + P+E+T+A L  +++ L+A   G Q+HA A K     +  VG +L+ MYAK G I  A  LF  ++  +   WNA++ G + HG GKE L +F  M 
Subjt:  DHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEALSMFWDML

Query:  TARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNYDKGKQIAD
        +    P+ VTFIGVLSAC+H   V E Y +   +   +GI P +EHY+C+   L R+G + +AEN + S  +      +RTLL AC +  + + GK++A 
Subjt:  TARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNYDKGKQIAD

Query:  YLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIE
         LL+++  D   Y+LLSNM+A   +WD +   R +M+   VKK+PG SW+E++N  H+F  +D  + ++  IY+KV+D++  I+  GYVP+    L D+E
Subjt:  YLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIE

Query:  DEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQDGVCSCGDYW
        +E+K   L YHSEKLAVA+GL+ TP   PIRVIKNLR+C DCH A+K I+KV NR I++RDANRFH F+DG+CSCGDYW
Subjt:  DEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQDGVCSCGDYW

AT5G39680.1 Pentatricopeptide repeat (PPR) superfamily protein9.7e-21552.4Show/hide
Query:  SNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLF
        S   P P+  L +LLK+ A++  L+ G  IHAHLI+TN   R      +NSLINLYVKC E + AR++FD M  RNVVSWCA+M GY  +G   EV  LF
Subjt:  SNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYMKNGSPLEVFGLF

Query:  KNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLKGLLEHSHIREAIE
        K+M       PNE+V +VV  SC +     EGKQ HG  LK GL  H++V+N L+ MYS CS    A+++LD +P  D+  ++  L G LE    +E ++
Subjt:  KNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLKGLLEHSHIREAIE

Query:  VLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEAL
        VL     E   WNN TY++   L + L+DL L  QVH++M++   + +V    ++I+MYGKCG VL  +  FD   +QN+   T IM AY Q+  FEEAL
Subjt:  VLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEAL

Query:  NLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEA
        NL SKM+   +PPNEYT A+LLNS A LS    G+ LH    KSG + +++VGNAL+ MYAKSG I  A+  FS M   D VTWN +I+G SHHGLG+EA
Subjt:  NLLSKMEADHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEA

Query:  LSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNY
        L  F  M+   E PN +TFIGVL AC+H+  V++G +YFN LMK F + P ++HYTCIVGLLS++G   +AE+FMR+ PI WDVVAWRTLLNACY+ RNY
Subjt:  LSMFWDMLTARECPNYVTFIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNY

Query:  DKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDI
          GK++A+Y ++    D G Y+LLSN+HA+ R W+GV K+R LM  R VKKEPGVSW+ IRN  HVF +EDN+H E   IY KV++++SKI+ LGY PD+
Subjt:  DKGKQIADYLLQMDHEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDI

Query:  AGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQDGVCSCGDYW
        AG  HD+++EQ+ +NLSYHSEKLAVAYGL+KTP  +P+ V KN+R+C DCH+AIKLISK+  R I++RD+NRFHHF DG CSC DYW
Subjt:  AGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQDGVCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAATGTTAAAGCTACCCATTACTGGCCTTGCCCCTGTGAAGTTCACCCCATTTCTATTCACCTCCAATTACTTGCCTTCCCCACTCCTAGACCTAATAAAGCTCTT
GAAACTAGCTGCTGATGCCAAGAACTTAAAATTTGGTAGAATTATCCATGCCCATTTGATCATTACCAATCACATCCATAGAGACTGCAGAGTAAATCACATGAACTCCC
TTATTAATTTGTATGTGAAATGTGATGAACTACTCGTTGCTCGCGAGATGTTCGATAGAATGTCTAGAAGAAATGTGGTGTCTTGGTGTGCTTTAATGGCTGGCTACATG
AAAAATGGGAGTCCCTTGGAAGTTTTTGGGCTGTTCAAAAACATGGTTGTGAAGGATAATATTTTCCCCAATGAATATGTGATTTCCGTTGTTGTATCTTCTTGTTGTGA
TGGTCAAATGTATGTTGAGGGAAAACAGTGTCATGGGTTTGCGTTGAAGTCTGGGTTGGAGCTTCATCAATATGTTAAGAATGCACTTATTCAGATGTACTCTAAATGTT
CTGATGTAAGAGCAGCATTGCAGATATTAGATACTGTGCCAGGTTATGACATATTTTGTTATAATTTGGTTCTAAAAGGCCTTCTAGAGCACTCACATATTAGAGAAGCT
ATAGAAGTTTTGGAGTTAATTATTGGTGAAGGCATAGAGTGGAATAATGCCACTTATGTTACAATTTTTCACCTTTGTGCTACTCTTAAAGATTTAAAATTAGGAAAGCA
AGTTCATGCTCAAATGTTGAAAAGCGATATCGACTATGATGTCTATATTGGAAGTTCTATCATAGATATGTATGGGAAATGTGGTAATGTGTTGGGTGGAAGAGCCTTTT
TTGATCAGTTACAAAGCCAAAATGTTGTTTCTTGGACAGCAATCATGGCAGCTTATTTACAGAATGGATTCTTCGAAGAAGCATTGAATCTGTTATCAAAGATGGAAGCG
GATCATATTCCTCCTAACGAATATACACTGGCAGTGTTGTTAAACTCTGCTGCTGGTTTGTCTGCAAAATGCCATGGCGAGCAGTTACATGCTCGTGCCGAGAAATCAGG
TCTTAAAGGCAATATTATAGTAGGGAATGCCTTGATCATAATGTATGCCAAGAGTGGGGACATTTTAGCAGCACAACATTTGTTCTCAAATATGAAATGCTGTGATTCCG
TTACCTGGAATGCGATAATAACTGGTTACTCCCACCATGGTCTTGGCAAGGAAGCTTTAAGCATGTTTTGGGACATGTTGACTGCTAGAGAATGTCCAAATTATGTCACC
TTTATTGGTGTTCTCTCTGCATGCGCCCATTTAAGCCGGGTAGACGAAGGATACTACTATTTTAATCATTTGATGAAACATTTTGGTATTGTTCCTGGGTTGGAGCACTA
TACCTGCATTGTTGGACTCCTAAGTAGATCTGGACGACTTGATGAAGCTGAGAATTTTATGAGGTCAAATCCAATCAATTGGGATGTTGTTGCGTGGCGTACCCTTCTCA
ATGCTTGTTACATTCATAGAAATTATGATAAAGGGAAACAAATAGCAGATTACTTACTACAGATGGACCATGAGGATGTAGGAACTTATATTCTATTATCAAACATGCAT
GCGAGAGTTAGGAGGTGGGATGGCGTTGTTAAGATTCGAAAATTGATGAGGGAAAGAAATGTCAAGAAAGAACCTGGAGTAAGCTGGTTAGAAATAAGAAATATTGCCCA
TGTTTTTACATCTGAAGATAATAAACACTCTGAGTCCAGACAAATTTATAAAAAGGTAAGAGACTTGTTATCTAAGATTCAACTATTGGGGTATGTTCCTGATATTGCTG
GGGTATTGCACGATATCGAGGATGAGCAAAAACTAGAAAATCTTAGCTATCACAGTGAGAAGCTTGCCGTAGCATATGGCCTGATGAAAACACCATCAGGTGCACCAATC
CGGGTGATTAAGAACCTTAGAATGTGCTATGATTGTCACACTGCTATCAAACTTATTTCAAAGGTTGAAAATAGGGTTATAATTGTTAGAGATGCCAATCGTTTCCATCA
TTTTCAAGATGGTGTTTGCTCGTGTGGAGATTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
CCCGGTTTTTCCAAAATCGGTTTAGGTACGACAAAACCGAACCCCTAGTCGAGGAGACGGCGACTCGTTGAATGTAATTGTCAGCCATTGATGGTGCGGAAACTTTGGTT
GCTGTGCGAGTTATGATTTTGCGATTGTAAGATTTTACCATTATACTTCGCTTGCTAAATTTACAATTCTTTCTTCTGGTGTAATCAATTGGGGATTTAGCTTGTTATTT
AGGATGAGCGGCTGGTGATGGAATCTGCCCAAAAGCTGGTTCTGGCTCTTCCATTTTTTAGTTTTCTTTGTTCATTTCGTTTCTAACTACAAATTCATAGCATTGTAATG
CCAATGTTAAAGCTACCCATTACTGGCCTTGCCCCTGTGAAGTTCACCCCATTTCTATTCACCTCCAATTACTTGCCTTCCCCACTCCTAGACCTAATAAAGCTCTTGAA
ACTAGCTGCTGATGCCAAGAACTTAAAATTTGGTAGAATTATCCATGCCCATTTGATCATTACCAATCACATCCATAGAGACTGCAGAGTAAATCACATGAACTCCCTTA
TTAATTTGTATGTGAAATGTGATGAACTACTCGTTGCTCGCGAGATGTTCGATAGAATGTCTAGAAGAAATGTGGTGTCTTGGTGTGCTTTAATGGCTGGCTACATGAAA
AATGGGAGTCCCTTGGAAGTTTTTGGGCTGTTCAAAAACATGGTTGTGAAGGATAATATTTTCCCCAATGAATATGTGATTTCCGTTGTTGTATCTTCTTGTTGTGATGG
TCAAATGTATGTTGAGGGAAAACAGTGTCATGGGTTTGCGTTGAAGTCTGGGTTGGAGCTTCATCAATATGTTAAGAATGCACTTATTCAGATGTACTCTAAATGTTCTG
ATGTAAGAGCAGCATTGCAGATATTAGATACTGTGCCAGGTTATGACATATTTTGTTATAATTTGGTTCTAAAAGGCCTTCTAGAGCACTCACATATTAGAGAAGCTATA
GAAGTTTTGGAGTTAATTATTGGTGAAGGCATAGAGTGGAATAATGCCACTTATGTTACAATTTTTCACCTTTGTGCTACTCTTAAAGATTTAAAATTAGGAAAGCAAGT
TCATGCTCAAATGTTGAAAAGCGATATCGACTATGATGTCTATATTGGAAGTTCTATCATAGATATGTATGGGAAATGTGGTAATGTGTTGGGTGGAAGAGCCTTTTTTG
ATCAGTTACAAAGCCAAAATGTTGTTTCTTGGACAGCAATCATGGCAGCTTATTTACAGAATGGATTCTTCGAAGAAGCATTGAATCTGTTATCAAAGATGGAAGCGGAT
CATATTCCTCCTAACGAATATACACTGGCAGTGTTGTTAAACTCTGCTGCTGGTTTGTCTGCAAAATGCCATGGCGAGCAGTTACATGCTCGTGCCGAGAAATCAGGTCT
TAAAGGCAATATTATAGTAGGGAATGCCTTGATCATAATGTATGCCAAGAGTGGGGACATTTTAGCAGCACAACATTTGTTCTCAAATATGAAATGCTGTGATTCCGTTA
CCTGGAATGCGATAATAACTGGTTACTCCCACCATGGTCTTGGCAAGGAAGCTTTAAGCATGTTTTGGGACATGTTGACTGCTAGAGAATGTCCAAATTATGTCACCTTT
ATTGGTGTTCTCTCTGCATGCGCCCATTTAAGCCGGGTAGACGAAGGATACTACTATTTTAATCATTTGATGAAACATTTTGGTATTGTTCCTGGGTTGGAGCACTATAC
CTGCATTGTTGGACTCCTAAGTAGATCTGGACGACTTGATGAAGCTGAGAATTTTATGAGGTCAAATCCAATCAATTGGGATGTTGTTGCGTGGCGTACCCTTCTCAATG
CTTGTTACATTCATAGAAATTATGATAAAGGGAAACAAATAGCAGATTACTTACTACAGATGGACCATGAGGATGTAGGAACTTATATTCTATTATCAAACATGCATGCG
AGAGTTAGGAGGTGGGATGGCGTTGTTAAGATTCGAAAATTGATGAGGGAAAGAAATGTCAAGAAAGAACCTGGAGTAAGCTGGTTAGAAATAAGAAATATTGCCCATGT
TTTTACATCTGAAGATAATAAACACTCTGAGTCCAGACAAATTTATAAAAAGGTAAGAGACTTGTTATCTAAGATTCAACTATTGGGGTATGTTCCTGATATTGCTGGGG
TATTGCACGATATCGAGGATGAGCAAAAACTAGAAAATCTTAGCTATCACAGTGAGAAGCTTGCCGTAGCATATGGCCTGATGAAAACACCATCAGGTGCACCAATCCGG
GTGATTAAGAACCTTAGAATGTGCTATGATTGTCACACTGCTATCAAACTTATTTCAAAGGTTGAAAATAGGGTTATAATTGTTAGAGATGCCAATCGTTTCCATCATTT
TCAAGATGGTGTTTGCTCGTGTGGAGATTATTGGTGACAATTTTGTTGAGTTTCTCAATGGCTTGGATCTTTTGAAGATCAAAAAATTTCAACCTAGATTCTTTGGTTGC
GAAGTAAATAGACTCTACTATTTCAATCTGGTATTAGGCACGAATGGCACAAGTTGTTGAGCTGATCGAGTTCGCACATTGGGTTTTCCTGGTTATAATTTATAAACTCA
TTCTCTTAGAGTAGTCTTTTCTGGATTCAGATATTATAAGCAGCATGTGTAGATATTGGCGTTACAAGGCTTACATATATATGTCCCTTACCTTGGATTTCATGTTGGGG
GAAAGCAAAGGAGATGAATGTGGTAGGAACTTATGATTGATAAACTCAGAATGAAATTATATAAATGGAGATGTGATGAGGGATTACGATTGGTGCGGTGGTACGTATAA
ACTCAGTACAACTGATTAATTGGAAGATTTCTTTTCTGCCTACAGTCTATGGTGGACTTGGTGTGAGTTCCGTCTGCCGAAGAAATACTTCACTCTTACTACGGTGGCTT
GGCTGTTCGAGGGAGAAAAGGGTCCTTTTGGAAGGAGGGTGATTTTCGCAATTTATGGATGTGACTCAAATGGCTGCTGACAAAAGAGGTAAAAGAGAAAGCTGGTGAGC
ACATGATTTAGGGGATCCTTTAGGGCTGTGCAGTTAAAGCTATATTATGGATGACTTGGAAGGAGCATGATCTTAGTATCTTTGAGAATAGAAGAAATGTGGAAGATTCT
TTTGTTTATAGTATACAACACTTCTCCTTGGTGGTGTGCAACACTTCTCTTAGCCTTTGCATTAACTATAGGAAATTGGAGATTCTTTTGTGTTATTGTTTCAAGAAGGG
GT
Protein sequenceShow/hide protein sequence
MPMLKLPITGLAPVKFTPFLFTSNYLPSPLLDLIKLLKLAADAKNLKFGRIIHAHLIITNHIHRDCRVNHMNSLINLYVKCDELLVAREMFDRMSRRNVVSWCALMAGYM
KNGSPLEVFGLFKNMVVKDNIFPNEYVISVVVSSCCDGQMYVEGKQCHGFALKSGLELHQYVKNALIQMYSKCSDVRAALQILDTVPGYDIFCYNLVLKGLLEHSHIREA
IEVLELIIGEGIEWNNATYVTIFHLCATLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLGGRAFFDQLQSQNVVSWTAIMAAYLQNGFFEEALNLLSKMEA
DHIPPNEYTLAVLLNSAAGLSAKCHGEQLHARAEKSGLKGNIIVGNALIIMYAKSGDILAAQHLFSNMKCCDSVTWNAIITGYSHHGLGKEALSMFWDMLTARECPNYVT
FIGVLSACAHLSRVDEGYYYFNHLMKHFGIVPGLEHYTCIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYIHRNYDKGKQIADYLLQMDHEDVGTYILLSNMH
ARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHSESRQIYKKVRDLLSKIQLLGYVPDIAGVLHDIEDEQKLENLSYHSEKLAVAYGLMKTPSGAPI
RVIKNLRMCYDCHTAIKLISKVENRVIIVRDANRFHHFQDGVCSCGDYW