; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011649 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011649
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153017:67451..71125
RNA-Seq ExpressionSgr011649
SyntenySgr011649
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016643.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0084.68Show/hide
Query:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAF
        PPQIP+KP+K YF+YGHRHRNPKQHRPTVYGGLF+NRQSL+PPNP KPISPK QPF L+ WDPDR P +      PSPSEAFFSS+LRLSPIARFIVDAF
Subjt:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAF

Query:  RKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEI
        RKNQNQWG PVISELNKLRRVTPDLVAE+LKAS R DSNPILASKFF+WAGKQKGFHHN+ASYNAFAY LNRHNRFRAADQIPELMDSQGKPPSEKQFEI
Subjt:  RKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEI

Query:  LIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDV
        LIRMH DANRGLRVYYVYEKMKKFGV+PRVFLYNRILDALVKTG+LDLALSVY DFQ+NGLVEE+ITFMILIKGLCKAGRV EMLELLGRMRAN CKPDV
Subjt:  LIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDV

Query:  FAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSG
        FAYTAMVKVLV+EENLEGCLRVWDEM AD VEPDVMAYGTLI+GLCK GR +K FELFQEMK KRILIDR IYGSLIEAFVQDEKVGLA DLLKDLVDSG
Subjt:  FAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSG

Query:  YRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHG
        YRADL IY+SLIKGLCNVNQVD+AYKLF++TIREDLKPDF TVKP+++  VETRRMEDLWKLLSL+QKLELSMDD+LSKF+S MVE+EDKIS ALD+F G
Subjt:  YRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHG

Query:  LNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLM
        L  KGYGS+A+YNI+IGALH+HGQ KKALEIYNDM++SN EPD STYSI VSC+VEV +IQEACASHNKIVELGSVPSVAAY SL+EGLFKICEIDAV+M
Subjt:  LNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLM

Query:  LVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKK
        LVRDCLAN ESGP EFKYALTIL  CKSGK E VI+VL+EM+QQ CP SAV YSAI+SGM KYGT+D AKKVFLHLKES  + EANCI+ EE+L+EHMKK
Subjt:  LVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKK

Query:  KTADL
        KTADL
Subjt:  KTADL

XP_022146449.1 pentatricopeptide repeat-containing protein At4g20740 [Momordica charantia]0.0e+0086.42Show/hide
Query:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSP--SEAFFSSSLRLSPIARFIVD
        PPQIPTKPHK YFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPP+PHKPISPKSQPFDL KWDPD S   SKLT PP P  SEAFFSSS+RLSPIARFI+D
Subjt:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSP--SEAFFSSSLRLSPIARFIVD

Query:  AFRKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQF
        AFRKNQN+WG+ VISELNKLRRVTPDLVAE+LKA  R DSNPILAS FFHWAGKQKGFHHNYASYNAFAY LNRHNRFRAADQIPELMDSQGKPPSEKQF
Subjt:  AFRKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQF

Query:  EILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKP
        EILIRMHSD NRGLRVYYVYEKMKKFGV+PRVFLYNRILDALVKTGHLDLALSVYGDFQ++GLVEESITFMILIKGLCKAGRV EMLELL RMRANLCKP
Subjt:  EILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKP

Query:  DVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVD
        DVFAYTAMVKVLV+EEN+EGCLRVWDEMRADGVEPDVMAY TLI GLCK G+A KG+ELFQ MKEKRILI RTIYGSLI+AFVQDEKVGLACDL KDLVD
Subjt:  DVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVD

Query:  SGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIF
        SGYRADLG+YNSLIKGLCN+NQVDKAYKLFQLTIREDLKP+FETVKP+L M V+++RMEDLW LLSL++KLELS+D +LS+F+SFMVEKEDKIS AL++F
Subjt:  SGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIF

Query:  HGLNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAV
         GLN KGYGS+A+YNI+IGALHQHGQ KKALEIY+DM+SSNF+PDSSTYSI VSC VE+GKIQEAC  HN+IVELGSVPS++AY SLAEGLFK CEIDAV
Subjt:  HGLNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAV

Query:  LMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHM
        LMLVRDCLAN+ESGP EFKYALTI+HVCKSGK ETVI+VL+EM+Q+DCPPS V YSAII GMSKYGT+D  KKVFLHLKES HLTEANCIM EELLI+HM
Subjt:  LMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHM

Query:  KKKTADL
        KKKTADL
Subjt:  KKKTADL

XP_022939585.1 pentatricopeptide repeat-containing protein At4g20740 [Cucurbita moschata]0.0e+0084.11Show/hide
Query:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAF
        PPQIP+KP+K YF+YGHRHRNPKQHRPTVYGGLF+NRQSL+PPNP KPI PK QPF L+ WDPDR P +      PSPSEAFFSS+LRLSPIARFIVDAF
Subjt:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAF

Query:  RKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEI
        RKNQNQWG PVISELNKLRRVTPDLVAE+LKAS R DSNPILASKFF+WAGKQKGFHHN+ASYNAFAY LNRHNRFRAADQIPELMDSQGKPPSEKQFEI
Subjt:  RKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEI

Query:  LIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDV
        LIRMH DANRGLRVYYVYEKMKKFGV+PRVFLYNRILDALVKTG++DLALSVY DFQ+NGLVEE+ITFMILIKGLCKAGRV EMLELLGRMRAN CKPDV
Subjt:  LIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDV

Query:  FAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSG
        FAYTAMVKVLV+EENLEGCLRVWDEM AD VEPDVMAYGTLI+GLCK GR +K FELFQEMK KRILIDR IYGSLIEAFVQDEKVGLA DLLKDLVDSG
Subjt:  FAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSG

Query:  YRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHG
        YRADL IY+SLIKGLCNVNQVD+AYKLF++TIREDLKPDF TVKP+++  VETRRMEDLWKLLSL+QKLELSMDD+LSKF+S MVE+EDKIS ALD+F G
Subjt:  YRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHG

Query:  LNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLM
        L  KGYGS+A+YNI+IGALH+HGQ KKALEIYNDM++SN +PD STYSI VSC+VEV +IQEACASHNKIVELGSVPSVAAY SL+EGLFKICEIDAV+M
Subjt:  LNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLM

Query:  LVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKK
        LVRDCLAN ESGP EFKYALTIL  CKSGK E VI+VL+EM+QQ CP SAV YSAI+SGM KYGT+D AKK FLHLKES  + EANCI+ EE+L+EHMKK
Subjt:  LVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKK

Query:  KTADL
        KTADL
Subjt:  KTADL

XP_023551127.1 pentatricopeptide repeat-containing protein At4g20740 [Cucurbita pepo subsp. pepo]0.0e+0084.82Show/hide
Query:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAF
        PPQIP+KP+K YF+YGHRHRNPKQHRPTVYGGLF+NRQSL+PPNP KPISPK QPF L+ WDPDR P +      PSPSEAFFSS+LRLSPIARFIVDAF
Subjt:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAF

Query:  RKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEI
        RKNQNQWG PVISELNKLRRVTPDLVAE+LKAS R DSNPILASKFF+WAGKQKGFHHN+ASYNAFAY LNRHNRFRAADQIPELMDSQGKPPSEKQFEI
Subjt:  RKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEI

Query:  LIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDV
        LIRMH DANRGLRVYYVYEKMKKFGV+PRVFLYNRILDALVKTG++DLALSVY DFQ+NGLVEE+ITFMILIKGLCKAGRV EMLELLGRMRAN CKPDV
Subjt:  LIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDV

Query:  FAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSG
        FAYTAMVKVLV+EENLEGCLRVWDEM AD VEPDVMAYGTLI GLCK GR QK FELFQEMK KRILIDR IYGSLIEAFVQDEKVGLA DLLKDLVDSG
Subjt:  FAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSG

Query:  YRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHG
        YRADLGIY+SLI+GLCN NQVD+AYKLF++TIREDL+PDF TVKP+++  VETRRMEDLWKLLSL+QKLELSMDD+LSKF+S MVE+EDK SAALD+F G
Subjt:  YRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHG

Query:  LNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLM
        LN KGYGS+A+YNI+IGALH+HGQ KKALEIYNDM+SSN EPDSSTYSI VSC+VEV +IQEACASHNKIVELGSVPSVAAY SL+EGLFKICEIDAV+M
Subjt:  LNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLM

Query:  LVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKK
        LVRDCLAN ESGP EFKYALTIL  CKSGK E VI+VL+EM+QQ CP SAVAYSAI+SGM KYGT+D AKKVFLHLKE   + EANCI+ EE+L+EHMKK
Subjt:  LVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKK

Query:  KTADL
        KTADL
Subjt:  KTADL

XP_038906442.1 pentatricopeptide repeat-containing protein At4g20740 [Benincasa hispida]0.0e+0086.67Show/hide
Query:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAF
        PPQIP+K HK YFYYGHRHRNP QHRPTVYGGLFSNRQS+SPPNPHKPISPK +PF L  WDPDR P+       PS SEAFFSSSLRLSPIARFIVDAF
Subjt:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAF

Query:  RKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEI
        RKNQNQWG PVISELNKLRRVTPDLVAE+LKAS R DSNPILASKFFHWAGKQKGFHHN+ASYNAFAY LNRHNRFRAADQIPELMDSQGKPPSEKQFEI
Subjt:  RKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEI

Query:  LIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDV
        LIRMH DANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKT HLDLALSVY DFQ+NGLVEESITFMILIKGLCKAGR+ EMLELL RMRANLCKPDV
Subjt:  LIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDV

Query:  FAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSG
        FAYTAMVKVLV+EENL+GCLRVWDEMRAD VEPDVMAYGTLI+GLCK GRAQKGFEL QEMK KRILIDR IYG+LIEAFVQDEKVGLACDL KDLVDSG
Subjt:  FAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSG

Query:  YRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHG
        YRADLGIY+SLIKGLCNVNQVDKAYKLFQLTIREDLKPD ETVKP++MM VETRRMEDLWKLL+L+QKLE S DD+LSKF+SFMVE+EDKIS ALD+F G
Subjt:  YRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHG

Query:  LNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLM
        +  KGYG++A+YNI++GALHQ+GQ KKALEIY+DM+SSN +PDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPS AAY SL+EGLFKICEIDAV+M
Subjt:  LNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLM

Query:  LVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKK
        LVRDCLAN+E+GP EFKYAL ILHVCK GK E V +V++EM+QQDCPPSAVAYSAIISGMSKYGTLD AKKVFLHL+ES  LTEANCI+ EELLIEHMKK
Subjt:  LVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKK

Query:  KTADL
        KTADL
Subjt:  KTADL

TrEMBL top hitse value%identityAlignment
A0A0A0KRS5 Uncharacterized protein0.0e+0083.74Show/hide
Query:  PTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAFRKNQ
        P KPHK YFYYGHRHRNP QHRPTVYGG F+NR+SL PP+PH+P SPK QPF L  WDPD  P+Q +   P S S+AFFS+SLRLSPIARFIVD FRKNQ
Subjt:  PTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAFRKNQ

Query:  NQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEILIRM
        NQWG PVISELNKLRRVTPDLVAE+LKAS R DSN ILASKFF+WAGKQKGFHH +ASYNAFAY LNRHNRFRAADQIPELMDSQGKPPSEKQFEILIRM
Subjt:  NQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEILIRM

Query:  HSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDVFAYT
        H DANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKT HLDLAL+VY DFQ+NGLVEES+TFMILIKGLCKAGRV EMLELL RMRANLCKPDVFAYT
Subjt:  HSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDVFAYT

Query:  AMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSGYRAD
        AMVKVL +++NLEGCLRVWDEMRAD VEPDVMAYGTLI+GLCK GRAQKG+ELFQEMK KRILIDR IYG+LIEAFVQDEKVGLACDL KDLVDSGYRAD
Subjt:  AMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSGYRAD

Query:  LGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHGLNAK
        LGIY+SLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKP++MM VET RM+D WKL+SL+QKLE S+DD+LSKF+SFMVE+EDKIS ALD+FHG+  K
Subjt:  LGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHGLNAK

Query:  GYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLMLVRD
        GYGS+ALYN+M+GALH++GQ  KALEIYNDM++SN EP+S+TYSIA+ CFVE+GKIQEACASHNKIVELGSVPS+AAY SL+EGLFKICEI+AV+MLVRD
Subjt:  GYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLMLVRD

Query:  CLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKKKTAD
        CLAN+ESGP EFKYALTI+H CKSGK E VI+VL EM+ QDC PS+VAYSAIISGMSKYGTLD AKKVFLHL+E   LTEANCI+ EELLIEHMKKKTAD
Subjt:  CLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKKKTAD

Query:  L
        L
Subjt:  L

A0A5D3BGR3 Pentatricopeptide repeat-containing protein0.0e+0083.59Show/hide
Query:  PTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAFRKNQ
        P KPHK YFYYGHRHRNP QHRPTVYGG F+NR+SL PP+PH+PISPK QPF L  WDPD  P+Q +   P S S+AFFS+SLRLSPIARFIVD FRKNQ
Subjt:  PTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAFRKNQ

Query:  NQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEILIRM
        NQWG PVISELNKLRRVTPDLVAE+LKAS R DSNPILASKFF+WAGKQKGFHH +ASYNAFAY LNRHNRFRAADQIPELMDSQGKPPSEKQFEILIRM
Subjt:  NQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEILIRM

Query:  HSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDVFAYT
        H DANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKT HLDLAL+VY DFQ+NGLVEESITFMILIKGLCKAGRV EMLELL RMRA LCKPDVFAYT
Subjt:  HSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDVFAYT

Query:  AMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSGYRAD
        AMVKV V++ENLEGCLRVWDEMRAD VEPDVMAYGTLI+GLCK GRAQKG+ELFQEMK KRILIDR IYG+LIEAFVQDEKVGLACDL KDLVDSGYRAD
Subjt:  AMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSGYRAD

Query:  LGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHGLNAK
        LGIY+SLIKGLCN+NQV KAYKLFQLTIREDLKPDFETVKP++MM VE  RM+DLWKL++L+QKLE S+DD+LSKF+ FMVE+EDKIS ALD+FHG+  K
Subjt:  LGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHGLNAK

Query:  GYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLMLVRD
        GYGS+ALYN+++GALH++GQ  KALEIYNDM++SN EPDS+TYSIAV CFVE+GKI+EACASHNKI+ELGSVPS+AAY SL+EGLFKICEIDAV+MLVRD
Subjt:  GYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLMLVRD

Query:  CLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKKKTAD
        CLANVESGP EFKYALTI+H CKSGK E VI+VL EM+ QDC PS+VAYSAIISGMSKYGT + AKKVFLHL+E S LTEANCI+ EELLIEHMKKKTAD
Subjt:  CLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKKKTAD

Query:  L
        L
Subjt:  L

A0A6J1CY62 pentatricopeptide repeat-containing protein At4g207400.0e+0086.42Show/hide
Query:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSP--SEAFFSSSLRLSPIARFIVD
        PPQIPTKPHK YFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPP+PHKPISPKSQPFDL KWDPD S   SKLT PP P  SEAFFSSS+RLSPIARFI+D
Subjt:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSP--SEAFFSSSLRLSPIARFIVD

Query:  AFRKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQF
        AFRKNQN+WG+ VISELNKLRRVTPDLVAE+LKA  R DSNPILAS FFHWAGKQKGFHHNYASYNAFAY LNRHNRFRAADQIPELMDSQGKPPSEKQF
Subjt:  AFRKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQF

Query:  EILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKP
        EILIRMHSD NRGLRVYYVYEKMKKFGV+PRVFLYNRILDALVKTGHLDLALSVYGDFQ++GLVEESITFMILIKGLCKAGRV EMLELL RMRANLCKP
Subjt:  EILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKP

Query:  DVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVD
        DVFAYTAMVKVLV+EEN+EGCLRVWDEMRADGVEPDVMAY TLI GLCK G+A KG+ELFQ MKEKRILI RTIYGSLI+AFVQDEKVGLACDL KDLVD
Subjt:  DVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVD

Query:  SGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIF
        SGYRADLG+YNSLIKGLCN+NQVDKAYKLFQLTIREDLKP+FETVKP+L M V+++RMEDLW LLSL++KLELS+D +LS+F+SFMVEKEDKIS AL++F
Subjt:  SGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIF

Query:  HGLNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAV
         GLN KGYGS+A+YNI+IGALHQHGQ KKALEIY+DM+SSNF+PDSSTYSI VSC VE+GKIQEAC  HN+IVELGSVPS++AY SLAEGLFK CEIDAV
Subjt:  HGLNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAV

Query:  LMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHM
        LMLVRDCLAN+ESGP EFKYALTI+HVCKSGK ETVI+VL+EM+Q+DCPPS V YSAII GMSKYGT+D  KKVFLHLKES HLTEANCIM EELLI+HM
Subjt:  LMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHM

Query:  KKKTADL
        KKKTADL
Subjt:  KKKTADL

A0A6J1FN51 pentatricopeptide repeat-containing protein At4g207400.0e+0084.11Show/hide
Query:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAF
        PPQIP+KP+K YF+YGHRHRNPKQHRPTVYGGLF+NRQSL+PPNP KPI PK QPF L+ WDPDR P +      PSPSEAFFSS+LRLSPIARFIVDAF
Subjt:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAF

Query:  RKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEI
        RKNQNQWG PVISELNKLRRVTPDLVAE+LKAS R DSNPILASKFF+WAGKQKGFHHN+ASYNAFAY LNRHNRFRAADQIPELMDSQGKPPSEKQFEI
Subjt:  RKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEI

Query:  LIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDV
        LIRMH DANRGLRVYYVYEKMKKFGV+PRVFLYNRILDALVKTG++DLALSVY DFQ+NGLVEE+ITFMILIKGLCKAGRV EMLELLGRMRAN CKPDV
Subjt:  LIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDV

Query:  FAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSG
        FAYTAMVKVLV+EENLEGCLRVWDEM AD VEPDVMAYGTLI+GLCK GR +K FELFQEMK KRILIDR IYGSLIEAFVQDEKVGLA DLLKDLVDSG
Subjt:  FAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSG

Query:  YRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHG
        YRADL IY+SLIKGLCNVNQVD+AYKLF++TIREDLKPDF TVKP+++  VETRRMEDLWKLLSL+QKLELSMDD+LSKF+S MVE+EDKIS ALD+F G
Subjt:  YRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHG

Query:  LNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLM
        L  KGYGS+A+YNI+IGALH+HGQ KKALEIYNDM++SN +PD STYSI VSC+VEV +IQEACASHNKIVELGSVPSVAAY SL+EGLFKICEIDAV+M
Subjt:  LNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLM

Query:  LVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKK
        LVRDCLAN ESGP EFKYALTIL  CKSGK E VI+VL+EM+QQ CP SAV YSAI+SGM KYGT+D AKK FLHLKES  + EANCI+ EE+L+EHMKK
Subjt:  LVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKK

Query:  KTADL
        KTADL
Subjt:  KTADL

A0A6J1K1S7 pentatricopeptide repeat-containing protein At4g207400.0e+0083.83Show/hide
Query:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAF
        PPQIP+KP+K YF+YGHRHRNPKQHRPTVYGGLF+NRQSL+PPNP KPI+PK QPF L+ WDPDR P +      PSPSEAFFSS+LRLSPIARFIVDAF
Subjt:  PPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAF

Query:  RKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEI
        RKNQNQWG PVISELNKLRRVTPDLVAE+LKAS R DSNPILASKFF+WAGKQKGFHHN+ASYNAFAY LNRHNRFRAADQIPELMDSQGKPPSEKQFEI
Subjt:  RKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEI

Query:  LIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDV
        LIRMH DANRGLRVYYVYEKMKKFGV+PRVFLYNRILDALVKTG++DLALSVY DFQ+NGLVEE+ITFMILIKGLCKAGRV EMLE LGRMRAN CKPDV
Subjt:  LIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDV

Query:  FAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSG
        FAYTAMVKVL++EENLEGCLRVWDEMRAD VEPDVMAYGTLI GLCK G  QK FELFQEMK KRILIDR IYGSLIEAFVQDEKVGLA DLLKDL+DSG
Subjt:  FAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSG

Query:  YRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHG
        YRADLGIY+SLIKGLCN NQVD+AYKLF++TIREDLKPDF TVKP+++  VETRRMEDLWKLLSL+QKLELSMDD+L K +S MVE+EDKIS ALD+F G
Subjt:  YRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHG

Query:  LNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLM
        L  KGYGS+A+YNI+IGALH+HGQ KKALEIYNDM+SSN EPD STYSI VSC+VEV +IQEACASHNKIVELGSVPS+AAY SL+EGLFKICEIDAV+M
Subjt:  LNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLM

Query:  LVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKK
        LVRDCLAN ESGP EFKYALTIL  CKSGK E VI+VL+EM+QQ CP SAVAYSAI+SGM KYGT++ AK VFLHLKES  + EANCI+ EELL+EHMKK
Subjt:  LVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKK

Query:  KTADL
        KTADL
Subjt:  KTADL

SwissProt top hitse value%identityAlignment
Q9FJE6 Putative pentatricopeptide repeat-containing protein At5g599005.4e-4324.3Show/hide
Query:  VDAFR---KNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPP
        VDA +   + +  W   + SEL   RR+    V EIL  +     +P L  +FF++ G  +GF H+ AS+    + L + N F  A  + + +  +   P
Subjt:  VDAFR---KNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPP

Query:  SE-----------------KQFEILIRMHSDANRGLRVYYVYEKM-KKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGL
        S+                   F++LI+ +  + R L    V++ M  K  ++P V   + +L  LVK  H  LA+ ++ D    G+  +   +  +I+ L
Subjt:  SE-----------------KQFEILIRMHSDANRGLRVYYVYEKM-KKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGL

Query:  CKAGRVGEMLELLGRMRANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGS
        C+   +    E++  M A  C  ++  Y  ++  L  ++ +   + +  ++    ++PDV+ Y TL+ GLCK    + G E+  EM   R         S
Subjt:  CKAGRVGEMLELLGRMRANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGS

Query:  LIEAFVQDEKVGLACDLLKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMD-
        L+E   +  K+  A +L+K +VD G   +L +YN+LI  LC   +  +A  LF    +  L+P+  T   L+ M     +++     L  M    L +  
Subjt:  LIEAFVQDEKVGLACDLLKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMD-

Query:  --------------DILSK--FVSFMVEK------------------EDKISAALDIFHGLNAKGYG-SIALYNIMIGALHQHGQGKKALEIYNDMRSSN
                      DI +   F++ M+ K                  + KI+ AL ++H +  KG   SI  +  ++  L + G  + A++++N+M   N
Subjt:  --------------DILSK--FVSFMVEK------------------EDKISAALDIFHGLNAKGYG-SIALYNIMIGALHQHGQGKKALEIYNDMRSSN

Query:  FEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLD
         +P+  TY++ +  + E G + +A     ++ E G VP   +Y  L  GL    +     + V D L       +E  Y   +   C+ GK+E  ++V  
Subjt:  FEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLD

Query:  EMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSH
        EM+Q+      V Y  +I G  K+      +K+F  L +  H
Subjt:  EMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSH

Q9LQ14 Pentatricopeptide repeat-containing protein At1g62930, chloroplastic7.1e-4324.51Show/hide
Query:  AADQIPELMDSQGKPPSEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCK
        A D   E++ S+   PS  +F  L+   +  N+   V  + E+M+   +   ++ YN +++   +   L LAL+V G   + G   + +T   L+ G C 
Subjt:  AADQIPELMDSQGKPPSEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCK

Query:  AGRVGEMLELLGRMRANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLI
          R+ E + L+ +M     +P+   +  ++  L         + + D M A G +PD+  YGT++ GLCK G       L ++M++ +I  D  IY ++I
Subjt:  AGRVGEMLELLGRMRANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLI

Query:  EAFVQDEKVGLACDLLKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDIL
        +A    + V  A +L  ++ + G R ++  YNSLI+ LCN  +   A +L    I   + P+  T   L+   V+  ++ +  KL   M K  +  D   
Subjt:  EAFVQDEKVGLACDLLKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDIL

Query:  SKFVSFMVEKEDKISAALDIFHGLNAKG-YGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSV
           +       D++  A  +F  + +K  + ++  YN +I    +  + ++ +E++ +M       ++ TY+  +    + G    A     K+V  G  
Subjt:  SKFVSFMVEKEDKISAALDIFHGLNAKG-YGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSV

Query:  PSVAAYSSLAEGLFKICEIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHL
        P +  YS L +GL K  +++  L +V + L   +  P  + Y + I  +CK+GKVE   ++   +  +   P+ + Y+ +ISG  + G  + A  +F  +
Subjt:  PSVAAYSSLAEGLFKICEIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHL

Query:  KESSHL
        KE   L
Subjt:  KESSHL

Q9SVH3 Pentatricopeptide repeat-containing protein At4g207402.1e-24460.39Show/hide
Query:  MPPQSPPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLS--PPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIA
        M    PP +  K  K  F++G  HR P Q+RPTVYGGLFSNRQS+    P P         PFDL+KWDP+   T     SPPS S    ++S RLSPIA
Subjt:  MPPQSPPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLS--PPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIA

Query:  RFIVDAFRKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPP
        RF++DAFRKN+N WG  V+SELNKLRRVTP +VAE+LK      ++  +A+KFFHWAGKQKG+ H++A+YNAFAY LNR+  FRAADQ+PELMDSQG+PP
Subjt:  RFIVDAFRKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPP

Query:  SEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRA
        SEKQFEILIRMH+D  RGLRVYYVYEKMKKFG  PRVFLYNRI+DALVK G+ DLAL+VY DF+++GLVEES TFMIL+KGLCKAGR+ EMLE+L RMR 
Subjt:  SEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRA

Query:  NLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLL
        NLCKPDVFAYTAM+K LV+E NL+  LRVWDEMR D ++PDVMAYGTL++GLCK GR ++G+ELF EMK K+ILIDR IY  LIE FV D KV  AC+L 
Subjt:  NLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLL

Query:  KDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISA
        +DLVDSGY AD+GIYN++IKGLC+VNQVDKAYKLFQ+ I E+L+PDFET+ P+++  V   R+ D   +L  + +L   + D L++F   +   E+K + 
Subjt:  KDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISA

Query:  ALDIFHGLNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKIC
        ALD+F+ L  KG+GS+++YNI++ AL++ G  +K+L ++ +MR   FEPDSS+YSIA+ CFVE G ++ AC+ H KI+E+  VPS+AAY SL +GL +I 
Subjt:  ALDIFHGLNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKIC

Query:  EIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEEL
        EIDAV++LVR+CL NVESGP EFKYALT+ HVCK    E V+ V+DEM Q+    + V Y AIISGMSK+GT+ VA++VF  LK+   +TEA+ +++EE+
Subjt:  EIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEEL

Query:  LIEHMKKKTADL
        LIE  KKKTADL
Subjt:  LIEHMKKKTADL

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial8.4e-4423.79Show/hide
Query:  PSEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMR
        PS  +F  L+   +  N+   V  + E+M+  G+    + Y+ +++   +   L LAL+V G   + G     +T   L+ G C + R+ E + L+ +M 
Subjt:  PSEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMR

Query:  ANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDL
            +P+   +  ++  L         + + D M A G +PD++ YG ++ GLCK G     F L  +M++ ++     IY ++I+   + + +  A +L
Subjt:  ANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDL

Query:  LKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKIS
         K++   G R ++  Y+SLI  LCN  +   A +L    I   + PD  T   L+   V+  ++ +  KL   M K  +    +    +       D++ 
Subjt:  LKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKIS

Query:  AALDIFHGLNAKG-YGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFK
         A  +F  + +K  +  +  YN +I    ++ + ++ +E++ +M       ++ TY+I +    + G    A     ++V  G  P++  Y++L +GL K
Subjt:  AALDIFHGLNAKG-YGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFK

Query:  ICEIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANC
          +++   M+V + L   +  P+ + Y + I  +CK+GKVE   ++   +  +   P  VAY+ +ISG  + G+ + A  +F  +KE   L  + C
Subjt:  ICEIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANC

Q9SZ52 Pentatricopeptide repeat-containing protein At4g31850, chloroplastic1.3e-4424.38Show/hide
Query:  KQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALS
        + +G   N  +YN     L R +R   A ++   M+S G  P+   + + I  +  +   +     +EKMK  G+ P +   N  L +L K G    A  
Subjt:  KQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALS

Query:  VYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRA
        ++   +  GLV +S+T+ +++K   K G + E ++LL  M  N C+PDV    +++  L   + ++   +++  M+   ++P V+ Y TL+ GL K G+ 
Subjt:  VYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRA

Query:  QKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLV
        Q+  ELF+ M +K    +   + +L +   ++++V LA  +L  ++D G   D+  YN++I GL    QV +A   F   +++ + PDF T+  LL  +V
Subjt:  QKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLV

Query:  ETRRMEDLWKLLSLM------QKLELSMDDILSKF---------VSFMVE----------------------KEDKISAALDIFHGLNAKGYG---SIAL
        +   +ED +K+++        Q   L  +D++            VSF                         K + +S A  +F     K  G    +  
Subjt:  ETRRMEDLWKLLSLM------QKLELSMDDILSKF---------VSFMVE----------------------KEDKISAALDIFHGLNAKGYG---SIAL

Query:  YNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLMLVRDCLANVES
        YN++IG L +    + A +++  ++S+   PD +TY+  +  + + GKI E    + ++       +   ++ +  GL K   +D  L L  D +++ + 
Subjt:  YNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLMLVRDCLANVES

Query:  GPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVF
         P+   Y   I  + KSG++     + + M+   C P+   Y+ +I+G  K G  D A  +F
Subjt:  GPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVF

Arabidopsis top hitse value%identityAlignment
AT1G62670.1 rna processing factor 25.9e-4523.79Show/hide
Query:  PSEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMR
        PS  +F  L+   +  N+   V  + E+M+  G+    + Y+ +++   +   L LAL+V G   + G     +T   L+ G C + R+ E + L+ +M 
Subjt:  PSEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMR

Query:  ANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDL
            +P+   +  ++  L         + + D M A G +PD++ YG ++ GLCK G     F L  +M++ ++     IY ++I+   + + +  A +L
Subjt:  ANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDL

Query:  LKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKIS
         K++   G R ++  Y+SLI  LCN  +   A +L    I   + PD  T   L+   V+  ++ +  KL   M K  +    +    +       D++ 
Subjt:  LKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKIS

Query:  AALDIFHGLNAKG-YGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFK
         A  +F  + +K  +  +  YN +I    ++ + ++ +E++ +M       ++ TY+I +    + G    A     ++V  G  P++  Y++L +GL K
Subjt:  AALDIFHGLNAKG-YGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFK

Query:  ICEIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANC
          +++   M+V + L   +  P+ + Y + I  +CK+GKVE   ++   +  +   P  VAY+ +ISG  + G+ + A  +F  +KE   L  + C
Subjt:  ICEIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANC

AT1G62930.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.0e-4424.51Show/hide
Query:  AADQIPELMDSQGKPPSEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCK
        A D   E++ S+   PS  +F  L+   +  N+   V  + E+M+   +   ++ YN +++   +   L LAL+V G   + G   + +T   L+ G C 
Subjt:  AADQIPELMDSQGKPPSEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCK

Query:  AGRVGEMLELLGRMRANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLI
          R+ E + L+ +M     +P+   +  ++  L         + + D M A G +PD+  YGT++ GLCK G       L ++M++ +I  D  IY ++I
Subjt:  AGRVGEMLELLGRMRANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLI

Query:  EAFVQDEKVGLACDLLKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDIL
        +A    + V  A +L  ++ + G R ++  YNSLI+ LCN  +   A +L    I   + P+  T   L+   V+  ++ +  KL   M K  +  D   
Subjt:  EAFVQDEKVGLACDLLKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDIL

Query:  SKFVSFMVEKEDKISAALDIFHGLNAKG-YGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSV
           +       D++  A  +F  + +K  + ++  YN +I    +  + ++ +E++ +M       ++ TY+  +    + G    A     K+V  G  
Subjt:  SKFVSFMVEKEDKISAALDIFHGLNAKG-YGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSV

Query:  PSVAAYSSLAEGLFKICEIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHL
        P +  YS L +GL K  +++  L +V + L   +  P  + Y + I  +CK+GKVE   ++   +  +   P+ + Y+ +ISG  + G  + A  +F  +
Subjt:  PSVAAYSSLAEGLFKICEIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHL

Query:  KESSHL
        KE   L
Subjt:  KESSHL

AT4G20740.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.5e-24560.39Show/hide
Query:  MPPQSPPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLS--PPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIA
        M    PP +  K  K  F++G  HR P Q+RPTVYGGLFSNRQS+    P P         PFDL+KWDP+   T     SPPS S    ++S RLSPIA
Subjt:  MPPQSPPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLS--PPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIA

Query:  RFIVDAFRKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPP
        RF++DAFRKN+N WG  V+SELNKLRRVTP +VAE+LK      ++  +A+KFFHWAGKQKG+ H++A+YNAFAY LNR+  FRAADQ+PELMDSQG+PP
Subjt:  RFIVDAFRKNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPP

Query:  SEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRA
        SEKQFEILIRMH+D  RGLRVYYVYEKMKKFG  PRVFLYNRI+DALVK G+ DLAL+VY DF+++GLVEES TFMIL+KGLCKAGR+ EMLE+L RMR 
Subjt:  SEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRA

Query:  NLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLL
        NLCKPDVFAYTAM+K LV+E NL+  LRVWDEMR D ++PDVMAYGTL++GLCK GR ++G+ELF EMK K+ILIDR IY  LIE FV D KV  AC+L 
Subjt:  NLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLL

Query:  KDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISA
        +DLVDSGY AD+GIYN++IKGLC+VNQVDKAYKLFQ+ I E+L+PDFET+ P+++  V   R+ D   +L  + +L   + D L++F   +   E+K + 
Subjt:  KDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISA

Query:  ALDIFHGLNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKIC
        ALD+F+ L  KG+GS+++YNI++ AL++ G  +K+L ++ +MR   FEPDSS+YSIA+ CFVE G ++ AC+ H KI+E+  VPS+AAY SL +GL +I 
Subjt:  ALDIFHGLNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKIC

Query:  EIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEEL
        EIDAV++LVR+CL NVESGP EFKYALT+ HVCK    E V+ V+DEM Q+    + V Y AIISGMSK+GT+ VA++VF  LK+   +TEA+ +++EE+
Subjt:  EIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEEL

Query:  LIEHMKKKTADL
        LIE  KKKTADL
Subjt:  LIEHMKKKTADL

AT4G31850.1 proton gradient regulation 39.2e-4624.38Show/hide
Query:  KQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALS
        + +G   N  +YN     L R +R   A ++   M+S G  P+   + + I  +  +   +     +EKMK  G+ P +   N  L +L K G    A  
Subjt:  KQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEILIRMHSDANRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALS

Query:  VYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRA
        ++   +  GLV +S+T+ +++K   K G + E ++LL  M  N C+PDV    +++  L   + ++   +++  M+   ++P V+ Y TL+ GL K G+ 
Subjt:  VYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRA

Query:  QKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLV
        Q+  ELF+ M +K    +   + +L +   ++++V LA  +L  ++D G   D+  YN++I GL    QV +A   F   +++ + PDF T+  LL  +V
Subjt:  QKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLV

Query:  ETRRMEDLWKLLSLM------QKLELSMDDILSKF---------VSFMVE----------------------KEDKISAALDIFHGLNAKGYG---SIAL
        +   +ED +K+++        Q   L  +D++            VSF                         K + +S A  +F     K  G    +  
Subjt:  ETRRMEDLWKLLSLM------QKLELSMDDILSKF---------VSFMVE----------------------KEDKISAALDIFHGLNAKGYG---SIAL

Query:  YNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLMLVRDCLANVES
        YN++IG L +    + A +++  ++S+   PD +TY+  +  + + GKI E    + ++       +   ++ +  GL K   +D  L L  D +++ + 
Subjt:  YNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLMLVRDCLANVES

Query:  GPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVF
         P+   Y   I  + KSG++     + + M+   C P+   Y+ +I+G  K G  D A  +F
Subjt:  GPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVF

AT5G59900.1 Pentatricopeptide repeat (PPR) superfamily protein3.8e-4424.3Show/hide
Query:  VDAFR---KNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPP
        VDA +   + +  W   + SEL   RR+    V EIL  +     +P L  +FF++ G  +GF H+ AS+    + L + N F  A  + + +  +   P
Subjt:  VDAFR---KNQNQWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPP

Query:  SE-----------------KQFEILIRMHSDANRGLRVYYVYEKM-KKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGL
        S+                   F++LI+ +  + R L    V++ M  K  ++P V   + +L  LVK  H  LA+ ++ D    G+  +   +  +I+ L
Subjt:  SE-----------------KQFEILIRMHSDANRGLRVYYVYEKM-KKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGL

Query:  CKAGRVGEMLELLGRMRANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGS
        C+   +    E++  M A  C  ++  Y  ++  L  ++ +   + +  ++    ++PDV+ Y TL+ GLCK    + G E+  EM   R         S
Subjt:  CKAGRVGEMLELLGRMRANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDEMRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGS

Query:  LIEAFVQDEKVGLACDLLKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMD-
        L+E   +  K+  A +L+K +VD G   +L +YN+LI  LC   +  +A  LF    +  L+P+  T   L+ M     +++     L  M    L +  
Subjt:  LIEAFVQDEKVGLACDLLKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIREDLKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMD-

Query:  --------------DILSK--FVSFMVEK------------------EDKISAALDIFHGLNAKGYG-SIALYNIMIGALHQHGQGKKALEIYNDMRSSN
                      DI +   F++ M+ K                  + KI+ AL ++H +  KG   SI  +  ++  L + G  + A++++N+M   N
Subjt:  --------------DILSK--FVSFMVEK------------------EDKISAALDIFHGLNAKGYG-SIALYNIMIGALHQHGQGKKALEIYNDMRSSN

Query:  FEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLD
         +P+  TY++ +  + E G + +A     ++ E G VP   +Y  L  GL    +     + V D L       +E  Y   +   C+ GK+E  ++V  
Subjt:  FEPDSSTYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLD

Query:  EMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSH
        EM+Q+      V Y  +I G  K+      +K+F  L +  H
Subjt:  EMMQQDCPPSAVAYSAIISGMSKYGTLDVAKKVFLHLKESSH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCCTCAATCCCCACCTCAAATCCCAACCAAACCCCACAAGTTATACTTCTACTACGGCCACCGCCACCGCAACCCCAAGCAGCACCGCCCCACCGTCTATGGTGG
CCTCTTCTCCAACCGCCAATCCCTCTCTCCACCCAACCCCCACAAACCCATTTCCCCGAAATCCCAACCGTTCGATCTTCAGAAGTGGGATCCTGATCGCTCACCTACTC
AGTCCAAGCTGACGTCGCCGCCGTCACCCTCGGAGGCCTTCTTCTCCAGCTCCCTCCGCCTTTCGCCCATTGCTCGCTTCATCGTCGACGCCTTCCGTAAGAATCAGAAC
CAGTGGGGCGCACCGGTGATCTCTGAACTCAACAAGCTCCGCCGAGTTACTCCGGACCTTGTGGCGGAGATTCTCAAGGCTTCTCAGCGGCATGATTCTAACCCAATCTT
AGCCTCCAAGTTCTTCCACTGGGCAGGGAAGCAAAAGGGCTTTCATCATAATTACGCCTCCTACAATGCTTTTGCTTATTTCTTGAATCGCCACAATCGCTTCAGGGCCG
CTGATCAAATCCCTGAGCTCATGGATTCACAAGGTAAGCCTCCAAGCGAAAAACAATTTGAGATTCTGATTAGGATGCACTCTGATGCCAATAGGGGTCTCAGAGTTTAC
TACGTATATGAAAAAATGAAGAAATTTGGGGTTGTTCCCCGCGTCTTCTTGTATAACAGGATTCTTGATGCCTTAGTCAAAACAGGTCATTTAGATTTAGCTTTATCTGT
TTATGGGGACTTCCAGCAAAATGGGTTGGTGGAAGAAAGCATCACTTTTATGATTTTGATTAAAGGGTTGTGCAAGGCAGGGAGGGTAGGGGAAATGCTGGAGCTTTTGG
GTCGGATGAGGGCGAATTTGTGTAAGCCAGATGTGTTTGCTTATACAGCAATGGTGAAGGTGTTGGTTGCTGAGGAGAATTTGGAGGGATGTTTAAGAGTTTGGGATGAA
ATGAGAGCAGATGGAGTAGAGCCTGATGTCATGGCATATGGAACTCTGATTATGGGATTGTGCAAAACCGGGCGGGCGCAAAAAGGGTTTGAATTGTTTCAGGAGATGAA
AGAGAAGAGGATTTTGATAGATAGAACAATTTATGGGTCCTTGATTGAGGCATTTGTGCAGGATGAGAAAGTTGGATTGGCTTGTGATTTGTTAAAGGATTTGGTAGATT
CAGGGTATAGAGCTGATTTGGGGATATATAATTCTCTCATTAAAGGTCTTTGTAATGTAAATCAAGTTGATAAGGCTTACAAACTCTTTCAACTAACCATACGAGAGGAT
CTTAAGCCAGATTTTGAAACCGTGAAGCCTTTGTTGATGATGTTGGTGGAGACGAGAAGAATGGAAGACTTGTGGAAGTTATTATCCTTGATGCAGAAGTTGGAACTTTC
CATGGATGATATTCTTTCAAAATTTGTCTCTTTTATGGTAGAAAAGGAGGACAAAATAAGCGCGGCTCTAGATATATTTCACGGCTTGAATGCGAAGGGCTATGGCAGCA
TTGCCTTGTACAATATCATGATAGGGGCTCTTCATCAGCATGGGCAGGGAAAGAAGGCATTAGAGATCTACAATGACATGAGGAGCTCGAATTTTGAACCAGACTCGTCA
ACTTACTCGATTGCAGTTTCGTGCTTCGTTGAAGTAGGGAAAATCCAAGAGGCTTGTGCATCTCATAACAAAATAGTTGAGTTGGGTTCGGTTCCTTCCGTTGCTGCCTA
CAGTTCTCTTGCCGAGGGGCTCTTTAAAATCTGCGAGATCGATGCAGTTTTGATGCTTGTTCGAGACTGCCTAGCGAATGTCGAGAGTGGACCTTCAGAGTTTAAATATG
CTCTTACAATTCTACATGTATGTAAATCTGGTAAAGTGGAAACGGTGATCAATGTTCTTGATGAGATGATGCAACAGGATTGCCCTCCGAGCGCGGTTGCCTACTCGGCT
ATCATATCTGGAATGTCCAAGTATGGGACACTTGATGTGGCAAAGAAAGTGTTTTTGCATCTGAAGGAGAGCAGCCATTTGACAGAAGCTAACTGCATCATGTTTGAGGA
GTTGTTAATTGAACACATGAAAAAGAAGACAGCAGACTTGCAGAAGAAATCTCAGTTCTCAGAACATGGATTGAAGAACAAGAACAAGCTGCAAGCCCCAGCTGTCTGCA
ACGCATATATAAGAAGCCAATTGTCCGAAAATTTGAAAAAGGCACTTAATTCTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCCCTCAATCCCCACCTCAAATCCCAACCAAACCCCACAAGTTATACTTCTACTACGGCCACCGCCACCGCAACCCCAAGCAGCACCGCCCCACCGTCTATGGTGG
CCTCTTCTCCAACCGCCAATCCCTCTCTCCACCCAACCCCCACAAACCCATTTCCCCGAAATCCCAACCGTTCGATCTTCAGAAGTGGGATCCTGATCGCTCACCTACTC
AGTCCAAGCTGACGTCGCCGCCGTCACCCTCGGAGGCCTTCTTCTCCAGCTCCCTCCGCCTTTCGCCCATTGCTCGCTTCATCGTCGACGCCTTCCGTAAGAATCAGAAC
CAGTGGGGCGCACCGGTGATCTCTGAACTCAACAAGCTCCGCCGAGTTACTCCGGACCTTGTGGCGGAGATTCTCAAGGCTTCTCAGCGGCATGATTCTAACCCAATCTT
AGCCTCCAAGTTCTTCCACTGGGCAGGGAAGCAAAAGGGCTTTCATCATAATTACGCCTCCTACAATGCTTTTGCTTATTTCTTGAATCGCCACAATCGCTTCAGGGCCG
CTGATCAAATCCCTGAGCTCATGGATTCACAAGGTAAGCCTCCAAGCGAAAAACAATTTGAGATTCTGATTAGGATGCACTCTGATGCCAATAGGGGTCTCAGAGTTTAC
TACGTATATGAAAAAATGAAGAAATTTGGGGTTGTTCCCCGCGTCTTCTTGTATAACAGGATTCTTGATGCCTTAGTCAAAACAGGTCATTTAGATTTAGCTTTATCTGT
TTATGGGGACTTCCAGCAAAATGGGTTGGTGGAAGAAAGCATCACTTTTATGATTTTGATTAAAGGGTTGTGCAAGGCAGGGAGGGTAGGGGAAATGCTGGAGCTTTTGG
GTCGGATGAGGGCGAATTTGTGTAAGCCAGATGTGTTTGCTTATACAGCAATGGTGAAGGTGTTGGTTGCTGAGGAGAATTTGGAGGGATGTTTAAGAGTTTGGGATGAA
ATGAGAGCAGATGGAGTAGAGCCTGATGTCATGGCATATGGAACTCTGATTATGGGATTGTGCAAAACCGGGCGGGCGCAAAAAGGGTTTGAATTGTTTCAGGAGATGAA
AGAGAAGAGGATTTTGATAGATAGAACAATTTATGGGTCCTTGATTGAGGCATTTGTGCAGGATGAGAAAGTTGGATTGGCTTGTGATTTGTTAAAGGATTTGGTAGATT
CAGGGTATAGAGCTGATTTGGGGATATATAATTCTCTCATTAAAGGTCTTTGTAATGTAAATCAAGTTGATAAGGCTTACAAACTCTTTCAACTAACCATACGAGAGGAT
CTTAAGCCAGATTTTGAAACCGTGAAGCCTTTGTTGATGATGTTGGTGGAGACGAGAAGAATGGAAGACTTGTGGAAGTTATTATCCTTGATGCAGAAGTTGGAACTTTC
CATGGATGATATTCTTTCAAAATTTGTCTCTTTTATGGTAGAAAAGGAGGACAAAATAAGCGCGGCTCTAGATATATTTCACGGCTTGAATGCGAAGGGCTATGGCAGCA
TTGCCTTGTACAATATCATGATAGGGGCTCTTCATCAGCATGGGCAGGGAAAGAAGGCATTAGAGATCTACAATGACATGAGGAGCTCGAATTTTGAACCAGACTCGTCA
ACTTACTCGATTGCAGTTTCGTGCTTCGTTGAAGTAGGGAAAATCCAAGAGGCTTGTGCATCTCATAACAAAATAGTTGAGTTGGGTTCGGTTCCTTCCGTTGCTGCCTA
CAGTTCTCTTGCCGAGGGGCTCTTTAAAATCTGCGAGATCGATGCAGTTTTGATGCTTGTTCGAGACTGCCTAGCGAATGTCGAGAGTGGACCTTCAGAGTTTAAATATG
CTCTTACAATTCTACATGTATGTAAATCTGGTAAAGTGGAAACGGTGATCAATGTTCTTGATGAGATGATGCAACAGGATTGCCCTCCGAGCGCGGTTGCCTACTCGGCT
ATCATATCTGGAATGTCCAAGTATGGGACACTTGATGTGGCAAAGAAAGTGTTTTTGCATCTGAAGGAGAGCAGCCATTTGACAGAAGCTAACTGCATCATGTTTGAGGA
GTTGTTAATTGAACACATGAAAAAGAAGACAGCAGACTTGCAGAAGAAATCTCAGTTCTCAGAACATGGATTGAAGAACAAGAACAAGCTGCAAGCCCCAGCTGTCTGCA
ACGCATATATAAGAAGCCAATTGTCCGAAAATTTGAAAAAGGCACTTAATTCTTCTTGA
Protein sequenceShow/hide protein sequence
MPPQSPPQIPTKPHKLYFYYGHRHRNPKQHRPTVYGGLFSNRQSLSPPNPHKPISPKSQPFDLQKWDPDRSPTQSKLTSPPSPSEAFFSSSLRLSPIARFIVDAFRKNQN
QWGAPVISELNKLRRVTPDLVAEILKASQRHDSNPILASKFFHWAGKQKGFHHNYASYNAFAYFLNRHNRFRAADQIPELMDSQGKPPSEKQFEILIRMHSDANRGLRVY
YVYEKMKKFGVVPRVFLYNRILDALVKTGHLDLALSVYGDFQQNGLVEESITFMILIKGLCKAGRVGEMLELLGRMRANLCKPDVFAYTAMVKVLVAEENLEGCLRVWDE
MRADGVEPDVMAYGTLIMGLCKTGRAQKGFELFQEMKEKRILIDRTIYGSLIEAFVQDEKVGLACDLLKDLVDSGYRADLGIYNSLIKGLCNVNQVDKAYKLFQLTIRED
LKPDFETVKPLLMMLVETRRMEDLWKLLSLMQKLELSMDDILSKFVSFMVEKEDKISAALDIFHGLNAKGYGSIALYNIMIGALHQHGQGKKALEIYNDMRSSNFEPDSS
TYSIAVSCFVEVGKIQEACASHNKIVELGSVPSVAAYSSLAEGLFKICEIDAVLMLVRDCLANVESGPSEFKYALTILHVCKSGKVETVINVLDEMMQQDCPPSAVAYSA
IISGMSKYGTLDVAKKVFLHLKESSHLTEANCIMFEELLIEHMKKKTADLQKKSQFSEHGLKNKNKLQAPAVCNAYIRSQLSENLKKALNSS