; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G036330 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G036330
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCicolChr02:32119339..32127963
RNA-Seq ExpressionCcUC02G036330
SyntenyCcUC02G036330
Gene Ontology termsGO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012674 - Calycin
IPR014878 - THAP4-like, heme-binding beta-barrel domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035334.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0084.21Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK R  FLRP++ Y+VPKPPWFHLFH+ TDPIA+SNEVSTIIETVDP ED LE+I+PH+SSDVITSVI+EQ N RLGFRLFIWSLRR+ LCC+ASQN II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDS+ EI SDAFSVLIEAYSKAGM EKAV+SFG+M+DF+CKPNIFA+NLILHVLVR+EAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHGFCKTSKTQ+ALVLFDEMT R +LPNEITYSI+LSGLCQAKKIDDAQRLF KMRA GCSPDVITYNVLLNGFCKLGY DEAF+LL+SFEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWYQK  R+N++PDVILYTIMIQGL QEGRV +ALALLDEMTERG SPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ
         HDCFP+NHTYSILICGMCKNGLI EAQH+FNEMEKLGC+PS+VTFNSLIDG CKAGKL+EA+LLFYKMEIG+KP LFLRLSQGANK+L +  LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVESGV PDIRTYNILING CK N+I+G F LFK MQLKGRLPDSVTYGTLIDGL+RVGRDEDALGIFEQMVK+GCKP+ S+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS
        SIMTWSCR+KKVSLAFSVWMKYLRNFRGW+DEKV VV ESF+KG+LE  I R+IEMD+ SKDFDLAPYTIFL+GLCQA RVSEAFAIFSVLKDF   +SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS

Query:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT
        ASCVMLIG LC+EG+LDLAV+VFLYTLE G MLMPRICNQLL  LLHLEDRKDHA VLI RMEAFGYDMN  LH   K+
Subjt:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT

XP_022948073.1 pentatricopeptide repeat-containing protein At1g79540 [Cucurbita moschata]0.0e+0084.34Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK R  FLRP++ Y+VPKPPWFHLFH+ TDPIATSNEVSTIIETVDP ED LE+I+PH+SSDVITSVI+EQ N RLGFRLFIWSLRR+ LCC+ASQN II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDS+ EI SDAFSVLIEAYSKAGM EKAV+SFG+M+DF+CKPNI+A+NLILHVLVR+EAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHGFCKTSKTQ+ALVLFDEMT R +LPNEITYSI+LSGLCQAKKIDDAQRLF KMRA GCSPDVITYNVLLNGFCKLGY DEAF+LL+SFEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWYQK  R+N++PDVILYTIMIQGL QEGRV +ALALLDEMTERG SPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ
         HDCFP+NHTYSILICGMCKNGLI EAQH+FNEMEKLGC+PS+VTFNSLIDG CKAGKL+EA+LLFYKMEIG+KPSLFLRLSQGANK+L +  LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVESGV PDIRTYNILING CK N+I+G FKLFK MQLKGRLPDSVTYGTLIDGL+RVGRDEDALGIFEQMVK+GCKP+ S+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS
        SIMTWSCR+KKVSL FSVWMKYLRNFRGW+DEKV VV ESF+KG+LE  I R+IEMD+ SKDF+LAPYTIFLIGLCQA RVSEAFAIFSVLKDF   +SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS

Query:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT
        ASCVMLIG LC+EG+LDLAV+VFLYTLE G MLMPRICNQLL  LLHLEDRKDHA VLI RMEAFGYDMN  LH   K+
Subjt:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT

XP_023007126.1 pentatricopeptide repeat-containing protein At1g79540 [Cucurbita maxima]0.0e+0085.11Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK R  FLRP++ Y+VPKPPWFHLFH+PTDPIATSNEVSTIIETVDP ED LE I+PHISSDVITSVI+EQ N RLGFRLFIWSLRR+ LCC+ASQ+ II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDS+ EI SDAFSVLIEAYSKAGM+EKAV+SFG+M+DF+CKPNIFA+NLILHVLVR+EAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHGFCKTSKTQ+ALVLFDEMT R +LPNEITYSI+LSGLCQAKKIDDAQRLF KMRA GCSPDVITYNVLLNGFCKLGY DEAF+LL+SFEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWYQK  R+N++PDVILYTIMIQGL QEGRV +ALALLDEMTERG SPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ
         HDCFPDNHTYSILICGMCKNGLI EAQH+FNEMEKLGC+PS+VTFNSLIDG CKAGKL+EA+LLFYKMEIG+KPSLFLRL QGANKVL +  LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVESGV PDIRTYNILING CK N+I+G FKLFK MQLKGRLPDS+TYGTLIDGL+RVGRDEDALGIFEQMVKNGCKP+SS+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS
        SIMTWSCR+KKVSLAFSVWMKYLRNFRGW+DEKV VV ESF+KG+LE  I R+IEMD+ SKDFDLAPYTIFLIGLCQA RVSEAFAIFSVLKDF   +SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS

Query:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT
        ASCVMLIG LC+EG+LDLAV+VFLYTLE G MLMPRICNQLL R LHLEDRKDHA VLI RMEAFGYDMN  LH   K+
Subjt:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT

XP_023534570.1 pentatricopeptide repeat-containing protein At1g79540 [Cucurbita pepo subsp. pepo]0.0e+0084.6Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK R  FLRP++ Y+VPKPPWFHLFH+PTD IATSNEVSTIIETVDP ED LE+I+PHISSDVITSVI+EQ N RLGFR+FIWSLRR+ LCC+ASQN II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDS+ EI SDAFSVLIEAYSKAGM EKAV+SFG+M+DF+CKPNIFA+NLILHVLVR+EAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHGFCKTSKTQ+ALVLFDEMT R +LPNEITYSI+LSGLCQAKKIDDAQRLF KMRA GCSPDVITYNVLLNGFCKLGY DEAF+LL+SFEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWYQK  R+N++PDVILYTIMIQGL QEGRV +ALALLDEMTERG SPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ
         HDCFPDNHTYSILICGMCKNGLI EAQH+FNEMEKLGC+PS+VTFNSLIDG CKAGKL+EA+LLFYKMEIG+KPSLFLRLSQGANKVL +  LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVESGV PDIRTYNILING CK N+I+G FKLFK MQLKGRLPDSVTYGTLIDGL+RVGRDEDALGIFEQMVK+GCKP+ S+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS
        SIMTWSCR+KKVSLAFSVWMKYLRNFRGW+DEKV VV ESF+KG+LE  I R+IEMD+ SKDFDLAPYTIFLIGLCQA R SEAFAIFSVLKDF   +SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS

Query:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT
        ASCVMLIG LC+EG+LDLAV+VFLYTLE G MLMPRICNQLL   LHLE+RKDHA VLI RMEAFGYDMN  LH   K+
Subjt:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT

XP_038901213.1 pentatricopeptide repeat-containing protein At1g79540 [Benincasa hispida]0.0e+0085.57Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK+RP   RPII YVVPKPPWF  FHSPTDPIATSNEVSTIIETVD FEDGLEVISPHISSD+ITSVI+EQ NPRLGFRLFIWSLRRKRLCC+ASQN II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDSAIEI SDAFSVLIEAY KAGM+EKAVESFGLMRDFDCKPN+FAFNLILH+LVRKEAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKTSK--TQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDG
        Y ILIHGFCKTSK  TQDAL LFDEMT RGILPNEITYSIVLSGLC+AKKI DAQRLFSKMRA G SPDV+TYNVLLNGFCKLGYL+EAF+LLQSFEKDG
Subjt:  YSILIHGFCKTSK--TQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDG

Query:  HILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLE
        HILGVNGYSCLINGLFRARRYDEAHMWYQK+LRENIKPDVILYTIMIQGLSQEGRVTDALALL EMTERG SPDTACYN LIKGFCD+G+LDKAQSLRLE
Subjt:  HILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLE

Query:  ISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMI
        IS H+CFPDNHTYSILICGMCKNGL+SEAQ +FNEMEKLGC+PS+VTFNSLIDGLCKAG+L+EA+LLF KMEIG+KPSLFLRLSQG NKVLD+ASLQVM+
Subjt:  ISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMI

Query:  QQLCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSI
        +QLCESGL+LKAYKLLMQLVESGVLPDIRTYNILING CKNN+ING FKL K M+LKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVK GCKPDSSI
Subjt:  QQLCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSI

Query:  YKSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNV
        YKSIMTW CRKK +SLAF+VWMKYLRNFRGWEDEKV +V ESF+KGEL+TTI RL++MDM SKDFDLAPYTIFLIGLCQA+RVSEAFAIFSVLKDF MN+
Subjt:  YKSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNV

Query:  SSASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKTQTRVK-TAIEMDGDLPPAP
        SSASCVMLIGRLC+  +LDLAVDVFLYTLEEG MLMPRICN+LLS LLH+ED+KDHALVL+++MEAFGYDMNT LH   K   R    +++    +P   
Subjt:  SSASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKTQTRVK-TAIEMDGDLPPAP

Query:  APLSIHPAVAP
          LS H   +P
Subjt:  APLSIHPAVAP

TrEMBL top hitse value%identityAlignment
A0A0A0KD52 Uncharacterized protein0.0e+0078.92Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MKLRP   RPII +VVPKP  FH +HS T+PIATS EVSTIIET+DP EDGL+VIS  I S  ITSV++EQ + RLGFRLFIWSL+   L C   Q+ II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
         +L+K+NAFELYWK LQELK+SAI+I S+AFSVLIEAYS+AGMDEKAVESFGLMRDFDCKP++FAFNLILH LVRKEAFLLALAVYNQMLKCNLNP+VVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        Y ILIHG CKT KTQDALVLFDEMT RGILPN+I YSIVLSGLCQAKKI DAQRLFSKMRA GC+ D+ITYNVLLNGFCK GYLD+AF+LLQ   KDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GY CLINGLFRARRY+EAHMWYQKMLRENIKPDV+LYTIMI+GLSQEGRVT+AL LL EMTERGL PDT CYNALIKGFCDMG+LD+A+SLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ
        KHDCFP+NHTYSILICGMCKNGLI++AQHIF EMEKLGC+PS+VTFNSLI+GLCKA +L+EA LLFY+MEI +KPSLFLRLSQG +KV D ASLQVM+++
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESG+ILKAYKLLMQLV+SGVLPDIRTYNILING CK  +ING FKLFK MQLKG +PDSVTYGTLIDGLYR GR+EDAL IFEQMVK GC P+SS YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS
        +IMTWSCR+  +SLA SVWMKYLR+FRGWEDEKV VVAESF+  EL+T I+RL+EMD+ SK+FDLAPYTIFLIGL QA+R  EAFAIFSVLKDF MN+SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS

Query:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREK
        ASCVMLIGRLCM   LD+A+DVFL+TLE GF LMP ICNQLL  LLHL DRKD AL L +RMEA GYD+   LH R K
Subjt:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREK

A0A5D3B9M5 Pentatricopeptide repeat-containing protein0.0e+0076.22Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MKLRP+  RPII +VVPKPP F  +HS T+PI TS EVSTIIETVDP EDGL+VIS  I+S +ITSV+ +Q N  LGFRLFIWSL        A ++ II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        D+L+KDNAFELYWK LQELK+SAIEI SDAFSVLIEAYS+AGM+EKAVESFGLMRDFDCKPN+FAFNLIL  LVRKEAFLLALAVYNQMLKCNLNP+V T
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        Y ILIHGFC+T KTQDALVLFDEMT RGILPN+I Y+IVLSGLC+AKKI DAQRLFS M A     D+ TYNVLLNGFCKLGYLDEAF+LLQ   KDGH 
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        L V+GY CLINGLFRARRY+EAH WY+KMLRENIKPDVILYTIMIQGLSQEGRVT+A+ LL EM ERGL PDT CYNALIKGFCD+G+LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ
         H CFP NHTYSILICGMCK+GLI+EAQHIF EMEKLGC+PS+VTFNSLI+GLCKA +L+EA LLFY+MEI +KPSLFLRLSQG +KVLD ASLQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLILKAYKLLMQLV+SGVLPDIRTYNILING CK  +ING FKLFK MQ +G +PDSVTYGTLIDGLYRVGR+EDALGIF QM K GC PDSS Y+
Subjt:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS
        +IMTW CR+K + L  SVWMKYLRNFRGWEDEKV VV ESF+  EL+T I+RL+EMD+ SK+FD+APYTIFLIGLC+A+RVSEAFAIFSV KDF MN+SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS

Query:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREK
        ASCV LI  LC   +L+LAVDVFL+TLE  F +MP ICN+LL  LL L DRKD AL L +R+EA GYD+   L+ R K
Subjt:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREK

A0A6J1D6A9 pentatricopeptide repeat-containing protein At1g79540 isoform X10.0e+0082.28Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK RP F+RPII  +VPKPPWFHL+HSPTDPIATSNEV TI+ETV+PFED LE I+PH+S DVITSVIEEQ NPRLGFRLFIWSL+ KRLCC+ASQN II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLV+DNAFELYWKTLQELKDSA+ I SDAFSVLIEAYS AGMDEKAVESFGLM+DFDCKPNIF +NLIL+VLVRKEAF LAL+VYNQML+CN  PNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHG CKTSKTQDALVLFDEM  RGI PNEITYSIVLSGLCQA KIDDAQRLF KMRA GCSPD ITYNVLLNGFCK GY DEAF+LLQ+FEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGVN YSCLI+GLFRARRYDEA  WYQKMLRENIKPDVILYTIMIQGLSQEG++ DALALL EMTERG SPDT CYNALIKGFCDM  LDKA+SLRL IS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ
         HDC PDNHTYSILICGMC+NGLI EAQ++FNEMEKLGC+PS+ TFNSLIDGLCK G++ EA LLFYKMEIG+KPS+FLRL+QG NKVLD+A LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESG+ILKAYKLLMQL ESGVLPDIRTYNILING CK N ING FKLFK MQLKGRLPDSVTYGTLI+GL+RVGRD+DAL +F+QMVK GCKPDSS+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS
        +IMTWSCRKK VSLAFSVWMKYL NFRGW+DE V VV  SF+KGELE  I+RLIEMD  SKDFD +PYTIFLIGLCQA+RVSEAFAIFSVLKDF MN + 
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS

Query:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT
        ASCVMLIG LC+E +LDLA+DVFLYTLE GF+LMPRICNQLL  LL  EDRKDHALVLI RME FGYDM+  LH   K+
Subjt:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT

A0A6J1G8C6 pentatricopeptide repeat-containing protein At1g795400.0e+0084.34Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK R  FLRP++ Y+VPKPPWFHLFH+ TDPIATSNEVSTIIETVDP ED LE+I+PH+SSDVITSVI+EQ N RLGFRLFIWSLRR+ LCC+ASQN II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDS+ EI SDAFSVLIEAYSKAGM EKAV+SFG+M+DF+CKPNI+A+NLILHVLVR+EAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHGFCKTSKTQ+ALVLFDEMT R +LPNEITYSI+LSGLCQAKKIDDAQRLF KMRA GCSPDVITYNVLLNGFCKLGY DEAF+LL+SFEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWYQK  R+N++PDVILYTIMIQGL QEGRV +ALALLDEMTERG SPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ
         HDCFP+NHTYSILICGMCKNGLI EAQH+FNEMEKLGC+PS+VTFNSLIDG CKAGKL+EA+LLFYKMEIG+KPSLFLRLSQGANK+L +  LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVESGV PDIRTYNILING CK N+I+G FKLFK MQLKGRLPDSVTYGTLIDGL+RVGRDEDALGIFEQMVK+GCKP+ S+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS
        SIMTWSCR+KKVSL FSVWMKYLRNFRGW+DEKV VV ESF+KG+LE  I R+IEMD+ SKDF+LAPYTIFLIGLCQA RVSEAFAIFSVLKDF   +SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS

Query:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT
        ASCVMLIG LC+EG+LDLAV+VFLYTLE G MLMPRICNQLL  LLHLEDRKDHA VLI RMEAFGYDMN  LH   K+
Subjt:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT

A0A6J1KZN2 pentatricopeptide repeat-containing protein At1g795400.0e+0085.11Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK R  FLRP++ Y+VPKPPWFHLFH+PTDPIATSNEVSTIIETVDP ED LE I+PHISSDVITSVI+EQ N RLGFRLFIWSLRR+ LCC+ASQ+ II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDS+ EI SDAFSVLIEAYSKAGM+EKAV+SFG+M+DF+CKPNIFA+NLILHVLVR+EAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHGFCKTSKTQ+ALVLFDEMT R +LPNEITYSI+LSGLCQAKKIDDAQRLF KMRA GCSPDVITYNVLLNGFCKLGY DEAF+LL+SFEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWYQK  R+N++PDVILYTIMIQGL QEGRV +ALALLDEMTERG SPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ
         HDCFPDNHTYSILICGMCKNGLI EAQH+FNEMEKLGC+PS+VTFNSLIDG CKAGKL+EA+LLFYKMEIG+KPSLFLRL QGANKVL +  LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVESGV PDIRTYNILING CK N+I+G FKLFK MQLKGRLPDS+TYGTLIDGL+RVGRDEDALGIFEQMVKNGCKP+SS+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS
        SIMTWSCR+KKVSLAFSVWMKYLRNFRGW+DEKV VV ESF+KG+LE  I R+IEMD+ SKDFDLAPYTIFLIGLCQA RVSEAFAIFSVLKDF   +SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSS

Query:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT
        ASCVMLIG LC+EG+LDLAV+VFLYTLE G MLMPRICNQLL R LHLEDRKDHA VLI RMEAFGYDMN  LH   K+
Subjt:  ASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLLHDREKT

SwissProt top hitse value%identityAlignment
Q9FIX3 Pentatricopeptide repeat-containing protein At5g397102.8e-7527.46Show/hide
Query:  RLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVR-KEAFLLALAVYNQMLKCNLNPNVVT
        + + D    L +K+LQE  D      S  F +++++YS+  + +KA+    L +     P + ++N +L   +R K     A  V+ +ML+  ++PNV T
Subjt:  RLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVR-KEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        Y+ILI GFC       AL LFD+M  +G LPN +TY+ ++ G C+ +KIDD  +L   M   G  P++I+YNV++NG C+ G + E   +L    + G+ 
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        L    Y+ LI G  +   + +A + + +MLR  + P VI YT +I  + + G +  A+  LD+M  RGL P+   Y  L+ GF   G++++A  +  E++
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKM-EIGKKPSLFLRLSQGANKVLDSASLQVMIQ
         +   P   TY+ LI G C  G + +A  +  +M++ G  P +V++++++ G C++  + EA  +  +M E G KP              D+ +   +IQ
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKM-EIGKKPSLFLRLSQGANKVLDSASLQVMIQ

Query:  QLCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIY
          CE     +A  L  +++  G+ PD  TY  LIN  C   D+    +L   M  KG LPD VTY  LI+GL +  R  +A  +  ++      P    Y
Subjt:  QLCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIY

Query:  KSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDF--DLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMN
         ++                    + N    E + V  + + F    + T   ++ E  M+ K+   D   Y I + G C+A  + +A+ ++  +      
Subjt:  KSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDF--DLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMN

Query:  VSSASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGY
        + + + + L+  L  EG+++    V ++ L     L      ++L  + H E   D  L ++  M   G+
Subjt:  VSSASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGY

Q9LQ14 Pentatricopeptide repeat-containing protein At1g62930, chloroplastic8.0e-7531.77Show/hide
Query:  SQNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNL
        S+N ++D L  D+A +L+ + +Q     +I      F+ L+ A +K    +  +     M++     +++++N++++   R+    LALAV  +M+K   
Subjt:  SQNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNL

Query:  NPNVVTYSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSF
         P++VT S L++G+C   +  +A+ L D+M      PN +T++ ++ GL    K  +A  L  +M A GC PD+ TY  ++NG CK G +D A SLL+  
Subjt:  NPNVVTYSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSF

Query:  EKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQS
        EK      V  Y+ +I+ L   +  ++A   + +M  + I+P+V+ Y  +I+ L   GR +DA  LL +M ER ++P+   ++ALI  F   G L +A+ 
Subjt:  EKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQS

Query:  LRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKME----IGKKPSLFLRLSQG------
        L  E+ K    PD  TYS LI G C +  + EA+H+F  M    C P++VT+N+LI G CKA +++E   LF +M     +G   + +  L QG      
Subjt:  LRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKME----IGKKPSLFLRLSQG------

Query:  ---ANKVL----------DSASLQVMIQQLCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGL
           A K+           D  +  +++  LC+ G + KA  +   L +S + PDI TYNI+I G+CK   +  G+ LF ++ LKG  P+ + Y T+I G 
Subjt:  ---ANKVL----------DSASLQVMIQQLCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGL

Query:  YRVGRDEDALGIFEQMVKNGCKPDSSIYKSIM
         R G  E+A  +F +M ++G  P+S  Y +++
Subjt:  YRVGRDEDALGIFEQMVKNGCKPDSSIYKSIM

Q9LQ16 Pentatricopeptide repeat-containing protein At1g629102.3e-7431.26Show/hide
Query:  QNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLN
        +N + D +  D+A +L+   ++     +I      F+ L+ A +K    E  +     M+      +++ +++ ++   R+    LALAV  +M+K    
Subjt:  QNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLN

Query:  PNVVTYSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFE
        P++VT S L++G+C + +  DA+ L D+M   G  P+  T++ ++ GL    K  +A  L  +M   GC PD++TY  ++NG CK G +D A SLL+  E
Subjt:  PNVVTYSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFE

Query:  KDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSL
        K      V  Y+ +I+GL + +  D+A   + +M  + I+PDV  Y+ +I  L   GR +DA  LL +M ER ++P+   ++ALI  F   G L +A+ L
Subjt:  KDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSL

Query:  RLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKME----IGKKPSLFLRLSQGANKVLDS
          E+ K    PD  TYS LI G C +  + EA+H+F  M    C P++VT+++LI G CKA +++E   LF +M     +G   + +  L  G  +  D 
Subjt:  RLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKME----IGKKPSLFLRLSQGANKVLDS

Query:  ASLQVMIQQ-------------------LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLY
         + Q++ +Q                   LC++G + KA  +   L  S + PDI TYNI+I G+CK   +  G++LF  + LKG  P+ + Y T+I G  
Subjt:  ASLQVMIQQ-------------------LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLY

Query:  RVGRDEDALGIFEQMVKNGCKPDSSIYKSIM
        R G  E+A  + ++M ++G  P+S  Y +++
Subjt:  RVGRDEDALGIFEQMVKNGCKPDSSIYKSIM

Q9SAJ5 Pentatricopeptide repeat-containing protein At1g795402.7e-22450.59Show/hide
Query:  FLRPIIAYVVPKPPWF-HLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFIIDRLVK
        F R +I +   KP W    + S       S EV +I+    P E  LE + P +S ++ITSVI+++ N +LGFR FIW+ RR+RL    S   +ID L +
Subjt:  FLRPIIAYVVPKPPWF-HLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFIIDRLVK

Query:  DNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEA-FLLALAVYNQMLKCNLNPNVVTYSIL
        DN  +LYW+TL+ELK   + + S  F VLI AY+K GM EKAVESFG M++FDC+P++F +N+IL V++R+E  F+LA AVYN+MLKCN +PN+ T+ IL
Subjt:  DNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEA-FLLALAVYNQMLKCNLNPNVVTYSIL

Query:  IHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVN
        + G  K  +T DA  +FD+MT RGI PN +TY+I++SGLCQ    DDA++LF +M+  G  PD + +N LL+GFCKLG + EAF LL+ FEKDG +LG+ 
Subjt:  IHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVN

Query:  GYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDC
        GYS LI+GLFRARRY +A   Y  ML++NIKPD+ILYTI+IQGLS+ G++ DAL LL  M  +G+SPDT CYNA+IK  C  G L++ +SL+LE+S+ + 
Subjt:  GYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDC

Query:  FPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQLCES
        FPD  T++ILIC MC+NGL+ EA+ IF E+EK GC PS+ TFN+LIDGLCK+G+L+EA LL +KME+G+  SLFLRLS   N+  D+         + ES
Subjt:  FPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQLCES

Query:  GLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMT
        G ILKAY+ L    ++G  PDI +YN+LING C+  DI+G  KL   +QLKG  PDSVTY TLI+GL+RVGR+E+A  +F    K+  +   ++Y+S+MT
Subjt:  GLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMT

Query:  WSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSSASCV
        WSCRK+KV +AF++WMKYL+     +DE    + + F +GE E  ++RLIE+D    +  L PYTI+LIGLCQ+ R  EA  +FSVL++  + V+  SCV
Subjt:  WSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSSASCV

Query:  MLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLL
         LI  LC   +LD A++VFLYTL+  F LMPR+CN LLS LL   ++ +    L +RME  GY+++++L
Subjt:  MLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLL

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial4.7e-7533.19Show/hide
Query:  FSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQDALVLFDEMTYRGIL
        FS L+ A +K    +  +     M++     N + ++++++   R+    LALAV  +M+K    PN+VT S L++G+C + +  +A+ L D+M   G  
Subjt:  FSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQDALVLFDEMTYRGIL

Query:  PNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKML
        PN +T++ ++ GL    K  +A  L  +M A GC PD++TY V++NG CK G  D AF+LL   E+     GV  Y+ +I+GL + +  D+A   +++M 
Subjt:  PNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKML

Query:  RENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHI
         + I+P+V+ Y+ +I  L   GR +DA  LL +M ER ++PD   ++ALI  F   G L +A+ L  E+ K    P   TYS LI G C +  + EA+ +
Subjt:  RENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHI

Query:  FNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQLCESGLILKAYKLLMQLVESGVLPDIRTYN
        F  M    C P +VT+N+LI G CK  +++E       ME+      F  +SQ    V ++ +  ++IQ L ++G    A ++  ++V  GV P+I TYN
Subjt:  FNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQLCESGLILKAYKLLMQLVESGVLPDIRTYN

Query:  ILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMTWSCRK
         L++GLCKN  +     +F+ +Q     P   TY  +I+G+ + G+ ED   +F  +   G KPD   Y ++++  CRK
Subjt:  ILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMTWSCRK

Arabidopsis top hitse value%identityAlignment
AT1G62670.1 rna processing factor 23.4e-7633.19Show/hide
Query:  FSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQDALVLFDEMTYRGIL
        FS L+ A +K    +  +     M++     N + ++++++   R+    LALAV  +M+K    PN+VT S L++G+C + +  +A+ L D+M   G  
Subjt:  FSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQDALVLFDEMTYRGIL

Query:  PNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKML
        PN +T++ ++ GL    K  +A  L  +M A GC PD++TY V++NG CK G  D AF+LL   E+     GV  Y+ +I+GL + +  D+A   +++M 
Subjt:  PNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKML

Query:  RENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHI
         + I+P+V+ Y+ +I  L   GR +DA  LL +M ER ++PD   ++ALI  F   G L +A+ L  E+ K    P   TYS LI G C +  + EA+ +
Subjt:  RENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHI

Query:  FNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQLCESGLILKAYKLLMQLVESGVLPDIRTYN
        F  M    C P +VT+N+LI G CK  +++E       ME+      F  +SQ    V ++ +  ++IQ L ++G    A ++  ++V  GV P+I TYN
Subjt:  FNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQLCESGLILKAYKLLMQLVESGVLPDIRTYN

Query:  ILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMTWSCRK
         L++GLCKN  +     +F+ +Q     P   TY  +I+G+ + G+ ED   +F  +   G KPD   Y ++++  CRK
Subjt:  ILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMTWSCRK

AT1G62910.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-7531.26Show/hide
Query:  QNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLN
        +N + D +  D+A +L+   ++     +I      F+ L+ A +K    E  +     M+      +++ +++ ++   R+    LALAV  +M+K    
Subjt:  QNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLN

Query:  PNVVTYSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFE
        P++VT S L++G+C + +  DA+ L D+M   G  P+  T++ ++ GL    K  +A  L  +M   GC PD++TY  ++NG CK G +D A SLL+  E
Subjt:  PNVVTYSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFE

Query:  KDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSL
        K      V  Y+ +I+GL + +  D+A   + +M  + I+PDV  Y+ +I  L   GR +DA  LL +M ER ++P+   ++ALI  F   G L +A+ L
Subjt:  KDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSL

Query:  RLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKME----IGKKPSLFLRLSQGANKVLDS
          E+ K    PD  TYS LI G C +  + EA+H+F  M    C P++VT+++LI G CKA +++E   LF +M     +G   + +  L  G  +  D 
Subjt:  RLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKME----IGKKPSLFLRLSQGANKVLDS

Query:  ASLQVMIQQ-------------------LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLY
         + Q++ +Q                   LC++G + KA  +   L  S + PDI TYNI+I G+CK   +  G++LF  + LKG  P+ + Y T+I G  
Subjt:  ASLQVMIQQ-------------------LCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLY

Query:  RVGRDEDALGIFEQMVKNGCKPDSSIYKSIM
        R G  E+A  + ++M ++G  P+S  Y +++
Subjt:  RVGRDEDALGIFEQMVKNGCKPDSSIYKSIM

AT1G62930.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.7e-7631.77Show/hide
Query:  SQNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNL
        S+N ++D L  D+A +L+ + +Q     +I      F+ L+ A +K    +  +     M++     +++++N++++   R+    LALAV  +M+K   
Subjt:  SQNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNL

Query:  NPNVVTYSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSF
         P++VT S L++G+C   +  +A+ L D+M      PN +T++ ++ GL    K  +A  L  +M A GC PD+ TY  ++NG CK G +D A SLL+  
Subjt:  NPNVVTYSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSF

Query:  EKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQS
        EK      V  Y+ +I+ L   +  ++A   + +M  + I+P+V+ Y  +I+ L   GR +DA  LL +M ER ++P+   ++ALI  F   G L +A+ 
Subjt:  EKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQS

Query:  LRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKME----IGKKPSLFLRLSQG------
        L  E+ K    PD  TYS LI G C +  + EA+H+F  M    C P++VT+N+LI G CKA +++E   LF +M     +G   + +  L QG      
Subjt:  LRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKME----IGKKPSLFLRLSQG------

Query:  ---ANKVL----------DSASLQVMIQQLCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGL
           A K+           D  +  +++  LC+ G + KA  +   L +S + PDI TYNI+I G+CK   +  G+ LF ++ LKG  P+ + Y T+I G 
Subjt:  ---ANKVL----------DSASLQVMIQQLCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGL

Query:  YRVGRDEDALGIFEQMVKNGCKPDSSIYKSIM
         R G  E+A  +F +M ++G  P+S  Y +++
Subjt:  YRVGRDEDALGIFEQMVKNGCKPDSSIYKSIM

AT1G79540.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-22550.59Show/hide
Query:  FLRPIIAYVVPKPPWF-HLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFIIDRLVK
        F R +I +   KP W    + S       S EV +I+    P E  LE + P +S ++ITSVI+++ N +LGFR FIW+ RR+RL    S   +ID L +
Subjt:  FLRPIIAYVVPKPPWF-HLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFIIDRLVK

Query:  DNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEA-FLLALAVYNQMLKCNLNPNVVTYSIL
        DN  +LYW+TL+ELK   + + S  F VLI AY+K GM EKAVESFG M++FDC+P++F +N+IL V++R+E  F+LA AVYN+MLKCN +PN+ T+ IL
Subjt:  DNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEA-FLLALAVYNQMLKCNLNPNVVTYSIL

Query:  IHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVN
        + G  K  +T DA  +FD+MT RGI PN +TY+I++SGLCQ    DDA++LF +M+  G  PD + +N LL+GFCKLG + EAF LL+ FEKDG +LG+ 
Subjt:  IHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVN

Query:  GYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDC
        GYS LI+GLFRARRY +A   Y  ML++NIKPD+ILYTI+IQGLS+ G++ DAL LL  M  +G+SPDT CYNA+IK  C  G L++ +SL+LE+S+ + 
Subjt:  GYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDC

Query:  FPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQLCES
        FPD  T++ILIC MC+NGL+ EA+ IF E+EK GC PS+ TFN+LIDGLCK+G+L+EA LL +KME+G+  SLFLRLS   N+  D+         + ES
Subjt:  FPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQLCES

Query:  GLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMT
        G ILKAY+ L    ++G  PDI +YN+LING C+  DI+G  KL   +QLKG  PDSVTY TLI+GL+RVGR+E+A  +F    K+  +   ++Y+S+MT
Subjt:  GLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMT

Query:  WSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSSASCV
        WSCRK+KV +AF++WMKYL+     +DE    + + F +GE E  ++RLIE+D    +  L PYTI+LIGLCQ+ R  EA  +FSVL++  + V+  SCV
Subjt:  WSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSSASCV

Query:  MLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLL
         LI  LC   +LD A++VFLYTL+  F LMPR+CN LLS LL   ++ +    L +RME  GY+++++L
Subjt:  MLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMNTLL

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-7627.46Show/hide
Query:  RLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVR-KEAFLLALAVYNQMLKCNLNPNVVT
        + + D    L +K+LQE  D      S  F +++++YS+  + +KA+    L +     P + ++N +L   +R K     A  V+ +ML+  ++PNV T
Subjt:  RLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVR-KEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        Y+ILI GFC       AL LFD+M  +G LPN +TY+ ++ G C+ +KIDD  +L   M   G  P++I+YNV++NG C+ G + E   +L    + G+ 
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        L    Y+ LI G  +   + +A + + +MLR  + P VI YT +I  + + G +  A+  LD+M  RGL P+   Y  L+ GF   G++++A  +  E++
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKM-EIGKKPSLFLRLSQGANKVLDSASLQVMIQ
         +   P   TY+ LI G C  G + +A  +  +M++ G  P +V++++++ G C++  + EA  +  +M E G KP              D+ +   +IQ
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKM-EIGKKPSLFLRLSQGANKVLDSASLQVMIQ

Query:  QLCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIY
          CE     +A  L  +++  G+ PD  TY  LIN  C   D+    +L   M  KG LPD VTY  LI+GL +  R  +A  +  ++      P    Y
Subjt:  QLCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIY

Query:  KSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDF--DLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMN
         ++                    + N    E + V  + + F    + T   ++ E  M+ K+   D   Y I + G C+A  + +A+ ++  +      
Subjt:  KSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDF--DLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMN

Query:  VSSASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGY
        + + + + L+  L  EG+++    V ++ L     L      ++L  + H E   D  L ++  M   G+
Subjt:  VSSASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTCCGACCAGATTTTCTTCGACCCATAATCGCCTATGTAGTCCCAAAACCTCCATGGTTCCATTTATTTCATTCGCCCACTGACCCAATCGCCACTTCCAATGA
GGTCTCCACCATAATCGAAACTGTTGATCCTTTCGAAGATGGATTGGAAGTCATATCGCCCCATATTTCGTCTGATGTAATTACTTCCGTCATTGAAGAACAATCGAATC
CCCGACTTGGATTTCGACTTTTTATCTGGTCGTTAAGGAGAAAGCGTCTCTGCTGCAATGCCTCGCAGAATTTTATCATCGACAGGTTAGTAAAGGACAATGCCTTCGAA
TTATATTGGAAAACTCTTCAAGAGCTTAAGGATTCAGCAATTGAAATTCCATCGGATGCTTTCTCTGTGTTGATTGAGGCATACTCAAAAGCGGGCATGGATGAGAAGGC
CGTTGAATCATTTGGTCTGATGCGGGATTTTGACTGTAAGCCCAACATTTTTGCTTTCAATTTGATTTTGCATGTTTTGGTGCGAAAAGAAGCATTTTTGTTAGCTTTAG
CAGTGTATAATCAGATGCTGAAATGTAATTTGAATCCGAATGTGGTTACCTACAGCATATTGATTCATGGATTCTGTAAAACTAGTAAAACTCAAGATGCCCTTGTACTT
TTTGACGAAATGACCTATAGAGGAATATTGCCCAACGAGATAACTTATTCGATTGTTCTTTCTGGACTGTGTCAAGCTAAGAAAATTGATGATGCCCAGAGATTGTTCAG
TAAGATGAGAGCTATTGGGTGTAGTCCAGATGTAATAACTTATAATGTTTTGCTTAATGGATTTTGTAAGTTAGGTTATTTGGATGAAGCTTTTTCATTGTTGCAATCTT
TTGAAAAGGATGGCCATATTCTTGGAGTTAATGGGTATAGTTGTTTGATTAATGGCTTGTTTAGGGCTAGGAGATATGATGAAGCACATATGTGGTACCAGAAAATGTTG
AGGGAAAACATCAAGCCCGATGTTATCTTGTATACTATTATGATCCAAGGTTTATCACAAGAAGGTCGGGTTACTGATGCATTGGCACTGTTGGATGAGATGACAGAAAG
AGGGCTTAGTCCAGATACTGCTTGTTACAATGCTTTAATTAAAGGGTTTTGTGATATGGGTCATTTGGATAAGGCTCAGTCCCTTCGACTTGAGATTTCAAAACACGACT
GTTTCCCTGATAATCACACATACTCCATTCTCATTTGTGGTATGTGCAAGAATGGGCTAATAAGTGAGGCACAACATATATTTAATGAAATGGAGAAGCTTGGATGCGTT
CCTTCTATAGTGACCTTCAATTCTCTCATTGATGGACTTTGCAAAGCCGGTAAGCTTCAGGAAGCTTACCTATTATTTTACAAAATGGAGATAGGAAAAAAACCTTCTTT
GTTTCTTCGGCTTTCTCAGGGCGCCAATAAGGTTCTTGACAGTGCCAGTCTCCAAGTTATGATCCAGCAATTATGTGAGTCAGGATTGATTCTTAAGGCCTACAAGCTTC
TTATGCAGCTAGTTGAGAGTGGGGTTTTGCCAGATATTAGGACTTATAACATCCTAATCAATGGATTATGCAAGAATAACGATATTAATGGTGGTTTTAAGCTCTTCAAG
GCCATGCAACTCAAAGGACGCTTGCCAGATTCGGTTACATACGGGACTCTAATAGATGGGCTCTATAGAGTTGGTAGGGATGAGGATGCACTAGGAATTTTTGAACAAAT
GGTAAAGAATGGGTGCAAGCCTGATTCTTCTATTTACAAGTCCATCATGACTTGGTCGTGTCGAAAAAAGAAGGTTTCACTAGCTTTTAGTGTTTGGATGAAGTATCTGA
GGAATTTTCGTGGCTGGGAAGATGAAAAGGTAGCAGTAGTAGCGGAAAGTTTCAATAAAGGAGAGCTTGAGACAACAATTCAGAGATTAATTGAAATGGACATGATATCA
AAAGATTTCGACTTAGCTCCATACACCATTTTTCTCATTGGATTGTGTCAAGCCGAGAGGGTTTCCGAAGCCTTTGCTATATTTTCTGTTCTCAAGGACTTCAATATGAA
TGTAAGTTCAGCGAGCTGTGTGATGTTGATTGGCAGGTTGTGCATGGAAGGAGAACTTGACCTAGCCGTGGATGTTTTTCTTTATACACTAGAAGAAGGCTTTATGTTGA
TGCCTCGAATTTGTAATCAGCTGCTGAGCCGCCTTCTTCATTTGGAGGACAGAAAAGACCATGCTCTTGTTCTTATACATAGAATGGAGGCTTTTGGATATGATATGAAT
ACTCTTCTCCACGACAGAGAGAAAACACAAACCAGAGTCAAAACAGCCATAGAGATGGACGGAGATCTGCCGCCGGCACCGGCTCCGTTGTCCATCCACCCGGCCGTAGC
ACCGCTATCATTCTTACTCGGAACATGGAGAGGCAAAGGCGACGGTGGATTCCCCACCATTAATTCCTTCTCTTACGGCGAGGAGCTTCACTTCTCCCATTCCGGCAAGC
CAGTGATTTCCTACACTCAAAAGACTTGGAAACTCGATTCTAGAGAGCCAATGCACGCTGAGAGTGGCTATTGGCGTCCCAAGCCCGATGGTACCATTGAGGTAGTCATC
GCTCAAAGTACTGGTATCGTTGAAGTTCAGTCAAATGCAAGGTCAGATTCTGCAGAGTTGCAACCGCGTACCAACTGTTCAATGAAATGCTCAACTCAAATGTTGTCTCA
TGGACCTCACTCATGGCAGTTATATCGAGATGGTCGGCTGAGTACAGCTCTTTCACTGTTTGGGGCAATGCCGAGAAGTCCAGTTGTTCTCAACGACTTCACTTTTGTGA
ATGCAATCAAGGCCTGTTTGATCCTTTCAAATTTAAGAAATGGTGAAAAGTTCCATGCCTATGTTGAGATTTTGGTTTTGGAGCTAATATTATGGTCTGTTCCTCGCTTA
TTGATATTGATTTCTGCTTATGCTCAGAATGCACATGGCAACGATGTGTTAACAGCTGGTTTCAGGAAAAGTCGTAGTGCAGTGATCCATCTTGGCTGTGATTCAAGCCA
CATAGCTGCAAATGTGCTGGTTAATATGTATGCTAAATGTGGCCGTAACTTCAGAACATCTGCTGATCTTACAGTCGACATTTTTAATTGTGTTACAGATTTGATGGCTT
GTGATAAAGGAACATATGATGCAGAAGAGAAAGTGATAAAGCTCCAAAGTGGATTTGTGGGGAATGCTTCTAAGGTGAAAGAAATAAGCAGAATATTTAAACTGGTGGAT
GGAGAGCTTTCCTACGTGGTTCAGATGGCTACTGGCTTAACCAGTCTTCAACCTCACTTAAAAGCCTTTCTTACTAAAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTCCGACCAGATTTTCTTCGACCCATAATCGCCTATGTAGTCCCAAAACCTCCATGGTTCCATTTATTTCATTCGCCCACTGACCCAATCGCCACTTCCAATGA
GGTCTCCACCATAATCGAAACTGTTGATCCTTTCGAAGATGGATTGGAAGTCATATCGCCCCATATTTCGTCTGATGTAATTACTTCCGTCATTGAAGAACAATCGAATC
CCCGACTTGGATTTCGACTTTTTATCTGGTCGTTAAGGAGAAAGCGTCTCTGCTGCAATGCCTCGCAGAATTTTATCATCGACAGGTTAGTAAAGGACAATGCCTTCGAA
TTATATTGGAAAACTCTTCAAGAGCTTAAGGATTCAGCAATTGAAATTCCATCGGATGCTTTCTCTGTGTTGATTGAGGCATACTCAAAAGCGGGCATGGATGAGAAGGC
CGTTGAATCATTTGGTCTGATGCGGGATTTTGACTGTAAGCCCAACATTTTTGCTTTCAATTTGATTTTGCATGTTTTGGTGCGAAAAGAAGCATTTTTGTTAGCTTTAG
CAGTGTATAATCAGATGCTGAAATGTAATTTGAATCCGAATGTGGTTACCTACAGCATATTGATTCATGGATTCTGTAAAACTAGTAAAACTCAAGATGCCCTTGTACTT
TTTGACGAAATGACCTATAGAGGAATATTGCCCAACGAGATAACTTATTCGATTGTTCTTTCTGGACTGTGTCAAGCTAAGAAAATTGATGATGCCCAGAGATTGTTCAG
TAAGATGAGAGCTATTGGGTGTAGTCCAGATGTAATAACTTATAATGTTTTGCTTAATGGATTTTGTAAGTTAGGTTATTTGGATGAAGCTTTTTCATTGTTGCAATCTT
TTGAAAAGGATGGCCATATTCTTGGAGTTAATGGGTATAGTTGTTTGATTAATGGCTTGTTTAGGGCTAGGAGATATGATGAAGCACATATGTGGTACCAGAAAATGTTG
AGGGAAAACATCAAGCCCGATGTTATCTTGTATACTATTATGATCCAAGGTTTATCACAAGAAGGTCGGGTTACTGATGCATTGGCACTGTTGGATGAGATGACAGAAAG
AGGGCTTAGTCCAGATACTGCTTGTTACAATGCTTTAATTAAAGGGTTTTGTGATATGGGTCATTTGGATAAGGCTCAGTCCCTTCGACTTGAGATTTCAAAACACGACT
GTTTCCCTGATAATCACACATACTCCATTCTCATTTGTGGTATGTGCAAGAATGGGCTAATAAGTGAGGCACAACATATATTTAATGAAATGGAGAAGCTTGGATGCGTT
CCTTCTATAGTGACCTTCAATTCTCTCATTGATGGACTTTGCAAAGCCGGTAAGCTTCAGGAAGCTTACCTATTATTTTACAAAATGGAGATAGGAAAAAAACCTTCTTT
GTTTCTTCGGCTTTCTCAGGGCGCCAATAAGGTTCTTGACAGTGCCAGTCTCCAAGTTATGATCCAGCAATTATGTGAGTCAGGATTGATTCTTAAGGCCTACAAGCTTC
TTATGCAGCTAGTTGAGAGTGGGGTTTTGCCAGATATTAGGACTTATAACATCCTAATCAATGGATTATGCAAGAATAACGATATTAATGGTGGTTTTAAGCTCTTCAAG
GCCATGCAACTCAAAGGACGCTTGCCAGATTCGGTTACATACGGGACTCTAATAGATGGGCTCTATAGAGTTGGTAGGGATGAGGATGCACTAGGAATTTTTGAACAAAT
GGTAAAGAATGGGTGCAAGCCTGATTCTTCTATTTACAAGTCCATCATGACTTGGTCGTGTCGAAAAAAGAAGGTTTCACTAGCTTTTAGTGTTTGGATGAAGTATCTGA
GGAATTTTCGTGGCTGGGAAGATGAAAAGGTAGCAGTAGTAGCGGAAAGTTTCAATAAAGGAGAGCTTGAGACAACAATTCAGAGATTAATTGAAATGGACATGATATCA
AAAGATTTCGACTTAGCTCCATACACCATTTTTCTCATTGGATTGTGTCAAGCCGAGAGGGTTTCCGAAGCCTTTGCTATATTTTCTGTTCTCAAGGACTTCAATATGAA
TGTAAGTTCAGCGAGCTGTGTGATGTTGATTGGCAGGTTGTGCATGGAAGGAGAACTTGACCTAGCCGTGGATGTTTTTCTTTATACACTAGAAGAAGGCTTTATGTTGA
TGCCTCGAATTTGTAATCAGCTGCTGAGCCGCCTTCTTCATTTGGAGGACAGAAAAGACCATGCTCTTGTTCTTATACATAGAATGGAGGCTTTTGGATATGATATGAAT
ACTCTTCTCCACGACAGAGAGAAAACACAAACCAGAGTCAAAACAGCCATAGAGATGGACGGAGATCTGCCGCCGGCACCGGCTCCGTTGTCCATCCACCCGGCCGTAGC
ACCGCTATCATTCTTACTCGGAACATGGAGAGGCAAAGGCGACGGTGGATTCCCCACCATTAATTCCTTCTCTTACGGCGAGGAGCTTCACTTCTCCCATTCCGGCAAGC
CAGTGATTTCCTACACTCAAAAGACTTGGAAACTCGATTCTAGAGAGCCAATGCACGCTGAGAGTGGCTATTGGCGTCCCAAGCCCGATGGTACCATTGAGGTAGTCATC
GCTCAAAGTACTGGTATCGTTGAAGTTCAGTCAAATGCAAGGTCAGATTCTGCAGAGTTGCAACCGCGTACCAACTGTTCAATGAAATGCTCAACTCAAATGTTGTCTCA
TGGACCTCACTCATGGCAGTTATATCGAGATGGTCGGCTGAGTACAGCTCTTTCACTGTTTGGGGCAATGCCGAGAAGTCCAGTTGTTCTCAACGACTTCACTTTTGTGA
ATGCAATCAAGGCCTGTTTGATCCTTTCAAATTTAAGAAATGGTGAAAAGTTCCATGCCTATGTTGAGATTTTGGTTTTGGAGCTAATATTATGGTCTGTTCCTCGCTTA
TTGATATTGATTTCTGCTTATGCTCAGAATGCACATGGCAACGATGTGTTAACAGCTGGTTTCAGGAAAAGTCGTAGTGCAGTGATCCATCTTGGCTGTGATTCAAGCCA
CATAGCTGCAAATGTGCTGGTTAATATGTATGCTAAATGTGGCCGTAACTTCAGAACATCTGCTGATCTTACAGTCGACATTTTTAATTGTGTTACAGATTTGATGGCTT
GTGATAAAGGAACATATGATGCAGAAGAGAAAGTGATAAAGCTCCAAAGTGGATTTGTGGGGAATGCTTCTAAGGTGAAAGAAATAAGCAGAATATTTAAACTGGTGGAT
GGAGAGCTTTCCTACGTGGTTCAGATGGCTACTGGCTTAACCAGTCTTCAACCTCACTTAAAAGCCTTTCTTACTAAAGTTTGACAATAAAGAAAAAGAAAGAGTTTAAC
AAATCCTTTTATTTTCTCTTAATGGACTGAAGTTATCTAATGACTTAACGACTTTATTAGATGAGGATATGTTCTATAGGTGTACAACTGCGTCCTTAACTTGTAAGAAG
TTATCAAGAATACTGAAGTCCACAATATAGATATATTGAATTGCAATTAGCATATTTCTATGATTCTGTATAGGGTTTGCAATGGGGCTGGAGGAGTTCTGGTCCCTGCT
CTCATGGGGGAGCGAGCCAGGGAGAGCGAATCTCTGTGGGAGCAAAGGAGAGCGAGCCAGGGAGAGCGAGTGTGAATTTAATTAGGGTGGTTGAGAGAGAGAAAGAGGAA
AGGATGCAAGAGTGAGAACACGAGATTTAACAATAACATCGGATTGGCGTCAAGGAATGGGGTTGGGGTCAGGGATACTCCTCATCCTCGATTAGGATCCCCGGTTAAAC
TGGGAATTCCCCAACCCGATCGGATCGGGCTCCTCAGATTTTTTTGCCAACCCTAATTATGTAGCTTTTATTCTCCACATCTCTCTCTAGAGTAACATCATCTAATAGTA
ATTTTCTGGTATGTTTCATAAATGTATGTTAATTTTTTGTTTTTGTTCTTTAAAA
Protein sequenceShow/hide protein sequence
MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEEQSNPRLGFRLFIWSLRRKRLCCNASQNFIIDRLVKDNAFE
LYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQDALVL
FDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKML
RENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCV
PSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGKKPSLFLRLSQGANKVLDSASLQVMIQQLCESGLILKAYKLLMQLVESGVLPDIRTYNILINGLCKNNDINGGFKLFK
AMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMIS
KDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFNMNVSSASCVMLIGRLCMEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDMN
TLLHDREKTQTRVKTAIEMDGDLPPAPAPLSIHPAVAPLSFLLGTWRGKGDGGFPTINSFSYGEELHFSHSGKPVISYTQKTWKLDSREPMHAESGYWRPKPDGTIEVVI
AQSTGIVEVQSNARSDSAELQPRTNCSMKCSTQMLSHGPHSWQLYRDGRLSTALSLFGAMPRSPVVLNDFTFVNAIKACLILSNLRNGEKFHAYVEILVLELILWSVPRL
LILISAYAQNAHGNDVLTAGFRKSRSAVIHLGCDSSHIAANVLVNMYAKCGRNFRTSADLTVDIFNCVTDLMACDKGTYDAEEKVIKLQSGFVGNASKVKEISRIFKLVD
GELSYVVQMATGLTSLQPHLKAFLTKV