; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G044320 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G044320
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCiama_Chr02:32146419..32155218
RNA-Seq ExpressionCaUC02G044320
SyntenyCaUC02G044320
Gene Ontology termsGO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012674 - Calycin
IPR014878 - THAP4-like, heme-binding beta-barrel domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035334.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0084.32Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK R  FLRP++ Y+VPKPPWFHLFH+ TDPIA+SNEVSTIIETVDP ED LE+I+PH+SSDVITSVI++Q N RLGFRLFIWSLRR+ LCC+ASQN II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDS+ EI SDAFSVLIEAYSKAGM EKAV+SFG+M+DF+CKPNIFA+NLILHVLVR+EAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHGFCK+SKTQ+ALVLFDEMT R +LPNEITYSI+LSGLCQAKKIDDAQRLF KMRA GCSPDVITYNVLLNGFCKLGY DEAF+LL+SFEKDGHI
Subjt:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWYQK  R+N++PDVILYTIMIQGL QEGRV +ALALLDEMTERG SPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ
         HDCFP+NHTYSILICGMCKNGLI EAQH+FNEMEKLGC+PS+VTFNSLIDG CKAGKL+EA+LLFYKMEIGRKP LFLRLSQGANK+L    LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVE GV PDIRTYNILING CK NNI+G F LFK MQLKGRLPDSVTYGTLIDGL+RVGRDEDALGIFEQMVK+GCKP+ S+YK
Subjt:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS
        SIMTWSCR+KKVSLAFSVWMKYLRNFRGW+DEKV VV ESF+KG+LE  I R+IEMD+ SKDFDLAPYTIFL+GLCQA RVSEAFAIFSVLKDFK  +SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS

Query:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER
        ASCVMLIG LCVEG+LDLAV+VFLYTLE G MLMPRICNQLL  LLHLEDRKDHA VLI RMEAFGYD+N  LH S +
Subjt:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER

XP_022948073.1 pentatricopeptide repeat-containing protein At1g79540 [Cucurbita moschata]0.0e+0084.45Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK R  FLRP++ Y+VPKPPWFHLFH+ TDPIATSNEVSTIIETVDP ED LE+I+PH+SSDVITSVI++Q N RLGFRLFIWSLRR+ LCC+ASQN II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDS+ EI SDAFSVLIEAYSKAGM EKAV+SFG+M+DF+CKPNI+A+NLILHVLVR+EAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHGFCK+SKTQ+ALVLFDEMT R +LPNEITYSI+LSGLCQAKKIDDAQRLF KMRA GCSPDVITYNVLLNGFCKLGY DEAF+LL+SFEKDGHI
Subjt:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWYQK  R+N++PDVILYTIMIQGL QEGRV +ALALLDEMTERG SPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ
         HDCFP+NHTYSILICGMCKNGLI EAQH+FNEMEKLGC+PS+VTFNSLIDG CKAGKL+EA+LLFYKMEIGRKPSLFLRLSQGANK+L    LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVE GV PDIRTYNILING CK NNI+G FKLFK MQLKGRLPDSVTYGTLIDGL+RVGRDEDALGIFEQMVK+GCKP+ S+YK
Subjt:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS
        SIMTWSCR+KKVSL FSVWMKYLRNFRGW+DEKV VV ESF+KG+LE  I R+IEMD+ SKDF+LAPYTIFLIGLCQA RVSEAFAIFSVLKDFK  +SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS

Query:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER
        ASCVMLIG LCVEG+LDLAV+VFLYTLE G MLMPRICNQLL  LLHLEDRKDHA VLI RMEAFGYD+N  LH S +
Subjt:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER

XP_023007126.1 pentatricopeptide repeat-containing protein At1g79540 [Cucurbita maxima]0.0e+0085.22Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK R  FLRP++ Y+VPKPPWFHLFH+PTDPIATSNEVSTIIETVDP ED LE I+PHISSDVITSVI++Q N RLGFRLFIWSLRR+ LCC+ASQ+ II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDS+ EI SDAFSVLIEAYSKAGM+EKAV+SFG+M+DF+CKPNIFA+NLILHVLVR+EAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHGFCK+SKTQ+ALVLFDEMT R +LPNEITYSI+LSGLCQAKKIDDAQRLF KMRA GCSPDVITYNVLLNGFCKLGY DEAF+LL+SFEKDGHI
Subjt:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWYQK  R+N++PDVILYTIMIQGL QEGRV +ALALLDEMTERG SPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ
         HDCFPDNHTYSILICGMCKNGLI EAQH+FNEMEKLGC+PS+VTFNSLIDG CKAGKL+EA+LLFYKMEIGRKPSLFLRL QGANKVL    LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVE GV PDIRTYNILING CK NNI+G FKLFK MQLKGRLPDS+TYGTLIDGL+RVGRDEDALGIFEQMVKNGCKP+SS+YK
Subjt:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS
        SIMTWSCR+KKVSLAFSVWMKYLRNFRGW+DEKV VV ESF+KG+LE  I R+IEMD+ SKDFDLAPYTIFLIGLCQA RVSEAFAIFSVLKDFK  +SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS

Query:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER
        ASCVMLIG LCVEG+LDLAV+VFLYTLE G MLMPRICNQLL R LHLEDRKDHA VLI RMEAFGYD+N  LH S +
Subjt:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER

XP_023534570.1 pentatricopeptide repeat-containing protein At1g79540 [Cucurbita pepo subsp. pepo]0.0e+0084.7Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK R  FLRP++ Y+VPKPPWFHLFH+PTD IATSNEVSTIIETVDP ED LE+I+PHISSDVITSVI++Q N RLGFR+FIWSLRR+ LCC+ASQN II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDS+ EI SDAFSVLIEAYSKAGM EKAV+SFG+M+DF+CKPNIFA+NLILHVLVR+EAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHGFCK+SKTQ+ALVLFDEMT R +LPNEITYSI+LSGLCQAKKIDDAQRLF KMRA GCSPDVITYNVLLNGFCKLGY DEAF+LL+SFEKDGHI
Subjt:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWYQK  R+N++PDVILYTIMIQGL QEGRV +ALALLDEMTERG SPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ
         HDCFPDNHTYSILICGMCKNGLI EAQH+FNEMEKLGC+PS+VTFNSLIDG CKAGKL+EA+LLFYKMEIGRKPSLFLRLSQGANKVL    LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVE GV PDIRTYNILING CK NNI+G FKLFK MQLKGRLPDSVTYGTLIDGL+RVGRDEDALGIFEQMVK+GCKP+ S+YK
Subjt:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS
        SIMTWSCR+KKVSLAFSVWMKYLRNFRGW+DEKV VV ESF+KG+LE  I R+IEMD+ SKDFDLAPYTIFLIGLCQA R SEAFAIFSVLKDFK  +SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS

Query:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER
        ASCVMLIG LCVEG+LDLAV+VFLYTLE G MLMPRICNQLL   LHLE+RKDHA VLI RMEAFGYD+N  LH S +
Subjt:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER

XP_038901213.1 pentatricopeptide repeat-containing protein At1g79540 [Benincasa hispida]0.0e+0088.08Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK+RP   RPII YVVPKPPWF  FHSPTDPIATSNEVSTIIETVD FEDGLEVISPHISSD+ITSVI++Q NPRLGFRLFIWSLRRKRLCC+ASQN II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDSAIEI SDAFSVLIEAY KAGM+EKAVESFGLMRDFDCKPN+FAFNLILH+LVRKEAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKSSK--TQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDG
        Y ILIHGFCK+SK  TQDAL LFDEMT RGILPNEITYSIVLSGLC+AKKI DAQRLFSKMRA G SPDV+TYNVLLNGFCKLGYL+EAF+LLQSFEKDG
Subjt:  YSILIHGFCKSSK--TQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDG

Query:  HILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLE
        HILGVNGYSCLINGLFRARRYDEAHMWYQK+LRENIKPDVILYTIMIQGLSQEGRVTDALALL EMTERG SPDTACYN LIKGFCD+G+LDKAQSLRLE
Subjt:  HILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLE

Query:  ISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMI
        IS H+CFPDNHTYSILICGMCKNGL+SEAQ +FNEMEKLGC+PS+VTFNSLIDGLCKAG+L+EA+LLF KMEIGRKPSLFLRLSQG NKVLD ASLQVM+
Subjt:  ISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMI

Query:  QQLCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSI
        +QLCESGL+LKAYKLLMQLVE GVLPDIRTYNILING CKNNNING FKL K M+LKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVK GCKPDSSI
Subjt:  QQLCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSI

Query:  YKSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNV
        YKSIMTW CRKK +SLAF+VWMKYLRNFRGWEDEKV +V ESF+KGEL+TTI RL++MDM SKDFDLAPYTIFLIGLCQA+RVSEAFAIFSVLKDFKMN+
Subjt:  YKSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNV

Query:  SSASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER
        SSASCVMLIG LCV  +LDLAVDVFLYTLEEG MLMPRICN+LLS LLH+ED+KDHALVL+++MEAFGYD+NT LH S +
Subjt:  SSASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER

TrEMBL top hitse value%identityAlignment
A0A0A0KD52 Uncharacterized protein0.0e+0078.81Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MKLRP   RPII +VVPKP  FH +HS T+PIATS EVSTIIET+DP EDGL+VIS  I S  ITSV+++Q + RLGFRLFIWSL+   L C   Q+ II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
         +L+K+NAFELYWK LQELK+SAI+I S+AFSVLIEAYS+AGMDEKAVESFGLMRDFDCKP++FAFNLILH LVRKEAFLLALAVYNQMLKCNLNP+VVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        Y ILIHG CK+ KTQDALVLFDEMT RGILPN+I YSIVLSGLCQAKKI DAQRLFSKMRA GC+ D+ITYNVLLNGFCK GYLD+AF+LLQ   KDGHI
Subjt:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GY CLINGLFRARRY+EAHMWYQKMLRENIKPDV+LYTIMI+GLSQEGRVT+AL LL EMTERGL PDT CYNALIKGFCDMG+LD+A+SLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ
        KHDCFP+NHTYSILICGMCKNGLI++AQHIF EMEKLGC+PS+VTFNSLI+GLCKA +L+EA LLFY+MEI RKPSLFLRLSQG +KV D ASLQVM+++
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESG+ILKAYKLLMQLV+ GVLPDIRTYNILING CK  NING FKLFK MQLKG +PDSVTYGTLIDGLYR GR+EDAL IFEQMVK GC P+SS YK
Subjt:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS
        +IMTWSCR+  +SLA SVWMKYLR+FRGWEDEKV VVAESF+  EL+T I+RL+EMD+ SK+FDLAPYTIFLIGL QA+R  EAFAIFSVLKDFKMN+SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS

Query:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLH
        ASCVMLIG LC+   LD+A+DVFL+TLE GF LMP ICNQLL  LLHL DRKD AL L +RMEA GYD+   LH
Subjt:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLH

A0A5D3B9M5 Pentatricopeptide repeat-containing protein0.0e+0076.49Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MKLRP+  RPII +VVPKPP F  +HS T+PI TS EVSTIIETVDP EDGL+VIS  I+S +ITSV+  Q N  LGFRLFIWSL        A ++ II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        D+L+KDNAFELYWK LQELK+SAIEI SDAFSVLIEAYS+AGM+EKAVESFGLMRDFDCKPN+FAFNLIL  LVRKEAFLLALAVYNQMLKCNLNP+V T
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        Y ILIHGFC++ KTQDALVLFDEMT RGILPN+I Y+IVLSGLC+AKKI DAQRLFS M A     D+ TYNVLLNGFCKLGYLDEAF+LLQ   KDGH 
Subjt:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        L V+GY CLINGLFRARRY+EAH WY+KMLRENIKPDVILYTIMIQGLSQEGRVT+A+ LL EM ERGL PDT CYNALIKGFCD+G+LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ
         H CFP NHTYSILICGMCK+GLI+EAQHIF EMEKLGC+PS+VTFNSLI+GLCKA +L+EA LLFY+MEI RKPSLFLRLSQG +KVLD ASLQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLILKAYKLLMQLV+ GVLPDIRTYNILING CK  NING FKLFK MQ +G +PDSVTYGTLIDGLYRVGR+EDALGIF QM K GC PDSS Y+
Subjt:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS
        +IMTW CR+K + L  SVWMKYLRNFRGWEDEKV VV ESF+  EL+T I+RL+EMD+ SK+FD+APYTIFLIGLC+A+RVSEAFAIFSV KDFKMN+SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS

Query:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLH
        ASCV LI  LC   +L+LAVDVFL+TLE  F +MP ICN+LL  LL L DRKD AL L +R+EA GYD+   L+
Subjt:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLH

A0A6J1D6A9 pentatricopeptide repeat-containing protein At1g79540 isoform X10.0e+0082.13Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK RP F+RPII  +VPKPPWFHL+HSPTDPIATSNEV TI+ETV+PFED LE I+PH+S DVITSVIE+Q NPRLGFRLFIWSL+ KRLCC+ASQN II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLV+DNAFELYWKTLQELKDSA+ I SDAFSVLIEAYS AGMDEKAVESFGLM+DFDCKPNIF +NLIL+VLVRKEAF LAL+VYNQML+CN  PNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHG CK+SKTQDALVLFDEM  RGI PNEITYSIVLSGLCQA KIDDAQRLF KMRA GCSPD ITYNVLLNGFCK GY DEAF+LLQ+FEKDGHI
Subjt:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGVN YSCLI+GLFRARRYDEA  WYQKMLRENIKPDVILYTIMIQGLSQEG++ DALALL EMTERG SPDT CYNALIKGFCDM  LDKA+SLRL IS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ
         HDC PDNHTYSILICGMC+NGLI EAQ++FNEMEKLGC+PS+ TFNSLIDGLCK G++ EA LLFYKMEIGRKPS+FLRL+QG NKVLD A LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESG+ILKAYKLLMQL E GVLPDIRTYNILING CK N ING FKLFK MQLKGRLPDSVTYGTLI+GL+RVGRD+DAL +F+QMVK GCKPDSS+YK
Subjt:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS
        +IMTWSCRKK VSLAFSVWMKYL NFRGW+DE V VV  SF+KGELE  I+RLIEMD  SKDFD +PYTIFLIGLCQA+RVSEAFAIFSVLKDFKMN + 
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS

Query:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER
        ASCVMLIG LC+E +LDLA+DVFLYTLE GF+LMPRICNQLL  LL  EDRKDHALVLI RME FGYD++  LH S +
Subjt:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER

A0A6J1G8C6 pentatricopeptide repeat-containing protein At1g795400.0e+0084.45Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK R  FLRP++ Y+VPKPPWFHLFH+ TDPIATSNEVSTIIETVDP ED LE+I+PH+SSDVITSVI++Q N RLGFRLFIWSLRR+ LCC+ASQN II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDS+ EI SDAFSVLIEAYSKAGM EKAV+SFG+M+DF+CKPNI+A+NLILHVLVR+EAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHGFCK+SKTQ+ALVLFDEMT R +LPNEITYSI+LSGLCQAKKIDDAQRLF KMRA GCSPDVITYNVLLNGFCKLGY DEAF+LL+SFEKDGHI
Subjt:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWYQK  R+N++PDVILYTIMIQGL QEGRV +ALALLDEMTERG SPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ
         HDCFP+NHTYSILICGMCKNGLI EAQH+FNEMEKLGC+PS+VTFNSLIDG CKAGKL+EA+LLFYKMEIGRKPSLFLRLSQGANK+L    LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVE GV PDIRTYNILING CK NNI+G FKLFK MQLKGRLPDSVTYGTLIDGL+RVGRDEDALGIFEQMVK+GCKP+ S+YK
Subjt:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS
        SIMTWSCR+KKVSL FSVWMKYLRNFRGW+DEKV VV ESF+KG+LE  I R+IEMD+ SKDF+LAPYTIFLIGLCQA RVSEAFAIFSVLKDFK  +SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS

Query:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER
        ASCVMLIG LCVEG+LDLAV+VFLYTLE G MLMPRICNQLL  LLHLEDRKDHA VLI RMEAFGYD+N  LH S +
Subjt:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER

A0A6J1KZN2 pentatricopeptide repeat-containing protein At1g795400.0e+0085.22Show/hide
Query:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII
        MK R  FLRP++ Y+VPKPPWFHLFH+PTDPIATSNEVSTIIETVDP ED LE I+PHISSDVITSVI++Q N RLGFRLFIWSLRR+ LCC+ASQ+ II
Subjt:  MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFII

Query:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT
        DRLVKDNAFELYWKTLQELKDS+ EI SDAFSVLIEAYSKAGM+EKAV+SFG+M+DF+CKPNIFA+NLILHVLVR+EAFLLALAVYNQMLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        YSILIHGFCK+SKTQ+ALVLFDEMT R +LPNEITYSI+LSGLCQAKKIDDAQRLF KMRA GCSPDVITYNVLLNGFCKLGY DEAF+LL+SFEKDGHI
Subjt:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWYQK  R+N++PDVILYTIMIQGL QEGRV +ALALLDEMTERG SPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ
         HDCFPDNHTYSILICGMCKNGLI EAQH+FNEMEKLGC+PS+VTFNSLIDG CKAGKL+EA+LLFYKMEIGRKPSLFLRL QGANKVL    LQVM++Q
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQ

Query:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVE GV PDIRTYNILING CK NNI+G FKLFK MQLKGRLPDS+TYGTLIDGL+RVGRDEDALGIFEQMVKNGCKP+SS+YK
Subjt:  LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS
        SIMTWSCR+KKVSLAFSVWMKYLRNFRGW+DEKV VV ESF+KG+LE  I R+IEMD+ SKDFDLAPYTIFLIGLCQA RVSEAFAIFSVLKDFK  +SS
Subjt:  SIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSS

Query:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER
        ASCVMLIG LCVEG+LDLAV+VFLYTLE G MLMPRICNQLL R LHLEDRKDHA VLI RMEAFGYD+N  LH S +
Subjt:  ASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLLHDSER

SwissProt top hitse value%identityAlignment
Q9FIX3 Pentatricopeptide repeat-containing protein At5g397106.2e-7527.31Show/hide
Query:  RLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVR-KEAFLLALAVYNQMLKCNLNPNVVT
        + + D    L +K+LQE  D      S  F +++++YS+  + +KA+    L +     P + ++N +L   +R K     A  V+ +ML+  ++PNV T
Subjt:  RLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVR-KEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        Y+ILI GFC +     AL LFD+M  +G LPN +TY+ ++ G C+ +KIDD  +L   M   G  P++I+YNV++NG C+ G + E   +L    + G+ 
Subjt:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        L    Y+ LI G  +   + +A + + +MLR  + P VI YT +I  + + G +  A+  LD+M  RGL P+   Y  L+ GF   G++++A  +  E++
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKM-EIGRKPSLFLRLSQGANKVLDNASLQVMIQ
         +   P   TY+ LI G C  G + +A  +  +M++ G  P +V++++++ G C++  + EA  +  +M E G KP              D  +   +IQ
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKM-EIGRKPSLFLRLSQGANKVLDNASLQVMIQ

Query:  QLCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIY
          CE     +A  L  +++ +G+ PD  TY  LIN  C   ++    +L   M  KG LPD VTY  LI+GL +  R  +A  +  ++      P    Y
Subjt:  QLCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIY

Query:  KSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDF--DLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMN
         ++                    + N    E + V  + + F    + T   ++ E  M+ K+   D   Y I + G C+A  + +A+ ++  +      
Subjt:  KSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDF--DLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMN

Query:  VSSASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGY
        + + + + L+ +L  EG+++    V ++ L     L      ++L  + H E   D  L ++  M   G+
Subjt:  VSSASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGY

Q9LQ14 Pentatricopeptide repeat-containing protein At1g62930, chloroplastic1.8e-7431.84Show/hide
Query:  SQNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNL
        S+N ++D L  D+A +L+ + +Q     +I      F+ L+ A +K    +  +     M++     +++++N++++   R+    LALAV  +M+K   
Subjt:  SQNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNL

Query:  NPNVVTYSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSF
         P++VT S L++G+C   +  +A+ L D+M      PN +T++ ++ GL    K  +A  L  +M A GC PD+ TY  ++NG CK G +D A SLL+  
Subjt:  NPNVVTYSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSF

Query:  EKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQS
        EK      V  Y+ +I+ L   +  ++A   + +M  + I+P+V+ Y  +I+ L   GR +DA  LL +M ER ++P+   ++ALI  F   G L +A+ 
Subjt:  EKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQS

Query:  LRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASL
        L  E+ K    PD  TYS LI G C +  + EA+H+F  M    C P++VT+N+LI G CKA +++E   LF +M                  V +  + 
Subjt:  LRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASL

Query:  QVMIQQLCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKP
          +IQ L ++G    A K+  ++V  GV PDI TY+IL++GLCK   +     +F+ +Q     PD  TY  +I+G+ + G+ ED   +F  +   G KP
Subjt:  QVMIQQLCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKP

Query:  DSSIYKSIMTWSCRK
        +  IY ++++  CRK
Subjt:  DSSIYKSIMTWSCRK

Q9LQ16 Pentatricopeptide repeat-containing protein At1g629108.9e-7431.26Show/hide
Query:  QNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLN
        +N + D +  D+A +L+   ++     +I      F+ L+ A +K    E  +     M+      +++ +++ ++   R+    LALAV  +M+K    
Subjt:  QNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLN

Query:  PNVVTYSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFE
        P++VT S L++G+C S +  DA+ L D+M   G  P+  T++ ++ GL    K  +A  L  +M   GC PD++TY  ++NG CK G +D A SLL+  E
Subjt:  PNVVTYSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFE

Query:  KDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSL
        K      V  Y+ +I+GL + +  D+A   + +M  + I+PDV  Y+ +I  L   GR +DA  LL +M ER ++P+   ++ALI  F   G L +A+ L
Subjt:  KDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSL

Query:  RLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKME----IGRKPSLFLRLSQGANKVLDN
          E+ K    PD  TYS LI G C +  + EA+H+F  M    C P++VT+++LI G CKA +++E   LF +M     +G   + +  L  G  +  D 
Subjt:  RLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKME----IGRKPSLFLRLSQGANKVLDN

Query:  ASLQVMIQQ-------------------LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLY
         + Q++ +Q                   LC++G + KA  +   L    + PDI TYNI+I G+CK   +  G++LF  + LKG  P+ + Y T+I G  
Subjt:  ASLQVMIQQ-------------------LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLY

Query:  RVGRDEDALGIFEQMVKNGCKPDSSIYKSIM
        R G  E+A  + ++M ++G  P+S  Y +++
Subjt:  RVGRDEDALGIFEQMVKNGCKPDSSIYKSIM

Q9SAJ5 Pentatricopeptide repeat-containing protein At1g795405.5e-22550.85Show/hide
Query:  FLRPIIAYVVPKPPWF-HLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFIIDRLVK
        F R +I +   KP W    + S       S EV +I+    P E  LE + P +S ++ITSVI+D+ N +LGFR FIW+ RR+RL    S   +ID L +
Subjt:  FLRPIIAYVVPKPPWF-HLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFIIDRLVK

Query:  DNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEA-FLLALAVYNQMLKCNLNPNVVTYSIL
        DN  +LYW+TL+ELK   + + S  F VLI AY+K GM EKAVESFG M++FDC+P++F +N+IL V++R+E  F+LA AVYN+MLKCN +PN+ T+ IL
Subjt:  DNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEA-FLLALAVYNQMLKCNLNPNVVTYSIL

Query:  IHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVN
        + G  K  +T DA  +FD+MT RGI PN +TY+I++SGLCQ    DDA++LF +M+  G  PD + +N LL+GFCKLG + EAF LL+ FEKDG +LG+ 
Subjt:  IHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVN

Query:  GYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDC
        GYS LI+GLFRARRY +A   Y  ML++NIKPD+ILYTI+IQGLS+ G++ DAL LL  M  +G+SPDT CYNA+IK  C  G L++ +SL+LE+S+ + 
Subjt:  GYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDC

Query:  FPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQLCES
        FPD  T++ILIC MC+NGL+ EA+ IF E+EK GC PS+ TFN+LIDGLCK+G+L+EA LL +KME+GR  SLFLRLS   N+  D          + ES
Subjt:  FPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQLCES

Query:  GLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMT
        G ILKAY+ L    + G  PDI +YN+LING C+  +I+G  KL   +QLKG  PDSVTY TLI+GL+RVGR+E+A  +F    K+  +   ++Y+S+MT
Subjt:  GLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMT

Query:  WSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSSASCV
        WSCRK+KV +AF++WMKYL+     +DE    + + F +GE E  ++RLIE+D    +  L PYTI+LIGLCQ+ R  EA  +FSVL++ K+ V+  SCV
Subjt:  WSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSSASCV

Query:  MLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLL
         LI  LC   +LD A++VFLYTL+  F LMPR+CN LLS LL   ++ +    L +RME  GY+++++L
Subjt:  MLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLL

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial6.2e-7533.4Show/hide
Query:  FSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKSSKTQDALVLFDEMTYRGIL
        FS L+ A +K    +  +     M++     N + ++++++   R+    LALAV  +M+K    PN+VT S L++G+C S +  +A+ L D+M   G  
Subjt:  FSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKSSKTQDALVLFDEMTYRGIL

Query:  PNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKML
        PN +T++ ++ GL    K  +A  L  +M A GC PD++TY V++NG CK G  D AF+LL   E+     GV  Y+ +I+GL + +  D+A   +++M 
Subjt:  PNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKML

Query:  RENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHI
         + I+P+V+ Y+ +I  L   GR +DA  LL +M ER ++PD   ++ALI  F   G L +A+ L  E+ K    P   TYS LI G C +  + EA+ +
Subjt:  RENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHI

Query:  FNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQLCESGLILKAYKLLMQLVEIGVLPDIRTYN
        F  M    C P +VT+N+LI G CK  +++E       ME+      F  +SQ    V +  +  ++IQ L ++G    A ++  ++V  GV P+I TYN
Subjt:  FNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQLCESGLILKAYKLLMQLVEIGVLPDIRTYN

Query:  ILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMTWSCRK
         L++GLCKN  +     +F+ +Q     P   TY  +I+G+ + G+ ED   +F  +   G KPD   Y ++++  CRK
Subjt:  ILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMTWSCRK

Arabidopsis top hitse value%identityAlignment
AT1G62670.1 rna processing factor 24.4e-7633.4Show/hide
Query:  FSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKSSKTQDALVLFDEMTYRGIL
        FS L+ A +K    +  +     M++     N + ++++++   R+    LALAV  +M+K    PN+VT S L++G+C S +  +A+ L D+M   G  
Subjt:  FSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKSSKTQDALVLFDEMTYRGIL

Query:  PNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKML
        PN +T++ ++ GL    K  +A  L  +M A GC PD++TY V++NG CK G  D AF+LL   E+     GV  Y+ +I+GL + +  D+A   +++M 
Subjt:  PNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKML

Query:  RENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHI
         + I+P+V+ Y+ +I  L   GR +DA  LL +M ER ++PD   ++ALI  F   G L +A+ L  E+ K    P   TYS LI G C +  + EA+ +
Subjt:  RENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHI

Query:  FNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQLCESGLILKAYKLLMQLVEIGVLPDIRTYN
        F  M    C P +VT+N+LI G CK  +++E       ME+      F  +SQ    V +  +  ++IQ L ++G    A ++  ++V  GV P+I TYN
Subjt:  FNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQLCESGLILKAYKLLMQLVEIGVLPDIRTYN

Query:  ILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMTWSCRK
         L++GLCKN  +     +F+ +Q     P   TY  +I+G+ + G+ ED   +F  +   G KPD   Y ++++  CRK
Subjt:  ILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMTWSCRK

AT1G62910.1 Pentatricopeptide repeat (PPR) superfamily protein6.3e-7531.26Show/hide
Query:  QNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLN
        +N + D +  D+A +L+   ++     +I      F+ L+ A +K    E  +     M+      +++ +++ ++   R+    LALAV  +M+K    
Subjt:  QNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLN

Query:  PNVVTYSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFE
        P++VT S L++G+C S +  DA+ L D+M   G  P+  T++ ++ GL    K  +A  L  +M   GC PD++TY  ++NG CK G +D A SLL+  E
Subjt:  PNVVTYSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFE

Query:  KDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSL
        K      V  Y+ +I+GL + +  D+A   + +M  + I+PDV  Y+ +I  L   GR +DA  LL +M ER ++P+   ++ALI  F   G L +A+ L
Subjt:  KDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSL

Query:  RLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKME----IGRKPSLFLRLSQGANKVLDN
          E+ K    PD  TYS LI G C +  + EA+H+F  M    C P++VT+++LI G CKA +++E   LF +M     +G   + +  L  G  +  D 
Subjt:  RLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKME----IGRKPSLFLRLSQGANKVLDN

Query:  ASLQVMIQQ-------------------LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLY
         + Q++ +Q                   LC++G + KA  +   L    + PDI TYNI+I G+CK   +  G++LF  + LKG  P+ + Y T+I G  
Subjt:  ASLQVMIQQ-------------------LCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLY

Query:  RVGRDEDALGIFEQMVKNGCKPDSSIYKSIM
        R G  E+A  + ++M ++G  P+S  Y +++
Subjt:  RVGRDEDALGIFEQMVKNGCKPDSSIYKSIM

AT1G62930.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-7531.84Show/hide
Query:  SQNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNL
        S+N ++D L  D+A +L+ + +Q     +I      F+ L+ A +K    +  +     M++     +++++N++++   R+    LALAV  +M+K   
Subjt:  SQNFIIDRLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNL

Query:  NPNVVTYSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSF
         P++VT S L++G+C   +  +A+ L D+M      PN +T++ ++ GL    K  +A  L  +M A GC PD+ TY  ++NG CK G +D A SLL+  
Subjt:  NPNVVTYSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSF

Query:  EKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQS
        EK      V  Y+ +I+ L   +  ++A   + +M  + I+P+V+ Y  +I+ L   GR +DA  LL +M ER ++P+   ++ALI  F   G L +A+ 
Subjt:  EKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQS

Query:  LRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASL
        L  E+ K    PD  TYS LI G C +  + EA+H+F  M    C P++VT+N+LI G CKA +++E   LF +M                  V +  + 
Subjt:  LRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASL

Query:  QVMIQQLCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKP
          +IQ L ++G    A K+  ++V  GV PDI TY+IL++GLCK   +     +F+ +Q     PD  TY  +I+G+ + G+ ED   +F  +   G KP
Subjt:  QVMIQQLCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKP

Query:  DSSIYKSIMTWSCRK
        +  IY ++++  CRK
Subjt:  DSSIYKSIMTWSCRK

AT1G79540.1 Pentatricopeptide repeat (PPR) superfamily protein3.9e-22650.85Show/hide
Query:  FLRPIIAYVVPKPPWF-HLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFIIDRLVK
        F R +I +   KP W    + S       S EV +I+    P E  LE + P +S ++ITSVI+D+ N +LGFR FIW+ RR+RL    S   +ID L +
Subjt:  FLRPIIAYVVPKPPWF-HLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFIIDRLVK

Query:  DNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEA-FLLALAVYNQMLKCNLNPNVVTYSIL
        DN  +LYW+TL+ELK   + + S  F VLI AY+K GM EKAVESFG M++FDC+P++F +N+IL V++R+E  F+LA AVYN+MLKCN +PN+ T+ IL
Subjt:  DNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEA-FLLALAVYNQMLKCNLNPNVVTYSIL

Query:  IHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVN
        + G  K  +T DA  +FD+MT RGI PN +TY+I++SGLCQ    DDA++LF +M+  G  PD + +N LL+GFCKLG + EAF LL+ FEKDG +LG+ 
Subjt:  IHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVN

Query:  GYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDC
        GYS LI+GLFRARRY +A   Y  ML++NIKPD+ILYTI+IQGLS+ G++ DAL LL  M  +G+SPDT CYNA+IK  C  G L++ +SL+LE+S+ + 
Subjt:  GYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDC

Query:  FPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQLCES
        FPD  T++ILIC MC+NGL+ EA+ IF E+EK GC PS+ TFN+LIDGLCK+G+L+EA LL +KME+GR  SLFLRLS   N+  D          + ES
Subjt:  FPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQLCES

Query:  GLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMT
        G ILKAY+ L    + G  PDI +YN+LING C+  +I+G  KL   +QLKG  PDSVTY TLI+GL+RVGR+E+A  +F    K+  +   ++Y+S+MT
Subjt:  GLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMT

Query:  WSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSSASCV
        WSCRK+KV +AF++WMKYL+     +DE    + + F +GE E  ++RLIE+D    +  L PYTI+LIGLCQ+ R  EA  +FSVL++ K+ V+  SCV
Subjt:  WSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSSASCV

Query:  MLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLL
         LI  LC   +LD A++VFLYTL+  F LMPR+CN LLS LL   ++ +    L +RME  GY+++++L
Subjt:  MLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDINTLL

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.4e-7627.31Show/hide
Query:  RLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVR-KEAFLLALAVYNQMLKCNLNPNVVT
        + + D    L +K+LQE  D      S  F +++++YS+  + +KA+    L +     P + ++N +L   +R K     A  V+ +ML+  ++PNV T
Subjt:  RLVKDNAFELYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVR-KEAFLLALAVYNQMLKCNLNPNVVT

Query:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI
        Y+ILI GFC +     AL LFD+M  +G LPN +TY+ ++ G C+ +KIDD  +L   M   G  P++I+YNV++NG C+ G + E   +L    + G+ 
Subjt:  YSILIHGFCKSSKTQDALVLFDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        L    Y+ LI G  +   + +A + + +MLR  + P VI YT +I  + + G +  A+  LD+M  RGL P+   Y  L+ GF   G++++A  +  E++
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYQKMLRENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKM-EIGRKPSLFLRLSQGANKVLDNASLQVMIQ
         +   P   TY+ LI G C  G + +A  +  +M++ G  P +V++++++ G C++  + EA  +  +M E G KP              D  +   +IQ
Subjt:  KHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCVPSIVTFNSLIDGLCKAGKLQEAYLLFYKM-EIGRKPSLFLRLSQGANKVLDNASLQVMIQ

Query:  QLCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIY
          CE     +A  L  +++ +G+ PD  TY  LIN  C   ++    +L   M  KG LPD VTY  LI+GL +  R  +A  +  ++      P    Y
Subjt:  QLCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFKAMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIY

Query:  KSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDF--DLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMN
         ++                    + N    E + V  + + F    + T   ++ E  M+ K+   D   Y I + G C+A  + +A+ ++  +      
Subjt:  KSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMISKDF--DLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMN

Query:  VSSASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGY
        + + + + L+ +L  EG+++    V ++ L     L      ++L  + H E   D  L ++  M   G+
Subjt:  VSSASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTCCGACCAGATTTTCTTCGACCCATAATCGCCTATGTAGTCCCAAAACCTCCATGGTTCCATTTATTTCATTCGCCCACTGACCCAATCGCCACTTCCAATGA
GGTCTCCACCATAATCGAAACTGTTGATCCTTTCGAAGATGGATTGGAAGTCATATCGCCCCATATTTCGTCTGATGTAATTACCTCCGTCATTGAAGATCAATCGAATC
CCCGACTTGGATTTCGACTTTTTATCTGGTCGTTAAGGAGAAAGCGTCTCTGCTGCAATGCCTCGCAGAATTTTATCATCGACAGGTTAGTAAAGGACAATGCCTTCGAA
TTATATTGGAAAACTCTTCAAGAGCTTAAGGATTCAGCAATTGAAATTCCATCGGATGCTTTCTCTGTGTTGATTGAGGCATACTCAAAAGCGGGCATGGATGAGAAGGC
CGTTGAATCATTTGGTCTGATGCGGGATTTTGACTGTAAGCCCAACATTTTTGCTTTCAATTTGATTTTGCATGTTTTGGTGCGAAAAGAAGCATTTTTGTTAGCTTTAG
CAGTGTATAATCAGATGCTGAAATGTAATTTGAATCCGAATGTGGTTACCTACAGCATATTGATTCATGGATTCTGTAAATCTAGTAAAACTCAAGATGCCCTTGTACTT
TTTGATGAAATGACCTATAGAGGAATATTGCCCAACGAGATAACTTATTCGATTGTTCTTTCTGGACTGTGTCAAGCTAAGAAAATTGATGATGCCCAGAGATTGTTCAG
TAAGATGAGAGCGATTGGGTGTAGTCCAGATGTAATAACTTATAATGTTTTGCTTAATGGATTTTGTAAGTTAGGTTATTTGGATGAAGCTTTTTCATTGTTGCAATCAT
TTGAAAAGGATGGCCATATTCTTGGAGTTAATGGGTATAGTTGTTTGATTAATGGCTTGTTTAGGGCTAGGAGATATGACGAAGCACATATGTGGTACCAGAAAATGTTG
AGGGAAAACATCAAGCCCGATGTTATCTTGTATACTATTATGATCCAAGGTTTATCACAAGAAGGTCGGGTTACTGATGCATTGGCACTGTTGGATGAGATGACAGAAAG
AGGGCTTAGTCCAGATACTGCTTGTTACAATGCTTTAATTAAAGGGTTTTGTGATATGGGTCATTTGGATAAGGCTCAGTCCCTTCGACTTGAGATTTCAAAGCACGACT
GTTTCCCTGATAATCACACATACTCCATTCTCATTTGTGGTATGTGCAAGAATGGGCTAATAAGTGAGGCACAACATATATTTAATGAAATGGAGAAGCTTGGATGCGTT
CCTTCTATAGTGACCTTCAATTCTCTCATTGATGGACTTTGCAAAGCCGGTAAGCTTCAGGAAGCTTACCTATTATTTTACAAAATGGAGATAGGAAGAAAACCTTCTTT
GTTTCTTCGGCTTTCTCAGGGCGCCAATAAGGTTCTTGACAATGCCAGTCTCCAAGTTATGATCCAGCAATTATGTGAGTCAGGATTGATTCTTAAGGCCTACAAGCTTC
TTATGCAGCTAGTTGAGATTGGGGTTTTGCCAGATATTAGGACTTATAACATCCTAATCAATGGATTATGCAAGAATAACAATATTAATGGTGGTTTTAAGCTCTTCAAG
GCCATGCAGCTCAAAGGACGCTTGCCAGATTCGGTTACATACGGGACTCTAATAGATGGGCTCTATAGAGTTGGTAGGGATGAGGATGCACTAGGAATTTTTGAACAAAT
GGTAAAGAATGGGTGCAAGCCTGATTCTTCTATTTACAAGTCCATCATGACTTGGTCGTGTCGAAAAAAGAAGGTTTCACTAGCTTTTAGTGTTTGGATGAAGTATCTGA
GGAATTTTCGTGGCTGGGAAGATGAAAAGGTAGCAGTAGTAGCGGAAAGTTTCAATAAAGGAGAGCTTGAGACAACAATTCAGAGATTAATTGAAATGGACATGATATCA
AAAGATTTCGACTTAGCTCCATACACCATTTTTCTCATTGGATTGTGTCAAGCCGAGAGGGTTTCCGAAGCCTTTGCTATATTTTCTGTTCTCAAGGACTTCAAGATGAA
TGTAAGTTCAGCGAGCTGTGTGATGTTGATTGGCAGTTTGTGCGTGGAGGGAGAACTTGACCTAGCCGTGGATGTTTTTCTTTATACACTAGAAGAAGGCTTTATGTTGA
TGCCTCGAATTTGTAATCAGCTGCTGAGCCGCCTTCTTCATTTGGAGGACAGAAAAGACCATGCTCTTGTTCTTATACATAGAATGGAGGCTTTTGGATATGATATTAAT
ACTCTTCTCCACGACAGTGAACGAGAGAAAACACAAACCAGAGTCAAAACAGCCATAGAGATGGACGGAGATCTGCCGCCGGCACCGGCTCCGTTGTCCATCCACCCGGC
CGTAGCACCGCTATCATTCTTACTCGGAACATGGAGAGGCAAAGGCGACGGGGGATTCCCCACCATTAATTCCTTCTCTTACGGCGAGGAGCTTCACTTCTCCCATTCCG
GCAAGCCGGTGATTTCCTACACTCAAAAGACTTGGAAACTCGATTCTAGAGAGCCAATGCACGCTGAGAGTGGCTATTGGCGTCCCAAGCCCGATGGTACCATTGAGGTA
GTCATCGCTCAAAGTACTGGTATCGTTGAAGTTCAGTCAATTGCAAGGTCAGATTCTGCAGAGTTGCAACCGCGTACCAACTGTTCAATGAAATGCTCAACTCAAATGTT
GTCTCATGGACCTCACTCATGGCGGTTATATCGAGATGGTGGGCTGAGTACAGCTCTTTCACTGTTTGGGGCAATGCCGAGAAGTCCAGTTGTTCTCAACGACTTCACTT
TTGTGAATGCAATCAAGGCCTGTTTGATCCTTTCAAATTTAAGAAATGGTGAAAAGTTTCATGCCTATGTTGAGATTTTGGTTTTGGAGCTAATATTATGGTCTATTCCT
CGCTTATTGATATTGATTTCTGCTTATGCTCAGAATGCACATGGCAACGATGTGTTAACAGCTGGTTTCAGGAAAAGTCGTAGTGCAGTGATCCATCTTGGCTCTGATTC
AAGCCACATAGCTGCAAATGTGCTGGTTAATATGTATGCTAAATGCGGCCGTAACTTCAGAACATCTGCTGATCTTACAGTCGACATTTTTAATTGTGTTACGGATTTGA
TAGCTTGTGATAAAGGAACATATGATGCAGAAGAGAAAGTGATAAAGCTCCAAAGTGAACTTGTGGGGAATGCTTCTAAGGTGAAAGAAATAAGCAGAATATTTAAACTG
GTGGATGGAGAGCTTTCCTACGTGGTTCAGATGGCTACTGGCTTAACCAGTCTACAACCACACTTAAAAGCCTTTCTTACTAAAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTCCGACCAGATTTTCTTCGACCCATAATCGCCTATGTAGTCCCAAAACCTCCATGGTTCCATTTATTTCATTCGCCCACTGACCCAATCGCCACTTCCAATGA
GGTCTCCACCATAATCGAAACTGTTGATCCTTTCGAAGATGGATTGGAAGTCATATCGCCCCATATTTCGTCTGATGTAATTACCTCCGTCATTGAAGATCAATCGAATC
CCCGACTTGGATTTCGACTTTTTATCTGGTCGTTAAGGAGAAAGCGTCTCTGCTGCAATGCCTCGCAGAATTTTATCATCGACAGGTTAGTAAAGGACAATGCCTTCGAA
TTATATTGGAAAACTCTTCAAGAGCTTAAGGATTCAGCAATTGAAATTCCATCGGATGCTTTCTCTGTGTTGATTGAGGCATACTCAAAAGCGGGCATGGATGAGAAGGC
CGTTGAATCATTTGGTCTGATGCGGGATTTTGACTGTAAGCCCAACATTTTTGCTTTCAATTTGATTTTGCATGTTTTGGTGCGAAAAGAAGCATTTTTGTTAGCTTTAG
CAGTGTATAATCAGATGCTGAAATGTAATTTGAATCCGAATGTGGTTACCTACAGCATATTGATTCATGGATTCTGTAAATCTAGTAAAACTCAAGATGCCCTTGTACTT
TTTGATGAAATGACCTATAGAGGAATATTGCCCAACGAGATAACTTATTCGATTGTTCTTTCTGGACTGTGTCAAGCTAAGAAAATTGATGATGCCCAGAGATTGTTCAG
TAAGATGAGAGCGATTGGGTGTAGTCCAGATGTAATAACTTATAATGTTTTGCTTAATGGATTTTGTAAGTTAGGTTATTTGGATGAAGCTTTTTCATTGTTGCAATCAT
TTGAAAAGGATGGCCATATTCTTGGAGTTAATGGGTATAGTTGTTTGATTAATGGCTTGTTTAGGGCTAGGAGATATGACGAAGCACATATGTGGTACCAGAAAATGTTG
AGGGAAAACATCAAGCCCGATGTTATCTTGTATACTATTATGATCCAAGGTTTATCACAAGAAGGTCGGGTTACTGATGCATTGGCACTGTTGGATGAGATGACAGAAAG
AGGGCTTAGTCCAGATACTGCTTGTTACAATGCTTTAATTAAAGGGTTTTGTGATATGGGTCATTTGGATAAGGCTCAGTCCCTTCGACTTGAGATTTCAAAGCACGACT
GTTTCCCTGATAATCACACATACTCCATTCTCATTTGTGGTATGTGCAAGAATGGGCTAATAAGTGAGGCACAACATATATTTAATGAAATGGAGAAGCTTGGATGCGTT
CCTTCTATAGTGACCTTCAATTCTCTCATTGATGGACTTTGCAAAGCCGGTAAGCTTCAGGAAGCTTACCTATTATTTTACAAAATGGAGATAGGAAGAAAACCTTCTTT
GTTTCTTCGGCTTTCTCAGGGCGCCAATAAGGTTCTTGACAATGCCAGTCTCCAAGTTATGATCCAGCAATTATGTGAGTCAGGATTGATTCTTAAGGCCTACAAGCTTC
TTATGCAGCTAGTTGAGATTGGGGTTTTGCCAGATATTAGGACTTATAACATCCTAATCAATGGATTATGCAAGAATAACAATATTAATGGTGGTTTTAAGCTCTTCAAG
GCCATGCAGCTCAAAGGACGCTTGCCAGATTCGGTTACATACGGGACTCTAATAGATGGGCTCTATAGAGTTGGTAGGGATGAGGATGCACTAGGAATTTTTGAACAAAT
GGTAAAGAATGGGTGCAAGCCTGATTCTTCTATTTACAAGTCCATCATGACTTGGTCGTGTCGAAAAAAGAAGGTTTCACTAGCTTTTAGTGTTTGGATGAAGTATCTGA
GGAATTTTCGTGGCTGGGAAGATGAAAAGGTAGCAGTAGTAGCGGAAAGTTTCAATAAAGGAGAGCTTGAGACAACAATTCAGAGATTAATTGAAATGGACATGATATCA
AAAGATTTCGACTTAGCTCCATACACCATTTTTCTCATTGGATTGTGTCAAGCCGAGAGGGTTTCCGAAGCCTTTGCTATATTTTCTGTTCTCAAGGACTTCAAGATGAA
TGTAAGTTCAGCGAGCTGTGTGATGTTGATTGGCAGTTTGTGCGTGGAGGGAGAACTTGACCTAGCCGTGGATGTTTTTCTTTATACACTAGAAGAAGGCTTTATGTTGA
TGCCTCGAATTTGTAATCAGCTGCTGAGCCGCCTTCTTCATTTGGAGGACAGAAAAGACCATGCTCTTGTTCTTATACATAGAATGGAGGCTTTTGGATATGATATTAAT
ACTCTTCTCCACGACAGTGAACGAGAGAAAACACAAACCAGAGTCAAAACAGCCATAGAGATGGACGGAGATCTGCCGCCGGCACCGGCTCCGTTGTCCATCCACCCGGC
CGTAGCACCGCTATCATTCTTACTCGGAACATGGAGAGGCAAAGGCGACGGGGGATTCCCCACCATTAATTCCTTCTCTTACGGCGAGGAGCTTCACTTCTCCCATTCCG
GCAAGCCGGTGATTTCCTACACTCAAAAGACTTGGAAACTCGATTCTAGAGAGCCAATGCACGCTGAGAGTGGCTATTGGCGTCCCAAGCCCGATGGTACCATTGAGGTA
GTCATCGCTCAAAGTACTGGTATCGTTGAAGTTCAGTCAATTGCAAGGTCAGATTCTGCAGAGTTGCAACCGCGTACCAACTGTTCAATGAAATGCTCAACTCAAATGTT
GTCTCATGGACCTCACTCATGGCGGTTATATCGAGATGGTGGGCTGAGTACAGCTCTTTCACTGTTTGGGGCAATGCCGAGAAGTCCAGTTGTTCTCAACGACTTCACTT
TTGTGAATGCAATCAAGGCCTGTTTGATCCTTTCAAATTTAAGAAATGGTGAAAAGTTTCATGCCTATGTTGAGATTTTGGTTTTGGAGCTAATATTATGGTCTATTCCT
CGCTTATTGATATTGATTTCTGCTTATGCTCAGAATGCACATGGCAACGATGTGTTAACAGCTGGTTTCAGGAAAAGTCGTAGTGCAGTGATCCATCTTGGCTCTGATTC
AAGCCACATAGCTGCAAATGTGCTGGTTAATATGTATGCTAAATGCGGCCGTAACTTCAGAACATCTGCTGATCTTACAGTCGACATTTTTAATTGTGTTACGGATTTGA
TAGCTTGTGATAAAGGAACATATGATGCAGAAGAGAAAGTGATAAAGCTCCAAAGTGAACTTGTGGGGAATGCTTCTAAGGTGAAAGAAATAAGCAGAATATTTAAACTG
GTGGATGGAGAGCTTTCCTACGTGGTTCAGATGGCTACTGGCTTAACCAGTCTACAACCACACTTAAAAGCCTTTCTTACTAAAGTTTGACAAAAAAGAAAAAGAAAAAG
AGTTTAACAAATCCTTTTATTTTCTCTTAATGGACTGAAGTTATCTAATGACTTAAACGACTTTATTGGAAGAGGATATGTTCTATAGGTGTACAACTGCGTCCTTGAGT
TCTAAGAAGTTATCAAGAATACTGAAGTCCACAATATAGATATATTGAATTGCAATTAGCATATTTCTATGATTCTGTACAGGGTTGGCAATGGGGCTGGGGGAGTTTCG
GTCCCCACCCTCGTGGGAGAATTTACCAGTTAATCTCCGCAGGAGCAAAGGAGAGCGAGCCAGCGAGAGTGAGTTTAATTAGGGTTTGGTTGGGTTTGAGTGAGCGAGAG
TATGCGAGAGTGAGAGAGAAAGAGAGAGTGATTGTGACTTTGAGTGAGGAAAGGATGCAAGAGCGAGAGCAAGAGAGTCAACAATAACATCGGGTTGGCATCGAGGAATG
GAGTTGGGTTGGGGATACTCTCCTCGTCCTCGAACTAGGATCTCCGGTTAAATTGGGGATTCCCCAACCCGATTTGGTCAGGCCCCTCAGATATTTTTGCCAACCCTAAT
TCTGTAGCTTTTATTCTCCACATCTCTCTCTAGAGTAACATCATCTAATAGTAATTTTTTGGTATGTTTCATAACTGTATGTTAATTTTTTGTTTTTGTGCTTTAAAA
Protein sequenceShow/hide protein sequence
MKLRPDFLRPIIAYVVPKPPWFHLFHSPTDPIATSNEVSTIIETVDPFEDGLEVISPHISSDVITSVIEDQSNPRLGFRLFIWSLRRKRLCCNASQNFIIDRLVKDNAFE
LYWKTLQELKDSAIEIPSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRKEAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKSSKTQDALVL
FDEMTYRGILPNEITYSIVLSGLCQAKKIDDAQRLFSKMRAIGCSPDVITYNVLLNGFCKLGYLDEAFSLLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYQKML
RENIKPDVILYTIMIQGLSQEGRVTDALALLDEMTERGLSPDTACYNALIKGFCDMGHLDKAQSLRLEISKHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCV
PSIVTFNSLIDGLCKAGKLQEAYLLFYKMEIGRKPSLFLRLSQGANKVLDNASLQVMIQQLCESGLILKAYKLLMQLVEIGVLPDIRTYNILINGLCKNNNINGGFKLFK
AMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMTWSCRKKKVSLAFSVWMKYLRNFRGWEDEKVAVVAESFNKGELETTIQRLIEMDMIS
KDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNVSSASCVMLIGSLCVEGELDLAVDVFLYTLEEGFMLMPRICNQLLSRLLHLEDRKDHALVLIHRMEAFGYDIN
TLLHDSEREKTQTRVKTAIEMDGDLPPAPAPLSIHPAVAPLSFLLGTWRGKGDGGFPTINSFSYGEELHFSHSGKPVISYTQKTWKLDSREPMHAESGYWRPKPDGTIEV
VIAQSTGIVEVQSIARSDSAELQPRTNCSMKCSTQMLSHGPHSWRLYRDGGLSTALSLFGAMPRSPVVLNDFTFVNAIKACLILSNLRNGEKFHAYVEILVLELILWSIP
RLLILISAYAQNAHGNDVLTAGFRKSRSAVIHLGSDSSHIAANVLVNMYAKCGRNFRTSADLTVDIFNCVTDLIACDKGTYDAEEKVIKLQSELVGNASKVKEISRIFKL
VDGELSYVVQMATGLTSLQPHLKAFLTKV