; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001607 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001607
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr09:18624345..18626699
RNA-Seq ExpressionHG10001607
SyntenyHG10001607
Gene Ontology termsGO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035334.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0084.69Show/hide
Query:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII
        MK R  FLRP++TY+VPKPPWFHLFH+ T+PIA+SNEVSTII+TVDP ED LE+IAPH+SSDVITSVI+EQ N +LGFRLFIWSLRR+ LCCSA QNLII
Subjt:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII

Query:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT
        DRLVKDNAFELYW TLQELKDS+ EISSDAFSVLIEAYSKAGM EKAV+SFG+M+DF+CKPNIFA+NLILHVLVR EAFLLALAVYN+MLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI
        YSILIHGFCKTSKTQ+ALVLFDEMT+R +LPNEITYSI+LSGLCQAKKI DAQRLF KMRASGCSPDVI+YNVLLNGFCKLGY DEAFALL+SFEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWY+K   +N+EPDVILYTIMIQGL QEGRV +ALALLDEMTERGFSPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ
        NHDCFP+NHTYSILICGMCKNGLI EAQH+FNEMEKLGC PSVVTFNSLIDG CK G+L+EAHLLFYKMEIGRKP LFLRLSQG+NK+L +  LQVM+EQ
Subjt:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ

Query:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVESGV PD+RTYNILING CK NNI+G F LFKDMQLKGRLPDSVTYGTLIDGL+RVGRDEDALGIFEQMVK+GCKP+ S+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS
        SIMTWSCR+KK+S AFS WMKYLRNFRGW+DEKV +V ESFDKG+LE  I R+IEMD+ SKDFDLAPYTIFL+GLCQA RVSEAFAIFSVLKDFK  ISS
Subjt:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS

Query:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH
        ASCVMLIG LC+E KLDLA++VFLYTLE G MLMPRICNQLL HLL LEDRKDHA VLI RMEAFGYDMNA+LH STK LLHDH
Subjt:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH

XP_022948073.1 pentatricopeptide repeat-containing protein At1g79540 [Cucurbita moschata]0.0e+0084.82Show/hide
Query:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII
        MK R  FLRP++TY+VPKPPWFHLFH+ T+PIATSNEVSTII+TVDP ED LE+IAPH+SSDVITSVI+EQ N +LGFRLFIWSLRR+ LCCSA QNLII
Subjt:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII

Query:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT
        DRLVKDNAFELYW TLQELKDS+ EISSDAFSVLIEAYSKAGM EKAV+SFG+M+DF+CKPNI+A+NLILHVLVR EAFLLALAVYN+MLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI
        YSILIHGFCKTSKTQ+ALVLFDEMT+R +LPNEITYSI+LSGLCQAKKI DAQRLF KMRASGCSPDVI+YNVLLNGFCKLGY DEAFALL+SFEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWY+K   +N+EPDVILYTIMIQGL QEGRV +ALALLDEMTERGFSPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ
        NHDCFP+NHTYSILICGMCKNGLI EAQH+FNEMEKLGC PSVVTFNSLIDG CK G+L+EAHLLFYKMEIGRKPSLFLRLSQG+NK+L +  LQVM+EQ
Subjt:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ

Query:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVESGV PD+RTYNILING CK NNI+G FKLFKDMQLKGRLPDSVTYGTLIDGL+RVGRDEDALGIFEQMVK+GCKP+ S+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS
        SIMTWSCR+KK+S  FS WMKYLRNFRGW+DEKV +V ESFDKG+LE  I R+IEMD+ SKDF+LAPYTIFLIGLCQA RVSEAFAIFSVLKDFK  ISS
Subjt:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS

Query:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH
        ASCVMLIG LC+E KLDLA++VFLYTLE G MLMPRICNQLL HLL LEDRKDHA VLI RMEAFGYDMNA+LH STK LLHDH
Subjt:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH

XP_023007126.1 pentatricopeptide repeat-containing protein At1g79540 [Cucurbita maxima]0.0e+0085.46Show/hide
Query:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII
        MK R  FLRP++TY+VPKPPWFHLFH+PT+PIATSNEVSTII+TVDP ED LE IAPHISSDVITSVI+EQ N +LGFRLFIWSLRR+ LCCSA Q+LII
Subjt:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII

Query:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT
        DRLVKDNAFELYW TLQELKDS+ EISSDAFSVLIEAYSKAGM+EKAV+SFG+M+DF+CKPNIFA+NLILHVLVR EAFLLALAVYN+MLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI
        YSILIHGFCKTSKTQ+ALVLFDEMT+R +LPNEITYSI+LSGLCQAKKI DAQRLF KMRASGCSPDVI+YNVLLNGFCKLGY DEAFALL+SFEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWY+K   +N+EPDVILYTIMIQGL QEGRV +ALALLDEMTERGFSPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ
        NHDCFPDNHTYSILICGMCKNGLI EAQH+FNEMEKLGC PSVVTFNSLIDG CK G+L+EAHLLFYKMEIGRKPSLFLRL QG+NKVL +  LQVM+EQ
Subjt:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ

Query:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVESGV PD+RTYNILING CK NNI+G FKLFKDMQLKGRLPDS+TYGTLIDGL+RVGRDEDALGIFEQMVKNGCKP+SS+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS
        SIMTWSCR+KK+S AFS WMKYLRNFRGW+DEKV +V ESFDKG+LE  I R+IEMD+ SKDFDLAPYTIFLIGLCQA RVSEAFAIFSVLKDFK  ISS
Subjt:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS

Query:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH
        ASCVMLIG LC+E KLDLA++VFLYTLE G MLMPRICNQLL H L LEDRKDHA VLI RMEAFGYDMNA+LH STK LLHDH
Subjt:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH

XP_023534570.1 pentatricopeptide repeat-containing protein At1g79540 [Cucurbita pepo subsp. pepo]0.0e+0085.2Show/hide
Query:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII
        MK R  FLRP++TY+VPKPPWFHLFH+PT+ IATSNEVSTII+TVDP ED LE+IAPHISSDVITSVI+EQ N +LGFR+FIWSLRR+ LCCSA QNLII
Subjt:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII

Query:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT
        DRLVKDNAFELYW TLQELKDS+ EISSDAFSVLIEAYSKAGM EKAV+SFG+M+DF+CKPNIFA+NLILHVLVR EAFLLALAVYN+MLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI
        YSILIHGFCKTSKTQ+ALVLFDEMT+R +LPNEITYSI+LSGLCQAKKI DAQRLF KMRASGCSPDVI+YNVLLNGFCKLGY DEAFALL+SFEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWY+K   +N+EPDVILYTIMIQGL QEGRV +ALALLDEMTERGFSPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ
        NHDCFPDNHTYSILICGMCKNGLI EAQH+FNEMEKLGC PSVVTFNSLIDG CK G+L+EAHLLFYKMEIGRKPSLFLRLSQG+NKVL +  LQVM+EQ
Subjt:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ

Query:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVESGV PD+RTYNILING CK NNI+G FKLFKDMQLKGRLPDSVTYGTLIDGL+RVGRDEDALGIFEQMVK+GCKP+ S+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS
        SIMTWSCR+KK+S AFS WMKYLRNFRGW+DEKV +V ESFDKG+LE  I R+IEMD+ SKDFDLAPYTIFLIGLCQA R SEAFAIFSVLKDFK  ISS
Subjt:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS

Query:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH
        ASCVMLIG LC+E KLDLA++VFLYTLE G MLMPRICNQLL H L LE+RKDHA VLI RMEAFGYDMNAHLH STK LLHDH
Subjt:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH

XP_038901213.1 pentatricopeptide repeat-containing protein At1g79540 [Benincasa hispida]0.0e+0088.8Show/hide
Query:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII
        MK+RP   RPIITYVVPKPPWF  FHSPT+PIATSNEVSTII+TVD FEDGLEVI+PHISSD+ITSVI+EQ NP+LGFRLFIWSLRRKRLCCSA QNLII
Subjt:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII

Query:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT
        DRLVKDNAFELYW TLQELKDSAIEISSDAFSVLIEAY KAGM+EKAVESFGLMRDFDCKPN+FAFNLILH+LVR EAFLLALAVYN+MLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT

Query:  YSILIHGFCKTSK--TQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDG
        Y ILIHGFCKTSK  TQDAL LFDEMT+RGILPNEITYSIVLSGLC+AKKIHDAQRLFSKMRASG SPDV++YNVLLNGFCKLGYL+EAFALLQSFEKDG
Subjt:  YSILIHGFCKTSK--TQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDG

Query:  HILGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLE
        HILGVNGYSCLINGLFRARRYDEAHMWY+K+L ENI+PDVILYTIMIQGLSQEGRVTDALALL EMTERGFSPDTACYN LIKGFCD+G+LDKAQSLRLE
Subjt:  HILGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLE

Query:  ISNHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMM
        ISNH+CFPDNHTYSILICGMCKNGL+SEAQ +FNEMEKLGC PSVVTFNSLIDGLCK GRL+EAHLLF KMEIGRKPSLFLRLSQG+NKVLD+A LQVMM
Subjt:  ISNHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMM

Query:  EQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSI
        EQLCESGL+LKAYKLLMQLVESGVLPD+RTYNILING CKNNNING FKL K+M+LKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVK GCKPDSSI
Subjt:  EQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSI

Query:  YKSIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNI
        YKSIMTW CRKK +S AF+ WMKYLRNFRGWEDEKV IV ESFDKGEL+TTI RL++MDM+SKDFDLAPYTIFLIGLCQA+RVSEAFAIFSVLKDFKMNI
Subjt:  YKSIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNI

Query:  SSASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH
        SSASCVMLIG+LC+ EKLDLA+DVFLYTLEEG MLMPRICN+LLSHLL +ED+KDHALVL+++MEAFGYDMN HLH STKLLL DH
Subjt:  SSASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH

TrEMBL top hitse value%identityAlignment
A0A0A0KD52 Uncharacterized protein0.0e+0078.7Show/hide
Query:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII
        MKLRP   RPII +VVPKP  FH +HS TNPIATS EVSTII+T+DP EDGL+VI+  I S  ITSV++EQ + +LGFRLFIWSL+   L C   Q+LII
Subjt:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII

Query:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT
         +L+K+NAFELYW  LQELK+SAI+ISS+AFSVLIEAYS+AGMDEKAVESFGLMRDFDCKP++FAFNLILH LVR EAFLLALAVYN+MLKCNLNP+VVT
Subjt:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI
        Y ILIHG CKT KTQDALVLFDEMT+RGILPN+I YSIVLSGLCQAKKI DAQRLFSKMRASGC+ D+I+YNVLLNGFCK GYLD+AF LLQ   KDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GY CLINGLFRARRY+EAHMWY+KML ENI+PDV+LYTIMI+GLSQEGRVT+AL LL EMTERG  PDT CYNALIKGFCDMG+LD+A+SLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ
         HDCFP+NHTYSILICGMCKNGLI++AQHIF EMEKLGC PSVVTFNSLI+GLCK  RL+EA LLFY+MEI RKPSLFLRLSQG++KV D A LQVMME+
Subjt:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ

Query:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESG+ILKAYKLLMQLV+SGVLPD+RTYNILING CK  NING FKLFK+MQLKG +PDSVTYGTLIDGLYR GR+EDAL IFEQMVK GC P+SS YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS
        +IMTWSCR+  +S A S WMKYLR+FRGWEDEKV +V ESFD  EL+T IRRL+EMD+KSK+FDLAPYTIFLIGL QA+R  EAFAIFSVLKDFKMNISS
Subjt:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS

Query:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH
        ASCVMLIG+LCM E LD+AMDVFL+TLE GF LMP ICNQLL +LL L DRKD AL L +RMEA GYD+ AHLH  TKL LHDH
Subjt:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH

A0A5D3B9M5 Pentatricopeptide repeat-containing protein0.0e+0077.04Show/hide
Query:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII
        MKLRPN  RPII +VVPKPP F  +HS TNPI TS EVSTII+TVDP EDGL+VI+  I+S +ITSV+ +Q N  LGFRLFIWSL        A ++LII
Subjt:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII

Query:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT
        D+L+KDNAFELYW  LQELK+SAIEISSDAFSVLIEAYS+AGM+EKAVESFGLMRDFDCKPN+FAFNLIL  LVR EAFLLALAVYN+MLKCNLNP+V T
Subjt:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI
        Y ILIHGFC+T KTQDALVLFDEMT RGILPN+I Y+IVLSGLC+AKKI DAQRLFS M A     D+ +YNVLLNGFCKLGYLDEAF LLQ   KDGH 
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        L V+GY CLINGLFRARRY+EAH WY+KML ENI+PDVILYTIMIQGLSQEGRVT+A+ LL EM ERG  PDT CYNALIKGFCD+G+LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ
        NH CFP NHTYSILICGMCK+GLI+EAQHIF EMEKLGC PSVVTFNSLI+GLCK  RL+EA LLFY+MEI RKPSLFLRLSQG++KVLD A LQVMMEQ
Subjt:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ

Query:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLILKAYKLLMQLV+SGVLPD+RTYNILING CK  NING FKLFK+MQ +G +PDSVTYGTLIDGLYRVGR+EDALGIF QM K GC PDSS Y+
Subjt:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS
        +IMTW CR+K +    S WMKYLRNFRGWEDEKV +V ESFD  EL+T IRRL+EMD+KSK+FD+APYTIFLIGLC+A+RVSEAFAIFSV KDFKMNISS
Subjt:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS

Query:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH
        ASCV LI  LC  EKL+LA+DVFL+TLE  F +MP ICN+LL HLL L DRKD AL L +R+EA GYD+ AHL+  TKLLLHDH
Subjt:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH

A0A6J1D6A9 pentatricopeptide repeat-containing protein At1g79540 isoform X10.0e+0083.16Show/hide
Query:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII
        MK RP F+RPII  +VPKPPWFHL+HSPT+PIATSNEV TI++TV+PFED LE IAPH+S DVITSVIEEQ NP+LGFRLFIWSL+ KRLCCSA QNLII
Subjt:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII

Query:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT
        DRLV+DNAFELYW TLQELKDSA+ I SDAFSVLIEAYS AGMDEKAVESFGLM+DFDCKPNIF +NLIL+VLVR EAF LAL+VYN+ML+CN  PNVVT
Subjt:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI
        YSILIHG CKTSKTQDALVLFDEM NRGI PNEITYSIVLSGLCQA KI DAQRLF KMRASGCSPD I+YNVLLNGFCK GY DEAFALLQ+FEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGVN YSCLI+GLFRARRYDEA  WY+KML ENI+PDVILYTIMIQGLSQEG++ DALALL EMTERGFSPDT CYNALIKGFCDM  LDKA+SLRL IS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ
        NHDC PDNHTYSILICGMC+NGLI EAQ++FNEMEKLGC PSV TFNSLIDGLCK GR+ EA LLFYKMEIGRKPS+FLRL+QG NKVLD+AGLQVM+EQ
Subjt:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ

Query:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESG+ILKAYKLLMQL ESGVLPD+RTYNILING CK N ING FKLFKDMQLKGRLPDSVTYGTLI+GL+RVGRD+DAL +F+QMVK GCKPDSS+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS
        +IMTWSCRKK +S AFS WMKYL NFRGW+DE V +V  SFDKGELE  I+RLIEMD KSKDFD +PYTIFLIGLCQA+RVSEAFAIFSVLKDFKMN + 
Subjt:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS

Query:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH
        ASCVMLIG LC+EEKLDLA+DVFLYTLE GF+LMPRICNQLL HLL  EDRKDHALVLI RME FGYDM+A+LH STK LLHDH
Subjt:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH

A0A6J1G8C6 pentatricopeptide repeat-containing protein At1g795400.0e+0084.82Show/hide
Query:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII
        MK R  FLRP++TY+VPKPPWFHLFH+ T+PIATSNEVSTII+TVDP ED LE+IAPH+SSDVITSVI+EQ N +LGFRLFIWSLRR+ LCCSA QNLII
Subjt:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII

Query:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT
        DRLVKDNAFELYW TLQELKDS+ EISSDAFSVLIEAYSKAGM EKAV+SFG+M+DF+CKPNI+A+NLILHVLVR EAFLLALAVYN+MLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI
        YSILIHGFCKTSKTQ+ALVLFDEMT+R +LPNEITYSI+LSGLCQAKKI DAQRLF KMRASGCSPDVI+YNVLLNGFCKLGY DEAFALL+SFEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWY+K   +N+EPDVILYTIMIQGL QEGRV +ALALLDEMTERGFSPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ
        NHDCFP+NHTYSILICGMCKNGLI EAQH+FNEMEKLGC PSVVTFNSLIDG CK G+L+EAHLLFYKMEIGRKPSLFLRLSQG+NK+L +  LQVM+EQ
Subjt:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ

Query:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVESGV PD+RTYNILING CK NNI+G FKLFKDMQLKGRLPDSVTYGTLIDGL+RVGRDEDALGIFEQMVK+GCKP+ S+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS
        SIMTWSCR+KK+S  FS WMKYLRNFRGW+DEKV +V ESFDKG+LE  I R+IEMD+ SKDF+LAPYTIFLIGLCQA RVSEAFAIFSVLKDFK  ISS
Subjt:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS

Query:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH
        ASCVMLIG LC+E KLDLA++VFLYTLE G MLMPRICNQLL HLL LEDRKDHA VLI RMEAFGYDMNA+LH STK LLHDH
Subjt:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH

A0A6J1KZN2 pentatricopeptide repeat-containing protein At1g795400.0e+0085.46Show/hide
Query:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII
        MK R  FLRP++TY+VPKPPWFHLFH+PT+PIATSNEVSTII+TVDP ED LE IAPHISSDVITSVI+EQ N +LGFRLFIWSLRR+ LCCSA Q+LII
Subjt:  MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLII

Query:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT
        DRLVKDNAFELYW TLQELKDS+ EISSDAFSVLIEAYSKAGM+EKAV+SFG+M+DF+CKPNIFA+NLILHVLVR EAFLLALAVYN+MLKCNLNPNVVT
Subjt:  DRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVT

Query:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI
        YSILIHGFCKTSKTQ+ALVLFDEMT+R +LPNEITYSI+LSGLCQAKKI DAQRLF KMRASGCSPDVI+YNVLLNGFCKLGY DEAFALL+SFEKDGHI
Subjt:  YSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHI

Query:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS
        LGV GYSCLI+GLFRARRYDEAHMWY+K   +N+EPDVILYTIMIQGL QEGRV +ALALLDEMTERGFSPDT CYNA+I+GFCDMG LDKAQSLRLEIS
Subjt:  LGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEIS

Query:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ
        NHDCFPDNHTYSILICGMCKNGLI EAQH+FNEMEKLGC PSVVTFNSLIDG CK G+L+EAHLLFYKMEIGRKPSLFLRL QG+NKVL +  LQVM+EQ
Subjt:  NHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ

Query:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK
        LCESGLI KAYKLLMQLVESGV PD+RTYNILING CK NNI+G FKLFKDMQLKGRLPDS+TYGTLIDGL+RVGRDEDALGIFEQMVKNGCKP+SS+YK
Subjt:  LCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYK

Query:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS
        SIMTWSCR+KK+S AFS WMKYLRNFRGW+DEKV +V ESFDKG+LE  I R+IEMD+ SKDFDLAPYTIFLIGLCQA RVSEAFAIFSVLKDFK  ISS
Subjt:  SIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISS

Query:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH
        ASCVMLIG LC+E KLDLA++VFLYTLE G MLMPRICNQLL H L LEDRKDHA VLI RMEAFGYDMNA+LH STK LLHDH
Subjt:  ASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHLHDSTKLLLHDH

SwissProt top hitse value%identityAlignment
Q9CAN0 Pentatricopeptide repeat-containing protein At1g63130, mitochondrial2.4e-7331.3Show/hide
Query:  NTLQELK-DSAIEISSD-----------AFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVTY
        N L +LK D A+ +  D            FS L+ A +K    +  +     M++     N++ ++++++   R     LALAV  +M+K    P++VT 
Subjt:  NTLQELK-DSAIEISSD-----------AFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVTY

Query:  SILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHIL
        + L++GFC  ++  DA+ L  +M   G  P+  T++ ++ GL +  +  +A  L  +M   GC PD+++Y +++NG CK G +D A +LL+  E+     
Subjt:  SILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHIL

Query:  GVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISN
        GV  Y+ +I+ L   +  ++A   + +M  + I P+V+ Y  +I+ L   GR +DA  LL +M ER  +P+   ++ALI  F   G L +A+ L  E+  
Subjt:  GVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISN

Query:  HDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRKPSL------FLRLSQGSN-----
            PD  TYS LI G C +  + EA+H+F  M    CFP+VVT+N+LI G CK  R+ E   LF +M     +G   +       F +  +  N     
Subjt:  HDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRKPSL------FLRLSQGSN-----

Query:  KVLDSAGL-------QVMMEQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDED
        K + S G+        ++++ LC +G +  A  +   L  S + PD+ TYNI+I G+CK   +  G+ LF  + LKG  P+ VTY T++ G  R G  E+
Subjt:  KVLDSAGL-------QVMMEQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDED

Query:  ALGIFEQMVKNGCKPDSSIYKSIM
        A  +F +M + G  PDS  Y +++
Subjt:  ALGIFEQMVKNGCKPDSSIYKSIM

Q9LQ16 Pentatricopeptide repeat-containing protein At1g629101.6e-7430.94Show/hide
Query:  QNLIIDRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLN
        +N + D +  D+A +L+ + ++     +I      F+ L+ A +K    E  +     M+      +++ +++ ++   R     LALAV  +M+K    
Subjt:  QNLIIDRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLN

Query:  PNVVTYSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFE
        P++VT S L++G+C + +  DA+ L D+M   G  P+  T++ ++ GL    K  +A  L  +M   GC PD+++Y  ++NG CK G +D A +LL+  E
Subjt:  PNVVTYSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFE

Query:  KDGHILGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSL
        K      V  Y+ +I+GL + +  D+A   + +M  + I PDV  Y+ +I  L   GR +DA  LL +M ER  +P+   ++ALI  F   G L +A+ L
Subjt:  KDGHILGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSL

Query:  RLEISNHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRKPSL------FLRLSQGS
          E+      PD  TYS LI G C +  + EA+H+F  M    CFP+VVT+++LI G CK  R++E   LF +M     +G   +       F +     
Subjt:  RLEISNHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRKPSL------FLRLSQGS

Query:  N-----KVLDSAGL-------QVMMEQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYR
        N     K + S G+        ++++ LC++G + KA  +   L  S + PD+ TYNI+I G+CK   +  G++LF ++ LKG  P+ + Y T+I G  R
Subjt:  N-----KVLDSAGL-------QVMMEQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYR

Query:  VGRDEDALGIFEQMVKNGCKPDSSIYKSIM
         G  E+A  + ++M ++G  P+S  Y +++
Subjt:  VGRDEDALGIFEQMVKNGCKPDSSIYKSIM

Q9SAJ5 Pentatricopeptide repeat-containing protein At1g795402.7e-22350.85Show/hide
Query:  FLRPIITYVVPKPPWFHLFHSPTN-PIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLIIDRLVK
        F R +I +   KP W    +S  N     S EV +I+    P E  LE + P +S ++ITSVI+++ N QLGFR FIW+ RR+RL       L+ID L +
Subjt:  FLRPIITYVVPKPPWFHLFHSPTN-PIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLIIDRLVK

Query:  DNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEA-FLLALAVYNRMLKCNLNPNVVTYSIL
        DN  +LYW TL+ELK   + + S  F VLI AY+K GM EKAVESFG M++FDC+P++F +N+IL V++R E  F+LA AVYN MLKCN +PN+ T+ IL
Subjt:  DNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEA-FLLALAVYNRMLKCNLNPNVVTYSIL

Query:  IHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHILGVN
        + G  K  +T DA  +FD+MT RGI PN +TY+I++SGLCQ     DA++LF +M+ SG  PD +++N LL+GFCKLG + EAF LL+ FEKDG +LG+ 
Subjt:  IHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHILGVN

Query:  GYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISNHDC
        GYS LI+GLFRARRY +A   Y  ML +NI+PD+ILYTI+IQGLS+ G++ DAL LL  M  +G SPDT CYNA+IK  C  G L++ +SL+LE+S  + 
Subjt:  GYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISNHDC

Query:  FPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQLCES
        FPD  T++ILIC MC+NGL+ EA+ IF E+EK GC PSV TFN+LIDGLCK G L+EA LL +KME+GR  SLFLRLS   N+  D+         + ES
Subjt:  FPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQLCES

Query:  GLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMT
        G ILKAY+ L    ++G  PD+ +YN+LING C+  +I+G  KL   +QLKG  PDSVTY TLI+GL+RVGR+E+A  +F    K+  +   ++Y+S+MT
Subjt:  GLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMT

Query:  WSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISSASCV
        WSCRK+K+  AF+ WMKYL+     +DE    + + F +GE E  +RRLIE+D +  +  L PYTI+LIGLCQ+ R  EA  +FSVL++ K+ ++  SCV
Subjt:  WSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISSASCV

Query:  MLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHL
         LI  LC  E+LD A++VFLYTL+  F LMPR+CN LLS LL   ++ +    L +RME  GY++++ L
Subjt:  MLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHL

Q9SH26 Pentatricopeptide repeat-containing protein At1g634006.8e-7332.92Show/hide
Query:  FSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVTYSILIHGFCKTSKTQDALVLFDEMTNRGIL
        F+ L+ A +K    +  +     M+      N++ +N++++   R     LALA+  +M+K    P++VT S L++G+C   +  DA+ L D+M   G  
Subjt:  FSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVTYSILIHGFCKTSKTQDALVLFDEMTNRGIL

Query:  PNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYKKML
        P+ IT++ ++ GL    K  +A  L  +M   GC P++++Y V++NG CK G +D AF LL   E       V  YS +I+ L + R  D+A   + +M 
Subjt:  PNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYKKML

Query:  GENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLISEAQHI
         + + P+VI Y+ +I  L    R +DA  LL +M ER  +P+   +NALI  F   G L +A+ L  E+      PD  TYS LI G C +  + EA+H+
Subjt:  GENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLISEAQHI

Query:  FNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ-------------------LCESGLI
        F  M    CFP+VVT+N+LI+G CK  R+ E   LF +M     +G   + +  L  G  +  D    Q++ +Q                   LC++G +
Subjt:  FNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ-------------------LCESGLI

Query:  LKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDS
         KA  +   L  S + P + TYNI+I G+CK   +  G+ LF  + LKG  PD + Y T+I G  R G  E+A  +F +M ++G  PDS
Subjt:  LKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDS

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial3.3e-7532.82Show/hide
Query:  NTLQELK-DSAIEISSD-----------AFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVTY
        N L ELK D A+ +  +            FS L+ A +K    +  +     M++     N + ++++++   R     LALAV  +M+K    PN+VT 
Subjt:  NTLQELK-DSAIEISSD-----------AFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVTY

Query:  SILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHIL
        S L++G+C + +  +A+ L D+M   G  PN +T++ ++ GL    K  +A  L  +M A GC PD+++Y V++NG CK G  D AF LL   E+     
Subjt:  SILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHIL

Query:  GVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISN
        GV  Y+ +I+GL + +  D+A   +K+M  + I P+V+ Y+ +I  L   GR +DA  LL +M ER  +PD   ++ALI  F   G L +A+ L  E+  
Subjt:  GVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISN

Query:  HDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRK-------PSLF----LRLSQGSN
            P   TYS LI G C +  + EA+ +F  M    CFP VVT+N+LI G CK  R++E   +F +M     +G           LF      ++Q   
Subjt:  HDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRK-------PSLF----LRLSQGSN

Query:  KVLDSAGL-------QVMMEQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDED
        K + S G+         +++ LC++G + KA  +   L  S + P + TYNI+I G+CK   +  G+ LF ++ LKG  PD V Y T+I G  R G  E+
Subjt:  KVLDSAGL-------QVMMEQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDED

Query:  ALGIFEQMVKNGCKPDSSIYKSIM
        A  +F++M ++G  P+S  Y +++
Subjt:  ALGIFEQMVKNGCKPDSSIYKSIM

Arabidopsis top hitse value%identityAlignment
AT1G62670.1 rna processing factor 22.3e-7632.82Show/hide
Query:  NTLQELK-DSAIEISSD-----------AFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVTY
        N L ELK D A+ +  +            FS L+ A +K    +  +     M++     N + ++++++   R     LALAV  +M+K    PN+VT 
Subjt:  NTLQELK-DSAIEISSD-----------AFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVTY

Query:  SILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHIL
        S L++G+C + +  +A+ L D+M   G  PN +T++ ++ GL    K  +A  L  +M A GC PD+++Y V++NG CK G  D AF LL   E+     
Subjt:  SILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHIL

Query:  GVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISN
        GV  Y+ +I+GL + +  D+A   +K+M  + I P+V+ Y+ +I  L   GR +DA  LL +M ER  +PD   ++ALI  F   G L +A+ L  E+  
Subjt:  GVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISN

Query:  HDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRK-------PSLF----LRLSQGSN
            P   TYS LI G C +  + EA+ +F  M    CFP VVT+N+LI G CK  R++E   +F +M     +G           LF      ++Q   
Subjt:  HDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRK-------PSLF----LRLSQGSN

Query:  KVLDSAGL-------QVMMEQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDED
        K + S G+         +++ LC++G + KA  +   L  S + P + TYNI+I G+CK   +  G+ LF ++ LKG  PD V Y T+I G  R G  E+
Subjt:  KVLDSAGL-------QVMMEQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDED

Query:  ALGIFEQMVKNGCKPDSSIYKSIM
        A  +F++M ++G  P+S  Y +++
Subjt:  ALGIFEQMVKNGCKPDSSIYKSIM

AT1G62910.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-7530.94Show/hide
Query:  QNLIIDRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLN
        +N + D +  D+A +L+ + ++     +I      F+ L+ A +K    E  +     M+      +++ +++ ++   R     LALAV  +M+K    
Subjt:  QNLIIDRLVKDNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLN

Query:  PNVVTYSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFE
        P++VT S L++G+C + +  DA+ L D+M   G  P+  T++ ++ GL    K  +A  L  +M   GC PD+++Y  ++NG CK G +D A +LL+  E
Subjt:  PNVVTYSILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFE

Query:  KDGHILGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSL
        K      V  Y+ +I+GL + +  D+A   + +M  + I PDV  Y+ +I  L   GR +DA  LL +M ER  +P+   ++ALI  F   G L +A+ L
Subjt:  KDGHILGVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSL

Query:  RLEISNHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRKPSL------FLRLSQGS
          E+      PD  TYS LI G C +  + EA+H+F  M    CFP+VVT+++LI G CK  R++E   LF +M     +G   +       F +     
Subjt:  RLEISNHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRKPSL------FLRLSQGS

Query:  N-----KVLDSAGL-------QVMMEQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYR
        N     K + S G+        ++++ LC++G + KA  +   L  S + PD+ TYNI+I G+CK   +  G++LF ++ LKG  P+ + Y T+I G  R
Subjt:  N-----KVLDSAGL-------QVMMEQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYR

Query:  VGRDEDALGIFEQMVKNGCKPDSSIYKSIM
         G  E+A  + ++M ++G  P+S  Y +++
Subjt:  VGRDEDALGIFEQMVKNGCKPDSSIYKSIM

AT1G63130.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-7431.3Show/hide
Query:  NTLQELK-DSAIEISSD-----------AFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVTY
        N L +LK D A+ +  D            FS L+ A +K    +  +     M++     N++ ++++++   R     LALAV  +M+K    P++VT 
Subjt:  NTLQELK-DSAIEISSD-----------AFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVTY

Query:  SILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHIL
        + L++GFC  ++  DA+ L  +M   G  P+  T++ ++ GL +  +  +A  L  +M   GC PD+++Y +++NG CK G +D A +LL+  E+     
Subjt:  SILIHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHIL

Query:  GVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISN
        GV  Y+ +I+ L   +  ++A   + +M  + I P+V+ Y  +I+ L   GR +DA  LL +M ER  +P+   ++ALI  F   G L +A+ L  E+  
Subjt:  GVNGYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISN

Query:  HDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRKPSL------FLRLSQGSN-----
            PD  TYS LI G C +  + EA+H+F  M    CFP+VVT+N+LI G CK  R+ E   LF +M     +G   +       F +  +  N     
Subjt:  HDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRKPSL------FLRLSQGSN-----

Query:  KVLDSAGL-------QVMMEQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDED
        K + S G+        ++++ LC +G +  A  +   L  S + PD+ TYNI+I G+CK   +  G+ LF  + LKG  P+ VTY T++ G  R G  E+
Subjt:  KVLDSAGL-------QVMMEQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDED

Query:  ALGIFEQMVKNGCKPDSSIYKSIM
        A  +F +M + G  PDS  Y +++
Subjt:  ALGIFEQMVKNGCKPDSSIYKSIM

AT1G63400.1 Pentatricopeptide repeat (PPR) superfamily protein4.9e-7432.92Show/hide
Query:  FSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVTYSILIHGFCKTSKTQDALVLFDEMTNRGIL
        F+ L+ A +K    +  +     M+      N++ +N++++   R     LALA+  +M+K    P++VT S L++G+C   +  DA+ L D+M   G  
Subjt:  FSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVTYSILIHGFCKTSKTQDALVLFDEMTNRGIL

Query:  PNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYKKML
        P+ IT++ ++ GL    K  +A  L  +M   GC P++++Y V++NG CK G +D AF LL   E       V  YS +I+ L + R  D+A   + +M 
Subjt:  PNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYKKML

Query:  GENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLISEAQHI
         + + P+VI Y+ +I  L    R +DA  LL +M ER  +P+   +NALI  F   G L +A+ L  E+      PD  TYS LI G C +  + EA+H+
Subjt:  GENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLISEAQHI

Query:  FNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ-------------------LCESGLI
        F  M    CFP+VVT+N+LI+G CK  R+ E   LF +M     +G   + +  L  G  +  D    Q++ +Q                   LC++G +
Subjt:  FNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKME----IGRKPSLFLRLSQGSNKVLDSAGLQVMMEQ-------------------LCESGLI

Query:  LKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDS
         KA  +   L  S + P + TYNI+I G+CK   +  G+ LF  + LKG  PD + Y T+I G  R G  E+A  +F +M ++G  PDS
Subjt:  LKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDS

AT1G79540.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-22450.85Show/hide
Query:  FLRPIITYVVPKPPWFHLFHSPTN-PIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLIIDRLVK
        F R +I +   KP W    +S  N     S EV +I+    P E  LE + P +S ++ITSVI+++ N QLGFR FIW+ RR+RL       L+ID L +
Subjt:  FLRPIITYVVPKPPWFHLFHSPTN-PIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLIIDRLVK

Query:  DNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEA-FLLALAVYNRMLKCNLNPNVVTYSIL
        DN  +LYW TL+ELK   + + S  F VLI AY+K GM EKAVESFG M++FDC+P++F +N+IL V++R E  F+LA AVYN MLKCN +PN+ T+ IL
Subjt:  DNAFELYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEA-FLLALAVYNRMLKCNLNPNVVTYSIL

Query:  IHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHILGVN
        + G  K  +T DA  +FD+MT RGI PN +TY+I++SGLCQ     DA++LF +M+ SG  PD +++N LL+GFCKLG + EAF LL+ FEKDG +LG+ 
Subjt:  IHGFCKTSKTQDALVLFDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHILGVN

Query:  GYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISNHDC
        GYS LI+GLFRARRY +A   Y  ML +NI+PD+ILYTI+IQGLS+ G++ DAL LL  M  +G SPDT CYNA+IK  C  G L++ +SL+LE+S  + 
Subjt:  GYSCLINGLFRARRYDEAHMWYKKMLGENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISNHDC

Query:  FPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQLCES
        FPD  T++ILIC MC+NGL+ EA+ IF E+EK GC PSV TFN+LIDGLCK G L+EA LL +KME+GR  SLFLRLS   N+  D+         + ES
Subjt:  FPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCFPSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQLCES

Query:  GLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMT
        G ILKAY+ L    ++G  PD+ +YN+LING C+  +I+G  KL   +QLKG  PDSVTY TLI+GL+RVGR+E+A  +F    K+  +   ++Y+S+MT
Subjt:  GLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFKDMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMT

Query:  WSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISSASCV
        WSCRK+K+  AF+ WMKYL+     +DE    + + F +GE E  +RRLIE+D +  +  L PYTI+LIGLCQ+ R  EA  +FSVL++ K+ ++  SCV
Subjt:  WSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKSKDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISSASCV

Query:  MLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHL
         LI  LC  E+LD A++VFLYTL+  F LMPR+CN LLS LL   ++ +    L +RME  GY++++ L
Subjt:  MLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMNAHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTCCGGCCAAATTTTCTTCGACCCATAATCACCTATGTAGTCCCAAAACCTCCATGGTTCCACTTATTTCATTCGCCGACTAACCCAATCGCCACTTCCAATGA
GGTTTCCACCATAATCAAAACTGTTGACCCTTTCGAAGATGGATTGGAAGTCATAGCGCCCCATATTTCGTCTGATGTAATTACCTCCGTCATTGAAGAACAATCGAATC
CCCAACTTGGATTTCGACTTTTTATCTGGTCGTTAAGGAGAAAGCGCCTGTGCTGCAGCGCCTTTCAGAATCTGATCATCGACAGGTTAGTAAAGGACAATGCCTTCGAA
TTATATTGGAACACTCTTCAAGAGCTAAAGGATTCAGCAATTGAAATTTCATCGGATGCTTTCTCTGTGTTGATTGAGGCATACTCAAAAGCGGGCATGGACGAGAAGGC
CGTTGAATCATTTGGTTTGATGCGGGATTTTGACTGTAAGCCCAACATTTTTGCTTTTAATTTGATTTTGCATGTTTTGGTGCGAAACGAAGCATTTCTGTTAGCTTTAG
CTGTGTATAATCGGATGCTGAAGTGTAATTTGAATCCGAATGTGGTTACCTACAGCATATTGATACATGGATTCTGTAAAACTAGTAAAACTCAAGATGCCCTTGTACTT
TTTGATGAAATGACCAATAGAGGAATATTGCCCAACGAGATAACTTATTCGATTGTTCTTTCTGGATTGTGTCAAGCTAAGAAAATTCATGATGCACAGAGATTGTTCAG
TAAGATGAGAGCTAGTGGGTGTAGTCCAGATGTAATAAGTTATAATGTTTTGCTTAATGGATTTTGTAAGTTAGGTTATTTGGATGAAGCTTTTGCATTGTTGCAATCAT
TTGAAAAGGATGGCCATATTCTTGGAGTCAATGGGTATAGTTGTTTAATTAATGGCTTGTTTAGGGCTAGGAGATATGATGAAGCACATATGTGGTACAAAAAAATGTTG
GGGGAAAACATCGAGCCCGATGTTATCTTGTATACTATTATGATCCAAGGTTTATCACAAGAAGGTCGGGTTACTGATGCATTGGCACTGTTGGATGAGATGACAGAAAG
AGGGTTTAGTCCAGATACTGCTTGTTACAATGCTTTAATTAAAGGGTTTTGTGATATGGGTCATTTGGATAAGGCTCAGTCTCTTAGACTCGAGATTTCAAACCACGACT
GTTTCCCTGATAATCACACATACTCCATTCTCATTTGTGGTATGTGCAAGAATGGGCTAATAAGTGAGGCACAACATATATTCAATGAAATGGAGAAGCTTGGATGCTTT
CCTTCTGTTGTGACCTTCAACTCTCTCATTGATGGACTTTGCAAAGTTGGTAGGCTTCAGGAAGCTCACCTATTATTTTACAAAATGGAGATAGGAAGAAAACCTTCTTT
GTTTCTTCGTCTTTCTCAGGGCTCCAATAAGGTTCTTGATAGTGCCGGTCTCCAAGTTATGATGGAGCAATTATGTGAGTCAGGATTGATTCTTAAGGCCTACAAGCTTC
TTATGCAGCTAGTTGAGAGTGGGGTTTTGCCAGATGTTAGGACTTATAACATCCTAATCAATGGATTATGCAAGAATAACAATATTAATGGTGGTTTCAAGCTCTTCAAG
GACATGCAGCTCAAAGGACGCTTGCCAGATTCGGTTACATACGGGACTCTAATAGATGGGCTTTATAGAGTTGGTAGGGATGAGGATGCACTAGGGATTTTTGAACAAAT
GGTAAAGAATGGGTGCAAGCCTGATTCTTCTATTTACAAGTCCATCATGACTTGGTCGTGTCGGAAAAAGAAGCTTTCACAAGCTTTTAGTTTCTGGATGAAGTATTTGA
GGAATTTCCGTGGCTGGGAAGACGAAAAGGTCGCAATAGTAGGGGAAAGCTTTGATAAAGGAGAGCTTGAGACAACAATCCGGAGATTAATCGAAATGGACATGAAATCA
AAAGATTTTGACTTAGCTCCATACACCATTTTTCTCATTGGATTGTGTCAAGCCGAGAGGGTTTCTGAAGCTTTTGCTATCTTTTCTGTTCTCAAGGACTTCAAAATGAA
TATAAGTTCAGCGAGTTGTGTGATGTTGATTGGCAAGTTGTGCATGGAAGAAAAACTCGACCTGGCCATGGATGTTTTTCTTTATACACTAGAAGAAGGCTTTATGTTGA
TGCCTCGAATTTGTAATCAGCTGCTGAGCCATCTTCTTCGTTTGGAGGACAGAAAAGACCATGCTCTTGTTCTTATACATAGAATGGAGGCTTTTGGATATGATATGAAT
GCTCATCTCCACGACAGTACTAAGTTGCTTCTTCATGATCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTCCGGCCAAATTTTCTTCGACCCATAATCACCTATGTAGTCCCAAAACCTCCATGGTTCCACTTATTTCATTCGCCGACTAACCCAATCGCCACTTCCAATGA
GGTTTCCACCATAATCAAAACTGTTGACCCTTTCGAAGATGGATTGGAAGTCATAGCGCCCCATATTTCGTCTGATGTAATTACCTCCGTCATTGAAGAACAATCGAATC
CCCAACTTGGATTTCGACTTTTTATCTGGTCGTTAAGGAGAAAGCGCCTGTGCTGCAGCGCCTTTCAGAATCTGATCATCGACAGGTTAGTAAAGGACAATGCCTTCGAA
TTATATTGGAACACTCTTCAAGAGCTAAAGGATTCAGCAATTGAAATTTCATCGGATGCTTTCTCTGTGTTGATTGAGGCATACTCAAAAGCGGGCATGGACGAGAAGGC
CGTTGAATCATTTGGTTTGATGCGGGATTTTGACTGTAAGCCCAACATTTTTGCTTTTAATTTGATTTTGCATGTTTTGGTGCGAAACGAAGCATTTCTGTTAGCTTTAG
CTGTGTATAATCGGATGCTGAAGTGTAATTTGAATCCGAATGTGGTTACCTACAGCATATTGATACATGGATTCTGTAAAACTAGTAAAACTCAAGATGCCCTTGTACTT
TTTGATGAAATGACCAATAGAGGAATATTGCCCAACGAGATAACTTATTCGATTGTTCTTTCTGGATTGTGTCAAGCTAAGAAAATTCATGATGCACAGAGATTGTTCAG
TAAGATGAGAGCTAGTGGGTGTAGTCCAGATGTAATAAGTTATAATGTTTTGCTTAATGGATTTTGTAAGTTAGGTTATTTGGATGAAGCTTTTGCATTGTTGCAATCAT
TTGAAAAGGATGGCCATATTCTTGGAGTCAATGGGTATAGTTGTTTAATTAATGGCTTGTTTAGGGCTAGGAGATATGATGAAGCACATATGTGGTACAAAAAAATGTTG
GGGGAAAACATCGAGCCCGATGTTATCTTGTATACTATTATGATCCAAGGTTTATCACAAGAAGGTCGGGTTACTGATGCATTGGCACTGTTGGATGAGATGACAGAAAG
AGGGTTTAGTCCAGATACTGCTTGTTACAATGCTTTAATTAAAGGGTTTTGTGATATGGGTCATTTGGATAAGGCTCAGTCTCTTAGACTCGAGATTTCAAACCACGACT
GTTTCCCTGATAATCACACATACTCCATTCTCATTTGTGGTATGTGCAAGAATGGGCTAATAAGTGAGGCACAACATATATTCAATGAAATGGAGAAGCTTGGATGCTTT
CCTTCTGTTGTGACCTTCAACTCTCTCATTGATGGACTTTGCAAAGTTGGTAGGCTTCAGGAAGCTCACCTATTATTTTACAAAATGGAGATAGGAAGAAAACCTTCTTT
GTTTCTTCGTCTTTCTCAGGGCTCCAATAAGGTTCTTGATAGTGCCGGTCTCCAAGTTATGATGGAGCAATTATGTGAGTCAGGATTGATTCTTAAGGCCTACAAGCTTC
TTATGCAGCTAGTTGAGAGTGGGGTTTTGCCAGATGTTAGGACTTATAACATCCTAATCAATGGATTATGCAAGAATAACAATATTAATGGTGGTTTCAAGCTCTTCAAG
GACATGCAGCTCAAAGGACGCTTGCCAGATTCGGTTACATACGGGACTCTAATAGATGGGCTTTATAGAGTTGGTAGGGATGAGGATGCACTAGGGATTTTTGAACAAAT
GGTAAAGAATGGGTGCAAGCCTGATTCTTCTATTTACAAGTCCATCATGACTTGGTCGTGTCGGAAAAAGAAGCTTTCACAAGCTTTTAGTTTCTGGATGAAGTATTTGA
GGAATTTCCGTGGCTGGGAAGACGAAAAGGTCGCAATAGTAGGGGAAAGCTTTGATAAAGGAGAGCTTGAGACAACAATCCGGAGATTAATCGAAATGGACATGAAATCA
AAAGATTTTGACTTAGCTCCATACACCATTTTTCTCATTGGATTGTGTCAAGCCGAGAGGGTTTCTGAAGCTTTTGCTATCTTTTCTGTTCTCAAGGACTTCAAAATGAA
TATAAGTTCAGCGAGTTGTGTGATGTTGATTGGCAAGTTGTGCATGGAAGAAAAACTCGACCTGGCCATGGATGTTTTTCTTTATACACTAGAAGAAGGCTTTATGTTGA
TGCCTCGAATTTGTAATCAGCTGCTGAGCCATCTTCTTCGTTTGGAGGACAGAAAAGACCATGCTCTTGTTCTTATACATAGAATGGAGGCTTTTGGATATGATATGAAT
GCTCATCTCCACGACAGTACTAAGTTGCTTCTTCATGATCATTGA
Protein sequenceShow/hide protein sequence
MKLRPNFLRPIITYVVPKPPWFHLFHSPTNPIATSNEVSTIIKTVDPFEDGLEVIAPHISSDVITSVIEEQSNPQLGFRLFIWSLRRKRLCCSAFQNLIIDRLVKDNAFE
LYWNTLQELKDSAIEISSDAFSVLIEAYSKAGMDEKAVESFGLMRDFDCKPNIFAFNLILHVLVRNEAFLLALAVYNRMLKCNLNPNVVTYSILIHGFCKTSKTQDALVL
FDEMTNRGILPNEITYSIVLSGLCQAKKIHDAQRLFSKMRASGCSPDVISYNVLLNGFCKLGYLDEAFALLQSFEKDGHILGVNGYSCLINGLFRARRYDEAHMWYKKML
GENIEPDVILYTIMIQGLSQEGRVTDALALLDEMTERGFSPDTACYNALIKGFCDMGHLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLISEAQHIFNEMEKLGCF
PSVVTFNSLIDGLCKVGRLQEAHLLFYKMEIGRKPSLFLRLSQGSNKVLDSAGLQVMMEQLCESGLILKAYKLLMQLVESGVLPDVRTYNILINGLCKNNNINGGFKLFK
DMQLKGRLPDSVTYGTLIDGLYRVGRDEDALGIFEQMVKNGCKPDSSIYKSIMTWSCRKKKLSQAFSFWMKYLRNFRGWEDEKVAIVGESFDKGELETTIRRLIEMDMKS
KDFDLAPYTIFLIGLCQAERVSEAFAIFSVLKDFKMNISSASCVMLIGKLCMEEKLDLAMDVFLYTLEEGFMLMPRICNQLLSHLLRLEDRKDHALVLIHRMEAFGYDMN
AHLHDSTKLLLHDH