; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013522 (gene) of Chayote v1 genome

Gene IDSed0013522
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG05:42582359..42587587
RNA-Seq ExpressionSed0013522
SyntenySed0013522
Gene Ontology termsGO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605380.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0079.95Show/hide
Query:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV
        MK R TF+RP + Y  PKPPWFH FH+ T+ IA+SNEV+ IIETV P E ALE I PHLS  VITSVI+EQP+ RLGFRLFIWSLRR HLCC ASQ+LI+
Subjt:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV

Query:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT
        DRLVKDNAFELYWKTLQELKDS+  ISSDAFSVLIEAYSKA M EKA++SFG + DFECKPN FAYNLILHVLVR+EAF+LALA+YNQMLKCNLNP+VVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT

Query:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI
        YSILIHGFCKTSK +EAL LFDEMTDR + PN+ITYS+ILSGLC+AKKIDDA RLF  MRASGCSPDVITYNV+LNGFCKLGY DEAFA L+SFEKDGHI
Subjt:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI

Query:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS
        LG+ GYSCLID LF+ARRYDEAHMWYQ+  +KNV+PDVILYTIMIQGL QEG  NEALALL EM E G SPDT CYNAVI+GFCD+GLLDKAQSL+LEIS
Subjt:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS

Query:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ
         HDCFP++HTYSILICGMCKNGL+ EAQH+FNEMEKLGCLPSVVTFNSLIDG CKAG+LKEAHLLFYKMEIGRKP LFLRLSQG +K+L +  LQV +EQ
Subjt:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ

Query:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN
        L ESG+I KAY LLMQLVESGV PDIRTYNILINGFCK NN++GAFKLFKDMQLKGRLPDSVTYGT+IDGLHRVGRD+DALGIFE MVK+GCKP   VY 
Subjt:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN

Query:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA
         IMTWSCR+KKVS AFS+WMKYLRNFRGWK+E V VVEESF KG++ KAI R+IEMDL SKDFDL PYTIFLVGLCQAGRVSEA  +F VLKDFK  I +
Subjt:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA

Query:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL
         SCVMLIGGLCVE K  LA+EVFLYTLETG MLMPRICNQLL HLL+ ED KDHAF L+RRMEAFGYDMNA+L +STK LL+DHWKSLK K R+ Q+L
Subjt:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL

KAG7035334.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0079.95Show/hide
Query:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV
        MK R TF+RP + Y  PKPPWFH FH+ T+ IA+SNEV+ IIETV P E ALE I PHLS  VITSVI+EQP+ RLGFRLFIWSLRR HLCC ASQ+LI+
Subjt:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV

Query:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT
        DRLVKDNAFELYWKTLQELKDS+  ISSDAFSVLIEAYSKA M EKA++SFG + DFECKPN FAYNLILHVLVR+EAF+LALA+YNQMLKCNLNP+VVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT

Query:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI
        YSILIHGFCKTSK +EAL LFDEMTDR + PN+ITYS+ILSGLC+AKKIDDA RLF  MRASGCSPDVITYNV+LNGFCKLGY DEAFA L+SFEKDGHI
Subjt:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI

Query:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS
        LG+ GYSCLID LF+ARRYDEAHMWYQ+  +KNV+PDVILYTIMIQGL QEG  NEALALL EM E G SPDT CYNAVI+GFCD+GLLDKAQSL+LEIS
Subjt:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS

Query:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ
         HDCFP++HTYSILICGMCKNGL+ EAQH+FNEMEKLGCLPSVVTFNSLIDG CKAG+LKEAHLLFYKMEIGRKP LFLRLSQG +K+L +  LQV +EQ
Subjt:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ

Query:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN
        L ESG+I KAY LLMQLVESGV PDIRTYNILINGFCK NN++GAF LFKDMQLKGRLPDSVTYGT+IDGLHRVGRDEDALGIFE MVK+GCKP   VY 
Subjt:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN

Query:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA
         IMTWSCR+KKVS AFS+WMKYLRNFRGWK+E V VVEESF KG++ KAI R+IEMDL SKDFDL PYTIFLVGLCQAGRVSEA  +F VLKDFK  I +
Subjt:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA

Query:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL
         SCVMLIGGLCVE K  LA+EVFLYTLETG MLMPRICNQLL HLL+ ED KDHAF L+RRMEAFGYDMNA+L +STK LL+DHWKSLK K R+ Q+L
Subjt:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL

XP_022948073.1 pentatricopeptide repeat-containing protein At1g79540 [Cucurbita moschata]0.0e+0080.08Show/hide
Query:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV
        MK R TF+RP + Y  PKPPWFH FH+ T+ IATSNEV+ IIETV P E ALE I PHLS  VITSVI+EQP+ RLGFRLFIWSLRR HLCC ASQ+LI+
Subjt:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV

Query:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT
        DRLVKDNAFELYWKTLQELKDS+  ISSDAFSVLIEAYSKA M EKA++SFG + DFECKPN +AYNLILHVLVR+EAF+LALA+YNQMLKCNLNP+VVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT

Query:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI
        YSILIHGFCKTSK +EAL LFDEMTDR + PN+ITYS+ILSGLC+AKKIDDA RLF  MRASGCSPDVITYNV+LNGFCKLGY DEAFA L+SFEKDGHI
Subjt:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI

Query:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS
        LG+ GYSCLID LF+ARRYDEAHMWYQ+  +KNV+PDVILYTIMIQGL QEG  NEALALL EM E G SPDTTCYNAVI+GFCD+GLLDKAQSL+LEIS
Subjt:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS

Query:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ
         HDCFP++HTYSILICGMCKNGL+ EAQH+FNEMEKLGCLPSVVTFNSLIDG CKAG+LKEAHLLFYKMEIGRKPSLFLRLSQG +K+L +  LQV +EQ
Subjt:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ

Query:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN
        L ESG+I KAY LLMQLVESGV PDIRTYNILINGFCK NN++GAFKLFKDMQLKGRLPDSVTYGT+IDGLHRVGRDEDALGIFE MVK+GCKP   VY 
Subjt:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN

Query:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA
         IMTWSCR+KKVS  FS+WMKYLRNFRGWK+E V VVEESF KG++ KAI R+IEMDL SKDF+L PYTIFL+GLCQAGRVSEA  +F VLKDFK  I +
Subjt:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA

Query:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL
         SCVMLIGGLCVE K  LA+EVFLYTLETG MLMPRICNQLL HLL+ ED KDHAF L+RRMEAFGYDMNA+L +STK LL+DHWKSLK K R+ QRL
Subjt:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL

XP_023007126.1 pentatricopeptide repeat-containing protein At1g79540 [Cucurbita maxima]0.0e+0080.7Show/hide
Query:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV
        MK R TF+RP + Y  PKPPWFH FH+PT+ IATSNEV+ IIETV P E ALE I PH+S  VITSVI+EQP+ RLGFRLFIWSLRR HLCC ASQDLI+
Subjt:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV

Query:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT
        DRLVKDNAFELYWKTLQELKDS+  ISSDAFSVLIEAYSKA M+EKA++SFG + DFECKPN FAYNLILHVLVR+EAF+LALA+YNQMLKCNLNP+VVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT

Query:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI
        YSILIHGFCKTSK +EAL LFDEMTDR + PN+ITYS+ILSGLC+AKKIDDA RLF  MRASGCSPDVITYNV+LNGFCKLGY DEAFA LRSFEKDGHI
Subjt:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI

Query:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS
        LG+ GYSCLID LF+ARRYDEAHMWYQ+  +KNV+PDVILYTIMIQGL QEG  NEALALL EM E G SPDTTCYNAVI+GFCD+GLLDKAQSL+LEIS
Subjt:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS

Query:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ
         HDCFPD+HTYSILICGMCKNGL+ EAQH+FNEMEKLGCLPSVVTFNSLIDG CKAG+LKEAHLLFYKMEIGRKPSLFLRL QG +KVL +  LQV +EQ
Subjt:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ

Query:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN
        L ESG+I KAY LLMQLVESGV PDIRTYNILINGFCK NN++GAFKLFKDMQLKGRLPDS+TYGT+IDGLHRVGRDEDALGIFE MVKNGCKP S VY 
Subjt:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN

Query:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA
         IMTWSCR+KKVS AFS+WMKYLRNFRGWK+E V VVEESF KG++ KAI R+IEMDL SKDFDL PYTIFL+GLCQAGRVSEA  +F VLKDFK  I +
Subjt:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA

Query:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL
         SCVMLIGGLCVE K  LA+EVFLYTLETG MLMPRICNQLL H L+ ED KDHAF L+RRMEAFGYDMNA+L +STK LL+DHWKSLK K R+ Q L
Subjt:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL

XP_023534570.1 pentatricopeptide repeat-containing protein At1g79540 [Cucurbita pepo subsp. pepo]0.0e+0080.2Show/hide
Query:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV
        MK R TF+RP + Y  PKPPWFH FH+PT+ IATSNEV+ IIETV P E ALE I PH+S  VITSVI+EQP+ RLGFR+FIWSLRR HLCC ASQ+LI+
Subjt:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV

Query:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT
        DRLVKDNAFELYWKTLQELKDS+  ISSDAFSVLIEAYSKA M EKA++SFG + DFECKPN FAYNLILHVLVR+EAF+LALA+YNQMLKCNLNP+VVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT

Query:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI
        YSILIHGFCKTSK +EAL LFDEMTDR + PN+ITYS+ILSGLC+AKKIDDA RLF  MRASGCSPDVITYNV+LNGFCKLGY DEAFA L+SFEKDGHI
Subjt:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI

Query:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS
        LG+ GYSCLID LF+ARRYDEAHMWYQ+  +KNV+PDVILYTIMIQGL QEG  NEALALL EM E G SPDTTCYNAVI+GFCD+GLLDKAQSL+LEIS
Subjt:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS

Query:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ
         HDCFPD+HTYSILICGMCKNGL+ EAQH+FNEMEKLGCLPSVVTFNSLIDG CKAG+LKEAHLLFYKMEIGRKPSLFLRLSQG +KVL +  LQV +EQ
Subjt:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ

Query:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN
        L ESG+I KAY LLMQLVESGV PDIRTYNILINGFCK NN++GAFKLFKDMQLKGRLPDSVTYGT+IDGLHRVGRDEDALGIFE MVK+GCKP   VY 
Subjt:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN

Query:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA
         IMTWSCR+KKVS AFS+WMKYLRNFRGWK+E V VVEESF KG++ KAI R+IEMDL SKDFDL PYTIFL+GLCQAGR SEA  +F VLKDFK  I +
Subjt:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA

Query:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL
         SCVMLIGGLCVE K  LA+EVFLYTLETG MLMPRICNQLL H L+ E+ KDHAF L+RRMEAFGYDMNAHL +STK LL+DHWKSLK K R+ Q+L
Subjt:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL

TrEMBL top hitse value%identityAlignment
A0A0A0KD52 Uncharacterized protein0.0e+0071.17Show/hide
Query:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV
        MKLRP   RP I +  PKP  FH +HS TN IATS EV+ IIET+ P E  L+ I   +    ITSV++EQPD RLGFRLFIWSL+  HL CR  QDLI+
Subjt:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV

Query:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT
         +L+K+NAFELYWK LQELK+SAI ISS+AFSVLIEAYS+A MDEKA+ESFG + DF+CKP+ FA+NLILH LVRKEAF+LALA+YNQMLKCNLNP VVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT

Query:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI
        Y ILIHG CKT K ++AL LFDEMTDRGI PNQI YS++LSGLC+AKKI DA RLF  MRASGC+ D+ITYNV+LNGFCK GYLD+AF  L+   KDGHI
Subjt:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI

Query:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS
        LG+ GY CLI+ LF+ARRY+EAHMWYQ+ML++N+KPDV+LYTIMI+GLSQEG   EAL LLGEM E GL PDT CYNA+IKGFCD+G LD+A+SL+LEIS
Subjt:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS

Query:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ
        KHDCFP++HTYSILICGMCKNGL+++AQHIF EMEKLGCLPSVVTFNSLI+GLCKA RL+EA LLFY+MEI RKPSLFLRLSQG DKV D A LQV ME+
Subjt:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ

Query:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN
        L ESGMILKAY LLMQLV+SGVLPDIRTYNILINGFCK  N+NGAFKLFK+MQLKG +PDSVTYGT+IDGL+R GR+EDAL IFE MVK GC P S  Y 
Subjt:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN

Query:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA
         IMTWSCR+  +S A S+WMKYLR+FRGW++E V VV ESF   E+  AIRRL+EMD++SK+FDL PYTIFL+GL QA R  EA  +F VLKDFKMNI +
Subjt:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA

Query:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDH
         SCVMLIG LC+ E   +AM+VFL+TLE G  LMP ICNQLL +LL+  D KD A  L  RMEA GYD+ AHL Y TKL L+DH
Subjt:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDH

A0A5D3B9M5 Pentatricopeptide repeat-containing protein0.0e+0068.03Show/hide
Query:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV
        MKLRP   RP I +  PKPP F  +HS TN I TS EV+ IIETV P E  L+ I   ++  +ITSV+ +QP+  LGFRLFIWSL  +H   RA + LI+
Subjt:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV

Query:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT
        D+L+KDNAFELYWK LQELK+SAI ISSDAFSVLIEAYS+A M+EKA+ESFG + DF+CKPN FA+NLIL  LVRKEAF+LALA+YNQMLKCNLNP V T
Subjt:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT

Query:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI
        Y ILIHGFC+T K ++AL LFDEMT RGI PN+I Y+++LSGLC AKKI DA RLF  M A     D+ TYNV+LNGFCKLGYLDEAF  L+   KDGH 
Subjt:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI

Query:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS
        L ++GY CLI+ LF+ARRY+EAH WY++ML++N+KPDVILYTIMIQGLSQEG    A+ LLGEM E GL PDT CYNA+IKGFCDIG LDKAQSL+LEIS
Subjt:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS

Query:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ
         H CFP +HTYSILICGMCK+GL++EAQHIF EMEKLGCLPSVVTFNSLI+GLCKA RL+EA LLFY+MEI RKPSLFLRLSQG DKVLD A LQV MEQ
Subjt:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ

Query:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN
        L ESG+ILKAY LLMQLV+SGVLPDIRTYNILINGFCK  N+NGAFKLFK+MQ +G +PDSVTYGT+IDGL+RVGR+EDALGIF  M K GC P S  Y 
Subjt:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN

Query:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA
         IMTW CR+K +    S+WMKYLRNFRGW++E V VVEESF   E+  AIRRL+EMD++SK+FD+ PYTIFL+GLC+A RVSEA  +F V KDFKMNI +
Subjt:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA

Query:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKG---RYVQR
         SCV LI GLC  EK  LA++VFL+TLE    +MP ICN+LL HLL+  D KD A  L  R+EA GYD+ AHL Y TKLLL+DH +SL+ K    +Y+  
Subjt:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKG---RYVQR

Query:  LPIHSRE
        L  HS+E
Subjt:  LPIHSRE

A0A6J1D6A9 pentatricopeptide repeat-containing protein At1g79540 isoform X10.0e+0077.35Show/hide
Query:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV
        MK RPTFIRP I    PKPPWFH +HSPT+ IATSNEV  I+ETV+PFE ALE I PH+SP VITSVIEEQP+PRLGFRLFIWSL+   LCC ASQ+LI+
Subjt:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV

Query:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT
        DRLV+DNAFELYWKTLQELKDSA++I SDAFSVLIEAYS A MDEKA+ESFG + DF+CKPN F YNLIL+VLVRKEAF LAL++YNQML+CN  P+VVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT

Query:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI
        YSILIHG CKTSK ++AL LFDEM +RGI PN+ITYS++LSGLC+A KIDDA RLF  MRASGCSPD ITYNV+LNGFCK GY DEAFA L++FEKDGHI
Subjt:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI

Query:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS
        LG+N YSCLID LF+ARRYDEA  WYQ+ML++N+KPDVILYTIMIQGLSQEG  N+ALALLGEM E G SPDTTCYNA+IKGFCD+ LLDKA+SL+L IS
Subjt:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS

Query:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ
         HDC PD+HTYSILICGMC+NGL+ EAQ++FNEMEKLGCLPSV TFNSLIDGLCK GR+ EA LLFYKMEIGRKPS+FLRL+QGV+KVLD+AGLQV +EQ
Subjt:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ

Query:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN
        L ESGMILKAY LLMQL ESGVLPDIRTYNILINGFCK N +NGAFKLFKDMQLKGRLPDSVTYGT+I+GLHRVGRD+DAL +F+ MVK GCKP S VY 
Subjt:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN

Query:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA
         IMTWSCRKK VS AFS+WMKYL NFRGWK+E+V VVE SF KGE+ KAI+RLIEMD +SKDFD +PYTIFL+GLCQA RVSEA  +F VLKDFKMN   
Subjt:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA

Query:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNK---GRYVQR
         SCVMLIGGLC+EEK  LA++VFLYTLETG +LMPRICNQLL HLL SED KDHA  L+RRME FGYDM+A+L YSTK LL+DHWKSL  K    RY QR
Subjt:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNK---GRYVQR

Query:  LPIHSRES
        LPIHS+ES
Subjt:  LPIHSRES

A0A6J1G8C6 pentatricopeptide repeat-containing protein At1g795400.0e+0080.08Show/hide
Query:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV
        MK R TF+RP + Y  PKPPWFH FH+ T+ IATSNEV+ IIETV P E ALE I PHLS  VITSVI+EQP+ RLGFRLFIWSLRR HLCC ASQ+LI+
Subjt:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV

Query:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT
        DRLVKDNAFELYWKTLQELKDS+  ISSDAFSVLIEAYSKA M EKA++SFG + DFECKPN +AYNLILHVLVR+EAF+LALA+YNQMLKCNLNP+VVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT

Query:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI
        YSILIHGFCKTSK +EAL LFDEMTDR + PN+ITYS+ILSGLC+AKKIDDA RLF  MRASGCSPDVITYNV+LNGFCKLGY DEAFA L+SFEKDGHI
Subjt:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI

Query:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS
        LG+ GYSCLID LF+ARRYDEAHMWYQ+  +KNV+PDVILYTIMIQGL QEG  NEALALL EM E G SPDTTCYNAVI+GFCD+GLLDKAQSL+LEIS
Subjt:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS

Query:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ
         HDCFP++HTYSILICGMCKNGL+ EAQH+FNEMEKLGCLPSVVTFNSLIDG CKAG+LKEAHLLFYKMEIGRKPSLFLRLSQG +K+L +  LQV +EQ
Subjt:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ

Query:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN
        L ESG+I KAY LLMQLVESGV PDIRTYNILINGFCK NN++GAFKLFKDMQLKGRLPDSVTYGT+IDGLHRVGRDEDALGIFE MVK+GCKP   VY 
Subjt:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN

Query:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA
         IMTWSCR+KKVS  FS+WMKYLRNFRGWK+E V VVEESF KG++ KAI R+IEMDL SKDF+L PYTIFL+GLCQAGRVSEA  +F VLKDFK  I +
Subjt:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA

Query:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL
         SCVMLIGGLCVE K  LA+EVFLYTLETG MLMPRICNQLL HLL+ ED KDHAF L+RRMEAFGYDMNA+L +STK LL+DHWKSLK K R+ QRL
Subjt:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL

A0A6J1KZN2 pentatricopeptide repeat-containing protein At1g795400.0e+0080.7Show/hide
Query:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV
        MK R TF+RP + Y  PKPPWFH FH+PT+ IATSNEV+ IIETV P E ALE I PH+S  VITSVI+EQP+ RLGFRLFIWSLRR HLCC ASQDLI+
Subjt:  MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIV

Query:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT
        DRLVKDNAFELYWKTLQELKDS+  ISSDAFSVLIEAYSKA M+EKA++SFG + DFECKPN FAYNLILHVLVR+EAF+LALA+YNQMLKCNLNP+VVT
Subjt:  DRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVT

Query:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI
        YSILIHGFCKTSK +EAL LFDEMTDR + PN+ITYS+ILSGLC+AKKIDDA RLF  MRASGCSPDVITYNV+LNGFCKLGY DEAFA LRSFEKDGHI
Subjt:  YSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHI

Query:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS
        LG+ GYSCLID LF+ARRYDEAHMWYQ+  +KNV+PDVILYTIMIQGL QEG  NEALALL EM E G SPDTTCYNAVI+GFCD+GLLDKAQSL+LEIS
Subjt:  LGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEIS

Query:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ
         HDCFPD+HTYSILICGMCKNGL+ EAQH+FNEMEKLGCLPSVVTFNSLIDG CKAG+LKEAHLLFYKMEIGRKPSLFLRL QG +KVL +  LQV +EQ
Subjt:  KHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQ

Query:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN
        L ESG+I KAY LLMQLVESGV PDIRTYNILINGFCK NN++GAFKLFKDMQLKGRLPDS+TYGT+IDGLHRVGRDEDALGIFE MVKNGCKP S VY 
Subjt:  LIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYN

Query:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA
         IMTWSCR+KKVS AFS+WMKYLRNFRGWK+E V VVEESF KG++ KAI R+IEMDL SKDFDL PYTIFL+GLCQAGRVSEA  +F VLKDFK  I +
Subjt:  CIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPA

Query:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL
         SCVMLIGGLCVE K  LA+EVFLYTLETG MLMPRICNQLL H L+ ED KDHAF L+RRMEAFGYDMNA+L +STK LL+DHWKSLK K R+ Q L
Subjt:  TSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLYSTKLLLYDHWKSLKNKGRYVQRL

SwissProt top hitse value%identityAlignment
Q9FIX3 Pentatricopeptide repeat-containing protein At5g397102.7e-7229.41Show/hide
Query:  IVDRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVR-KEAFILALALYNQMLKCNLNPH
        +  + + D    L +K+LQE  D   S SS  F +++++YS+ S+ +KA+            P   +YN +L   +R K     A  ++ +ML+  ++P+
Subjt:  IVDRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVR-KEAFILALALYNQMLKCNLNPH

Query:  VVTYSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKD
        V TY+ILI GFC    I  AL LFD+M  +G  PN +TY+ ++ G C+ +KIDD  +L  +M   G  P++I+YNV++NG C+ G + E    L    + 
Subjt:  VVTYSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKD

Query:  GHILGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVI-----------------------------------LYTIMIQGLSQEGYANEALALLG
        G+ L    Y+ LI    K   + +A + +  ML+  + P VI                                    YT ++ G SQ+GY NEA  +L 
Subjt:  GHILGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVI-----------------------------------LYTIMIQGLSQEGYANEALALLG

Query:  EMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEA
        EM ++G SP    YNA+I G C  G ++ A ++  ++ +    PD  +YS ++ G C++  V EA  +  EM + G  P  +T++SLI G C+  R KEA
Subjt:  EMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEA

Query:  HLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSV
          L+ +M         LR+    D+   +A +     +    G + KA  L  ++VE GVLPD+ TY++LING  K +    A +L   +  +  +P  V
Subjt:  HLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSV

Query:  TYGTIIDGLHRV---------------GRDEDALGIFELMVKNGCKPLSPVYNCIMTWSCRKKKVSQAFSIWMKYLRN
        TY T+I+    +               G   +A  +FE M+    KP    YN ++   CR   + +A++++ + +++
Subjt:  TYGTIIDGLHRV---------------GRDEDALGIFELMVKNGCKPLSPVYNCIMTWSCRKKKVSQAFSIWMKYLRN

Q9LQ14 Pentatricopeptide repeat-containing protein At1g62930, chloroplastic1.7e-7130.39Show/hide
Query:  RASQDLIVDRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKC
        + S+++++D L  D+A +L+ + +Q     +I      F+ L+ A +K +  +  I    R+ +     + ++YN++++   R+    LALA+  +M+K 
Subjt:  RASQDLIVDRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKC

Query:  NLNPHVVTYSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLR
           P +VT S L++G+C   +I EA+ L D+M      PN +T++ ++ GL    K  +A  L   M A GC PD+ TY  ++NG CK G +D A + L+
Subjt:  NLNPHVVTYSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLR

Query:  SFEKDGHILGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKA
          EK      +  Y+ +IDAL   +  ++A   +  M  K ++P+V+ Y  +I+ L   G  ++A  LL +M E  ++P+   ++A+I  F   G L +A
Subjt:  SFEKDGHILGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKA

Query:  QSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSA
        + L  E+ K    PD  TYS LI G C +  + EA+H+F  M    C P+VVT+N+LI G CKA R++E   LF +M             +G+  V ++ 
Subjt:  QSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSA

Query:  GLQVRMEQLIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGC
             ++ L ++G    A  +  ++V  GV PDI TY+IL++G CK   +  A  +F+ +Q     PD  TY  +I+G+ + G+ ED   +F  +   G 
Subjt:  GLQVRMEQLIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGC

Query:  KPLSPVYNCIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVI
        KP   +Y  +++  CRK    +A ++       FR  KE+  +
Subjt:  KPLSPVYNCIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVI

Q9SAJ5 Pentatricopeptide repeat-containing protein At1g795401.4e-22550.58Show/hide
Query:  FIRPTINYSGPKPPWFHCFHSPTNL-IATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIVDRLVK
        F R  I +   KP W    +S  N     S EV +I+    P E ALE +VP LS  +ITSVI+++ + +LGFR FIW+ RR  L  R S  L++D L +
Subjt:  FIRPTINYSGPKPPWFHCFHSPTNL-IATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIVDRLVK

Query:  DNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEA-FILALALYNQMLKCNLNPHVVTYSIL
        DN  +LYW+TL+ELK   +S+ S  F VLI AY+K  M EKA+ESFGR+ +F+C+P+ F YN+IL V++R+E  F+LA A+YN+MLKCN +P++ T+ IL
Subjt:  DNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEA-FILALALYNQMLKCNLNPHVVTYSIL

Query:  IHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHILGIN
        + G  K  +  +A ++FD+MT RGI PN++TY++++SGLC+    DDA +LF  M+ SG  PD + +N +L+GFCKLG + EAF  LR FEKDG +LG+ 
Subjt:  IHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHILGIN

Query:  GYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDC
        GYS LID LF+ARRY +A   Y  ML+KN+KPD+ILYTI+IQGLS+ G   +AL LL  MP  G+SPDT CYNAVIK  C  GLL++ +SLQLE+S+ + 
Subjt:  GYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDC

Query:  FPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIES
        FPD+ T++ILIC MC+NGLV EA+ IF E+EK GC PSV TFN+LIDGLCK+G LKEA LL +KME+GR  SLFLRLS   ++  D+         ++ES
Subjt:  FPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIES

Query:  GMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYNCIMT
        G ILKAY  L    ++G  PDI +YN+LINGFC+  +++GA KL   +QLKG  PDSVTY T+I+GLHRVGR+E+A  +F    K+  +    VY  +MT
Subjt:  GMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYNCIMT

Query:  WSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPATSCV
        WSCRK+KV  AF++WMKYL+      +E    +E+ F +GE  +A+RRLIE+D    +  L PYTI+L+GLCQ+GR  EA+ +F VL++ K+ +   SCV
Subjt:  WSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPATSCV

Query:  MLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLY
         LI GLC  E+   A+EVFLYTL+    LMPR+CN LL  LL S +  +    L  RME  GY++++ L +
Subjt:  MLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLY

Q9SH26 Pentatricopeptide repeat-containing protein At1g634003.0e-7132.73Show/hide
Query:  FSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVTYSILIHGFCKTSKIKEALELFDEMTDRGIF
        F+ L+ A +K    +  I    ++       N + YN++++   R+    LALAL  +M+K    P +VT S L++G+C   +I +A+ L D+M + G  
Subjt:  FSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVTYSILIHGFCKTSKIKEALELFDEMTDRGIF

Query:  PNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHILGINGYSCLIDALFKARRYDEAHMWYQRML
        P+ IT++ ++ GL    K  +A  L   M   GC P+++TY V++NG CK G +D AF  L   E       +  YS +ID+L K R  D+A   +  M 
Subjt:  PNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHILGINGYSCLIDALFKARRYDEAHMWYQRML

Query:  QKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHI
         K V+P+VI Y+ +I  L      ++A  LL +M E  ++P+   +NA+I  F   G L +A+ L  E+ K    PD  TYS LI G C +  + EA+H+
Subjt:  QKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHI

Query:  FNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKME----IGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIESGMILKAYNLLMQLVESGVLPDI
        F  M    C P+VVT+N+LI+G CKA R+ E   LF +M     +G   + +  L  G  +  D    Q          M+ K      Q+V  GV P+I
Subjt:  FNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKME----IGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIESGMILKAYNLLMQLVESGVLPDI

Query:  RTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYNCIMTWSCRKKKVSQAFSIWMK
         TYN L++G CKN  +  A  +F+ +Q     P   TY  +I+G+ + G+ ED   +F  +   G KP   +YN +++  CRK    +A +++ K
Subjt:  RTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYNCIMTWSCRKKKVSQAFSIWMK

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial2.3e-7131.9Show/hide
Query:  FSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVTYSILIHGFCKTSKIKEALELFDEMTDRGIF
        FS L+ A +K +  +  I    ++ +     N + Y+++++   R+    LALA+  +M+K    P++VT S L++G+C + +I EA+ L D+M   G  
Subjt:  FSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVTYSILIHGFCKTSKIKEALELFDEMTDRGIF

Query:  PNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHILGINGYSCLIDALFKARRYDEAHMWYQRML
        PN +T++ ++ GL    K  +A  L   M A GC PD++TY V++NG CK G  D AF  L   E+     G+  Y+ +ID L K +  D+A   ++ M 
Subjt:  PNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHILGINGYSCLIDALFKARRYDEAHMWYQRML

Query:  QKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHI
         K ++P+V+ Y+ +I  L   G  ++A  LL +M E  ++PD   ++A+I  F   G L +A+ L  E+ K    P   TYS LI G C +  + EA+ +
Subjt:  QKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHI

Query:  FNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIESGMILKAYNLLMQLVESGVLPDIRTYN
        F  M    C P VVT+N+LI G CK  R++E       ME+ R+ S      +G+  V ++    + ++ L ++G    A  +  ++V  GV P+I TYN
Subjt:  FNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIESGMILKAYNLLMQLVESGVLPDIRTYN

Query:  ILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYNCIMTWSCRKKKVSQAFSIW
         L++G CKN  +  A  +F+ +Q     P   TY  +I+G+ + G+ ED   +F  +   G KP    YN +++  CRK    +A +++
Subjt:  ILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYNCIMTWSCRKKKVSQAFSIW

Arabidopsis top hitse value%identityAlignment
AT1G62670.1 rna processing factor 21.6e-7231.9Show/hide
Query:  FSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVTYSILIHGFCKTSKIKEALELFDEMTDRGIF
        FS L+ A +K +  +  I    ++ +     N + Y+++++   R+    LALA+  +M+K    P++VT S L++G+C + +I EA+ L D+M   G  
Subjt:  FSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVTYSILIHGFCKTSKIKEALELFDEMTDRGIF

Query:  PNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHILGINGYSCLIDALFKARRYDEAHMWYQRML
        PN +T++ ++ GL    K  +A  L   M A GC PD++TY V++NG CK G  D AF  L   E+     G+  Y+ +ID L K +  D+A   ++ M 
Subjt:  PNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHILGINGYSCLIDALFKARRYDEAHMWYQRML

Query:  QKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHI
         K ++P+V+ Y+ +I  L   G  ++A  LL +M E  ++PD   ++A+I  F   G L +A+ L  E+ K    P   TYS LI G C +  + EA+ +
Subjt:  QKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHI

Query:  FNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIESGMILKAYNLLMQLVESGVLPDIRTYN
        F  M    C P VVT+N+LI G CK  R++E       ME+ R+ S      +G+  V ++    + ++ L ++G    A  +  ++V  GV P+I TYN
Subjt:  FNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIESGMILKAYNLLMQLVESGVLPDIRTYN

Query:  ILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYNCIMTWSCRKKKVSQAFSIW
         L++G CKN  +  A  +F+ +Q     P   TY  +I+G+ + G+ ED   +F  +   G KP    YN +++  CRK    +A +++
Subjt:  ILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYNCIMTWSCRKKKVSQAFSIW

AT1G62930.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-7230.39Show/hide
Query:  RASQDLIVDRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKC
        + S+++++D L  D+A +L+ + +Q     +I      F+ L+ A +K +  +  I    R+ +     + ++YN++++   R+    LALA+  +M+K 
Subjt:  RASQDLIVDRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKC

Query:  NLNPHVVTYSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLR
           P +VT S L++G+C   +I EA+ L D+M      PN +T++ ++ GL    K  +A  L   M A GC PD+ TY  ++NG CK G +D A + L+
Subjt:  NLNPHVVTYSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLR

Query:  SFEKDGHILGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKA
          EK      +  Y+ +IDAL   +  ++A   +  M  K ++P+V+ Y  +I+ L   G  ++A  LL +M E  ++P+   ++A+I  F   G L +A
Subjt:  SFEKDGHILGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKA

Query:  QSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSA
        + L  E+ K    PD  TYS LI G C +  + EA+H+F  M    C P+VVT+N+LI G CKA R++E   LF +M             +G+  V ++ 
Subjt:  QSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSA

Query:  GLQVRMEQLIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGC
             ++ L ++G    A  +  ++V  GV PDI TY+IL++G CK   +  A  +F+ +Q     PD  TY  +I+G+ + G+ ED   +F  +   G 
Subjt:  GLQVRMEQLIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGC

Query:  KPLSPVYNCIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVI
        KP   +Y  +++  CRK    +A ++       FR  KE+  +
Subjt:  KPLSPVYNCIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVI

AT1G63400.1 Pentatricopeptide repeat (PPR) superfamily protein2.1e-7232.73Show/hide
Query:  FSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVTYSILIHGFCKTSKIKEALELFDEMTDRGIF
        F+ L+ A +K    +  I    ++       N + YN++++   R+    LALAL  +M+K    P +VT S L++G+C   +I +A+ L D+M + G  
Subjt:  FSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVTYSILIHGFCKTSKIKEALELFDEMTDRGIF

Query:  PNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHILGINGYSCLIDALFKARRYDEAHMWYQRML
        P+ IT++ ++ GL    K  +A  L   M   GC P+++TY V++NG CK G +D AF  L   E       +  YS +ID+L K R  D+A   +  M 
Subjt:  PNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHILGINGYSCLIDALFKARRYDEAHMWYQRML

Query:  QKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHI
         K V+P+VI Y+ +I  L      ++A  LL +M E  ++P+   +NA+I  F   G L +A+ L  E+ K    PD  TYS LI G C +  + EA+H+
Subjt:  QKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHI

Query:  FNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKME----IGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIESGMILKAYNLLMQLVESGVLPDI
        F  M    C P+VVT+N+LI+G CKA R+ E   LF +M     +G   + +  L  G  +  D    Q          M+ K      Q+V  GV P+I
Subjt:  FNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKME----IGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIESGMILKAYNLLMQLVESGVLPDI

Query:  RTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYNCIMTWSCRKKKVSQAFSIWMK
         TYN L++G CKN  +  A  +F+ +Q     P   TY  +I+G+ + G+ ED   +F  +   G KP   +YN +++  CRK    +A +++ K
Subjt:  RTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYNCIMTWSCRKKKVSQAFSIWMK

AT1G79540.1 Pentatricopeptide repeat (PPR) superfamily protein9.6e-22750.58Show/hide
Query:  FIRPTINYSGPKPPWFHCFHSPTNL-IATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIVDRLVK
        F R  I +   KP W    +S  N     S EV +I+    P E ALE +VP LS  +ITSVI+++ + +LGFR FIW+ RR  L  R S  L++D L +
Subjt:  FIRPTINYSGPKPPWFHCFHSPTNL-IATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIVDRLVK

Query:  DNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEA-FILALALYNQMLKCNLNPHVVTYSIL
        DN  +LYW+TL+ELK   +S+ S  F VLI AY+K  M EKA+ESFGR+ +F+C+P+ F YN+IL V++R+E  F+LA A+YN+MLKCN +P++ T+ IL
Subjt:  DNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEA-FILALALYNQMLKCNLNPHVVTYSIL

Query:  IHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHILGIN
        + G  K  +  +A ++FD+MT RGI PN++TY++++SGLC+    DDA +LF  M+ SG  PD + +N +L+GFCKLG + EAF  LR FEKDG +LG+ 
Subjt:  IHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHILGIN

Query:  GYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDC
        GYS LID LF+ARRY +A   Y  ML+KN+KPD+ILYTI+IQGLS+ G   +AL LL  MP  G+SPDT CYNAVIK  C  GLL++ +SLQLE+S+ + 
Subjt:  GYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDC

Query:  FPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIES
        FPD+ T++ILIC MC+NGLV EA+ IF E+EK GC PSV TFN+LIDGLCK+G LKEA LL +KME+GR  SLFLRLS   ++  D+         ++ES
Subjt:  FPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIES

Query:  GMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYNCIMT
        G ILKAY  L    ++G  PDI +YN+LINGFC+  +++GA KL   +QLKG  PDSVTY T+I+GLHRVGR+E+A  +F    K+  +    VY  +MT
Subjt:  GMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYNCIMT

Query:  WSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPATSCV
        WSCRK+KV  AF++WMKYL+      +E    +E+ F +GE  +A+RRLIE+D    +  L PYTI+L+GLCQ+GR  EA+ +F VL++ K+ +   SCV
Subjt:  WSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLESKDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPATSCV

Query:  MLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLY
         LI GLC  E+   A+EVFLYTL+    LMPR+CN LL  LL S +  +    L  RME  GY++++ L +
Subjt:  MLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMNAHLLY

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-7329.41Show/hide
Query:  IVDRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVR-KEAFILALALYNQMLKCNLNPH
        +  + + D    L +K+LQE  D   S SS  F +++++YS+ S+ +KA+            P   +YN +L   +R K     A  ++ +ML+  ++P+
Subjt:  IVDRLVKDNAFELYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVR-KEAFILALALYNQMLKCNLNPH

Query:  VVTYSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKD
        V TY+ILI GFC    I  AL LFD+M  +G  PN +TY+ ++ G C+ +KIDD  +L  +M   G  P++I+YNV++NG C+ G + E    L    + 
Subjt:  VVTYSILIHGFCKTSKIKEALELFDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKD

Query:  GHILGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVI-----------------------------------LYTIMIQGLSQEGYANEALALLG
        G+ L    Y+ LI    K   + +A + +  ML+  + P VI                                    YT ++ G SQ+GY NEA  +L 
Subjt:  GHILGINGYSCLIDALFKARRYDEAHMWYQRMLQKNVKPDVI-----------------------------------LYTIMIQGLSQEGYANEALALLG

Query:  EMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEA
        EM ++G SP    YNA+I G C  G ++ A ++  ++ +    PD  +YS ++ G C++  V EA  +  EM + G  P  +T++SLI G C+  R KEA
Subjt:  EMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCLPSVVTFNSLIDGLCKAGRLKEA

Query:  HLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSV
          L+ +M         LR+    D+   +A +     +    G + KA  L  ++VE GVLPD+ TY++LING  K +    A +L   +  +  +P  V
Subjt:  HLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFKDMQLKGRLPDSV

Query:  TYGTIIDGLHRV---------------GRDEDALGIFELMVKNGCKPLSPVYNCIMTWSCRKKKVSQAFSIWMKYLRN
        TY T+I+    +               G   +A  +FE M+    KP    YN ++   CR   + +A++++ + +++
Subjt:  TYGTIIDGLHRV---------------GRDEDALGIFELMVKNGCKPLSPVYNCIMTWSCRKKKVSQAFSIWMKYLRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTCCGACCAACATTTATCCGGCCCACCATCAACTATTCAGGTCCAAAACCTCCATGGTTCCATTGTTTTCATTCGCCCACTAACCTAATCGCCACTTCCAATGA
GGTCGCCGCCATTATCGAAACTGTTTCCCCCTTCGAAGGTGCATTGGAGGCCATAGTCCCGCATCTATCCCCTGTTGTAATTACCTCCGTTATCGAAGAACAGCCGGATC
CCCGACTTGGATTTCGACTGTTCATTTGGTCGTTGAGAAGAAATCACCTGTGCTGCCGCGCCTCGCAGGATTTGATCGTCGACAGGTTAGTGAAGGACAATGCCTTTGAA
TTATATTGGAAAACTCTTCAAGAGCTTAAGGATTCTGCTATTTCGATTTCATCGGACGCTTTCTCCGTGTTGATAGAGGCGTACTCGAAAGCGAGTATGGATGAGAAAGC
CATTGAATCGTTTGGCCGGATTAGTGATTTTGAATGTAAGCCCAATAGTTTTGCTTACAATTTGATTTTGCATGTTTTAGTTCGAAAAGAAGCGTTTATTTTAGCATTAG
CTCTGTATAATCAGATGCTGAAATGTAATTTGAATCCTCATGTGGTTACTTACAGCATTTTGATTCATGGATTTTGTAAAACTAGTAAGATTAAAGAGGCCCTTGAACTG
TTTGATGAAATGACTGATAGAGGAATATTTCCCAACCAGATAACTTATTCGGTTATTCTTTCTGGATTGTGTGAAGCTAAGAAGATTGATGATGCACATAGATTGTTTTG
TAATATGAGAGCTAGTGGATGTAGTCCAGATGTAATCACCTACAATGTCATGCTTAATGGATTTTGTAAGTTGGGTTATCTTGATGAAGCTTTTGCATTCTTGAGATCAT
TTGAAAAGGATGGCCATATTCTTGGAATCAATGGGTACAGTTGTTTGATTGATGCCTTGTTTAAGGCTAGGAGATATGATGAAGCACATATGTGGTACCAAAGAATGTTG
CAGAAAAATGTAAAGCCTGATGTTATCTTGTATACTATAATGATCCAAGGTTTATCCCAAGAAGGCTACGCCAACGAGGCGCTGGCGTTGTTGGGTGAGATGCCAGAAAG
TGGGTTAAGCCCGGATACTACTTGTTACAATGCTGTAATAAAAGGATTTTGTGATATTGGTCTTTTGGATAAGGCTCAGTCTCTTCAACTCGAGATTTCGAAACACGACT
GTTTCCCTGACAGCCACACATACTCTATTCTCATTTGTGGTATGTGCAAGAATGGGCTAGTCAGTGAGGCACAACATATATTCAATGAAATGGAGAAGCTTGGATGCCTT
CCTTCTGTTGTGACCTTCAATTCGCTCATTGATGGACTTTGCAAGGCTGGTAGGCTTAAGGAAGCTCATCTTCTATTTTACAAAATGGAGATTGGAAGAAAACCTTCTCT
GTTTCTTCGGCTTTCTCAGGGTGTCGATAAGGTTCTTGATAGTGCCGGGCTACAAGTTAGGATGGAACAATTAATTGAGTCAGGGATGATTCTTAAGGCATATAACCTTC
TTATGCAGCTTGTCGAGAGTGGAGTTTTGCCAGACATTAGAACTTACAACATCCTGATCAATGGATTTTGCAAGAACAACAACGTCAATGGTGCTTTCAAGCTCTTCAAA
GATATGCAACTTAAAGGGCGCTTGCCAGATTCGGTTACGTACGGGACTATAATAGATGGGCTCCATAGAGTTGGTAGGGATGAGGATGCACTAGGGATTTTTGAACTAAT
GGTAAAGAATGGGTGCAAGCCGTTGTCTCCTGTTTACAACTGCATCATGACGTGGTCATGTCGAAAAAAGAAGGTTTCACAAGCTTTTAGTATTTGGATGAAGTATTTGA
GGAATTTTCGTGGCTGGAAAGAAGAAAATGTCATAGTAGTAGAGGAAAGTTTTGGTAAAGGAGAGATTGGAAAGGCAATCCGGAGATTAATCGAAATGGACTTGGAATCA
AAAGACTTCGACTTAACTCCATACACCATTTTTCTCGTTGGATTGTGTCAAGCAGGGAGGGTTTCTGAAGCCATTGAATTATTTTATGTCCTCAAGGACTTCAAAATGAA
TATACCTGCAACAAGTTGTGTGATGTTGATTGGTGGGCTTTGCGTGGAAGAAAAATTTGGTCTAGCTATGGAAGTTTTCCTTTATACACTAGAAACGGGCCTTATGTTGA
TGCCTCGAATTTGTAACCAACTGCTAGAGCATCTTCTTAATTCGGAGGACGGAAAAGACCATGCTTTTGCTCTTGTACGTAGAATGGAGGCTTTTGGATATGATATGAAT
GCTCATCTCCTCTACAGTACTAAGTTACTTCTTTACGATCATTGGAAGTCATTGAAAAATAAAGGCAGATATGTTCAGCGATTGCCGATTCACAGCAGAGAATCCTAA
mRNA sequenceShow/hide mRNA sequence
GGGCAATAAATCTCCACACTCGTCTATTAGCAGCAGCACCAAACAACCGCATTGGACGAACAAAAGCAGAGGGTTCAACTTCAAAAGTCATCCTATCGAAGCATTTACAG
TCGCCGCCGATTTGAAAGCACCGATTTCTAGCGCCGCCGTAGATTCGAAGATTGCAAAAGTCAAAAACCTTCCTAAACCCTAATCAAACACCACCGATTTTTGCTTTTCC
GCTGCCAGAAATCTCTTTCACGGCAAGAAATGGAACATTTGCCGGACGAAATTAGCCGCCGATGAAGCTCCGACCAACATTTATCCGGCCCACCATCAACTATTCAGGTC
CAAAACCTCCATGGTTCCATTGTTTTCATTCGCCCACTAACCTAATCGCCACTTCCAATGAGGTCGCCGCCATTATCGAAACTGTTTCCCCCTTCGAAGGTGCATTGGAG
GCCATAGTCCCGCATCTATCCCCTGTTGTAATTACCTCCGTTATCGAAGAACAGCCGGATCCCCGACTTGGATTTCGACTGTTCATTTGGTCGTTGAGAAGAAATCACCT
GTGCTGCCGCGCCTCGCAGGATTTGATCGTCGACAGGTTAGTGAAGGACAATGCCTTTGAATTATATTGGAAAACTCTTCAAGAGCTTAAGGATTCTGCTATTTCGATTT
CATCGGACGCTTTCTCCGTGTTGATAGAGGCGTACTCGAAAGCGAGTATGGATGAGAAAGCCATTGAATCGTTTGGCCGGATTAGTGATTTTGAATGTAAGCCCAATAGT
TTTGCTTACAATTTGATTTTGCATGTTTTAGTTCGAAAAGAAGCGTTTATTTTAGCATTAGCTCTGTATAATCAGATGCTGAAATGTAATTTGAATCCTCATGTGGTTAC
TTACAGCATTTTGATTCATGGATTTTGTAAAACTAGTAAGATTAAAGAGGCCCTTGAACTGTTTGATGAAATGACTGATAGAGGAATATTTCCCAACCAGATAACTTATT
CGGTTATTCTTTCTGGATTGTGTGAAGCTAAGAAGATTGATGATGCACATAGATTGTTTTGTAATATGAGAGCTAGTGGATGTAGTCCAGATGTAATCACCTACAATGTC
ATGCTTAATGGATTTTGTAAGTTGGGTTATCTTGATGAAGCTTTTGCATTCTTGAGATCATTTGAAAAGGATGGCCATATTCTTGGAATCAATGGGTACAGTTGTTTGAT
TGATGCCTTGTTTAAGGCTAGGAGATATGATGAAGCACATATGTGGTACCAAAGAATGTTGCAGAAAAATGTAAAGCCTGATGTTATCTTGTATACTATAATGATCCAAG
GTTTATCCCAAGAAGGCTACGCCAACGAGGCGCTGGCGTTGTTGGGTGAGATGCCAGAAAGTGGGTTAAGCCCGGATACTACTTGTTACAATGCTGTAATAAAAGGATTT
TGTGATATTGGTCTTTTGGATAAGGCTCAGTCTCTTCAACTCGAGATTTCGAAACACGACTGTTTCCCTGACAGCCACACATACTCTATTCTCATTTGTGGTATGTGCAA
GAATGGGCTAGTCAGTGAGGCACAACATATATTCAATGAAATGGAGAAGCTTGGATGCCTTCCTTCTGTTGTGACCTTCAATTCGCTCATTGATGGACTTTGCAAGGCTG
GTAGGCTTAAGGAAGCTCATCTTCTATTTTACAAAATGGAGATTGGAAGAAAACCTTCTCTGTTTCTTCGGCTTTCTCAGGGTGTCGATAAGGTTCTTGATAGTGCCGGG
CTACAAGTTAGGATGGAACAATTAATTGAGTCAGGGATGATTCTTAAGGCATATAACCTTCTTATGCAGCTTGTCGAGAGTGGAGTTTTGCCAGACATTAGAACTTACAA
CATCCTGATCAATGGATTTTGCAAGAACAACAACGTCAATGGTGCTTTCAAGCTCTTCAAAGATATGCAACTTAAAGGGCGCTTGCCAGATTCGGTTACGTACGGGACTA
TAATAGATGGGCTCCATAGAGTTGGTAGGGATGAGGATGCACTAGGGATTTTTGAACTAATGGTAAAGAATGGGTGCAAGCCGTTGTCTCCTGTTTACAACTGCATCATG
ACGTGGTCATGTCGAAAAAAGAAGGTTTCACAAGCTTTTAGTATTTGGATGAAGTATTTGAGGAATTTTCGTGGCTGGAAAGAAGAAAATGTCATAGTAGTAGAGGAAAG
TTTTGGTAAAGGAGAGATTGGAAAGGCAATCCGGAGATTAATCGAAATGGACTTGGAATCAAAAGACTTCGACTTAACTCCATACACCATTTTTCTCGTTGGATTGTGTC
AAGCAGGGAGGGTTTCTGAAGCCATTGAATTATTTTATGTCCTCAAGGACTTCAAAATGAATATACCTGCAACAAGTTGTGTGATGTTGATTGGTGGGCTTTGCGTGGAA
GAAAAATTTGGTCTAGCTATGGAAGTTTTCCTTTATACACTAGAAACGGGCCTTATGTTGATGCCTCGAATTTGTAACCAACTGCTAGAGCATCTTCTTAATTCGGAGGA
CGGAAAAGACCATGCTTTTGCTCTTGTACGTAGAATGGAGGCTTTTGGATATGATATGAATGCTCATCTCCTCTACAGTACTAAGTTACTTCTTTACGATCATTGGAAGT
CATTGAAAAATAAAGGCAGATATGTTCAGCGATTGCCGATTCACAGCAGAGAATCCTAAATGCCCCATTTTGCATGGTTGAAAATAATGGGAGGAATTGATACGCTAACC
AGCGCACAACTCAATTGGCATTAAGTGTGTGCTTATGACCAAAAGGTCATGATTTCTAATTTTTCCACCCTCAACTGCCTTTTTAATTATAGGAAACGAAAAACCATTTT
ATTGATAAAATGAAAAAGTTTCTCGATTAAGAGCTCCATAAGTGGAAAGTTGGATAAACGAACTGATATATAGTAGGAGGGCAGGTACCAGAGCAAATAATGTTGGTACA
TTTTCGGGCTGGTACCAGGGCAGATAATGCTGCTTTCCTTTCAGGTATTGAGTCATGTCATGTATTGAGCCTCTCTTCTATAAAAGAAGTGAAGATTAGCTCAACTGATT
GAGACTTACGGAACATAAAATGCACAAGAGGTGAAAGAAAG
Protein sequenceShow/hide protein sequence
MKLRPTFIRPTINYSGPKPPWFHCFHSPTNLIATSNEVAAIIETVSPFEGALEAIVPHLSPVVITSVIEEQPDPRLGFRLFIWSLRRNHLCCRASQDLIVDRLVKDNAFE
LYWKTLQELKDSAISISSDAFSVLIEAYSKASMDEKAIESFGRISDFECKPNSFAYNLILHVLVRKEAFILALALYNQMLKCNLNPHVVTYSILIHGFCKTSKIKEALEL
FDEMTDRGIFPNQITYSVILSGLCEAKKIDDAHRLFCNMRASGCSPDVITYNVMLNGFCKLGYLDEAFAFLRSFEKDGHILGINGYSCLIDALFKARRYDEAHMWYQRML
QKNVKPDVILYTIMIQGLSQEGYANEALALLGEMPESGLSPDTTCYNAVIKGFCDIGLLDKAQSLQLEISKHDCFPDSHTYSILICGMCKNGLVSEAQHIFNEMEKLGCL
PSVVTFNSLIDGLCKAGRLKEAHLLFYKMEIGRKPSLFLRLSQGVDKVLDSAGLQVRMEQLIESGMILKAYNLLMQLVESGVLPDIRTYNILINGFCKNNNVNGAFKLFK
DMQLKGRLPDSVTYGTIIDGLHRVGRDEDALGIFELMVKNGCKPLSPVYNCIMTWSCRKKKVSQAFSIWMKYLRNFRGWKEENVIVVEESFGKGEIGKAIRRLIEMDLES
KDFDLTPYTIFLVGLCQAGRVSEAIELFYVLKDFKMNIPATSCVMLIGGLCVEEKFGLAMEVFLYTLETGLMLMPRICNQLLEHLLNSEDGKDHAFALVRRMEAFGYDMN
AHLLYSTKLLLYDHWKSLKNKGRYVQRLPIHSRES