; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0196 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0196
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationMC01:8291214..8292923
RNA-Seq ExpressionMC01g0196
SyntenyMC01g0196
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595676.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.076.44Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------
        T I+FLQSCA+ KNLN+GKQLHS+MITYGFSHSPSSITSLINMYSKCG+MEEA+LVFHDPC+E NVFAYNA+ISG                         
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------

Query:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF
                                             ALVNTYLK+G ME+AQ+VFEEL IRDVVLWNA+INGYAQIGCLDEALE+FRRM IEG+ PSRF
Subjt:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF

Query:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL
        T+TGILSIFAL+G L+NGRTVH IV KMGY+ GVAV NALIDMYGKCKHI DALM+F+ ++EKDIFSWNSIISVHEQ GDHDG LRLFDKMLGSG LPDL
Subjt:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL

Query:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGG---VDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCM
        VTVTT+LPACSHLAALMHGREIHGYMIVNG G+DG    +DDLLVNNAVMDMYAKCGSMKNA  VF+ M+NKDVASWNI+IMGYGMHGYGM+ALD+FS M
Subjt:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGG---VDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCM

Query:  CEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAA
        CEA+IKPDEVTFVGVLSACNHAGFV QGR FLAQME +FGVIPTIEHYTCVIDMLGRAGHL+DAY+LAQ MPIQANP+VWRALLGACRLHGNAELAE+AA
Subjt:  CEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAA

Query:  RKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
        +KVMQL+PEHCGSYVLMSNVYGVVGRY EVLEVR TMKEQ+V+KTPGCSWIELKDGVHVFLTGDRTH ELNAL
Subjt:  RKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

XP_022152247.1 pentatricopeptide repeat-containing protein At3g14730-like [Momordica charantia]0.089.12Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------
        TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG                         
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------

Query:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF
                                             ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF
Subjt:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF

Query:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL
        TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL
Subjt:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL

Query:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEA
        VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEA
Subjt:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEA

Query:  RIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKV
        RIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKV
Subjt:  RIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKV

Query:  MQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
        MQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
Subjt:  MQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

XP_022966216.1 pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita maxima]0.076.09Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------
        T I+FLQSCA+ KNLN+GKQLHS+MITYGFSHSPSSITSLINMYSKCG+MEEA+LVFHDPC+E NVFAYNA+ISG                         
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------

Query:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF
                                             ALVNTYLK+G ME+AQ+VFEEL IRDVVLWNA+INGYAQIGCLDEALE+F+RM IEG+ PSRF
Subjt:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF

Query:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL
        T+TGILSIFAL+G L+NGRTVH IV KMGY+ GVAV NALIDMYGKCKHI DALM+F+ ++EKDIFSWNSIISVHEQ GDHDG LRLFDKMLGSG LPDL
Subjt:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL

Query:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGG---VDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCM
        VTVTT+LPACSHLAALMHGREIHGYMIVNG G+DG    +DDLLVNNAVMDMYAKCGSM NA  VF+ M+NKDVASWNI+IMGYGMHGYGM+ALD+FS M
Subjt:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGG---VDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCM

Query:  CEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAA
        CEA+IKPDEVTFVGVLSACNHAGFV QGR+FLAQME +FGVIPTIEHYTCVIDMLGRAGHL+DAY+LAQ MPIQANP+VWRALLGACRLHGNAELAE+AA
Subjt:  CEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAA

Query:  RKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
        +KVMQL+PEHCGSYVLMSNVYGVVGRY EVLEVR TMKEQ+V+KTPGCSWIELKDGVHVFLTGDRTH ELNAL
Subjt:  RKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

XP_023517268.1 pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita pepo subsp. pepo]0.076.09Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------
        T I+FLQSCA+ KNLN+GKQLHS+MITYGFSHSPSSITSLINMYSKCG+MEEA+LVFHDPC+E NVFAYNA+ISG                         
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------

Query:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF
                                             ALVNTYLK+G ME+AQ+VFEEL IRDVVLWNA+INGYAQIGCLDEALE+FRRM IEG+ PSRF
Subjt:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF

Query:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL
        T+TGILSIFAL+G L+NGRTVH IV KMGY+ GVAV NAL+DMYGKCKHI DALM+F+ ++EKDIFSWNSIISVHEQ GDHDG LRLFDKMLGSG LPDL
Subjt:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL

Query:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGG---VDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCM
        VTVTT+LPACSHLAALMHGREIHGYMIVNG G+DG    +DDLLVNNAVMDMYAKCGSMKNA  VF+ M+NKDVASWNI+IMGYGMHGYGM+ALD+FS M
Subjt:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGG---VDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCM

Query:  CEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAA
        CEA+IKPDEVTFVGVLSACNHAGFV QGR FLAQME +FGVIPTIEHYTCVIDMLGRAGHL+DAY+LAQ MPI+ANP+VWRALLGACRLHGNAELAE+AA
Subjt:  CEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAA

Query:  RKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
        +KVMQL+PEHCGSYVLMSNVYGVVGRY EVLEVR TMKEQ+V+KTPGCSWIELKDGVHVFLTGDRTH ELNAL
Subjt:  RKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

XP_038881250.1 pentatricopeptide repeat-containing protein At3g14730-like [Benincasa hispida]0.079.06Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------
        T I+FLQSCADHKNLN+GKQLHSLMITYGFS SP SITSLINMYSKCGQM EAILVFHDPCHERNVFAYNA+ISG                         
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------

Query:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF
                                             ALVNTYLKIG MENAQKVFEE+SIRDVVLWNA+INGYAQIGCLDEALEVFRRM IEGI PSRF
Subjt:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF

Query:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL
        T+TGIL IFA RGDL+NG+TVH IV KMGY+ GVAV NALIDMYGKCKHIRDAL+IF+MI+EKDIFSWNSIISVHEQ GDHDGTLRLFDKMLGS ILPDL
Subjt:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL

Query:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKD---GGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCM
        VT+TTVLPACSHLAA MHGREIHGYMIVNG GKD   G VDDLLVNNAVMDMYAKCGSM NAL VFD MSNKDVASWNI+IMGYGMHGYGM+ALD+FS M
Subjt:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKD---GGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCM

Query:  CEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAA
        CE   KPDEVT VGVLSACNH GFVSQGRL LAQMES+FGVIPTIEHYTCVIDMLGRAGHL+DAY++ QKMPIQANP+VWRALLGACRLHGNAELAE+AA
Subjt:  CEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAA

Query:  RKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
        R+VMQLEPEHCGSYVLMSNVYGV+GR+ EVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
Subjt:  RKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

TrEMBL top hitse value%identityAlignment
A0A1S4E0R7 pentatricopeptide repeat-containing protein At3g14730-like0.077.23Show/hide
Query:  ISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG---------------------------
        I+FLQSCADHKNLN+GKQ HSLMITYGFS SP SITSLINMYSKCGQM EAILVF+DPCHERNVFAYNA+ISG                           
Subjt:  ISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG---------------------------

Query:  -----------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRFTV
                                           ALVNTYLK G ME+AQKVF EL +RDVVLWNA+INGYA+IGCLDEALEVFRRM +EGI P RFT+
Subjt:  -----------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRFTV

Query:  TGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDLVT
        TGILSIFA RGDL+NG+TVH IV KMGY+ GVAV NALIDMYGKCKHI DAL+IF+MI+EKDIFSWNSIISVHEQ GDH GTLRLFDKMLGSGILPDLVT
Subjt:  TGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDLVT

Query:  VTTVLPACSHLAALMHGREIHGYMIVNGFGKD---GGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCE
        +TTVLPACSHLAALM GREIHGYMI+NGFGKD   G +DDL V+NAVMDMYAKCGSM NAL +FD MSNKDVASWNI+IMGYGMHGY +EALD+FS MCE
Subjt:  VTTVLPACSHLAALMHGREIHGYMIVNGFGKD---GGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCE

Query:  ARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARK
        A  KPDEVT VGVLSACNHAGFVSQGRLF AQMES FGVIPTIEHYTCVIDMLGRAGHL+DAYE+AQKMPIQANP+VWRALLGACRLHGNAELAEVAAR+
Subjt:  ARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARK

Query:  VMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
        V+QLEPEHCGSYVLMSNVYGV+GRY EVLEVRKTMKEQNVKKTPGCSWIELKDG+HVF TGDRTHSELNAL
Subjt:  VMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

A0A5A7TK17 Pentatricopeptide repeat-containing protein0.077.23Show/hide
Query:  ISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG---------------------------
        I+FLQSCADHKNLN+GKQ HSLMITYGFS SP SITSLINMYSKCGQM EAILVF+DPCHERNVFAYNA+ISG                           
Subjt:  ISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG---------------------------

Query:  -----------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRFTV
                                           ALVNTYLK G ME+AQKVF EL +RDVVLWNA+INGYA+IGCLDEALEVFRRM +EGI P RFT+
Subjt:  -----------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRFTV

Query:  TGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDLVT
        TGILSIFA RGDL+NG+TVH IV KMGY+ GVAV NALIDMYGKCKHI DAL+IF+MI+EKDIFSWNSIISVHEQ GDH GTLRLFDKMLGSGILPDLVT
Subjt:  TGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDLVT

Query:  VTTVLPACSHLAALMHGREIHGYMIVNGFGKD---GGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCE
        +TTVLPACSHLAALM GREIHGYMI+NGFGKD   G +DDL V+NAVMDMYAKCGSM NAL +FD MSNKDVASWNI+IMGYGMHGY +EALD+FS MCE
Subjt:  VTTVLPACSHLAALMHGREIHGYMIVNGFGKD---GGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCE

Query:  ARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARK
        A  KPDEVT VGVLSACNHAGFVSQGRLF AQMES FGVIPTIEHYTCVIDMLGRAGHL+DAYE+AQKMPIQANP+VWRALLGACRLHGNAELAEVAAR+
Subjt:  ARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARK

Query:  VMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
        V+QLEPEHCGSYVLMSNVYGV+GRY EVLEVRKTMKEQNVKKTPGCSWIELKDG+HVF TGDRTHSELNAL
Subjt:  VMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

A0A6J1DFN8 pentatricopeptide repeat-containing protein At3g14730-like0.089.12Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------
        TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG                         
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------

Query:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF
                                             ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF
Subjt:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF

Query:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL
        TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL
Subjt:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL

Query:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEA
        VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEA
Subjt:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEA

Query:  RIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKV
        RIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKV
Subjt:  RIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKV

Query:  MQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
        MQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
Subjt:  MQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

A0A6J1HME0 pentatricopeptide repeat-containing protein At3g14730-like isoform X20.076.09Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------
        T I+FLQSCA+ KNLN+GKQLHS+MITYGFSHSPSSITSLINMYSKCG+MEEA+LVFHDPC+E NVFAYNA+ISG                         
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------

Query:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF
                                             ALVNTYLK+G ME+AQ+VFEEL IRDVVLWNA+INGYAQIGCLDEALE+F+RM IEG+ PSRF
Subjt:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF

Query:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL
        T+TGILSIFAL+G L+NGRTVH IV KMGY+ GVAV NALIDMYGKCKHI DALM+F+ ++EKDIFSWNSIISVHEQ GDHDG LRLFDKMLGSG LPDL
Subjt:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL

Query:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGG---VDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCM
        VTVTT+LPACSHLAALMHGREIHGYMIVNG G+DG    +DDLLVNNAVMDMYAKCGSM NA  VF+ M+NKDVASWNI+IMGYGMHGYGM+ALD+FS M
Subjt:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGG---VDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCM

Query:  CEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAA
        CEA+IKPDEVTFVGVLSACNHAGFV QGR+FLAQME +FGVIPTIEHYTCVIDMLGRAGHL+DAY+LAQ MPIQANP+VWRALLGACRLHGNAELAE+AA
Subjt:  CEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAA

Query:  RKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
        +KVMQL+PEHCGSYVLMSNVYGVVGRY EVLEVR TMKEQ+V+KTPGCSWIELKDGVHVFLTGDRTH ELNAL
Subjt:  RKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

A0A6J1HNR1 pentatricopeptide repeat-containing protein At3g14730-like isoform X10.076.09Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------
        T I+FLQSCA+ KNLN+GKQLHS+MITYGFSHSPSSITSLINMYSKCG+MEEA+LVFHDPC+E NVFAYNA+ISG                         
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISG-------------------------

Query:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF
                                             ALVNTYLK+G ME+AQ+VFEEL IRDVVLWNA+INGYAQIGCLDEALE+F+RM IEG+ PSRF
Subjt:  -------------------------------------ALVNTYLKIGLMENAQKVFEELSIRDVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRF

Query:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL
        T+TGILSIFAL+G L+NGRTVH IV KMGY+ GVAV NALIDMYGKCKHI DALM+F+ ++EKDIFSWNSIISVHEQ GDHDG LRLFDKMLGSG LPDL
Subjt:  TVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDL

Query:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGG---VDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCM
        VTVTT+LPACSHLAALMHGREIHGYMIVNG G+DG    +DDLLVNNAVMDMYAKCGSM NA  VF+ M+NKDVASWNI+IMGYGMHGYGM+ALD+FS M
Subjt:  VTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGG---VDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCM

Query:  CEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAA
        CEA+IKPDEVTFVGVLSACNHAGFV QGR+FLAQME +FGVIPTIEHYTCVIDMLGRAGHL+DAY+LAQ MPIQANP+VWRALLGACRLHGNAELAE+AA
Subjt:  CEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAA

Query:  RKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
        +KVMQL+PEHCGSYVLMSNVYGVVGRY EVLEVR TMKEQ+V+KTPGCSWIELKDGVHVFLTGDRTH ELNAL
Subjt:  RKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

SwissProt top hitse value%identityAlignment
Q9LFL5 Pentatricopeptide repeat-containing protein At5g168604.1e-10339.85Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIR
        T ++ L  CA     + GKQLH   +T     +      L++MY+KCG M+EA  VF +    ++V ++NA+++G     Y +IG  E+A ++FE++   
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIR

Query:  ----DVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRFTVTGILSIFALRGDLNNGRTVHAI-------VTKMGYNLGVAVLNALIDMYGKCKHIR
            DVV W+A I+GYAQ G   EAL V R+M   GI P+  T+  +LS  A  G L +G+ +H         + K G+     V+N LIDMY KCK + 
Subjt:  ----DVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRFTVTGILSIFALRGDLNNGRTVHAI-------VTKMGYNLGVAVLNALIDMYGKCKHIR

Query:  DALMIFKMID--EKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSG--ILPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVM
         A  +F  +   E+D+ +W  +I  + Q+GD +  L L  +M        P+  T++  L AC+ LAAL  G++IH Y + N          L V+N ++
Subjt:  DALMIFKMID--EKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSG--ILPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVM

Query:  DMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTC
        DMYAKCGS+ +A +VFD M  K+  +W  ++ GYGMHGYG EAL IF  M     K D VT + VL AC+H+G + QG  +  +M++ FGV P  EHY C
Subjt:  DMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTC

Query:  VIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSW
        ++D+LGRAG L  A  L ++MP++  P+VW A L  CR+HG  EL E AA K+ +L   H GSY L+SN+Y   GR+ +V  +R  M+ + VKK PGCSW
Subjt:  VIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSW

Query:  IELKDGVHVFLTGDRTH
        +E   G   F  GD+TH
Subjt:  IELKDGVHVFLTGDRTH

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic5.0e-10939.84Show/hide
Query:  LQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIRDVVLW
        L+SCA  K    G+Q+H  ++  G        TSLI+MY + G++E+A  VF D    R+V +Y ALI G     Y   G +ENAQK+F+E+ ++DVV W
Subjt:  LQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIRDVVLW

Query:  NALINGYAQIGCLDEALEVFRRMSIEGILPSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFS
        NA+I+GYA+ G   EALE+F+ M    + P   T+  ++S  A  G +  GR VH  +   G+   + ++NALID+Y KC  +  A  +F+ +  KD+ S
Subjt:  NALINGYAQIGCLDEALEVFRRMSIEGILPSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFS

Query:  WNSIISVHEQYGDHDGTLRLFDKMLGSGILPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMS
        WN++I  +     +   L LF +ML SG  P+ VT+ ++LPAC+HL A+  GR IH Y+        G  +   +  +++DMYAKCG ++ A  VF+ + 
Subjt:  WNSIISVHEQYGDHDGTLRLFDKMLGSGILPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMS

Query:  NKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQK
        +K ++SWN +I G+ MHG    + D+FS M +  I+PD++TFVG+LSAC+H+G +  GR     M  ++ + P +EHY C+ID+LG +G  K+A E+   
Subjt:  NKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQK

Query:  MPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTH
        M ++ + ++W +LL AC++HGN EL E  A  ++++EPE+ GSYVL+SN+Y   GR+ EV + R  + ++ +KK PGCS IE+   VH F+ GD+ H
Subjt:  MPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTH

Q9LUC2 Pentatricopeptide repeat-containing protein At3g147301.9e-16150.17Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGF-SHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVN--------------------
        T I+ LQ CA  K+   G+Q+H  M+  GF   SP + TSL+NMY+KCG M  A+LVF     ER+VF YNALISG +VN                    
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGF-SHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVN--------------------

Query:  --------------------------------------------TYLKIGLMENAQKVFEELSIR-DVVLWNALINGYAQIGCLDEALEVFRRMSIEGIL
                                                    +Y K   +E+AQKVF+EL  R D VLWNAL+NGY+QI   ++AL VF +M  EG+ 
Subjt:  --------------------------------------------TYLKIGLMENAQKVFEELSIR-DVVLWNALINGYAQIGCLDEALEVFRRMSIEGIL

Query:  PSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGI
         SR T+T +LS F + GD++NGR++H +  K G    + V NALIDMYGK K + +A  IF+ +DE+D+F+WNS++ VH+  GDHDGTL LF++ML SGI
Subjt:  PSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGI

Query:  LPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSC
         PD+VT+TTVLP C  LA+L  GREIHGYMIV+G       ++  ++N++MDMY KCG +++A MVFD M  KD ASWNI+I GYG+   G  ALD+FSC
Subjt:  LPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSC

Query:  MCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVA
        MC A +KPDE+TFVG+L AC+H+GF+++GR FLAQME+ + ++PT +HY CVIDMLGRA  L++AYELA   PI  NP+VWR++L +CRLHGN +LA VA
Subjt:  MCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVA

Query:  ARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
         +++ +LEPEHCG YVLMSNVY   G+Y EVL+VR  M++QNVKKTPGCSWI LK+GVH F TG++TH E  ++
Subjt:  ARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220701.0e-10135.91Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIR
        T  + L S A  + +  GK++HS ++  G   + S   SL+NMY+KCG    A  VF D    R++ ++NA+I+      ++++G M+ A   FE+++ R
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIR

Query:  DVVLWNALINGYAQIGCLDEALEVFRRMSIEGIL-PSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHI-----------
        D+V WN++I+G+ Q G    AL++F +M  + +L P RFT+  +LS  A    L  G+ +H+ +   G+++   VLNALI MY +C  +           
Subjt:  DVVLWNALINGYAQIGCLDEALEVFRRMSIEGIL-PSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHI-----------

Query:  ----------------------RDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDLVTVTTVLPACSHLAALMHGREIHGYMIV
                                A  IF  + ++D+ +W ++I  +EQ+G +   + LF  M+G G  P+  T+  +L   S LA+L HG++IHG  + 
Subjt:  ----------------------RDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDLVTVTTVLPACSHLAALMHGREIHGYMIV

Query:  NGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLM-SNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRL
            K G +  + V+NA++ MYAK G++ +A   FDL+   +D  SW  +I+    HG+  EAL++F  M    ++PD +T+VGV SAC HAG V+QGR 
Subjt:  NGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLM-SNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRL

Query:  FLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEV
        +   M+    +IPT+ HY C++D+ GRAG L++A E  +KMPI+ + + W +LL ACR+H N +L +VAA +++ LEPE+ G+Y  ++N+Y   G++ E 
Subjt:  FLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEV

Query:  LEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
         ++RK+MK+  VKK  G SWIE+K  VHVF   D TH E N +
Subjt:  LEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic2.7e-10238.42Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIR
        T +S    CAD + ++ G+ +HS+ +   FS       +L++MYSKCG                                      +++A+ VF E+S R
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIR

Query:  DVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDE
         VV + ++I GYA+ G   EA+++F  M  EGI P  +TVT +L+  A    L+ G+ VH  + +      + V NAL+DMY KC  +++A ++F  +  
Subjt:  DVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDE

Query:  KDIFSWNSIISVHEQYGDHDGTLRLFDKML-GSGILPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALM
        KDI SWN+II  + +    +  L LF+ +L      PD  TV  VLPAC+ L+A   GREIHGY++ NG+       D  V N+++DMYAKCG++  A M
Subjt:  KDIFSWNSIISVHEQYGDHDGTLRLFDKML-GSGILPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALM

Query:  VFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDA
        +FD +++KD+ SW ++I GYGMHG+G EA+ +F+ M +A I+ DE++FV +L AC+H+G V +G  F   M  E  + PT+EHY C++DML R G L  A
Subjt:  VFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDA

Query:  YELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGD
        Y   + MPI  +  +W ALL  CR+H + +LAE  A KV +LEPE+ G YVLM+N+Y    ++ +V  +RK + ++ ++K PGCSWIE+K  V++F+ GD
Subjt:  YELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGD

Query:  RTHSE
         ++ E
Subjt:  RTHSE

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.6e-11039.84Show/hide
Query:  LQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIRDVVLW
        L+SCA  K    G+Q+H  ++  G        TSLI+MY + G++E+A  VF D    R+V +Y ALI G     Y   G +ENAQK+F+E+ ++DVV W
Subjt:  LQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIRDVVLW

Query:  NALINGYAQIGCLDEALEVFRRMSIEGILPSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFS
        NA+I+GYA+ G   EALE+F+ M    + P   T+  ++S  A  G +  GR VH  +   G+   + ++NALID+Y KC  +  A  +F+ +  KD+ S
Subjt:  NALINGYAQIGCLDEALEVFRRMSIEGILPSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFS

Query:  WNSIISVHEQYGDHDGTLRLFDKMLGSGILPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMS
        WN++I  +     +   L LF +ML SG  P+ VT+ ++LPAC+HL A+  GR IH Y+        G  +   +  +++DMYAKCG ++ A  VF+ + 
Subjt:  WNSIISVHEQYGDHDGTLRLFDKMLGSGILPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMS

Query:  NKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQK
        +K ++SWN +I G+ MHG    + D+FS M +  I+PD++TFVG+LSAC+H+G +  GR     M  ++ + P +EHY C+ID+LG +G  K+A E+   
Subjt:  NKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQK

Query:  MPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTH
        M ++ + ++W +LL AC++HGN EL E  A  ++++EPE+ GSYVL+SN+Y   GR+ EV + R  + ++ +KK PGCS IE+   VH F+ GD+ H
Subjt:  MPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTH

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein7.2e-10335.91Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIR
        T  + L S A  + +  GK++HS ++  G   + S   SL+NMY+KCG    A  VF D    R++ ++NA+I+      ++++G M+ A   FE+++ R
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIR

Query:  DVVLWNALINGYAQIGCLDEALEVFRRMSIEGIL-PSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHI-----------
        D+V WN++I+G+ Q G    AL++F +M  + +L P RFT+  +LS  A    L  G+ +H+ +   G+++   VLNALI MY +C  +           
Subjt:  DVVLWNALINGYAQIGCLDEALEVFRRMSIEGIL-PSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHI-----------

Query:  ----------------------RDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDLVTVTTVLPACSHLAALMHGREIHGYMIV
                                A  IF  + ++D+ +W ++I  +EQ+G +   + LF  M+G G  P+  T+  +L   S LA+L HG++IHG  + 
Subjt:  ----------------------RDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGILPDLVTVTTVLPACSHLAALMHGREIHGYMIV

Query:  NGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLM-SNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRL
            K G +  + V+NA++ MYAK G++ +A   FDL+   +D  SW  +I+    HG+  EAL++F  M    ++PD +T+VGV SAC HAG V+QGR 
Subjt:  NGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLM-SNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRL

Query:  FLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEV
        +   M+    +IPT+ HY C++D+ GRAG L++A E  +KMPI+ + + W +LL ACR+H N +L +VAA +++ LEPE+ G+Y  ++N+Y   G++ E 
Subjt:  FLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEV

Query:  LEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
         ++RK+MK+  VKK  G SWIE+K  VHVF   D TH E N +
Subjt:  LEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

AT3G14730.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-16250.17Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGF-SHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVN--------------------
        T I+ LQ CA  K+   G+Q+H  M+  GF   SP + TSL+NMY+KCG M  A+LVF     ER+VF YNALISG +VN                    
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGF-SHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVN--------------------

Query:  --------------------------------------------TYLKIGLMENAQKVFEELSIR-DVVLWNALINGYAQIGCLDEALEVFRRMSIEGIL
                                                    +Y K   +E+AQKVF+EL  R D VLWNAL+NGY+QI   ++AL VF +M  EG+ 
Subjt:  --------------------------------------------TYLKIGLMENAQKVFEELSIR-DVVLWNALINGYAQIGCLDEALEVFRRMSIEGIL

Query:  PSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGI
         SR T+T +LS F + GD++NGR++H +  K G    + V NALIDMYGK K + +A  IF+ +DE+D+F+WNS++ VH+  GDHDGTL LF++ML SGI
Subjt:  PSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSGI

Query:  LPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSC
         PD+VT+TTVLP C  LA+L  GREIHGYMIV+G       ++  ++N++MDMY KCG +++A MVFD M  KD ASWNI+I GYG+   G  ALD+FSC
Subjt:  LPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSC

Query:  MCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVA
        MC A +KPDE+TFVG+L AC+H+GF+++GR FLAQME+ + ++PT +HY CVIDMLGRA  L++AYELA   PI  NP+VWR++L +CRLHGN +LA VA
Subjt:  MCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVA

Query:  ARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL
         +++ +LEPEHCG YVLMSNVY   G+Y EVL+VR  M++QNVKKTPGCSWI LK+GVH F TG++TH E  ++
Subjt:  ARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-10338.42Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIR
        T +S    CAD + ++ G+ +HS+ +   FS       +L++MYSKCG                                      +++A+ VF E+S R
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIR

Query:  DVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDE
         VV + ++I GYA+ G   EA+++F  M  EGI P  +TVT +L+  A    L+ G+ VH  + +      + V NAL+DMY KC  +++A ++F  +  
Subjt:  DVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDE

Query:  KDIFSWNSIISVHEQYGDHDGTLRLFDKML-GSGILPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALM
        KDI SWN+II  + +    +  L LF+ +L      PD  TV  VLPAC+ L+A   GREIHGY++ NG+       D  V N+++DMYAKCG++  A M
Subjt:  KDIFSWNSIISVHEQYGDHDGTLRLFDKML-GSGILPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALM

Query:  VFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDA
        +FD +++KD+ SW ++I GYGMHG+G EA+ +F+ M +A I+ DE++FV +L AC+H+G V +G  F   M  E  + PT+EHY C++DML R G L  A
Subjt:  VFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDA

Query:  YELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGD
        Y   + MPI  +  +W ALL  CR+H + +LAE  A KV +LEPE+ G YVLM+N+Y    ++ +V  +RK + ++ ++K PGCSWIE+K  V++F+ GD
Subjt:  YELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGD

Query:  RTHSE
         ++ E
Subjt:  RTHSE

AT5G16860.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-10439.85Show/hide
Query:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIR
        T ++ L  CA     + GKQLH   +T     +      L++MY+KCG M+EA  VF +    ++V ++NA+++G     Y +IG  E+A ++FE++   
Subjt:  TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIR

Query:  ----DVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRFTVTGILSIFALRGDLNNGRTVHAI-------VTKMGYNLGVAVLNALIDMYGKCKHIR
            DVV W+A I+GYAQ G   EAL V R+M   GI P+  T+  +LS  A  G L +G+ +H         + K G+     V+N LIDMY KCK + 
Subjt:  ----DVVLWNALINGYAQIGCLDEALEVFRRMSIEGILPSRFTVTGILSIFALRGDLNNGRTVHAI-------VTKMGYNLGVAVLNALIDMYGKCKHIR

Query:  DALMIFKMID--EKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSG--ILPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVM
         A  +F  +   E+D+ +W  +I  + Q+GD +  L L  +M        P+  T++  L AC+ LAAL  G++IH Y + N          L V+N ++
Subjt:  DALMIFKMID--EKDIFSWNSIISVHEQYGDHDGTLRLFDKMLGSG--ILPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVM

Query:  DMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTC
        DMYAKCGS+ +A +VFD M  K+  +W  ++ GYGMHGYG EAL IF  M     K D VT + VL AC+H+G + QG  +  +M++ FGV P  EHY C
Subjt:  DMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALDIFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTC

Query:  VIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSW
        ++D+LGRAG L  A  L ++MP++  P+VW A L  CR+HG  EL E AA K+ +L   H GSY L+SN+Y   GR+ +V  +R  M+ + VKK PGCSW
Subjt:  VIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKVMQLEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSW

Query:  IELKDGVHVFLTGDRTH
        +E   G   F  GD+TH
Subjt:  IELKDGVHVFLTGDRTH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACATCCATTTCATTTCTACAATCATGTGCTGACCACAAGAATCTCAACAGAGGAAAACAGCTTCACTCCCTCATGATCACCTATGGTTTTTCTCATTCACCTTCATCCAT
CACTAGCTTAATCAACATGTACTCCAAATGTGGTCAAATGGAGGAGGCCATTTTGGTTTTCCATGATCCATGTCACGAGCGTAATGTGTTTGCATATAATGCTTTGATTT
CTGGTGCTTTAGTAAATACTTACTTAAAGATTGGCTTAATGGAGAATGCACAAAAAGTATTTGAAGAACTTTCAATAAGAGATGTTGTGCTTTGGAATGCACTGATCAAT
GGGTATGCCCAGATTGGTTGCCTTGATGAGGCATTGGAAGTTTTCAGAAGAATGAGTATAGAAGGGATTTTACCTAGTAGGTTTACAGTTACTGGCATTTTATCTATTTT
TGCTTTAAGGGGAGATTTAAACAATGGGAGAACAGTTCATGCAATTGTGACAAAAATGGGTTATAATTTGGGAGTTGCAGTTTTGAACGCGCTAATTGATATGTATGGGA
AATGCAAGCATATCAGAGATGCTCTAATGATTTTCAAGATGATTGATGAGAAGGACATATTCTCATGGAATTCGATTATATCGGTTCATGAACAATATGGTGATCACGAT
GGTACCTTGAGGCTTTTTGATAAGATGTTAGGGTCCGGGATTCTACCTGATTTAGTAACCGTCACAACTGTGCTTCCAGCTTGCTCTCATTTGGCTGCCCTCATGCATGG
TAGAGAAATCCATGGATATATGATTGTTAATGGATTTGGAAAGGATGGAGGTGTAGATGATTTGCTTGTAAATAATGCTGTTATGGATATGTATGCAAAATGTGGAAGTA
TGAAAAATGCCCTCATGGTTTTTGATCTAATGAGCAATAAGGATGTGGCATCATGGAACATTGTGATTATGGGTTATGGCATGCATGGATATGGCATGGAGGCATTGGAT
ATATTTTCTTGCATGTGCGAGGCCCGAATTAAGCCAGATGAAGTTACGTTTGTTGGAGTTTTATCGGCATGCAATCATGCAGGTTTCGTGAGTCAAGGGCGTTTGTTTTT
AGCTCAAATGGAGTCTGAATTCGGTGTTATTCCAACTATTGAGCATTATACGTGTGTAATTGATATGCTCGGTCGAGCTGGGCATCTGAAGGACGCTTATGAGTTGGCCC
AAAAAATGCCTATTCAAGCCAATCCCATTGTGTGGAGGGCTCTATTAGGAGCATGTCGACTTCATGGGAATGCAGAGTTGGCTGAAGTTGCTGCACGAAAAGTAATGCAA
CTTGAACCAGAGCATTGTGGGAGTTATGTATTGATGTCTAACGTTTATGGTGTTGTAGGTCGATACGGAGAGGTCTTAGAGGTTAGAAAAACAATGAAGGAACAAAATGT
TAAGAAGACACCAGGTTGTAGTTGGATTGAACTCAAGGATGGGGTGCATGTTTTTCTTACTGGAGATCGGACACATTCAGAATTGAATGCATTG
mRNA sequenceShow/hide mRNA sequence
ACATCCATTTCATTTCTACAATCATGTGCTGACCACAAGAATCTCAACAGAGGAAAACAGCTTCACTCCCTCATGATCACCTATGGTTTTTCTCATTCACCTTCATCCAT
CACTAGCTTAATCAACATGTACTCCAAATGTGGTCAAATGGAGGAGGCCATTTTGGTTTTCCATGATCCATGTCACGAGCGTAATGTGTTTGCATATAATGCTTTGATTT
CTGGTGCTTTAGTAAATACTTACTTAAAGATTGGCTTAATGGAGAATGCACAAAAAGTATTTGAAGAACTTTCAATAAGAGATGTTGTGCTTTGGAATGCACTGATCAAT
GGGTATGCCCAGATTGGTTGCCTTGATGAGGCATTGGAAGTTTTCAGAAGAATGAGTATAGAAGGGATTTTACCTAGTAGGTTTACAGTTACTGGCATTTTATCTATTTT
TGCTTTAAGGGGAGATTTAAACAATGGGAGAACAGTTCATGCAATTGTGACAAAAATGGGTTATAATTTGGGAGTTGCAGTTTTGAACGCGCTAATTGATATGTATGGGA
AATGCAAGCATATCAGAGATGCTCTAATGATTTTCAAGATGATTGATGAGAAGGACATATTCTCATGGAATTCGATTATATCGGTTCATGAACAATATGGTGATCACGAT
GGTACCTTGAGGCTTTTTGATAAGATGTTAGGGTCCGGGATTCTACCTGATTTAGTAACCGTCACAACTGTGCTTCCAGCTTGCTCTCATTTGGCTGCCCTCATGCATGG
TAGAGAAATCCATGGATATATGATTGTTAATGGATTTGGAAAGGATGGAGGTGTAGATGATTTGCTTGTAAATAATGCTGTTATGGATATGTATGCAAAATGTGGAAGTA
TGAAAAATGCCCTCATGGTTTTTGATCTAATGAGCAATAAGGATGTGGCATCATGGAACATTGTGATTATGGGTTATGGCATGCATGGATATGGCATGGAGGCATTGGAT
ATATTTTCTTGCATGTGCGAGGCCCGAATTAAGCCAGATGAAGTTACGTTTGTTGGAGTTTTATCGGCATGCAATCATGCAGGTTTCGTGAGTCAAGGGCGTTTGTTTTT
AGCTCAAATGGAGTCTGAATTCGGTGTTATTCCAACTATTGAGCATTATACGTGTGTAATTGATATGCTCGGTCGAGCTGGGCATCTGAAGGACGCTTATGAGTTGGCCC
AAAAAATGCCTATTCAAGCCAATCCCATTGTGTGGAGGGCTCTATTAGGAGCATGTCGACTTCATGGGAATGCAGAGTTGGCTGAAGTTGCTGCACGAAAAGTAATGCAA
CTTGAACCAGAGCATTGTGGGAGTTATGTATTGATGTCTAACGTTTATGGTGTTGTAGGTCGATACGGAGAGGTCTTAGAGGTTAGAAAAACAATGAAGGAACAAAATGT
TAAGAAGACACCAGGTTGTAGTTGGATTGAACTCAAGGATGGGGTGCATGTTTTTCTTACTGGAGATCGGACACATTCAGAATTGAATGCATTG
Protein sequenceShow/hide protein sequence
TSISFLQSCADHKNLNRGKQLHSLMITYGFSHSPSSITSLINMYSKCGQMEEAILVFHDPCHERNVFAYNALISGALVNTYLKIGLMENAQKVFEELSIRDVVLWNALIN
GYAQIGCLDEALEVFRRMSIEGILPSRFTVTGILSIFALRGDLNNGRTVHAIVTKMGYNLGVAVLNALIDMYGKCKHIRDALMIFKMIDEKDIFSWNSIISVHEQYGDHD
GTLRLFDKMLGSGILPDLVTVTTVLPACSHLAALMHGREIHGYMIVNGFGKDGGVDDLLVNNAVMDMYAKCGSMKNALMVFDLMSNKDVASWNIVIMGYGMHGYGMEALD
IFSCMCEARIKPDEVTFVGVLSACNHAGFVSQGRLFLAQMESEFGVIPTIEHYTCVIDMLGRAGHLKDAYELAQKMPIQANPIVWRALLGACRLHGNAELAEVAARKVMQ
LEPEHCGSYVLMSNVYGVVGRYGEVLEVRKTMKEQNVKKTPGCSWIELKDGVHVFLTGDRTHSELNAL