; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G017040 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G017040
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr09:25541845..25550476
RNA-Seq ExpressionLsi09G017040
SyntenyLsi09G017040
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013843.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0083.23Show/hide
Query:  MRHGRGGFLSMESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHILCRSG
        MRHG GGF++MESRAT TLS+LADLLLVASITKTLSESGTRTLQH+SL +SEPLLLQIL SRSVHPSNKLDFFKWCSL+PNF+HSPSTYSQIF ILCRSG
Subjt:  MRHGRGGFLSMESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHILCRSG

Query:  YLHEVPLLLSSMKRDG--VAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAATS
        YLHEVPLLLSSMKRDG  V VDS TFKVLLDAFIRSGK+DAALEILDHME+LGTSLELNTYNSVLVAL+RKNQVGLALSIFFKLFDAF+ GGQEG+A  S
Subjt:  YLHEVPLLLSSMKRDG--VAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAATS

Query:  FPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------VK
        F FLPN+LACNELLVALRKSDMR EFKKVFDKLR IRSFEFN                                                        V 
Subjt:  FPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------VK

Query:  DALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFR
        DALIVWEELKGSGHEPDAFTYR+IIQGCCKSYRMDDAT IFNEMEYNGF PDTIVYNSLLDGLFKAR+V EACQ FDKMVQEGVRASPWTYNILIDGLFR
Subjt:  DALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFR

Query:  NGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANM
        NGRAEASY+LFCDLKKKGQFVDGVTYSII+LQLCKEGL EEALQLVEEMEARGFV+DL+TVTSLLIAMH+QGQWEGLERLMKHIREGDLVPNVLKWKANM
Subjt:  NGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANM

Query:  EDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVN
        EDS+KYQ+NKRKDY+ LFSPKEDLSEIISSRASSV KVN+DD  E TEE+D D+WSSSPHVDLLANLAKSTGD LQPFSLS GQR+QAKGDNSFDI+MVN
Subjt:  EDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVN

Query:  TFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLD
        TFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLS+FVKKGYFHQAWGIFNEMGEKVCPADIATYN+IIQGLGKMGRADLASSVLEKLME+GGYLD
Subjt:  TFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLD

Query:  IVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPKN
        IVMYNTL+NALGKAGRMDDVNKLF+QM++SGINPDVV+FNTLIEVHSKAGRFKDAY FLKMMLDSGCSPNHVTDT LDFLGREIEK++      +  KN
Subjt:  IVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPKN

XP_008459805.1 PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Cucumis melo]0.0e+0082.27Show/hide
Query:  MRHG--RGGFLSME--SRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHIL
        MRHG  R  FL +E  SR  STLSQL+DLLLVASITKTLSESGTRTLQH SLP+S PLLLQILHSRS++PS+KLDFFKWCSL PNFNHSPSTYSQIFHIL
Subjt:  MRHG--RGGFLSME--SRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHIL

Query:  CRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAA
        CRSGYLHEVP LL SMKRDGV+VDS TFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFD  NNGGQ+ +AA
Subjt:  CRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAA

Query:  TSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------
        TSF FLPNSLACNELLVALRK DMRVEF+KVFDKLRAI +FEFN                                                        
Subjt:  TSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------

Query:  VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGL
        VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDAT IFNEMEYNG  PD IVYNSLL+GLFKARKVTEACQ FDKMVQE VRASPWTYNILIDGL
Subjt:  VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGL

Query:  FRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKA
        FRNGRAEA Y+LFCDLKKKGQFVDGVTYSII+LQLCKEGL EEALQLVEEMEARGFVVDLIT+TSLLIAMH+QGQWEGLERLMKHIREGDLVPNVLKWK 
Subjt:  FRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKA

Query:  NMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINM
        NMEDSIKYQ+NKR+D++SLFSPKEDL E+ISSRASS  +VNID++ E TEE D D WSSSPHVD LANLA ST D LQPFSL QG+RIQ KG+NSFDINM
Subjt:  NMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINM

Query:  VNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGY
        VNTFLSIFLAKGKL+LACKLFEIFSDMGVNPV+YTYNSMLSSFVKKGYFHQAWGIFNEMGE VCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGY
Subjt:  VNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGY

Query:  LDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPK
        LDIVMYNTLINALGKAGRMDDVNKLFDQM+NSGINPDVVTFNTLIEVHSKAGRFKDAY FLKMMLDSGCSPNHVTDTTLDFLGREIEK++      +  K
Subjt:  LDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPK

Query:  N
        N
Subjt:  N

XP_022929794.1 pentatricopeptide repeat-containing protein At4g01570 [Cucurbita moschata]0.0e+0083.31Show/hide
Query:  MRHGRGGFLSMESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHILCRSG
        MRHG GGF++MESRAT TLS+LADLLLVASITKTLSESGTRTLQH+SL +SEPLLLQIL SRSVHPSNKLDFFKWCSL+PNF+HSPSTYSQIF ILCRSG
Subjt:  MRHGRGGFLSMESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHILCRSG

Query:  YLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAATSFP
        YLHEVPLLLSSMKRDGV VDS TFKVLLDAFIRSGK+D AL+ILDHME+LGTSLELNTYNSVLVAL+RKNQVGLALSIFFKLFDAF+ GGQEG+A  SF 
Subjt:  YLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAATSFP

Query:  FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------VKDA
        FLPN+LACNELLVALRKSDMRVEFKKVFDKLR IRSFEFN                                                        V DA
Subjt:  FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------VKDA

Query:  LIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNG
        LIVWEELKGSGHEPDAFTYR+IIQGCCKSYRMDDAT IFNEMEYNGF PDTIVYNSLLDGLFKAR+V EACQ FDKMVQEGVRASPWTYNILIDGLFRNG
Subjt:  LIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNG

Query:  RAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMED
        RAEASY+LFCDLKKKGQFVDGVTYSII+LQLCKEGL EEALQLVEEMEARGFV+DL+TVTSLLIAMH+QGQWEGLERLMKHIREGDLVPNVLKWKANMED
Subjt:  RAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMED

Query:  SIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTF
        S+KYQ+NKRKDY+ LFSPKEDLSEIISSRASSV KV  DD  E TEE+D D+WSSSPHVDLLANLAKSTGD LQPFSLS GQR+QAKGDNSFDI+MVNTF
Subjt:  SIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTF

Query:  LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIV
        LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLS+FVKKGYF QAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIV
Subjt:  LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIV

Query:  MYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPKN
        MYNTL+NALGKAGRMDDVNKLF+QM++SGINPDVV+FNTLIEVHSKAGRFKDAY FLKMMLDSGCSPNHVTDT LDFLGREIEK++      +  KN
Subjt:  MYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPKN

XP_022992119.1 pentatricopeptide repeat-containing protein At4g01570 [Cucurbita maxima]0.0e+0082.06Show/hide
Query:  MRHGRGGFLSMESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHILCRSG
        MRHG GGF++MESRAT TLS+LADLLLVASITKTLSESGTRTLQH+SL +SEPLLLQIL SRSVHPSNKLDFFKWCSL+PNF+HS STYSQIF ILCRSG
Subjt:  MRHGRGGFLSMESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHILCRSG

Query:  YLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAATSFP
        Y HEVPLLLSSMKRDGV VDS TFKVLLDAFIRSGK+DAALEILDHME+LGTSLELNTYNSVLVAL+RKNQVGLALSIFFKLFDAF+ GGQEG+A  SF 
Subjt:  YLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAATSFP

Query:  FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------VKDA
        FLPN+LACNELLVALRKSDMRVEFK VFDKLR IRSFEFN                                                        V DA
Subjt:  FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------VKDA

Query:  LIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNG
        LIVWEELKGSGHEPDAFTYR+IIQGCCKSYRMDDAT IFNEMEYNGF P+TIVYNSLLDGLFKAR+V EACQ FDKMVQ+GVRASPWTYNILIDGLFRNG
Subjt:  LIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNG

Query:  RAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMED
        RAEASYSLFCDLKKKGQFVDGVTYSII+LQLCKEGL EEALQLVEEMEARGFV+DL+TVTSLLIAM++QGQWEGLERLMKHIREGDLVPNVLKWKANMED
Subjt:  RAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMED

Query:  SIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTF
        S+KYQ+NKRKDY+ LFSPKEDLSEIISSRA+SV KVN+DD  E TEE+D D+WSSSPHVDLLAN AKSTGD LQ FSLS GQR+Q+KG+NSFDI+MVNTF
Subjt:  SIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTF

Query:  LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIV
        LSIFLAKGKLSLACKLF+IFSDMGVNPVRYTYNSMLS+FVKKGYFHQAWGIFNEMGEKVCPADIATYN+IIQGLGKMGRADLASSVLEKLMEQGGYLDIV
Subjt:  LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIV

Query:  MYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPKN
        MYNTL+NALGKAGRMDDVNKLF+QM++SGI PDVV+FNTLIEVHSKAGRFKDAY +LKMMLDSGCSPNHVTDT LDFLGREIEK++      +  KN
Subjt:  MYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPKN

XP_023549441.1 pentatricopeptide repeat-containing protein At4g01570 [Cucurbita pepo subsp. pepo]0.0e+0083.06Show/hide
Query:  MRHGRGGFLSMESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHILCRSG
        MRH  GGF++MESRAT TLS+LADLLLVASITKTLSESGTRTLQH+SL +SEPLLLQIL SRSVHPSNKLDFFKWCSL+PNF+HS STYSQIF  LCRSG
Subjt:  MRHGRGGFLSMESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHILCRSG

Query:  YLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAATSFP
        YLHEVPL+LSSMKRDGV VDS TFKVLLDAFIRSGK+DAALEILDHME+LGTSLELNTYNSVLVAL+RKNQVGLALSIFFKLFDAF+ GGQEG+A  SF 
Subjt:  YLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAATSFP

Query:  FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------VKDA
        FLPN+LACNELLVALRKSDMRVEFKKVFDKLR IRSFEFN                                                        V DA
Subjt:  FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------VKDA

Query:  LIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNG
        LIVWEELKGSGHEPDAFTYR+IIQGCCKSYRMDDAT IFNEMEYNGF PDTIVYNSLLDGLFKAR+V EACQ FDKMVQEGVRASPWTYNILIDGLFRNG
Subjt:  LIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNG

Query:  RAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMED
        RAEASYSLFCDLKKKGQFVDGVTYSII+LQLCKEGL EEALQLVEEMEARGFV+DL+TVTSLLIAMH+QGQWEGLERLMKHIREGDLVPNVLKWKANMED
Subjt:  RAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMED

Query:  SIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTF
        S+KYQ+NKRK+Y+SLFSPKEDLSEIISSRASSV KVN+ D  E TEE+D D+WSSSPHVDLLANLAKSTGD LQPFSLS GQR++AKGDNSFDI+MVNTF
Subjt:  SIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTF

Query:  LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIV
        LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLS+FVKKGYFHQAWGIFNEMGEKVCPADIATYN+IIQGLGKMGRADLASSVLEKLMEQGGYLDIV
Subjt:  LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIV

Query:  MYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPKN
        MYNTL+NALGKAGRMDDVNKLF+QM++SGINPDVV+FNTLIEVHSKAGRFKDAY FLKMMLDSGCSPNHVTDT LDFLGREIEK++      +  KN
Subjt:  MYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPKN

TrEMBL top hitse value%identityAlignment
A0A0A0KFG9 Uncharacterized protein0.0e+0081.4Show/hide
Query:  MRHGRGG--FLSME--SRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHIL
        MRHGR    FLS+E  SR  STLS L+ LLL+ASITKTLSESGTRTLQH SLP+S PLLLQILHSRS++PS+KLDFFKWCSL PNFNHSPSTYSQIFHIL
Subjt:  MRHGRGG--FLSME--SRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHIL

Query:  CRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAA
        CRSGYLHEVP LL SMKRDGV+VDS TFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKL D FNNGGQ  +AA
Subjt:  CRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAA

Query:  TSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------
        T+F FLPNSLACNELLVALRK DMRVEFKKVFDKLRAI SFEF+                                                        
Subjt:  TSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------

Query:  VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGL
        VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKS RMDDAT IFNEMEYNG  PDTIVYNSLL+GLFKARKVTEACQ FDKMVQE VRASPWTYNILIDGL
Subjt:  VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGL

Query:  FRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKA
        FRNGRAEA Y+LFCDLKKKGQ VD VTYSII+LQLCKE L EEALQLVEEMEARGFVVDLIT+TSLLIAMH+QGQW+GLERLMKHIREGDLVPNVLKWK 
Subjt:  FRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKA

Query:  NMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINM
        NME SIKYQ+NKRKD++SLFSPKEDLSE+ISSRASS  KVNID++FE TEERD DSWSSSP+V+ LANLA ST D LQPFS+ QG+RIQ K DNSFDINM
Subjt:  NMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINM

Query:  VNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGY
        VNTFLSIFLAKGKL+LACKLFEIFSDMGVNPV+YTYNSMLSSFVKKGYFHQAWGIFNEMGE VCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGY
Subjt:  VNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGY

Query:  LDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPK
        LDIVMYNTLINALGKAGRMDDVNKLF QM+NSGINPDVVTFNTLIEVHSKAGR KDAY FLKMMLDSGCSPNHVTDTTLDFLGRE+EK++      +  K
Subjt:  LDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPK

Query:  N
        N
Subjt:  N

A0A1S3CBH7 pentatricopeptide repeat-containing protein At4g015700.0e+0082.27Show/hide
Query:  MRHG--RGGFLSME--SRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHIL
        MRHG  R  FL +E  SR  STLSQL+DLLLVASITKTLSESGTRTLQH SLP+S PLLLQILHSRS++PS+KLDFFKWCSL PNFNHSPSTYSQIFHIL
Subjt:  MRHG--RGGFLSME--SRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHIL

Query:  CRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAA
        CRSGYLHEVP LL SMKRDGV+VDS TFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFD  NNGGQ+ +AA
Subjt:  CRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAA

Query:  TSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------
        TSF FLPNSLACNELLVALRK DMRVEF+KVFDKLRAI +FEFN                                                        
Subjt:  TSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------

Query:  VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGL
        VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDAT IFNEMEYNG  PD IVYNSLL+GLFKARKVTEACQ FDKMVQE VRASPWTYNILIDGL
Subjt:  VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGL

Query:  FRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKA
        FRNGRAEA Y+LFCDLKKKGQFVDGVTYSII+LQLCKEGL EEALQLVEEMEARGFVVDLIT+TSLLIAMH+QGQWEGLERLMKHIREGDLVPNVLKWK 
Subjt:  FRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKA

Query:  NMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINM
        NMEDSIKYQ+NKR+D++SLFSPKEDL E+ISSRASS  +VNID++ E TEE D D WSSSPHVD LANLA ST D LQPFSL QG+RIQ KG+NSFDINM
Subjt:  NMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINM

Query:  VNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGY
        VNTFLSIFLAKGKL+LACKLFEIFSDMGVNPV+YTYNSMLSSFVKKGYFHQAWGIFNEMGE VCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGY
Subjt:  VNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGY

Query:  LDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPK
        LDIVMYNTLINALGKAGRMDDVNKLFDQM+NSGINPDVVTFNTLIEVHSKAGRFKDAY FLKMMLDSGCSPNHVTDTTLDFLGREIEK++      +  K
Subjt:  LDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPK

Query:  N
        N
Subjt:  N

A0A5A7TE47 Pentatricopeptide repeat-containing protein0.0e+0082.27Show/hide
Query:  MRHG--RGGFLSME--SRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHIL
        MRHG  R  FL +E  SR  STLSQL+DLLLVASITKTLSESGTRTLQH SLP+S PLLLQILHSRS++PS+KLDFFKWCSL PNFNHSPSTYSQIFHIL
Subjt:  MRHG--RGGFLSME--SRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHIL

Query:  CRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAA
        CRSGYLHEVP LL SMKRDGV+VDS TFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFD  NNGGQ+ +AA
Subjt:  CRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAA

Query:  TSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------
        TSF FLPNSLACNELLVALRK DMRVEF+KVFDKLRAI +FEFN                                                        
Subjt:  TSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------

Query:  VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGL
        VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDAT IFNEMEYNG  PD IVYNSLL+GLFKARKVTEACQ FDKMVQE VRASPWTYNILIDGL
Subjt:  VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGL

Query:  FRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKA
        FRNGRAEA Y+LFCDLKKKGQFVDGVTYSII+LQLCKEGL EEALQLVEEMEARGFVVDLIT+TSLLIAMH+QGQWEGLERLMKHIREGDLVPNVLKWK 
Subjt:  FRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKA

Query:  NMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINM
        NMEDSIKYQ+NKR+D++SLFSPKEDL E+ISSRASS  +VNID++ E TEE D D WSSSPHVD LANLA ST D LQPFSL QG+RIQ KG+NSFDINM
Subjt:  NMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINM

Query:  VNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGY
        VNTFLSIFLAKGKL+LACKLFEIFSDMGVNPV+YTYNSMLSSFVKKGYFHQAWGIFNEMGE VCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGY
Subjt:  VNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGY

Query:  LDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPK
        LDIVMYNTLINALGKAGRMDDVNKLFDQM+NSGINPDVVTFNTLIEVHSKAGRFKDAY FLKMMLDSGCSPNHVTDTTLDFLGREIEK++      +  K
Subjt:  LDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPK

Query:  N
        N
Subjt:  N

A0A6J1EPT7 pentatricopeptide repeat-containing protein At4g015700.0e+0083.31Show/hide
Query:  MRHGRGGFLSMESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHILCRSG
        MRHG GGF++MESRAT TLS+LADLLLVASITKTLSESGTRTLQH+SL +SEPLLLQIL SRSVHPSNKLDFFKWCSL+PNF+HSPSTYSQIF ILCRSG
Subjt:  MRHGRGGFLSMESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHILCRSG

Query:  YLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAATSFP
        YLHEVPLLLSSMKRDGV VDS TFKVLLDAFIRSGK+D AL+ILDHME+LGTSLELNTYNSVLVAL+RKNQVGLALSIFFKLFDAF+ GGQEG+A  SF 
Subjt:  YLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAATSFP

Query:  FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------VKDA
        FLPN+LACNELLVALRKSDMRVEFKKVFDKLR IRSFEFN                                                        V DA
Subjt:  FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------VKDA

Query:  LIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNG
        LIVWEELKGSGHEPDAFTYR+IIQGCCKSYRMDDAT IFNEMEYNGF PDTIVYNSLLDGLFKAR+V EACQ FDKMVQEGVRASPWTYNILIDGLFRNG
Subjt:  LIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNG

Query:  RAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMED
        RAEASY+LFCDLKKKGQFVDGVTYSII+LQLCKEGL EEALQLVEEMEARGFV+DL+TVTSLLIAMH+QGQWEGLERLMKHIREGDLVPNVLKWKANMED
Subjt:  RAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMED

Query:  SIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTF
        S+KYQ+NKRKDY+ LFSPKEDLSEIISSRASSV KV  DD  E TEE+D D+WSSSPHVDLLANLAKSTGD LQPFSLS GQR+QAKGDNSFDI+MVNTF
Subjt:  SIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTF

Query:  LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIV
        LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLS+FVKKGYF QAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIV
Subjt:  LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIV

Query:  MYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPKN
        MYNTL+NALGKAGRMDDVNKLF+QM++SGINPDVV+FNTLIEVHSKAGRFKDAY FLKMMLDSGCSPNHVTDT LDFLGREIEK++      +  KN
Subjt:  MYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPKN

A0A6J1JWP1 pentatricopeptide repeat-containing protein At4g015700.0e+0082.06Show/hide
Query:  MRHGRGGFLSMESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHILCRSG
        MRHG GGF++MESRAT TLS+LADLLLVASITKTLSESGTRTLQH+SL +SEPLLLQIL SRSVHPSNKLDFFKWCSL+PNF+HS STYSQIF ILCRSG
Subjt:  MRHGRGGFLSMESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHILCRSG

Query:  YLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAATSFP
        Y HEVPLLLSSMKRDGV VDS TFKVLLDAFIRSGK+DAALEILDHME+LGTSLELNTYNSVLVAL+RKNQVGLALSIFFKLFDAF+ GGQEG+A  SF 
Subjt:  YLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAATSFP

Query:  FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------VKDA
        FLPN+LACNELLVALRKSDMRVEFK VFDKLR IRSFEFN                                                        V DA
Subjt:  FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN--------------------------------------------------------VKDA

Query:  LIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNG
        LIVWEELKGSGHEPDAFTYR+IIQGCCKSYRMDDAT IFNEMEYNGF P+TIVYNSLLDGLFKAR+V EACQ FDKMVQ+GVRASPWTYNILIDGLFRNG
Subjt:  LIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNG

Query:  RAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMED
        RAEASYSLFCDLKKKGQFVDGVTYSII+LQLCKEGL EEALQLVEEMEARGFV+DL+TVTSLLIAM++QGQWEGLERLMKHIREGDLVPNVLKWKANMED
Subjt:  RAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMED

Query:  SIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTF
        S+KYQ+NKRKDY+ LFSPKEDLSEIISSRA+SV KVN+DD  E TEE+D D+WSSSPHVDLLAN AKSTGD LQ FSLS GQR+Q+KG+NSFDI+MVNTF
Subjt:  SIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTF

Query:  LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIV
        LSIFLAKGKLSLACKLF+IFSDMGVNPVRYTYNSMLS+FVKKGYFHQAWGIFNEMGEKVCPADIATYN+IIQGLGKMGRADLASSVLEKLMEQGGYLDIV
Subjt:  LSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIV

Query:  MYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPKN
        MYNTL+NALGKAGRMDDVNKLF+QM++SGI PDVV+FNTLIEVHSKAGRFKDAY +LKMMLDSGCSPNHVTDT LDFLGREIEK++      +  KN
Subjt:  MYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPKN

SwissProt top hitse value%identityAlignment
Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial1.2e-4726.03Show/hide
Query:  GHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFC
        G+EPD  T+  ++ G C   R+ +A  + + M      PD +  ++L++GL    +V+EA    D+MV+ G +    TY  +++ L ++G +  +  LF 
Subjt:  GHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFC

Query:  DLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMEDSIKYQQNKRK
         ++++      V YSI++  LCK+G F++AL L  EME +G   D++T +SL+  +   G+W+   ++++ +   +++P+V+ + A ++  +K  +    
Subjt:  DLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMEDSIKYQQNKRK

Query:  DYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTFLSIFLAKGKL
            L   KE  +E+I+   +        DT  Y    D     +  H            +  Q F L     + +KG    DI   +  ++ +    ++
Subjt:  DYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTFLSIFLAKGKL

Query:  SLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALG
            +LF   S  G+ P   TYN+++  F + G  + A  +F EM  +  P  + TY +++ GL   G  + A  + EK+ +    L I +YN +I+ + 
Subjt:  SLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALG

Query:  KAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVT
         A ++DD   LF  + + G+ PDVVT+N +I    K G   +A    + M + GC+P+  T
Subjt:  KAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVT

Q8VZE4 Pentatricopeptide repeat-containing protein At4g015701.1e-25056.84Show/hide
Query:  MRHGRGGFLS-----MESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWC-SLTPNFNHSPSTYSQIFH
        MRHGRG  +S     +     S   QL ++LLVAS++KTLS+SGTR+L   S+P+SEP++LQIL   S+ PS KLDFF+WC SL P + HS + YSQIF 
Subjt:  MRHGRGGFLS-----MESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWC-SLTPNFNHSPSTYSQIFH

Query:  ILCRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGT
         +CR+G L EVP LL SMK DGV +D    K+LLD+ IRSGK+++AL +LD+ME+LG  L  + Y+SVL+AL++K+++ LALSI FKL +A +N   + T
Subjt:  ILCRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGT

Query:  AATSF-PFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN-----------------------------------------------------
               +LP ++A NELLV LR++DMR EFK+VF+KL+ ++ F+F+                                                     
Subjt:  AATSF-PFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN-----------------------------------------------------

Query:  ----VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNIL
             KDALIVW+ELK SGHEPD  TYRI+IQGCCKSYRMDDA +I+ EM+YNGF PDTIVYN LLDG  KARKVTEACQ F+KMVQEGVRAS WTYNIL
Subjt:  ----VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNIL

Query:  IDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVL
        IDGLFRNGRAEA ++LFCDLKKKGQFVD +T+SI+ LQLC+EG  E A++LVEEME RGF VDL+T++SLLI  H+QG+W+  E+LMKHIREG+LVPNVL
Subjt:  IDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVL

Query:  KWKANMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDD--TFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDN
        +W A +E S+K  Q+K KDYT +F  K    +I+S   S       DD  + E     + D WSSSP++D LA+           F L++GQR++AK D 
Subjt:  KWKANMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDD--TFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDN

Query:  SFDINMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPV-RYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEK
        SFD++M+NTFLSI+L+KG LSLACKLFEIF+ MGV  +  YTYNSM+SSFVKKGYF  A G+ ++M E  C ADIATYNVIIQGLGKMGRADLAS+VL++
Subjt:  SFDINMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPV-RYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEK

Query:  LMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSK
        L +QGGYLDIVMYNTLINALGKA R+D+  +LFD MK++GINPDVV++NT+IEV+SKAG+ K+AY +LK MLD+GC PNHVTDT LD+LG+E+EK++
Subjt:  LMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSK

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028603.2e-4823.36Show/hide
Query:  STYSQIFHILCRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAF
        S  + I  +L + G +     + + ++ DG ++D  ++  L+ AF  SG+Y  A+ +   ME+ G    L TYN +             L++F K+   +
Subjt:  STYSQIFHILCRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAF

Query:  NNGGQEGTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFNVKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEM
        N          S    P++   N L+   ++  +  E  +VF                    EE+K +G   D  TY  ++    KS+R  +A K+ NEM
Subjt:  NNGGQEGTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFNVKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEM

Query:  EYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQ
          NGF+P  + YNSL+    +   + EA +  ++M ++G +   +TY  L+ G  R G+ E++ S+F +++  G   +  T++  +      G F E ++
Subjt:  EYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQ

Query:  LVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTF
        + +E+   G   D++T  +LL    + G    +  + K ++    VP   +   N   S   +    +   +++    D        A   P ++  +T 
Subjt:  LVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTF

Query:  EYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSL----SQGQRIQAKGDNSFDIN---------MVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVR
             R    W  S  V       +   + L   SL    + G+ I      + ++          ++ T + +      L  A + F    + G +P  
Subjt:  EYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSL----SQGQRIQAKGDNSFDIN---------MVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVR

Query:  YTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFDQMKNSG
         T NSM+S + ++    +A G+ + M E+     +ATYN ++    +      +  +L +++ +G   DI+ YNT+I A  +  RM D +++F +M+NSG
Subjt:  YTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFDQMKNSG

Query:  INPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVT
        I PDV+T+NT I  ++    F++A   ++ M+  GC PN  T
Subjt:  INPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVT

Q9SR00 Pentatricopeptide repeat-containing protein At3g04760, chloroplastic4.6e-4727.5Show/hide
Query:  NVKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDG
        N+  A+ V E L+  G +PD F Y  +I G CK  R+DDAT++ + M    F+PDT+ YN ++  L    K+  A +  ++++ +  + +  TY ILI+ 
Subjt:  NVKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDG

Query:  LFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWK
            G  + +  L  ++  +G   D  TY+ I+  +CKEG+ + A ++V  +E +G   D+I+   LL A+  QG+WE  E+LM                
Subjt:  LFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWK

Query:  ANMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDIN
                         T +FS K D            P V        T  RD         ++   NL K                ++ KG    D  
Subjt:  ANMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDIN

Query:  MVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGG
          +  ++ F  +G+L +A +  E     G  P    YN++L++  K G   QA  IF ++GE  C  + ++YN +   L   G    A  ++ ++M  G 
Subjt:  MVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGG

Query:  YLDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTL
          D + YN++I+ L + G +D+  +L   M++   +P VVT+N ++    KA R +DA N L+ M+ +GC PN  T T L
Subjt:  YLDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTL

Q9SZ52 Pentatricopeptide repeat-containing protein At4g31850, chloroplastic5.1e-5427.29Show/hide
Query:  HSPS--TYSQIFHILCRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFK
        H P   TY  +      +  L  V    S M++DG   D +TF +L+DA  ++G +  A + LD M D G    L+TYN+++  LLR +++  AL +F  
Subjt:  HSPS--TYSQIFHILCRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFK

Query:  L---------------FDAFNNGGQEGTAATSFP------FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFNVKDALIVWEELKGSGHEPDA
        +                D +   G   +A  +F         PN +ACN  L +L K+    E K++F          + +KD  +V          PD+
Subjt:  L---------------FDAFNNGGQEGTAATSFP------FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFNVKDALIVWEELKGSGHEPDA

Query:  FTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKG
         TY ++++   K   +D+A K+ +EM  NG  PD IV NSL++ L+KA +V EA + F +M +  ++ +  TYN L+ GL +NG+ + +  LF  + +KG
Subjt:  FTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKG

Query:  QFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQ-------WEGLERLM--KHIREGDLVPNVLKWKANMEDSIK----
           + +T++ +   LCK      AL+++ +M   G V D+ T  +++  + + GQ       +  +++L+    +    L+P V+K  + +ED+ K    
Subjt:  QFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQ-------WEGLERLM--KHIREGDLVPNVLKWKANMEDSIK----

Query:  YQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTFLSI
        +  N      +LF   EDL        S + +  ID+   ++E              L+AN     GD +    L    R   K +N             
Subjt:  YQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTFLSI

Query:  FLAKGKLSLACKLFEIFS-DMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMY
              +S A  LFE F+ D+GV P   TYN ++   ++      A  +F ++    C  D+ATYN ++   GK G+ D    + +++       + + +
Subjt:  FLAKGKLSLACKLFEIFS-DMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMY

Query:  NTLINALGKAGRMDDVNKL-FDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPN
        N +I+ L KAG +DD   L +D M +   +P   T+  LI+  SK+GR  +A    + MLD GC PN
Subjt:  NTLINALGKAGRMDDVNKL-FDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPN

Arabidopsis top hitse value%identityAlignment
AT3G04760.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.3e-4827.5Show/hide
Query:  NVKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDG
        N+  A+ V E L+  G +PD F Y  +I G CK  R+DDAT++ + M    F+PDT+ YN ++  L    K+  A +  ++++ +  + +  TY ILI+ 
Subjt:  NVKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDG

Query:  LFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWK
            G  + +  L  ++  +G   D  TY+ I+  +CKEG+ + A ++V  +E +G   D+I+   LL A+  QG+WE  E+LM                
Subjt:  LFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWK

Query:  ANMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDIN
                         T +FS K D            P V        T  RD         ++   NL K                ++ KG    D  
Subjt:  ANMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDIN

Query:  MVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGG
          +  ++ F  +G+L +A +  E     G  P    YN++L++  K G   QA  IF ++GE  C  + ++YN +   L   G    A  ++ ++M  G 
Subjt:  MVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGG

Query:  YLDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTL
          D + YN++I+ L + G +D+  +L   M++   +P VVT+N ++    KA R +DA N L+ M+ +GC PN  T T L
Subjt:  YLDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTL

AT3G22470.1 Pentatricopeptide repeat (PPR) superfamily protein8.6e-4926.03Show/hide
Query:  GHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFC
        G+EPD  T+  ++ G C   R+ +A  + + M      PD +  ++L++GL    +V+EA    D+MV+ G +    TY  +++ L ++G +  +  LF 
Subjt:  GHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFC

Query:  DLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMEDSIKYQQNKRK
         ++++      V YSI++  LCK+G F++AL L  EME +G   D++T +SL+  +   G+W+   ++++ +   +++P+V+ + A ++  +K  +    
Subjt:  DLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMEDSIKYQQNKRK

Query:  DYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTFLSIFLAKGKL
            L   KE  +E+I+   +        DT  Y    D     +  H            +  Q F L     + +KG    DI   +  ++ +    ++
Subjt:  DYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTFLSIFLAKGKL

Query:  SLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALG
            +LF   S  G+ P   TYN+++  F + G  + A  +F EM  +  P  + TY +++ GL   G  + A  + EK+ +    L I +YN +I+ + 
Subjt:  SLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALG

Query:  KAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVT
         A ++DD   LF  + + G+ PDVVT+N +I    K G   +A    + M + GC+P+  T
Subjt:  KAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVT

AT4G01570.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.9e-25256.84Show/hide
Query:  MRHGRGGFLS-----MESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWC-SLTPNFNHSPSTYSQIFH
        MRHGRG  +S     +     S   QL ++LLVAS++KTLS+SGTR+L   S+P+SEP++LQIL   S+ PS KLDFF+WC SL P + HS + YSQIF 
Subjt:  MRHGRGGFLS-----MESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWC-SLTPNFNHSPSTYSQIFH

Query:  ILCRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGT
         +CR+G L EVP LL SMK DGV +D    K+LLD+ IRSGK+++AL +LD+ME+LG  L  + Y+SVL+AL++K+++ LALSI FKL +A +N   + T
Subjt:  ILCRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGT

Query:  AATSF-PFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN-----------------------------------------------------
               +LP ++A NELLV LR++DMR EFK+VF+KL+ ++ F+F+                                                     
Subjt:  AATSF-PFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFN-----------------------------------------------------

Query:  ----VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNIL
             KDALIVW+ELK SGHEPD  TYRI+IQGCCKSYRMDDA +I+ EM+YNGF PDTIVYN LLDG  KARKVTEACQ F+KMVQEGVRAS WTYNIL
Subjt:  ----VKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNIL

Query:  IDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVL
        IDGLFRNGRAEA ++LFCDLKKKGQFVD +T+SI+ LQLC+EG  E A++LVEEME RGF VDL+T++SLLI  H+QG+W+  E+LMKHIREG+LVPNVL
Subjt:  IDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVL

Query:  KWKANMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDD--TFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDN
        +W A +E S+K  Q+K KDYT +F  K    +I+S   S       DD  + E     + D WSSSP++D LA+           F L++GQR++AK D 
Subjt:  KWKANMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDD--TFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDN

Query:  SFDINMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPV-RYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEK
        SFD++M+NTFLSI+L+KG LSLACKLFEIF+ MGV  +  YTYNSM+SSFVKKGYF  A G+ ++M E  C ADIATYNVIIQGLGKMGRADLAS+VL++
Subjt:  SFDINMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPV-RYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEK

Query:  LMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSK
        L +QGGYLDIVMYNTLINALGKA R+D+  +LFD MK++GINPDVV++NT+IEV+SKAG+ K+AY +LK MLD+GC PNHVTDT LD+LG+E+EK++
Subjt:  LMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSK

AT4G31850.1 proton gradient regulation 33.6e-5527.29Show/hide
Query:  HSPS--TYSQIFHILCRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFK
        H P   TY  +      +  L  V    S M++DG   D +TF +L+DA  ++G +  A + LD M D G    L+TYN+++  LLR +++  AL +F  
Subjt:  HSPS--TYSQIFHILCRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFK

Query:  L---------------FDAFNNGGQEGTAATSFP------FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFNVKDALIVWEELKGSGHEPDA
        +                D +   G   +A  +F         PN +ACN  L +L K+    E K++F          + +KD  +V          PD+
Subjt:  L---------------FDAFNNGGQEGTAATSFP------FLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFNVKDALIVWEELKGSGHEPDA

Query:  FTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKG
         TY ++++   K   +D+A K+ +EM  NG  PD IV NSL++ L+KA +V EA + F +M +  ++ +  TYN L+ GL +NG+ + +  LF  + +KG
Subjt:  FTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKG

Query:  QFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQ-------WEGLERLM--KHIREGDLVPNVLKWKANMEDSIK----
           + +T++ +   LCK      AL+++ +M   G V D+ T  +++  + + GQ       +  +++L+    +    L+P V+K  + +ED+ K    
Subjt:  QFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQ-------WEGLERLM--KHIREGDLVPNVLKWKANMEDSIK----

Query:  YQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTFLSI
        +  N      +LF   EDL        S + +  ID+   ++E              L+AN     GD +    L    R   K +N             
Subjt:  YQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNSFDINMVNTFLSI

Query:  FLAKGKLSLACKLFEIFS-DMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMY
              +S A  LFE F+ D+GV P   TYN ++   ++      A  +F ++    C  D+ATYN ++   GK G+ D    + +++       + + +
Subjt:  FLAKGKLSLACKLFEIFS-DMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMY

Query:  NTLINALGKAGRMDDVNKL-FDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPN
        N +I+ L KAG +DD   L +D M +   +P   T+  LI+  SK+GR  +A    + MLD GC PN
Subjt:  NTLINALGKAGRMDDVNKL-FDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPN

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-4923.36Show/hide
Query:  STYSQIFHILCRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAF
        S  + I  +L + G +     + + ++ DG ++D  ++  L+ AF  SG+Y  A+ +   ME+ G    L TYN +             L++F K+   +
Subjt:  STYSQIFHILCRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAF

Query:  NNGGQEGTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFNVKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEM
        N          S    P++   N L+   ++  +  E  +VF                    EE+K +G   D  TY  ++    KS+R  +A K+ NEM
Subjt:  NNGGQEGTAATSFPFLPNSLACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFNVKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEM

Query:  EYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQ
          NGF+P  + YNSL+    +   + EA +  ++M ++G +   +TY  L+ G  R G+ E++ S+F +++  G   +  T++  +      G F E ++
Subjt:  EYNGFAPDTIVYNSLLDGLFKARKVTEACQTFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQ

Query:  LVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTF
        + +E+   G   D++T  +LL    + G    +  + K ++    VP   +   N   S   +    +   +++    D        A   P ++  +T 
Subjt:  LVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKHIREGDLVPNVLKWKANMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTF

Query:  EYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSL----SQGQRIQAKGDNSFDIN---------MVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVR
             R    W  S  V       +   + L   SL    + G+ I      + ++          ++ T + +      L  A + F    + G +P  
Subjt:  EYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSL----SQGQRIQAKGDNSFDIN---------MVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVR

Query:  YTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFDQMKNSG
         T NSM+S + ++    +A G+ + M E+     +ATYN ++    +      +  +L +++ +G   DI+ YNT+I A  +  RM D +++F +M+NSG
Subjt:  YTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFDQMKNSG

Query:  INPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVT
        I PDV+T+NT I  ++    F++A   ++ M+  GC PN  T
Subjt:  INPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGATAAGGAATTCTGAAGCTTCCATTAGAAATGCCAATGCAATGCGCCATGGAAGAGGTGGTTTCCTTTCCATGGAATCAAGGGCAACTTCAACTCTCTCTCAATT
GGCCGATCTCCTCCTCGTTGCTTCCATTACCAAAACCCTTTCCGAATCAGGTACTCGAACCCTCCAACACCGTTCTCTTCCATTATCGGAGCCTCTCCTTCTTCAAATCC
TGCATTCCAGATCTGTTCATCCTTCCAACAAGCTCGATTTCTTCAAATGGTGTTCTCTCACCCCCAATTTCAACCATTCACCCTCCACATATTCCCAAATCTTCCATATC
CTCTGCCGCTCTGGATACCTCCACGAGGTCCCCCTTTTACTCTCCTCGATGAAGCGAGACGGTGTCGCTGTTGATTCCCTCACTTTCAAGGTCCTTCTCGATGCGTTTAT
CAGGTCTGGTAAATATGATGCTGCCCTTGAAATTTTAGACCATATGGAAGATTTGGGAACTAGCTTGGAACTCAACACCTACAACTCTGTTCTTGTCGCTCTGCTCAGAA
AAAACCAAGTGGGTTTGGCCTTATCAATTTTCTTTAAGCTGTTTGATGCGTTTAATAATGGAGGGCAAGAAGGTACTGCTGCAACTAGTTTTCCTTTCTTGCCTAATTCA
CTTGCTTGTAATGAATTGTTGGTCGCTCTTAGGAAATCAGACATGAGGGTTGAGTTCAAAAAGGTTTTTGACAAGCTTAGAGCAATTAGAAGCTTTGAGTTTAATGTTAA
GGATGCACTTATTGTGTGGGAGGAACTTAAAGGGTCAGGTCATGAGCCTGATGCCTTCACTTACCGTATCATAATTCAGGGTTGCTGTAAATCTTACCGAATGGACGATG
CAACCAAGATTTTTAATGAAATGGAGTACAATGGATTTGCCCCAGATACCATTGTGTATAATTCTCTCCTCGACGGGCTATTTAAGGCTCGGAAAGTTACTGAAGCATGT
CAAACTTTTGATAAAATGGTGCAAGAAGGTGTAAGAGCTTCTCCTTGGACATACAATATTCTAATTGATGGATTGTTTAGGAATGGAAGAGCTGAAGCTAGCTACTCTTT
ATTCTGTGATTTGAAGAAAAAGGGTCAATTTGTGGATGGTGTTACTTACAGCATCATTGTATTACAACTGTGTAAAGAGGGACTGTTTGAGGAAGCACTACAATTGGTTG
AAGAAATGGAAGCGAGAGGCTTTGTTGTTGATCTTATTACTGTAACATCTCTTTTAATTGCAATGCACAGACAAGGCCAGTGGGAAGGGTTAGAGAGGCTCATGAAGCAC
ATTAGAGAAGGTGATTTGGTCCCCAATGTTCTTAAGTGGAAGGCTAACATGGAAGATTCAATCAAATATCAGCAAAATAAAAGGAAAGACTACACATCCCTGTTCTCCCC
AAAGGAGGATTTGAGTGAGATTATTAGTTCAAGAGCTTCTTCTGTTCCGAAAGTTAATATTGATGACACTTTCGAATACACAGAAGAAAGAGATGCTGACAGTTGGTCAT
CATCCCCACATGTAGATCTTTTGGCTAATCTTGCAAAGTCCACCGGCGATTTTTTGCAACCATTCTCTCTAAGTCAGGGGCAACGAATCCAAGCTAAAGGGGACAACTCA
TTCGATATCAATATGGTCAATACATTTTTGTCTATTTTTCTAGCAAAAGGAAAATTGAGCTTAGCGTGTAAGTTGTTTGAGATCTTCAGTGATATGGGTGTTAACCCAGT
GAGGTACACCTACAATTCAATGCTGAGTTCATTTGTGAAGAAGGGATACTTTCACCAGGCATGGGGTATATTTAATGAAATGGGCGAGAAGGTATGTCCAGCCGATATAG
CCACATACAATGTGATAATTCAAGGACTCGGGAAGATGGGTAGAGCAGATCTTGCAAGTTCTGTTCTGGAAAAGCTAATGGAGCAGGGTGGCTATCTCGATATTGTAATG
TACAACACTTTGATCAATGCACTAGGGAAGGCAGGCCGAATGGATGATGTAAATAAGCTTTTTGATCAGATGAAGAACAGTGGGATAAACCCAGATGTTGTCACTTTTAA
TACACTTATTGAAGTTCACAGCAAAGCAGGTCGGTTTAAGGATGCTTACAACTTCTTGAAAATGATGCTGGATTCGGGCTGTTCCCCAAATCATGTCACAGATACAACTT
TGGATTTTCTAGGGAGAGAGATTGAGAAAAGTAAGAGGCTTACCAACAGAGCTCTTGCTCCAAAGAACTTGACTGTCAATGGGTGTGAGATGTTTGTCATCAAGGCGTTT
CCATGTTCTGATAGATTTGACAGTTCCCCAATTTCCTTTTGCTGTTCTTTCCAAGTTAGCCACTACCACTCTAGCTCTCCTAGACCATTCAGCCTTCCCATAAGCATGGT
ACCCAAACCCTCCAGCATAGCAAAGGTTAATCCCAGTAAGCACTCCACAGAAATCATTGAGATGATCATGACCTGTAAACACTGCTTTTACATCTCCTGCTTCAACCATG
GAGGTAAAGAAACCAGAGTTCACAGAAGGAGAGCTGATCCCTTCTTGTCGAACACCAGTAAGAAAGAAGAAAATACCAGTAAAAACGATGAAATCAGGTTTCTCAGCAAG
AATCATCCGGCGAAGGAAGGCGGTAGTGTTGAGGTCAGAGCAAGAAGCAAGCTGAGTGGGAAGAACATCCTCACATGGCGTACCTTTCCCATTAGCATAATGCATATCCG
CTACTTGCAGAATCTTAAACTCCCCATTTTTCCCAAATCTCAGCCGCATCGGGTGGTTCCGTTGAACGGCGGTGGATTTGGCCGCCGGGAAAGTCAGCGACAAGAAAAGG
AGCAGAAACAGGGGAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGATAAGGAATTCTGAAGCTTCCATTAGAAATGCCAATGCAATGCGCCATGGAAGAGGTGGTTTCCTTTCCATGGAATCAAGGGCAACTTCAACTCTCTCTCAATT
GGCCGATCTCCTCCTCGTTGCTTCCATTACCAAAACCCTTTCCGAATCAGGTACTCGAACCCTCCAACACCGTTCTCTTCCATTATCGGAGCCTCTCCTTCTTCAAATCC
TGCATTCCAGATCTGTTCATCCTTCCAACAAGCTCGATTTCTTCAAATGGTGTTCTCTCACCCCCAATTTCAACCATTCACCCTCCACATATTCCCAAATCTTCCATATC
CTCTGCCGCTCTGGATACCTCCACGAGGTCCCCCTTTTACTCTCCTCGATGAAGCGAGACGGTGTCGCTGTTGATTCCCTCACTTTCAAGGTCCTTCTCGATGCGTTTAT
CAGGTCTGGTAAATATGATGCTGCCCTTGAAATTTTAGACCATATGGAAGATTTGGGAACTAGCTTGGAACTCAACACCTACAACTCTGTTCTTGTCGCTCTGCTCAGAA
AAAACCAAGTGGGTTTGGCCTTATCAATTTTCTTTAAGCTGTTTGATGCGTTTAATAATGGAGGGCAAGAAGGTACTGCTGCAACTAGTTTTCCTTTCTTGCCTAATTCA
CTTGCTTGTAATGAATTGTTGGTCGCTCTTAGGAAATCAGACATGAGGGTTGAGTTCAAAAAGGTTTTTGACAAGCTTAGAGCAATTAGAAGCTTTGAGTTTAATGTTAA
GGATGCACTTATTGTGTGGGAGGAACTTAAAGGGTCAGGTCATGAGCCTGATGCCTTCACTTACCGTATCATAATTCAGGGTTGCTGTAAATCTTACCGAATGGACGATG
CAACCAAGATTTTTAATGAAATGGAGTACAATGGATTTGCCCCAGATACCATTGTGTATAATTCTCTCCTCGACGGGCTATTTAAGGCTCGGAAAGTTACTGAAGCATGT
CAAACTTTTGATAAAATGGTGCAAGAAGGTGTAAGAGCTTCTCCTTGGACATACAATATTCTAATTGATGGATTGTTTAGGAATGGAAGAGCTGAAGCTAGCTACTCTTT
ATTCTGTGATTTGAAGAAAAAGGGTCAATTTGTGGATGGTGTTACTTACAGCATCATTGTATTACAACTGTGTAAAGAGGGACTGTTTGAGGAAGCACTACAATTGGTTG
AAGAAATGGAAGCGAGAGGCTTTGTTGTTGATCTTATTACTGTAACATCTCTTTTAATTGCAATGCACAGACAAGGCCAGTGGGAAGGGTTAGAGAGGCTCATGAAGCAC
ATTAGAGAAGGTGATTTGGTCCCCAATGTTCTTAAGTGGAAGGCTAACATGGAAGATTCAATCAAATATCAGCAAAATAAAAGGAAAGACTACACATCCCTGTTCTCCCC
AAAGGAGGATTTGAGTGAGATTATTAGTTCAAGAGCTTCTTCTGTTCCGAAAGTTAATATTGATGACACTTTCGAATACACAGAAGAAAGAGATGCTGACAGTTGGTCAT
CATCCCCACATGTAGATCTTTTGGCTAATCTTGCAAAGTCCACCGGCGATTTTTTGCAACCATTCTCTCTAAGTCAGGGGCAACGAATCCAAGCTAAAGGGGACAACTCA
TTCGATATCAATATGGTCAATACATTTTTGTCTATTTTTCTAGCAAAAGGAAAATTGAGCTTAGCGTGTAAGTTGTTTGAGATCTTCAGTGATATGGGTGTTAACCCAGT
GAGGTACACCTACAATTCAATGCTGAGTTCATTTGTGAAGAAGGGATACTTTCACCAGGCATGGGGTATATTTAATGAAATGGGCGAGAAGGTATGTCCAGCCGATATAG
CCACATACAATGTGATAATTCAAGGACTCGGGAAGATGGGTAGAGCAGATCTTGCAAGTTCTGTTCTGGAAAAGCTAATGGAGCAGGGTGGCTATCTCGATATTGTAATG
TACAACACTTTGATCAATGCACTAGGGAAGGCAGGCCGAATGGATGATGTAAATAAGCTTTTTGATCAGATGAAGAACAGTGGGATAAACCCAGATGTTGTCACTTTTAA
TACACTTATTGAAGTTCACAGCAAAGCAGGTCGGTTTAAGGATGCTTACAACTTCTTGAAAATGATGCTGGATTCGGGCTGTTCCCCAAATCATGTCACAGATACAACTT
TGGATTTTCTAGGGAGAGAGATTGAGAAAAGTAAGAGGCTTACCAACAGAGCTCTTGCTCCAAAGAACTTGACTGTCAATGGGTGTGAGATGTTTGTCATCAAGGCGTTT
CCATGTTCTGATAGATTTGACAGTTCCCCAATTTCCTTTTGCTGTTCTTTCCAAGTTAGCCACTACCACTCTAGCTCTCCTAGACCATTCAGCCTTCCCATAAGCATGGT
ACCCAAACCCTCCAGCATAGCAAAGGTTAATCCCAGTAAGCACTCCACAGAAATCATTGAGATGATCATGACCTGTAAACACTGCTTTTACATCTCCTGCTTCAACCATG
GAGGTAAAGAAACCAGAGTTCACAGAAGGAGAGCTGATCCCTTCTTGTCGAACACCAGTAAGAAAGAAGAAAATACCAGTAAAAACGATGAAATCAGGTTTCTCAGCAAG
AATCATCCGGCGAAGGAAGGCGGTAGTGTTGAGGTCAGAGCAAGAAGCAAGCTGAGTGGGAAGAACATCCTCACATGGCGTACCTTTCCCATTAGCATAATGCATATCCG
CTACTTGCAGAATCTTAAACTCCCCATTTTTCCCAAATCTCAGCCGCATCGGGTGGTTCCGTTGAACGGCGGTGGATTTGGCCGCCGGGAAAGTCAGCGACAAGAAAAGG
AGCAGAAACAGGGGAGATAG
Protein sequenceShow/hide protein sequence
MWIRNSEASIRNANAMRHGRGGFLSMESRATSTLSQLADLLLVASITKTLSESGTRTLQHRSLPLSEPLLLQILHSRSVHPSNKLDFFKWCSLTPNFNHSPSTYSQIFHI
LCRSGYLHEVPLLLSSMKRDGVAVDSLTFKVLLDAFIRSGKYDAALEILDHMEDLGTSLELNTYNSVLVALLRKNQVGLALSIFFKLFDAFNNGGQEGTAATSFPFLPNS
LACNELLVALRKSDMRVEFKKVFDKLRAIRSFEFNVKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSYRMDDATKIFNEMEYNGFAPDTIVYNSLLDGLFKARKVTEAC
QTFDKMVQEGVRASPWTYNILIDGLFRNGRAEASYSLFCDLKKKGQFVDGVTYSIIVLQLCKEGLFEEALQLVEEMEARGFVVDLITVTSLLIAMHRQGQWEGLERLMKH
IREGDLVPNVLKWKANMEDSIKYQQNKRKDYTSLFSPKEDLSEIISSRASSVPKVNIDDTFEYTEERDADSWSSSPHVDLLANLAKSTGDFLQPFSLSQGQRIQAKGDNS
FDINMVNTFLSIFLAKGKLSLACKLFEIFSDMGVNPVRYTYNSMLSSFVKKGYFHQAWGIFNEMGEKVCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVM
YNTLINALGKAGRMDDVNKLFDQMKNSGINPDVVTFNTLIEVHSKAGRFKDAYNFLKMMLDSGCSPNHVTDTTLDFLGREIEKSKRLTNRALAPKNLTVNGCEMFVIKAF
PCSDRFDSSPISFCCSFQVSHYHSSSPRPFSLPISMVPKPSSIAKVNPSKHSTEIIEMIMTCKHCFYISCFNHGGKETRVHRRRADPFLSNTSKKEENTSKNDEIRFLSK
NHPAKEGGSVEVRARSKLSGKNILTWRTFPISIMHIRYLQNLKLPIFPKSQPHRVVPLNGGGFGRRESQRQEKEQKQGR