; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005072 (gene) of Snake gourd v1 genome

Gene IDTan0005072
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG03:78248536..78257077
RNA-Seq ExpressionTan0005072
SyntenyTan0005072
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594603.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.2e-29591.27Show/hide
Query:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL
        MN+TKSLITVE++Q A QHPVTHI  SRPHSP++AALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSL+CAAGSITYAR+L PTVPNPDSFLFNSL
Subjt:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL

Query:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS
        LKATSK+GFSVDAVLFYRHMLF GV QSNYTFTSVIKACADLSALRLGREIHSHV+V GYGSDMYVQAALIALYAKAGDMKVAQK+FD MPQRT IAWNS
Subjt:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS

Query:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT
        LISGYEQNGLP +SI LFNLM+ SGFQPD AT+VSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMK RNVVTWT
Subjt:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT

Query:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP
        AMISGYGMHGYG+QAMKLF EMR YGPRPNNITFVAVLSACAHSGLIDDGR+AF+SMKEVYGLVPGVEHHVCMVDMFGRAGLL DAYQFI+ FLPEEPGP
Subjt:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP

Query:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL
        AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMM+R RLKKQVGYSTIEI++KTY+FSMGDKSHPET+ IYRYL
Subjt:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL

Query:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR
        DEL+G CSESGYIPA  SLMHDLEEEERDYALRYHSEKLALAFGLLKTN+
Subjt:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR

KAG7026573.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.8e-29591.09Show/hide
Query:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL
        MN+TKSLITVE++Q A QHPVTHI  SRPHSP++AALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSL+CAAGSITYAR+L PTVPNPDSFLFNSL
Subjt:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL

Query:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS
        LKATSK+GFSVDAVLFYRHMLF GV QSNYTFTSVIKACADLSALRLGREIHSHV+V GYGSDMYVQAALIALYAKAGDMKVAQK+FD MPQRT IAWNS
Subjt:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS

Query:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT
        LISGYEQNGLP +SI LFNLM+ SGFQPD AT+VSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMK RNVVTWT
Subjt:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT

Query:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP
        AMISGYGMHGYG+QAMKLF EMR YGPRPNNITFVAVLSACAHSGLIDDGR+AF+SMKEVYGLVPGVEHHVCMVDMFGRAGLL DAYQFI+ FLPEEPGP
Subjt:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP

Query:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL
        AVWTSMLGACRMHK FDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMM+R RLKKQVGYSTIEI++KTY+FSMGDKSHPET+ IYRYL
Subjt:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL

Query:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR
        DEL+G CSESGYIPA  SLMHDLEEEERDYALRYHSEKLALAFGLLKTN+
Subjt:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR

XP_022926430.1 pentatricopeptide repeat-containing protein At2g33760 isoform X1 [Cucurbita moschata]5.2e-29691.27Show/hide
Query:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL
        MN+TKSLITVE++Q A QHPVTHI  SRPHSP++AALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSL+CAAGSITYAR+L PTVPNPDSFLFNSL
Subjt:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL

Query:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS
        LKATSK+GFSVDAVLFYRHMLF GV QSNYTFTSVIKACADLSALRLGREIHSHV+V GYGSDMYVQAALIALYAKAGDMKVAQK+FD MPQRT IAWNS
Subjt:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS

Query:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT
        LISGYEQNGLP +SI LFNLM+ SGFQPD AT+VSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMK RNVVTWT
Subjt:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT

Query:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP
        AMISGYGMHGYG+QAMKLF EMR YGPRPNNITFVAVLSACAHSGLIDDGR+AF+SMKEVYGLVPGVEHHVCMVDMFGRAGLL DAYQFI+ FLPEEPGP
Subjt:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP

Query:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL
        AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMM+R RLKKQVGYSTIEI++KTY+FSMGDKSHPET+ IYRYL
Subjt:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL

Query:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR
        DEL+G CSESGYIPA  SLMHDLEEEERDYALRYHSEKLALAFGLLKTN+
Subjt:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR

XP_022926431.1 pentatricopeptide repeat-containing protein At2g33760 isoform X2 [Cucurbita moschata]5.2e-29691.27Show/hide
Query:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL
        MN+TKSLITVE++Q A QHPVTHI  SRPHSP++AALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSL+CAAGSITYAR+L PTVPNPDSFLFNSL
Subjt:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL

Query:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS
        LKATSK+GFSVDAVLFYRHMLF GV QSNYTFTSVIKACADLSALRLGREIHSHV+V GYGSDMYVQAALIALYAKAGDMKVAQK+FD MPQRT IAWNS
Subjt:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS

Query:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT
        LISGYEQNGLP +SI LFNLM+ SGFQPD AT+VSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMK RNVVTWT
Subjt:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT

Query:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP
        AMISGYGMHGYG+QAMKLF EMR YGPRPNNITFVAVLSACAHSGLIDDGR+AF+SMKEVYGLVPGVEHHVCMVDMFGRAGLL DAYQFI+ FLPEEPGP
Subjt:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP

Query:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL
        AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMM+R RLKKQVGYSTIEI++KTY+FSMGDKSHPET+ IYRYL
Subjt:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL

Query:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR
        DEL+G CSESGYIPA  SLMHDLEEEERDYALRYHSEKLALAFGLLKTN+
Subjt:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR

XP_023517887.1 pentatricopeptide repeat-containing protein At2g33760 [Cucurbita pepo subsp. pepo]6.8e-29691.27Show/hide
Query:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL
        MN+TKSLITVE++Q A QHPVTHI  SRPHSP++AALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRL PTVPNPDSFLFNSL
Subjt:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL

Query:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS
        LKATSK+GFSVDAVLFYRHMLF GV QSNYTFTSVIKACADLSALRLGREIHSHV+V GYGSDMYVQAALIALY+KAGDMKVAQK+FD MPQRT IAWNS
Subjt:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS

Query:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT
        LISGYEQNGLP +SI LFNLM+ SGFQPD AT+VSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMK RNVVTWT
Subjt:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT

Query:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP
        AMISGYGMHGYG+QAMKLF EMR YGPRPNNITFVAVLSACAHSGLIDDGR+AF+SMKE+YGLVPGVEHHVCMVDMFGRAGLL DAYQFI+ FLPEEPGP
Subjt:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP

Query:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL
        AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMM+R RLKKQVGYSTIEI++KTY+FSMGDKSHPET+ IYRYL
Subjt:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL

Query:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR
        DEL+G CSESGYIPA  SLMHDLEEEERDYALRYHSEKLALAFGLLKTN+
Subjt:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR

TrEMBL top hitse value%identityAlignment
A0A1S4DTH4 pentatricopeptide repeat-containing protein At2g337605.0e-28488.03Show/hide
Query:  ITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKF
        I ++ +Q AFQHPVT    ++  SPV+ ALL++GPRLRNLQQVHAHIIVSG HRSRSLLTKL+SLVC AGSITYARRLFPTVPNPDSFLF+SLLK TSKF
Subjt:  ITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKF

Query:  GFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQ
        GFS+DAVLFYRHMLFSG P SNYTFTSVIKACADLSALRLG+EIHSHVMVCGYGSDMYVQAALIALYAKA DMKVA+K+FD MPQRTIIAWNSLISG+EQ
Subjt:  GFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQ

Query:  NGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYG
        NGLP+ESI LF+LM++SG QPDPATIVSLLSSCSQLGALDFGCWLHDY+N N FDLNVVLGTSLINMYTRCGNVSKA+EVFDSMKERNVVTWTAMISGYG
Subjt:  NGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYG

Query:  MHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSML
        MHG+G+QAMKLFGEMR YGPRPNNITFVAVLSACAHSGLIDDGR+ F+SMKEVYGLVPGVEH+VCMVDMFGRAGLLNDAYQFIK  LP+EPGPAVWTSML
Subjt:  MHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSML

Query:  GACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSC
        GACRMH+NFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMM R RLKKQVGYSTIEI+RKTYLFSMGDKSHP+T+TIYRYLDEL+  C
Subjt:  GACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSC

Query:  SESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR
        SESGY+PAP SLMHDLEEEERDYALRYHSEKLALAFGLL TN+
Subjt:  SESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR

A0A5D3CPN1 Pentatricopeptide repeat-containing protein5.0e-28488.03Show/hide
Query:  ITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKF
        I ++ +Q AFQHPVT    ++  SPV+ ALL++GPRLRNLQQVHAHIIVSG HRSRSLLTKL+SLVC AGSITYARRLFPTVPNPDSFLF+SLLK TSKF
Subjt:  ITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKF

Query:  GFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQ
        GFS+DAVLFYRHMLFSG P SNYTFTSVIKACADLSALRLG+EIHSHVMVCGYGSDMYVQAALIALYAKA DMKVA+K+FD MPQRTIIAWNSLISG+EQ
Subjt:  GFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQ

Query:  NGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYG
        NGLP+ESI LF+LM++SG QPDPATIVSLLSSCSQLGALDFGCWLHDY+N N FDLNVVLGTSLINMYTRCGNVSKA+EVFDSMKERNVVTWTAMISGYG
Subjt:  NGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYG

Query:  MHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSML
        MHG+G+QAMKLFGEMR YGPRPNNITFVAVLSACAHSGLIDDGR+ F+SMKEVYGLVPGVEH+VCMVDMFGRAGLLNDAYQFIK  LP+EPGPAVWTSML
Subjt:  MHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSML

Query:  GACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSC
        GACRMH+NFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMM R RLKKQVGYSTIEI+RKTYLFSMGDKSHP+T+TIYRYLDEL+  C
Subjt:  GACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSC

Query:  SESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR
        SESGY+PAP SLMHDLEEEERDYALRYHSEKLALAFGLL TN+
Subjt:  SESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR

A0A6J1EEW7 pentatricopeptide repeat-containing protein At2g33760 isoform X22.5e-29691.27Show/hide
Query:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL
        MN+TKSLITVE++Q A QHPVTHI  SRPHSP++AALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSL+CAAGSITYAR+L PTVPNPDSFLFNSL
Subjt:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL

Query:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS
        LKATSK+GFSVDAVLFYRHMLF GV QSNYTFTSVIKACADLSALRLGREIHSHV+V GYGSDMYVQAALIALYAKAGDMKVAQK+FD MPQRT IAWNS
Subjt:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS

Query:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT
        LISGYEQNGLP +SI LFNLM+ SGFQPD AT+VSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMK RNVVTWT
Subjt:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT

Query:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP
        AMISGYGMHGYG+QAMKLF EMR YGPRPNNITFVAVLSACAHSGLIDDGR+AF+SMKEVYGLVPGVEHHVCMVDMFGRAGLL DAYQFI+ FLPEEPGP
Subjt:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP

Query:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL
        AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMM+R RLKKQVGYSTIEI++KTY+FSMGDKSHPET+ IYRYL
Subjt:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL

Query:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR
        DEL+G CSESGYIPA  SLMHDLEEEERDYALRYHSEKLALAFGLLKTN+
Subjt:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR

A0A6J1EL35 pentatricopeptide repeat-containing protein At2g33760 isoform X12.5e-29691.27Show/hide
Query:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL
        MN+TKSLITVE++Q A QHPVTHI  SRPHSP++AALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSL+CAAGSITYAR+L PTVPNPDSFLFNSL
Subjt:  MNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSL

Query:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS
        LKATSK+GFSVDAVLFYRHMLF GV QSNYTFTSVIKACADLSALRLGREIHSHV+V GYGSDMYVQAALIALYAKAGDMKVAQK+FD MPQRT IAWNS
Subjt:  LKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNS

Query:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT
        LISGYEQNGLP +SI LFNLM+ SGFQPD AT+VSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMK RNVVTWT
Subjt:  LISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWT

Query:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP
        AMISGYGMHGYG+QAMKLF EMR YGPRPNNITFVAVLSACAHSGLIDDGR+AF+SMKEVYGLVPGVEHHVCMVDMFGRAGLL DAYQFI+ FLPEEPGP
Subjt:  AMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGP

Query:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL
        AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMM+R RLKKQVGYSTIEI++KTY+FSMGDKSHPET+ IYRYL
Subjt:  AVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYL

Query:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR
        DEL+G CSESGYIPA  SLMHDLEEEERDYALRYHSEKLALAFGLLKTN+
Subjt:  DELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR

A0A6J1KND5 pentatricopeptide repeat-containing protein At2g337602.7e-29090.94Show/hide
Query:  VEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGF
        +E++Q A QHPVTHI  SRPHSP++AALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRL PTVPNPDSFLFNSLLKATSK+GF
Subjt:  VEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGF

Query:  SVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNG
        SVDAVLFYRHMLF GV QSNYTFTSVIKACADLSALRLGREIHSHV+V GYGSDMYVQAALIALYAKAGDMKVAQK+FD MPQRT IAWNSLISGYEQNG
Subjt:  SVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNG

Query:  LPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMH
        LP +SI LFN M+ SGFQPD AT+VSLLSSCSQLGALDFGCWLHDYANSNSF+LNVVLGTSLINMYTRCGNVSKAREVFDSMK RNVVTWTAMISGYGMH
Subjt:  LPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMH

Query:  GYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGA
        GYG+QAMKLF EMR YGPRPNNITFVAVLSACAHSGLIDDGRQAF+SMKEVYGLVPGVEHHVCMVDMFGRAGLL DAYQFI+ FLPEEPGPAVWTSMLGA
Subjt:  GYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGA

Query:  CRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSE
        CRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMD+VEMVRNMM+R RLKKQVGYSTIEI++KTY+FSMGDKSHPET+ IYRYLDEL+G CSE
Subjt:  CRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSE

Query:  SGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR
        SGYIPA  SLMHDLEEEERDYALRYHSE+LALAFGLLKTN+
Subjt:  SGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNR

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210651.0e-10838.8Show/hide
Query:  LRNLQQVHAHII-----VSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNP-DSFLFNSLLKATSKFGFSVDAVLFYRHMLFSG-VPQSNYTFTSVI
        +  L+Q+HA  I     +S     + L+  L+SL  +   ++YA ++F  +  P + F++N+L++  ++ G S+ A   YR M  SG V    +T+  +I
Subjt:  LRNLQQVHAHII-----VSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNP-DSFLFNSLLKATSKFGFSVDAVLFYRHMLFSG-VPQSNYTFTSVI

Query:  KACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSL
        KA   ++ +RLG  IHS V+  G+GS +YVQ +L+ LYA  GD+  A K+FD MP++ ++AWNS+I+G+ +NG P+E++ L+  M   G +PD  TIVSL
Subjt:  KACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSL

Query:  LSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMRG-YGPRPNNITFV
        LS+C+++GAL  G  +H Y        N+     L+++Y RCG V +A+ +FD M ++N V+WT++I G  ++G+G++A++LF  M    G  P  ITFV
Subjt:  LSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMRG-YGPRPNNITFV

Query:  AVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPG
         +L AC+H G++ +G + F  M+E Y + P +EH  CMVD+  RAG +  AY++IK+ +P +P   +W ++LGAC +H + DL       +L +EP + G
Subjt:  AVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPG

Query:  HYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYH
         YV+LSN+YA   R   V+ +R  M+R  +KK  G+S +E+  + + F MGDKSHP++  IY  L E+ G     GY+P   ++  D+EEEE++ A+ YH
Subjt:  HYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYH

Query:  SEKLALAFGLLKTNRANP
        SEK+A+AF L+ T   +P
Subjt:  SEKLALAFGLLKTNRANP

P93011 Pentatricopeptide repeat-containing protein At2g337603.9e-19360.54Show/hide
Query:  HSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSN
        +S  + A+++AGPR++ LQQVHAH+IV+G+ RSRSLLTKL++L C+A +I Y   LF +VP PD FLFNS++K+TSK    +  V +YR ML S V  SN
Subjt:  HSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSN

Query:  YTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPD
        YTFTSVIK+CADLSALR+G+ +H H +V G+G D YVQAAL+  Y+K GDM+ A+++FD MP+++I+AWNSL+SG+EQNGL  E+I +F  M ESGF+PD
Subjt:  YTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPD

Query:  PATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMR-GYGPR
         AT VSLLS+C+Q GA+  G W+H Y  S   DLNV LGT+LIN+Y+RCG+V KAREVFD MKE NV  WTAMIS YG HGYGQQA++LF +M    GP 
Subjt:  PATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMR-GYGPR

Query:  PNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNF--LPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHV
        PNN+TFVAVLSACAH+GL+++GR  +  M + Y L+PGVEHHVCMVDM GRAG L++AY+FI       +   PA+WT+MLGAC+MH+N+DLGV++A+ +
Subjt:  PNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNF--LPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHV

Query:  LAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEE
        +A+EP+NPGH+VMLSNIYAL+G+ D V  +R+ M+R  L+KQVGYS IE++ KTY+FSMGD+SH ET  IYRYL+ LI  C E GY P    +MH +EEE
Subjt:  LAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEE

Query:  ERDYALRYHSEKLALAFGLLKT
        E+++ALRYHSEKLA+AFGLLKT
Subjt:  ERDYALRYHSEKLALAFGLLKT

Q9SUH6 Pentatricopeptide repeat-containing protein At4g307004.1e-11038.81Show/hide
Query:  LRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLS
        LR   Q+H+    +G +    +LT  +SL    G I     LF     PD   +N+++   +  G +  ++  ++ ++ SG    + T  S++     L 
Subjt:  LRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLS

Query:  ALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQL
         +     IH + +   + S   V  AL  +Y+K  +++ A+K+FD  P++++ +WN++ISGY QNGL +++I LF  M +S F P+P TI  +LS+C+QL
Subjt:  ALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQL

Query:  GALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAH
        GAL  G W+HD   S  F+ ++ + T+LI MY +CG++++AR +FD M ++N VTW  MISGYG+HG GQ+A+ +F EM   G  P  +TF+ VL AC+H
Subjt:  GALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAH

Query:  SGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNI
        +GL+ +G + F SM   YG  P V+H+ CMVD+ GRAG L  A QFI+  +  EPG +VW ++LGACR+HK+ +L   V+E +  ++P+N G++V+LSNI
Subjt:  SGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNI

Query:  YALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAF
        ++      +   VR    + +L K  GY+ IEI    ++F+ GD+SHP+   IY  L++L G   E+GY P     +HD+EEEER+  ++ HSE+LA+AF
Subjt:  YALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAF

Query:  GLLKT
        GL+ T
Subjt:  GLLKT

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic1.1e-11037.43Show/hide
Query:  AALLQAGPRLRNLQQVHAHIIVSGF---HRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYT
        A L+     +  + Q+HA I+        R   L  KL     + G I ++  LF    +PD FLF + +   S  G    A L Y  +L S +  + +T
Subjt:  AALLQAGPRLRNLQQVHAHIIVSGF---HRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYT

Query:  FTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRT-------------------------------IIAWNS
        F+S++K+C+  S    G+ IH+HV+  G G D YV   L+ +YAK GD+  AQK+FD MP+R+                               I++WN 
Subjt:  FTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRT-------------------------------IIAWNS

Query:  LISGYEQNGLPKESIGLFNLMLESGF-QPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTW
        +I GY Q+G P +++ LF  +L  G  +PD  T+V+ LS+CSQ+GAL+ G W+H +  S+   LNV + T LI+MY++CG++ +A  VF+    +++V W
Subjt:  LISGYEQNGLPKESIGLFNLMLESGF-QPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTW

Query:  TAMISGYGMHGYGQQAMKLFGEMRGY-GPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEP
         AMI+GY MHGY Q A++LF EM+G  G +P +ITF+  L ACAH+GL+++G + F SM + YG+ P +EH+ C+V + GRAG L  AY+ IKN +  + 
Subjt:  TAMISGYGMHGYGQQAMKLFGEMRGY-GPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEP

Query:  GPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYR
           +W+S+LG+C++H +F LG ++AE+++ +  +N G YV+LSNIYA  G  + V  VRN+M    + K+ G STIEI+ K + F  GD+ H ++  IY 
Subjt:  GPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYR

Query:  YLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNRANPPPVWNS
         L ++       GY+P   +++ DLEE E++ +L+ HSE+LA+A+GL+ T   +P  ++ +
Subjt:  YLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNRANPPPVWNS

Q9ZVF4 Pentatricopeptide repeat-containing protein At2g01510, mitochondrial8.5e-11637.84Show/hide
Query:  RNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSA
        + L+++HA ++ +GF    SLLT+LL  +   G + YAR++F  +  P  FL+N+L K   +     +++L Y+ M   GV    +T+  V+KA + L  
Subjt:  RNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSA

Query:  LRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLG
           G  +H+HV+  G+G    V   L+ +Y K G++  A+ +F++M  + ++AWN+ ++   Q G    ++  FN M     Q D  T+VS+LS+C QLG
Subjt:  LRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLG

Query:  ALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHS
        +L+ G  ++D A     D N+++  + ++M+ +CGN   AR +F+ MK+RNVV+W+ MI GY M+G  ++A+ LF  M+  G RPN +TF+ VLSAC+H+
Subjt:  ALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHS

Query:  GLIDDGRQAFASMKEV--YGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSN
        GL+++G++ F+ M +     L P  EH+ CMVD+ GR+GLL +AY+FIK  +P EP   +W ++LGAC +H++  LG KVA+ ++   P+   ++V+LSN
Subjt:  GLIDDGRQAFASMKEV--YGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSN

Query:  IYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALA
        IYA AG+ D V+ VR+ M +L  KK   YS++E + K + F+ GDKSHP++  IY  LDE++    + GY+P   S+ HD+E EE++ +L +HSEKLA+A
Subjt:  IYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALA

Query:  FGLLKTNRANPPPVWNSAGLKQLMHFYSSFNTRAGQFYTVMPPKGSF
        FGL+K    +P  V  +       H +S F +       +M  K  F
Subjt:  FGLLKTNRANPPPVWNSAGLKQLMHFYSSFNTRAGQFYTVMPPKGSF

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.2e-11038.22Show/hide
Query:  VHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYTF
        VH +L+    +   L+  H  +     HR     T L+    + G I  A++LF  +P  D   +N+++   ++ G   +A+  ++ M+ + V     T 
Subjt:  VHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYTF

Query:  TSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPDPAT
         +V+ ACA   ++ LGR++H  +   G+GS++ +  ALI LY+K G+++ A  +F+ +P + +I+WN+LI GY    L KE++ LF  ML SG  P+  T
Subjt:  TSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPDPAT

Query:  IVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVV--LGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMRGYGPRPN
        ++S+L +C+ LGA+D G W+H Y +     +     L TSLI+MY +CG++  A +VF+S+  +++ +W AMI G+ MHG    +  LF  MR  G +P+
Subjt:  IVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVV--LGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMRGYGPRPN

Query:  NITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHVLAVE
        +ITFV +LSAC+HSG++D GR  F +M + Y + P +EH+ CM+D+ G +GL  +A + I N +  EP   +W S+L AC+MH N +LG   AE+++ +E
Subjt:  NITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHVLAVE

Query:  PENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEEERDY
        PENPG YV+LSNIYA AGR + V   R ++    +KK  G S+IEID   + F +GDK HP    IY  L+E+     ++G++P    ++ ++EEE ++ 
Subjt:  PENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEEERDY

Query:  ALRYHSEKLALAFGLLKT
        ALR+HSEKLA+AFGL+ T
Subjt:  ALRYHSEKLALAFGLLKT

AT2G01510.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.1e-11737.84Show/hide
Query:  RNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSA
        + L+++HA ++ +GF    SLLT+LL  +   G + YAR++F  +  P  FL+N+L K   +     +++L Y+ M   GV    +T+  V+KA + L  
Subjt:  RNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSA

Query:  LRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLG
           G  +H+HV+  G+G    V   L+ +Y K G++  A+ +F++M  + ++AWN+ ++   Q G    ++  FN M     Q D  T+VS+LS+C QLG
Subjt:  LRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLG

Query:  ALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHS
        +L+ G  ++D A     D N+++  + ++M+ +CGN   AR +F+ MK+RNVV+W+ MI GY M+G  ++A+ LF  M+  G RPN +TF+ VLSAC+H+
Subjt:  ALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHS

Query:  GLIDDGRQAFASMKEV--YGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSN
        GL+++G++ F+ M +     L P  EH+ CMVD+ GR+GLL +AY+FIK  +P EP   +W ++LGAC +H++  LG KVA+ ++   P+   ++V+LSN
Subjt:  GLIDDGRQAFASMKEV--YGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSN

Query:  IYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALA
        IYA AG+ D V+ VR+ M +L  KK   YS++E + K + F+ GDKSHP++  IY  LDE++    + GY+P   S+ HD+E EE++ +L +HSEKLA+A
Subjt:  IYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALA

Query:  FGLLKTNRANPPPVWNSAGLKQLMHFYSSFNTRAGQFYTVMPPKGSF
        FGL+K    +P  V  +       H +S F +       +M  K  F
Subjt:  FGLLKTNRANPPPVWNSAGLKQLMHFYSSFNTRAGQFYTVMPPKGSF

AT2G33760.1 Pentatricopeptide repeat (PPR) superfamily protein2.8e-19460.54Show/hide
Query:  HSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSN
        +S  + A+++AGPR++ LQQVHAH+IV+G+ RSRSLLTKL++L C+A +I Y   LF +VP PD FLFNS++K+TSK    +  V +YR ML S V  SN
Subjt:  HSPVHAALLQAGPRLRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSN

Query:  YTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPD
        YTFTSVIK+CADLSALR+G+ +H H +V G+G D YVQAAL+  Y+K GDM+ A+++FD MP+++I+AWNSL+SG+EQNGL  E+I +F  M ESGF+PD
Subjt:  YTFTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPD

Query:  PATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMR-GYGPR
         AT VSLLS+C+Q GA+  G W+H Y  S   DLNV LGT+LIN+Y+RCG+V KAREVFD MKE NV  WTAMIS YG HGYGQQA++LF +M    GP 
Subjt:  PATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMR-GYGPR

Query:  PNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNF--LPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHV
        PNN+TFVAVLSACAH+GL+++GR  +  M + Y L+PGVEHHVCMVDM GRAG L++AY+FI       +   PA+WT+MLGAC+MH+N+DLGV++A+ +
Subjt:  PNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNF--LPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHV

Query:  LAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEE
        +A+EP+NPGH+VMLSNIYAL+G+ D V  +R+ M+R  L+KQVGYS IE++ KTY+FSMGD+SH ET  IYRYL+ LI  C E GY P    +MH +EEE
Subjt:  LAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEE

Query:  ERDYALRYHSEKLALAFGLLKT
        E+++ALRYHSEKLA+AFGLLKT
Subjt:  ERDYALRYHSEKLALAFGLLKT

AT4G30700.1 Pentatricopeptide repeat (PPR) superfamily protein2.9e-11138.81Show/hide
Query:  LRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLS
        LR   Q+H+    +G +    +LT  +SL    G I     LF     PD   +N+++   +  G +  ++  ++ ++ SG    + T  S++     L 
Subjt:  LRNLQQVHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLS

Query:  ALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQL
         +     IH + +   + S   V  AL  +Y+K  +++ A+K+FD  P++++ +WN++ISGY QNGL +++I LF  M +S F P+P TI  +LS+C+QL
Subjt:  ALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQL

Query:  GALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAH
        GAL  G W+HD   S  F+ ++ + T+LI MY +CG++++AR +FD M ++N VTW  MISGYG+HG GQ+A+ +F EM   G  P  +TF+ VL AC+H
Subjt:  GALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAH

Query:  SGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNI
        +GL+ +G + F SM   YG  P V+H+ CMVD+ GRAG L  A QFI+  +  EPG +VW ++LGACR+HK+ +L   V+E +  ++P+N G++V+LSNI
Subjt:  SGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNI

Query:  YALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAF
        ++      +   VR    + +L K  GY+ IEI    ++F+ GD+SHP+   IY  L++L G   E+GY P     +HD+EEEER+  ++ HSE+LA+AF
Subjt:  YALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAF

Query:  GLLKT
        GL+ T
Subjt:  GLLKT

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.7e-11237.43Show/hide
Query:  AALLQAGPRLRNLQQVHAHIIVSGF---HRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYT
        A L+     +  + Q+HA I+        R   L  KL     + G I ++  LF    +PD FLF + +   S  G    A L Y  +L S +  + +T
Subjt:  AALLQAGPRLRNLQQVHAHIIVSGF---HRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYT

Query:  FTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRT-------------------------------IIAWNS
        F+S++K+C+  S    G+ IH+HV+  G G D YV   L+ +YAK GD+  AQK+FD MP+R+                               I++WN 
Subjt:  FTSVIKACADLSALRLGREIHSHVMVCGYGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRT-------------------------------IIAWNS

Query:  LISGYEQNGLPKESIGLFNLMLESGF-QPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTW
        +I GY Q+G P +++ LF  +L  G  +PD  T+V+ LS+CSQ+GAL+ G W+H +  S+   LNV + T LI+MY++CG++ +A  VF+    +++V W
Subjt:  LISGYEQNGLPKESIGLFNLMLESGF-QPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGTSLINMYTRCGNVSKAREVFDSMKERNVVTW

Query:  TAMISGYGMHGYGQQAMKLFGEMRGY-GPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEP
         AMI+GY MHGY Q A++LF EM+G  G +P +ITF+  L ACAH+GL+++G + F SM + YG+ P +EH+ C+V + GRAG L  AY+ IKN +  + 
Subjt:  TAMISGYGMHGYGQQAMKLFGEMRGY-GPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGRAGLLNDAYQFIKNFLPEEP

Query:  GPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYR
           +W+S+LG+C++H +F LG ++AE+++ +  +N G YV+LSNIYA  G  + V  VRN+M    + K+ G STIEI+ K + F  GD+ H ++  IY 
Subjt:  GPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKSHPETSTIYR

Query:  YLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNRANPPPVWNS
         L ++       GY+P   +++ DLEE E++ +L+ HSE+LA+A+GL+ T   +P  ++ +
Subjt:  YLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNRANPPPVWNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCGTCGTCGTCATCGACTTTCCCCGTGATCTGCCTTCTTCACTCTGTCGTCGCAATCACGAGCGGGACGCTCATGATGTTCTATATGAAGGAGATTTACACGAT
TGGCCATGGGATCGACATCGCCACTAAGTTAATGGGCTCTACGCCGCACGATCAGCTCCTGATTCAGACCTCCGATTCGTTTTCCGGGTTGCTCTTATTCGCAATCGGAT
TTCTTCTCTTCATGGTTTCGTTCGTCAAGGACAGGGAATTTCAAGGGTTCTTCGCTAAGGGATGTACGGTGCTTCACGTATCAATGGCGATGTGGAGATTCTATTTCGAG
CGAAGAGTGGAGGACCTGGCCTGGGATTGGCTGAGGCAGATTGTTGGTGACATTCTTCTTGCCCTATCTTGGGTGAATCTGTCAGAAAGGAGAAAAATCCATCCCACTTT
CCTTTCAAAAGCTTACACTCGAAGCCAAACCGGCGGAGAGGAGTTTCAGGGAGGGATAGCAATCAATCCAATTATGAACGAAACAAAGTCTCTAATTACAGTGGAAGCCG
AGCAATTTGCATTTCAACATCCTGTAACTCACATTCTTTACTCGCGACCGCACTCTCCAGTTCATGCAGCGCTTCTTCAAGCAGGTCCCCGTCTGAGAAACCTTCAACAA
GTTCATGCCCATATCATCGTTTCCGGATTCCATAGAAGTCGATCCCTCCTCACTAAGCTTCTTTCACTGGTTTGTGCCGCTGGTTCAATCACCTATGCTCGTCGGTTGTT
CCCCACTGTCCCAAATCCTGATTCATTCCTATTTAATTCCCTCCTCAAAGCGACTTCCAAGTTCGGTTTCTCTGTTGATGCCGTCTTGTTTTACCGTCATATGCTTTTCT
CAGGTGTTCCCCAGTCGAATTACACCTTTACGTCCGTTATCAAAGCCTGTGCAGATCTTTCAGCTCTGAGGTTGGGTCGAGAAATTCATTCTCATGTTATGGTTTGTGGT
TATGGTTCAGATATGTACGTTCAGGCTGCACTAATTGCTCTCTATGCTAAAGCTGGTGATATGAAAGTCGCCCAGAAGATGTTTGATACAATGCCACAAAGAACAATTAT
AGCTTGGAACTCACTTATATCAGGGTACGAGCAGAATGGATTACCGAAGGAATCAATTGGTTTATTTAATCTGATGTTGGAGTCGGGCTTTCAACCTGATCCAGCAACAA
TAGTGAGCTTGTTGTCTTCTTGTTCTCAGCTGGGGGCTCTTGATTTCGGATGCTGGTTGCATGATTATGCTAATAGTAATAGTTTTGATCTCAACGTAGTTCTTGGTACT
TCATTGATTAACATGTACACTAGATGTGGGAATGTAAGCAAAGCGCGGGAAGTTTTTGACTCCATGAAGGAAAGGAATGTTGTTACCTGGACAGCCATGATTTCAGGGTA
CGGGATGCATGGTTATGGTCAGCAAGCAATGAAGCTTTTTGGTGAAATGAGAGGTTATGGCCCTCGCCCTAACAATATCACATTTGTTGCAGTCTTGTCTGCATGTGCTC
ATTCAGGGTTGATTGATGATGGTCGTCAGGCATTTGCAAGCATGAAGGAAGTATATGGGTTAGTTCCAGGAGTAGAACATCATGTCTGCATGGTAGATATGTTTGGACGT
GCTGGATTGCTCAACGATGCTTATCAATTTATTAAGAATTTTCTTCCTGAAGAGCCAGGTCCGGCAGTTTGGACTTCAATGCTTGGGGCTTGCAGAATGCATAAAAATTT
TGACCTTGGAGTTAAGGTTGCAGAACATGTTTTAGCAGTTGAACCAGAAAACCCTGGTCATTATGTTATGCTTTCTAACATATATGCATTGGCCGGTAGGATGGATCGAG
TGGAAATGGTACGAAACATGATGATTAGACTACGCCTAAAGAAGCAAGTAGGCTATAGCACCATAGAGATAGATCGAAAGACCTATTTGTTTAGCATGGGTGACAAGTCT
CATCCCGAGACAAGTACAATCTATAGATATTTAGACGAATTAATAGGTAGTTGTAGTGAATCAGGCTATATACCAGCACCTGGGTCCTTGATGCATGATTTGGAAGAAGA
GGAAAGGGATTATGCCCTTAGATATCATAGTGAGAAGCTTGCACTAGCATTTGGTTTACTTAAAACCAATCGAGCTAATCCACCACCAGTGTGGAACTCCGCAGGTTTGA
AGCAGCTTATGCATTTTTATTCAAGCTTCAACACCAGGGCTGGGCAGTTTTATACAGTCATGCCGCCAAAAGGTTCTTTCCCTAAAAAATTCTTTGAGGTGAATGATGAG
ATGGAGCTTCAAGAAGCAAACATTTGGATTTTAAATTGCGTGTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGATCGTCGTCGTCATCGACTTTCCCCGTGATCTGCCTTCTTCACTCTGTCGTCGCAATCACGAGCGGGACGCTCATGATGTTCTATATGAAGGAGATTTACACGAT
TGGCCATGGGATCGACATCGCCACTAAGTTAATGGGCTCTACGCCGCACGATCAGCTCCTGATTCAGACCTCCGATTCGTTTTCCGGGTTGCTCTTATTCGCAATCGGAT
TTCTTCTCTTCATGGTTTCGTTCGTCAAGGACAGGGAATTTCAAGGGTTCTTCGCTAAGGGATGTACGGTGCTTCACGTATCAATGGCGATGTGGAGATTCTATTTCGAG
CGAAGAGTGGAGGACCTGGCCTGGGATTGGCTGAGGCAGATTGTTGGTGACATTCTTCTTGCCCTATCTTGGGTGAATCTGTCAGAAAGGAGAAAAATCCATCCCACTTT
CCTTTCAAAAGCTTACACTCGAAGCCAAACCGGCGGAGAGGAGTTTCAGGGAGGGATAGCAATCAATCCAATTATGAACGAAACAAAGTCTCTAATTACAGTGGAAGCCG
AGCAATTTGCATTTCAACATCCTGTAACTCACATTCTTTACTCGCGACCGCACTCTCCAGTTCATGCAGCGCTTCTTCAAGCAGGTCCCCGTCTGAGAAACCTTCAACAA
GTTCATGCCCATATCATCGTTTCCGGATTCCATAGAAGTCGATCCCTCCTCACTAAGCTTCTTTCACTGGTTTGTGCCGCTGGTTCAATCACCTATGCTCGTCGGTTGTT
CCCCACTGTCCCAAATCCTGATTCATTCCTATTTAATTCCCTCCTCAAAGCGACTTCCAAGTTCGGTTTCTCTGTTGATGCCGTCTTGTTTTACCGTCATATGCTTTTCT
CAGGTGTTCCCCAGTCGAATTACACCTTTACGTCCGTTATCAAAGCCTGTGCAGATCTTTCAGCTCTGAGGTTGGGTCGAGAAATTCATTCTCATGTTATGGTTTGTGGT
TATGGTTCAGATATGTACGTTCAGGCTGCACTAATTGCTCTCTATGCTAAAGCTGGTGATATGAAAGTCGCCCAGAAGATGTTTGATACAATGCCACAAAGAACAATTAT
AGCTTGGAACTCACTTATATCAGGGTACGAGCAGAATGGATTACCGAAGGAATCAATTGGTTTATTTAATCTGATGTTGGAGTCGGGCTTTCAACCTGATCCAGCAACAA
TAGTGAGCTTGTTGTCTTCTTGTTCTCAGCTGGGGGCTCTTGATTTCGGATGCTGGTTGCATGATTATGCTAATAGTAATAGTTTTGATCTCAACGTAGTTCTTGGTACT
TCATTGATTAACATGTACACTAGATGTGGGAATGTAAGCAAAGCGCGGGAAGTTTTTGACTCCATGAAGGAAAGGAATGTTGTTACCTGGACAGCCATGATTTCAGGGTA
CGGGATGCATGGTTATGGTCAGCAAGCAATGAAGCTTTTTGGTGAAATGAGAGGTTATGGCCCTCGCCCTAACAATATCACATTTGTTGCAGTCTTGTCTGCATGTGCTC
ATTCAGGGTTGATTGATGATGGTCGTCAGGCATTTGCAAGCATGAAGGAAGTATATGGGTTAGTTCCAGGAGTAGAACATCATGTCTGCATGGTAGATATGTTTGGACGT
GCTGGATTGCTCAACGATGCTTATCAATTTATTAAGAATTTTCTTCCTGAAGAGCCAGGTCCGGCAGTTTGGACTTCAATGCTTGGGGCTTGCAGAATGCATAAAAATTT
TGACCTTGGAGTTAAGGTTGCAGAACATGTTTTAGCAGTTGAACCAGAAAACCCTGGTCATTATGTTATGCTTTCTAACATATATGCATTGGCCGGTAGGATGGATCGAG
TGGAAATGGTACGAAACATGATGATTAGACTACGCCTAAAGAAGCAAGTAGGCTATAGCACCATAGAGATAGATCGAAAGACCTATTTGTTTAGCATGGGTGACAAGTCT
CATCCCGAGACAAGTACAATCTATAGATATTTAGACGAATTAATAGGTAGTTGTAGTGAATCAGGCTATATACCAGCACCTGGGTCCTTGATGCATGATTTGGAAGAAGA
GGAAAGGGATTATGCCCTTAGATATCATAGTGAGAAGCTTGCACTAGCATTTGGTTTACTTAAAACCAATCGAGCTAATCCACCACCAGTGTGGAACTCCGCAGGTTTGA
AGCAGCTTATGCATTTTTATTCAAGCTTCAACACCAGGGCTGGGCAGTTTTATACAGTCATGCCGCCAAAAGGTTCTTTCCCTAAAAAATTCTTTGAGGTGAATGATGAG
ATGGAGCTTCAAGAAGCAAACATTTGGATTTTAAATTGCGTGTCATAA
Protein sequenceShow/hide protein sequence
MGSSSSSTFPVICLLHSVVAITSGTLMMFYMKEIYTIGHGIDIATKLMGSTPHDQLLIQTSDSFSGLLLFAIGFLLFMVSFVKDREFQGFFAKGCTVLHVSMAMWRFYFE
RRVEDLAWDWLRQIVGDILLALSWVNLSERRKIHPTFLSKAYTRSQTGGEEFQGGIAINPIMNETKSLITVEAEQFAFQHPVTHILYSRPHSPVHAALLQAGPRLRNLQQ
VHAHIIVSGFHRSRSLLTKLLSLVCAAGSITYARRLFPTVPNPDSFLFNSLLKATSKFGFSVDAVLFYRHMLFSGVPQSNYTFTSVIKACADLSALRLGREIHSHVMVCG
YGSDMYVQAALIALYAKAGDMKVAQKMFDTMPQRTIIAWNSLISGYEQNGLPKESIGLFNLMLESGFQPDPATIVSLLSSCSQLGALDFGCWLHDYANSNSFDLNVVLGT
SLINMYTRCGNVSKAREVFDSMKERNVVTWTAMISGYGMHGYGQQAMKLFGEMRGYGPRPNNITFVAVLSACAHSGLIDDGRQAFASMKEVYGLVPGVEHHVCMVDMFGR
AGLLNDAYQFIKNFLPEEPGPAVWTSMLGACRMHKNFDLGVKVAEHVLAVEPENPGHYVMLSNIYALAGRMDRVEMVRNMMIRLRLKKQVGYSTIEIDRKTYLFSMGDKS
HPETSTIYRYLDELIGSCSESGYIPAPGSLMHDLEEEERDYALRYHSEKLALAFGLLKTNRANPPPVWNSAGLKQLMHFYSSFNTRAGQFYTVMPPKGSFPKKFFEVNDE
MELQEANIWILNCVS