; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036546 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036546
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr3:48283275..48288048
RNA-Seq ExpressionLag0036546
SyntenyLag0036546
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR005162 - Retrotransposon gag domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135765.1 pentatricopeptide repeat-containing protein At5g48910 isoform X1 [Cucumis sativus]0.0e+0087.67Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI
        MSSI+T+HLPSP KP  FS EK IPTSKLPQKTVLKLFDSKS TSLQYL+++HGLVLRSGHFQDHYVSGAL+KCYANPHFSNF FALKVFSSIPNPNVFI
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW
        WNIVIKGCLEN+K FKAIYFYGRM++DARPNKFTYPTLFKACSVA+AVQEGRQIHGHVVKHG+GSD+HIKSAGI MYASFG LEDARK+F +GESDVVCW
Subjt:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV
        NTMIDGYLKCG LEAAKGLF QMP++NIGSWNVMING AKGG LGDARK+FDEMSERDEI+WSSMVDGYISAG YKEALEIFQ MQRE+TRPG+FIL SV
Subjt:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV

Query:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS
        LAACS+IGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKC RLDM WEVFEEMKE+E+FTWNAMIGG AIHGRA DALELFSK+QEGR+KPNG+TLV 
Subjt:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS

Query:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY
        +LTACAHAGFVD+GLRIF+TM+EFYGV+ E+EHYGC+VDLLGRSGLFSEAEDLI SMPMKPNAAVWGALLGACRIHGN ++AERVGKILLELEP NSGRY
Subjt:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY

Query:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE
         LLSNIYAK GRFDDV+KIRK MKDRGIKT+PGVS+VDLNG VHEFKMGDGSHPQMKEIY KL+ I ERLQMAG+SPDTSQVLFDI+EEEKETAV YHSE
Subjt:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE

Query:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        KLAIAFGLINT PGK IHIVKNLRVCDDCHSATKLISQI++REIIVRDR+RYHHFKNGTCSC+DF
Subjt:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

XP_016900941.1 PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like isoform X1 [Cucumis melo]0.0e+0089.47Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI
        MSSI+T+HLPSP KP   S EKKIPTSKLPQKTVLKLFDSKS TSLQYL +VHGLVLRSGHFQDHYVSGAL+KCYANPHFSNF FALKVFSSIPNPNVFI
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW
        WNIVIKGCLEN+K FKAIYFYGRM++DARPNKFTYPTLFKACSVA+AVQEGRQIHGHVVKHG+GSDMHIKSAGIQMYASFG LEDARKLF +GESDVVCW
Subjt:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV
        NTMIDGYLKCGDLEAAKGLF QMP+RNIGSWNVMING AKGGKLGDARK+FDEMSERDEI+WSSMVDGYISAGCYKEALEIFQ MQRE+TRPG+FIL SV
Subjt:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV

Query:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS
        LAACS+IGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKC RLDM WEVFEEMKE+E+FTWNAMIGG AIHGRA DALELFSKMQEGR+KPNGVTLV 
Subjt:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS

Query:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY
        +LTACAHAGFVD+GLRIF+TM+EFYGV+ E+EHYGC+VDLLGRSGLFSEAEDLI SMPMKPNAAVWGALLGACRIHGN  +AERVGKILLELEP NSGRY
Subjt:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY

Query:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE
         LLSNIYA  GRFDDVAKIRK MKDRGIKTLPGVS VDLNG VHEFKMGDGSH QMKEIY KL+ I ERLQMAG+SPDTSQVLFDIEEEEKETAVQYHSE
Subjt:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE

Query:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        KLAIAFGLINT PG+ IHIVKNLRVCDDCHSATKLISQIY+REIIVRDR+RYHHFKNGTCSC+DF
Subjt:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

XP_022989356.1 pentatricopeptide repeat-containing protein At5g66520-like isoform X1 [Cucurbita maxima]0.0e+0087.37Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI
        MSSI+T  LPSP +PEHF  E+K+PTSKLPQKTVLKLFDSKS TSL+YLS+VHGL+LRSGHFQDHYVSGAL+KCYANPHFSNF FALKVFSSIPNPNVFI
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW
        WNIVIKGCLEN+ PFKAIYFYGRM++DARPNKF+YPTLFKACSVA+AV EG QIH HVVKHG G+DMHIKSAGIQMY SFG  EDARKL DNGESDVVCW
Subjt:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV
        NT+IDGYLKCG+LEAAKGLFEQMP RNIGSWNVMI+GFAKGGKLGDARKVFDEM +RDEITWSSMVDGYISAGCYKEALEIFQLMQR++  PG+FILCSV
Subjt:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV

Query:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS
        LAACSS+GAIDQGRWVHAYL+RNSIKLDAVLGTALLDMYAKC RLDMAW+VF E++++EVFTWNAMIGG AIHGRA DALELFSKMQ+GRLKPNGVTLVS
Subjt:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS

Query:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY
        ILTACAHAGFVDRGLRIFETMKEFYGVE EM HYGCVVDLLGRSGLFSEAE+LI SMPMKPNAAVWGALLG CRIHGN+E+AERVGKILLEL+  NSG Y
Subjt:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY

Query:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE
         LLSNIYAKAGRFDDVAKIRK MK++GIKT+PGVSMVDLNG VHEFKMGD SHPQMKEIY KLE+I ERL+MAGYSPDTSQVLFDIEEEEKE+AV+YHSE
Subjt:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE

Query:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        KLAIAFGLINTSPGK IHIVKNLR+CDDCHSATKLISQIY++EIIVRDR+RYHHFK GTCSC+DF
Subjt:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

XP_023529861.1 pentatricopeptide repeat-containing protein At5g48910-like isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0087.97Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI
        MSSI+T  LPSP KPEHF  E+K+PTSKLPQKTVLKLFDSKS TSL+YLS+VHGL+LRSGHFQDHYVSGALVKCYANPHF NF FALKVFSSIPNPNVFI
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW
        WNIVIKGCLEN+ PFKAIYFYGRM++DARPNKF+YPTLFKACSVA+AV EGRQIH HVVKHG GSDMHIKSAGIQMY SFG  EDARKL DNGESDVVCW
Subjt:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV
        NT+IDGYLKCG+LEAAKGLFEQMP  NIGSWNVMI+GFAKGGKLGDARKVFDEM +RDEITWSSMVDGYISAGCYKEALEIFQLMQR+   PG+FILCSV
Subjt:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV

Query:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS
        LAACSS+GAIDQGRWVHAYL+RNSIKLDAVLGTALLDMYAKC RLDMAW+VF E++E+EVFTWNAMIGG AIHGRA DALELFSKMQ+GRLKPNGVTLVS
Subjt:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS

Query:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY
        ILTACAHAGFVDRGLRIFETMKEFYGVE EM HYGCVVDLLGRSGLFSEAE+LI SMPMKPNAAVWGALLG CRIHGN+E+AERVGKILLEL+ HNSG Y
Subjt:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY

Query:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE
         LLSNIYAKAGRFDDVAKIRK MK++GIKT+PGVSMVDLNG VHEFKMGD SHPQMKEIY KLE+I ERLQMAGYSPDTSQVLFDIEEEEKE+AV+ HSE
Subjt:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE

Query:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        KLAIAFGLIN+SPGK IHIVKNLR+CDDCHSATKLISQIY++EIIVRDR+RYHHFKNGTCSC+DF
Subjt:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

XP_038878241.1 pentatricopeptide repeat-containing protein At5g48910-like isoform X1 [Benincasa hispida]0.0e+0088.72Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI
        MSSI+T+HLPSP KP  FS EK+IP SKL QKTVLKLFDSKS  SL YLS+VHGLVLRSGHFQDHYVSGALVKCYA+PHFSNF FALKVFSSIPNPNVFI
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW
        WNIVIKGCLEN+KPF+AIYFYGRM++DARPNKFTYPTLFKACSVA+AVQEGRQIHGHVVKHG+ SDMHIKSAGIQMYASFGGLEDA+KLFD+GESD+VCW
Subjt:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV
        NTMIDGYLKCGDLEAAKGLF QMPIRNIGSWNVMING AKGGKLGDARKVFDEMSERDEI+WSSMVDGYISAG YKEALEIFQ MQ+E+ RPGKFIL SV
Subjt:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV

Query:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS
        LAACS+IGAIDQGRWVH YLKRNSIKLDAVLGTALLDMYAKC RLDMAWEVFEEMKE+E+FTWNAMIGG A+HGRA DALELFSKMQ+GRLKPNGVTLVS
Subjt:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS

Query:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY
        +LTACAHAGFVD+GLRIF+TM+EFYGVE ++EHYGC+VDLLGRSGLFSEAEDLI SMPMKPNAAVWGALLGACRIHGN E+AERVGKILLELEPHNSGRY
Subjt:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY

Query:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE
         LLSNIYAKAGRFDDVAKIRK MKDRGIKTLPGVS+VDLNG VHEFKMGDGSHPQMKEIY KL+ I ERLQMAG+SP TSQVLFDI+EEEKETA +YHSE
Subjt:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE

Query:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        +LAIAFGLINT PGKPIHI+KNLRVC+DCHSATKLISQIY+REIIVRDR+RYHHF+NGTCSC+DF
Subjt:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

TrEMBL top hitse value%identityAlignment
A0A0A0LYD3 DYW_deaminase domain-containing protein0.0e+0087.67Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI
        MSSI+T+HLPSP KP  FS EK IPTSKLPQKTVLKLFDSKS TSLQYL+++HGLVLRSGHFQDHYVSGAL+KCYANPHFSNF FALKVFSSIPNPNVFI
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW
        WNIVIKGCLEN+K FKAIYFYGRM++DARPNKFTYPTLFKACSVA+AVQEGRQIHGHVVKHG+GSD+HIKSAGI MYASFG LEDARK+F +GESDVVCW
Subjt:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV
        NTMIDGYLKCG LEAAKGLF QMP++NIGSWNVMING AKGG LGDARK+FDEMSERDEI+WSSMVDGYISAG YKEALEIFQ MQRE+TRPG+FIL SV
Subjt:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV

Query:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS
        LAACS+IGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKC RLDM WEVFEEMKE+E+FTWNAMIGG AIHGRA DALELFSK+QEGR+KPNG+TLV 
Subjt:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS

Query:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY
        +LTACAHAGFVD+GLRIF+TM+EFYGV+ E+EHYGC+VDLLGRSGLFSEAEDLI SMPMKPNAAVWGALLGACRIHGN ++AERVGKILLELEP NSGRY
Subjt:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY

Query:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE
         LLSNIYAK GRFDDV+KIRK MKDRGIKT+PGVS+VDLNG VHEFKMGDGSHPQMKEIY KL+ I ERLQMAG+SPDTSQVLFDI+EEEKETAV YHSE
Subjt:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE

Query:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        KLAIAFGLINT PGK IHIVKNLRVCDDCHSATKLISQI++REIIVRDR+RYHHFKNGTCSC+DF
Subjt:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

A0A1S4DY81 pentatricopeptide repeat-containing protein At5g48910-like isoform X10.0e+0089.47Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI
        MSSI+T+HLPSP KP   S EKKIPTSKLPQKTVLKLFDSKS TSLQYL +VHGLVLRSGHFQDHYVSGAL+KCYANPHFSNF FALKVFSSIPNPNVFI
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW
        WNIVIKGCLEN+K FKAIYFYGRM++DARPNKFTYPTLFKACSVA+AVQEGRQIHGHVVKHG+GSDMHIKSAGIQMYASFG LEDARKLF +GESDVVCW
Subjt:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV
        NTMIDGYLKCGDLEAAKGLF QMP+RNIGSWNVMING AKGGKLGDARK+FDEMSERDEI+WSSMVDGYISAGCYKEALEIFQ MQRE+TRPG+FIL SV
Subjt:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV

Query:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS
        LAACS+IGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKC RLDM WEVFEEMKE+E+FTWNAMIGG AIHGRA DALELFSKMQEGR+KPNGVTLV 
Subjt:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS

Query:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY
        +LTACAHAGFVD+GLRIF+TM+EFYGV+ E+EHYGC+VDLLGRSGLFSEAEDLI SMPMKPNAAVWGALLGACRIHGN  +AERVGKILLELEP NSGRY
Subjt:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY

Query:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE
         LLSNIYA  GRFDDVAKIRK MKDRGIKTLPGVS VDLNG VHEFKMGDGSH QMKEIY KL+ I ERLQMAG+SPDTSQVLFDIEEEEKETAVQYHSE
Subjt:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE

Query:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        KLAIAFGLINT PG+ IHIVKNLRVCDDCHSATKLISQIY+REIIVRDR+RYHHFKNGTCSC+DF
Subjt:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

A0A5A7UBJ6 Pentatricopeptide repeat-containing protein0.0e+0089.47Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI
        MSSI+T+HLPSP KP   S EKKIPTSKLPQKTVLKLFDSKS TSLQYL +VHGLVLRSGHFQDHYVSGAL+KCYANPHFSNF FALKVFSSIPNPNVFI
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW
        WNIVIKGCLEN+K FKAIYFYGRM++DARPNKFTYPTLFKACSVA+AVQEGRQIHGHVVKHG+GSDMHIKSAGIQMYASFG LEDARKLF +GESDVVCW
Subjt:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV
        NTMIDGYLKCGDLEAAKGLF QMP+RNIGSWNVMING AKGGKLGDARK+FDEMSERDEI+WSSMVDGYISAGCYKEALEIFQ MQRE+TRPG+FIL SV
Subjt:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV

Query:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS
        LAACS+IGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKC RLDM WEVFEEMKE+E+FTWNAMIGG AIHGRA DALELFSKMQEGR+KPNGVTLV 
Subjt:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS

Query:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY
        +LTACAHAGFVD+GLRIF+TM+EFYGV+ E+EHYGC+VDLLGRSGLFSEAEDLI SMPMKPNAAVWGALLGACRIHGN  +AERVGKILLELEP NSGRY
Subjt:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY

Query:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE
         LLSNIYA  GRFDDVAKIRK MKDRGIKTLPGVS VDLNG VHEFKMGDGSH QMKEIY KL+ I ERLQMAG+SPDTSQVLFDIEEEEKETAVQYHSE
Subjt:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE

Query:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        KLAIAFGLINT PG+ IHIVKNLRVCDDCHSATKLISQIY+REIIVRDR+RYHHFKNGTCSC+DF
Subjt:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

A0A6J1EZS2 pentatricopeptide repeat-containing protein At5g66520-like isoform X10.0e+0086.77Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI
        M SI+T  LPSP KPEHF  E+K+PTSKLPQKTVLKLFDSKS TSL+YLS+VHGL+LRSGHFQDHYVSGALVKCYANPHF NF FALKVFS IPNPNVFI
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW
        WNIVIKGCLEN  PFKAIYFYGRM++DARPNKF+YPTLF ACSVA+AV EG QIH HVVKHG GSDMHIKSAGIQMY SFG  EDARKL DNGESDVVCW
Subjt:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV
        NT+IDGYLKCG+LEAAKGLFEQMP RNIGSWNVMI+GFAKGGKLGDARK+FDEM +RDEITWSSMVDGYISAGCYKEALEIFQLMQR++  PG+FILCSV
Subjt:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV

Query:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS
        LAACSS+GAIDQGRWVHAYL+RNSIKLDAVLGTALLDMYAKC RLDMAW+VF E++E+E+FTWNAMIGG AIHGRA DALE+FSKMQ+GRLKPNGVTLVS
Subjt:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS

Query:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY
        ILTACA+AGFVDRGLRIFE MKEFYGVE EM HYGCVVDLLGRSGLFSEAE+LI SMPMKPNAAVWGALLG CRIHGN+E+AERVGKILLEL+ HNSG Y
Subjt:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY

Query:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE
         LLSNIYAKAGRFDDVAKIRK MK++GIKT+PGVSMVDLNG VHEFKMGD +H QMKEIY KLE+I ERLQMAGYSPDTSQVLFDIEEEEKE+AV+YHSE
Subjt:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE

Query:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        KLAIAFGLINTSPGK IHIVKNLR+CDDCHSATKLISQIY++EIIVRDR+RYHHFKNGTCSC+DF
Subjt:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

A0A6J1JJU5 pentatricopeptide repeat-containing protein At5g66520-like isoform X10.0e+0087.37Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI
        MSSI+T  LPSP +PEHF  E+K+PTSKLPQKTVLKLFDSKS TSL+YLS+VHGL+LRSGHFQDHYVSGAL+KCYANPHFSNF FALKVFSSIPNPNVFI
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW
        WNIVIKGCLEN+ PFKAIYFYGRM++DARPNKF+YPTLFKACSVA+AV EG QIH HVVKHG G+DMHIKSAGIQMY SFG  EDARKL DNGESDVVCW
Subjt:  WNIVIKGCLENDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV
        NT+IDGYLKCG+LEAAKGLFEQMP RNIGSWNVMI+GFAKGGKLGDARKVFDEM +RDEITWSSMVDGYISAGCYKEALEIFQLMQR++  PG+FILCSV
Subjt:  NTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSV

Query:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS
        LAACSS+GAIDQGRWVHAYL+RNSIKLDAVLGTALLDMYAKC RLDMAW+VF E++++EVFTWNAMIGG AIHGRA DALELFSKMQ+GRLKPNGVTLVS
Subjt:  LAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVS

Query:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY
        ILTACAHAGFVDRGLRIFETMKEFYGVE EM HYGCVVDLLGRSGLFSEAE+LI SMPMKPNAAVWGALLG CRIHGN+E+AERVGKILLEL+  NSG Y
Subjt:  ILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRY

Query:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE
         LLSNIYAKAGRFDDVAKIRK MK++GIKT+PGVSMVDLNG VHEFKMGD SHPQMKEIY KLE+I ERL+MAGYSPDTSQVLFDIEEEEKE+AV+YHSE
Subjt:  ALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSE

Query:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        KLAIAFGLINTSPGK IHIVKNLR+CDDCHSATKLISQIY++EIIVRDR+RYHHFK GTCSC+DF
Subjt:  KLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic7.9e-16139.19Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI
        M+  +T    S P+  +FS   + PT+   +   + L +     SL+ L + HG ++R+G F D Y +  L    A   F++  +A KVF  IP PN F 
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENDKPFKAIYFYGRMIVDAR--PNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDN-GESDV
        WN +I+       P  +I+ +  M+ +++  PNK+T+P L KA +   ++  G+ +HG  VK  +GSD+ + ++ I  Y S G L+ A K+F    E DV
Subjt:  WNIVIKGCLENDKPFKAIYFYGRMIVDAR--PNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDN-GESDV

Query:  VCWNTMIDGYLKCGDLEAAKGLFEQM--------------------PIRNIGSW-------------------NVMINGFAKGGKLGDARKVFDEMSERD
        V WN+MI+G+++ G  + A  LF++M                     IRN+                      N M++ + K G + DA+++FD M E+D
Subjt:  VCWNTMIDGYLKCGDLEAAKGLFEQM--------------------PIRNIGSW-------------------NVMINGFAKGGKLGDARKVFDEMSERD

Query:  EITWSSMVDGYISAGCYKEALEIFQLMQRED--------------------------------TRPGKFILCSVLAACSSIGAIDQGRWVHAYLKRNSIK
         +TW++M+DGY  +  Y+ A E+   M ++D                                 +  +  L S L+AC+ +GA++ GRW+H+Y+K++ I+
Subjt:  EITWSSMVDGYISAGCYKEALEIFQLMQRED--------------------------------TRPGKFILCSVLAACSSIGAIDQGRWVHAYLKRNSIK

Query:  LDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKEFYG
        ++  + +AL+ MY+KC  L+ + EVF  +++++VF W+AMIGG A+HG   +A+++F KMQE  +KPNGVT  ++  AC+H G VD    +F  M+  YG
Subjt:  LDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKEFYG

Query:  VESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRKRMKDR
        +  E +HY C+VD+LGRSG   +A   I +MP+ P+ +VWGALLGAC+IH NL +AE     LLELEP N G + LLSNIYAK G++++V+++RK M+  
Subjt:  VESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRKRMKDR

Query:  GIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEE-KETAVQYHSEKLAIAFGLINTSPGKPIHIVKNLRV
        G+K  PG S ++++GM+HEF  GD +HP  +++Y KL ++ E+L+  GY P+ SQVL  IEEEE KE ++  HSEKLAI +GLI+T   K I ++KNLRV
Subjt:  GIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEE-KETAVQYHSEKLAIAFGLINTSPGKPIHIVKNLRV

Query:  CDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        C DCHS  KLISQ+Y+REIIVRDR R+HHF+NG CSC DF
Subjt:  CDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

Q9FG16 Pentatricopeptide repeat-containing protein At5g065405.1e-15240.44Show/hide
Query:  KLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVK-CYANPHFSN----FGFALKVFSSIPNPNVFIWNIVIKGCLENDKPFKAIYFYGRMIVD-AR
        KL   +S +S   L  +HG +LR+    D +V+  L+  C  +  F+      G+A  +FS I NPN+F++N++I+      +P KA  FY +M+     
Subjt:  KLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVK-CYANPHFSN----FGFALKVFSSIPNPNVFIWNIVIKGCLENDKPFKAIYFYGRMIVD-AR

Query:  PNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDN-GESDVVCWNTMIDGYLKCGDLEAAKGLFEQMPIRNI
        P+  T+P L KA S  E V  G Q H  +V+ G  +D++++++ + MYA+ G +  A ++F   G  DVV W +M+ GY KCG +E A+ +F++MP RN+
Subjt:  PNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDN-GESDVVCWNTMIDGYLKCGDLEAAKGLFEQMPIRNI

Query:  GSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSVLAACSSIGAIDQGRWVHAYLKRNSIKLD
         +W++MING+AK                                 C+++A+++F+ M+RE     + ++ SV+++C+ +GA++ G   + Y+ ++ + ++
Subjt:  GSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSVLAACSSIGAIDQGRWVHAYLKRNSIKLD

Query:  AVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKEFYGVE
         +LGTAL+DM+ +C  ++ A  VFE + E +  +W+++I G A+HG A  A+  FS+M      P  VT  ++L+AC+H G V++GL I+E MK+ +G+E
Subjt:  AVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKEFYGVE

Query:  SEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRKRMKDRGI
          +EHYGC+VD+LGR+G  +EAE+ I  M +KPNA + GALLGAC+I+ N E+AERVG +L++++P +SG Y LLSNIYA AG++D +  +R  MK++ +
Subjt:  SEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRKRMKDRGI

Query:  KTLPGVSMVDLNGMVHEFKMGDG-SHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSEKLAIAFGLINTSPGKPIHIVKNLRVCD
        K  PG S+++++G +++F MGD   HP+M +I  K E+I  ++++ GY  +T    FD++EEEKE+++  HSEKLAIA+G++ T PG  I IVKNLRVC+
Subjt:  KTLPGVSMVDLNGMVHEFKMGDG-SHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSEKLAIAFGLINTSPGKPIHIVKNLRVCD

Query:  DCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        DCH+ TKLIS++Y RE+IVRDR R+HHF+NG CSCRD+
Subjt:  DCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

Q9FI80 Pentatricopeptide repeat-containing protein At5g489101.3e-16843.71Show/hide
Query:  TTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANP--HFSNFGFALKVFSSIPNPNVFIWNI
        T  L SP      S     P+S  PQ          +  +++ LS++H + ++SG  +D   +  +++  A    H  +  +A K+F+ +P  N F WN 
Subjt:  TTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANP--HFSNFGFALKVFSSIPNPNVFIWNI

Query:  VIKGCLEN--DKPFKAIYFYGRMIVD--ARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNG--ESDV
        +I+G  E+  DK   AI  +  M+ D    PN+FT+P++ KAC+    +QEG+QIHG  +K+G G D  + S  ++MY   G ++DAR LF     E D+
Subjt:  VIKGCLEN--DKPFKAIYFYGRMIVD--ARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNG--ESDV

Query:  VCWNTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFIL
        V    M D   + G+               I  WNVMI+G+ + G    AR +FD+M +R  ++W++M+ GY   G +K+A+E+F+ M++ D RP    L
Subjt:  VCWNTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFIL

Query:  CSVLAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVT
         SVL A S +G+++ G W+H Y + + I++D VLG+AL+DMY+KC  ++ A  VFE +  + V TW+AMI GFAIHG+AGDA++ F KM++  ++P+ V 
Subjt:  CSVLAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVT

Query:  LVSILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNS
         +++LTAC+H G V+ G R F  M    G+E  +EHYGC+VDLLGRSGL  EAE+ I +MP+KP+  +W ALLGACR+ GN+EM +RV  IL+++ PH+S
Subjt:  LVSILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNS

Query:  GRYALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQY
        G Y  LSN+YA  G + +V+++R RMK++ I+  PG S++D++G++HEF + D SHP+ KEI S L +I+++L++AGY P T+QVL ++EEE+KE  + Y
Subjt:  GRYALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQY

Query:  HSEKLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        HSEK+A AFGLI+TSPGKPI IVKNLR+C+DCHS+ KLIS++Y R+I VRDR R+HHF++G+CSC D+
Subjt:  HSEKLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665201.0e-15541.13Show/hide
Query:  LSEVHGLVLRSGHFQDHY-VSGALVKCYANPHFSNFGFALKVFSSIPNPNVFIWNIVIKGCLENDKPFKAIYFYGRMIVDARP-NKFTYPTLFKACSVAE
        L ++H  +L++G  QD Y ++  L  C ++       +A  VF     P+ F+WN++I+G   +D+P +++  Y RM+  + P N +T+P+L KACS   
Subjt:  LSEVHGLVLRSGHFQDHY-VSGALVKCYANPHFSNFGFALKVFSSIPNPNVFIWNIVIKGCLENDKPFKAIYFYGRMIVDARP-NKFTYPTLFKACSVAE

Query:  AVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCWNTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGD
        A +E  QIH  + K G                               E+DV   N++I+ Y   G+ + A  LF+++P  +  SWN +I G+ K GK+  
Subjt:  AVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCWNTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGD

Query:  ARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSVLAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLD
        A  +F +M+E++ I+W++M+ GY+ A   KEAL++F  MQ  D  P    L + L+AC+ +GA++QG+W+H+YL +  I++D+VLG  L+DMYAKC  ++
Subjt:  ARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSVLAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLD

Query:  MAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGL
         A EVF+ +K+K V  W A+I G+A HG   +A+  F +MQ+  +KPN +T  ++LTAC++ G V+ G  IF +M+  Y ++  +EHYGC+VDLLGR+GL
Subjt:  MAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGL

Query:  FSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEF
          EA+  I+ MP+KPNA +WGALL ACRIH N+E+ E +G+IL+ ++P++ GRY   +NI+A   ++D  A+ R+ MK++G+  +PG S + L G  HEF
Subjt:  FSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEF

Query:  KMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFD-IEEEEKETAVQYHSEKLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREII
          GD SHP++++I SK   +  +L+  GY P+  ++L D ++++E+E  V  HSEKLAI +GLI T PG  I I+KNLRVC DCH  TKLIS+IY R+I+
Subjt:  KMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFD-IEEEEKETAVQYHSEKLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREII

Query:  VRDRIRYHHFKNGTCSCRDF
        +RDR R+HHF++G CSC D+
Subjt:  VRDRIRYHHFKNGTCSCRDF

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic9.0e-16539.65Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKS----TTSLQYLSEVHGLVLRSGHFQDHYVSGALVK-CYANPHFSNFGFALKVFSSIPN
        M S +   +PS   P HF     +P+S  P    ++   S S      +LQ L  +H  +++ G    +Y    L++ C  +PHF    +A+ VF +I  
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKS----TTSLQYLSEVHGLVLRSGHFQDHYVSGALVK-CYANPHFSNFGFALKVFSSIPN

Query:  PNVFIWNIVIKGCLENDKPFKAIYFYGRMI-VDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNG-
        PN+ IWN + +G   +  P  A+  Y  MI +   PN +T+P + K+C+ ++A +EG+QIHGHV+K G   D+++ ++ I MY   G LEDA K+FD   
Subjt:  PNVFIWNIVIKGCLENDKPFKAIYFYGRMI-VDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNG-

Query:  ESDVVCWNTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSE-------------------------------------
          DVV +  +I GY   G +E A+ LF+++P++++ SWN MI+G+A+ G   +A ++F +M +                                     
Subjt:  ESDVVCWNTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSE-------------------------------------

Query:  ---------------------------------RDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSVLAACSSIGAIDQGRWVHAYLKR-
                                         +D I+W++++ GY     YKEAL +FQ M R    P    + S+L AC+ +GAID GRW+H Y+ + 
Subjt:  ---------------------------------RDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSVLAACSSIGAIDQGRWVHAYLKR-

Query:  -NSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETM
           +   + L T+L+DMYAKC  ++ A +VF  +  K + +WNAMI GFA+HGRA  + +LFS+M++  ++P+ +T V +L+AC+H+G +D G  IF TM
Subjt:  -NSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETM

Query:  KEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRK
         + Y +  ++EHYGC++DLLG SGLF EAE++I  M M+P+  +W +LL AC++HGN+E+ E   + L+++EP N G Y LLSNIYA AGR+++VAK R 
Subjt:  KEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRK

Query:  RMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSEKLAIAFGLINTSPGKPIHIVK
         + D+G+K +PG S ++++ +VHEF +GD  HP+ +EIY  LE++   L+ AG+ PDTS+VL ++EEE KE A+++HSEKLAIAFGLI+T PG  + IVK
Subjt:  RMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSEKLAIAFGLINTSPGKPIHIVK

Query:  NLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        NLRVC +CH ATKLIS+IY REII RDR R+HHF++G CSC D+
Subjt:  NLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.4e-16639.65Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKS----TTSLQYLSEVHGLVLRSGHFQDHYVSGALVK-CYANPHFSNFGFALKVFSSIPN
        M S +   +PS   P HF     +P+S  P    ++   S S      +LQ L  +H  +++ G    +Y    L++ C  +PHF    +A+ VF +I  
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKS----TTSLQYLSEVHGLVLRSGHFQDHYVSGALVK-CYANPHFSNFGFALKVFSSIPN

Query:  PNVFIWNIVIKGCLENDKPFKAIYFYGRMI-VDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNG-
        PN+ IWN + +G   +  P  A+  Y  MI +   PN +T+P + K+C+ ++A +EG+QIHGHV+K G   D+++ ++ I MY   G LEDA K+FD   
Subjt:  PNVFIWNIVIKGCLENDKPFKAIYFYGRMI-VDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNG-

Query:  ESDVVCWNTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSE-------------------------------------
          DVV +  +I GY   G +E A+ LF+++P++++ SWN MI+G+A+ G   +A ++F +M +                                     
Subjt:  ESDVVCWNTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSE-------------------------------------

Query:  ---------------------------------RDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSVLAACSSIGAIDQGRWVHAYLKR-
                                         +D I+W++++ GY     YKEAL +FQ M R    P    + S+L AC+ +GAID GRW+H Y+ + 
Subjt:  ---------------------------------RDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSVLAACSSIGAIDQGRWVHAYLKR-

Query:  -NSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETM
           +   + L T+L+DMYAKC  ++ A +VF  +  K + +WNAMI GFA+HGRA  + +LFS+M++  ++P+ +T V +L+AC+H+G +D G  IF TM
Subjt:  -NSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETM

Query:  KEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRK
         + Y +  ++EHYGC++DLLG SGLF EAE++I  M M+P+  +W +LL AC++HGN+E+ E   + L+++EP N G Y LLSNIYA AGR+++VAK R 
Subjt:  KEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRK

Query:  RMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSEKLAIAFGLINTSPGKPIHIVK
         + D+G+K +PG S ++++ +VHEF +GD  HP+ +EIY  LE++   L+ AG+ PDTS+VL ++EEE KE A+++HSEKLAIAFGLI+T PG  + IVK
Subjt:  RMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSEKLAIAFGLINTSPGKPIHIVK

Query:  NLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        NLRVC +CH ATKLIS+IY REII RDR R+HHF++G CSC D+
Subjt:  NLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.6e-16239.19Show/hide
Query:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI
        M+  +T    S P+  +FS   + PT+   +   + L +     SL+ L + HG ++R+G F D Y +  L    A   F++  +A KVF  IP PN F 
Subjt:  MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENDKPFKAIYFYGRMIVDAR--PNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDN-GESDV
        WN +I+       P  +I+ +  M+ +++  PNK+T+P L KA +   ++  G+ +HG  VK  +GSD+ + ++ I  Y S G L+ A K+F    E DV
Subjt:  WNIVIKGCLENDKPFKAIYFYGRMIVDAR--PNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDN-GESDV

Query:  VCWNTMIDGYLKCGDLEAAKGLFEQM--------------------PIRNIGSW-------------------NVMINGFAKGGKLGDARKVFDEMSERD
        V WN+MI+G+++ G  + A  LF++M                     IRN+                      N M++ + K G + DA+++FD M E+D
Subjt:  VCWNTMIDGYLKCGDLEAAKGLFEQM--------------------PIRNIGSW-------------------NVMINGFAKGGKLGDARKVFDEMSERD

Query:  EITWSSMVDGYISAGCYKEALEIFQLMQRED--------------------------------TRPGKFILCSVLAACSSIGAIDQGRWVHAYLKRNSIK
         +TW++M+DGY  +  Y+ A E+   M ++D                                 +  +  L S L+AC+ +GA++ GRW+H+Y+K++ I+
Subjt:  EITWSSMVDGYISAGCYKEALEIFQLMQRED--------------------------------TRPGKFILCSVLAACSSIGAIDQGRWVHAYLKRNSIK

Query:  LDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKEFYG
        ++  + +AL+ MY+KC  L+ + EVF  +++++VF W+AMIGG A+HG   +A+++F KMQE  +KPNGVT  ++  AC+H G VD    +F  M+  YG
Subjt:  LDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKEFYG

Query:  VESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRKRMKDR
        +  E +HY C+VD+LGRSG   +A   I +MP+ P+ +VWGALLGAC+IH NL +AE     LLELEP N G + LLSNIYAK G++++V+++RK M+  
Subjt:  VESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRKRMKDR

Query:  GIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEE-KETAVQYHSEKLAIAFGLINTSPGKPIHIVKNLRV
        G+K  PG S ++++GM+HEF  GD +HP  +++Y KL ++ E+L+  GY P+ SQVL  IEEEE KE ++  HSEKLAI +GLI+T   K I ++KNLRV
Subjt:  GIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEE-KETAVQYHSEKLAIAFGLINTSPGKPIHIVKNLRV

Query:  CDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        C DCHS  KLISQ+Y+REIIVRDR R+HHF+NG CSC DF
Subjt:  CDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-15340.44Show/hide
Query:  KLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVK-CYANPHFSN----FGFALKVFSSIPNPNVFIWNIVIKGCLENDKPFKAIYFYGRMIVD-AR
        KL   +S +S   L  +HG +LR+    D +V+  L+  C  +  F+      G+A  +FS I NPN+F++N++I+      +P KA  FY +M+     
Subjt:  KLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVK-CYANPHFSN----FGFALKVFSSIPNPNVFIWNIVIKGCLENDKPFKAIYFYGRMIVD-AR

Query:  PNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDN-GESDVVCWNTMIDGYLKCGDLEAAKGLFEQMPIRNI
        P+  T+P L KA S  E V  G Q H  +V+ G  +D++++++ + MYA+ G +  A ++F   G  DVV W +M+ GY KCG +E A+ +F++MP RN+
Subjt:  PNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDN-GESDVVCWNTMIDGYLKCGDLEAAKGLFEQMPIRNI

Query:  GSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSVLAACSSIGAIDQGRWVHAYLKRNSIKLD
         +W++MING+AK                                 C+++A+++F+ M+RE     + ++ SV+++C+ +GA++ G   + Y+ ++ + ++
Subjt:  GSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSVLAACSSIGAIDQGRWVHAYLKRNSIKLD

Query:  AVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKEFYGVE
         +LGTAL+DM+ +C  ++ A  VFE + E +  +W+++I G A+HG A  A+  FS+M      P  VT  ++L+AC+H G V++GL I+E MK+ +G+E
Subjt:  AVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKEFYGVE

Query:  SEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRKRMKDRGI
          +EHYGC+VD+LGR+G  +EAE+ I  M +KPNA + GALLGAC+I+ N E+AERVG +L++++P +SG Y LLSNIYA AG++D +  +R  MK++ +
Subjt:  SEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRKRMKDRGI

Query:  KTLPGVSMVDLNGMVHEFKMGDG-SHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSEKLAIAFGLINTSPGKPIHIVKNLRVCD
        K  PG S+++++G +++F MGD   HP+M +I  K E+I  ++++ GY  +T    FD++EEEKE+++  HSEKLAIA+G++ T PG  I IVKNLRVC+
Subjt:  KTLPGVSMVDLNGMVHEFKMGDG-SHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSEKLAIAFGLINTSPGKPIHIVKNLRVCD

Query:  DCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        DCH+ TKLIS++Y RE+IVRDR R+HHF+NG CSCRD+
Subjt:  DCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein9.6e-17043.71Show/hide
Query:  TTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANP--HFSNFGFALKVFSSIPNPNVFIWNI
        T  L SP      S     P+S  PQ          +  +++ LS++H + ++SG  +D   +  +++  A    H  +  +A K+F+ +P  N F WN 
Subjt:  TTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANP--HFSNFGFALKVFSSIPNPNVFIWNI

Query:  VIKGCLEN--DKPFKAIYFYGRMIVD--ARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNG--ESDV
        +I+G  E+  DK   AI  +  M+ D    PN+FT+P++ KAC+    +QEG+QIHG  +K+G G D  + S  ++MY   G ++DAR LF     E D+
Subjt:  VIKGCLEN--DKPFKAIYFYGRMIVD--ARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNG--ESDV

Query:  VCWNTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFIL
        V    M D   + G+               I  WNVMI+G+ + G    AR +FD+M +R  ++W++M+ GY   G +K+A+E+F+ M++ D RP    L
Subjt:  VCWNTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFIL

Query:  CSVLAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVT
         SVL A S +G+++ G W+H Y + + I++D VLG+AL+DMY+KC  ++ A  VFE +  + V TW+AMI GFAIHG+AGDA++ F KM++  ++P+ V 
Subjt:  CSVLAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVT

Query:  LVSILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNS
         +++LTAC+H G V+ G R F  M    G+E  +EHYGC+VDLLGRSGL  EAE+ I +MP+KP+  +W ALLGACR+ GN+EM +RV  IL+++ PH+S
Subjt:  LVSILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNS

Query:  GRYALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQY
        G Y  LSN+YA  G + +V+++R RMK++ I+  PG S++D++G++HEF + D SHP+ KEI S L +I+++L++AGY P T+QVL ++EEE+KE  + Y
Subjt:  GRYALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQY

Query:  HSEKLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF
        HSEK+A AFGLI+TSPGKPI IVKNLR+C+DCHS+ KLIS++Y R+I VRDR R+HHF++G+CSC D+
Subjt:  HSEKLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTCSCRDF

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.1e-15741.13Show/hide
Query:  LSEVHGLVLRSGHFQDHY-VSGALVKCYANPHFSNFGFALKVFSSIPNPNVFIWNIVIKGCLENDKPFKAIYFYGRMIVDARP-NKFTYPTLFKACSVAE
        L ++H  +L++G  QD Y ++  L  C ++       +A  VF     P+ F+WN++I+G   +D+P +++  Y RM+  + P N +T+P+L KACS   
Subjt:  LSEVHGLVLRSGHFQDHY-VSGALVKCYANPHFSNFGFALKVFSSIPNPNVFIWNIVIKGCLENDKPFKAIYFYGRMIVDARP-NKFTYPTLFKACSVAE

Query:  AVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCWNTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGD
        A +E  QIH  + K G                               E+DV   N++I+ Y   G+ + A  LF+++P  +  SWN +I G+ K GK+  
Subjt:  AVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCWNTMIDGYLKCGDLEAAKGLFEQMPIRNIGSWNVMINGFAKGGKLGD

Query:  ARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSVLAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLD
        A  +F +M+E++ I+W++M+ GY+ A   KEAL++F  MQ  D  P    L + L+AC+ +GA++QG+W+H+YL +  I++D+VLG  L+DMYAKC  ++
Subjt:  ARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSVLAACSSIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCARLD

Query:  MAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGL
         A EVF+ +K+K V  W A+I G+A HG   +A+  F +MQ+  +KPN +T  ++LTAC++ G V+ G  IF +M+  Y ++  +EHYGC+VDLLGR+GL
Subjt:  MAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDLLGRSGL

Query:  FSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEF
          EA+  I+ MP+KPNA +WGALL ACRIH N+E+ E +G+IL+ ++P++ GRY   +NI+A   ++D  A+ R+ MK++G+  +PG S + L G  HEF
Subjt:  FSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEF

Query:  KMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFD-IEEEEKETAVQYHSEKLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREII
          GD SHP++++I SK   +  +L+  GY P+  ++L D ++++E+E  V  HSEKLAI +GLI T PG  I I+KNLRVC DCH  TKLIS+IY R+I+
Subjt:  KMGDGSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFD-IEEEEKETAVQYHSEKLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREII

Query:  VRDRIRYHHFKNGTCSCRDF
        +RDR R+HHF++G CSC D+
Subjt:  VRDRIRYHHFKNGTCSCRDF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCAATCACCACCACACACCTCCCTTCTCCCCCCAAACCAGAACACTTCTCGGGAGAGAAGAAGATTCCCACATCAAAACTTCCACAGAAAACAGTTCTAAAGCT
TTTCGACTCAAAATCCACCACTTCTCTGCAATACCTCAGCGAAGTTCATGGGCTTGTATTGAGAAGCGGCCATTTCCAAGACCATTATGTCTCTGGCGCGTTGGTGAAGT
GTTATGCAAATCCCCATTTCAGCAATTTCGGTTTCGCCTTGAAGGTATTCTCCTCAATTCCAAATCCCAACGTTTTCATTTGGAATATTGTGATTAAAGGGTGTTTAGAG
AACGACAAACCGTTTAAAGCTATTTACTTCTATGGTAGGATGATTGTTGATGCTAGGCCCAATAAATTCACATACCCAACTCTGTTTAAAGCTTGTTCTGTGGCAGAAGC
TGTTCAAGAAGGGAGGCAAATTCATGGCCATGTGGTGAAACATGGCCTTGGTAGTGACATGCATATAAAAAGCGCTGGAATTCAAATGTATGCCTCTTTTGGTGGCTTAG
AAGATGCTAGGAAACTGTTTGATAATGGGGAATCTGATGTTGTCTGTTGGAATACTATGATTGATGGGTACCTGAAATGTGGGGATTTGGAGGCTGCTAAAGGGTTGTTT
GAGCAAATGCCAATCAGAAACATTGGATCGTGGAATGTGATGATCAATGGCTTTGCTAAGGGTGGGAAGTTGGGAGATGCAAGGAAGGTGTTTGATGAAATGAGTGAAAG
AGATGAAATTACTTGGAGCTCTATGGTAGATGGTTACATATCAGCAGGTTGTTACAAGGAAGCATTGGAAATTTTCCAGCTAATGCAAAGAGAGGATACTAGACCTGGAA
AGTTCATTTTGTGCAGTGTGCTAGCTGCGTGTTCCAGTATTGGAGCCATTGATCAAGGGAGGTGGGTTCATGCTTATCTCAAGAGGAACTCCATTAAATTGGATGCAGTG
TTGGGGACTGCCTTATTGGATATGTATGCTAAATGTGCACGGCTGGACATGGCATGGGAGGTATTTGAAGAAATGAAGGAAAAAGAGGTCTTCACTTGGAATGCCATGAT
TGGTGGGTTTGCTATACATGGAAGAGCAGGGGATGCACTTGAGCTTTTCTCTAAGATGCAGGAGGGGAGGTTGAAACCTAATGGAGTCACGCTTGTTAGCATCCTAACTG
CTTGTGCTCATGCAGGTTTTGTTGACAGAGGCCTGAGGATTTTTGAAACGATGAAAGAGTTTTATGGTGTTGAGTCTGAAATGGAACATTATGGATGTGTGGTTGATTTG
TTAGGGAGGTCAGGGTTATTTTCTGAAGCAGAGGATTTGATAAGGTCAATGCCCATGAAGCCCAATGCAGCTGTTTGGGGAGCACTCTTAGGTGCCTGCAGGATTCATGG
AAATTTGGAAATGGCTGAAAGAGTGGGGAAGATTTTGCTCGAATTAGAGCCACACAACAGTGGCCGGTATGCATTACTGTCAAATATATATGCAAAGGCAGGTAGGTTTG
ATGATGTTGCTAAAATAAGAAAAAGGATGAAGGATAGGGGGATAAAAACATTGCCTGGTGTCAGCATGGTTGATCTAAATGGTATGGTTCACGAATTCAAAATGGGTGAT
GGATCGCATCCACAAATGAAGGAAATTTATAGCAAGCTGGAACAAATAAATGAGAGGCTGCAGATGGCAGGTTATTCACCTGATACATCTCAAGTTTTATTTGATATTGA
AGAGGAAGAGAAGGAAACTGCAGTCCAATACCATAGTGAAAAGCTTGCAATTGCTTTTGGATTGATTAATACCTCGCCGGGTAAACCAATTCACATAGTGAAGAACTTGA
GGGTCTGTGATGATTGCCATTCAGCCACAAAGCTCATTTCTCAAATTTATAATCGAGAAATAATTGTAAGGGATCGCATTCGTTATCACCACTTTAAAAATGGAACTTGC
TCATGTAGAGATTTTTGTATACAAGCCAAGCTTGGAGTTGAATTCTTCAAAGCTTGTCCTTGGCAATGCTTTGGTGAGAATGTCAGTGTCAGGCCTCCAATTATTCACAC
CTATGTTCGTAGGGGCAGACCAGGGAATTCACCAAGAGCGGTTATCTTGGGAAAGACACGGAGAATGGCGCAGAAACAACTGGAAGATCACCTAGCCAAATCCGAAAAAA
AATTAGAAGGAATGAGGGAGAAATTACCGGAGATCGAAAAGTCAGTTGCGGAACTTAATCGAATCATGGAAAGATTGTTTAACAGTGTGGAAGACCAAAGACAACTCTCT
TTGGAGAATCTGCAAGCATTAGCAAACCTTGTCCAAGGCGGGTTCCGAGGAGGACCATCAACAGAGAAAGAGTCGGTACCTGGGCAAAAACGTAAGTTTCCTGAAGACGA
AAGTGCTATGTCTAGCCGGAAGGAGCATAGAGGAGAGGAACAATTTCACGAGCGTCACAAATTCAAGAAGGTGGAAATGCCCGAGTTTGATGGTGACCAACCTGACGACT
GGCTCTTTAGGGCAGAGAGATACTTCGACATCCATCAACTAACAGATCAAGAGAAGATAACAGCGACAACAATCTGTTCCACGGGAGCCGCGCTCAGGTGGTACCGTTGG
GCGGAGGGAAGACAACCCTTCTCTGGGTGGAAGAACTTGAAGTACAGAATCCTAGAGCGATTCAGGCCTTCACAAGAAGGATCGCTATGTGTGCGTTTCCTAGCGGTGAG
AAAGATGAAATCGGTAGCAAAGTACCGGGAACGGTTTGAGGCTTTGGCTTTGCCTTTGCCGCACCTATCTGATGAAGTCCTGGAAGGAACGTTCCTAAATGGGCTATCAC
CGGAAATTAAGACCGAAGTCATGTGCTTTGAACCCGTGGGCCTTGAGGCCATGATGAAAGCGGCCCAGCGAATAGAAGATAAAAATTCTGCTCTCTCTTCGAAGACTGGA
CCACGAACGTGGAAGCCCGCTACATCCGGAACTGTTGGGCTACCCAACACTCTGGGCCCTTCTTACACGCCAAAATTGCTTGAACCCGGCCCAATCAAGACTATTACTCT
GCCCAATATCCCGGAAAAAGGGTTGTGTTATCGATGCGATGAGAAGTATTTTGTAGGGCATAAATGCAAAAGTAAGGAATTGAAGGTTCTCGTTGTGTCTGACGAACAAG
GAGGAAAGGAAGAGATATCCGAGAATCCAGACCCACTAGAGTGCCGGGTAGAAGAAGTGGAAGAGTCTTCCGCCGAGATGGATACAACGGAGCTATCCCTCAACACGGTG
GTAGGGTTTTCTTCACCGAGAACTTTGAAGGTACGGGGACGACTTGAAGATCGTGAAGTAGTGATCCTGATAGACTGTGGGGCTACTCACAATTTTATTTCCCAGAAACT
TGTGGATGAGCTCAAGTTACCAGTTTCTGAAACTCTGAACTATGGGATAATAATGGGTACGGGAACAGCGGTTAAGGGAAGAGGCATTTGCAACGAAGTGATTGAAAGAT
CTAACTATCGCCGAAGATTACTTACCACTGGAGTTGGGAAGTATCGATATCATTCTCGGCATGCAGTGGCTGAGAACTCTCGGAGTAACAACCGTAGACTGGAAATCGTT
ATCGTTGACCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCAATCACCACCACACACCTCCCTTCTCCCCCCAAACCAGAACACTTCTCGGGAGAGAAGAAGATTCCCACATCAAAACTTCCACAGAAAACAGTTCTAAAGCT
TTTCGACTCAAAATCCACCACTTCTCTGCAATACCTCAGCGAAGTTCATGGGCTTGTATTGAGAAGCGGCCATTTCCAAGACCATTATGTCTCTGGCGCGTTGGTGAAGT
GTTATGCAAATCCCCATTTCAGCAATTTCGGTTTCGCCTTGAAGGTATTCTCCTCAATTCCAAATCCCAACGTTTTCATTTGGAATATTGTGATTAAAGGGTGTTTAGAG
AACGACAAACCGTTTAAAGCTATTTACTTCTATGGTAGGATGATTGTTGATGCTAGGCCCAATAAATTCACATACCCAACTCTGTTTAAAGCTTGTTCTGTGGCAGAAGC
TGTTCAAGAAGGGAGGCAAATTCATGGCCATGTGGTGAAACATGGCCTTGGTAGTGACATGCATATAAAAAGCGCTGGAATTCAAATGTATGCCTCTTTTGGTGGCTTAG
AAGATGCTAGGAAACTGTTTGATAATGGGGAATCTGATGTTGTCTGTTGGAATACTATGATTGATGGGTACCTGAAATGTGGGGATTTGGAGGCTGCTAAAGGGTTGTTT
GAGCAAATGCCAATCAGAAACATTGGATCGTGGAATGTGATGATCAATGGCTTTGCTAAGGGTGGGAAGTTGGGAGATGCAAGGAAGGTGTTTGATGAAATGAGTGAAAG
AGATGAAATTACTTGGAGCTCTATGGTAGATGGTTACATATCAGCAGGTTGTTACAAGGAAGCATTGGAAATTTTCCAGCTAATGCAAAGAGAGGATACTAGACCTGGAA
AGTTCATTTTGTGCAGTGTGCTAGCTGCGTGTTCCAGTATTGGAGCCATTGATCAAGGGAGGTGGGTTCATGCTTATCTCAAGAGGAACTCCATTAAATTGGATGCAGTG
TTGGGGACTGCCTTATTGGATATGTATGCTAAATGTGCACGGCTGGACATGGCATGGGAGGTATTTGAAGAAATGAAGGAAAAAGAGGTCTTCACTTGGAATGCCATGAT
TGGTGGGTTTGCTATACATGGAAGAGCAGGGGATGCACTTGAGCTTTTCTCTAAGATGCAGGAGGGGAGGTTGAAACCTAATGGAGTCACGCTTGTTAGCATCCTAACTG
CTTGTGCTCATGCAGGTTTTGTTGACAGAGGCCTGAGGATTTTTGAAACGATGAAAGAGTTTTATGGTGTTGAGTCTGAAATGGAACATTATGGATGTGTGGTTGATTTG
TTAGGGAGGTCAGGGTTATTTTCTGAAGCAGAGGATTTGATAAGGTCAATGCCCATGAAGCCCAATGCAGCTGTTTGGGGAGCACTCTTAGGTGCCTGCAGGATTCATGG
AAATTTGGAAATGGCTGAAAGAGTGGGGAAGATTTTGCTCGAATTAGAGCCACACAACAGTGGCCGGTATGCATTACTGTCAAATATATATGCAAAGGCAGGTAGGTTTG
ATGATGTTGCTAAAATAAGAAAAAGGATGAAGGATAGGGGGATAAAAACATTGCCTGGTGTCAGCATGGTTGATCTAAATGGTATGGTTCACGAATTCAAAATGGGTGAT
GGATCGCATCCACAAATGAAGGAAATTTATAGCAAGCTGGAACAAATAAATGAGAGGCTGCAGATGGCAGGTTATTCACCTGATACATCTCAAGTTTTATTTGATATTGA
AGAGGAAGAGAAGGAAACTGCAGTCCAATACCATAGTGAAAAGCTTGCAATTGCTTTTGGATTGATTAATACCTCGCCGGGTAAACCAATTCACATAGTGAAGAACTTGA
GGGTCTGTGATGATTGCCATTCAGCCACAAAGCTCATTTCTCAAATTTATAATCGAGAAATAATTGTAAGGGATCGCATTCGTTATCACCACTTTAAAAATGGAACTTGC
TCATGTAGAGATTTTTGTATACAAGCCAAGCTTGGAGTTGAATTCTTCAAAGCTTGTCCTTGGCAATGCTTTGGTGAGAATGTCAGTGTCAGGCCTCCAATTATTCACAC
CTATGTTCGTAGGGGCAGACCAGGGAATTCACCAAGAGCGGTTATCTTGGGAAAGACACGGAGAATGGCGCAGAAACAACTGGAAGATCACCTAGCCAAATCCGAAAAAA
AATTAGAAGGAATGAGGGAGAAATTACCGGAGATCGAAAAGTCAGTTGCGGAACTTAATCGAATCATGGAAAGATTGTTTAACAGTGTGGAAGACCAAAGACAACTCTCT
TTGGAGAATCTGCAAGCATTAGCAAACCTTGTCCAAGGCGGGTTCCGAGGAGGACCATCAACAGAGAAAGAGTCGGTACCTGGGCAAAAACGTAAGTTTCCTGAAGACGA
AAGTGCTATGTCTAGCCGGAAGGAGCATAGAGGAGAGGAACAATTTCACGAGCGTCACAAATTCAAGAAGGTGGAAATGCCCGAGTTTGATGGTGACCAACCTGACGACT
GGCTCTTTAGGGCAGAGAGATACTTCGACATCCATCAACTAACAGATCAAGAGAAGATAACAGCGACAACAATCTGTTCCACGGGAGCCGCGCTCAGGTGGTACCGTTGG
GCGGAGGGAAGACAACCCTTCTCTGGGTGGAAGAACTTGAAGTACAGAATCCTAGAGCGATTCAGGCCTTCACAAGAAGGATCGCTATGTGTGCGTTTCCTAGCGGTGAG
AAAGATGAAATCGGTAGCAAAGTACCGGGAACGGTTTGAGGCTTTGGCTTTGCCTTTGCCGCACCTATCTGATGAAGTCCTGGAAGGAACGTTCCTAAATGGGCTATCAC
CGGAAATTAAGACCGAAGTCATGTGCTTTGAACCCGTGGGCCTTGAGGCCATGATGAAAGCGGCCCAGCGAATAGAAGATAAAAATTCTGCTCTCTCTTCGAAGACTGGA
CCACGAACGTGGAAGCCCGCTACATCCGGAACTGTTGGGCTACCCAACACTCTGGGCCCTTCTTACACGCCAAAATTGCTTGAACCCGGCCCAATCAAGACTATTACTCT
GCCCAATATCCCGGAAAAAGGGTTGTGTTATCGATGCGATGAGAAGTATTTTGTAGGGCATAAATGCAAAAGTAAGGAATTGAAGGTTCTCGTTGTGTCTGACGAACAAG
GAGGAAAGGAAGAGATATCCGAGAATCCAGACCCACTAGAGTGCCGGGTAGAAGAAGTGGAAGAGTCTTCCGCCGAGATGGATACAACGGAGCTATCCCTCAACACGGTG
GTAGGGTTTTCTTCACCGAGAACTTTGAAGGTACGGGGACGACTTGAAGATCGTGAAGTAGTGATCCTGATAGACTGTGGGGCTACTCACAATTTTATTTCCCAGAAACT
TGTGGATGAGCTCAAGTTACCAGTTTCTGAAACTCTGAACTATGGGATAATAATGGGTACGGGAACAGCGGTTAAGGGAAGAGGCATTTGCAACGAAGTGATTGAAAGAT
CTAACTATCGCCGAAGATTACTTACCACTGGAGTTGGGAAGTATCGATATCATTCTCGGCATGCAGTGGCTGAGAACTCTCGGAGTAACAACCGTAGACTGGAAATCGTT
ATCGTTGACCATTGA
Protein sequenceShow/hide protein sequence
MSSITTTHLPSPPKPEHFSGEKKIPTSKLPQKTVLKLFDSKSTTSLQYLSEVHGLVLRSGHFQDHYVSGALVKCYANPHFSNFGFALKVFSSIPNPNVFIWNIVIKGCLE
NDKPFKAIYFYGRMIVDARPNKFTYPTLFKACSVAEAVQEGRQIHGHVVKHGLGSDMHIKSAGIQMYASFGGLEDARKLFDNGESDVVCWNTMIDGYLKCGDLEAAKGLF
EQMPIRNIGSWNVMINGFAKGGKLGDARKVFDEMSERDEITWSSMVDGYISAGCYKEALEIFQLMQREDTRPGKFILCSVLAACSSIGAIDQGRWVHAYLKRNSIKLDAV
LGTALLDMYAKCARLDMAWEVFEEMKEKEVFTWNAMIGGFAIHGRAGDALELFSKMQEGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKEFYGVESEMEHYGCVVDL
LGRSGLFSEAEDLIRSMPMKPNAAVWGALLGACRIHGNLEMAERVGKILLELEPHNSGRYALLSNIYAKAGRFDDVAKIRKRMKDRGIKTLPGVSMVDLNGMVHEFKMGD
GSHPQMKEIYSKLEQINERLQMAGYSPDTSQVLFDIEEEEKETAVQYHSEKLAIAFGLINTSPGKPIHIVKNLRVCDDCHSATKLISQIYNREIIVRDRIRYHHFKNGTC
SCRDFCIQAKLGVEFFKACPWQCFGENVSVRPPIIHTYVRRGRPGNSPRAVILGKTRRMAQKQLEDHLAKSEKKLEGMREKLPEIEKSVAELNRIMERLFNSVEDQRQLS
LENLQALANLVQGGFRGGPSTEKESVPGQKRKFPEDESAMSSRKEHRGEEQFHERHKFKKVEMPEFDGDQPDDWLFRAERYFDIHQLTDQEKITATTICSTGAALRWYRW
AEGRQPFSGWKNLKYRILERFRPSQEGSLCVRFLAVRKMKSVAKYRERFEALALPLPHLSDEVLEGTFLNGLSPEIKTEVMCFEPVGLEAMMKAAQRIEDKNSALSSKTG
PRTWKPATSGTVGLPNTLGPSYTPKLLEPGPIKTITLPNIPEKGLCYRCDEKYFVGHKCKSKELKVLVVSDEQGGKEEISENPDPLECRVEEVEESSAEMDTTELSLNTV
VGFSSPRTLKVRGRLEDREVVILIDCGATHNFISQKLVDELKLPVSETLNYGIIMGTGTAVKGRGICNEVIERSNYRRRLLTTGVGKYRYHSRHAVAENSRSNNRRLEIV
IVDH