; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G13380 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G13380
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat (PPR-like) superfamily protein
Genome locationClcChr07:28198692..28201538
RNA-Seq ExpressionClc07G13380
SyntenyClc07G13380
Gene Ontology termsGO:0031930 - mitochondria-nucleus signaling pathway (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051178.1 putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0087.61Show/hide
Query:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG
        ML A SYQRFKS SFCFP LSINFHSQFSSITYD+DL +FFDHLL+QC+ IQHSKQVHSATVVTGAYCSAFV+ARLVSIY+RYGLVSDARKVF SAPFE 
Subjt:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG

Query:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM
        LSN LLWNSIIRANVYHGYC EAL LYGKMR+YGVLGDGFTFPLVLRASSNLG+ ++CKNLHCHVVQFGFQNHLHV NELIGMYAKLERMDDARK+FDKM
Subjt:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM

Query:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY
        RIKSVVSWNTM+SGYAYNYDVNGASRMFHQMELEGV+PNPVTWTSLLSSHARCGHL ET+VLF KMRMKGVGATAEMLAVVLSVCADL TLN G+MIHGY
Subjt:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY

Query:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL
        MVKGGF+DYLFAKNALIT+YGKGGD+GDAEKLFHEMK KNLVSWNALISS+AESG+YDKA EL SQLEKME YPEMKPNVITWS++ICGF+SKGLGEESL
Subjt:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL

Query:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA
        EVFRKMQLANVKANSVTIASV SICAMLAALNLGRE+HGHVIRA MD+N+LVGNGLINMYTKCGSFKPG +VFEKLENRDSISWNSMI GYG HGLGKDA
Subjt:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA

Query:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT
        L+T N MIKSG+RPD VTFIAALSACSHAGLVAEG WLF QM Q FKI+P++EHY+CMVDLLGRAGLVEEASNI+K MP EPNAYIWS++LNSCRMHKDT
Subjt:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT

Query:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI
        DLAEE A++I NLNS+ITGSHMLLSNIFAASCRWEDSARVRISAR+KGLKKVPG SWIEVKKKVY+FKAG++  EGLEKVDEILHDLA QIEN+D DD I
Subjt:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI

Query:  IE
        IE
Subjt:  IE

XP_004146851.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucumis sativus]0.0e+0088.75Show/hide
Query:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG
        ML A SYQRFKS SFCFP LSINFHSQFSSITYD+DL DFFDHLL+QC+ IQHSKQVHSATVVTGAYCSAFV+ARLVSIY+RYGLVSDARKVF SAPFE 
Subjt:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG

Query:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM
         SN LLWNSIIRANVYHGYC EALQLYGKMR+YGVLGDGFTFPL+LRASSNLG+FN+CKNLHCHVVQFGFQNHLHV NELIGMYAKLERMDDARK+FDKM
Subjt:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM

Query:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY
        RIKSVVSWNTM+SGYAYNYDVNGASRMFHQMELEGV+PNPVTWTSLLSSHARCGHLEET+VLF KMRMKGVG TAEMLAVVLSVCADL TLN G+MIHGY
Subjt:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY

Query:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL
        MVKGGF+DYLFAKNALIT+YGKGG +GDAEKLFHEMK KNLVSWNALISS+AESG+YDKA EL SQLEKME YPEMKPNVITWSA+ICGFASKGLGEESL
Subjt:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL

Query:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA
        EVFRKMQLANVKANSVTIASV SICAMLAALNLGRE+HGHVIRA MDDN+LVGNGLINMYTKCGSFKPG MVFEKLENRDSISWNSMI GYG HGLGKDA
Subjt:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA

Query:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT
        L+TFN MIKSG+RPD VTFIAALSACSHAGLVAEG WLF QM Q FKI+P++EHY+CMVDLLGRAGLVEEASNI+KGMP EPNAYIWS++LNSCRMHKDT
Subjt:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT

Query:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI
        DLAEE A++I NLNS+ITGSHMLLSNIFAASCRWEDSARVRISAR KGLKKVPG SWIEVKKKVYMFKAG++I EGLEKVDEILHDLA QIEN++ DD I
Subjt:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI

Query:  IE
        IE
Subjt:  IE

XP_008447611.1 PREDICTED: putative pentatricopeptide repeat-containing protein At1g17630 [Cucumis melo]0.0e+0087.46Show/hide
Query:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG
        ML A SYQRFKS SFCFP LSINFHSQFSSITYD+DL +FFDHLL+QC+ IQHSKQVHSATVVTGAYCSAFV+ARLVSIY+RYGLVSDARKVF SAPFE 
Subjt:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG

Query:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM
        LSN LLWNSIIRANVYHGYC EAL LYGKMR+YGVLGDGFTFPLVLRASSNLG+ ++CKNLHCHVVQFGFQNHLHV NELIGMYAKLERMDDARK+FDKM
Subjt:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM

Query:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY
        RIKSVVSWNTM+SGYAYNYDVNGASRMFHQMELEGV+PNPVTWTSLLSSHARCGHL ET+VLF KMRMKGVGATAEMLAVVLSVCADL TLN G+MIHGY
Subjt:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY

Query:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL
        MVKGGF+DYLFAKNALIT+YGKGGD+GDAEKLFHEMK KNLVSWNALISS+AESG+YDKA EL SQLEKME YPEMKPNVITWS++ICGF+SKGLGEESL
Subjt:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL

Query:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA
        EVFRKMQLANVKANSVTIASV SICAMLAALNLGRE+HGHVIRA MD+N+LVGNGLINMYTKCGSFKPG +VFEKLENRDSISWNSMI  YG HGLGKDA
Subjt:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA

Query:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT
        L+T N MIKSG+RPD VTFIAALSACSHAGLVAEG WLF QM Q FKI+P++EHY+CMVDLLGRAGLVEEASNI+K MP EPNAYIWS++LNSCRMHKDT
Subjt:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT

Query:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI
        DLAEE A++I NLNS+ITGSHMLLSNIFAASCRWEDSARVRISAR+KGLKKVPG SWIEVKKKVY+FKAG++  EGLEKVDEILHDLA QIEN+D DD I
Subjt:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI

Query:  IE
        IE
Subjt:  IE

XP_022997825.1 putative pentatricopeptide repeat-containing protein At1g17630 [Cucurbita maxima]0.0e+0085.63Show/hide
Query:  MLYACSYQRFKSASFCFPRLSIN--------FHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKV
        MLYA SYQRF SASFCFPR +IN        F+S FSSITYDD+LLD FD LL+QC+ I+H KQVHS TVV GA  SAFVAARLVS+YAR G V DARKV
Subjt:  MLYACSYQRFKSASFCFPRLSIN--------FHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKV

Query:  FNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDD
        F++ PFEGLSNLLLWNSIIRANV  GYC+EALQLYGKMR++GVL DGFTFPLVLRASSNLG FNLCK LHCHVVQFGFQNHLHVVNELIGMY KL RM D
Subjt:  FNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDD

Query:  ARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLN
        ARK+FDKMR+KSV+SWNTM+SGYAYNYDVNGA RMF QMELEGV+PNPVTWTSLLSSHARCGHLEET+ LFSKMRMKGVGATAEMLAVVLSVCADL TL+
Subjt:  ARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLN

Query:  RGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFAS
        RG+M+HGY+VKGGF+DYLFAKNALITVYGKGGD+ DAEKLFHEMK KNLVSWN+LISSYAESGLYDKAFE FS+LEKME  PEMKPNVITWSAVICGFAS
Subjt:  RGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYG
         G GEESLEVFR+MQLANVKANSVTI+SV SICAMLAALNLGRE+HGHVIRA M+DNILVGNGLINMYTKCGSFKPGC+VFEKLENRD ISWNSMI GYG
Subjt:  KGLGEESLEVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYG

Query:  MHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILN
        MHGLGKDAL TF++MIKSGFRPDDVTFIAALSACSHAGLVAEGRWLF QM+Q FKIKPQMEHY+CMVDLLGRAGLVEEASN+VK MP +PN YIWSA+LN
Subjt:  MHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILN

Query:  SCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIE
        SCRMHK TDLAEET SQI NLNSEI GSHMLLSNIF+ASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAG+S+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIE

Query:  NHDFDDSIIE
        + DFDDSIIE
Subjt:  NHDFDDSIIE

XP_038903135.1 putative pentatricopeptide repeat-containing protein At1g17630 [Benincasa hispida]0.0e+0090.88Show/hide
Query:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG
        MLY CS QRFKS SFCFPRLSINFHSQ S+ITYDDDLLDFFDHLL+QC+ IQHSKQVHSATVVTG Y SAFV ARLVSIY RYGLVSDARKVF+SAPFE 
Subjt:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG

Query:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM
        LSN+LLWNSIIRANV HGYC+EALQLY KMR+YGV GD FTFPLVLRASSNLGSFNLCKNLHCHVVQFG QNHLHVVNEL+GMYAKLERMDDARK+FDKM
Subjt:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM

Query:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY
        RIKSVVSWNTM+SGYAYNYDVNG SRMFHQMELEGV+PN VTWTSLLSSHARCG LEET+VLFSK+RMKGVGATAEMLAVVLSVCADL TLN+G+MIHGY
Subjt:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY

Query:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL
        MVKGGF DYLFAKNALIT+YGKGGDL DAEKLFHEMK KNLVSWNALISSYAESGLYDKAFELFS+LE+MEVYPEMKPNVITWSAVICGFASKGLGEESL
Subjt:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL

Query:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA
        EVFRKMQLANVKANSVTIASV SICA LAALNLGRE+HGHVIRA MDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMI GYGMHGLG +A
Subjt:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA

Query:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT
        L+TFNQMIKSGFRP+DVTFI+AL+ACSHAGLVAEGRWLFYQM+Q FKI+PQMEHY+CMVDLLGRAGLVEEASNIVKGMP EPNAYIWSA+LNSCRMHKDT
Subjt:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT

Query:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI
        D+AEETASQIF LNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKK+VY FKAGHSI EGLEKVDEIL DLALQIENHDFDDS+
Subjt:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI

Query:  IE
        IE
Subjt:  IE

TrEMBL top hitse value%identityAlignment
A0A0A0LFT1 Uncharacterized protein0.0e+0088.75Show/hide
Query:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG
        ML A SYQRFKS SFCFP LSINFHSQFSSITYD+DL DFFDHLL+QC+ IQHSKQVHSATVVTGAYCSAFV+ARLVSIY+RYGLVSDARKVF SAPFE 
Subjt:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG

Query:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM
         SN LLWNSIIRANVYHGYC EALQLYGKMR+YGVLGDGFTFPL+LRASSNLG+FN+CKNLHCHVVQFGFQNHLHV NELIGMYAKLERMDDARK+FDKM
Subjt:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM

Query:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY
        RIKSVVSWNTM+SGYAYNYDVNGASRMFHQMELEGV+PNPVTWTSLLSSHARCGHLEET+VLF KMRMKGVG TAEMLAVVLSVCADL TLN G+MIHGY
Subjt:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY

Query:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL
        MVKGGF+DYLFAKNALIT+YGKGG +GDAEKLFHEMK KNLVSWNALISS+AESG+YDKA EL SQLEKME YPEMKPNVITWSA+ICGFASKGLGEESL
Subjt:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL

Query:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA
        EVFRKMQLANVKANSVTIASV SICAMLAALNLGRE+HGHVIRA MDDN+LVGNGLINMYTKCGSFKPG MVFEKLENRDSISWNSMI GYG HGLGKDA
Subjt:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA

Query:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT
        L+TFN MIKSG+RPD VTFIAALSACSHAGLVAEG WLF QM Q FKI+P++EHY+CMVDLLGRAGLVEEASNI+KGMP EPNAYIWS++LNSCRMHKDT
Subjt:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT

Query:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI
        DLAEE A++I NLNS+ITGSHMLLSNIFAASCRWEDSARVRISAR KGLKKVPG SWIEVKKKVYMFKAG++I EGLEKVDEILHDLA QIEN++ DD I
Subjt:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI

Query:  IE
        IE
Subjt:  IE

A0A1S3BIG8 putative pentatricopeptide repeat-containing protein At1g176300.0e+0087.46Show/hide
Query:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG
        ML A SYQRFKS SFCFP LSINFHSQFSSITYD+DL +FFDHLL+QC+ IQHSKQVHSATVVTGAYCSAFV+ARLVSIY+RYGLVSDARKVF SAPFE 
Subjt:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG

Query:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM
        LSN LLWNSIIRANVYHGYC EAL LYGKMR+YGVLGDGFTFPLVLRASSNLG+ ++CKNLHCHVVQFGFQNHLHV NELIGMYAKLERMDDARK+FDKM
Subjt:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM

Query:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY
        RIKSVVSWNTM+SGYAYNYDVNGASRMFHQMELEGV+PNPVTWTSLLSSHARCGHL ET+VLF KMRMKGVGATAEMLAVVLSVCADL TLN G+MIHGY
Subjt:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY

Query:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL
        MVKGGF+DYLFAKNALIT+YGKGGD+GDAEKLFHEMK KNLVSWNALISS+AESG+YDKA EL SQLEKME YPEMKPNVITWS++ICGF+SKGLGEESL
Subjt:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL

Query:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA
        EVFRKMQLANVKANSVTIASV SICAMLAALNLGRE+HGHVIRA MD+N+LVGNGLINMYTKCGSFKPG +VFEKLENRDSISWNSMI  YG HGLGKDA
Subjt:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA

Query:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT
        L+T N MIKSG+RPD VTFIAALSACSHAGLVAEG WLF QM Q FKI+P++EHY+CMVDLLGRAGLVEEASNI+K MP EPNAYIWS++LNSCRMHKDT
Subjt:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT

Query:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI
        DLAEE A++I NLNS+ITGSHMLLSNIFAASCRWEDSARVRISAR+KGLKKVPG SWIEVKKKVY+FKAG++  EGLEKVDEILHDLA QIEN+D DD I
Subjt:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI

Query:  IE
        IE
Subjt:  IE

A0A5A7U7B1 Putative pentatricopeptide repeat-containing protein0.0e+0087.61Show/hide
Query:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG
        ML A SYQRFKS SFCFP LSINFHSQFSSITYD+DL +FFDHLL+QC+ IQHSKQVHSATVVTGAYCSAFV+ARLVSIY+RYGLVSDARKVF SAPFE 
Subjt:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG

Query:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM
        LSN LLWNSIIRANVYHGYC EAL LYGKMR+YGVLGDGFTFPLVLRASSNLG+ ++CKNLHCHVVQFGFQNHLHV NELIGMYAKLERMDDARK+FDKM
Subjt:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM

Query:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY
        RIKSVVSWNTM+SGYAYNYDVNGASRMFHQMELEGV+PNPVTWTSLLSSHARCGHL ET+VLF KMRMKGVGATAEMLAVVLSVCADL TLN G+MIHGY
Subjt:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY

Query:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL
        MVKGGF+DYLFAKNALIT+YGKGGD+GDAEKLFHEMK KNLVSWNALISS+AESG+YDKA EL SQLEKME YPEMKPNVITWS++ICGF+SKGLGEESL
Subjt:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL

Query:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA
        EVFRKMQLANVKANSVTIASV SICAMLAALNLGRE+HGHVIRA MD+N+LVGNGLINMYTKCGSFKPG +VFEKLENRDSISWNSMI GYG HGLGKDA
Subjt:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA

Query:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT
        L+T N MIKSG+RPD VTFIAALSACSHAGLVAEG WLF QM Q FKI+P++EHY+CMVDLLGRAGLVEEASNI+K MP EPNAYIWS++LNSCRMHKDT
Subjt:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT

Query:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI
        DLAEE A++I NLNS+ITGSHMLLSNIFAASCRWEDSARVRISAR+KGLKKVPG SWIEVKKKVY+FKAG++  EGLEKVDEILHDLA QIEN+D DD I
Subjt:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI

Query:  IE
        IE
Subjt:  IE

A0A5D3BVI7 Putative pentatricopeptide repeat-containing protein0.0e+0087.46Show/hide
Query:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG
        ML A SYQRFKS SFCFP LSINFHSQFSSITYD+DL +FFDHLL+QC+ IQHSKQVHSATVVTGAYCSAFV+ARLVSIY+RYGLVSDARKVF SAPFE 
Subjt:  MLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG

Query:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM
        LSN LLWNSIIRANVYHGYC EAL LYGKMR+YGVLGDGFTFPLVLRASSNLG+ ++CKNLHCHVVQFGFQNHLHV NELIGMYAKLERMDDARK+FDKM
Subjt:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM

Query:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY
        RIKSVVSWNTM+SGYAYNYDVNGASRMFHQMELEGV+PNPVTWTSLLSSHARCGHL ET+VLF KMRMKGVGATAEMLAVVLSVCADL TLN G+MIHGY
Subjt:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY

Query:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL
        MVKGGF+DYLFAKNALIT+YGKGGD+GDAEKLFHEMK KNLVSWNALISS+AESG+YDKA EL SQLEKME YPEMKPNVITWS++ICGF+SKGLGEESL
Subjt:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL

Query:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA
        EVFRKMQLANVKANSVTIASV SICAMLAALNLGRE+HGHVIRA MD+N+LVGNGLINMYTKCGSFKPG +VFEKLENRDSISWNSMI  YG HGLGKDA
Subjt:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA

Query:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT
        L+T N MIKSG+RPD VTFIAALSACSHAGLVAEG WLF QM Q FKI+P++EHY+CMVDLLGRAGLVEEASNI+K MP EPNAYIWS++LNSCRMHKDT
Subjt:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT

Query:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI
        DLAEE A++I NLNS+ITGSHMLLSNIFAASCRWEDSARVRISAR+KGLKKVPG SWIEVKKKVY+FKAG++  EGLEKVDEILHDLA QIEN+D DD I
Subjt:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDSI

Query:  IE
        IE
Subjt:  IE

A0A6J1KAZ0 putative pentatricopeptide repeat-containing protein At1g176300.0e+0085.63Show/hide
Query:  MLYACSYQRFKSASFCFPRLSIN--------FHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKV
        MLYA SYQRF SASFCFPR +IN        F+S FSSITYDD+LLD FD LL+QC+ I+H KQVHS TVV GA  SAFVAARLVS+YAR G V DARKV
Subjt:  MLYACSYQRFKSASFCFPRLSIN--------FHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKV

Query:  FNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDD
        F++ PFEGLSNLLLWNSIIRANV  GYC+EALQLYGKMR++GVL DGFTFPLVLRASSNLG FNLCK LHCHVVQFGFQNHLHVVNELIGMY KL RM D
Subjt:  FNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDD

Query:  ARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLN
        ARK+FDKMR+KSV+SWNTM+SGYAYNYDVNGA RMF QMELEGV+PNPVTWTSLLSSHARCGHLEET+ LFSKMRMKGVGATAEMLAVVLSVCADL TL+
Subjt:  ARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLN

Query:  RGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFAS
        RG+M+HGY+VKGGF+DYLFAKNALITVYGKGGD+ DAEKLFHEMK KNLVSWN+LISSYAESGLYDKAFE FS+LEKME  PEMKPNVITWSAVICGFAS
Subjt:  RGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFAS

Query:  KGLGEESLEVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYG
         G GEESLEVFR+MQLANVKANSVTI+SV SICAMLAALNLGRE+HGHVIRA M+DNILVGNGLINMYTKCGSFKPGC+VFEKLENRD ISWNSMI GYG
Subjt:  KGLGEESLEVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYG

Query:  MHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILN
        MHGLGKDAL TF++MIKSGFRPDDVTFIAALSACSHAGLVAEGRWLF QM+Q FKIKPQMEHY+CMVDLLGRAGLVEEASN+VK MP +PN YIWSA+LN
Subjt:  MHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILN

Query:  SCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIE
        SCRMHK TDLAEET SQI NLNSEI GSHMLLSNIF+ASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAG+S+ EGLE+VDEILHDLALQIE
Subjt:  SCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIE

Query:  NHDFDDSIIE
        + DFDDSIIE
Subjt:  NHDFDDSIIE

SwissProt top hitse value%identityAlignment
Q9LFL5 Pentatricopeptide repeat-containing protein At5g168603.4e-10634.5Show/hide
Query:  SITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGK
        S T D+    F      + SS++  +  H+ ++VTG   + FV   LV++Y+R   +SDARKVF+      + +++ WNSII +    G  K AL+++ +
Subjt:  SITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGK

Query:  M-RDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMF
        M  ++G   D  T   VL   ++LG+ +L K LHC  V      ++ V N L+ MYAK   MD+A  +F  M +K VVSWN M++GY+       A R+F
Subjt:  M-RDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMF

Query:  HQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGD
         +M+ E +  + VTW++ +S +A+ G   E L +  +M   G+      L  VLS CA +  L  GK IH Y +K   D     KN      G G +   
Subjt:  HQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGD

Query:  AEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRKM--QLANVKANSVTIASVSSICA
                   N+V  N LI  YA+    D A  +F  L   E       +V+TW+ +I G++  G   ++LE+  +M  +    + N+ TI+     CA
Subjt:  AEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRKM--QLANVKANSVTIASVSSICA

Query:  MLAALNLGREVHGHVIRAGMDD-NILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSA
         LAAL +G+++H + +R   +   + V N LI+MY KCGS     +VF+ +  ++ ++W S++ GYGMHG G++AL  F++M + GF+ D VT +  L A
Subjt:  MLAALNLGREVHGHVIRAGMDD-NILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSA

Query:  CSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLS
        CSH+G++ +G   F +M   F + P  EHY+C+VDLLGRAG +  A  +++ MP EP   +W A L+ CR+H   +L E  A +I  L S   GS+ LLS
Subjt:  CSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLS

Query:  NIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIEN-----------HDFDD
        N++A + RW+D  R+R   R KG+KK PGCSW+E  K    F  G       +++ ++L D   +I++           HD DD
Subjt:  NIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIEN-----------HDFDD

Q9LNP2 Putative pentatricopeptide repeat-containing protein At1g176301.2e-19650.36Show/hide
Query:  SFCF-----PRLSINFHSQFSSITY-------DDDLLDFFDHLLQQCSSIQHSKQVHSATVVTG-AYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG
        +FCF     P  SI+     S  +Y       D  L  +FDHLL  C + Q  +QVH+  +++   + S  +AA L+S+YAR GL+ DAR VF +     
Subjt:  SFCF-----PRLSINFHSQFSSITY-------DDDLLDFFDHLLQQCSSIQHSKQVHSATVVTG-AYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG

Query:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM
        LS+L LWNSI++ANV HG  + AL+LY  MR  G+ GDG+  PL+LRA   LG F LC+  H  V+Q G + +LHVVNEL+ +Y K  RM DA  LF +M
Subjt:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM

Query:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY
         +++ +SWN MI G++  YD   A ++F  M+ E   P+ VTWTS+LS H++CG  E+ L  F  MRM G   + E LAV  SVCA+L  L+  + +HGY
Subjt:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY

Query:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL
        ++KGGF++YL ++NALI VYGK G + DAE LF +++ K + SWN+LI+S+ ++G  D+A  LFS+LE+M     +K NV+TW++VI G   +G G++SL
Subjt:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL

Query:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA
        E FR+MQ + V ANSVTI  + SICA L ALNLGRE+HGHVIR  M +NILV N L+NMY KCG    G +VFE + ++D ISWNS+I GYGMHG  + A
Subjt:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA

Query:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT
        LS F++MI SGF PD +  +A LSACSHAGLV +GR +FY M ++F ++PQ EHY+C+VDLLGR G ++EAS IVK MP EP   +  A+LNSCRMHK+ 
Subjt:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT

Query:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDL
        D+AE  ASQ+  L  E TGS+MLLSNI++A  RWE+SA VR  A+ K LKKV G SWIEVKKK Y F +G  +    E +  +L DL
Subjt:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDL

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202305.7e-12235.77Show/hide
Query:  SSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRA
        SS+  + Q H+  + +GA    +++A+L++ Y+ Y   +DA  V  S P      +  ++S+I A        +++ ++ +M  +G++ D    P + + 
Subjt:  SSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRA

Query:  SSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLS
         + L +F + K +HC     G      V   +  MY +  RM DARK+FD+M  K VV+ + ++  YA    +    R+  +ME  G++ N V+W  +LS
Subjt:  SSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLS

Query:  SHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALI
           R G+ +E +V+F K+   G       ++ VL    D   LN G++IHGY++K G        +A+I +YGK G +     LF++ +       NA I
Subjt:  SHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALI

Query:  SSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDD
        +  + +GL DKA E+F   ++      M+ NV++W+++I G A  G   E+LE+FR+MQ+A VK N VTI S+   C  +AAL  GR  HG  +R  + D
Subjt:  SSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDD

Query:  NILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKI
        N+ VG+ LI+MY KCG      +VF  +  ++ + WNS++ G+ MHG  K+ +S F  ++++  +PD ++F + LSAC   GL  EG   F  M +++ I
Subjt:  NILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKI

Query:  KPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKG
        KP++EHYSCMV+LLGRAG ++EA +++K MPFEP++ +W A+LNSCR+  + DLAE  A ++F+L  E  G+++LLSNI+AA   W +   +R      G
Subjt:  KPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKG

Query:  LKKVPGCSWIEVKKKVYMFKAG---HSILEGL-EKVDEILHDL
        LKK PGCSWI+VK +VY   AG   H  ++ + EK+DEI  ++
Subjt:  LKKVPGCSWIEVKKKVYMFKAG---HSILEGL-EKVDEILHDL

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226901.7e-10531.61Show/hide
Query:  LQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGL---VSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFT
        L+ C +I   K  H +    G         +LV+     G    +S A++VF ++  E      ++NS+IR     G C EA+ L+ +M + G+  D +T
Subjt:  LQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGL---VSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFT

Query:  FPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYA-YNYDVNGASRMFHQMELEGVDPNP
        FP  L A +   +      +H  +V+ G+   L V N L+  YA+   +D ARK+FD+M  ++VVSW +MI GYA  ++  +     F  +  E V PN 
Subjt:  FPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYA-YNYDVNGASRMFHQMELEGVDPNP

Query:  VTWTSLLSSHARCGHLEETLVLFSKMRMKGV------------------------------------------------GATAEMLAV------------
        VT   ++S+ A+   LE    +++ +R  G+                                                G T E L V            
Subjt:  VTWTSLLSSHARCGHLEETLVLFSKMRMKGV------------------------------------------------GATAEMLAV------------

Query:  ------VLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYP
               +S C+ L  +  GK  HGY+++ GF+ +    NALI +Y K      A ++F  M  K +V+WN++++ Y E+G  D A+E F      E  P
Subjt:  ------VLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYP

Query:  EMKPNVITWSAVICGFASKGLGEESLEVFRKMQ-LANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVF
        E   N+++W+ +I G     L EE++EVF  MQ    V A+ VT+ S++S C  L AL+L + ++ ++ + G+  ++ +G  L++M+++CG  +    +F
Subjt:  EMKPNVITWSAVICGFASKGLGEESLEVFRKMQ-LANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVF

Query:  EKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASN
          L NRD  +W + I    M G  + A+  F+ MI+ G +PD V F+ AL+ACSH GLV +G+ +FY M++   + P+  HY CMVDLLGRAGL+EEA  
Subjt:  EKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASN

Query:  IVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSI
        +++ MP EPN  IW+++L +CR+  + ++A   A +I  L  E TGS++LLSN++A++ RW D A+VR+S + KGL+K PG S I+++ K + F +G   
Subjt:  IVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSI

Query:  LEGLEKVDEILHDLA
           +  ++ +L +++
Subjt:  LEGLEKVDEILHDLA

Q9SJZ3 Pentatricopeptide repeat-containing protein At2g22410, mitochondrial4.9e-10531.86Show/hide
Query:  LNPRNRFMLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKV-
        L P+    LY+ S +R +S      +  IN++S  S + ++  L      LL++C  + H KQ+ +  ++ G     F ++RL++  A    +S++R + 
Subjt:  LNPRNRFMLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKV-

Query:  FNSAPFEGLS--NLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVL---GDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKL
        ++    +G+   N+  WN  IR        KE+  LY +M  +G      D FT+P++ +  ++L   +L   +  HV++   +   HV N  I M+A  
Subjt:  FNSAPFEGLS--NLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVL---GDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKL

Query:  ERMDDARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCAD
          M++ARK+FD+  ++ +VSWN +I+GY    +   A  ++  ME EGV P+ VT   L+SS                                   C+ 
Subjt:  ERMDDARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCAD

Query:  LPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVI
        L  LNRGK  + Y+ + G    +   NAL+ ++ K GD+ +A ++F  ++ + +VSW  +IS YA  GL D + +LF  +E+ +        V+ W+A+I
Subjt:  LPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVI

Query:  CGFASKGLGEESLEVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSM
         G      G+++L +F++MQ +N K + +T+    S C+ L AL++G  +H ++ +  +  N+ +G  L++MY KCG+      VF  ++ R+S+++ ++
Subjt:  CGFASKGLGEESLEVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSM

Query:  IVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIW
        I G  +HG    A+S FN+MI +G  PD++TFI  LSAC H G++  GR  F QM  +F + PQ++HYS MVDLLGRAGL+EEA  +++ MP E +A +W
Subjt:  IVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIW

Query:  SAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDL
         A+L  CRMH + +L E+ A ++  L+   +G ++LL  ++  +  WED+ R R     +G++K+PGCS IEV   V  F          EK+ + LH L
Subjt:  SAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDL

Arabidopsis top hitse value%identityAlignment
AT1G17630.1 Pentatricopeptide repeat (PPR-like) superfamily protein8.6e-19850.36Show/hide
Query:  SFCF-----PRLSINFHSQFSSITY-------DDDLLDFFDHLLQQCSSIQHSKQVHSATVVTG-AYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG
        +FCF     P  SI+     S  +Y       D  L  +FDHLL  C + Q  +QVH+  +++   + S  +AA L+S+YAR GL+ DAR VF +     
Subjt:  SFCF-----PRLSINFHSQFSSITY-------DDDLLDFFDHLLQQCSSIQHSKQVHSATVVTG-AYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEG

Query:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM
        LS+L LWNSI++ANV HG  + AL+LY  MR  G+ GDG+  PL+LRA   LG F LC+  H  V+Q G + +LHVVNEL+ +Y K  RM DA  LF +M
Subjt:  LSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKM

Query:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY
         +++ +SWN MI G++  YD   A ++F  M+ E   P+ VTWTS+LS H++CG  E+ L  F  MRM G   + E LAV  SVCA+L  L+  + +HGY
Subjt:  RIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGY

Query:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL
        ++KGGF++YL ++NALI VYGK G + DAE LF +++ K + SWN+LI+S+ ++G  D+A  LFS+LE+M     +K NV+TW++VI G   +G G++SL
Subjt:  MVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESL

Query:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA
        E FR+MQ + V ANSVTI  + SICA L ALNLGRE+HGHVIR  M +NILV N L+NMY KCG    G +VFE + ++D ISWNS+I GYGMHG  + A
Subjt:  EVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDA

Query:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT
        LS F++MI SGF PD +  +A LSACSHAGLV +GR +FY M ++F ++PQ EHY+C+VDLLGR G ++EAS IVK MP EP   +  A+LNSCRMHK+ 
Subjt:  LSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDT

Query:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDL
        D+AE  ASQ+  L  E TGS+MLLSNI++A  RWE+SA VR  A+ K LKKV G SWIEVKKK Y F +G  +    E +  +L DL
Subjt:  DLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDL

AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein4.1e-12335.77Show/hide
Query:  SSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRA
        SS+  + Q H+  + +GA    +++A+L++ Y+ Y   +DA  V  S P      +  ++S+I A        +++ ++ +M  +G++ D    P + + 
Subjt:  SSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRA

Query:  SSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLS
         + L +F + K +HC     G      V   +  MY +  RM DARK+FD+M  K VV+ + ++  YA    +    R+  +ME  G++ N V+W  +LS
Subjt:  SSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLS

Query:  SHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALI
           R G+ +E +V+F K+   G       ++ VL    D   LN G++IHGY++K G        +A+I +YGK G +     LF++ +       NA I
Subjt:  SHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALI

Query:  SSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDD
        +  + +GL DKA E+F   ++      M+ NV++W+++I G A  G   E+LE+FR+MQ+A VK N VTI S+   C  +AAL  GR  HG  +R  + D
Subjt:  SSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDD

Query:  NILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKI
        N+ VG+ LI+MY KCG      +VF  +  ++ + WNS++ G+ MHG  K+ +S F  ++++  +PD ++F + LSAC   GL  EG   F  M +++ I
Subjt:  NILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKI

Query:  KPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKG
        KP++EHYSCMV+LLGRAG ++EA +++K MPFEP++ +W A+LNSCR+  + DLAE  A ++F+L  E  G+++LLSNI+AA   W +   +R      G
Subjt:  KPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKG

Query:  LKKVPGCSWIEVKKKVYMFKAG---HSILEGL-EKVDEILHDL
        LKK PGCSWI+VK +VY   AG   H  ++ + EK+DEI  ++
Subjt:  LKKVPGCSWIEVKKKVYMFKAG---HSILEGL-EKVDEILHDL

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)1.2e-10631.61Show/hide
Query:  LQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGL---VSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFT
        L+ C +I   K  H +    G         +LV+     G    +S A++VF ++  E      ++NS+IR     G C EA+ L+ +M + G+  D +T
Subjt:  LQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGL---VSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFT

Query:  FPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYA-YNYDVNGASRMFHQMELEGVDPNP
        FP  L A +   +      +H  +V+ G+   L V N L+  YA+   +D ARK+FD+M  ++VVSW +MI GYA  ++  +     F  +  E V PN 
Subjt:  FPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYA-YNYDVNGASRMFHQMELEGVDPNP

Query:  VTWTSLLSSHARCGHLEETLVLFSKMRMKGV------------------------------------------------GATAEMLAV------------
        VT   ++S+ A+   LE    +++ +R  G+                                                G T E L V            
Subjt:  VTWTSLLSSHARCGHLEETLVLFSKMRMKGV------------------------------------------------GATAEMLAV------------

Query:  ------VLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYP
               +S C+ L  +  GK  HGY+++ GF+ +    NALI +Y K      A ++F  M  K +V+WN++++ Y E+G  D A+E F      E  P
Subjt:  ------VLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYP

Query:  EMKPNVITWSAVICGFASKGLGEESLEVFRKMQ-LANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVF
        E   N+++W+ +I G     L EE++EVF  MQ    V A+ VT+ S++S C  L AL+L + ++ ++ + G+  ++ +G  L++M+++CG  +    +F
Subjt:  EMKPNVITWSAVICGFASKGLGEESLEVFRKMQ-LANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVF

Query:  EKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASN
          L NRD  +W + I    M G  + A+  F+ MI+ G +PD V F+ AL+ACSH GLV +G+ +FY M++   + P+  HY CMVDLLGRAGL+EEA  
Subjt:  EKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASN

Query:  IVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSI
        +++ MP EPN  IW+++L +CR+  + ++A   A +I  L  E TGS++LLSN++A++ RW D A+VR+S + KGL+K PG S I+++ K + F +G   
Subjt:  IVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSI

Query:  LEGLEKVDEILHDLA
           +  ++ +L +++
Subjt:  LEGLEKVDEILHDLA

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.2e-10631.61Show/hide
Query:  LQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGL---VSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFT
        L+ C +I   K  H +    G         +LV+     G    +S A++VF ++  E      ++NS+IR     G C EA+ L+ +M + G+  D +T
Subjt:  LQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGL---VSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFT

Query:  FPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYA-YNYDVNGASRMFHQMELEGVDPNP
        FP  L A +   +      +H  +V+ G+   L V N L+  YA+   +D ARK+FD+M  ++VVSW +MI GYA  ++  +     F  +  E V PN 
Subjt:  FPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYA-YNYDVNGASRMFHQMELEGVDPNP

Query:  VTWTSLLSSHARCGHLEETLVLFSKMRMKGV------------------------------------------------GATAEMLAV------------
        VT   ++S+ A+   LE    +++ +R  G+                                                G T E L V            
Subjt:  VTWTSLLSSHARCGHLEETLVLFSKMRMKGV------------------------------------------------GATAEMLAV------------

Query:  ------VLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYP
               +S C+ L  +  GK  HGY+++ GF+ +    NALI +Y K      A ++F  M  K +V+WN++++ Y E+G  D A+E F      E  P
Subjt:  ------VLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYP

Query:  EMKPNVITWSAVICGFASKGLGEESLEVFRKMQ-LANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVF
        E   N+++W+ +I G     L EE++EVF  MQ    V A+ VT+ S++S C  L AL+L + ++ ++ + G+  ++ +G  L++M+++CG  +    +F
Subjt:  EMKPNVITWSAVICGFASKGLGEESLEVFRKMQ-LANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVF

Query:  EKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASN
          L NRD  +W + I    M G  + A+  F+ MI+ G +PD V F+ AL+ACSH GLV +G+ +FY M++   + P+  HY CMVDLLGRAGL+EEA  
Subjt:  EKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASN

Query:  IVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSI
        +++ MP EPN  IW+++L +CR+  + ++A   A +I  L  E TGS++LLSN++A++ RW D A+VR+S + KGL+K PG S I+++ K + F +G   
Subjt:  IVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSI

Query:  LEGLEKVDEILHDLA
           +  ++ +L +++
Subjt:  LEGLEKVDEILHDLA

AT5G16860.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-10734.5Show/hide
Query:  SITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGK
        S T D+    F      + SS++  +  H+ ++VTG   + FV   LV++Y+R   +SDARKVF+      + +++ WNSII +    G  K AL+++ +
Subjt:  SITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCSAFVAARLVSIYARYGLVSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGK

Query:  M-RDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMF
        M  ++G   D  T   VL   ++LG+ +L K LHC  V      ++ V N L+ MYAK   MD+A  +F  M +K VVSWN M++GY+       A R+F
Subjt:  M-RDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHLHVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMF

Query:  HQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGD
         +M+ E +  + VTW++ +S +A+ G   E L +  +M   G+      L  VLS CA +  L  GK IH Y +K   D     KN      G G +   
Subjt:  HQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLAVVLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGD

Query:  AEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRKM--QLANVKANSVTIASVSSICA
                   N+V  N LI  YA+    D A  +F  L   E       +V+TW+ +I G++  G   ++LE+  +M  +    + N+ TI+     CA
Subjt:  AEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWSAVICGFASKGLGEESLEVFRKM--QLANVKANSVTIASVSSICA

Query:  MLAALNLGREVHGHVIRAGMDD-NILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSA
         LAAL +G+++H + +R   +   + V N LI+MY KCGS     +VF+ +  ++ ++W S++ GYGMHG G++AL  F++M + GF+ D VT +  L A
Subjt:  MLAALNLGREVHGHVIRAGMDD-NILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIVGYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSA

Query:  CSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLS
        CSH+G++ +G   F +M   F + P  EHY+C+VDLLGRAG +  A  +++ MP EP   +W A L+ CR+H   +L E  A +I  L S   GS+ LLS
Subjt:  CSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSCRMHKDTDLAEETASQIFNLNSEITGSHMLLS

Query:  NIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIEN-----------HDFDD
        N++A + RW+D  R+R   R KG+KK PGCSW+E  K    F  G       +++ ++L D   +I++           HD DD
Subjt:  NIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIEN-----------HDFDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGACGGGTCAATGTCGTCCCAAATGCGAACTGTGGGTGTCTCATAAATTCGCGCCAATCTCAAATTTGTGGGTTTCAAAAACCCTAAACCCTCGAAATCGA
TTCATGCTGTATGCTTGTTCTTATCAGCGTTTCAAATCAGCTTCGTTTTGTTTTCCCCGATTATCGATCAATTTCCACTCCCAATTTTCCTCAATCACGTATGAC
GACGACCTTCTCGATTTCTTCGATCATCTTCTTCAGCAGTGCAGCAGTATTCAACATAGCAAACAAGTTCATTCCGCCACTGTTGTCACCGGCGCCTATTGTTCA
GCGTTCGTTGCCGCCCGGCTTGTGTCCATCTATGCCCGTTATGGGCTTGTTTCTGATGCCCGTAAAGTGTTTAATTCTGCGCCATTTGAAGGTTTGTCAAACTTG
CTGTTATGGAATTCGATTATAAGAGCAAATGTATATCATGGGTATTGCAAAGAAGCGCTTCAACTTTATGGGAAAATGAGAGATTATGGGGTTTTGGGGGATGGG
TTTACTTTTCCTCTGGTTTTGAGGGCTTCTTCCAATTTGGGTAGTTTCAACTTGTGCAAGAATCTACATTGTCATGTTGTGCAATTTGGGTTCCAAAATCATTTG
CATGTTGTGAATGAATTGATAGGGATGTACGCGAAGCTTGAACGAATGGATGATGCCCGGAAATTGTTTGATAAAATGCGCATCAAAAGTGTAGTTTCTTGGAAT
ACCATGATTTCTGGTTATGCCTATAATTATGATGTTAATGGTGCTTCTAGGATGTTCCATCAAATGGAGTTGGAAGGGGTTGATCCGAACCCTGTAACTTGGACT
TCGTTGTTGTCGAGCCACGCCCGGTGCGGTCATCTCGAAGAAACTCTAGTGTTGTTTAGCAAGATGAGGATGAAAGGTGTTGGTGCCACTGCTGAAATGCTTGCT
GTGGTGTTATCTGTTTGTGCTGATTTACCTACATTGAACAGGGGTAAGATGATTCATGGATATATGGTAAAGGGAGGTTTCGATGATTACTTGTTCGCTAAGAAC
GCACTTATAACTGTATATGGAAAAGGAGGAGACTTAGGAGATGCAGAGAAGCTATTTCATGAGATGAAAGCGAAAAATCTTGTGAGTTGGAACGCTCTTATATCC
TCCTATGCTGAATCTGGACTATATGACAAAGCTTTTGAATTGTTTTCTCAGCTGGAGAAAATGGAAGTCTATCCAGAGATGAAACCAAATGTTATAACTTGGAGT
GCAGTCATTTGTGGATTTGCTTCCAAGGGACTAGGAGAAGAATCTTTGGAAGTTTTTCGTAAAATGCAGCTTGCAAATGTAAAGGCGAACTCAGTGACGATAGCT
AGTGTTTCATCGATTTGTGCTATGCTAGCAGCTCTAAATCTTGGTAGGGAAGTGCATGGTCATGTCATTAGAGCTGGGATGGATGATAACATATTGGTAGGAAAT
GGATTGATTAACATGTATACAAAATGTGGAAGTTTCAAGCCAGGCTGTATGGTGTTTGAAAAACTTGAAAATCGAGATTCAATCTCGTGGAATTCAATGATTGTA
GGATATGGAATGCATGGACTTGGTAAAGATGCTCTATCAACTTTTAATCAGATGATAAAATCAGGATTTAGACCAGATGATGTTACCTTTATTGCTGCTCTGTCT
GCTTGTAGTCACGCCGGTCTTGTTGCCGAAGGCCGTTGGCTTTTTTATCAGATGGTACAGAAGTTCAAGATCAAACCTCAGATGGAGCACTATTCGTGTATGGTT
GATCTTCTCGGTCGTGCTGGGCTTGTGGAAGAAGCAAGTAACATAGTCAAGGGCATGCCATTTGAACCCAATGCTTATATCTGGAGTGCTATTCTCAACTCTTGC
AGAATGCACAAAGATACGGATCTTGCAGAAGAAACTGCATCTCAAATTTTCAATCTGAATTCCGAGATAACTGGAAGTCATATGTTGCTCTCGAATATTTTTGCT
GCAAGCTGTAGATGGGAGGATTCTGCAAGGGTGAGAATTTCAGCAAGAATGAAGGGCTTAAAGAAGGTTCCTGGATGCAGTTGGATTGAGGTGAAGAAGAAGGTT
TATATGTTCAAAGCAGGACACTCAATACTAGAAGGTCTAGAGAAAGTTGATGAAATTCTTCATGATTTGGCTCTTCAGATAGAAAATCATGATTTTGATGATAGT
ATCATCGAATAG
mRNA sequenceShow/hide mRNA sequence
CTTTCATCCGACTTGTCGTTTCGTTTTTTCTCTCGACAAAATTTTACCAATGCGGACGGGTCAATGTCGTCCCAAATGCGAACTGTGGGTGTCTCATAAATTCGC
GCCAATCTCAAATTTGTGGGTTTCAAAAACCCTAAACCCTCGAAATCGATTCATGCTGTATGCTTGTTCTTATCAGCGTTTCAAATCAGCTTCGTTTTGTTTTCC
CCGATTATCGATCAATTTCCACTCCCAATTTTCCTCAATCACGTATGACGACGACCTTCTCGATTTCTTCGATCATCTTCTTCAGCAGTGCAGCAGTATTCAACA
TAGCAAACAAGTTCATTCCGCCACTGTTGTCACCGGCGCCTATTGTTCAGCGTTCGTTGCCGCCCGGCTTGTGTCCATCTATGCCCGTTATGGGCTTGTTTCTGA
TGCCCGTAAAGTGTTTAATTCTGCGCCATTTGAAGGTTTGTCAAACTTGCTGTTATGGAATTCGATTATAAGAGCAAATGTATATCATGGGTATTGCAAAGAAGC
GCTTCAACTTTATGGGAAAATGAGAGATTATGGGGTTTTGGGGGATGGGTTTACTTTTCCTCTGGTTTTGAGGGCTTCTTCCAATTTGGGTAGTTTCAACTTGTG
CAAGAATCTACATTGTCATGTTGTGCAATTTGGGTTCCAAAATCATTTGCATGTTGTGAATGAATTGATAGGGATGTACGCGAAGCTTGAACGAATGGATGATGC
CCGGAAATTGTTTGATAAAATGCGCATCAAAAGTGTAGTTTCTTGGAATACCATGATTTCTGGTTATGCCTATAATTATGATGTTAATGGTGCTTCTAGGATGTT
CCATCAAATGGAGTTGGAAGGGGTTGATCCGAACCCTGTAACTTGGACTTCGTTGTTGTCGAGCCACGCCCGGTGCGGTCATCTCGAAGAAACTCTAGTGTTGTT
TAGCAAGATGAGGATGAAAGGTGTTGGTGCCACTGCTGAAATGCTTGCTGTGGTGTTATCTGTTTGTGCTGATTTACCTACATTGAACAGGGGTAAGATGATTCA
TGGATATATGGTAAAGGGAGGTTTCGATGATTACTTGTTCGCTAAGAACGCACTTATAACTGTATATGGAAAAGGAGGAGACTTAGGAGATGCAGAGAAGCTATT
TCATGAGATGAAAGCGAAAAATCTTGTGAGTTGGAACGCTCTTATATCCTCCTATGCTGAATCTGGACTATATGACAAAGCTTTTGAATTGTTTTCTCAGCTGGA
GAAAATGGAAGTCTATCCAGAGATGAAACCAAATGTTATAACTTGGAGTGCAGTCATTTGTGGATTTGCTTCCAAGGGACTAGGAGAAGAATCTTTGGAAGTTTT
TCGTAAAATGCAGCTTGCAAATGTAAAGGCGAACTCAGTGACGATAGCTAGTGTTTCATCGATTTGTGCTATGCTAGCAGCTCTAAATCTTGGTAGGGAAGTGCA
TGGTCATGTCATTAGAGCTGGGATGGATGATAACATATTGGTAGGAAATGGATTGATTAACATGTATACAAAATGTGGAAGTTTCAAGCCAGGCTGTATGGTGTT
TGAAAAACTTGAAAATCGAGATTCAATCTCGTGGAATTCAATGATTGTAGGATATGGAATGCATGGACTTGGTAAAGATGCTCTATCAACTTTTAATCAGATGAT
AAAATCAGGATTTAGACCAGATGATGTTACCTTTATTGCTGCTCTGTCTGCTTGTAGTCACGCCGGTCTTGTTGCCGAAGGCCGTTGGCTTTTTTATCAGATGGT
ACAGAAGTTCAAGATCAAACCTCAGATGGAGCACTATTCGTGTATGGTTGATCTTCTCGGTCGTGCTGGGCTTGTGGAAGAAGCAAGTAACATAGTCAAGGGCAT
GCCATTTGAACCCAATGCTTATATCTGGAGTGCTATTCTCAACTCTTGCAGAATGCACAAAGATACGGATCTTGCAGAAGAAACTGCATCTCAAATTTTCAATCT
GAATTCCGAGATAACTGGAAGTCATATGTTGCTCTCGAATATTTTTGCTGCAAGCTGTAGATGGGAGGATTCTGCAAGGGTGAGAATTTCAGCAAGAATGAAGGG
CTTAAAGAAGGTTCCTGGATGCAGTTGGATTGAGGTGAAGAAGAAGGTTTATATGTTCAAAGCAGGACACTCAATACTAGAAGGTCTAGAGAAAGTTGATGAAAT
TCTTCATGATTTGGCTCTTCAGATAGAAAATCATGATTTTGATGATAGTATCATCGAATAGAATGTTCAAGAACACTATAAAGGGGTATTTGGGTTGAGGAGATG
AGATGAGATGAGATGAGATGAGTTTAGATATGAATTCAATTTATTGTTTGAGAGGCCAATTTCATACGTCACTGACATTTTATACCATGTCCATCAACAATGCAC
TCAATCTTTATCACCACTAATGTCTCATTGCTCAACTCTAGCGACTACCTCCAACGGCCAACTTCAGCAACCACTCTTGACGGCTAACTCCGACTACCAACTTTT
GACAACCACCTTCAACGGTCAAATCTAACAACCACCTCCAACGACCAACTCCAATAACATTCGACCAACACTATTGACGGCCAACTCCGGCAACCTCCGATTGTG
GCCACTCCACCAACTCCAACAATCATTATTTTAAATTATAACTACTACAAATTTGGGCTATGTTGACACGAGTTTTTTGTTTTGTCAATGCTCGTGACATAAAAC
CGTCAATAAAACCTTAATTTAAAGCTTAAAAATAACTATTAATTTAAACATACATATATAACATCAATAAATGTTTATATTTAAGTTGACGCATAAATTTATCAA
TAAAAACACATT
Protein sequenceShow/hide protein sequence
MRTGQCRPKCELWVSHKFAPISNLWVSKTLNPRNRFMLYACSYQRFKSASFCFPRLSINFHSQFSSITYDDDLLDFFDHLLQQCSSIQHSKQVHSATVVTGAYCS
AFVAARLVSIYARYGLVSDARKVFNSAPFEGLSNLLLWNSIIRANVYHGYCKEALQLYGKMRDYGVLGDGFTFPLVLRASSNLGSFNLCKNLHCHVVQFGFQNHL
HVVNELIGMYAKLERMDDARKLFDKMRIKSVVSWNTMISGYAYNYDVNGASRMFHQMELEGVDPNPVTWTSLLSSHARCGHLEETLVLFSKMRMKGVGATAEMLA
VVLSVCADLPTLNRGKMIHGYMVKGGFDDYLFAKNALITVYGKGGDLGDAEKLFHEMKAKNLVSWNALISSYAESGLYDKAFELFSQLEKMEVYPEMKPNVITWS
AVICGFASKGLGEESLEVFRKMQLANVKANSVTIASVSSICAMLAALNLGREVHGHVIRAGMDDNILVGNGLINMYTKCGSFKPGCMVFEKLENRDSISWNSMIV
GYGMHGLGKDALSTFNQMIKSGFRPDDVTFIAALSACSHAGLVAEGRWLFYQMVQKFKIKPQMEHYSCMVDLLGRAGLVEEASNIVKGMPFEPNAYIWSAILNSC
RMHKDTDLAEETASQIFNLNSEITGSHMLLSNIFAASCRWEDSARVRISARMKGLKKVPGCSWIEVKKKVYMFKAGHSILEGLEKVDEILHDLALQIENHDFDDS
IIE