; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014426 (gene) of Snake gourd v1 genome

Gene IDTan0014426
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG10:1177243..1179184
RNA-Seq ExpressionTan0014426
SyntenyTan0014426
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147487.1 pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Momordica charantia]1.4e-26886.69Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP
        MNQLK IHAY LR+GVDYTKFLIEKLLQIPNLPYAC LFDLIPKPSVFLYNKFIQ++SS+G HHRCW LYYQMC QGCSPN+HSFTFLFAACASL N  P
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP

Query:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD
        GQMLH+HFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMP RDIPTWNSM+AGY+RSGDMGAALELFDRMP R+VVSWTALISG++QNGKYAKAL+
Subjt:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD

Query:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND
        +FLRLE E+ IKPNEVT+ASVLPACAQLGALDIG+RIEA+AR NG+FKNLYV NAILEVHARCGNIEEA++VFDEIGSKRNLCSWN+MIMGLAVHGRC+ 
Subjt:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND

Query:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN
        A++LYDQML QR+RPDDVTF+GLLLACTHGGMVAKGRQLFESM+S+FQIAPKLEHYGC+VDLLGRAGEL+EAY+ I++MPM PDSVIWGALLGACSFH +
Subjt:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH
        VELAEVAAESLFKLEPWNPGNYVILSNIYAS GDW GVARLRK MKGG ITKRAGYS IEV DGIHEFIVEDRSHLK+DEIYALLHGIY+IIKL KP  H
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH

Query:  NQNEGDELLYS
        +QNEG+ELL+S
Subjt:  NQNEGDELLYS

XP_022924284.1 pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Cucurbita moschata]4.4e-26787.57Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP
        MNQLK IHAYSLRNGVDYTKFLI+KLLQIPNLPYACTLFDLIPKPSVFLYNKFIQ FSS GHHHRCWLLYYQMCLQGCSPN HSFTFLF ACAS  NA+P
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP

Query:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD
        GQMLHSHF KSGFASDVFALTALLDMY KLG+L+SARQLFDEMP RDIPTWNSMIAGYARSG MGAALELFD+MPTR+V+SWTALISG+AQNGKYAKAL+
Subjt:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD

Query:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND
        +FLRLE EK  KPNEVTIASVLPACA LGALDIGKRIEA+ARKNG+FKNLYV NAILEVHARCGNIEEA+RVFDEIGSKRNLCSWN+MIMGLAVHGRC D
Subjt:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND

Query:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN
        ALQLYDQMLIQR RPDDVTFVGLLLACTHGGMVAKGRQLFESM+ +FQIAPKLEHYGC+VDLLGRAGE++EAYS I+SMPM PDSVIWGALLGACSFH N
Subjt:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH
        VEL EVAAESLFKLEPWNPGNYVILSNIYAS GDWSGVAR RKMMKGG + KRAG S IEV DGIHEF+VEDRSH K+DEIYALLH +YAIIKL     H
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH

Query:  NQNEG-DELLYSSCI
        NQNEG +ELLYSS I
Subjt:  NQNEG-DELLYSSCI

XP_022979295.1 pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Cucurbita maxima]2.2e-26687.77Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP
        MNQLK IHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQ FSS GH HRCWLLYYQMCLQGCSPN HSFTFLF ACAS  NA+P
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP

Query:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD
        GQMLHSHFCKSGFASDVFALTALLDMY KLG+L+SARQLFDEMP RDIPTWNSMIAGYARSG MGAALELFD+MP R+V+SWTALISG+AQNGKYAKAL+
Subjt:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD

Query:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND
        +FLRLE EK  KPNEVTIASVLPACAQLGALDIGKRIE +ARKNG+FKNLYV NAILEVHARCGNIEEA+RVFDEIGSKRNLCSWN+MIMGLAVHGRC D
Subjt:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND

Query:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN
        ALQLYDQMLIQR RPDDVTFVGLLLACTHGGMVAKGRQ+FESM+ +FQIAPKLEHYGC+VDLLGRAGE++EAYS I+SMPM PDSVIWGALLGACSFH N
Subjt:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH
        VEL EVAAESLFKLEPWNPGNYVILSNIYAS GDWSGVAR+RKMMKGG I KRAG S IEV DGIHEFIVEDRSH K+DEIYALLH IYAIIKL     H
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH

Query:  NQNEG-DELLYSSCI
        +QNEG +ELLYSS I
Subjt:  NQNEG-DELLYSSCI

XP_023527365.1 pentatricopeptide repeat-containing protein At5g08510 [Cucurbita pepo subsp. pepo]1.5e-26787.77Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP
        MNQLK IHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQ FSS GHHHRCWLLYYQMCLQGCSPN HSFTFLF ACAS  NA+P
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP

Query:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD
        GQMLHSHFCKSGFASDVFALTALLDMY KLG+L+SARQLFDE P RDIPTWNSMIAGYARSG MGAAL+LFD+MPTR+V+SWTALISG+AQNGKYAKAL+
Subjt:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD

Query:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND
        +FLRLE EK  KPNEVTIASVLPACA LGALDIGKRIEA+ARKNG+FKNLYV NAILEVHARCGNIEEA+RVFDEIGSKRNLCSWN+MIMGLAVHGRC D
Subjt:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND

Query:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN
        ALQLYDQML+QR RPDDVTFVGLLLACTHGGMVAKGRQLFESM+ +FQIAPKLEHYGC+VDLLGRAGE++EAYS I+SMPM PDSVIWGALLGACSFH N
Subjt:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH
        VEL EVAAESLFKLEPWNPGNYVILSNIYAS GDWSGVAR+RKMMKGG I KRAG S IEV DGIHEFIVEDRSH K+DEIYALLH +YAIIKL     H
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH

Query:  NQNEG-DELLYSSCI
        NQNEG +ELLYSS I
Subjt:  NQNEG-DELLYSSCI

XP_038877076.1 pentatricopeptide repeat-containing protein At5g08510-like [Benincasa hispida]5.2e-26888.16Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP
        M QLK IHAYSLRNG+DYTKFLIEKLLQ PNLPYACTLFDLIP+PSVFLYNKFIQ FSS G  HRCWLLYYQMCLQGCSPN+HSFTFLFAACASL NA+P
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP

Query:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD
        GQMLHSHFCKSGFASDVFA TALLDMYAKLGMLRSARQLFDEMP RDIPTWNSMIAGYARSGDM AA +LFD+MP RSVVSWT LISG+AQNGKYAKAL+
Subjt:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD

Query:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFK-NLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCN
        +FLRLE EK IKPNEVTIASVLPACAQLGALDIGKRIEA+AR NG+FK N YV NAILEVHARCGNI EA+RVFDEIGSKRNLCSWN+MIMGLAVHGRC+
Subjt:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFK-NLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCN

Query:  DALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHD
        DALQLYDQMLI+RMRPDDVTFVGLLLACTHGGMVA+GRQ+FESM+S FQIAPKLEHYGC+VDLLGRAGELQEAY  I++MPMAPDSVIWGALLGACSFH 
Subjt:  DALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHD

Query:  NVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVP
        NVEL EVAAESLF LEPWNPGNYVILSNIYAS GDWSGVARLRKMMKGG+ITKRAGYS IEV DGIHEFIVEDRSHL++ EIYALLHGIY IIKLHK   
Subjt:  NVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVP

Query:  HNQNEGDELLYSSCI
        HNQNE DELLYSS I
Subjt:  HNQNEGDELLYSSCI

TrEMBL top hitse value%identityAlignment
A0A0A0LY28 Uncharacterized protein1.3e-26184.77Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP
        MNQLK IHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLFD IPKPSV+LYNKFIQ FSS GH HRCWLLY QMC QGCSPN++SFTFLF ACASLFN +P
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP

Query:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD
        GQMLHSHFCKSGFASD+FA+TALLDMYAKLGMLRSARQLFDEMP RDIPTWNS+IAGYARSG M AALELF++MP R+V+SWTALISG+AQNGKYAKAL+
Subjt:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD

Query:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND
        +F+ LE EK  KPNEV+IASVLPAC+QLGALDIGKRIEA+AR NG+FKN YV NA+LE+HARCGNIEEA++VFDEIGSKRNLCSWN+MIMGLAVHGRC D
Subjt:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND

Query:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN
        ALQLYDQMLI++MRPDDVTFVGLLLACTHGGMVA+GRQLFESM+S+FQ+APKLEHYGC+VDLLGRAGELQEAY+ I++MPMAPDSVIWG LLGACSFH N
Subjt:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH
        VEL EVAAESLFKLEPWNPGNYVILSNIYA  GDWSGVARLRKMMKGG ITKRAGYS IEV DGIHEFIVEDRSHLK+ EIYALLH IY IIKLHK V H
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH

Query:  NQNEGDELLYSS
        + NE +ELLYSS
Subjt:  NQNEGDELLYSS

A0A6J1D160 pentatricopeptide repeat-containing protein At5g08510 isoform X23.0e-26686.93Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP
        MNQLK IHAY LR+GVDYTKFLIEKLLQIPNLPYAC LFDLIPKPSVFLYNKFIQ++SS+G HHRCW LYYQMC QGCSPN+HSFTFLFAACASL N  P
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP

Query:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD
        GQMLH+HFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMP RDIPTWNSM+AGY+RSGDMGAALELFDRMP R+VVSWTALISG++QNGKYAKAL+
Subjt:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD

Query:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND
        +FLRLE E+ IKPNEVT+ASVLPACAQLGALDIG+RIEA+AR NG+FKNLYV NAILEVHARCGNIEEA++VFDEIGSKRNLCSWN+MIMGLAVHGRC+ 
Subjt:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND

Query:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN
        A++LYDQML QR+RPDDVTF+GLLLACTHGGMVAKGRQLFESM+S+FQIAPKLEHYGC+VDLLGRAGEL+EAY+ I++MPM PDSVIWGALLGACSFH +
Subjt:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH
        VELAEVAAESLFKLEPWNPGNYVILSNIYAS GDW GVARLRK MKGG ITKRAGYS IEV DGIHEFIVEDRSHLK+DEIYALLHGIY+IIKL KP  H
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH

Query:  NQNEG
        +QNEG
Subjt:  NQNEG

A0A6J1D2G9 pentatricopeptide repeat-containing protein At5g08510 isoform X16.6e-26986.69Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP
        MNQLK IHAY LR+GVDYTKFLIEKLLQIPNLPYAC LFDLIPKPSVFLYNKFIQ++SS+G HHRCW LYYQMC QGCSPN+HSFTFLFAACASL N  P
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP

Query:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD
        GQMLH+HFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMP RDIPTWNSM+AGY+RSGDMGAALELFDRMP R+VVSWTALISG++QNGKYAKAL+
Subjt:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD

Query:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND
        +FLRLE E+ IKPNEVT+ASVLPACAQLGALDIG+RIEA+AR NG+FKNLYV NAILEVHARCGNIEEA++VFDEIGSKRNLCSWN+MIMGLAVHGRC+ 
Subjt:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND

Query:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN
        A++LYDQML QR+RPDDVTF+GLLLACTHGGMVAKGRQLFESM+S+FQIAPKLEHYGC+VDLLGRAGEL+EAY+ I++MPM PDSVIWGALLGACSFH +
Subjt:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH
        VELAEVAAESLFKLEPWNPGNYVILSNIYAS GDW GVARLRK MKGG ITKRAGYS IEV DGIHEFIVEDRSHLK+DEIYALLHGIY+IIKL KP  H
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH

Query:  NQNEGDELLYS
        +QNEG+ELL+S
Subjt:  NQNEGDELLYS

A0A6J1E8Q2 pentatricopeptide repeat-containing protein At5g08510 isoform X12.1e-26787.57Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP
        MNQLK IHAYSLRNGVDYTKFLI+KLLQIPNLPYACTLFDLIPKPSVFLYNKFIQ FSS GHHHRCWLLYYQMCLQGCSPN HSFTFLF ACAS  NA+P
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP

Query:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD
        GQMLHSHF KSGFASDVFALTALLDMY KLG+L+SARQLFDEMP RDIPTWNSMIAGYARSG MGAALELFD+MPTR+V+SWTALISG+AQNGKYAKAL+
Subjt:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD

Query:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND
        +FLRLE EK  KPNEVTIASVLPACA LGALDIGKRIEA+ARKNG+FKNLYV NAILEVHARCGNIEEA+RVFDEIGSKRNLCSWN+MIMGLAVHGRC D
Subjt:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND

Query:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN
        ALQLYDQMLIQR RPDDVTFVGLLLACTHGGMVAKGRQLFESM+ +FQIAPKLEHYGC+VDLLGRAGE++EAYS I+SMPM PDSVIWGALLGACSFH N
Subjt:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH
        VEL EVAAESLFKLEPWNPGNYVILSNIYAS GDWSGVAR RKMMKGG + KRAG S IEV DGIHEF+VEDRSH K+DEIYALLH +YAIIKL     H
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH

Query:  NQNEG-DELLYSSCI
        NQNEG +ELLYSS I
Subjt:  NQNEG-DELLYSSCI

A0A6J1ISU4 pentatricopeptide repeat-containing protein At5g08510 isoform X11.0e-26687.77Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP
        MNQLK IHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQ FSS GH HRCWLLYYQMCLQGCSPN HSFTFLF ACAS  NA+P
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP

Query:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD
        GQMLHSHFCKSGFASDVFALTALLDMY KLG+L+SARQLFDEMP RDIPTWNSMIAGYARSG MGAALELFD+MP R+V+SWTALISG+AQNGKYAKAL+
Subjt:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD

Query:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND
        +FLRLE EK  KPNEVTIASVLPACAQLGALDIGKRIE +ARKNG+FKNLYV NAILEVHARCGNIEEA+RVFDEIGSKRNLCSWN+MIMGLAVHGRC D
Subjt:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND

Query:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN
        ALQLYDQMLIQR RPDDVTFVGLLLACTHGGMVAKGRQ+FESM+ +FQIAPKLEHYGC+VDLLGRAGE++EAYS I+SMPM PDSVIWGALLGACSFH N
Subjt:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH
        VEL EVAAESLFKLEPWNPGNYVILSNIYAS GDWSGVAR+RKMMKGG I KRAG S IEV DGIHEFIVEDRSH K+DEIYALLH IYAIIKL     H
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPH

Query:  NQNEG-DELLYSSCI
        +QNEG +ELLYSS I
Subjt:  NQNEG-DELLYSSCI

SwissProt top hitse value%identityAlignment
Q9C501 Pentatricopeptide repeat-containing protein At1g333501.5e-10839.8Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKL-----LQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCS---PNEHSFTFLFAAC
        +N LK + ++ + +G+ ++ FL  KL     L++ NL YA  +FD    P+  LY   + A+SS+   H      +   +   S   PN   +  +  + 
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKL-----LQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCS---PNEHSFTFLFAAC

Query:  ASLFNAHPGQMLHSHFCKSGFASDVFALTALLDMYA-KLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQ
          L +A    ++H+H  KSGF   V   TALL  YA  +  +  ARQLFDEM  R++ +W +M++GYARSGD+  A+ LF+ MP R V SW A+++   Q
Subjt:  ASLFNAHPGQMLHSHFCKSGFASDVFALTALLDMYA-KLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQ

Query:  NGKYAKALDVFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMG
        NG + +A+ +F R+  E  I+PNEVT+  VL ACAQ G L + K I AFA +     +++V N++++++ +CGN+EEA  VF ++ SK++L +WNSMI  
Subjt:  NGKYAKALDVFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMG

Query:  LAVHGRCNDALQLYDQML---IQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIW
         A+HGR  +A+ ++++M+   I  ++PD +TF+GLL ACTHGG+V+KGR  F+ M + F I P++EHYGC++DLLGRAG   EA   + +M M  D  IW
Subjt:  LAVHGRCNDALQLYDQML---IQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIW

Query:  GALLGACSFHDNVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGI
        G+LL AC  H +++LAEVA ++L  L P N G   +++N+Y  +G+W    R RKM+K     K  G+S IE+++ +H+F   D+SH + +EIY +L  +
Subjt:  GALLGACSFHDNVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGI

Q9FFG8 Pentatricopeptide repeat-containing protein At5g442301.9e-10841.06Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQ------IPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACAS
        +NQ+K IH + LR G+D + +++ KL++      +P  PYA  + + +   + FL+   I+ ++  G       +Y  M  +  +P   +F+ L  AC +
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQ------IPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACAS

Query:  LFNAHPGQMLHSH-FCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNG
        + + + G+  H+  F   GF   V+    ++DMY K   +  AR++FDEMP RD+ +W  +IA YAR G+M  A ELF+ +PT+ +V+WTA+++GFAQN 
Subjt:  LFNAHPGQMLHSH-FCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNG

Query:  KYAKALDVFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGY--FKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMG
        K  +AL+ F R+EK   I+ +EVT+A  + ACAQLGA     R    A+K+GY    ++ +G+A+++++++CGN+EEA  VF  + +K N+ +++SMI+G
Subjt:  KYAKALDVFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGY--FKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMG

Query:  LAVHGRCNDALQLYDQMLIQ-RMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGA
        LA HGR  +AL L+  M+ Q  ++P+ VTFVG L+AC+H G+V +GRQ+F+SM   F + P  +HY CMVDLLGR G LQEA   I++M + P   +WGA
Subjt:  LAVHGRCNDALQLYDQMLIQ-RMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGA

Query:  LLGACSFHDNVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDG-IHEFIVEDRSHLKNDEI
        LLGAC  H+N E+AE+AAE LF+LEP   GNY++LSN+YAS GDW GV R+RK++K   + K    S +  ++G +H+F   + +H  +++I
Subjt:  LLGACSFHDNVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDG-IHEFIVEDRSHLKNDEI

Q9FNN7 Pentatricopeptide repeat-containing protein At5g085101.5e-16957.34Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP
        MN +K +HA+ LR GVD TK L+++LL IPNL YA  LFD       FLYNK IQA+      H   +LY  +   G  P+ H+F F+FAA AS  +A P
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP

Query:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD
         ++LHS F +SGF SD F  T L+  YAKLG L  AR++FDEM  RD+P WN+MI GY R GDM AA+ELFD MP ++V SWT +ISGF+QNG Y++AL 
Subjt:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD

Query:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND
        +FL +EK+K +KPN +T+ SVLPACA LG L+IG+R+E +AR+NG+F N+YV NA +E++++CG I+ AKR+F+E+G++RNLCSWNSMI  LA HG+ ++
Subjt:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND

Query:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN
        AL L+ QML +  +PD VTFVGLLLAC HGGMV KG++LF+SM+   +I+PKLEHYGCM+DLLGR G+LQEAY  I++MPM PD+V+WG LLGACSFH N
Subjt:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYS-CIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHK
        VE+AE+A+E+LFKLEP NPGN VI+SNIYA+   W GV R+RK+MK   +TK AGYS  +EV   +H+F VED+SH ++ EIY +L  I+  +KL K
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYS-CIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHK

Q9LS72 Pentatricopeptide repeat-containing protein At3g292309.0e-10635.92Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNL----PYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLF
        +NQ+K +HA  +R  +     +  KL+   +L      A  +F+ + +P+V L N  I+A +     ++ + ++ +M   G   +  ++ FL  AC+   
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNL----PYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLF

Query:  NAHPGQMLHSHFCKSGFASDVFALTALLDMYA---------------------------------KLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGD
             +M+H+H  K G +SD++   AL+D Y+                                 K G LR AR+LFDEMP RD+ +WN+M+ GYAR  +
Subjt:  NAHPGQMLHSHFCKSGFASDVFALTALLDMYA---------------------------------KLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGD

Query:  MGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALDVF------------------------LRLEKEKVI--------KPNEVTIASVLPACAQLGAL
        M  A ELF++MP R+ VSW+ ++ G+++ G    A  +F                        L  E ++++        K +   + S+L AC + G L
Subjt:  MGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALDVF------------------------LRLEKEKVI--------KPNEVTIASVLPACAQLGAL

Query:  DIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCNDALQLYDQMLIQRMRPDDVTFVGLLLACTHGG
         +G RI +  +++    N YV NA+L+++A+CGN+++A  VF++I  K++L SWN+M+ GL VHG   +A++L+ +M  + +RPD VTF+ +L +C H G
Subjt:  DIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCNDALQLYDQMLIQRMRPDDVTFVGLLLACTHGG

Query:  MVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDNVELAEVAAESLFKLEPWNPGNYVILSNIYAS
        ++ +G   F SM+  + + P++EHYGC+VDLLGR G L+EA   +++MPM P+ VIWGALLGAC  H+ V++A+   ++L KL+P +PGNY +LSNIYA+
Subjt:  MVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDNVELAEVAAESLFKLEPWNPGNYVILSNIYAS

Query:  VGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALL
          DW GVA +R  MK   + K +G S +E+EDGIHEF V D+SH K+D+IY +L
Subjt:  VGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALL

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205406.6e-10940.38Show/hide
Query:  NQLKHIHAYSLRNGVDYTKFLIEKLL----QIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCS-PNEHSFTFLFAACASLF
        N+ K I+A  + +G+  + F++ K++    +I ++ YA  LF+ +  P+VFLYN  I+A++    +     +Y Q+  +    P+  +F F+F +CASL 
Subjt:  NQLKHIHAYSLRNGVDYTKFLIEKLL----QIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCS-PNEHSFTFLFAACASLF

Query:  NAHPGQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYA
        + + G+ +H H CK G    V    AL+DMY K   L  A ++FDEM  RD+ +WNS+++GYAR G M  A  LF  M  +++VSWTA+ISG+   G Y 
Subjt:  NAHPGQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYA

Query:  KALDVFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHG
        +A+D F  ++    I+P+E+++ SVLP+CAQLG+L++GK I  +A + G+ K   V NA++E++++CG I +A ++F ++  K ++ SW++MI G A HG
Subjt:  KALDVFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHG

Query:  RCNDALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACS
          + A++ +++M   +++P+ +TF+GLL AC+H GM  +G + F+ M+ ++QI PK+EHYGC++D+L RAG+L+ A    ++MPM PDS IWG+LL +C 
Subjt:  RCNDALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACS

Query:  FHDNVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRS
           N+++A VA + L +LEP + GNYV+L+NIYA +G W  V+RLRKM++   + K  G S IEV + + EF+  D S
Subjt:  FHDNVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRS

Arabidopsis top hitse value%identityAlignment
AT1G33350.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-10939.8Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKL-----LQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCS---PNEHSFTFLFAAC
        +N LK + ++ + +G+ ++ FL  KL     L++ NL YA  +FD    P+  LY   + A+SS+   H      +   +   S   PN   +  +  + 
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKL-----LQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCS---PNEHSFTFLFAAC

Query:  ASLFNAHPGQMLHSHFCKSGFASDVFALTALLDMYA-KLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQ
          L +A    ++H+H  KSGF   V   TALL  YA  +  +  ARQLFDEM  R++ +W +M++GYARSGD+  A+ LF+ MP R V SW A+++   Q
Subjt:  ASLFNAHPGQMLHSHFCKSGFASDVFALTALLDMYA-KLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQ

Query:  NGKYAKALDVFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMG
        NG + +A+ +F R+  E  I+PNEVT+  VL ACAQ G L + K I AFA +     +++V N++++++ +CGN+EEA  VF ++ SK++L +WNSMI  
Subjt:  NGKYAKALDVFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMG

Query:  LAVHGRCNDALQLYDQML---IQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIW
         A+HGR  +A+ ++++M+   I  ++PD +TF+GLL ACTHGG+V+KGR  F+ M + F I P++EHYGC++DLLGRAG   EA   + +M M  D  IW
Subjt:  LAVHGRCNDALQLYDQML---IQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIW

Query:  GALLGACSFHDNVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGI
        G+LL AC  H +++LAEVA ++L  L P N G   +++N+Y  +G+W    R RKM+K     K  G+S IE+++ +H+F   D+SH + +EIY +L  +
Subjt:  GALLGACSFHDNVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGI

AT2G20540.1 mitochondrial editing factor 214.7e-11040.38Show/hide
Query:  NQLKHIHAYSLRNGVDYTKFLIEKLL----QIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCS-PNEHSFTFLFAACASLF
        N+ K I+A  + +G+  + F++ K++    +I ++ YA  LF+ +  P+VFLYN  I+A++    +     +Y Q+  +    P+  +F F+F +CASL 
Subjt:  NQLKHIHAYSLRNGVDYTKFLIEKLL----QIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCS-PNEHSFTFLFAACASLF

Query:  NAHPGQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYA
        + + G+ +H H CK G    V    AL+DMY K   L  A ++FDEM  RD+ +WNS+++GYAR G M  A  LF  M  +++VSWTA+ISG+   G Y 
Subjt:  NAHPGQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYA

Query:  KALDVFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHG
        +A+D F  ++    I+P+E+++ SVLP+CAQLG+L++GK I  +A + G+ K   V NA++E++++CG I +A ++F ++  K ++ SW++MI G A HG
Subjt:  KALDVFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHG

Query:  RCNDALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACS
          + A++ +++M   +++P+ +TF+GLL AC+H GM  +G + F+ M+ ++QI PK+EHYGC++D+L RAG+L+ A    ++MPM PDS IWG+LL +C 
Subjt:  RCNDALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACS

Query:  FHDNVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRS
           N+++A VA + L +LEP + GNYV+L+NIYA +G W  V+RLRKM++   + K  G S IEV + + EF+  D S
Subjt:  FHDNVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRS

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.4e-10735.92Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNL----PYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLF
        +NQ+K +HA  +R  +     +  KL+   +L      A  +F+ + +P+V L N  I+A +     ++ + ++ +M   G   +  ++ FL  AC+   
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNL----PYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLF

Query:  NAHPGQMLHSHFCKSGFASDVFALTALLDMYA---------------------------------KLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGD
             +M+H+H  K G +SD++   AL+D Y+                                 K G LR AR+LFDEMP RD+ +WN+M+ GYAR  +
Subjt:  NAHPGQMLHSHFCKSGFASDVFALTALLDMYA---------------------------------KLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGD

Query:  MGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALDVF------------------------LRLEKEKVI--------KPNEVTIASVLPACAQLGAL
        M  A ELF++MP R+ VSW+ ++ G+++ G    A  +F                        L  E ++++        K +   + S+L AC + G L
Subjt:  MGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALDVF------------------------LRLEKEKVI--------KPNEVTIASVLPACAQLGAL

Query:  DIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCNDALQLYDQMLIQRMRPDDVTFVGLLLACTHGG
         +G RI +  +++    N YV NA+L+++A+CGN+++A  VF++I  K++L SWN+M+ GL VHG   +A++L+ +M  + +RPD VTF+ +L +C H G
Subjt:  DIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCNDALQLYDQMLIQRMRPDDVTFVGLLLACTHGG

Query:  MVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDNVELAEVAAESLFKLEPWNPGNYVILSNIYAS
        ++ +G   F SM+  + + P++EHYGC+VDLLGR G L+EA   +++MPM P+ VIWGALLGAC  H+ V++A+   ++L KL+P +PGNY +LSNIYA+
Subjt:  MVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDNVELAEVAAESLFKLEPWNPGNYVILSNIYAS

Query:  VGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALL
          DW GVA +R  MK   + K +G S +E+EDGIHEF V D+SH K+D+IY +L
Subjt:  VGDWSGVARLRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALL

AT5G08510.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-17057.34Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP
        MN +K +HA+ LR GVD TK L+++LL IPNL YA  LFD       FLYNK IQA+      H   +LY  +   G  P+ H+F F+FAA AS  +A P
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHP

Query:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD
         ++LHS F +SGF SD F  T L+  YAKLG L  AR++FDEM  RD+P WN+MI GY R GDM AA+ELFD MP ++V SWT +ISGF+QNG Y++AL 
Subjt:  GQMLHSHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALD

Query:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND
        +FL +EK+K +KPN +T+ SVLPACA LG L+IG+R+E +AR+NG+F N+YV NA +E++++CG I+ AKR+F+E+G++RNLCSWNSMI  LA HG+ ++
Subjt:  VFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCND

Query:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN
        AL L+ QML +  +PD VTFVGLLLAC HGGMV KG++LF+SM+   +I+PKLEHYGCM+DLLGR G+LQEAY  I++MPM PD+V+WG LLGACSFH N
Subjt:  ALQLYDQMLIQRMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDN

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYS-CIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHK
        VE+AE+A+E+LFKLEP NPGN VI+SNIYA+   W GV R+RK+MK   +TK AGYS  +EV   +H+F VED+SH ++ EIY +L  I+  +KL K
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYS-CIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHK

AT5G44230.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-10941.06Show/hide
Query:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQ------IPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACAS
        +NQ+K IH + LR G+D + +++ KL++      +P  PYA  + + +   + FL+   I+ ++  G       +Y  M  +  +P   +F+ L  AC +
Subjt:  MNQLKHIHAYSLRNGVDYTKFLIEKLLQ------IPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACAS

Query:  LFNAHPGQMLHSH-FCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNG
        + + + G+  H+  F   GF   V+    ++DMY K   +  AR++FDEMP RD+ +W  +IA YAR G+M  A ELF+ +PT+ +V+WTA+++GFAQN 
Subjt:  LFNAHPGQMLHSH-FCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNG

Query:  KYAKALDVFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGY--FKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMG
        K  +AL+ F R+EK   I+ +EVT+A  + ACAQLGA     R    A+K+GY    ++ +G+A+++++++CGN+EEA  VF  + +K N+ +++SMI+G
Subjt:  KYAKALDVFLRLEKEKVIKPNEVTIASVLPACAQLGALDIGKRIEAFARKNGY--FKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMG

Query:  LAVHGRCNDALQLYDQMLIQ-RMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGA
        LA HGR  +AL L+  M+ Q  ++P+ VTFVG L+AC+H G+V +GRQ+F+SM   F + P  +HY CMVDLLGR G LQEA   I++M + P   +WGA
Subjt:  LAVHGRCNDALQLYDQMLIQ-RMRPDDVTFVGLLLACTHGGMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGA

Query:  LLGACSFHDNVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDG-IHEFIVEDRSHLKNDEI
        LLGAC  H+N E+AE+AAE LF+LEP   GNY++LSN+YAS GDW GV R+RK++K   + K    S +  ++G +H+F   + +H  +++I
Subjt:  LLGACSFHDNVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVARLRKMMKGGRITKRAGYSCIEVEDG-IHEFIVEDRSHLKNDEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCAATTGAAGCACATTCATGCCTACAGTCTCAGAAACGGCGTAGATTATACCAAATTCCTCATCGAAAAACTCCTGCAAATCCCAAATCTTCCATATGCCTGTAC
CCTGTTCGACCTTATTCCTAAGCCCTCTGTTTTTCTCTACAACAAGTTCATTCAAGCATTTTCTTCTACCGGTCACCATCACCGATGCTGGTTGCTTTACTACCAAATGT
GCCTCCAGGGCTGCTCTCCAAACGAGCATTCATTCACATTCCTCTTTGCCGCATGCGCTTCGCTTTTTAATGCTCACCCAGGTCAGATGCTTCATTCCCATTTCTGTAAG
TCGGGATTTGCCTCTGATGTTTTTGCTTTGACGGCGTTGTTGGACATGTATGCGAAATTGGGAATGTTGAGGTCTGCACGCCAACTGTTTGATGAAATGCCTGCTCGAGA
TATACCCACTTGGAACTCGATGATTGCTGGTTATGCCAGGTCCGGGGACATGGGGGCAGCGTTAGAATTGTTTGACAGAATGCCTACGAGAAGTGTCGTTTCGTGGACAG
CATTGATTTCTGGGTTTGCTCAGAATGGGAAGTATGCGAAGGCCTTGGATGTGTTTTTGCGATTGGAAAAAGAGAAAGTCATTAAACCAAATGAGGTGACCATTGCAAGT
GTTCTTCCTGCCTGTGCTCAGCTTGGGGCCTTGGATATTGGGAAAAGGATTGAAGCATTTGCACGAAAGAATGGCTACTTCAAAAACTTGTATGTGGGTAATGCGATACT
GGAAGTGCATGCTAGGTGCGGTAACATCGAGGAAGCTAAGCGAGTTTTTGATGAGATTGGAAGCAAGAGAAATTTGTGCTCGTGGAATTCCATGATAATGGGATTGGCTG
TGCATGGAAGATGCAATGATGCTCTTCAGCTTTATGATCAAATGTTGATACAAAGAATGAGACCCGATGATGTGACGTTTGTAGGGCTTCTCTTGGCTTGCACTCATGGA
GGCATGGTTGCGAAAGGTCGACAACTCTTTGAATCAATGAAAAGTGAGTTTCAAATTGCTCCCAAATTAGAGCACTATGGCTGCATGGTTGATCTATTAGGCAGGGCCGG
AGAGCTACAAGAAGCTTACAGTCGCATTCGAAGCATGCCAATGGCTCCTGATTCTGTAATATGGGGAGCTCTTCTGGGAGCTTGCAGCTTCCATGACAATGTTGAATTGG
CCGAAGTAGCAGCTGAGTCTCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTCTCTAACATTTACGCGTCAGTTGGCGATTGGTCTGGAGTTGCAAGG
CTAAGGAAGATGATGAAGGGAGGACGCATTACAAAGAGAGCAGGGTATAGTTGTATTGAAGTGGAAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCGCATTTGAA
GAATGATGAAATATATGCTTTACTTCATGGAATTTATGCCATTATTAAACTTCACAAGCCTGTACCTCATAATCAAAATGAAGGTGATGAACTACTTTATTCTTCATGTA
TCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACCAATTGAAGCACATTCATGCCTACAGTCTCAGAAACGGCGTAGATTATACCAAATTCCTCATCGAAAAACTCCTGCAAATCCCAAATCTTCCATATGCCTGTAC
CCTGTTCGACCTTATTCCTAAGCCCTCTGTTTTTCTCTACAACAAGTTCATTCAAGCATTTTCTTCTACCGGTCACCATCACCGATGCTGGTTGCTTTACTACCAAATGT
GCCTCCAGGGCTGCTCTCCAAACGAGCATTCATTCACATTCCTCTTTGCCGCATGCGCTTCGCTTTTTAATGCTCACCCAGGTCAGATGCTTCATTCCCATTTCTGTAAG
TCGGGATTTGCCTCTGATGTTTTTGCTTTGACGGCGTTGTTGGACATGTATGCGAAATTGGGAATGTTGAGGTCTGCACGCCAACTGTTTGATGAAATGCCTGCTCGAGA
TATACCCACTTGGAACTCGATGATTGCTGGTTATGCCAGGTCCGGGGACATGGGGGCAGCGTTAGAATTGTTTGACAGAATGCCTACGAGAAGTGTCGTTTCGTGGACAG
CATTGATTTCTGGGTTTGCTCAGAATGGGAAGTATGCGAAGGCCTTGGATGTGTTTTTGCGATTGGAAAAAGAGAAAGTCATTAAACCAAATGAGGTGACCATTGCAAGT
GTTCTTCCTGCCTGTGCTCAGCTTGGGGCCTTGGATATTGGGAAAAGGATTGAAGCATTTGCACGAAAGAATGGCTACTTCAAAAACTTGTATGTGGGTAATGCGATACT
GGAAGTGCATGCTAGGTGCGGTAACATCGAGGAAGCTAAGCGAGTTTTTGATGAGATTGGAAGCAAGAGAAATTTGTGCTCGTGGAATTCCATGATAATGGGATTGGCTG
TGCATGGAAGATGCAATGATGCTCTTCAGCTTTATGATCAAATGTTGATACAAAGAATGAGACCCGATGATGTGACGTTTGTAGGGCTTCTCTTGGCTTGCACTCATGGA
GGCATGGTTGCGAAAGGTCGACAACTCTTTGAATCAATGAAAAGTGAGTTTCAAATTGCTCCCAAATTAGAGCACTATGGCTGCATGGTTGATCTATTAGGCAGGGCCGG
AGAGCTACAAGAAGCTTACAGTCGCATTCGAAGCATGCCAATGGCTCCTGATTCTGTAATATGGGGAGCTCTTCTGGGAGCTTGCAGCTTCCATGACAATGTTGAATTGG
CCGAAGTAGCAGCTGAGTCTCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTCTCTAACATTTACGCGTCAGTTGGCGATTGGTCTGGAGTTGCAAGG
CTAAGGAAGATGATGAAGGGAGGACGCATTACAAAGAGAGCAGGGTATAGTTGTATTGAAGTGGAAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCGCATTTGAA
GAATGATGAAATATATGCTTTACTTCATGGAATTTATGCCATTATTAAACTTCACAAGCCTGTACCTCATAATCAAAATGAAGGTGATGAACTACTTTATTCTTCATGTA
TCTGA
Protein sequenceShow/hide protein sequence
MNQLKHIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQAFSSTGHHHRCWLLYYQMCLQGCSPNEHSFTFLFAACASLFNAHPGQMLHSHFCK
SGFASDVFALTALLDMYAKLGMLRSARQLFDEMPARDIPTWNSMIAGYARSGDMGAALELFDRMPTRSVVSWTALISGFAQNGKYAKALDVFLRLEKEKVIKPNEVTIAS
VLPACAQLGALDIGKRIEAFARKNGYFKNLYVGNAILEVHARCGNIEEAKRVFDEIGSKRNLCSWNSMIMGLAVHGRCNDALQLYDQMLIQRMRPDDVTFVGLLLACTHG
GMVAKGRQLFESMKSEFQIAPKLEHYGCMVDLLGRAGELQEAYSRIRSMPMAPDSVIWGALLGACSFHDNVELAEVAAESLFKLEPWNPGNYVILSNIYASVGDWSGVAR
LRKMMKGGRITKRAGYSCIEVEDGIHEFIVEDRSHLKNDEIYALLHGIYAIIKLHKPVPHNQNEGDELLYSSCI