; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005368 (gene) of Snake gourd v1 genome

Gene IDTan0005368
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG05:17529023..17530845
RNA-Seq ExpressionTan0005368
SyntenyTan0005368
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571131.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.8e-24685.17Show/hide
Query:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS
        MSIPLLKR+LW I N +F  PFSPSFFSSSPA  PSPSTKPS+STVVSVLTHHRSKSRWRFLNSLCPDGFDPG+FSDIVLQIKNN HL LRFFLWT++KS
Subjt:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR
        LCNH+LVSYSTVIHILARGRLRTHAK VIQTA+RA+ LEDGD CSKCERFSSSRPLKLFETLVKTYK+CGSAPFVFDLLIK+LLDSKKL+ AIQIVRMLR
Subjt:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR

Query:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV
        SRGISPQIGTLNSLIL +SKCEGANAGYA+FREVFGL+C+IEE+NVK+KARVSPNVHTFNTLMVCFYQDGLVG+VKEIWDQL +SNSI NSYSY ILMAV
Subjt:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV

Query:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE
        LCEEKRM EAE+LW+EM+MKKLE DAVAYNTIIGGFCKAGNI+RAEEFFREMEL G EST+STFEHLINGYCE+GDVDS LLVYKDMRRK F+++   LE
Subjt:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE

Query:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS
        A+ R LCA+TRLLEALDVFG ATED+N CPTMETYELLI+GLCQ+G +EAAFKLQ+QMVGKGFKPNSKIY SF+DAY+KEGNEEMV+KL +E+LEIQLS
Subjt:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS

KAG7010942.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-24685.17Show/hide
Query:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS
        MSIPLLKR+LW I N +F  PFSPSFFSSSPA  PSPSTKPS+STVVSVLTHHRSKSRWRFLNSLCPDGFDPG+FSDIVLQIKNN HL LRFFLWT++KS
Subjt:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR
        LCNH+LVSYSTVIHILARGRLRTHAK VIQTA+RA+ LEDGD CSKCERFSSSRPLKLFETLVKTYK+CGSAPFVFDLLIK+LLDSKKL+ AIQIVRMLR
Subjt:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR

Query:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV
        SRGISPQIGTLNSLIL +SKCEGANAGYA+FREVFGL+C+IEE+NVK+KARVSPNVHTFNTLMVCFYQDGLVG+VKEIWDQL +SNSI NSYSY ILMAV
Subjt:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV

Query:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE
        LCEEKRM EAE+LW+EM+MKKLE DAVAYNTIIGGFCKAGNI+RAEEFFREMEL G EST+STFEHLINGYCE+GDVDS LLVYKDMRRK F+++   LE
Subjt:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE

Query:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS
        A+ R LCA+TRLLEALDVFG ATED+N CPTMETYELLI+GLCQ+G +EAAFKLQ QMVGKGFKPNSKIY SF+DAY+KEGNEEMV+KL +E+LEIQLS
Subjt:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS

XP_022944388.1 pentatricopeptide repeat-containing protein At2g15980 [Cucurbita moschata]8.3e-24785.57Show/hide
Query:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS
        MSIPLLKR+LW I N +F  PFSPSFFSSSPA  PSPSTKPS+STVVSVLTHHRSKSRWRFLNSLCPDGFDPG+FSDIVLQIKNN HL LRFFLWT++KS
Subjt:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR
        LCNH+LVSYSTVIHILARGRLRT AK VIQTA+RA+ LEDGDDCSKCERFSSSRPLKLFETLVKTYK+CGSAPFVFDLLIK+LLDSKKL+ AIQIVRMLR
Subjt:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR

Query:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV
        SRGISPQIGTLNSLIL +SKCEGANAGYA+FREVFGL+C+IEE+NVK+KARVSPNVHTFNTLMVCFYQDGLVG+ KEIWDQL +SNSI NSYSY ILMAV
Subjt:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV

Query:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE
        LCEEKRM EAE+LW+EM+MKKLE DAVAYNTIIGGFCKAGNI+RAEEFFREMEL G EST+STFEHLINGYCETGDVDS LLVYKDMRRK F+L+   LE
Subjt:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE

Query:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS
        A+ R LCA+TRLLEALD+FG ATED+N CPTMETYELLI+GLCQEG +EAAFKLQAQMVGKGFKPNSKIY SF+DAY+KEGNEEMV+KL +E+LEIQLS
Subjt:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS

XP_023512842.1 pentatricopeptide repeat-containing protein At2g15980 [Cucurbita pepo subsp. pepo]8.3e-24785.37Show/hide
Query:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS
        MSIPLLKR+LW I N +F  PFSPSFFSSSPA  PSPSTKPS+STVVSVLTHHRSKSRWRFLNSLCPDGFDPG+FSDIVLQIKNN HL L FFLWT++KS
Subjt:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR
        LCNH+LVSYSTVIHILARGRLRTHAK VIQTA+RA+ LEDGDDCS CERFSSSRPLKLFETLVKTYK+CGSAPFVFDLLIK+LLDSKKL+ AIQIVRMLR
Subjt:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR

Query:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV
        SRGISPQIGTLNSLIL +SKCEGANAGYA+FREVFGL+C+IEE+NVK+KARVSPNVHTFNTLMVCFYQDGLVG+VKEIWDQL +S SI NSYSY ILMAV
Subjt:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV

Query:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE
        LCEEKRM EAE+LW+EM+MKKLE DAVAYNTIIGGFCKAGN++RAEEFFREMEL G EST+STFEHLINGYCETGDVDS LLVYKDMRRK F+L+   LE
Subjt:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE

Query:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS
        A+ R LCA+TRLLEALDVFG A E +NFCPTMETYELLI+GLCQEG +EAAFKLQAQMVGKGFKPNSKIY SF+DAY+KEGNEEMV+KLG+E+LEIQLS
Subjt:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS

XP_038901621.1 pentatricopeptide repeat-containing protein At2g15980 [Benincasa hispida]7.3e-25186.97Show/hide
Query:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS
        MS+PLL+RTLWPIRN +F  PFS SFFSSSP G PSPSTKPS+STVVSVLTHHRSKSRWRFLNSLCPDGFDPG+FSDI+LQIKNNPHLALRFF WTQNKS
Subjt:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR
        LCNHNLVSYSTVIHILARGRLRTHAK VIQTA+RA+ELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIK+LLDSKKLESAIQIVRMLR
Subjt:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR

Query:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV
        SRGISPQ+GTLNSLIL VSK +GANAGYAIF+EVFGLDC+IEEENVKLKA VSPNVHTFNTLM CFYQDGLVG+VK+IWDQL +SNSI NSYSY ILMAV
Subjt:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV

Query:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE
         CEEKRM EAE+LW EM++KKLE DAVAYNTIIGGFCKAGN+ RAEEF+REMELSGIEST+STFEHLINGYCETGDVDS LLVYKDMRRK F  +A  LE
Subjt:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE

Query:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS
         +IR LCA+TRLLEALDVF  A EDSNFCPT+ETYELLI+GLCQEG IE AFKLQAQMVGKGFKPN KIY SF+DAY KEGNEEMVEKLGKE+LEIQLS
Subjt:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS

TrEMBL top hitse value%identityAlignment
A0A5A7TQU6 Pentatricopeptide repeat-containing protein1.4e-23682.77Show/hide
Query:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS
        MS PLLKRTL PI N +    FS SFFSSSP   PSPSTKPS+STVVSVLTH RSKSRWRFLNSLCP+GFDPG+FSDIVLQIKNNPHLALRFFLWTQNKS
Subjt:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR
        LCNHNL+SYST+IHILARGRLRTHAK VIQTA+RA+ELED D+ S+ ERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIK+LLDSKKL+S+I+IVRMLR
Subjt:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR

Query:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV
        SRGISPQ+ TLNSLIL VSKC+GAN  YAIF EVFGLDC+IE+E+VKLK RVSPNVHTFNTLM CFYQDG VG+VKEIWDQL +SNSI NSYSY ILMAV
Subjt:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV

Query:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE
        LCEEKRM EAE+LW+EM+MKKLE D VAYNTIIGGFCKAGN QRAEEF+REMELSGIEST+ST EHLINGYC+TGDVDS LLVYKDMRRK F+L+ASTLE
Subjt:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE

Query:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS
         +I  LCA+ RLLEALDVFG A EDS+FCPTMET+E+LI+ LCQEG IE AFKLQAQMVGKGFKPN KIY SF+DAY KEGN EMVEKLGKE+ EIQLS
Subjt:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS

A0A5D3CQ25 Pentatricopeptide repeat-containing protein1.4e-23682.77Show/hide
Query:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS
        MS PLLKRTL PI N +    FS SFFSSSP   PSPSTKPS+STVVSVLTH RSKSRWRFLNSLCP+GFDPG+FSDIVLQIKNNPHLALRFFLWTQNKS
Subjt:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR
        LCNHNL+SYST+IHILARGRLRTHAK VIQTA+RA+ELED D+ S+ ERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIK+LLDSKKL+S+I+IVRMLR
Subjt:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR

Query:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV
        SRGISPQ+ TLNSLIL VSKC+GAN  YAIF EVFGLDC+IE+E+VKLK RVSPNVHTFNTLM CFYQDG VG+VKEIWDQL +SNSI NSYSY ILMAV
Subjt:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV

Query:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE
        LCEEKRM EAE+LW+EM+MKKLE D VAYNTIIGGFCKAGN QRAEEF+REMELSGIEST+ST EHLINGYC+TGDVDS LLVYKDMRRK F+L+ASTLE
Subjt:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE

Query:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS
         +I  LCA+ RLLEALDVFG A EDS+FCPTMET+E+LI+ LCQEG IE AFKLQAQMVGKGFKPN KIY SF+DAY KEGN EMVEKLGKE+ EIQLS
Subjt:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS

A0A6J1D472 pentatricopeptide repeat-containing protein At2g159805.1e-24283.97Show/hide
Query:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS
        MSIPLLKRTL  IRNP FK PFSPSF SS      SPS KPS+STVVSVLTHHRSKSRWRFLNSLCPDGFDPG+FSDIVL IKNNPHL+LRFFLWTQNKS
Subjt:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR
        LC HNLVSYSTVIHILARGRLRTHAKAVIQTA+RA+ELED D CS C++F  SRPL+LF+TLVKTYKRCGSAPFVFDLLIK+LLDS+KLE AIQI+RMLR
Subjt:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR

Query:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV
        SRGISPQ+ TLNSLIL VSKCEGANAGYAIFREVFGLDC+++EE VK+KA+ SPNVH+FNTLM+CFYQDGLVG+VKEIWDQLTESNSI NSYSY ILM V
Subjt:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV

Query:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE
         C+++RMVEAE+LWKEMR+KKLE DAVAYNTIIGGFCKAG+IQRAEE FREMELSGIEST+STFEHLINGYCETGD+DS LLVYKDMRRK+F+L+ASTLE
Subjt:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE

Query:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS
        A++R L A+TRLLEALDVFG  TEDSNFCPTMETYELLI+GLC+EG IEAAFKLQAQMVGKGFKP+SK+Y SF+DAYT EGNEEMVEKL KE+LEIQLS
Subjt:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS

A0A6J1FY05 pentatricopeptide repeat-containing protein At2g159804.0e-24785.57Show/hide
Query:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS
        MSIPLLKR+LW I N +F  PFSPSFFSSSPA  PSPSTKPS+STVVSVLTHHRSKSRWRFLNSLCPDGFDPG+FSDIVLQIKNN HL LRFFLWT++KS
Subjt:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR
        LCNH+LVSYSTVIHILARGRLRT AK VIQTA+RA+ LEDGDDCSKCERFSSSRPLKLFETLVKTYK+CGSAPFVFDLLIK+LLDSKKL+ AIQIVRMLR
Subjt:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR

Query:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV
        SRGISPQIGTLNSLIL +SKCEGANAGYA+FREVFGL+C+IEE+NVK+KARVSPNVHTFNTLMVCFYQDGLVG+ KEIWDQL +SNSI NSYSY ILMAV
Subjt:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV

Query:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE
        LCEEKRM EAE+LW+EM+MKKLE DAVAYNTIIGGFCKAGNI+RAEEFFREMEL G EST+STFEHLINGYCETGDVDS LLVYKDMRRK F+L+   LE
Subjt:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE

Query:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS
        A+ R LCA+TRLLEALD+FG ATED+N CPTMETYELLI+GLCQEG +EAAFKLQAQMVGKGFKPNSKIY SF+DAY+KEGNEEMV+KL +E+LEIQLS
Subjt:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS

A0A6J1JGQ1 pentatricopeptide repeat-containing protein At2g159801.2e-24684.97Show/hide
Query:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS
        MSIPLLKR+LW I N +F  PFSPSFFSSSPA  P PSTKPS+STVVSVLTHHRSKSRWRFLNSLCPDGFDPG+FSDIVLQIKNN HL LRFFLWT++KS
Subjt:  MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR
        LCNH+LVSYSTVIHILARGRLRTHAK VIQ A+RA+ LED DDCS+CERFSSSRPLKLFETLVKTYK+CGSAPFVFDLLIK+LLDSKKL+ AIQIVRMLR
Subjt:  LCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLR

Query:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV
        SRGISPQIGTLNSLIL +SKCEGANAGYA+FREVFGL+C+IEEENVK+KAR SPNVHTFNTLMVCFYQDGLVG+VKEIWDQL +SNSI NSYSY ILMAV
Subjt:  SRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAV

Query:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE
        LCEEKRM EAE+LW+EM+MKKLE DAVAYNTIIGGFCKAGN++RAEEFFREMEL G EST+STFEHLINGYCETGDVDS LLVYKDMRRK F+L+   LE
Subjt:  LCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLE

Query:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS
        A+ R LC +TRLLEALDVFG ATE +NFCPTMETYELLI+GLCQ+G +EAAFKLQAQMVGKGFKPNSKIY SF+DAY+KEGNEEMV+KLG+E+LEIQLS
Subjt:  AVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial4.3e-3626.65Show/hide
Query:  IVLQIKNNPHLALRFFLWTQNKSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFD
        ++++IK +  L L FF W +++   + NL S   VIH+    +    A+++I +     +L   D             ++ F+ LV TYK  GS P VFD
Subjt:  IVLQIKNNPHLALRFFLWTQNKSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFD

Query:  LLIKSLLDSKKLESAIQIVRMLRSRGISPQIGTLNSLILSVSK-CEGANAGYAIFREV--FGLDCDIEEENV---------------------KLKARVS
        +  + L+D   L  A ++   + + G+   + + N  +  +SK C        +FRE    G+  ++   N+                     +LK   +
Subjt:  LLIKSLLDSKKLESAIQIVRMLRSRGISPQIGTLNSLILSVSK-CEGANAGYAIFREV--FGLDCDIEEENV---------------------KLKARVS

Query:  PNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREME
        P+V +++T++  + + G + +V ++ + +       NSY Y  ++ +LC   ++ EAE+ + EM  + +  D V Y T+I GFCK G+I+ A +FF EM 
Subjt:  PNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREME

Query:  LSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFK
           I     T+  +I+G+C+ GD+     ++ +M  K    D+ T   +I   C    + +A  V     + +   P + TY  LI GLC+EG +++A +
Subjt:  LSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFK

Query:  LQAQMVGKGFKPNSKIYCSFMDAYTKEGN-EEMVEKLGK
        L  +M   G +PN   Y S ++   K GN EE V+ +G+
Subjt:  LQAQMVGKGFKPNSKIYCSFMDAYTKEGN-EEMVEKLGK

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011102.0e-3326.22Show/hide
Query:  PIRN--PSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRF-----FLWTQNKSLCNH
        P++N   S    F PS  SSS +   S S   S S +V  +     +      N L     +P    +++ + +N+  L  RF     F +   K    H
Subjt:  PIRN--PSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRF-----FLWTQNKSLCNH

Query:  NLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDC-SKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLRSRG
          +S S +IHIL                VR+  L D   C  +  R S    L++  +L  T+  CGS   VFDLLI++ + ++KL  A +   +LRS+G
Subjt:  NLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDC-SKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLRSRG

Query:  ISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCE
         +  I   N+LI S+ +       + +++E+              ++ V  NV+T N ++    +DG + +V     Q+ E     +  +Y  L++    
Subjt:  ISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCE

Query:  EKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVI
        +  M EA +L   M  K        YNT+I G CK G  +RA+E F EM  SG+    +T+  L+   C+ GDV     V+ DMR +    D     +++
Subjt:  EKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVI

Query:  RALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLE
                L +AL  F  + +++   P    Y +LI G C++G I  A  L+ +M+ +G   +   Y + +    K       +KL  E+ E
Subjt:  RALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLE

Q9LQ14 Pentatricopeptide repeat-containing protein At1g62930, chloroplastic6.8e-3426.52Show/hide
Query:  NLVSYSTVIHILARGRLRTHAKAVIQTAVRASELED----GDDCS-KCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRML
        N V+++T+IH L      + A A+I   V      D    G   +  C+R      L L + + K   +  +   ++  +I +L + K +  A+ +   +
Subjt:  NLVSYSTVIHILARGRLRTHAKAVIQTAVRASELED----GDDCS-KCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRML

Query:  RSRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMA
         ++GI P + T NSLI  +         Y  + +   L  D+      ++ +++PNV TF+ L+  F ++G + + ++++D++ + +   + ++Y  L+ 
Subjt:  RSRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMA

Query:  VLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTL
          C   R+ EA+ +++ M  K    + V YNT+I GFCKA  ++   E FREM   G+     T+  LI G  + GD D    ++K M       D  T 
Subjt:  VLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTL

Query:  EAVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLE
          ++  LC   +L +AL VF    + S   P + TY ++I G+C+ G +E  + L   +  KG KPN  IY + +  + ++G +E  + L +E+ E
Subjt:  EAVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLE

Q9SZ10 Pentatricopeptide repeat-containing protein At4g26680, mitochondrial3.4e-4125.38Show/hide
Query:  RPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAV
        R +P  K      V+V   H  +S W  LN L  D  D     +++L+I+ +  L+L FF W + ++  +H+L +++ V+H L + R    A+++++  +
Subjt:  RPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAV

Query:  RASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLRSRGISPQIGTLNSLILSVSKCEGANAGYAIFRE
            +  G D           P K+F+ L+ +Y+ C S P VFD L K+    KK  +A      ++  G  P + + N+ + S+      +     +RE
Subjt:  RASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLRSRGISPQIGTLNSLILSVSKCEGANAGYAIFRE

Query:  VFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTII
        +              + ++SPN +T N +M  + + G + +  E+   +          SY  L+A  CE+  +  A KL   M    L+ + V +NT+I
Subjt:  VFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTII

Query:  GGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVIRALCAKTRLLEALDVFGLATEDSNFCPTME
         GFC+A  +Q A + F EM+   +     T+  LINGY + GD +     Y+DM       D  T  A+I  LC + +  +A   F    +  N  P   
Subjt:  GGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVIRALCAKTRLLEALDVFGLATEDSNFCPTME

Query:  TYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVL
        T+  LI G C     +  F+L   M+  G  PN + +   + A+ +  + +   ++ +E++
Subjt:  TYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVL

Q9XIM8 Pentatricopeptide repeat-containing protein At2g159801.5e-12647.31Show/hide
Query:  MSIPLLKRTLWPIRNPSFKHPFSPSFFS--SSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQN
        MS  +L+R L P R P      S S  +  SSP   PSP + P +S  VS+LTHHRSKSRW  L SL P GF P  FS+I L ++NNPHL+LRFFL+T+ 
Subjt:  MSIPLLKRTLWPIRNPSFKHPFSPSFFS--SSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQN

Query:  KSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRM
         SLC+H+  S ST+IHIL+R RL++HA  +I+ A+R +  ++ +D          R LK+F +L+K+Y RCGSAPFVFDLLIKS LDSK+++ A+ ++R 
Subjt:  KSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRM

Query:  LRSRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTES-NSIQNSYSYIIL
        LRSRGI+ QI T N+LI  VS+  GA+ GY ++REVFGLD    +E  K+  ++ PN  TFN++MV FY++G    V+ IW ++ E      N YSY +L
Subjt:  LRSRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTES-NSIQNSYSYIIL

Query:  MAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDAS
        M   C    M EAEK+W+EM+++ + +D VAYNT+IGG C    + +A+E FR+M L GIE T  T+EHL+NGYC+ GDVDS L+VY++M+RK F  D  
Subjt:  MAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDAS

Query:  TLEAVIRALCAK---TRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVL
        T+EA++  LC      R++EA D+   A  ++ F P+   YELL+  LC++G ++ A  +QA+MVGKGFKP+ + Y +F+D Y   G+EE    L  E+ 
Subjt:  TLEAVIRALCAK---TRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVL

Query:  E
        E
Subjt:  E

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.0e-3726.65Show/hide
Query:  IVLQIKNNPHLALRFFLWTQNKSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFD
        ++++IK +  L L FF W +++   + NL S   VIH+    +    A+++I +     +L   D             ++ F+ LV TYK  GS P VFD
Subjt:  IVLQIKNNPHLALRFFLWTQNKSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFD

Query:  LLIKSLLDSKKLESAIQIVRMLRSRGISPQIGTLNSLILSVSK-CEGANAGYAIFREV--FGLDCDIEEENV---------------------KLKARVS
        +  + L+D   L  A ++   + + G+   + + N  +  +SK C        +FRE    G+  ++   N+                     +LK   +
Subjt:  LLIKSLLDSKKLESAIQIVRMLRSRGISPQIGTLNSLILSVSK-CEGANAGYAIFREV--FGLDCDIEEENV---------------------KLKARVS

Query:  PNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREME
        P+V +++T++  + + G + +V ++ + +       NSY Y  ++ +LC   ++ EAE+ + EM  + +  D V Y T+I GFCK G+I+ A +FF EM 
Subjt:  PNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREME

Query:  LSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFK
           I     T+  +I+G+C+ GD+     ++ +M  K    D+ T   +I   C    + +A  V     + +   P + TY  LI GLC+EG +++A +
Subjt:  LSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFK

Query:  LQAQMVGKGFKPNSKIYCSFMDAYTKEGN-EEMVEKLGK
        L  +M   G +PN   Y S ++   K GN EE V+ +G+
Subjt:  LQAQMVGKGFKPNSKIYCSFMDAYTKEGN-EEMVEKLGK

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein3.0e-3726.65Show/hide
Query:  IVLQIKNNPHLALRFFLWTQNKSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFD
        ++++IK +  L L FF W +++   + NL S   VIH+    +    A+++I +     +L   D             ++ F+ LV TYK  GS P VFD
Subjt:  IVLQIKNNPHLALRFFLWTQNKSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFD

Query:  LLIKSLLDSKKLESAIQIVRMLRSRGISPQIGTLNSLILSVSK-CEGANAGYAIFREV--FGLDCDIEEENV---------------------KLKARVS
        +  + L+D   L  A ++   + + G+   + + N  +  +SK C        +FRE    G+  ++   N+                     +LK   +
Subjt:  LLIKSLLDSKKLESAIQIVRMLRSRGISPQIGTLNSLILSVSK-CEGANAGYAIFREV--FGLDCDIEEENV---------------------KLKARVS

Query:  PNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREME
        P+V +++T++  + + G + +V ++ + +       NSY Y  ++ +LC   ++ EAE+ + EM  + +  D V Y T+I GFCK G+I+ A +FF EM 
Subjt:  PNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREME

Query:  LSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFK
           I     T+  +I+G+C+ GD+     ++ +M  K    D+ T   +I   C    + +A  V     + +   P + TY  LI GLC+EG +++A +
Subjt:  LSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFK

Query:  LQAQMVGKGFKPNSKIYCSFMDAYTKEGN-EEMVEKLGK
        L  +M   G +PN   Y S ++   K GN EE V+ +G+
Subjt:  LQAQMVGKGFKPNSKIYCSFMDAYTKEGN-EEMVEKLGK

AT2G15980.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-12747.31Show/hide
Query:  MSIPLLKRTLWPIRNPSFKHPFSPSFFS--SSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQN
        MS  +L+R L P R P      S S  +  SSP   PSP + P +S  VS+LTHHRSKSRW  L SL P GF P  FS+I L ++NNPHL+LRFFL+T+ 
Subjt:  MSIPLLKRTLWPIRNPSFKHPFSPSFFS--SSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQN

Query:  KSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRM
         SLC+H+  S ST+IHIL+R RL++HA  +I+ A+R +  ++ +D          R LK+F +L+K+Y RCGSAPFVFDLLIKS LDSK+++ A+ ++R 
Subjt:  KSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRM

Query:  LRSRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTES-NSIQNSYSYIIL
        LRSRGI+ QI T N+LI  VS+  GA+ GY ++REVFGLD    +E  K+  ++ PN  TFN++MV FY++G    V+ IW ++ E      N YSY +L
Subjt:  LRSRGISPQIGTLNSLILSVSKCEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTES-NSIQNSYSYIIL

Query:  MAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDAS
        M   C    M EAEK+W+EM+++ + +D VAYNT+IGG C    + +A+E FR+M L GIE T  T+EHL+NGYC+ GDVDS L+VY++M+RK F  D  
Subjt:  MAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDAS

Query:  TLEAVIRALCAK---TRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVL
        T+EA++  LC      R++EA D+   A  ++ F P+   YELL+  LC++G ++ A  +QA+MVGKGFKP+ + Y +F+D Y   G+EE    L  E+ 
Subjt:  TLEAVIRALCAK---TRLLEALDVFGLATEDSNFCPTMETYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVL

Query:  E
        E
Subjt:  E

AT4G26680.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-4225.38Show/hide
Query:  RPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAV
        R +P  K      V+V   H  +S W  LN L  D  D     +++L+I+ +  L+L FF W + ++  +H+L +++ V+H L + R    A+++++  +
Subjt:  RPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAV

Query:  RASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLRSRGISPQIGTLNSLILSVSKCEGANAGYAIFRE
            +  G D           P K+F+ L+ +Y+ C S P VFD L K+    KK  +A      ++  G  P + + N+ + S+      +     +RE
Subjt:  RASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLRSRGISPQIGTLNSLILSVSKCEGANAGYAIFRE

Query:  VFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTII
        +              + ++SPN +T N +M  + + G + +  E+   +          SY  L+A  CE+  +  A KL   M    L+ + V +NT+I
Subjt:  VFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTII

Query:  GGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVIRALCAKTRLLEALDVFGLATEDSNFCPTME
         GFC+A  +Q A + F EM+   +     T+  LINGY + GD +     Y+DM       D  T  A+I  LC + +  +A   F    +  N  P   
Subjt:  GGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVIRALCAKTRLLEALDVFGLATEDSNFCPTME

Query:  TYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVL
        T+  LI G C     +  F+L   M+  G  PN + +   + A+ +  + +   ++ +E++
Subjt:  TYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVL

AT4G26680.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-4225.38Show/hide
Query:  RPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAV
        R +P  K      V+V   H  +S W  LN L  D  D     +++L+I+ +  L+L FF W + ++  +H+L +++ V+H L + R    A+++++  +
Subjt:  RPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLVSYSTVIHILARGRLRTHAKAVIQTAV

Query:  RASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLRSRGISPQIGTLNSLILSVSKCEGANAGYAIFRE
            +  G D           P K+F+ L+ +Y+ C S P VFD L K+    KK  +A      ++  G  P + + N+ + S+      +     +RE
Subjt:  RASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLRSRGISPQIGTLNSLILSVSKCEGANAGYAIFRE

Query:  VFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTII
        +              + ++SPN +T N +M  + + G + +  E+   +          SY  L+A  CE+  +  A KL   M    L+ + V +NT+I
Subjt:  VFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYNTII

Query:  GGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVIRALCAKTRLLEALDVFGLATEDSNFCPTME
         GFC+A  +Q A + F EM+   +     T+  LINGY + GD +     Y+DM       D  T  A+I  LC + +  +A   F    +  N  P   
Subjt:  GGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVIRALCAKTRLLEALDVFGLATEDSNFCPTME

Query:  TYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVL
        T+  LI G C     +  F+L   M+  G  PN + +   + A+ +  + +   ++ +E++
Subjt:  TYELLIHGLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCATTCCGCTGCTTAAACGAACCCTCTGGCCGATCCGAAACCCAAGCTTTAAGCACCCATTTTCCCCTTCCTTCTTCTCATCCTCACCGGCTGGCCGACCTTCGCC
GTCGACAAAACCCTCACTCTCGACCGTGGTTTCAGTTCTCACTCATCACCGCTCAAAATCTCGCTGGCGATTCCTCAACTCCCTCTGTCCCGACGGCTTCGATCCCGGCG
ACTTTTCCGATATCGTTCTCCAAATCAAGAACAATCCCCATCTCGCCCTCCGTTTCTTCCTCTGGACTCAGAACAAGTCCCTCTGCAATCACAATCTCGTTTCTTACTCT
ACCGTCATCCACATCCTTGCCCGCGGCCGGCTCAGAACTCACGCGAAGGCGGTTATTCAGACCGCCGTTAGGGCTTCAGAGCTTGAAGATGGTGACGATTGTTCCAAATG
TGAGCGGTTTTCGTCTTCGAGGCCTTTGAAGCTGTTTGAAACCCTCGTGAAGACGTATAAACGGTGTGGCTCTGCTCCCTTTGTGTTTGATTTATTGATTAAATCCCTTT
TAGATTCTAAAAAGCTCGAATCGGCTATTCAAATTGTTAGAATGTTGCGGTCTCGTGGGATTAGCCCACAAATTGGTACGTTGAATTCATTGATTTTGTCCGTGTCGAAG
TGTGAGGGGGCTAATGCAGGTTATGCAATTTTTAGAGAGGTTTTTGGCTTAGATTGTGACATTGAGGAAGAAAATGTGAAATTGAAGGCTCGGGTTAGTCCTAATGTGCA
TACTTTTAATACATTAATGGTGTGTTTTTATCAAGATGGGTTGGTGGGGCAGGTGAAGGAGATATGGGATCAATTAACCGAGTCAAATTCGATTCAAAACAGTTACAGTT
ATATTATTCTAATGGCGGTTTTATGTGAAGAGAAAAGAATGGTTGAAGCAGAGAAGTTGTGGAAAGAAATGAGAATGAAGAAGTTGGAGTTTGATGCTGTAGCTTACAAT
ACTATAATTGGAGGGTTTTGTAAAGCAGGAAATATTCAAAGGGCTGAAGAATTTTTCAGAGAAATGGAACTGAGTGGAATAGAGAGTACTTACTCCACCTTTGAGCATCT
CATCAATGGCTATTGTGAGACTGGAGATGTTGACTCTGTATTACTTGTGTATAAGGATATGCGCAGGAAACATTTTAATCTCGACGCATCTACGTTGGAAGCAGTTATTC
GAGCATTGTGTGCCAAGACTAGGCTCTTAGAAGCTTTAGACGTTTTCGGTTTAGCTACAGAAGATTCTAACTTTTGCCCGACAATGGAAACTTACGAACTTCTGATACAT
GGTTTGTGTCAGGAAGGGACAATTGAAGCTGCATTTAAGCTTCAGGCACAGATGGTAGGGAAAGGTTTTAAGCCAAATTCAAAGATTTACTGTTCTTTTATGGATGCCTA
TACGAAAGAAGGAAATGAAGAAATGGTAGAAAAGTTGGGGAAGGAAGTACTTGAAATCCAGTTGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATCTATGTACCTCCTTGGAATTCAATGTCCATTCCGCTGCTTAAACGAACCCTCTGGCCGATCCGAAACCCAAGCTTTAAGCACCCATTTTCCCCTTCCTTCTTCTCATC
CTCACCGGCTGGCCGACCTTCGCCGTCGACAAAACCCTCACTCTCGACCGTGGTTTCAGTTCTCACTCATCACCGCTCAAAATCTCGCTGGCGATTCCTCAACTCCCTCT
GTCCCGACGGCTTCGATCCCGGCGACTTTTCCGATATCGTTCTCCAAATCAAGAACAATCCCCATCTCGCCCTCCGTTTCTTCCTCTGGACTCAGAACAAGTCCCTCTGC
AATCACAATCTCGTTTCTTACTCTACCGTCATCCACATCCTTGCCCGCGGCCGGCTCAGAACTCACGCGAAGGCGGTTATTCAGACCGCCGTTAGGGCTTCAGAGCTTGA
AGATGGTGACGATTGTTCCAAATGTGAGCGGTTTTCGTCTTCGAGGCCTTTGAAGCTGTTTGAAACCCTCGTGAAGACGTATAAACGGTGTGGCTCTGCTCCCTTTGTGT
TTGATTTATTGATTAAATCCCTTTTAGATTCTAAAAAGCTCGAATCGGCTATTCAAATTGTTAGAATGTTGCGGTCTCGTGGGATTAGCCCACAAATTGGTACGTTGAAT
TCATTGATTTTGTCCGTGTCGAAGTGTGAGGGGGCTAATGCAGGTTATGCAATTTTTAGAGAGGTTTTTGGCTTAGATTGTGACATTGAGGAAGAAAATGTGAAATTGAA
GGCTCGGGTTAGTCCTAATGTGCATACTTTTAATACATTAATGGTGTGTTTTTATCAAGATGGGTTGGTGGGGCAGGTGAAGGAGATATGGGATCAATTAACCGAGTCAA
ATTCGATTCAAAACAGTTACAGTTATATTATTCTAATGGCGGTTTTATGTGAAGAGAAAAGAATGGTTGAAGCAGAGAAGTTGTGGAAAGAAATGAGAATGAAGAAGTTG
GAGTTTGATGCTGTAGCTTACAATACTATAATTGGAGGGTTTTGTAAAGCAGGAAATATTCAAAGGGCTGAAGAATTTTTCAGAGAAATGGAACTGAGTGGAATAGAGAG
TACTTACTCCACCTTTGAGCATCTCATCAATGGCTATTGTGAGACTGGAGATGTTGACTCTGTATTACTTGTGTATAAGGATATGCGCAGGAAACATTTTAATCTCGACG
CATCTACGTTGGAAGCAGTTATTCGAGCATTGTGTGCCAAGACTAGGCTCTTAGAAGCTTTAGACGTTTTCGGTTTAGCTACAGAAGATTCTAACTTTTGCCCGACAATG
GAAACTTACGAACTTCTGATACATGGTTTGTGTCAGGAAGGGACAATTGAAGCTGCATTTAAGCTTCAGGCACAGATGGTAGGGAAAGGTTTTAAGCCAAATTCAAAGAT
TTACTGTTCTTTTATGGATGCCTATACGAAAGAAGGAAATGAAGAAATGGTAGAAAAGTTGGGGAAGGAAGTACTTGAAATCCAGTTGAGTTGAGACGGGAATTGAATCA
CACTGTATTGTACATTGTCATCTGGTTGCAGTATTCCAGCTGATCGGAAAGAAGTTTCTGTAAACACTAGATGTGAGTTGAACTTATTTCTGATACATTCTTGAAAATGC
ATTTCTTGTATCATTACGATCATCATTTTTGAGCTTAGATTAGTTTGTAAATTCATTTTTGTTCTTGGAAATGAGTTAAAGCATAAGGGTAATCCACATTTTGTGGTGGT
TGCATTAGGTTTCTTCATCAATCTGAAGCTTGAGGGCCCAAAATCATTGTTAGTAAATCGGAC
Protein sequenceShow/hide protein sequence
MSIPLLKRTLWPIRNPSFKHPFSPSFFSSSPAGRPSPSTKPSLSTVVSVLTHHRSKSRWRFLNSLCPDGFDPGDFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLVSYS
TVIHILARGRLRTHAKAVIQTAVRASELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKSLLDSKKLESAIQIVRMLRSRGISPQIGTLNSLILSVSK
CEGANAGYAIFREVFGLDCDIEEENVKLKARVSPNVHTFNTLMVCFYQDGLVGQVKEIWDQLTESNSIQNSYSYIILMAVLCEEKRMVEAEKLWKEMRMKKLEFDAVAYN
TIIGGFCKAGNIQRAEEFFREMELSGIESTYSTFEHLINGYCETGDVDSVLLVYKDMRRKHFNLDASTLEAVIRALCAKTRLLEALDVFGLATEDSNFCPTMETYELLIH
GLCQEGTIEAAFKLQAQMVGKGFKPNSKIYCSFMDAYTKEGNEEMVEKLGKEVLEIQLS