; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0962 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0962
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationMC02:7721127..7723716
RNA-Seq ExpressionMC02g0962
SyntenyMC02g0962
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571131.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]6.33e-29681.16Show/hide
Query:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS
        MSIPLLKR+L  I N  F LPFSPSF SSSP+A      KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVL IKNN HL LRFFLWT++KS
Subjt:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS

Query:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR
        LC H+LVSYSTVIHILARGRLRTHAK VIQTAIRA  LED DGCS C++FS  RPL+LF+TLVKTYK+CGSAPFVFDLLIKALLDS+KL+PAIQI+RMLR
Subjt:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR

Query:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV
        SRGISPQ+ TLNSLILW+SKCEGANAGYA+FREVFGL+CE++E+ VK+KA+ SPNVH+FNTLM+CFYQDGLVGRVKEIWDQL +SNSIPNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV

Query:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE
         C+++RM EAEELW+EM++KKLE+DAVAYNTIIGGFCKAG+I+RAEE FREMEL G ESTFSTFEHLINGYCE+GD+DSALLVYKDMRRK+FS+N   LE
Subjt:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE

Query:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS
        A+ RGL AETRLLEALDVFGF TED+N CPTMETYELLINGLC++G++EAAFKLQ+QMVGKGFKP+SK+Y+SFIDAY+ EGNEEMV+KLR+E+LEIQLS
Subjt:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS

KAG7010942.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]8.99e-29681.16Show/hide
Query:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS
        MSIPLLKR+L  I N  F LPFSPSF SSSP+A      KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVL IKNN HL LRFFLWT++KS
Subjt:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS

Query:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR
        LC H+LVSYSTVIHILARGRLRTHAK VIQTAIRA  LED DGCS C++FS  RPL+LF+TLVKTYK+CGSAPFVFDLLIKALLDS+KL+PAIQI+RMLR
Subjt:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR

Query:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV
        SRGISPQ+ TLNSLILW+SKCEGANAGYA+FREVFGL+CE++E+ VK+KA+ SPNVH+FNTLM+CFYQDGLVGRVKEIWDQL +SNSIPNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV

Query:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE
         C+++RM EAEELW+EM++KKLE+DAVAYNTIIGGFCKAG+I+RAEE FREMEL G ESTFSTFEHLINGYCE+GD+DSALLVYKDMRRK+FS+N   LE
Subjt:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE

Query:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS
        A+ RGL AETRLLEALDVFGF TED+N CPTMETYELLINGLC++G++EAAFKLQ QMVGKGFKP+SK+Y+SFIDAY+ EGNEEMV+KLR+E+LEIQLS
Subjt:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS

XP_022148504.1 pentatricopeptide repeat-containing protein At2g15980 [Momordica charantia]0.0100Show/hide
Query:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNL
        MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNL
Subjt:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNL

Query:  VSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQV
        VSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQV
Subjt:  VSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQV

Query:  STLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMV
        STLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMV
Subjt:  STLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMV

Query:  EAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVA
        EAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVA
Subjt:  EAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVA

Query:  ETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS
        ETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS
Subjt:  ETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS

XP_022944388.1 pentatricopeptide repeat-containing protein At2g15980 [Cucurbita moschata]3.00e-29481.16Show/hide
Query:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS
        MSIPLLKR+L  I N  F LPFSPSF SSSP+A      KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVL IKNN HL LRFFLWT++KS
Subjt:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS

Query:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR
        LC H+LVSYSTVIHILARGRLRT AK VIQTAIRA  LED D CS C++FS  RPL+LF+TLVKTYK+CGSAPFVFDLLIKALLDS+KL+PAIQI+RMLR
Subjt:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR

Query:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV
        SRGISPQ+ TLNSLILW+SKCEGANAGYA+FREVFGL+CE++E+ VK+KA+ SPNVH+FNTLM+CFYQDGLVGR KEIWDQL +SNSIPNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV

Query:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE
         C+++RM EAEELW+EM++KKLE+DAVAYNTIIGGFCKAG+I+RAEE FREMEL G ESTFSTFEHLINGYCETGD+DSALLVYKDMRRK+FSLN   LE
Subjt:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE

Query:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS
        A+ RGL AETRLLEALD+FGF TED+N CPTMETYELLINGLC+EG++EAAFKLQAQMVGKGFKP+SK+Y+SFIDAY+ EGNEEMV+KLR+E+LEIQLS
Subjt:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS

XP_022986508.1 pentatricopeptide repeat-containing protein At2g15980 [Cucurbita maxima]4.25e-29481.36Show/hide
Query:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS
        MSIPLLKR+L  I N  F LPFSPSF SSSP+A      KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVL IKNN HL LRFFLWT++KS
Subjt:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS

Query:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR
        LC H+LVSYSTVIHILARGRLRTHAK VIQ AIRA  LEDDD CS C++FS  RPL+LF+TLVKTYK+CGSAPFVFDLLIKALLDS+KL+PAIQI+RMLR
Subjt:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR

Query:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV
        SRGISPQ+ TLNSLIL +SKCEGANAGYA+FREVFGL+CE++EE VK+KA+ASPNVH+FNTLM+CFYQDGLVGRVKEIWDQL +SNSIPNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV

Query:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE
         C+++RM EAEELW+EM++KKLE+DAVAYNTIIGGFCKAG+++RAEE FREMEL G ESTFSTFEHLINGYCETGD+DSALLVYKDMRRK+FSLN   LE
Subjt:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE

Query:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS
        AI RGL  ETRLLEALDVFGF TE +NFCPTMETYELLINGLC++G++EAAFKLQAQMVGKGFKP+SK+Y+SFIDAY+ EGNEEMV+KL +E+LEIQLS
Subjt:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS

TrEMBL top hitse value%identityAlignment
A0A5A7TQU6 Pentatricopeptide repeat-containing protein4.21e-28078.96Show/hide
Query:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSP------SAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS
        MS PLLKRTL  I N    L FS SF SSSP      S KPSISTVVSVLTH RSKSRWRFLNSLCP+GFDPGEFSDIVL IKNNPHL+LRFFLWTQNKS
Subjt:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSP------SAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS

Query:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR
        LC HNL+SYST+IHILARGRLRTHAK VIQTAIRAAELED D  S  ++FS  RPL+LF+TLVKTYKRCGSAPFVFDLLIKALLDS+KL+ +I+I+RMLR
Subjt:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR

Query:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV
        SRGISPQVSTLNSLIL VSKC+GAN  YAIF EVFGLDCE+++E VK+K + SPNVH+FNTLM CFYQDG VGRVKEIWDQL +SNSIPNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV

Query:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE
         C+++RM EAEELW+EM++KKLELD VAYNTIIGGFCKAG+ QRAEE +REMELSGIESTFST EHLINGYC+TGD+DSALLVYKDMRRK FSLNASTLE
Subjt:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE

Query:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS
         ++  L AE RLLEALDVFGF  EDS+FCPTMET+E+LIN LC+EG+IE AFKLQAQMVGKGFKP+ K+Y+SFIDAY  EGN EMVEKL KE+ EIQLS
Subjt:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS

A0A5D3CQ25 Pentatricopeptide repeat-containing protein4.21e-28078.96Show/hide
Query:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSP------SAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS
        MS PLLKRTL  I N    L FS SF SSSP      S KPSISTVVSVLTH RSKSRWRFLNSLCP+GFDPGEFSDIVL IKNNPHL+LRFFLWTQNKS
Subjt:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSP------SAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS

Query:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR
        LC HNL+SYST+IHILARGRLRTHAK VIQTAIRAAELED D  S  ++FS  RPL+LF+TLVKTYKRCGSAPFVFDLLIKALLDS+KL+ +I+I+RMLR
Subjt:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR

Query:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV
        SRGISPQVSTLNSLIL VSKC+GAN  YAIF EVFGLDCE+++E VK+K + SPNVH+FNTLM CFYQDG VGRVKEIWDQL +SNSIPNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV

Query:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE
         C+++RM EAEELW+EM++KKLELD VAYNTIIGGFCKAG+ QRAEE +REMELSGIESTFST EHLINGYC+TGD+DSALLVYKDMRRK FSLNASTLE
Subjt:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE

Query:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS
         ++  L AE RLLEALDVFGF  EDS+FCPTMET+E+LIN LC+EG+IE AFKLQAQMVGKGFKP+ K+Y+SFIDAY  EGN EMVEKL KE+ EIQLS
Subjt:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS

A0A6J1D472 pentatricopeptide repeat-containing protein At2g159800.0100Show/hide
Query:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNL
        MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNL
Subjt:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNL

Query:  VSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQV
        VSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQV
Subjt:  VSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQV

Query:  STLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMV
        STLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMV
Subjt:  STLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMV

Query:  EAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVA
        EAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVA
Subjt:  EAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVA

Query:  ETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS
        ETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS
Subjt:  ETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS

A0A6J1FY05 pentatricopeptide repeat-containing protein At2g159801.45e-29481.16Show/hide
Query:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS
        MSIPLLKR+L  I N  F LPFSPSF SSSP+A      KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVL IKNN HL LRFFLWT++KS
Subjt:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS

Query:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR
        LC H+LVSYSTVIHILARGRLRT AK VIQTAIRA  LED D CS C++FS  RPL+LF+TLVKTYK+CGSAPFVFDLLIKALLDS+KL+PAIQI+RMLR
Subjt:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR

Query:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV
        SRGISPQ+ TLNSLILW+SKCEGANAGYA+FREVFGL+CE++E+ VK+KA+ SPNVH+FNTLM+CFYQDGLVGR KEIWDQL +SNSIPNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV

Query:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE
         C+++RM EAEELW+EM++KKLE+DAVAYNTIIGGFCKAG+I+RAEE FREMEL G ESTFSTFEHLINGYCETGD+DSALLVYKDMRRK+FSLN   LE
Subjt:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE

Query:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS
        A+ RGL AETRLLEALD+FGF TED+N CPTMETYELLINGLC+EG++EAAFKLQAQMVGKGFKP+SK+Y+SFIDAY+ EGNEEMV+KLR+E+LEIQLS
Subjt:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS

A0A6J1JGQ1 pentatricopeptide repeat-containing protein At2g159802.06e-29481.36Show/hide
Query:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS
        MSIPLLKR+L  I N  F LPFSPSF SSSP+A      KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVL IKNN HL LRFFLWT++KS
Subjt:  MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKS

Query:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR
        LC H+LVSYSTVIHILARGRLRTHAK VIQ AIRA  LEDDD CS C++FS  RPL+LF+TLVKTYK+CGSAPFVFDLLIKALLDS+KL+PAIQI+RMLR
Subjt:  LCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFS--RPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR

Query:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV
        SRGISPQ+ TLNSLIL +SKCEGANAGYA+FREVFGL+CE++EE VK+KA+ASPNVH+FNTLM+CFYQDGLVGRVKEIWDQL +SNSIPNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV

Query:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE
         C+++RM EAEELW+EM++KKLE+DAVAYNTIIGGFCKAG+++RAEE FREMEL G ESTFSTFEHLINGYCETGD+DSALLVYKDMRRK+FSLN   LE
Subjt:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE

Query:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS
        AI RGL  ETRLLEALDVFGF TE +NFCPTMETYELLINGLC++G++EAAFKLQAQMVGKGFKP+SK+Y+SFIDAY+ EGNEEMV+KL +E+LEIQLS
Subjt:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial2.2e-3726.58Show/hide
Query:  IVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLL
        +++ IK +  L L FF W +++     NL S   VIH+    +    A+++I +     +L   D           ++ F  LV TYK  GS P VFD+ 
Subjt:  IVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLL

Query:  IKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSK-CEGANAGYAIFRE---------------VFGLDCE---VKEER-----VKMKAQASPN
         + L+D   L  A ++   + + G+   V + N  +  +SK C        +FRE               V    C+   +KE       +++K   +P+
Subjt:  IKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSK-CEGANAGYAIFRE---------------VFGLDCE---VKEER-----VKMKAQASPN

Query:  VHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAE---------
        V S++T++  + + G + +V ++ + +      PNSY Y  ++ + C   ++ EAEE + EM  + +  D V Y T+I GFCK G I+ A          
Subjt:  VHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAE---------

Query:  --------------------------ELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFG
                                  +LF EM   G+E    TF  LINGYC+ G +  A  V+  M +   S N  T   ++ GL  E  L  A ++  
Subjt:  --------------------------ELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFG

Query:  FTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELL
                 P + TY  ++NGLC+ G IE A KL  +    G   D+  Y + +DAY   G  +  +++ KE+L
Subjt:  FTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELL

Q9SH26 Pentatricopeptide repeat-containing protein At1g634002.9e-3728.66Show/hide
Query:  VFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGR
        ++  +I +L   R  + A+ +   + ++G+ P V T +SLI            Y  + +   L  ++ E ++      +PNV +FN L+  F ++G +  
Subjt:  VFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGR

Query:  VKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCET
         ++++D++ + +  P+ ++YS L+  FC   R+ EA+ +++ M  K    + V YNT+I GFCKA  I    ELFREM   G+     T+  LI+G+ + 
Subjt:  VKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCET

Query:  GDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFI
         D D+A +V+K M       N  T   ++ GL    +L +A+ VF +  + S   PT+ TY ++I G+C+ G++E  + L   +  KG KPD  +Y + I
Subjt:  GDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFI

Query:  DAYTNEGNEEMVEKLRKELLE
          +  +G +E  + L +++ E
Subjt:  DAYTNEGNEEMVEKLRKELLE

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial1.7e-3727.09Show/hide
Query:  NLVSYSTVIHILARGRLRTHAKAVIQTAIRAA---ELEDDDGCSN--CKQFSRPLRLFQTLVKTYK-RCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR
        N V+++T+IH L      + A A+I   +      +L       N  CK+    L  F  L K  + +      +++ +I  L   + ++ A+ + + + 
Subjt:  NLVSYSTVIHILARGRLRTHAKAVIQTAIRAA---ELEDDDGCSN--CKQFSRPLRLFQTLVKTYK-RCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR

Query:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV
        ++GI P V T +SLI            Y  + +   L  ++ E ++      +P+V +F+ L+  F ++G +   ++++D++ + +  P+  +YS L+  
Subjt:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV

Query:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE
        FC   R+ EA+++++ M  K    D V YNT+I GFCK   ++   E+FREM   G+     T+  LI G  + GD D A  ++K+M       N  T  
Subjt:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE

Query:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLE
         ++ GL    +L +A+ VF +  + S   PT+ TY ++I G+C+ G++E  + L   +  KG KPD   Y + I  +  +G++E  + L KE+ E
Subjt:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLE

Q9SZ10 Pentatricopeptide repeat-containing protein At4g26680, mitochondrial1.8e-3923.85Show/hide
Query:  SPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRA
        +P  K      V+V   H  +S W  LN L  D  D     +++L I+ +  LSL FF W + ++  +H+L +++ V+H L + R    A+++++  +  
Subjt:  SPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRA

Query:  AELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGL
          ++             P ++F  L+ +Y+ C S P VFD L K     +K   A      ++  G  P V + N+   ++S   G             +
Subjt:  AELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGL

Query:  DCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFC
        D  ++  R   + + SPN ++ N +M  + + G + +  E+   +          SY+ L+   C++  +  A +L   M    L+ + V +NT+I GFC
Subjt:  DCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFC

Query:  KAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYEL
        +A  +Q A ++F EM+   +     T+  LINGY + GD + A   Y+DM       +  T  A++ GL  + +  +A   F    +  N  P   T+  
Subjt:  KAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYEL

Query:  LINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELL
        LI G C     +  F+L   M+  G  P+ + +   + A+    + +   ++ +E++
Subjt:  LINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELL

Q9XIM8 Pentatricopeptide repeat-containing protein At2g159802.3e-12746.98Show/hide
Query:  MSIPLLKRTLSSIRNPGFKLPFSPSF-----SSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSL
        MS  +L+R L   R P      S S      S  SP + P IS  VS+LTHHRSKSRW  L SL P GF P +FS+I L ++NNPHLSLRFFL+T+  SL
Subjt:  MSIPLLKRTLSSIRNPGFKLPFSPSF-----SSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSL

Query:  CTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRG
        C+H+  S ST+IHIL+R RL++HA  +I+ A+R A  ++D+         R L++F++L+K+Y RCGSAPFVFDLLIK+ LDS++++ A+ ++R LRSRG
Subjt:  CTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRG

Query:  ISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTES-NSIPNSYSYSILMTVFC
        I+ Q+ST N+LI  VS+  GA+ GY ++REVFGLD    +E  KM  +  PN  +FN++M+ FY++G    V+ IW ++ E     PN YSY++LM  +C
Subjt:  ISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTES-NSIPNSYSYSILMTVFC

Query:  DQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAI
         +  M EAE++W+EM+++ +  D VAYNT+IGG C    + +A+ELFR+M L GIE T  T+EHL+NGYC+ GD+DS L+VY++M+RK F  +  T+EA+
Subjt:  DQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAI

Query:  VRGLVAE---TRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLE
        V GL  +    R++EA D+      ++ F P+   YELL+  LC +G+++ A  +QA+MVGKGFKP  + YR+FID Y   G+EE    L  E+ E
Subjt:  VRGLVAE---TRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLE

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.6e-3826.58Show/hide
Query:  IVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLL
        +++ IK +  L L FF W +++     NL S   VIH+    +    A+++I +     +L   D           ++ F  LV TYK  GS P VFD+ 
Subjt:  IVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLL

Query:  IKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSK-CEGANAGYAIFRE---------------VFGLDCE---VKEER-----VKMKAQASPN
         + L+D   L  A ++   + + G+   V + N  +  +SK C        +FRE               V    C+   +KE       +++K   +P+
Subjt:  IKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSK-CEGANAGYAIFRE---------------VFGLDCE---VKEER-----VKMKAQASPN

Query:  VHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAE---------
        V S++T++  + + G + +V ++ + +      PNSY Y  ++ + C   ++ EAEE + EM  + +  D V Y T+I GFCK G I+ A          
Subjt:  VHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAE---------

Query:  --------------------------ELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFG
                                  +LF EM   G+E    TF  LINGYC+ G +  A  V+  M +   S N  T   ++ GL  E  L  A ++  
Subjt:  --------------------------ELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFG

Query:  FTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELL
                 P + TY  ++NGLC+ G IE A KL  +    G   D+  Y + +DAY   G  +  +++ KE+L
Subjt:  FTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELL

AT1G62670.1 rna processing factor 21.2e-3827.09Show/hide
Query:  NLVSYSTVIHILARGRLRTHAKAVIQTAIRAA---ELEDDDGCSN--CKQFSRPLRLFQTLVKTYK-RCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR
        N V+++T+IH L      + A A+I   +      +L       N  CK+    L  F  L K  + +      +++ +I  L   + ++ A+ + + + 
Subjt:  NLVSYSTVIHILARGRLRTHAKAVIQTAIRAA---ELEDDDGCSN--CKQFSRPLRLFQTLVKTYK-RCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLR

Query:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV
        ++GI P V T +SLI            Y  + +   L  ++ E ++      +P+V +F+ L+  F ++G +   ++++D++ + +  P+  +YS L+  
Subjt:  SRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV

Query:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE
        FC   R+ EA+++++ M  K    D V YNT+I GFCK   ++   E+FREM   G+     T+  LI G  + GD D A  ++K+M       N  T  
Subjt:  FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE

Query:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLE
         ++ GL    +L +A+ VF +  + S   PT+ TY ++I G+C+ G++E  + L   +  KG KPD   Y + I  +  +G++E  + L KE+ E
Subjt:  AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLE

AT2G15980.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-12846.98Show/hide
Query:  MSIPLLKRTLSSIRNPGFKLPFSPSF-----SSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSL
        MS  +L+R L   R P      S S      S  SP + P IS  VS+LTHHRSKSRW  L SL P GF P +FS+I L ++NNPHLSLRFFL+T+  SL
Subjt:  MSIPLLKRTLSSIRNPGFKLPFSPSF-----SSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSL

Query:  CTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRG
        C+H+  S ST+IHIL+R RL++HA  +I+ A+R A  ++D+         R L++F++L+K+Y RCGSAPFVFDLLIK+ LDS++++ A+ ++R LRSRG
Subjt:  CTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRG

Query:  ISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTES-NSIPNSYSYSILMTVFC
        I+ Q+ST N+LI  VS+  GA+ GY ++REVFGLD    +E  KM  +  PN  +FN++M+ FY++G    V+ IW ++ E     PN YSY++LM  +C
Subjt:  ISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTES-NSIPNSYSYSILMTVFC

Query:  DQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAI
         +  M EAE++W+EM+++ +  D VAYNT+IGG C    + +A+ELFR+M L GIE T  T+EHL+NGYC+ GD+DS L+VY++M+RK F  +  T+EA+
Subjt:  DQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAI

Query:  VRGLVAE---TRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLE
        V GL  +    R++EA D+      ++ F P+   YELL+  LC +G+++ A  +QA+MVGKGFKP  + YR+FID Y   G+EE    L  E+ E
Subjt:  VRGLVAE---TRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLE

AT4G26680.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-4023.85Show/hide
Query:  SPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRA
        +P  K      V+V   H  +S W  LN L  D  D     +++L I+ +  LSL FF W + ++  +H+L +++ V+H L + R    A+++++  +  
Subjt:  SPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRA

Query:  AELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGL
          ++             P ++F  L+ +Y+ C S P VFD L K     +K   A      ++  G  P V + N+   ++S   G             +
Subjt:  AELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGL

Query:  DCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFC
        D  ++  R   + + SPN ++ N +M  + + G + +  E+   +          SY+ L+   C++  +  A +L   M    L+ + V +NT+I GFC
Subjt:  DCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFC

Query:  KAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYEL
        +A  +Q A ++F EM+   +     T+  LINGY + GD + A   Y+DM       +  T  A++ GL  + +  +A   F    +  N  P   T+  
Subjt:  KAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYEL

Query:  LINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELL
        LI G C     +  F+L   M+  G  P+ + +   + A+    + +   ++ +E++
Subjt:  LINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELL

AT4G26680.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-4023.85Show/hide
Query:  SPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRA
        +P  K      V+V   H  +S W  LN L  D  D     +++L I+ +  LSL FF W + ++  +H+L +++ V+H L + R    A+++++  +  
Subjt:  SPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRA

Query:  AELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGL
          ++             P ++F  L+ +Y+ C S P VFD L K     +K   A      ++  G  P V + N+   ++S   G             +
Subjt:  AELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGL

Query:  DCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFC
        D  ++  R   + + SPN ++ N +M  + + G + +  E+   +          SY+ L+   C++  +  A +L   M    L+ + V +NT+I GFC
Subjt:  DCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFC

Query:  KAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYEL
        +A  +Q A ++F EM+   +     T+  LINGY + GD + A   Y+DM       +  T  A++ GL  + +  +A   F    +  N  P   T+  
Subjt:  KAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYEL

Query:  LINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELL
        LI G C     +  F+L   M+  G  P+ + +   + A+    + +   ++ +E++
Subjt:  LINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCATCCCTCTGCTGAAACGAACCCTGTCGTCAATCCGAAACCCAGGATTTAAGCTCCCATTTTCCCCTTCATTCTCCTCTTCCTCGCCGTCGGCAAAACCCTCGAT
CTCGACCGTGGTTTCAGTTCTCACTCACCACCGCTCAAAATCCCGCTGGCGATTCCTCAACTCCCTCTGCCCCGACGGCTTCGATCCCGGCGAGTTTTCCGATATCGTCC
TCCACATCAAGAACAATCCCCATCTCTCCCTTCGATTCTTTCTCTGGACTCAGAACAAGTCCCTCTGCACTCACAATCTCGTTTCCTACTCAACCGTCATCCACATCCTT
GCCCGCGGTCGCCTCAGAACTCACGCCAAGGCCGTTATTCAGACCGCCATTAGGGCTGCGGAGCTCGAAGACGACGATGGCTGTTCCAATTGTAAGCAATTTTCTAGGCC
TTTGAGGCTGTTTCAGACTCTCGTCAAGACGTATAAACGGTGTGGTTCTGCTCCCTTCGTGTTTGATTTATTGATTAAAGCTCTCCTGGATTCTAGAAAGCTCGAGCCGG
CCATTCAAATTATTAGAATGTTAAGGTCTCGTGGGATTAGCCCGCAGGTTAGTACATTGAATTCGTTGATTTTGTGGGTGTCGAAGTGCGAGGGGGCTAACGCGGGTTAT
GCTATTTTTAGAGAGGTTTTTGGTTTAGATTGTGAAGTCAAGGAAGAACGTGTGAAAATGAAGGCTCAGGCTAGTCCCAATGTACATTCTTTTAATACATTAATGATGTG
TTTTTATCAAGATGGATTGGTGGGGCGGGTGAAGGAGATATGGGATCAATTAACCGAGTCAAATTCGATTCCGAACAGTTACAGTTATAGTATTCTAATGACGGTTTTCT
GTGATCAAAGAAGAATGGTTGAAGCAGAGGAGTTGTGGAAAGAAATGAGATTGAAGAAGTTGGAGCTTGATGCTGTAGCTTATAACACTATAATTGGAGGGTTTTGTAAA
GCAGGAAGTATTCAGAGAGCTGAAGAGCTTTTCAGAGAAATGGAACTGAGTGGAATAGAGAGTACTTTCTCCACCTTTGAGCATCTCATCAATGGCTATTGTGAAACTGG
AGATATTGACTCTGCATTACTGGTGTATAAGGATATGCGGAGGAAAAATTTTAGTCTGAACGCATCGACGTTGGAAGCGATTGTTAGAGGATTGGTTGCCGAGACTAGAC
TTTTAGAAGCTTTAGATGTTTTTGGGTTCACCACAGAGGACTCCAACTTTTGCCCAACAATGGAAACTTACGAACTTCTGATAAATGGTTTGTGTCGGGAAGGGGAAATT
GAAGCTGCATTTAAGCTTCAGGCGCAGATGGTAGGGAAAGGCTTTAAGCCAGATTCGAAGGTTTACCGTTCTTTTATCGATGCTTATACGAACGAAGGAAATGAAGAAAT
GGTCGAGAAGTTGAGGAAGGAATTACTTGAAATCCAGCTGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCATCCCTCTGCTGAAACGAACCCTGTCGTCAATCCGAAACCCAGGATTTAAGCTCCCATTTTCCCCTTCATTCTCCTCTTCCTCGCCGTCGGCAAAACCCTCGAT
CTCGACCGTGGTTTCAGTTCTCACTCACCACCGCTCAAAATCCCGCTGGCGATTCCTCAACTCCCTCTGCCCCGACGGCTTCGATCCCGGCGAGTTTTCCGATATCGTCC
TCCACATCAAGAACAATCCCCATCTCTCCCTTCGATTCTTTCTCTGGACTCAGAACAAGTCCCTCTGCACTCACAATCTCGTTTCCTACTCAACCGTCATCCACATCCTT
GCCCGCGGTCGCCTCAGAACTCACGCCAAGGCCGTTATTCAGACCGCCATTAGGGCTGCGGAGCTCGAAGACGACGATGGCTGTTCCAATTGTAAGCAATTTTCTAGGCC
TTTGAGGCTGTTTCAGACTCTCGTCAAGACGTATAAACGGTGTGGTTCTGCTCCCTTCGTGTTTGATTTATTGATTAAAGCTCTCCTGGATTCTAGAAAGCTCGAGCCGG
CCATTCAAATTATTAGAATGTTAAGGTCTCGTGGGATTAGCCCGCAGGTTAGTACATTGAATTCGTTGATTTTGTGGGTGTCGAAGTGCGAGGGGGCTAACGCGGGTTAT
GCTATTTTTAGAGAGGTTTTTGGTTTAGATTGTGAAGTCAAGGAAGAACGTGTGAAAATGAAGGCTCAGGCTAGTCCCAATGTACATTCTTTTAATACATTAATGATGTG
TTTTTATCAAGATGGATTGGTGGGGCGGGTGAAGGAGATATGGGATCAATTAACCGAGTCAAATTCGATTCCGAACAGTTACAGTTATAGTATTCTAATGACGGTTTTCT
GTGATCAAAGAAGAATGGTTGAAGCAGAGGAGTTGTGGAAAGAAATGAGATTGAAGAAGTTGGAGCTTGATGCTGTAGCTTATAACACTATAATTGGAGGGTTTTGTAAA
GCAGGAAGTATTCAGAGAGCTGAAGAGCTTTTCAGAGAAATGGAACTGAGTGGAATAGAGAGTACTTTCTCCACCTTTGAGCATCTCATCAATGGCTATTGTGAAACTGG
AGATATTGACTCTGCATTACTGGTGTATAAGGATATGCGGAGGAAAAATTTTAGTCTGAACGCATCGACGTTGGAAGCGATTGTTAGAGGATTGGTTGCCGAGACTAGAC
TTTTAGAAGCTTTAGATGTTTTTGGGTTCACCACAGAGGACTCCAACTTTTGCCCAACAATGGAAACTTACGAACTTCTGATAAATGGTTTGTGTCGGGAAGGGGAAATT
GAAGCTGCATTTAAGCTTCAGGCGCAGATGGTAGGGAAAGGCTTTAAGCCAGATTCGAAGGTTTACCGTTCTTTTATCGATGCTTATACGAACGAAGGAAATGAAGAAAT
GGTCGAGAAGTTGAGGAAGGAATTACTTGAAATCCAGCTGAGTTGAGAAGGGGAATTGCATCCACATTGTATCTTCATCTTCTGATGGCAGTATTGCAGCTGATCAGACC
AAAGTTCCCATAAAAACTAGAGCTTCATGGCCAATATTACTGCTAGCAAACATAGGATTGATCTGAATAGTTGGACAAATAAAAATACCTATAAGGGTGGTCAAGGGGCA
TGAGCGCCCGAGCTCGAAAATTGAAAACCAGCAGCAAGGGATTGAGCTGCTATGCTTTAAATGTATTTTAAGGTGGTATTTGTTCAAAGTTCTGAGTCTAGGAAACAGGT
TCTTAAAACTTTGATGCAGAGGAGAACTGTATTCAAAACCTTATCAATTTTTGTTGTAGAATTTCAAAAGTTTGCACTTATCAACCATTAGTTGGAGTTTGTATGTTTGA
CACACTTCTTAGATTCAGCCAAACAATGGGTGAGAGCAGACAAGCTGGGTATATCTCACTCATCAACTTGGCATAGCTAACTCCTGGTTGAAGTTGAACCAAAACCACGA
CATCCCCATGCACATCATGATACACTTAAGCCCCGACATGACTCGATTCATACGAAGACACGTCTCTACCTAGTCATACCTAGCTTAGGGATCCCTCGGCTGCAGCCTCA
CATAGAAGTACTCAAAACTACAAGGGATCAGTTTGGCTAGCTAGCTTTGATTGTCGATTGTCGATTGTCGATTGTTTGAAACATTAGCATAGATTCTTAGGTCAAGTCTA
CAAGTTGAGAGAGATGGGAATCGACGATTGATAATTGTATGGAAAACATGAAAAGGGTAAGTGTATTTGATTCCCACAAAATTCATGGTTGGTAGAACCAAAAGTACTGA
ATGTGGGCATCTCCATACTTGGAGATCACTTGTATAGAATGTTCTTTCACATGTTCCTTTCAATGAAAAAAGAAAACTAAAACAAAAACTACCTTTAAAATCAGCTTAGA
GAGATAAAAAGC
Protein sequenceShow/hide protein sequence
MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHIL
ARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGY
AIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCK
AGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEI
EAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS