; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10006432 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10006432
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr07:18540412..18542016
RNA-Seq ExpressionHG10006432
SyntenyHG10006432
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022946255.1 pentatricopeptide repeat-containing protein At3g05340 isoform X2 [Cucurbita moschata]1.4e-16393.53Show/hide
Query:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF
        MCLVGDCKFDKATLTTILSACDGLE CCIIKM+HGL FLSGY QEITVGNALISSYFKCGCV  G QVFYEM ERNVITWTAVISGLAQNG HEHSL+LF
Subjt:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF

Query:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ
        REM+SCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK G IGDAWKIFESAEE DMVSLTVILAGFT NGCEEEAIQ
Subjt:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ

Query:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL
        IFLKMLKMGI+IDENVVSAVLGVFGADTSL LGQQVHSF+VKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQ RNSVTWNSMIAAFARHGDGLKAL
Subjt:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL

Query:  QLYENMKLE
         LYENMKLE
Subjt:  QLYENMKLE

XP_022946256.1 pentatricopeptide repeat-containing protein At3g05340 isoform X3 [Cucurbita moschata]1.4e-16393.53Show/hide
Query:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF
        MCLVGDCKFDKATLTTILSACDGLE CCIIKM+HGL FLSGY QEITVGNALISSYFKCGCV  G QVFYEM ERNVITWTAVISGLAQNG HEHSL+LF
Subjt:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF

Query:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ
        REM+SCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK G IGDAWKIFESAEE DMVSLTVILAGFT NGCEEEAIQ
Subjt:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ

Query:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL
        IFLKMLKMGI+IDENVVSAVLGVFGADTSL LGQQVHSF+VKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQ RNSVTWNSMIAAFARHGDGLKAL
Subjt:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL

Query:  QLYENMKLE
         LYENMKLE
Subjt:  QLYENMKLE

XP_022999024.1 pentatricopeptide repeat-containing protein At3g05340 [Cucurbita maxima]1.3e-16192.23Show/hide
Query:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF
        MCLVGDCKFDKATLTTILSACDG E CCII+M+HGL FLSGY+QEITVGNALISSYFKCGCV  G Q+FYEM ERNVITWTAVISGLAQNG HEHSL+LF
Subjt:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF

Query:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ
        REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK G IGDAWKIFESAEE DMVSLTVILAGFT NGCEEEAIQ
Subjt:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ

Query:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL
        IFLKMLKMGI+ID NVVSAVLGVFGADTSL LGQQVHSF+VKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQ RNSVTWNSMIAAFARHGDG KAL
Subjt:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL

Query:  QLYENMKLE
         LYENMKLE
Subjt:  QLYENMKLE

XP_038877028.1 pentatricopeptide repeat-containing protein At3g05340 [Benincasa hispida]6.2e-16492.88Show/hide
Query:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF
        MCLV DCKFDKATLTTILSACDGLE CCIIKM+HGLAF SGY+Q ITVGNALISSYFKCGC+DLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF
Subjt:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF

Query:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ
        REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESA+  DMVSLT++LAGFT NGCEEEAIQ
Subjt:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ

Query:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL
        IFLKMLKMGI+IDENVVS VLGVFGADTSL LGQQVHSF+VKKNFS NPFVSNGLINMYSKCGALDESVK+FDRM+ERNSVTWNSMIAAFARHGDG KAL
Subjt:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL

Query:  QLYENMKLE
        QLYENM+LE
Subjt:  QLYENMKLE

XP_038877028.1 pentatricopeptide repeat-containing protein At3g05340 [Benincasa hispida]1.5e-0524.41Show/hide
Query:  LGMQVFYEMGERNVITWTAVISGL----AQNGRHEHSLKLFREMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYS
        LG QV   + ++N      V +GL    ++ G  + S+K+F  M      E NS+T+ S++ A +      +  Q++  +   G +       +L+   S
Subjt:  LGMQVFYEMGERNVITWTAVISGL----AQNGRHEHSLKLFREMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYS

Query:  KSGRIGDAWKIFESAEEFDMVS-----LTVILAGFTHNGCEEEAIQIFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNG
          G +    +  ES  +   ++        ++      G   EA     K+ +    +    +     ++G D+ +G     H F ++   S  P+V   
Subjt:  KSGRIGDAWKIFESAEEFDMVS-----LTVILAGFTHNGCEEEAIQIFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNG

Query:  LINMYSKCGALDESVKVFDRMQE-----RNSVTW---NSMIAAFARHGDGLKALQLYENMKLE------DEGYMPDKKFILYYLDDDRRDPIDNG
        L N+YS  G   E  +   +M+E        ++W   +  + +F          ++   + +E      DEGY+PDKKFIL+YLDD+RRDPIDNG
Subjt:  LINMYSKCGALDESVKVFDRMQE-----RNSVTW---NSMIAAFARHGDGLKALQLYENMKLE------DEGYMPDKKFILYYLDDDRRDPIDNG

XP_038877028.1 pentatricopeptide repeat-containing protein At3g05340 [Benincasa hispida]1.4e-16393.53Show/hide
Query:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF
        MCLVGDCKFDKATLTTILSACDGLE CCIIKM+HGL FLSGY QEITVGNALISSYFKCGCV  G QVFYEM ERNVITWTAVISGLAQNG HEHSL+LF
Subjt:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF

Query:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ
        REM+SCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK G IGDAWKIFESAEE DMVSLTVILAGFT NGCEEEAIQ
Subjt:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ

Query:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL
        IFLKMLKMGI+IDENVVSAVLGVFGADTSL LGQQVHSF+VKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQ RNSVTWNSMIAAFARHGDGLKAL
Subjt:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL

Query:  QLYENMKLE
         LYENMKLE
Subjt:  QLYENMKLE

TrEMBL top hitse value%identityAlignment
A0A0A0LGC8 DYW_deaminase domain-containing protein8.2e-16292.56Show/hide
Query:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF
        MCLVGDC+FDKATLTTILSACDGLEFC IIKM+HGLAFLSGY QEITVGNALISSYFKCGCV LGMQVFYEMGERNVITWTAVISGLAQNG HEHSLKLF
Subjt:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF

Query:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ
        +EMMS GSVEPNSLTYLSLLTACSGLEAL+EGCQIHGLI+KLGIQSDLCIGSALMDMYSKSGRIG+AWKIFE AEE DMVSLTVILAGFTHNGCEEEAIQ
Subjt:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ

Query:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL
        IFLKMLKMGI+ID NVVS VLGVFGADTSL LGQQVHSFVVKKNF CNPFVSNGLINMYSKCGALDES+KVFDRM+ERNSVTWNSMIAAFARHGD LKAL
Subjt:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL

Query:  QLYENMKLE
        QLYE+M+LE
Subjt:  QLYENMKLE

A0A0A0LGC8 DYW_deaminase domain-containing protein6.3e-1325.94Show/hide
Query:  KFDKATLTTIL-----SACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGL----AQNGRHEHSLK
        + D  +LT IL     + C+       +KM+       G + +  V + ++  +     + LG QV   + ++N I    V +GL    ++ G  + S+K
Subjt:  KFDKATLTTIL-----SACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGL----AQNGRHEHSLK

Query:  LFREMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVS-----LTVILAGFTHNG
        +F  M      E NS+T+ S++ A +      +  Q++  +   G +       +L+   S +G +    +  +S  +   ++        ++      G
Subjt:  LFREMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVS-----LTVILAGFTHNG

Query:  CEEEAIQIFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQE-----RNSVTW---NS
           EA     K+ +    +    +     ++G D+ +G     H F    + S  P+V   L N+YS  G   E  +   +M+E        ++W   + 
Subjt:  CEEEAIQIFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQE-----RNSVTW---NS

Query:  MIAAFARHGDGLKALQ------LYE-NMKLEDEGYMPDKKFILYYLDDDRRDPIDNGLANRQNVKETDVVWELF
         + +F   GD +          L+E  + + DEGY+PDKKFILYYLDDDRRDPI NG A  QN  ET+VVWELF
Subjt:  MIAAFARHGDGLKALQ------LYE-NMKLEDEGYMPDKKFILYYLDDDRRDPIDNGLANRQNVKETDVVWELF

A0A6J1G350 pentatricopeptide repeat-containing protein At3g05340 isoform X36.7e-16493.53Show/hide
Query:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF
        MCLVGDCKFDKATLTTILSACDGLE CCIIKM+HGL FLSGY QEITVGNALISSYFKCGCV  G QVFYEM ERNVITWTAVISGLAQNG HEHSL+LF
Subjt:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF

Query:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ
        REM+SCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK G IGDAWKIFESAEE DMVSLTVILAGFT NGCEEEAIQ
Subjt:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ

Query:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL
        IFLKMLKMGI+IDENVVSAVLGVFGADTSL LGQQVHSF+VKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQ RNSVTWNSMIAAFARHGDGLKAL
Subjt:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL

Query:  QLYENMKLE
         LYENMKLE
Subjt:  QLYENMKLE

A0A6J1G3A2 pentatricopeptide repeat-containing protein At3g05340 isoform X16.7e-16493.53Show/hide
Query:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF
        MCLVGDCKFDKATLTTILSACDGLE CCIIKM+HGL FLSGY QEITVGNALISSYFKCGCV  G QVFYEM ERNVITWTAVISGLAQNG HEHSL+LF
Subjt:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF

Query:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ
        REM+SCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK G IGDAWKIFESAEE DMVSLTVILAGFT NGCEEEAIQ
Subjt:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ

Query:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL
        IFLKMLKMGI+IDENVVSAVLGVFGADTSL LGQQVHSF+VKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQ RNSVTWNSMIAAFARHGDGLKAL
Subjt:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL

Query:  QLYENMKLE
         LYENMKLE
Subjt:  QLYENMKLE

A0A6J1G3C0 pentatricopeptide repeat-containing protein At3g05340 isoform X26.7e-16493.53Show/hide
Query:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF
        MCLVGDCKFDKATLTTILSACDGLE CCIIKM+HGL FLSGY QEITVGNALISSYFKCGCV  G QVFYEM ERNVITWTAVISGLAQNG HEHSL+LF
Subjt:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF

Query:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ
        REM+SCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK G IGDAWKIFESAEE DMVSLTVILAGFT NGCEEEAIQ
Subjt:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ

Query:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL
        IFLKMLKMGI+IDENVVSAVLGVFGADTSL LGQQVHSF+VKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQ RNSVTWNSMIAAFARHGDGLKAL
Subjt:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL

Query:  QLYENMKLE
         LYENMKLE
Subjt:  QLYENMKLE

A0A6J1KFW9 pentatricopeptide repeat-containing protein At3g053406.3e-16292.23Show/hide
Query:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF
        MCLVGDCKFDKATLTTILSACDG E CCII+M+HGL FLSGY+QEITVGNALISSYFKCGCV  G Q+FYEM ERNVITWTAVISGLAQNG HEHSL+LF
Subjt:  MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLF

Query:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ
        REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK G IGDAWKIFESAEE DMVSLTVILAGFT NGCEEEAIQ
Subjt:  REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQ

Query:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL
        IFLKMLKMGI+ID NVVSAVLGVFGADTSL LGQQVHSF+VKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQ RNSVTWNSMIAAFARHGDG KAL
Subjt:  IFLKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKAL

Query:  QLYENMKLE
         LYENMKLE
Subjt:  QLYENMKLE

SwissProt top hitse value%identityAlignment
P93005 Pentatricopeptide repeat-containing protein At2g336809.6e-5134.77Show/hide
Query:  TTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGSVEPNSL
        T +LS+     +  + + +H +   +G    + + NAL++ Y KC  ++   ++F   G+RN ITW+A+++G +QNG    ++KLF  M S G ++P+  
Subjt:  TTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGSVEPNSL

Query:  TYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKMLKMGIKIDE
        T + +L ACS +  LEEG Q+H  +LKLG +  L   +AL+DMY+K+G + DA K F+  +E D+   T +++G+  N   EEA+ ++ +M   GI  ++
Subjt:  TYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKMLKMGIKIDE

Query:  NVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQLYENMKLEDEGYM
          +++VL    +  +L LG+QVH   +K  F     + + L  MYSKCG+L++   VF R   ++ V+WN+MI+  + +G G +AL+L+E M    EG  
Subjt:  NVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQLYENMKLEDEGYM

Query:  PD
        PD
Subjt:  PD

Q7XJN6 Pentatricopeptide repeat-containing protein At2g407201.5e-4834.32Show/hide
Query:  DKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFREMM-SCGS
        D  TL+ ++S C  L      K VH   F        T+ +AL++ Y KCGC      VF  M E++++ W ++ISGL +NG+ + +LK+F +M     S
Subjt:  DKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFREMM-SCGS

Query:  VEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKMLKM
        ++P+S    S+  AC+GLEAL  G Q+HG ++K G+  ++ +GS+L+D+YSK G    A K+F S    +MV+   +++ ++ N   E +I +F  ML  
Subjt:  VEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKMLKM

Query:  GIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQLYENMKL
        GI  D   +++VL    +  SL  G+ +H + ++     +  + N LI+MY KCG    +  +F +MQ ++ +TWN MI  +  HGD + AL L++ MK 
Subjt:  GIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQLYENMKL

Query:  EDE
          E
Subjt:  EDE

Q9LIC3 Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial6.9e-4935.27Show/hide
Query:  VHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGSVEPNSLTYLSLLTACSGLEALEEG
        VH     + Y     +   L+  Y KC C++   +V  EM E+NV++WTA+IS  +Q G    +L +F EMM     +PN  T+ ++LT+C     L  G
Subjt:  VHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGSVEPNSLTYLSLLTACSGLEALEEG

Query:  CQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKMLKMGIKIDENVVSAVLGVFGADTSLGL
         QIHGLI+K    S + +GS+L+DMY+K+G+I +A +IFE   E D+VS T I+AG+   G +EEA+++F ++   G+  +    +++L        L  
Subjt:  CQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKMLKMGIKIDENVVSAVLGVFGADTSLGL

Query:  GQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQLYENMKLEDEGYMPDKKFILYYL
        G+Q H  V+++       + N LI+MYSKCG L  + ++FD M ER +++WN+M+  +++HG G + L+L+  M+ +++   PD   +L  L
Subjt:  GQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQLYENMKLEDEGYMPDKKFILYYL

Q9LU94 Putative pentatricopeptide repeat-containing protein At3g259702.0e-4835.88Show/hide
Query:  DKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEM-GERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGS
        D  T   +L+  D   FC ++K VH      G   EIT+ NA+ISSY  CG V    +VF  + G +++I+W ++I+G +++   E + +LF +M     
Subjt:  DKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEM-GERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGS

Query:  VEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK--SGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKML
        VE +  TY  LL+ACSG E    G  +HG+++K G++      +AL+ MY +  +G + DA  +FES +  D++S   I+ GF   G  E+A++ F  + 
Subjt:  VEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK--SGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKML

Query:  KMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNS-VTWNSMIAAFARHGDGLKALQLYEN
           IK+D+   SA+L       +L LGQQ+H+   K  F  N FV + LI MYSKCG ++ + K F ++  ++S V WN+MI  +A+HG G  +L L+  
Subjt:  KMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNS-VTWNSMIAAFARHGDGLKALQLYEN

Query:  M
        M
Subjt:  M

Q9MA85 Pentatricopeptide repeat-containing protein At3g053401.3e-10363.82Show/hide
Query:  LVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFRE
        ++G   FD ATLT +LS CD  EFC + KM+H LA LSGYD+EI+VGN LI+SYFKCGC   G  VF  M  RNVIT TAVISGL +N  HE  L+LF  
Subjt:  LVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFRE

Query:  MMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIF
        +M  G V PNS+TYLS L ACSG + + EG QIH L+ K GI+S+LCI SALMDMYSK G I DAW IFES  E D VS+TVIL G   NG EEEAIQ F
Subjt:  MMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIF

Query:  LKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQL
        ++ML+ G++ID NVVSAVLGV   D SLGLG+Q+HS V+K+ FS N FV+NGLINMYSKCG L +S  VF RM +RN V+WNSMIAAFARHG GL AL+L
Subjt:  LKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQL

Query:  YENM
        YE M
Subjt:  YENM

Arabidopsis top hitse value%identityAlignment
AT2G33680.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.8e-5234.77Show/hide
Query:  TTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGSVEPNSL
        T +LS+     +  + + +H +   +G    + + NAL++ Y KC  ++   ++F   G+RN ITW+A+++G +QNG    ++KLF  M S G ++P+  
Subjt:  TTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGSVEPNSL

Query:  TYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKMLKMGIKIDE
        T + +L ACS +  LEEG Q+H  +LKLG +  L   +AL+DMY+K+G + DA K F+  +E D+   T +++G+  N   EEA+ ++ +M   GI  ++
Subjt:  TYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKMLKMGIKIDE

Query:  NVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQLYENMKLEDEGYM
          +++VL    +  +L LG+QVH   +K  F     + + L  MYSKCG+L++   VF R   ++ V+WN+MI+  + +G G +AL+L+E M    EG  
Subjt:  NVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQLYENMKLEDEGYM

Query:  PD
        PD
Subjt:  PD

AT2G40720.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-4934.32Show/hide
Query:  DKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFREMM-SCGS
        D  TL+ ++S C  L      K VH   F        T+ +AL++ Y KCGC      VF  M E++++ W ++ISGL +NG+ + +LK+F +M     S
Subjt:  DKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFREMM-SCGS

Query:  VEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKMLKM
        ++P+S    S+  AC+GLEAL  G Q+HG ++K G+  ++ +GS+L+D+YSK G    A K+F S    +MV+   +++ ++ N   E +I +F  ML  
Subjt:  VEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKMLKM

Query:  GIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQLYENMKL
        GI  D   +++VL    +  SL  G+ +H + ++     +  + N LI+MY KCG    +  +F +MQ ++ +TWN MI  +  HGD + AL L++ MK 
Subjt:  GIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQLYENMKL

Query:  EDE
          E
Subjt:  EDE

AT3G05340.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.1e-10563.82Show/hide
Query:  LVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFRE
        ++G   FD ATLT +LS CD  EFC + KM+H LA LSGYD+EI+VGN LI+SYFKCGC   G  VF  M  RNVIT TAVISGL +N  HE  L+LF  
Subjt:  LVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFRE

Query:  MMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIF
        +M  G V PNS+TYLS L ACSG + + EG QIH L+ K GI+S+LCI SALMDMYSK G I DAW IFES  E D VS+TVIL G   NG EEEAIQ F
Subjt:  MMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIF

Query:  LKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQL
        ++ML+ G++ID NVVSAVLGV   D SLGLG+Q+HS V+K+ FS N FV+NGLINMYSKCG L +S  VF RM +RN V+WNSMIAAFARHG GL AL+L
Subjt:  LKMLKMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQL

Query:  YENM
        YE M
Subjt:  YENM

AT3G13770.1 Pentatricopeptide repeat (PPR) superfamily protein4.9e-5035.27Show/hide
Query:  VHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGSVEPNSLTYLSLLTACSGLEALEEG
        VH     + Y     +   L+  Y KC C++   +V  EM E+NV++WTA+IS  +Q G    +L +F EMM     +PN  T+ ++LT+C     L  G
Subjt:  VHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGSVEPNSLTYLSLLTACSGLEALEEG

Query:  CQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKMLKMGIKIDENVVSAVLGVFGADTSLGL
         QIHGLI+K    S + +GS+L+DMY+K+G+I +A +IFE   E D+VS T I+AG+   G +EEA+++F ++   G+  +    +++L        L  
Subjt:  CQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKMLKMGIKIDENVVSAVLGVFGADTSLGL

Query:  GQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQLYENMKLEDEGYMPDKKFILYYL
        G+Q H  V+++       + N LI+MYSKCG L  + ++FD M ER +++WN+M+  +++HG G + L+L+  M+ +++   PD   +L  L
Subjt:  GQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQLYENMKLEDEGYMPDKKFILYYL

AT3G25970.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-4935.88Show/hide
Query:  DKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEM-GERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGS
        D  T   +L+  D   FC ++K VH      G   EIT+ NA+ISSY  CG V    +VF  + G +++I+W ++I+G +++   E + +LF +M     
Subjt:  DKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEM-GERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGS

Query:  VEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK--SGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKML
        VE +  TY  LL+ACSG E    G  +HG+++K G++      +AL+ MY +  +G + DA  +FES +  D++S   I+ GF   G  E+A++ F  + 
Subjt:  VEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK--SGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKML

Query:  KMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNS-VTWNSMIAAFARHGDGLKALQLYEN
           IK+D+   SA+L       +L LGQQ+H+   K  F  N FV + LI MYSKCG ++ + K F ++  ++S V WN+MI  +A+HG G  +L L+  
Subjt:  KMGIKIDENVVSAVLGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNS-VTWNSMIAAFARHGDGLKALQLYEN

Query:  M
        M
Subjt:  M


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTTAGTTGGTGATTGTAAATTTGACAAAGCTACTTTGACGACGATTTTATCTGCTTGTGATGGCTTGGAGTTCTGTTGCATTATTAAAATGGTGCATGGTTTGGC
GTTTTTGAGTGGGTATGATCAAGAAATTACTGTGGGAAATGCTTTGATTAGTTCGTATTTTAAATGTGGATGTGTTGATTTGGGGATGCAAGTTTTCTACGAGATGGGGG
AGAGAAATGTGATTACTTGGACGGCTGTGATCTCTGGTTTGGCTCAAAATGGGCGTCATGAGCACAGCCTGAAGCTGTTTAGGGAGATGATGAGTTGTGGGTCTGTGGAG
CCAAATTCTTTAACTTATTTGAGTTTACTCACTGCTTGTTCTGGTTTGGAGGCATTAGAGGAAGGATGTCAAATTCATGGTCTTATTTTGAAGCTGGGAATTCAGTCAGA
TTTGTGCATTGGAAGTGCTCTGATGGATATGTACTCAAAGTCTGGAAGAATTGGAGATGCTTGGAAGATTTTCGAGTCGGCTGAGGAATTTGATATGGTTTCATTAACTG
TTATACTTGCAGGGTTCACACACAATGGATGTGAGGAAGAAGCCATCCAGATCTTTCTGAAAATGTTGAAGATGGGGATCAAGATTGACGAAAATGTTGTTTCAGCTGTT
CTTGGGGTGTTTGGTGCTGATACATCTTTGGGACTGGGTCAACAAGTTCACTCGTTTGTTGTCAAGAAGAACTTTAGTTGCAATCCTTTTGTGAGCAATGGGCTTATAAA
CATGTACTCCAAGTGTGGAGCATTGGATGAGTCAGTGAAGGTCTTTGATAGGATGCAAGAGAGGAACTCGGTGACATGGAACTCCATGATTGCAGCATTTGCCCGACATG
GAGATGGCTTGAAAGCTCTACAACTTTATGAGAATATGAAACTGGAAGATGAAGGATATATGCCAGATAAGAAGTTCATCCTCTACTACTTGGATGATGACAGGAGGGAT
CCGATCGATAACGGTCTTGCTAACCGTCAAAATGTCAAAGAAACCGACGTTGTTTGGGAGCTGTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTTTAGTTGGTGATTGTAAATTTGACAAAGCTACTTTGACGACGATTTTATCTGCTTGTGATGGCTTGGAGTTCTGTTGCATTATTAAAATGGTGCATGGTTTGGC
GTTTTTGAGTGGGTATGATCAAGAAATTACTGTGGGAAATGCTTTGATTAGTTCGTATTTTAAATGTGGATGTGTTGATTTGGGGATGCAAGTTTTCTACGAGATGGGGG
AGAGAAATGTGATTACTTGGACGGCTGTGATCTCTGGTTTGGCTCAAAATGGGCGTCATGAGCACAGCCTGAAGCTGTTTAGGGAGATGATGAGTTGTGGGTCTGTGGAG
CCAAATTCTTTAACTTATTTGAGTTTACTCACTGCTTGTTCTGGTTTGGAGGCATTAGAGGAAGGATGTCAAATTCATGGTCTTATTTTGAAGCTGGGAATTCAGTCAGA
TTTGTGCATTGGAAGTGCTCTGATGGATATGTACTCAAAGTCTGGAAGAATTGGAGATGCTTGGAAGATTTTCGAGTCGGCTGAGGAATTTGATATGGTTTCATTAACTG
TTATACTTGCAGGGTTCACACACAATGGATGTGAGGAAGAAGCCATCCAGATCTTTCTGAAAATGTTGAAGATGGGGATCAAGATTGACGAAAATGTTGTTTCAGCTGTT
CTTGGGGTGTTTGGTGCTGATACATCTTTGGGACTGGGTCAACAAGTTCACTCGTTTGTTGTCAAGAAGAACTTTAGTTGCAATCCTTTTGTGAGCAATGGGCTTATAAA
CATGTACTCCAAGTGTGGAGCATTGGATGAGTCAGTGAAGGTCTTTGATAGGATGCAAGAGAGGAACTCGGTGACATGGAACTCCATGATTGCAGCATTTGCCCGACATG
GAGATGGCTTGAAAGCTCTACAACTTTATGAGAATATGAAACTGGAAGATGAAGGATATATGCCAGATAAGAAGTTCATCCTCTACTACTTGGATGATGACAGGAGGGAT
CCGATCGATAACGGTCTTGCTAACCGTCAAAATGTCAAAGAAACCGACGTTGTTTGGGAGCTGTTCTGA
Protein sequenceShow/hide protein sequence
MCLVGDCKFDKATLTTILSACDGLEFCCIIKMVHGLAFLSGYDQEITVGNALISSYFKCGCVDLGMQVFYEMGERNVITWTAVISGLAQNGRHEHSLKLFREMMSCGSVE
PNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSKSGRIGDAWKIFESAEEFDMVSLTVILAGFTHNGCEEEAIQIFLKMLKMGIKIDENVVSAV
LGVFGADTSLGLGQQVHSFVVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQERNSVTWNSMIAAFARHGDGLKALQLYENMKLEDEGYMPDKKFILYYLDDDRRD
PIDNGLANRQNVKETDVVWELF