; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh13G008670 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh13G008670
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein PLASTID TRANSCRIPTIONALLY ACTIVE 14
Genome locationCmo_Chr13:7910169..7919597
RNA-Seq ExpressionCmoCh13G008670
SyntenyCmoCh13G008670
Gene Ontology termsGO:0009416 - response to light stimulus (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0010027 - thylakoid membrane organization (biological process)
GO:0018026 - peptidyl-lysine monomethylation (biological process)
GO:0042793 - plastid transcription (biological process)
GO:0000427 - plastid-encoded plastid RNA polymerase complex (cellular component)
GO:0009534 - chloroplast thylakoid (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0016279 - protein-lysine N-methyltransferase activity (molecular function)
InterPro domainsIPR001214 - SET domain
IPR013219 - Ribosomal protein S27/S33, mitochondrial
IPR036464 - Rubisco LSMT, substrate-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055365.1 protein PLASTID TRANSCRIPTIONALLY ACTIVE 14 isoform X2 [Cucumis melo var. makuwa]8.7e-24692.49Show/hide
Query:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
        MANSISFH PTHRFISCPQVKDFRSF SPRF+   STSPK+RLRPIKAAT   AFPLLQPPKADE SPSELEPADPDFYKIG+VRSMRAYGIEFKEGPDG
Subjt:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG

Query:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
        FGVYASKDVEPLRRARVIMEIPLE+MLTISQKLPWMFFPDIIPVGHPIFDIINST+PETDWDLRLACLLLYAFDR+DNFWQLYGDFLPSIDECTSLLLAS
Subjt:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS

Query:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
        EEEL ELQDQNLAS+IRDQQ RAL+FWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINM+TRIGALVQ+ANMLIPYADMLNHSF PNCFFHWRFKD
Subjt:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD

Query:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
        RMLEVMINAGQQIKKGQEMTVNYMNGQ N+MFLQRYGFSS VNPWDMI+FSGNA IHLDSFLSVFNIAGLP+ YYYNGRLS +EDTFVDGAVIAAARSLP
Subjt:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP

Query:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKILGTPCSPCLLP
        SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSE+DQK+LGT CSPCLLP
Subjt:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKILGTPCSPCLLP

KAG6584179.1 Protein PLASTID TRANSCRIPTIONALLY ACTIVE 14, partial [Cucurbita argyrosperma subsp. sororia]2.6e-25899.1Show/hide
Query:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
        MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPS SPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
Subjt:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG

Query:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
        FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
Subjt:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS

Query:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
        EEELLELQDQNLAS+IRDQQHRALDFWERNWHSGVPLKIKRLA+DPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
Subjt:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD

Query:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
        RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSS VNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
Subjt:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP

Query:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
        SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
Subjt:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL

XP_022923924.1 protein PLASTID TRANSCRIPTIONALLY ACTIVE 14 [Cucurbita moschata]2.1e-260100Show/hide
Query:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
        MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
Subjt:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG

Query:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
        FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
Subjt:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS

Query:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
        EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
Subjt:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD

Query:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
        RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
Subjt:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP

Query:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
        SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
Subjt:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL

XP_023000752.1 protein PLASTID TRANSCRIPTIONALLY ACTIVE 14 [Cucurbita maxima]3.8e-25798.65Show/hide
Query:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
        MANSISFHQPTHRFISCPQVKDFRSFSSPRF NYPS+SPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
Subjt:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG

Query:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
        FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
Subjt:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS

Query:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
        EEELLELQDQNLAS+IRDQQHRAL+FWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
Subjt:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD

Query:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
        RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSS VNPWDMIEFSGNA IHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
Subjt:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP

Query:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
        SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
Subjt:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL

XP_023520259.1 protein PLASTID TRANSCRIPTIONALLY ACTIVE 14 [Cucurbita pepo subsp. pepo]9.0e-25999.32Show/hide
Query:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
        MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
Subjt:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG

Query:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
        FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
Subjt:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS

Query:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
        EEELLELQD NLAS+IRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
Subjt:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD

Query:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
        RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSS VNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
Subjt:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP

Query:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
        SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
Subjt:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL

TrEMBL top hitse value%identityAlignment
A0A0A0LTZ8 SET domain-containing protein8.2e-24293Show/hide
Query:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
        MANSISFHQPTHRFISCPQVKDFRSF SPRF+N  S SPK+RLRPIKAAT   AFPLLQPPKADE SPSELEPADPDFYKIG+VRSMRAYGIEFKEGPDG
Subjt:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG

Query:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
        FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIP+GHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
Subjt:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS

Query:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
        EEELLELQDQNLAS+IRDQQ RAL+FWERNWHSGVPLKIKRLARDPKRFIWA+SIAQSRCINM+TRIGALVQ+ANMLIPYADMLNHSF+PNCFFHWRFKD
Subjt:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD

Query:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
        RMLEVMINAGQQIKKGQEMTVNYMNGQ N+MFLQRYGFSS VNPWDMIEFS NA IHLDSFLSVFNIAGLP+ YYYNGRLS++EDTFVDGAVIAAARSLP
Subjt:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP

Query:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
        SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTS++DQK+L
Subjt:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL

A0A5A7UP05 Protein PLASTID TRANSCRIPTIONALLY ACTIVE 14 isoform X24.2e-24692.49Show/hide
Query:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
        MANSISFH PTHRFISCPQVKDFRSF SPRF+   STSPK+RLRPIKAAT   AFPLLQPPKADE SPSELEPADPDFYKIG+VRSMRAYGIEFKEGPDG
Subjt:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG

Query:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
        FGVYASKDVEPLRRARVIMEIPLE+MLTISQKLPWMFFPDIIPVGHPIFDIINST+PETDWDLRLACLLLYAFDR+DNFWQLYGDFLPSIDECTSLLLAS
Subjt:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS

Query:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
        EEEL ELQDQNLAS+IRDQQ RAL+FWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINM+TRIGALVQ+ANMLIPYADMLNHSF PNCFFHWRFKD
Subjt:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD

Query:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
        RMLEVMINAGQQIKKGQEMTVNYMNGQ N+MFLQRYGFSS VNPWDMI+FSGNA IHLDSFLSVFNIAGLP+ YYYNGRLS +EDTFVDGAVIAAARSLP
Subjt:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP

Query:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKILGTPCSPCLLP
        SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSE+DQK+LGT CSPCLLP
Subjt:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKILGTPCSPCLLP

A0A5D3BHR6 Protein PLASTID TRANSCRIPTIONALLY ACTIVE 14 isoform X11.6e-24085.66Show/hide
Query:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
        MANSISFH PTHRFISCPQVKDFRSF SPRF+   STSPK+RLRPIKAAT   AFPLLQPPKADE SPSELEPADPDFYKIG+VRSMRAYGIEFKEGPDG
Subjt:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG

Query:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
        FGVYASKDVEPLRRARVIMEIPLE+MLTISQKLPWMFFPDIIPVGHPIFDIINST+PETDWDLRLACLLLYAFDR+DNFWQLYGDFLPSIDECTSLLLAS
Subjt:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS

Query:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
        EEEL ELQDQNLAS+IRDQQ RAL+FWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINM+TRIGALVQ+ANMLIPYADM+NHSF PNCFFHWRFKD
Subjt:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD

Query:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYN-----------------------
        RMLEVMINAGQQIKKGQEMTVNYMNGQ N+MFLQRYGFSS VNPWDMI+FSGNA IHLDSFLSVFNIAGLP+ YYYN                       
Subjt:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYN-----------------------

Query:  ------------GRLSNEEDTFVDGAVIAAARSLPSWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKILGTPCSPCLLP
                    GRLS +EDTFVDGAVIAAARSLPSWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSE+DQK+LGT CSPCLLP
Subjt:  ------------GRLSNEEDTFVDGAVIAAARSLPSWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKILGTPCSPCLLP

A0A6J1E7Q3 protein PLASTID TRANSCRIPTIONALLY ACTIVE 141.0e-260100Show/hide
Query:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
        MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
Subjt:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG

Query:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
        FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
Subjt:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS

Query:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
        EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
Subjt:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD

Query:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
        RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
Subjt:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP

Query:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
        SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
Subjt:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL

A0A6J1KEJ3 protein PLASTID TRANSCRIPTIONALLY ACTIVE 141.8e-25798.65Show/hide
Query:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
        MANSISFHQPTHRFISCPQVKDFRSFSSPRF NYPS+SPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG
Subjt:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDG

Query:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
        FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS
Subjt:  FGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLAS

Query:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
        EEELLELQDQNLAS+IRDQQHRAL+FWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD
Subjt:  EEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKD

Query:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
        RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSS VNPWDMIEFSGNA IHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP
Subjt:  RMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLP

Query:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
        SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
Subjt:  SWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL

SwissProt top hitse value%identityAlignment
P53305 Mitochondrial 37S ribosomal protein S274.1e-0441.11Show/hide
Query:  VTEARARIFGHVLNPTGQRSTHKLLRKKLIGDKVAEWYPY-DIKK--------DDPLIMARQEQERLSKLEMLKRRGKGPPKKGQGRRAA
        V E  A+IF    NP+G R+  K+L ++L G  VA +Y   DI K         D   +  +EQ RLS +E  KRRGKG PKK +   AA
Subjt:  VTEARARIFGHVLNPTGQRSTHKLLRKKLIGDKVAEWYPY-DIKK--------DDPLIMARQEQERLSKLEMLKRRGKGPPKKGQGRRAA

Q84JF5 Protein PLASTID TRANSCRIPTIONALLY ACTIVE 142.3e-18871.85Show/hide
Query:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAAT-ETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPD
        MA+S+S    T+ FIS PQ       S+PR  +      ++ +RPIK A+ ET  FPL Q P ++ESS SELE ADPDFYKIG+VRS+RAYG+EFKEGPD
Subjt:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAAT-ETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPD

Query:  GFGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLA
        GFGVYASKD+EP RRARVIMEIPLELM+TI QK PWMFFPDI+P+GHPIFDIINST+PE DWD+RLACLLL++FDRDD+FW+LYGDFLP+ DEC+SLLLA
Subjt:  GFGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLA

Query:  SEEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFK
        +EE+L ELQD +L S+IR QQ R LDFWE+NWHSGVPLKIKRLA DP+RFIWAVS+AQ+RCI+MQTR+GALVQ+ NM+IPYADMLNHSF PNCF HWR K
Subjt:  SEEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFK

Query:  DRMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSL
        DRMLEVM NAGQ IKKG+EMT+NYM GQ N+M ++RYGFS+ VNPWD I+FSG++RIHL+SFLSVFNI GLP+ YY++  LS   DTFVDGAVIAAAR+L
Subjt:  DRMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSL

Query:  PSWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
        P+WSD D+PP PS ERKAVKELQ+EC++MLA +PTT+EQDQK+L
Subjt:  PSWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL

Arabidopsis top hitse value%identityAlignment
AT1G24610.1 Rubisco methyltransferase family protein1.9e-0923.62Show/hide
Query:  SSPSELEPADPDFYKI-----GFVRSMRAYGIEF-KEGPDGFGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPET
        S PS L P  PD  +      GFV     + ++  +E   G G+ +++ + P      ++ +P  + L            D       +   +    PE 
Subjt:  SSPSELEPADPDFYKI-----GFVRSMRAYGIEF-KEGPDGFGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPET

Query:  DWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLASEEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPK------------
         W ++L   LL      D+FW  Y   LP  +  T  +    E++  LQ   L   +  +    L+F +         +I+R   D K            
Subjt:  DWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLASEEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPK------------

Query:  --RFIWAVSIAQSRCINM---QTRIGALVQDANMLIPYADMLNHSFRPNC--FFHWRFKDRMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSS
             W +S   +R   +   +   G    D  M++P  DM NHSF+PN          D    V + A  ++K+   + +NY    +ND FL  YGF  
Subjt:  --RFIWAVSIAQSRCINM---QTRIGALVQDANMLIPYADMLNHSFRPNC--FFHWRFKDRMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSS

Query:  SVNPWDMIE
          NP+D IE
Subjt:  SVNPWDMIE

AT4G20130.1 plastid transcriptionally active 141.6e-18971.85Show/hide
Query:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAAT-ETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPD
        MA+S+S    T+ FIS PQ       S+PR  +      ++ +RPIK A+ ET  FPL Q P ++ESS SELE ADPDFYKIG+VRS+RAYG+EFKEGPD
Subjt:  MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAAT-ETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPD

Query:  GFGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLA
        GFGVYASKD+EP RRARVIMEIPLELM+TI QK PWMFFPDI+P+GHPIFDIINST+PE DWD+RLACLLL++FDRDD+FW+LYGDFLP+ DEC+SLLLA
Subjt:  GFGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLA

Query:  SEEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFK
        +EE+L ELQD +L S+IR QQ R LDFWE+NWHSGVPLKIKRLA DP+RFIWAVS+AQ+RCI+MQTR+GALVQ+ NM+IPYADMLNHSF PNCF HWR K
Subjt:  SEEELLELQDQNLASSIRDQQHRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFK

Query:  DRMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSL
        DRMLEVM NAGQ IKKG+EMT+NYM GQ N+M ++RYGFS+ VNPWD I+FSG++RIHL+SFLSVFNI GLP+ YY++  LS   DTFVDGAVIAAAR+L
Subjt:  DRMLEVMINAGQQIKKGQEMTVNYMNGQTNDMFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSL

Query:  PSWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL
        P+WSD D+PP PS ERKAVKELQ+EC++MLA +PTT+EQDQK+L
Subjt:  PSWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQKIL

AT5G44710.1 CONTAINS InterPro DOMAIN/s: Ribosomal protein S27/S33, mitochondrial (InterPro:IPR013219); Has 101 Blast hits to 101 proteins in 55 species: Archae - 0; Bacteria - 0; Metazoa - 8; Fungi - 59; Plants - 26; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).6.2e-4077.45Show/hide
Query:  MATNSLKGMIAAALNKGVTEARARIFGHVLNPTGQRSTHKLLRKKLIGDKVAEWYPYDIKKDDPLIMARQEQERLSKLEMLKRRGKGPPKKGQGRRAAKR
        MA+ SLK +I++A+ +GVTEARARIFGH+LNPTGQRS HK+LRKKLIGDKVAEWYPYDIK +DP ++AR+E+ER+SKLEMLKRR KGPPKKG G+RAAKR
Subjt:  MATNSLKGMIAAALNKGVTEARARIFGHVLNPTGQRSTHKLLRKKLIGDKVAEWYPYDIKKDDPLIMARQEQERLSKLEMLKRRGKGPPKKGQGRRAAKR

Query:  NK
        NK
Subjt:  NK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAATTCTATCTCCTTCCATCAGCCCACTCATCGCTTCATCTCCTGCCCGCAGGTGAAGGATTTCCGGTCTTTCTCCTCACCAAGATTCAGTAACTATCCGTCCAC
TTCCCCTAAAAGCAGATTGCGGCCGATTAAAGCTGCAACCGAAACCGCCGCATTTCCTCTTCTACAACCTCCGAAAGCCGACGAATCTTCTCCTTCTGAGTTGGAACCAG
CAGACCCTGATTTCTATAAGATAGGATTTGTTAGAAGTATGCGAGCTTATGGAATCGAATTCAAAGAAGGGCCTGATGGGTTTGGTGTATATGCTTCCAAAGATGTTGAA
CCTCTTCGCCGTGCTAGGGTAATCATGGAAATTCCATTAGAACTAATGTTAACCATAAGCCAGAAACTCCCCTGGATGTTTTTCCCTGATATAATTCCAGTGGGTCATCC
GATATTCGATATAATTAATTCGACAAATCCAGAGACTGATTGGGATCTAAGGTTAGCGTGTCTTCTGTTGTATGCATTTGATCGAGATGACAACTTTTGGCAGCTCTATG
GTGATTTTCTACCAAGTATTGACGAATGTACAAGCTTACTTCTAGCCTCCGAGGAAGAACTTTTGGAGCTGCAGGATCAAAATCTTGCTTCATCGATCAGAGACCAGCAG
CATCGTGCTTTAGATTTCTGGGAAAGGAACTGGCACAGTGGTGTACCCCTTAAGATTAAGCGGCTTGCTCGAGATCCTAAACGATTCATTTGGGCTGTGAGTATAGCACA
ATCACGATGCATAAACATGCAAACAAGGATCGGAGCTTTAGTACAAGATGCAAATATGCTAATTCCTTATGCTGATATGCTGAATCATTCTTTCCGACCAAATTGTTTTT
TCCACTGGCGTTTTAAGGATCGGATGCTTGAGGTGATGATAAATGCTGGGCAGCAGATCAAAAAAGGGCAAGAGATGACAGTCAATTACATGAATGGCCAAACGAACGAC
ATGTTTCTGCAGAGATATGGTTTTTCATCGTCTGTGAACCCTTGGGATATGATCGAGTTCTCCGGGAATGCGCGTATTCACTTAGATTCGTTCTTATCAGTTTTCAACAT
AGCTGGGCTTCCTGATGGTTATTACTACAATGGTCGGTTATCGAACGAGGAAGATACATTCGTTGATGGAGCGGTAATCGCAGCAGCAAGATCTCTGCCTTCATGGTCAG
ATGGAGATATCCCACCTAGCCCAAGCAGGGAGAGGAAAGCAGTTAAAGAGTTACAAGAAGAATGCCAACGGATGCTCGCAGCATTCCCGACCACATCAGAACAAGACCAA
AAAATCCTAGGTACGCCATGCAGTCCTTGCCTTCTCCCAAAATTCTATGCCACAAGCTACAAGAACGCTAGAAGCCTCGATCAAATACAGATTGCACCGAAAGTTGTTCA
TGGAGAAAGTGATCAAGGCATTGGATGTCTATCAAGAACGGATACTGTTCTAGTTCTGCAAATTTTCTACTGCAATTTGAGCTTAGCAGGTATTATATTGGGAAAACTTG
AAGAGAGGATGGCTACGAATAGCCTGAAGGGTATGATTGCTGCAGCACTCAATAAGGGAGTAACAGAAGCAAGGGCAAGGATATTTGGTCACGTACTTAACCCGACAGGT
CAACGATCTACCCACAAGCTTCTGCGCAAAAAACTCATCGGTGACAAAGTAGCAGAGTGGTATCCATATGACATCAAGAAGGATGATCCTCTTATCATGGCTCGTCAAGA
ACAAGAGCGCTTGTCGAAGCTTGAAATGCTGAAGCGGCGTGGTAAGGGACCACCTAAGAAGGGTCAAGGCAGGCGTGCAGCCAAGCGCAACAAGTAG
mRNA sequenceShow/hide mRNA sequence
TCTTAAAAAATGGGAGGGAACGCGGAGGCGGGGAGCTGAAGTGGATAACGTTCTTCTGCAAAACATACAGCAACCTTTTGCGCTGCTCTGCTGTTGATCGGTACCTCTAA
TGGCGAATTCTATCTCCTTCCATCAGCCCACTCATCGCTTCATCTCCTGCCCGCAGGTGAAGGATTTCCGGTCTTTCTCCTCACCAAGATTCAGTAACTATCCGTCCACT
TCCCCTAAAAGCAGATTGCGGCCGATTAAAGCTGCAACCGAAACCGCCGCATTTCCTCTTCTACAACCTCCGAAAGCCGACGAATCTTCTCCTTCTGAGTTGGAACCAGC
AGACCCTGATTTCTATAAGATAGGATTTGTTAGAAGTATGCGAGCTTATGGAATCGAATTCAAAGAAGGGCCTGATGGGTTTGGTGTATATGCTTCCAAAGATGTTGAAC
CTCTTCGCCGTGCTAGGGTAATCATGGAAATTCCATTAGAACTAATGTTAACCATAAGCCAGAAACTCCCCTGGATGTTTTTCCCTGATATAATTCCAGTGGGTCATCCG
ATATTCGATATAATTAATTCGACAAATCCAGAGACTGATTGGGATCTAAGGTTAGCGTGTCTTCTGTTGTATGCATTTGATCGAGATGACAACTTTTGGCAGCTCTATGG
TGATTTTCTACCAAGTATTGACGAATGTACAAGCTTACTTCTAGCCTCCGAGGAAGAACTTTTGGAGCTGCAGGATCAAAATCTTGCTTCATCGATCAGAGACCAGCAGC
ATCGTGCTTTAGATTTCTGGGAAAGGAACTGGCACAGTGGTGTACCCCTTAAGATTAAGCGGCTTGCTCGAGATCCTAAACGATTCATTTGGGCTGTGAGTATAGCACAA
TCACGATGCATAAACATGCAAACAAGGATCGGAGCTTTAGTACAAGATGCAAATATGCTAATTCCTTATGCTGATATGCTGAATCATTCTTTCCGACCAAATTGTTTTTT
CCACTGGCGTTTTAAGGATCGGATGCTTGAGGTGATGATAAATGCTGGGCAGCAGATCAAAAAAGGGCAAGAGATGACAGTCAATTACATGAATGGCCAAACGAACGACA
TGTTTCTGCAGAGATATGGTTTTTCATCGTCTGTGAACCCTTGGGATATGATCGAGTTCTCCGGGAATGCGCGTATTCACTTAGATTCGTTCTTATCAGTTTTCAACATA
GCTGGGCTTCCTGATGGTTATTACTACAATGGTCGGTTATCGAACGAGGAAGATACATTCGTTGATGGAGCGGTAATCGCAGCAGCAAGATCTCTGCCTTCATGGTCAGA
TGGAGATATCCCACCTAGCCCAAGCAGGGAGAGGAAAGCAGTTAAAGAGTTACAAGAAGAATGCCAACGGATGCTCGCAGCATTCCCGACCACATCAGAACAAGACCAAA
AAATCCTAGGTACGCCATGCAGTCCTTGCCTTCTCCCAAAATTCTATGCCACAAGCTACAAGAACGCTAGAAGCCTCGATCAAATACAGATTGCACCGAAAGTTGTTCAT
GGAGAAAGTGATCAAGGCATTGGATGTCTATCAAGAACGGATACTGTTCTAGTTCTGCAAATTTTCTACTGCAATTTGAGCTTAGCAGGTATTATATTGGGAAAACTTGA
AGAGAGGATGGCTACGAATAGCCTGAAGGGTATGATTGCTGCAGCACTCAATAAGGGAGTAACAGAAGCAAGGGCAAGGATATTTGGTCACGTACTTAACCCGACAGGTC
AACGATCTACCCACAAGCTTCTGCGCAAAAAACTCATCGGTGACAAAGTAGCAGAGTGGTATCCATATGACATCAAGAAGGATGATCCTCTTATCATGGCTCGTCAAGAA
CAAGAGCGCTTGTCGAAGCTTGAAATGCTGAAGCGGCGTGGTAAGGGACCACCTAAGAAGGGTCAAGGCAGGCGTGCAGCCAAGCGCAACAAGTAGAGACAATGACAGTG
GAATTGTTCCTGCTAAGGCTTTATCTTTTTGTTGTTTTTTCCTGCCTAAAGTTTAGGTTATGGACCAGTTTCTCGTAGTTGTAACGACTTCTATTCATTATAAGCGATAT
TTACTCTGAAGATTCTTCGTTCATAATTGAGCGCCCTTGAAGCAAAGTAATACATACTATATTATGCAAAAAGCCAGAAACAGAGCAGCGCACCATGCGTAGTCTGGAAA
CAAAAGGCATTAATAGAACAGTATACTTGTATGTTGGAGAGAACCAACTGCTAGAATTCATTACTAGAGCTTTGTTTCGTGAACAATTCATTGACTGCAAATTGATTAGG
AATGATAATTTGATTGCCTTGGAGCTGAAATGCGGTTCATTTTCCTCCCAAGATCAAAATCTAAGGTCCTGCACTGGGCGTGGTTTCTTCATTGATCTGCTTTTGGGTCA
TTAATTTGTCGTCTTGATGAGCCATATCATAGGTTGCATGAAGACCAAAGAAAACATAATAGATAAGCATCACCAGTGTGCAGATTCCAAATCTTTCAAAAGCCTCAGGA
CCCAATGATCCCATGAGAAAGACGTTGGTTGCAATCGACAAACTCGGCAGCCATGGCACGAGAGGTACGCCCCAAACTTTTGGCTTTCTTTGCATTGGCAGAAGCAATGA
AATCCCCAACGTCCCTAGAAACCATATGGGAACGGTCACGACGTATCCTACCCAACCATTTGGGTAAAGCCCCCAGTATGCAGAAGTTGCCATGGAGGAAGCAATGATGG
TTATCAGTAGAATGAAAAGCTTCAACTGGTGTGACCGTGGAGTGACCCCTCTTGCATAGTATCTCCTGACCAGAAGCGCCACGGCCATCATCATAAAGACGAACAGCGTG
CTGACCGACAACAAGCTCGCCAGAACATCCAAGCTAGAGAAGAAAGCAACTGAACCACTTGCTACGGTTATCAACAGTGTAGCATTAATCGGAGTTCCAGTCTTTGGATG
AACAAGAGCAAACCAAGGAGGAATCATGTGAGCTCGAGCAATGTGCGTTATATACCGACCTTGCCCGAGCGCCCCAACCAGAAGAACAGTAGTCATCCCCTTCAGAGCAC
CAAGAGCCACCAAATACTTAGCCCATTTCATTCCAACACTCTCAAAGGCAACAGAGTAGGCTGCATCTGGGTTGATATCAGTATACTTCTGCATCATACTCAGCGACAGA
GCCATCAGACAGTATATCACAGTGATAACCGACATTGATCCCAACAATCCCAAAGGTATGAAGTTTTCCGCGTGCTGATCATCGCTATCGTTGCAGCGATCGTCAAGACC
CCAACAGCAATTGGGTCAAGAAGATTGAACCCATCTTTCAGATTCGTGTGGATACGCAACGAATTGTCGGGTCTGTCCAGCAGCGAGGTTAAGTAAGAAGTCCAAGCTCG
AGCGACAGCGGCGGTACCGACAATGCTCTCGAGGAGAATGTTGCCAGCAGTGATGAAAGCAGCAAAGTCCCCCAATTCAATCCTCAGGTAAGCAAAAGAGCCCCCTGCAA
CTGGGATTTCAACGGCGAACTCTGTATAGCAAAAAACAGAGAGCATTGCAGAAACACCGGAAGCTACATAGGACAAAACAATGGCTGGTCCAGCATGTTTATTGGCTTCT
TGACCGGTAAGAACGAAGATCCCTGCACCAATTACAGCCCCAAATCCAAACCAGGTGAGATCCCACCAAGTTAAGCAGCGTTTCATTTCATTCTCACTTCGTTTTCTTAG
CTCCCCGATTTCATTTTCGTCAAATGATCTGCTTTGAAGACGGTCCATGAACCGAAACCATGTCTGAGACAGAGCAGTCCGGTAATTGCTCAAGCTCTGGAAGGATTCTT
CCGGCAAGAAATCTTGTTTGCTCCATCTCCACCAATAGCTTCTCTGCTGCTGAACAATATCGTCGCCCATTCTGCTCTCTATGGAAATGGGCATGTGACAACGTCGTCTA
TATAGAGTTTAGATCCGTGAGCAATGTGGGCACTGGCTAATGGTTAGGAGGGGGGGTTTCGAAGATGCCGATGGTGGGCTTGGATTGATGTTGCTTGTCTCGTAGCTCCA
ACCCGAAAATGACCGTCTTACAGAAATATCTGAATGCTCCCATTGACGTTAGGATCGAAGACTTCCAAGTCAACTACTTTGGCTGTGCTTGGTTGTTGCTCAATCTCATC
AATTCTTTCATTGACTTTTATGCTTACTCTGTTTTCTGGTTTCTGTAGGATTATATAATCTTTTCCAAATATTCTCATGTTTACTGTGTAGCTGCTTGTCTTCCA
Protein sequenceShow/hide protein sequence
MANSISFHQPTHRFISCPQVKDFRSFSSPRFSNYPSTSPKSRLRPIKAATETAAFPLLQPPKADESSPSELEPADPDFYKIGFVRSMRAYGIEFKEGPDGFGVYASKDVE
PLRRARVIMEIPLELMLTISQKLPWMFFPDIIPVGHPIFDIINSTNPETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLLASEEELLELQDQNLASSIRDQQ
HRALDFWERNWHSGVPLKIKRLARDPKRFIWAVSIAQSRCINMQTRIGALVQDANMLIPYADMLNHSFRPNCFFHWRFKDRMLEVMINAGQQIKKGQEMTVNYMNGQTND
MFLQRYGFSSSVNPWDMIEFSGNARIHLDSFLSVFNIAGLPDGYYYNGRLSNEEDTFVDGAVIAAARSLPSWSDGDIPPSPSRERKAVKELQEECQRMLAAFPTTSEQDQ
KILGTPCSPCLLPKFYATSYKNARSLDQIQIAPKVVHGESDQGIGCLSRTDTVLVLQIFYCNLSLAGIILGKLEERMATNSLKGMIAAALNKGVTEARARIFGHVLNPTG
QRSTHKLLRKKLIGDKVAEWYPYDIKKDDPLIMARQEQERLSKLEMLKRRGKGPPKKGQGRRAAKRNK