; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg03810 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg03810
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr08:73865..76155
RNA-Seq ExpressionCarg03810
SyntenyCarg03810
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8023775.1 hypothetical protein FH972_009438 [Carpinus fangiana]1.9e-5430.95Show/hide
Query:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIG----------
        M  AGVSV+  ++K LL+  G LR LPDG+ +H  L+RTV++P GF ++   +  C CGS ++A++LFDEMLER LVSW I+IS YA+ G          
Subjt:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIG----------

Query:  ---------------------------LSHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDV
                                   L  LG Q+ S V ++G  SN  ++TA++NMYVKCGWLE AEL FD M E KN VAW GLMVGY +++ +L+D 
Subjt:  ---------------------------LSHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDV

Query:  LSLFGKMVRDGV--------------------------------------------------------------------------CVF-----------
        L+LF KMV++GV                                                                          C+            
Subjt:  LSLFGKMVRDGV--------------------------------------------------------------------------CVF-----------

Query:  ---------------------DCPQACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYAS
                                QAC+AL   + G Q H D        YL+GESAM  +                    DT AW A+I GY  HG A+
Subjt:  ---------------------DCPQACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYAS

Query:  ESLRLFIKMQAFSVRPNAVTFI-----------------------------------------------------LAAIQAWYPGNACW--VVAG----S
        E+LRLF +M+   VRPNAVTFI                                                     L    A+ P    W  ++ G     
Subjt:  ESLRLFIKMQAFSVRPNAVTFI-----------------------------------------------------LAAIQAWYPGNACW--VVAG----S

Query:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS
           +GK AAEN   +ES   + +++ MFN   +FGKW+EAAH RK MA + L+KE+SCS
Subjt:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS

KAG6788016.1 hypothetical protein POTOM_004068 [Populus tomentosa]4.4e-5130.05Show/hide
Query:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL---------
        M  AG+SV+ +++KCL +  G ++ L DG+  H  +QRTV+NPP FL++   K  C CGSL++AR++FDEM ER+LVSW  +ISAYA+ G+         
Subjt:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL---------

Query:  --------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS
                                    +G Q+ S   + GL SNAS+ TA+ NMYVKCGWLE AELVF+ M+E KN VAW G+MVGY+    ++ D L+
Subjt:  --------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS

Query:  LFGKMVRDGV----CVFD------------------------------------------------------------------------CP--------
        LF KMV +GV     VF                                                                         C         
Subjt:  LFGKMVRDGV----CVFD------------------------------------------------------------------------CP--------

Query:  ----------------------QACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASES
                              QACSAL  F+ G Q H D        Y  GESAM  +                    D  AW A+I GY   G A E+
Subjt:  ----------------------QACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASES

Query:  LRLFIKMQAFSVRPNAVTFILAAIQAWYPG-------------------------------------------------------------NACWVVAGS
        L+LF +MQ   VRPNAVTFI       + G                                                               CW     
Subjt:  LRLFIKMQAFSVRPNAVTFILAAIQAWYPG-------------------------------------------------------------NACWVVAGS

Query:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS
           +G+ AAEN   ++    +  ++ MFN   +FGKW+EAA+VRK MA +NL+KELSCS
Subjt:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS

KAG7025229.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]6.1e-218100Show/hide
Query:  MAEAEKKKLLKLWQSNPGIMARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFI
        MAEAEKKKLLKLWQSNPGIMARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFI
Subjt:  MAEAEKKKLLKLWQSNPGIMARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFI

Query:  MISAYAQIGLSHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFGKMVRDGVCVFDCP
        MISAYAQIGLSHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFGKMVRDGVCVFDCP
Subjt:  MISAYAQIGLSHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFGKMVRDGVCVFDCP

Query:  QACSALTCFDFGTQVHGDGFPYLFGESAMTNLDTAAWIAVIFGYGNHGYASESLRLFIKMQAFSVRPNAVTFILAAIQAWYPGNACWVVAGSTRTVGKTA
        QACSALTCFDFGTQVHGDGFPYLFGESAMTNLDTAAWIAVIFGYGNHGYASESLRLFIKMQAFSVRPNAVTFILAAIQAWYPGNACWVVAGSTRTVGKTA
Subjt:  QACSALTCFDFGTQVHGDGFPYLFGESAMTNLDTAAWIAVIFGYGNHGYASESLRLFIKMQAFSVRPNAVTFILAAIQAWYPGNACWVVAGSTRTVGKTA

Query:  AENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMAKNLKKELSCSCGIPLKAQRTDRKQLLVHSERVAIA
        AENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMAKNLKKELSCSCGIPLKAQRTDRKQLLVHSERVAIA
Subjt:  AENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMAKNLKKELSCSCGIPLKAQRTDRKQLLVHSERVAIA

PNT53060.1 hypothetical protein POPTR_001G066100 [Populus trichocarpa]5.8e-5130.05Show/hide
Query:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL---------
        M  AG+SV+ +++KCL +  G ++ L DG+  H  +QRTV+NPP FL++   K  C CGSL++AR++FDEM ER+LVSW  +ISAYA+ G+         
Subjt:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL---------

Query:  --------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS
                                    +G Q+ S   + GL SNAS+ TA+ NMYVKCGWLE AELVF+ M+E KN VAW G+MVGY+    ++ D L+
Subjt:  --------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS

Query:  LFGKMVRDGV----CVFD------------------------------------------------------------------------CP--------
        LF KMV +GV     VF                                                                         C         
Subjt:  LFGKMVRDGV----CVFD------------------------------------------------------------------------CP--------

Query:  ----------------------QACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASES
                              QACSAL  F+ G Q H D        Y  GESAM  +                    D  AW A+I GY   G A E+
Subjt:  ----------------------QACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASES

Query:  LRLFIKMQAFSVRPNAVTFILAAIQAWYPG-------------------------------------------------------------NACWVVAGS
        L+LF +MQ   VRPNAVTFI       + G                                                               CW     
Subjt:  LRLFIKMQAFSVRPNAVTFILAAIQAWYPG-------------------------------------------------------------NACWVVAGS

Query:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS
           +G+ AAEN   ++    +  ++ MFN   +FGKW+EAA+VRK MA +NL+KELSCS
Subjt:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS

XP_022148899.1 pentatricopeptide repeat-containing protein At5g13270, chloroplastic [Momordica charantia]1.3e-8240.11Show/hide
Query:  AGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGLSH----------
        AGVSV AKTFKCL +V GNLR LPDG+FVHHLLQRTV NPPGFLQ+CA K  C CGS S+AR++FDEMLERDLVSW IMISAYAQ GL +          
Subjt:  AGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGLSH----------

Query:  -------------------------LGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFG
                                 LG QM S + +HGLS+NASIETAV+NMYVKCGWLE A LVFD MT  KNKVAW GLMVGYSS+S++LQ  L+LF 
Subjt:  -------------------------LGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFG

Query:  KMVRDGV---------------------------------------------------CV--------------------------------FDCP----
        KMV +GV                                                   C                                 FD      
Subjt:  KMVRDGV---------------------------------------------------CV--------------------------------FDCP----

Query:  -------------------QACSALTCFDFGTQVHGDG-----FPYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASESLRL
                           QACSA TCFD GTQVH D        YL+GESAM  +                    DT AW A+I GY  HGYASE+LRL
Subjt:  -------------------QACSALTCFDFGTQVHGDG-----FPYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASESLRL

Query:  FIKMQAFSVRPNAVTFILAAIQAWYPGN-------------------------------------------------------------ACWVVAGSTRT
        F +MQ   VRPNAVTFI       + G+                                                              CW+       
Subjt:  FIKMQAFSVRPNAVTFILAAIQAWYPGN-------------------------------------------------------------ACWVVAGSTRT

Query:  VGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS
        +GKTAAEN   ++    + S+VTMFN   AFGKWQEAA VRK MA KNLKKELSCS
Subjt:  VGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS

TrEMBL top hitse value%identityAlignment
A0A2K2BTF2 DYW_deaminase domain-containing protein2.8e-5130.05Show/hide
Query:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL---------
        M  AG+SV+ +++KCL +  G ++ L DG+  H  +QRTV+NPP FL++   K  C CGSL++AR++FDEM ER+LVSW  +ISAYA+ G+         
Subjt:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL---------

Query:  --------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS
                                    +G Q+ S   + GL SNAS+ TA+ NMYVKCGWLE AELVF+ M+E KN VAW G+MVGY+    ++ D L+
Subjt:  --------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS

Query:  LFGKMVRDGV----CVFD------------------------------------------------------------------------CP--------
        LF KMV +GV     VF                                                                         C         
Subjt:  LFGKMVRDGV----CVFD------------------------------------------------------------------------CP--------

Query:  ----------------------QACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASES
                              QACSAL  F+ G Q H D        Y  GESAM  +                    D  AW A+I GY   G A E+
Subjt:  ----------------------QACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASES

Query:  LRLFIKMQAFSVRPNAVTFILAAIQAWYPG-------------------------------------------------------------NACWVVAGS
        L+LF +MQ   VRPNAVTFI       + G                                                               CW     
Subjt:  LRLFIKMQAFSVRPNAVTFILAAIQAWYPG-------------------------------------------------------------NACWVVAGS

Query:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS
           +G+ AAEN   ++    +  ++ MFN   +FGKW+EAA+VRK MA +NL+KELSCS
Subjt:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS

A0A314XTU1 Pentatricopeptide repeat-containing protein2.4e-5030.23Show/hide
Query:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL--------S
        M  AGVSV+ +++KCL +  G +  L DGKFVH  LQ+T+++PP FL++ A +  C CGSLS+A+++FDEML ++LVSW I+ISAYAQ G+        S
Subjt:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL--------S

Query:  H---------------------------LGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS
        H                           LG Q+ S + + G++SN SI+T++ NMYVKCGWLE A+LVFD M + KN V W GLMVGY +   +L++VL 
Subjt:  H---------------------------LGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS

Query:  LFGKMVRDGVCVFD----------------------------------------------------------------------------CP--------
        LF +MVR  V V D                                                                            C         
Subjt:  LFGKMVRDGVCVFD----------------------------------------------------------------------------CP--------

Query:  ----------------------QACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASES
                              QACSA+   + GTQVH D         L G SAM  +                    DT AW ++I GY  HG ASE+
Subjt:  ----------------------QACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASES

Query:  LRLFIKMQAFSVRPNAVTFILAAIQAWYPG-------------------------------------------------------------NACWVVAGS
        LRLF +MQ   VRPN+VTFI       + G                                                               CW+    
Subjt:  LRLFIKMQAFSVRPNAVTFILAAIQAWYPG-------------------------------------------------------------NACWVVAGS

Query:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS
           +GK AAEN   ++    + S++ MFN   + GKW+EAA  R+ MA +NL+KE+ CS
Subjt:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS

A0A5N6R4U8 DYW_deaminase domain-containing protein9.3e-5530.95Show/hide
Query:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIG----------
        M  AGVSV+  ++K LL+  G LR LPDG+ +H  L+RTV++P GF ++   +  C CGS ++A++LFDEMLER LVSW I+IS YA+ G          
Subjt:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIG----------

Query:  ---------------------------LSHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDV
                                   L  LG Q+ S V ++G  SN  ++TA++NMYVKCGWLE AEL FD M E KN VAW GLMVGY +++ +L+D 
Subjt:  ---------------------------LSHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDV

Query:  LSLFGKMVRDGV--------------------------------------------------------------------------CVF-----------
        L+LF KMV++GV                                                                          C+            
Subjt:  LSLFGKMVRDGV--------------------------------------------------------------------------CVF-----------

Query:  ---------------------DCPQACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYAS
                                QAC+AL   + G Q H D        YL+GESAM  +                    DT AW A+I GY  HG A+
Subjt:  ---------------------DCPQACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYAS

Query:  ESLRLFIKMQAFSVRPNAVTFI-----------------------------------------------------LAAIQAWYPGNACW--VVAG----S
        E+LRLF +M+   VRPNAVTFI                                                     L    A+ P    W  ++ G     
Subjt:  ESLRLFIKMQAFSVRPNAVTFI-----------------------------------------------------LAAIQAWYPGNACW--VVAG----S

Query:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS
           +GK AAEN   +ES   + +++ MFN   +FGKW+EAAH RK MA + L+KE+SCS
Subjt:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS

A0A6J1D5E7 pentatricopeptide repeat-containing protein At5g13270, chloroplastic6.2e-8340.11Show/hide
Query:  AGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGLSH----------
        AGVSV AKTFKCL +V GNLR LPDG+FVHHLLQRTV NPPGFLQ+CA K  C CGS S+AR++FDEMLERDLVSW IMISAYAQ GL +          
Subjt:  AGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGLSH----------

Query:  -------------------------LGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFG
                                 LG QM S + +HGLS+NASIETAV+NMYVKCGWLE A LVFD MT  KNKVAW GLMVGYSS+S++LQ  L+LF 
Subjt:  -------------------------LGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFG

Query:  KMVRDGV---------------------------------------------------CV--------------------------------FDCP----
        KMV +GV                                                   C                                 FD      
Subjt:  KMVRDGV---------------------------------------------------CV--------------------------------FDCP----

Query:  -------------------QACSALTCFDFGTQVHGDG-----FPYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASESLRL
                           QACSA TCFD GTQVH D        YL+GESAM  +                    DT AW A+I GY  HGYASE+LRL
Subjt:  -------------------QACSALTCFDFGTQVHGDG-----FPYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASESLRL

Query:  FIKMQAFSVRPNAVTFILAAIQAWYPGN-------------------------------------------------------------ACWVVAGSTRT
        F +MQ   VRPNAVTFI       + G+                                                              CW+       
Subjt:  FIKMQAFSVRPNAVTFILAAIQAWYPGN-------------------------------------------------------------ACWVVAGSTRT

Query:  VGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS
        +GKTAAEN   ++    + S+VTMFN   AFGKWQEAA VRK MA KNLKKELSCS
Subjt:  VGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS

A0A6J5TNM9 DYW_deaminase domain-containing protein3.1e-5030.23Show/hide
Query:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL--------S
        M  AGVSV+ +++KCL +  G +  L DGK VH  L++T+++PP FL++ A +  C CGSLS+A+++FDEML ++LVSW I+ISAYAQ G+        S
Subjt:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL--------S

Query:  H---------------------------LGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS
        H                           LG Q+ S V + G++SN SI+T++ NMYVKCGWLE A+LVFD M + KN V W GLMVGY +   +L++VL 
Subjt:  H---------------------------LGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS

Query:  LFGKMVRDGVCVFD--------------------------------------------------------------------------------------
        LF +MVR  V V D                                                                                      
Subjt:  LFGKMVRDGVCVFD--------------------------------------------------------------------------------------

Query:  -------CP-------------QACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASES
               C              QACSA+   +FGTQVH D         L G SAM  +                    DT AW ++I GY  HG ASE+
Subjt:  -------CP-------------QACSALTCFDFGTQVHGDGF-----PYLFGESAMTNL--------------------DTAAWIAVIFGYGNHGYASES

Query:  LRLFIKMQAFSVRPNAVTFILAAIQAWYPG-------------------------------------------------------------NACWVVAGS
        LRLF +MQ   VRPN+VTFI       + G                                                               CW+    
Subjt:  LRLFIKMQAFSVRPNAVTFILAAIQAWYPG-------------------------------------------------------------NACWVVAGS

Query:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS
           +GK AAEN   ++    + S++ MFN   + GKW+EAA  R+ MA +NL+KE+ CS
Subjt:  TRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMA-KNLKKELSCS

SwissProt top hitse value%identityAlignment
O23169 Pentatricopeptide repeat-containing protein At4g371702.7e-1924.42Show/hide
Query:  AKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL------------------
        A T+  L+QV    R L +GK VH  ++ +   P   + +   +    CGSL +AR++FDEM  RDL SW +M++ YA++GL                  
Subjt:  AKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL------------------

Query:  -------------------------------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKN
                                                            G ++   + + GL S+  + +++M+MY KCG ++ A  +FD + E K+
Subjt:  -------------------------------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKN

Query:  KVAWIGLMVGYSSRSNELQDVLSLFGKMVRDGVC-------VFDCPQACSALTCFDFGTQVHG----DGF-PYLFGESAMTNL-----------------
         V+W  ++  Y  +S+  ++  SLF ++V  G C             AC+ LT  + G QVHG     GF PY F  S++ ++                 
Subjt:  KVAWIGLMVGYSSRSNELQDVLSLFGKMVRDGVC-------VFDCPQACSALTCFDFGTQVHG----DGF-PYLFGESAMTNL-----------------

Query:  ---DTAAWIAVIFGYGNHGYASESLRLFIKMQAFSVRPNAVTFI
           D  +W ++I G   +G   E+L+ F  +     +P+ VTF+
Subjt:  ---DTAAWIAVIFGYGNHGYASESLRLFIKMQAFSVRPNAVTFI

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.1e-1925.41Show/hide
Query:  FKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGLSHLGNQMLSQV----------
        F  LL+V G+   L  GK +H LL ++  +   F  +        C  ++ AR++FD M ERDLVSW  +++ Y+Q G++ +  +M+  +          
Subjt:  FKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGLSHLGNQMLSQV----------

Query:  -------------------------FKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFGKMVRDGVCV
                                  + G  S  +I TA+++MY KCG LE A  +FD M E +N V+W  ++  Y    N  ++ + +F KM+ +GV  
Subjt:  -------------------------FKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFGKMVRDGVCV

Query:  FDCPQACSALTCFDFGTQVHGDGFPYLFGESAM-----------------TNLDTAA-------------WIAVIFGYGNHGYASESLRLFIKMQAFSVR
         D     +   C D G    G     L  E  +                   +DTAA             W A+I G+  +G   ++L  F +M++ +V+
Subjt:  FDCPQACSALTCFDFGTQVHGDGFPYLFGESAM-----------------TNLDTAA-------------WIAVIFGYGNHGYASESLRLFIKMQAFSVR

Query:  PNAVTFI
        P+  T++
Subjt:  PNAVTFI

Q56X05 Pentatricopeptide repeat-containing protein At1g061432.7e-1929.2Show/hide
Query:  GSLSNARRLFDEMLERDLVSWFIMISAYAQIGLSHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSN
        G +  AR++FDEM ERD ++W  M+SAY ++      N + +Q+ +     N +    ++N Y+  G LE AE +F+ M   K+ ++W  ++ GY S++ 
Subjt:  GSLSNARRLFDEMLERDLVSWFIMISAYAQIGLSHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSN

Query:  ELQDVLSLFGKMVRDG-----VCVFDCPQACSALTCFDFGTQVH----GDGFPY-LFGESAMTNL--------------------DTAAWIAVIFGYGNH
          ++ +++F KM+ +G     V +     AC+ L   + G +VH     +GF   ++  SA+ ++                    +   W ++I G   H
Subjt:  ELQDVLSLFGKMVRDG-----VCVFDCPQACSALTCFDFGTQVH----GDGFPY-LFGESAMTNL--------------------DTAAWIAVIFGYGNH

Query:  GYASESLRLFIKMQAFSVRPNAVTFI
        G+A E+L++F KM+  SV+PNAVTF+
Subjt:  GYASESLRLFIKMQAFSVRPNAVTFI

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial3.2e-2023.65Show/hide
Query:  FKCLLQVYGNLRVLPDGKFVH-HLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQ----------------IGLSH--
        +  LL+     ++L  G+ VH H+LQ   R+    + +        CGSL  AR++F++M +RD V+W  +IS Y+Q                 G S   
Subjt:  FKCLLQVYGNLRVLPDGKFVH-HLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQ----------------IGLSH--

Query:  -----------------LGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFGKMVRDG--
                          G+Q+     K G  SN  + +A++++Y + G ++ A+LVFD + E +N V+W  L+ G++ RS   +  L LF  M+RDG  
Subjt:  -----------------LGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFGKMVRDG--

Query:  ---VCVFDCPQACSALTCFDFGTQVH--------------GDGFPYLFGESA-----------MTNLDTAAWIAVIFGYGNHGYASESLRLFIKMQAFSV
                   ACS+    + G  VH              G+    ++ +S            +   D  +W +++  Y  HG+  E++  F +M+   +
Subjt:  ---VCVFDCPQACSALTCFDFGTQVH--------------GDGFPYLFGESA-----------MTNLDTAAWIAVIFGYGNHGYASESLRLFIKMQAFSV

Query:  RPNAVTF--ILAAI-----------------------QAWY---------------------------PGNACWVV------AGSTRTVGKTAAENPPSV
        RPN ++F  +L A                        +AW+                           P  A W              +G  AAE+   +
Subjt:  RPNAVTF--ILAAI-----------------------QAWY---------------------------PGNACWVV------AGSTRTVGKTAAENPPSV

Query:  ESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMAKN-LKKELSCS
        +       HV ++N   + G+W +AA VRKKM ++ +KKE +CS
Subjt:  ESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMAKN-LKKELSCS

Q9LYU9 Pentatricopeptide repeat-containing protein At5g13270, chloroplastic1.1e-3137.13Show/hide
Query:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL---------
        M +AGVSV++ +++CL +    LR L  G+ +H  ++  + NP   LQ+C  +  C C SL +A +LFDEM E + VS   MISAYA+ G+         
Subjt:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL---------

Query:  --------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS
                                     G Q+ + V + GL SN SIET ++NMYVKCGWL  A+ VFD M  KK  VA  GLMVGY +++   +D L 
Subjt:  --------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS

Query:  LFGKMVRDGV----CVFDCP-QACSALTCFDFGTQVH
        LF  +V +GV     VF    +AC++L   + G Q+H
Subjt:  LFGKMVRDGV----CVFDCP-QACSALTCFDFGTQVH

Q9LYU9 Pentatricopeptide repeat-containing protein At5g13270, chloroplastic6.5e-0552.17Show/hide
Query:  AMTNLDTAAWIAVIFGYGNHGYASESLRLFIKMQAFSVRPNAVTFI
        +M N D  AW A I G+  +G ASE+LRLF KM +  ++PN+VTFI
Subjt:  AMTNLDTAAWIAVIFGYGNHGYASESLRLFIKMQAFSVRPNAVTFI

Arabidopsis top hitse value%identityAlignment
AT1G06150.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.9e-2029.2Show/hide
Query:  GSLSNARRLFDEMLERDLVSWFIMISAYAQIGLSHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSN
        G +  AR++FDEM ERD ++W  M+SAY ++      N + +Q+ +     N +    ++N Y+  G LE AE +F+ M   K+ ++W  ++ GY S++ 
Subjt:  GSLSNARRLFDEMLERDLVSWFIMISAYAQIGLSHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSN

Query:  ELQDVLSLFGKMVRDG-----VCVFDCPQACSALTCFDFGTQVH----GDGFPY-LFGESAMTNL--------------------DTAAWIAVIFGYGNH
          ++ +++F KM+ +G     V +     AC+ L   + G +VH     +GF   ++  SA+ ++                    +   W ++I G   H
Subjt:  ELQDVLSLFGKMVRDG-----VCVFDCPQACSALTCFDFGTQVH----GDGFPY-LFGESAMTNL--------------------DTAAWIAVIFGYGNH

Query:  GYASESLRLFIKMQAFSVRPNAVTFI
        G+A E+L++F KM+  SV+PNAVTF+
Subjt:  GYASESLRLFIKMQAFSVRPNAVTFI

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-2025.41Show/hide
Query:  FKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGLSHLGNQMLSQV----------
        F  LL+V G+   L  GK +H LL ++  +   F  +        C  ++ AR++FD M ERDLVSW  +++ Y+Q G++ +  +M+  +          
Subjt:  FKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGLSHLGNQMLSQV----------

Query:  -------------------------FKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFGKMVRDGVCV
                                  + G  S  +I TA+++MY KCG LE A  +FD M E +N V+W  ++  Y    N  ++ + +F KM+ +GV  
Subjt:  -------------------------FKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFGKMVRDGVCV

Query:  FDCPQACSALTCFDFGTQVHGDGFPYLFGESAM-----------------TNLDTAA-------------WIAVIFGYGNHGYASESLRLFIKMQAFSVR
         D     +   C D G    G     L  E  +                   +DTAA             W A+I G+  +G   ++L  F +M++ +V+
Subjt:  FDCPQACSALTCFDFGTQVHGDGFPYLFGESAM-----------------TNLDTAA-------------WIAVIFGYGNHGYASESLRLFIKMQAFSVR

Query:  PNAVTFI
        P+  T++
Subjt:  PNAVTFI

AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-2123.65Show/hide
Query:  FKCLLQVYGNLRVLPDGKFVH-HLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQ----------------IGLSH--
        +  LL+     ++L  G+ VH H+LQ   R+    + +        CGSL  AR++F++M +RD V+W  +IS Y+Q                 G S   
Subjt:  FKCLLQVYGNLRVLPDGKFVH-HLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQ----------------IGLSH--

Query:  -----------------LGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFGKMVRDG--
                          G+Q+     K G  SN  + +A++++Y + G ++ A+LVFD + E +N V+W  L+ G++ RS   +  L LF  M+RDG  
Subjt:  -----------------LGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFGKMVRDG--

Query:  ---VCVFDCPQACSALTCFDFGTQVH--------------GDGFPYLFGESA-----------MTNLDTAAWIAVIFGYGNHGYASESLRLFIKMQAFSV
                   ACS+    + G  VH              G+    ++ +S            +   D  +W +++  Y  HG+  E++  F +M+   +
Subjt:  ---VCVFDCPQACSALTCFDFGTQVH--------------GDGFPYLFGESA-----------MTNLDTAAWIAVIFGYGNHGYASESLRLFIKMQAFSV

Query:  RPNAVTF--ILAAI-----------------------QAWY---------------------------PGNACWVV------AGSTRTVGKTAAENPPSV
        RPN ++F  +L A                        +AW+                           P  A W              +G  AAE+   +
Subjt:  RPNAVTF--ILAAI-----------------------QAWY---------------------------PGNACWVV------AGSTRTVGKTAAENPPSV

Query:  ESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMAKN-LKKELSCS
        +       HV ++N   + G+W +AA VRKKM ++ +KKE +CS
Subjt:  ESRGHSISHVTMFNSCTAFGKWQEAAHVRKKMAKN-LKKELSCS

AT4G37170.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-2024.42Show/hide
Query:  AKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL------------------
        A T+  L+QV    R L +GK VH  ++ +   P   + +   +    CGSL +AR++FDEM  RDL SW +M++ YA++GL                  
Subjt:  AKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL------------------

Query:  -------------------------------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKN
                                                            G ++   + + GL S+  + +++M+MY KCG ++ A  +FD + E K+
Subjt:  -------------------------------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKN

Query:  KVAWIGLMVGYSSRSNELQDVLSLFGKMVRDGVC-------VFDCPQACSALTCFDFGTQVHG----DGF-PYLFGESAMTNL-----------------
         V+W  ++  Y  +S+  ++  SLF ++V  G C             AC+ LT  + G QVHG     GF PY F  S++ ++                 
Subjt:  KVAWIGLMVGYSSRSNELQDVLSLFGKMVRDGVC-------VFDCPQACSALTCFDFGTQVHG----DGF-PYLFGESAMTNL-----------------

Query:  ---DTAAWIAVIFGYGNHGYASESLRLFIKMQAFSVRPNAVTFI
           D  +W ++I G   +G   E+L+ F  +     +P+ VTF+
Subjt:  ---DTAAWIAVIFGYGNHGYASESLRLFIKMQAFSVRPNAVTFI

AT5G13270.1 Pentatricopeptide repeat (PPR) superfamily protein7.6e-3337.13Show/hide
Query:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL---------
        M +AGVSV++ +++CL +    LR L  G+ +H  ++  + NP   LQ+C  +  C C SL +A +LFDEM E + VS   MISAYA+ G+         
Subjt:  MARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL---------

Query:  --------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS
                                     G Q+ + V + GL SN SIET ++NMYVKCGWL  A+ VFD M  KK  VA  GLMVGY +++   +D L 
Subjt:  --------------------------SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLS

Query:  LFGKMVRDGV----CVFDCP-QACSALTCFDFGTQVH
        LF  +V +GV     VF    +AC++L   + G Q+H
Subjt:  LFGKMVRDGV----CVFDCP-QACSALTCFDFGTQVH

AT5G13270.1 Pentatricopeptide repeat (PPR) superfamily protein4.6e-0652.17Show/hide
Query:  AMTNLDTAAWIAVIFGYGNHGYASESLRLFIKMQAFSVRPNAVTFI
        +M N D  AW A I G+  +G ASE+LRLF KM +  ++PN+VTFI
Subjt:  AMTNLDTAAWIAVIFGYGNHGYASESLRLFIKMQAFSVRPNAVTFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAGGCTGAAAAGAAAAAACTTCTGAAACTTTGGCAAAGCAACCCAGGCATTATGGCAAGGGCGGGAGTTTCTGTTGCTGCCAAAACGTTCAAATGCCTTCTCCA
AGTGTATGGGAATCTGAGGGTTTTACCAGATGGTAAATTCGTTCACCATCTACTGCAAAGAACGGTGCGCAACCCACCTGGTTTCCTTCAGAGTTGTGCTTTTAAAAATG
ATTGTTATTGTGGTAGCTTGTCAAATGCCCGAAGACTGTTCGATGAAATGCTCGAAAGAGACTTGGTGTCATGGTTTATAATGATATCTGCATATGCGCAGATTGGGCTT
TCTCACCTTGGCAATCAAATGCTTTCTCAGGTGTTTAAACATGGACTGAGCTCCAATGCCTCGATTGAAACTGCAGTCATGAACATGTATGTCAAGTGTGGTTGGTTAGA
AGCAGCTGAGCTCGTCTTTGATATGATGACTGAAAAGAAAAACAAGGTGGCTTGGATTGGACTGATGGTAGGATACTCTTCACGAAGTAACGAACTGCAGGATGTTCTAT
CACTCTTTGGGAAAATGGTCAGAGATGGTGTTTGTGTTTTCGATTGTCCTCAAGCTTGTTCTGCTCTTACATGTTTCGATTTTGGAACTCAAGTTCATGGCGATGGATTT
CCTTACCTATTTGGAGAGAGTGCAATGACAAACCTTGATACTGCTGCTTGGATTGCTGTAATATTTGGTTATGGAAATCATGGGTATGCTTCTGAATCTTTGAGGCTTTT
TATTAAAATGCAGGCGTTCAGTGTTCGGCCAAATGCTGTTACCTTTATTCTTGCAGCCATTCAGGCTTGGTACCCTGGAAATGCTTGTTGGGTGGTTGCTGGATCCACAA
GAACCGTCGGGAAGACTGCAGCTGAAAACCCTCCTTCAGTTGAATCCAGGGGACACTCAATCAGTCATGTAACCATGTTTAATTCGTGTACCGCATTTGGGAAATGGCAA
GAAGCAGCTCACGTGAGAAAGAAAATGGCTAAGAACTTGAAGAAGGAGCTCAGCTGCAGTTGTGGAATACCATTAAAGGCTCAACGCACAGATCGAAAACAACTTCTGGT
TCATAGTGAGAGAGTTGCCATTGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAGGCTGAAAAGAAAAAACTTCTGAAACTTTGGCAAAGCAACCCAGGCATTATGGCAAGGGCGGGAGTTTCTGTTGCTGCCAAAACGTTCAAATGCCTTCTCCA
AGTGTATGGGAATCTGAGGGTTTTACCAGATGGTAAATTCGTTCACCATCTACTGCAAAGAACGGTGCGCAACCCACCTGGTTTCCTTCAGAGTTGTGCTTTTAAAAATG
ATTGTTATTGTGGTAGCTTGTCAAATGCCCGAAGACTGTTCGATGAAATGCTCGAAAGAGACTTGGTGTCATGGTTTATAATGATATCTGCATATGCGCAGATTGGGCTT
TCTCACCTTGGCAATCAAATGCTTTCTCAGGTGTTTAAACATGGACTGAGCTCCAATGCCTCGATTGAAACTGCAGTCATGAACATGTATGTCAAGTGTGGTTGGTTAGA
AGCAGCTGAGCTCGTCTTTGATATGATGACTGAAAAGAAAAACAAGGTGGCTTGGATTGGACTGATGGTAGGATACTCTTCACGAAGTAACGAACTGCAGGATGTTCTAT
CACTCTTTGGGAAAATGGTCAGAGATGGTGTTTGTGTTTTCGATTGTCCTCAAGCTTGTTCTGCTCTTACATGTTTCGATTTTGGAACTCAAGTTCATGGCGATGGATTT
CCTTACCTATTTGGAGAGAGTGCAATGACAAACCTTGATACTGCTGCTTGGATTGCTGTAATATTTGGTTATGGAAATCATGGGTATGCTTCTGAATCTTTGAGGCTTTT
TATTAAAATGCAGGCGTTCAGTGTTCGGCCAAATGCTGTTACCTTTATTCTTGCAGCCATTCAGGCTTGGTACCCTGGAAATGCTTGTTGGGTGGTTGCTGGATCCACAA
GAACCGTCGGGAAGACTGCAGCTGAAAACCCTCCTTCAGTTGAATCCAGGGGACACTCAATCAGTCATGTAACCATGTTTAATTCGTGTACCGCATTTGGGAAATGGCAA
GAAGCAGCTCACGTGAGAAAGAAAATGGCTAAGAACTTGAAGAAGGAGCTCAGCTGCAGTTGTGGAATACCATTAAAGGCTCAACGCACAGATCGAAAACAACTTCTGGT
TCATAGTGAGAGAGTTGCCATTGCTTAA
Protein sequenceShow/hide protein sequence
MAEAEKKKLLKLWQSNPGIMARAGVSVAAKTFKCLLQVYGNLRVLPDGKFVHHLLQRTVRNPPGFLQSCAFKNDCYCGSLSNARRLFDEMLERDLVSWFIMISAYAQIGL
SHLGNQMLSQVFKHGLSSNASIETAVMNMYVKCGWLEAAELVFDMMTEKKNKVAWIGLMVGYSSRSNELQDVLSLFGKMVRDGVCVFDCPQACSALTCFDFGTQVHGDGF
PYLFGESAMTNLDTAAWIAVIFGYGNHGYASESLRLFIKMQAFSVRPNAVTFILAAIQAWYPGNACWVVAGSTRTVGKTAAENPPSVESRGHSISHVTMFNSCTAFGKWQ
EAAHVRKKMAKNLKKELSCSCGIPLKAQRTDRKQLLVHSERVAIA