; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G003300 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G003300
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmo_Chr06:1611639..1613533
RNA-Seq ExpressionCmoCh06G003300
SyntenyCmoCh06G003300
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580575.1 ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. sororia]4.6e-11561.07Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM
        R+I+SWNAM+A YSQG FY +CKELFK ML   E +PNA+TAV VLQACAQSN LIFGME                          CGSLDY  ELFE M
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM

Query:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD------------TCERSSHLLT--------FFNTKDGKEIHAYAVRN
        PEKDEVT+   I   ++H                          G +Q    D             C  ++  L         F   K GKEIHAYAVRN
Subjt:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD------------TCERSSHLLT--------FFNTKDGKEIHAYAVRN

Query:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP
         YDGN+ V TAIIDSYAKSGYLH AR V DQ K RSLIIWTAIISAYAA+G AN  LS FYE+LTNGIRPD V FT V  +CAHSGELDEAWKIFNVLLP
Subjt:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP

Query:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        E+GIQPL+EHYACMVGVLS A KLSDAV+FISKMPIEPT KVWGALLNGASVAGDVELGK VFD LLD EPENTGN IIM NLYS FGRWK++
Subjt:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

KAG6596437.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]4.9e-14969.64Show/hide
Query:  MGSASNPPQRRDELRCLWPLHPARTDFLFVRLGNCFALVLFYVLSLPRISSDRSSSPSNQNLAALEMPTMCSRKLCLIECLREILSWNAMLAEYSQGWFY
        MGSASNPPQRRDELR LWPLHPART FLFVRLGNCFALVLFYVLSLPRISSDR SSPSNQNLAALEMPTMCS                            
Subjt:  MGSASNPPQRRDELRCLWPLHPARTDFLFVRLGNCFALVLFYVLSLPRISSDRSSSPSNQNLAALEMPTMCSRKLCLIECLREILSWNAMLAEYSQGWFY

Query:  VECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGMECGSLDYVWELFEEMPEKDEVTWRDDIVLH---------------VGWLQTEYYDTCERS
                                                 CGSLDYVWELFEEMPEKDEVT RDDI LH               VGWLQTEYYDTCERS
Subjt:  VECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGMECGSLDYVWELFEEMPEKDEVTWRDDIVLH---------------VGWLQTEYYDTCERS

Query:  SHLLTFFNTKDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV
        SHLLTF NTKDG                                  ARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV
Subjt:  SHLLTFFNTKDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV

Query:  CSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNII
        CSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEK SDAVDFISKMPIEPTTKVW ALLNGASVAGDVELGKCVFDSLLDTEPENTGNNII
Subjt:  CSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNII

Query:  MDNLYSLFGRWKKSG
        MDNLYSLFGRWKKSG
Subjt:  MDNLYSLFGRWKKSG

KAG7017327.1 ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. argyrosperma]3.9e-11460.81Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM
        R+I+SWNAM+A YSQG FY +CKELFK ML   E +PNA+TAV VLQACAQSN LIFGME                          CGSLDY  ELFE M
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM

Query:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD------------TCERSSHLLT--------FFNTKDGKEIHAYAVRN
        PEKDEVT+   I   ++H                          G +Q    D             C  ++  L         F   K GKEIHAYAVRN
Subjt:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD------------TCERSSHLLT--------FFNTKDGKEIHAYAVRN

Query:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP
         YDGN+ V TAIIDSYAKSGYL  AR V DQ K RSLIIWTAIISAYAA+G AN  LS FYE+LTNGIRPD V FT V  +CAHSGELDEAWKIFNVLLP
Subjt:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP

Query:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        E+GIQPL+EHYACMVGVLS A KLSDAV+FISKMPIEPT KVWGALLNGASVAGDVELGK VFD LLD EPENTGN IIM NLYS FGRWK++
Subjt:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

KAG7027978.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-11885.21Show/hide
Query:  LFEEMPEKDEVTWRDDIVLH-------VGWLQTEYYDTCERSSHLLTFFNTKDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRS
        +F+ MP +D V   D  +         +    TEYYDTCERSSHLLTF NTKDG         NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRS
Subjt:  LFEEMPEKDEVTWRDDIVLH-------VGWLQTEYYDTCERSSHLLTFFNTKDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRS

Query:  LIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIE
        LIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEK SDAVDFISKMPIE
Subjt:  LIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIE

Query:  PTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKSG
        PTTKVW ALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKSG
Subjt:  PTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKSG

XP_022145703.1 pentatricopeptide repeat-containing protein At2g37310 [Momordica charantia]9.3e-11661.07Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM
        R+I+SWNAM+A +SQG FY ECKELFKEMLS VEL+PNA+TAV VLQACAQSN LIFGME                          CGSLDY  ELFEEM
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM

Query:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD------------TCERSSHLLT--------FFNTKDGKEIHAYAVRN
        PEKDEVT+   I   ++H                          G +Q    D             C  ++  L         F   K GKEIHAYAVRN
Subjt:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD------------TCERSSHLLT--------FFNTKDGKEIHAYAVRN

Query:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP
        GY+GN+ V TAIIDSYAKSGYLH A  V D  KGRSLIIWTAIISAYAA+G ANVALS FYE+L NGI+PD V FT V  +CAHSGELDEAWKIFN++LP
Subjt:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP

Query:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        EYGIQPL+EHYACMVGVLS A KLSDAVDFISKMPIEP+ KVWGALLNGASVAGDVELGK VFD LL+ EPENTG  IIM NLYS  GRWK++
Subjt:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

TrEMBL top hitse value%identityAlignment
A0A0A0LFN1 Uncharacterized protein1.1e-10958.82Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM
        R+I+SWNAMLA YSQG  Y +CKELF+ MLS +E++PNA+TAV VLQACAQSN LIFG+E                          CGSLDY  ELFEEM
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM

Query:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD-----------------TCERSSHLLTF--FNT-KDGKEIHAYAVRN
         EKD +T+   I   ++H                          G +Q    +                 T   +S L  F  F+T K GKEIH YA+RN
Subjt:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD-----------------TCERSSHLLTF--FNT-KDGKEIHAYAVRN

Query:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP
         YD N+ V TAIIDSYAK GYLH A+LV DQ KGRSLI WT+IISAYA +G ANVALS FYE+LTNGI+PD V FT V  +CAHSGELDEAWKIFNVLLP
Subjt:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP

Query:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWK
        EYGIQPL+EHYACMVGVLS A KLSDAV+FISKMP+EPT KVWGALLNGASVAGDVELGK VFD L + EPENTGN +IM NLYS  GRWK
Subjt:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWK

A0A5A7TRM4 Pentatricopeptide repeat-containing protein2.7e-10857.25Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM
        R+I+SWNAMLA YSQG  Y +CKELF+ M S +E++PNA+TAV VLQACAQSN LIFGME                          CGSLDY  ELFEEM
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM

Query:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYDTC-----ERSSH--------------LLTFFNT-KDGKEIHAYAVRN
        PEKD +T+   I   ++H                          G +Q    D          SH              + + F+T K GKEIH YA+RN
Subjt:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYDTC-----ERSSH--------------LLTFFNT-KDGKEIHAYAVRN

Query:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP
         YDGN+ V TAIIDSYAK GYL  AR V DQ KGRSLI WT+IISAYA +G ANVALS FYE+LT GI+PD V FT V  +CAHSGELDEAWKIFN+LLP
Subjt:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP

Query:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        +YGIQPL+EHYACMVGVLS A KLSDAV+FISKMP+EP  KVWGALLNGASVAGDVELGK VFD L + EP NTGN +IM NLYS  GRWK++
Subjt:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

A0A6J1CWN9 pentatricopeptide repeat-containing protein At2g373104.5e-11661.07Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM
        R+I+SWNAM+A +SQG FY ECKELFKEMLS VEL+PNA+TAV VLQACAQSN LIFGME                          CGSLDY  ELFEEM
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM

Query:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD------------TCERSSHLLT--------FFNTKDGKEIHAYAVRN
        PEKDEVT+   I   ++H                          G +Q    D             C  ++  L         F   K GKEIHAYAVRN
Subjt:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD------------TCERSSHLLT--------FFNTKDGKEIHAYAVRN

Query:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP
        GY+GN+ V TAIIDSYAKSGYLH A  V D  KGRSLIIWTAIISAYAA+G ANVALS FYE+L NGI+PD V FT V  +CAHSGELDEAWKIFN++LP
Subjt:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP

Query:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        EYGIQPL+EHYACMVGVLS A KLSDAVDFISKMPIEP+ KVWGALLNGASVAGDVELGK VFD LL+ EPENTG  IIM NLYS  GRWK++
Subjt:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

A0A6J1F110 pentatricopeptide repeat-containing protein At2g373104.2e-11460.56Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM
        R+I+SWNAM+A YSQG FY +CKELFK ML   E +PNA+TAV VLQACA SN LIFGME                          CGSLDY  ELFE M
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM

Query:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD------------TCERSSHLLT--------FFNTKDGKEIHAYAVRN
        PEKDEVT+   I   ++H                          G +Q    D             C  ++  L         F   K GKEIHAYAVRN
Subjt:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD------------TCERSSHLLT--------FFNTKDGKEIHAYAVRN

Query:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP
         YDGN+ V TAIIDSYAKSGYL  AR V DQ K RSLIIWTAIISAYAA+G AN  LS FYE+LTNGIRPD V FT V  +CAHSGELDEAWKIFNVLLP
Subjt:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP

Query:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        E+GIQPL+EHYACMVGVLS A KLSDAV+FISKMPIEPT KVWGALLNGASVAGDVELGK VFD LLD EPENTGN IIM NLYS FGRWK++
Subjt:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

A0A6J1J0S5 pentatricopeptide repeat-containing protein At2g373106.1e-11360.31Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM
        R+I+SWNAM+A YSQG FY +CKELFK ML   E +PNA+TAV VLQACAQSN LIFGME                          CGSLDY  ELFE M
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM

Query:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD------------TCERSSHLLT--------FFNTKDGKEIHAYAVRN
        PEKDEVT+   I   ++H                          G +Q    D             C  ++  L         F   K GKEIHAYAVRN
Subjt:  PEKDEVTWRDDI---VLH-------------------------VGWLQTEYYD------------TCERSSHLLT--------FFNTKDGKEIHAYAVRN

Query:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP
         YDGN+ V TAIIDSYAKSGYL  AR V DQ K RSLIIWTAIISAYAA+G AN  LS FYE+LTNGIRPD V FT V  +CAHSGEL+EAWKIFNVLLP
Subjt:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP

Query:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        E+GIQPL+EHYACMVGVLS A KLSDAV+FISKMPIEPT KVWGALLNGASVAGDVELGK VFD LLD EPENTGN IIM NLYS FG WK++
Subjt:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic6.9e-4530.1Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM
        ++++SWN+M+  + Q     +  ELFK+M S  +++ + VT V VL ACA+   L FG +                          CGS++    LF+ M
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM

Query:  PEKDEVTWRDDI--------------VLH-------VGW--LQTEYYDTCERSSHLLTFFN--------------------------TKDGKEIHAYAVR
         EKD VTW   +              VL+       V W  L + Y    + +  L+ F                             + G+ IH+Y  +
Subjt:  PEKDEVTWRDDI--------------VLH-------VGW--LQTEYYDTCERSSHLLTFFN--------------------------TKDGKEIHAYAVR

Query:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLL
        +G   N  V +A+I  Y+K G L ++R V +  + R + +W+A+I   A +G  N A+  FY++    ++P+ V FT V C+C+H+G +DEA  +F+ + 
Subjt:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLL

Query:  PEYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWK
          YGI P  +HYAC+V VL  +  L  AV FI  MPI P+T VWGALL    +  ++ L +     LL+ EP N G ++++ N+Y+  G+W+
Subjt:  PEYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWK

P0C899 Putative pentatricopeptide repeat-containing protein At3g491423.4e-4432.54Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGMECGSLDYVWELFEEMPEKDEVTWRDDIVLHVGWLQ----TE
        R+++SWN+++  Y+Q   + +  E+ +EM S V++  +A T   +L A + +          ++ YV ++F +M +K  V+W   I +++         E
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGMECGSLDYVWELFEEMPEKDEVTWRDDIVLHVGWLQ----TE

Query:  YY----------DTCERSSHLLTFFNTKD---GKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVAL
         Y          D    +S L    +T     GK+IH Y  R     N+ +  A+ID YAK G L +AR V +  K R ++ WTA+ISAY  +G    A+
Subjt:  YY----------DTCERSSHLLTFFNTKD---GKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVAL

Query:  SFFYEILTNGIRPDSVAF-TPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVE
        + F ++  +G+ PDS+AF T + +C+H+G L+E    F ++   Y I P +EH ACMV +L  A K+ +A  FI  M +EP  +VWGALL    V  D +
Subjt:  SFFYEILTNGIRPDSVAF-TPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVE

Query:  LGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK
        +G    D L    PE +G  +++ N+Y+  GRW++
Subjt:  LGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK

Q9FND6 Pentatricopeptide repeat-containing protein At5g40410, mitochondrial4.4e-4430.56Show/hide
Query:  MCSRKLCLIECLREILSWNAMLAEYSQGWFYVECKE-LFKEMLSLVELRPNAVTAVIVLQACAQSN-----------YLIFGM---------------EC
        +C+ KL      R+++SWN++++ YS   +  +C E L + M+S V  RPN VT + ++ AC                + FG+               + 
Subjt:  MCSRKLCLIECLREILSWNAMLAEYSQGWFYVECKE-LFKEMLSLVELRPNAVTAVIVLQACAQSN-----------YLIFGM---------------EC

Query:  GSLDYVWELFEEMPEKDEVTWRDDIVLHVGWLQTE----YYDTCERSSH-------LLTFFNTKD------GKEIHAYAVRNGYDGNVCVVTAIIDSYAK
        G L    +LFE++  K+ V+W   IV+H+     E    Y++   R  H       L    + +D       + IH   +  G+ GN C+ TA++D Y+K
Subjt:  GSLDYVWELFEEMPEKDEVTWRDDIVLHVGWLQTE----YYDTCERSSH-------LLTFFNTKD------GKEIHAYAVRNGYDGNVCVVTAIIDSYAK

Query:  SGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVC-SCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVL
         G L  +  V  +      + WTA+++AYA +G    A+  F  ++  GI PD V FT +  +C+HSG ++E    F  +   Y I P ++HY+CMV +L
Subjt:  SGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVC-SCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVL

Query:  SLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWK
          +  L DA   I +MP+EP++ VWGALL    V  D +LG    + L + EP +  N +++ N+YS  G WK
Subjt:  SLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWK

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic6.9e-4529.83Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFG--------------------------MECGSLDYVWELFEEM
        R ++S+ +M+A Y++     E  +LF+EM     + P+  T   VL  CA+   L  G                           +CGS+     +F EM
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFG--------------------------MECGSLDYVWELFEEM

Query:  PEKDEVTW-------------RDDIVLHVGWLQTEYYDTCERSSHLL-----TFFNTKDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLVCD
          KD ++W              + + L    L+ + +   ER+   +     +      G+EIH Y +RNGY  +  V  +++D YAK G L  A ++ D
Subjt:  PEKDEVTW-------------RDDIVLHVGWLQTEYYDTCERSSHLL-----TFFNTKDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLVCD

Query:  QFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVC-SCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKLSDAVDF
            + L+ WT +I+ Y  +G    A++ F ++   GI  D ++F  +  +C+HSG +DE W+ FN++  E  I+P +EHYAC+V +L+    L  A  F
Subjt:  QFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVC-SCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKLSDAVDF

Query:  ISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK
        I  MPI P   +WGALL G  +  DV+L + V + + + EPENTG  ++M N+Y+   +W++
Subjt:  ISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK

Q9ZUT5 Pentatricopeptide repeat-containing protein At2g373109.5e-7943.51Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM
        R+++SWN+M++ YSQ   + +CK+++K ML+  + +PN VT + V QAC QS+ LIFG+E                          CGSLDY   LF+EM
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM

Query:  PEKDEVTWRDDI----------------------------VLHVGWLQTEYYD-----------------TCERSSHL--LTF-FNTKDGKEIHAYAVRN
         EKD VT+   I                             +  G +Q  +++                 T   SS L  LT+  N K GKEIHA+A+RN
Subjt:  PEKDEVTWRDDI----------------------------VLHVGWLQTEYYD-----------------TCERSSHL--LTF-FNTKDGKEIHAYAVRN

Query:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSC-AHSGELDEAWKIFNVLLP
        G D N+ V T+IID+YAK G+L  A+ V D  K RSLI WTAII+AYA +G ++ A S F ++   G +PD V  T V S  AHSG+ D A  IF+ +L 
Subjt:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSC-AHSGELDEAWKIFNVLLP

Query:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        +Y I+P +EHYACMV VLS A KLSDA++FISKMPI+P  KVWGALLNGASV GD+E+ +   D L + EPENTGN  IM NLY+  GRW+++
Subjt:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

Arabidopsis top hitse value%identityAlignment
AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.9e-4630.1Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM
        ++++SWN+M+  + Q     +  ELFK+M S  +++ + VT V VL ACA+   L FG +                          CGS++    LF+ M
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM

Query:  PEKDEVTWRDDI--------------VLH-------VGW--LQTEYYDTCERSSHLLTFFN--------------------------TKDGKEIHAYAVR
         EKD VTW   +              VL+       V W  L + Y    + +  L+ F                             + G+ IH+Y  +
Subjt:  PEKDEVTWRDDI--------------VLH-------VGW--LQTEYYDTCERSSHLLTFFN--------------------------TKDGKEIHAYAVR

Query:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLL
        +G   N  V +A+I  Y+K G L ++R V +  + R + +W+A+I   A +G  N A+  FY++    ++P+ V FT V C+C+H+G +DEA  +F+ + 
Subjt:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLL

Query:  PEYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWK
          YGI P  +HYAC+V VL  +  L  AV FI  MPI P+T VWGALL    +  ++ L +     LL+ EP N G ++++ N+Y+  G+W+
Subjt:  PEYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWK

AT2G37310.1 Pentatricopeptide repeat (PPR) superfamily protein6.7e-8043.51Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM
        R+++SWN+M++ YSQ   + +CK+++K ML+  + +PN VT + V QAC QS+ LIFG+E                          CGSLDY   LF+EM
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGME--------------------------CGSLDYVWELFEEM

Query:  PEKDEVTWRDDI----------------------------VLHVGWLQTEYYD-----------------TCERSSHL--LTF-FNTKDGKEIHAYAVRN
         EKD VT+   I                             +  G +Q  +++                 T   SS L  LT+  N K GKEIHA+A+RN
Subjt:  PEKDEVTWRDDI----------------------------VLHVGWLQTEYYD-----------------TCERSSHL--LTF-FNTKDGKEIHAYAVRN

Query:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSC-AHSGELDEAWKIFNVLLP
        G D N+ V T+IID+YAK G+L  A+ V D  K RSLI WTAII+AYA +G ++ A S F ++   G +PD V  T V S  AHSG+ D A  IF+ +L 
Subjt:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSC-AHSGELDEAWKIFNVLLP

Query:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        +Y I+P +EHYACMV VLS A KLSDA++FISKMPI+P  KVWGALLNGASV GD+E+ +   D L + EPENTGN  IM NLY+  GRW+++
Subjt:  EYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

AT3G49142.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-4532.54Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGMECGSLDYVWELFEEMPEKDEVTWRDDIVLHVGWLQ----TE
        R+++SWN+++  Y+Q   + +  E+ +EM S V++  +A T   +L A + +          ++ YV ++F +M +K  V+W   I +++         E
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFGMECGSLDYVWELFEEMPEKDEVTWRDDIVLHVGWLQ----TE

Query:  YY----------DTCERSSHLLTFFNTKD---GKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVAL
         Y          D    +S L    +T     GK+IH Y  R     N+ +  A+ID YAK G L +AR V +  K R ++ WTA+ISAY  +G    A+
Subjt:  YY----------DTCERSSHLLTFFNTKD---GKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVAL

Query:  SFFYEILTNGIRPDSVAF-TPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVE
        + F ++  +G+ PDS+AF T + +C+H+G L+E    F ++   Y I P +EH ACMV +L  A K+ +A  FI  M +EP  +VWGALL    V  D +
Subjt:  SFFYEILTNGIRPDSVAF-TPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVE

Query:  LGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK
        +G    D L    PE +G  +++ N+Y+  GRW++
Subjt:  LGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein4.9e-4629.83Show/hide
Query:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFG--------------------------MECGSLDYVWELFEEM
        R ++S+ +M+A Y++     E  +LF+EM     + P+  T   VL  CA+   L  G                           +CGS+     +F EM
Subjt:  REILSWNAMLAEYSQGWFYVECKELFKEMLSLVELRPNAVTAVIVLQACAQSNYLIFG--------------------------MECGSLDYVWELFEEM

Query:  PEKDEVTW-------------RDDIVLHVGWLQTEYYDTCERSSHLL-----TFFNTKDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLVCD
          KD ++W              + + L    L+ + +   ER+   +     +      G+EIH Y +RNGY  +  V  +++D YAK G L  A ++ D
Subjt:  PEKDEVTW-------------RDDIVLHVGWLQTEYYDTCERSSHLL-----TFFNTKDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLVCD

Query:  QFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVC-SCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKLSDAVDF
            + L+ WT +I+ Y  +G    A++ F ++   GI  D ++F  +  +C+HSG +DE W+ FN++  E  I+P +EHYAC+V +L+    L  A  F
Subjt:  QFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVC-SCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKLSDAVDF

Query:  ISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK
        I  MPI P   +WGALL G  +  DV+L + V + + + EPENTG  ++M N+Y+   +W++
Subjt:  ISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK

AT5G40410.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.2e-4530.56Show/hide
Query:  MCSRKLCLIECLREILSWNAMLAEYSQGWFYVECKE-LFKEMLSLVELRPNAVTAVIVLQACAQSN-----------YLIFGM---------------EC
        +C+ KL      R+++SWN++++ YS   +  +C E L + M+S V  RPN VT + ++ AC                + FG+               + 
Subjt:  MCSRKLCLIECLREILSWNAMLAEYSQGWFYVECKE-LFKEMLSLVELRPNAVTAVIVLQACAQSN-----------YLIFGM---------------EC

Query:  GSLDYVWELFEEMPEKDEVTWRDDIVLHVGWLQTE----YYDTCERSSH-------LLTFFNTKD------GKEIHAYAVRNGYDGNVCVVTAIIDSYAK
        G L    +LFE++  K+ V+W   IV+H+     E    Y++   R  H       L    + +D       + IH   +  G+ GN C+ TA++D Y+K
Subjt:  GSLDYVWELFEEMPEKDEVTWRDDIVLHVGWLQTE----YYDTCERSSH-------LLTFFNTKD------GKEIHAYAVRNGYDGNVCVVTAIIDSYAK

Query:  SGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVC-SCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVL
         G L  +  V  +      + WTA+++AYA +G    A+  F  ++  GI PD V FT +  +C+HSG ++E    F  +   Y I P ++HY+CMV +L
Subjt:  SGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVC-SCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVL

Query:  SLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWK
          +  L DA   I +MP+EP++ VWGALL    V  D +LG    + L + EP +  N +++ N+YS  G WK
Subjt:  SLAEKLSDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTGCAAGCAATCCGCCGCAACGACGGGATGAACTACGGTGCCTATGGCCGCTTCATCCAGCTCGCACCGACTTCCTCTTCGTCCGCCTCGGCAACTGCTTCGC
GCTCGTCTTGTTCTATGTTCTGTCGCTCCCGAGAATTTCCTCGGATCGAAGCTCATCGCCTTCTAATCAAAATCTGGCAGCCTTAGAGATGCCTACAATGTGTTCGCGAA
AACTGTGTTTGATAGAATGCCTGCGAGAGATATTGTCTTGGAATGCGATGTTGGCTGAGTACTCTCAGGGTTGGTTTTATGTGGAATGCAAGGAACTATTTAAAGAGATG
TTGAGTTTAGTGGAGTTGAGGCCTAATGCAGTGACTGCTGTCATTGTTTTACAAGCTTGTGCTCAGTCAAATTATCTCATTTTTGGAATGGAATGCGGTAGCTTGGATTA
TGTTTGGGAGTTGTTTGAAGAAATGCCTGAGAAGGATGAGGTCACATGGCGCGATGATATCGTTCTACATGTTGGATGGTTGCAGACCGAATACTATGACACTTGCGAGC
GTTCTTCCCATCTTCTCACATTTTTCAACACCAAGGATGGGAAAGAAATTCATGCTTATGCCGTTAGAAACGGTTACGATGGGAATGTTTGTGTTGTCACTGCTATCATT
GATTCTTATGCTAAGTCTGGTTACCTCCACAGGGCACGACTGGTTTGTGATCAATTTAAAGGTAGGAGTCTAATCATTTGGACAGCAATAATTTCAGCCTATGCTGCCAA
TGGAGGTGCTAACGTGGCTCTGAGTTTTTTTTATGAGATTCTGACAAATGGGATTCGGCCTGATTCAGTAGCCTTTACTCCAGTGTGTTCCTGTGCCCATTCAGGGGAGT
TAGATGAAGCCTGGAAGATATTTAACGTCTTGTTACCTGAGTATGGAATTCAACCACTAATCGAGCACTATGCTTGCATGGTAGGAGTTCTTAGTCTAGCAGAAAAGCTC
TCCGATGCTGTTGACTTTATTTCTAAAATGCCAATTGAACCCACTACAAAAGTTTGGGGTGCTTTGCTCAATGGGGCTTCCGTTGCTGGTGATGTTGAGCTTGGAAAGTG
CGTTTTTGATAGTCTGCTTGACACCGAGCCTGAAAATACAGGTAACAATATCATCATGGATAACTTATATTCACTGTTTGGAAGGTGGAAAAAATCTGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTGCAAGCAATCCGCCGCAACGACGGGATGAACTACGGTGCCTATGGCCGCTTCATCCAGCTCGCACCGACTTCCTCTTCGTCCGCCTCGGCAACTGCTTCGC
GCTCGTCTTGTTCTATGTTCTGTCGCTCCCGAGAATTTCCTCGGATCGAAGCTCATCGCCTTCTAATCAAAATCTGGCAGCCTTAGAGATGCCTACAATGTGTTCGCGAA
AACTGTGTTTGATAGAATGCCTGCGAGAGATATTGTCTTGGAATGCGATGTTGGCTGAGTACTCTCAGGGTTGGTTTTATGTGGAATGCAAGGAACTATTTAAAGAGATG
TTGAGTTTAGTGGAGTTGAGGCCTAATGCAGTGACTGCTGTCATTGTTTTACAAGCTTGTGCTCAGTCAAATTATCTCATTTTTGGAATGGAATGCGGTAGCTTGGATTA
TGTTTGGGAGTTGTTTGAAGAAATGCCTGAGAAGGATGAGGTCACATGGCGCGATGATATCGTTCTACATGTTGGATGGTTGCAGACCGAATACTATGACACTTGCGAGC
GTTCTTCCCATCTTCTCACATTTTTCAACACCAAGGATGGGAAAGAAATTCATGCTTATGCCGTTAGAAACGGTTACGATGGGAATGTTTGTGTTGTCACTGCTATCATT
GATTCTTATGCTAAGTCTGGTTACCTCCACAGGGCACGACTGGTTTGTGATCAATTTAAAGGTAGGAGTCTAATCATTTGGACAGCAATAATTTCAGCCTATGCTGCCAA
TGGAGGTGCTAACGTGGCTCTGAGTTTTTTTTATGAGATTCTGACAAATGGGATTCGGCCTGATTCAGTAGCCTTTACTCCAGTGTGTTCCTGTGCCCATTCAGGGGAGT
TAGATGAAGCCTGGAAGATATTTAACGTCTTGTTACCTGAGTATGGAATTCAACCACTAATCGAGCACTATGCTTGCATGGTAGGAGTTCTTAGTCTAGCAGAAAAGCTC
TCCGATGCTGTTGACTTTATTTCTAAAATGCCAATTGAACCCACTACAAAAGTTTGGGGTGCTTTGCTCAATGGGGCTTCCGTTGCTGGTGATGTTGAGCTTGGAAAGTG
CGTTTTTGATAGTCTGCTTGACACCGAGCCTGAAAATACAGGTAACAATATCATCATGGATAACTTATATTCACTGTTTGGAAGGTGGAAAAAATCTGGCTAG
Protein sequenceShow/hide protein sequence
MGSASNPPQRRDELRCLWPLHPARTDFLFVRLGNCFALVLFYVLSLPRISSDRSSSPSNQNLAALEMPTMCSRKLCLIECLREILSWNAMLAEYSQGWFYVECKELFKEM
LSLVELRPNAVTAVIVLQACAQSNYLIFGMECGSLDYVWELFEEMPEKDEVTWRDDIVLHVGWLQTEYYDTCERSSHLLTFFNTKDGKEIHAYAVRNGYDGNVCVVTAII
DSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKL
SDAVDFISKMPIEPTTKVWGALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKSG