; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003851 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003851
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr08:10411756..10413249
RNA-Seq ExpressionHG10003851
SyntenyHG10003851
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ97821.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]1.0e-13556.97Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        MKSM VRPNCFTFPLVLKSCAK GAF+EGEEIHCEVIKGGFEGNQFVATTLIDVYS GRAIGSAYKVFV MLERNIVAWTSMISGYILCN VALARRLFD
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
        LAPERD+VLWNIM                                              LFEEMP+RNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
Subjt:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL

Query:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------
        VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG                                                                 
Subjt:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL
                                                                                             IYKN+DLAELALQKL
Subjt:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL

Query:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQ
        I LEPKN ANYV+LSNIYGDLGRWKDVARLKIL+RDTGSKKLPGCSLIEVNDSVV+FYSLDERHSQS+EIYGVL GLMKLLRSFGYEP IMELQQ
Subjt:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQ

XP_011656468.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucumis sativus]6.5e-13857.55Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        MKSMDVRPNCFTFPLVLKSCAK GAF+EGEEIHCEVIKGG EGNQFVATTLIDVYSGGRAIGSAYK+FV MLERNIVAWTSMISGYILCN VALARRLFD
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
        LAPERD+VLWNIM                                              LFEEMP+RNVFSWNGLIGGYAHNG FFEVLRCFKRMLIDGL
Subjt:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL

Query:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------
        VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG                                                                 
Subjt:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL
                                                                                             IYKNIDLAELALQKL
Subjt:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL

Query:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS
        I LEPKNPANYV+LSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVV+FYSLDERHSQSKEIYGVLKGLMKLLRSFGYEP +MEL QGS
Subjt:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS

XP_016901901.1 PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucumis melo]2.7e-13656.94Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        MKSM VRPNCFTFPLVLKSCAK GAF+EGEEIH EVIKGGFEGNQFVATTLIDVYS GRAIGSAYKVFV MLERNIVAWTSMISGYILCN VALARRLFD
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
        LAPERD+VLWNIM                                              +FEEMP+RNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
Subjt:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL

Query:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------
        VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG                                                                 
Subjt:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL
                                                                                             IYKN+DLAELALQKL
Subjt:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL

Query:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS
        I LEPKNPANYV+LSNIYGDLGRWKDVARLKILMRDTG KKLPGCSLIEVNDSVV+FYSLDERHSQS+EIYGVLKGLMKLLRSFGYEP IMELQQ S
Subjt:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS

XP_022139698.1 pentatricopeptide repeat-containing protein At3g29230-like [Momordica charantia]1.8e-13556.34Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        MKS DVRPNCFTFPLVLKSCAK  AF+EGEEIHCEVIKGGF GNQFVATTLIDVYSGGRAIGSAYKVFV MLERNIVAWTSMISGYILCNDV  ARRLFD
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
        LAPERD+VLW+IM                                              LFEEMP+RNVFSWNGLIGGYAHNGRFF+VL CFKRML+DG 
Subjt:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL

Query:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIGI----------------------------------------------------------------
        VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG                                                                 
Subjt:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIGI----------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------YKNIDLAELALQKL
                                                                                              YKNIDLAELALQKL
Subjt:  --------------------------------------------------------------------------------------YKNIDLAELALQKL

Query:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS
        IQLEPKNPANYVMLSNIYGDL RWKDVARLKILMRDTG KKLPGCSLIEVNDSVV+FYSLDERHSQSKEIYGVLKGLMKLLRS+GYEP IMELQQGS
Subjt:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS

XP_038886719.1 pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Benincasa hispida]9.1e-14057.75Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        MKSMDVRPNCFTFPLVLKSCAK  AF+EGEEIHCEVIKGGFEGNQFVATTLIDVYSGGR IGSAYKVFV MLERNIVAWTSMISGYILCNDVALARRLFD
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
        LAPERD+VLWNIM                                              LFEEMP+RNVFSWNGLIGGYAHNGRFFEVLRCFKRML D +
Subjt:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL

Query:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------
        VVPNDATLVTVLSACARLGALDLGKWVH+YAATIG                                                                 
Subjt:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL
                                                                                             IYKNIDLAELALQKL
Subjt:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL

Query:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS
        IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKK+PGCSLIEVNDSVV+FYSLDERHSQSKEIYGVLKGLMKLLRSFGYEP IMELQQGS
Subjt:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS

TrEMBL top hitse value%identityAlignment
A0A0A0KBY4 Uncharacterized protein3.2e-13857.55Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        MKSMDVRPNCFTFPLVLKSCAK GAF+EGEEIHCEVIKGG EGNQFVATTLIDVYSGGRAIGSAYK+FV MLERNIVAWTSMISGYILCN VALARRLFD
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
        LAPERD+VLWNIM                                              LFEEMP+RNVFSWNGLIGGYAHNG FFEVLRCFKRMLIDGL
Subjt:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL

Query:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------
        VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG                                                                 
Subjt:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL
                                                                                             IYKNIDLAELALQKL
Subjt:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL

Query:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS
        I LEPKNPANYV+LSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVV+FYSLDERHSQSKEIYGVLKGLMKLLRSFGYEP +MEL QGS
Subjt:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS

A0A1S4E0Z4 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like1.3e-13656.94Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        MKSM VRPNCFTFPLVLKSCAK GAF+EGEEIH EVIKGGFEGNQFVATTLIDVYS GRAIGSAYKVFV MLERNIVAWTSMISGYILCN VALARRLFD
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
        LAPERD+VLWNIM                                              +FEEMP+RNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
Subjt:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL

Query:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------
        VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG                                                                 
Subjt:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL
                                                                                             IYKN+DLAELALQKL
Subjt:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL

Query:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS
        I LEPKNPANYV+LSNIYGDLGRWKDVARLKILMRDTG KKLPGCSLIEVNDSVV+FYSLDERHSQS+EIYGVLKGLMKLLRSFGYEP IMELQQ S
Subjt:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS

A0A5A7V6Y9 Pentatricopeptide repeat-containing protein1.3e-13656.94Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        MKSM VRPNCFTFPLVLKSCAK GAF+EGEEIH EVIKGGFEGNQFVATTLIDVYS GRAIGSAYKVFV MLERNIVAWTSMISGYILCN VALARRLFD
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
        LAPERD+VLWNIM                                              +FEEMP+RNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
Subjt:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL

Query:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------
        VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG                                                                 
Subjt:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL
                                                                                             IYKN+DLAELALQKL
Subjt:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL

Query:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS
        I LEPKNPANYV+LSNIYGDLGRWKDVARLKILMRDTG KKLPGCSLIEVNDSVV+FYSLDERHSQS+EIYGVLKGLMKLLRSFGYEP IMELQQ S
Subjt:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS

A0A5D3BGP3 Pentatricopeptide repeat-containing protein5.0e-13656.97Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        MKSM VRPNCFTFPLVLKSCAK GAF+EGEEIHCEVIKGGFEGNQFVATTLIDVYS GRAIGSAYKVFV MLERNIVAWTSMISGYILCN VALARRLFD
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
        LAPERD+VLWNIM                                              LFEEMP+RNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
Subjt:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL

Query:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------
        VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG                                                                 
Subjt:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-----------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL
                                                                                             IYKN+DLAELALQKL
Subjt:  -------------------------------------------------------------------------------------IYKNIDLAELALQKL

Query:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQ
        I LEPKN ANYV+LSNIYGDLGRWKDVARLKIL+RDTGSKKLPGCSLIEVNDSVV+FYSLDERHSQS+EIYGVL GLMKLLRSFGYEP IMELQQ
Subjt:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQ

A0A6J1CGA7 pentatricopeptide repeat-containing protein At3g29230-like8.6e-13656.34Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        MKS DVRPNCFTFPLVLKSCAK  AF+EGEEIHCEVIKGGF GNQFVATTLIDVYSGGRAIGSAYKVFV MLERNIVAWTSMISGYILCNDV  ARRLFD
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL
        LAPERD+VLW+IM                                              LFEEMP+RNVFSWNGLIGGYAHNGRFF+VL CFKRML+DG 
Subjt:  LAPERDIVLWNIM----------------------------------------------LFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGL

Query:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIGI----------------------------------------------------------------
        VVPNDATLVTVLSACARLGALDLGKWVHVYAATIG                                                                 
Subjt:  VVPNDATLVTVLSACARLGALDLGKWVHVYAATIGI----------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------YKNIDLAELALQKL
                                                                                              YKNIDLAELALQKL
Subjt:  --------------------------------------------------------------------------------------YKNIDLAELALQKL

Query:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS
        IQLEPKNPANYVMLSNIYGDL RWKDVARLKILMRDTG KKLPGCSLIEVNDSVV+FYSLDERHSQSKEIYGVLKGLMKLLRS+GYEP IMELQQGS
Subjt:  IQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic4.4e-3626.04Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        M+S DV+ +  T   VL +CAK      G ++   + +     N  +A  ++D+Y+   +I  A ++F  M E++ V WT+M+ GY +  D   AR + +
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIML--------------------------------------------------------------------------------------
          P++DIV WN ++                                                                                      
Subjt:  LAPERDIVLWNIML--------------------------------------------------------------------------------------

Query:  FEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALD----------------------------LGK-------
        F  +  R+VF W+ +IGG A +G   E +  F +M  +  V PN  T   V  AC+  G +D                            LG+       
Subjt:  FEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALD----------------------------LGK-------

Query:  -----------WVHVYAATIG---IYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLD
                      V+ A +G   I+ N++LAE+A  +L++LEP+N   +V+LSNIY  LG+W++V+ L+  MR TG KK PGCS IE++  + +F S D
Subjt:  -----------WVHVYAATIG---IYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLD

Query:  ERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQ
          H  S+++YG L  +M+ L+S GYEP I ++ Q
Subjt:  ERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQ

Q7Y211 Pentatricopeptide repeat-containing protein At3g57430, chloroplastic6.8e-3725.55Show/hide
Query:  KSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYI--------------
        +S  +  N  T   V+ +C ++GAF   E IH  V+K G + ++FV  TL+D+YS    I  A ++F +M +R++V W +MI+GY+              
Subjt:  KSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYI--------------

Query:  ---------------------------------------------------LCNDVALARRLFDLAPERDIVLWNIMLFEEMPDRNVFSWNGLIGGYAHN
                                                           L  DVA+   L D+  +   +  +  +F+++P +NV +WN +I  Y  +
Subjt:  ---------------------------------------------------LCNDVALARRLFDLAPERDIVLWNIMLFEEMPDRNVFSWNGLIGGYAHN

Query:  GRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALD---------------------------------------------------LGKWVHVYA
        G   E +   + M++ G V PN+ T ++V +AC+  G +D                                                    G W  +  
Subjt:  GRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALD---------------------------------------------------LGKWVHVYA

Query:  ATIGIYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLL
        A+  I+ N+++ E+A Q LIQLEP   ++YV+L+NIY   G W     ++  M++ G +K PGCS IE  D V +F + D  H QS+++ G L+ L + +
Subjt:  ATIGIYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLL

Query:  RSFGYEP
        R  GY P
Subjt:  RSFGYEP

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic2.0e-4425.19Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        M S+ + PN +TFP VLKSCAK+ AF EG++IH  V+K G + + +V T+LI +Y     +  A+KVF +   R++V++T++I GY     +  A++LFD
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIM-------------------------------------------------------------------------------------LF
          P +D+V WN M                                                                                     LF
Subjt:  LAPERDIVLWNIM-------------------------------------------------------------------------------------LF

Query:  EEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALDLGKWVHVY----------AATI-----------------
        E +P ++V SWN LIGGY H   + E L  F+ ML  G   PND T++++L ACA LGA+D+G+W+HVY          A+++                 
Subjt:  EEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALDLGKWVHVY----------AATI-----------------

Query:  ------------------------------------------------------------------------------------------------GIYK
                                                                                                        G++K
Subjt:  ------------------------------------------------------------------------------------------------GIYK

Query:  -----------------------------NIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYS
                                     N++L E   + LI++EP+NP +YV+LSNIY   GRW +VA+ + L+ D G KK+PGCS IE++  V +F  
Subjt:  -----------------------------NIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYS

Query:  LDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQ
         D+ H +++EIYG+L+ +  LL   G+ P   E+ Q
Subjt:  LDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQ

Q9LZ19 Pentatricopeptide repeat-containing protein At5g04780, mitochondrial1.7e-3530.26Show/hide
Query:  MDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFDLAP
        M +  N FT   V+ +C+   A +EG+++H  + K GF  N FVA++ +D+Y+   ++  +Y +F E+ E+N+  W ++ISG+                P
Subjt:  MDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFDLAP

Query:  ERDIVLWNIMLFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLG----ALDLGKWVHV---------YAATIGI
        +  ++L+  M  + M    V +++ L+    H G   E  R FK M     + PN      ++    R G    A +L K +             A+  +
Subjt:  ERDIVLWNIMLFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLG----ALDLGKWVHV---------YAATIGI

Query:  YKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGY
        YKN++LAE+A +KL +LEP+N  N+V+LSNIY    +W+++A+ + L+RD   KK+ G S I++ D V  F   +  H + +EI   L  L+   R FGY
Subjt:  YKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGY

Query:  EPII
        +P +
Subjt:  EPII

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic9.8e-3627.93Show/hide
Query:  SMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFDLA
        S ++ PN FTF  +LKSC+       G+ IH  V+K G   + +VAT L+DVY+ G  + SA KVF  M ER++V+ T+MI+ Y    +V  AR LFD  
Subjt:  SMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFDLA

Query:  PERDIVLWNIML--------------------------------------------------------------------------------------FE
         ERDIV WN+M+                                                                                      F 
Subjt:  PERDIVLWNIML--------------------------------------------------------------------------------------FE

Query:  EMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRML-IDGLVVPNDATLVTVLSACARLGALDLG---------------KWVH------------------
        + P +++ +WN +I GYA +G   + LR F  M  I GL  P D T +  L ACA  G ++ G               K  H                  
Subjt:  EMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRML-IDGLVVPNDATLVTVLSACARLGALDLG---------------KWVH------------------

Query:  -------------VYAATIG---IYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDE
                     ++++ +G   ++ +  L +   + LI L  KN   YV+LSNIY  +G ++ VA+++ LM++ G  K PG S IE+ + V +F + D 
Subjt:  -------------VYAATIG---IYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDE

Query:  RHSQSKEIYGVLKGLMKLLRSFGYEP
         HS+SKEIY +L+ + + ++S GY P
Subjt:  RHSQSKEIYGVLKGLMKLLRSFGYEP

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-4525.19Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        M S+ + PN +TFP VLKSCAK+ AF EG++IH  V+K G + + +V T+LI +Y     +  A+KVF +   R++V++T++I GY     +  A++LFD
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIM-------------------------------------------------------------------------------------LF
          P +D+V WN M                                                                                     LF
Subjt:  LAPERDIVLWNIM-------------------------------------------------------------------------------------LF

Query:  EEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALDLGKWVHVY----------AATI-----------------
        E +P ++V SWN LIGGY H   + E L  F+ ML  G   PND T++++L ACA LGA+D+G+W+HVY          A+++                 
Subjt:  EEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALDLGKWVHVY----------AATI-----------------

Query:  ------------------------------------------------------------------------------------------------GIYK
                                                                                                        G++K
Subjt:  ------------------------------------------------------------------------------------------------GIYK

Query:  -----------------------------NIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYS
                                     N++L E   + LI++EP+NP +YV+LSNIY   GRW +VA+ + L+ D G KK+PGCS IE++  V +F  
Subjt:  -----------------------------NIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYS

Query:  LDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQ
         D+ H +++EIYG+L+ +  LL   G+ P   E+ Q
Subjt:  LDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQ

AT1G13410.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.0e-5733.49Show/hide
Query:  IGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFDLAPERDIVLWNIML----------------------------------------------
        I SA KVF EM+E+N+V WTSMI+GY+L  D+  ARR FDL+PERDIVLWN M+                                              
Subjt:  IGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFDLAPERDIVLWNIML----------------------------------------------

Query:  FEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-------------------------
        F++MP+RNVFSWNGLI GYA NGR  EVL  FKRM+ +G VVPNDAT+  VLSACA+LGA D GKWVH Y  T+G                         
Subjt:  FEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALDLGKWVHVYAATIG-------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------IYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYS
                                  +YK +D+ E+AL++LI+LEP+NPAN+VMLSNIYGD GR+ D ARLK+ MRDTG KK  G S IE +D +V+FYS
Subjt:  --------------------------IYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYS

Query:  LDERHSQSKEIYGVLKGL
          E+H +++E+  +L+ L
Subjt:  LDERHSQSKEIYGVLKGL

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-3726.04Show/hide
Query:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD
        M+S DV+ +  T   VL +CAK      G ++   + +     N  +A  ++D+Y+   +I  A ++F  M E++ V WT+M+ GY +  D   AR + +
Subjt:  MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFD

Query:  LAPERDIVLWNIML--------------------------------------------------------------------------------------
          P++DIV WN ++                                                                                      
Subjt:  LAPERDIVLWNIML--------------------------------------------------------------------------------------

Query:  FEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALD----------------------------LGK-------
        F  +  R+VF W+ +IGG A +G   E +  F +M  +  V PN  T   V  AC+  G +D                            LG+       
Subjt:  FEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALD----------------------------LGK-------

Query:  -----------WVHVYAATIG---IYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLD
                      V+ A +G   I+ N++LAE+A  +L++LEP+N   +V+LSNIY  LG+W++V+ L+  MR TG KK PGCS IE++  + +F S D
Subjt:  -----------WVHVYAATIG---IYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLD

Query:  ERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQ
          H  S+++YG L  +M+ L+S GYEP I ++ Q
Subjt:  ERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQ

AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.8e-3825.55Show/hide
Query:  KSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYI--------------
        +S  +  N  T   V+ +C ++GAF   E IH  V+K G + ++FV  TL+D+YS    I  A ++F +M +R++V W +MI+GY+              
Subjt:  KSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYI--------------

Query:  ---------------------------------------------------LCNDVALARRLFDLAPERDIVLWNIMLFEEMPDRNVFSWNGLIGGYAHN
                                                           L  DVA+   L D+  +   +  +  +F+++P +NV +WN +I  Y  +
Subjt:  ---------------------------------------------------LCNDVALARRLFDLAPERDIVLWNIMLFEEMPDRNVFSWNGLIGGYAHN

Query:  GRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALD---------------------------------------------------LGKWVHVYA
        G   E +   + M++ G V PN+ T ++V +AC+  G +D                                                    G W  +  
Subjt:  GRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALD---------------------------------------------------LGKWVHVYA

Query:  ATIGIYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLL
        A+  I+ N+++ E+A Q LIQLEP   ++YV+L+NIY   G W     ++  M++ G +K PGCS IE  D V +F + D  H QS+++ G L+ L + +
Subjt:  ATIGIYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLL

Query:  RSFGYEP
        R  GY P
Subjt:  RSFGYEP

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.9e-3727.93Show/hide
Query:  SMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFDLA
        S ++ PN FTF  +LKSC+       G+ IH  V+K G   + +VAT L+DVY+ G  + SA KVF  M ER++V+ T+MI+ Y    +V  AR LFD  
Subjt:  SMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFDLA

Query:  PERDIVLWNIML--------------------------------------------------------------------------------------FE
         ERDIV WN+M+                                                                                      F 
Subjt:  PERDIVLWNIML--------------------------------------------------------------------------------------FE

Query:  EMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRML-IDGLVVPNDATLVTVLSACARLGALDLG---------------KWVH------------------
        + P +++ +WN +I GYA +G   + LR F  M  I GL  P D T +  L ACA  G ++ G               K  H                  
Subjt:  EMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRML-IDGLVVPNDATLVTVLSACARLGALDLG---------------KWVH------------------

Query:  -------------VYAATIG---IYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDE
                     ++++ +G   ++ +  L +   + LI L  KN   YV+LSNIY  +G ++ VA+++ LM++ G  K PG S IE+ + V +F + D 
Subjt:  -------------VYAATIG---IYKNIDLAELALQKLIQLEPKNPANYVMLSNIYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDE

Query:  RHSQSKEIYGVLKGLMKLLRSFGYEP
         HS+SKEIY +L+ + + ++S GY P
Subjt:  RHSQSKEIYGVLKGLMKLLRSFGYEP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGCATGGACGTGAGACCCAACTGCTTCACGTTCCCTCTTGTCCTCAAATCTTGTGCGAAGAACGGTGCCTTTTTGGAAGGTGAAGAGATACATTGTGAGGTGAT
TAAGGGAGGGTTTGAAGGGAACCAATTCGTGGCTACTACGCTGATCGATGTGTATTCTGGTGGGAGGGCGATTGGGTCTGCGTACAAGGTGTTTGTTGAAATGCTCGAGA
GAAATATAGTTGCTTGGACTTCCATGATTAGTGGCTACATTTTGTGTAATGATGTGGCACTTGCTCGCCGACTTTTTGATTTGGCACCAGAACGGGATATTGTCCTGTGG
AACATTATGCTGTTTGAAGAGATGCCTGACCGGAATGTTTTCTCCTGGAATGGATTGATTGGAGGATATGCTCATAATGGGCGTTTCTTTGAAGTATTGCGTTGTTTCAA
ACGAATGCTAATCGATGGGCTTGTTGTTCCTAATGATGCTACCCTTGTCACTGTGCTATCCGCTTGTGCAAGATTAGGAGCTCTTGACTTGGGAAAGTGGGTGCATGTAT
ATGCTGCGACGATCGGGATTTACAAAAACATAGATCTGGCTGAGTTAGCTCTTCAAAAACTCATTCAGCTTGAACCCAAAAACCCTGCAAACTATGTCATGCTATCAAAT
ATCTACGGAGATCTTGGTAGATGGAAAGATGTTGCACGGTTGAAGATTTTAATGAGGGATACCGGGTCCAAAAAATTGCCAGGATGTAGCTTGATTGAGGTGAATGATAG
CGTGGTTCAATTTTATTCCTTAGATGAGAGGCATTCTCAGAGCAAGGAAATCTATGGAGTTTTAAAGGGGTTGATGAAATTGTTAAGATCATTTGGGTATGAACCAATTA
TTATGGAACTCCAGCAAGGATCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGCATGGACGTGAGACCCAACTGCTTCACGTTCCCTCTTGTCCTCAAATCTTGTGCGAAGAACGGTGCCTTTTTGGAAGGTGAAGAGATACATTGTGAGGTGAT
TAAGGGAGGGTTTGAAGGGAACCAATTCGTGGCTACTACGCTGATCGATGTGTATTCTGGTGGGAGGGCGATTGGGTCTGCGTACAAGGTGTTTGTTGAAATGCTCGAGA
GAAATATAGTTGCTTGGACTTCCATGATTAGTGGCTACATTTTGTGTAATGATGTGGCACTTGCTCGCCGACTTTTTGATTTGGCACCAGAACGGGATATTGTCCTGTGG
AACATTATGCTGTTTGAAGAGATGCCTGACCGGAATGTTTTCTCCTGGAATGGATTGATTGGAGGATATGCTCATAATGGGCGTTTCTTTGAAGTATTGCGTTGTTTCAA
ACGAATGCTAATCGATGGGCTTGTTGTTCCTAATGATGCTACCCTTGTCACTGTGCTATCCGCTTGTGCAAGATTAGGAGCTCTTGACTTGGGAAAGTGGGTGCATGTAT
ATGCTGCGACGATCGGGATTTACAAAAACATAGATCTGGCTGAGTTAGCTCTTCAAAAACTCATTCAGCTTGAACCCAAAAACCCTGCAAACTATGTCATGCTATCAAAT
ATCTACGGAGATCTTGGTAGATGGAAAGATGTTGCACGGTTGAAGATTTTAATGAGGGATACCGGGTCCAAAAAATTGCCAGGATGTAGCTTGATTGAGGTGAATGATAG
CGTGGTTCAATTTTATTCCTTAGATGAGAGGCATTCTCAGAGCAAGGAAATCTATGGAGTTTTAAAGGGGTTGATGAAATTGTTAAGATCATTTGGGTATGAACCAATTA
TTATGGAACTCCAGCAAGGATCATGA
Protein sequenceShow/hide protein sequence
MKSMDVRPNCFTFPLVLKSCAKNGAFLEGEEIHCEVIKGGFEGNQFVATTLIDVYSGGRAIGSAYKVFVEMLERNIVAWTSMISGYILCNDVALARRLFDLAPERDIVLW
NIMLFEEMPDRNVFSWNGLIGGYAHNGRFFEVLRCFKRMLIDGLVVPNDATLVTVLSACARLGALDLGKWVHVYAATIGIYKNIDLAELALQKLIQLEPKNPANYVMLSN
IYGDLGRWKDVARLKILMRDTGSKKLPGCSLIEVNDSVVQFYSLDERHSQSKEIYGVLKGLMKLLRSFGYEPIIMELQQGS