; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G13950 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G13950
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr1:9536412..9538771
RNA-Seq ExpressionCSPI01G13950
SyntenyCSPI01G13950
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032594.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]4.6e-16286.42Show/hide
Query:  NRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKV-FKSSEAYDMV
        NRIAS PVGNSQIWPLYAIKC SHQSS T I P+EVKVGDE LNQIIAPRENAS C HEI+DACIDKIC LGHLAAAA LLKSLCNEK+   SS+AYDMV
Subjt:  NRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKV-FKSSEAYDMV

Query:  LLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTND-SKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYT
        LLAASERGDTPLLC+VFKVA++SCKSLSSASYMSFARAFTKTND SKLLECVKEI+E+TSQ C+VINRIIFAFS+ REIDKAFQIFNQMKCLSCTPDLYT
Subjt:  LLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTND-SKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYT

Query:  YNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIR
        YNI+LDM+GRAGRV+EILH+FVSMK+EGIAPDIVSYNTLINSLRKVGRLDIS+IYFREMVAMGI+PDLLTYTALIES+GRFGN+EEALTLLKEMKL  I 
Subjt:  YNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIR

Query:  PSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK
        PSSYIY+SLI NS  M KVELAT+LLNEMKLS+S+LARPEDFKRRK
Subjt:  PSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK

XP_008462480.1 PREDICTED: pentatricopeptide repeat-containing protein At1g11900 isoform X1 [Cucumis melo]3.5e-17085.99Show/hide
Query:  MSLYHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLK
        MSLY L I+RN LHYSYANRIAS PVGNSQIWPLYAIKC SHQSS T I P+EVKVGDE LNQIIAPRENAS C HEI+DACIDKIC LGHLAAAA LLK
Subjt:  MSLYHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLK

Query:  SLCNEKV-FKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTND-SKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKA
        SLCNEK+   SS+AYDMVLLAASERGDTPLLC+VFKVA++SCKSLSSASYMSFARAFTKTND SKLLECVKEI+E+TSQ C+VINRIIFAFS+ REIDKA
Subjt:  SLCNEKV-FKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTND-SKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKA

Query:  FQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFG
        FQIFNQMKCLSCTPDLYTYNI+LDM+GRAGRV+EILH+FVSMK+EGIAPDIVSYNTLINSLRKVGRLDIS+IYFREMVAMGI+PDLLTYTALIES+GRFG
Subjt:  FQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFG

Query:  NLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK
        N+EEALTLLKEMKL  I PSSYIY+SLI NS  M KVELAT+LLNEMKLS+S+LARPEDFKRRK
Subjt:  NLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK

XP_011654081.1 pentatricopeptide repeat-containing protein At1g11900 isoform X1 [Cucumis sativus]1.0e-19899.45Show/hide
Query:  MSLYHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLK
        MSLYHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAP ENASKCIHEIIDACIDKICRLGHLAAAAHLLK
Subjt:  MSLYHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLK

Query:  SLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQ
        SLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQKC VINRIIFAFSERREIDKAFQ
Subjt:  SLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQ

Query:  IFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNL
        IFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNL
Subjt:  IFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNL

Query:  EEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRKM
        EEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRKM
Subjt:  EEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRKM

XP_011654087.1 pentatricopeptide repeat-containing protein At1g11900 isoform X2 [Cucumis sativus]8.3e-18899.42Show/hide
Query:  NRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKVFKSSEAYDMVL
        NRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAP ENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKVFKSSEAYDMVL
Subjt:  NRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKVFKSSEAYDMVL

Query:  LAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYN
        LAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQKC VINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYN
Subjt:  LAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYN

Query:  IILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPS
        IILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPS
Subjt:  IILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPS

Query:  SYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRKM
        SYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRKM
Subjt:  SYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRKM

XP_038893594.1 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g11900 [Benincasa hispida]2.9e-14877.47Show/hide
Query:  MSLYHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLK
        MS+Y L +HRN LH SYAN IAS PVGNSQ WPLYAI+ LSHQSS T I P+EVKVGDE LNQIIAPRENAS+C HE  DACIDK+CR+GHLAAAA LLK
Subjt:  MSLYHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLK

Query:  SLCNEKV-FKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDS-KLLECVKEIIEITSQKCTVINRIIFAFSERREIDKA
        SLC+ K+   SS+AYDMVLLAASE GDT LL +VFK +L+SCKSLSS SY SFA AFT+TNDS KLLE VKEIIE+T   C VINRIIFAFS+ REIDKA
Subjt:  SLCNEKV-FKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDS-KLLECVKEIIEITSQKCTVINRIIFAFSERREIDKA

Query:  FQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFG
         QIFNQMK LSC PDLYTYNIILDM+GRAGRVDEILH+FVSMKE+GIAPDIVSYNTLINS RKVGRLD+ ++YF+EMVA+ IEPDLLTYTALIES+GR G
Subjt:  FQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFG

Query:  NLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK
        N+EEA TLL+EMKL NI PSSYIY+SLI NSM M KVELA +LL EMKLS S+LA P+DFKRRK
Subjt:  NLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK

TrEMBL top hitse value%identityAlignment
A0A0A0LXZ6 Uncharacterized protein5.1e-19999.45Show/hide
Query:  MSLYHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLK
        MSLYHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAP ENASKCIHEIIDACIDKICRLGHLAAAAHLLK
Subjt:  MSLYHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLK

Query:  SLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQ
        SLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQKC VINRIIFAFSERREIDKAFQ
Subjt:  SLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQ

Query:  IFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNL
        IFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNL
Subjt:  IFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNL

Query:  EEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRKM
        EEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRKM
Subjt:  EEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRKM

A0A1S3CIJ8 pentatricopeptide repeat-containing protein At1g11900 isoform X11.7e-17085.99Show/hide
Query:  MSLYHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLK
        MSLY L I+RN LHYSYANRIAS PVGNSQIWPLYAIKC SHQSS T I P+EVKVGDE LNQIIAPRENAS C HEI+DACIDKIC LGHLAAAA LLK
Subjt:  MSLYHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLK

Query:  SLCNEKV-FKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTND-SKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKA
        SLCNEK+   SS+AYDMVLLAASERGDTPLLC+VFKVA++SCKSLSSASYMSFARAFTKTND SKLLECVKEI+E+TSQ C+VINRIIFAFS+ REIDKA
Subjt:  SLCNEKV-FKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTND-SKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKA

Query:  FQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFG
        FQIFNQMKCLSCTPDLYTYNI+LDM+GRAGRV+EILH+FVSMK+EGIAPDIVSYNTLINSLRKVGRLDIS+IYFREMVAMGI+PDLLTYTALIES+GRFG
Subjt:  FQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFG

Query:  NLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK
        N+EEALTLLKEMKL  I PSSYIY+SLI NS  M KVELAT+LLNEMKLS+S+LARPEDFKRRK
Subjt:  NLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK

A0A5D3DB98 Pentatricopeptide repeat-containing protein2.2e-16286.42Show/hide
Query:  NRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKV-FKSSEAYDMV
        NRIAS PVGNSQIWPLYAIKC SHQSS T I P+EVKVGDE LNQIIAPRENAS C HEI+DACIDKIC LGHLAAAA LLKSLCNEK+   SS+AYDMV
Subjt:  NRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKV-FKSSEAYDMV

Query:  LLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTND-SKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYT
        LLAASERGDTPLLC+VFKVA++SCKSLSSASYMSFARAFTKTND SKLLECVKEI+E+TSQ C+VINRIIFAFS+ REIDKAFQIFNQMKCLSCTPDLYT
Subjt:  LLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTND-SKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYT

Query:  YNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIR
        YNI+LDM+GRAGRV+EILH+FVSMK+EGIAPDIVSYNTLINSLRKVGRLDIS+IYFREMVAMGI+PDLLTYTALIES+GRFGN+EEALTLLKEMKL  I 
Subjt:  YNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIR

Query:  PSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK
        PSSYIY+SLI NS  M KVELAT+LLNEMKLS+S+LARPEDFKRRK
Subjt:  PSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK

A0A6J1HKH0 pentatricopeptide repeat-containing protein At1g11900-like2.7e-13172.02Show/hide
Query:  YHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLC
        + L   RN LHYSY N I S PVGN Q W LYAI+   HQ S T I P+E KV DE LNQI A RENAS C HE  D CIDK+CR  +L AAA LLKS C
Subjt:  YHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLC

Query:  NEKV-FKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTND-SKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQI
        + K+   SS+AYDMVLLAASERGDT LLC+VFK +L+S K LSS SYM+FA+AF +T+D SKLLE VKEIIE+T     VINRIIFAFSE REIDKA QI
Subjt:  NEKV-FKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTND-SKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQI

Query:  FNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLE
        FNQMK LS  PDLYTYNIILD +GRAGR+DEILH+FVSMKE+GIAPDIVSYNTLINSLRKVGRLD+ +IYFREMVAM IEPDLLTYTALIES+GR GN+E
Subjt:  FNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLE

Query:  EALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK
        EALTLL+EMKL +IRPSSYIY+SLI NSM + KVELA +LL EMKLS S+LA P+DFKR++
Subjt:  EALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK

A0A6J1HYS7 pentatricopeptide repeat-containing protein At1g119002.2e-13373.52Show/hide
Query:  RNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKV-F
        RN LHYSY N I S PVGN Q W LYAI+   HQSS   I P+E KV DE LNQI A RENAS+C HE  D CIDK+CR G+L AAA LLKSLC+ K+  
Subjt:  RNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKV-F

Query:  KSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTND-SKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQIFNQMKC
         SS+AYDMVLLAASERGDT LLC+VFK +L+S K LSS SYM+FA+AF +T+D SKLLE VKEIIE+T     VINRIIFAFSE REIDKA QIFNQMK 
Subjt:  KSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTND-SKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQIFNQMKC

Query:  LSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLL
        LS  PDLYTYNIILD +GRAGR+DEILH+FVSMKE+GIAPDIVSYNTLINSLRKVGRLD+ +IYFREMVAM IEPDLLTYTALIES+GR GN+EEALTLL
Subjt:  LSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLL

Query:  KEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK
        +EMKL +IRPSSYIY+SLI NSM + KVELA +LL EMKLS S+LA P+DFKR++
Subjt:  KEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMKLSKSELARPEDFKRRK

SwissProt top hitse value%identityAlignment
O80958 Pentatricopeptide repeat-containing protein At2g39230, mitochondrial2.4e-2023.96Show/hide
Query:  LNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKVFK-SSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKT
        +NQ+ A    A++ I+  I   I+ +C++G  + A  +L++L  EK +  S  +Y+ ++    + GDT    E ++    + KS +  ++ S    F K+
Subjt:  LNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKVFK-SSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKT

Query:  NDSKL-LECVKEIIEITSQ-KCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLIN
        N   L LE   E+  +  +        +I  F ++ ++  A+ +F+++  L   P++  YN ++      G++D  + ++  M  +GI+ D+ +Y T+I+
Subjt:  NDSKL-LECVKEIIEITSQ-KCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLIN

Query:  SLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEM
         L K G ++++   + E++ +GI PD + +  L+    + G   +A  +L+EMK  ++ P+  +Y ++I        +  A  L +EM
Subjt:  SLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEM

P0C894 Putative pentatricopeptide repeat-containing protein At2g021501.3e-2129.07Show/hide
Query:  AHLLKSLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQ--------KCTVINRIIF
        AH+L   C    + ++     ++L+ ++       C+VF V L S +++    +  F   F+   D  +LE   E I+  S+        K    N ++ 
Subjt:  AHLLKSLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQ--------KCTVINRIIF

Query:  AFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTY
         F++  + D   + F  M      P ++TYNI++D + + G V+    +F  MK  G+ PD V+YN++I+   KVGRLD +V +F EM  M  EPD++TY
Subjt:  AFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTY

Query:  TALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLI----RNSMTMKKVELATDL
         ALI  + +FG L   L   +EMK N ++P+   Y +L+    +  M  + ++   D+
Subjt:  TALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLI----RNSMTMKKVELATDL

Q5BIV3 Pentatricopeptide repeat-containing protein At1g119006.7e-5542.14Show/hide
Query:  VKVGDED----LNQIIAPRENASKCIHEI-IDACIDKICRLGHLAAAAHLLKSLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALL--SCKSLSS
        +  G+ED    L +I+   E+ SK I +I     ++K  R G+L+ A  LL+SL  + +      +  +L AA E  D  L C VF+  L+    + LSS
Subjt:  VKVGDED----LNQIIAPRENASKCIHEI-IDACIDKICRLGHLAAAAHLLKSLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALL--SCKSLSS

Query:  ASYMSFARAFTKTND-SKLLECVKEIIEIT-SQKCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEE-
          Y++ ARAF  T+D + L   +KEI E +   +  V+NRIIFAF+E R+IDK   I  +MK   C PD+ TYN +LD++GRAG V+EIL +  +MKE+ 
Subjt:  ASYMSFARAFTKTND-SKLLECVKEIIEIT-SQKCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEE-

Query:  GIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLN
         ++ +I++YNT++N +RK  R D+ ++ + EMV  GIEPDLL+YTA+I+S GR GN++E+L L  EMK   IRPS Y+YR+LI         + A  L +
Subjt:  GIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLN

Query:  EMKLSKS-ELARPEDFKR
        E+K + S +LA P+DFKR
Subjt:  EMKLSKS-ELARPEDFKR

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic8.2e-2134.76Show/hide
Query:  NRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEP
        N ++ A  +  ++D AF+I  QM      P++ +Y+ ++D   +AGR DE L++F  M+  GIA D VSYNTL++   KVGR + ++   REM ++GI+ 
Subjt:  NRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEP

Query:  DLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMK
        D++TY AL+  YG+ G  +E   +  EMK  ++ P+   Y +LI         + A ++  E K
Subjt:  DLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMK

Q9ZU27 Pentatricopeptide repeat-containing protein At1g51965, mitochondrial8.8e-2326.69Show/hide
Query:  VGDEDLNQIIAPRENASKCIHEIIDACIDKICR---------LGHLAAAAHLLK---SLCNEKVFKSSEAYDMVLLAASERGDTP----LLCEVFKVALL
        VG   L Q++A  +   K I ++    ++  CR         L  L A   L++    +   K + +   Y  ++   S+ G       L C+++   + 
Subjt:  VGDEDLNQIIAPRENASKCIHEIIDACIDKICR---------LGHLAAAAHLLK---SLCNEKVFKSSEAYDMVLLAASERGDTP----LLCEVFKVALL

Query:  SCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQ----KCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILH
          +     SYMS   +       K +E ++ + +I  +       + N +  A  + ++I     +F +MK    +PD++TYNI++   GR G VDE ++
Subjt:  SCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQ----KCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILH

Query:  IFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKV
        IF  ++     PDI+SYN+LIN L K G +D + + F+EM   G+ PD++TY+ L+E +G+   +E A +L +EM +   +P+   Y  L+       + 
Subjt:  IFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKV

Query:  ELATDLLNEMK
          A DL ++MK
Subjt:  ELATDLLNEMK

Arabidopsis top hitse value%identityAlignment
AT1G11900.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.7e-5642.14Show/hide
Query:  VKVGDED----LNQIIAPRENASKCIHEI-IDACIDKICRLGHLAAAAHLLKSLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALL--SCKSLSS
        +  G+ED    L +I+   E+ SK I +I     ++K  R G+L+ A  LL+SL  + +      +  +L AA E  D  L C VF+  L+    + LSS
Subjt:  VKVGDED----LNQIIAPRENASKCIHEI-IDACIDKICRLGHLAAAAHLLKSLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALL--SCKSLSS

Query:  ASYMSFARAFTKTND-SKLLECVKEIIEIT-SQKCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEE-
          Y++ ARAF  T+D + L   +KEI E +   +  V+NRIIFAF+E R+IDK   I  +MK   C PD+ TYN +LD++GRAG V+EIL +  +MKE+ 
Subjt:  ASYMSFARAFTKTND-SKLLECVKEIIEIT-SQKCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEE-

Query:  GIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLN
         ++ +I++YNT++N +RK  R D+ ++ + EMV  GIEPDLL+YTA+I+S GR GN++E+L L  EMK   IRPS Y+YR+LI         + A  L +
Subjt:  GIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLN

Query:  EMKLSKS-ELARPEDFKR
        E+K + S +LA P+DFKR
Subjt:  EMKLSKS-ELARPEDFKR

AT1G51965.1 ABA Overly-Sensitive 56.2e-2426.69Show/hide
Query:  VGDEDLNQIIAPRENASKCIHEIIDACIDKICR---------LGHLAAAAHLLK---SLCNEKVFKSSEAYDMVLLAASERGDTP----LLCEVFKVALL
        VG   L Q++A  +   K I ++    ++  CR         L  L A   L++    +   K + +   Y  ++   S+ G       L C+++   + 
Subjt:  VGDEDLNQIIAPRENASKCIHEIIDACIDKICR---------LGHLAAAAHLLK---SLCNEKVFKSSEAYDMVLLAASERGDTP----LLCEVFKVALL

Query:  SCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQ----KCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILH
          +     SYMS   +       K +E ++ + +I  +       + N +  A  + ++I     +F +MK    +PD++TYNI++   GR G VDE ++
Subjt:  SCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQ----KCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILH

Query:  IFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKV
        IF  ++     PDI+SYN+LIN L K G +D + + F+EM   G+ PD++TY+ L+E +G+   +E A +L +EM +   +P+   Y  L+       + 
Subjt:  IFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKV

Query:  ELATDLLNEMK
          A DL ++MK
Subjt:  ELATDLLNEMK

AT2G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.0e-2329.07Show/hide
Query:  AHLLKSLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQ--------KCTVINRIIF
        AH+L   C    + ++     ++L+ ++       C+VF V L S +++    +  F   F+   D  +LE   E I+  S+        K    N ++ 
Subjt:  AHLLKSLCNEKVFKSSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQ--------KCTVINRIIF

Query:  AFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTY
         F++  + D   + F  M      P ++TYNI++D + + G V+    +F  MK  G+ PD V+YN++I+   KVGRLD +V +F EM  M  EPD++TY
Subjt:  AFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTY

Query:  TALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLI----RNSMTMKKVELATDL
         ALI  + +FG L   L   +EMK N ++P+   Y +L+    +  M  + ++   D+
Subjt:  TALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLI----RNSMTMKKVELATDL

AT2G31400.1 genomes uncoupled 15.8e-2234.76Show/hide
Query:  NRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEP
        N ++ A  +  ++D AF+I  QM      P++ +Y+ ++D   +AGR DE L++F  M+  GIA D VSYNTL++   KVGR + ++   REM ++GI+ 
Subjt:  NRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEP

Query:  DLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMK
        D++TY AL+  YG+ G  +E   +  EMK  ++ P+   Y +LI         + A ++  E K
Subjt:  DLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEMK

AT2G39230.1 LATERAL ORGAN JUNCTION1.7e-2123.96Show/hide
Query:  LNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKVFK-SSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKT
        +NQ+ A    A++ I+  I   I+ +C++G  + A  +L++L  EK +  S  +Y+ ++    + GDT    E ++    + KS +  ++ S    F K+
Subjt:  LNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKVFK-SSEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKT

Query:  NDSKL-LECVKEIIEITSQ-KCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLIN
        N   L LE   E+  +  +        +I  F ++ ++  A+ +F+++  L   P++  YN ++      G++D  + ++  M  +GI+ D+ +Y T+I+
Subjt:  NDSKL-LECVKEIIEITSQ-KCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNIILDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLIN

Query:  SLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEM
         L K G ++++   + E++ +GI PD + +  L+    + G   +A  +L+EMK  ++ P+  +Y ++I        +  A  L +EM
Subjt:  SLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSMTMKKVELATDLLNEM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCTTTACCATCTTGCTATCCATAGGAATTTTCTGCATTACTCGTATGCTAATAGAATTGCATCTGCCCCAGTTGGTAACTCTCAAATCTGGCCGCTTTATGCCAT
CAAATGCCTTAGCCATCAGTCATCCGGTACAAAAATCTTTCCTAATGAAGTGAAAGTGGGGGATGAAGACTTGAATCAGATTATTGCTCCAAGGGAAAATGCCTCAAAAT
GTATCCATGAGATCATTGATGCTTGCATTGATAAGATTTGTCGACTGGGACATCTTGCAGCTGCAGCTCATTTACTTAAATCATTGTGCAATGAGAAAGTATTTAAATCC
TCGGAGGCATATGATATGGTTTTGCTTGCAGCAAGTGAAAGGGGAGACACTCCTCTTTTATGTGAAGTTTTTAAAGTTGCCCTACTTTCATGTAAATCACTGAGTTCTGC
TTCTTACATGAGTTTTGCCAGGGCCTTTACCAAGACAAATGATAGCAAGCTGTTGGAATGTGTCAAAGAAATAATTGAGATTACCTCTCAGAAGTGCACAGTTATAAACA
GAATTATCTTTGCCTTCTCCGAGCGTAGGGAGATTGATAAAGCCTTTCAAATATTTAATCAGATGAAGTGTCTGTCATGTACACCAGATTTGTATACATACAACATCATT
TTGGACATGGTAGGTCGTGCAGGTCGCGTGGATGAAATTCTTCATATATTTGTTTCCATGAAAGAAGAAGGCATAGCCCCAGATATTGTGTCTTATAATACATTGATAAA
TAGCTTAAGAAAGGTGGGTCGACTAGATATATCCGTGATTTACTTCAGGGAAATGGTTGCAATGGGAATTGAACCTGATTTGCTTACTTATACAGCTTTGATAGAGAGTT
ATGGTCGATTTGGAAACCTTGAAGAAGCTTTGACACTCCTCAAAGAGATGAAGCTTAATAACATCCGTCCTTCAAGCTATATTTACAGGTCCCTTATCAGAAATTCAATG
ACGATGAAGAAGGTGGAATTGGCTACGGACCTTCTCAATGAAATGAAATTAAGTAAATCAGAACTTGCTCGTCCAGAGGATTTCAAACGAAGAAAAATGTAA
mRNA sequenceShow/hide mRNA sequence
GCCGAACTCCGCTTCCCTCACCGTCTTCTCTCTTCACGTCTCACCTGCCGCGTCTCCTAGCCGCCCGTTCCTCCTCGCGTTGCCGCTTATCTGACGCCCAGACGAGCGGA
GCCGACGTCTCCTTCGCTGCAGCTGGAGTTCGTCGCGCCGCCGTGTGGTTGTTTCACACCCGTCGTCCGAAGCCTTCGCCGCTGCGACTGCGTTGCCCATCTGCCGCCGC
CGGTCCCGTCAACCACTGAAGTTAGGTACAGAGTTTGGGAATGGGTTTGGTAGTAATCTTAGCATTCATCTTTATCGAGTTAGGGCTGATTTCACGATAGGCAAAGGCTG
AGCAGAAAAGCAAGATGCAAACCCTGGAATTTTGTAACGAAGAAGAATCGCAAAAAGCAGTGTCGTGGTTTTAGGGTTTTGGGTTTAGAGTGCTTTGGCCTTCTTCCACA
TTTCACCTTATGGTTCGAACCACTTTCCCATTTACTTGCATTTTCAATTTCAGTTCTGGCCGTTTATGTCGCTTTACCATCTTGCTATCCATAGGAATTTTCTGCATTAC
TCGTATGCTAATAGAATTGCATCTGCCCCAGTTGGTAACTCTCAAATCTGGCCGCTTTATGCCATCAAATGCCTTAGCCATCAGTCATCCGGTACAAAAATCTTTCCTAA
TGAAGTGAAAGTGGGGGATGAAGACTTGAATCAGATTATTGCTCCAAGGGAAAATGCCTCAAAATGTATCCATGAGATCATTGATGCTTGCATTGATAAGATTTGTCGAC
TGGGACATCTTGCAGCTGCAGCTCATTTACTTAAATCATTGTGCAATGAGAAAGTATTTAAATCCTCGGAGGCATATGATATGGTTTTGCTTGCAGCAAGTGAAAGGGGA
GACACTCCTCTTTTATGTGAAGTTTTTAAAGTTGCCCTACTTTCATGTAAATCACTGAGTTCTGCTTCTTACATGAGTTTTGCCAGGGCCTTTACCAAGACAAATGATAG
CAAGCTGTTGGAATGTGTCAAAGAAATAATTGAGATTACCTCTCAGAAGTGCACAGTTATAAACAGAATTATCTTTGCCTTCTCCGAGCGTAGGGAGATTGATAAAGCCT
TTCAAATATTTAATCAGATGAAGTGTCTGTCATGTACACCAGATTTGTATACATACAACATCATTTTGGACATGGTAGGTCGTGCAGGTCGCGTGGATGAAATTCTTCAT
ATATTTGTTTCCATGAAAGAAGAAGGCATAGCCCCAGATATTGTGTCTTATAATACATTGATAAATAGCTTAAGAAAGGTGGGTCGACTAGATATATCCGTGATTTACTT
CAGGGAAATGGTTGCAATGGGAATTGAACCTGATTTGCTTACTTATACAGCTTTGATAGAGAGTTATGGTCGATTTGGAAACCTTGAAGAAGCTTTGACACTCCTCAAAG
AGATGAAGCTTAATAACATCCGTCCTTCAAGCTATATTTACAGGTCCCTTATCAGAAATTCAATGACGATGAAGAAGGTGGAATTGGCTACGGACCTTCTCAATGAAATG
AAATTAAGTAAATCAGAACTTGCTCGTCCAGAGGATTTCAAACGAAGAAAAATGTAACCAATTGCATAGTTTCACAGCAGACTTTGAGATTGATAAGCATGTGAATGTCA
TGGCTTTAATCTTCAAAGTGAGATCTGCCAGAGTGACTGCATGCATAGGTAATATTTCTACCTATTTATCTACTTCTCTTATATCTGACATGAAATACACAAATTGTTGT
CATAAGTTGCAATAGAAGTCTTGTGACGAATTTTGGAGATTCATTTTAGGACCAATATACAAAAAC
Protein sequenceShow/hide protein sequence
MSLYHLAIHRNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPRENASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKVFKS
SEAYDMVLLAASERGDTPLLCEVFKVALLSCKSLSSASYMSFARAFTKTNDSKLLECVKEIIEITSQKCTVINRIIFAFSERREIDKAFQIFNQMKCLSCTPDLYTYNII
LDMVGRAGRVDEILHIFVSMKEEGIAPDIVSYNTLINSLRKVGRLDISVIYFREMVAMGIEPDLLTYTALIESYGRFGNLEEALTLLKEMKLNNIRPSSYIYRSLIRNSM
TMKKVELATDLLNEMKLSKSELARPEDFKRRKM