; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G013900 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G013900
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCG_Chr01:27950160..27953890
RNA-Seq ExpressionClCG01G013900
SyntenyClCG01G013900
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032594.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]1.3e-15973.38Show/hide
Query:  MVRATFPFTCFFDFSSGRSCQFTNLLS-----TRNLLYY-----------------------------SYANSIACIPVSNPQIWPLYAIKCLSHQSSTT
        MVR T PFTC FDFSSGR C+FTNLLS     T  +L+                                 N IA IPV N QIWPLYAIKC SHQSS+T
Subjt:  MVRATFPFTCFFDFSSGRSCQFTNLLS-----TRNLLYY-----------------------------SYANSIACIPVSNPQIWPLYAIKCLSHQSSTT

Query:  NISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSS
        NISPDEVKVGDEVLNQII PREN S CSHE  DACIDK+C  G++AAAAQ LKSLC+ KISL+SSKAYDMV LAASERGD  LLCQVFK  +VSCKSLSS
Subjt:  NISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSS

Query:  TAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGI
         +YM+FA+AFT+TND SKLLE VKEI+E++  NC VINRIIFAFSKCREIDKA QIFNQMK LSC PDLYTYNI+LDMLGRAGR+NE+LH+FVSMK++GI
Subjt:  TAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGI

Query:  APDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM
        APDIVSYNTLINSLRKVGRLD+ LIYFREMVAM ++PDLLTYTALIESFGR GNIEEALTLL+EMKL+ ICPSSYIYKSLI NSKKMGKVELA NLL EM
Subjt:  APDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM

Query:  KLSDSKLAGLKDFKRRK
        KLS+SKLA  +DFKRRK
Subjt:  KLSDSKLAGLKDFKRRK

KAG6583593.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.3e-15983.29Show/hide
Query:  LLSTRNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCDR
        LLS RN+L+YSY NSI  +PV NPQ W LYAI+   HQ STTNISPDE KV DEVLNQI   REN S CSHETFD CIDKMC+S N+ AAAQ LKSLCDR
Subjt:  LLSTRNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCDR

Query:  KISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFN
        KISLSSSKAYDMV LAASERGD TLLCQVFKD LVS K LSST+YMNFAKAF RT+D SKLLE+VKEIIEM+FPN +VINRIIFAFS+CREIDKALQIFN
Subjt:  KISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFN

Query:  QMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEA
        QMKLLS RPDLYTYNIILD LGRAGR++E+LH+FVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMR+EPDLLTYTALIESFGRSGN+EEA
Subjt:  QMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEA

Query:  LTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        LTLLREMKL++I PSSYIYKSLI NS K+GKVELAMNLLKEMKLS SKLAG KDFKR++
Subjt:  LTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

XP_022970322.1 pentatricopeptide repeat-containing protein At1g11900 [Cucurbita maxima]2.4e-16183.89Show/hide
Query:  NLLSTRNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCD
        +LLS RN+L+YSY NSI  IPV NPQ W LYAI+   HQSST NISPDE KV DEVLNQI   REN SRCSHETFD CIDKMC+SGN+ AAAQ LKSLCD
Subjt:  NLLSTRNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCD

Query:  RKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIF
        RKISLSSSKAYDMV LAASERGD TLLCQVFKD LVS K LSST+YMNFAKAF RT+D SKLLE+VKEIIEM+FPN +VINRIIFAFS+CREIDKALQIF
Subjt:  RKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIF

Query:  NQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEE
        NQMKLLS RPDLYTYNIILD LGRAGR++E+LH+FVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMR+EPDLLTYTALIESFGRSGN+EE
Subjt:  NQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEE

Query:  ALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        ALTLLREMKL++I PSSYIYKSLI NS K+GKVELAMNLLKEMKLS SKLAG KDFKR++
Subjt:  ALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

XP_023518970.1 pentatricopeptide repeat-containing protein At1g11900 [Cucurbita pepo subsp. pepo]5.0e-15983.06Show/hide
Query:  NLLSTRNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCD
        +LLS RN+L+YSY NSI  IPV NPQ W LYAI+   HQ STTNISPDE KV DEVLNQ    REN S CSHETFD CIDKMC+S N+ AAAQ LKSLCD
Subjt:  NLLSTRNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCD

Query:  RKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIF
        RKISLSSSKAYDMV LAASERGD TLLCQVFKD LVS K LSST+YMNFAKAF RT+D SKLLE+VKEIIEM+FPN +VINRIIFAFS+CREIDKALQIF
Subjt:  RKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIF

Query:  NQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEE
        NQMKLLS RPDLYTYNIILD LGRAGR++E+LH+FVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMR+EPDLLTYTALIESFGRSGN+EE
Subjt:  NQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEE

Query:  ALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        ALTLLREMKL++I PSSYIYKSLI NS K+GKVELAMNLLKEMKLS SKLAG KDFKR++
Subjt:  ALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

XP_038893594.1 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g11900 [Benincasa hispida]3.5e-16886.76Show/hide
Query:  RNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCDRKISL
        RNLL+ SYANSIA +PV N Q WPLYAI+ LSHQSS+TNI PDEVKVGDEVLNQII PREN SRCSHETFDACIDKMC+ G++AAAAQ LKSLCD KI L
Subjt:  RNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCDRKISL

Query:  SSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKL
        SSSKAYDMV LAASE GD TLL QVFKD LVSCKSLSST+Y +FA AFTRTND +KLLE+VKEIIEM+FPNC+VINRIIFAFSKCREIDKALQIFNQMKL
Subjt:  SSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKL

Query:  LSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLL
        LSCRPDLYTYNIILDMLGRAGR++E+LH+FVSMKEDGIAPDIVSYNTLINS RKVGRLDMCL+YF+EMVA+R+EPDLLTYTALIESFGRSGNIEEA TLL
Subjt:  LSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLL

Query:  REMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        REMKL+NICPSSYIYKSLIGNS KMGKVELAMNLLKEMKLSDSKLAG KDFKRRK
Subjt:  REMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

TrEMBL top hitse value%identityAlignment
A0A1S3CIJ8 pentatricopeptide repeat-containing protein At1g11900 isoform X12.3e-15781.69Show/hide
Query:  RNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCDRKISL
        RNLL+YSYAN IA IPV N QIWPLYAIKC SHQSS+TNISPDEVKVGDEVLNQII PREN S CSHE  DACIDK+C  G++AAAAQ LKSLC+ KISL
Subjt:  RNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCDRKISL

Query:  SSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKL
        +SSKAYDMV LAASERGD  LLCQVFK  +VSCKSLSS +YM+FA+AFT+TND SKLLE VKEI+E++  NC VINRIIFAFSKCREIDKA QIFNQMK 
Subjt:  SSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKL

Query:  LSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLL
        LSC PDLYTYNI+LDMLGRAGR+NE+LH+FVSMK++GIAPDIVSYNTLINSLRKVGRLD+ LIYFREMVAM ++PDLLTYTALIESFGR GNIEEALTLL
Subjt:  LSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLL

Query:  REMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        +EMKL+ ICPSSYIYKSLI NSKKMGKVELA NLL EMKLS+SKLA  +DFKRRK
Subjt:  REMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

A0A5D3DB98 Pentatricopeptide repeat-containing protein6.4e-16073.38Show/hide
Query:  MVRATFPFTCFFDFSSGRSCQFTNLLS-----TRNLLYY-----------------------------SYANSIACIPVSNPQIWPLYAIKCLSHQSSTT
        MVR T PFTC FDFSSGR C+FTNLLS     T  +L+                                 N IA IPV N QIWPLYAIKC SHQSS+T
Subjt:  MVRATFPFTCFFDFSSGRSCQFTNLLS-----TRNLLYY-----------------------------SYANSIACIPVSNPQIWPLYAIKCLSHQSSTT

Query:  NISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSS
        NISPDEVKVGDEVLNQII PREN S CSHE  DACIDK+C  G++AAAAQ LKSLC+ KISL+SSKAYDMV LAASERGD  LLCQVFK  +VSCKSLSS
Subjt:  NISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSS

Query:  TAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGI
         +YM+FA+AFT+TND SKLLE VKEI+E++  NC VINRIIFAFSKCREIDKA QIFNQMK LSC PDLYTYNI+LDMLGRAGR+NE+LH+FVSMK++GI
Subjt:  TAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGI

Query:  APDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM
        APDIVSYNTLINSLRKVGRLD+ LIYFREMVAM ++PDLLTYTALIESFGR GNIEEALTLL+EMKL+ ICPSSYIYKSLI NSKKMGKVELA NLL EM
Subjt:  APDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM

Query:  KLSDSKLAGLKDFKRRK
        KLS+SKLA  +DFKRRK
Subjt:  KLSDSKLAGLKDFKRRK

A0A6J1D2E8 pentatricopeptide repeat-containing protein At1g11900-like isoform X12.1e-15576.3Show/hide
Query:  VRATFPFTCFFDFSSGRSCQFTNLLSTRNLLYYSYANSIACIPVSNPQIWPLYAI--KCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFD
        VR    F+   DFSSG SC+F NL STR +L+Y Y N IA  PV +PQIW  +A   K  SHQS  T+ SPDE KV DEVLNQI+  R+N SR SHETFD
Subjt:  VRATFPFTCFFDFSSGRSCQFTNLLSTRNLLYYSYANSIACIPVSNPQIWPLYAI--KCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFD

Query:  ACIDKMCQSGNVAAAAQ-LKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPN
        ACI KMC+SGN+AAAAQ LKSLCD KISLS+SKAYDMV LAASERGD +L CQVFKD LVSCKSLSS  YMN AKAF  TND  KLLE+VKE+IEM+FPN
Subjt:  ACIDKMCQSGNVAAAAQ-LKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPN

Query:  CMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAM
         +VIN+IIFAFSKCREI+KAL+IFNQMKLLSC+PDLYTYNIILDMLGRAGR++E+LH+FVSMKE GIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAM
Subjt:  CMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAM

Query:  RVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        R+EPDLLTYTA+IESFGRSGNIEEAL LLREMKL+N+ PS+YIYKSLI NS K+GKVELAM+LL E+KLS S LA  KDFKRRK
Subjt:  RVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

A0A6J1HKH0 pentatricopeptide repeat-containing protein At1g11900-like4.1e-15983.01Show/hide
Query:  LLSTRNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCDR
        LLS RN+L+YSY NSI  +PV NPQ W LYAI+   HQ STTNISPDE KV DEVLNQI   REN S CSHETFD CIDKMC+S N+ AAAQ LKS CDR
Subjt:  LLSTRNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCDR

Query:  KISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFN
        KISLSSSKAYDMV LAASERGD TLLCQVFKD LVS K LSST+YMNFAKAF RT+D SKLLE+VKEIIEM+FPN +VINRIIFAFS+CREIDKALQIFN
Subjt:  KISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFN

Query:  QMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEA
        QMKLLS RPDLYTYNIILD LGRAGR++E+LH+FVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMR+EPDLLTYTALIESFGRSGN+EEA
Subjt:  QMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEA

Query:  LTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        LTLLREMKL++I PSSYIYKSLI NS K+GKVELAMNLLKEMKLS SKLAG KDFKR++
Subjt:  LTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

A0A6J1HYS7 pentatricopeptide repeat-containing protein At1g119001.2e-16183.89Show/hide
Query:  NLLSTRNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCD
        +LLS RN+L+YSY NSI  IPV NPQ W LYAI+   HQSST NISPDE KV DEVLNQI   REN SRCSHETFD CIDKMC+SGN+ AAAQ LKSLCD
Subjt:  NLLSTRNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQ-LKSLCD

Query:  RKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIF
        RKISLSSSKAYDMV LAASERGD TLLCQVFKD LVS K LSST+YMNFAKAF RT+D SKLLE+VKEIIEM+FPN +VINRIIFAFS+CREIDKALQIF
Subjt:  RKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIF

Query:  NQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEE
        NQMKLLS RPDLYTYNIILD LGRAGR++E+LH+FVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMR+EPDLLTYTALIESFGRSGN+EE
Subjt:  NQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEE

Query:  ALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        ALTLLREMKL++I PSSYIYKSLI NS K+GKVELAMNLLKEMKLS SKLAG KDFKR++
Subjt:  ALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

SwissProt top hitse value%identityAlignment
Q5BIV3 Pentatricopeptide repeat-containing protein At1g119004.5e-6244.37Show/hide
Query:  DEVLNQIITPRENVSR-CSHETFDACIDKMCQSGNVAAAAQ-LKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLV--SCKSLSSTAYMNFA
        +E+L +I+   E+ S+  S   +   ++K  + GN++ A   L+SL ++ I L  S  +  +  AA E  D+ L C+VF++ L+    + LSS  Y+N A
Subjt:  DEVLNQIITPRENVSR-CSHETFDACIDKMCQSGNVAAAAQ-LKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLV--SCKSLSSTAYMNFA

Query:  KAFTRTNDCSKLLEFVKEIIEMSFP-NCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKED-GIAPDIV
        +AF  T+DC+ L   +KEI E S P   +V+NRIIFAF++ R+IDK L I  +MK   C+PD+ TYN +LD+LGRAG +NE+L +  +MKED  ++ +I+
Subjt:  KAFTRTNDCSKLLEFVKEIIEMSFP-NCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKED-GIAPDIV

Query:  SYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDS
        +YNT++N +RK  R DMCL+ + EMV   +EPDLL+YTA+I+S GRSGN++E+L L  EMK R I PS Y+Y++LI   KK G  + A+ L  E+K + S
Subjt:  SYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDS

Query:  -KLAGLKDFKR
          LAG +DFKR
Subjt:  -KLAGLKDFKR

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.3e-2427.78Show/hide
Query:  VSRCSHETFDACIDKMCQSGNVAAAAQLKSLCDRKIS----------LSSSKAYDMVFLAA-SERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRT
        V +   ET+D C         V  +    SL D+ +S          +    +Y+ V  A    + +I+    VFK+ L S  S +   Y    + F   
Subjt:  VSRCSHETFDACIDKMCQSGNVAAAAQLKSLCDRKIS----------LSSSKAYDMVFLAA-SERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRT

Query:  NDCSKLLE-FVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLIN
         +    L  F K   +   PN +  N +I  + K R+ID   ++   M L    P+L +YN++++ L R GR+ E+  +   M   G + D V+YNTLI 
Subjt:  NDCSKLLE-FVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLIN

Query:  SLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM
           K G     L+   EM+   + P ++TYT+LI S  ++GN+  A+  L +M++R +CP+   Y +L+    + G +  A  +L+EM
Subjt:  SLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM

Q9LER0 Pentatricopeptide repeat-containing protein At5g14770, mitochondrial1.6e-2228.36Show/hide
Query:  TFDACIDKMCQSGNVAAAAQLKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLE---FVKEIIE
        +++  ID  C+ GN   A   K+L D    +S         L +S   ++  + + ++D ++S         + F+    R     K+LE    ++E+ E
Subjt:  TFDACIDKMCQSGNVAAAAQLKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLE---FVKEIIE

Query:  MS-FPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYF
        MS +PN +    ++ +  K      AL +++QM +     DL  Y +++D L +AG L E    F  + ED   P++V+Y  L++ L K G L       
Subjt:  MS-FPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYF

Query:  REMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKL
         +M+   V P+++TY+++I  + + G +EEA++LLR+M+ +N+ P+ + Y ++I    K GK E+A+ L KEM+L
Subjt:  REMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKL

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial9.2e-2325.17Show/hide
Query:  DEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQLKS-LCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAF
        D+ LN          R +  T+ + I  +C  G  + A++L S + +RKI+      +  +  A  + G +    +++ + +      S   Y +    F
Subjt:  DEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQLKS-LCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAF

Query:  ---TRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSY
            R ++  ++ EF+  + +  FP+ +  N +I  F K + +++ +++F +M       +  TYNI++  L +AG  +    IF  M  DG+ P+I++Y
Subjt:  ---TRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSY

Query:  NTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK
        NTL++ L K G+L+  ++ F  +   ++EP + TY  +IE   ++G +E+   L   + L+ + P    Y ++I    + G  E A  L KEMK
Subjt:  NTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK

Q9ZU27 Pentatricopeptide repeat-containing protein At1g51965, mitochondrial5.8e-2528.94Show/hide
Query:  YDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMS-FPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCR
        Y  +    S+ G ++   ++F D           +YM+  ++        + +E + +I E     + M+ N +  A  K ++I     +F +MK     
Subjt:  YDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMS-FPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCR

Query:  PDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMK
        PD++TYNI++   GR G ++E ++IF  ++     PDI+SYN+LIN L K G +D   + F+EM    + PD++TY+ L+E FG++  +E A +L  EM 
Subjt:  PDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMK

Query:  LRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK
        ++   P+   Y  L+   +K G+   A++L  +MK
Subjt:  LRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK

Arabidopsis top hitse value%identityAlignment
AT1G11900.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.2e-6344.37Show/hide
Query:  DEVLNQIITPRENVSR-CSHETFDACIDKMCQSGNVAAAAQ-LKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLV--SCKSLSSTAYMNFA
        +E+L +I+   E+ S+  S   +   ++K  + GN++ A   L+SL ++ I L  S  +  +  AA E  D+ L C+VF++ L+    + LSS  Y+N A
Subjt:  DEVLNQIITPRENVSR-CSHETFDACIDKMCQSGNVAAAAQ-LKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLV--SCKSLSSTAYMNFA

Query:  KAFTRTNDCSKLLEFVKEIIEMSFP-NCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKED-GIAPDIV
        +AF  T+DC+ L   +KEI E S P   +V+NRIIFAF++ R+IDK L I  +MK   C+PD+ TYN +LD+LGRAG +NE+L +  +MKED  ++ +I+
Subjt:  KAFTRTNDCSKLLEFVKEIIEMSFP-NCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKED-GIAPDIV

Query:  SYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDS
        +YNT++N +RK  R DMCL+ + EMV   +EPDLL+YTA+I+S GRSGN++E+L L  EMK R I PS Y+Y++LI   KK G  + A+ L  E+K + S
Subjt:  SYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDS

Query:  -KLAGLKDFKR
          LAG +DFKR
Subjt:  -KLAGLKDFKR

AT1G51965.1 ABA Overly-Sensitive 54.1e-2628.94Show/hide
Query:  YDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMS-FPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCR
        Y  +    S+ G ++   ++F D           +YM+  ++        + +E + +I E     + M+ N +  A  K ++I     +F +MK     
Subjt:  YDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMS-FPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCR

Query:  PDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMK
        PD++TYNI++   GR G ++E ++IF  ++     PDI+SYN+LIN L K G +D   + F+EM    + PD++TY+ L+E FG++  +E A +L  EM 
Subjt:  PDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMK

Query:  LRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK
        ++   P+   Y  L+   +K G+   A++L  +MK
Subjt:  LRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK

AT1G62670.1 rna processing factor 26.6e-2425.17Show/hide
Query:  DEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQLKS-LCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAF
        D+ LN          R +  T+ + I  +C  G  + A++L S + +RKI+      +  +  A  + G +    +++ + +      S   Y +    F
Subjt:  DEVLNQIITPRENVSRCSHETFDACIDKMCQSGNVAAAAQLKS-LCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAF

Query:  ---TRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSY
            R ++  ++ EF+  + +  FP+ +  N +I  F K + +++ +++F +M       +  TYNI++  L +AG  +    IF  M  DG+ P+I++Y
Subjt:  ---TRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSY

Query:  NTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK
        NTL++ L K G+L+  ++ F  +   ++EP + TY  +IE   ++G +E+   L   + L+ + P    Y ++I    + G  E A  L KEMK
Subjt:  NTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK

AT5G14770.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-2328.36Show/hide
Query:  TFDACIDKMCQSGNVAAAAQLKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLE---FVKEIIE
        +++  ID  C+ GN   A   K+L D    +S         L +S   ++  + + ++D ++S         + F+    R     K+LE    ++E+ E
Subjt:  TFDACIDKMCQSGNVAAAAQLKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLE---FVKEIIE

Query:  MS-FPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYF
        MS +PN +    ++ +  K      AL +++QM +     DL  Y +++D L +AG L E    F  + ED   P++V+Y  L++ L K G L       
Subjt:  MS-FPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYF

Query:  REMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKL
         +M+   V P+++TY+++I  + + G +EEA++LLR+M+ +N+ P+ + Y ++I    K GK E+A+ L KEM+L
Subjt:  REMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKL

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.1e-2627.78Show/hide
Query:  VSRCSHETFDACIDKMCQSGNVAAAAQLKSLCDRKIS----------LSSSKAYDMVFLAA-SERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRT
        V +   ET+D C         V  +    SL D+ +S          +    +Y+ V  A    + +I+    VFK+ L S  S +   Y    + F   
Subjt:  VSRCSHETFDACIDKMCQSGNVAAAAQLKSLCDRKIS----------LSSSKAYDMVFLAA-SERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRT

Query:  NDCSKLLE-FVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLIN
         +    L  F K   +   PN +  N +I  + K R+ID   ++   M L    P+L +YN++++ L R GR+ E+  +   M   G + D V+YNTLI 
Subjt:  NDCSKLLE-FVKEIIEMSFPNCMVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLIN

Query:  SLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM
           K G     L+   EM+   + P ++TYT+LI S  ++GN+  A+  L +M++R +CP+   Y +L+    + G +  A  +L+EM
Subjt:  SLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAGCCACTTTTCCATTTACTTGCTTTTTCGATTTCAGTTCTGGCAGGTCATGTCAGTTTACGAACTTGCTATCTACCAGGAATCTTCTGTATTACTCG
TACGCTAATAGCATTGCATGTATCCCTGTTAGTAACCCTCAAATCTGGCCGCTTTATGCCATCAAATGCCTTAGCCATCAGTCATCTACTACAAATATATCTCCT
GATGAAGTGAAAGTAGGGGATGAAGTCTTGAATCAGATCATTACCCCAAGGGAAAATGTCTCAAGGTGTAGCCATGAAACCTTTGATGCTTGCATTGATAAGATG
TGTCAATCTGGAAATGTTGCAGCTGCTGCTCAACTTAAATCTTTGTGCGATAGGAAAATATCTCTTAGCTCCTCCAAGGCATATGATATGGTTTTTCTTGCAGCA
AGTGAAAGGGGAGACATTACCCTTTTATGTCAAGTTTTTAAAGATTACCTGGTTTCCTGTAAATCATTGAGTTCAACCGCTTACATGAATTTTGCCAAGGCCTTT
ACCAGGACAAATGATTGTAGCAAGCTACTGGAATTTGTCAAAGAAATAATTGAGATGAGCTTTCCAAACTGCATGGTTATAAACAGAATTATATTTGCCTTCTCC
AAATGTAGGGAGATTGATAAAGCCCTTCAGATATTTAATCAGATGAAGCTTCTGTCATGTAGACCAGATTTGTATACGTATAACATCATTTTGGATATGCTAGGC
CGTGCAGGTCGCTTGAATGAACTTCTTCATATATTTGTTTCCATGAAAGAAGATGGCATTGCCCCAGATATTGTGTCCTATAATACATTGATAAATAGCTTAAGG
AAGGTGGGTAGACTAGATATGTGCTTGATTTACTTCAGGGAAATGGTTGCAATGAGAGTGGAACCTGATTTGCTTACTTATACTGCTTTGATAGAGAGTTTTGGT
CGATCTGGAAATATTGAGGAAGCTTTGACACTCCTCAGGGAGATGAAGCTTAGGAATATCTGTCCTTCAAGCTATATCTACAAGTCCCTTATCGGAAATTCAAAG
AAGATGGGGAAGGTGGAATTGGCTATGAACCTTCTCAAGGAAATGAAATTAAGTGATTCAAAACTTGCAGGTCTGAAGGATTTCAAACGAAGAAAAATGTAA
mRNA sequenceShow/hide mRNA sequence
CTGAAAATCACGATGGTCAGAGCGTGAGAGAAAAACAACACGAAAACGCTGGAAGTTTAGAACGGAGAAGAAGAATCGCAGAGCAGCGGCGTGGGTTTAGGGTTT
AGAGTACTTGAGCCTTCTTCATACCAGCTGTATGGTTCGAGCCACTTTTCCATTTACTTGCTTTTTCGATTTCAGTTCTGGCAGGTCATGTCAGTTTACGAACTT
GCTATCTACCAGGAATCTTCTGTATTACTCGTACGCTAATAGCATTGCATGTATCCCTGTTAGTAACCCTCAAATCTGGCCGCTTTATGCCATCAAATGCCTTAG
CCATCAGTCATCTACTACAAATATATCTCCTGATGAAGTGAAAGTAGGGGATGAAGTCTTGAATCAGATCATTACCCCAAGGGAAAATGTCTCAAGGTGTAGCCA
TGAAACCTTTGATGCTTGCATTGATAAGATGTGTCAATCTGGAAATGTTGCAGCTGCTGCTCAACTTAAATCTTTGTGCGATAGGAAAATATCTCTTAGCTCCTC
CAAGGCATATGATATGGTTTTTCTTGCAGCAAGTGAAAGGGGAGACATTACCCTTTTATGTCAAGTTTTTAAAGATTACCTGGTTTCCTGTAAATCATTGAGTTC
AACCGCTTACATGAATTTTGCCAAGGCCTTTACCAGGACAAATGATTGTAGCAAGCTACTGGAATTTGTCAAAGAAATAATTGAGATGAGCTTTCCAAACTGCAT
GGTTATAAACAGAATTATATTTGCCTTCTCCAAATGTAGGGAGATTGATAAAGCCCTTCAGATATTTAATCAGATGAAGCTTCTGTCATGTAGACCAGATTTGTA
TACGTATAACATCATTTTGGATATGCTAGGCCGTGCAGGTCGCTTGAATGAACTTCTTCATATATTTGTTTCCATGAAAGAAGATGGCATTGCCCCAGATATTGT
GTCCTATAATACATTGATAAATAGCTTAAGGAAGGTGGGTAGACTAGATATGTGCTTGATTTACTTCAGGGAAATGGTTGCAATGAGAGTGGAACCTGATTTGCT
TACTTATACTGCTTTGATAGAGAGTTTTGGTCGATCTGGAAATATTGAGGAAGCTTTGACACTCCTCAGGGAGATGAAGCTTAGGAATATCTGTCCTTCAAGCTA
TATCTACAAGTCCCTTATCGGAAATTCAAAGAAGATGGGGAAGGTGGAATTGGCTATGAACCTTCTCAAGGAAATGAAATTAAGTGATTCAAAACTTGCAGGTCT
GAAGGATTTCAAACGAAGAAAAATGTAACCAATTACAAGGTTTCATAGTGACTCTGAGGTTGATAAACCTGTGAATGGCATGACTTTAATCTTCAAACTAAGATC
TGCCAGAGTGACTGCATCTACAGTTTGGTCAGATTGGAAGTTCCCGGAGCTGTGACACCAACAAGAAGATGGCACAAACAGATGATTCGCTCTTGAATGTTATGT
GCTTGAACTCTTGGGGCCAGGGCTGATTTGGCCGTTACTGTTGGTACTATACGAATGGATTTGATTGCCAATCAGGATACGATTTGTTGAAGTTCAACGTAGTTG
ATCTAGAACTGACATGACGAAGATGAGGAACTGATCGATTTGGCAGGATCCCAAATGTAGCTTTGAGAGAACTGAATAGAAAAGGGGAATGGGGAATGGGGAATG
GTTTCCTTGAACAATAGAAATTGATTCTATTTCCTTTGCAATTGGAAGATGGCAAAGACCCATCTTATATCAAGATTCAATCTCTTTCTTGAAGCCAAGCTGTCT
TTGGCAGGAAATTTCATGTAAATCTTTAGATGATTTCTAATTCAGTTGTGCCATTTGTGAATTAATTTCGAACTTTCAAGAATTTCCTCTGCAGTCTTTTCTTTT
CTCCTGTGAATT
Protein sequenceShow/hide protein sequence
MVRATFPFTCFFDFSSGRSCQFTNLLSTRNLLYYSYANSIACIPVSNPQIWPLYAIKCLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDACIDKM
CQSGNVAAAAQLKSLCDRKISLSSSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTRTNDCSKLLEFVKEIIEMSFPNCMVINRIIFAFS
KCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFG
RSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRKM