; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G013780 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G013780
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCicolChr01:26411031..26416734
RNA-Seq ExpressionCcUC01G013780
SyntenyCcUC01G013780
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583593.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.9e-15883.01Show/hide
Query:  LLSTRNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDR
        LLS RN+L+YSY NSI  +PV N Q W LYAI+R  HQ STTNISPDE KV DEVLNQI   REN S CSHETFD CIDKMC+S N+ AAAQ LKSLCDR
Subjt:  LLSTRNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDR

Query:  KISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFN
        KISLS SKAYDMV LAASERGD TLLCQVFKD LVS K LSST+YMNFAKAF  T+DSSKLLE+VKEIIEM FPN +VINRIIFAFS+CREIDKALQIFN
Subjt:  KISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFN

Query:  QMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEA
        QMKLLS RPDLYTYNIILD LGRAGR++E+LH+FVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMR+EPDLLTYTALIESFGRSGN+EEA
Subjt:  QMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEA

Query:  LTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        LTLLREMKL++I PSSYIYKSLI NS K+GKVELAMNLLKEMKLS SKLAG KDFKR++
Subjt:  LTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

XP_022964991.1 pentatricopeptide repeat-containing protein At1g11900-like [Cucurbita moschata]1.2e-15782.73Show/hide
Query:  LLSTRNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDR
        LLS RN+L+YSY NSI  +PV N Q W LYAI+R  HQ STTNISPDE KV DEVLNQI   REN S CSHETFD CIDKMC+S N+ AAAQ LKS CDR
Subjt:  LLSTRNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDR

Query:  KISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFN
        KISLS SKAYDMV LAASERGD TLLCQVFKD LVS K LSST+YMNFAKAF  T+DSSKLLE+VKEIIEM FPN +VINRIIFAFS+CREIDKALQIFN
Subjt:  KISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFN

Query:  QMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEA
        QMKLLS RPDLYTYNIILD LGRAGR++E+LH+FVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMR+EPDLLTYTALIESFGRSGN+EEA
Subjt:  QMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEA

Query:  LTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        LTLLREMKL++I PSSYIYKSLI NS K+GKVELAMNLLKEMKLS SKLAG KDFKR++
Subjt:  LTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

XP_022970322.1 pentatricopeptide repeat-containing protein At1g11900 [Cucurbita maxima]3.5e-16083.61Show/hide
Query:  NLLSTRNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCD
        +LLS RN+L+YSY NSI  IPV N Q W LYAI+R  HQSST NISPDE KV DEVLNQI   REN SRCSHETFD CIDKMC+SGN+ AAAQ LKSLCD
Subjt:  NLLSTRNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCD

Query:  RKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIF
        RKISLS SKAYDMV LAASERGD TLLCQVFKD LVS K LSST+YMNFAKAF  T+DSSKLLE+VKEIIEM FPN +VINRIIFAFS+CREIDKALQIF
Subjt:  RKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIF

Query:  NQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEE
        NQMKLLS RPDLYTYNIILD LGRAGR++E+LH+FVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMR+EPDLLTYTALIESFGRSGN+EE
Subjt:  NQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEE

Query:  ALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        ALTLLREMKL++I PSSYIYKSLI NS K+GKVELAMNLLKEMKLS SKLAG KDFKR++
Subjt:  ALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

XP_023518970.1 pentatricopeptide repeat-containing protein At1g11900 [Cucurbita pepo subsp. pepo]7.2e-15882.78Show/hide
Query:  NLLSTRNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCD
        +LLS RN+L+YSY NSI  IPV N Q W LYAI+R  HQ STTNISPDE KV DEVLNQ    REN S CSHETFD CIDKMC+S N+ AAAQ LKSLCD
Subjt:  NLLSTRNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCD

Query:  RKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIF
        RKISLS SKAYDMV LAASERGD TLLCQVFKD LVS K LSST+YMNFAKAF  T+DSSKLLE+VKEIIEM FPN +VINRIIFAFS+CREIDKALQIF
Subjt:  RKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIF

Query:  NQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEE
        NQMKLLS RPDLYTYNIILD LGRAGR++E+LH+FVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMR+EPDLLTYTALIESFGRSGN+EE
Subjt:  NQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEE

Query:  ALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        ALTLLREMKL++I PSSYIYKSLI NS K+GKVELAMNLLKEMKLS SKLAG KDFKR++
Subjt:  ALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

XP_038893594.1 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g11900 [Benincasa hispida]2.2e-16786.76Show/hide
Query:  RNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDRKISL
        RNLL+ SYANSIA +PV NSQ WPLYAI+ LSHQSS+TNI PDEVKVGDEVLNQII PREN SRCSHETFD CIDKMC+ G++AAAAQ LKSLCD KI L
Subjt:  RNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDRKISL

Query:  SPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFNQMKL
        S SKAYDMV LAASE GD TLL QVFKD LVSCKSLSST+Y +FA AFT TNDS+KLLE+VKEIIEM FPNCIVINRIIFAFSKCREIDKALQIFNQMKL
Subjt:  SPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFNQMKL

Query:  LSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLL
        LSCRPDLYTYNIILDMLGRAGR++E+LH+FVSMKEDGIAPDIVSYNTLINS RKVGRLDMCL+YF+EMVA+R+EPDLLTYTALIESFGRSGNIEEA TLL
Subjt:  LSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLL

Query:  REMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        REMKL+NICPSSYIYKSLIGNS KMGKVELAMNLLKEMKLSDSKLAG KDFKRRK
Subjt:  REMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

TrEMBL top hitse value%identityAlignment
A0A1S3CIJ8 pentatricopeptide repeat-containing protein At1g11900 isoform X11.2e-15581.46Show/hide
Query:  RNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDRKISL
        RNLL+YSYAN IA IPV NSQIWPLYAIK  SHQSS+TNISPDEVKVGDEVLNQII PREN S CSHE  D CIDK+C  G++AAAAQ LKSLC+ KISL
Subjt:  RNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDRKISL

Query:  SPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFNQMKL
        + SKAYDMV LAASERGD  LLCQVFK  +VSCKSLSS +YM+FA+AFT TNDSSKLLE VKEI+E+   NC VINRIIFAFSKCREIDKA QIFNQMK 
Subjt:  SPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFNQMKL

Query:  LSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLL
        LSC PDLYTYNI+LDMLGRAGR+NE+LH+FVSMK++GIAPDIVSYNTLINSLRKVGRLD+ LIYFREMVAM ++PDLLTYTALIESFGR GNIEEALTLL
Subjt:  LSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLL

Query:  REMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRKR
        +EMKL+ ICPSSYIYKSLI NSKKMGKVELA NLL EMKLS+SKLA  +DFKRRKR
Subjt:  REMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRKR

A0A5D3DB98 Pentatricopeptide repeat-containing protein1.7e-15772.97Show/hide
Query:  MVRATFPFTCFFDFSSGRSCQFMNLLS-----TRNLLYY-----------------------------SYANSIACIPVSNSQIWPLYAIKRLSHQSSTT
        MVR T PFTC FDFSSGR C+F NLLS     T  +L+                                 N IA IPV NSQIWPLYAIK  SHQSS+T
Subjt:  MVRATFPFTCFFDFSSGRSCQFMNLLS-----TRNLLYY-----------------------------SYANSIACIPVSNSQIWPLYAIKRLSHQSSTT

Query:  NISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDRKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSS
        NISPDEVKVGDEVLNQII PREN S CSHE  D CIDK+C  G++AAAAQ LKSLC+ KISL+ SKAYDMV LAASERGD  LLCQVFK  +VSCKSLSS
Subjt:  NISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDRKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSS

Query:  TAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGI
         +YM+FA+AFT TNDSSKLLE VKEI+E+   NC VINRIIFAFSKCREIDKA QIFNQMK LSC PDLYTYNI+LDMLGRAGR+NE+LH+FVSMK++GI
Subjt:  TAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGI

Query:  APDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM
        APDIVSYNTLINSLRKVGRLD+ LIYFREMVAM ++PDLLTYTALIESFGR GNIEEALTLL+EMKL+ ICPSSYIYKSLI NSKKMGKVELA NLL EM
Subjt:  APDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM

Query:  KLSDSKLAGLKDFKRRKR
        KLS+SKLA  +DFKRRKR
Subjt:  KLSDSKLAGLKDFKRRKR

A0A6J1D2E8 pentatricopeptide repeat-containing protein At1g11900-like isoform X13.6e-15576.36Show/hide
Query:  VRATFPFTCFFDFSSGRSCQFMNLLSTRNLLYYSYANSIACIPVSNSQIWPLYAI--KRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFD
        VR    F+   DFSSG SC+F NL STR +L+Y Y N IA  PV + QIW  +A   KR SHQS  T+ SPDE KV DEVLNQI+  R+N SR SHETFD
Subjt:  VRATFPFTCFFDFSSGRSCQFMNLLSTRNLLYYSYANSIACIPVSNSQIWPLYAI--KRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFD

Query:  TCIDKMCQSGNVAAAAQ-LKSLCDRKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPN
         CI KMC+SGN+AAAAQ LKSLCD KISLS SKAYDMV LAASERGD +L CQVFKD LVSCKSLSS  YMN AKAF +TND  KLLE+VKE+IEM FPN
Subjt:  TCIDKMCQSGNVAAAAQ-LKSLCDRKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPN

Query:  CIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAM
         IVIN+IIFAFSKCREI+KAL+IFNQMKLLSC+PDLYTYNIILDMLGRAGR++E+LH+FVSMKE GIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAM
Subjt:  CIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAM

Query:  RVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRKR
        R+EPDLLTYTA+IESFGRSGNIEEAL LLREMKL+N+ PS+YIYKSLI NS K+GKVELAM+LL E+KLS S LA  KDFKRRKR
Subjt:  RVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRKR

A0A6J1HKH0 pentatricopeptide repeat-containing protein At1g11900-like6.0e-15882.73Show/hide
Query:  LLSTRNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDR
        LLS RN+L+YSY NSI  +PV N Q W LYAI+R  HQ STTNISPDE KV DEVLNQI   REN S CSHETFD CIDKMC+S N+ AAAQ LKS CDR
Subjt:  LLSTRNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDR

Query:  KISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFN
        KISLS SKAYDMV LAASERGD TLLCQVFKD LVS K LSST+YMNFAKAF  T+DSSKLLE+VKEIIEM FPN +VINRIIFAFS+CREIDKALQIFN
Subjt:  KISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFN

Query:  QMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEA
        QMKLLS RPDLYTYNIILD LGRAGR++E+LH+FVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMR+EPDLLTYTALIESFGRSGN+EEA
Subjt:  QMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEA

Query:  LTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        LTLLREMKL++I PSSYIYKSLI NS K+GKVELAMNLLKEMKLS SKLAG KDFKR++
Subjt:  LTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

A0A6J1HYS7 pentatricopeptide repeat-containing protein At1g119001.7e-16083.61Show/hide
Query:  NLLSTRNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCD
        +LLS RN+L+YSY NSI  IPV N Q W LYAI+R  HQSST NISPDE KV DEVLNQI   REN SRCSHETFD CIDKMC+SGN+ AAAQ LKSLCD
Subjt:  NLLSTRNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGNVAAAAQ-LKSLCD

Query:  RKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIF
        RKISLS SKAYDMV LAASERGD TLLCQVFKD LVS K LSST+YMNFAKAF  T+DSSKLLE+VKEIIEM FPN +VINRIIFAFS+CREIDKALQIF
Subjt:  RKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIF

Query:  NQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEE
        NQMKLLS RPDLYTYNIILD LGRAGR++E+LH+FVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMR+EPDLLTYTALIESFGRSGN+EE
Subjt:  NQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEE

Query:  ALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK
        ALTLLREMKL++I PSSYIYKSLI NS K+GKVELAMNLLKEMKLS SKLAG KDFKR++
Subjt:  ALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRK

SwissProt top hitse value%identityAlignment
P0C894 Putative pentatricopeptide repeat-containing protein At2g021507.1e-2331.67Show/hide
Query:  CQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMR----FPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLG
        C VF D L S +++    +  F   F+   D   L E ++   +M+    FP     N ++  F+K  + D   + F  M     RP ++TYNI++D + 
Subjt:  CQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMR----FPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLG

Query:  RAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSL
        + G +     +F  MK  G+ PD V+YN++I+   KVGRLD  + +F EM  M  EPD++TY ALI  F + G +   L   REMK   + P+   Y +L
Subjt:  RAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSL

Query:  IGNSKKMGKVELAMNLLKEMK
        +    K G ++ A+    +M+
Subjt:  IGNSKKMGKVELAMNLLKEMK

Q5BIV3 Pentatricopeptide repeat-containing protein At1g119002.9e-6143.95Show/hide
Query:  DEVLNQIITPRENVSR-CSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDRKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLV--SCKSLSSTAYMNFA
        +E+L +I+   E+ S+  S   +   ++K  + GN++ A   L+SL ++ I L P   +  +  AA E  D+ L C+VF++ L+    + LSS  Y+N A
Subjt:  DEVLNQIITPRENVSR-CSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDRKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLV--SCKSLSSTAYMNFA

Query:  KAFTTTNDSSKLLEFVKEIIEMRFP-NCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKED-GIAPDIV
        +AF  T+D + L   +KEI E   P   IV+NRIIFAF++ R+IDK L I  +MK   C+PD+ TYN +LD+LGRAG +NE+L +  +MKED  ++ +I+
Subjt:  KAFTTTNDSSKLLEFVKEIIEMRFP-NCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKED-GIAPDIV

Query:  SYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDS
        +YNT++N +RK  R DMCL+ + EMV   +EPDLL+YTA+I+S GRSGN++E+L L  EMK R I PS Y+Y++LI   KK G  + A+ L  E+K + S
Subjt:  SYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDS

Query:  -KLAGLKDFKRRKR
          LAG +DFKR  R
Subjt:  -KLAGLKDFKRRKR

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.3e-2428.12Show/hide
Query:  VSRCSHETFDTCIDKMCQSGNVAAAAQLKSLCDRKIS----------LSPSKAYDMVFLAA-SERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFT-T
        V +   ET+D C         V  +    SL D+ +S          +    +Y+ V  A    + +I+    VFK+ L S  S +   Y    + F   
Subjt:  VSRCSHETFDTCIDKMCQSGNVAAAAQLKSLCDRKIS----------LSPSKAYDMVFLAA-SERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFT-T

Query:  TNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLIN
         N    L  F K   +   PN +  N +I  + K R+ID   ++   M L    P+L +YN++++ L R GR+ E+  +   M   G + D V+YNTLI 
Subjt:  TNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLIN

Query:  SLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM
           K G     L+   EM+   + P ++TYT+LI S  ++GN+  A+  L +M++R +CP+   Y +L+    + G +  A  +L+EM
Subjt:  SLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic2.1e-2234.76Show/hide
Query:  NRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEP
        N ++ A  K  ++D A +I  QM +    P++ +Y+ ++D   +AGR +E L++F  M+  GIA D VSYNTL++   KVGR +  L   REM ++ ++ 
Subjt:  NRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEP

Query:  DLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK
        D++TY AL+  +G+ G  +E   +  EMK  ++ P+   Y +LI    K G  + AM + +E K
Subjt:  DLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK

Q9ZU27 Pentatricopeptide repeat-containing protein At1g51965, mitochondrial2.9e-2428.51Show/hide
Query:  YDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMR-FPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCR
        Y  +    S+ G ++   ++F D           +YM+  ++      + + +E + +I E     + ++ N +  A  K ++I     +F +MK     
Subjt:  YDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMR-FPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCR

Query:  PDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMK
        PD++TYNI++   GR G ++E ++IF  ++     PDI+SYN+LIN L K G +D   + F+EM    + PD++TY+ L+E FG++  +E A +L  EM 
Subjt:  PDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMK

Query:  LRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK
        ++   P+   Y  L+   +K G+   A++L  +MK
Subjt:  LRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK

Arabidopsis top hitse value%identityAlignment
AT1G11900.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-6243.95Show/hide
Query:  DEVLNQIITPRENVSR-CSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDRKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLV--SCKSLSSTAYMNFA
        +E+L +I+   E+ S+  S   +   ++K  + GN++ A   L+SL ++ I L P   +  +  AA E  D+ L C+VF++ L+    + LSS  Y+N A
Subjt:  DEVLNQIITPRENVSR-CSHETFDTCIDKMCQSGNVAAAAQ-LKSLCDRKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLV--SCKSLSSTAYMNFA

Query:  KAFTTTNDSSKLLEFVKEIIEMRFP-NCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKED-GIAPDIV
        +AF  T+D + L   +KEI E   P   IV+NRIIFAF++ R+IDK L I  +MK   C+PD+ TYN +LD+LGRAG +NE+L +  +MKED  ++ +I+
Subjt:  KAFTTTNDSSKLLEFVKEIIEMRFP-NCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKED-GIAPDIV

Query:  SYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDS
        +YNT++N +RK  R DMCL+ + EMV   +EPDLL+YTA+I+S GRSGN++E+L L  EMK R I PS Y+Y++LI   KK G  + A+ L  E+K + S
Subjt:  SYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDS

Query:  -KLAGLKDFKRRKR
          LAG +DFKR  R
Subjt:  -KLAGLKDFKRRKR

AT1G51965.1 ABA Overly-Sensitive 52.0e-2528.51Show/hide
Query:  YDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMR-FPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCR
        Y  +    S+ G ++   ++F D           +YM+  ++      + + +E + +I E     + ++ N +  A  K ++I     +F +MK     
Subjt:  YDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMR-FPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCR

Query:  PDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMK
        PD++TYNI++   GR G ++E ++IF  ++     PDI+SYN+LIN L K G +D   + F+EM    + PD++TY+ L+E FG++  +E A +L  EM 
Subjt:  PDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMK

Query:  LRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK
        ++   P+   Y  L+   +K G+   A++L  +MK
Subjt:  LRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK

AT2G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.0e-2431.67Show/hide
Query:  CQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMR----FPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLG
        C VF D L S +++    +  F   F+   D   L E ++   +M+    FP     N ++  F+K  + D   + F  M     RP ++TYNI++D + 
Subjt:  CQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMR----FPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLG

Query:  RAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSL
        + G +     +F  MK  G+ PD V+YN++I+   KVGRLD  + +F EM  M  EPD++TY ALI  F + G +   L   REMK   + P+   Y +L
Subjt:  RAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSL

Query:  IGNSKKMGKVELAMNLLKEMK
        +    K G ++ A+    +M+
Subjt:  IGNSKKMGKVELAMNLLKEMK

AT2G31400.1 genomes uncoupled 11.5e-2334.76Show/hide
Query:  NRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEP
        N ++ A  K  ++D A +I  QM +    P++ +Y+ ++D   +AGR +E L++F  M+  GIA D VSYNTL++   KVGR +  L   REM ++ ++ 
Subjt:  NRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEP

Query:  DLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK
        D++TY AL+  +G+ G  +E   +  EMK  ++ P+   Y +LI    K G  + AM + +E K
Subjt:  DLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMK

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.1e-2628.12Show/hide
Query:  VSRCSHETFDTCIDKMCQSGNVAAAAQLKSLCDRKIS----------LSPSKAYDMVFLAA-SERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFT-T
        V +   ET+D C         V  +    SL D+ +S          +    +Y+ V  A    + +I+    VFK+ L S  S +   Y    + F   
Subjt:  VSRCSHETFDTCIDKMCQSGNVAAAAQLKSLCDRKIS----------LSPSKAYDMVFLAA-SERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFT-T

Query:  TNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLIN
         N    L  F K   +   PN +  N +I  + K R+ID   ++   M L    P+L +YN++++ L R GR+ E+  +   M   G + D V+YNTLI 
Subjt:  TNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLIN

Query:  SLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM
           K G     L+   EM+   + P ++TYT+LI S  ++GN+  A+  L +M++R +CP+   Y +L+    + G +  A  +L+EM
Subjt:  SLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREMKLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAGCCACTTTTCCATTTACTTGCTTTTTCGATTTCAGTTCTGGCAGGTCATGTCAGTTTATGAACTTGCTATCTACCAGGAATCTTCTGTATTACTCGTACGC
TAATAGCATTGCATGTATCCCTGTTAGTAACTCTCAAATCTGGCCACTTTATGCCATCAAACGCCTTAGCCATCAGTCATCTACTACAAATATATCTCCTGATGAAGTGA
AAGTAGGGGATGAAGTCTTGAATCAGATTATTACCCCAAGGGAAAATGTCTCAAGGTGTAGCCATGAAACCTTTGATACTTGCATTGATAAGATGTGTCAATCTGGAAAT
GTTGCAGCTGCTGCTCAACTTAAATCTTTGTGCGATAGGAAAATATCTCTTAGCCCCTCCAAGGCATATGATATGGTTTTTCTTGCAGCAAGTGAAAGGGGAGACATTAC
CCTTTTATGTCAAGTTTTTAAAGATTACCTGGTTTCCTGTAAATCATTGAGTTCAACCGCTTACATGAATTTTGCCAAGGCCTTTACCACGACAAATGATAGTAGCAAGC
TACTGGAATTTGTCAAAGAAATAATTGAGATGAGATTTCCAAACTGCATAGTTATAAACAGAATTATATTTGCCTTCTCCAAATGTAGGGAGATTGATAAAGCCCTTCAG
ATATTTAATCAGATGAAGCTTCTGTCATGTAGACCAGATTTGTATACGTATAACATCATTTTGGATATGCTAGGCCGTGCAGGTCGCTTGAATGAACTTCTTCATATATT
TGTTTCCATGAAAGAAGATGGCATTGCCCCAGATATTGTGTCCTATAATACATTGATAAATAGCTTAAGGAAGGTGGGTAGACTAGATATGTGCTTGATTTACTTCAGGG
AAATGGTTGCAATGAGAGTGGAACCTGATTTGCTTACTTATACTGCTTTGATAGAGAGTTTTGGTCGATCTGGAAACATTGAGGAAGCTTTGACACTCCTCAGGGAGATG
AAGCTTAGGAATATCTGTCCTTCAAGCTATATCTACAAATCCCTTATCGGAAATTCAAAGAAGATGGGGAAGGTGGAATTGGCTATGAACCTTCTCAAGGAAATGAAATT
AAGTGATTCAAAACTTGCAGGTCTGAAGGATTTCAAACGAAGAAAAAGGTAA
mRNA sequenceShow/hide mRNA sequence
TGAAGAGGCATTTAATTTACGGATAGTATAATGGTAATTTTAGCATTCATCTTCATCGAGTTAGGGCTGAAAATCACGATGGTCAGAGCGTGAGAGAAAAACAACACGAA
AACGCTGGAAGTTTAGAACGGAGAAGAAGAATCGCATAGCAGCGGCGTGGGTTTAGGGTTTAGAGTACTTGAGCCTTCTTCATACCAGCTGTATGGTTCGAGCCACTTTT
CCATTTACTTGCTTTTTCGATTTCAGTTCTGGCAGGTCATGTCAGTTTATGAACTTGCTATCTACCAGGAATCTTCTGTATTACTCGTACGCTAATAGCATTGCATGTAT
CCCTGTTAGTAACTCTCAAATCTGGCCACTTTATGCCATCAAACGCCTTAGCCATCAGTCATCTACTACAAATATATCTCCTGATGAAGTGAAAGTAGGGGATGAAGTCT
TGAATCAGATTATTACCCCAAGGGAAAATGTCTCAAGGTGTAGCCATGAAACCTTTGATACTTGCATTGATAAGATGTGTCAATCTGGAAATGTTGCAGCTGCTGCTCAA
CTTAAATCTTTGTGCGATAGGAAAATATCTCTTAGCCCCTCCAAGGCATATGATATGGTTTTTCTTGCAGCAAGTGAAAGGGGAGACATTACCCTTTTATGTCAAGTTTT
TAAAGATTACCTGGTTTCCTGTAAATCATTGAGTTCAACCGCTTACATGAATTTTGCCAAGGCCTTTACCACGACAAATGATAGTAGCAAGCTACTGGAATTTGTCAAAG
AAATAATTGAGATGAGATTTCCAAACTGCATAGTTATAAACAGAATTATATTTGCCTTCTCCAAATGTAGGGAGATTGATAAAGCCCTTCAGATATTTAATCAGATGAAG
CTTCTGTCATGTAGACCAGATTTGTATACGTATAACATCATTTTGGATATGCTAGGCCGTGCAGGTCGCTTGAATGAACTTCTTCATATATTTGTTTCCATGAAAGAAGA
TGGCATTGCCCCAGATATTGTGTCCTATAATACATTGATAAATAGCTTAAGGAAGGTGGGTAGACTAGATATGTGCTTGATTTACTTCAGGGAAATGGTTGCAATGAGAG
TGGAACCTGATTTGCTTACTTATACTGCTTTGATAGAGAGTTTTGGTCGATCTGGAAACATTGAGGAAGCTTTGACACTCCTCAGGGAGATGAAGCTTAGGAATATCTGT
CCTTCAAGCTATATCTACAAATCCCTTATCGGAAATTCAAAGAAGATGGGGAAGGTGGAATTGGCTATGAACCTTCTCAAGGAAATGAAATTAAGTGATTCAAAACTTGC
AGGTCTGAAGGATTTCAAACGAAGAAAAAGGTAACCAATTACAAGGTTTCATAGTGACTCTGAGGTTGATAAACCTGTGAATGGCATGACTTTAATCTTCAAACTGAGAT
CTGCCAGAGTGATTCCATCTACAGGTCAATGAATCTACTTCGCATCTTTGTACGATTTAGTGGGGAATGAGATCAGTCCGGATGCAACTATGTTGGAGGTCAAATGAAAG
GTCT
Protein sequenceShow/hide protein sequence
MVRATFPFTCFFDFSSGRSCQFMNLLSTRNLLYYSYANSIACIPVSNSQIWPLYAIKRLSHQSSTTNISPDEVKVGDEVLNQIITPRENVSRCSHETFDTCIDKMCQSGN
VAAAAQLKSLCDRKISLSPSKAYDMVFLAASERGDITLLCQVFKDYLVSCKSLSSTAYMNFAKAFTTTNDSSKLLEFVKEIIEMRFPNCIVINRIIFAFSKCREIDKALQ
IFNQMKLLSCRPDLYTYNIILDMLGRAGRLNELLHIFVSMKEDGIAPDIVSYNTLINSLRKVGRLDMCLIYFREMVAMRVEPDLLTYTALIESFGRSGNIEEALTLLREM
KLRNICPSSYIYKSLIGNSKKMGKVELAMNLLKEMKLSDSKLAGLKDFKRRKR