; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0003896 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0003896
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr12:19333534..19334999
RNA-Seq ExpressionPI0003896
SyntenyPI0003896
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052071.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]2.2e-20080.53Show/hide
Query:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW
        MHSIVSATSVSSILVKGNG  GCQ TMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAP+PSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW
Subjt:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW

Query:  DSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG
        DSDEIEAISSLFQGRIPQKP                                          VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG
Subjt:  DSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG

Query:  SLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARKVRECWI
        SLSLTIKELGHMGLPDR+LKTFCW QEQ RLFPDDRVLASTVEVL+RNHELKVP+NLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAA+K +    
Subjt:  SLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARKVRECWI

Query:  PA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME
        P+                      + +LGQREALKLNQQD+TTIIKVCTRL KFEIAEKLY WYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME
Subjt:  PA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME

Query:  SANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSG
        SANCPFDLPAY+VVIKLFVALGDLSRAVRYFAKLKEAGF+PTYDVYRNMITIYLVSG
Subjt:  SANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSG

XP_004139567.1 pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus]2.7e-21479.75Show/hide
Query:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW
        M SIVSATSVSSILVKGNG  GCQNTMVHFKANSRRRPPKNLLCPRRAKLPP+PAVNQF NNKTSAP+P FTDLISSKIFQDEHEEIHA+DYTKDTDVVW
Subjt:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW

Query:  DSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG
        DSDEIEAISSLFQGRIPQKP                                          VYKRPDFLIGLAR IRDLSPEENVSKVLNRWGPFLQKG
Subjt:  DSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG

Query:  SLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARKVRECWI
        SLSLTIKELGHMGLPDR+L TFCWAQEQHRLFPDDRVLASTVEVL+RNHELKV +NLEEFTKLASRGVLEAMMRGFI+GGSLNLAWKLLVAA+K +    
Subjt:  SLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARKVRECWI

Query:  PA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME
        P+                      + +LGQREALKLNQQD TTI+KVCTRLGKFEIAEKLYSWYVESGHEPS+VMYTALVHSRYSDRKYREALSLVWEME
Subjt:  PA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME

Query:  SANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQGKR
        S NCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGF+PTY+VYRNMITIYLVSGRLAKCKEIYKEA+NAGF+MDKQITSMLLQ KR
Subjt:  SANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQGKR

XP_008462173.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo]5.9e-21781.19Show/hide
Query:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW
        MHSIVSATSVSSILVKGNG  GCQ TMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAP+PSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW
Subjt:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW

Query:  DSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG
        DSDEIEAISSLFQGRIPQKP                                          VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG
Subjt:  DSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG

Query:  SLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARKVRECWI
        SLSLTIKELGHMGLPDR+LKTFCW QEQ RLFPDDRVLASTVEVL+RNHELKVP+NLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAA+K +    
Subjt:  SLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARKVRECWI

Query:  PA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME
        P+                      + +LGQREALKLNQQD+TTIIKVCTRL KFEIAEKLY WYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME
Subjt:  PA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME

Query:  SANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQGKR
        SANCPFDLPAY+VVIKLFVALGDLSRAVRYFAKLKEAGF+PTYDVYRNMITIYLVSGRLAK KEIYKEA+NAGFIMDKQITSMLLQ KR
Subjt:  SANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQGKR

XP_022153119.1 pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Momordica charantia]3.7e-17968.28Show/hide
Query:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQ------DEHEEIHAYDY--
        M  I S++ VSSI+VKGNG   CQ +M  F AN+RRR PKNLL PRR KLPPDP VNQFL N TS   PSFTD  SS+  +      D+HEE    +Y  
Subjt:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQ------DEHEEIHAYDY--

Query:  -TKDTDVVWDSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLN
          KD +++WDSDEIEAISSLFQGRIPQKP                                          VYKRPDFLIGLARAIRDLS EENVSKVLN
Subjt:  -TKDTDVVWDSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLN

Query:  RWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVA
        RW PFL KGSLSLTI+ELGHMGL DR+L++FCWAQEQ RLFPDDRVLASTVEVL+RNHELKVPLNLEEFT+LASRGVLEAM+RGFIKGGSLNLAWKLLV 
Subjt:  RWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVA

Query:  ARKVRECWIPA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYRE
        A+K      P+                      + +LGQREALKLNQQD T I+KVCTRLGKFEIAE+LY WYVES HEPS+VMYTAL+HSRYS++KYRE
Subjt:  ARKVRECWIPA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYRE

Query:  ALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQ
        ALS+VWEME+ANCPFDLPAY+VVIKLFVALGDLSRA RYFAKLKEAGFAPTYD+YRN+ITIYLVSGRLAKCKEIYKEAKNAGFI+DKQITS LLQ
Subjt:  ALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQ

XP_038893977.1 pentatricopeptide repeat-containing protein At2g01860 [Benincasa hispida]2.1e-20678.18Show/hide
Query:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQ------DEHEEIHAYDYTK
        M SI SATSVSSILVKGNG  GCQ TM HFK NSRRR PKNLLCPRRAKLPPDPAVNQFL NKTSAP+PS TDLISS+IFQ      DEHEEIHAYDY K
Subjt:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQ------DEHEEIHAYDYTK

Query:  DTDVVWDSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWG
        DTDVVWDSDEIEAISSLFQGRIPQKP                                          VYKRPDFLIGLARAIRDLSPEENVSKVLNRWG
Subjt:  DTDVVWDSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWG

Query:  PFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARK
        PFLQKGSLSLTIKELGHMGLPDR+LKTF WAQEQ RLFPDDRVLASTVEVLARNHELKVPL+LEEFTKLASRGVLEAM+RGFIKGGSLNLAWKLLVAA+K
Subjt:  PFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARK

Query:  VRECWIPA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALS
         +    P+                      + +LGQREAL LNQQD TTIIKVCTRLGKFEIAEKLYSWYVESGHEPS+VMYTALVHSRYSDRKYREALS
Subjt:  VRECWIPA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALS

Query:  LVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQGKR
        LVWEME+ANCPFDLPAYSV+IKLFV LGDLSRAVRYFAKLKEAGFAPTYDVYR MITIYLVSGRLAKCKEIYKEA+NAGFIMDKQITSMLLQ KR
Subjt:  LVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQGKR

TrEMBL top hitse value%identityAlignment
A0A0A0LVM0 Uncharacterized protein1.3e-21479.75Show/hide
Query:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW
        M SIVSATSVSSILVKGNG  GCQNTMVHFKANSRRRPPKNLLCPRRAKLPP+PAVNQF NNKTSAP+P FTDLISSKIFQDEHEEIHA+DYTKDTDVVW
Subjt:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW

Query:  DSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG
        DSDEIEAISSLFQGRIPQKP                                          VYKRPDFLIGLAR IRDLSPEENVSKVLNRWGPFLQKG
Subjt:  DSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG

Query:  SLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARKVRECWI
        SLSLTIKELGHMGLPDR+L TFCWAQEQHRLFPDDRVLASTVEVL+RNHELKV +NLEEFTKLASRGVLEAMMRGFI+GGSLNLAWKLLVAA+K +    
Subjt:  SLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARKVRECWI

Query:  PA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME
        P+                      + +LGQREALKLNQQD TTI+KVCTRLGKFEIAEKLYSWYVESGHEPS+VMYTALVHSRYSDRKYREALSLVWEME
Subjt:  PA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME

Query:  SANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQGKR
        S NCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGF+PTY+VYRNMITIYLVSGRLAKCKEIYKEA+NAGF+MDKQITSMLLQ KR
Subjt:  SANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQGKR

A0A1S3CGD0 pentatricopeptide repeat-containing protein At2g018602.8e-21781.19Show/hide
Query:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW
        MHSIVSATSVSSILVKGNG  GCQ TMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAP+PSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW
Subjt:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW

Query:  DSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG
        DSDEIEAISSLFQGRIPQKP                                          VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG
Subjt:  DSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG

Query:  SLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARKVRECWI
        SLSLTIKELGHMGLPDR+LKTFCW QEQ RLFPDDRVLASTVEVL+RNHELKVP+NLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAA+K +    
Subjt:  SLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARKVRECWI

Query:  PA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME
        P+                      + +LGQREALKLNQQD+TTIIKVCTRL KFEIAEKLY WYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME
Subjt:  PA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME

Query:  SANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQGKR
        SANCPFDLPAY+VVIKLFVALGDLSRAVRYFAKLKEAGF+PTYDVYRNMITIYLVSGRLAK KEIYKEA+NAGFIMDKQITSMLLQ KR
Subjt:  SANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQGKR

A0A5D3BQZ3 Pentatricopeptide repeat-containing protein1.1e-20080.53Show/hide
Query:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW
        MHSIVSATSVSSILVKGNG  GCQ TMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAP+PSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW
Subjt:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVW

Query:  DSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG
        DSDEIEAISSLFQGRIPQKP                                          VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG
Subjt:  DSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG

Query:  SLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARKVRECWI
        SLSLTIKELGHMGLPDR+LKTFCW QEQ RLFPDDRVLASTVEVL+RNHELKVP+NLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAA+K +    
Subjt:  SLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAARKVRECWI

Query:  PA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME
        P+                      + +LGQREALKLNQQD+TTIIKVCTRL KFEIAEKLY WYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME
Subjt:  PA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEME

Query:  SANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSG
        SANCPFDLPAY+VVIKLFVALGDLSRAVRYFAKLKEAGF+PTYDVYRNMITIYLVSG
Subjt:  SANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSG

A0A6J1DI37 pentatricopeptide repeat-containing protein At2g01860 isoform X11.8e-17968.28Show/hide
Query:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQ------DEHEEIHAYDY--
        M  I S++ VSSI+VKGNG   CQ +M  F AN+RRR PKNLL PRR KLPPDP VNQFL N TS   PSFTD  SS+  +      D+HEE    +Y  
Subjt:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQ------DEHEEIHAYDY--

Query:  -TKDTDVVWDSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLN
          KD +++WDSDEIEAISSLFQGRIPQKP                                          VYKRPDFLIGLARAIRDLS EENVSKVLN
Subjt:  -TKDTDVVWDSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSKVLN

Query:  RWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVA
        RW PFL KGSLSLTI+ELGHMGL DR+L++FCWAQEQ RLFPDDRVLASTVEVL+RNHELKVPLNLEEFT+LASRGVLEAM+RGFIKGGSLNLAWKLLV 
Subjt:  RWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVA

Query:  ARKVRECWIPA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYRE
        A+K      P+                      + +LGQREALKLNQQD T I+KVCTRLGKFEIAE+LY WYVES HEPS+VMYTAL+HSRYS++KYRE
Subjt:  ARKVRECWIPA---------------------FMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYRE

Query:  ALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQ
        ALS+VWEME+ANCPFDLPAY+VVIKLFVALGDLSRA RYFAKLKEAGFAPTYD+YRN+ITIYLVSGRLAKCKEIYKEAKNAGFI+DKQITS LLQ
Subjt:  ALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQ

A0A6J1GIR9 pentatricopeptide repeat-containing protein At2g01860 isoform X21.2e-17870.21Show/hide
Query:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTP--SFTDLISSKIF------QDEHEEIHAYDY
        M S+ S T++SSILVK NG   CQ  + HF+ NSRRRPPKNLL PRR KLPPDP VNQFL  +TS P P  SF DLISS+         DE EE  A +Y
Subjt:  MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTP--SFTDLISSKIF------QDEHEEIHAYDY

Query:  ----TKDTDVVWDSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSK
              D+DVVWDS+EIEAI+SLF+GRIPQKP                                          VYKRPDFLIGLARAIRDL PEENVSK
Subjt:  ----TKDTDVVWDSDEIEAISSLFQGRIPQKP------------------------------------------VYKRPDFLIGLARAIRDLSPEENVSK

Query:  VLNRWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKL
        VLNRW PFLQKGSLSLTIKELGHMGL DR+LKTFCW QEQ RL+PDDRVLASTVEVLARNHELK+P NL+EFTKLASRGVLEAMMRGFIKGG L+LAWKL
Subjt:  VLNRWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKL

Query:  LVAARKVRECWIPAFMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLP
        LVAA+            +LGQREAL LNQQD + IIKV TRLGKFEIAE+LYSWYVESGHEPS+VMYTALVH+RYS+RKYREALS+VWEME+AN PFDLP
Subjt:  LVAARKVRECWIPAFMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLP

Query:  AYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQGKR
        AYSVV+KLFVALGDLSRAVRYFAKLKEAGF PTY +YRN+ITIYL +GRLAKCKEIYKEA+NAG++MDKQITSMLLQ KR
Subjt:  AYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQGKR

SwissProt top hitse value%identityAlignment
O64624 Pentatricopeptide repeat-containing protein At2g18940, chloroplastic2.6e-1026.39Show/hide
Query:  EALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYF
        + LK ++   +T++  C R G    A++ ++     G+EP  V Y AL+        Y EALS++ EME  +CP D   Y+ ++  +V  G    A    
Subjt:  EALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYF

Query:  AKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFI
          + + G  P    Y  +I  Y  +G+  +  +++   K AG +
Subjt:  AKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFI

Q5XET4 Pentatricopeptide repeat-containing protein At2g018601.0e-10747.58Show/hide
Query:  NSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQGRIPQKP------------
        N  ++  KNL  PRR KLPPD  VN FL      P           +  D+ E++       D  VVW+ +EIEAISSLFQ RIPQKP            
Subjt:  NSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQGRIPQKP------------

Query:  -----------------------------VYKRPDFLIGLARAIRDL-SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLF
                                     VYK P FLIGLAR I+ L S + +VS VLN+W  FL+KGSLS TI+ELGHMGLP+R+L+T+ WA++   L 
Subjt:  -----------------------------VYKRPDFLIGLARAIRDL-SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLF

Query:  PDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLL---------------------VAARKVRECWIPAFMSKLGQRE
        PD+R+LAST++VLA++HELK+   L+    LAS+ V+EAM++G I+GG LNLA KL+                     +A    +   + A + +L +RE
Subjt:  PDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLL---------------------VAARKVRECWIPAFMSKLGQRE

Query:  ALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFA
         LKL+QQD T+I+K+C +LG+FE+ E L+ W+  S  EPS+VMYT ++HSRYS++KYREA+S+VWEME +NC  DLPAY VVIKLFVAL DL RA+RY++
Subjt:  ALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFA

Query:  KLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQ
        KLKEAGF+PTYD+YR+MI++Y  SGRL KCKEI KE ++AG  +DK  +  LLQ
Subjt:  KLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQ

Q8GZ63 Pentatricopeptide repeat-containing protein At5g256301.2e-1027.56Show/hide
Query:  QDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAG
        +  T ++ V    G+   A+ ++    E+GH PS++ YT L+ +    ++Y    S+V E+E +    D   ++ VI  F   G++  AV+   K+KE G
Subjt:  QDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAG

Query:  FAPTYDVYRNMITIYLVSGRLAKCKEI
          PT   Y  +I  Y ++G+  +  E+
Subjt:  FAPTYDVYRNMITIYLVSGRLAKCKEI

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic9.6e-1320.87Show/hide
Query:  LARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGV----
        L   +  L P  ++++ L+ +   L     +L  KE    G   RSL+ F + Q Q    P++ +    + +L R  E  +   LE F ++ S+GV    
Subjt:  LARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGV----

Query:  --LEAMMRGFIKGGSLNLAWKLLVAARKVRECWIPAFMS-----------------KLG-----QREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYV
            A++  + + G    + +LL   R   E   P+ ++                  LG     + E ++ +     T++  C   G  + AE ++    
Subjt:  --LEAMMRGFIKGGSLNLAWKLLVAARKVRECWIPAFMS-----------------KLG-----QREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYV

Query:  ESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEI
        + G  P +  Y+ LV +    R+  +   L+ EM S     D+ +Y+V+++ +   G +  A+  F +++ AG  P  + Y  ++ ++  SGR    +++
Subjt:  ESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEI

Query:  YKEAKNAGFIMDKQITSMLLQ
        + E K++    D    ++L++
Subjt:  YKEAKNAGFIMDKQITSMLLQ

Q9ZUA2 Pentatricopeptide repeat-containing protein At2g017401.2e-1023.59Show/hide
Query:  MMRGFIKGGSLNLAWKLLVAARKVRECWIPAFMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREA
        ++ G+ K G L +A  L    R+VR                + LN    T +I    + G+ + AE++YS  VE   EP+ ++YT ++   +       A
Subjt:  MMRGFIKGGSLNLAWKLLVAARKVRECWIPAFMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREA

Query:  LSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQG
        +  + +M +     D+ AY V+I      G L  A      ++++   P   ++  M+  Y  SGR+     +Y +    GF  D    S ++ G
Subjt:  LSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQG

Arabidopsis top hitse value%identityAlignment
AT1G74850.1 plastid transcriptionally active 26.8e-1420.87Show/hide
Query:  LARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGV----
        L   +  L P  ++++ L+ +   L     +L  KE    G   RSL+ F + Q Q    P++ +    + +L R  E  +   LE F ++ S+GV    
Subjt:  LARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGV----

Query:  --LEAMMRGFIKGGSLNLAWKLLVAARKVRECWIPAFMS-----------------KLG-----QREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYV
            A++  + + G    + +LL   R   E   P+ ++                  LG     + E ++ +     T++  C   G  + AE ++    
Subjt:  --LEAMMRGFIKGGSLNLAWKLLVAARKVRECWIPAFMS-----------------KLG-----QREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYV

Query:  ESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEI
        + G  P +  Y+ LV +    R+  +   L+ EM S     D+ +Y+V+++ +   G +  A+  F +++ AG  P  + Y  ++ ++  SGR    +++
Subjt:  ESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEI

Query:  YKEAKNAGFIMDKQITSMLLQ
        + E K++    D    ++L++
Subjt:  YKEAKNAGFIMDKQITSMLLQ

AT2G01740.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.4e-1223.59Show/hide
Query:  MMRGFIKGGSLNLAWKLLVAARKVRECWIPAFMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREA
        ++ G+ K G L +A  L    R+VR                + LN    T +I    + G+ + AE++YS  VE   EP+ ++YT ++   +       A
Subjt:  MMRGFIKGGSLNLAWKLLVAARKVRECWIPAFMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREA

Query:  LSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQG
        +  + +M +     D+ AY V+I      G L  A      ++++   P   ++  M+  Y  SGR+     +Y +    GF  D    S ++ G
Subjt:  LSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQG

AT2G01860.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.4e-10947.58Show/hide
Query:  NSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQGRIPQKP------------
        N  ++  KNL  PRR KLPPD  VN FL      P           +  D+ E++       D  VVW+ +EIEAISSLFQ RIPQKP            
Subjt:  NSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQGRIPQKP------------

Query:  -----------------------------VYKRPDFLIGLARAIRDL-SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLF
                                     VYK P FLIGLAR I+ L S + +VS VLN+W  FL+KGSLS TI+ELGHMGLP+R+L+T+ WA++   L 
Subjt:  -----------------------------VYKRPDFLIGLARAIRDL-SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLF

Query:  PDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLL---------------------VAARKVRECWIPAFMSKLGQRE
        PD+R+LAST++VLA++HELK+   L+    LAS+ V+EAM++G I+GG LNLA KL+                     +A    +   + A + +L +RE
Subjt:  PDDRVLASTVEVLARNHELKVPLNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLL---------------------VAARKVRECWIPAFMSKLGQRE

Query:  ALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFA
         LKL+QQD T+I+K+C +LG+FE+ E L+ W+  S  EPS+VMYT ++HSRYS++KYREA+S+VWEME +NC  DLPAY VVIKLFVAL DL RA+RY++
Subjt:  ALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFA

Query:  KLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQ
        KLKEAGF+PTYD+YR+MI++Y  SGRL KCKEI KE ++AG  +DK  +  LLQ
Subjt:  KLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQ

AT5G25630.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.4e-1227.56Show/hide
Query:  QDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAG
        +  T ++ V    G+   A+ ++    E+GH PS++ YT L+ +    ++Y    S+V E+E +    D   ++ VI  F   G++  AV+   K+KE G
Subjt:  QDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAG

Query:  FAPTYDVYRNMITIYLVSGRLAKCKEI
          PT   Y  +I  Y ++G+  +  E+
Subjt:  FAPTYDVYRNMITIYLVSGRLAKCKEI

AT5G25630.2 Tetratricopeptide repeat (TPR)-like superfamily protein8.4e-1227.56Show/hide
Query:  QDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAG
        +  T ++ V    G+   A+ ++    E+GH PS++ YT L+ +    ++Y    S+V E+E +    D   ++ VI  F   G++  AV+   K+KE G
Subjt:  QDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAG

Query:  FAPTYDVYRNMITIYLVSGRLAKCKEI
          PT   Y  +I  Y ++G+  +  E+
Subjt:  FAPTYDVYRNMITIYLVSGRLAKCKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTCTATTGTCTCAGCAACTTCAGTGTCTTCCATTCTGGTGAAAGGAAATGGAGCAACTGGCTGCCAGAATACAATGGTTCATTTCAAGGCTAACTCCAGAAGACG
CCCACCTAAAAACCTCCTCTGTCCACGACGGGCCAAGCTTCCTCCTGACCCCGCCGTCAACCAATTCTTGAACAACAAAACCTCTGCCCCTACCCCATCCTTCACCGATT
TGATTTCCTCTAAGATTTTCCAAGATGAGCATGAAGAAATCCATGCTTATGATTATACCAAGGATACTGATGTTGTTTGGGATTCAGATGAAATTGAAGCTATTTCCTCA
CTCTTCCAAGGGAGAATTCCTCAGAAACCTGTCTACAAGCGTCCTGATTTTCTTATTGGCCTTGCTAGGGCGATTAGAGATCTATCTCCAGAGGAAAATGTGTCCAAGGT
TCTCAATCGGTGGGGTCCGTTTTTGCAGAAGGGATCTCTTTCATTGACAATCAAGGAACTAGGGCATATGGGTCTTCCTGATAGATCTCTAAAGACGTTCTGTTGGGCAC
AGGAACAACATCGACTCTTTCCAGATGATCGTGTTTTGGCCTCAACCGTTGAGGTCCTTGCAAGGAACCATGAACTGAAGGTACCTCTAAACTTGGAAGAGTTCACTAAA
CTTGCAAGTCGTGGTGTGCTCGAGGCAATGATGAGAGGGTTTATCAAAGGTGGGAGCTTAAATCTTGCTTGGAAGCTTCTTGTAGCTGCGAGGAAGGTAAGAGAATGTTG
GATCCCAGCGTTTATGTCAAAGCTAGGACAAAGAGAAGCCTTGAAGTTAAACCAACAAGATAATACAACTATAATTAAGGTCTGCACAAGGCTTGGTAAATTTGAAATTG
CTGAGAAACTTTATAGCTGGTATGTTGAATCTGGACATGAACCGAGTATGGTTATGTATACTGCCTTAGTTCATAGTCGCTACTCAGACAGGAAATATAGGGAGGCATTA
TCTTTAGTGTGGGAAATGGAGTCTGCAAACTGTCCTTTTGATCTTCCTGCTTATAGTGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTTTCAAGGGCTGTTAGATA
CTTTGCTAAGCTTAAGGAAGCTGGTTTTGCTCCTACATATGATGTATATAGGAATATGATCACCATTTATTTAGTTTCAGGGAGGTTAGCCAAGTGTAAGGAAATTTATA
AGGAAGCAAAGAATGCTGGATTTATCATGGATAAACAAATTACTTCAATGTTGTTGCAAGGAAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCATTCTATTGTCTCAGCAACTTCAGTGTCTTCCATTCTGGTGAAAGGAAATGGAGCAACTGGCTGCCAGAATACAATGGTTCATTTCAAGGCTAACTCCAGAAGACG
CCCACCTAAAAACCTCCTCTGTCCACGACGGGCCAAGCTTCCTCCTGACCCCGCCGTCAACCAATTCTTGAACAACAAAACCTCTGCCCCTACCCCATCCTTCACCGATT
TGATTTCCTCTAAGATTTTCCAAGATGAGCATGAAGAAATCCATGCTTATGATTATACCAAGGATACTGATGTTGTTTGGGATTCAGATGAAATTGAAGCTATTTCCTCA
CTCTTCCAAGGGAGAATTCCTCAGAAACCTGTCTACAAGCGTCCTGATTTTCTTATTGGCCTTGCTAGGGCGATTAGAGATCTATCTCCAGAGGAAAATGTGTCCAAGGT
TCTCAATCGGTGGGGTCCGTTTTTGCAGAAGGGATCTCTTTCATTGACAATCAAGGAACTAGGGCATATGGGTCTTCCTGATAGATCTCTAAAGACGTTCTGTTGGGCAC
AGGAACAACATCGACTCTTTCCAGATGATCGTGTTTTGGCCTCAACCGTTGAGGTCCTTGCAAGGAACCATGAACTGAAGGTACCTCTAAACTTGGAAGAGTTCACTAAA
CTTGCAAGTCGTGGTGTGCTCGAGGCAATGATGAGAGGGTTTATCAAAGGTGGGAGCTTAAATCTTGCTTGGAAGCTTCTTGTAGCTGCGAGGAAGGTAAGAGAATGTTG
GATCCCAGCGTTTATGTCAAAGCTAGGACAAAGAGAAGCCTTGAAGTTAAACCAACAAGATAATACAACTATAATTAAGGTCTGCACAAGGCTTGGTAAATTTGAAATTG
CTGAGAAACTTTATAGCTGGTATGTTGAATCTGGACATGAACCGAGTATGGTTATGTATACTGCCTTAGTTCATAGTCGCTACTCAGACAGGAAATATAGGGAGGCATTA
TCTTTAGTGTGGGAAATGGAGTCTGCAAACTGTCCTTTTGATCTTCCTGCTTATAGTGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTTTCAAGGGCTGTTAGATA
CTTTGCTAAGCTTAAGGAAGCTGGTTTTGCTCCTACATATGATGTATATAGGAATATGATCACCATTTATTTAGTTTCAGGGAGGTTAGCCAAGTGTAAGGAAATTTATA
AGGAAGCAAAGAATGCTGGATTTATCATGGATAAACAAATTACTTCAATGTTGTTGCAAGGAAAGAGATGA
Protein sequenceShow/hide protein sequence
MHSIVSATSVSSILVKGNGATGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFLNNKTSAPTPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVWDSDEIEAISS
LFQGRIPQKPVYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRSLKTFCWAQEQHRLFPDDRVLASTVEVLARNHELKVPLNLEEFTK
LASRGVLEAMMRGFIKGGSLNLAWKLLVAARKVRECWIPAFMSKLGQREALKLNQQDNTTIIKVCTRLGKFEIAEKLYSWYVESGHEPSMVMYTALVHSRYSDRKYREAL
SLVWEMESANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNMITIYLVSGRLAKCKEIYKEAKNAGFIMDKQITSMLLQGKR