; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008114 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008114
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionpentatricopeptide repeat-containing protein At2g01390
Genome locationchr9:12554949..12558403
RNA-Seq ExpressionLag0008114
SyntenyLag0008114
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011168.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.3e-28785.15Show/hide
Query:  QCSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWD
        +CSN FS L+ NYVV SAICKRIYQNIS+K LHS HQ KQEKP   FSRK RKG K VKKEEV    YTRDTVRNIYNILRNCSWGSAQGH+ETLPIRWD
Subjt:  QCSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWD

Query:  SYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGC
        SYLINQVLKTHPPLEKAWLFFNWASRLQ ++HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSN GDVDGAI+VW+EMKANGC
Subjt:  SYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGC

Query:  YPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYM
        YPTVV+YTAYIKILLD+ +V++ATD Y+EMLQSGLSPNCCTYT+LMEYLIGE K KEALDIFHKMQDAG YPDKAACNILIQKCCKS EMLVMTQILEYM
Subjt:  YPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYM

Query:  KEKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSI
        KEKRLVLRYPVFVEA E LKSCSV+ TLL QVNPHIE+ESVSKG+V+ DVSTS  +I P+VD ELV  LL+E+KLIAVDHILIGM +KNIQLDS II SI
Subjt:  KEKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSI

Query:  IEVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATY
        IEVNCKRNRPN ALLAF+YCLKNGV ++RNLYL LIG+LIRSSIYS LLEIVQEMY +GHCLG+YHATLILYRLGKAGKPQYARKVFNMLP+ELKCTATY
Subjt:  IEVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATY

Query:  TALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGEL
        TALV AYF  G+ GKGLKIYE MRKKGFTPSLGTYNVLL+GLVKS RVVELDIYRREKK  EISHHSHH TILEEERICDLLFGEL
Subjt:  TALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGEL

XP_022928072.1 pentatricopeptide repeat-containing protein At2g01390 [Cucurbita moschata]1.6e-28885.37Show/hide
Query:  QCSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWD
        +CSN FS L+ NYVV SAICKRIYQNIS+K LHS HQ KQEKP   FSRK RKG K VKKEEV    YTRDTVRNIYNILRNCSW SAQGH+ETLPIRWD
Subjt:  QCSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWD

Query:  SYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGC
        SYLINQVLKTHPPLEKAWLFFNWASRLQ ++HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSN GDVDGAI+VW+EMKANGC
Subjt:  SYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGC

Query:  YPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYM
        YPTVV+YTAYIKILLD+ +V++ATD Y+EMLQSGLSPNCCTYT+LMEYLIGE K KEALDIFHKMQDAG YPDKAACNILIQKCCKS EMLVMTQILEYM
Subjt:  YPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYM

Query:  KEKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSI
        KEKRLVLRYPVFVEA E LKSCSV+ TLL QVNPHIE+ESVSKG+V+ DVSTS  +I P+VD ELV  LL+E+KLIAVDHILIGM +KNIQLDS II SI
Subjt:  KEKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSI

Query:  IEVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATY
        IEVNCKRNRPN ALLAF+YCLKNGV ++RNLYL LIG+LIRSSIYS LLEIVQEMY +GHCLG+YHATLILYRLGKAGKPQYARKVFNMLP+ELKCTATY
Subjt:  IEVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATY

Query:  TALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS
        TALV AYFS G+ GKGLKIYETMRKKGFTPSLGTYNVLL+GLVKS RVVELDIYRREKK  EISHHSHH TILEEERICDLLFGELVS
Subjt:  TALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS

XP_022971714.1 pentatricopeptide repeat-containing protein At2g01390 [Cucurbita maxima]4.6e-29186.05Show/hide
Query:  QCSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWD
        +CSNSFS L+ NYVV SAICKRIYQNIS+K LHS HQ KQEKP   FSRK RKG K VKKEEV    YTRDTVRNIYNILRNCSWGSAQGH+ETLPIRWD
Subjt:  QCSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWD

Query:  SYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGC
        SYLINQVLKTHPPLEKAWLFFNWASRLQ +KHD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSN GDVDGAI+VW+EMKANGC
Subjt:  SYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGC

Query:  YPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYM
        YPTVV+YTAYIKILLD+ +V++ATDTY+EMLQSGLSPNCCTYT+LMEYLIGE K KEALDIFHKMQDAGVYPDKAACNILIQKCCKS EMLVMTQILEYM
Subjt:  YPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYM

Query:  KEKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSI
        KEKRLVLRYPVFVEA E LKSCSV+ TLL QVNPHIE+ESVSKG+V+ DVSTS  +I P+VD ELV  LL+E+KLIAVDHILIGM +KNIQLDS II SI
Subjt:  KEKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSI

Query:  IEVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATY
        IEVNCKRNRPN ALLAF+YCLKNGV ++RNLYL LIG+LIRSSIYS LLEIVQ+MY +GHCLG+YHATLILYRLGKAGKPQYARKVFNMLP+ELKCTATY
Subjt:  IEVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATY

Query:  TALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS
        TALV AYFS G+ GKGLKIYETMRKKGFTPSLGTYNVLL+GLVKS RVVELDIYRREKK  EISHHSHH TILEEERICDLLFGELVS
Subjt:  TALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS

XP_023512200.1 pentatricopeptide repeat-containing protein At2g01390 [Cucurbita pepo subsp. pepo]2.8e-28884.52Show/hide
Query:  QCSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWD
        +CSN FS  + NYVV SAICKR+YQNIS+K LHS HQ KQEKP   F+RK RKG K VKKEE+ P  YTRDTVRNIYNILRNCSWG AQGH+ETLPIRWD
Subjt:  QCSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWD

Query:  SYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGC
        SYLINQVLKTHPPLEKAWLFFNWASRLQ ++HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSN GDVDGAI+VW+EMKANGC
Subjt:  SYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGC

Query:  YPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYM
        YPTVV+YTAYIKILLD+ +V++ATD Y+EMLQSGLSPNCCTYT+LMEYLIGE K KEALDIFHKMQDAG YPDKAACNILIQKCCKS EMLVMTQILEYM
Subjt:  YPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYM

Query:  KEKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSI
        KEKRLVLRYPVFVEA E LKSCSV+ TLL QVNPHIE+ESVSKG+V+ DVSTS  +I P+VD ELV  LL+E+KLIAVDHILIGM +KNIQLDS II SI
Subjt:  KEKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSI

Query:  IEVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATY
        IEVNCKRNRPN ALLAF+YCLKNGV ++RNLYLGLIG+LIRSSIYSKLLE+VQEMY +GHCLG+YHATL LYRLGKAGKPQYARKVFNMLP+ELKCTATY
Subjt:  IEVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATY

Query:  TALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS
        TALV AYFS G+ GKGLKIYETMRKKGFTPSLGTYNVLL+GLVKS RV ELDIYRREKK  EISHHSHH TILEEERICDLLFGE VS
Subjt:  TALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS

XP_038901985.1 pentatricopeptide repeat-containing protein At2g01390 [Benincasa hispida]1.1e-28785.15Show/hide
Query:  CSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWDS
        CSNSFS LL NYVV SAI KRIYQNIS+K LHSFHQ KQEKPIK F+RKSRKG KVVKKEEV  R YTRDTVRNIYNILR CSWGSAQ HLE LPIRWDS
Subjt:  CSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWDS

Query:  YLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCY
        YLINQVLKTHPPLEK WLFFNWASRLQ++KHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEK IKIDAVTYTSLMHWRSN GDV+GAIKVWKEMKANGCY
Subjt:  YLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCY

Query:  PTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMK
        PTVV+YTAYIKILLDSDQ+KEATDTY+EMLQSGL PNCCTYTILMEYLIGEGKCKEALDIF KMQDAGVYPDKAACNILIQKCCKS E LVMTQILEYMK
Subjt:  PTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMK

Query:  EKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSII
        +KRLVLRYPVFVEA ETLKSCSV+ TLLRQVNPHIE+ESVSKG+V+ +VST S I+PPNVD EL+ ILL+E+KL A+D++L G++++NIQLDS II SI 
Subjt:  EKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSII

Query:  EVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATYT
        EVNCK NRPN ALLAFNYCLK+GV+I+R LYL LIGILIRSSIY KLLEIVQ+MY QGHCLG+YHATLILYRLGKAGKPQYARKVFN+LP+ELKCTATYT
Subjt:  EVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATYT

Query:  ALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELV
        ALV AYFS G+SGKGLKIYETMRKKGF PSLGTYNVLL GL K GR+ EL IYR+E+KS EISHHSH  TILEEERICDLL+GELV
Subjt:  ALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELV

TrEMBL top hitse value%identityAlignment
A0A0A0LJM3 Uncharacterized protein1.0e-28083.25Show/hide
Query:  NSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWDSYL
        N FSLLL NYVV SAI KRIYQNIS+K LHS HQ K++KPI  FSR+SRKG KV KKEEV PRLYTRDTVRNI NILRNCSW SAQ HLE LPIRWDSYL
Subjt:  NSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWDSYL

Query:  INQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPT
        INQVLKTHPPLEK WLFFNWAS LQ++KHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSN GDVDGAIK+WKEMKANGC+PT
Subjt:  INQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPT

Query:  VVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKEK
        VV+YTAYIKILLD+ Q+ EAT TY++MLQSGLSPNCCTYTILMEYLIGEGKCKEALDIF KMQDAGVYPDKAACNILIQKCCKS E LVMTQILE+MKE 
Subjt:  VVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKEK

Query:  RLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIEV
        R VLRYPVFVEA ETLKSCSV+  LL+QVNPH+E+ES+SKG+V+ DVST S  +PPNVD EL+ +LL+++KL AVDH+LIG+++KNIQLDS II+SIIEV
Subjt:  RLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIEV

Query:  NCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATYTAL
        NCK NRPNSALLAF+YCLKN V+I R LYL LIGILIRSSIY KLLEIVQEMY QGHCLG+YHATLIL  LGKAGKPQYARKVFNMLP+ELKCTATYTAL
Subjt:  NCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATYTAL

Query:  VGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS
        V  YFS G+SGKGLKI+ETMRKKGFTPSLGTYNVLL GL K+GR VEL+IYRREKKS EISHHS  NTIL++ERICDLLFGELVS
Subjt:  VGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS

A0A1S4E1N2 pentatricopeptide repeat-containing protein At2g01390-like7.9e-28183.08Show/hide
Query:  NSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWDSYL
        N FSLLL NYVVISAI KRIYQNIS K LHS HQ K+EKPI  FSR SRKG KVVKKEEV PR+YTRDTV NI NILRNCSW SAQ HLE LPIRWDSYL
Subjt:  NSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWDSYL

Query:  INQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPT
        INQVLKTHPPLEK WLFFNWASRL+++KHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKGIKIDA TYTSLMHWRSN GDVDGAIKVWKEMKANGC+PT
Subjt:  INQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPT

Query:  VVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKEK
        VV+YTAYIKILLD+ Q KEAT TY+EML++GLSPNCCTYTILMEYLIGEGKCKEALDIF KMQDAGVYPDKAACNILIQKCCKS E LVMTQILE+MKE 
Subjt:  VVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKEK

Query:  RLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIEV
        R VLRYPVFVEA E LKSCSV   LLRQVNPHIE+ES+SKG+V +DVST S  +PPNVD EL+ +LL+++KL A+DH+LIG+++KNIQLDS II+SIIEV
Subjt:  RLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIEV

Query:  NCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATYTAL
        NCK NRPNSA+LAF+YCLKNGV+I R LYL LIGILIRSSIY KLLEIVQEMY QGHC+G+YHATLILY LG+AGKPQYARKVFN+LP+ELKCTATYT+L
Subjt:  NCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATYTAL

Query:  VGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS
        V AYFS G+SGKGLKI+ETMRKKGFTPSLGTYNVLL GL KSGR VEL+IYRREKKS EISHHS  NTIL++ERICDLLFGELVS
Subjt:  VGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS

A0A6J1DMJ2 pentatricopeptide repeat-containing protein At2g01390 isoform X11.3e-28383.28Show/hide
Query:  SNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWDSY
        SNSFSLLL NYVVISAI K+IY NIS K LHS  Q KQEKPIK+FSRK RKG KVV+KEEV P+LYTRDTVRNIYNILRN SW SAQ HLE LP+RWDSY
Subjt:  SNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWDSY

Query:  LINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYP
        LINQV+KTHPPLEKAWLFFNWA RL+ +KHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKGIKIDAVTYTSLMHWRS  GDVDGAIKVWKEMK NGCYP
Subjt:  LINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYP

Query:  TVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKE
        TVV+YTAYIKILLD+DQVKEATDTY+EMLQSGLSPNCCTYT+LMEYLIG GKCKEALDIFHKMQDAGVYPDKAACNILI KCC+S EMLVMT ILEYMKE
Subjt:  TVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKE

Query:  KRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIE
         R VLRYPVFVEA +TLKSCSV++TLLRQVNPHIE ESVSK +VI  V TSS IIP NVD EL+EILL+++KLIAVD++L GM++KNIQLDS II +IIE
Subjt:  KRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIE

Query:  VNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATYTA
        VNCK NRP+ ALL F++CLK+GV++ RNLYLGLIG+LIRSSIYSKLLEIV EMY+QGHCLG+YHATLILYRLGKAGKPQYA K+FN+LP+ELKCTATYTA
Subjt:  VNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATYTA

Query:  LVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS
        LVGAYFS G+SGKGLKIYETMRKKGF+PSLGTYNVLLTGL KSGRVVEL+IYRREKKS EI ++SHH+ ILEE+RICDLL+GE++S
Subjt:  LVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS

A0A6J1EIW0 pentatricopeptide repeat-containing protein At2g013907.9e-28985.37Show/hide
Query:  QCSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWD
        +CSN FS L+ NYVV SAICKRIYQNIS+K LHS HQ KQEKP   FSRK RKG K VKKEEV    YTRDTVRNIYNILRNCSW SAQGH+ETLPIRWD
Subjt:  QCSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWD

Query:  SYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGC
        SYLINQVLKTHPPLEKAWLFFNWASRLQ ++HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSN GDVDGAI+VW+EMKANGC
Subjt:  SYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGC

Query:  YPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYM
        YPTVV+YTAYIKILLD+ +V++ATD Y+EMLQSGLSPNCCTYT+LMEYLIGE K KEALDIFHKMQDAG YPDKAACNILIQKCCKS EMLVMTQILEYM
Subjt:  YPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYM

Query:  KEKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSI
        KEKRLVLRYPVFVEA E LKSCSV+ TLL QVNPHIE+ESVSKG+V+ DVSTS  +I P+VD ELV  LL+E+KLIAVDHILIGM +KNIQLDS II SI
Subjt:  KEKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSI

Query:  IEVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATY
        IEVNCKRNRPN ALLAF+YCLKNGV ++RNLYL LIG+LIRSSIYS LLEIVQEMY +GHCLG+YHATLILYRLGKAGKPQYARKVFNMLP+ELKCTATY
Subjt:  IEVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATY

Query:  TALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS
        TALV AYFS G+ GKGLKIYETMRKKGFTPSLGTYNVLL+GLVKS RVVELDIYRREKK  EISHHSHH TILEEERICDLLFGELVS
Subjt:  TALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS

A0A6J1I9C5 pentatricopeptide repeat-containing protein At2g013902.2e-29186.05Show/hide
Query:  QCSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWD
        +CSNSFS L+ NYVV SAICKRIYQNIS+K LHS HQ KQEKP   FSRK RKG K VKKEEV    YTRDTVRNIYNILRNCSWGSAQGH+ETLPIRWD
Subjt:  QCSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWD

Query:  SYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGC
        SYLINQVLKTHPPLEKAWLFFNWASRLQ +KHD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSN GDVDGAI+VW+EMKANGC
Subjt:  SYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGC

Query:  YPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYM
        YPTVV+YTAYIKILLD+ +V++ATDTY+EMLQSGLSPNCCTYT+LMEYLIGE K KEALDIFHKMQDAGVYPDKAACNILIQKCCKS EMLVMTQILEYM
Subjt:  YPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYM

Query:  KEKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSI
        KEKRLVLRYPVFVEA E LKSCSV+ TLL QVNPHIE+ESVSKG+V+ DVSTS  +I P+VD ELV  LL+E+KLIAVDHILIGM +KNIQLDS II SI
Subjt:  KEKRLVLRYPVFVEARETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSI

Query:  IEVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATY
        IEVNCKRNRPN ALLAF+YCLKNGV ++RNLYL LIG+LIRSSIYS LLEIVQ+MY +GHCLG+YHATLILYRLGKAGKPQYARKVFNMLP+ELKCTATY
Subjt:  IEVNCKRNRPNSALLAFNYCLKNGVSIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATY

Query:  TALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS
        TALV AYFS G+ GKGLKIYETMRKKGFTPSLGTYNVLL+GLVKS RVVELDIYRREKK  EISHHSHH TILEEERICDLLFGELVS
Subjt:  TALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS

SwissProt top hitse value%identityAlignment
Q76C99 Protein Rf1, mitochondrial7.5e-2623.5Show/hide
Query:  DQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYREMLQ
        D  +YTT+++ F + G        + +M ++GI  D VTY S++        +D A++V   M  NG  P  +TY + +     S Q KEA    ++M  
Subjt:  DQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYREMLQ

Query:  SGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKEKRLVLRYPVFVEARETLKSCSVTDTLLRQV
         G+ P+  TY++LM+YL   G+C EA  IF  M   G+ P+      L+Q       ++ M  +L+ M    +   + VF     ++  C+         
Subjt:  SGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKEKRLVLRYPVFVEARETLKSCSVTDTLLRQV

Query:  NPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIEVNCKRNRPNSALLAFNYCLKNGVSIDRNLY
                  +GKV   +   SK                             M ++ +  +++   ++I + CK  R   A+L F   +  G+S    +Y
Subjt:  NPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIEVNCKRNRPNSALLAFNYCLKNGVSIDRNLY

Query:  LGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPK--ELKCTATYTALVGAYFSTGNSGKGLKIYETMRKKGFTP
          LI  L   + + +  E++ EM  +G CL       I+    K G+   + K+F ++ +        TY  L+  Y   G   + +K+   M   G  P
Subjt:  LGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPK--ELKCTATYTALVGAYFSTGNSGKGLKIYETMRKKGFTP

Query:  SLGTYNVLLTGLVKSGRVVELDIYRREKKSLEIS
        +  TY+ L+ G  K  R+ +  +  +E +S  +S
Subjt:  SLGTYNVLLTGLVKSGRVVELDIYRREKKSLEIS

Q8GYP6 Pentatricopeptide repeat-containing protein At1g189008.0e-2833.64Show/hide
Query:  VRNIYNILRNCSWG-SAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT
        V N+ ++LR   WG +A+  L+ L +R D+Y  NQVLK       A  FF W  R   +KHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNIYNILRNCSWG-SAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT

Query:  YTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVY
        Y  L+H       ++ A+ V+ +M+  GC P  VTY   I I   +  +  A D Y+ M   GLSP+  TY++++  L   G    A  +F +M D G  
Subjt:  YTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVY

Query:  PDKAACNILIQKCCKSR
        P+    NI++    K+R
Subjt:  PDKAACNILIQKCCKSR

Q9SFV9 Pentatricopeptide repeat-containing protein At3g07290, mitochondrial2.2e-2526Show/hide
Query:  DQYTYTTMLDIFGEAGRISSMNYVFQQM-KEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYREML
        D +  T++L  F     +     VF  M KE     ++V+Y+ L+H     G ++ A  +  +M   GC P+  TYT  IK L D   + +A + + EM+
Subjt:  DQYTYTTMLDIFGEAGRISSMNYVFQQM-KEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYREML

Query:  QSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKEKRLVLRYPVFVEARETLKSCSVTDTLLRQ
          G  PN  TYT+L++ L  +GK +EA  +  KM    ++P     N LI   CK   ++   ++L  M+++        F E  E          L R 
Subjt:  QSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKEKRLVLRYPVFVEARETLKSCSVTDTLLRQ

Query:  VNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIEVNCKRNRPNSALLAFNYCLKNGVSIDRNL
          P+   ++V   K ++D   S  I+  NV   L++ L  E  +     +L  M   +I+ D L   +II   CK+ + + A       L+ G+S+D   
Subjt:  VNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIEVNCKRNRPNSALLAFNYCLKNGVSIDRNL

Query:  YLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPK--ELKCTATYTALVGAYFSTGNSGKGLKIYETMRKKGFT
           LI  + +       L I++ + K       +   +IL  L K  K +    +   + K   +    TYT LV     +G+     +I E M+  G  
Subjt:  YLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPK--ELKCTATYTALVGAYFSTGNSGKGLKIYETMRKKGFT

Query:  PSLGTYNVLLTGLVKSGRVVELD
        P++  Y +++ GL + GRV E +
Subjt:  PSLGTYNVLLTGLVKSGRVVELD

Q9SSF9 Pentatricopeptide repeat-containing protein At1g747507.2e-2931.92Show/hide
Query:  LHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRD--TVRNIYNILRNCSWG-SAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLQ
        +HS         ++ F + SR+ +KV  +    PR +      V N+ +ILR   WG +A+  L     R D+Y  NQVLK       A  FF W  R  
Subjt:  LHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRD--TVRNIYNILRNCSWG-SAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLQ

Query:  IYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYR
         +KHD +TYTTM+   G A +   +N +  +M   G K + VTY  L+H       +  A+ V+ +M+  GC P  VTY   I I   +  +  A D Y+
Subjt:  IYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYR

Query:  EMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSR
         M ++GLSP+  TY++++  L   G    A  +F +M   G  P+    NI+I    K+R
Subjt:  EMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSR

Q9ZU29 Pentatricopeptide repeat-containing protein At2g013903.0e-15251.52Show/hide
Query:  STKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEV-GPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASR
        S K LHS  + K     K FS+K     K+VK + +  P +YTRD V NIYNIL+  +W SAQ  L  L +RWDS++IN+VLK HPP++KAWLFFNWA++
Subjt:  STKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEV-GPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASR

Query:  LQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDT
        ++ +KHD +TYTTMLDIFGEAGRI SM  VF  MKEKG+ ID VTYTSL+HW S+ GDVDGA+++W+EM+ NGC PTVV+YTAY+K+L    +V+EAT+ 
Subjt:  LQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDT

Query:  YREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKEKRLVLRYPVFVEARETLKSCSVTD
        Y+EML+S +SPNC TYT+LMEYL+  GKC+EALDIF KMQ+ GV PDKAACNILI K  K  E   MT++L YMKE  +VLRYP+FVEA ETLK+   +D
Subjt:  YREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKEKRLVLRYPVFVEARETLKSCSVTD

Query:  TLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELV-EILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIEVNCKRNRPNSALLAFNYCLKNGV
         LLR+VN HI VES+    +    +        + D  ++  +LL +  L+AVD +L  M ++NI+LDS ++ +IIE NC R R   A LAF+Y L+ G+
Subjt:  TLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELV-EILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIEVNCKRNRPNSALLAFNYCLKNGV

Query:  SIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATYTALVGAYFSTGNSGKGLKIYETMRK
         + ++ YL LIG  +RS+   K++E+V+EM K  H LG Y   ++++RLG   +P+ A  VF++LP + K  A YTAL+  Y S G+  K +KI   MR+
Subjt:  SIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATYTALVGAYFSTGNSGKGLKIYETMRK

Query:  KGFTPSLGTYNVLLTGLVK-SGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLF
        +   PSLGTY+VLL+GL K S    E+ + R+EKKSL  S     N +  E++ICDLLF
Subjt:  KGFTPSLGTYNVLLTGLVK-SGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLF

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein5.7e-2933.64Show/hide
Query:  VRNIYNILRNCSWG-SAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT
        V N+ ++LR   WG +A+  L+ L +R D+Y  NQVLK       A  FF W  R   +KHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNIYNILRNCSWG-SAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT

Query:  YTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVY
        Y  L+H       ++ A+ V+ +M+  GC P  VTY   I I   +  +  A D Y+ M   GLSP+  TY++++  L   G    A  +F +M D G  
Subjt:  YTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVY

Query:  PDKAACNILIQKCCKSR
        P+    NI++    K+R
Subjt:  PDKAACNILIQKCCKSR

AT1G18900.2 Pentatricopeptide repeat (PPR) superfamily protein5.7e-2933.64Show/hide
Query:  VRNIYNILRNCSWG-SAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT
        V N+ ++LR   WG +A+  L+ L +R D+Y  NQVLK       A  FF W  R   +KHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNIYNILRNCSWG-SAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT

Query:  YTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVY
        Y  L+H       ++ A+ V+ +M+  GC P  VTY   I I   +  +  A D Y+ M   GLSP+  TY++++  L   G    A  +F +M D G  
Subjt:  YTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVY

Query:  PDKAACNILIQKCCKSR
        P+    NI++    K+R
Subjt:  PDKAACNILIQKCCKSR

AT1G18900.3 Pentatricopeptide repeat (PPR) superfamily protein5.7e-2933.64Show/hide
Query:  VRNIYNILRNCSWG-SAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT
        V N+ ++LR   WG +A+  L+ L +R D+Y  NQVLK       A  FF W  R   +KHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNIYNILRNCSWG-SAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT

Query:  YTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVY
        Y  L+H       ++ A+ V+ +M+  GC P  VTY   I I   +  +  A D Y+ M   GLSP+  TY++++  L   G    A  +F +M D G  
Subjt:  YTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVY

Query:  PDKAACNILIQKCCKSR
        P+    NI++    K+R
Subjt:  PDKAACNILIQKCCKSR

AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein5.1e-3031.92Show/hide
Query:  LHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRD--TVRNIYNILRNCSWG-SAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLQ
        +HS         ++ F + SR+ +KV  +    PR +      V N+ +ILR   WG +A+  L     R D+Y  NQVLK       A  FF W  R  
Subjt:  LHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRD--TVRNIYNILRNCSWG-SAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLQ

Query:  IYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYR
         +KHD +TYTTM+   G A +   +N +  +M   G K + VTY  L+H       +  A+ V+ +M+  GC P  VTY   I I   +  +  A D Y+
Subjt:  IYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDTYR

Query:  EMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSR
         M ++GLSP+  TY++++  L   G    A  +F +M   G  P+    NI+I    K+R
Subjt:  EMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSR

AT2G01390.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-15351.52Show/hide
Query:  STKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEV-GPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASR
        S K LHS  + K     K FS+K     K+VK + +  P +YTRD V NIYNIL+  +W SAQ  L  L +RWDS++IN+VLK HPP++KAWLFFNWA++
Subjt:  STKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEV-GPRLYTRDTVRNIYNILRNCSWGSAQGHLETLPIRWDSYLINQVLKTHPPLEKAWLFFNWASR

Query:  LQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDT
        ++ +KHD +TYTTMLDIFGEAGRI SM  VF  MKEKG+ ID VTYTSL+HW S+ GDVDGA+++W+EM+ NGC PTVV+YTAY+K+L    +V+EAT+ 
Subjt:  LQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVVTYTAYIKILLDSDQVKEATDT

Query:  YREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKEKRLVLRYPVFVEARETLKSCSVTD
        Y+EML+S +SPNC TYT+LMEYL+  GKC+EALDIF KMQ+ GV PDKAACNILI K  K  E   MT++L YMKE  +VLRYP+FVEA ETLK+   +D
Subjt:  YREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKEKRLVLRYPVFVEARETLKSCSVTD

Query:  TLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELV-EILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIEVNCKRNRPNSALLAFNYCLKNGV
         LLR+VN HI VES+    +    +        + D  ++  +LL +  L+AVD +L  M ++NI+LDS ++ +IIE NC R R   A LAF+Y L+ G+
Subjt:  TLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELV-EILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIEVNCKRNRPNSALLAFNYCLKNGV

Query:  SIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATYTALVGAYFSTGNSGKGLKIYETMRK
         + ++ YL LIG  +RS+   K++E+V+EM K  H LG Y   ++++RLG   +P+ A  VF++LP + K  A YTAL+  Y S G+  K +KI   MR+
Subjt:  SIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATYTALVGAYFSTGNSGKGLKIYETMRK

Query:  KGFTPSLGTYNVLLTGLVK-SGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLF
        +   PSLGTY+VLL+GL K S    E+ + R+EKKSL  S     N +  E++ICDLLF
Subjt:  KGFTPSLGTYNVLLTGLVK-SGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCCTACCCAACCAACTACGAGCGCGAGCGCTCTGGCGATTGGGCAGTTTCTGATCAATGGCGACCTGATCTACGACAATCACGCTCTTGTGGGACCGATCTTCCC
CTCTCTCACGGCGGCGACAGACCTACGCAACAGGTCTGCAGTGGCGCGAAACTCAATTGACAATGATGACCAGCAGCGGCTTCTCCCACAGGCGCCGTTGGTCGGAAACT
GCCGAATCGAAGCCAATTCCAATTCGATTCACTTTCTACGGGTGAATTTTTCTAAAGTTCCTCAGGGTCGTCTCTCGAGTCTCAATACCTCTCGCCAAACCCGGGGAAAC
TCAAATTTCCATGTTGGTCACCATTATCACTTCCCTAGTTCAAGCCAGTGTTCTAATAGTTTCTCTTTACTTCTGGGTAACTACGTGGTTATTTCTGCCATCTGTAAAAG
AATTTATCAGAATATTTCCACTAAAGGTTTGCATTCCTTTCACCAATCTAAACAAGAAAAACCCATCAAAGTATTCAGCAGAAAGTCGAGGAAAGGAATTAAGGTGGTTA
AGAAGGAAGAAGTAGGTCCAAGACTTTACACGAGAGATACAGTGAGGAACATATACAATATTTTGAGAAATTGTTCATGGGGTTCTGCTCAAGGACACCTAGAGACGCTC
CCTATAAGATGGGATTCTTATCTCATCAACCAGGTTCTGAAAACGCATCCACCATTGGAGAAGGCATGGTTATTCTTTAATTGGGCCTCTAGGCTGCAAATCTACAAGCA
TGACCAGTATACCTACACAACGATGCTGGACATTTTTGGAGAAGCTGGGAGAATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATG
CAGTTACATATACTTCATTGATGCATTGGCGTTCAAACTGGGGGGATGTTGATGGGGCTATAAAGGTATGGAAGGAAATGAAAGCCAATGGTTGTTATCCGACAGTAGTT
ACTTATACTGCTTATATAAAGATTTTGTTGGACAGTGACCAAGTCAAGGAGGCCACTGATACATACAGGGAGATGCTTCAGTCTGGGCTTTCTCCAAATTGCTGTACTTA
CACCATCTTAATGGAATACCTTATTGGAGAGGGTAAATGCAAAGAAGCCCTTGATATTTTTCACAAAATGCAAGATGCAGGAGTATATCCTGATAAAGCGGCTTGCAATA
TATTGATTCAGAAGTGCTGTAAATCACGGGAGATGCTGGTAATGACACAAATCCTTGAGTACATGAAAGAAAAACGCCTTGTGCTTCGATACCCTGTGTTTGTTGAAGCA
CGTGAAACTTTAAAAAGTTGTTCTGTAACTGATACCCTACTCAGGCAAGTTAATCCTCATATAGAAGTTGAATCAGTCAGTAAGGGTAAGGTTATTATGGATGTTAGTAC
AAGTTCTAAAATTATTCCTCCCAATGTAGATTGTGAGCTTGTGGAAATTTTGTTGGAGGAGGATAAACTTATCGCTGTTGACCATATATTAATTGGGATGATAGAGAAGA
ACATACAGTTGGATTCTTTGATTATTTTTTCCATCATCGAGGTAAATTGCAAACGTAATCGACCTAACAGTGCTCTGCTGGCTTTCAACTACTGTTTAAAAAACGGTGTT
AGCATTGACAGAAATCTGTACCTTGGCTTGATCGGGATTCTGATCCGATCGAGTATATATTCAAAGTTACTGGAAATTGTTCAAGAAATGTATAAGCAAGGGCATTGCCT
TGGAATCTATCATGCCACACTTATACTTTATAGGCTTGGGAAAGCTGGAAAACCTCAATATGCTAGGAAAGTTTTCAATATGTTGCCTAAGGAATTGAAGTGCACTGCAA
CTTACACTGCTCTGGTTGGTGCTTATTTTTCTACTGGAAATTCTGGTAAAGGGCTTAAAATTTACGAAACAATGCGAAAGAAAGGATTTACACCATCTTTAGGCACATAC
AATGTGCTGTTAACTGGTCTTGTGAAGAGCGGTAGAGTTGTTGAATTAGATATTTATAGAAGGGAAAAGAAGAGTTTGGAGATCAGTCATCACTCTCATCATAATACAAT
ACTGGAGGAAGAAAGGATTTGTGATCTTCTTTTTGGAGAATTGGTATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCCTACCCAACCAACTACGAGCGCGAGCGCTCTGGCGATTGGGCAGTTTCTGATCAATGGCGACCTGATCTACGACAATCACGCTCTTGTGGGACCGATCTTCCC
CTCTCTCACGGCGGCGACAGACCTACGCAACAGGTCTGCAGTGGCGCGAAACTCAATTGACAATGATGACCAGCAGCGGCTTCTCCCACAGGCGCCGTTGGTCGGAAACT
GCCGAATCGAAGCCAATTCCAATTCGATTCACTTTCTACGGGTGAATTTTTCTAAAGTTCCTCAGGGTCGTCTCTCGAGTCTCAATACCTCTCGCCAAACCCGGGGAAAC
TCAAATTTCCATGTTGGTCACCATTATCACTTCCCTAGTTCAAGCCAGTGTTCTAATAGTTTCTCTTTACTTCTGGGTAACTACGTGGTTATTTCTGCCATCTGTAAAAG
AATTTATCAGAATATTTCCACTAAAGGTTTGCATTCCTTTCACCAATCTAAACAAGAAAAACCCATCAAAGTATTCAGCAGAAAGTCGAGGAAAGGAATTAAGGTGGTTA
AGAAGGAAGAAGTAGGTCCAAGACTTTACACGAGAGATACAGTGAGGAACATATACAATATTTTGAGAAATTGTTCATGGGGTTCTGCTCAAGGACACCTAGAGACGCTC
CCTATAAGATGGGATTCTTATCTCATCAACCAGGTTCTGAAAACGCATCCACCATTGGAGAAGGCATGGTTATTCTTTAATTGGGCCTCTAGGCTGCAAATCTACAAGCA
TGACCAGTATACCTACACAACGATGCTGGACATTTTTGGAGAAGCTGGGAGAATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATG
CAGTTACATATACTTCATTGATGCATTGGCGTTCAAACTGGGGGGATGTTGATGGGGCTATAAAGGTATGGAAGGAAATGAAAGCCAATGGTTGTTATCCGACAGTAGTT
ACTTATACTGCTTATATAAAGATTTTGTTGGACAGTGACCAAGTCAAGGAGGCCACTGATACATACAGGGAGATGCTTCAGTCTGGGCTTTCTCCAAATTGCTGTACTTA
CACCATCTTAATGGAATACCTTATTGGAGAGGGTAAATGCAAAGAAGCCCTTGATATTTTTCACAAAATGCAAGATGCAGGAGTATATCCTGATAAAGCGGCTTGCAATA
TATTGATTCAGAAGTGCTGTAAATCACGGGAGATGCTGGTAATGACACAAATCCTTGAGTACATGAAAGAAAAACGCCTTGTGCTTCGATACCCTGTGTTTGTTGAAGCA
CGTGAAACTTTAAAAAGTTGTTCTGTAACTGATACCCTACTCAGGCAAGTTAATCCTCATATAGAAGTTGAATCAGTCAGTAAGGGTAAGGTTATTATGGATGTTAGTAC
AAGTTCTAAAATTATTCCTCCCAATGTAGATTGTGAGCTTGTGGAAATTTTGTTGGAGGAGGATAAACTTATCGCTGTTGACCATATATTAATTGGGATGATAGAGAAGA
ACATACAGTTGGATTCTTTGATTATTTTTTCCATCATCGAGGTAAATTGCAAACGTAATCGACCTAACAGTGCTCTGCTGGCTTTCAACTACTGTTTAAAAAACGGTGTT
AGCATTGACAGAAATCTGTACCTTGGCTTGATCGGGATTCTGATCCGATCGAGTATATATTCAAAGTTACTGGAAATTGTTCAAGAAATGTATAAGCAAGGGCATTGCCT
TGGAATCTATCATGCCACACTTATACTTTATAGGCTTGGGAAAGCTGGAAAACCTCAATATGCTAGGAAAGTTTTCAATATGTTGCCTAAGGAATTGAAGTGCACTGCAA
CTTACACTGCTCTGGTTGGTGCTTATTTTTCTACTGGAAATTCTGGTAAAGGGCTTAAAATTTACGAAACAATGCGAAAGAAAGGATTTACACCATCTTTAGGCACATAC
AATGTGCTGTTAACTGGTCTTGTGAAGAGCGGTAGAGTTGTTGAATTAGATATTTATAGAAGGGAAAAGAAGAGTTTGGAGATCAGTCATCACTCTCATCATAATACAAT
ACTGGAGGAAGAAAGGATTTGTGATCTTCTTTTTGGAGAATTGGTATCTTGA
Protein sequenceShow/hide protein sequence
MIPTQPTTSASALAIGQFLINGDLIYDNHALVGPIFPSLTAATDLRNRSAVARNSIDNDDQQRLLPQAPLVGNCRIEANSNSIHFLRVNFSKVPQGRLSSLNTSRQTRGN
SNFHVGHHYHFPSSSQCSNSFSLLLGNYVVISAICKRIYQNISTKGLHSFHQSKQEKPIKVFSRKSRKGIKVVKKEEVGPRLYTRDTVRNIYNILRNCSWGSAQGHLETL
PIRWDSYLINQVLKTHPPLEKAWLFFNWASRLQIYKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNWGDVDGAIKVWKEMKANGCYPTVV
TYTAYIKILLDSDQVKEATDTYREMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSREMLVMTQILEYMKEKRLVLRYPVFVEA
RETLKSCSVTDTLLRQVNPHIEVESVSKGKVIMDVSTSSKIIPPNVDCELVEILLEEDKLIAVDHILIGMIEKNIQLDSLIIFSIIEVNCKRNRPNSALLAFNYCLKNGV
SIDRNLYLGLIGILIRSSIYSKLLEIVQEMYKQGHCLGIYHATLILYRLGKAGKPQYARKVFNMLPKELKCTATYTALVGAYFSTGNSGKGLKIYETMRKKGFTPSLGTY
NVLLTGLVKSGRVVELDIYRREKKSLEISHHSHHNTILEEERICDLLFGELVS