; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10000731 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10000731
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionpentatricopeptide repeat-containing protein At2g01390-like
Genome locationChr09:8611443..8613780
RNA-Seq ExpressionHG10000731
SyntenyHG10000731
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011168.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]4.9e-28784.12Show/hide
Query:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW
        M  SN FS L+ NYVVTSAICKRIYQNISSK LHS HQYK+EKP +RFSRK RKGTK VKK++V+   YTRDTVRNIYNIL+NCSWGSAQGH+E LPIRW
Subjt:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG
        DSYLINQVLKTHPPLEK WLFFNWASRLQ F+HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAI+VW+EMKA+G
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG

Query:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY
        CYPTVVSYTAYIKILLD+G++++ATD YKEMLQSGLSPNCCTYT+LMEYLIGE K KEALDIF KMQDAG YPDKAACNILIQKCCKSGE LVMTQILEY
Subjt:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY

Query:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI
        MKEKRLVLRYPVFVEAHE LKSCSVS TLL QVNPHIEIESV KGEVVDV+TS N++ P+VD EL+A LLKE KL A+D+ILIG  DKNIQLDSSIILSI
Subjt:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI

Query:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY
        IEVNCK NRPN ALLAFDY  KNGV ++RNLYL LIG+LIRSSIY  LL+IVQ+MYT+GHCLGLYHATLI+Y LGKAGKPQYARKVFNMLPEELKC ATY
Subjt:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY

Query:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGELSMRHHP
        TALV AYF AGS GKGLKIYE MR KGFTPSLGTYNVLL+GLVKS RVVELDIYRREK  FEISH SH  TILEEERICDLLFGEL+    P
Subjt:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGELSMRHHP

XP_011649371.1 pentatricopeptide repeat-containing protein At2g01390 [Cucumis sativus]5.6e-29987.54Show/hide
Query:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW
        MH+ N FSLLL NYVV+SAI KRIYQNISSK LHS HQYKR+KPI+RFSR+SRKGTK+ KK++V  RLYTRDTVRNI NIL+NCSW SAQ HLEMLPIRW
Subjt:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG
        DSYLINQVLKTHPPLEKTWLFFNWAS LQ+FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIK+WKEMKA+G
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG

Query:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY
        C+PTVVSYTAYIKILLD+GQI EAT TYK+MLQSGLSPNCCTYTILMEYLIGEGKCKEALDIF KMQDAGVYPDKAACNILIQKCCKSGERLVMTQILE+
Subjt:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY

Query:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI
        MKE R VLRYPVFVEAHETLKSCSVSY LL+QVNPH+EIES+ KGEVVDV+T SN VPPNVDNELLA+LLK+NKLTA+D++LIG VDKNIQLDSSII SI
Subjt:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI

Query:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY
        IEVNCK NRPNSALLAFDY  KN VNIKR LYL LIGILIRSSIYPKLL+IVQ+MYTQGHCLGLYHATLI+ SLGKAGKPQYARKVFNMLPEELKC ATY
Subjt:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY

Query:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL
        TALVD YFSAGSSGKGLKI+ETMR KGFTPSLGTYNVLL GL K+GR VEL+IYRREK SFEISH S LNTIL++ERICDLLFGEL
Subjt:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL

XP_016902133.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01390-like [Cucumis melo]5.8e-29686.52Show/hide
Query:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW
        MH+ N FSLLL NYVV SAI KRIYQNIS K LHS HQYKREKPI+RFSR SRKGTK+VKK++V  R+YTRDTV NI NIL+NCSW SAQ HLEMLPIRW
Subjt:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG
        DSYLINQVLKTHPPLEKTWLFFNWASRL++FKHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKGIKIDA TYTSLMHWRSNSGDVDGAIKVWKEMKA+G
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG

Query:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY
        C+PTVVSYTAYIKILLD+GQ KEAT TYKEML++GLSPNCCTYTILMEYLIGEGKCKEALDIF KMQDAGVYPDKAACNILIQKCCKSGERLVMTQILE+
Subjt:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY

Query:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI
        MKE R VLRYPVFVEAHE LKSCSV + LLRQVNPHIEIES+ KGEV+DV+T SN VPPNVDNELLA+LLK+NKLTAID++LIG VDKNIQLDSSII SI
Subjt:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI

Query:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY
        IEVNCK NRPNSA+LAFDY  KNGVNI R LYL LIGILIRSSIYPKLL+IVQ+MYTQGHC+GLYHATLI+YSLG+AGKPQYARKVFN+LPEELKC ATY
Subjt:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY

Query:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL
        T+LVDAYFSAGSSGKGLKI+ETMR KGFTPSLGTYNVLL GL KSGR VEL+IYRREK SFEISH S LNTIL++ERICDLLFGEL
Subjt:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL

XP_022971714.1 pentatricopeptide repeat-containing protein At2g01390 [Cucurbita maxima]6.2e-29085.84Show/hide
Query:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW
        M  SNSFS L+ NYVVTSAICKRIYQNISSK LHS HQYK+EKP +RFSRK RKGTK VKK++V+   YTRDTVRNIYNIL+NCSWGSAQGH+E LPIRW
Subjt:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG
        DSYLINQVLKTHPPLEK WLFFNWASRLQ FKHD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAI+VW+EMKA+G
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG

Query:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY
        CYPTVVSYTAYIKILLD+G++++ATDTYKEMLQSGLSPNCCTYT+LMEYLIGE K KEALDIF KMQDAGVYPDKAACNILIQKCCKSGE LVMTQILEY
Subjt:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY

Query:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI
        MKEKRLVLRYPVFVEAHE LKSCSVS TLL QVNPHIEIESV KGEVVDV+TS N++ P+VD EL+A LLKE KL A+D+ILIG  DKNIQLDSSIILSI
Subjt:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI

Query:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY
        IEVNCK NRPN ALLAFDY  KNGV ++RNLYL LIG+LIRSSIY  LL+IVQ MYT+GHCLGLYHATLI+Y LGKAGKPQYARKVFNMLPEELKC ATY
Subjt:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY

Query:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL
        TALV AYFSAGS GKGLKIYETMR KGFTPSLGTYNVLL+GLVKS RVVELDIYRREK  FEISH SH  TILEEERICDLLFGEL
Subjt:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL

XP_038901985.1 pentatricopeptide repeat-containing protein At2g01390 [Benincasa hispida]9.9e-30489.25Show/hide
Query:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW
        MH SNSFS LL NYVVTSAI KRIYQNISSK LHSFHQYK+EKPI +F+RKSRKGTK+VKK++VD R YTRDTVRNIYNIL+ CSWGSAQ HLEMLPIRW
Subjt:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG
        DSYLINQVLKTHPPLEKTWLFFNWASRLQ+FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEK IKIDAVTYTSLMHWRSNSGDV+GAIKVWKEMKA+G
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG

Query:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY
        CYPTVVSYTAYIKILLDS QIKEATDTYKEMLQSGL PNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGE LVMTQILEY
Subjt:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY

Query:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI
        MK+KRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESV KGEVV+V+T SNIVPPNVD+ELLAILLKENKLTAIDY+L G VD+NIQLDSSIILSI
Subjt:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI

Query:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY
         EVNCK NRPN ALLAF+Y  K+GVNI+R LYL LIGILIRSSIYPKLL+IVQKMYTQGHCLGLYHATLI+Y LGKAGKPQYARKVFN+LPEELKC ATY
Subjt:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY

Query:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL
        TALVDAYFSAGSSGKGLKIYETMR KGF PSLGTYNVLL GL K GR+ EL IYR+E+ SFEISH SHL TILEEERICDLL+GEL
Subjt:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL

TrEMBL top hitse value%identityAlignment
A0A0A0LJM3 Uncharacterized protein2.7e-29987.54Show/hide
Query:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW
        MH+ N FSLLL NYVV+SAI KRIYQNISSK LHS HQYKR+KPI+RFSR+SRKGTK+ KK++V  RLYTRDTVRNI NIL+NCSW SAQ HLEMLPIRW
Subjt:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG
        DSYLINQVLKTHPPLEKTWLFFNWAS LQ+FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIK+WKEMKA+G
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG

Query:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY
        C+PTVVSYTAYIKILLD+GQI EAT TYK+MLQSGLSPNCCTYTILMEYLIGEGKCKEALDIF KMQDAGVYPDKAACNILIQKCCKSGERLVMTQILE+
Subjt:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY

Query:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI
        MKE R VLRYPVFVEAHETLKSCSVSY LL+QVNPH+EIES+ KGEVVDV+T SN VPPNVDNELLA+LLK+NKLTA+D++LIG VDKNIQLDSSII SI
Subjt:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI

Query:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY
        IEVNCK NRPNSALLAFDY  KN VNIKR LYL LIGILIRSSIYPKLL+IVQ+MYTQGHCLGLYHATLI+ SLGKAGKPQYARKVFNMLPEELKC ATY
Subjt:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY

Query:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL
        TALVD YFSAGSSGKGLKI+ETMR KGFTPSLGTYNVLL GL K+GR VEL+IYRREK SFEISH S LNTIL++ERICDLLFGEL
Subjt:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL

A0A1S4E1N2 pentatricopeptide repeat-containing protein At2g01390-like2.8e-29686.52Show/hide
Query:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW
        MH+ N FSLLL NYVV SAI KRIYQNIS K LHS HQYKREKPI+RFSR SRKGTK+VKK++V  R+YTRDTV NI NIL+NCSW SAQ HLEMLPIRW
Subjt:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG
        DSYLINQVLKTHPPLEKTWLFFNWASRL++FKHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKGIKIDA TYTSLMHWRSNSGDVDGAIKVWKEMKA+G
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG

Query:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY
        C+PTVVSYTAYIKILLD+GQ KEAT TYKEML++GLSPNCCTYTILMEYLIGEGKCKEALDIF KMQDAGVYPDKAACNILIQKCCKSGERLVMTQILE+
Subjt:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY

Query:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI
        MKE R VLRYPVFVEAHE LKSCSV + LLRQVNPHIEIES+ KGEV+DV+T SN VPPNVDNELLA+LLK+NKLTAID++LIG VDKNIQLDSSII SI
Subjt:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI

Query:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY
        IEVNCK NRPNSA+LAFDY  KNGVNI R LYL LIGILIRSSIYPKLL+IVQ+MYTQGHC+GLYHATLI+YSLG+AGKPQYARKVFN+LPEELKC ATY
Subjt:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY

Query:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL
        T+LVDAYFSAGSSGKGLKI+ETMR KGFTPSLGTYNVLL GL KSGR VEL+IYRREK SFEISH S LNTIL++ERICDLLFGEL
Subjt:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL

A0A6J1DMJ2 pentatricopeptide repeat-containing protein At2g01390 isoform X11.2e-27881.74Show/hide
Query:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW
        MHYSNSFSLLL NYVV SAI K+IY NIS KALHS  QYK+EKPI  FSRK RKG K+V+K++VD +LYTRDTVRNIYNIL+N SW SAQ HLE LP+RW
Subjt:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG
        DSYLINQV+KTHPPLEK WLFFNWA RL+ FKHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKGIKIDAVTYTSLMHWRS SGDVDGAIKVWKEMK +G
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG

Query:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY
        CYPTVVSYTAYIKILLD+ Q+KEATDTYKEMLQSGLSPNCCTYT+LMEYLIG GKCKEALDIF KMQDAGVYPDKAACNILI KCC+SGE LVMT ILEY
Subjt:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY

Query:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI
        MKE R VLRYPVFVEAH+TLKSCSVS TLLRQVNPHIE ESV K EV+ V TSS I+P NVD+EL+ ILLK+ KL A+DY+L G VDKNIQLDS+II +I
Subjt:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI

Query:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY
        IEVNCK NRP+ ALL FD+  K+GVN+KRNLYL LIG+LIRSSIY KLL+IV +MY QGHCLGLYHATLI+Y LGKAGKPQYA K+FN+LPEELKC ATY
Subjt:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY

Query:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL
        TALV AYFSAGSSGKGLKIYETMR KGF+PSLGTYNVLLTGL KSGRVVEL+IYRREK SFEI + SH + ILEE+RICDLL+GE+
Subjt:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL

A0A6J1EIW0 pentatricopeptide repeat-containing protein At2g013909.0e-28784.81Show/hide
Query:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW
        M  SN FS L+ NYVVTSAICKRIYQNISSK LHS HQYK+EKP +RFSRK RKGTK VKK++V+   YTRDTVRNIYNIL+NCSW SAQGH+E LPIRW
Subjt:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG
        DSYLINQVLKTHPPLEK WLFFNWASRLQ F+HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAI+VW+EMKA+G
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG

Query:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY
        CYPTVVSYTAYIKILLD+ ++++ATD YKEMLQSGLSPNCCTYT+LMEYLIGE K KEALDIF KMQDAG YPDKAACNILIQKCCKSGE LVMTQILEY
Subjt:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY

Query:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI
        MKEKRLVLRYPVFVEAHE LKSCSVS TLL QVNPHIEIESV KGEVVDV+TS N++ P+VD EL+A LLKE KL A+D+ILIG  DKNIQLDSSIILSI
Subjt:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI

Query:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY
        IEVNCK NRPN ALLAFDY  KNGV ++RNLYL LIG+LIRSSIY  LL+IVQ+MYT+GHCLGLYHATLI+Y LGKAGKPQYARKVFNMLPEELKC ATY
Subjt:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY

Query:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL
        TALV AYFSAGS GKGLKIYETMR KGFTPSLGTYNVLL+GLVKS RVVELDIYRREK  FEISH SH  TILEEERICDLLFGEL
Subjt:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL

A0A6J1I9C5 pentatricopeptide repeat-containing protein At2g013903.0e-29085.84Show/hide
Query:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW
        M  SNSFS L+ NYVVTSAICKRIYQNISSK LHS HQYK+EKP +RFSRK RKGTK VKK++V+   YTRDTVRNIYNIL+NCSWGSAQGH+E LPIRW
Subjt:  MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRW

Query:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG
        DSYLINQVLKTHPPLEK WLFFNWASRLQ FKHD YTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAI+VW+EMKA+G
Subjt:  DSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHG

Query:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY
        CYPTVVSYTAYIKILLD+G++++ATDTYKEMLQSGLSPNCCTYT+LMEYLIGE K KEALDIF KMQDAGVYPDKAACNILIQKCCKSGE LVMTQILEY
Subjt:  CYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEY

Query:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI
        MKEKRLVLRYPVFVEAHE LKSCSVS TLL QVNPHIEIESV KGEVVDV+TS N++ P+VD EL+A LLKE KL A+D+ILIG  DKNIQLDSSIILSI
Subjt:  MKEKRLVLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSI

Query:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY
        IEVNCK NRPN ALLAFDY  KNGV ++RNLYL LIG+LIRSSIY  LL+IVQ MYT+GHCLGLYHATLI+Y LGKAGKPQYARKVFNMLPEELKC ATY
Subjt:  IEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATY

Query:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL
        TALV AYFSAGS GKGLKIYETMR KGFTPSLGTYNVLL+GLVKS RVVELDIYRREK  FEISH SH  TILEEERICDLLFGEL
Subjt:  TALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLFGEL

SwissProt top hitse value%identityAlignment
Q76C99 Protein Rf1, mitochondrial9.1e-2623.38Show/hide
Query:  DAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQD
        D V+YT++++     GD D A   + EM   G  P VV+Y + I  L  +  + +A +    M+++G+ P+C TY  ++      G+ KEA+    KM+ 
Subjt:  DAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQD

Query:  AGVYPDKAACNILIQKCCKSGERLVMTQILEYMKEKRL---VLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPN-VDNE
         GV PD    ++L+   CK+G  +   +I + M ++ L   +  Y   ++ + T  +    + LL                  D+   + I P + V + 
Subjt:  AGVYPDKAACNILIQKCCKSGERLVMTQILEYMKEKRL---VLRYPVFVEAHETLKSCSVSYTLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPN-VDNE

Query:  LLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSIIEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGL
        L+    K+ K+     +      + +  ++    ++I + CK  R   A+L F+     G++    +Y  LI  L   + + +  +++ +M  +G CL  
Subjt:  LLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSIIEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGL

Query:  YHATLIIYSLGKAGKPQYARKVFNMLPE--ELKCAATYTALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFE
             II S  K G+   + K+F ++          TY  L++ Y  AG   + +K+   M + G  P+  TY+ L+ G  K  R+ +  +  +E  S  
Subjt:  YHATLIIYSLGKAGKPQYARKVFNMLPE--ELKCAATYTALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVELDIYRREKNSFE

Query:  IS
        +S
Subjt:  IS

Q8GYP6 Pentatricopeptide repeat-containing protein At1g189008.8e-2933.33Show/hide
Query:  VRNIYNILKNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT
        V N+ ++L+   WG +A+  L+ L +R D+Y  NQVLK          FF W  R   FKHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNIYNILKNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT

Query:  YTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVY
        Y  L+H    +  ++ A+ V+ +M+  GC P  V+Y   I I   +G +  A D Y+ M   GLSP+  TY++++  L   G    A  +FC+M D G  
Subjt:  YTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVY

Query:  PDKAACNILIQKCCKS
        P+    NI++    K+
Subjt:  PDKAACNILIQKCCKS

Q9SSF9 Pentatricopeptide repeat-containing protein At1g747504.4e-2831.1Show/hide
Query:  ALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRD--TVRNIYNILKNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRL
        ++HS         +  F + SR+  K+  +     R +      V N+ +IL+   WG +A+  L     R D+Y  NQVLK          FF W  R 
Subjt:  ALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRD--TVRNIYNILKNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRL

Query:  QIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTY
          FKHD +TYTTM+   G A +   +N +  +M   G K + VTY  L+H    +  +  A+ V+ +M+  GC P  V+Y   I I   +G +  A D Y
Subjt:  QIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTY

Query:  KEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILI
        + M ++GLSP+  TY++++  L   G    A  +FC+M   G  P+    NI+I
Subjt:  KEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILI

Q9SZ52 Pentatricopeptide repeat-containing protein At4g31850, chloroplastic1.6e-2524.47Show/hide
Query:  SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVD
        S  G+L ++        + + L+    LE+    F+   + +I K D  TY T+       G +    Y  ++M+E G  ++A +Y  L+H    S    
Subjt:  SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVD

Query:  GAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCK
         A++V++ M   G  P++ +Y++ +  L     I       KEM   GL PN  T+TI +  L   GK  EA +I  +M D G  PD     +LI   C 
Subjt:  GAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCK

Query:  SGERLVMTQILEYMKEKRLVLRYPVFVEAHETLKSCSVSY-TLLRQVNPHIEIESVGK--------GEVVDVNTSSNIVPPNVDNELLAILLKENKLTAI
        + +     ++ E MK  R               K   V+Y TLL + + + +++SV +        G V DV T + +V     + L            +
Subjt:  SGERLVMTQILEYMKEKRLVLRYPVFVEAHETLKSCSVSY-TLLRQVNPHIEIESVGK--------GEVVDVNTSSNIVPPNVDNELLAILLKENKLTAI

Query:  DYILIGTVDKNIQLDSSIILSIIEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAG
        D +    +  N+   +++I  ++ V    +R + AL  F      GV      Y+  I    +S      L+  +KM T+G    +      +YSL KAG
Subjt:  DYILIGTVDKNIQLDSSIILSIIEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAG

Query:  KPQYARKVFNMLPE--ELKCAATYTALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVE
        + + A+++F  L +   +  + TY  ++  Y   G   + +K+   M   G  P +   N L+  L K+ RV E
Subjt:  KPQYARKVFNMLPE--ELKCAATYTALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVE

Q9ZU29 Pentatricopeptide repeat-containing protein At2g013904.2e-15652.24Show/hide
Query:  SSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKV-DQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASR
        S K LHS  + K      RFS+K     K+VK   + D  +YTRD V NIYNILK  +W SAQ  L  L +RWDS++IN+VLK HPP++K WLFFNWA++
Subjt:  SSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKV-DQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASR

Query:  LQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDT
        ++ FKHD +TYTTMLDIFGEAGRI SM  VF  MKEKG+ ID VTYTSL+HW S+SGDVDGA+++W+EM+ +GC PTVVSYTAY+K+L   G+++EAT+ 
Subjt:  LQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDT

Query:  YKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEYMKEKRLVLRYPVFVEAHETLKSCSVSY
        YKEML+S +SPNC TYT+LMEYL+  GKC+EALDIF KMQ+ GV PDKAACNILI K  K GE   MT++L YMKE  +VLRYP+FVEA ETLK+   S 
Subjt:  YKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEYMKEKRLVLRYPVFVEAHETLKSCSVSY

Query:  TLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNE--LLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSIIEVNCKCNRPNSALLAFDYGFKNGV
         LLR+VN HI +ES+   ++ +  T+      N D+   + ++LL +  L A+D +L    D+NI+LDS ++ +IIE NC   R   A LAFDY  + G+
Subjt:  TLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNE--LLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSIIEVNCKCNRPNSALLAFDYGFKNGV

Query:  NIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATYTALVDAYFSAGSSGKGLKIYETMRN
        ++K++ YL LIG  +RS+  PK++++V++M    H LG Y   ++I+ LG   +P+ A  VF++LP++ K  A YTAL+D Y SAGS  K +KI   MR 
Subjt:  NIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATYTALVDAYFSAGSSGKGLKIYETMRN

Query:  KGFTPSLGTYNVLLTGLVK-SGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLF
        +   PSLGTY+VLL+GL K S    E+ + R+EK S   S R   N +  E++ICDLLF
Subjt:  KGFTPSLGTYNVLLTGLVK-SGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLF

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein6.2e-3033.33Show/hide
Query:  VRNIYNILKNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT
        V N+ ++L+   WG +A+  L+ L +R D+Y  NQVLK          FF W  R   FKHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNIYNILKNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT

Query:  YTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVY
        Y  L+H    +  ++ A+ V+ +M+  GC P  V+Y   I I   +G +  A D Y+ M   GLSP+  TY++++  L   G    A  +FC+M D G  
Subjt:  YTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVY

Query:  PDKAACNILIQKCCKS
        P+    NI++    K+
Subjt:  PDKAACNILIQKCCKS

AT1G18900.2 Pentatricopeptide repeat (PPR) superfamily protein6.2e-3033.33Show/hide
Query:  VRNIYNILKNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT
        V N+ ++L+   WG +A+  L+ L +R D+Y  NQVLK          FF W  R   FKHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNIYNILKNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT

Query:  YTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVY
        Y  L+H    +  ++ A+ V+ +M+  GC P  V+Y   I I   +G +  A D Y+ M   GLSP+  TY++++  L   G    A  +FC+M D G  
Subjt:  YTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVY

Query:  PDKAACNILIQKCCKS
        P+    NI++    K+
Subjt:  PDKAACNILIQKCCKS

AT1G18900.3 Pentatricopeptide repeat (PPR) superfamily protein6.2e-3033.33Show/hide
Query:  VRNIYNILKNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT
        V N+ ++L+   WG +A+  L+ L +R D+Y  NQVLK          FF W  R   FKHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNIYNILKNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVT

Query:  YTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVY
        Y  L+H    +  ++ A+ V+ +M+  GC P  V+Y   I I   +G +  A D Y+ M   GLSP+  TY++++  L   G    A  +FC+M D G  
Subjt:  YTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVY

Query:  PDKAACNILIQKCCKS
        P+    NI++    K+
Subjt:  PDKAACNILIQKCCKS

AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein3.1e-2931.1Show/hide
Query:  ALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRD--TVRNIYNILKNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRL
        ++HS         +  F + SR+  K+  +     R +      V N+ +IL+   WG +A+  L     R D+Y  NQVLK          FF W  R 
Subjt:  ALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRD--TVRNIYNILKNCSWG-SAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASRL

Query:  QIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTY
          FKHD +TYTTM+   G A +   +N +  +M   G K + VTY  L+H    +  +  A+ V+ +M+  GC P  V+Y   I I   +G +  A D Y
Subjt:  QIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDTY

Query:  KEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILI
        + M ++GLSP+  TY++++  L   G    A  +FC+M   G  P+    NI+I
Subjt:  KEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILI

AT2G01390.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-15752.24Show/hide
Query:  SSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKV-DQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASR
        S K LHS  + K      RFS+K     K+VK   + D  +YTRD V NIYNILK  +W SAQ  L  L +RWDS++IN+VLK HPP++K WLFFNWA++
Subjt:  SSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKV-DQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRWDSYLINQVLKTHPPLEKTWLFFNWASR

Query:  LQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDT
        ++ FKHD +TYTTMLDIFGEAGRI SM  VF  MKEKG+ ID VTYTSL+HW S+SGDVDGA+++W+EM+ +GC PTVVSYTAY+K+L   G+++EAT+ 
Subjt:  LQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQIKEATDT

Query:  YKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEYMKEKRLVLRYPVFVEAHETLKSCSVSY
        YKEML+S +SPNC TYT+LMEYL+  GKC+EALDIF KMQ+ GV PDKAACNILI K  K GE   MT++L YMKE  +VLRYP+FVEA ETLK+   S 
Subjt:  YKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEYMKEKRLVLRYPVFVEAHETLKSCSVSY

Query:  TLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNE--LLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSIIEVNCKCNRPNSALLAFDYGFKNGV
         LLR+VN HI +ES+   ++ +  T+      N D+   + ++LL +  L A+D +L    D+NI+LDS ++ +IIE NC   R   A LAFDY  + G+
Subjt:  TLLRQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNE--LLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSIIEVNCKCNRPNSALLAFDYGFKNGV

Query:  NIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATYTALVDAYFSAGSSGKGLKIYETMRN
        ++K++ YL LIG  +RS+  PK++++V++M    H LG Y   ++I+ LG   +P+ A  VF++LP++ K  A YTAL+D Y SAGS  K +KI   MR 
Subjt:  NIKRNLYLCLIGILIRSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATYTALVDAYFSAGSSGKGLKIYETMRN

Query:  KGFTPSLGTYNVLLTGLVK-SGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLF
        +   PSLGTY+VLL+GL K S    E+ + R+EK S   S R   N +  E++ICDLLF
Subjt:  KGFTPSLGTYNVLLTGLVK-SGRVVELDIYRREKNSFEISHRSHLNTILEEERICDLLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTATTCTAATAGTTTTTCTTTACTTCTGTGTAACTATGTGGTTACCTCTGCCATTTGTAAAAGGATTTATCAAAATATTTCCTCTAAAGCCTTGCATTCCTTCCA
CCAATACAAACGAGAGAAACCCATCACACGATTCAGTAGAAAGTCAAGGAAGGGAACTAAGATAGTTAAGAAGGATAAAGTAGATCAAAGGCTTTACACTAGAGATACAG
TGAGGAACATATACAATATTCTGAAAAATTGCTCATGGGGTTCTGCTCAAGGACACCTAGAGATGCTACCTATAAGATGGGATTCTTATCTCATCAACCAGGTTCTGAAA
ACACATCCACCATTGGAGAAGACATGGTTATTCTTCAATTGGGCCTCTAGGCTGCAAATCTTCAAACATGACCAGTATACGTACACGACAATGCTGGATATTTTTGGAGA
AGCTGGGAGAATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATGCAGTTACATATACTTCATTAATGCACTGGCGTTCAAATTCAG
GAGATGTTGATGGAGCTATAAAGGTTTGGAAGGAAATGAAAGCCCATGGCTGCTATCCAACGGTAGTTTCGTATACTGCTTATATAAAGATTTTGTTGGACAGTGGCCAA
ATTAAGGAGGCCACTGATACATACAAGGAGATGCTTCAATCTGGGCTATCTCCAAATTGTTGTACTTACACCATCTTAATGGAATACCTCATTGGAGAGGGTAAATGCAA
AGAAGCCCTTGATATTTTTTGCAAAATGCAAGATGCTGGAGTATATCCTGATAAAGCGGCTTGCAATATATTGATTCAGAAATGCTGTAAATCAGGGGAGAGGCTAGTGA
TGACACAAATCCTTGAGTACATGAAAGAAAAACGCCTTGTGCTTCGATACCCTGTGTTTGTTGAAGCACATGAAACTTTAAAAAGTTGTTCTGTTAGTTATACCCTACTC
AGGCAAGTTAATCCTCATATAGAAATTGAATCAGTTGGTAAGGGCGAGGTTGTGGATGTTAATACAAGTTCTAATATTGTTCCTCCCAATGTAGATAACGAGCTTTTGGC
AATTCTGTTGAAAGAGAATAAACTTACTGCTATTGACTACATACTCATTGGGACAGTAGATAAGAACATACAGTTGGATTCTTCAATTATTTTATCCATCATTGAGGTGA
ATTGCAAATGTAATAGACCCAACAGTGCTCTACTGGCTTTCGACTACGGTTTTAAAAATGGTGTTAACATTAAGAGAAATCTGTATCTTTGCTTGATTGGGATTCTGATA
CGATCGAGTATATATCCGAAGTTGTTGCAAATTGTTCAGAAAATGTATACACAAGGGCATTGCCTTGGACTCTATCATGCCACACTTATAATTTATAGTCTTGGCAAAGC
TGGAAAACCTCAATATGCAAGGAAAGTTTTTAATATGTTGCCTGAAGAATTGAAGTGCGCTGCAACTTACACTGCTCTGGTTGATGCTTATTTCTCTGCTGGAAGTTCTG
GTAAAGGGCTTAAAATTTACGAAACAATGCGAAATAAAGGATTTACACCATCTTTAGGCACGTATAATGTGCTGTTAACTGGTCTTGTGAAGAGTGGTAGAGTTGTTGAA
TTAGATATTTATAGAAGGGAGAAGAATAGTTTTGAGATTAGTCATCGTTCTCATCTCAATACAATATTGGAGGAAGAAAGGATTTGTGATCTTCTTTTTGGAGAATTGTC
GATGCGCCACCATCCGACGAGGTTCCATTTCAGGAATGCCAGCGTGGTGTGGGATCCAGATGAGGAGAACTGCATTACGCGTCTTCAGGGTTGGAAACTTGGGGTTTCTC
CAACTGCCAGTCATGATGAGGATGCTCTGAAGGATGCAACTTCTGTACCCGGCCAGATGAATGCTCAACCATCCCGTCCTATGCAGACCCATTTACTGCCTTATCGTCGC
TTTGAGCATATGGCAGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATTATTCTAATAGTTTTTCTTTACTTCTGTGTAACTATGTGGTTACCTCTGCCATTTGTAAAAGGATTTATCAAAATATTTCCTCTAAAGCCTTGCATTCCTTCCA
CCAATACAAACGAGAGAAACCCATCACACGATTCAGTAGAAAGTCAAGGAAGGGAACTAAGATAGTTAAGAAGGATAAAGTAGATCAAAGGCTTTACACTAGAGATACAG
TGAGGAACATATACAATATTCTGAAAAATTGCTCATGGGGTTCTGCTCAAGGACACCTAGAGATGCTACCTATAAGATGGGATTCTTATCTCATCAACCAGGTTCTGAAA
ACACATCCACCATTGGAGAAGACATGGTTATTCTTCAATTGGGCCTCTAGGCTGCAAATCTTCAAACATGACCAGTATACGTACACGACAATGCTGGATATTTTTGGAGA
AGCTGGGAGAATTTCATCCATGAATTATGTATTTCAACAGATGAAGGAGAAGGGGATAAAGATAGATGCAGTTACATATACTTCATTAATGCACTGGCGTTCAAATTCAG
GAGATGTTGATGGAGCTATAAAGGTTTGGAAGGAAATGAAAGCCCATGGCTGCTATCCAACGGTAGTTTCGTATACTGCTTATATAAAGATTTTGTTGGACAGTGGCCAA
ATTAAGGAGGCCACTGATACATACAAGGAGATGCTTCAATCTGGGCTATCTCCAAATTGTTGTACTTACACCATCTTAATGGAATACCTCATTGGAGAGGGTAAATGCAA
AGAAGCCCTTGATATTTTTTGCAAAATGCAAGATGCTGGAGTATATCCTGATAAAGCGGCTTGCAATATATTGATTCAGAAATGCTGTAAATCAGGGGAGAGGCTAGTGA
TGACACAAATCCTTGAGTACATGAAAGAAAAACGCCTTGTGCTTCGATACCCTGTGTTTGTTGAAGCACATGAAACTTTAAAAAGTTGTTCTGTTAGTTATACCCTACTC
AGGCAAGTTAATCCTCATATAGAAATTGAATCAGTTGGTAAGGGCGAGGTTGTGGATGTTAATACAAGTTCTAATATTGTTCCTCCCAATGTAGATAACGAGCTTTTGGC
AATTCTGTTGAAAGAGAATAAACTTACTGCTATTGACTACATACTCATTGGGACAGTAGATAAGAACATACAGTTGGATTCTTCAATTATTTTATCCATCATTGAGGTGA
ATTGCAAATGTAATAGACCCAACAGTGCTCTACTGGCTTTCGACTACGGTTTTAAAAATGGTGTTAACATTAAGAGAAATCTGTATCTTTGCTTGATTGGGATTCTGATA
CGATCGAGTATATATCCGAAGTTGTTGCAAATTGTTCAGAAAATGTATACACAAGGGCATTGCCTTGGACTCTATCATGCCACACTTATAATTTATAGTCTTGGCAAAGC
TGGAAAACCTCAATATGCAAGGAAAGTTTTTAATATGTTGCCTGAAGAATTGAAGTGCGCTGCAACTTACACTGCTCTGGTTGATGCTTATTTCTCTGCTGGAAGTTCTG
GTAAAGGGCTTAAAATTTACGAAACAATGCGAAATAAAGGATTTACACCATCTTTAGGCACGTATAATGTGCTGTTAACTGGTCTTGTGAAGAGTGGTAGAGTTGTTGAA
TTAGATATTTATAGAAGGGAGAAGAATAGTTTTGAGATTAGTCATCGTTCTCATCTCAATACAATATTGGAGGAAGAAAGGATTTGTGATCTTCTTTTTGGAGAATTGTC
GATGCGCCACCATCCGACGAGGTTCCATTTCAGGAATGCCAGCGTGGTGTGGGATCCAGATGAGGAGAACTGCATTACGCGTCTTCAGGGTTGGAAACTTGGGGTTTCTC
CAACTGCCAGTCATGATGAGGATGCTCTGAAGGATGCAACTTCTGTACCCGGCCAGATGAATGCTCAACCATCCCGTCCTATGCAGACCCATTTACTGCCTTATCGTCGC
TTTGAGCATATGGCAGCCTAA
Protein sequenceShow/hide protein sequence
MHYSNSFSLLLCNYVVTSAICKRIYQNISSKALHSFHQYKREKPITRFSRKSRKGTKIVKKDKVDQRLYTRDTVRNIYNILKNCSWGSAQGHLEMLPIRWDSYLINQVLK
THPPLEKTWLFFNWASRLQIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGIKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKAHGCYPTVVSYTAYIKILLDSGQ
IKEATDTYKEMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFCKMQDAGVYPDKAACNILIQKCCKSGERLVMTQILEYMKEKRLVLRYPVFVEAHETLKSCSVSYTLL
RQVNPHIEIESVGKGEVVDVNTSSNIVPPNVDNELLAILLKENKLTAIDYILIGTVDKNIQLDSSIILSIIEVNCKCNRPNSALLAFDYGFKNGVNIKRNLYLCLIGILI
RSSIYPKLLQIVQKMYTQGHCLGLYHATLIIYSLGKAGKPQYARKVFNMLPEELKCAATYTALVDAYFSAGSSGKGLKIYETMRNKGFTPSLGTYNVLLTGLVKSGRVVE
LDIYRREKNSFEISHRSHLNTILEEERICDLLFGELSMRHHPTRFHFRNASVVWDPDEENCITRLQGWKLGVSPTASHDEDALKDATSVPGQMNAQPSRPMQTHLLPYRR
FEHMAA