; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003454 (gene) of Snake gourd v1 genome

Gene IDTan0003454
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG05:60816167..60817757
RNA-Seq ExpressionTan0003454
SyntenyTan0003454
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012718.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-25783.84Show/hide
Query:  MNSVPAHTAIP--TQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITL
        M+SVPAHTAIP   QLQQY+N       SNP+ +N PR P+SS          N I PIVLWTSSIARYCRN QLAEAA EFTRMRL GVEPNHIT ITL
Subjt:  MNSVPAHTAIP--TQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITL

Query:  LSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTAL
        LSGCADFPS SL+ G+SLHGY RK GLDTGHV+VGTALI MYAKC+QLGLAR VFD+L MKNSV+WNTMLDGYMRNGE+ELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTAL

Query:  INGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNS
        ING LKQGYSEQAL+CFH+MQCSGIEPDYVSIIAVLAACADLGAL+ GLWVNRF+ QQ+F DNIRISNSLIDMYSRCGCIEF+ QVF+KMPKRTLVSWNS
Subjt:  INGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNS

Query:  IIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVV
        +IVGFA NGFADESLEFFDAMQKEGFK DGVSYTGALTACSHAGLVNKGLELFD MKRVHRITPRIEHYGCIVDLY RAGRL+EALNVIE MPMKPNEVV
Subjt:  IIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLEL
        LGSLLAACRTHGDVSLAERL+K+LF+LDPGGDS+YVLLSNIYAA+G+WEGAN VRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYHADA++IYSMLE+
Subjt:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLEL

Query:  LFHELKICGYVPETDAFLNIKESSKD
        LFHELKICGYVPET  F+N  ESSK+
Subjt:  LFHELKICGYVPETDAFLNIKESSKD

XP_022142716.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Momordica charantia]5.4e-27388.74Show/hide
Query:  MNSVPAHTAIPTQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLS
        M+S+PA+TA   QLQQY NP   IPL NP  IN PRS +SSNR+ISSKST NSIDPIVLWTSSIARYCRNGQLAEAA EFT MRL GVEPNH+TLITLLS
Subjt:  MNSVPAHTAIPTQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLS

Query:  GCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALIN
        GCADFPS+SLY GSSLHGYARK GLDT HV+VGT+++DMYAKC+QLGLAR+VFD+L MKNSVSWNTMLDGY RNGE+ELAIDLFDEMPTRDAISWTALIN
Subjt:  GCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSII
        GLLKQGYSEQALECFHQMQCSGI+PDYVSIIAVLAACADLG LTLGLWVNRFV QQ+F DNIRISNSLIDMYSRCGCI FA QVFE+M KRTLVSWNSII
Subjt:  GLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSII

Query:  VGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLG
        VG+AANGFADESLEFFDAMQKEGFKPD VSYTGALTACSHAGLVNKGLELFD MKRVHRI PRIEHYGCIVDLYGRAGRLE+AL+VIEKMPMKPNEVVLG
Subjt:  VGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLF
        SLLAACRTHGDVSLAERLMKHL KLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADA+SIYSML+LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLF

Query:  HELKICGYVPETDAFLNIKESSKD
        HELKICG VPET+ FLN KESSKD
Subjt:  HELKICGYVPETDAFLNIKESSKD

XP_022945035.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita moschata]1.4e-25783.84Show/hide
Query:  MNSVPAHTAIP--TQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITL
        M+SVPAHTAIP   QLQQY+N       SNP+ +N PR P+SS          N I PIVLWTSSIARYCRN QL EAA EFTRMRL GVEPNHIT ITL
Subjt:  MNSVPAHTAIP--TQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITL

Query:  LSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTAL
        LSGCADFPS SL+ G+SLHGY RK GLDTGHV+VGTALI MYAKC+QLGLAR VFD+L MKNSV+WNTMLDGYMRNGE+ELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTAL

Query:  INGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNS
        ING LKQGYSEQAL+CFH+MQCSGIEPDYVSIIAVLAACADLGAL+ GLWVNRF+ QQ+F DNIRISNSLIDMYSRCGCIEFA QVF+KMPKRTLVSWNS
Subjt:  INGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNS

Query:  IIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVV
        +IVGFA NGFADESLEFFDAMQKEGFK DGVSYTGALTACSHAGLVNKGLELFD MKRVHRITPRIEHYGCIVDLY RAGRL+EALNVIE MPMKPNEVV
Subjt:  IIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLEL
        LGSLLAACRTHGDVSLAERL+K+LF+LDPGGDS+YVLLSNIYAA+G+WEGAN VRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYHADA++IYSMLE+
Subjt:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLEL

Query:  LFHELKICGYVPETDAFLNIKESSKD
        LFHELKICGYVPET  F+N  ESSK+
Subjt:  LFHELKICGYVPETDAFLNIKESSKD

XP_022967078.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita maxima]3.4e-25984.03Show/hide
Query:  MNSVPAHTAIP--TQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITL
        M+SVP+HT IP   QLQQY+NPP PIP SNP+ ++ PR+P+SS          N I PIVLWTSSIARYCRN QLAEAA EFTRMRL GVEPNHIT ITL
Subjt:  MNSVPAHTAIP--TQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITL

Query:  LSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTAL
        LSGCADFPS SL+ G+SLHGY RK GLDTGHV+VGTALI MYAKC+QLGLAR VFD+LAMKNSV+WNTMLDGYMRNGE+ELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTAL

Query:  INGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNS
        ING LKQGYSEQALECFH+MQCSGIEPDYVSIIAVLAACADLGAL+ GLWVNRF+ QQ+F DNIRISNSLIDMYSRCGCIEFA QVF+KM K TLVSWNS
Subjt:  INGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNS

Query:  IIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVV
        +IVGFA NGFADESLEFFDAMQKEGF  DGVSYTGALTACSHAGLVNKGLELFD MKRVHRITPRIEHYGCIVDLY RAGRL+EALNVIE MPMKPNEVV
Subjt:  IIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLEL
        LGSLLAACRTHGDVSLAERL+K+LF+LDPGGDS+YVLLSNIYAA+G+WEGANKVRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYH DA++IYSMLE+
Subjt:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLEL

Query:  LFHELKICGYVPETDAFLNIKESSKD
        LFHELKI GYVPET  F+N  ESSK+
Subjt:  LFHELKICGYVPETDAFLNIKESSKD

XP_038877228.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Benincasa hispida]2.4e-27387.98Show/hide
Query:  MNSVPAHTAIPTQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLS
        M+S P++TAIP+QLQQY NPP  IPLSNPTK+N PRSP+SS+RNISSK   NSIDPIVLWTSS+ARYCRNGQL+EAATEFTRMRL GVEPNH+T ITLLS
Subjt:  MNSVPAHTAIPTQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLS

Query:  GCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALIN
        GC DFPS+SL+ GSSLHGYARK GLDTGHV+VGTAL+DMYAKC+Q  LARKVFD+L MKNSV+WNTMLDGY RNGE+ELAIDLFDEMPTRDAISWTALIN
Subjt:  GCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSII
        GLLKQG+SEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFV QQ+F DNIRISNSL+DMYSRCGCIEFA QVFEKMPKRTLVSWNSII
Subjt:  GLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSII

Query:  VGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLG
        VGFA NGFADESLEFFDAMQ EGFKPDGVSYTGALTACSHAGLVNKGLELFD MKR+H+ITPRIEHYGCIVDLYGRAGRLE+ALNVIE+MPMKPNEVVLG
Subjt:  VGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLF
        SLLAACRT+GDVSLAE+LMKHL KLDP GDSNYVLLSNIYAAIG+WEGANKVRRTMKARGVQKKPG SSVEIDGKVHEFVAGDKYHADA++IYSMLELLF
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLF

Query:  HELKICGYVPETDAFLNIKESSKD
        HELKI GYVP+T+  LN KE SKD
Subjt:  HELKICGYVPETDAFLNIKESSKD

TrEMBL top hitse value%identityAlignment
A0A0A0LYD6 Uncharacterized protein7.7e-25783.4Show/hide
Query:  MNSVPAHTAIPTQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLS
        M+S+P+HTA P+QLQ     P  IPLSNPTK+N PRSP+S +RNISSK   NS+DPIVLWTSS+ARYCRNGQL+EAA EFTRMRL GVEPNHIT ITLLS
Subjt:  MNSVPAHTAIPTQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLS

Query:  GCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALIN
         CADFPS+S +  SSLHGYA K+GLDTGHV+VGTALIDMY+KC+QLG ARKVF  L +KNSVSWNTML+G+MRNGE+ELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSII
        GLLK GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALTLGLWV+RFV  Q+F DNI+ISNSLIDMYSRCGCIEFA QVF KM KRTLVSWNSII
Subjt:  GLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSII

Query:  VGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFD MK VH+ITPRIEHYGCIVDLYGRAGRLE+ALN+IE+MPMKPNEVVLG
Subjt:  VGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLF
        SLLAACRTHGDV+LAERLMKHLFKLDP GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKKPG+SSVEIDGKVHEFVAGD YHADA++IYSML+LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLF

Query:  HELKICGYVPETDAFLNIKESSKD
        HELK+CGYVP +D  LN KES+KD
Subjt:  HELKICGYVPETDAFLNIKESSKD

A0A5A7UJB6 Pentatricopeptide repeat-containing protein1.7e-25684.35Show/hide
Query:  MNSVPAHTAIPTQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLS
        M+S+P+H A P+QLQQ   P   IPLSNPTK+N PRSP S + NI SK T NS+ PIV WTSSIARYC NGQL EAA EFTRMRL GVEPNHIT ITLLS
Subjt:  MNSVPAHTAIPTQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLS

Query:  GCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALIN
        GCADFPS+S +  SSLHGYA KFGLDTGHV+VGTALIDMY+KCSQLGLA+KVFD+L +KNSVSWNTML+G+MRNGE+ELAI LFDEMPTRDAISWTALIN
Subjt:  GCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSII
        GLLK GYSEQALECFHQMQ SG+  DYVSIIAVLAACADLGALT GLWVNRFV QQ+F DN+RISNSLIDMYSRCGCIEFA QVF KM KRTLVSWNSII
Subjt:  GLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSII

Query:  VGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLG
        VGFA NGFADESLEFF AMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFD MKRVH+ITP IEHYGCIVDLYGRAGRLE+A NVIE+MPMKPNEVVLG
Subjt:  VGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLF
        SLLAACRTHGDV LAERLMKH+FKLD  GDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKK G+SSVEIDGKVHEFVAGDKYHADA++IYSML+LLF
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLF

Query:  HELKICGYVPETDAFLNIKESSKD
        HELK+CGYVP+TD  LN K+S+KD
Subjt:  HELKICGYVPETDAFLNIKESSKD

A0A6J1CN07 pentatricopeptide repeat-containing protein At1g05750, chloroplastic2.6e-27388.74Show/hide
Query:  MNSVPAHTAIPTQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLS
        M+S+PA+TA   QLQQY NP   IPL NP  IN PRS +SSNR+ISSKST NSIDPIVLWTSSIARYCRNGQLAEAA EFT MRL GVEPNH+TLITLLS
Subjt:  MNSVPAHTAIPTQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLS

Query:  GCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALIN
        GCADFPS+SLY GSSLHGYARK GLDT HV+VGT+++DMYAKC+QLGLAR+VFD+L MKNSVSWNTMLDGY RNGE+ELAIDLFDEMPTRDAISWTALIN
Subjt:  GCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALIN

Query:  GLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSII
        GLLKQGYSEQALECFHQMQCSGI+PDYVSIIAVLAACADLG LTLGLWVNRFV QQ+F DNIRISNSLIDMYSRCGCI FA QVFE+M KRTLVSWNSII
Subjt:  GLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSII

Query:  VGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLG
        VG+AANGFADESLEFFDAMQKEGFKPD VSYTGALTACSHAGLVNKGLELFD MKRVHRI PRIEHYGCIVDLYGRAGRLE+AL+VIEKMPMKPNEVVLG
Subjt:  VGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLG

Query:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLF
        SLLAACRTHGDVSLAERLMKHL KLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADA+SIYSML+LL 
Subjt:  SLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLF

Query:  HELKICGYVPETDAFLNIKESSKD
        HELKICG VPET+ FLN KESSKD
Subjt:  HELKICGYVPETDAFLNIKESSKD

A0A6J1FZR8 pentatricopeptide repeat-containing protein At1g05750, chloroplastic7.0e-25883.84Show/hide
Query:  MNSVPAHTAIP--TQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITL
        M+SVPAHTAIP   QLQQY+N       SNP+ +N PR P+SS          N I PIVLWTSSIARYCRN QL EAA EFTRMRL GVEPNHIT ITL
Subjt:  MNSVPAHTAIP--TQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITL

Query:  LSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTAL
        LSGCADFPS SL+ G+SLHGY RK GLDTGHV+VGTALI MYAKC+QLGLAR VFD+L MKNSV+WNTMLDGYMRNGE+ELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTAL

Query:  INGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNS
        ING LKQGYSEQAL+CFH+MQCSGIEPDYVSIIAVLAACADLGAL+ GLWVNRF+ QQ+F DNIRISNSLIDMYSRCGCIEFA QVF+KMPKRTLVSWNS
Subjt:  INGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNS

Query:  IIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVV
        +IVGFA NGFADESLEFFDAMQKEGFK DGVSYTGALTACSHAGLVNKGLELFD MKRVHRITPRIEHYGCIVDLY RAGRL+EALNVIE MPMKPNEVV
Subjt:  IIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLEL
        LGSLLAACRTHGDVSLAERL+K+LF+LDPGGDS+YVLLSNIYAA+G+WEGAN VRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYHADA++IYSMLE+
Subjt:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLEL

Query:  LFHELKICGYVPETDAFLNIKESSKD
        LFHELKICGYVPET  F+N  ESSK+
Subjt:  LFHELKICGYVPETDAFLNIKESSKD

A0A6J1HU31 pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.7e-25984.03Show/hide
Query:  MNSVPAHTAIP--TQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITL
        M+SVP+HT IP   QLQQY+NPP PIP SNP+ ++ PR+P+SS          N I PIVLWTSSIARYCRN QLAEAA EFTRMRL GVEPNHIT ITL
Subjt:  MNSVPAHTAIP--TQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITL

Query:  LSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTAL
        LSGCADFPS SL+ G+SLHGY RK GLDTGHV+VGTALI MYAKC+QLGLAR VFD+LAMKNSV+WNTMLDGYMRNGE+ELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTAL

Query:  INGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNS
        ING LKQGYSEQALECFH+MQCSGIEPDYVSIIAVLAACADLGAL+ GLWVNRF+ QQ+F DNIRISNSLIDMYSRCGCIEFA QVF+KM K TLVSWNS
Subjt:  INGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNS

Query:  IIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVV
        +IVGFA NGFADESLEFFDAMQKEGF  DGVSYTGALTACSHAGLVNKGLELFD MKRVHRITPRIEHYGCIVDLY RAGRL+EALNVIE MPMKPNEVV
Subjt:  IIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLEL
        LGSLLAACRTHGDVSLAERL+K+LF+LDPGGDS+YVLLSNIYAA+G+WEGANKVRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYH DA++IYSMLE+
Subjt:  LGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLEL

Query:  LFHELKICGYVPETDAFLNIKESSKD
        LFHELKI GYVPET  F+N  ESSK+
Subjt:  LFHELKICGYVPETDAFLNIKESSKD

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic8.9e-10136.97Show/hide
Query:  IVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFL
        +V W S I  + + G   +A   F +M    V+ +H+T++ +LS CA     +L  G  +  Y  +  ++  ++ +  A++DMY KC  +  A+++FD +
Subjt:  IVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFL

Query:  AMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQ
          K++V+W TMLDGY  + + E A ++ + MP +D ++W ALI+   + G   +AL  FH++Q    ++ + +++++ L+ACA +GAL LG W++ ++ +
Subjt:  AMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQ

Query:  QDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMK
             N  ++++LI MYS+CG +E + +VF  + KR +  W+++I G A +G  +E+++ F  MQ+   KP+GV++T    ACSH GLV++   LF +M+
Subjt:  QDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMK

Query:  RVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRT
          + I P  +HY CIVD+ GR+G LE+A+  IE MP+ P+  V G+LL AC+ H +++LAE     L +L+P  D  +VLLSNIYA +GKWE  +++R+ 
Subjt:  RVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRT

Query:  MKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPETDAFLNIKESSK
        M+  G++K+PG SS+EIDG +HEF++GD  H  +E +Y  L  +  +LK  GY PE    L I E  +
Subjt:  MKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPETDAFLNIKESSK

Q9LSB8 Putative pentatricopeptide repeat-containing protein At3g159304.7e-10237.34Show/hide
Query:  RNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLAR
        R   + +  W   I+ Y R  +  E+      M    V P  +TL+ +LS C+      L     +H Y  +   +   + +  AL++ YA C ++ +A 
Subjt:  RNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLAR

Query:  KVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVN
        ++F  +  ++ +SW +++ GY+  G ++LA   FD+MP RD ISWT +I+G L+ G   ++LE F +MQ +G+ PD  ++++VL ACA LG+L +G W+ 
Subjt:  KVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVN

Query:  RFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLEL
         ++++    +++ + N+LIDMY +CGC E A +VF  M +R   +W +++VG A NG   E+++ F  MQ    +PD ++Y G L+AC+H+G+V++  + 
Subjt:  RFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLEL

Query:  FDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGAN
        F KM+  HRI P + HYGC+VD+ GRAG ++EA  ++ KMPM PN +V G+LL A R H D  +AE   K + +L+P   + Y LL NIYA   +W+   
Subjt:  FDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGAN

Query:  KVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPETDAFL
        +VRR +    ++K PGFS +E++G  HEFVAGDK H  +E IY  LE L  E     Y+P+T   L
Subjt:  KVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPETDAFL

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226901.3e-10439.4Show/hide
Query:  LWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAM
        L  +  + Y R G   EA   F  M  +GV P+ I++++ +S C+     ++  G S HGY  + G ++   +   ALIDMY KC +   A ++FD ++ 
Subjt:  LWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAM

Query:  KNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQD
        K  V+WN+++ GY+ NGEV+ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LGAL L  W+  ++ +  
Subjt:  KNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQD

Query:  FNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRV
           ++R+  +L+DM+SRCG  E A  +F  +  R + +W + I   A  G A+ ++E FD M ++G KPDGV++ GALTACSH GLV +G E+F  M ++
Subjt:  FNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRV

Query:  HRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMK
        H ++P   HYGC+VDL GRAG LEEA+ +IE MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK
Subjt:  HRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMK

Query:  ARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPE-TDAFLNIKESSK
         +G++K PG SS++I GK HEF +GD+ H +  +I +ML+ +       G+VP+ ++  +++ E  K
Subjt:  ARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPE-TDAFLNIKESSK

Q9MA50 Pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.9e-16759.51Show/hide
Query:  PPIPLSNPTKINIPR--SPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGY
        P + +++P  I      +P     N S+  T       V WTS I    RNG+LAEAA EF+ M L GVEPNHIT I LLSGC DF S S  LG  LHGY
Subjt:  PPIPLSNPTKINIPR--SPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGY

Query:  ARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQ
        A K GLD  HV+VGTA+I MY+K  +   AR VFD++  KNSV+WNTM+DGYMR+G+V+ A  +FD+MP RD ISWTA+ING +K+GY E+AL  F +MQ
Subjt:  ARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQ

Query:  CSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAM
         SG++PDYV+IIA L AC +LGAL+ GLWV+R+V  QDF +N+R+SNSLID+Y RCGC+EFA QVF  M KRT+VSWNS+IVGFAANG A ESL +F  M
Subjt:  CSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAM

Query:  QKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHG-DVSLAERL
        Q++GFKPD V++TGALTACSH GLV +GL  F  MK  +RI+PRIEHYGC+VDLY RAGRLE+AL +++ MPMKPNEVV+GSLLAAC  HG ++ LAERL
Subjt:  QKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHG-DVSLAERL

Query:  MKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPET
        MKHL  L+    SNYV+LSN+YAA GKWEGA+K+RR MK  G++K+PGFSS+EID  +H F+AGD  H +   I  +LEL+  +L++ G V ET
Subjt:  MKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPET

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic6.8e-10138.78Show/hide
Query:  SIDP-IVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARK
        +IDP + L+T++I     NG   +A   + ++  + + PN  T  +LL  C      S   G  +H +  KFGL      V T L+D+YAK   +  A+K
Subjt:  SIDP-IVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARK

Query:  VFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN
        VFD +  ++ VS   M+  Y + G VE A  LFD M  RD +SW  +I+G  + G+   AL  F ++   G  +PD ++++A L+AC+ +GAL  G W++
Subjt:  VFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN

Query:  RFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQK-EGFKPDGVSYTGALTACSHAGLVNKGLE
         FV       N+++   LIDMYS+CG +E A  VF   P++ +V+WN++I G+A +G++ ++L  F+ MQ   G +P  +++ G L AC+HAGLVN+G+ 
Subjt:  RFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQK-EGFKPDGVSYTGALTACSHAGLVNKGLE

Query:  LFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGA
        +F+ M + + I P+IEHYGC+V L GRAG+L+ A   I+ M M  + V+  S+L +C+ HGD  L + + ++L  L+      YVLLSNIYA++G +EG 
Subjt:  LFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGA

Query:  NKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPETDAFL-NIKESSKDR
         KVR  MK +G+ K+PG S++EI+ KVHEF AGD+ H+ ++ IY+ML  +   +K  GYVP T+  L +++E+ K++
Subjt:  NKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPETDAFL-NIKESSKDR

Arabidopsis top hitse value%identityAlignment
AT1G05750.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-16859.51Show/hide
Query:  PPIPLSNPTKINIPR--SPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGY
        P + +++P  I      +P     N S+  T       V WTS I    RNG+LAEAA EF+ M L GVEPNHIT I LLSGC DF S S  LG  LHGY
Subjt:  PPIPLSNPTKINIPR--SPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGY

Query:  ARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQ
        A K GLD  HV+VGTA+I MY+K  +   AR VFD++  KNSV+WNTM+DGYMR+G+V+ A  +FD+MP RD ISWTA+ING +K+GY E+AL  F +MQ
Subjt:  ARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQ

Query:  CSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAM
         SG++PDYV+IIA L AC +LGAL+ GLWV+R+V  QDF +N+R+SNSLID+Y RCGC+EFA QVF  M KRT+VSWNS+IVGFAANG A ESL +F  M
Subjt:  CSGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAM

Query:  QKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHG-DVSLAERL
        Q++GFKPD V++TGALTACSH GLV +GL  F  MK  +RI+PRIEHYGC+VDLY RAGRLE+AL +++ MPMKPNEVV+GSLLAAC  HG ++ LAERL
Subjt:  QKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHG-DVSLAERL

Query:  MKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPET
        MKHL  L+    SNYV+LSN+YAA GKWEGA+K+RR MK  G++K+PGFSS+EID  +H F+AGD  H +   I  +LEL+  +L++ G V ET
Subjt:  MKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPET

AT3G15930.1 Pentatricopeptide repeat (PPR) superfamily protein3.3e-10337.34Show/hide
Query:  RNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLAR
        R   + +  W   I+ Y R  +  E+      M    V P  +TL+ +LS C+      L     +H Y  +   +   + +  AL++ YA C ++ +A 
Subjt:  RNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLAR

Query:  KVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVN
        ++F  +  ++ +SW +++ GY+  G ++LA   FD+MP RD ISWT +I+G L+ G   ++LE F +MQ +G+ PD  ++++VL ACA LG+L +G W+ 
Subjt:  KVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSGIEPDYVSIIAVLAACADLGALTLGLWVN

Query:  RFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLEL
         ++++    +++ + N+LIDMY +CGC E A +VF  M +R   +W +++VG A NG   E+++ F  MQ    +PD ++Y G L+AC+H+G+V++  + 
Subjt:  RFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLEL

Query:  FDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGAN
        F KM+  HRI P + HYGC+VD+ GRAG ++EA  ++ KMPM PN +V G+LL A R H D  +AE   K + +L+P   + Y LL NIYA   +W+   
Subjt:  FDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGAN

Query:  KVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPETDAFL
        +VRR +    ++K PGFS +E++G  HEFVAGDK H  +E IY  LE L  E     Y+P+T   L
Subjt:  KVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPETDAFL

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)9.4e-10639.4Show/hide
Query:  LWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAM
        L  +  + Y R G   EA   F  M  +GV P+ I++++ +S C+     ++  G S HGY  + G ++   +   ALIDMY KC +   A ++FD ++ 
Subjt:  LWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAM

Query:  KNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQD
        K  V+WN+++ GY+ NGEV+ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LGAL L  W+  ++ +  
Subjt:  KNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQD

Query:  FNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRV
           ++R+  +L+DM+SRCG  E A  +F  +  R + +W + I   A  G A+ ++E FD M ++G KPDGV++ GALTACSH GLV +G E+F  M ++
Subjt:  FNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRV

Query:  HRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMK
        H ++P   HYGC+VDL GRAG LEEA+ +IE MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK
Subjt:  HRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMK

Query:  ARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPE-TDAFLNIKESSK
         +G++K PG SS++I GK HEF +GD+ H +  +I +ML+ +       G+VP+ ++  +++ E  K
Subjt:  ARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPE-TDAFLNIKESSK

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification9.4e-10639.4Show/hide
Query:  LWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAM
        L  +  + Y R G   EA   F  M  +GV P+ I++++ +S C+     ++  G S HGY  + G ++   +   ALIDMY KC +   A ++FD ++ 
Subjt:  LWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAM

Query:  KNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQD
        K  V+WN+++ GY+ NGEV+ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ   G+  D V+++++ +AC  LGAL L  W+  ++ +  
Subjt:  KNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC-SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQD

Query:  FNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRV
           ++R+  +L+DM+SRCG  E A  +F  +  R + +W + I   A  G A+ ++E FD M ++G KPDGV++ GALTACSH GLV +G E+F  M ++
Subjt:  FNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDKMKRV

Query:  HRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMK
        H ++P   HYGC+VDL GRAG LEEA+ +IE MPM+PN+V+  SLLAACR  G+V +A    + +  L P    +YVLLSN+YA+ G+W    KVR +MK
Subjt:  HRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGANKVRRTMK

Query:  ARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPE-TDAFLNIKESSK
         +G++K PG SS++I GK HEF +GD+ H +  +I +ML+ +       G+VP+ ++  +++ E  K
Subjt:  ARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPE-TDAFLNIKESSK

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.8e-10238.78Show/hide
Query:  SIDP-IVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARK
        +IDP + L+T++I     NG   +A   + ++  + + PN  T  +LL  C      S   G  +H +  KFGL      V T L+D+YAK   +  A+K
Subjt:  SIDP-IVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSLYLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARK

Query:  VFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN
        VFD +  ++ VS   M+  Y + G VE A  LFD M  RD +SW  +I+G  + G+   AL  F ++   G  +PD ++++A L+AC+ +GAL  G W++
Subjt:  VFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQCSG-IEPDYVSIIAVLAACADLGALTLGLWVN

Query:  RFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQK-EGFKPDGVSYTGALTACSHAGLVNKGLE
         FV       N+++   LIDMYS+CG +E A  VF   P++ +V+WN++I G+A +G++ ++L  F+ MQ   G +P  +++ G L AC+HAGLVN+G+ 
Subjt:  RFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQK-EGFKPDGVSYTGALTACSHAGLVNKGLE

Query:  LFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGA
        +F+ M + + I P+IEHYGC+V L GRAG+L+ A   I+ M M  + V+  S+L +C+ HGD  L + + ++L  L+      YVLLSNIYA++G +EG 
Subjt:  LFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIYAAIGKWEGA

Query:  NKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPETDAFL-NIKESSKDR
         KVR  MK +G+ K+PG S++EI+ KVHEF AGD+ H+ ++ IY+ML  +   +K  GYVP T+  L +++E+ K++
Subjt:  NKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPETDAFL-NIKESSKDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAGCGTTCCAGCACACACAGCCATTCCGACCCAACTCCAACAATATACTAATCCGCCTCCTCCAATCCCACTCTCAAACCCAACAAAAATCAACATCCCTCGCTC
TCCCAGTTCCTCAAATCGCAATATCTCCTCCAAATCCACCCGCAATTCTATCGACCCCATTGTTCTATGGACCTCTTCTATTGCTCGTTACTGCCGCAACGGCCAATTAG
CCGAAGCCGCCACAGAGTTTACGAGGATGAGACTCACCGGAGTTGAGCCGAACCACATCACATTGATTACGCTTCTCTCTGGCTGTGCTGATTTTCCGTCTGACAGCCTT
TACTTAGGCTCTTCGCTTCATGGTTACGCTCGTAAATTCGGTTTGGATACAGGACATGTAGTAGTGGGGACTGCTCTAATTGATATGTATGCCAAATGTTCTCAATTGGG
TCTTGCTAGGAAGGTTTTTGATTTCCTGGCCATGAAAAACTCTGTCTCTTGGAACACGATGCTCGATGGTTACATGAGGAATGGGGAGGTTGAGTTGGCCATTGACCTGT
TTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTAATTAATGGTCTTTTGAAACAGGGGTACTCTGAACAAGCGTTGGAGTGCTTCCATCAAATGCAATGC
TCGGGTATCGAGCCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCATGTGCTGATTTGGGCGCGCTTACTTTGGGGTTATGGGTTAATCGGTTCGTTAACCAGCAGGA
CTTTAACGATAATATTAGGATAAGTAATTCCTTGATAGATATGTATTCTCGATGTGGATGTATTGAGTTTGCCCATCAAGTGTTTGAGAAAATGCCCAAGCGAACTTTAG
TGTCTTGGAACTCCATCATTGTGGGGTTTGCTGCTAATGGGTTTGCAGATGAGTCTCTCGAGTTTTTTGATGCAATGCAGAAGGAAGGATTCAAGCCAGATGGAGTTAGC
TACACAGGAGCTCTTACTGCGTGTAGCCATGCTGGCTTAGTGAATAAGGGGCTGGAATTGTTTGATAAGATGAAGAGGGTGCACAGAATTACTCCCAGGATTGAGCATTA
TGGGTGCATTGTTGACCTCTATGGCCGTGCAGGGAGGTTGGAGGAAGCATTGAATGTGATTGAGAAAATGCCGATGAAACCGAATGAAGTTGTATTGGGGTCGTTGCTGG
CTGCCTGTAGGACTCATGGTGATGTGAGCCTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTGGATCCGGGAGGCGATTCAAATTACGTGCTCCTCTCGAACATATAT
GCAGCAATTGGGAAGTGGGAAGGTGCTAATAAGGTGAGGAGAACAATGAAGGCCCGAGGTGTGCAGAAGAAACCAGGGTTTAGTTCAGTTGAGATTGATGGTAAGGTTCA
CGAGTTTGTTGCTGGTGACAAATACCATGCTGATGCAGAGAGTATTTACTCGATGTTAGAGCTGTTGTTTCATGAACTAAAGATATGTGGCTATGTTCCTGAAACTGATG
CCTTTCTGAATATTAAAGAATCTAGTAAAGACCGTTGA
mRNA sequenceShow/hide mRNA sequence
TGGAATTCGAACGATGAACAGCGTTCCAGCACACACAGCCATTCCGACCCAACTCCAACAATATACTAATCCGCCTCCTCCAATCCCACTCTCAAACCCAACAAAAATCA
ACATCCCTCGCTCTCCCAGTTCCTCAAATCGCAATATCTCCTCCAAATCCACCCGCAATTCTATCGACCCCATTGTTCTATGGACCTCTTCTATTGCTCGTTACTGCCGC
AACGGCCAATTAGCCGAAGCCGCCACAGAGTTTACGAGGATGAGACTCACCGGAGTTGAGCCGAACCACATCACATTGATTACGCTTCTCTCTGGCTGTGCTGATTTTCC
GTCTGACAGCCTTTACTTAGGCTCTTCGCTTCATGGTTACGCTCGTAAATTCGGTTTGGATACAGGACATGTAGTAGTGGGGACTGCTCTAATTGATATGTATGCCAAAT
GTTCTCAATTGGGTCTTGCTAGGAAGGTTTTTGATTTCCTGGCCATGAAAAACTCTGTCTCTTGGAACACGATGCTCGATGGTTACATGAGGAATGGGGAGGTTGAGTTG
GCCATTGACCTGTTTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTAATTAATGGTCTTTTGAAACAGGGGTACTCTGAACAAGCGTTGGAGTGCTTCCA
TCAAATGCAATGCTCGGGTATCGAGCCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCATGTGCTGATTTGGGCGCGCTTACTTTGGGGTTATGGGTTAATCGGTTCG
TTAACCAGCAGGACTTTAACGATAATATTAGGATAAGTAATTCCTTGATAGATATGTATTCTCGATGTGGATGTATTGAGTTTGCCCATCAAGTGTTTGAGAAAATGCCC
AAGCGAACTTTAGTGTCTTGGAACTCCATCATTGTGGGGTTTGCTGCTAATGGGTTTGCAGATGAGTCTCTCGAGTTTTTTGATGCAATGCAGAAGGAAGGATTCAAGCC
AGATGGAGTTAGCTACACAGGAGCTCTTACTGCGTGTAGCCATGCTGGCTTAGTGAATAAGGGGCTGGAATTGTTTGATAAGATGAAGAGGGTGCACAGAATTACTCCCA
GGATTGAGCATTATGGGTGCATTGTTGACCTCTATGGCCGTGCAGGGAGGTTGGAGGAAGCATTGAATGTGATTGAGAAAATGCCGATGAAACCGAATGAAGTTGTATTG
GGGTCGTTGCTGGCTGCCTGTAGGACTCATGGTGATGTGAGCCTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTGGATCCGGGAGGCGATTCAAATTACGTGCTCCT
CTCGAACATATATGCAGCAATTGGGAAGTGGGAAGGTGCTAATAAGGTGAGGAGAACAATGAAGGCCCGAGGTGTGCAGAAGAAACCAGGGTTTAGTTCAGTTGAGATTG
ATGGTAAGGTTCACGAGTTTGTTGCTGGTGACAAATACCATGCTGATGCAGAGAGTATTTACTCGATGTTAGAGCTGTTGTTTCATGAACTAAAGATATGTGGCTATGTT
CCTGAAACTGATGCCTTTCTGAATATTAAAGAATCTAGTAAAGACCGTTGA
Protein sequenceShow/hide protein sequence
MNSVPAHTAIPTQLQQYTNPPPPIPLSNPTKINIPRSPSSSNRNISSKSTRNSIDPIVLWTSSIARYCRNGQLAEAATEFTRMRLTGVEPNHITLITLLSGCADFPSDSL
YLGSSLHGYARKFGLDTGHVVVGTALIDMYAKCSQLGLARKVFDFLAMKNSVSWNTMLDGYMRNGEVELAIDLFDEMPTRDAISWTALINGLLKQGYSEQALECFHQMQC
SGIEPDYVSIIAVLAACADLGALTLGLWVNRFVNQQDFNDNIRISNSLIDMYSRCGCIEFAHQVFEKMPKRTLVSWNSIIVGFAANGFADESLEFFDAMQKEGFKPDGVS
YTGALTACSHAGLVNKGLELFDKMKRVHRITPRIEHYGCIVDLYGRAGRLEEALNVIEKMPMKPNEVVLGSLLAACRTHGDVSLAERLMKHLFKLDPGGDSNYVLLSNIY
AAIGKWEGANKVRRTMKARGVQKKPGFSSVEIDGKVHEFVAGDKYHADAESIYSMLELLFHELKICGYVPETDAFLNIKESSKDR