; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G006260 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G006260
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmo_Chr18:8009329..8010861
RNA-Seq ExpressionCmoCh18G006260
SyntenyCmoCh18G006260
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573584.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]8.7e-26891.18Show/hide
Query:  MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS
        MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQL EAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS
Subjt:  MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS

Query:  LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF
        LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF
Subjt:  LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF

Query:  HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEF
        HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEF+RQVFDKMPKRTLVSWNSMIVGFA NGFADESLEF
Subjt:  HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEF

Query:  FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA
        FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA
Subjt:  FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA

Query:  ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF
        E                          VGR +         ++  +QKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF
Subjt:  ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF

Query:  MNGNESSKEY
        MNGNESSKEY
Subjt:  MNGNESSKEY

KAG7012718.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-30099.41Show/hide
Query:  MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS
        MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQL EAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS
Subjt:  MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS

Query:  LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF
        LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF
Subjt:  LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF

Query:  HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEF
        HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEF+RQVFDKMPKRTLVSWNSMIVGFA NGFADESLEF
Subjt:  HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEF

Query:  FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA
        FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA
Subjt:  FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA

Query:  ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF
        ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF
Subjt:  ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF

Query:  MNGNESSKEY
        MNGNESSKEY
Subjt:  MNGNESSKEY

XP_022945035.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita moschata]5.4e-302100Show/hide
Query:  MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS
        MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS
Subjt:  MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS

Query:  LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF
        LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF
Subjt:  LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF

Query:  HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEF
        HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEF
Subjt:  HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEF

Query:  FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA
        FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA
Subjt:  FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA

Query:  ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF
        ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF
Subjt:  ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF

Query:  MNGNESSKEY
        MNGNESSKEY
Subjt:  MNGNESSKEY

XP_022967078.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita maxima]8.1e-29095.94Show/hide
Query:  MSSVPAHTAIPFQFQLQQYSN-------SNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSH
        MSSVP+HT IPFQFQLQQYSN       SNPSNL+FPR PNSSNPIKPIVLWTSSIARYCRNAQL EAAAEFTRMRLAGVEPNHITFITLLSGCADFPSH
Subjt:  MSSVPAHTAIPFQFQLQQYSN-------SNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSH

Query:  SLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYS
        SLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYL MKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYS
Subjt:  SLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYS

Query:  EQALDCFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGF
        EQAL+CFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKM K TLVSWNSMIVGFA NGF
Subjt:  EQALDCFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGF

Query:  ADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRT
        ADESLEFFDAMQKEGF ADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRT
Subjt:  ADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRT

Query:  HGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGY
        HGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGAN VRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYH DADNIYSMLEVLFHELKI GY
Subjt:  HGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGY

Query:  VPETATFMNGNESSKEY
        VPETATFMNGNESSKEY
Subjt:  VPETATFMNGNESSKEY

XP_023521260.1 pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucurbita pepo subsp. pepo]1.2e-29898.82Show/hide
Query:  MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS
        MSSVPAHTA+PFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRN QLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS
Subjt:  MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS

Query:  LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF
        LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQAL+CF
Subjt:  LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF

Query:  HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEF
        HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFA NGFADESLEF
Subjt:  HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEF

Query:  FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA
        FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA
Subjt:  FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA

Query:  ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF
        ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGAN VRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETAT 
Subjt:  ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF

Query:  MNGNESSKEY
        MNGNESSKEY
Subjt:  MNGNESSKEY

TrEMBL top hitse value%identityAlignment
A0A1S3C956 pentatricopeptide repeat-containing protein At1g05750, chloroplastic6.8e-24279.35Show/hide
Query:  MSSVPAHTAIPFQFQLQQYSN---SNPSNLNFPRYPNS----------SNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGC
        MSS+P+H A P Q Q    S+   SNP+ +NFPR P S          +N + PIV WTSSIARYC N QLPEAAAEFTRMRLAGVEPNHITFITLLSGC
Subjt:  MSSVPAHTAIPFQFQLQQYSN---SNPSNLNFPRYPNS----------SNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGC

Query:  ADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGF
        ADFPS S  F +SLHGY  K GLDTGHVMVGTALI MY+KC+QLGLA+ VFDYL +KNSV+WNTML+G+MRNGEIELAI+LFDEMPTRDAISWTALING 
Subjt:  ADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGF

Query:  LKQGYSEQALDCFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVG
        LK GYSEQAL+CFH+MQ SG+  DYVSIIAVLAACADLGAL+ GLWVNRF+MQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNS+IVG
Subjt:  LKQGYSEQALDCFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVG

Query:  FATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSL
        FA NGFADESLEFF AMQKEGFK DGVSYTGALTACSHAGLVNKGLELFDNMKRVH+ITP IEHYGCIVDLY RAGRL++A NVIE MPMKPNEVVLGSL
Subjt:  FATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSL

Query:  LAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHE
        LAACRTHGDV LAERL+K++F+LD  GDS+YVLLSNIYAA+G+WEGAN VRRTMKARGVQKK G+SS+EIDGKVHEFVAGDKYHADADNIYSML++LFHE
Subjt:  LAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHE

Query:  LKICGYVPETATFMNGNESSKEY
        LK+CGYVP+T   +N  +S+K++
Subjt:  LKICGYVPETATFMNGNESSKEY

A0A5A7UJB6 Pentatricopeptide repeat-containing protein6.8e-24279.35Show/hide
Query:  MSSVPAHTAIPFQFQLQQYSN---SNPSNLNFPRYPNS----------SNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGC
        MSS+P+H A P Q Q    S+   SNP+ +NFPR P S          +N + PIV WTSSIARYC N QLPEAAAEFTRMRLAGVEPNHITFITLLSGC
Subjt:  MSSVPAHTAIPFQFQLQQYSN---SNPSNLNFPRYPNS----------SNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGC

Query:  ADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGF
        ADFPS S  F +SLHGY  K GLDTGHVMVGTALI MY+KC+QLGLA+ VFDYL +KNSV+WNTML+G+MRNGEIELAI+LFDEMPTRDAISWTALING 
Subjt:  ADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGF

Query:  LKQGYSEQALDCFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVG
        LK GYSEQAL+CFH+MQ SG+  DYVSIIAVLAACADLGAL+ GLWVNRF+MQQEFKDN+RISNSLIDMYSRCGCIEFARQVF KM KRTLVSWNS+IVG
Subjt:  LKQGYSEQALDCFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVG

Query:  FATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSL
        FA NGFADESLEFF AMQKEGFK DGVSYTGALTACSHAGLVNKGLELFDNMKRVH+ITP IEHYGCIVDLY RAGRL++A NVIE MPMKPNEVVLGSL
Subjt:  FATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSL

Query:  LAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHE
        LAACRTHGDV LAERL+K++F+LD  GDS+YVLLSNIYAA+G+WEGAN VRRTMKARGVQKK G+SS+EIDGKVHEFVAGDKYHADADNIYSML++LFHE
Subjt:  LAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHE

Query:  LKICGYVPETATFMNGNESSKEY
        LK+CGYVP+T   +N  +S+K++
Subjt:  LKICGYVPETATFMNGNESSKEY

A0A6J1CN07 pentatricopeptide repeat-containing protein At1g05750, chloroplastic2.5e-25281.97Show/hide
Query:  MSSVPAHTAIPFQFQLQQYSN-------SNPSNLNFPRYPNSS----------NPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITL
        MSS+PA+TA     QLQQY N        NP  +NFPR  NSS          N I PIVLWTSSIARYCRN QL EAAAEFT MRLAGVEPNH+T ITL
Subjt:  MSSVPAHTAIPFQFQLQQYSN-------SNPSNLNFPRYPNSS----------NPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITL

Query:  LSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTAL
        LSGCADFPS SL+FG+SLHGY RKLGLDT HVMVGT+++ MYAKCAQLGLAR VFDYL MKNSV+WNTMLDGY RNGEIELAI+LFDEMPTRDAISWTAL
Subjt:  LSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTAL

Query:  INGFLKQGYSEQALDCFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNS
        ING LKQGYSEQAL+CFH+MQCSGI+PDYVSIIAVLAACADLG L+ GLWVNRF+MQQEFKDNIRISNSLIDMYSRCGCI FARQVF++M KRTLVSWNS
Subjt:  INGFLKQGYSEQALDCFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNS

Query:  MIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVV
        +IVG+A NGFADESLEFFDAMQKEGFK D VSYTGALTACSHAGLVNKGLELFDNMKRVHRI PRIEHYGCIVDLY RAGRL++AL+VIE MPMKPNEVV
Subjt:  MIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVV

Query:  LGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEV
        LGSLLAACRTHGDVSLAERL+K+L +LDPGGDS+YVLLSNIYAA+G+WEGAN VRRTMKARGVQKKPGFSS+EIDGKVHEFVAGDKYHADAD+IYSML++
Subjt:  LGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEV

Query:  LFHELKICGYVPETATFMNGNESSKEY
        L HELKICG VPET TF+N  ESSK++
Subjt:  LFHELKICGYVPETATFMNGNESSKEY

A0A6J1FZR8 pentatricopeptide repeat-containing protein At1g05750, chloroplastic2.6e-302100Show/hide
Query:  MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS
        MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS
Subjt:  MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGAS

Query:  LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF
        LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF
Subjt:  LHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCF

Query:  HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEF
        HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEF
Subjt:  HEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEF

Query:  FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA
        FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA
Subjt:  FDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLA

Query:  ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF
        ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF
Subjt:  ERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATF

Query:  MNGNESSKEY
        MNGNESSKEY
Subjt:  MNGNESSKEY

A0A6J1HU31 pentatricopeptide repeat-containing protein At1g05750, chloroplastic3.9e-29095.94Show/hide
Query:  MSSVPAHTAIPFQFQLQQYSN-------SNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSH
        MSSVP+HT IPFQFQLQQYSN       SNPSNL+FPR PNSSNPIKPIVLWTSSIARYCRNAQL EAAAEFTRMRLAGVEPNHITFITLLSGCADFPSH
Subjt:  MSSVPAHTAIPFQFQLQQYSN-------SNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSH

Query:  SLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYS
        SLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYL MKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYS
Subjt:  SLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYS

Query:  EQALDCFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGF
        EQAL+CFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKM K TLVSWNSMIVGFA NGF
Subjt:  EQALDCFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGF

Query:  ADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRT
        ADESLEFFDAMQKEGF ADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRT
Subjt:  ADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRT

Query:  HGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGY
        HGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGAN VRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYH DADNIYSMLEVLFHELKI GY
Subjt:  HGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGY

Query:  VPETATFMNGNESSKEY
        VPETATFMNGNESSKEY
Subjt:  VPETATFMNGNESSKEY

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148205.6e-10037.11Show/hide
Query:  LNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYV--RKLGLDTGHVMVGTALIAM
        +N+ R        + +V W + I RYCR   + EA   F  M+ + V P+ +    ++S C    + ++ +  +++ ++    + +DT H++  TAL+ M
Subjt:  LNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYV--RKLGLDTGHVMVGTALIAM

Query:  YAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQCSGIEPDYVSIIAVLAACAD
        YA    + +AR  F  + ++N      M+ GY + G ++ A  +FD+   +D + WT +I+ +++  Y ++AL  F EM CSGI+PD VS+ +V++ACA+
Subjt:  YAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQCSGIEPDYVSIIAVLAACAD

Query:  LGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACS
        LG L    WV+  +     +  + I+N+LI+MY++CG ++  R VF+KMP+R +VSW+SMI   + +G A ++L  F  M++E  + + V++ G L  CS
Subjt:  LGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACS

Query:  HAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNI
        H+GLV +G ++F +M   + ITP++EHYGC+VDL+ RA  L EAL VIE+MP+  N V+ GSL++ACR HG++ L +   K + EL+P  D + VL+SNI
Subjt:  HAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNI

Query:  YAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATFM
        YA   RWE    +RR M+ + V K+ G S I+ +GK HEF+ GDK H  ++ IY+ L+ +  +LK+ GYVP+  + +
Subjt:  YAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATFM

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic5.6e-10037.34Show/hide
Query:  KPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFD
        K +V W S I  + +     +A   F +M    V+ +H+T + +LS CA     +L FG  +  Y+ +  ++  ++ +  A++ MY KC  +  A+ +FD
Subjt:  KPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFD

Query:  YLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQC-SGIEPDYVSIIAVLAACADLGALSFGLWVNRFL
         ++ K++VTW TMLDGY  + + E A E+ + MP +D ++W ALI+ + + G   +AL  FHE+Q    ++ + +++++ L+ACA +GAL  G W++ ++
Subjt:  YLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQC-SGIEPDYVSIIAVLAACADLGALSFGLWVNRFL

Query:  MQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDN
         +   + N  ++++LI MYS+CG +E +R+VF+ + KR +  W++MI G A +G  +E+++ F  MQ+   K +GV++T    ACSH GLV++   LF  
Subjt:  MQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDN

Query:  MKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVR
        M+  + I P  +HY CIVD+  R+G L++A+  IE MP+ P+  V G+LL AC+ H +++LAE     L EL+P  D ++VLLSNIYA +G+WE  + +R
Subjt:  MKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVR

Query:  RTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPE
        + M+  G++K+PG SSIEIDG +HEF++GD  H  ++ +Y  L  +  +LK  GY PE
Subjt:  RTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPE

Q9LSB8 Putative pentatricopeptide repeat-containing protein At3g159301.6e-9937.42Show/hide
Query:  WTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMK
        W   I+ Y R  +  E+      M    V P  +T + +LS C+      L     +H YV +   +   + +  AL+  YA C ++ +A  +F  +  +
Subjt:  WTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMK

Query:  NSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFK
        + ++W +++ GY+  G ++LA   FD+MP RD ISWT +I+G+L+ G   ++L+ F EMQ +G+ PD  ++++VL ACA LG+L  G W+  ++ + + K
Subjt:  NSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQCSGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQEFK

Query:  DNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHR
        +++ + N+LIDMY +CGC E A++VF  M +R   +W +M+VG A NG   E+++ F  MQ    + D ++Y G L+AC+H+G+V++  + F  M+  HR
Subjt:  DNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRVHR

Query:  ITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKAR
        I P + HYGC+VD+  RAG + EA  ++  MPM PN +V G+LL A R H D  +AE   K + EL+P   + Y LL NIYA   RW+    VRR +   
Subjt:  ITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMKAR

Query:  GVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATFM
         ++K PGFS IE++G  HEFVAGDK H  ++ IY  LE L  E     Y+P+T+  +
Subjt:  GVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATFM

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226905.0e-10138.97Show/hide
Query:  LWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDM
        L  +  + Y R     EA   F  M  +GV P+ I+ ++ +S C+     ++ +G S HGYV + G ++    +  ALI MY KC +   A  +FD +  
Subjt:  LWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDM

Query:  KNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQC-SGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQE
        K  VTWN+++ GY+ NGE++ A E F+ MP ++ +SW  +I+G ++    E+A++ F  MQ   G+  D V+++++ +AC  LGAL    W+  ++ +  
Subjt:  KNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQC-SGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQE

Query:  FKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRV
         + ++R+  +L+DM+SRCG  E A  +F+ +  R + +W + I   A  G A+ ++E FD M ++G K DGV++ GALTACSH GLV +G E+F +M ++
Subjt:  FKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRV

Query:  HRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMK
        H ++P   HYGC+VDL  RAG L+EA+ +IE MPM+PN+V+  SLLAACR  G+V +A    + +  L P    SYVLLSN+YA+ GRW     VR +MK
Subjt:  HRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMK

Query:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETA-TFMNGNESSK
         +G++K PG SSI+I GK HEF +GD+ H +  NI +ML+ +       G+VP+ +   M+ +E  K
Subjt:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETA-TFMNGNESSK

Q9MA50 Pentatricopeptide repeat-containing protein At1g05750, chloroplastic1.6e-16860.33Show/hide
Query:  SNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGT
        ++ N +N    R+  S++  +  V WTS I    RN +L EAA EF+ M LAGVEPNHITFI LLSGC DF S S   G  LHGY  KLGLD  HVMVGT
Subjt:  SNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGT

Query:  ALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQCSGIEPDYVSIIAVL
        A+I MY+K  +   AR VFDY++ KNSVTWNTM+DGYMR+G+++ A ++FD+MP RD ISWTA+INGF+K+GY E+AL  F EMQ SG++PDYV+IIA L
Subjt:  ALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQCSGIEPDYVSIIAVL

Query:  AACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGA
         AC +LGALSFGLWV+R+++ Q+FK+N+R+SNSLID+Y RCGC+EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFK D V++TGA
Subjt:  AACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGA

Query:  LTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHG-DVSLAERLIKYLFELDPGGDSSY
        LTACSH GLV +GL  F  MK  +RI+PRIEHYGC+VDLYSRAGRL++AL ++++MPMKPNEVV+GSLLAAC  HG ++ LAERL+K+L +L+    S+Y
Subjt:  LTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHG-DVSLAERLIKYLFELDPGGDSSY

Query:  VLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPET
        V+LSN+YAA G+WEGA+ +RR MK  G++K+PGFSSIEID  +H F+AGD  H +   I  +LE++  +L++ G V ET
Subjt:  VLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPET

Arabidopsis top hitse value%identityAlignment
AT1G05750.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-16960.33Show/hide
Query:  SNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGT
        ++ N +N    R+  S++  +  V WTS I    RN +L EAA EF+ M LAGVEPNHITFI LLSGC DF S S   G  LHGY  KLGLD  HVMVGT
Subjt:  SNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGT

Query:  ALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQCSGIEPDYVSIIAVL
        A+I MY+K  +   AR VFDY++ KNSVTWNTM+DGYMR+G+++ A ++FD+MP RD ISWTA+INGF+K+GY E+AL  F EMQ SG++PDYV+IIA L
Subjt:  ALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQCSGIEPDYVSIIAVL

Query:  AACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGA
         AC +LGALSFGLWV+R+++ Q+FK+N+R+SNSLID+Y RCGC+EFARQVF  M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFK D V++TGA
Subjt:  AACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGA

Query:  LTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHG-DVSLAERLIKYLFELDPGGDSSY
        LTACSH GLV +GL  F  MK  +RI+PRIEHYGC+VDLYSRAGRL++AL ++++MPMKPNEVV+GSLLAAC  HG ++ LAERL+K+L +L+    S+Y
Subjt:  LTACSHAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHG-DVSLAERLIKYLFELDPGGDSSY

Query:  VLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPET
        V+LSN+YAA G+WEGA+ +RR MK  G++K+PGFSSIEID  +H F+AGD  H +   I  +LE++  +L++ G V ET
Subjt:  VLLSNIYAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPET

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.0e-10137.34Show/hide
Query:  KPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFD
        K +V W S I  + +     +A   F +M    V+ +H+T + +LS CA     +L FG  +  Y+ +  ++  ++ +  A++ MY KC  +  A+ +FD
Subjt:  KPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFD

Query:  YLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQC-SGIEPDYVSIIAVLAACADLGALSFGLWVNRFL
         ++ K++VTW TMLDGY  + + E A E+ + MP +D ++W ALI+ + + G   +AL  FHE+Q    ++ + +++++ L+ACA +GAL  G W++ ++
Subjt:  YLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQC-SGIEPDYVSIIAVLAACADLGALSFGLWVNRFL

Query:  MQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDN
         +   + N  ++++LI MYS+CG +E +R+VF+ + KR +  W++MI G A +G  +E+++ F  MQ+   K +GV++T    ACSH GLV++   LF  
Subjt:  MQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDN

Query:  MKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVR
        M+  + I P  +HY CIVD+  R+G L++A+  IE MP+ P+  V G+LL AC+ H +++LAE     L EL+P  D ++VLLSNIYA +G+WE  + +R
Subjt:  MKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVR

Query:  RTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPE
        + M+  G++K+PG SSIEIDG +HEF++GD  H  ++ +Y  L  +  +LK  GY PE
Subjt:  RTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPE

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)3.6e-10238.97Show/hide
Query:  LWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDM
        L  +  + Y R     EA   F  M  +GV P+ I+ ++ +S C+     ++ +G S HGYV + G ++    +  ALI MY KC +   A  +FD +  
Subjt:  LWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDM

Query:  KNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQC-SGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQE
        K  VTWN+++ GY+ NGE++ A E F+ MP ++ +SW  +I+G ++    E+A++ F  MQ   G+  D V+++++ +AC  LGAL    W+  ++ +  
Subjt:  KNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQC-SGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQE

Query:  FKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRV
         + ++R+  +L+DM+SRCG  E A  +F+ +  R + +W + I   A  G A+ ++E FD M ++G K DGV++ GALTACSH GLV +G E+F +M ++
Subjt:  FKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRV

Query:  HRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMK
        H ++P   HYGC+VDL  RAG L+EA+ +IE MPM+PN+V+  SLLAACR  G+V +A    + +  L P    SYVLLSN+YA+ GRW     VR +MK
Subjt:  HRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMK

Query:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETA-TFMNGNESSK
         +G++K PG SSI+I GK HEF +GD+ H +  NI +ML+ +       G+VP+ +   M+ +E  K
Subjt:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETA-TFMNGNESSK

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification3.6e-10238.97Show/hide
Query:  LWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDM
        L  +  + Y R     EA   F  M  +GV P+ I+ ++ +S C+     ++ +G S HGYV + G ++    +  ALI MY KC +   A  +FD +  
Subjt:  LWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGLDTGHVMVGTALIAMYAKCAQLGLARNVFDYLDM

Query:  KNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQC-SGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQE
        K  VTWN+++ GY+ NGE++ A E F+ MP ++ +SW  +I+G ++    E+A++ F  MQ   G+  D V+++++ +AC  LGAL    W+  ++ +  
Subjt:  KNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQC-SGIEPDYVSIIAVLAACADLGALSFGLWVNRFLMQQE

Query:  FKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRV
         + ++R+  +L+DM+SRCG  E A  +F+ +  R + +W + I   A  G A+ ++E FD M ++G K DGV++ GALTACSH GLV +G E+F +M ++
Subjt:  FKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVNKGLELFDNMKRV

Query:  HRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMK
        H ++P   HYGC+VDL  RAG L+EA+ +IE MPM+PN+V+  SLLAACR  G+V +A    + +  L P    SYVLLSN+YA+ GRW     VR +MK
Subjt:  HRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRTMK

Query:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETA-TFMNGNESSK
         +G++K PG SSI+I GK HEF +GD+ H +  NI +ML+ +       G+VP+ +   M+ +E  K
Subjt:  ARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETA-TFMNGNESSK

AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein4.0e-10137.11Show/hide
Query:  LNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYV--RKLGLDTGHVMVGTALIAM
        +N+ R        + +V W + I RYCR   + EA   F  M+ + V P+ +    ++S C    + ++ +  +++ ++    + +DT H++  TAL+ M
Subjt:  LNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYV--RKLGLDTGHVMVGTALIAM

Query:  YAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQCSGIEPDYVSIIAVLAACAD
        YA    + +AR  F  + ++N      M+ GY + G ++ A  +FD+   +D + WT +I+ +++  Y ++AL  F EM CSGI+PD VS+ +V++ACA+
Subjt:  YAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQCSGIEPDYVSIIAVLAACAD

Query:  LGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACS
        LG L    WV+  +     +  + I+N+LI+MY++CG ++  R VF+KMP+R +VSW+SMI   + +G A ++L  F  M++E  + + V++ G L  CS
Subjt:  LGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACS

Query:  HAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNI
        H+GLV +G ++F +M   + ITP++EHYGC+VDL+ RA  L EAL VIE+MP+  N V+ GSL++ACR HG++ L +   K + EL+P  D + VL+SNI
Subjt:  HAGLVNKGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNI

Query:  YAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATFM
        YA   RWE    +RR M+ + V K+ G S I+ +GK HEF+ GDK H  ++ IY+ L+ +  +LK+ GYVP+  + +
Subjt:  YAAVGRWEGANMVRRTMKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCGTTCCAGCGCACACCGCCATTCCATTCCAATTCCAACTCCAACAATATTCTAATTCAAATCCATCGAATCTCAATTTCCCTCGCTATCCCAATTCCTCAAA
TCCCATTAAACCCATTGTTCTATGGACCTCTTCTATTGCTCGCTACTGCCGCAACGCCCAATTACCCGAAGCCGCCGCAGAGTTTACCAGGATGAGACTCGCCGGAGTAG
AGCCGAACCACATCACATTCATTACGCTTCTCTCCGGCTGTGCTGATTTTCCGTCACACAGCCTCCACTTCGGCGCTTCTCTTCATGGGTACGTCCGTAAATTAGGTTTG
GATACAGGGCATGTAATGGTTGGGACTGCTCTAATTGCTATGTATGCCAAATGTGCTCAATTGGGTCTTGCTAGGAATGTTTTTGATTATCTAGACATGAAAAACTCTGT
CACTTGGAACACGATGCTCGATGGGTACATGAGGAATGGGGAGATTGAGTTGGCCATTGAACTGTTTGATGAAATGCCTACAAGAGATGCGATTTCCTGGACGGCTTTAA
TTAATGGTTTTTTGAAACAGGGGTACTCTGAACAAGCATTGGACTGCTTCCATGAAATGCAATGCTCGGGGATCGAGCCTGATTATGTGTCAATAATTGCTGTTCTTGCG
GCGTGTGCTGATTTGGGTGCGCTTTCTTTTGGGTTATGGGTTAATCGGTTCCTTATGCAGCAGGAGTTTAAGGATAATATTAGGATAAGTAATTCATTGATAGATATGTA
TTCTCGATGTGGATGCATTGAGTTTGCCCGCCAAGTGTTTGATAAAATGCCCAAACGAACTTTGGTATCTTGGAATTCCATGATTGTGGGATTTGCTACTAATGGCTTTG
CAGATGAATCTCTGGAGTTTTTTGATGCAATGCAGAAGGAAGGATTCAAGGCAGATGGAGTTAGCTACACGGGAGCTCTTACTGCGTGTAGCCATGCTGGCTTAGTGAAC
AAGGGGCTGGAATTGTTTGATAACATGAAGAGAGTACATAGAATTACTCCTAGGATTGAGCATTATGGATGCATTGTTGACCTCTATAGCCGTGCAGGGAGGTTGGATGA
AGCGTTGAACGTGATCGAGACAATGCCCATGAAACCGAATGAAGTTGTCCTCGGGTCGCTGCTGGCTGCCTGCAGGACTCATGGTGATGTGAGCCTGGCTGAAAGGTTGA
TCAAATATCTCTTTGAGTTGGACCCTGGTGGTGATTCGAGTTACGTGCTGCTTTCGAATATATATGCAGCAGTCGGGAGGTGGGAAGGCGCCAACATGGTCCGGAGAACA
ATGAAAGCCCGAGGTGTGCAGAAAAAACCGGGGTTTAGCTCGATTGAGATTGACGGTAAGGTTCATGAGTTTGTTGCTGGTGACAAATACCATGCTGATGCAGATAATAT
CTACTCCATGTTAGAGGTGTTGTTTCATGAACTCAAGATATGTGGCTATGTTCCTGAAACTGCTACCTTTATGAATGGTAATGAATCTAGTAAAGAGTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAGCGTTCCAGCGCACACCGCCATTCCATTCCAATTCCAACTCCAACAATATTCTAATTCAAATCCATCGAATCTCAATTTCCCTCGCTATCCCAATTCCTCAAA
TCCCATTAAACCCATTGTTCTATGGACCTCTTCTATTGCTCGCTACTGCCGCAACGCCCAATTACCCGAAGCCGCCGCAGAGTTTACCAGGATGAGACTCGCCGGAGTAG
AGCCGAACCACATCACATTCATTACGCTTCTCTCCGGCTGTGCTGATTTTCCGTCACACAGCCTCCACTTCGGCGCTTCTCTTCATGGGTACGTCCGTAAATTAGGTTTG
GATACAGGGCATGTAATGGTTGGGACTGCTCTAATTGCTATGTATGCCAAATGTGCTCAATTGGGTCTTGCTAGGAATGTTTTTGATTATCTAGACATGAAAAACTCTGT
CACTTGGAACACGATGCTCGATGGGTACATGAGGAATGGGGAGATTGAGTTGGCCATTGAACTGTTTGATGAAATGCCTACAAGAGATGCGATTTCCTGGACGGCTTTAA
TTAATGGTTTTTTGAAACAGGGGTACTCTGAACAAGCATTGGACTGCTTCCATGAAATGCAATGCTCGGGGATCGAGCCTGATTATGTGTCAATAATTGCTGTTCTTGCG
GCGTGTGCTGATTTGGGTGCGCTTTCTTTTGGGTTATGGGTTAATCGGTTCCTTATGCAGCAGGAGTTTAAGGATAATATTAGGATAAGTAATTCATTGATAGATATGTA
TTCTCGATGTGGATGCATTGAGTTTGCCCGCCAAGTGTTTGATAAAATGCCCAAACGAACTTTGGTATCTTGGAATTCCATGATTGTGGGATTTGCTACTAATGGCTTTG
CAGATGAATCTCTGGAGTTTTTTGATGCAATGCAGAAGGAAGGATTCAAGGCAGATGGAGTTAGCTACACGGGAGCTCTTACTGCGTGTAGCCATGCTGGCTTAGTGAAC
AAGGGGCTGGAATTGTTTGATAACATGAAGAGAGTACATAGAATTACTCCTAGGATTGAGCATTATGGATGCATTGTTGACCTCTATAGCCGTGCAGGGAGGTTGGATGA
AGCGTTGAACGTGATCGAGACAATGCCCATGAAACCGAATGAAGTTGTCCTCGGGTCGCTGCTGGCTGCCTGCAGGACTCATGGTGATGTGAGCCTGGCTGAAAGGTTGA
TCAAATATCTCTTTGAGTTGGACCCTGGTGGTGATTCGAGTTACGTGCTGCTTTCGAATATATATGCAGCAGTCGGGAGGTGGGAAGGCGCCAACATGGTCCGGAGAACA
ATGAAAGCCCGAGGTGTGCAGAAAAAACCGGGGTTTAGCTCGATTGAGATTGACGGTAAGGTTCATGAGTTTGTTGCTGGTGACAAATACCATGCTGATGCAGATAATAT
CTACTCCATGTTAGAGGTGTTGTTTCATGAACTCAAGATATGTGGCTATGTTCCTGAAACTGCTACCTTTATGAATGGTAATGAATCTAGTAAAGAGTATTGA
Protein sequenceShow/hide protein sequence
MSSVPAHTAIPFQFQLQQYSNSNPSNLNFPRYPNSSNPIKPIVLWTSSIARYCRNAQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSHSLHFGASLHGYVRKLGL
DTGHVMVGTALIAMYAKCAQLGLARNVFDYLDMKNSVTWNTMLDGYMRNGEIELAIELFDEMPTRDAISWTALINGFLKQGYSEQALDCFHEMQCSGIEPDYVSIIAVLA
ACADLGALSFGLWVNRFLMQQEFKDNIRISNSLIDMYSRCGCIEFARQVFDKMPKRTLVSWNSMIVGFATNGFADESLEFFDAMQKEGFKADGVSYTGALTACSHAGLVN
KGLELFDNMKRVHRITPRIEHYGCIVDLYSRAGRLDEALNVIETMPMKPNEVVLGSLLAACRTHGDVSLAERLIKYLFELDPGGDSSYVLLSNIYAAVGRWEGANMVRRT
MKARGVQKKPGFSSIEIDGKVHEFVAGDKYHADADNIYSMLEVLFHELKICGYVPETATFMNGNESSKEY