; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021254 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021254
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr7:5912262..5914835
RNA-Seq ExpressionLag0021254
SyntenyLag0021254
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597439.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0091.46Show/hide
Query:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR
        METLSH+TSILPLQ+P YPTRP ALSAALSSATSLLH+KQVHAQILRSKFERSDSDSLLFKLILSSC+LSPSLDYALSVFDQ+PEPK+RFCNKLLRELSR
Subjt:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR

Query:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC
        GS+PE+AL +YEKMRAEGLSLDRFCFPPLLKAASRNLS RTGMEIHGLASKLGFGSDPFVETGL+RMYAAC RIM+ARLVFDKMS +D+VTWSIMIDGYC
Subjt:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC

Query:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL
        ISG Y+LAFQLFEEMKRTGLEPDEMILST+LSACARAGNLDFGTK+HEFITKKNI+MDPHLQSALI MYASC S DLAWDLYEK+SPKNMV+STAMVSGL
Subjt:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL

Query:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM
        AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESD PQEALVLFK+MQQ+G+KPDVVTMLSVISACAHLGALDQAKWIQ YVDKNGF KALS++NALIDM
Subjt:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM

Query:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV
        YAKCGSLEGARE+F KMPKKNVISWTSMINA AMHGDAH ALSLFHQMKVENVEPNWITFVG+LYACSH GLVEEG+RIFHSM NEYGISPKHEHFGCMV
Subjt:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV

Query:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE
        DLFGRAKLLREALEV+EAMPFAPNAIIWGSLMAACQLH   ELGEFAAKQVLKLEPDHDGALVVLSN+YAKERRWE+ GEVRKLMNEMG+SKERGCSRIE
Subjt:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE

Query:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY
        LNNEVHEFQMADRKHKQA+ IYQKL+EVVQ LKLAGY PQTNCVLVDLDEEEKKELVL HSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY
Subjt:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY

Query:  AREIIIRDRTRFHHYRDGSCSCKDYW
        AREI++RDRTRFHHYR GSCSCKDYW
Subjt:  AREIIIRDRTRFHHYRDGSCSCKDYW

XP_022959359.1 pentatricopeptide repeat-containing protein At4g14820 [Cucurbita moschata]0.0e+0091.46Show/hide
Query:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR
        METLSH+TSILPLQ+P YPT+P ALSAALSSATSLLH+KQVHAQILRSKFERSDSDSLLFKLILSSC+LSPSLDYALSVFDQ+PEPK+RFCNKLLRELSR
Subjt:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR

Query:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC
        GS+PE+AL +YEKMRAEGLSLDRFCFPPLLKAASRNLS RTGMEIHGLASKLGFGSDPFVETGL+RMYAAC RIM+ARLVFDKMS +D+VTWSIMIDGYC
Subjt:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC

Query:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL
        ISG Y+LAFQLFEEMKRTGLEPDEMILST+LSACARAGNLDFGTK+HEFITKKNI+MDPHLQSALI MYASC S DLAWDLYEK+SPKNMV+STAMVSGL
Subjt:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL

Query:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM
        AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESD PQEALVLFK+MQQ+G+KPDVVTMLSVISACAHLGALDQAKWIQ YVDKNGF KALS+NNALIDM
Subjt:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM

Query:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV
        YAKCGSLEGARE+F KMPKKNVISWTSMINA AMHGDAH ALSLFHQMKVENVEPNWITFVG+LYACSH GLVEEG+RIFHSM NEYGISPKHEHFGCMV
Subjt:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV

Query:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE
        DLFGRAKLLREALEV+EAMPFAPNAIIWGSLMAACQLH + ELGEFAAKQVLKLEPDHDGALVVLSN+YAKERRWE+ GEVRKLMNEMG+SKERGCSRIE
Subjt:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE

Query:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY
        LNNEVHEFQMADRKHKQA+ IYQKL+EVVQ LKLAGY PQTNCVLVDLD+EEKKELVL HSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY
Subjt:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY

Query:  AREIIIRDRTRFHHYRDGSCSCKDYW
        AREI++RDRTRFHHYRDGSCSCKDYW
Subjt:  AREIIIRDRTRFHHYRDGSCSCKDYW

XP_022974384.1 pentatricopeptide repeat-containing protein At4g14820-like [Cucurbita maxima]0.0e+0091.87Show/hide
Query:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR
        METLSH+TSILPLQ+P YPTRP ALSAALSSATSLLH+KQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQ+PEPKTRFCNKLLRELSR
Subjt:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR

Query:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC
        GS+PE+AL +YEKMRAEGLSLDRFCFPPLLKAASRNLS RTGMEIHGLASKLGFGSDPFVETGL+RMYAAC RIM+ARLVFDKMS +D+VTWSIMIDGYC
Subjt:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC

Query:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL
        +SG Y+LAFQLFEEMKRTGLEPDEMILST+LSACARAGNLDFGTKIHEFITKKNI+MDPHLQSALI MYAS  S DLAWDLYEK+SPKNMV+STAMVSGL
Subjt:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL

Query:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM
        AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESD PQEALVLFK+MQQ+G+KPDVVTMLSVISACAHLGALDQAKWIQ YVDKNGF KALS+NNALIDM
Subjt:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM

Query:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV
        YAKCGSLEGARE+F KMPKKNVISWTSMINA AMHGDAHNALSLFHQMKVENVEPNWITFVG+LYACSH GLV+EG+RIFHSM NEYGISPKHEHFGCMV
Subjt:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV

Query:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE
        DLFGRAKLLREALEV+EAMPFAPNAIIWGSLMAACQLHS+ ELGEFAAKQVLKLEPDHDGALVVLSN+YAKERRWE+ GEVRKLMNEMG+SKERGCSRIE
Subjt:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE

Query:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY
        LNNEVHEFQMADRKHKQA+ IY KL+EVVQKLKLAGY PQTNCVLVDLDEEEKKELVL HSEKLALCYALMNEGSRICI KNLRICEDCHAFMKLASKVY
Subjt:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY

Query:  AREIIIRDRTRFHHYRDGSCSCKDYW
        AREI++RDRTRFHHYRDGSCSCKDYW
Subjt:  AREIIIRDRTRFHHYRDGSCSCKDYW

XP_023538947.1 pentatricopeptide repeat-containing protein At4g14820 [Cucurbita pepo subsp. pepo]0.0e+0091.87Show/hide
Query:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR
        METLSH+TSILPL +P YPTRPTALSAALSSATSLLH+KQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQ+PEPKTRFCNKLLRELSR
Subjt:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR

Query:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC
        GS+PE+AL LYEKMRAEGLSLDRFCFPPLLKAASRNLS RTGMEIHGLASKLGFGSDPFVETGL+RMYAAC RIM+ARLVFDKMS +D+VTWSIMIDGYC
Subjt:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC

Query:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL
        ISG Y+LAFQLFEEMKRTGLEPDEMILST+LSACARAGNLDFGTKIHEFITK NI+MDPHLQSALI MYASC S DLAWDLYEK++PKNMV+STAMVSGL
Subjt:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL

Query:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM
        AKGGQIGDAR VFDQMVEKDLICWSAMISGYTESD PQEALVLFK+MQQ+G+KPDVVTMLSVISACAHLGALDQAKWIQ YVDKNGF KALS+NNALIDM
Subjt:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM

Query:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV
        YAKCGSLEGARE+F KMPKKNVISWTSMINA AMHGDAHNALSLFHQMKVENVEPNWITFVG+LYACSH GLVEEG+RIFHSM NEYGISPKHEHFGCMV
Subjt:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV

Query:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE
        DLFGRAKLLREALEV+EAMPFAPNAIIWGSLMAACQLH + ELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWE+ G+VRKLMNEMG+SKERGCSRIE
Subjt:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE

Query:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY
        LNNEVHEFQMADRKHKQA+ IYQKL+EVVQ LKLAGY PQ NCVLVDLDEEEKKELVL HSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY
Subjt:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY

Query:  AREIIIRDRTRFHHYRDGSCSCKDYW
        AREI++RDRTRFHHYRDGSCSCKDYW
Subjt:  AREIIIRDRTRFHHYRDGSCSCKDYW

XP_038902272.1 pentatricopeptide repeat-containing protein At4g14820 [Benincasa hispida]0.0e+0092.29Show/hide
Query:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR
        METLSHSTS+LPLQI TYPTRPTALSAALSSA+SLLHLKQVHAQILRSKFE  DS+SLLF+LILSSCAL PSLDYALSVFDQ+P+PKTRFCNKLLRELSR
Subjt:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR

Query:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC
        GS+PE ALLLYEKMRAEGLSLDRFCFPPLLKAASRNLS RTGMEIHGLASKLGFGSDPFVETGLV+MYAACGRIM+ARLVFDKMSH+D+VTWSIMIDGYC
Subjt:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC

Query:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL
         SGCY+LAFQLFE+MKRT LEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNI+MDPHLQSALITMYASC SMDLAWDL+EK+ PKNMVVSTAMVSGL
Subjt:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL

Query:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM
        AKGGQIGDARYVFDQMV KDLICWSAMISGYTESD PQEAL+LFK+MQQ G+KPDVVTMLSVISACAHLGALDQA WIQ YVDKNGF KALSVNNALIDM
Subjt:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM

Query:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV
        YAKCGSLEGAREVF KMPKKNVISWTSMINA AMHGDAH+A+SLFHQMKVENVEPNWITFVGVLYACSH GLVEEGRRIFHSMTNEYGISPKHEHFGCMV
Subjt:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV

Query:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE
        DLFGRA LLREALEVIEAMPFAPNAIIWGSLMAACQ+H E ELGEFAAKQVLKLEPDHDGALVVLSN+YAKERRWE+VGEVRKLMN+MG+SKERGCSRIE
Subjt:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE

Query:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY
        LNNEVHEFQMADRKHKQA+QIYQKLDEVVQKL LAGY PQTNCVL DLDEEEKKELVL HSEKLA CYALMNEG RICIIKNLRICEDCHAFMKLASKVY
Subjt:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY

Query:  AREIIIRDRTRFHHYRDGSCSCKDYW
        AREIIIRDR+RFHHYRDGSCSCKDYW
Subjt:  AREIIIRDRTRFHHYRDGSCSCKDYW

TrEMBL top hitse value%identityAlignment
A0A438KLK6 Pentatricopeptide repeat-containing protein7.7e-30871.84Show/hide
Query:  STSILPLQIPTYPTRPT------ALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR
        S + L L     P  PT       L +ALSSATSL HLKQVHAQILRSK +R  S SLL KL++SSCALS SLDYALSVF+ +P+P+T  CN+ LRELSR
Subjt:  STSILPLQIPTYPTRPT------ALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR

Query:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC
          +PE  LL+YE+MR +GL++DRF FPPLLKA SR  S   G+EIHGLA+KLGF SDPFV+TGLVRMYAACGRI +ARL+FDKM H+D+VTWSIMIDGYC
Subjt:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC

Query:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL
         SG +  A  LFEEMK   +EPDEM+LSTVLSAC RAGNL +G  IH+FI + NI++DPHLQSAL+TMYASC SMDLA +L+EKM+PKN+V STAMV+G 
Subjt:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL

Query:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM
        +K GQI +AR VF+QMV+KDL+CWSAMISGY ESD PQEAL LF EMQ +GIKPD VTMLSVI+ACAHLGALDQAKWI  +VDKNGF  AL +NNALI+M
Subjt:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM

Query:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV
        YAKCGSLE AR +F KMP+KNVISWT MI+AFAMHGDA +AL  FHQM+ EN+EPN ITFVGVLYACSHAGLVEEGR+IF+SM NE+ I+PKH H+GCMV
Subjt:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV

Query:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE
        DLFGRA LLREALE++EAMP APN IIWGSLMAAC++H E ELGEFAAK++L+L+PDHDGA V LSN+YAK +RWE+VG+VRKLM   GISKERGCSRIE
Subjt:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE

Query:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRIC--IIKNLRICEDCHAFMKLASK
        LNNE+HEF +ADR HK A++IY+KLDEVV KLKL GY P T  +LVDL+EEEKKE+VL HSEKLALCY LM +G+  C  I KNLR+CEDCH F+KLASK
Subjt:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRIC--IIKNLRICEDCHAFMKLASK

Query:  VYAREIIIRDRTRFHHYRDGSCSCKDYW
        VY REI++RDRTRFHHY+DG CSCKDYW
Subjt:  VYAREIIIRDRTRFHHYRDGSCSCKDYW

A0A6J1C8N9 pentatricopeptide repeat-containing protein At4g148200.0e+0088.02Show/hide
Query:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR
        METLSHSTS LPLQ   + TRPT L+AAL+SA++LLHLKQVH QILRSKFER DSDSLLFKL+LSSCALS SLDYALSVFDQ+PEPKTRFCNKLLRELSR
Subjt:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR

Query:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC
        G  PE+AL +YEKMRAEGLSLDRF FPP+LKAASRNLS RTGMEIHGLASKLGFG DPFVETGLVRMYAACGR+M+ARLVFDKMSH+D+VTWSIMIDGYC
Subjt:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC

Query:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL
        ISGCY+LAFQLFEEMKRT +EPDEMILST++SACARAGNLD+GT+IHEFITKKNI+MDPHLQSALITMYASC SMDLAWDLYEK+SPKNMVVSTAMVSGL
Subjt:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL

Query:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM
        +K GQIGDARYVFDQMVEKDLICWSAMISGYTESD PQEALVLFK+MQ +GIKPDVVTMLSVISACAHLGAL+QA WI TYVDKNGF KALSVNNALIDM
Subjt:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM

Query:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV
        YAKCGSLEGAREVF+KMPKKNVISWT MINA AMHGD+HNAL+LFHQMK ENVEPNWITFVGVLYACSH GLVEEGR+IFHSM N+YGISPKHEHFGCMV
Subjt:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV

Query:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE
        DLFGRA LLREALE+IEAMPFAPNAIIWGSLMAACQ++ E ELGEFAAKQVLKLEP+HDGA VVLSN+YAKERRWE+VGEVRKLMNEMG++KERGCSR+E
Subjt:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE

Query:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY
        LNNEVHEFQMADRKH+QA+QIYQKLDEVVQKLK+AGY P+ +CVLVDLDEEE+KE +L HSEKLALCYALMNEGS I IIKNLRICEDCH FMKLASKVY
Subjt:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY

Query:  AREIIIRDRTRFHHYRDGSCSCKDYW
        AREIIIRDRTRFHHYRDGSCSC DYW
Subjt:  AREIIIRDRTRFHHYRDGSCSCKDYW

A0A6J1H7U5 pentatricopeptide repeat-containing protein At4g148200.0e+0091.46Show/hide
Query:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR
        METLSH+TSILPLQ+P YPT+P ALSAALSSATSLLH+KQVHAQILRSKFERSDSDSLLFKLILSSC+LSPSLDYALSVFDQ+PEPK+RFCNKLLRELSR
Subjt:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR

Query:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC
        GS+PE+AL +YEKMRAEGLSLDRFCFPPLLKAASRNLS RTGMEIHGLASKLGFGSDPFVETGL+RMYAAC RIM+ARLVFDKMS +D+VTWSIMIDGYC
Subjt:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC

Query:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL
        ISG Y+LAFQLFEEMKRTGLEPDEMILST+LSACARAGNLDFGTK+HEFITKKNI+MDPHLQSALI MYASC S DLAWDLYEK+SPKNMV+STAMVSGL
Subjt:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL

Query:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM
        AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESD PQEALVLFK+MQQ+G+KPDVVTMLSVISACAHLGALDQAKWIQ YVDKNGF KALS+NNALIDM
Subjt:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM

Query:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV
        YAKCGSLEGARE+F KMPKKNVISWTSMINA AMHGDAH ALSLFHQMKVENVEPNWITFVG+LYACSH GLVEEG+RIFHSM NEYGISPKHEHFGCMV
Subjt:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV

Query:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE
        DLFGRAKLLREALEV+EAMPFAPNAIIWGSLMAACQLH + ELGEFAAKQVLKLEPDHDGALVVLSN+YAKERRWE+ GEVRKLMNEMG+SKERGCSRIE
Subjt:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE

Query:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY
        LNNEVHEFQMADRKHKQA+ IYQKL+EVVQ LKLAGY PQTNCVLVDLD+EEKKELVL HSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY
Subjt:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY

Query:  AREIIIRDRTRFHHYRDGSCSCKDYW
        AREI++RDRTRFHHYRDGSCSCKDYW
Subjt:  AREIIIRDRTRFHHYRDGSCSCKDYW

A0A6J1ICZ7 pentatricopeptide repeat-containing protein At4g14820-like isoform X20.0e+0090.38Show/hide
Query:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR
        METLSH+TSILPLQ+P YPTRP ALSAALSSATSLLH+KQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQ+PEPKTRFCNKLLRELSR
Subjt:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR

Query:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC
        GS+PE+AL +YEKMRAEGLSLDRFCFPPLLKAASRNLS RTGMEIHGLASKLGFGSDPFVETGL+RMYAAC RIM+ARLVFDKMS +D+VTWSIMIDGYC
Subjt:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC

Query:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL
        +SG Y+LAFQLFEEMKRTGLEPDEMILST+LSACARAGNLDFGTKIHEFITKKNI+MDPHLQSALI MYAS  S DLAWDLYEK+SPKNMV+STAMVSGL
Subjt:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL

Query:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM
        AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESD PQEALVLFK+MQQ+G+KPDVVTMLSVISACAHLGALDQAKWIQ YVDKNGF KALS+NNALIDM
Subjt:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM

Query:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV
        YAKCGSLEGARE+F KMPKKNVISWTSMINA AMHGDAHNALSLFHQMKVENVEPNWITFVG+LYACSH GLV+EG+RIFHSM NEYGISPKHEHFGCMV
Subjt:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV

Query:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE
        DLFGRAKLLREALEV+EAMPFAPNAIIWGSLMAACQLHS+ ELGEFAAKQVLKLEPDHDGALVVLSN+YAKERRWE+ GEVRKLMNEMG+SKERGCSRIE
Subjt:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE

Query:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSR------------ICIIKNLRICED
        LNNEVHEFQMADRKHKQA+ IY KL+EVVQKLKLAGY PQTNCVLVDLDEEEKKELVL HSEKLALCYALMNEGSR            ICI KNLRICED
Subjt:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSR------------ICIIKNLRICED

Query:  CHAFMKLASKVYAREIIIRDRTRFHHYRDGSCSCKDYW
        CHAFMKLASKVYAREI++RDRTRFHHYRDGSCSCKDYW
Subjt:  CHAFMKLASKVYAREIIIRDRTRFHHYRDGSCSCKDYW

A0A6J1IE29 pentatricopeptide repeat-containing protein At4g14820-like0.0e+0091.87Show/hide
Query:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR
        METLSH+TSILPLQ+P YPTRP ALSAALSSATSLLH+KQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQ+PEPKTRFCNKLLRELSR
Subjt:  METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSR

Query:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC
        GS+PE+AL +YEKMRAEGLSLDRFCFPPLLKAASRNLS RTGMEIHGLASKLGFGSDPFVETGL+RMYAAC RIM+ARLVFDKMS +D+VTWSIMIDGYC
Subjt:  GSDPEDALLLYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC

Query:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL
        +SG Y+LAFQLFEEMKRTGLEPDEMILST+LSACARAGNLDFGTKIHEFITKKNI+MDPHLQSALI MYAS  S DLAWDLYEK+SPKNMV+STAMVSGL
Subjt:  ISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGL

Query:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM
        AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESD PQEALVLFK+MQQ+G+KPDVVTMLSVISACAHLGALDQAKWIQ YVDKNGF KALS+NNALIDM
Subjt:  AKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDM

Query:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV
        YAKCGSLEGARE+F KMPKKNVISWTSMINA AMHGDAHNALSLFHQMKVENVEPNWITFVG+LYACSH GLV+EG+RIFHSM NEYGISPKHEHFGCMV
Subjt:  YAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMV

Query:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE
        DLFGRAKLLREALEV+EAMPFAPNAIIWGSLMAACQLHS+ ELGEFAAKQVLKLEPDHDGALVVLSN+YAKERRWE+ GEVRKLMNEMG+SKERGCSRIE
Subjt:  DLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIE

Query:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY
        LNNEVHEFQMADRKHKQA+ IY KL+EVVQKLKLAGY PQTNCVLVDLDEEEKKELVL HSEKLALCYALMNEGSRICI KNLRICEDCHAFMKLASKVY
Subjt:  LNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVY

Query:  AREIIIRDRTRFHHYRDGSCSCKDYW
        AREI++RDRTRFHHYRDGSCSCKDYW
Subjt:  AREIIIRDRTRFHHYRDGSCSCKDYW

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148207.4e-24757.52Show/hide
Query:  LPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPE-PKTRFCNKLLRELSRGSDPEDALL
        + L  P   T    +   LS   SL H+KQ+HA ILR+       +S LF L +SS ++  +L YAL+VF  +P  P++   N  LR+LSR S+P   +L
Subjt:  LPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPE-PKTRFCNKLLRELSRGSDPEDALL

Query:  LYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYCISGCYELAF
         Y+++R  G  LD+F F P+LKA S+  +   GME+HG+A K+    DPFVETG + MYA+CGRI  AR VFD+MSH+D+VTW+ MI+ YC  G  + AF
Subjt:  LYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYCISGCYELAF

Query:  QLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGLAKGGQIGDA
        +LFEEMK + + PDEMIL  ++SAC R GN+ +   I+EF+ + ++ MD HL +AL+TMYA    MD+A + + KMS +N+ VSTAMVSG +K G++ DA
Subjt:  QLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGLAKGGQIGDA

Query:  RYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDMYAKCGSLEG
        + +FDQ  +KDL+CW+ MIS Y ESD+PQEAL +F+EM   GIKPDVV+M SVISACA+LG LD+AKW+ + +  NG    LS+NNALI+MYAKCG L+ 
Subjt:  RYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDMYAKCGSLEG

Query:  AREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMVDLFGRAKLL
         R+VF KMP++NV+SW+SMINA +MHG+A +ALSLF +MK ENVEPN +TFVGVLY CSH+GLVEEG++IF SMT+EY I+PK EH+GCMVDLFGRA LL
Subjt:  AREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMVDLFGRAKLL

Query:  REALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIELNNEVHEFQ
        REALEVIE+MP A N +IWGSLM+AC++H E ELG+FAAK++L+LEPDHDGALV++SN+YA+E+RWE+V  +R++M E  + KE+G SRI+ N + HEF 
Subjt:  REALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIELNNEVHEFQ

Query:  MADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSR--------ICIIKNLRICEDCHAFMKLASKVYA
        + D++HKQ+N+IY KLDEVV KLKLAGY+P    VLVD++EEEKK+LVL HSEKLALC+ LMNE           I I+KNLR+CEDCH F KL SKVY 
Subjt:  MADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSR--------ICIIKNLRICEDCHAFMKLASKVYA

Query:  REIIIRDRTRFHHYRDGSCSCKDYW
        REII+RDRTRFH Y++G CSC+DYW
Subjt:  REIIIRDRTRFHHYRDGSCSCKDYW

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic8.3e-16641.2Show/hide
Query:  PLQIPTY-----PTRPTALS------AALSSATSLLHLKQVHAQILRS-KFERSDSDSLLFKL-ILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLREL
        PL +P +     P +PT  +      + +    SL  LKQ H  ++R+  F    S S LF +  LSS A   SL+YA  VFD++P+P +   N L+R  
Subjt:  PLQIPTY-----PTRPTALS------AALSSATSLLHLKQVHAQILRS-KFERSDSDSLLFKL-ILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLREL

Query:  SRGSDPEDALLLYEKMRAEGLSL-DRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMID
        + G DP  ++  +  M +E     +++ FP L+KAA+   S   G  +HG+A K   GSD FV   L+  Y +CG +  A  VF  +  KD+V+W+ MI+
Subjt:  SRGSDPEDALLLYEKMRAEGLSL-DRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMID

Query:  GYCISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMV
        G+   G  + A +LF++M+   ++   + +  VLSACA+  NL+FG ++  +I +  + ++  L +A++ MY  C S++ A  L++ M  K+ V  T M+
Subjt:  GYCISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMV

Query:  SGLAKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQ-QIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNA
         G A       AR V + M +KD++ W+A+IS Y ++  P EAL++F E+Q Q  +K + +T++S +SACA +GAL+  +WI +Y+ K+G      V +A
Subjt:  SGLAKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQ-QIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNA

Query:  LIDMYAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHF
        LI MY+KCG LE +REVF  + K++V  W++MI   AMHG  + A+ +F++M+  NV+PN +TF  V  ACSH GLV+E   +FH M + YGI P+ +H+
Subjt:  LIDMYAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHF

Query:  GCMVDLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGC
         C+VD+ GR+  L +A++ IEAMP  P+  +WG+L+ AC++H+   L E A  ++L+LEP +DGA V+LSN+YAK  +WENV E+RK M   G+ KE GC
Subjt:  GCMVDLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGC

Query:  SRIELNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVL-CHSEKLALCYALMNEGSR--ICIIKNLRICEDCHAFM
        S IE++  +HEF   D  H  + ++Y KL EV++KLK  GY P+ + VL  ++EEE KE  L  HSEKLA+CY L++  +   I +IKNLR+C DCH+  
Subjt:  SRIELNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVL-CHSEKLALCYALMNEGSR--ICIIKNLRICEDCHAFM

Query:  KLASKVYAREIIIRDRTRFHHYRDGSCSCKDYW
        KL S++Y REII+RDR RFHH+R+G CSC D+W
Subjt:  KLASKVYAREIIIRDRTRFHHYRDGSCSCKDYW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.5e-15940.08Show/hide
Query:  LSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSP---SLDYALSVFDQVPEPKTRFCNKLLRELSRGSDPEDALLLYEKMRAEGLSLDRFC
        L +  +L  L+ +HAQ++  K    +++  L KLI   C LSP    L YA+SVF  + EP     N + R  +  SDP  AL LY  M + GL  + + 
Subjt:  LSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSP---SLDYALSVFDQVPEPKTRFCNKLLRELSRGSDPEDALLLYEKMRAEGLSLDRFC

Query:  FPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSH-------------------------------KDIVTWSI
        FP +LK+ +++ + + G +IHG   KLG   D +V T L+ MY   GR+ DA  VFDK  H                               KD+V+W+ 
Subjt:  FPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSH-------------------------------KDIVTWSI

Query:  MIDGYCISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVST
        MI GY  +G Y+ A +LF++M +T + PDE  + TV+SACA++G+++ G ++H +I       +  + +ALI +Y+ C  ++ A  L+E++         
Subjt:  MIDGYCISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVST

Query:  AMVSGLAKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDK--NGFSKALS
                                KD+I W+ +I GYT  +  +EAL+LF+EM + G  P+ VTMLS++ ACAHLGA+D  +WI  Y+DK   G + A S
Subjt:  AMVSGLAKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDK--NGFSKALS

Query:  VNNALIDMYAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPK
        +  +LIDMYAKCG +E A +VF  +  K++ SW +MI  FAMHG A  +  LF +M+   ++P+ ITFVG+L ACSH+G+++ GR IF +MT +Y ++PK
Subjt:  VNNALIDMYAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPK

Query:  HEHFGCMVDLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISK
         EH+GCM+DL G + L +EA E+I  M   P+ +IW SL+ AC++H   ELGE  A+ ++K+EP++ G+ V+LSN+YA   RW  V + R L+N+ G+ K
Subjt:  HEHFGCMVDLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISK

Query:  ERGCSRIELNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMN--EGSRICIIKNLRICEDCH
          GCS IE+++ VHEF + D+ H +  +IY  L+E+   L+ AG++P T+ VL +++EE K+  +  HSEKLA+ + L++   G+++ I+KNLR+C +CH
Subjt:  ERGCSRIELNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMN--EGSRICIIKNLRICEDCH

Query:  AFMKLASKVYAREIIIRDRTRFHHYRDGSCSCKDYW
           KL SK+Y REII RDRTRFHH+RDG CSC DYW
Subjt:  AFMKLASKVYAREIIIRDRTRFHHYRDGSCSCKDYW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127701.6e-14537.73Show/hide
Query:  SAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSRGSDPEDALLLYEKMRAEGLSLDRFC
        ++ + SAT    LKQ+HA++L    +   S  L+ KLI +S +    + +A  VFD +P P+    N ++R  SR +  +DALL+Y  M+   +S D F 
Subjt:  SAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSRGSDPEDALLLYEKMRAEGLSLDRFC

Query:  FPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFD--KMSHKDIVTWSIMIDGYCISGCYELAFQLFEEMKRTGLEPD
        FP LLKA S     + G  +H    +LGF +D FV+ GL+ +YA C R+  AR VF+   +  + IV+W+ ++  Y  +G    A ++F +M++  ++PD
Subjt:  FPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFD--KMSHKDIVTWSIMIDGYCISGCYELAFQLFEEMKRTGLEPD

Query:  EMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGLAKGGQIGDARYVFDQMVEKDLIC
         + L +VL+A     +L  G  IH  + K  + ++P L  +L TMYA C                               GQ+  A+ +FD+M   +LI 
Subjt:  EMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGLAKGGQIGDARYVFDQMVEKDLIC

Query:  WSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDMYAKCGSLEGAREVFRKMPKKNVI
        W+AMISGY ++ + +EA+ +F EM    ++PD +++ S ISACA +G+L+QA+ +  YV ++ +   + +++ALIDM+AKCGS+EGAR VF +   ++V+
Subjt:  WSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDMYAKCGSLEGAREVFRKMPKKNVI

Query:  SWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMVDLFGRAKLLREALEVIEAMPFAP
         W++MI  + +HG A  A+SL+  M+   V PN +TF+G+L AC+H+G+V EG   F+ M  ++ I+P+ +H+ C++DL GRA  L +A EVI+ MP  P
Subjt:  SWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMVDLFGRAKLLREALEVIEAMPFAP

Query:  NAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIELNNEVHEFQMADRKHKQANQIYQ
           +WG+L++AC+ H   ELGE+AA+Q+  ++P + G  V LSNLYA  R W+ V EVR  M E G++K+ GCS +E+   +  F++ D+ H +  +I +
Subjt:  NAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIELNNEVHEFQMADRKHKQANQIYQ

Query:  KLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMN--EGSRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRTRFHHYRDGSCS
        +++ +  +LK  G++   +  L DL++EE +E +  HSE++A+ Y L++  +G+ + I KNLR C +CHA  KL SK+  REI++RD  RFHH++DG CS
Subjt:  KLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMN--EGSRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRTRFHHYRDGSCS

Query:  CKDYW
        C DYW
Subjt:  CKDYW

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226907.8e-14835.14Show/hide
Query:  ALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALS--PSLDYALSVFDQVPEPKTRFC-NKLLRELSRGSDPEDALLLYEKMRAEGLS
        A  ++L +  ++  LK  H  + +   +  +  S + KL+  SC L    SL +A  VF+      T F  N L+R  +      +A+LL+ +M   G+S
Subjt:  ALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALS--PSLDYALSVFDQVPEPKTRFC-NKLLRELSRGSDPEDALLLYEKMRAEGLS

Query:  LDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC--------------------
         D++ FP  L A +++ +   G++IHGL  K+G+  D FV+  LV  YA CG +  AR VFD+MS +++V+W+ MI GY                     
Subjt:  LDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC--------------------

Query:  -----------ISGCYEL----------------------------------------AFQLFEE-------------------------------MKRT
                   IS C +L                                        A +LF+E                               M  +
Subjt:  -----------ISGCYEL----------------------------------------AFQLFEE-------------------------------MKRT

Query:  GLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGLAKGGQIGDARYVFDQMVE
        G+ PD + + + +S+C++  N+ +G   H ++ +       ++ +ALI MY  C   D A+ ++++MS K +V   ++V+G  + G++  A   F+ M E
Subjt:  GLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGLAKGGQIGDARYVFDQMVE

Query:  KDLICWSAMISGYTESDFPQEALVLFKEMQ-QIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDMYAKCGSLEGAREVFRKM
        K+++ W+ +ISG  +    +EA+ +F  MQ Q G+  D VTM+S+ SAC HLGALD AKWI  Y++KNG    + +   L+DM+++CG  E A  +F  +
Subjt:  KDLICWSAMISGYTESDFPQEALVLFKEMQ-QIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDMYAKCGSLEGAREVFRKM

Query:  PKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMVDLFGRAKLLREALEVIE
          ++V +WT+ I A AM G+A  A+ LF  M  + ++P+ + FVG L ACSH GLV++G+ IF+SM   +G+SP+  H+GCMVDL GRA LL EA+++IE
Subjt:  PKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMVDLFGRAKLLREALEVIE

Query:  AMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIELNNEVHEFQMADRKHKQ
         MP  PN +IW SL+AAC++    E+  +AA+++  L P+  G+ V+LSN+YA   RW ++ +VR  M E G+ K  G S I++  + HEF   D  H +
Subjt:  AMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIELNNEVHEFQMADRKHKQ

Query:  ANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALM--NEGSRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRTRFHHY
           I   LDEV Q+    G++P  + VL+D+DE+EK  ++  HSEKLA+ Y L+  N+G+ I I+KNLR+C DCH+F K ASKVY REII+RD  RFH+ 
Subjt:  ANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALM--NEGSRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRTRFHHY

Query:  RDGSCSCKDYW
        R G CSC D+W
Subjt:  RDGSCSCKDYW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-16040.08Show/hide
Query:  LSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSP---SLDYALSVFDQVPEPKTRFCNKLLRELSRGSDPEDALLLYEKMRAEGLSLDRFC
        L +  +L  L+ +HAQ++  K    +++  L KLI   C LSP    L YA+SVF  + EP     N + R  +  SDP  AL LY  M + GL  + + 
Subjt:  LSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSP---SLDYALSVFDQVPEPKTRFCNKLLRELSRGSDPEDALLLYEKMRAEGLSLDRFC

Query:  FPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSH-------------------------------KDIVTWSI
        FP +LK+ +++ + + G +IHG   KLG   D +V T L+ MY   GR+ DA  VFDK  H                               KD+V+W+ 
Subjt:  FPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSH-------------------------------KDIVTWSI

Query:  MIDGYCISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVST
        MI GY  +G Y+ A +LF++M +T + PDE  + TV+SACA++G+++ G ++H +I       +  + +ALI +Y+ C  ++ A  L+E++         
Subjt:  MIDGYCISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVST

Query:  AMVSGLAKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDK--NGFSKALS
                                KD+I W+ +I GYT  +  +EAL+LF+EM + G  P+ VTMLS++ ACAHLGA+D  +WI  Y+DK   G + A S
Subjt:  AMVSGLAKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDK--NGFSKALS

Query:  VNNALIDMYAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPK
        +  +LIDMYAKCG +E A +VF  +  K++ SW +MI  FAMHG A  +  LF +M+   ++P+ ITFVG+L ACSH+G+++ GR IF +MT +Y ++PK
Subjt:  VNNALIDMYAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPK

Query:  HEHFGCMVDLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISK
         EH+GCM+DL G + L +EA E+I  M   P+ +IW SL+ AC++H   ELGE  A+ ++K+EP++ G+ V+LSN+YA   RW  V + R L+N+ G+ K
Subjt:  HEHFGCMVDLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISK

Query:  ERGCSRIELNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMN--EGSRICIIKNLRICEDCH
          GCS IE+++ VHEF + D+ H +  +IY  L+E+   L+ AG++P T+ VL +++EE K+  +  HSEKLA+ + L++   G+++ I+KNLR+C +CH
Subjt:  ERGCSRIELNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMN--EGSRICIIKNLRICEDCH

Query:  AFMKLASKVYAREIIIRDRTRFHHYRDGSCSCKDYW
           KL SK+Y REII RDRTRFHH+RDG CSC DYW
Subjt:  AFMKLASKVYAREIIIRDRTRFHHYRDGSCSCKDYW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.9e-16741.2Show/hide
Query:  PLQIPTY-----PTRPTALS------AALSSATSLLHLKQVHAQILRS-KFERSDSDSLLFKL-ILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLREL
        PL +P +     P +PT  +      + +    SL  LKQ H  ++R+  F    S S LF +  LSS A   SL+YA  VFD++P+P +   N L+R  
Subjt:  PLQIPTY-----PTRPTALS------AALSSATSLLHLKQVHAQILRS-KFERSDSDSLLFKL-ILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLREL

Query:  SRGSDPEDALLLYEKMRAEGLSL-DRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMID
        + G DP  ++  +  M +E     +++ FP L+KAA+   S   G  +HG+A K   GSD FV   L+  Y +CG +  A  VF  +  KD+V+W+ MI+
Subjt:  SRGSDPEDALLLYEKMRAEGLSL-DRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMID

Query:  GYCISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMV
        G+   G  + A +LF++M+   ++   + +  VLSACA+  NL+FG ++  +I +  + ++  L +A++ MY  C S++ A  L++ M  K+ V  T M+
Subjt:  GYCISGCYELAFQLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMV

Query:  SGLAKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQ-QIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNA
         G A       AR V + M +KD++ W+A+IS Y ++  P EAL++F E+Q Q  +K + +T++S +SACA +GAL+  +WI +Y+ K+G      V +A
Subjt:  SGLAKGGQIGDARYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQ-QIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNA

Query:  LIDMYAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHF
        LI MY+KCG LE +REVF  + K++V  W++MI   AMHG  + A+ +F++M+  NV+PN +TF  V  ACSH GLV+E   +FH M + YGI P+ +H+
Subjt:  LIDMYAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHF

Query:  GCMVDLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGC
         C+VD+ GR+  L +A++ IEAMP  P+  +WG+L+ AC++H+   L E A  ++L+LEP +DGA V+LSN+YAK  +WENV E+RK M   G+ KE GC
Subjt:  GCMVDLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGC

Query:  SRIELNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVL-CHSEKLALCYALMNEGSR--ICIIKNLRICEDCHAFM
        S IE++  +HEF   D  H  + ++Y KL EV++KLK  GY P+ + VL  ++EEE KE  L  HSEKLA+CY L++  +   I +IKNLR+C DCH+  
Subjt:  SRIELNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVL-CHSEKLALCYALMNEGSR--ICIIKNLRICEDCHAFM

Query:  KLASKVYAREIIIRDRTRFHHYRDGSCSCKDYW
        KL S++Y REII+RDR RFHH+R+G CSC D+W
Subjt:  KLASKVYAREIIIRDRTRFHHYRDGSCSCKDYW

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)1.0e-14735.06Show/hide
Query:  ALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALS--PSLDYALSVFDQVPEPKTRFC-NKLLRELSRGSDPEDALLLYEKMRAEGLS
        A  ++L +  ++  LK  H  + +   +  +  S + KL+  SC L    SL +A  VF+      T F  N L+R  +      +A+LL+ +M   G+S
Subjt:  ALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALS--PSLDYALSVFDQVPEPKTRFC-NKLLRELSRGSDPEDALLLYEKMRAEGLS

Query:  LDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC--------------------
         D++ FP  L A +++ +   G++IHGL  K+G+  D FV+  LV  YA CG +  AR VFD+MS +++V+W+ MI GY                     
Subjt:  LDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC--------------------

Query:  -----------ISGCYEL----------------------------------------AFQLFEE-------------------------------MKRT
                   IS C +L                                        A +LF+E                               M  +
Subjt:  -----------ISGCYEL----------------------------------------AFQLFEE-------------------------------MKRT

Query:  GLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGLAKGGQIGDARYVFDQMVE
        G+ PD + + + +S+C++  N+ +G   H ++ +       ++ +ALI MY  C   D A+ ++++MS K +V   ++V+G  + G++  A   F+ M E
Subjt:  GLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGLAKGGQIGDARYVFDQMVE

Query:  KDLICWSAMISGYTESDFPQEALVLFKEMQ-QIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDMYAKCGSLEGAREVFRKM
        K+++ W+ +ISG  +    +EA+ +F  MQ Q G+  D VTM+S+ SAC HLGALD AKWI  Y++KNG    + +   L+DM+++CG  E A  +F  +
Subjt:  KDLICWSAMISGYTESDFPQEALVLFKEMQ-QIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDMYAKCGSLEGAREVFRKM

Query:  PKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMVDLFGRAKLLREALEVIE
          ++V +WT+ I A AM G+A  A+ LF  M  + ++P+ + FVG L ACSH GLV++G+ IF+SM   +G+SP+  H+GCMVDL GRA LL EA+++IE
Subjt:  PKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMVDLFGRAKLLREALEVIE

Query:  AMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIELNNEVHEFQMADRKHKQ
         MP  PN +IW SL+AAC++    E+  +AA+++  L P+  G+ V+LSN+YA   RW ++ +VR  M E G+ K  G S I++  + HEF   D  H +
Subjt:  AMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIELNNEVHEFQMADRKHKQ

Query:  ANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALM--NEGSRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRTRFHHY
           I   LDEV Q+    G++P  + VL+D+DE+EK  ++  HSEKLA+ Y L+  N+G+ I I+KNLR+C DCH+F K ASKVY REII+RD  RFH+ 
Subjt:  ANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALM--NEGSRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRTRFHHY

Query:  RDGSCSCKDY
        R G CSC D+
Subjt:  RDGSCSCKDY

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification5.6e-14935.14Show/hide
Query:  ALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALS--PSLDYALSVFDQVPEPKTRFC-NKLLRELSRGSDPEDALLLYEKMRAEGLS
        A  ++L +  ++  LK  H  + +   +  +  S + KL+  SC L    SL +A  VF+      T F  N L+R  +      +A+LL+ +M   G+S
Subjt:  ALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALS--PSLDYALSVFDQVPEPKTRFC-NKLLRELSRGSDPEDALLLYEKMRAEGLS

Query:  LDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC--------------------
         D++ FP  L A +++ +   G++IHGL  K+G+  D FV+  LV  YA CG +  AR VFD+MS +++V+W+ MI GY                     
Subjt:  LDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYC--------------------

Query:  -----------ISGCYEL----------------------------------------AFQLFEE-------------------------------MKRT
                   IS C +L                                        A +LF+E                               M  +
Subjt:  -----------ISGCYEL----------------------------------------AFQLFEE-------------------------------MKRT

Query:  GLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGLAKGGQIGDARYVFDQMVE
        G+ PD + + + +S+C++  N+ +G   H ++ +       ++ +ALI MY  C   D A+ ++++MS K +V   ++V+G  + G++  A   F+ M E
Subjt:  GLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGLAKGGQIGDARYVFDQMVE

Query:  KDLICWSAMISGYTESDFPQEALVLFKEMQ-QIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDMYAKCGSLEGAREVFRKM
        K+++ W+ +ISG  +    +EA+ +F  MQ Q G+  D VTM+S+ SAC HLGALD AKWI  Y++KNG    + +   L+DM+++CG  E A  +F  +
Subjt:  KDLICWSAMISGYTESDFPQEALVLFKEMQ-QIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDMYAKCGSLEGAREVFRKM

Query:  PKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMVDLFGRAKLLREALEVIE
          ++V +WT+ I A AM G+A  A+ LF  M  + ++P+ + FVG L ACSH GLV++G+ IF+SM   +G+SP+  H+GCMVDL GRA LL EA+++IE
Subjt:  PKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMVDLFGRAKLLREALEVIE

Query:  AMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIELNNEVHEFQMADRKHKQ
         MP  PN +IW SL+AAC++    E+  +AA+++  L P+  G+ V+LSN+YA   RW ++ +VR  M E G+ K  G S I++  + HEF   D  H +
Subjt:  AMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIELNNEVHEFQMADRKHKQ

Query:  ANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALM--NEGSRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRTRFHHY
           I   LDEV Q+    G++P  + VL+D+DE+EK  ++  HSEKLA+ Y L+  N+G+ I I+KNLR+C DCH+F K ASKVY REII+RD  RFH+ 
Subjt:  ANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALM--NEGSRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRTRFHHY

Query:  RDGSCSCKDYW
        R G CSC D+W
Subjt:  RDGSCSCKDYW

AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein5.2e-24857.52Show/hide
Query:  LPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPE-PKTRFCNKLLRELSRGSDPEDALL
        + L  P   T    +   LS   SL H+KQ+HA ILR+       +S LF L +SS ++  +L YAL+VF  +P  P++   N  LR+LSR S+P   +L
Subjt:  LPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPE-PKTRFCNKLLRELSRGSDPEDALL

Query:  LYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYCISGCYELAF
         Y+++R  G  LD+F F P+LKA S+  +   GME+HG+A K+    DPFVETG + MYA+CGRI  AR VFD+MSH+D+VTW+ MI+ YC  G  + AF
Subjt:  LYEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYCISGCYELAF

Query:  QLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGLAKGGQIGDA
        +LFEEMK + + PDEMIL  ++SAC R GN+ +   I+EF+ + ++ MD HL +AL+TMYA    MD+A + + KMS +N+ VSTAMVSG +K G++ DA
Subjt:  QLFEEMKRTGLEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGLAKGGQIGDA

Query:  RYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDMYAKCGSLEG
        + +FDQ  +KDL+CW+ MIS Y ESD+PQEAL +F+EM   GIKPDVV+M SVISACA+LG LD+AKW+ + +  NG    LS+NNALI+MYAKCG L+ 
Subjt:  RYVFDQMVEKDLICWSAMISGYTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDMYAKCGSLEG

Query:  AREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMVDLFGRAKLL
         R+VF KMP++NV+SW+SMINA +MHG+A +ALSLF +MK ENVEPN +TFVGVLY CSH+GLVEEG++IF SMT+EY I+PK EH+GCMVDLFGRA LL
Subjt:  AREVFRKMPKKNVISWTSMINAFAMHGDAHNALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMVDLFGRAKLL

Query:  REALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIELNNEVHEFQ
        REALEVIE+MP A N +IWGSLM+AC++H E ELG+FAAK++L+LEPDHDGALV++SN+YA+E+RWE+V  +R++M E  + KE+G SRI+ N + HEF 
Subjt:  REALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQVLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIELNNEVHEFQ

Query:  MADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSR--------ICIIKNLRICEDCHAFMKLASKVYA
        + D++HKQ+N+IY KLDEVV KLKLAGY+P    VLVD++EEEKK+LVL HSEKLALC+ LMNE           I I+KNLR+CEDCH F KL SKVY 
Subjt:  MADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCHSEKLALCYALMNEGSR--------ICIIKNLRICEDCHAFMKLASKVYA

Query:  REIIIRDRTRFHHYRDGSCSCKDYW
        REII+RDRTRFH Y++G CSC+DYW
Subjt:  REIIIRDRTRFHHYRDGSCSCKDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACTCTCAGCCACTCAACCTCCATCCTCCCTCTTCAAATTCCCACATACCCCACAAGACCCACCGCTCTCTCCGCCGCTCTCTCGTCCGCCACCAGCCTCCTCCA
CCTCAAACAAGTCCACGCTCAAATCCTCCGCTCCAAATTCGAACGCTCCGATTCCGATTCCCTTCTTTTCAAACTTATTCTCTCCTCTTGTGCTCTCTCGCCCAGCCTCG
ACTATGCCCTCTCTGTGTTTGATCAAGTTCCCGAGCCCAAAACCCGTTTCTGCAACAAGCTTCTACGCGAATTATCTCGCGGTTCTGACCCGGAGGATGCGCTTTTACTG
TACGAGAAGATGAGGGCTGAGGGTCTGAGTTTGGATAGGTTCTGCTTCCCACCCCTGTTGAAAGCGGCTTCCAGGAATCTTTCGTCGAGAACTGGGATGGAGATTCATGG
GCTCGCGTCGAAGTTGGGATTTGGTTCAGACCCATTTGTGGAGACGGGGTTGGTGAGAATGTACGCGGCCTGTGGAAGGATTATGGATGCACGCCTGGTGTTTGATAAAA
TGTCTCACAAGGATATCGTCACTTGGAGCATCATGATTGATGGGTATTGCATAAGTGGCTGTTACGAACTCGCCTTTCAACTCTTTGAAGAAATGAAGAGAACAGGCTTA
GAACCAGATGAGATGATTCTTTCAACAGTTCTTTCTGCCTGCGCCCGTGCTGGAAATTTGGATTTTGGAACAAAAATACACGAGTTCATTACCAAGAAGAATATTATCAT
GGATCCTCACTTACAAAGTGCTCTCATCACGATGTATGCAAGCTGTGACTCAATGGACTTGGCTTGGGATTTGTATGAAAAGATGTCCCCTAAGAACATGGTTGTTTCAA
CTGCCATGGTTTCTGGGCTTGCAAAAGGTGGACAGATTGGTGATGCTCGCTATGTGTTTGATCAAATGGTGGAGAAGGACTTGATATGTTGGAGTGCAATGATTTCTGGA
TATACTGAGAGTGATTTCCCTCAAGAGGCTCTTGTATTGTTCAAAGAAATGCAACAAATCGGAATAAAGCCTGATGTAGTCACCATGTTGAGTGTCATCTCAGCTTGTGC
TCATCTTGGTGCATTAGATCAAGCCAAATGGATCCAAACTTATGTTGATAAAAATGGGTTCAGCAAGGCATTATCTGTCAATAATGCACTCATTGATATGTATGCTAAAT
GTGGGAGTCTTGAAGGAGCGAGAGAGGTCTTTAGAAAGATGCCAAAGAAAAATGTTATATCTTGGACCAGTATGATTAATGCTTTTGCAATGCACGGAGATGCTCATAAT
GCTTTAAGCCTATTTCATCAAATGAAAGTTGAAAATGTTGAGCCTAATTGGATCACATTTGTAGGGGTGCTTTATGCTTGTAGCCACGCAGGTTTAGTTGAGGAGGGCCG
AAGAATATTTCACTCAATGACCAATGAGTATGGCATAAGTCCCAAGCACGAACACTTTGGTTGCATGGTTGACCTTTTTGGCCGTGCAAAACTGCTAAGAGAAGCTCTTG
AGGTGATAGAGGCAATGCCATTTGCTCCTAATGCTATTATTTGGGGGTCCCTTATGGCTGCTTGTCAGCTCCACAGTGAGGCTGAGTTAGGAGAATTTGCTGCTAAACAA
GTTCTCAAGCTCGAGCCAGATCACGATGGGGCCCTTGTTGTCTTATCGAACTTATATGCAAAAGAAAGAAGATGGGAAAACGTTGGGGAGGTTAGAAAACTAATGAATGA
GATGGGTATTTCCAAAGAGAGAGGATGCAGTAGAATTGAATTGAACAATGAGGTGCATGAATTTCAAATGGCAGATAGAAAGCACAAGCAAGCAAATCAAATATATCAGA
AGTTAGATGAGGTTGTTCAAAAACTGAAACTGGCTGGTTATATGCCACAGACAAATTGTGTGCTCGTTGATTTAGACGAAGAGGAAAAGAAGGAATTAGTCCTCTGCCAT
AGCGAGAAATTGGCACTGTGCTATGCCCTTATGAATGAAGGGTCACGCATTTGCATTATAAAGAACCTAAGGATTTGTGAGGATTGTCATGCTTTTATGAAATTAGCCTC
AAAGGTCTATGCCAGAGAGATTATCATTAGGGACAGAACTAGGTTTCACCATTACAGAGACGGTTCTTGTTCTTGTAAAGACTACTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACTCTCAGCCACTCAACCTCCATCCTCCCTCTTCAAATTCCCACATACCCCACAAGACCCACCGCTCTCTCCGCCGCTCTCTCGTCCGCCACCAGCCTCCTCCA
CCTCAAACAAGTCCACGCTCAAATCCTCCGCTCCAAATTCGAACGCTCCGATTCCGATTCCCTTCTTTTCAAACTTATTCTCTCCTCTTGTGCTCTCTCGCCCAGCCTCG
ACTATGCCCTCTCTGTGTTTGATCAAGTTCCCGAGCCCAAAACCCGTTTCTGCAACAAGCTTCTACGCGAATTATCTCGCGGTTCTGACCCGGAGGATGCGCTTTTACTG
TACGAGAAGATGAGGGCTGAGGGTCTGAGTTTGGATAGGTTCTGCTTCCCACCCCTGTTGAAAGCGGCTTCCAGGAATCTTTCGTCGAGAACTGGGATGGAGATTCATGG
GCTCGCGTCGAAGTTGGGATTTGGTTCAGACCCATTTGTGGAGACGGGGTTGGTGAGAATGTACGCGGCCTGTGGAAGGATTATGGATGCACGCCTGGTGTTTGATAAAA
TGTCTCACAAGGATATCGTCACTTGGAGCATCATGATTGATGGGTATTGCATAAGTGGCTGTTACGAACTCGCCTTTCAACTCTTTGAAGAAATGAAGAGAACAGGCTTA
GAACCAGATGAGATGATTCTTTCAACAGTTCTTTCTGCCTGCGCCCGTGCTGGAAATTTGGATTTTGGAACAAAAATACACGAGTTCATTACCAAGAAGAATATTATCAT
GGATCCTCACTTACAAAGTGCTCTCATCACGATGTATGCAAGCTGTGACTCAATGGACTTGGCTTGGGATTTGTATGAAAAGATGTCCCCTAAGAACATGGTTGTTTCAA
CTGCCATGGTTTCTGGGCTTGCAAAAGGTGGACAGATTGGTGATGCTCGCTATGTGTTTGATCAAATGGTGGAGAAGGACTTGATATGTTGGAGTGCAATGATTTCTGGA
TATACTGAGAGTGATTTCCCTCAAGAGGCTCTTGTATTGTTCAAAGAAATGCAACAAATCGGAATAAAGCCTGATGTAGTCACCATGTTGAGTGTCATCTCAGCTTGTGC
TCATCTTGGTGCATTAGATCAAGCCAAATGGATCCAAACTTATGTTGATAAAAATGGGTTCAGCAAGGCATTATCTGTCAATAATGCACTCATTGATATGTATGCTAAAT
GTGGGAGTCTTGAAGGAGCGAGAGAGGTCTTTAGAAAGATGCCAAAGAAAAATGTTATATCTTGGACCAGTATGATTAATGCTTTTGCAATGCACGGAGATGCTCATAAT
GCTTTAAGCCTATTTCATCAAATGAAAGTTGAAAATGTTGAGCCTAATTGGATCACATTTGTAGGGGTGCTTTATGCTTGTAGCCACGCAGGTTTAGTTGAGGAGGGCCG
AAGAATATTTCACTCAATGACCAATGAGTATGGCATAAGTCCCAAGCACGAACACTTTGGTTGCATGGTTGACCTTTTTGGCCGTGCAAAACTGCTAAGAGAAGCTCTTG
AGGTGATAGAGGCAATGCCATTTGCTCCTAATGCTATTATTTGGGGGTCCCTTATGGCTGCTTGTCAGCTCCACAGTGAGGCTGAGTTAGGAGAATTTGCTGCTAAACAA
GTTCTCAAGCTCGAGCCAGATCACGATGGGGCCCTTGTTGTCTTATCGAACTTATATGCAAAAGAAAGAAGATGGGAAAACGTTGGGGAGGTTAGAAAACTAATGAATGA
GATGGGTATTTCCAAAGAGAGAGGATGCAGTAGAATTGAATTGAACAATGAGGTGCATGAATTTCAAATGGCAGATAGAAAGCACAAGCAAGCAAATCAAATATATCAGA
AGTTAGATGAGGTTGTTCAAAAACTGAAACTGGCTGGTTATATGCCACAGACAAATTGTGTGCTCGTTGATTTAGACGAAGAGGAAAAGAAGGAATTAGTCCTCTGCCAT
AGCGAGAAATTGGCACTGTGCTATGCCCTTATGAATGAAGGGTCACGCATTTGCATTATAAAGAACCTAAGGATTTGTGAGGATTGTCATGCTTTTATGAAATTAGCCTC
AAAGGTCTATGCCAGAGAGATTATCATTAGGGACAGAACTAGGTTTCACCATTACAGAGACGGTTCTTGTTCTTGTAAAGACTACTGGTGA
Protein sequenceShow/hide protein sequence
METLSHSTSILPLQIPTYPTRPTALSAALSSATSLLHLKQVHAQILRSKFERSDSDSLLFKLILSSCALSPSLDYALSVFDQVPEPKTRFCNKLLRELSRGSDPEDALLL
YEKMRAEGLSLDRFCFPPLLKAASRNLSSRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMDARLVFDKMSHKDIVTWSIMIDGYCISGCYELAFQLFEEMKRTGL
EPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIIMDPHLQSALITMYASCDSMDLAWDLYEKMSPKNMVVSTAMVSGLAKGGQIGDARYVFDQMVEKDLICWSAMISG
YTESDFPQEALVLFKEMQQIGIKPDVVTMLSVISACAHLGALDQAKWIQTYVDKNGFSKALSVNNALIDMYAKCGSLEGAREVFRKMPKKNVISWTSMINAFAMHGDAHN
ALSLFHQMKVENVEPNWITFVGVLYACSHAGLVEEGRRIFHSMTNEYGISPKHEHFGCMVDLFGRAKLLREALEVIEAMPFAPNAIIWGSLMAACQLHSEAELGEFAAKQ
VLKLEPDHDGALVVLSNLYAKERRWENVGEVRKLMNEMGISKERGCSRIELNNEVHEFQMADRKHKQANQIYQKLDEVVQKLKLAGYMPQTNCVLVDLDEEEKKELVLCH
SEKLALCYALMNEGSRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRTRFHHYRDGSCSCKDYW