; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021505 (gene) of Snake gourd v1 genome

Gene IDTan0021505
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG04:36811399..36816869
RNA-Seq ExpressionTan0021505
SyntenyTan0021505
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601646.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.62Show/hide
Query:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV
        MLQRFFSLPR FSRLAGPLLEMGHH+   TYA Q  LS+LDQTM SVQFISL+SCFY MGLCRNIDTL+KFHGLLIVHGLVG+LLCDTKLVGVYGALGDV
Subjt:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV

Query:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI
        GSARMVFDQM DPDFYAWKVMIRWYFLNDMFA+II FYNRMRMSF++CDNIIFSIILKA SELREIDEGR+VHCQIVKVG PDSFVLTGLIDMY KCGQI
Subjt:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI

Query:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNV-VEHSSFLATAFLDMYVKCG
        EC+SAVFEGIIDKNVVSWT+MIAGYVQNDCA+EGLVLFNRMRE+LVESNQFTLGSIITACT+LRALHQGKWVHGYA KNV +E +SFLATAFLDMYVKCG
Subjt:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNV-VEHSSFLATAFLDMYVKCG

Query:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH
        QTRDARMIFDELP+IDLVSWTAMIVGY+QAGQPN+ALRLFTD+IR  LLPNSVTAAS+LS+CSVSGNLS+GM VHGLGIKLGLEECAVKNALIDMYAKCH
Subjt:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH

Query:  MIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK
        MIDDAY +F GVLEKD ITWNSMI GYAQ+GSAY+AL+LFNQM+SDS APDAITLVS LSASA LGAVQVGSSLHAYSIKEGLFSSNLYIGTALLN YAK
Subjt:  MIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK

Query:  CGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLL
        CGDAKSARMVFDSM DKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPN+VIFTTVLSACSYSGMVEEGWRYFKSMSQDYN+ PSMKHYACMVDLL
Subjt:  CGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLL

Query:  ARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNA
        +RSGRL+EAL+FIKKMPV+PDISLYGAFLHGCGLYSRFDLGEV+VREML+LHPNEAC+YVLLSNLYASDGRWGQVNEVR+LML+RGL KVPGYS VETNA
Subjt:  ARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNA

Query:  GVL
        G++
Subjt:  GVL

KAG7032406.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0089.93Show/hide
Query:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV
        MLQRFFSLPR FSRLAGPLLEMGHH++  TYA Q  LS+LDQTM SVQFISL+SCFY MGLCRNIDTL+KFHGLLIVHGLVG+LLCDTKLVGVYGALGDV
Subjt:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV

Query:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI
        GSARMVFDQM DPDFYAWKVMIRWYFLNDMFA+II FYNRMRMSF++CDNIIFSIILKA SELREIDEGR+VHCQIVKVG PDSFVLTGLIDMY KCGQI
Subjt:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI

Query:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNV-VEHSSFLATAFLDMYVKCG
        EC+SAVFEGIIDKNVVSWT+MIAGYVQNDCA+EGLVLFNRMRE+LVESNQFTLGSIITACT+LRALHQGKWVHGYA KNV +E +SFLATAFLDMYVKCG
Subjt:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNV-VEHSSFLATAFLDMYVKCG

Query:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH
        QTRDARMIFDELP+IDLVSWTAMIVGY+QAGQPN+ALRLFTD+IR  LLPNSVTAAS+LS+CSVSGNLS+GM VHGLGIKLGLEECAVKNALIDMYAKCH
Subjt:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH

Query:  MIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK
        MIDDAY +F GVLEKD ITWNSMI GYAQ+GSAY+AL+LFNQM+SDS APDAITLVS LSASA LGAVQVGSSLHAYSIKEGLFSSNLYIGTALLN YAK
Subjt:  MIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK

Query:  CGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLL
        CGDAKSARMVFDSM DKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPN+VIFTTVLSACSYSGMVEEGWRYFKSMSQDYN+APSMKHYACMVDLL
Subjt:  CGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLL

Query:  ARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNA
        +RSGRL+EAL+FIKKMPV+PDISLYGAFLHGCGLYSRFDLGEV+VREML+LHPNEAC+YVLLSNLYASDGRWGQVNEVR+LML+RGL KVPGYS VETNA
Subjt:  ARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNA

Query:  GVLFH
        GV FH
Subjt:  GVLFH

XP_022957571.1 pentatricopeptide repeat-containing protein At2g03380, mitochondrial [Cucurbita moschata]0.0e+0090.07Show/hide
Query:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV
        MLQRFFSLPR FSRLAGPLLEMGHH++  TYA Q  LS+LDQTM SVQFISL+SCFY MGLCRNIDTL+KFHGLLIVHGLVG+LLCDTKLVGVYGALGDV
Subjt:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV

Query:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI
        GSARMVFDQM DPDFYAWKVMIRWYFLNDMFA+II FYNRMRMSF++CDNIIFSIILKA SELREIDEGR+VHCQIVKVG PDSFVLTGLIDMY KCGQI
Subjt:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI

Query:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVV-EHSSFLATAFLDMYVKCG
        EC+SAVFEGIIDKNVVSWT+MIAGYVQNDCA+EGLVLFNRMRE+LVESNQFTLGSIITACT+LRALHQGKWVHGYA KNV+ E +SFLATAFLDMYVKCG
Subjt:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVV-EHSSFLATAFLDMYVKCG

Query:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH
        QTRDARMIFDELP+IDLVSWTAMIVGY+QAGQPNEALRLFTD+IR  LLPNSVTAAS+LS+CSVSGNLS+GM VHGLGIKLGLEECAVKNALIDMYAKCH
Subjt:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH

Query:  MIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK
        MIDDAY +F GVLEKD ITWNSMI GYAQ+GSAY+AL+LFNQMRSDS APDAITLVS LSASA LGAVQVGSSLH YSIKEGLFSSNLYIGTALLN YAK
Subjt:  MIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK

Query:  CGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLL
        CGDAKSARMVFDSM DKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPN+VIFTTVLSACSYSGMVEEGWRYFKSMSQDYN+APSMKHYACMVDLL
Subjt:  CGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLL

Query:  ARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNA
        +RSGRL+EALDFIKKMPV+PDISLYGAFLHGCGLYSRFDLGEV+VREML+LHPNEA  YVLLSNLYASDGRWGQVNEVR+LML+RGL KVPGYS VETNA
Subjt:  ARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNA

Query:  GVLFH
        GV FH
Subjt:  GVLFH

XP_022997487.1 pentatricopeptide repeat-containing protein At2g03380, mitochondrial [Cucurbita maxima]0.0e+0090.62Show/hide
Query:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV
        MLQRFFSLPR FSRLAGPLLEMGHH++  TYA Q  LS+LDQTM SVQFISL+SCFY MGLCRNIDTL+KFHGLLIVHGLVG+LLCDTKLVGVYGALGDV
Subjt:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV

Query:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI
        GSARMVFDQM DPDFYAWKVMIRWYFLNDMFA+II FYNRMRMSFR+CDNIIFSIILKA  ELREIDEGR+VHCQIVKVG PDSFVLTGLIDMY KCGQI
Subjt:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI

Query:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQ
        EC+SAVFEGIIDKNVVSWT+MIAGYVQNDCA+EGLVLFNRMRE+LVESNQFTLGSIITACT+LRALHQGKWVHGYA KNV+E SSFLATAFLDMYVKCGQ
Subjt:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQ

Query:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM
        TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEAL LFTD+IR  LLPNSVTAAS+LS+CSVSGNLS+GMSVHGLGIKLGL ECAVKNALIDMYAKCHM
Subjt:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM

Query:  IDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
        IDDAY +F GVLEKD ITWNSMI GY QNGSAY+AL+LFNQMRSDS APDAITLVSTLSASA LGAVQ+GSSLHAYSIKEGLFSSNLYIGTALLN YAKC
Subjt:  IDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC

Query:  GDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLA
        GDAKSARMVFDSM DKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPN+VIFTTVLSACSYSGMVEEGW+YFKSMSQDYN+APSMKHYACMVDLL+
Subjt:  GDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLA

Query:  RSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAG
        RSGRL+EALDFIKKMPV+PDISLYGAFLHGCGLYSRFDLGEV+VREML+LHPNEAC YVLLSNLYASDGRWGQVNEVR+LML+RGL KVPGYS VETNAG
Subjt:  RSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAG

Query:  VLFH
        V FH
Subjt:  VLFH

XP_038893238.1 pentatricopeptide repeat-containing protein At2g03380, mitochondrial [Benincasa hispida]0.0e+0089.77Show/hide
Query:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV
        MLQRFF+LPR FSRLAGPLL+MGHHM+   YA    LS+L QTM S Q ISL+SCFYL+GLC+NIDTL+KFHGLLIVHGLVG+LLCDTKLVGVYGALGDV
Subjt:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV

Query:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI
        GSARMVFDQM DPDFYAWKVMIRWYFLND+F DII FYNRMRMSFR+CD IIFSIILKA SELREIDEGR+VHCQIVKVG PDSFVLTGLIDMY KCGQI
Subjt:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI

Query:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQ
        E +S VFE I+DKNVVSWTSMIAGYVQN+CA+EGLVLFNRMREALVESN FTLGSIITACTKLRALHQGKWVHGYA KN++E SSFLATAFLDMYVKCGQ
Subjt:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQ

Query:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM
        T+DARMIFDELPTIDLVSWTAM+VGYTQ GQPNEALRLF DEIR  LLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMY+KCHM
Subjt:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM

Query:  IDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
        IDDAY IF GVLEKD ITWNSMI GYAQNGSAYE+L+LFN MRSD  APDAITLVSTLSASA++GAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
Subjt:  IDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC

Query:  GDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLA
        GDAKSARMVFD M DKNIITWSAMIGGYGVQGDGSGSL++FSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSM QDYNF PSMKHYACMVDLLA
Subjt:  GDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLA

Query:  RSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAG
        RSGRLDEALDFIKKMPV+PDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVR+LMLQRGLNKVPGYS VE+N G
Subjt:  RSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAG

Query:  VLFH
        VLFH
Subjt:  VLFH

TrEMBL top hitse value%identityAlignment
A0A1S4E1F1 pentatricopeptide repeat-containing protein At2g03380, mitochondrial0.0e+0088.64Show/hide
Query:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV
        MLQRFFSLPR FS L G LL+MGH M+  TYA    LS+L QTM SVQFISL+SC YLMGL RNIDTL+KFHGLLIVHGL+G+LLCDTKLVGVYGALGDV
Subjt:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV

Query:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI
         SARMVFDQM DPDFYAWKVMIRWYFLND+F D+I FYN MRMSFR+CDNIIFSIILKA SELREIDEGR+VHCQIVKVG PDSFV+TGLIDMY KC Q+
Subjt:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI

Query:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQ
        EC+SAVFE I+DKNVVSWTSMIAGYVQN+CA+EGLVLFNRMR+ALVESN FTLGSII ACTKLRALHQGKWVHGYA KN+VE SSFLAT FLDMYVKCGQ
Subjt:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQ

Query:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM
        TRDARMIFDELPTIDLVSWTAMIVGYTQA QPN+ LRLF DEIR  LLPNSVTAASVLS+CSVSGNL+LGMSVHGLGIKLGLEECAVKNALIDMYAKCH 
Subjt:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM

Query:  IDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
        I DAY IFHGVLEKD ITWNSMI GYAQNGSAY+AL+LFNQMRS S APDAITLVSTLSASA+LGAVQVGSSLHAYS+KEGLFSSNLYIGTALLNFYAKC
Subjt:  IDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC

Query:  GDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLA
        GDAKSAR VFDSM DKNIITWSAMIGGYGVQGDGSGSL+IFSDMLKEDLKPNEVIFTT+LSACS SGMVEEGWRYFKSM QDYNF PSMKHYACMVDLLA
Subjt:  GDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLA

Query:  RSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAG
        RSGRLDEALDFIKKMPV+PD+SLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDG+WGQVNEVR+LMLQRGLNKVPGYS VETNAG
Subjt:  RSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAG

Query:  VLFH
        VLFH
Subjt:  VLFH

A0A5D3DQT2 Pentatricopeptide repeat-containing protein0.0e+0088.56Show/hide
Query:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV
        MLQRFFSLPR FS L G LL+MGH M+  TYA    LS+L QTM SVQFISL+SC YLMGL RNIDTL+KFHGLLIVHGL+G+LLCDTKLVGVYGALGDV
Subjt:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV

Query:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI
         SARMVFDQM DPDFYAWKVMIRWYFLND+F D+I FYN MRMSFR+CDNIIFSIILKA SELREIDEGR+VHCQIVKVG PDSFV+TGLIDMY KC Q+
Subjt:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI

Query:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQ
        EC+SAVFE I+DKNVVSWTSMIAGYVQN+CA+EGLVLFNRMR+ALVESN FTLGSII ACTKLRALHQGKWVHGYA KN+VE SSFLAT FLDMYVKCGQ
Subjt:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQ

Query:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM
        TRDARMIFDELPTIDLVSWTAMIVGYTQA QPN+ LRLF DEIR  LLPNSVTAASVLS+CSVSGNL+LGMSVHGLGIKLGLEECAVKNALIDMYAKCH 
Subjt:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM

Query:  IDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
        I DAY IFHGVLEKD ITWNSMI GYAQNGSAY+AL+LFNQMRS S APDAITLVSTLSASA+LGAVQVGSSLHAYS+KEGLFSSNLYIGTALLNFYAKC
Subjt:  IDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC

Query:  GDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLA
        GDAKSAR VFDSM DKNIITWSAMIGGYGVQGDGSGSL+IFSDMLKEDLKPNEVIFTT+LSACS SGMVEEGWRYFKSM QDYNF PSMKHYACMVDLLA
Subjt:  GDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLA

Query:  RSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNA
        RSGRLDEALDFIKKMPV+PD+SLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDG+WGQVNEVR+LMLQRGLNKVPGYS VETNA
Subjt:  RSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNA

A0A6J1DCB9 pentatricopeptide repeat-containing protein At2g03380, mitochondrial isoform X20.0e+0088.78Show/hide
Query:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV
        MLQRF SL R+FSRLAG L+E+G +M C TY PQ QLSELDQTM SVQFISLNS FYLMGLCRNIDTL+KFHGLLIVHGLVG+LLCDTKLVGVYGALGDV
Subjt:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV

Query:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI
         SAR+VFDQM D DFYAWKVMIRWYFLNDMFADII FYN MRMSF + DNIIFSIILKA SELREI EGR+VH QIVKVG PDSFVLTGL+DMYAKCGQI
Subjt:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI

Query:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQ
        EC+ AVFEGIIDKNVVSWTSMIAGYVQNDCA+EGLVLFNRMR ALVESNQFTLGSIITACTKLRALHQGKWVHGYA KNVV+ S+FL T FLDMYVKCGQ
Subjt:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQ

Query:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM
        TRDAR++FDELPTID VSWTAMIVGYTQAGQPNEALRLF  EIR  LLPNSVTAA+VLSSCSVSGNL+LGMSVHGLGIKLGLEEC VKNALIDMYAKCHM
Subjt:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM

Query:  IDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
        ID+AY IFHGVLEKD ITWNSMI GYAQNG+A EAL LFNQMRS S APDAITLVS LSASA+LGAV VGSSLHAYSIK GLFS NLYIGT LLNFYAKC
Subjt:  IDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC

Query:  GDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLA
        GDAKSARMVFDSM DKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKED+KPNEVIFTTVLSACSYSGMV EGWRYFKSMSQDYNF PSMKHYACMVDLLA
Subjt:  GDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLA

Query:  RSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAG
        RSGRLDEALDFIKKMPV+PDISLYGAFLHGCGLYSRFDLGEVVV+EMLQLHP+EACYYVLLSNLYASDGRWGQVN+VRELM QRGLNK PGYS VET+AG
Subjt:  RSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAG

Query:  VLFH
        +LFH
Subjt:  VLFH

A0A6J1GZK8 pentatricopeptide repeat-containing protein At2g03380, mitochondrial0.0e+0090.07Show/hide
Query:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV
        MLQRFFSLPR FSRLAGPLLEMGHH++  TYA Q  LS+LDQTM SVQFISL+SCFY MGLCRNIDTL+KFHGLLIVHGLVG+LLCDTKLVGVYGALGDV
Subjt:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV

Query:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI
        GSARMVFDQM DPDFYAWKVMIRWYFLNDMFA+II FYNRMRMSF++CDNIIFSIILKA SELREIDEGR+VHCQIVKVG PDSFVLTGLIDMY KCGQI
Subjt:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI

Query:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVV-EHSSFLATAFLDMYVKCG
        EC+SAVFEGIIDKNVVSWT+MIAGYVQNDCA+EGLVLFNRMRE+LVESNQFTLGSIITACT+LRALHQGKWVHGYA KNV+ E +SFLATAFLDMYVKCG
Subjt:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVV-EHSSFLATAFLDMYVKCG

Query:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH
        QTRDARMIFDELP+IDLVSWTAMIVGY+QAGQPNEALRLFTD+IR  LLPNSVTAAS+LS+CSVSGNLS+GM VHGLGIKLGLEECAVKNALIDMYAKCH
Subjt:  QTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCH

Query:  MIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK
        MIDDAY +F GVLEKD ITWNSMI GYAQ+GSAY+AL+LFNQMRSDS APDAITLVS LSASA LGAVQVGSSLH YSIKEGLFSSNLYIGTALLN YAK
Subjt:  MIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAK

Query:  CGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLL
        CGDAKSARMVFDSM DKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPN+VIFTTVLSACSYSGMVEEGWRYFKSMSQDYN+APSMKHYACMVDLL
Subjt:  CGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLL

Query:  ARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNA
        +RSGRL+EALDFIKKMPV+PDISLYGAFLHGCGLYSRFDLGEV+VREML+LHPNEA  YVLLSNLYASDGRWGQVNEVR+LML+RGL KVPGYS VETNA
Subjt:  ARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNA

Query:  GVLFH
        GV FH
Subjt:  GVLFH

A0A6J1KBI2 pentatricopeptide repeat-containing protein At2g03380, mitochondrial0.0e+0090.62Show/hide
Query:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV
        MLQRFFSLPR FSRLAGPLLEMGHH++  TYA Q  LS+LDQTM SVQFISL+SCFY MGLCRNIDTL+KFHGLLIVHGLVG+LLCDTKLVGVYGALGDV
Subjt:  MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDV

Query:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI
        GSARMVFDQM DPDFYAWKVMIRWYFLNDMFA+II FYNRMRMSFR+CDNIIFSIILKA  ELREIDEGR+VHCQIVKVG PDSFVLTGLIDMY KCGQI
Subjt:  GSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQI

Query:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQ
        EC+SAVFEGIIDKNVVSWT+MIAGYVQNDCA+EGLVLFNRMRE+LVESNQFTLGSIITACT+LRALHQGKWVHGYA KNV+E SSFLATAFLDMYVKCGQ
Subjt:  ECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQ

Query:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM
        TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEAL LFTD+IR  LLPNSVTAAS+LS+CSVSGNLS+GMSVHGLGIKLGL ECAVKNALIDMYAKCHM
Subjt:  TRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHM

Query:  IDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC
        IDDAY +F GVLEKD ITWNSMI GY QNGSAY+AL+LFNQMRSDS APDAITLVSTLSASA LGAVQ+GSSLHAYSIKEGLFSSNLYIGTALLN YAKC
Subjt:  IDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKC

Query:  GDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLA
        GDAKSARMVFDSM DKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPN+VIFTTVLSACSYSGMVEEGW+YFKSMSQDYN+APSMKHYACMVDLL+
Subjt:  GDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLA

Query:  RSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAG
        RSGRL+EALDFIKKMPV+PDISLYGAFLHGCGLYSRFDLGEV+VREML+LHPNEAC YVLLSNLYASDGRWGQVNEVR+LML+RGL KVPGYS VETNAG
Subjt:  RSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAG

Query:  VLFH
        V FH
Subjt:  VLFH

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic3.4e-12436.55Show/hide
Query:  LMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNII--FSI
        L+  C ++  L +   L+  +GL  +    TKLV ++   G V  A  VF+ +       +  M++ +         ++F+ RMR  + D + ++  F+ 
Subjt:  LMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNII--FSI

Query:  ILKASSELREIDEGRRVHCQIVKVG-APDSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLG
        +LK   +  E+  G+ +H  +VK G + D F +TGL +MYAKC Q+  A  VF+ + ++++VSW +++AGY QN  A+  L +   M E  ++ +  T+ 
Subjt:  ILKASSELREIDEGRRVHCQIVKVG-APDSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLG

Query:  SIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTA
        S++ A + LR +  GK +HGYA ++  +    ++TA +DMY KCG    AR +FD +   ++VSW +MI  Y Q   P EA+ +F   +   + P  V+ 
Subjt:  SIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTA

Query:  ASVLSSCSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAIT
           L +C+  G+L  G  +H L ++LGL+   +V N+LI MY KC  +D A ++F  +  +  ++WN+MI G+AQNG   +AL  F+QMRS +  PD  T
Subjt:  ASVLSSCSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAIT

Query:  LVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNE
         VS ++A A L        +H   ++  L   N+++ TAL++ YAKCG    AR++FD M ++++ TW+AMI GYG  G G  +L +F +M K  +KPN 
Subjt:  LVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNE

Query:  VIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPN
        V F +V+SACS+SG+VE G + F  M ++Y+   SM HY  MVDLL R+GRL+EA DFI +MPV+P +++YGA L  C ++   +  E     + +L+P+
Subjt:  VIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPN

Query:  EACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVE
        +  Y+VLL+N+Y +   W +V +VR  ML++GL K PG S VE
Subjt:  EACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVE

Q9C507 Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial5.3e-11733.6Show/hide
Query:  KFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEG
        K HG +I  G+  D + +T L+ +YG  G++  A  VFD M   D  AW  ++     N      +R +  M     + D +    +++  +EL  +   
Subjt:  KFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEG

Query:  RRVHCQIV-KVGAPDSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQ
        R VH QI  K+   D  +   L+ MY+KCG +  +  +FE I  KN VSWT+MI+ Y + + +++ L  F+ M ++ +E N  TL S++++C  +  + +
Subjt:  RRVHCQIV-KVGAPDSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQ

Query:  GKWVHGYAFKNVVE-HSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNL
        GK VHG+A +  ++ +   L+ A +++Y +CG+  D   +   +   ++V+W ++I  Y   G   +AL LF   +   + P++ T AS +S+C  +G +
Subjt:  GKWVHGYAFKNVVE-HSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNL

Query:  SLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAV
         LG  +HG  I+  + +  V+N+LIDMY+K   +D A  +F+ +  +  +TWNSM+CG++QNG++ EA+ LF+ M       + +T ++ + A +S+G++
Subjt:  SLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAV

Query:  QVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSG
        + G  +H   I  GL   +L+  TAL++ YAKCGD  +A  VF +M  ++I++WS+MI  YG+ G    +++ F+ M++   KPNEV+F  VLSAC +SG
Subjt:  QVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSG

Query:  MVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYAS
         VEEG +Y+ ++ + +  +P+ +H+AC +DLL+RSG L EA   IK+MP   D S++G+ ++GC ++ + D+ + +  ++  +  ++  YY LLSN+YA 
Subjt:  MVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYAS

Query:  DGRWGQVNEVRELMLQRGLNKVPGYSQVETNAGV
        +G W +   +R  M    L KVPGYS +E +  V
Subjt:  DGRWGQVNEVRELMLQRGLNKVPGYSQVETNAGV

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.4e-11234.94Show/hide
Query:  RNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSE
        + +D  ++ +G +I      D    +KL  +Y   GD+  A  VFD++       W +++     +  F+  I  + +M  S  + D+  FS + K+ S 
Subjt:  RNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSE

Query:  LREIDEGRRVHCQIVKVGAPD-SFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACT
        LR +  G ++H  I+K G  + + V   L+  Y K  +++ A  VF+ + +++V+SW S+I GYV N  A++GL +F +M  + +E +  T+ S+   C 
Subjt:  LREIDEGRRVHCQIVKVGAPD-SFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACT

Query:  KLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSC
          R +  G+ VH    K             LDMY KCG    A+ +F E+    +VS+T+MI GY + G   EA++LF +     + P+  T  +VL+ C
Subjt:  KLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSC

Query:  SVSGNLSLGMSVHGLGIK---LGLEECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSP-APDAITLVST
        +    L  G  VH   IK   LG  +  V NAL+DMYAKC  + +A  +F  +  KD I+WN++I GY++N  A EAL LFN +  +   +PD  T+   
Subjt:  SVSGNLSLGMSVHGLGIK---LGLEECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSP-APDAITLVST

Query:  LSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFT
        L A ASL A   G  +H Y ++ G FS   ++  +L++ YAKCG    A M+FD +  K++++W+ MI GYG+ G G  ++A+F+ M +  ++ +E+ F 
Subjt:  LSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFT

Query:  TVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACY
        ++L ACS+SG+V+EGWR+F  M  +    P+++HYAC+VD+LAR+G L +A  FI+ MP+ PD +++GA L GC ++    L E V  ++ +L P    Y
Subjt:  TVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACY

Query:  YVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAGV
        YVL++N+YA   +W QV  +R+ + QRGL K PG S +E    V
Subjt:  YVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAGV

Q9STE1 Pentatricopeptide repeat-containing protein At4g213002.6e-11133.91Show/hide
Query:  LVKFHGLLIVHGLVGDLLCD------TKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASS
        L  F G+  +   V  L  D      + L+  Y   G +     +FD++L  D   W VM+  Y        +I+ ++ MRM     + + F  +L   +
Subjt:  LVKFHGLLIVHGLVGDLLCD------TKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASS

Query:  ELREIDEGRRVHCQIVKVGAP-DSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITAC
            ID G ++H  +V  G   +  +   L+ MY+KCG+ + AS +F  +   + V+W  MI+GYVQ+   +E L  F  M  + V  +  T  S++ + 
Subjt:  ELREIDEGRRVHCQIVKVGAP-DSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITAC

Query:  TKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSS
        +K   L   K +H Y  ++ +    FL +A +D Y KC     A+ IF +  ++D+V +TAMI GY   G   ++L +F   ++  + PN +T  S+L  
Subjt:  TKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSS

Query:  CSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLS
          +   L LG  +HG  IK G +  C +  A+IDMYAKC  ++ AY IF  + ++D ++WNSMI   AQ+ +   A+ +F QM       D +++ + LS
Subjt:  CSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLS

Query:  ASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDML-KEDLKPNEVIFTT
        A A+L +   G ++H + IK  L +S++Y  + L++ YAKCG+ K+A  VF +MK+KNI++W+++I   G  G    SL +F +M+ K  ++P+++ F  
Subjt:  ASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDML-KEDLKPNEVIFTT

Query:  VLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYY
        ++S+C + G V+EG R+F+SM++DY   P  +HYAC+VDL  R+GRL EA + +K MP  PD  ++G  L  C L+   +L EV   +++ L P+ + YY
Subjt:  VLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYY

Query:  VLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETN
        VL+SN +A+   W  V +VR LM +R + K+PGYS +E N
Subjt:  VLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETN

Q9ZQ74 Pentatricopeptide repeat-containing protein At2g03380, mitochondrial3.9e-22157.19Show/hide
Query:  TSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYN-RMRM
        +S+ + + + CF L+  C NID+L + HG+L  +GL+GD+   TKLV +YG  G    AR+VFDQ+ +PDFY WKVM+R Y LN    ++++ Y+  M+ 
Subjt:  TSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYN-RMRM

Query:  SFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMRE
         FR  D+I+FS  LKA +EL+++D G+++HCQ+VKV + D+ VLTGL+DMYAKCG+I+ A  VF  I  +NVV WTSMIAGYV+ND  +EGLVLFNRMRE
Subjt:  SFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMRE

Query:  ALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEI
          V  N++T G++I ACTKL ALHQGKW HG   K+ +E SS L T+ LDMYVKCG   +AR +F+E   +DLV WTAMIVGYT  G  NEAL LF    
Subjt:  ALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEI

Query:  RFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMR
           + PN VT ASVLS C +  NL LG SVHGL IK+G+ +  V NAL+ MYAKC+   DA  +F    EKD + WNS+I G++QNGS +EAL LF++M 
Subjt:  RFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMR

Query:  SDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEG-LFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFS
        S+S  P+ +T+ S  SA ASLG++ VGSSLHAYS+K G L SS++++GTALL+FYAKCGD +SAR++FD++++KN ITWSAMIGGYG QGD  GSL +F 
Subjt:  SDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEG-LFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFS

Query:  DMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEV
        +MLK+  KPNE  FT++LSAC ++GMV EG +YF SM +DYNF PS KHY CMVD+LAR+G L++ALD I+KMP++PD+  +GAFLHGCG++SRFDLGE+
Subjt:  DMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEV

Query:  VVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVE
        V+++ML LHP++A YYVL+SNLYASDGRW Q  EVR LM QRGL+K+ G+S +E
Subjt:  VVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVE

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-12536.55Show/hide
Query:  LMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNII--FSI
        L+  C ++  L +   L+  +GL  +    TKLV ++   G V  A  VF+ +       +  M++ +         ++F+ RMR  + D + ++  F+ 
Subjt:  LMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNII--FSI

Query:  ILKASSELREIDEGRRVHCQIVKVG-APDSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLG
        +LK   +  E+  G+ +H  +VK G + D F +TGL +MYAKC Q+  A  VF+ + ++++VSW +++AGY QN  A+  L +   M E  ++ +  T+ 
Subjt:  ILKASSELREIDEGRRVHCQIVKVG-APDSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLG

Query:  SIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTA
        S++ A + LR +  GK +HGYA ++  +    ++TA +DMY KCG    AR +FD +   ++VSW +MI  Y Q   P EA+ +F   +   + P  V+ 
Subjt:  SIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTA

Query:  ASVLSSCSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAIT
           L +C+  G+L  G  +H L ++LGL+   +V N+LI MY KC  +D A ++F  +  +  ++WN+MI G+AQNG   +AL  F+QMRS +  PD  T
Subjt:  ASVLSSCSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAIT

Query:  LVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNE
         VS ++A A L        +H   ++  L   N+++ TAL++ YAKCG    AR++FD M ++++ TW+AMI GYG  G G  +L +F +M K  +KPN 
Subjt:  LVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNE

Query:  VIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPN
        V F +V+SACS+SG+VE G + F  M ++Y+   SM HY  MVDLL R+GRL+EA DFI +MPV+P +++YGA L  C ++   +  E     + +L+P+
Subjt:  VIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPN

Query:  EACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVE
        +  Y+VLL+N+Y +   W +V +VR  ML++GL K PG S VE
Subjt:  EACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVE

AT1G69350.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-11833.6Show/hide
Query:  KFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEG
        K HG +I  G+  D + +T L+ +YG  G++  A  VFD M   D  AW  ++     N      +R +  M     + D +    +++  +EL  +   
Subjt:  KFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEG

Query:  RRVHCQIV-KVGAPDSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQ
        R VH QI  K+   D  +   L+ MY+KCG +  +  +FE I  KN VSWT+MI+ Y + + +++ L  F+ M ++ +E N  TL S++++C  +  + +
Subjt:  RRVHCQIV-KVGAPDSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQ

Query:  GKWVHGYAFKNVVE-HSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNL
        GK VHG+A +  ++ +   L+ A +++Y +CG+  D   +   +   ++V+W ++I  Y   G   +AL LF   +   + P++ T AS +S+C  +G +
Subjt:  GKWVHGYAFKNVVE-HSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNL

Query:  SLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAV
         LG  +HG  I+  + +  V+N+LIDMY+K   +D A  +F+ +  +  +TWNSM+CG++QNG++ EA+ LF+ M       + +T ++ + A +S+G++
Subjt:  SLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASASLGAV

Query:  QVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSG
        + G  +H   I  GL   +L+  TAL++ YAKCGD  +A  VF +M  ++I++WS+MI  YG+ G    +++ F+ M++   KPNEV+F  VLSAC +SG
Subjt:  QVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSACSYSG

Query:  MVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYAS
         VEEG +Y+ ++ + +  +P+ +H+AC +DLL+RSG L EA   IK+MP   D S++G+ ++GC ++ + D+ + +  ++  +  ++  YY LLSN+YA 
Subjt:  MVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLSNLYAS

Query:  DGRWGQVNEVRELMLQRGLNKVPGYSQVETNAGV
        +G W +   +R  M    L KVPGYS +E +  V
Subjt:  DGRWGQVNEVRELMLQRGLNKVPGYSQVETNAGV

AT2G03380.1 Pentatricopeptide repeat (PPR) superfamily protein2.8e-22257.19Show/hide
Query:  TSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYN-RMRM
        +S+ + + + CF L+  C NID+L + HG+L  +GL+GD+   TKLV +YG  G    AR+VFDQ+ +PDFY WKVM+R Y LN    ++++ Y+  M+ 
Subjt:  TSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYN-RMRM

Query:  SFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMRE
         FR  D+I+FS  LKA +EL+++D G+++HCQ+VKV + D+ VLTGL+DMYAKCG+I+ A  VF  I  +NVV WTSMIAGYV+ND  +EGLVLFNRMRE
Subjt:  SFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMRE

Query:  ALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEI
          V  N++T G++I ACTKL ALHQGKW HG   K+ +E SS L T+ LDMYVKCG   +AR +F+E   +DLV WTAMIVGYT  G  NEAL LF    
Subjt:  ALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEI

Query:  RFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMR
           + PN VT ASVLS C +  NL LG SVHGL IK+G+ +  V NAL+ MYAKC+   DA  +F    EKD + WNS+I G++QNGS +EAL LF++M 
Subjt:  RFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMR

Query:  SDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEG-LFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFS
        S+S  P+ +T+ S  SA ASLG++ VGSSLHAYS+K G L SS++++GTALL+FYAKCGD +SAR++FD++++KN ITWSAMIGGYG QGD  GSL +F 
Subjt:  SDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEG-LFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFS

Query:  DMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEV
        +MLK+  KPNE  FT++LSAC ++GMV EG +YF SM +DYNF PS KHY CMVD+LAR+G L++ALD I+KMP++PD+  +GAFLHGCG++SRFDLGE+
Subjt:  DMLKEDLKPNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEV

Query:  VVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVE
        V+++ML LHP++A YYVL+SNLYASDGRW Q  EVR LM QRGL+K+ G+S +E
Subjt:  VVREMLQLHPNEACYYVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVE

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein9.6e-11434.94Show/hide
Query:  RNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSE
        + +D  ++ +G +I      D    +KL  +Y   GD+  A  VFD++       W +++     +  F+  I  + +M  S  + D+  FS + K+ S 
Subjt:  RNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSE

Query:  LREIDEGRRVHCQIVKVGAPD-SFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACT
        LR +  G ++H  I+K G  + + V   L+  Y K  +++ A  VF+ + +++V+SW S+I GYV N  A++GL +F +M  + +E +  T+ S+   C 
Subjt:  LREIDEGRRVHCQIVKVGAPD-SFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACT

Query:  KLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSC
          R +  G+ VH    K             LDMY KCG    A+ +F E+    +VS+T+MI GY + G   EA++LF +     + P+  T  +VL+ C
Subjt:  KLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFTDEIRFHLLPNSVTAASVLSSC

Query:  SVSGNLSLGMSVHGLGIK---LGLEECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSP-APDAITLVST
        +    L  G  VH   IK   LG  +  V NAL+DMYAKC  + +A  +F  +  KD I+WN++I GY++N  A EAL LFN +  +   +PD  T+   
Subjt:  SVSGNLSLGMSVHGLGIK---LGLEECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSP-APDAITLVST

Query:  LSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFT
        L A ASL A   G  +H Y ++ G FS   ++  +L++ YAKCG    A M+FD +  K++++W+ MI GYG+ G G  ++A+F+ M +  ++ +E+ F 
Subjt:  LSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFT

Query:  TVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACY
        ++L ACS+SG+V+EGWR+F  M  +    P+++HYAC+VD+LAR+G L +A  FI+ MP+ PD +++GA L GC ++    L E V  ++ +L P    Y
Subjt:  TVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACY

Query:  YVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAGV
        YVL++N+YA   +W QV  +R+ + QRGL K PG S +E    V
Subjt:  YVLLSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAGV

AT4G39530.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-11234.43Show/hide
Query:  KFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYF-LNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDE
        +    L+  G   D+   T L+  Y   G++  AR+VFD + +     W  MI     +   +  +  FY  M  +    D  I S +L A S L  ++ 
Subjt:  KFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQMLDPDFYAWKVMIRWYF-LNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDE

Query:  GRRVHCQIVKVGAP-DSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALH
        G+++H  I++ G   D+ ++  LID Y KCG++  A  +F G+ +KN++SWT++++GY QN   KE + LF  M +  ++ + +   SI+T+C  L AL 
Subjt:  GRRVHCQIVKVGAP-DSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTSMIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALH

Query:  QGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAG---QPNEALRLFTDEIRFHLL-PNSVTAASVLSSCSV
         G  VH Y  K  + + S++  + +DMY KC    DAR +FD     D+V + AMI GY++ G   + +EAL +F D +RF L+ P+ +T  S+L + + 
Subjt:  QGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAG---QPNEALRLFTDEIRFHLL-PNSVTAASVLSSCSV

Query:  SGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASA
          +L L   +HGL  K GL  +    +ALID+Y+ C+ + D+  +F  +  KD + WNSM  GY Q     EAL LF +++     PD  T  + ++A+ 
Subjt:  SGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFNQMRSDSPAPDAITLVSTLSASA

Query:  SLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSA
        +L +VQ+G   H   +K GL   N YI  ALL+ YAKCG  + A   FDS   ++++ W+++I  Y   G+G  +L +   M+ E ++PN + F  VLSA
Subjt:  SLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFTTVLSA

Query:  CSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLS
        CS++G+VE+G + F+ M + +   P  +HY CMV LL R+GRL++A + I+KMP +P   ++ + L GC      +L E      +   P ++  + +LS
Subjt:  CSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVLLS

Query:  NLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAGV
        N+YAS G W +  +VRE M   G+ K PG S +  N  V
Subjt:  NLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAGV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCAACGCTTCTTTAGCCTTCCTCGAACCTTCTCTCGCCTCGCAGGTCCTTTACTTGAAATGGGCCACCACATGACTTGTCCGACATATGCTCCACAGCACCAACT
CTCGGAGCTCGATCAAACCATGACTTCGGTACAATTTATTTCGTTAAACTCGTGCTTTTATCTCATGGGACTTTGCAGGAACATTGATACCCTCGTCAAATTCCATGGAT
TGCTCATAGTACACGGCCTTGTTGGTGATCTTCTTTGTGATACGAAGTTAGTCGGTGTTTATGGCGCACTTGGGGACGTGGGGTCTGCGCGCATGGTGTTCGATCAAATG
CTCGACCCAGATTTTTATGCGTGGAAGGTGATGATTAGGTGGTATTTTTTGAATGACATGTTTGCGGATATTATTCGGTTCTATAATCGCATGAGAATGTCTTTTAGGGA
CTGTGATAACATTATTTTTTCGATTATCTTGAAAGCAAGTAGTGAATTGCGTGAAATTGATGAAGGGAGAAGGGTCCATTGCCAGATTGTGAAGGTGGGGGCCCCGGATA
GTTTCGTATTGACTGGTTTGATTGATATGTATGCGAAATGTGGGCAGATTGAGTGCGCAAGCGCTGTGTTTGAAGGAATTATTGACAAGAATGTGGTTTCTTGGACTTCA
ATGATTGCGGGATATGTACAAAATGATTGTGCGAAGGAGGGTCTGGTTTTGTTCAATCGGATGAGAGAAGCATTGGTTGAAAGCAACCAATTTACTTTAGGGAGCATAAT
CACTGCGTGTACAAAATTAAGAGCCTTGCATCAGGGGAAATGGGTGCATGGCTATGCCTTTAAGAACGTTGTTGAACATAGCTCTTTCTTAGCAACGGCTTTTTTAGACA
TGTATGTTAAATGTGGGCAAACTAGAGATGCTCGCATGATATTTGATGAGCTACCTACTATTGATCTCGTTTCATGGACTGCAATGATTGTTGGATATACCCAAGCTGGC
CAACCGAATGAGGCATTGAGGCTTTTCACAGATGAAATAAGGTTTCATCTCTTACCTAATTCTGTCACTGCTGCGAGTGTTCTTTCATCGTGTTCGGTTTCTGGTAATTT
AAGTTTAGGAATGTCAGTTCATGGACTTGGGATTAAACTTGGGCTGGAAGAGTGTGCAGTGAAGAATGCTCTTATTGACATGTATGCTAAATGCCATATGATTGATGATG
CTTATGCTATATTTCACGGGGTTTTGGAAAAAGATGCAATTACTTGGAACTCAATGATATGTGGGTATGCTCAGAATGGGTCTGCATATGAAGCCCTCCAACTCTTTAAT
CAAATGAGATCAGACTCCCCTGCACCTGATGCAATAACACTGGTGAGCACCCTTTCAGCATCTGCATCCCTAGGTGCTGTGCAGGTCGGTTCATCACTTCATGCTTACTC
GATAAAAGAAGGCTTATTTTCATCTAATCTTTACATTGGCACTGCACTTTTGAACTTTTATGCCAAATGTGGCGATGCTAAATCAGCACGGATGGTGTTTGACAGTATGA
AAGATAAGAACATTATCACGTGGAGTGCAATGATAGGTGGTTATGGAGTGCAGGGTGATGGAAGTGGATCCCTCGCCATTTTCTCTGATATGTTGAAGGAGGATTTGAAA
CCCAATGAAGTAATTTTCACAACGGTATTATCTGCTTGTAGCTATTCAGGAATGGTTGAAGAGGGATGGAGATATTTCAAATCTATGAGTCAGGATTATAACTTTGCGCC
TTCCATGAAACACTATGCCTGCATGGTTGATCTTTTAGCTCGTTCGGGTAGACTGGATGAAGCATTGGACTTTATTAAGAAAATGCCAGTTCGACCAGACATTAGTTTGT
ATGGAGCTTTTCTTCATGGATGTGGATTATACTCAAGGTTTGATCTTGGAGAAGTGGTAGTGAGGGAAATGTTACAGCTTCATCCAAACGAAGCTTGCTATTATGTGCTT
TTGTCTAACCTGTATGCTTCAGATGGGAGATGGGGTCAAGTTAATGAGGTGAGAGAGTTGATGCTACAGAGAGGATTGAACAAAGTCCCTGGGTATAGCCAAGTAGAAAC
TAATGCAGGTGTGCTGTTTCATTAG
mRNA sequenceShow/hide mRNA sequence
TGTTTACAAAAATTAGACGTTCATTGCTCCCGCTGCGTTTTAAAAACTAGCATCCGATATCACACCATCCGTAAAGAATGGCGGCATGACGGTTGAACATTTCCCAACTC
GAAATGTTGCAACGCTTCTTTAGCCTTCCTCGAACCTTCTCTCGCCTCGCAGGTCCTTTACTTGAAATGGGCCACCACATGACTTGTCCGACATATGCTCCACAGCACCA
ACTCTCGGAGCTCGATCAAACCATGACTTCGGTACAATTTATTTCGTTAAACTCGTGCTTTTATCTCATGGGACTTTGCAGGAACATTGATACCCTCGTCAAATTCCATG
GATTGCTCATAGTACACGGCCTTGTTGGTGATCTTCTTTGTGATACGAAGTTAGTCGGTGTTTATGGCGCACTTGGGGACGTGGGGTCTGCGCGCATGGTGTTCGATCAA
ATGCTCGACCCAGATTTTTATGCGTGGAAGGTGATGATTAGGTGGTATTTTTTGAATGACATGTTTGCGGATATTATTCGGTTCTATAATCGCATGAGAATGTCTTTTAG
GGACTGTGATAACATTATTTTTTCGATTATCTTGAAAGCAAGTAGTGAATTGCGTGAAATTGATGAAGGGAGAAGGGTCCATTGCCAGATTGTGAAGGTGGGGGCCCCGG
ATAGTTTCGTATTGACTGGTTTGATTGATATGTATGCGAAATGTGGGCAGATTGAGTGCGCAAGCGCTGTGTTTGAAGGAATTATTGACAAGAATGTGGTTTCTTGGACT
TCAATGATTGCGGGATATGTACAAAATGATTGTGCGAAGGAGGGTCTGGTTTTGTTCAATCGGATGAGAGAAGCATTGGTTGAAAGCAACCAATTTACTTTAGGGAGCAT
AATCACTGCGTGTACAAAATTAAGAGCCTTGCATCAGGGGAAATGGGTGCATGGCTATGCCTTTAAGAACGTTGTTGAACATAGCTCTTTCTTAGCAACGGCTTTTTTAG
ACATGTATGTTAAATGTGGGCAAACTAGAGATGCTCGCATGATATTTGATGAGCTACCTACTATTGATCTCGTTTCATGGACTGCAATGATTGTTGGATATACCCAAGCT
GGCCAACCGAATGAGGCATTGAGGCTTTTCACAGATGAAATAAGGTTTCATCTCTTACCTAATTCTGTCACTGCTGCGAGTGTTCTTTCATCGTGTTCGGTTTCTGGTAA
TTTAAGTTTAGGAATGTCAGTTCATGGACTTGGGATTAAACTTGGGCTGGAAGAGTGTGCAGTGAAGAATGCTCTTATTGACATGTATGCTAAATGCCATATGATTGATG
ATGCTTATGCTATATTTCACGGGGTTTTGGAAAAAGATGCAATTACTTGGAACTCAATGATATGTGGGTATGCTCAGAATGGGTCTGCATATGAAGCCCTCCAACTCTTT
AATCAAATGAGATCAGACTCCCCTGCACCTGATGCAATAACACTGGTGAGCACCCTTTCAGCATCTGCATCCCTAGGTGCTGTGCAGGTCGGTTCATCACTTCATGCTTA
CTCGATAAAAGAAGGCTTATTTTCATCTAATCTTTACATTGGCACTGCACTTTTGAACTTTTATGCCAAATGTGGCGATGCTAAATCAGCACGGATGGTGTTTGACAGTA
TGAAAGATAAGAACATTATCACGTGGAGTGCAATGATAGGTGGTTATGGAGTGCAGGGTGATGGAAGTGGATCCCTCGCCATTTTCTCTGATATGTTGAAGGAGGATTTG
AAACCCAATGAAGTAATTTTCACAACGGTATTATCTGCTTGTAGCTATTCAGGAATGGTTGAAGAGGGATGGAGATATTTCAAATCTATGAGTCAGGATTATAACTTTGC
GCCTTCCATGAAACACTATGCCTGCATGGTTGATCTTTTAGCTCGTTCGGGTAGACTGGATGAAGCATTGGACTTTATTAAGAAAATGCCAGTTCGACCAGACATTAGTT
TGTATGGAGCTTTTCTTCATGGATGTGGATTATACTCAAGGTTTGATCTTGGAGAAGTGGTAGTGAGGGAAATGTTACAGCTTCATCCAAACGAAGCTTGCTATTATGTG
CTTTTGTCTAACCTGTATGCTTCAGATGGGAGATGGGGTCAAGTTAATGAGGTGAGAGAGTTGATGCTACAGAGAGGATTGAACAAAGTCCCTGGGTATAGCCAAGTAGA
AACTAATGCAGGTGTGCTGTTTCATTAGCTAAAGTGACTAGTGTGGTTCATATCTTCTTGTTTTTGCTGGGATTATCAAGAGACCTTGCTCAGGAAAGCTCTAATGTACT
GATCTATTTATGCTTCATTCACCACTGACTAACCTACTCAGCTGGAATCCTCGTACTAGCAAACTCACTTTCTTATGAGCAGTTCATCAATAGAAAAGACCAGTGGTCTG
CCTAAAACATTCACACCTTCTGTGGTGTGCGACTAAGGTCTTGCTGCTTAGCTGTCTTCGTTAGAACACGTAAACTGAAATGGGGCCTTGTCATATCCATGCCCTCTAAT
AAAATTGTTGAGGTGGCTTTGCAATGTTTTGTGGTGAATAGACGTCTCAAAGGGCAAGGAGTTACGAACTCGATGGTTTATATCTGTGAGTGTTCGGACTAGCTTGCGTG
CACGTTAGCTATTCTCACGGGACAACAACTTGACCCTTCTACATTTGGTTGCCAAGGTAACTCATAGGATATTAAATTCTTGGTAGGTGACCACCATTATTTGAATCCAT
TATCTCTAAGCCCTTTATTATTTACCTGACCCTTTTTTGACTACTAGGTCGGTCAACCAAGGGTGGTTATAGAATTAAATTTGTTGAGAACCTTTTCTGCAACCACAACT
TACACTTAAGCTGATTATATACAGCTTGTCGAGGACTTGAGCTAAAAGAATTAATAAATTACATCAAGCTAATTAGAGCATCATCCTCCAAGACTATGGTAAACTTTTTG
TTTGGCCAGAGACAGCGACATTGCCAATTCTAGAAGTGAGCTTCCATTTTCATAATTTTCAGCTTTCTAGAATATTTAGCTTCTTCTGATGTGCTGGATAAAATTTCTAA
GCCTATACTAAGTTTGGTTTCTTGGCCAATTTTCAGTAGAAAGATCTTTTTTCCTGCTGACTTCCAGTTGGATATTTCTTGTTCTAATTGTGGTAGCATTGAAGTGCCTG
TTTGCATAAATATTTCTCTGAATTTTCAATATGGAGATTACATAAAATTCATCATTATAGCTTAGGAATTAGTATTTGCTTAACAAACGTGGCGCATCTTGGTGCATTCC
TGGCATTCTATTGAATGAATTCGACTGTCTTGAAATCACCAACTTATTGTGCGTCTACTTATCTGGGATTGTTGTCTGTTAAAAAGGAATAGGATCAAGTTTTCCTTTGC
CAATCTTGATGTGTTTGATGGGAGAGTGGCGTTTTCCTTGTATTGCTTGTTATAAGCTCAAGAGAAGGATGTTTTATGGAAGAGGGTCATTGCTAGTATTTTTGGTCTGG
GACCAAATGATTGAGTCACTCCTTATCCTAAAGGCAGAGCAATGGGCAGACTGTGGTTTAATATAGCCAAGAACATTGCTGTTAAGTTTACGAAGTTTAGAAAAGGGAAT
GGAAGAAGAATCAGATTTTGGAAAGAGAAATGGGCAGGGAACATTGCTGTTTTATTGGAGGGCCGTATTTTTGGGGCATTTGGTTGAAGGGCAATAATAGGATCTTCAAA
GAGGTCTTGAGAGGAGGTTTGGTCCATAGCTAGGTTTAATGGTTGTTTTGGACGTATGTGTCTAAGGATTTCTGAGATTATTCGTTAGGTTTTATTCTGTTGGATTGAAG
TCCATTCTTGGTTTAGATTGACTCGTTTGGCAGCTGGCTTTTTTGTATGGCTGGCTGCCGGTCTCTGACATAGAAAATAAAAAGTAAGCAAACATTAATGGCAACTTCTC
TTCTATTGCTGAAAATGGTCGAACTAAGGAAGATATGAAATCTATTGATCCTTGTGGATTCCAAGGTCACATGAATGATGAGTCGTTGGTAAATCATCATGAAGCCACTG
GAAGCAAGAAGTCTCGTGACAGTGAAATTGGTGAAAATTTGCATGTAGCGTGTCAAGAGGGTAATTTGTATGTTTCAGAATGTCCTCCCAGCTGCAGCTCTAAAGGTAGA
GATCTTTTTCATGAAACGATGCATAATAAAAAAAGGACTTGGATGAATGCCCGTTACATCTTCAAATAAATTCATGGAGAGTGGACCACAGAACTCCTAAAGATTATATG
GAAAGCAATGGCGATGAGCAAACTTGTTCGAGTGGATTCTTTTCTCAACTCTCATATGCTCAGAATGCAAATGGTTCAAGTGTAAAAAGTGCAAAACTCTCTTGTGACGG
TAACGATGCCAACTCTAAAAGATTATTTGGGCAAAGAAGTGAGTTCTTATCTTTAATCTATATAACTTCAGTATATTACTGTCTTAATACTTTGTTAGTCTTTCCTTGAC
ATGATAGATTAAATTCACAATTCTATATTACCCTTGGCTCTGCAATATAGCAAACTGGTTTCGAGTTAACTTCATATGGAAACTATCTTGTTTTAATTGGCGGCATTAGA
ACACCTTTTTGCAGGTTGCGAGAGTTTTCATGAATTAAAATACGTACTACTCTGAACAACAATAGTATGAAAACTTATTTTACAAAGATATTCTCAGGACGGAGAATATA
AATTGTTCATGCTCAATATGTGAATCTGGCAAGTTTGAAGCGAATGTAATGACGATACATCTGGCTAGTGTTCGTGGATGGAGGGGTGCATCCTGTGGGAATTTTCTTTC
TTTGCTGTTATTTGAGAGTTATGATCGGAGAGAAATAGGAGAGTTTTCGAGGGGCTCGAGAGATCTTGGGAGGATGTGCGGTCCTTAACACCTCCTTTTGGGCATTAGTG
GCTAAAGCTTTTTATAATTATCCTCCAGAATTGGTCCTCTTAGATGGGACCCTCTTTTTTGTTAGCTTTTCTAGGTTTAGGGCATTTCGATTCGTTGGGTTAGGTTTGGG
TTGGGTTTTTGTATTCTTTTATTTATTCTTAATGAAAGCATGTTTTCTCATTAAAGAAAAAAAAAAAGTTAAATATGGCTACATGTCAATCATGGCAAGCTTGACTGCTT
GAGAAGTGCTGGTATACATTTTATATTGGTTTGTGAACCTGATAACTTGTTGCTGTTAATTATTCGTTTCATTCAATTTTGATATAATATTAATATACATCATAATTAGT
TTACTCATATTACGGATTTCAATCCATTAGATTTAAGCTCTTCAATCTAGATGCACCC
Protein sequenceShow/hide protein sequence
MLQRFFSLPRTFSRLAGPLLEMGHHMTCPTYAPQHQLSELDQTMTSVQFISLNSCFYLMGLCRNIDTLVKFHGLLIVHGLVGDLLCDTKLVGVYGALGDVGSARMVFDQM
LDPDFYAWKVMIRWYFLNDMFADIIRFYNRMRMSFRDCDNIIFSIILKASSELREIDEGRRVHCQIVKVGAPDSFVLTGLIDMYAKCGQIECASAVFEGIIDKNVVSWTS
MIAGYVQNDCAKEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAFKNVVEHSSFLATAFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQAG
QPNEALRLFTDEIRFHLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHMIDDAYAIFHGVLEKDAITWNSMICGYAQNGSAYEALQLFN
QMRSDSPAPDAITLVSTLSASASLGAVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMKDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLK
PNEVIFTTVLSACSYSGMVEEGWRYFKSMSQDYNFAPSMKHYACMVDLLARSGRLDEALDFIKKMPVRPDISLYGAFLHGCGLYSRFDLGEVVVREMLQLHPNEACYYVL
LSNLYASDGRWGQVNEVRELMLQRGLNKVPGYSQVETNAGVLFH