; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001608 (gene) of Snake gourd v1 genome

Gene IDTan0001608
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG11:8032878..8036156
RNA-Seq ExpressionTan0001608
SyntenyTan0001608
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578825.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]8.4e-23572.41Show/hide
Query:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------
        MGFLHVCHFA   G+     +FQFL+LR+C S+F ASSS I+ECEK TTEDF  TH                                            
Subjt:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------

Query:  ------------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAA
                                                              AAM+LNVFVATALLDVYAKCGLMNDAA VFESM ERSVVTWSSMAA
Subjt:  ------------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAA

Query:  GYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMI
        GYVQN +YEEALALF KA ETGLK D+FLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGI EAY VFRDVE RNVVLWNAMI
Subjt:  GYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMI

Query:  CGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWG
         GLSRHARSLEVMILFEKMQQ+GL+PNDVTFVSVLS CGHMGLVEKGQKYFDLMIK++HL PNV HYSCMVD LSRAG T +AY+LI KMPF ASASMWG
Subjt:  CGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWG

Query:  SLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELM
        SLLASCR+HGNLELAEVAAKNLF+IEP NAGNYLLLSNMYAANGKW+EVAKARKLLKESDVKKERGKSWIEIK++VHSFMVGERNHPKI+EIYSKLNEL+
Subjt:  SLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELM

Query:  EELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        EEL KLGY+VET+HDLHQV ESRKQELLRHHSEKLAFT GLLFLPPNAP+RIMKNLRICGDCHSFMKLAS  V+RDV+VRDTNRFHHF +GHCSCGDF
Subjt:  EELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

XP_004140992.1 pentatricopeptide repeat-containing protein At5g04780, mitochondrial [Cucumis sativus]3.1e-23769.39Show/hide
Query:  MGFLHVCHFAAKLGRYKEKG----LFQFLNLRLCTSQF----SASSSIVECEKATTEDFNGTH-------------------------------------
        MGFLHVCHFA+  GRY+EKG    +FQFL+LR+CT+QF    S+SS IVECEK TT+DFN TH                                     
Subjt:  MGFLHVCHFAAKLGRYKEKG----LFQFLNLRLCTSQF----SASSSIVECEKATTEDFNGTH-------------------------------------

Query:  ---------------------------------------------------------------------------------------------AAMDLNV
                                                                                                     AAMDLNV
Subjt:  ---------------------------------------------------------------------------------------------AAMDLNV

Query:  FVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSN
        FVATALLDVYAKCGLM DA CVFESMP+RSVVTWSSMAAGYVQNE+YE+ALALF KA ETGLK D+FLMSSVICACAGLAAMIEG Q+NALLSKSGFCSN
Subjt:  FVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSN

Query:  IFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLA
        IFVASSLIDMYAKCGGI E+Y VFRDVE RNVVLWNAMI GLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLS CGHMGLV KGQKYFDLM KEHHLA
Subjt:  IFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLA

Query:  PNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDV
        PNV HYSCMVD LSRAGQ FEAY+LISK+PFNASASMWGSLLASCR+HGNLELAEVAAK LF+IEP N+GNYLLLSNMYAANGKW+EVAK RKLLKESDV
Subjt:  PNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDV

Query:  KKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGD
        KKERGKSWIEIKDKVH FMVGERNHPKI EIYSKLNE+M+EL KLGYKVET+HDLHQVGES KQELLRHHSEKLAFT GLLFLPPNAPIRIMKNLRICGD
Subjt:  KKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGD

Query:  CHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        CHSFMKLAS F  RDVIVRDTNRFHHFK+G CSCGDF
Subjt:  CHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

XP_008456610.1 PREDICTED: pentatricopeptide repeat-containing protein At5g04780 [Cucumis melo]4.5e-23669.11Show/hide
Query:  MGFLHVCHFAAKLGRYKE--------KGLFQFLNLRLCTSQFSASSS----IVECEKATTEDFNGTH---------------------------------
        MGFLHVCHFA+  GRY+E        KG+FQFL+LRLCT+QF ASSS    IVECEK T++DFN TH                                 
Subjt:  MGFLHVCHFAAKLGRYKE--------KGLFQFLNLRLCTSQFSASSS----IVECEKATTEDFNGTH---------------------------------

Query:  -------------------------------------------------------------------------------------------------AAM
                                                                                                         AAM
Subjt:  -------------------------------------------------------------------------------------------------AAM

Query:  DLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSG
        DLNVFVATALLDVYAKCGLM DA  VFESMP+RSVVTWSSMAAGYVQNE+YEEALALF KA ETGLK D+FLMSSVICACAGLAAMIEG QVNALLSKSG
Subjt:  DLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSG

Query:  FCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKE
        FCSNIFVASSLIDMYAKCGGI E+Y VF+DVE RNVVLWNAMI GLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLS CGHMGLV+KGQKYFDLMIKE
Subjt:  FCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKE

Query:  HHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLK
        HHLAPNV+HYSCMVD LSRAGQTFEAY+LISKMPFNASASMWGSLLASCR+HGNLELAE AAK LF+IEP N+GNYLLLSNMYAANGKW+EVAK RKLLK
Subjt:  HHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLK

Query:  ESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLR
        ESDVKKERGKSWIEIKDKVH FMVGERNHPKI EIYSKLNE+M+EL KLGYK ET+HDLHQVGES KQELLRHHSEKLAF  GLLFLPP+APIRIMKNLR
Subjt:  ESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLR

Query:  ICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        ICGDCHSFMKLAS FV RDVIVRDTNRFHHFK+G CSCGDF
Subjt:  ICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

XP_022992701.1 pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X2 [Cucurbita maxima]1.4e-23773.08Show/hide
Query:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------
        MGFLHVCHFA   G+     +FQFL+LR+C S+F ASSS I+ECEK TTEDF  TH                                            
Subjt:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------

Query:  ------------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAA
                                                              AAM+LNVFVATALLDVYAKCGLMNDAA VFESM ERSVVTWSSMAA
Subjt:  ------------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAA

Query:  GYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMI
        GYVQN +YEEALALF KA ETGLK D+FLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGI EAY VFRDVE RNVVLWNAMI
Subjt:  GYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMI

Query:  CGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWG
         GLSRHARSLEVMILFEKMQQ+GL+PNDVTFVSVLS CGHMGLVEKGQKYFDLMIKE+HLAPNV HYSCMVD LSRAG+T +AY+LI KMPF ASASMWG
Subjt:  CGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWG

Query:  SLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELM
        SLLASCR+HGNLELAEVAAKNLF+IEPQNAGNYLLLSNMYAANGKW+EVAKARKLLKESDVKKERGKSWIEIK++VHSFMVGERNHPKI+EIYSKLNEL+
Subjt:  SLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELM

Query:  EELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        EEL KLGY+VET+HDLHQVGESRKQELLRHHSEKLAFT GLLFLPPNAP+RIMKNLRICGDCHSFMKLAS  V+RDV+VRDTNRFHHF +GHCSCGDF
Subjt:  EELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

XP_022992702.1 pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X3 [Cucurbita maxima]1.1e-23773.2Show/hide
Query:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------
        MGFLHVCHFA   G+     +FQFL+LR+C S+F ASSS I+ECEK TTEDF  TH                                            
Subjt:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------

Query:  -----------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAG
                                                             AAM+LNVFVATALLDVYAKCGLMNDAA VFESM ERSVVTWSSMAAG
Subjt:  -----------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAG

Query:  YVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMIC
        YVQN +YEEALALF KA ETGLK D+FLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGI EAY VFRDVE RNVVLWNAMI 
Subjt:  YVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMIC

Query:  GLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGS
        GLSRHARSLEVMILFEKMQQ+GL+PNDVTFVSVLS CGHMGLVEKGQKYFDLMIKE+HLAPNV HYSCMVD LSRAG+T +AY+LI KMPF ASASMWGS
Subjt:  GLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGS

Query:  LLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELME
        LLASCR+HGNLELAEVAAKNLF+IEPQNAGNYLLLSNMYAANGKW+EVAKARKLLKESDVKKERGKSWIEIK++VHSFMVGERNHPKI+EIYSKLNEL+E
Subjt:  LLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELME

Query:  ELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        EL KLGY+VET+HDLHQVGESRKQELLRHHSEKLAFT GLLFLPPNAP+RIMKNLRICGDCHSFMKLAS  V+RDV+VRDTNRFHHF +GHCSCGDF
Subjt:  ELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

TrEMBL top hitse value%identityAlignment
A0A1S3C4B6 pentatricopeptide repeat-containing protein At5g047802.2e-23669.11Show/hide
Query:  MGFLHVCHFAAKLGRYKE--------KGLFQFLNLRLCTSQFSASSS----IVECEKATTEDFNGTH---------------------------------
        MGFLHVCHFA+  GRY+E        KG+FQFL+LRLCT+QF ASSS    IVECEK T++DFN TH                                 
Subjt:  MGFLHVCHFAAKLGRYKE--------KGLFQFLNLRLCTSQFSASSS----IVECEKATTEDFNGTH---------------------------------

Query:  -------------------------------------------------------------------------------------------------AAM
                                                                                                         AAM
Subjt:  -------------------------------------------------------------------------------------------------AAM

Query:  DLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSG
        DLNVFVATALLDVYAKCGLM DA  VFESMP+RSVVTWSSMAAGYVQNE+YEEALALF KA ETGLK D+FLMSSVICACAGLAAMIEG QVNALLSKSG
Subjt:  DLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSG

Query:  FCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKE
        FCSNIFVASSLIDMYAKCGGI E+Y VF+DVE RNVVLWNAMI GLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLS CGHMGLV+KGQKYFDLMIKE
Subjt:  FCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKE

Query:  HHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLK
        HHLAPNV+HYSCMVD LSRAGQTFEAY+LISKMPFNASASMWGSLLASCR+HGNLELAE AAK LF+IEP N+GNYLLLSNMYAANGKW+EVAK RKLLK
Subjt:  HHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLK

Query:  ESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLR
        ESDVKKERGKSWIEIKDKVH FMVGERNHPKI EIYSKLNE+M+EL KLGYK ET+HDLHQVGES KQELLRHHSEKLAF  GLLFLPP+APIRIMKNLR
Subjt:  ESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLR

Query:  ICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        ICGDCHSFMKLAS FV RDVIVRDTNRFHHFK+G CSCGDF
Subjt:  ICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

A0A6J1FED4 pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X21.2e-23472.41Show/hide
Query:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------
        MGFLHVCHFA   G+     +FQFL+LR+C S+F ASSS I+ECEK TTEDF  TH                                            
Subjt:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------

Query:  ------------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAA
                                                              AAM+LNVFVATALLDVYAKCGLMNDAA VFESM ERSVVTWSSMAA
Subjt:  ------------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAA

Query:  GYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMI
        GYVQN +YEEALALF KA ETGLK D+FLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGI EAY VFRDVE RNVVLWNAMI
Subjt:  GYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMI

Query:  CGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWG
         GLSRHARSLEVMILFEKMQQ+GL+PNDVTFVSVLS CGHMGLVEKGQKYFDLMIKE+HLAPNV HYSCMVD LSRAG+T +AY LI KMPF ASAS+WG
Subjt:  CGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWG

Query:  SLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELM
        SLLASCR+HGNLELAEVAAKNLF+IEP NAGNYLLLSNMYAANGKW+EVAKARKLLKESDVKKERGKSWIEIK++VH FMVGERNHPKI+EIYSKLNEL+
Subjt:  SLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELM

Query:  EELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        EEL KLGY+VET+HDLHQV ESRKQELLRHHSEKLAFT GLLFLPPNAP+RIMKNLRICGDCHSFMKLAS  V+RDV+VRDTNRFHHF +GHCSCGDF
Subjt:  EELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

A0A6J1FJ71 pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X39.1e-23572.53Show/hide
Query:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------
        MGFLHVCHFA   G+     +FQFL+LR+C S+F ASSS I+ECEK TTEDF  TH                                            
Subjt:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------

Query:  -----------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAG
                                                             AAM+LNVFVATALLDVYAKCGLMNDAA VFESM ERSVVTWSSMAAG
Subjt:  -----------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAG

Query:  YVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMIC
        YVQN +YEEALALF KA ETGLK D+FLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGI EAY VFRDVE RNVVLWNAMI 
Subjt:  YVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMIC

Query:  GLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGS
        GLSRHARSLEVMILFEKMQQ+GL+PNDVTFVSVLS CGHMGLVEKGQKYFDLMIKE+HLAPNV HYSCMVD LSRAG+T +AY LI KMPF ASAS+WGS
Subjt:  GLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGS

Query:  LLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELME
        LLASCR+HGNLELAEVAAKNLF+IEP NAGNYLLLSNMYAANGKW+EVAKARKLLKESDVKKERGKSWIEIK++VH FMVGERNHPKI+EIYSKLNEL+E
Subjt:  LLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELME

Query:  ELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        EL KLGY+VET+HDLHQV ESRKQELLRHHSEKLAFT GLLFLPPNAP+RIMKNLRICGDCHSFMKLAS  V+RDV+VRDTNRFHHF +GHCSCGDF
Subjt:  ELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

A0A6J1JQM7 pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X35.2e-23873.2Show/hide
Query:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------
        MGFLHVCHFA   G+     +FQFL+LR+C S+F ASSS I+ECEK TTEDF  TH                                            
Subjt:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------

Query:  -----------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAG
                                                             AAM+LNVFVATALLDVYAKCGLMNDAA VFESM ERSVVTWSSMAAG
Subjt:  -----------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAG

Query:  YVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMIC
        YVQN +YEEALALF KA ETGLK D+FLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGI EAY VFRDVE RNVVLWNAMI 
Subjt:  YVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMIC

Query:  GLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGS
        GLSRHARSLEVMILFEKMQQ+GL+PNDVTFVSVLS CGHMGLVEKGQKYFDLMIKE+HLAPNV HYSCMVD LSRAG+T +AY+LI KMPF ASASMWGS
Subjt:  GLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGS

Query:  LLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELME
        LLASCR+HGNLELAEVAAKNLF+IEPQNAGNYLLLSNMYAANGKW+EVAKARKLLKESDVKKERGKSWIEIK++VHSFMVGERNHPKI+EIYSKLNEL+E
Subjt:  LLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELME

Query:  ELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        EL KLGY+VET+HDLHQVGESRKQELLRHHSEKLAFT GLLFLPPNAP+RIMKNLRICGDCHSFMKLAS  V+RDV+VRDTNRFHHF +GHCSCGDF
Subjt:  ELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

A0A6J1JUA1 pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X26.7e-23873.08Show/hide
Query:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------
        MGFLHVCHFA   G+     +FQFL+LR+C S+F ASSS I+ECEK TTEDF  TH                                            
Subjt:  MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSS-IVECEKATTEDFNGTH--------------------------------------------

Query:  ------------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAA
                                                              AAM+LNVFVATALLDVYAKCGLMNDAA VFESM ERSVVTWSSMAA
Subjt:  ------------------------------------------------------AAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAA

Query:  GYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMI
        GYVQN +YEEALALF KA ETGLK D+FLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGI EAY VFRDVE RNVVLWNAMI
Subjt:  GYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMI

Query:  CGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWG
         GLSRHARSLEVMILFEKMQQ+GL+PNDVTFVSVLS CGHMGLVEKGQKYFDLMIKE+HLAPNV HYSCMVD LSRAG+T +AY+LI KMPF ASASMWG
Subjt:  CGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWG

Query:  SLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELM
        SLLASCR+HGNLELAEVAAKNLF+IEPQNAGNYLLLSNMYAANGKW+EVAKARKLLKESDVKKERGKSWIEIK++VHSFMVGERNHPKI+EIYSKLNEL+
Subjt:  SLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELM

Query:  EELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        EEL KLGY+VET+HDLHQVGESRKQELLRHHSEKLAFT GLLFLPPNAP+RIMKNLRICGDCHSFMKLAS  V+RDV+VRDTNRFHHF +GHCSCGDF
Subjt:  EELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

SwissProt top hitse value%identityAlignment
Q9CAA8 Putative pentatricopeptide repeat-containing protein At1g689305.1e-11042.6Show/hide
Query:  NVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFC
        +++V +AL+D+Y KC  ++ A  VF+ M +++VV+W++M  GY Q    EEA+ +F   + +G+  D + +   I ACA ++++ EG+Q +     SG  
Subjt:  NVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFC

Query:  SNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHH
          + V++SL+ +Y KCG I ++  +F ++ VR+ V W AM+   ++  R++E + LF+KM Q GL P+ VT   V+S C   GLVEKGQ+YF LM  E+ 
Subjt:  SNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHH

Query:  LAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKES
        + P++ HYSCM+D+ SR+G+  EA   I+ MPF   A  W +LL++CR+ GNLE+ + AA++L E++P +   Y LLS++YA+ GKW+ VA+ R+ ++E 
Subjt:  LAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKES

Query:  DVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRIC
        +VKKE G+SWI+ K K+HSF   + + P + +IY+KL EL  ++   GYK +T    H V E+ K ++L +HSE+LA   GL+F+P   PIR+ KNLR+C
Subjt:  DVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRIC

Query:  GDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
         DCH+  K  S    R+++VRD  RFH FKDG CSCGDF
Subjt:  GDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

Q9LTF4 Putative pentatricopeptide repeat-containing protein At5g526301.4e-11547.17Show/hide
Query:  DLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSG
        D +VFV ++L+D+YAKCG +  A  +F+ MP+R+VVTWS M  GY Q    EEAL LF +A    L  +++  SSVI  CA    +  G Q++ L  KS 
Subjt:  DLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSG

Query:  FCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKE
        F S+ FV SSL+ +Y+KCG    AY VF +V V+N+ +WNAM+   ++H+ + +V+ LF++M+  G+ PN +TF++VL+ C H GLV++G+ YFD M KE
Subjt:  FCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKE

Query:  HHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLK
          + P   HY+ +VD+L RAG+  EA  +I+ MP + + S+WG+LL SC  H N ELA  AA  +FE+ P ++G ++ LSN YAA+G++ + AKARKLL+
Subjt:  HHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLK

Query:  ESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLR
        +   KKE G SW+E ++KVH+F  GER H K  EIY KL EL EE+ K GY  +T + L +V    K + +R+HSE+LA   GL+  P + PIR+MKNLR
Subjt:  ESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLR

Query:  ICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        +CGDCH+ +K  S   +R +IVRD NRFH F+DG CSC D+
Subjt:  ICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

Q9LZ19 Pentatricopeptide repeat-containing protein At5g04780, mitochondrial1.5e-16558.7Show/hide
Query:  RYKEKGLFQFLNLR---LCTSQFSASSSI----VECEKATTEDFN--GTHAAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQ
        R + + L  FL +R      S+F+ SS +    V C+    +  +       +DLN++V TALLD+YAKCG++ DA  VFESM ++S VTWSSM AGYVQ
Subjt:  RYKEKGLFQFLNLR---LCTSQFSASSSI----VECEKATTEDFN--GTHAAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQ

Query:  NELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLS
        N+ YEEAL L+ +A+   L+Q++F +SSVICAC+ LAA+IEG Q++A++ KSGF SN+FVASS +DMYAKCG + E+Y +F +V+ +N+ LWN +I G +
Subjt:  NELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLS

Query:  RHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLA
        +HAR  EVMILFEKMQQ G+ PN+VTF S+LSVCGH GLVE+G+++F LM   + L+PNV+HYSCMVDIL RAG   EAY LI  +PF+ +AS+WGSLLA
Subjt:  RHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLA

Query:  SCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELP
        SCR + NLELAEVAA+ LFE+EP+NAGN++LLSN+YAAN +W E+AK+RKLL++ DVKK RGKSWI+IKDKVH+F VGE  HP+I EI S L+ L+ +  
Subjt:  SCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELP

Query:  KLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        K GYK   EH+LH V   +K+ELL  HSEKLA   GL+ LP ++P+RIMKNLRIC DCH FMK AS   +R +IVRD NRFHHF DGHCSCGDF
Subjt:  KLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331701.5e-10943.31Show/hide
Query:  DLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSG
        DL+++V++ +LD+Y KCG M+ A   F+S+P    V W++M +G ++N   E A  +F + R  G+  DEF ++++  A + L A+ +G Q++A   K  
Subjt:  DLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSG

Query:  FCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKE
          ++ FV +SL+DMYAKCG I +AY +F+ +E+ N+  WNAM+ GL++H    E + LF++M+ +G+ P+ VTF+ VLS C H GLV +  K+   M  +
Subjt:  FCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKE

Query:  HHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLK
        + + P + HYSC+ D L RAG   +A NLI  M   ASASM+ +LLA+CR  G+ E  +  A  L E+EP ++  Y+LLSNMYAA  KW+E+  AR ++K
Subjt:  HHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLK

Query:  ESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLR
           VKK+ G SWIE+K+K+H F+V +R++ +   IY K+ +++ ++ + GY  ET+  L  V E  K+  L +HSEKLA   GLL  PP+ PIR++KNLR
Subjt:  ESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLR

Query:  ICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        +CGDCH+ MK  +    R++++RD NRFH FKDG CSCGD+
Subjt:  ICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

Q9SY02 Pentatricopeptide repeat-containing protein At4g027509.4e-11244.19Show/hide
Query:  NVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFC
        NV     ++  YA+CG +++A  +F+ MP+R  V+W++M AGY Q+    EAL LF +    G + +    SS +  CA + A+  G Q++  L K G+ 
Subjt:  NVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFC

Query:  SNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHH
        +  FV ++L+ MY KCG I EA ++F+++  +++V WN MI G SRH      +  FE M++ GL P+D T V+VLS C H GLV+KG++YF  M +++ 
Subjt:  SNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHH

Query:  LAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKES
        + PN  HY+CMVD+L RAG   +A+NL+  MPF   A++WG+LL + R HGN ELAE AA  +F +EP+N+G Y+LLSN+YA++G+W +V K R  +++ 
Subjt:  LAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKES

Query:  DVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRIC
         VKK  G SWIEI++K H+F VG+  HP+  EI++ L EL   + K GY  +T   LH V E  K+ ++R+HSE+LA   G++ +    PIR++KNLR+C
Subjt:  DVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRIC

Query:  GDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
         DCH+ +K  +    R +I+RD NRFHHFKDG CSCGD+
Subjt:  GDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

Arabidopsis top hitse value%identityAlignment
AT1G68930.1 pentatricopeptide (PPR) repeat-containing protein3.7e-11142.6Show/hide
Query:  NVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFC
        +++V +AL+D+Y KC  ++ A  VF+ M +++VV+W++M  GY Q    EEA+ +F   + +G+  D + +   I ACA ++++ EG+Q +     SG  
Subjt:  NVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFC

Query:  SNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHH
          + V++SL+ +Y KCG I ++  +F ++ VR+ V W AM+   ++  R++E + LF+KM Q GL P+ VT   V+S C   GLVEKGQ+YF LM  E+ 
Subjt:  SNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHH

Query:  LAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKES
        + P++ HYSCM+D+ SR+G+  EA   I+ MPF   A  W +LL++CR+ GNLE+ + AA++L E++P +   Y LLS++YA+ GKW+ VA+ R+ ++E 
Subjt:  LAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKES

Query:  DVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRIC
        +VKKE G+SWI+ K K+HSF   + + P + +IY+KL EL  ++   GYK +T    H V E+ K ++L +HSE+LA   GL+F+P   PIR+ KNLR+C
Subjt:  DVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRIC

Query:  GDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
         DCH+  K  S    R+++VRD  RFH FKDG CSCGDF
Subjt:  GDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.7e-11344.19Show/hide
Query:  NVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFC
        NV     ++  YA+CG +++A  +F+ MP+R  V+W++M AGY Q+    EAL LF +    G + +    SS +  CA + A+  G Q++  L K G+ 
Subjt:  NVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFC

Query:  SNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHH
        +  FV ++L+ MY KCG I EA ++F+++  +++V WN MI G SRH      +  FE M++ GL P+D T V+VLS C H GLV+KG++YF  M +++ 
Subjt:  SNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHH

Query:  LAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKES
        + PN  HY+CMVD+L RAG   +A+NL+  MPF   A++WG+LL + R HGN ELAE AA  +F +EP+N+G Y+LLSN+YA++G+W +V K R  +++ 
Subjt:  LAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKES

Query:  DVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRIC
         VKK  G SWIEI++K H+F VG+  HP+  EI++ L EL   + K GY  +T   LH V E  K+ ++R+HSE+LA   G++ +    PIR++KNLR+C
Subjt:  DVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRIC

Query:  GDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
         DCH+ +K  +    R +I+RD NRFHHFKDG CSCGD+
Subjt:  GDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-11043.31Show/hide
Query:  DLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSG
        DL+++V++ +LD+Y KCG M+ A   F+S+P    V W++M +G ++N   E A  +F + R  G+  DEF ++++  A + L A+ +G Q++A   K  
Subjt:  DLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSG

Query:  FCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKE
          ++ FV +SL+DMYAKCG I +AY +F+ +E+ N+  WNAM+ GL++H    E + LF++M+ +G+ P+ VTF+ VLS C H GLV +  K+   M  +
Subjt:  FCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKE

Query:  HHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLK
        + + P + HYSC+ D L RAG   +A NLI  M   ASASM+ +LLA+CR  G+ E  +  A  L E+EP ++  Y+LLSNMYAA  KW+E+  AR ++K
Subjt:  HHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLK

Query:  ESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLR
           VKK+ G SWIE+K+K+H F+V +R++ +   IY K+ +++ ++ + GY  ET+  L  V E  K+  L +HSEKLA   GLL  PP+ PIR++KNLR
Subjt:  ESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLR

Query:  ICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        +CGDCH+ MK  +    R++++RD NRFH FKDG CSCGD+
Subjt:  ICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

AT5G04780.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-16658.7Show/hide
Query:  RYKEKGLFQFLNLR---LCTSQFSASSSI----VECEKATTEDFN--GTHAAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQ
        R + + L  FL +R      S+F+ SS +    V C+    +  +       +DLN++V TALLD+YAKCG++ DA  VFESM ++S VTWSSM AGYVQ
Subjt:  RYKEKGLFQFLNLR---LCTSQFSASSSI----VECEKATTEDFN--GTHAAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQ

Query:  NELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLS
        N+ YEEAL L+ +A+   L+Q++F +SSVICAC+ LAA+IEG Q++A++ KSGF SN+FVASS +DMYAKCG + E+Y +F +V+ +N+ LWN +I G +
Subjt:  NELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLS

Query:  RHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLA
        +HAR  EVMILFEKMQQ G+ PN+VTF S+LSVCGH GLVE+G+++F LM   + L+PNV+HYSCMVDIL RAG   EAY LI  +PF+ +AS+WGSLLA
Subjt:  RHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLA

Query:  SCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELP
        SCR + NLELAEVAA+ LFE+EP+NAGN++LLSN+YAAN +W E+AK+RKLL++ DVKK RGKSWI+IKDKVH+F VGE  HP+I EI S L+ L+ +  
Subjt:  SCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELP

Query:  KLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        K GYK   EH+LH V   +K+ELL  HSEKLA   GL+ LP ++P+RIMKNLRIC DCH FMK AS   +R +IVRD NRFHHF DGHCSCGDF
Subjt:  KLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF

AT5G52630.1 mitochondrial RNAediting factor 19.9e-11747.17Show/hide
Query:  DLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSG
        D +VFV ++L+D+YAKCG +  A  +F+ MP+R+VVTWS M  GY Q    EEAL LF +A    L  +++  SSVI  CA    +  G Q++ L  KS 
Subjt:  DLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYEEALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSG

Query:  FCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKE
        F S+ FV SSL+ +Y+KCG    AY VF +V V+N+ +WNAM+   ++H+ + +V+ LF++M+  G+ PN +TF++VL+ C H GLV++G+ YFD M KE
Subjt:  FCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKE

Query:  HHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLK
          + P   HY+ +VD+L RAG+  EA  +I+ MP + + S+WG+LL SC  H N ELA  AA  +FE+ P ++G ++ LSN YAA+G++ + AKARKLL+
Subjt:  HHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQNAGNYLLLSNMYAANGKWNEVAKARKLLK

Query:  ESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLR
        +   KKE G SW+E ++KVH+F  GER H K  EIY KL EL EE+ K GY  +T + L +V    K + +R+HSE+LA   GL+  P + PIR+MKNLR
Subjt:  ESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTTGLLFLPPNAPIRIMKNLR

Query:  ICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF
        +CGDCH+ +K  S   +R +IVRD NRFH F+DG CSC D+
Subjt:  ICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATTTCTCCATGTTTGCCATTTTGCAGCAAAGTTAGGACGATACAAAGAGAAGGGACTCTTTCAGTTTCTTAATCTTCGTCTTTGCACTAGCCAATTTTCTGCATC
ATCATCCATTGTTGAGTGTGAAAAAGCAACTACAGAGGATTTCAATGGTACCCATGCTGCAATGGATCTGAATGTTTTTGTTGCAACTGCATTGCTTGATGTTTATGCAA
AATGTGGTTTGATGAATGATGCGGCTTGTGTTTTTGAGTCCATGCCTGAGAGGAGTGTTGTCACATGGAGTTCAATGGCAGCAGGGTACGTGCAAAATGAGCTATATGAG
GAAGCTTTGGCATTGTTTTGTAAAGCTCGGGAAACGGGGTTGAAACAGGACGAGTTTCTAATGTCATCTGTGATTTGTGCTTGTGCTGGATTGGCAGCCATGATAGAAGG
GAACCAGGTGAATGCTTTGCTATCTAAATCTGGTTTTTGTTCTAATATCTTTGTTGCTTCTTCTCTTATTGACATGTATGCAAAATGTGGTGGCATTGGAGAAGCTTACA
ATGTGTTTCGAGATGTAGAAGTGAGAAATGTTGTTTTGTGGAATGCTATGATATGTGGCTTGTCGAGACATGCTCGTTCACTCGAGGTGATGATTTTATTTGAGAAAATG
CAGCAGATGGGCTTGAGTCCAAATGATGTTACTTTTGTTTCTGTTTTGTCTGTTTGTGGTCATATGGGATTGGTTGAAAAAGGACAGAAATATTTTGACCTGATGATAAA
AGAGCATCATTTGGCACCAAATGTCCTTCACTATTCTTGTATGGTTGACATTCTTAGTCGGGCAGGGCAGACTTTTGAGGCTTACAATTTGATAAGTAAAATGCCTTTTA
ATGCCTCTGCTTCCATGTGGGGTTCCCTTCTGGCTTCTTGTAGGAGCCATGGGAATCTTGAACTTGCTGAGGTTGCTGCTAAAAATTTGTTTGAGATTGAACCACAAAAT
GCAGGAAACTATTTGTTGCTATCCAACATGTATGCAGCAAATGGGAAGTGGAACGAAGTTGCGAAGGCAAGGAAGCTCCTTAAAGAAAGTGATGTGAAGAAAGAGAGGGG
CAAGAGTTGGATTGAGATTAAGGACAAGGTTCACTCGTTTATGGTCGGAGAGAGGAATCATCCTAAGATTTCTGAAATTTACTCAAAGTTGAACGAGTTGATGGAAGAAT
TACCGAAGCTTGGTTACAAGGTGGAGACCGAGCATGATCTTCATCAAGTGGGAGAGAGTAGAAAACAAGAACTTTTGAGGCATCACAGCGAGAAACTTGCTTTTACTACG
GGATTATTGTTCTTACCTCCAAATGCACCCATTAGGATTATGAAAAACCTTAGAATCTGTGGAGACTGCCACTCTTTTATGAAGCTTGCCTCGGGATTTGTTCAGAGAGA
TGTAATAGTCAGGGACACCAACCGATTTCACCATTTTAAGGACGGGCATTGTTCTTGTGGGGATTTTTGCACTTGTCTACCGACCTCCATTTCCAGAAGCAGACTGTTAA
TAGTTGCCATTGCATTTGTTTCTCCAGCTTGCAGAGACGAGAAACCCATATATGAACAAGAACATCTAGTTCTAATCACCTCTCTTGTTTTTGCACTTCTAAGAATCCTT
CTGGACTTCTGTTTATGTGACCAAGGGAGGCTAAACACTGACCATATTCTCCAGTTTATTGAAATATCATTCCCATTCCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGATTTCTCCATGTTTGCCATTTTGCAGCAAAGTTAGGACGATACAAAGAGAAGGGACTCTTTCAGTTTCTTAATCTTCGTCTTTGCACTAGCCAATTTTCTGCATC
ATCATCCATTGTTGAGTGTGAAAAAGCAACTACAGAGGATTTCAATGGTACCCATGCTGCAATGGATCTGAATGTTTTTGTTGCAACTGCATTGCTTGATGTTTATGCAA
AATGTGGTTTGATGAATGATGCGGCTTGTGTTTTTGAGTCCATGCCTGAGAGGAGTGTTGTCACATGGAGTTCAATGGCAGCAGGGTACGTGCAAAATGAGCTATATGAG
GAAGCTTTGGCATTGTTTTGTAAAGCTCGGGAAACGGGGTTGAAACAGGACGAGTTTCTAATGTCATCTGTGATTTGTGCTTGTGCTGGATTGGCAGCCATGATAGAAGG
GAACCAGGTGAATGCTTTGCTATCTAAATCTGGTTTTTGTTCTAATATCTTTGTTGCTTCTTCTCTTATTGACATGTATGCAAAATGTGGTGGCATTGGAGAAGCTTACA
ATGTGTTTCGAGATGTAGAAGTGAGAAATGTTGTTTTGTGGAATGCTATGATATGTGGCTTGTCGAGACATGCTCGTTCACTCGAGGTGATGATTTTATTTGAGAAAATG
CAGCAGATGGGCTTGAGTCCAAATGATGTTACTTTTGTTTCTGTTTTGTCTGTTTGTGGTCATATGGGATTGGTTGAAAAAGGACAGAAATATTTTGACCTGATGATAAA
AGAGCATCATTTGGCACCAAATGTCCTTCACTATTCTTGTATGGTTGACATTCTTAGTCGGGCAGGGCAGACTTTTGAGGCTTACAATTTGATAAGTAAAATGCCTTTTA
ATGCCTCTGCTTCCATGTGGGGTTCCCTTCTGGCTTCTTGTAGGAGCCATGGGAATCTTGAACTTGCTGAGGTTGCTGCTAAAAATTTGTTTGAGATTGAACCACAAAAT
GCAGGAAACTATTTGTTGCTATCCAACATGTATGCAGCAAATGGGAAGTGGAACGAAGTTGCGAAGGCAAGGAAGCTCCTTAAAGAAAGTGATGTGAAGAAAGAGAGGGG
CAAGAGTTGGATTGAGATTAAGGACAAGGTTCACTCGTTTATGGTCGGAGAGAGGAATCATCCTAAGATTTCTGAAATTTACTCAAAGTTGAACGAGTTGATGGAAGAAT
TACCGAAGCTTGGTTACAAGGTGGAGACCGAGCATGATCTTCATCAAGTGGGAGAGAGTAGAAAACAAGAACTTTTGAGGCATCACAGCGAGAAACTTGCTTTTACTACG
GGATTATTGTTCTTACCTCCAAATGCACCCATTAGGATTATGAAAAACCTTAGAATCTGTGGAGACTGCCACTCTTTTATGAAGCTTGCCTCGGGATTTGTTCAGAGAGA
TGTAATAGTCAGGGACACCAACCGATTTCACCATTTTAAGGACGGGCATTGTTCTTGTGGGGATTTTTGCACTTGTCTACCGACCTCCATTTCCAGAAGCAGACTGTTAA
TAGTTGCCATTGCATTTGTTTCTCCAGCTTGCAGAGACGAGAAACCCATATATGAACAAGAACATCTAGTTCTAATCACCTCTCTTGTTTTTGCACTTCTAAGAATCCTT
CTGGACTTCTGTTTATGTGACCAAGGGAGGCTAAACACTGACCATATTCTCCAGTTTATTGAAATATCATTCCCATTCCTTTAA
Protein sequenceShow/hide protein sequence
MGFLHVCHFAAKLGRYKEKGLFQFLNLRLCTSQFSASSSIVECEKATTEDFNGTHAAMDLNVFVATALLDVYAKCGLMNDAACVFESMPERSVVTWSSMAAGYVQNELYE
EALALFCKARETGLKQDEFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIGEAYNVFRDVEVRNVVLWNAMICGLSRHARSLEVMILFEKM
QQMGLSPNDVTFVSVLSVCGHMGLVEKGQKYFDLMIKEHHLAPNVLHYSCMVDILSRAGQTFEAYNLISKMPFNASASMWGSLLASCRSHGNLELAEVAAKNLFEIEPQN
AGNYLLLSNMYAANGKWNEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELMEELPKLGYKVETEHDLHQVGESRKQELLRHHSEKLAFTT
GLLFLPPNAPIRIMKNLRICGDCHSFMKLASGFVQRDVIVRDTNRFHHFKDGHCSCGDFCTCLPTSISRSRLLIVAIAFVSPACRDEKPIYEQEHLVLITSLVFALLRIL
LDFCLCDQGRLNTDHILQFIEISFPFL