; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022305 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022305
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00154107:389083..402475
RNA-Seq ExpressionSgr022305
SyntenySgr022305
Gene Ontology termsGO:0006644 - phospholipid metabolic process (biological process)
GO:0009451 - RNA modification (biological process)
GO:0015986 - ATP synthesis coupled proton transport (biological process)
GO:0050482 - arachidonic acid secretion (biological process)
GO:0000276 - mitochondrial proton-transporting ATP synthase complex, coupling factor F(o) (cellular component)
GO:0015078 - proton transmembrane transporter activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0004623 - phospholipase A2 activity (molecular function)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR036444 - Phospholipase A2 domain superfamily
IPR033113 - Phospholipase A2, histidine active site
IPR032867 - DYW domain
IPR029004 - Ribosomal L28e/Mak16
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR006808 - ATP synthase, F0 complex, subunit G, mitochondrial
IPR002885 - Pentatricopeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586480.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]4.7e-26471.23Show/hide
Query:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLCTNA----RNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR
        MASSIPS TT NSII++NF S     HSPAV            KSFRPT  C NA     NRK  RRYDG +TTNTLSKSQN+TSDSV            
Subjt:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLCTNA----RNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR

Query:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT
        SLPSTVDLVALCE G+V D LEFIGQGAN DYG+ TALLNSC N KLLEAGRRVD LLK TKF GDV LNN LIE+YSCCGCMKDARRVFDKMPDRD RT
Subjt:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT

Query:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT
        WNLMIKGY ENG GD GLALFE+MKSV LQPNSETF +VLAACAMAEAVEEG+ YF SMENEYGIIP  EHYLGVVDVLGK  HLIEAEEF++K+PI PT
Subjt:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT

Query:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH
         KIWDALR YARLHGNIELEDRAEEL+ + DPSMA T AK PLP PRKQ ATNMLEEKDRVREFRCA+PYKEE + GKLKGLNGQMREAGYVPDTRYVLH
Subjt:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH

Query:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYWSKLVWAEGKQARSLSKRR
        DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCH+AIKIMSKIVGRELIVRDNKRFHHFKDGKCS                       
Subjt:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYWSKLVWAEGKQARSLSKRR

Query:  RSARLLYIMGFSQRAFTLREPSKSGTSLRFVPRCPLKFSFLYCVAHTLRARLLKLAATESSHFYHILPFSKSSALVSFGLSRSFFSFILDLMASKLHQLQ
                            P+ S             FS L+ V                       P S +S     G  + F SF+LDLMASKLHQLQ
Subjt:  RSARLLYIMGFSQRAFTLREPSKSGTSLRFVPRCPLKFSFLYCVAHTLRARLLKLAATESSHFYHILPFSKSSALVSFGLSRSFFSFILDLMASKLHQLQ

Query:  SKATQASQFALKHGSSYYKQLLEQNKQYIQEPATVEKCNLLSKQLLYTRLASIPGRCESFWKELDYVKNLWKNRQELKVEDAGIAALFGLECFAWFCAGE
        SKA +ASQ+A+KHGSSYYKQLLEQNKQYIQEPATVEKC+LLS+QL YTRLASIPGR ESFWKELD VKNLWKNRQELKVEDAGIAALFGLECFAWFCAGE
Subjt:  SKATQASQFALKHGSSYYKQLLEQNKQYIQEPATVEKCNLLSKQLLYTRLASIPGRCESFWKELDYVKNLWKNRQELKVEDAGIAALFGLECFAWFCAGE

Query:  IVGRGFTFT
        IVGRGFTFT
Subjt:  IVGRGFTFT

KAG7021339.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-26471.37Show/hide
Query:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLCTNA----RNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR
        MASSIPS TT NSII++NF S     HSPAV            KSFRPT  C NA     NRK  RRYDG +TTNTLSKSQN+TSDSV            
Subjt:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLCTNA----RNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR

Query:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT
        SLPSTVDLVALCE G+V D LEFIGQGAN DYG+ TALLNSC N KLLEAGRRVD LLK TKF GDV LNNKLIE+YSCCGCMKDARRVFDKMPDRD RT
Subjt:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT

Query:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT
        WNLMIKGY ENG GD GLALFE+MKSV LQPNSETF +VLAACAMAEAVEEG+ YF SMENEYGIIP  EHYLGVVDVLGK  HLIEAEEF++K+PI PT
Subjt:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT

Query:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH
         KIWDALR YARLHGNIELEDRAEEL+ + DPSMA T AK PLP PRKQ ATNMLEEKDRVREFRCA+PYKEE + GKLKGLNGQMREAGYVPDTRYVLH
Subjt:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH

Query:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYWSKLVWAEGKQARSLSKRR
        DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCH+AIKIMSKIVGRELIVRDNKRFHHFKDGKCS                       
Subjt:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYWSKLVWAEGKQARSLSKRR

Query:  RSARLLYIMGFSQRAFTLREPSKSGTSLRFVPRCPLKFSFLYCVAHTLRARLLKLAATESSHFYHILPFSKSSALVSFGLSRSFFSFILDLMASKLHQLQ
                            P+ S             FS L+ V                       P S +S     G  + F SF+LDLMASKLHQLQ
Subjt:  RSARLLYIMGFSQRAFTLREPSKSGTSLRFVPRCPLKFSFLYCVAHTLRARLLKLAATESSHFYHILPFSKSSALVSFGLSRSFFSFILDLMASKLHQLQ

Query:  SKATQASQFALKHGSSYYKQLLEQNKQYIQEPATVEKCNLLSKQLLYTRLASIPGRCESFWKELDYVKNLWKNRQELKVEDAGIAALFGLECFAWFCAGE
        SKA +ASQ+A+KHGSSYYKQLLEQNKQYIQEPATVEKC+LLS+QL YTRLASIPGR ESFWKELD VKNLWKNRQELKVEDAGIAALFGLECFAWFCAGE
Subjt:  SKATQASQFALKHGSSYYKQLLEQNKQYIQEPATVEKCNLLSKQLLYTRLASIPGRCESFWKELDYVKNLWKNRQELKVEDAGIAALFGLECFAWFCAGE

Query:  IVGRGFTFT
        IVGRGFTFT
Subjt:  IVGRGFTFT

XP_022154624.1 pentatricopeptide repeat-containing protein At2g15690-like [Momordica charantia]2.6e-21481.09Show/hide
Query:  MASSIPSHTTRNSIISSNFNSHSPAVIRTATFSSSRCPKSFRPTTLC----TNARNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFRSLPST
        MASSIPS T  NSIIS NF SH     R++ FS S C KSFRPT  C    +NARNRK  RRYDG  TTN LSKSQNQTS+SVFS            PST
Subjt:  MASSIPSHTTRNSIISSNFNSHSPAVIRTATFSSSRCPKSFRPTTLC----TNARNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFRSLPST

Query:  VDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMI
         DLVALCEEGKVTDALE+I  GA+ADYG+FTALLNSCGNLKLL+AGRRVDGLLKRTKFRGDVELNNKLIELYS CGCMKDARRVF+KMPDRD RTWNLMI
Subjt:  VDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMI

Query:  KGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWD
        KGY ENG GDDG+ALFEQMK V LQPNSETF +VLAACAMAEAVEEG+ YF SMENEYG  PG EHYLGVVDVLGK  HLIEA EF+EKMPI P   IWD
Subjt:  KGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWD

Query:  ALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDIDEEA
        ALRNYAR+HG++ELEDRAEELMLALDPS+ ATA   P+P P+KQ ATNMLEEK+RVREFRCA+PYKEE +GKLKGL+GQMREAGYVPDTRYVLHDIDEEA
Subjt:  ALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDIDEEA

Query:  KQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
        KQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCH+AIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
Subjt:  KQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW

XP_022938076.1 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like isoform X1 [Cucurbita moschata]3.7e-20879.67Show/hide
Query:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLC----TNARNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR
        MASSIPS TT NSII++NF S     HSPAV            KSFRPT  C    +N+ NRK  RRYDG +TTNTLSKSQN+TSDSV            
Subjt:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLC----TNARNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR

Query:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT
        SLPSTVDLVALCE G+V D LEFIGQGAN DYG+ TALLNSC N KLLEAGRRVD LLK TKF GDV LNN LIE+YSCCGCMKDARRVFDKMPDRD RT
Subjt:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT

Query:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT
        WNLMIKGY ENG GD GLALFE+MKSV LQPNSETF +VLAACAMAEAVEEG+ YF SMENEYGIIP  EHYLGVVDVLGK  HLIEAEEF++K+PI PT
Subjt:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT

Query:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH
         KIWDALR YARLHGNIELEDRAEEL+ + DPSMA TAAK PL  PRKQ ATNMLEEKDRVREFRCA+PYKEE + GKLKGLNGQMREAGYVPDTRYVLH
Subjt:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH

Query:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
        DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCH+AIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
Subjt:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW

XP_023538219.1 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucurbita pepo subsp. pepo]2.4e-20779.25Show/hide
Query:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLCTNA----RNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR
        MASSIPS T  NSII++NF S     HSPAV            KSFRP   C NA     NRK  RRYDG +TTNTLSKSQN+TSDSV            
Subjt:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLCTNA----RNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR

Query:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT
        SLPSTVDLVALCE G+V D LEFIGQGA+ DYG+FTALLNSC N KLLEAGRRVD LLK TKF GDV LNNKLIE+YSCCGCMKDARRVFDKMPDRD RT
Subjt:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT

Query:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT
        WNLMIKGY ENG GD GLALFE+MK+V LQPNSETF  VLAACAMAEAVEEG+ YF SMENEYGIIP  EHYLGVVDVLGK  HLIEAEEF++K+PI PT
Subjt:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT

Query:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH
         KIWDALR YARLHGNIELEDRAEEL+ + DPSMA T AK PLP PRKQ ATNMLEEKDRVREFRCA+PYKEE + GKLKGLNGQMREAGYVPDTRYVLH
Subjt:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH

Query:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
        DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCH+AIKIMSKIVGRELIVRDNKRFHHFKD KCSCGDYW
Subjt:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A5A7V2K9 Pentatricopeptide repeat-containing protein1.3e-20377.75Show/hide
Query:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLCTNA----RNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR
        M+SSI SH T NSI ++NF S     H PAV            KSFR T L TNA    R RK  R+YD  STTNTLS+SQNQTSDSV+           
Subjt:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLCTNA----RNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR

Query:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT
        S PSTVDL+ALC+EGKV DALE+IGQGA  DYG+FTALLNSC NLKLLEAGRRVDGLLK TKFRGDVELNNKLIE+YS CGCMK+AR+VFDKMP++DTRT
Subjt:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT

Query:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT
        WNLMIKGY ENG GD+GLALFEQMK+V LQPNSETF +VLAACAMAEAVEEG+ YF  M NEYGI P  EHYLGVVDVLGK  HLIEAEEF+EKMPI PT
Subjt:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT

Query:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHD
         KIWDALRNYARLHGN+ELEDRAEELM +LDPS  AT  K  LP  RKQ +TNMLEEKDRVREFR A+PYKEE +GKLKGLNGQM+EAGYVPDTRYVLHD
Subjt:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHD

Query:  IDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
        IDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCH+AIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
Subjt:  IDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW

A0A6J1DMP2 pentatricopeptide repeat-containing protein At2g15690-like1.3e-21481.09Show/hide
Query:  MASSIPSHTTRNSIISSNFNSHSPAVIRTATFSSSRCPKSFRPTTLC----TNARNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFRSLPST
        MASSIPS T  NSIIS NF SH     R++ FS S C KSFRPT  C    +NARNRK  RRYDG  TTN LSKSQNQTS+SVFS            PST
Subjt:  MASSIPSHTTRNSIISSNFNSHSPAVIRTATFSSSRCPKSFRPTTLC----TNARNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFRSLPST

Query:  VDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMI
         DLVALCEEGKVTDALE+I  GA+ADYG+FTALLNSCGNLKLL+AGRRVDGLLKRTKFRGDVELNNKLIELYS CGCMKDARRVF+KMPDRD RTWNLMI
Subjt:  VDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMI

Query:  KGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWD
        KGY ENG GDDG+ALFEQMK V LQPNSETF +VLAACAMAEAVEEG+ YF SMENEYG  PG EHYLGVVDVLGK  HLIEA EF+EKMPI P   IWD
Subjt:  KGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWD

Query:  ALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDIDEEA
        ALRNYAR+HG++ELEDRAEELMLALDPS+ ATA   P+P P+KQ ATNMLEEK+RVREFRCA+PYKEE +GKLKGL+GQMREAGYVPDTRYVLHDIDEEA
Subjt:  ALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDIDEEA

Query:  KQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
        KQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCH+AIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
Subjt:  KQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW

A0A6J1FC52 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like isoform X11.8e-20879.67Show/hide
Query:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLC----TNARNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR
        MASSIPS TT NSII++NF S     HSPAV            KSFRPT  C    +N+ NRK  RRYDG +TTNTLSKSQN+TSDSV            
Subjt:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLC----TNARNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR

Query:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT
        SLPSTVDLVALCE G+V D LEFIGQGAN DYG+ TALLNSC N KLLEAGRRVD LLK TKF GDV LNN LIE+YSCCGCMKDARRVFDKMPDRD RT
Subjt:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT

Query:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT
        WNLMIKGY ENG GD GLALFE+MKSV LQPNSETF +VLAACAMAEAVEEG+ YF SMENEYGIIP  EHYLGVVDVLGK  HLIEAEEF++K+PI PT
Subjt:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT

Query:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH
         KIWDALR YARLHGNIELEDRAEEL+ + DPSMA TAAK PL  PRKQ ATNMLEEKDRVREFRCA+PYKEE + GKLKGLNGQMREAGYVPDTRYVLH
Subjt:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH

Query:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
        DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCH+AIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
Subjt:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW

A0A6J1FHU4 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like isoform X23.3e-20779.63Show/hide
Query:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLC----TNARNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR
        MASSIPS TT NSII++NF S     HSPAV            KSFRPT  C    +N+ NRK  RRYDG +TTNTLSKSQN+TSDSV            
Subjt:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLC----TNARNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR

Query:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT
        SLPSTVDLVALCE G+V D LEFIGQGAN DYG+ TALLNSC N KLLEAGRRVD LLK TKF GDV LNN LIE+YSCCGCMKDARRVFDKMPDRD RT
Subjt:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT

Query:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT
        WNLMIKGY ENG GD GLALFE+MKSV LQPNSETF +VLAACAMAEAVEEG+ YF SMENEYGIIP  EHYLGVVDVLGK  HLIEAEEF++K+PI PT
Subjt:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT

Query:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH
         KIWDALR YARLHGNIELEDRAEEL+ + DPSMA TAAK PL  PRKQ ATNMLEEKDRVREFRCA+PYKEE + GKLKGLNGQMREAGYVPDTRYVLH
Subjt:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH

Query:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDY
        DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCH+AIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDY

A0A6J1HQH5 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like1.2e-20478.63Show/hide
Query:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLCTNA----RNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR
        MASSIPS TT NSII++NF S     HSPAV            KSFRPT  C NA     NRK  RRYDG +TTNTLSKSQN+TSDSV S          
Subjt:  MASSIPSHTTRNSIISSNFNS-----HSPAVIRTATFSSSRCPKSFRPTTLCTNA----RNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFR

Query:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT
          PSTV LVALCE G+V D LEFIG GAN DYG+FTALLNSC N KLLEAGRRVD LLK TKF GDV LNNKLIE+YSCCGCMKDARRVFDKMPDRD RT
Subjt:  SLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRT

Query:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT
        WNLMIKGY ENG GD GLALFE+MK+V LQPNSETF  VLAACAMAEAVEEG+ YF SMENEYGIIP  EHYL VVDVLGK  HLIEAEEF++KMPI PT
Subjt:  WNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPT

Query:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH
         KIWDALR YARLHGNIELEDRAEEL+ + DPSMA TAAK PLP  RKQ  TNMLEEKDRVR+FRC++P KEE + GKLKGLNGQMREAGYVPDTRYVLH
Subjt:  TKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEAD-GKLKGLNGQMREAGYVPDTRYVLH

Query:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
        DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCH+AIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
Subjt:  DIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW

SwissProt top hitse value%identityAlignment
Q680H3 Pentatricopeptide repeat-containing protein At2g255801.6e-7337.37Show/hide
Query:  ALCEEGKVTDALEFIGQGANADY----GLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMI
        A C+ GKV  AL  I   A+ +Y         L   CG  + L+  + V G +  +    D+  N+ L+E+YS CG   +A  VF+KM +++  TW ++I
Subjt:  ALCEEGKVTDALEFIGQGANADY----GLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMI

Query:  KGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWD
        + + +NG G+D + +F + K     P+ + F+ +  AC M   V+EG+++F+SM  +YGI P  E Y+ +V++      L EA EFVE+MP++P   +W+
Subjt:  KGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWD

Query:  ALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRP-LPLPRKQLATNMLEE--------KDRVREFRCA---IPYKEEADGKLKGLNGQMREAGYVPD
         L N +R+HGN+EL D   E++  LDP+     ++   +P+    +    L++        K  ++EFR     +P  +E    L+ L   M E GYV +
Subjt:  ALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRP-LPLPRKQLATNMLEE--------KDRVREFRCA---IPYKEEADGKLKGLNGQMREAGYVPD

Query:  TRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
        TR  LHDID+E+K+  L  HSER+A A  ++++  R    +IKNLR+C DCH+A+KIMS IVGRE+I RD KRFH  K+G C+C DYW
Subjt:  TRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW

Q9SUH6 Pentatricopeptide repeat-containing protein At4g307005.5e-7437.17Show/hide
Query:  DSDFRSLPSTVDLVA-LCEEGKVTDALEFIGQGANADYG----LFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVF
        +S  +SLPS   +++   + G   DA+    +   +++       T +L++C  L  L  G+ V  L++ T F   + ++  LI +Y+ CG + +ARR+F
Subjt:  DSDFRSLPSTVDLVA-LCEEGKVTDALEFIGQGANADYG----LFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVF

Query:  DKMPDRDTRTWNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEE
        D M  ++  TWN MI GY  +G G + L +F +M +  + P   TF  VL AC+ A  V+EG   F SM + YG  P  +HY  +VD+LG+  HL  A +
Subjt:  DKMPDRDTRTWNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEE

Query:  FVEKMPIKPTTKIWDALRNYARLHGNIELEDRAEELMLALDP-----------------------SMAATAAKRPLPLPRKQLATNMLEEKDRVREFRC-
        F+E M I+P + +W+ L    R+H +  L     E +  LDP                       ++  TA KR L    K     ++E  +    F   
Subjt:  FVEKMPIKPTTKIWDALRNYARLHGNIELEDRAEELMLALDP-----------------------SMAATAAKRPLPLPRKQLATNMLEEKDRVREFRC-

Query:  --AIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDN
          + P  +E   KL+ L G+MREAGY P+T   LHD++EE ++  ++ HSERLAIA+GLI+T   T +RIIKNLR+C DCH+  K++SKI  R ++VRD 
Subjt:  --AIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDN

Query:  KRFHHFKDGKCSCGDYW
         RFHHFKDG CSCGDYW
Subjt:  KRFHHFKDGKCSCGDYW

Q9SUU7 Pentatricopeptide repeat-containing protein At4g32450, mitochondrial1.6e-7336.62Show/hide
Query:  DGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFRSLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALL----NSCGNLKLLEAGRRVDGLLKRTKFR
        DG S+  T      Q ++   +G      D     S  +L ++C EGKV  A+E I    N  Y +    L      CG+ + L+  + V   +  +   
Subjt:  DGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFRSLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALL----NSCGNLKLLEAGRRVDGLLKRTKFR

Query:  GDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYG
         D+   N +IE+YS CG ++DA  VF+ MP+R+  TW  +I+ + +NG G+D +  F + K    +P+ E F+ +  AC +   + EG+++F+SM  EYG
Subjt:  GDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYG

Query:  IIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPL-PLPRKQLATNMLEEKDRVRE
        IIP  EHY+ +V +L +  +L EA  FVE M  +P   +W+ L N +R+HG++ L DR ++++  LD S     +K  L P+    L    L+   +   
Subjt:  IIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPL-PLPRKQLATNMLEEKDRVRE

Query:  F--------RCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIV
        +          + P   E    LK L   M E GYVP ++  LHD+D+E+K + L  H+ER A     + TPAR+ +R++KNLR+C DCH+A+K+MSKIV
Subjt:  F--------RCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIV

Query:  GRELIVRDNKRFHHFKDGKCSCGDYW
        GRELI RD KRFHH KDG CSC +YW
Subjt:  GRELIVRDNKRFHHFKDGKCSCGDYW

Q9SY02 Pentatricopeptide repeat-containing protein At4g027502.3e-7236.15Show/hide
Query:  QGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMIKGYTENGLGDDGLALFEQMK
        +G   +   F++ L++C ++  LE G+++ G L +  +     + N L+ +Y  CG +++A  +F +M  +D  +WN MI GY+ +G G+  L  FE MK
Subjt:  QGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMIKGYTENGLGDDGLALFEQMK

Query:  SVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWDALRNYARLHGNIELEDRAEE
           L+P+  T   VL+AC+    V++G  YF +M  +YG++P  +HY  +VD+LG+   L +A   ++ MP +P   IW  L   +R+HGN EL + A +
Subjt:  SVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWDALRNYARLHGNIELEDRAEE

Query:  LMLALDPSMA----------------ATAAKRPLPL----PRKQLATNMLEEKDRVREFRCA---IPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDID
         + A++P  +                    K  + +     +K    + +E +++   F       P K+E    L+ L+ +M++AGYV  T  VLHD++
Subjt:  LMLALDPSMA----------------ATAAKRPLPL----PRKQLATNMLEEKDRVREFRCA---IPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDID

Query:  EEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
        EE K++ ++YHSERLA+AYG++   +   +R+IKNLR+C DCH+AIK M++I GR +I+RDN RFHHFKDG CSCGDYW
Subjt:  EEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW

Q9ZQE5 Pentatricopeptide repeat-containing protein At2g15690, mitochondrial1.6e-11855.15Show/hide
Query:  PSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWN
        PS  +++ LC+     DA+E + +GA  D   F  L  SC NLK LE  ++V     ++KFRGD +LNN +I ++  C  + DA+RVFD M D+D  +W+
Subjt:  PSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWN

Query:  LMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTK
        LM+  Y++NG+GDD L LFE+M    L+PN ETF  V  ACA    +EE  ++F SM+NE+GI P  EHYLGV+ VLGKC HL+EAE+++  +P +PT  
Subjt:  LMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTK

Query:  IWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDID
         W+A+RNYARLHG+I+LED  EELM+ +DPS  A   K P P P+    TNM+  K R+ EFR    YK+EA  ++    G +    YVPDTR+VLHDID
Subjt:  IWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDID

Query:  EEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
        +EAK+QAL YHSERLAIAYG+I TP R TL IIKNLR+CGDCH+ IKIMSKI+GR LIVRDNKRFHHFKDGKCSCGDYW
Subjt:  EEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT2G15690.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-11955.15Show/hide
Query:  PSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWN
        PS  +++ LC+     DA+E + +GA  D   F  L  SC NLK LE  ++V     ++KFRGD +LNN +I ++  C  + DA+RVFD M D+D  +W+
Subjt:  PSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWN

Query:  LMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTK
        LM+  Y++NG+GDD L LFE+M    L+PN ETF  V  ACA    +EE  ++F SM+NE+GI P  EHYLGV+ VLGKC HL+EAE+++  +P +PT  
Subjt:  LMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTK

Query:  IWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDID
         W+A+RNYARLHG+I+LED  EELM+ +DPS  A   K P P P+    TNM+  K R+ EFR    YK+EA  ++    G +    YVPDTR+VLHDID
Subjt:  IWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDID

Query:  EEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
        +EAK+QAL YHSERLAIAYG+I TP R TL IIKNLR+CGDCH+ IKIMSKI+GR LIVRDNKRFHHFKDGKCSCGDYW
Subjt:  EEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW

AT2G25580.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-7437.37Show/hide
Query:  ALCEEGKVTDALEFIGQGANADY----GLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMI
        A C+ GKV  AL  I   A+ +Y         L   CG  + L+  + V G +  +    D+  N+ L+E+YS CG   +A  VF+KM +++  TW ++I
Subjt:  ALCEEGKVTDALEFIGQGANADY----GLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMI

Query:  KGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWD
        + + +NG G+D + +F + K     P+ + F+ +  AC M   V+EG+++F+SM  +YGI P  E Y+ +V++      L EA EFVE+MP++P   +W+
Subjt:  KGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWD

Query:  ALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRP-LPLPRKQLATNMLEE--------KDRVREFRCA---IPYKEEADGKLKGLNGQMREAGYVPD
         L N +R+HGN+EL D   E++  LDP+     ++   +P+    +    L++        K  ++EFR     +P  +E    L+ L   M E GYV +
Subjt:  ALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRP-LPLPRKQLATNMLEE--------KDRVREFRCA---IPYKEEADGKLKGLNGQMREAGYVPD

Query:  TRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
        TR  LHDID+E+K+  L  HSER+A A  ++++  R    +IKNLR+C DCH+A+KIMS IVGRE+I RD KRFH  K+G C+C DYW
Subjt:  TRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-7336.15Show/hide
Query:  QGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMIKGYTENGLGDDGLALFEQMK
        +G   +   F++ L++C ++  LE G+++ G L +  +     + N L+ +Y  CG +++A  +F +M  +D  +WN MI GY+ +G G+  L  FE MK
Subjt:  QGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMIKGYTENGLGDDGLALFEQMK

Query:  SVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWDALRNYARLHGNIELEDRAEE
           L+P+  T   VL+AC+    V++G  YF +M  +YG++P  +HY  +VD+LG+   L +A   ++ MP +P   IW  L   +R+HGN EL + A +
Subjt:  SVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWDALRNYARLHGNIELEDRAEE

Query:  LMLALDPSMA----------------ATAAKRPLPL----PRKQLATNMLEEKDRVREFRCA---IPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDID
         + A++P  +                    K  + +     +K    + +E +++   F       P K+E    L+ L+ +M++AGYV  T  VLHD++
Subjt:  LMLALDPSMA----------------ATAAKRPLPL----PRKQLATNMLEEKDRVREFRCA---IPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDID

Query:  EEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW
        EE K++ ++YHSERLA+AYG++   +   +R+IKNLR+C DCH+AIK M++I GR +I+RDN RFHHFKDG CSCGDYW
Subjt:  EEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDNKRFHHFKDGKCSCGDYW

AT4G30700.1 Pentatricopeptide repeat (PPR) superfamily protein3.9e-7537.17Show/hide
Query:  DSDFRSLPSTVDLVA-LCEEGKVTDALEFIGQGANADYG----LFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVF
        +S  +SLPS   +++   + G   DA+    +   +++       T +L++C  L  L  G+ V  L++ T F   + ++  LI +Y+ CG + +ARR+F
Subjt:  DSDFRSLPSTVDLVA-LCEEGKVTDALEFIGQGANADYG----LFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVF

Query:  DKMPDRDTRTWNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEE
        D M  ++  TWN MI GY  +G G + L +F +M +  + P   TF  VL AC+ A  V+EG   F SM + YG  P  +HY  +VD+LG+  HL  A +
Subjt:  DKMPDRDTRTWNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEE

Query:  FVEKMPIKPTTKIWDALRNYARLHGNIELEDRAEELMLALDP-----------------------SMAATAAKRPLPLPRKQLATNMLEEKDRVREFRC-
        F+E M I+P + +W+ L    R+H +  L     E +  LDP                       ++  TA KR L    K     ++E  +    F   
Subjt:  FVEKMPIKPTTKIWDALRNYARLHGNIELEDRAEELMLALDP-----------------------SMAATAAKRPLPLPRKQLATNMLEEKDRVREFRC-

Query:  --AIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDN
          + P  +E   KL+ L G+MREAGY P+T   LHD++EE ++  ++ HSERLAIA+GLI+T   T +RIIKNLR+C DCH+  K++SKI  R ++VRD 
Subjt:  --AIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIVGRELIVRDN

Query:  KRFHHFKDGKCSCGDYW
         RFHHFKDG CSCGDYW
Subjt:  KRFHHFKDGKCSCGDYW

AT4G32450.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-7436.62Show/hide
Query:  DGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFRSLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALL----NSCGNLKLLEAGRRVDGLLKRTKFR
        DG S+  T      Q ++   +G      D     S  +L ++C EGKV  A+E I    N  Y +    L      CG+ + L+  + V   +  +   
Subjt:  DGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFRSLPSTVDLVALCEEGKVTDALEFIGQGANADYGLFTALL----NSCGNLKLLEAGRRVDGLLKRTKFR

Query:  GDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYG
         D+   N +IE+YS CG ++DA  VF+ MP+R+  TW  +I+ + +NG G+D +  F + K    +P+ E F+ +  AC +   + EG+++F+SM  EYG
Subjt:  GDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMIKGYTENGLGDDGLALFEQMKSVDLQPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYG

Query:  IIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPL-PLPRKQLATNMLEEKDRVRE
        IIP  EHY+ +V +L +  +L EA  FVE M  +P   +W+ L N +R+HG++ L DR ++++  LD S     +K  L P+    L    L+   +   
Subjt:  IIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAAKRPL-PLPRKQLATNMLEEKDRVRE

Query:  F--------RCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIV
        +          + P   E    LK L   M E GYVP ++  LHD+D+E+K + L  H+ER A     + TPAR+ +R++KNLR+C DCH+A+K+MSKIV
Subjt:  F--------RCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIKIMSKIV

Query:  GRELIVRDNKRFHHFKDGKCSCGDYW
        GRELI RD KRFHH KDG CSC +YW
Subjt:  GRELIVRDNKRFHHFKDGKCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTCGATACCTTCTCACACCACTCGAAACTCCATTATCTCTTCCAACTTCAACTCTCACTCTCCCGCCGTCATTAGAACCGCCACATTTTCCTCTTCTCGTTG
CCCCAAATCCTTCCGGCCGACGACCTTGTGCACCAACGCCCGCAATCGGAAGGTGCCTCGCCGCTACGACGGCCATAGCACGACGAACACACTCTCCAAATCTCAGAACC
AGACGAGCGATTCAGTTTTTAGTGGAAGCCAGAGTGTGGATAGCGATTTTCGCAGTCTTCCGTCGACGGTCGATTTAGTGGCTCTATGCGAGGAGGGTAAGGTAACGGAT
GCTCTGGAATTTATTGGTCAAGGTGCTAATGCCGATTATGGTCTTTTTACTGCTTTGCTGAATTCGTGTGGGAATTTGAAGTTGCTTGAGGCTGGAAGACGAGTAGATGG
GCTGTTGAAGCGAACGAAGTTTCGTGGCGATGTGGAATTGAACAATAAGTTGATTGAATTGTACTCGTGTTGTGGCTGTATGAAAGATGCACGCAGGGTGTTTGATAAAA
TGCCCGACAGGGATACAAGGACGTGGAATTTGATGATCAAGGGATATACAGAGAACGGGCTAGGAGATGATGGTTTGGCCTTGTTTGAGCAAATGAAGAGCGTGGATTTG
CAGCCAAATTCAGAAACATTTCAGCTGGTTTTAGCAGCTTGTGCCATGGCTGAAGCTGTGGAGGAAGGTATGGTTTACTTTAAATCAATGGAAAATGAATATGGAATCAT
TCCTGGATTTGAGCATTACTTGGGAGTTGTTGATGTTCTTGGGAAATGTAGCCATTTGATCGAAGCAGAGGAGTTCGTTGAGAAAATGCCCATCAAACCCACAACCAAAA
TCTGGGATGCCCTTAGGAACTATGCTCGACTCCATGGAAACATAGAGCTTGAAGATCGAGCTGAAGAGTTGATGCTCGCTCTCGACCCGTCCATGGCAGCCACGGCTGCC
AAGCGACCGCTCCCTCTGCCGAGGAAGCAATTGGCCACCAACATGCTGGAGGAGAAGGACAGGGTGAGGGAGTTCAGATGTGCAATCCCTTACAAGGAAGAGGCTGACGG
GAAGCTAAAAGGGTTGAATGGACAAATGAGAGAAGCAGGCTATGTGCCAGATACAAGATATGTGCTACATGACATTGATGAAGAGGCAAAACAGCAGGCTTTGCAATACC
ATAGCGAGCGGTTGGCCATTGCTTATGGATTGATCAGTACACCGGCAAGAACAACACTGAGGATCATCAAGAATCTTCGGATCTGCGGGGACTGCCATAGTGCAATCAAG
ATAATGTCAAAGATTGTTGGGAGAGAGTTGATTGTTAGAGATAATAAGCGGTTTCATCACTTTAAAGATGGCAAATGCTCTTGTGGGGATTACTGGAGCAAATTGGTTTG
GGCCGAGGGCAAACAGGCCCGGAGTCTGAGCAAAAGACGTCGCTCGGCTCGGCTTCTATATATAATGGGTTTCAGTCAGCGGGCTTTTACGTTACGAGAGCCCTCCAAGA
GTGGAACAAGTCTTCGCTTCGTCCCTCGCTGTCCATTGAAATTCTCATTTCTCTATTGCGTTGCGCACACTCTCCGAGCTCGCCTTCTCAAACTAGCGGCGACGGAAAGC
AGCCACTTCTATCACATTCTCCCCTTCTCTAAATCCTCCGCTCTCGTTTCTTTTGGTCTCTCTCGAAGTTTTTTTAGCTTTATTTTGGATTTGATGGCATCCAAGTTGCA
TCAGTTGCAATCAAAGGCCACTCAAGCTTCTCAATTTGCCTTGAAGCACGGGTCTTCCTATTATAAGCAGTTATTGGAGCAGAACAAGCAATACATCCAAGAACCAGCTA
CTGTGGAAAAATGCAACTTGCTTTCAAAGCAATTGTTGTATACTCGGCTTGCTAGCATCCCAGGTCGTTGTGAATCATTCTGGAAAGAACTTGATTATGTGAAAAACTTA
TGGAAGAACAGGCAGGAGCTGAAAGTTGAAGATGCTGGCATCGCTGCCTTATTCGGACTGGAGTGCTTTGCATGGTTTTGCGCGGGTGAGATCGTAGGAAGGGGCTTTAC
GTTCACAGAGTCAAGCGGACGGGAGACGGAGCAGACGAGGAGGGAGACACGGCATAGAAGACGAAGAACTTCGCCGCCGAAGATGGTCCGTAATGCCAGCAGTTCCCTTC
GCAAACGTGTCTCATCGGCGATTCTCCTCGTGCTCGTTTGTCTGAACGTCGTCGCCGAGTGTTCCAACAATGAATCTCGGGTTGAATGCAGCAAAACTTGTGTCGCAATC
AACTGCAACTCTGTTGGGATTCGGTACGGAAAGTTCTGCGGAGTAGGATGGACGGGTTGTGCTGGTGAAAAGCCTTGCGATGATCTTGATGCCTGTTGCAAAGTTCACGA
CGAATGTGTTGAAAGAAAAGTTCACTCCAAAATTCTGGCACAGTTGTTGGGCATGAGTCGGTTGCTTGTCTTCAAGCTCCAAGCCACCAATTCCTCCCCTCCAAATTCCA
TCAATGAATCCTCGAGGCTACAGTACATTGGGCTAAAATCACAAAAGGAGAACATAATTCCTGTAGCTTATGTTCAATACCAGTTCTGGACATCTTCCTTGGACTGCCAG
CATGTAGAGGCTGATTTTCATCCTCAAAATGCCTATACGGTGCTATACTATAGTGAATGGGTGAAATGTAAGGACTTGACTGTTGGCACTTTTGCCTCAATTATGGCGAC
CATTGAGCGGATTATACCATGTCACGAGAAGTTCAAGAGTTGTATTAAGAGAGTTCAGAAATCTGGGAAGGCTGGTTTCTCACAGGAGTGCCCATATGCCACAGCTGTTC
CTACAATGGTACAGGGCATGGATTTGGCCATCATGCTCAGCCAGTTCGACGAAGCCGGTGTACTCTCTTCTACTAGTTGCAGCACAGGTGCGTTCTTCGACCTTTCAGAT
CTCCTTCGCGCTTCTCGTTTCATCATCTTCAATCAGATTTCGCCTCCTTTTACGAATTCCATGGCGAATATTCCGGGGCAGTTGGTCTGGGAAATCGTTAAGAAAAACAA
CTGCTTCCTCGTTAAAGAGTTTGGAAGAGGTAATGCTGGCGTGCAATTCAGCAAAGAGCCCAACAACCTCTACAACCTCAACTCCTACAAGCATTCTGGCTTGGCAAACC
GGAAGACAGTAACCATTCAGCCAGCAGGCAAGGATTTGTCAGTATTGCTCGCGACAACAAAGACAAAGAAGCAGAACAAACCTGCGACTTTGCTCCACAAATCAGTCATG
AGGAAGGAATTTCCTCGTATGGTCAAGGCTGTGACTAATCAGATTGGAATTGAACGGAATGTGGGATGTTGCAGTGATGAGAAGCTAACATATCCACTTGATTCAGTGAG
GAGTGTACTTGTTGAATTGCAGGTAGCTGACAATTACTACCGCCCGGACTTGAAGAAGGCTGCTCTTGCAAAACTTAGCGCAGTTCACAGGAGCCTTAAGGTGGCCAAGT
CAGGTGTGAAAAAGAGGAACAGGCAGGCAGTTAAGCCCCGCGGTAGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTCGATACCTTCTCACACCACTCGAAACTCCATTATCTCTTCCAACTTCAACTCTCACTCTCCCGCCGTCATTAGAACCGCCACATTTTCCTCTTCTCGTTG
CCCCAAATCCTTCCGGCCGACGACCTTGTGCACCAACGCCCGCAATCGGAAGGTGCCTCGCCGCTACGACGGCCATAGCACGACGAACACACTCTCCAAATCTCAGAACC
AGACGAGCGATTCAGTTTTTAGTGGAAGCCAGAGTGTGGATAGCGATTTTCGCAGTCTTCCGTCGACGGTCGATTTAGTGGCTCTATGCGAGGAGGGTAAGGTAACGGAT
GCTCTGGAATTTATTGGTCAAGGTGCTAATGCCGATTATGGTCTTTTTACTGCTTTGCTGAATTCGTGTGGGAATTTGAAGTTGCTTGAGGCTGGAAGACGAGTAGATGG
GCTGTTGAAGCGAACGAAGTTTCGTGGCGATGTGGAATTGAACAATAAGTTGATTGAATTGTACTCGTGTTGTGGCTGTATGAAAGATGCACGCAGGGTGTTTGATAAAA
TGCCCGACAGGGATACAAGGACGTGGAATTTGATGATCAAGGGATATACAGAGAACGGGCTAGGAGATGATGGTTTGGCCTTGTTTGAGCAAATGAAGAGCGTGGATTTG
CAGCCAAATTCAGAAACATTTCAGCTGGTTTTAGCAGCTTGTGCCATGGCTGAAGCTGTGGAGGAAGGTATGGTTTACTTTAAATCAATGGAAAATGAATATGGAATCAT
TCCTGGATTTGAGCATTACTTGGGAGTTGTTGATGTTCTTGGGAAATGTAGCCATTTGATCGAAGCAGAGGAGTTCGTTGAGAAAATGCCCATCAAACCCACAACCAAAA
TCTGGGATGCCCTTAGGAACTATGCTCGACTCCATGGAAACATAGAGCTTGAAGATCGAGCTGAAGAGTTGATGCTCGCTCTCGACCCGTCCATGGCAGCCACGGCTGCC
AAGCGACCGCTCCCTCTGCCGAGGAAGCAATTGGCCACCAACATGCTGGAGGAGAAGGACAGGGTGAGGGAGTTCAGATGTGCAATCCCTTACAAGGAAGAGGCTGACGG
GAAGCTAAAAGGGTTGAATGGACAAATGAGAGAAGCAGGCTATGTGCCAGATACAAGATATGTGCTACATGACATTGATGAAGAGGCAAAACAGCAGGCTTTGCAATACC
ATAGCGAGCGGTTGGCCATTGCTTATGGATTGATCAGTACACCGGCAAGAACAACACTGAGGATCATCAAGAATCTTCGGATCTGCGGGGACTGCCATAGTGCAATCAAG
ATAATGTCAAAGATTGTTGGGAGAGAGTTGATTGTTAGAGATAATAAGCGGTTTCATCACTTTAAAGATGGCAAATGCTCTTGTGGGGATTACTGGAGCAAATTGGTTTG
GGCCGAGGGCAAACAGGCCCGGAGTCTGAGCAAAAGACGTCGCTCGGCTCGGCTTCTATATATAATGGGTTTCAGTCAGCGGGCTTTTACGTTACGAGAGCCCTCCAAGA
GTGGAACAAGTCTTCGCTTCGTCCCTCGCTGTCCATTGAAATTCTCATTTCTCTATTGCGTTGCGCACACTCTCCGAGCTCGCCTTCTCAAACTAGCGGCGACGGAAAGC
AGCCACTTCTATCACATTCTCCCCTTCTCTAAATCCTCCGCTCTCGTTTCTTTTGGTCTCTCTCGAAGTTTTTTTAGCTTTATTTTGGATTTGATGGCATCCAAGTTGCA
TCAGTTGCAATCAAAGGCCACTCAAGCTTCTCAATTTGCCTTGAAGCACGGGTCTTCCTATTATAAGCAGTTATTGGAGCAGAACAAGCAATACATCCAAGAACCAGCTA
CTGTGGAAAAATGCAACTTGCTTTCAAAGCAATTGTTGTATACTCGGCTTGCTAGCATCCCAGGTCGTTGTGAATCATTCTGGAAAGAACTTGATTATGTGAAAAACTTA
TGGAAGAACAGGCAGGAGCTGAAAGTTGAAGATGCTGGCATCGCTGCCTTATTCGGACTGGAGTGCTTTGCATGGTTTTGCGCGGGTGAGATCGTAGGAAGGGGCTTTAC
GTTCACAGAGTCAAGCGGACGGGAGACGGAGCAGACGAGGAGGGAGACACGGCATAGAAGACGAAGAACTTCGCCGCCGAAGATGGTCCGTAATGCCAGCAGTTCCCTTC
GCAAACGTGTCTCATCGGCGATTCTCCTCGTGCTCGTTTGTCTGAACGTCGTCGCCGAGTGTTCCAACAATGAATCTCGGGTTGAATGCAGCAAAACTTGTGTCGCAATC
AACTGCAACTCTGTTGGGATTCGGTACGGAAAGTTCTGCGGAGTAGGATGGACGGGTTGTGCTGGTGAAAAGCCTTGCGATGATCTTGATGCCTGTTGCAAAGTTCACGA
CGAATGTGTTGAAAGAAAAGTTCACTCCAAAATTCTGGCACAGTTGTTGGGCATGAGTCGGTTGCTTGTCTTCAAGCTCCAAGCCACCAATTCCTCCCCTCCAAATTCCA
TCAATGAATCCTCGAGGCTACAGTACATTGGGCTAAAATCACAAAAGGAGAACATAATTCCTGTAGCTTATGTTCAATACCAGTTCTGGACATCTTCCTTGGACTGCCAG
CATGTAGAGGCTGATTTTCATCCTCAAAATGCCTATACGGTGCTATACTATAGTGAATGGGTGAAATGTAAGGACTTGACTGTTGGCACTTTTGCCTCAATTATGGCGAC
CATTGAGCGGATTATACCATGTCACGAGAAGTTCAAGAGTTGTATTAAGAGAGTTCAGAAATCTGGGAAGGCTGGTTTCTCACAGGAGTGCCCATATGCCACAGCTGTTC
CTACAATGGTACAGGGCATGGATTTGGCCATCATGCTCAGCCAGTTCGACGAAGCCGGTGTACTCTCTTCTACTAGTTGCAGCACAGGTGCGTTCTTCGACCTTTCAGAT
CTCCTTCGCGCTTCTCGTTTCATCATCTTCAATCAGATTTCGCCTCCTTTTACGAATTCCATGGCGAATATTCCGGGGCAGTTGGTCTGGGAAATCGTTAAGAAAAACAA
CTGCTTCCTCGTTAAAGAGTTTGGAAGAGGTAATGCTGGCGTGCAATTCAGCAAAGAGCCCAACAACCTCTACAACCTCAACTCCTACAAGCATTCTGGCTTGGCAAACC
GGAAGACAGTAACCATTCAGCCAGCAGGCAAGGATTTGTCAGTATTGCTCGCGACAACAAAGACAAAGAAGCAGAACAAACCTGCGACTTTGCTCCACAAATCAGTCATG
AGGAAGGAATTTCCTCGTATGGTCAAGGCTGTGACTAATCAGATTGGAATTGAACGGAATGTGGGATGTTGCAGTGATGAGAAGCTAACATATCCACTTGATTCAGTGAG
GAGTGTACTTGTTGAATTGCAGGTAGCTGACAATTACTACCGCCCGGACTTGAAGAAGGCTGCTCTTGCAAAACTTAGCGCAGTTCACAGGAGCCTTAAGGTGGCCAAGT
CAGGTGTGAAAAAGAGGAACAGGCAGGCAGTTAAGCCCCGCGGTAGGAAGTGA
Protein sequenceShow/hide protein sequence
MASSIPSHTTRNSIISSNFNSHSPAVIRTATFSSSRCPKSFRPTTLCTNARNRKVPRRYDGHSTTNTLSKSQNQTSDSVFSGSQSVDSDFRSLPSTVDLVALCEEGKVTD
ALEFIGQGANADYGLFTALLNSCGNLKLLEAGRRVDGLLKRTKFRGDVELNNKLIELYSCCGCMKDARRVFDKMPDRDTRTWNLMIKGYTENGLGDDGLALFEQMKSVDL
QPNSETFQLVLAACAMAEAVEEGMVYFKSMENEYGIIPGFEHYLGVVDVLGKCSHLIEAEEFVEKMPIKPTTKIWDALRNYARLHGNIELEDRAEELMLALDPSMAATAA
KRPLPLPRKQLATNMLEEKDRVREFRCAIPYKEEADGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHSAIK
IMSKIVGRELIVRDNKRFHHFKDGKCSCGDYWSKLVWAEGKQARSLSKRRRSARLLYIMGFSQRAFTLREPSKSGTSLRFVPRCPLKFSFLYCVAHTLRARLLKLAATES
SHFYHILPFSKSSALVSFGLSRSFFSFILDLMASKLHQLQSKATQASQFALKHGSSYYKQLLEQNKQYIQEPATVEKCNLLSKQLLYTRLASIPGRCESFWKELDYVKNL
WKNRQELKVEDAGIAALFGLECFAWFCAGEIVGRGFTFTESSGRETEQTRRETRHRRRRTSPPKMVRNASSSLRKRVSSAILLVLVCLNVVAECSNNESRVECSKTCVAI
NCNSVGIRYGKFCGVGWTGCAGEKPCDDLDACCKVHDECVERKVHSKILAQLLGMSRLLVFKLQATNSSPPNSINESSRLQYIGLKSQKENIIPVAYVQYQFWTSSLDCQ
HVEADFHPQNAYTVLYYSEWVKCKDLTVGTFASIMATIERIIPCHEKFKSCIKRVQKSGKAGFSQECPYATAVPTMVQGMDLAIMLSQFDEAGVLSSTSCSTGAFFDLSD
LLRASRFIIFNQISPPFTNSMANIPGQLVWEIVKKNNCFLVKEFGRGNAGVQFSKEPNNLYNLNSYKHSGLANRKTVTIQPAGKDLSVLLATTKTKKQNKPATLLHKSVM
RKEFPRMVKAVTNQIGIERNVGCCSDEKLTYPLDSVRSVLVELQVADNYYRPDLKKAALAKLSAVHRSLKVAKSGVKKRNRQAVKPRGRK