; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028945 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028945
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153210:1859403..1874731
RNA-Seq ExpressionSgr028945
SyntenySgr028945
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR003888 - FY-rich, N-terminal
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150652.1 uncharacterized protein LOC111018735 isoform X1 [Momordica charantia]0.0e+0075.77Show/hide
Query:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI
        A +KEQADASLEIISIGSLYSG WDKKYWSSSRGKDRYPYPVGY A+RAYNGIKYKMEIHEGPKGPLFMILSMDG SFSGQTPDIAWE FQRKGCLH KI
Subjt:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI

Query:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA
        WHGKRSSCKVDGVE          RLLRELVAN+SGTAE+NLLPSNLCN ASGSAQT VEH+SA ECE AELI C ERSKI RKRSR+H IE  K   GA
Subjt:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA

Query:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV
         LK+VRN  PKIKSMTAKLSSS SVNE+NQSFCG YEK++ KG SLT  KN+  SNRP  F AAVPA++IEK M    ED +S +T+VAHN   DEKL  
Subjt:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV

Query:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEICS
                      HDRLSMDKLEGI+R+METDDNSGVASFQKDC D ED+HHHASDT DLKQ        SL KN+L    I++PEELVMDSHPEEICS
Subjt:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEICS

Query:  LNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDNL
        LN NSGSERND DSVGQDMVKSMMTFLLPQAIPLLK+TSGRKK +TSTLE+LPCD NTKDILPMEKEDREKQE+M TQH + +S VPSLELS+P+ L NL
Subjt:  LNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDNL

Query:  EDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMHN
        E EQH D  +ING+FSSI DDGRAKED+KPI+SCGF   GRM+DESLVNHHET GSKKS  SEIG+ LHGTCQE    V ECLP  +SSG  LFDETMHN
Subjt:  EDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMHN

Query:  NTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNVS
        N RM+ECPLNLQINS KVDLRTPKDY+ESNGDEQ C   SFSQL HAQNA  S   S S++SEA NKEV+AGK+A GIDTLPSSQVPSIVYSRRKAQNVS
Subjt:  NTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNVS

Query:  ILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASL--RAN
         L KE N PPNEAY T+CLG H GAE SST+SPHSSD  ++ILP NQQRED LSEPT GEQSPINCSYK TMKS+AGLEK+C  SP LDQ+EASL  RAN
Subjt:  ILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASL--RAN

Query:  KNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGA-LRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVL
         N HNS LL KPV K+DLEGCV EEMIE NN+ STNK ELSHD+GA  R NNK SYPHCNVELYREAEGMSKIVG YLHPMPV SVFL N+EN+IHICVL
Subjt:  KNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGA-LRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVL

Query:  CGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQ
        CGL VDKNRTL+TYTVE+ EPRLGYPSLVGHT V LPTL DYLGKEIA+ERTGFQLTPDG Y+VLIG ++T FCRTG+INCSCSTCTSGKFEEN+V IVQ
Subjt:  CGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQ

Query:  VKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL----------
        VKYGYVSIMASLKSADC HCILVCEPD LVAVGRGGRLHLWVMDSTWGKQIESHTIPS DHISPNLV+LKR+PK A+LVVGHNG GEFSL          
Subjt:  VKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL----------

Query:  ---------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANS
                                    N NS DYVK+L  ATN SSKNTEE SS++ KDTAIWL ASTISDS+DSH YLP DCQINH GLWKLALLANS
Subjt:  ---------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANS

Query:  TVTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAAD
        TVTFG E+DLRASAIGAS+GRGIIGT+DGLVY+WELSTGNKLGTLLRF+GASV CIATD+RE GVVAVAAD
Subjt:  TVTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAAD

XP_022150653.1 uncharacterized protein LOC111018735 isoform X2 [Momordica charantia]0.0e+0077.43Show/hide
Query:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI
        A +KEQADASLEIISIGSLYSG WDKKYWSSSRGKDRYPYPVGY A+RAYNGIKYKMEIHEGPKGPLFMILSMDG SFSGQTPDIAWE FQRKGCLH KI
Subjt:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI

Query:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA
        WHGKRSSCKVDGVE          RLLRELVAN+SGTAE+NLLPSNLCN ASGSAQT VEH+SA ECE AELI C ERSKI RKRSR+H IE  K   GA
Subjt:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA

Query:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV
         LK+VRN  PKIKSMTAKLSSS SVNE+NQSFCG YEK++ KG SLT  KN+  SNRP  F AAVPA++IEK M    ED +S +T+VAHN   DEKL  
Subjt:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV

Query:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEICS
                      HDRLSMDKLEGI+R+METDDNSGVASFQKDC D ED+HHHASDT DLKQ        SL KN+L    I++PEELVMDSHPEEICS
Subjt:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEICS

Query:  LNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDNL
        LN NSGSERND DSVGQDMVKSMMTFLLPQAIPLLK+TSGRKK +TSTLE+LPCD NTKDILPMEKEDREKQE+M TQH + +S VPSLELS+P+ L NL
Subjt:  LNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDNL

Query:  EDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMHN
        E EQH D  +ING+FSSI DDGRAKED+KPI+SCGF   GRM+DESLVNHHET GSKKS  SEIG+ LHGTCQE    V ECLP  +SSG  LFDETMHN
Subjt:  EDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMHN

Query:  NTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNVS
        N RM+ECPLNLQINS KVDLRTPKDY+ESNGDEQ C   SFSQL HAQNA  S   S S++SEA NKEV+AGK+A GIDTLPSSQVPSIVYSRRKAQNVS
Subjt:  NTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNVS

Query:  ILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASL--RAN
         L KE N PPNEAY T+CLG H GAE SST+SPHSSD  ++ILP NQQRED LSEPT GEQSPINCSYK TMKS+AGLEK+C  SP LDQ+EASL  RAN
Subjt:  ILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASL--RAN

Query:  KNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGA-LRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVL
         N HNS LL KPV K+DLEGCV EEMIE NN+ STNK ELSHD+GA  R NNK SYPHCNVELYREAEGMSKIVG YLHPMPV SVFL N+EN+IHICVL
Subjt:  KNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGA-LRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVL

Query:  CGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQ
        CGL VDKNRTL+TYTVE+ EPRLGYPSLVGHT V LPTL DYLGKEIA+ERTGFQLTPDG Y+VLIG ++T FCRTG+INCSCSTCTSGKFEEN+V IVQ
Subjt:  CGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQ

Query:  VKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL
        VKYGYVSIMASLKSADC HCILVCEPD LVAVGRGGRLHLWVMDSTWGKQIESHTIPS DHISPNLV+LKR+PK A+LVVGHNG GEFSL
Subjt:  VKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL

XP_022945885.1 uncharacterized protein LOC111449993 [Cucurbita moschata]0.0e+0069.42Show/hide
Query:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI
        A LK+QADASLEIISIGSLYSGPW KKYWSSSRGKDR+PYPVGY AVR YNGIKYK+E+HEGPKGPLFMILSMDG SFSGQTPDIAWEMFQRK CLHTKI
Subjt:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI

Query:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA
        WHGKRSSCKVDGVE          RLLRELVANVSGTAEL+  PSNLC+KASGSAQTAVE +   EC+ A+L+S  ERSK ARKRSR   IET K  NG+
Subjt:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA

Query:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV
        NLK+ RN G  I+SMTA+L+S  S N+ NQ FC                                     EK +  QEE AVSE+TQVAHN SIDEK   
Subjt:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV

Query:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKD-CSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLT--HIIIPEELVMDSHPEEIC
                      HDRLS DKLE ISR+ME DDNSGVASFQKD C D EDN+H ASDT D KQVIFESAP+S EK +L    III EE VMD+ PEEIC
Subjt:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKD-CSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLT--HIIIPEELVMDSHPEEIC

Query:  SLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDN
        SLN NSGS+RND DSVGQDMVKSMMT+LLPQA+PLL+  SGRKKTATS LET PCD NTKD+ P E+E REKQEYM+ QH N +  VP LEL K  GLDN
Subjt:  SLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDN

Query:  LEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMH
        LE EQH D AN+NG+FSS  D+ +AKEDMKP+DSCGF F GRM +E LVNHHE +G KKSR SE G+ L GTCQE    VSEC P CSSSGRVL      
Subjt:  LEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMH

Query:  NNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNV
             +ECPLNLQINS KVD +TP+DY E NGDEQ C  ESFSQLSHAQ+AN SSVRS SA+SEA NKEVI GKEA+GIDT P SQVPSIVYSRRK Q V
Subjt:  NNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNV

Query:  SILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASLRANK
        S LAKE N  P+EA  TS LG H G E SSTKSPHSS IN+  LP NQ REDLLSEPT  E  PINCSY+TTMK++ GLEK+C RSP LD +EAS + + 
Subjt:  SILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASLRANK

Query:  NKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGALR-DNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVLC
          HNSGLLDK V K+DLEGCV   MIE NN+LS NK EL +D+G    D +K SYPH NVELYREAEGMSKIVG YLHPMPV S+FLSN+ENVIHICVLC
Subjt:  NKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGALR-DNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVLC

Query:  GLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQV
        GLSV+KNRTL+TYTVE+KEPRLGYPS+VGHT VM+PTLKDYLGKE+A+ERTGFQ TPDGN+LVL+G ++   CRTGSINC CSTCTS KFEEN+VKIVQV
Subjt:  GLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQV

Query:  KYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL-----------
        KYGYVSI+A+L+S D VHCILVC PD LVAVG GGRLHLWVMDSTW KQIESHTIPSEDHISPNLVEL++VP+ ++LVVGHNG+GEFSL           
Subjt:  KYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL-----------

Query:  --------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANST
                                   N NSRDYVKEL CATNTSS   +E SS++LKDTAIWLFAST SDS+D H YLP  CQ NH  LWKL LLANST
Subjt:  --------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANST

Query:  VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAA
        VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASV CIATDNRE GVVAVAA
Subjt:  VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAA

XP_022968243.1 uncharacterized protein LOC111467538 isoform X1 [Cucurbita maxima]0.0e+0069.03Show/hide
Query:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI
        A LK+QADASLEIISIGSLYSGPW KKYWSSSRGKDR+PYPVGY AVR YNGIKYK+E+HEGPKGPLFMILSMDG SFSGQTPDIAWEMFQRK CLHTKI
Subjt:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI

Query:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA
        WHGKRSSCKVDGVE          RLLRELVANVSGTAEL+  PSNLC+KASGSAQTAVE +   EC+ A+L+S  ERSK ARKRSR   IET K  NG+
Subjt:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA

Query:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV
        NLK+ RN G  I+SMTA+ +S  S N+ NQ FC                                     EK +  QEE AVSE+TQVAHN SI +K   
Subjt:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV

Query:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKD-CSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEIC
                      HDRLS DKLE ISR+ME DDNSG ASFQKD C D EDN+H ASDT D KQVIFESAP+S EK +L    IIIPEE VMD+HPEEIC
Subjt:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKD-CSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEIC

Query:  SLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDN
        S N NSGS+RND DSVGQDMVKSMMT+LLPQA+PLL+  S RKKTATS LET PCD NTKD+   EKE REKQEYM+ QH N +  VP LEL K  GLDN
Subjt:  SLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDN

Query:  LEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMH
        LE  QH D ANING+FSS  D+ +AKEDMKP+D  GF F GRM +E LVNHHE +GSKKSR SE GK L GTCQE    VSEC P CS SGRVL      
Subjt:  LEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMH

Query:  NNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNV
             +ECPLNLQ NS KVD +TP+DY ESNGDEQ C  ESFSQLSHAQ+AN SSVRS SA+SEA NKEVI GKEA+GIDT P SQVPSIVYSRRKAQ V
Subjt:  NNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNV

Query:  SILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASLRANK
        S LAKE N  P+EA  TS L  H G E SSTKSPHSS IN+  LP NQ REDLLSEPT  E  PINCSY+TTMK++ GLEK+C RSP LD +EAS + + 
Subjt:  SILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASLRANK

Query:  NKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIG-ALRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVLC
          HNSGLLDK V K+DLEGCV   MIE NN+LS NK EL  D+G   RD +K SYPH NVELYREAEGMSKIVG YLHPMPV S+FLSN+ENVIHICVLC
Subjt:  NKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIG-ALRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVLC

Query:  GLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQV
        GLSV+KNRTL+TYTVE+KEPRLGYPS+VGHT VM+PTLKDYLGKE+A+ERTGFQ T DGN+LVL+G I+   CRTGSINC CSTCTS KFEEN+VKIVQV
Subjt:  GLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQV

Query:  KYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL-----------
        KYGYVSI+A+L+S D VHCILVC PD LVAVG GGRLHLWVMDSTW KQIE HTIPSEDHISPNLVEL++VP+ ++LVVGHNG+GEFSL           
Subjt:  KYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL-----------

Query:  --------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANST
                                   NFNSRDYVKEL CATNTSS   +E SS++LKDTAIWLFAST SDS+D H YLP  CQ NH  LWKL LLANST
Subjt:  --------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANST

Query:  VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAA
        VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASV CIATDNRE GVVAVA+
Subjt:  VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAA

XP_023542996.1 uncharacterized protein LOC111802747 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0069.58Show/hide
Query:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI
        A LK+Q+DASLEIISIGSLYSGPW KKYWSSSRGKDR+PYPVGY AVR YNGIKYK+E+HEGPKGPLFMILSMDG SFSGQTPDIAWEMFQRK CLHTKI
Subjt:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI

Query:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA
        WHGKRSSCKVDGVE          RLLRELVANVSGTAEL+  PSNLC+KASGSAQTAVE +   EC+ A+L+S  ERSK ARKRSR   IET K   G+
Subjt:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA

Query:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV
        NLK+ RN G +I+SMTA+L+S  S N+ NQ FC                                     EK +  QEE AV E+TQVAHN SIDEK   
Subjt:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV

Query:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKD-CSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLT--HIIIPEELVMDSHPEEIC
                      HDRLS DKLE ISR+ME DDNSGVASFQKD C D EDN+H ASDT D KQVIFESAP+S EK +L    IIIPEE V+D+HPEEIC
Subjt:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKD-CSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLT--HIIIPEELVMDSHPEEIC

Query:  SLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDN
        SLN NSGS+RND DSVGQDMVKSMMT+LLPQA+PLL+  SGRKKTATS LET PCD NTKD+ P EKE REKQEYM+ QH N +  VP LEL K  GLDN
Subjt:  SLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDN

Query:  LEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMH
        LE EQH D ANING+FSS  D+ +AKEDMKP+DSCGF F GRM +E LVNHHE +GSKKSR SE G+ L GTCQE    VSEC P CSSSGRVL      
Subjt:  LEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMH

Query:  NNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNV
             +ECPLNLQINS KVD +TP+DY ESNGDEQ C  ESFSQ SHAQ+AN SSVRS SA+SE  NKEV+ GKEA+GIDT P SQVPSIVYSRRKAQ V
Subjt:  NNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNV

Query:  SILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASLRANK
        S LAKE N  P+EA  TS LG H G E SS+KSPHSS IN+  LP NQ REDLLSEPT  E  PINCSY+TTMK++ GLEK+C  SP LD +EAS + + 
Subjt:  SILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASLRANK

Query:  NKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIG-ALRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVLC
          HNSGLLDK V K+DLEGCV   MIE NN+LS NK EL HD+G   RD +K SYPH NVELYREAEGMSKIVG YLHPMPV S+FLSN+ENVIHICVLC
Subjt:  NKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIG-ALRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVLC

Query:  GLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQV
        GLSV+KNRTL+TYTVE+KEPRLGYPS+VGHT VM+PTLKDYLGKE+A+ERTGFQ TPDGN+LVL+G I+   CRTGSINC CSTCTS KFEEN+VKIVQV
Subjt:  GLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQV

Query:  KYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL-----------
        KYGYVSI+A+L+S D VHCILVC PD LVAVG GGRLHLWVMDSTW KQIESHTIPSEDHISPNLVEL++VP+ ++LVVGHNG+GEFSL           
Subjt:  KYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL-----------

Query:  --------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANST
                                   NFNSRDYVKEL CATNTSS   +E SS++LKDTAIWLFAST SDS+D H YLP  CQ NH  LWKL LLANST
Subjt:  --------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANST

Query:  VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAA
        VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASV CIATDNRE GVVAVAA
Subjt:  VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAA

TrEMBL top hitse value%identityAlignment
A0A6J1DA01 uncharacterized protein LOC111018735 isoform X20.0e+0077.43Show/hide
Query:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI
        A +KEQADASLEIISIGSLYSG WDKKYWSSSRGKDRYPYPVGY A+RAYNGIKYKMEIHEGPKGPLFMILSMDG SFSGQTPDIAWE FQRKGCLH KI
Subjt:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI

Query:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA
        WHGKRSSCKVDGVE          RLLRELVAN+SGTAE+NLLPSNLCN ASGSAQT VEH+SA ECE AELI C ERSKI RKRSR+H IE  K   GA
Subjt:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA

Query:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV
         LK+VRN  PKIKSMTAKLSSS SVNE+NQSFCG YEK++ KG SLT  KN+  SNRP  F AAVPA++IEK M    ED +S +T+VAHN   DEKL  
Subjt:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV

Query:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEICS
                      HDRLSMDKLEGI+R+METDDNSGVASFQKDC D ED+HHHASDT DLKQ        SL KN+L    I++PEELVMDSHPEEICS
Subjt:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEICS

Query:  LNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDNL
        LN NSGSERND DSVGQDMVKSMMTFLLPQAIPLLK+TSGRKK +TSTLE+LPCD NTKDILPMEKEDREKQE+M TQH + +S VPSLELS+P+ L NL
Subjt:  LNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDNL

Query:  EDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMHN
        E EQH D  +ING+FSSI DDGRAKED+KPI+SCGF   GRM+DESLVNHHET GSKKS  SEIG+ LHGTCQE    V ECLP  +SSG  LFDETMHN
Subjt:  EDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMHN

Query:  NTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNVS
        N RM+ECPLNLQINS KVDLRTPKDY+ESNGDEQ C   SFSQL HAQNA  S   S S++SEA NKEV+AGK+A GIDTLPSSQVPSIVYSRRKAQNVS
Subjt:  NTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNVS

Query:  ILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASL--RAN
         L KE N PPNEAY T+CLG H GAE SST+SPHSSD  ++ILP NQQRED LSEPT GEQSPINCSYK TMKS+AGLEK+C  SP LDQ+EASL  RAN
Subjt:  ILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASL--RAN

Query:  KNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGA-LRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVL
         N HNS LL KPV K+DLEGCV EEMIE NN+ STNK ELSHD+GA  R NNK SYPHCNVELYREAEGMSKIVG YLHPMPV SVFL N+EN+IHICVL
Subjt:  KNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGA-LRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVL

Query:  CGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQ
        CGL VDKNRTL+TYTVE+ EPRLGYPSLVGHT V LPTL DYLGKEIA+ERTGFQLTPDG Y+VLIG ++T FCRTG+INCSCSTCTSGKFEEN+V IVQ
Subjt:  CGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQ

Query:  VKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL
        VKYGYVSIMASLKSADC HCILVCEPD LVAVGRGGRLHLWVMDSTWGKQIESHTIPS DHISPNLV+LKR+PK A+LVVGHNG GEFSL
Subjt:  VKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL

A0A6J1DAQ6 uncharacterized protein LOC111018735 isoform X30.0e+0077.1Show/hide
Query:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI
        A +KEQADASLEIISIGSLYSG WDKKYWSSSRGKDRYPYPVGY A+RAYNGIKYKMEIHEGPKGPLFMILSMDG SFSGQTPDIAWE FQRKGCLH KI
Subjt:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI

Query:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA
        WHGKRSSCKVDGVE          RLLRELVAN+SGTAE+NLLPSNLCN ASGSAQT VEH+SA ECE AELI C ERSKI RKRSR+H IE  K   GA
Subjt:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA

Query:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV
         LK+VRN  PKIKSMTAKLSSS SVNE+NQSFCG YEK++ KG SLT  KN+  SNRP  F AAVPA++IEK M    ED +S +T+VAHN   DEKL  
Subjt:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV

Query:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEICS
                      HDRLSMDKLEGI+R+METDDNSGVASFQKDC D ED+HHHASDT DLKQ        SL KN+L    I++PEELVMDSHPEEICS
Subjt:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEICS

Query:  LNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDNL
        LN NSGSERND DSVGQDMVKSMMTFLLPQAIPLLK+TSGRKK +TSTLE+LPCD NTKDILPMEKEDREKQE+M TQH + +S VPSLELS+P+ L NL
Subjt:  LNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDNL

Query:  EDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMHN
        E EQH D  +ING+FSSI DDGRAKED+KPI+SCGF   GRM+DESLVNHHET GSKKS  SEIG+ LHGTCQE    V ECLP  +SSG  LFDETMHN
Subjt:  EDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMHN

Query:  NTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNVS
        N RM+ECPLNLQINS KVDLRTPKDY+ESNGDEQ C   SFSQL HAQNA  S   S S++SEA NKEV+AGK+A GIDTLPSSQVPSIVYSRRKAQNVS
Subjt:  NTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNVS

Query:  ILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASL--RAN
         L KE N PPNEAY T+CLG H GAE SST+SPHSSD  ++ILP NQQRED LSEPT GEQSPINCSYK TMKS+AGLEK+C  SP LDQ+EASL  RAN
Subjt:  ILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASL--RAN

Query:  KNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGA-LRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVL
         N HNS LL KPV K+DLEGCV EEMIE NN+ STNK ELSHD+GA  R NNK SYPHCNVELYREAEGMSKIVG YLHPMPV SVFL N+EN+IHICVL
Subjt:  KNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGA-LRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVL

Query:  CGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQ
        CGL VDKNRTL+TYTVE+ EPRLGYPSLVGHT V LPTL DYLGKEIA+ERTGFQLTPDG Y+VLIG ++T FCRTG+INCSCSTCTSGKFEEN+V IVQ
Subjt:  CGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQ

Query:  VKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWG
        VKYGYVSIMASLKSADC HCILVCEPD LVAVGRGGRLHLWVMDSTWG
Subjt:  VKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWG

A0A6J1DBA8 uncharacterized protein LOC111018735 isoform X10.0e+0075.77Show/hide
Query:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI
        A +KEQADASLEIISIGSLYSG WDKKYWSSSRGKDRYPYPVGY A+RAYNGIKYKMEIHEGPKGPLFMILSMDG SFSGQTPDIAWE FQRKGCLH KI
Subjt:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI

Query:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA
        WHGKRSSCKVDGVE          RLLRELVAN+SGTAE+NLLPSNLCN ASGSAQT VEH+SA ECE AELI C ERSKI RKRSR+H IE  K   GA
Subjt:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA

Query:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV
         LK+VRN  PKIKSMTAKLSSS SVNE+NQSFCG YEK++ KG SLT  KN+  SNRP  F AAVPA++IEK M    ED +S +T+VAHN   DEKL  
Subjt:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV

Query:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEICS
                      HDRLSMDKLEGI+R+METDDNSGVASFQKDC D ED+HHHASDT DLKQ        SL KN+L    I++PEELVMDSHPEEICS
Subjt:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEICS

Query:  LNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDNL
        LN NSGSERND DSVGQDMVKSMMTFLLPQAIPLLK+TSGRKK +TSTLE+LPCD NTKDILPMEKEDREKQE+M TQH + +S VPSLELS+P+ L NL
Subjt:  LNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDNL

Query:  EDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMHN
        E EQH D  +ING+FSSI DDGRAKED+KPI+SCGF   GRM+DESLVNHHET GSKKS  SEIG+ LHGTCQE    V ECLP  +SSG  LFDETMHN
Subjt:  EDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMHN

Query:  NTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNVS
        N RM+ECPLNLQINS KVDLRTPKDY+ESNGDEQ C   SFSQL HAQNA  S   S S++SEA NKEV+AGK+A GIDTLPSSQVPSIVYSRRKAQNVS
Subjt:  NTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNVS

Query:  ILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASL--RAN
         L KE N PPNEAY T+CLG H GAE SST+SPHSSD  ++ILP NQQRED LSEPT GEQSPINCSYK TMKS+AGLEK+C  SP LDQ+EASL  RAN
Subjt:  ILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASL--RAN

Query:  KNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGA-LRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVL
         N HNS LL KPV K+DLEGCV EEMIE NN+ STNK ELSHD+GA  R NNK SYPHCNVELYREAEGMSKIVG YLHPMPV SVFL N+EN+IHICVL
Subjt:  KNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGA-LRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVL

Query:  CGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQ
        CGL VDKNRTL+TYTVE+ EPRLGYPSLVGHT V LPTL DYLGKEIA+ERTGFQLTPDG Y+VLIG ++T FCRTG+INCSCSTCTSGKFEEN+V IVQ
Subjt:  CGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQ

Query:  VKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL----------
        VKYGYVSIMASLKSADC HCILVCEPD LVAVGRGGRLHLWVMDSTWGKQIESHTIPS DHISPNLV+LKR+PK A+LVVGHNG GEFSL          
Subjt:  VKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL----------

Query:  ---------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANS
                                    N NS DYVK+L  ATN SSKNTEE SS++ KDTAIWL ASTISDS+DSH YLP DCQINH GLWKLALLANS
Subjt:  ---------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANS

Query:  TVTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAAD
        TVTFG E+DLRASAIGAS+GRGIIGT+DGLVY+WELSTGNKLGTLLRF+GASV CIATD+RE GVVAVAAD
Subjt:  TVTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAAD

A0A6J1G291 uncharacterized protein LOC1114499930.0e+0069.42Show/hide
Query:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI
        A LK+QADASLEIISIGSLYSGPW KKYWSSSRGKDR+PYPVGY AVR YNGIKYK+E+HEGPKGPLFMILSMDG SFSGQTPDIAWEMFQRK CLHTKI
Subjt:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI

Query:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA
        WHGKRSSCKVDGVE          RLLRELVANVSGTAEL+  PSNLC+KASGSAQTAVE +   EC+ A+L+S  ERSK ARKRSR   IET K  NG+
Subjt:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA

Query:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV
        NLK+ RN G  I+SMTA+L+S  S N+ NQ FC                                     EK +  QEE AVSE+TQVAHN SIDEK   
Subjt:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV

Query:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKD-CSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLT--HIIIPEELVMDSHPEEIC
                      HDRLS DKLE ISR+ME DDNSGVASFQKD C D EDN+H ASDT D KQVIFESAP+S EK +L    III EE VMD+ PEEIC
Subjt:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKD-CSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLT--HIIIPEELVMDSHPEEIC

Query:  SLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDN
        SLN NSGS+RND DSVGQDMVKSMMT+LLPQA+PLL+  SGRKKTATS LET PCD NTKD+ P E+E REKQEYM+ QH N +  VP LEL K  GLDN
Subjt:  SLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDN

Query:  LEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMH
        LE EQH D AN+NG+FSS  D+ +AKEDMKP+DSCGF F GRM +E LVNHHE +G KKSR SE G+ L GTCQE    VSEC P CSSSGRVL      
Subjt:  LEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMH

Query:  NNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNV
             +ECPLNLQINS KVD +TP+DY E NGDEQ C  ESFSQLSHAQ+AN SSVRS SA+SEA NKEVI GKEA+GIDT P SQVPSIVYSRRK Q V
Subjt:  NNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNV

Query:  SILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASLRANK
        S LAKE N  P+EA  TS LG H G E SSTKSPHSS IN+  LP NQ REDLLSEPT  E  PINCSY+TTMK++ GLEK+C RSP LD +EAS + + 
Subjt:  SILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASLRANK

Query:  NKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGALR-DNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVLC
          HNSGLLDK V K+DLEGCV   MIE NN+LS NK EL +D+G    D +K SYPH NVELYREAEGMSKIVG YLHPMPV S+FLSN+ENVIHICVLC
Subjt:  NKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGALR-DNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVLC

Query:  GLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQV
        GLSV+KNRTL+TYTVE+KEPRLGYPS+VGHT VM+PTLKDYLGKE+A+ERTGFQ TPDGN+LVL+G ++   CRTGSINC CSTCTS KFEEN+VKIVQV
Subjt:  GLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQV

Query:  KYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL-----------
        KYGYVSI+A+L+S D VHCILVC PD LVAVG GGRLHLWVMDSTW KQIESHTIPSEDHISPNLVEL++VP+ ++LVVGHNG+GEFSL           
Subjt:  KYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL-----------

Query:  --------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANST
                                   N NSRDYVKEL CATNTSS   +E SS++LKDTAIWLFAST SDS+D H YLP  CQ NH  LWKL LLANST
Subjt:  --------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANST

Query:  VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAA
        VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASV CIATDNRE GVVAVAA
Subjt:  VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAA

A0A6J1HXG7 uncharacterized protein LOC111467538 isoform X10.0e+0069.03Show/hide
Query:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI
        A LK+QADASLEIISIGSLYSGPW KKYWSSSRGKDR+PYPVGY AVR YNGIKYK+E+HEGPKGPLFMILSMDG SFSGQTPDIAWEMFQRK CLHTKI
Subjt:  AHLKEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKI

Query:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA
        WHGKRSSCKVDGVE          RLLRELVANVSGTAEL+  PSNLC+KASGSAQTAVE +   EC+ A+L+S  ERSK ARKRSR   IET K  NG+
Subjt:  WHGKRSSCKVDGVE----------RLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGA

Query:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV
        NLK+ RN G  I+SMTA+ +S  S N+ NQ FC                                     EK +  QEE AVSE+TQVAHN SI +K   
Subjt:  NLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIV

Query:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKD-CSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEIC
                      HDRLS DKLE ISR+ME DDNSG ASFQKD C D EDN+H ASDT D KQVIFESAP+S EK +L    IIIPEE VMD+HPEEIC
Subjt:  IVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKD-CSDIEDNHHHASDTLDLKQVIFESAPVSLEKNDLTH--IIIPEELVMDSHPEEIC

Query:  SLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDN
        S N NSGS+RND DSVGQDMVKSMMT+LLPQA+PLL+  S RKKTATS LET PCD NTKD+   EKE REKQEYM+ QH N +  VP LEL K  GLDN
Subjt:  SLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQHRNCESAVPSLELSKPAGLDN

Query:  LEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMH
        LE  QH D ANING+FSS  D+ +AKEDMKP+D  GF F GRM +E LVNHHE +GSKKSR SE GK L GTCQE    VSEC P CS SGRVL      
Subjt:  LEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQE----VSECLPRCSSSGRVLFDETMH

Query:  NNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNV
             +ECPLNLQ NS KVD +TP+DY ESNGDEQ C  ESFSQLSHAQ+AN SSVRS SA+SEA NKEVI GKEA+GIDT P SQVPSIVYSRRKAQ V
Subjt:  NNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNV

Query:  SILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASLRANK
        S LAKE N  P+EA  TS L  H G E SSTKSPHSS IN+  LP NQ REDLLSEPT  E  PINCSY+TTMK++ GLEK+C RSP LD +EAS + + 
Subjt:  SILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASLRANK

Query:  NKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIG-ALRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVLC
          HNSGLLDK V K+DLEGCV   MIE NN+LS NK EL  D+G   RD +K SYPH NVELYREAEGMSKIVG YLHPMPV S+FLSN+ENVIHICVLC
Subjt:  NKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIG-ALRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVLC

Query:  GLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQV
        GLSV+KNRTL+TYTVE+KEPRLGYPS+VGHT VM+PTLKDYLGKE+A+ERTGFQ T DGN+LVL+G I+   CRTGSINC CSTCTS KFEEN+VKIVQV
Subjt:  GLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQV

Query:  KYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL-----------
        KYGYVSI+A+L+S D VHCILVC PD LVAVG GGRLHLWVMDSTW KQIE HTIPSEDHISPNLVEL++VP+ ++LVVGHNG+GEFSL           
Subjt:  KYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSL-----------

Query:  --------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANST
                                   NFNSRDYVKEL CATNTSS   +E SS++LKDTAIWLFAST SDS+D H YLP  CQ NH  LWKL LLANST
Subjt:  --------------------------CNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPIDCQINHVGLWKLALLANST

Query:  VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAA
        VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASV CIATDNRE GVVAVA+
Subjt:  VTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAA

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.1e-10335.2Show/hide
Query:  YLMGLCRNIDTL---IKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRCELREIDEGRKVHCQIEKVGIECSSAMFEG
        YL+ +C +   L    + HGLL+  G   +L   T L  +Y     V  AR VFD+MP+ D                                       
Subjt:  YLMGLCRNIDTL---IKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRCELREIDEGRKVHCQIEKVGIECSSAMFEG

Query:  ILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAIKNVVEFSSFLATTFLDMYVKCGQTRDACMIFD
             +VSW +++AGY QN  A   L +   M E  ++ +  T+ S++ A + LR +  GK +HGYA+++  +    ++T  +DMY KCG    A  +FD
Subjt:  ILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAIKNVVEFSSFLATTFLDMYVKCGQTRDACMIFD

Query:  ELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHIIDDAYVIF
         +   ++VSW +MI  Y Q   P EA+ +F   +   + P  V+    L +C+  G+L  G  +H L ++LGL+   +V N+LI MY KC  +D A  +F
Subjt:  ELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHIIDDAYVIF

Query:  HGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRSSSPAPDAITLVSALSASATLGSVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARM
          +  + +++WN+MI G+AQNG   +AL  F+ MRS +  PD  T VS ++A A L        +H   ++  L   N+++ TAL++ YAKCG    AR+
Subjt:  HGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRSSSPAPDAITLVSALSASATLGSVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARM

Query:  VFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFMTVLSACSYSGMVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEA
        +FD M ++++ TW+AMI GYG  G G  +L +F +M K  +KPN V F++V+SACS+SG+V+ G + F  M ++Y+ + SM HY  MVDLL R+GRL+EA
Subjt:  VFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFMTVLSACSYSGMVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEA

Query:  LDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVVIGEMLQLHPNEACYYVLLSNLYASDGRWGQVNQVRELMQQRGLNKVPGAHLKE
         DFI +MPV+P +++YGA L  C ++   +  E     + +L+P++  Y+VLL+N+Y +   W +V QVR  M ++GL K PG  + E
Subjt:  LDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVVIGEMLQLHPNEACYYVLLSNLYASDGRWGQVNQVRELMQQRGLNKVPGAHLKE

Q9C507 Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial3.4e-10232.21Show/hide
Query:  KFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRCELRE-------------IDEG------------------------
        K HG +I  G+  + + +T L+ +YG  G++  A  VFD MP  D  AW  ++   L               +D+G                        
Subjt:  KFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRCELRE-------------IDEG------------------------

Query:  RKVHCQIEKVGIE-----CSSAM---------------FEGILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQ
        R VH QI +   +     C+S +               FE I  KN VSWT+MI+ Y + + +E+ L  F+ M ++ +E N  TL S++++C  +  + +
Subjt:  RKVHCQIEKVGIE-----CSSAM---------------FEGILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQ

Query:  GKWVHGYAIKNVVEFS-SFLATTFLDMYVKCGQTRDACMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNL
        GK VHG+A++  ++ +   L+   +++Y +CG+  D   +   +   ++V+W ++I  Y   G   +AL LF   +   + P++ T AS +S+C  +G +
Subjt:  GKWVHGYAIKNVVEFS-SFLATTFLDMYVKCGQTRDACMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNL

Query:  SLGMSVHGLGIKLGLEECAVKNALIDMYAKCHIIDDAYVIFHGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRSSSPAPDAITLVSALSASATLGSV
         LG  +HG  I+  + +  V+N+LIDMY+K   +D A  +F+ +  + V+TWNSM+ G++QNG++ EA+ LF+ M  S    + +T ++ + A +++GS+
Subjt:  SLGMSVHGLGIKLGLEECAVKNALIDMYAKCHIIDDAYVIFHGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRSSSPAPDAITLVSALSASATLGSV

Query:  QVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFMTVLSACSYSG
        + G  +H   I  GL   +L+  TAL++ YAKCGD  +A  VF +M  ++I++WS+MI  YG+ G    +++ F+ M++   KPNEV+FM VLSAC +SG
Subjt:  QVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFMTVLSACSYSG

Query:  MVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEALDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVVIGEMLQLHPNEACYYVLLSNLYAS
         V+EG +Y+ ++ + +   P+ +H+AC +DLL+RSG L EA   IK+MP   D S++G+ ++GC ++ + D+ + +  ++  +  ++  YY LLSN+YA 
Subjt:  MVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEALDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVVIGEMLQLHPNEACYYVLLSNLYAS

Query:  DGRWGQVNQVRELMQQRGLNKVPG
        +G W +  ++R  M+   L KVPG
Subjt:  DGRWGQVNQVRELMQQRGLNKVPG

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136002.7e-9932.17Show/hide
Query:  HGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRCELREIDEGRKVHCQIEKVG-IECSSAMFEGILDKNVVSWTSMIAGYV
        H  +I  G    +    +L+  Y   G +E  R VFD+MP  + Y W  ++                + K+G ++ + ++F  + +++  +W SM++G+ 
Subjt:  HGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRCELREIDEGRKVHCQIEKVG-IECSSAMFEGILDKNVVSWTSMIAGYV

Query:  QNDCAEEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAIKNVVEFSSFLATTFLDMYVKCGQTRDACMIFDELPTIDLVSWTAMIVGY
        Q+D  EE L  F  M +     N+++  S+++AC+ L  +++G  VH    K+      ++ +  +DMY KCG   DA  +FDE+   ++VSW ++I  +
Subjt:  QNDCAEEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAIKNVVEFSSFLATTFLDMYVKCGQTRDACMIFDELPTIDLVSWTAMIVGY

Query:  TQAGQPNEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLG--LEECAVKNALIDMYAKCHIIDDAYVIFHG--------------
         Q G   EAL +F   + S + P+ VT ASV+S+C+    + +G  VHG  +K      +  + NA +DMYAKC  I +A  IF                
Subjt:  TQAGQPNEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLG--LEECAVKNALIDMYAKCHIIDDAYVIFHG--------------

Query:  -----------------VLDKDVITWNSMISGYAQNGSAYEALRLFNLMRSSSPAPDAITLVSALSASATLGSVQVGSSLHAYSIKEGL-FSS----NLY
                         + +++V++WN++I+GY QNG   EAL LF L++  S  P   +  + L A A L  + +G   H + +K G  F S    +++
Subjt:  -----------------VLDKDVITWNSMISGYAQNGSAYEALRLFNLMRSSSPAPDAITLVSALSASATLGSVQVGSSLHAYSIKEGL-FSS----NLY

Query:  IGTALLNFYAKCGDAKSARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFMTVLSACSYSGMVDEGWRYFKSMSQDYNFKPS
        +G +L++ Y KCG  +   +VF  M +++ ++W+AMI G+   G G+ +L +F +ML+   KP+ +  + VLSAC ++G V+EG  YF SM++D+   P 
Subjt:  IGTALLNFYAKCGDAKSARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFMTVLSACSYSGMVDEGWRYFKSMSQDYNFKPS

Query:  MKHYACMVDLLARSGRLDEALDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVVIGEMLQLHPNEACYYVLLSNLYASDGRWGQVNQVRELMQQRGLNK
          HY CMVDLL R+G L+EA   I++MP+QPD  ++G+ L  C ++    LG+ V  ++L++ P+ +  YVLLSN+YA  G+W  V  VR+ M++ G+ K
Subjt:  MKHYACMVDLLARSGRLDEALDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVVIGEMLQLHPNEACYYVLLSNLYASDGRWGQVNQVRELMQQRGLNK

Query:  VPG
         PG
Subjt:  VPG

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic5.4e-10033.12Show/hide
Query:  RNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIR-------------------------------------CE
        + +D  I+ +G +I   L       +KL  +Y   GD++ A  VFD++       W +++                                        
Subjt:  RNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIR-------------------------------------CE

Query:  LREIDEGRKVHCQIEKVG--------------------IECSSAMFEGILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACT
        LR +  G ++H  I K G                    ++ +  +F+ + +++V+SW S+I GYV N  AE+GL +F +M  + +E +  T+ S+   C 
Subjt:  LREIDEGRKVHCQIEKVG--------------------IECSSAMFEGILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACT

Query:  KLRALHQGKWVHGYAIKNVVEFSSFLATTFLDMYVKCGQTRDACMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIRSDLLPNSVTAASVLSSC
          R +  G+ VH   +K           T LDMY KCG    A  +F E+    +VS+T+MI GY + G   EA++LF +     + P+  T  +VL+ C
Subjt:  KLRALHQGKWVHGYAIKNVVEFSSFLATTFLDMYVKCGQTRDACMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIRSDLLPNSVTAASVLSSC

Query:  SVSGNLSLGMSVHGLGIK---LGLEECAVKNALIDMYAKCHIIDDAYVIFHGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRSSSP-APDAITLVSA
        +    L  G  VH   IK   LG  +  V NAL+DMYAKC  + +A ++F  +  KD+I+WN++I GY++N  A EAL LFNL+      +PD  T+   
Subjt:  SVSGNLSLGMSVHGLGIK---LGLEECAVKNALIDMYAKCHIIDDAYVIFHGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRSSSP-APDAITLVSA

Query:  LSASATLGSVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFM
        L A A+L +   G  +H Y ++ G FS   ++  +L++ YAKCG    A M+FD +  K++++W+ MI GYG+ G G  ++A+F+ M +  ++ +E+ F+
Subjt:  LSASATLGSVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFM

Query:  TVLSACSYSGMVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEALDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVVIGEMLQLHPNEACY
        ++L ACS+SG+VDEGWR+F  M  +   +P+++HYAC+VD+LAR+G L +A  FI+ MP+ PD +++GA L GC ++    L E V  ++ +L P    Y
Subjt:  TVLSACSYSGMVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEALDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVVIGEMLQLHPNEACY

Query:  YVLLSNLYASDGRWGQVNQVRELMQQRGLNKVPG
        YVL++N+YA   +W QV ++R+ + QRGL K PG
Subjt:  YVLLSNLYASDGRWGQVNQVRELMQQRGLNKVPG

Q9ZQ74 Pentatricopeptide repeat-containing protein At2g03380, mitochondrial6.9e-19653.09Show/hide
Query:  ASVQFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRC--------------------
        +S+ + + + CF L+  C NID+L + HG+L  +GL+G++   TKLV +YG  G  + ARLVFDQ+P+PDFY WKVM+RC                    
Subjt:  ASVQFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRC--------------------

Query:  -----------------ELREIDEGRKVHCQIEKV-------------------GIECSSAMFEGILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREA
                         EL+++D G+K+HCQ+ KV                    I+ +  +F  I  +NVV WTSMIAGYV+ND  EEGLVLFNRMRE 
Subjt:  -----------------ELREIDEGRKVHCQIEKV-------------------GIECSSAMFEGILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREA

Query:  LVESNQFTLGSIITACTKLRALHQGKWVHGYAIKNVVEFSSFLATTFLDMYVKCGQTRDACMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIR
         V  N++T G++I ACTKL ALHQGKW HG  +K+ +E SS L T+ LDMYVKCG   +A  +F+E   +DLV WTAMIVGYT  G  NEAL LF     
Subjt:  LVESNQFTLGSIITACTKLRALHQGKWVHGYAIKNVVEFSSFLATTFLDMYVKCGQTRDACMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIR

Query:  SDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHIIDDAYVIFHGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRS
         ++ PN VT ASVLS C +  NL LG SVHGL IK+G+ +  V NAL+ MYAKC+   DA  +F    +KD++ WNS+ISG++QNGS +EAL LF+ M S
Subjt:  SDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHIIDDAYVIFHGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRS

Query:  SSPAPDAITLVSALSASATLGSVQVGSSLHAYSIKEG-LFSSNLYIGTALLNFYAKCGDAKSARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSD
         S  P+ +T+ S  SA A+LGS+ VGSSLHAYS+K G L SS++++GTALL+FYAKCGD +SAR++FD++ +KN ITWSAMIGGYG QGD  GSL +F +
Subjt:  SSPAPDAITLVSALSASATLGSVQVGSSLHAYSIKEG-LFSSNLYIGTALLNFYAKCGDAKSARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSD

Query:  MLKEDLKPNEVIFMTVLSACSYSGMVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEALDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVV
        MLK+  KPNE  F ++LSAC ++GMV+EG +YF SM +DYNF PS KHY CMVD+LAR+G L++ALD I+KMP+QPD+  +GAFLHGCG+++RFDLGE+V
Subjt:  MLKEDLKPNEVIFMTVLSACSYSGMVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEALDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVV

Query:  IGEMLQLHPNEACYYVLLSNLYASDGRWGQVNQVRELMQQRGLNKVPG
        I +ML LHP++A YYVL+SNLYASDGRW Q  +VR LM+QRGL+K+ G
Subjt:  IGEMLQLHPNEACYYVLLSNLYASDGRWGQVNQVRELMQQRGLNKVPG

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein7.5e-10535.2Show/hide
Query:  YLMGLCRNIDTL---IKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRCELREIDEGRKVHCQIEKVGIECSSAMFEG
        YL+ +C +   L    + HGLL+  G   +L   T L  +Y     V  AR VFD+MP+ D                                       
Subjt:  YLMGLCRNIDTL---IKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRCELREIDEGRKVHCQIEKVGIECSSAMFEG

Query:  ILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAIKNVVEFSSFLATTFLDMYVKCGQTRDACMIFD
             +VSW +++AGY QN  A   L +   M E  ++ +  T+ S++ A + LR +  GK +HGYA+++  +    ++T  +DMY KCG    A  +FD
Subjt:  ILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAIKNVVEFSSFLATTFLDMYVKCGQTRDACMIFD

Query:  ELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHIIDDAYVIF
         +   ++VSW +MI  Y Q   P EA+ +F   +   + P  V+    L +C+  G+L  G  +H L ++LGL+   +V N+LI MY KC  +D A  +F
Subjt:  ELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLE-ECAVKNALIDMYAKCHIIDDAYVIF

Query:  HGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRSSSPAPDAITLVSALSASATLGSVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARM
          +  + +++WN+MI G+AQNG   +AL  F+ MRS +  PD  T VS ++A A L        +H   ++  L   N+++ TAL++ YAKCG    AR+
Subjt:  HGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRSSSPAPDAITLVSALSASATLGSVQVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARM

Query:  VFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFMTVLSACSYSGMVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEA
        +FD M ++++ TW+AMI GYG  G G  +L +F +M K  +KPN V F++V+SACS+SG+V+ G + F  M ++Y+ + SM HY  MVDLL R+GRL+EA
Subjt:  VFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFMTVLSACSYSGMVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEA

Query:  LDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVVIGEMLQLHPNEACYYVLLSNLYASDGRWGQVNQVRELMQQRGLNKVPGAHLKE
         DFI +MPV+P +++YGA L  C ++   +  E     + +L+P++  Y+VLL+N+Y +   W +V QVR  M ++GL K PG  + E
Subjt:  LDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVVIGEMLQLHPNEACYYVLLSNLYASDGRWGQVNQVRELMQQRGLNKVPGAHLKE

AT1G26330.1 DNA binding3.8e-12528.89Show/hide
Query:  KEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKIWHG
        +++    +EI+S+G+LY+G WDKKYWSSSRGKDR+PYPVGY AVRA++G  Y MEI EG KGPLF+I  +D  S++GQTPDIAW   Q+    H KIWHG
Subjt:  KEQADASLEIISIGSLYSGPWDKKYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKIWHG

Query:  KRSSCKVDG----------VERLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGANLK
        KR +CK+ G          V+RLLRELV N  G  E         + +S ++   V       C N  L+  L+     +KRSR   I T + S   ++ 
Subjt:  KRSSCKVDG----------VERLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGANLK

Query:  QVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLT----FAKNNGN-------SNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNP
           ++ P+ +     L+    +N    S C    +  + G  +     F  N+         S +P      +P     +     +   +S+ ++  H  
Subjt:  QVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLT----FAKNNGN-------SNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNP

Query:  SIDEKLIVIVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLD-LKQVIFESAP------VSLEKNDLTH--III
         + EK         EN  N                     DDN     F  +  ++  +   A DTLD L+     SAP        ++K +LTH  +++
Subjt:  SIDEKLIVIVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLD-LKQVIFESAP------VSLEKNDLTH--III

Query:  PEELVMDSHPEEICSLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPC----DRNTKDILPMEKEDREKQEYMHTQHRN
         E ++ + + E++     N  S+++DSD V Q+  K+MM+ LLPQAIPLLK+TS +K       +        D +   +    +E     E M     +
Subjt:  PEELVMDSHPEEICSLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPC----DRNTKDILPMEKEDREKQEYMHTQHRN

Query:  CESAVPSLELSKPAGLDNLEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSR-----VSEIGKILHGTCQEVS
         +    S     P       DE H          SS  +   A     PID    P    +   +L  +++    K+        S +  IL    QE  
Subjt:  CESAVPSLELSKPAGLDNLEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSR-----VSEIGKILHGTCQEVS

Query:  ECLPRCSSSGRVLFDETMHNNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDT
        E   RC++ G +L  E    N        + + N + VD    +       + +   R+  S   H  N N SS   NS  +   +  +        +  
Subjt:  ECLPRCSSSGRVLFDETMHNNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDT

Query:  LPSSQVPSIVYSRRKAQNVSILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQRE------DLLSEPTLGEQSPINCSYKTTMKS
        + SS++  +    +   + + ++   NP          LGN  G  T   + P    +N +     +  E      D  S    G  S  + S+  +   
Subjt:  LPSSQVPSIVYSRRKAQNVSILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQRE------DLLSEPTLGEQSPINCSYKTTMKS

Query:  DAGL---EKMCLRSPILDQDEASLRANKNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGALRDNNKVSYPHCNVELYREAEGMSKIV
        D      E++ ++       E+  + N ++  + + + P S                  L  N+                     +V++  E E   +++
Subjt:  DAGL---EKMCLRSPILDQDEASLRANKNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGALRDNNKVSYPHCNVELYREAEGMSKIV

Query:  GCYLHPMPVSSVFLSNIENVIHICVLCGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFC
        GCY HPMPVSSV L  + N I+I VL   + D+ RTL  Y +  + P  G+PS++GHT  +LP + D       +E +    TPDG +L+L G+I+T +C
Subjt:  GCYLHPMPVSSVFLSNIENVIHICVLCGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFC

Query:  RTGSINCSCSTCTSGKFEENMVKIVQVKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPK
        R    +CSC  CTS  FEEN V+IVQVK G+VS++  L++ D V C++VC+P++L+A  + G L +W M+S W    E + I +   IS  ++ELK++PK
Subjt:  RTGSINCSCSTCTSGKFEENMVKIVQVKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPK

Query:  SASLVVGHNGFGEFSLCNFNSRDYVKELLCATN---------------TSSKNTEEDS----------------------SVELKDTAIWLFASTISDSH
           LV+GHNG GEF++ + + R  V   +  +N                 S +T ED+                        E+KDTAIWL  ST  DS 
Subjt:  SASLVVGHNGFGEFSLCNFNSRDYVKELLCATN---------------TSSKNTEEDS----------------------SVELKDTAIWLFASTISDSH

Query:  DSHGYLPIDCQINHVGLWKLALLANSTVTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAADD
                D   + V  W+LALL    +  G++LD RA   G  +G G+ GT DGLVY+W+LSTG KLG+L  F+G  V CI+TD+     + +A++D
Subjt:  DSHGYLPIDCQINHVGLWKLALLANSTVTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAADD

AT1G26330.2 DNA binding2.9e-11728.24Show/hide
Query:  KEQADASLEIISIGSLYSGPWDKKYWSSSRGKD-----RYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHT
        +++    +EI+S+G+LY+G WDKKYWSSSR  +        Y  GY AVRA++G  Y MEI EG KGPLF+I  +D  S++GQTPDIAW   Q+    H 
Subjt:  KEQADASLEIISIGSLYSGPWDKKYWSSSRGKD-----RYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHT

Query:  KIWHGKRSSCKVDG----------VERLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSN
        KIWHGKR +CK+ G          V+RLLRELV N  G  E         + +S ++   V       C N  L+  L+     +KRSR   I T + S 
Subjt:  KIWHGKRSSCKVDG----------VERLLRELVANVSGTAELNLLPSNLCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSN

Query:  GANLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLT----FAKNNGN-------SNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQ
          ++    ++ P+ +     L+    +N    S C    +  + G  +     F  N+         S +P      +P     +     +   +S+ ++
Subjt:  GANLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLT----FAKNNGN-------SNRPAAFSAAVPATSIEKTMHEQEEDAVSESTQ

Query:  VAHNPSIDEKLIVIVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLD-LKQVIFESAP------VSLEKNDLTH
          H   + EK         EN  N                     DDN     F  +  ++  +   A DTLD L+     SAP        ++K +LTH
Subjt:  VAHNPSIDEKLIVIVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLD-LKQVIFESAP------VSLEKNDLTH

Query:  --IIIPEELVMDSHPEEICSLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPC----DRNTKDILPMEKEDREKQEYMH
          +++ E ++ + + E++     N  S+++DSD V Q+  K+MM+ LLPQAIPLLK+TS +K       +        D +   +    +E     E M 
Subjt:  --IIIPEELVMDSHPEEICSLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPC----DRNTKDILPMEKEDREKQEYMH

Query:  TQHRNCESAVPSLELSKPAGLDNLEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSR-----VSEIGKILHGT
            + +    S     P       DE H          SS  +   A     PID    P    +   +L  +++    K+        S +  IL   
Subjt:  TQHRNCESAVPSLELSKPAGLDNLEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSR-----VSEIGKILHGT

Query:  CQEVSECLPRCSSSGRVLFDETMHNNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEA
         QE  E   RC++ G +L  E    N        + + N + VD    +       + +   R+  S   H  N N SS   NS  +   +  +      
Subjt:  CQEVSECLPRCSSSGRVLFDETMHNNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEA

Query:  MGIDTLPSSQVPSIVYSRRKAQNVSILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQRE------DLLSEPTLGEQSPINCSYK
          +  + SS++  +    +   + + ++   NP          LGN  G  T   + P    +N +     +  E      D  S    G  S  + S+ 
Subjt:  MGIDTLPSSQVPSIVYSRRKAQNVSILAKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQRE------DLLSEPTLGEQSPINCSYK

Query:  TTMKSDAGL---EKMCLRSPILDQDEASLRANKNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGALRDNNKVSYPHCNVELYREAEG
         +   D      E++ ++       E+  + N ++  + + + P S                  L  N+                     +V++  E E 
Subjt:  TTMKSDAGL---EKMCLRSPILDQDEASLRANKNKHNSGLLDKPVSKKDLEGCVVEEMIERNNLLSTNKNELSHDIGALRDNNKVSYPHCNVELYREAEG

Query:  MSKIVGCYLHPMPVSSVFLSNIENVIHICVLCGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSI
          +++GCY HPMPVSSV L  + N I+I VL   + D+ RTL  Y +  + P  G+PS++GHT  +LP + D       +E +    TPDG +L+L G+I
Subjt:  MSKIVGCYLHPMPVSSVFLSNIENVIHICVLCGLSVDKNRTLVTYTVEVKEPRLGYPSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSI

Query:  QTRFCRTGSINCSCSTCTSGKFEENMVKIVQVKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVEL
        +T +CR    +CSC  CTS  FEEN V+IVQVK G+VS++  L++ D V C++VC+P++L+A  + G L +W M+S W    E + I +   IS  ++EL
Subjt:  QTRFCRTGSINCSCSTCTSGKFEENMVKIVQVKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRGGRLHLWVMDSTWGKQIESHTIPSEDHISPNLVEL

Query:  KRVPKSASLVVGHNGFGEFSLCNFNSRDYVKELLCATN---------------TSSKNTEEDS----------------------SVELKDTAIWLFAST
        K++PK   LV+GHNG GEF++ + + R  V   +  +N                 S +T ED+                        E+KDTAIWL  ST
Subjt:  KRVPKSASLVVGHNGFGEFSLCNFNSRDYVKELLCATN---------------TSSKNTEEDS----------------------SVELKDTAIWLFAST

Query:  ISDSHDSHGYLPIDCQINHVGLWKLALLANSTVTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVA
          DS         D   + V  W+LALL    +  G++LD RA   G  +G G+ GT DGLVY+W+LSTG KLG+L  F+G  V CI+TD+     + +A
Subjt:  ISDSHDSHGYLPIDCQINHVGLWKLALLANSTVTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVA

Query:  ADD
        ++D
Subjt:  ADD

AT1G69350.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-10332.21Show/hide
Query:  KFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRCELRE-------------IDEG------------------------
        K HG +I  G+  + + +T L+ +YG  G++  A  VFD MP  D  AW  ++   L               +D+G                        
Subjt:  KFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRCELRE-------------IDEG------------------------

Query:  RKVHCQIEKVGIE-----CSSAM---------------FEGILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQ
        R VH QI +   +     C+S +               FE I  KN VSWT+MI+ Y + + +E+ L  F+ M ++ +E N  TL S++++C  +  + +
Subjt:  RKVHCQIEKVGIE-----CSSAM---------------FEGILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQ

Query:  GKWVHGYAIKNVVEFS-SFLATTFLDMYVKCGQTRDACMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNL
        GK VHG+A++  ++ +   L+   +++Y +CG+  D   +   +   ++V+W ++I  Y   G   +AL LF   +   + P++ T AS +S+C  +G +
Subjt:  GKWVHGYAIKNVVEFS-SFLATTFLDMYVKCGQTRDACMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNL

Query:  SLGMSVHGLGIKLGLEECAVKNALIDMYAKCHIIDDAYVIFHGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRSSSPAPDAITLVSALSASATLGSV
         LG  +HG  I+  + +  V+N+LIDMY+K   +D A  +F+ +  + V+TWNSM+ G++QNG++ EA+ LF+ M  S    + +T ++ + A +++GS+
Subjt:  SLGMSVHGLGIKLGLEECAVKNALIDMYAKCHIIDDAYVIFHGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRSSSPAPDAITLVSALSASATLGSV

Query:  QVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFMTVLSACSYSG
        + G  +H   I  GL   +L+  TAL++ YAKCGD  +A  VF +M  ++I++WS+MI  YG+ G    +++ F+ M++   KPNEV+FM VLSAC +SG
Subjt:  QVGSSLHAYSIKEGLFSSNLYIGTALLNFYAKCGDAKSARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFMTVLSACSYSG

Query:  MVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEALDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVVIGEMLQLHPNEACYYVLLSNLYAS
         V+EG +Y+ ++ + +   P+ +H+AC +DLL+RSG L EA   IK+MP   D S++G+ ++GC ++ + D+ + +  ++  +  ++  YY LLSN+YA 
Subjt:  MVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEALDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVVIGEMLQLHPNEACYYVLLSNLYAS

Query:  DGRWGQVNQVRELMQQRGLNKVPG
        +G W +  ++R  M+   L KVPG
Subjt:  DGRWGQVNQVRELMQQRGLNKVPG

AT2G03380.1 Pentatricopeptide repeat (PPR) superfamily protein4.9e-19753.09Show/hide
Query:  ASVQFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRC--------------------
        +S+ + + + CF L+  C NID+L + HG+L  +GL+G++   TKLV +YG  G  + ARLVFDQ+P+PDFY WKVM+RC                    
Subjt:  ASVQFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMPDPDFYAWKVMIRC--------------------

Query:  -----------------ELREIDEGRKVHCQIEKV-------------------GIECSSAMFEGILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREA
                         EL+++D G+K+HCQ+ KV                    I+ +  +F  I  +NVV WTSMIAGYV+ND  EEGLVLFNRMRE 
Subjt:  -----------------ELREIDEGRKVHCQIEKV-------------------GIECSSAMFEGILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREA

Query:  LVESNQFTLGSIITACTKLRALHQGKWVHGYAIKNVVEFSSFLATTFLDMYVKCGQTRDACMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIR
         V  N++T G++I ACTKL ALHQGKW HG  +K+ +E SS L T+ LDMYVKCG   +A  +F+E   +DLV WTAMIVGYT  G  NEAL LF     
Subjt:  LVESNQFTLGSIITACTKLRALHQGKWVHGYAIKNVVEFSSFLATTFLDMYVKCGQTRDACMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIR

Query:  SDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHIIDDAYVIFHGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRS
         ++ PN VT ASVLS C +  NL LG SVHGL IK+G+ +  V NAL+ MYAKC+   DA  +F    +KD++ WNS+ISG++QNGS +EAL LF+ M S
Subjt:  SDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAVKNALIDMYAKCHIIDDAYVIFHGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRS

Query:  SSPAPDAITLVSALSASATLGSVQVGSSLHAYSIKEG-LFSSNLYIGTALLNFYAKCGDAKSARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSD
         S  P+ +T+ S  SA A+LGS+ VGSSLHAYS+K G L SS++++GTALL+FYAKCGD +SAR++FD++ +KN ITWSAMIGGYG QGD  GSL +F +
Subjt:  SSPAPDAITLVSALSASATLGSVQVGSSLHAYSIKEG-LFSSNLYIGTALLNFYAKCGDAKSARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSD

Query:  MLKEDLKPNEVIFMTVLSACSYSGMVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEALDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVV
        MLK+  KPNE  F ++LSAC ++GMV+EG +YF SM +DYNF PS KHY CMVD+LAR+G L++ALD I+KMP+QPD+  +GAFLHGCG+++RFDLGE+V
Subjt:  MLKEDLKPNEVIFMTVLSACSYSGMVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDEALDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVV

Query:  IGEMLQLHPNEACYYVLLSNLYASDGRWGQVNQVRELMQQRGLNKVPG
        I +ML LHP++A YYVL+SNLYASDGRW Q  +VR LM+QRGL+K+ G
Subjt:  IGEMLQLHPNEACYYVLLSNLYASDGRWGQVNQVRELMQQRGLNKVPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCAACGCTTCCTTAGCCTGGGTCGAAGCTTTTCTCGCCTCGCAGGACCTTTACTTGAACTGGGGAACATGTCTTCTACGACATATCCTCGGCAGCCCCAACTGTC
TGAGCTTGACCAGACCATGGCTTCGGTGCAGTTTATTTCATTAAACTCGTGCTTTTATCTCATGGGACTATGTAGGAACATAGATACCCTCATCAAGTTCCATGGGTTGC
TTATAGTACACGGCCTTGTCGGTAATCTTCTCTGTGATACAAAGTTAGTTGGTGTTTATGGCGCACTTGGGGACGTAGAGTCTGCTCGTTTGGTGTTTGATCAAATGCCG
GACCCGGACTTTTATGCATGGAAGGTGATGATTAGGTGTGAATTGCGGGAAATTGATGAAGGGAGGAAGGTCCATTGCCAGATTGAGAAGGTGGGGATTGAGTGCTCAAG
CGCAATGTTCGAAGGAATTCTTGATAAGAACGTGGTTTCTTGGACTTCAATGATTGCAGGATATGTACAGAATGATTGCGCGGAGGAGGGTCTGGTTTTATTCAATCGGA
TGAGAGAAGCATTGGTTGAAAGCAACCAGTTTACTTTAGGGAGCATAATTACTGCGTGTACAAAATTAAGAGCTTTGCATCAGGGAAAATGGGTACATGGCTATGCCATT
AAGAATGTTGTTGAATTTAGCTCTTTCTTAGCGACGACTTTTTTAGACATGTATGTCAAGTGTGGGCAAACTAGAGACGCTTGCATGATATTTGACGAGCTTCCTACTAT
TGATCTCGTTTCATGGACTGCAATGATTGTTGGATATACCCAAGCTGGCCAACCCAACGAAGCATTGAGGCTTTTCGCTGATGAAATAAGATCTGATCTCTTGCCCAATT
CTGTCACCGCTGCAAGTGTTCTTTCATCGTGTTCTGTTTCTGGTAATTTAAGTTTAGGAATGTCAGTGCATGGACTTGGAATTAAACTTGGGCTGGAAGAGTGTGCAGTG
AAGAATGCTCTCATTGACATGTATGCTAAATGCCATATAATTGATGACGCTTATGTTATATTTCATGGGGTTTTGGATAAAGATGTGATTACTTGGAACTCAATGATATC
TGGGTATGCTCAGAATGGGTCTGCATATGAAGCTCTCCGGCTCTTTAATCTAATGAGATCAAGCTCACCTGCCCCTGACGCCATTACACTGGTGAGCGCCCTTTCAGCAT
CTGCCACTCTAGGTTCTGTACAGGTCGGTTCATCACTTCATGCTTACTCGATAAAAGAAGGCCTCTTTTCATCTAATCTTTACATTGGCACTGCGCTTTTGAACTTTTAT
GCCAAATGTGGTGATGCTAAATCAGCACGTATGGTATTTGATAGTATGGGAGATAAGAACATTATCACGTGGAGTGCAATGATAGGTGGTTATGGAGTGCAGGGTGATGG
TAGTGGATCGCTTGCCATATTCTCTGACATGTTGAAGGAGGATCTGAAACCCAATGAAGTGATTTTTATGACAGTATTATCTGCTTGTAGCTATTCAGGAATGGTTGACG
AGGGATGGAGATATTTCAAATCTATGAGTCAGGATTATAACTTTAAGCCTTCCATGAAACACTATGCCTGTATGGTTGATCTTTTGGCTCGATCTGGCAGGTTGGATGAA
GCATTGGACTTTATTAAGAAAATGCCAGTTCAACCAGACATTAGTTTGTATGGAGCTTTTCTTCATGGATGTGGATTATACACGAGGTTTGATCTCGGAGAAGTGGTAAT
CGGGGAAATGTTACAGCTTCATCCGAATGAAGCTTGCTATTATGTGCTTTTGTCTAACCTATATGCTTCAGATGGGAGATGGGGTCAAGTCAATCAGGTGAGAGAGTTGA
TGCAACAGAGAGGATTGAACAAAGTCCCAGGAGCTCATCTCAAAGAACAGGCAGACGCTAGTCTTGAAATAATTTCCATCGGATCCTTGTACAGTGGACCATGGGACAAG
AAGTACTGGAGCAGCTCTAGGGGTAAAGATCGATATCCCTATCCAGTTGGGTATCATGCTGTTCGGGCTTACAATGGGATCAAATATAAAATGGAAATTCATGAGGGTCC
CAAAGGGCCTTTATTTATGATTTTGTCCATGGATGGATGTTCTTTTTCTGGGCAAACCCCTGATATTGCATGGGAGATGTTTCAGAGGAAAGGCTGCCTCCACACTAAAA
TTTGGCACGGGAAAAGGTCTTCATGCAAAGTTGATGGCGTTGAGAGGTTACTTAGGGAGCTAGTGGCAAATGTCAGTGGAACAGCGGAATTAAATCTACTTCCTTCAAAC
TTATGCAATAAGGCTTCTGGATCTGCACAGACTGCAGTTGAGCATTATTCTGCTGGTGAATGTGAAAATGCTGAATTAATTTCTTGCCTTGAAAGATCAAAGATTGCAAG
AAAGAGAAGCAGGAGTCATAGAATTGAAACGGTAAAGTTATCCAACGGTGCTAATCTTAAACAGGTAAGAAATCGTGGTCCGAAAATTAAATCCATGACTGCAAAACTGT
CAAGTTCAGGGTCCGTTAATGAACAAAATCAAAGTTTTTGTGGTAACTATGAAAAGAAACGAAGCAAAGGGAATTCCTTGACTTTTGCAAAAAATAATGGCAACAGTAAT
AGGCCAGCAGCTTTTTCGGCAGCAGTACCTGCAACATCTATAGAGAAAACTATGCACGAACAGGAGGAGGATGCTGTCTCTGAATCCACTCAAGTAGCTCATAACCCGTC
CATTGATGAAAAACTTATTGTGATTGTTCTGTATTGGTATGAGAACACTCTTAACTTCTTACAGCATGATAGGTTGTCGATGGACAAGTTGGAAGGTATTAGTCGAGATA
TGGAAACAGATGATAACAGTGGAGTTGCTTCCTTCCAAAAAGATTGTTCAGATATAGAAGACAATCATCATCATGCTTCAGATACGTTAGATCTAAAGCAAGTTATTTTT
GAGTCTGCTCCAGTTAGCTTGGAAAAGAATGATCTCACTCATATCATCATACCTGAAGAGTTGGTGATGGATTCCCACCCAGAAGAAATCTGCTCATTAAACCCAAACTC
AGGTTCTGAAAGAAATGATTCTGATTCAGTAGGTCAAGACATGGTAAAGTCAATGATGACATTTTTGCTTCCACAAGCAATTCCTCTGCTTAAGAGGACTTCTGGAAGGA
AAAAAACTGCCACTTCTACTTTGGAAACTTTGCCTTGTGATAGAAATACAAAAGATATATTGCCTATGGAGAAAGAAGATAGAGAAAAGCAGGAATATATGCACACTCAA
CATAGGAATTGCGAATCTGCTGTCCCAAGTCTTGAGCTTTCAAAACCTGCTGGTCTTGACAATCTAGAGGATGAGCAACATGATGACTGTGCAAACATTAATGGCAACTT
CTCTTCTATCACTGACGATGGTCGAGCTAAGGAAGATATGAAACCCATTGATTCTTGTGGATTCCCATTCTGTGGTCGCATGCATGATGAGTCATTGGTGAACCATCATG
AAACTAATGGAAGCAAGAAGTCCCGTGTCAGTGAAATTGGTAAAATTTTGCATGGAACATGTCAGGAGGTTTCAGAATGTCTTCCCAGGTGCTCTTCTTCTGGTAGAGTT
CTTTTTGATGAAACTATGCACAATAATACGAGGATGGACGAATGCCCGTTAAATCTTCAAATAAATTCCTGGAAAGTGGACCTGAGAACTCCTAAAGATTATATGGAAAG
CAATGGTGATGAGCAAGCTTGTTCAAGAGAATCCTTTTCTCAACTCTCGCATGCTCAGAATGCAAATGGTTCAAGTGTAAGAAGCAACTCTGCATATTCAGAAGCACCGA
ATAAGGAAGTCATTGCAGGAAAAGAGGCTATGGGAATCGACACTTTGCCATCTTCTCAAGTTCCAAGCATTGTGTATAGTCGGAGAAAAGCTCAAAATGTGTCTATTTTG
GCTAAAGAATTCAATCCCCCACCCAACGAAGCTTACATCACTAGTTGCCTTGGAAATCATTGTGGTGCCGAAACATCTTCCACTAAATCTCCACATTCTTCCGATATCAA
TCTTAGCATTCTACCTGAAAACCAACAACGTGAAGATTTACTTTCTGAACCTACACTAGGAGAACAATCCCCCATCAATTGCAGTTATAAAACTACTATGAAATCTGATG
CAGGATTAGAAAAAATGTGCCTTCGCAGTCCTATATTAGACCAAGATGAAGCATCACTGAGAGCCAACAAGAACAAGCATAATTCAGGACTTCTAGATAAACCTGTTTCG
AAGAAAGATTTGGAAGGTTGTGTTGTCGAGGAGATGATTGAACGTAATAACCTTTTAAGTACAAATAAAAACGAGTTATCTCATGATATTGGAGCCTTAAGAGACAATAA
TAAGGTTTCTTATCCTCATTGCAATGTGGAGCTCTATCGTGAGGCAGAAGGAATGTCAAAGATAGTGGGATGTTATTTGCACCCTATGCCTGTATCATCAGTATTTCTCA
GCAACATTGAGAATGTAATCCACATTTGTGTTTTGTGTGGTCTTTCGGTGGACAAGAACAGAACACTTGTTACTTACACGGTGGAAGTGAAAGAACCAAGGTTGGGATAC
CCATCTTTGGTTGGCCACACGGCAGTAATGTTACCAACGCTAAAAGATTATTTGGGCAAAGAAATTGCAATAGAACGAACCGGTTTCCAGTTAACTCCAGATGGGAACTA
TCTTGTTTTGATTGGTAGCATTCAAACACGTTTTTGCAGGACAGGGAGTATTAATTGTTCTTGTTCAACATGTACATCTGGCAAGTTTGAAGAGAATATGGTGAAGATAG
TGCAAGTTAAATATGGCTATGTGTCGATCATGGCAAGCTTGAAAAGTGCTGACTGTGTACATTGTATTTTGGTTTGTGAACCTGACCATCTTGTTGCTGTTGGAAGGGGT
GGACGGCTGCATCTTTGGGTCATGGATTCAACTTGGGGCAAACAGATAGAAAGTCATACCATACCGTCTGAGGATCATATATCTCCCAACTTGGTGGAGCTTAAAAGGGT
TCCAAAGTCTGCCAGTCTGGTTGTAGGCCATAATGGTTTTGGTGAATTCAGTTTATGCAACTTTAATTCAAGGGACTATGTTAAAGAGCTGTTGTGTGCAACGAATACAA
GCTCAAAGAACACTGAGGAAGATTCTTCAGTTGAGCTAAAGGACACTGCCATATGGCTTTTTGCCTCAACCATATCAGATTCTCATGACTCACATGGCTATTTACCAATT
GATTGTCAGATAAATCATGTAGGATTGTGGAAACTAGCACTACTTGCCAACAGTACGGTTACATTTGGTGCAGAGCTGGATTTGAGGGCTTCTGCCATTGGGGCATCAGC
TGGTCGAGGTATCATTGGGACTCAGGACGGCCTAGTTTATGTGTGGGAATTGTCTACAGGAAATAAACTGGGCACTCTTCTTCGTTTCGAAGGTGCAAGTGTTGATTGTA
TTGCAACTGATAATAGAGAGGAAGGTGTCGTGGCTGTAGCGGCTGATGATATGAGCTTTCCACTCATCTCTGGCATTCCTGTTACAGCTTCTAAAACAGGAACCAATCTA
TCCAATTCTTCTTCAACAGAAATCATAGGGGCCATGGGCCGTGTCGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGCAACGCTTCCTTAGCCTGGGTCGAAGCTTTTCTCGCCTCGCAGGACCTTTACTTGAACTGGGGAACATGTCTTCTACGACATATCCTCGGCAGCCCCAACTGTC
TGAGCTTGACCAGACCATGGCTTCGGTGCAGTTTATTTCATTAAACTCGTGCTTTTATCTCATGGGACTATGTAGGAACATAGATACCCTCATCAAGTTCCATGGGTTGC
TTATAGTACACGGCCTTGTCGGTAATCTTCTCTGTGATACAAAGTTAGTTGGTGTTTATGGCGCACTTGGGGACGTAGAGTCTGCTCGTTTGGTGTTTGATCAAATGCCG
GACCCGGACTTTTATGCATGGAAGGTGATGATTAGGTGTGAATTGCGGGAAATTGATGAAGGGAGGAAGGTCCATTGCCAGATTGAGAAGGTGGGGATTGAGTGCTCAAG
CGCAATGTTCGAAGGAATTCTTGATAAGAACGTGGTTTCTTGGACTTCAATGATTGCAGGATATGTACAGAATGATTGCGCGGAGGAGGGTCTGGTTTTATTCAATCGGA
TGAGAGAAGCATTGGTTGAAAGCAACCAGTTTACTTTAGGGAGCATAATTACTGCGTGTACAAAATTAAGAGCTTTGCATCAGGGAAAATGGGTACATGGCTATGCCATT
AAGAATGTTGTTGAATTTAGCTCTTTCTTAGCGACGACTTTTTTAGACATGTATGTCAAGTGTGGGCAAACTAGAGACGCTTGCATGATATTTGACGAGCTTCCTACTAT
TGATCTCGTTTCATGGACTGCAATGATTGTTGGATATACCCAAGCTGGCCAACCCAACGAAGCATTGAGGCTTTTCGCTGATGAAATAAGATCTGATCTCTTGCCCAATT
CTGTCACCGCTGCAAGTGTTCTTTCATCGTGTTCTGTTTCTGGTAATTTAAGTTTAGGAATGTCAGTGCATGGACTTGGAATTAAACTTGGGCTGGAAGAGTGTGCAGTG
AAGAATGCTCTCATTGACATGTATGCTAAATGCCATATAATTGATGACGCTTATGTTATATTTCATGGGGTTTTGGATAAAGATGTGATTACTTGGAACTCAATGATATC
TGGGTATGCTCAGAATGGGTCTGCATATGAAGCTCTCCGGCTCTTTAATCTAATGAGATCAAGCTCACCTGCCCCTGACGCCATTACACTGGTGAGCGCCCTTTCAGCAT
CTGCCACTCTAGGTTCTGTACAGGTCGGTTCATCACTTCATGCTTACTCGATAAAAGAAGGCCTCTTTTCATCTAATCTTTACATTGGCACTGCGCTTTTGAACTTTTAT
GCCAAATGTGGTGATGCTAAATCAGCACGTATGGTATTTGATAGTATGGGAGATAAGAACATTATCACGTGGAGTGCAATGATAGGTGGTTATGGAGTGCAGGGTGATGG
TAGTGGATCGCTTGCCATATTCTCTGACATGTTGAAGGAGGATCTGAAACCCAATGAAGTGATTTTTATGACAGTATTATCTGCTTGTAGCTATTCAGGAATGGTTGACG
AGGGATGGAGATATTTCAAATCTATGAGTCAGGATTATAACTTTAAGCCTTCCATGAAACACTATGCCTGTATGGTTGATCTTTTGGCTCGATCTGGCAGGTTGGATGAA
GCATTGGACTTTATTAAGAAAATGCCAGTTCAACCAGACATTAGTTTGTATGGAGCTTTTCTTCATGGATGTGGATTATACACGAGGTTTGATCTCGGAGAAGTGGTAAT
CGGGGAAATGTTACAGCTTCATCCGAATGAAGCTTGCTATTATGTGCTTTTGTCTAACCTATATGCTTCAGATGGGAGATGGGGTCAAGTCAATCAGGTGAGAGAGTTGA
TGCAACAGAGAGGATTGAACAAAGTCCCAGGAGCTCATCTCAAAGAACAGGCAGACGCTAGTCTTGAAATAATTTCCATCGGATCCTTGTACAGTGGACCATGGGACAAG
AAGTACTGGAGCAGCTCTAGGGGTAAAGATCGATATCCCTATCCAGTTGGGTATCATGCTGTTCGGGCTTACAATGGGATCAAATATAAAATGGAAATTCATGAGGGTCC
CAAAGGGCCTTTATTTATGATTTTGTCCATGGATGGATGTTCTTTTTCTGGGCAAACCCCTGATATTGCATGGGAGATGTTTCAGAGGAAAGGCTGCCTCCACACTAAAA
TTTGGCACGGGAAAAGGTCTTCATGCAAAGTTGATGGCGTTGAGAGGTTACTTAGGGAGCTAGTGGCAAATGTCAGTGGAACAGCGGAATTAAATCTACTTCCTTCAAAC
TTATGCAATAAGGCTTCTGGATCTGCACAGACTGCAGTTGAGCATTATTCTGCTGGTGAATGTGAAAATGCTGAATTAATTTCTTGCCTTGAAAGATCAAAGATTGCAAG
AAAGAGAAGCAGGAGTCATAGAATTGAAACGGTAAAGTTATCCAACGGTGCTAATCTTAAACAGGTAAGAAATCGTGGTCCGAAAATTAAATCCATGACTGCAAAACTGT
CAAGTTCAGGGTCCGTTAATGAACAAAATCAAAGTTTTTGTGGTAACTATGAAAAGAAACGAAGCAAAGGGAATTCCTTGACTTTTGCAAAAAATAATGGCAACAGTAAT
AGGCCAGCAGCTTTTTCGGCAGCAGTACCTGCAACATCTATAGAGAAAACTATGCACGAACAGGAGGAGGATGCTGTCTCTGAATCCACTCAAGTAGCTCATAACCCGTC
CATTGATGAAAAACTTATTGTGATTGTTCTGTATTGGTATGAGAACACTCTTAACTTCTTACAGCATGATAGGTTGTCGATGGACAAGTTGGAAGGTATTAGTCGAGATA
TGGAAACAGATGATAACAGTGGAGTTGCTTCCTTCCAAAAAGATTGTTCAGATATAGAAGACAATCATCATCATGCTTCAGATACGTTAGATCTAAAGCAAGTTATTTTT
GAGTCTGCTCCAGTTAGCTTGGAAAAGAATGATCTCACTCATATCATCATACCTGAAGAGTTGGTGATGGATTCCCACCCAGAAGAAATCTGCTCATTAAACCCAAACTC
AGGTTCTGAAAGAAATGATTCTGATTCAGTAGGTCAAGACATGGTAAAGTCAATGATGACATTTTTGCTTCCACAAGCAATTCCTCTGCTTAAGAGGACTTCTGGAAGGA
AAAAAACTGCCACTTCTACTTTGGAAACTTTGCCTTGTGATAGAAATACAAAAGATATATTGCCTATGGAGAAAGAAGATAGAGAAAAGCAGGAATATATGCACACTCAA
CATAGGAATTGCGAATCTGCTGTCCCAAGTCTTGAGCTTTCAAAACCTGCTGGTCTTGACAATCTAGAGGATGAGCAACATGATGACTGTGCAAACATTAATGGCAACTT
CTCTTCTATCACTGACGATGGTCGAGCTAAGGAAGATATGAAACCCATTGATTCTTGTGGATTCCCATTCTGTGGTCGCATGCATGATGAGTCATTGGTGAACCATCATG
AAACTAATGGAAGCAAGAAGTCCCGTGTCAGTGAAATTGGTAAAATTTTGCATGGAACATGTCAGGAGGTTTCAGAATGTCTTCCCAGGTGCTCTTCTTCTGGTAGAGTT
CTTTTTGATGAAACTATGCACAATAATACGAGGATGGACGAATGCCCGTTAAATCTTCAAATAAATTCCTGGAAAGTGGACCTGAGAACTCCTAAAGATTATATGGAAAG
CAATGGTGATGAGCAAGCTTGTTCAAGAGAATCCTTTTCTCAACTCTCGCATGCTCAGAATGCAAATGGTTCAAGTGTAAGAAGCAACTCTGCATATTCAGAAGCACCGA
ATAAGGAAGTCATTGCAGGAAAAGAGGCTATGGGAATCGACACTTTGCCATCTTCTCAAGTTCCAAGCATTGTGTATAGTCGGAGAAAAGCTCAAAATGTGTCTATTTTG
GCTAAAGAATTCAATCCCCCACCCAACGAAGCTTACATCACTAGTTGCCTTGGAAATCATTGTGGTGCCGAAACATCTTCCACTAAATCTCCACATTCTTCCGATATCAA
TCTTAGCATTCTACCTGAAAACCAACAACGTGAAGATTTACTTTCTGAACCTACACTAGGAGAACAATCCCCCATCAATTGCAGTTATAAAACTACTATGAAATCTGATG
CAGGATTAGAAAAAATGTGCCTTCGCAGTCCTATATTAGACCAAGATGAAGCATCACTGAGAGCCAACAAGAACAAGCATAATTCAGGACTTCTAGATAAACCTGTTTCG
AAGAAAGATTTGGAAGGTTGTGTTGTCGAGGAGATGATTGAACGTAATAACCTTTTAAGTACAAATAAAAACGAGTTATCTCATGATATTGGAGCCTTAAGAGACAATAA
TAAGGTTTCTTATCCTCATTGCAATGTGGAGCTCTATCGTGAGGCAGAAGGAATGTCAAAGATAGTGGGATGTTATTTGCACCCTATGCCTGTATCATCAGTATTTCTCA
GCAACATTGAGAATGTAATCCACATTTGTGTTTTGTGTGGTCTTTCGGTGGACAAGAACAGAACACTTGTTACTTACACGGTGGAAGTGAAAGAACCAAGGTTGGGATAC
CCATCTTTGGTTGGCCACACGGCAGTAATGTTACCAACGCTAAAAGATTATTTGGGCAAAGAAATTGCAATAGAACGAACCGGTTTCCAGTTAACTCCAGATGGGAACTA
TCTTGTTTTGATTGGTAGCATTCAAACACGTTTTTGCAGGACAGGGAGTATTAATTGTTCTTGTTCAACATGTACATCTGGCAAGTTTGAAGAGAATATGGTGAAGATAG
TGCAAGTTAAATATGGCTATGTGTCGATCATGGCAAGCTTGAAAAGTGCTGACTGTGTACATTGTATTTTGGTTTGTGAACCTGACCATCTTGTTGCTGTTGGAAGGGGT
GGACGGCTGCATCTTTGGGTCATGGATTCAACTTGGGGCAAACAGATAGAAAGTCATACCATACCGTCTGAGGATCATATATCTCCCAACTTGGTGGAGCTTAAAAGGGT
TCCAAAGTCTGCCAGTCTGGTTGTAGGCCATAATGGTTTTGGTGAATTCAGTTTATGCAACTTTAATTCAAGGGACTATGTTAAAGAGCTGTTGTGTGCAACGAATACAA
GCTCAAAGAACACTGAGGAAGATTCTTCAGTTGAGCTAAAGGACACTGCCATATGGCTTTTTGCCTCAACCATATCAGATTCTCATGACTCACATGGCTATTTACCAATT
GATTGTCAGATAAATCATGTAGGATTGTGGAAACTAGCACTACTTGCCAACAGTACGGTTACATTTGGTGCAGAGCTGGATTTGAGGGCTTCTGCCATTGGGGCATCAGC
TGGTCGAGGTATCATTGGGACTCAGGACGGCCTAGTTTATGTGTGGGAATTGTCTACAGGAAATAAACTGGGCACTCTTCTTCGTTTCGAAGGTGCAAGTGTTGATTGTA
TTGCAACTGATAATAGAGAGGAAGGTGTCGTGGCTGTAGCGGCTGATGATATGAGCTTTCCACTCATCTCTGGCATTCCTGTTACAGCTTCTAAAACAGGAACCAATCTA
TCCAATTCTTCTTCAACAGAAATCATAGGGGCCATGGGCCGTGTCGACTGA
Protein sequenceShow/hide protein sequence
MLQRFLSLGRSFSRLAGPLLELGNMSSTTYPRQPQLSELDQTMASVQFISLNSCFYLMGLCRNIDTLIKFHGLLIVHGLVGNLLCDTKLVGVYGALGDVESARLVFDQMP
DPDFYAWKVMIRCELREIDEGRKVHCQIEKVGIECSSAMFEGILDKNVVSWTSMIAGYVQNDCAEEGLVLFNRMREALVESNQFTLGSIITACTKLRALHQGKWVHGYAI
KNVVEFSSFLATTFLDMYVKCGQTRDACMIFDELPTIDLVSWTAMIVGYTQAGQPNEALRLFADEIRSDLLPNSVTAASVLSSCSVSGNLSLGMSVHGLGIKLGLEECAV
KNALIDMYAKCHIIDDAYVIFHGVLDKDVITWNSMISGYAQNGSAYEALRLFNLMRSSSPAPDAITLVSALSASATLGSVQVGSSLHAYSIKEGLFSSNLYIGTALLNFY
AKCGDAKSARMVFDSMGDKNIITWSAMIGGYGVQGDGSGSLAIFSDMLKEDLKPNEVIFMTVLSACSYSGMVDEGWRYFKSMSQDYNFKPSMKHYACMVDLLARSGRLDE
ALDFIKKMPVQPDISLYGAFLHGCGLYTRFDLGEVVIGEMLQLHPNEACYYVLLSNLYASDGRWGQVNQVRELMQQRGLNKVPGAHLKEQADASLEIISIGSLYSGPWDK
KYWSSSRGKDRYPYPVGYHAVRAYNGIKYKMEIHEGPKGPLFMILSMDGCSFSGQTPDIAWEMFQRKGCLHTKIWHGKRSSCKVDGVERLLRELVANVSGTAELNLLPSN
LCNKASGSAQTAVEHYSAGECENAELISCLERSKIARKRSRSHRIETVKLSNGANLKQVRNRGPKIKSMTAKLSSSGSVNEQNQSFCGNYEKKRSKGNSLTFAKNNGNSN
RPAAFSAAVPATSIEKTMHEQEEDAVSESTQVAHNPSIDEKLIVIVLYWYENTLNFLQHDRLSMDKLEGISRDMETDDNSGVASFQKDCSDIEDNHHHASDTLDLKQVIF
ESAPVSLEKNDLTHIIIPEELVMDSHPEEICSLNPNSGSERNDSDSVGQDMVKSMMTFLLPQAIPLLKRTSGRKKTATSTLETLPCDRNTKDILPMEKEDREKQEYMHTQ
HRNCESAVPSLELSKPAGLDNLEDEQHDDCANINGNFSSITDDGRAKEDMKPIDSCGFPFCGRMHDESLVNHHETNGSKKSRVSEIGKILHGTCQEVSECLPRCSSSGRV
LFDETMHNNTRMDECPLNLQINSWKVDLRTPKDYMESNGDEQACSRESFSQLSHAQNANGSSVRSNSAYSEAPNKEVIAGKEAMGIDTLPSSQVPSIVYSRRKAQNVSIL
AKEFNPPPNEAYITSCLGNHCGAETSSTKSPHSSDINLSILPENQQREDLLSEPTLGEQSPINCSYKTTMKSDAGLEKMCLRSPILDQDEASLRANKNKHNSGLLDKPVS
KKDLEGCVVEEMIERNNLLSTNKNELSHDIGALRDNNKVSYPHCNVELYREAEGMSKIVGCYLHPMPVSSVFLSNIENVIHICVLCGLSVDKNRTLVTYTVEVKEPRLGY
PSLVGHTAVMLPTLKDYLGKEIAIERTGFQLTPDGNYLVLIGSIQTRFCRTGSINCSCSTCTSGKFEENMVKIVQVKYGYVSIMASLKSADCVHCILVCEPDHLVAVGRG
GRLHLWVMDSTWGKQIESHTIPSEDHISPNLVELKRVPKSASLVVGHNGFGEFSLCNFNSRDYVKELLCATNTSSKNTEEDSSVELKDTAIWLFASTISDSHDSHGYLPI
DCQINHVGLWKLALLANSTVTFGAELDLRASAIGASAGRGIIGTQDGLVYVWELSTGNKLGTLLRFEGASVDCIATDNREEGVVAVAADDMSFPLISGIPVTASKTGTNL
SNSSSTEIIGAMGRVD