; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017243 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017243
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionFUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: Ubiquitin system component Cue protein .
Genome locationtig00153033:872565..886110
RNA-Seq ExpressionSgr017243
SyntenySgr017243
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsIPR006726 - Para-hydroxybenzoic acid efflux pump subunit AaeB/fusaric acid resistance protein
IPR015216 - SANT associated


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027320.1 hypothetical protein SDJN02_11332, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0068.9Show/hide
Query:  TAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGV-AALA
        TAVLWR RLG+ LR A ACS++GGV +FGPA VR LL+F +FSYFT I IVL+D VS+GDAVRG W V +AV  VLV SVPCL+L+GP RF  G+ AA+A
Subjt:  TAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGV-AALA

Query:  VAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEGFGA
        VA+S FVVALP RTH+L+KRIAFGQLVIVYVGTVVHG +TSF MHP++VASSTA GALAAV AM+LPYPRLAS QIRK+ RAYCEN CER G + EGFGA
Subjt:  VAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEGFGA

Query:  KSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVLKLQQ
        K+KAEAVAS+AEAK+LSTTGT+LL++I+ NL GM+WER +   A        E LE  EVA+RGMEAALTS SIAIG MDEELCN   NLK K  LKLQ 
Subjt:  KSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVLKLQQ

Query:  LKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAA---PNIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTSR
         K S PPNATTAPE TK  F    PLN+S IT + LP SFFLRCME+L Y STAA    N+VS VE  RK +G E   L + G+K TRWGILSN+ PT +
Subjt:  LKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAA---PNIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTSR

Query:  SLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQA
        S  FALKCSITLGLAVFLGL YTK NGYWSGLTVAIS ATERQA FTVANARAQGTA+GSIYGV+CCFIL+K EYLWLLPLLPWV+F+SFL+HSRMYG A
Subjt:  SLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQA

Query:  GGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLNDSSRYIPLIEQHKNLKSH
        G  +SALGALLVLGRKNYGIPSEFANARITEACIGL+C++T+E++FNPTRAATLAKTEFS+SLEALQ+ IKRV LIP+K++N SS    LIE HK LKSH
Subjt:  GGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLNDSSRYIPLIEQHKNLKSH

Query:  VGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKELH
        V QLEKFI EA FEPNFWFTPFQ GCYD LLKSLQKTV+IL FL HEM  L LEL+RSG+V KE+HDS +EDM+ F KKVGCSL FMEK+S+   +KEL 
Subjt:  VGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKELH

Query:  DKNQNLSDEMEMGKS-PSDGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPSIH
        + N+N   EMEMGK  P+DGCRALAL EED+EKIVGSFCQHANE+ +K YTN EGEAN +GQMTL LSSIGFC+ECLMRETM MEKEV+QLLKLENPS+H
Subjt:  DKNQNLSDEMEMGKS-PSDGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPSIH

Query:  MNLQELSTK-------------------------------------------------------------------------------------------
        +NLQ+LSTK                                                                                           
Subjt:  MNLQELSTK-------------------------------------------------------------------------------------------

Query:  -----EVDMSAGVCGKRVGFEEIFGSSSPTACSSAKRSRWSSFGSPTRSEFGSGPDDSASVLLQMFPGV-GAEVPSFDDFNARVDSATIGNCSTIPDERT
             E  MSAGVCGKRVGFEEIFGSSSPTACSSAKRSRWSSFGS TRSEFGSGP+DSASVLLQMFPGV GAEVPSFDD+NA VDSA I NCST+PDERT
Subjt:  -----EVDMSAGVCGKRVGFEEIFGSSSPTACSSAKRSRWSSFGSPTRSEFGSGPDDSASVLLQMFPGV-GAEVPSFDDFNARVDSATIGNCSTIPDERT

Query:  ATCSQMLP-EKVEEAKDVSSAIAEGNAMHGSKWVDMFVQEMTNAIDLDDARIRAAKILEAFEQNITVYSRESEELKHASLKEHLQSLVNDNQILKRAVAI
        ATCSQM   EKVEEA+DV SAIAEGN MHG KWVDMFVQEM +A+D+DDAR+RAA+ILEAFE +IT +SRESEELKH SLKEH QSLVNDNQILKRAVAI
Subjt:  ATCSQMLP-EKVEEAKDVSSAIAEGNAMHGSKWVDMFVQEMTNAIDLDDARIRAAKILEAFEQNITVYSRESEELKHASLKEHLQSLVNDNQILKRAVAI

Query:  QHERNLEQEEKTREVHQLKQVLCQYQEQIQTV
        QHERNLEQEEKT+EVHQLK VLCQYQEQIQ++
Subjt:  QHERNLEQEEKTREVHQLKQVLCQYQEQIQTV

XP_004142207.1 uncharacterized protein LOC101207339 [Cucumis sativus]0.0e+0072.87Show/hide
Query:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGV-
        +  AT ++WR RLG ALRAALAC IVG VT+FGPAP+R LL FS+FSYFT I ++LSD VS+GDAVRG W V +AV  VLVSSVPCLWLIGP RF S   
Subjt:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGV-

Query:  AALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAE
        AA+AVAVSGFVVALPERTHLL+KRIAFGQLVIVYVGTV+HG + SFV HP++VASSTA GALAAV AM++P+PRLA  QIRK+ + YCEN  +R+  + E
Subjt:  AALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAE

Query:  GFGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVL
        G GAK+K EAVA + EAKSLST  T+LLQ IK N+RG++WER      +      EE LEE+EVA++GMEAALTSPS+  G++DE+L N   NLK K +L
Subjt:  GFGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVL

Query:  KLQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAPNIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTS
        KLQQ K ++PP +TTAPE TK +F    PLNIS IT + LP SFFLRCMEILLYDSTA  N+VS VE G++ +GE+A  LG+ G+K+T WGILSNM PT+
Subjt:  KLQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAPNIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTS

Query:  RSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQ
        +SL FALKCSITLGLAVFLGL YTKPNGYWSGLTVAISFATE+QA FTVANARAQGTAIGSIYGV+CCFIL+KYEYLWLLPLLPWV+FTSFL+HSRMYGQ
Subjt:  RSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQ

Query:  AGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLN-DSSRYIPLIEQHKNLK
        +GGIASALGALLVLGRK+YG+PSEFANARITEACIGLLC++TVE++FNPTR ATLAKTEFS +L AL+D IKRV LIP+K+LN ++S ++ LI+ HK L+
Subjt:  AGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLN-DSSRYIPLIEQHKNLK

Query:  SHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKE
        SHV QLEKFIVEAGFEPNFWFTPFQG CY+KLLKSLQKT+DILQ ++HE+KFLSLEL+RSGL+VKELHDS  EDM +F+KK+GCSLKFMEK+SLIKSLKE
Subjt:  SHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKE

Query:  LHDKNQNLSDEMEMGKSPS-DGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPS
        L +KNQN   +MEMGK  S DGC+A AL EED+EKIVGSFCQHANE+ SK Y+N E E NLKGQMTL LSSIGFC+ECLMRETMVMEKEV Q+LKLENPS
Subjt:  LHDKNQNLSDEMEMGKSPS-DGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPS

Query:  IHMNLQELSTK
        IH+NLQELST+
Subjt:  IHMNLQELSTK

XP_022143731.1 uncharacterized protein LOC111013575 [Momordica charantia]0.0e+0080.79Show/hide
Query:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGVA
        MAVATA LWR RLG+ALRAALACSIVG VTLFGPAPVRH+LTFS+FSY T + IVLSD VSLG AVRG   V +AV  VLV+SVPCLWLIG ARFNS  A
Subjt:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGVA

Query:  ALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEG
        A AVAVS FVVAL ER HLL+KRIAFGQLVIVYVGT +HG ETSF MHPL+VASSTA GALAA  AM++PYPRL+SCQIRK+ R YCEN CERVG + EG
Subjt:  ALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEG

Query:  FGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPN-LMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVL
        FGAK+KA A+ SIAEAKSLS T T+LL++IKI L GM WERPR+   KP+  M     LEE EVA+RGMEAALTSPS A GTMDE+LCNLPKNLK K + 
Subjt:  FGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPN-LMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVL

Query:  KLQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAP---NIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMW
        KLQQLK S+PPNATTAPE TKA F       IS I   NLP SFFLRCMEI+LYDSTAA    N+V  VENG++ +GEEA+ LG  G+K TRWGILSNM 
Subjt:  KLQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAP---NIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMW

Query:  PTSRSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRM
        PT++SLYFALKCS+TLGLAVFLGL YTKPNGYWSGLTVAISFATERQA FTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWV+FTSFLIHSRM
Subjt:  PTSRSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRM

Query:  YGQAGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDL-NDSSRYIPLIEQHK
        YGQ+GGIASALGALLVLGRKNYGIPSEFANARITEACIGLLC+VTVEVVFNPTRAATLAK EFSKSL ALQDCI+RV LIP+K+L N+SS +I LIE+HK
Subjt:  YGQAGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDL-NDSSRYIPLIEQHK

Query:  NLKSHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKS
         LKSHVGQLEKFI+EAG+EPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLEL+RSGLVVKELHDS +EDMELFNKKVGCSLKFMEKVSL+KS
Subjt:  NLKSHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKS

Query:  LKELHDKNQNLSDEMEMGKSPSDGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLEN
        LKEL +KNQNL DEMEMGKS +DGCRAL LSEED+EKIVGSFCQ+ANE+ SKVYTN EGEANLKGQMTL L SIGFC+ECLMRETMVMEKEV QLLKLEN
Subjt:  LKELHDKNQNLSDEMEMGKSPSDGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLEN

Query:  PSIHMNLQELST
        PSIHMNLQELST
Subjt:  PSIHMNLQELST

XP_022143874.1 uncharacterized protein LOC111013682 [Momordica charantia]0.0e+0077.78Show/hide
Query:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGVA
        MA ATAVLWR RLG+A+RAA+ACSIVGGVTLFGPAPVR +LTFS+FSY T I  VLSD V+ G AVR G+ V  AVA+VLVSSVPCLWLIGPARF+   A
Subjt:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGVA

Query:  ALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEG
        +LAVAVSGFVVAL ER HLL+KRIAFGQLVIVYVGTV++G ETSF MHPL+VASSTA GA+AA  AM++PYPRL+SCQ++ + + Y EN CERVG+L + 
Subjt:  ALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEG

Query:  FGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVLK
        FGAKSKAEA  SIAE KSLST GT+LL  IK N+RGM WERPR+GH + N +   E LEE EVAVRGME ALTSPS A  TMDEELCNLPKNLK +++LK
Subjt:  FGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVLK

Query:  LQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAP-NIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTS
        LQQLKCS+P +ATTAPE TK  F +TWPL  S I  +N+P SFFLRC++ILLYDSTAA   +V  VENGR+ +GEEA+ LG  G+K TR  ILSNM PT+
Subjt:  LQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAP-NIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTS

Query:  RSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQ
        +SL FALKCSITLGLAVFLGL+YTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLW+LPL+PWV FTSFL+HSRMYGQ
Subjt:  RSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQ

Query:  AGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDL-NDSSRYIPLIEQHKNLK
         GGI+SALGALLVLGRKNYGIPSEFANARITEACIGLLC+VTVEVVFNPTR+ATLAK EFSKSL ALQDCI+RV LIP+K+L N+SS  + LIEQHK LK
Subjt:  AGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDL-NDSSRYIPLIEQHKNLK

Query:  SHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKE
        SHV QLEKFIVEAG+EPNFWFTPFQG CYDKLLKSLQKTVDILQFLVHEM+ LSLEL+R G VVKELHDS +EDMELF+ KV  SLKFMEK SL+KSLK+
Subjt:  SHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKE

Query:  LHDKNQNLSDEMEMGKSPSDGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPSI
        L +KNQN SDEMEMGKS +DGCRAL LSEE++EKIVGSF Q+ANE+ SKVYTN EGEANLKGQMTL LSS+GFC+ECLMRETMVMEKEV QLLKLENPSI
Subjt:  LHDKNQNLSDEMEMGKSPSDGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPSI

Query:  HMNLQELSTK
        HMNLQELST+
Subjt:  HMNLQELSTK

XP_038882781.1 uncharacterized protein LOC120073933 isoform X1 [Benincasa hispida]0.0e+0076.14Show/hide
Query:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGV-
        +  ATA++WR RLG ALRAALAC+IVG VT+FGPAPVR LL FS+FSY T I IVLSD VS+GDAVRG W V +AV  V+V SVPCLWLIGP RF     
Subjt:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGV-

Query:  AALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAE
        AALAV VS FVVALPERTHLL+KRIAFGQLVIVYVGTV+HG + SFVMHP++VASSTA GALAAV AM+LP+PRLA  QIRK+ R YCEN CER+G + E
Subjt:  AALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAE

Query:  GFGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVL
        G GAKSKAEA+A + EAKSLST GT+LLQ+IK N+RGM+WER +MG         EE +EE+EVA+RGMEAALTSPS+A G MDE+LCN   NLK K +L
Subjt:  GFGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVL

Query:  KLQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAP---NIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMW
        KLQQ K S+P  +TTAPE TK  F    PLNIS IT + LP SFFLRCMEILLYDSTA+    N++S VE GR+ +GEEA    +  +K TRWGILSNM 
Subjt:  KLQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAP---NIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMW

Query:  PTSRSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRM
        PT++SL FALKCSITLGLAVFLGL YTKPNGYWSGLTVAISFATERQA FTVAN RAQGTAIGSIYGV+CCFIL+KYEYLWLLPLLPWV+FTSFL+HSRM
Subjt:  PTSRSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRM

Query:  YGQAGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLNDSSRYIPLIEQHKN
        YGQ+GGIASALGALLVLGRKNYGIPSEFANARITEACIGLLC++TVE+VFNPTRAATLAKTEFS SL A+QD IKRV LIP+K+LN++S +I LIE HK 
Subjt:  YGQAGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLNDSSRYIPLIEQHKN

Query:  LKSHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSL
        LKSHV QLEKFIVEAGFEPNFWFTPFQGGCY+KLLKSLQKTVDILQ ++HEMKFLSLEL+RSGLVVKELHDS +EDME F+KKVGCSL+FM+KVSLIKSL
Subjt:  LKSHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSL

Query:  KELHDKNQNLSDEMEMGKSPS-DGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLEN
        KEL +KN N   EMEMGK  S DGC+A ALSEED+EKIVGSFCQ ANE+ SK YTN E E NLKGQMTL LSSIGFC+ECLMRETMVMEKEV+QLLKLEN
Subjt:  KELHDKNQNLSDEMEMGKSPS-DGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLEN

Query:  PSIHMNLQELSTK
        PSIH+NLQELSTK
Subjt:  PSIHMNLQELSTK

TrEMBL top hitse value%identityAlignment
A0A0A0L0W0 Uncharacterized protein0.0e+0072.87Show/hide
Query:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGV-
        +  AT ++WR RLG ALRAALAC IVG VT+FGPAP+R LL FS+FSYFT I ++LSD VS+GDAVRG W V +AV  VLVSSVPCLWLIGP RF S   
Subjt:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGV-

Query:  AALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAE
        AA+AVAVSGFVVALPERTHLL+KRIAFGQLVIVYVGTV+HG + SFV HP++VASSTA GALAAV AM++P+PRLA  QIRK+ + YCEN  +R+  + E
Subjt:  AALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAE

Query:  GFGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVL
        G GAK+K EAVA + EAKSLST  T+LLQ IK N+RG++WER      +      EE LEE+EVA++GMEAALTSPS+  G++DE+L N   NLK K +L
Subjt:  GFGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVL

Query:  KLQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAPNIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTS
        KLQQ K ++PP +TTAPE TK +F    PLNIS IT + LP SFFLRCMEILLYDSTA  N+VS VE G++ +GE+A  LG+ G+K+T WGILSNM PT+
Subjt:  KLQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAPNIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTS

Query:  RSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQ
        +SL FALKCSITLGLAVFLGL YTKPNGYWSGLTVAISFATE+QA FTVANARAQGTAIGSIYGV+CCFIL+KYEYLWLLPLLPWV+FTSFL+HSRMYGQ
Subjt:  RSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQ

Query:  AGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLN-DSSRYIPLIEQHKNLK
        +GGIASALGALLVLGRK+YG+PSEFANARITEACIGLLC++TVE++FNPTR ATLAKTEFS +L AL+D IKRV LIP+K+LN ++S ++ LI+ HK L+
Subjt:  AGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLN-DSSRYIPLIEQHKNLK

Query:  SHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKE
        SHV QLEKFIVEAGFEPNFWFTPFQG CY+KLLKSLQKT+DILQ ++HE+KFLSLEL+RSGL+VKELHDS  EDM +F+KK+GCSLKFMEK+SLIKSLKE
Subjt:  SHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKE

Query:  LHDKNQNLSDEMEMGKSPS-DGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPS
        L +KNQN   +MEMGK  S DGC+A AL EED+EKIVGSFCQHANE+ SK Y+N E E NLKGQMTL LSSIGFC+ECLMRETMVMEKEV Q+LKLENPS
Subjt:  LHDKNQNLSDEMEMGKSPS-DGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPS

Query:  IHMNLQELSTK
        IH+NLQELST+
Subjt:  IHMNLQELSTK

A0A1S3BHE3 uncharacterized protein LOC1034901000.0e+0073.24Show/hide
Query:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGV-
        +  AT ++WR RLG ALRAALAC IVG VT+FGPAPVR LL FS+FSYFT I IVLSD VSLGDAVRG W V +AV  V+VSS+PCLWLIGP RF S   
Subjt:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGV-

Query:  AALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAE
        AA+AVAVS FVVALPERTHLL+KRIAFGQLVIVYVGTV+HG + SFV HP++VASSTA GALAAV AM++P+PRLA  QIRK+ + YCEN  +RV  + E
Subjt:  AALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAE

Query:  GFGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVL
        G GAK+K EAVA + EAKSLST  T+LLQ IK N+RG++WER +MG         EE LEE+EVA++GMEAALTSPS+  G+MDE+L N   NLK K + 
Subjt:  GFGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVL

Query:  KLQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAPNIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTS
        KLQQ K ++PP +TTAPE TK  F    PLNIS IT + LP SFFLRCMEILLYDSTA  N+VS VE GR+ +GE+A  LG+  +K+T WG LSNM PT+
Subjt:  KLQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAPNIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTS

Query:  RSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQ
        +SL FALKCSITLGLAVFLGL YTKPNGYWSGLTVAISFATERQA FTVANARAQGTAIGSIYGV+CCFIL+KYEYLWLLPLLPWV+FTSFL+HSRMYGQ
Subjt:  RSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQ

Query:  AGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLN-DSSRYIPLIEQHKNLK
        +GGIASALGALLVLGRK+YG+PSEFANAR+TEACIGLLC++TVE++FNPTR ATLAKTEFS +L AL+D IKRV L+P+K+LN ++S ++ LI+ HK LK
Subjt:  AGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLN-DSSRYIPLIEQHKNLK

Query:  SHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKE
        SHV QL KFIVEAGFEPNFWFTPFQGGCY+K+LKSLQKT+DILQ ++HE+KFLSLEL+ SGL+VKELHDS  EDME+F+KK+GCSLKFMEK+S IKSLKE
Subjt:  SHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKE

Query:  LHDKNQNLSDEMEMGKSPS-DGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPS
        L +KNQN   EMEMGK  S DGC+A AL EED+EKIVGSFCQHANE+ SK YTN E E NLKGQMTL LSSIGFC+ECLMRETMVMEKEV Q+LKLENPS
Subjt:  LHDKNQNLSDEMEMGKSPS-DGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPS

Query:  IHMNLQELSTK
        IH+NLQELST+
Subjt:  IHMNLQELSTK

A0A5D3DIP8 FUSC_2 domain-containing protein0.0e+0073.54Show/hide
Query:  VLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGV-AALAVA
        ++WR RLG ALRAALAC IVG VT+FGPAPVR LL FS+FSYFT I IVLSD VSLGDAVRG W V +AV  V+VSS+PCLWLIGP RF S   AA+AVA
Subjt:  VLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGV-AALAVA

Query:  VSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEGFGAKS
        VS FVVALPERTHLL+KRIAFGQLVIVYVGTV+HG + SFV HP++VASSTA GALAAV AM++P+PRLA  QIRK+ + YCEN  +RV  + EG GAK+
Subjt:  VSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEGFGAKS

Query:  KAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVLKLQQLK
        K EAVA + EAKSLST  T+LLQ IK N+RG++WER +MG         EE LEE+EVA++GMEAALTSPS+  G+MDE+L N   NLK K + KLQQ K
Subjt:  KAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVLKLQQLK

Query:  CSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAPNIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTSRSLYFA
         ++PP +TTAPE TK  F    PLNIS IT + LP SFFLRCMEILLYDSTA  N+VS VE GR+ +GE+A  LG+  +K+T WG LSNM PT++SL FA
Subjt:  CSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAPNIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTSRSLYFA

Query:  LKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQAGGIAS
        LKCSITLGLAVFLGL YTKPNGYWSGLTVAISFATERQA FTVANARAQGTAIGSIYGV+CCFIL+KYEYLWLLPLLPWV+FTSFL+HSRMYGQ+GGIAS
Subjt:  LKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQAGGIAS

Query:  ALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLN-DSSRYIPLIEQHKNLKSHVGQL
        ALGALLVLGRK+YG+PSEFANAR+TEACIGLLC++TVE++FNPTR ATLAKTEFS +L AL+D IKRV L+P+K+LN ++S ++ LI+ HK LKSHV QL
Subjt:  ALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLN-DSSRYIPLIEQHKNLKSHVGQL

Query:  EKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKELHDKNQ
         KFIVEAGFEPNFWFTPFQGGCY+K+LKSLQKT+DILQ ++HE+KFLSLEL+ SGL+VKELHDS  EDME+F+KK+GCSLKFMEK+S IKSLKEL +KNQ
Subjt:  EKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKELHDKNQ

Query:  NLSDEMEMGKSPS-DGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPSIHMNLQ
        N   EMEMGK  S DGC+A AL EED+EKIVGSFCQHANE+ SK YTN E E NLKGQMTL LSSIGFC+ECLMRETMVMEKEV Q+LKLENPSIH+NLQ
Subjt:  NLSDEMEMGKSPS-DGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPSIHMNLQ

Query:  ELSTK
        ELST+
Subjt:  ELSTK

A0A6J1CQM6 uncharacterized protein LOC1110136820.0e+0077.78Show/hide
Query:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGVA
        MA ATAVLWR RLG+A+RAA+ACSIVGGVTLFGPAPVR +LTFS+FSY T I  VLSD V+ G AVR G+ V  AVA+VLVSSVPCLWLIGPARF+   A
Subjt:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGVA

Query:  ALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEG
        +LAVAVSGFVVAL ER HLL+KRIAFGQLVIVYVGTV++G ETSF MHPL+VASSTA GA+AA  AM++PYPRL+SCQ++ + + Y EN CERVG+L + 
Subjt:  ALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEG

Query:  FGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVLK
        FGAKSKAEA  SIAE KSLST GT+LL  IK N+RGM WERPR+GH + N +   E LEE EVAVRGME ALTSPS A  TMDEELCNLPKNLK +++LK
Subjt:  FGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVLK

Query:  LQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAP-NIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTS
        LQQLKCS+P +ATTAPE TK  F +TWPL  S I  +N+P SFFLRC++ILLYDSTAA   +V  VENGR+ +GEEA+ LG  G+K TR  ILSNM PT+
Subjt:  LQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAP-NIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTS

Query:  RSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQ
        +SL FALKCSITLGLAVFLGL+YTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLW+LPL+PWV FTSFL+HSRMYGQ
Subjt:  RSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQ

Query:  AGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDL-NDSSRYIPLIEQHKNLK
         GGI+SALGALLVLGRKNYGIPSEFANARITEACIGLLC+VTVEVVFNPTR+ATLAK EFSKSL ALQDCI+RV LIP+K+L N+SS  + LIEQHK LK
Subjt:  AGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDL-NDSSRYIPLIEQHKNLK

Query:  SHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKE
        SHV QLEKFIVEAG+EPNFWFTPFQG CYDKLLKSLQKTVDILQFLVHEM+ LSLEL+R G VVKELHDS +EDMELF+ KV  SLKFMEK SL+KSLK+
Subjt:  SHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKE

Query:  LHDKNQNLSDEMEMGKSPSDGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPSI
        L +KNQN SDEMEMGKS +DGCRAL LSEE++EKIVGSF Q+ANE+ SKVYTN EGEANLKGQMTL LSS+GFC+ECLMRETMVMEKEV QLLKLENPSI
Subjt:  LHDKNQNLSDEMEMGKSPSDGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLENPSI

Query:  HMNLQELSTK
        HMNLQELST+
Subjt:  HMNLQELSTK

A0A6J1CR62 uncharacterized protein LOC1110135750.0e+0080.79Show/hide
Query:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGVA
        MAVATA LWR RLG+ALRAALACSIVG VTLFGPAPVRH+LTFS+FSY T + IVLSD VSLG AVRG   V +AV  VLV+SVPCLWLIG ARFNS  A
Subjt:  MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGVA

Query:  ALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEG
        A AVAVS FVVAL ER HLL+KRIAFGQLVIVYVGT +HG ETSF MHPL+VASSTA GALAA  AM++PYPRL+SCQIRK+ R YCEN CERVG + EG
Subjt:  ALAVAVSGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEG

Query:  FGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPN-LMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVL
        FGAK+KA A+ SIAEAKSLS T T+LL++IKI L GM WERPR+   KP+  M     LEE EVA+RGMEAALTSPS A GTMDE+LCNLPKNLK K + 
Subjt:  FGAKSKAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGHAKPN-LMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVL

Query:  KLQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAP---NIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMW
        KLQQLK S+PPNATTAPE TKA F       IS I   NLP SFFLRCMEI+LYDSTAA    N+V  VENG++ +GEEA+ LG  G+K TRWGILSNM 
Subjt:  KLQQLKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAP---NIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMW

Query:  PTSRSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRM
        PT++SLYFALKCS+TLGLAVFLGL YTKPNGYWSGLTVAISFATERQA FTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWV+FTSFLIHSRM
Subjt:  PTSRSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRM

Query:  YGQAGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDL-NDSSRYIPLIEQHK
        YGQ+GGIASALGALLVLGRKNYGIPSEFANARITEACIGLLC+VTVEVVFNPTRAATLAK EFSKSL ALQDCI+RV LIP+K+L N+SS +I LIE+HK
Subjt:  YGQAGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDL-NDSSRYIPLIEQHK

Query:  NLKSHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKS
         LKSHVGQLEKFI+EAG+EPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLEL+RSGLVVKELHDS +EDMELFNKKVGCSLKFMEKVSL+KS
Subjt:  NLKSHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKS

Query:  LKELHDKNQNLSDEMEMGKSPSDGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLEN
        LKEL +KNQNL DEMEMGKS +DGCRAL LSEED+EKIVGSFCQ+ANE+ SKVYTN EGEANLKGQMTL L SIGFC+ECLMRETMVMEKEV QLLKLEN
Subjt:  LKELHDKNQNLSDEMEMGKSPSDGCRALALSEEDIEKIVGSFCQHANEVPSKVYTN-EGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEVYQLLKLEN

Query:  PSIHMNLQELST
        PSIHMNLQELST
Subjt:  PSIHMNLQELST

SwissProt top hitse value%identityAlignment
F4KCE9 Kinetochore-associated protein KNL-2 homolog1.9e-2538.1Show/hide
Query:  SASYFQKTVCLHDWWLVRAENDFNGKTLAVAGLTSRPE-QPVRVFSSAPIIKRYDIFTLETADGICVVIKGFINKLRTTDNGFTAEIFKHFVFGFPPNWQ
        S S FQKTV L DWWL++   +F GK   VAG     E + +RVF+S+PI K  D+FTL  +DGI + ++GF+NK R   NGF  EI + F+FGFPP W+
Subjt:  SASYFQKTVCLHDWWLVRAENDFNGKTLAVAGLTSRPE-QPVRVFSSAPIIKRYDIFTLETADGICVVIKGFINKLRTTDNGFTAEIFKHFVFGFPPNWQ

Query:  TYAANCFEGEAFDSAAAGNVSDTDKSKNGGIHCSPTNMNSPLCENCPMNKQDSGSPLESDSRQGLDHG
            +CFEG++F +      S  +K+    +  SP   ++   ++ P   ++  +  E+D  +  D G
Subjt:  TYAANCFEGEAFDSAAAGNVSDTDKSKNGGIHCSPTNMNSPLCENCPMNKQDSGSPLESDSRQGLDHG

F4KCE9 Kinetochore-associated protein KNL-2 homolog1.0e-1041.57Show/hide
Query:  WKKAKRKVDFDSPGSGGIGRSTRQRKCVIVSAECMSYGRSRSGRLLIPTLEFWRNQLPVYDADRGIRGIQEGLPEPQIQPPTNAGSKSQ
        W+K KRK+DFD   +    +  +Q+K    S + +   RSRSGR+L+ +LEFWRNQ+PVYD DR +  +++G  E    P    GS S+
Subjt:  WKKAKRKVDFDSPGSGGIGRSTRQRKCVIVSAECMSYGRSRSGRLLIPTLEFWRNQLPVYDADRGIRGIQEGLPEPQIQPPTNAGSKSQ

Q8RWD7 Protein EMBRYO DEFECTIVE 16746.9e-1543.16Show/hide
Query:  KTVCLHDWWLVRAENDFNGKTLAVAGLTSRPEQPVRVFSSAPIIKRYDIFTLETADGICVVIKGFINKLRTTDNGFTAEIFKHFVFGFPPNWQTY
        K+V L DWWL +      GK L + G  S     VR+FSS  I KR++  TLE  DGI + I GFIN+ R  +NG + E+   F  GFP +W+ Y
Subjt:  KTVCLHDWWLVRAENDFNGKTLAVAGLTSRPEQPVRVFSSAPIIKRYDIFTLETADGICVVIKGFINKLRTTDNGFTAEIFKHFVFGFPPNWQTY

Arabidopsis top hitse value%identityAlignment
AT2G28780.1 unknown protein5.0e-14639.9Show/hide
Query:  LWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGVAALAVAVS
        +WR  L +A R ALAC+IVG  TL+GP  +   + F +FSY TVI ++++D  +LGD +RG W   YA    +  ++  L LI PAR  +   ALA A++
Subjt:  LWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGVAALAVAVS

Query:  GFVVALP-ERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEGFGAKSK
         FVV LP   THL++KRIA GQ+V++YV   + GA+T  VMHPLQVA+STA+G +A V A+L+P PRLA+C++++  +   +N   RV +  + F +   
Subjt:  GFVVALP-ERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEGFGAKSK

Query:  AEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERP-----RMGHAKPNLMTFEENLEELEVAVRGMEAALTS-----PSIAIGTMDEELCNLPKNLKAK
          A AS+++A+ L+ + ++L Q +K     M WER      R  +   N     E L+ +E+A+RGME  + S      S+  G + E+L    KN++ +
Subjt:  AEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERP-----RMGHAKPNLMTFEENLEELEVAVRGMEAALTS-----PSIAIGTMDEELCNLPKNLKAK

Query:  VVLKLQQLKCSLPPNATTAPETTKA-AFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAP--NIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILS
        V+L ++++  S  P+ T   +       L T  L     T ++LP  FFL C+ +L     A P  N V V+EN  K                TR  I  
Subjt:  VVLKLQQLKCSLPPNATTAPETTKA-AFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAP--NIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILS

Query:  NMWPTSRSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIH
        + W  S+ +  ALK S++LGLA+ LG +++KPNGYW+GL VA+SFA  R+ATF V N +AQGT IG++YGV+ CF+ QK+  +  L LLPW +F+SFL  
Subjt:  NMWPTSRSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIH

Query:  SRMYGQAGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLNDSSRYIPLIEQ
        S+MYGQAGGI++A+GA+L+LGRKN+G PSEFA  RI E  IGL C + VE+VF PTRAA +AK E S+S  AL +C         K          ++E 
Subjt:  SRMYGQAGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLNDSSRYIPLIEQ

Query:  HKNLKSHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLI
         K L+SH+ +L+KF  EA  EP+FWF+PF   CY+KL KSL K  D+LQF  + + FL  +        KE+  + ++D++   + +G   K  E+++L+
Subjt:  HKNLKSHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLI

Query:  KSLKELHD---KNQNLSDEMEMGKSPSDGCRALALSEEDIEKIVGSFCQHANEVPSKVY--TNEGEANL---KGQMTLSLSSIGFCLECLMRETMVMEKE
        KSL  L     K+ N S ++E+GK+P+    + A+SE   EKI+ ++ QH   V   ++    +GE  +   K ++ LSL ++GFC+E + +ET  +E+ 
Subjt:  KSLKELHD---KNQNLSDEMEMGKSPSDGCRALALSEEDIEKIVGSFCQHANEVPSKVY--TNEGEANL---KGQMTLSLSSIGFCLECLMRETMVMEKE

Query:  VYQLLKLENPSIHMNLQELSTK
        V ++++ ENPS H+NL E+S K
Subjt:  VYQLLKLENPSIHMNLQELSTK

AT3G09450.1 CONTAINS InterPro DOMAIN/s: Fusaric acid resistance protein, conserved region (InterPro:IPR006726)2.7e-13139.37Show/hide
Query:  WRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDG-VSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSG-VAALAVAV
        W  RLG ALR A+AC IV   TL+GP P+RH  TF +FSY T I I LSD   + G+ ++    V YA    +  ++  + ++GPA   +G VA +AVA+
Subjt:  WRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDG-VSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSG-VAALAVAV

Query:  SGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFV-MHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEGFGAKS
        + F+VA P  T LL+KRIAFGQ+V+VYV  VV   E + V M P+ VA STA+GA+A++ A+LLP+PRLA  Q+ K  + Y ENA ER+ +  E   A+ 
Subjt:  SGFVVALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFV-MHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEGFGAKS

Query:  KAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGH-AKPNLMTFEENLEELEVAVRGMEAALTS-PSIAIGTMDEELCNLPKNLKAKVVLKLQQ
           A   IA A SLS      L+NIKI+   + WERP     ++   +   E L   +  +RG+E AL S  S   G   +EL  L +  +  +      
Subjt:  KAEAVASIAEAKSLSTTGTRLLQNIKINLRGMVWERPRMGH-AKPNLMTFEENLEELEVAVRGMEAALTS-PSIAIGTMDEELCNLPKNLKAKVVLKLQQ

Query:  LKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAPNIVSVVENGRKPSGEEAVVLGNQGSKRTR--WGILSNMWPTSRS
                A  +  T K+     W     S++   LP  FF  C+E+   D  +       V NGR  + EE +   N+G    R  W IL  +W     
Subjt:  LKCSLPPNATTAPETTKAAFLNTWPLNISSITRENLPASFFLRCMEILLYDSTAAPNIVSVVENGRKPSGEEAVVLGNQGSKRTR--WGILSNMWPTSRS

Query:  LYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQAG
          FA KCSI+LGLAV  G++Y K NGYWSGLTVAIS  + RQAT TVAN+R QGTA+GS+YG+ICC + Q+ E    LPLLPW+I   F+ HS++YGQ G
Subjt:  LYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFATERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQAG

Query:  GIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLNDSSRYIPLIEQHKNLKSHV
        G+ +A+ ALL+LGR+NYG P+EFA ARI EA IGLLC+V  E++  P RAATLA+TE S  L+AL DCI+  +L+   +  +      L +    LKSHV
Subjt:  GIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTRAATLAKTEFSKSLEALQDCIKRVNLIPKKDLNDSSRYIPLIEQHKNLKSHV

Query:  GQLEKFIVEAGFEPNFWF-TPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKELH
          LE+F  EA  EP   F        Y++LL S  K  D+  ++   +K     LS     +    D+   ++  F +K+  S+K ++++S  KS   L 
Subjt:  GQLEKFIVEAGFEPNFWF-TPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGLVVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKELH

Query:  DKNQ--NLSDEMEMGKSPSDGCRALAL--SEEDIEKIVGSFCQHANEVPSKVYTNEGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEV
         + Q   +  ++E G + +D    + L  S+ D+E+   SF     E   K+  N  +   K +  L LSS+GFC+  LM+ET+ +  E+
Subjt:  DKNQ--NLSDEMEMGKSPSDGCRALAL--SEEDIEKIVGSFCQHANEVPSKVYTNEGEANLKGQMTLSLSSIGFCLECLMRETMVMEKEV

AT5G02510.1 BEST Arabidopsis thaliana protein match is: Ubiquitin system component Cue protein (TAIR:AT5G32440.1)7.8e-2256Show/hide
Query:  GSKWVDMFVQEMTNAIDLDDARIRAAKILEAFEQNITVYSRESEELKHASLKEHLQSLVNDNQILKRAVAIQHERNLEQEEKTREVHQLKQVLCQYQEQI
        G+KWVD  V EMT AI++DD R R A ILEA E  I   +  S++L++AS+KE LQSL+NDNQILKR +A QH+R+ E EEK ++V  L+ V+ QYQEQ+
Subjt:  GSKWVDMFVQEMTNAIDLDDARIRAAKILEAFEQNITVYSRESEELKHASLKEHLQSLVNDNQILKRAVAIQHERNLEQEEKTREVHQLKQVLCQYQEQI

AT5G02520.1 CONTAINS InterPro DOMAIN/s: SANT associated (InterPro:IPR015216)1.4e-2638.1Show/hide
Query:  SASYFQKTVCLHDWWLVRAENDFNGKTLAVAGLTSRPE-QPVRVFSSAPIIKRYDIFTLETADGICVVIKGFINKLRTTDNGFTAEIFKHFVFGFPPNWQ
        S S FQKTV L DWWL++   +F GK   VAG     E + +RVF+S+PI K  D+FTL  +DGI + ++GF+NK R   NGF  EI + F+FGFPP W+
Subjt:  SASYFQKTVCLHDWWLVRAENDFNGKTLAVAGLTSRPE-QPVRVFSSAPIIKRYDIFTLETADGICVVIKGFINKLRTTDNGFTAEIFKHFVFGFPPNWQ

Query:  TYAANCFEGEAFDSAAAGNVSDTDKSKNGGIHCSPTNMNSPLCENCPMNKQDSGSPLESDSRQGLDHG
            +CFEG++F +      S  +K+    +  SP   ++   ++ P   ++  +  E+D  +  D G
Subjt:  TYAANCFEGEAFDSAAAGNVSDTDKSKNGGIHCSPTNMNSPLCENCPMNKQDSGSPLESDSRQGLDHG

AT5G02520.1 CONTAINS InterPro DOMAIN/s: SANT associated (InterPro:IPR015216)7.3e-1241.57Show/hide
Query:  WKKAKRKVDFDSPGSGGIGRSTRQRKCVIVSAECMSYGRSRSGRLLIPTLEFWRNQLPVYDADRGIRGIQEGLPEPQIQPPTNAGSKSQ
        W+K KRK+DFD   +    +  +Q+K    S + +   RSRSGR+L+ +LEFWRNQ+PVYD DR +  +++G  E    P    GS S+
Subjt:  WKKAKRKVDFDSPGSGGIGRSTRQRKCVIVSAECMSYGRSRSGRLLIPTLEFWRNQLPVYDADRGIRGIQEGLPEPQIQPPTNAGSKSQ

AT5G32440.1 Ubiquitin system component Cue protein3.0e-2131.54Show/hide
Query:  MSAGVCGKRVGFEEIFGSSSPTACSSAKRSRWSSFGSPTRSEFGSGPDDSASVLLQ----MFPGVGAEV--PSFDDFNARVDSA-------TIGNCSTIP
        MSA VCGKR  FE++  +S P +       +   F S + S F      S+S+LL     +FP +  ++   + ++    +DSA        + + +   
Subjt:  MSAGVCGKRVGFEEIFGSSSPTACSSAKRSRWSSFGSPTRSEFGSGPDDSASVLLQ----MFPGVGAEV--PSFDDFNARVDSA-------TIGNCSTIP

Query:  DERTATCSQMLPEKVEEAKDVSSAIAEGNAMH--GSKWVDMFVQEMTNAIDLDDARIRAAKILEAFEQNITVY----SRESEELKHASLKEHLQSLVNDN
        D  T     ++ E   E +   SA  E N ++  G++WV++FV+EM NA D+ DA+ RAA+ LEA E++I       + ++ + ++  LK+ L+++V +N
Subjt:  DERTATCSQMLPEKVEEAKDVSSAIAEGNAMH--GSKWVDMFVQEMTNAIDLDDARIRAAKILEAFEQNITVY----SRESEELKHASLKEHLQSLVNDN

Query:  QILKRAVAIQHERNLEQEEKTREVHQLKQVLCQYQEQIQTV
         +LKRAV  Q +R  E E++++E+  L+Q++ QYQEQ++T+
Subjt:  QILKRAVAIQHERNLEQEEKTREVHQLKQVLCQYQEQIQTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTGGCGACGGCGGTGCTGTGGCGGGCTCGCCTAGGCGCGGCGCTGCGAGCTGCTCTGGCATGCAGCATAGTGGGCGGCGTCACGCTTTTCGGGCCAGCGCCGGT
GAGGCACTTGCTGACCTTCTCGTCTTTCTCTTATTTCACCGTCATTCCGATAGTACTTTCCGACGGGGTGTCGCTAGGCGACGCCGTGAGAGGTGGCTGGCAGGTGGGGT
ACGCGGTGGCCATCGTTCTGGTTTCGTCTGTACCATGCTTGTGGCTGATCGGACCGGCGCGGTTCAACAGTGGGGTGGCGGCGTTGGCGGTGGCGGTGAGTGGGTTTGTG
GTGGCGCTGCCGGAACGCACCCACTTGTTGTCGAAGCGAATCGCGTTCGGGCAGCTGGTGATCGTGTACGTCGGGACAGTGGTTCACGGCGCTGAAACGAGTTTTGTGAT
GCACCCACTTCAAGTTGCGTCGAGTACGGCCGTCGGAGCTCTGGCCGCCGTCGGAGCCATGTTGCTTCCGTACCCTCGGCTCGCCTCCTGCCAGATAAGGAAAGTGGTTA
GAGCATATTGCGAGAATGCGTGTGAGAGGGTGGGGGTTTTGGCAGAAGGGTTCGGTGCGAAGAGCAAAGCAGAAGCAGTTGCGTCAATTGCCGAAGCCAAGTCTCTATCC
ACAACAGGAACAAGGCTTCTTCAAAACATCAAAATAAATTTGCGGGGAATGGTTTGGGAGAGACCACGGATGGGCCATGCGAAGCCCAATCTCATGACATTTGAAGAAAA
TTTAGAAGAATTGGAAGTAGCAGTGAGAGGGATGGAAGCCGCCTTAACTTCCCCTTCCATTGCCATTGGAACCATGGACGAAGAACTCTGCAACTTGCCCAAGAATCTCA
AAGCCAAAGTGGTCTTAAAGCTACAGCAACTCAAGTGCTCCCTGCCGCCTAATGCCACCACTGCGCCGGAGACGACGAAGGCTGCGTTCTTAAACACTTGGCCGCTCAAC
ATTTCCTCCATTACCCGCGAGAATCTGCCGGCTTCTTTCTTCTTGCGTTGCATGGAAATCCTTCTGTACGACTCAACCGCAGCCCCGAACATCGTCTCCGTCGTGGAGAA
CGGTCGGAAACCCAGCGGGGAAGAAGCAGTCGTGTTGGGGAATCAGGGCTCCAAACGAACTCGTTGGGGCATTTTGTCGAACATGTGGCCGACGAGTCGGAGTTTGTATT
TTGCGCTGAAGTGCTCGATTACATTGGGTCTTGCTGTGTTTCTGGGTCTGATTTATACAAAGCCGAACGGGTATTGGTCAGGATTGACGGTGGCCATCAGCTTTGCAACA
GAGAGACAAGCAACATTTACTGTTGCAAACGCTCGAGCTCAAGGAACGGCAATCGGGTCAATCTATGGTGTCATATGCTGTTTTATTTTGCAAAAATATGAGTATTTATG
GCTCTTACCTCTTCTCCCTTGGGTTATTTTTACCAGTTTTCTAATCCATAGTAGAATGTATGGTCAAGCTGGTGGGATCGCATCAGCATTAGGCGCATTGTTAGTTCTTG
GGAGGAAGAATTATGGCATTCCATCTGAGTTTGCAAATGCTAGAATCACAGAAGCTTGCATTGGATTGCTTTGTTATGTAACAGTGGAGGTTGTCTTCAATCCAACAAGA
GCAGCAACTTTGGCAAAAACTGAGTTCTCAAAAAGTTTGGAGGCCCTTCAAGATTGCATCAAAAGGGTAAATCTCATTCCCAAAAAGGACTTGAATGATTCTTCCCGTTA
CATTCCATTGATAGAACAGCACAAAAATCTGAAATCCCATGTTGGTCAGTTAGAAAAGTTCATTGTAGAAGCTGGGTTTGAGCCTAATTTCTGGTTCACGCCATTCCAAG
GTGGCTGCTATGATAAGCTTCTGAAATCTCTCCAGAAAACAGTGGATATCTTACAATTTCTGGTTCATGAAATGAAGTTTCTCTCTCTAGAGCTTAGTAGATCTGGGCTT
GTTGTGAAGGAGCTCCATGATAGTTTTAATGAAGATATGGAGCTTTTCAACAAAAAAGTTGGATGTTCTCTGAAATTCATGGAGAAGGTGAGCCTAATAAAGTCACTAAA
GGAATTGCACGACAAAAACCAGAACCTGAGTGATGAAATGGAGATGGGGAAGTCTCCAAGTGATGGATGCAGAGCTCTTGCTCTGAGTGAAGAAGACATTGAGAAAATTG
TTGGTTCCTTCTGTCAACATGCAAATGAAGTACCGAGCAAAGTTTACACAAATGAAGGTGAAGCAAATCTGAAAGGCCAAATGACACTTTCTTTGAGTTCAATTGGGTTC
TGTCTGGAATGTTTGATGAGAGAAACAATGGTGATGGAGAAAGAAGTATATCAACTGTTGAAGCTGGAGAATCCATCCATTCATATGAATCTGCAAGAACTTTCAACAAA
AGAGGTGGATATGTCGGCTGGAGTCTGTGGAAAGCGGGTGGGGTTTGAGGAGATCTTTGGATCTTCTTCTCCGACGGCCTGTTCTTCGGCTAAGAGGTCCCGGTGGTCCA
GTTTCGGATCCCCGACCCGATCGGAGTTCGGGTCCGGTCCAGATGATTCTGCTTCGGTTTTGCTCCAAATGTTTCCTGGAGTCGGTGCCGAGGTGCCTTCCTTTGATGAT
TTTAATGCTCGTGTTGATTCTGCAACAATTGGAAATTGTTCTACTATTCCTGATGAGAGAACAGCTACATGCAGTCAGATGTTGCCCGAGAAGGTTGAAGAAGCTAAAGA
CGTTAGTTCAGCCATTGCTGAAGGAAATGCAATGCATGGGTCCAAGTGGGTGGATATGTTTGTGCAAGAGATGACAAATGCGATAGATCTTGATGATGCTAGAATTAGAG
CAGCAAAGATTTTAGAAGCTTTTGAACAAAACATAACTGTGTATTCAAGAGAATCAGAGGAGTTAAAGCATGCTTCTTTGAAGGAGCATTTGCAAAGCTTGGTAAACGAC
AACCAAATTTTAAAGAGAGCAGTAGCCATTCAGCACGAGCGCAATCTCGAGCAAGAAGAGAAGACAAGAGAAGTCCATCAGTTAAAGCAGGTCTTATGCCAGTATCAAGA
ACAAATTCAAACAGTGTCAGGATTTGGGATTTCTGAATTTGGGACTTGGGATGGGGTTGTAATCAGAATTGTTGATGCCATTCCTTGTAAGAAATTACACGTTAAATCTC
CATCTACAAAGGGCACAATCTGTTTCAGGACACTTCCACCAAGACATATTTTAAGGATCATGGCTTTCACTCCAGAATCCCATCGAACAACAACCCTCAACAGCGACAGC
AAGAACCGCGTCGGCGCTTCAGCTTCTTACTTCCAGAAAACAGTCTGTTTGCACGATTGGTGGTTGGTCAGAGCTGAAAATGACTTCAATGGGAAAACACTAGCCGTCGC
TGGGTTAACATCCAGACCGGAACAGCCTGTTCGAGTATTTTCTTCAGCGCCGATTATTAAGAGATACGATATTTTCACTCTCGAGACCGCTGATGGAATCTGTGTTGTTA
TTAAGGGTTTCATAAACAAACTGCGTACTACTGATAATGGGTTCACAGCTGAGATTTTCAAGCATTTTGTGTTTGGCTTTCCTCCCAACTGGCAAACTTATGCAGCAAAT
TGCTTTGAGGGAGAAGCTTTTGATAGTGCTGCTGCGGGAAATGTTTCTGATACGGACAAATCAAAAAACGGTGGCATACACTGCTCTCCCACCAATATGAATTCTCCTTT
GTGTGAAAATTGTCCAATGAATAAGCAAGATAGTGGCAGTCCACTGGAAAGTGATTCAAGACAAGGATTGGATCACGGAGAAGCTATGGCTGGAGGTGTGATGCAGAGTA
GCAGTGCAACTAAAAGTGCAGTGTCTTTCACTGATGAACATACTGGTGATGATATTGAAGAGGAGGAAATTAAAAGGAAAGGAAAAGAGGTAAAAGAATGTTGGAAGAAA
GCTAAGAGGAAAGTTGATTTTGATTCACCTGGAAGTGGTGGTATAGGAAGGAGTACAAGGCAAAGGAAGTGTGTTATAGTATCGGCAGAATGTATGAGTTATGGGCGATC
CCGATCAGGGAGATTACTTATACCCACATTGGAATTTTGGCGGAACCAGTTACCTGTTTATGATGCGGATAGGGGGATTAGAGGAATCCAAGAAGGCTTGCCTGAGCCTC
AGATACAACCACCTACAAATGCAGGGAGCAAAAGCCAAGGGGTTGCCGCTACCACCATGGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTGGCGACGGCGGTGCTGTGGCGGGCTCGCCTAGGCGCGGCGCTGCGAGCTGCTCTGGCATGCAGCATAGTGGGCGGCGTCACGCTTTTCGGGCCAGCGCCGGT
GAGGCACTTGCTGACCTTCTCGTCTTTCTCTTATTTCACCGTCATTCCGATAGTACTTTCCGACGGGGTGTCGCTAGGCGACGCCGTGAGAGGTGGCTGGCAGGTGGGGT
ACGCGGTGGCCATCGTTCTGGTTTCGTCTGTACCATGCTTGTGGCTGATCGGACCGGCGCGGTTCAACAGTGGGGTGGCGGCGTTGGCGGTGGCGGTGAGTGGGTTTGTG
GTGGCGCTGCCGGAACGCACCCACTTGTTGTCGAAGCGAATCGCGTTCGGGCAGCTGGTGATCGTGTACGTCGGGACAGTGGTTCACGGCGCTGAAACGAGTTTTGTGAT
GCACCCACTTCAAGTTGCGTCGAGTACGGCCGTCGGAGCTCTGGCCGCCGTCGGAGCCATGTTGCTTCCGTACCCTCGGCTCGCCTCCTGCCAGATAAGGAAAGTGGTTA
GAGCATATTGCGAGAATGCGTGTGAGAGGGTGGGGGTTTTGGCAGAAGGGTTCGGTGCGAAGAGCAAAGCAGAAGCAGTTGCGTCAATTGCCGAAGCCAAGTCTCTATCC
ACAACAGGAACAAGGCTTCTTCAAAACATCAAAATAAATTTGCGGGGAATGGTTTGGGAGAGACCACGGATGGGCCATGCGAAGCCCAATCTCATGACATTTGAAGAAAA
TTTAGAAGAATTGGAAGTAGCAGTGAGAGGGATGGAAGCCGCCTTAACTTCCCCTTCCATTGCCATTGGAACCATGGACGAAGAACTCTGCAACTTGCCCAAGAATCTCA
AAGCCAAAGTGGTCTTAAAGCTACAGCAACTCAAGTGCTCCCTGCCGCCTAATGCCACCACTGCGCCGGAGACGACGAAGGCTGCGTTCTTAAACACTTGGCCGCTCAAC
ATTTCCTCCATTACCCGCGAGAATCTGCCGGCTTCTTTCTTCTTGCGTTGCATGGAAATCCTTCTGTACGACTCAACCGCAGCCCCGAACATCGTCTCCGTCGTGGAGAA
CGGTCGGAAACCCAGCGGGGAAGAAGCAGTCGTGTTGGGGAATCAGGGCTCCAAACGAACTCGTTGGGGCATTTTGTCGAACATGTGGCCGACGAGTCGGAGTTTGTATT
TTGCGCTGAAGTGCTCGATTACATTGGGTCTTGCTGTGTTTCTGGGTCTGATTTATACAAAGCCGAACGGGTATTGGTCAGGATTGACGGTGGCCATCAGCTTTGCAACA
GAGAGACAAGCAACATTTACTGTTGCAAACGCTCGAGCTCAAGGAACGGCAATCGGGTCAATCTATGGTGTCATATGCTGTTTTATTTTGCAAAAATATGAGTATTTATG
GCTCTTACCTCTTCTCCCTTGGGTTATTTTTACCAGTTTTCTAATCCATAGTAGAATGTATGGTCAAGCTGGTGGGATCGCATCAGCATTAGGCGCATTGTTAGTTCTTG
GGAGGAAGAATTATGGCATTCCATCTGAGTTTGCAAATGCTAGAATCACAGAAGCTTGCATTGGATTGCTTTGTTATGTAACAGTGGAGGTTGTCTTCAATCCAACAAGA
GCAGCAACTTTGGCAAAAACTGAGTTCTCAAAAAGTTTGGAGGCCCTTCAAGATTGCATCAAAAGGGTAAATCTCATTCCCAAAAAGGACTTGAATGATTCTTCCCGTTA
CATTCCATTGATAGAACAGCACAAAAATCTGAAATCCCATGTTGGTCAGTTAGAAAAGTTCATTGTAGAAGCTGGGTTTGAGCCTAATTTCTGGTTCACGCCATTCCAAG
GTGGCTGCTATGATAAGCTTCTGAAATCTCTCCAGAAAACAGTGGATATCTTACAATTTCTGGTTCATGAAATGAAGTTTCTCTCTCTAGAGCTTAGTAGATCTGGGCTT
GTTGTGAAGGAGCTCCATGATAGTTTTAATGAAGATATGGAGCTTTTCAACAAAAAAGTTGGATGTTCTCTGAAATTCATGGAGAAGGTGAGCCTAATAAAGTCACTAAA
GGAATTGCACGACAAAAACCAGAACCTGAGTGATGAAATGGAGATGGGGAAGTCTCCAAGTGATGGATGCAGAGCTCTTGCTCTGAGTGAAGAAGACATTGAGAAAATTG
TTGGTTCCTTCTGTCAACATGCAAATGAAGTACCGAGCAAAGTTTACACAAATGAAGGTGAAGCAAATCTGAAAGGCCAAATGACACTTTCTTTGAGTTCAATTGGGTTC
TGTCTGGAATGTTTGATGAGAGAAACAATGGTGATGGAGAAAGAAGTATATCAACTGTTGAAGCTGGAGAATCCATCCATTCATATGAATCTGCAAGAACTTTCAACAAA
AGAGGTGGATATGTCGGCTGGAGTCTGTGGAAAGCGGGTGGGGTTTGAGGAGATCTTTGGATCTTCTTCTCCGACGGCCTGTTCTTCGGCTAAGAGGTCCCGGTGGTCCA
GTTTCGGATCCCCGACCCGATCGGAGTTCGGGTCCGGTCCAGATGATTCTGCTTCGGTTTTGCTCCAAATGTTTCCTGGAGTCGGTGCCGAGGTGCCTTCCTTTGATGAT
TTTAATGCTCGTGTTGATTCTGCAACAATTGGAAATTGTTCTACTATTCCTGATGAGAGAACAGCTACATGCAGTCAGATGTTGCCCGAGAAGGTTGAAGAAGCTAAAGA
CGTTAGTTCAGCCATTGCTGAAGGAAATGCAATGCATGGGTCCAAGTGGGTGGATATGTTTGTGCAAGAGATGACAAATGCGATAGATCTTGATGATGCTAGAATTAGAG
CAGCAAAGATTTTAGAAGCTTTTGAACAAAACATAACTGTGTATTCAAGAGAATCAGAGGAGTTAAAGCATGCTTCTTTGAAGGAGCATTTGCAAAGCTTGGTAAACGAC
AACCAAATTTTAAAGAGAGCAGTAGCCATTCAGCACGAGCGCAATCTCGAGCAAGAAGAGAAGACAAGAGAAGTCCATCAGTTAAAGCAGGTCTTATGCCAGTATCAAGA
ACAAATTCAAACAGTGTCAGGATTTGGGATTTCTGAATTTGGGACTTGGGATGGGGTTGTAATCAGAATTGTTGATGCCATTCCTTGTAAGAAATTACACGTTAAATCTC
CATCTACAAAGGGCACAATCTGTTTCAGGACACTTCCACCAAGACATATTTTAAGGATCATGGCTTTCACTCCAGAATCCCATCGAACAACAACCCTCAACAGCGACAGC
AAGAACCGCGTCGGCGCTTCAGCTTCTTACTTCCAGAAAACAGTCTGTTTGCACGATTGGTGGTTGGTCAGAGCTGAAAATGACTTCAATGGGAAAACACTAGCCGTCGC
TGGGTTAACATCCAGACCGGAACAGCCTGTTCGAGTATTTTCTTCAGCGCCGATTATTAAGAGATACGATATTTTCACTCTCGAGACCGCTGATGGAATCTGTGTTGTTA
TTAAGGGTTTCATAAACAAACTGCGTACTACTGATAATGGGTTCACAGCTGAGATTTTCAAGCATTTTGTGTTTGGCTTTCCTCCCAACTGGCAAACTTATGCAGCAAAT
TGCTTTGAGGGAGAAGCTTTTGATAGTGCTGCTGCGGGAAATGTTTCTGATACGGACAAATCAAAAAACGGTGGCATACACTGCTCTCCCACCAATATGAATTCTCCTTT
GTGTGAAAATTGTCCAATGAATAAGCAAGATAGTGGCAGTCCACTGGAAAGTGATTCAAGACAAGGATTGGATCACGGAGAAGCTATGGCTGGAGGTGTGATGCAGAGTA
GCAGTGCAACTAAAAGTGCAGTGTCTTTCACTGATGAACATACTGGTGATGATATTGAAGAGGAGGAAATTAAAAGGAAAGGAAAAGAGGTAAAAGAATGTTGGAAGAAA
GCTAAGAGGAAAGTTGATTTTGATTCACCTGGAAGTGGTGGTATAGGAAGGAGTACAAGGCAAAGGAAGTGTGTTATAGTATCGGCAGAATGTATGAGTTATGGGCGATC
CCGATCAGGGAGATTACTTATACCCACATTGGAATTTTGGCGGAACCAGTTACCTGTTTATGATGCGGATAGGGGGATTAGAGGAATCCAAGAAGGCTTGCCTGAGCCTC
AGATACAACCACCTACAAATGCAGGGAGCAAAAGCCAAGGGGTTGCCGCTACCACCATGGCCTAA
Protein sequenceShow/hide protein sequence
MAVATAVLWRARLGAALRAALACSIVGGVTLFGPAPVRHLLTFSSFSYFTVIPIVLSDGVSLGDAVRGGWQVGYAVAIVLVSSVPCLWLIGPARFNSGVAALAVAVSGFV
VALPERTHLLSKRIAFGQLVIVYVGTVVHGAETSFVMHPLQVASSTAVGALAAVGAMLLPYPRLASCQIRKVVRAYCENACERVGVLAEGFGAKSKAEAVASIAEAKSLS
TTGTRLLQNIKINLRGMVWERPRMGHAKPNLMTFEENLEELEVAVRGMEAALTSPSIAIGTMDEELCNLPKNLKAKVVLKLQQLKCSLPPNATTAPETTKAAFLNTWPLN
ISSITRENLPASFFLRCMEILLYDSTAAPNIVSVVENGRKPSGEEAVVLGNQGSKRTRWGILSNMWPTSRSLYFALKCSITLGLAVFLGLIYTKPNGYWSGLTVAISFAT
ERQATFTVANARAQGTAIGSIYGVICCFILQKYEYLWLLPLLPWVIFTSFLIHSRMYGQAGGIASALGALLVLGRKNYGIPSEFANARITEACIGLLCYVTVEVVFNPTR
AATLAKTEFSKSLEALQDCIKRVNLIPKKDLNDSSRYIPLIEQHKNLKSHVGQLEKFIVEAGFEPNFWFTPFQGGCYDKLLKSLQKTVDILQFLVHEMKFLSLELSRSGL
VVKELHDSFNEDMELFNKKVGCSLKFMEKVSLIKSLKELHDKNQNLSDEMEMGKSPSDGCRALALSEEDIEKIVGSFCQHANEVPSKVYTNEGEANLKGQMTLSLSSIGF
CLECLMRETMVMEKEVYQLLKLENPSIHMNLQELSTKEVDMSAGVCGKRVGFEEIFGSSSPTACSSAKRSRWSSFGSPTRSEFGSGPDDSASVLLQMFPGVGAEVPSFDD
FNARVDSATIGNCSTIPDERTATCSQMLPEKVEEAKDVSSAIAEGNAMHGSKWVDMFVQEMTNAIDLDDARIRAAKILEAFEQNITVYSRESEELKHASLKEHLQSLVND
NQILKRAVAIQHERNLEQEEKTREVHQLKQVLCQYQEQIQTVSGFGISEFGTWDGVVIRIVDAIPCKKLHVKSPSTKGTICFRTLPPRHILRIMAFTPESHRTTTLNSDS
KNRVGASASYFQKTVCLHDWWLVRAENDFNGKTLAVAGLTSRPEQPVRVFSSAPIIKRYDIFTLETADGICVVIKGFINKLRTTDNGFTAEIFKHFVFGFPPNWQTYAAN
CFEGEAFDSAAAGNVSDTDKSKNGGIHCSPTNMNSPLCENCPMNKQDSGSPLESDSRQGLDHGEAMAGGVMQSSSATKSAVSFTDEHTGDDIEEEEIKRKGKEVKECWKK
AKRKVDFDSPGSGGIGRSTRQRKCVIVSAECMSYGRSRSGRLLIPTLEFWRNQLPVYDADRGIRGIQEGLPEPQIQPPTNAGSKSQGVAATTMA