; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G016480 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G016480
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSulfate adenylyltransferase
Genome locationchr05:24123857..24132411
RNA-Seq ExpressionLsi05G016480
SyntenyLsi05G016480
Gene Ontology termsGO:0000103 - sulfate assimilation (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0004020 - adenylylsulfate kinase activity (molecular function)
GO:0004781 - sulfate adenylyltransferase (ATP) activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR002650 - Sulphate adenylyltransferase
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold
IPR015947 - PUA-like superfamily
IPR024951 - Sulphate adenylyltransferase catalytic domain
IPR025980 - ATP-sulfurylase PUA-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049322.1 serine-aspartate repeat-containing protein F isoform X1 [Cucumis melo var. makuwa]0.0e+0075.66Show/hide
Query:  LIASLDLLATTDQAVKYHHHHNNGFVELNMGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEE
        +++SLD L+TT QAVKYHHHH+NGFVELNMGNEMGNNNTSEFREEEK KAEGPE  LLEGG+N  +ADEVAGF QKEAR +SG+AD+VNGDHH++EKEEE
Subjt:  LIASLDLLATTDQAVKYHHHHNNGFVELNMGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEE

Query:  KNKACNIAEFQVVSEKLLDRTKEDNEVDVKPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAES
        KNK  +I EFQVVSE L D+T EDN V+VK KE+KEM  +SEENRSD       +QAS+QKEEE +RGSNLNTTVLSL++P++E                
Subjt:  KNKACNIAEFQVVSEKLLDRTKEDNEVDVKPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAES

Query:  LIEDSSKSQEECKDALELSLNNTYHSADCVMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDG
              K++E+CKDALE S  NT HSADC + DSEENTNM+QVIDTDR TN+ENDGEKGK S I+MIT+ASEDEKSE TSD ++DQVIDND DTDKE+DG
Subjt:  LIEDSSKSQEECKDALELSLNNTYHSADCVMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDG

Query:  EKGKGSNFNMITHASRDPKSEMNSEFDTDQVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIH
        E+G+GS+FN  THA +DPKSE+NS+FD+DQVIDTD+DA+KEN+EERR GSNF+   +S++P  EK SNL+++QVV TD D DKENDE RGN SNFD LI 
Subjt:  EKGKGSNFNMITHASRDPKSEMNSEFDTDQVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIH

Query:  TSKDPNSETSSDLKSIQHESLETNAESLTRSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVP
         SKDPNSE SS+L+  QHES ETNAESLT SSDDG TDMEKKKGDLVEPR CHGY  P AK VDTKD+GT TD+ICHN+S SLAEE LVIESP SS+QVP
Subjt:  TSKDPNSETSSDLKSIQHESLETNAESLTRSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVP

Query:  QVEDRCTVLKKESQLREEHLGTETVVEDNIPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSS
        +VED     K E QLREE LGTETV EDNIPTQSKI+N VQE+FNT  SHSE NAEE EVSPEFVA  EN+NEAP EDCEDSDGEYLEISEQGM+ILN S
Subjt:  QVEDRCTVLKKESQLREEHLGTETVVEDNIPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSS

Query:  IGDCKHKNEEMGEMTEISTNDEH-AERREPDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETK
        IGDCKHKNEEMGE TE+STN+EH A+RREPDES FEP+LGFQPQ QQKE +I  QTAESTDE +SAP QE DT TEKSK+NPSDS SY QTAS+T TETK
Subjt:  IGDCKHKNEEMGEMTEISTNDEH-AERREPDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETK

Query:  PSINPIDEQSLTTLPFSTFGGEDQDSPGRTSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGR
        P+ NPIDEQSL TLPFSTFGGEDQDSPGRTSNESISENSIG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL NLQEISFPNPMEKRV+KLGR
Subjt:  PSINPIDEQSLTTLPFSTFGGEDQDSPGRTSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGR

Query:  SDSEKSRSSFPGFAKEKEESGMEFKAIDQNNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        SDSEKSR SFPGF KEKEESGMEFKAIDQNNF  +KKVAK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  SDSEKSRSSFPGFAKEKEESGMEFKAIDQNNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

XP_008438604.1 PREDICTED: serine-aspartate repeat-containing protein F isoform X1 [Cucumis melo]0.0e+0075.89Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV
        MGNEMGNNNTSEFREEEK KAEGPE  LLEGG+N  +ADEVAGF QKEAR +SG+AD+VNGDHH++EKEEEKNK  +I EFQVVSE L D+T EDN V+V
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV

Query:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC
        K KE+KEM  +SEENRSD       +QAS+QKEEE +RGSNLNTTVLSL++P++E                      K++E+CKDALE S  NT HSADC
Subjt:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC

Query:  VMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTD
         + DSEENTNM+QVIDTDR TN+ENDGEKGK S I+MIT+ASEDEKSE TSD ++DQVIDND DTDKE+DGE+G+GS+FN  THA +DPKSE+NS+FD+D
Subjt:  VMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTD

Query:  QVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLT
        QVIDTD+DA+KEN+EERR GSNF+   +S++P  EK SNL+++QVV TD D DKENDE RGN SNFD LI  SKDPNSE SS+L+  QHES ETNAESLT
Subjt:  QVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLT

Query:  RSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDN
         SSDDG TDMEKKKGDLVEPR CHGY  P AK VDTKD+GT TD+ICHN+S SLAEE LVIESP SS+QVP+VED     K E QLREE LGTETV EDN
Subjt:  RSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDN

Query:  IPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERRE
        IPTQSKI+N VQE+FNT  SHSE NAEE EVSPEFVA  EN+NEAP EDCEDSDGEYLEISEQGM+ILN SIGDCKHKNEEMGE TE+ST++EH AERRE
Subjt:  IPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERRE

Query:  PDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGR
        PDES FEP+LGFQPQ QQKE +I  QTAESTDE +SAP QE DT TEKSK+NPSDS SY QTAS+T TETKPS NPIDEQSL TLPFSTFGGEDQDSPGR
Subjt:  PDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGR

Query:  TSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQ
        TSNESISENSIG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL NLQEISFPNPMEKRVVKLGRSDSEKSR SFPGF KEKEESGMEFKAIDQ
Subjt:  TSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQ

Query:  NNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        NNF  +KKVAK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  NNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

XP_008438605.1 PREDICTED: serine-aspartate repeat-containing protein F isoform X2 [Cucumis melo]0.0e+0073.87Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV
        MGNEMGNNNTSEFREEEK KAEGPE  LLEGG+N  +ADEVAGF QKEAR +SG+AD+VNGDHH++E                         KEDN V+V
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV

Query:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC
        K KE+KEM  +SEENRSD       +QAS+QKEEE +RGSNLNTTVLSL++P++E                      K++E+CKDALE S  NT HSADC
Subjt:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC

Query:  VMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTD
         + DSEENTNM+QVIDTDR TN+ENDGEKGK S I+MIT+ASEDEKSE TSD ++DQVIDND DTDKE+DGE+G+GS+FN  THA +DPKSE+NS+FD+D
Subjt:  VMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTD

Query:  QVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLT
        QVIDTD+DA+KEN+EERR GSNF+   +S++P  EK SNL+++QVV TD D DKENDE RGN SNFD LI  SKDPNSE SS+L+  QHES ETNAESLT
Subjt:  QVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLT

Query:  RSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDN
         SSDDG TDMEKKKGDLVEPR CHGY  P AK VDTKD+GT TD+ICHN+S SLAEE LVIESP SS+QVP+VED     K E QLREE LGTETV EDN
Subjt:  RSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDN

Query:  IPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERRE
        IPTQSKI+N VQE+FNT  SHSE NAEE EVSPEFVA  EN+NEAP EDCEDSDGEYLEISEQGM+ILN SIGDCKHKNEEMGE TE+ST++EH AERRE
Subjt:  IPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERRE

Query:  PDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGR
        PDES FEP+LGFQPQ QQKE +I  QTAESTDE +SAP QE DT TEKSK+NPSDS SY QTAS+T TETKPS NPIDEQSL TLPFSTFGGEDQDSPGR
Subjt:  PDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGR

Query:  TSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQ
        TSNESISENSIG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL NLQEISFPNPMEKRVVKLGRSDSEKSR SFPGF KEKEESGMEFKAIDQ
Subjt:  TSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQ

Query:  NNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        NNF  +KKVAK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  NNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

XP_011650954.1 myb-like protein X isoform X1 [Cucumis sativus]0.0e+0076.13Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV
        MGNEMGNNNTSEFREEEK KAEGPE  LLEG +N  + DEVAGF QKEAR +SG AD+VNG+HH+ EKEEEKNK CNI EFQVVSE L DRT EDN V+V
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV

Query:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC
        K KE+K++ FNSEENRSDGN  EHE+QASNQKEEE +RGSNLN  VLSL++P +E                      K++E+CKDALE S  NT HSADC
Subjt:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC

Query:  VMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTD
         + DSEENTNMD VIDTDR TN+ENDGEKG  S IDMIT+ASEDEKSEKTSD +IDQV+DNDRDTDKE+DGE+G+GS+ N  THA +DPKSE NS FD+D
Subjt:  VMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTD

Query:  QVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLT
        QVIDTD    KEN+EERR GSNF+   +S++P  EK SNL+T+QVV TD D DK NDE+RGN SNFD LI  SK+PNSE SSDL+SIQHE  ETNAESLT
Subjt:  QVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLT

Query:  RSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDN
         SSDDGDTDMEKKKGDLVEPR CHGY +P AK VDTKD+GT TDL CHN+SCSLAEE LVIESP SS+Q+P+VE+     K+E QLR EHLGTETV EDN
Subjt:  RSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDN

Query:  IPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERRE
        IPTQSKI+N V+E+FNT  SHSENNAEE EVSPEFV   EN+NEAP EDCEDSDGEYLEISEQGM+ILN SIGDCKHKNEEMGE TE+STN+EH  ERRE
Subjt:  IPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERRE

Query:  PDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGR
        PDESLFEP+LGFQPQ QQKE +I  QTAESTDE +SAP QE DT TEKSK+NPSDSPSY QTAS+T TET+PS NPIDEQSL TLPFSTFGGEDQDSPGR
Subjt:  PDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGR

Query:  TSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQ
        TSNESISENSIGHIEMRKSPSFNIDIQ EG+ GETEK PLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSR SFPGF KEKEES ME KAIDQ
Subjt:  TSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQ

Query:  NNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        NNF ++KK AK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  NNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

XP_038881451.1 uncharacterized protein LOC120072973 [Benincasa hispida]1.5e-29570.92Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV
        MGNEMGNNNTSEFREEEKAK+EGPEKSL EGG N V+ADEVAG  QKEAR +SG+ADR NGD H+TEK EEKNKACNI EFQ+V ++L DRTKEDNEVD+
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV

Query:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC
        K KE KEM F+SEENRS+GNEHEHE+QASNQKEE+GERGSNLNT VLSL +P +EKI DFK NQHELLAESLIEDS+KS EECKDALELSLN   HSADC
Subjt:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC

Query:  VMADSEENTNMDQVIDTDRGTNRENDGEKGKSNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTDQ
         MADSEENTNMDQVIDT                                           DRDTD+E+DGEK KGSNF+MIT+A                
Subjt:  VMADSEENTNMDQVIDTDRGTNRENDGEKGKSNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTDQ

Query:  VIDTDLDAYKENNEERRNGSNFDTSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLTRSSD
                                                                            SKDPNSE S DLKS+QHES ETNA+SLT SSD
Subjt:  VIDTDLDAYKENNEERRNGSNFDTSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLTRSSD

Query:  DGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDNIPTQ
        DGDTDMEKKKGDLVEPRPCHGY +P AK VDTKDEGTA DLICHNSSCSLAEESLVIESP S MQVP+VE+RCTVLK+E QLREEHLGTETVV+DNIPTQ
Subjt:  DGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDNIPTQ

Query:  SKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERREPD-E
        SKI+NGVQ +FNTIGSHSEN+AE+TEV PEFVAK++N+ EAP EDCEDSDGEYLEISEQG +I N SIGD KHK+E+ GE TE+STN+ H  ERREPD E
Subjt:  SKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERREPD-E

Query:  SLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGRTSN
        SLFEPVLGFQ QIQQKE SI  QTAEST + +SAP QE  T TEKSK NPSDSPSYIQTASATLTETKPS NP+DEQS T LPFSTFGGEDQ+SPGRTSN
Subjt:  SLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGRTSN

Query:  ESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQNNF
        ES SENSIGHIEMRKSPSFNIDIQSEGR GETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSE SR SFPGFAKE+EE GMEFKAIDQNNF
Subjt:  ESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQNNF

Query:  AADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        A D K AKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  AADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

TrEMBL top hitse value%identityAlignment
A0A0A0L8I8 Uncharacterized protein0.0e+0076.13Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV
        MGNEMGNNNTSEFREEEK KAEGPE  LLEG +N  + DEVAGF QKEAR +SG AD+VNG+HH+ EKEEEKNK CNI EFQVVSE L DRT EDN V+V
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV

Query:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC
        K KE+K++ FNSEENRSDGN  EHE+QASNQKEEE +RGSNLN  VLSL++P +E                      K++E+CKDALE S  NT HSADC
Subjt:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC

Query:  VMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTD
         + DSEENTNMD VIDTDR TN+ENDGEKG  S IDMIT+ASEDEKSEKTSD +IDQV+DNDRDTDKE+DGE+G+GS+ N  THA +DPKSE NS FD+D
Subjt:  VMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTD

Query:  QVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLT
        QVIDTD    KEN+EERR GSNF+   +S++P  EK SNL+T+QVV TD D DK NDE+RGN SNFD LI  SK+PNSE SSDL+SIQHE  ETNAESLT
Subjt:  QVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLT

Query:  RSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDN
         SSDDGDTDMEKKKGDLVEPR CHGY +P AK VDTKD+GT TDL CHN+SCSLAEE LVIESP SS+Q+P+VE+     K+E QLR EHLGTETV EDN
Subjt:  RSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDN

Query:  IPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERRE
        IPTQSKI+N V+E+FNT  SHSENNAEE EVSPEFV   EN+NEAP EDCEDSDGEYLEISEQGM+ILN SIGDCKHKNEEMGE TE+STN+EH  ERRE
Subjt:  IPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERRE

Query:  PDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGR
        PDESLFEP+LGFQPQ QQKE +I  QTAESTDE +SAP QE DT TEKSK+NPSDSPSY QTAS+T TET+PS NPIDEQSL TLPFSTFGGEDQDSPGR
Subjt:  PDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGR

Query:  TSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQ
        TSNESISENSIGHIEMRKSPSFNIDIQ EG+ GETEK PLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSR SFPGF KEKEES ME KAIDQ
Subjt:  TSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQ

Query:  NNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        NNF ++KK AK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  NNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

A0A1S3AXG5 serine-aspartate repeat-containing protein F isoform X20.0e+0073.87Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV
        MGNEMGNNNTSEFREEEK KAEGPE  LLEGG+N  +ADEVAGF QKEAR +SG+AD+VNGDHH++E                         KEDN V+V
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV

Query:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC
        K KE+KEM  +SEENRSD       +QAS+QKEEE +RGSNLNTTVLSL++P++E                      K++E+CKDALE S  NT HSADC
Subjt:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC

Query:  VMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTD
         + DSEENTNM+QVIDTDR TN+ENDGEKGK S I+MIT+ASEDEKSE TSD ++DQVIDND DTDKE+DGE+G+GS+FN  THA +DPKSE+NS+FD+D
Subjt:  VMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTD

Query:  QVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLT
        QVIDTD+DA+KEN+EERR GSNF+   +S++P  EK SNL+++QVV TD D DKENDE RGN SNFD LI  SKDPNSE SS+L+  QHES ETNAESLT
Subjt:  QVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLT

Query:  RSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDN
         SSDDG TDMEKKKGDLVEPR CHGY  P AK VDTKD+GT TD+ICHN+S SLAEE LVIESP SS+QVP+VED     K E QLREE LGTETV EDN
Subjt:  RSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDN

Query:  IPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERRE
        IPTQSKI+N VQE+FNT  SHSE NAEE EVSPEFVA  EN+NEAP EDCEDSDGEYLEISEQGM+ILN SIGDCKHKNEEMGE TE+ST++EH AERRE
Subjt:  IPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERRE

Query:  PDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGR
        PDES FEP+LGFQPQ QQKE +I  QTAESTDE +SAP QE DT TEKSK+NPSDS SY QTAS+T TETKPS NPIDEQSL TLPFSTFGGEDQDSPGR
Subjt:  PDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGR

Query:  TSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQ
        TSNESISENSIG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL NLQEISFPNPMEKRVVKLGRSDSEKSR SFPGF KEKEESGMEFKAIDQ
Subjt:  TSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQ

Query:  NNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        NNF  +KKVAK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  NNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

A0A1S3AXG7 serine-aspartate repeat-containing protein F isoform X31.3e-28476.09Show/hide
Query:  NSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADCVMADSEENTN
        +SEENRSD       +QAS+QKEEE +RGSNLNTTVLSL++P++E                      K++E+CKDALE S  NT HSADC + DSEENTN
Subjt:  NSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADCVMADSEENTN

Query:  MDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTDQVIDTDLDAY
        M+QVIDTDR TN+ENDGEKGK S I+MIT+ASEDEKSE TSD ++DQVIDND DTDKE+DGE+G+GS+FN  THA +DPKSE+NS+FD+DQVIDTD+DA+
Subjt:  MDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTDQVIDTDLDAY

Query:  KENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLTRSSDDGDTDM
        KEN+EERR GSNF+   +S++P  EK SNL+++QVV TD D DKENDE RGN SNFD LI  SKDPNSE SS+L+  QHES ETNAESLT SSDDG TDM
Subjt:  KENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLTRSSDDGDTDM

Query:  EKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDNIPTQSKITNG
        EKKKGDLVEPR CHGY  P AK VDTKD+GT TD+ICHN+S SLAEE LVIESP SS+QVP+VED     K E QLREE LGTETV EDNIPTQSKI+N 
Subjt:  EKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDNIPTQSKITNG

Query:  VQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERREPDESLFEPVL
        VQE+FNT  SHSE NAEE EVSPEFVA  EN+NEAP EDCEDSDGEYLEISEQGM+ILN SIGDCKHKNEEMGE TE+ST++EH AERREPDES FEP+L
Subjt:  VQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERREPDESLFEPVL

Query:  GFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGRTSNESISENS
        GFQPQ QQKE +I  QTAESTDE +SAP QE DT TEKSK+NPSDS SY QTAS+T TETKPS NPIDEQSL TLPFSTFGGEDQDSPGRTSNESISENS
Subjt:  GFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGRTSNESISENS

Query:  IGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQNNFAADKKVA
        IG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL NLQEISFPNPMEKRVVKLGRSDSEKSR SFPGF KEKEESGMEFKAIDQNNF  +KKVA
Subjt:  IGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQNNFAADKKVA

Query:  KDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        K+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  KDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

A0A1S4DSH5 serine-aspartate repeat-containing protein F isoform X10.0e+0075.89Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV
        MGNEMGNNNTSEFREEEK KAEGPE  LLEGG+N  +ADEVAGF QKEAR +SG+AD+VNGDHH++EKEEEKNK  +I EFQVVSE L D+T EDN V+V
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDV

Query:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC
        K KE+KEM  +SEENRSD       +QAS+QKEEE +RGSNLNTTVLSL++P++E                      K++E+CKDALE S  NT HSADC
Subjt:  KPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADC

Query:  VMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTD
         + DSEENTNM+QVIDTDR TN+ENDGEKGK S I+MIT+ASEDEKSE TSD ++DQVIDND DTDKE+DGE+G+GS+FN  THA +DPKSE+NS+FD+D
Subjt:  VMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTD

Query:  QVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLT
        QVIDTD+DA+KEN+EERR GSNF+   +S++P  EK SNL+++QVV TD D DKENDE RGN SNFD LI  SKDPNSE SS+L+  QHES ETNAESLT
Subjt:  QVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDPNSETSSDLKSIQHESLETNAESLT

Query:  RSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDN
         SSDDG TDMEKKKGDLVEPR CHGY  P AK VDTKD+GT TD+ICHN+S SLAEE LVIESP SS+QVP+VED     K E QLREE LGTETV EDN
Subjt:  RSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQLREEHLGTETVVEDN

Query:  IPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERRE
        IPTQSKI+N VQE+FNT  SHSE NAEE EVSPEFVA  EN+NEAP EDCEDSDGEYLEISEQGM+ILN SIGDCKHKNEEMGE TE+ST++EH AERRE
Subjt:  IPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEH-AERRE

Query:  PDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGR
        PDES FEP+LGFQPQ QQKE +I  QTAESTDE +SAP QE DT TEKSK+NPSDS SY QTAS+T TETKPS NPIDEQSL TLPFSTFGGEDQDSPGR
Subjt:  PDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGR

Query:  TSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQ
        TSNESISENSIG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL NLQEISFPNPMEKRVVKLGRSDSEKSR SFPGF KEKEESGMEFKAIDQ
Subjt:  TSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQ

Query:  NNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        NNF  +KKVAK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  NNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

A0A5D3D317 Serine-aspartate repeat-containing protein F isoform X10.0e+0075.66Show/hide
Query:  LIASLDLLATTDQAVKYHHHHNNGFVELNMGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEE
        +++SLD L+TT QAVKYHHHH+NGFVELNMGNEMGNNNTSEFREEEK KAEGPE  LLEGG+N  +ADEVAGF QKEAR +SG+AD+VNGDHH++EKEEE
Subjt:  LIASLDLLATTDQAVKYHHHHNNGFVELNMGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEVAGFHQKEARRNSGDADRVNGDHHITEKEEE

Query:  KNKACNIAEFQVVSEKLLDRTKEDNEVDVKPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAES
        KNK  +I EFQVVSE L D+T EDN V+VK KE+KEM  +SEENRSD       +QAS+QKEEE +RGSNLNTTVLSL++P++E                
Subjt:  KNKACNIAEFQVVSEKLLDRTKEDNEVDVKPKEDKEM-FNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQPEVEKIADFKSNQHELLAES

Query:  LIEDSSKSQEECKDALELSLNNTYHSADCVMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDG
              K++E+CKDALE S  NT HSADC + DSEENTNM+QVIDTDR TN+ENDGEKGK S I+MIT+ASEDEKSE TSD ++DQVIDND DTDKE+DG
Subjt:  LIEDSSKSQEECKDALELSLNNTYHSADCVMADSEENTNMDQVIDTDRGTNRENDGEKGK-SNIDMITNASEDEKSEKTSDFDIDQVIDNDRDTDKESDG

Query:  EKGKGSNFNMITHASRDPKSEMNSEFDTDQVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIH
        E+G+GS+FN  THA +DPKSE+NS+FD+DQVIDTD+DA+KEN+EERR GSNF+   +S++P  EK SNL+++QVV TD D DKENDE RGN SNFD LI 
Subjt:  EKGKGSNFNMITHASRDPKSEMNSEFDTDQVIDTDLDAYKENNEERRNGSNFD---TSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIH

Query:  TSKDPNSETSSDLKSIQHESLETNAESLTRSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVP
         SKDPNSE SS+L+  QHES ETNAESLT SSDDG TDMEKKKGDLVEPR CHGY  P AK VDTKD+GT TD+ICHN+S SLAEE LVIESP SS+QVP
Subjt:  TSKDPNSETSSDLKSIQHESLETNAESLTRSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVP

Query:  QVEDRCTVLKKESQLREEHLGTETVVEDNIPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSS
        +VED     K E QLREE LGTETV EDNIPTQSKI+N VQE+FNT  SHSE NAEE EVSPEFVA  EN+NEAP EDCEDSDGEYLEISEQGM+ILN S
Subjt:  QVEDRCTVLKKESQLREEHLGTETVVEDNIPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSS

Query:  IGDCKHKNEEMGEMTEISTNDEH-AERREPDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETK
        IGDCKHKNEEMGE TE+STN+EH A+RREPDES FEP+LGFQPQ QQKE +I  QTAESTDE +SAP QE DT TEKSK+NPSDS SY QTAS+T TETK
Subjt:  IGDCKHKNEEMGEMTEISTNDEH-AERREPDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETK

Query:  PSINPIDEQSLTTLPFSTFGGEDQDSPGRTSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGR
        P+ NPIDEQSL TLPFSTFGGEDQDSPGRTSNESISENSIG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL NLQEISFPNPMEKRV+KLGR
Subjt:  PSINPIDEQSLTTLPFSTFGGEDQDSPGRTSNESISENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGR

Query:  SDSEKSRSSFPGFAKEKEESGMEFKAIDQNNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        SDSEKSR SFPGF KEKEESGMEFKAIDQNNF  +KKVAK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  SDSEKSRSSFPGFAKEKEESGMEFKAIDQNNFAADKKVAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

SwissProt top hitse value%identityAlignment
O23324 ATP-sulfurylase 3, chloroplastic2.1e-21580.56Show/hide
Query:  MASMATRFTKTPSPFHSIQRTSYSHLGAPIKVSIA-SSKSKKTHLR-LRVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSE
        MASM+T F K P+ F S Q  + SH    +  SI+  S SK   LR + V AGLIEPDGG+LV+L+V EP R  KK EA  LPR+ L++ID+QW+HVLSE
Subjt:  MASMATRFTKTPSPFHSIQRTSYSHLGAPIKVSIA-SSKSKKTHLR-LRVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSE

Query:  GWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQA
        GWASPL GFMRESEFLQTLHFN L L DGSV NMSVPIVLAIDD QK  IG+S  V+L D  +NPIAIL DIEIYKHPKEERIARTWGTTAPGLPYV++A
Subjt:  GWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQA

Query:  ITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWR
        ITNAG+WLIGGDLEV+EP+KY+DGLDRFR SP ELR+E  +R ADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGG+TKADDVPLSWR
Subjt:  ITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWR

Query:  MKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKT
        MKQHEKVLEDGVLDPETTVVSIFPSPM YAGPTEVQWHAKARINAGANFYIVGRDPAGMGHP EKRDLYDADHGKKVLSMA GLERLNILPFRVAAYDKT
Subjt:  MKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKT

Query:  QGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYYDSLVPASSGKLPE
        QGKMAFFDPSRAQDFLFISGTKMR LAKN+ENPP+GFMCPGGWKVLV+YYDSL    + KLPE
Subjt:  QGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYYDSLVPASSGKLPE

Q27128 Bifunctional 3'-phosphoadenosine 5'-phosphosulfate synthase8.5e-14062.02Show/hide
Query:  ELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKN
        EL V E      K E + LP +E++ +D QWV VLSEGWA+PLTGFMRE E+LQ+ HF  L   DG V N S+PIVL +  A K R+  S++ AL   + 
Subjt:  ELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKN

Query:  NPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALL
          IAIL+  E Y+H KEER +R +GT+  G PYV + I  +G+WL+GGDLEV+E I ++DGLD +R +P ELR +F   NADAVFAFQLRNPVHNGHALL
Subjt:  NPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALL

Query:  MTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPT
        MTDTRRRL E GYK P+LLLHPLGG+TK DDVPL+WRMKQH+ +L++ VLDP+ TV++IFPSPM YAGPTEVQWHAKAR++ GANFYIVGRDPAGM HP 
Subjt:  MTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPT

Query:  EKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYY
         K+DLY+A HG KVL+MA GL +L I+PFRVAAY+KT+  M F+DP R  +F+FISGTKMR +A+  E PP GFM P  WK++VEYY
Subjt:  EKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYY

Q43870 ATP sulfurylase 25.4e-18774.94Show/hide
Query:  RLRVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQ
        ++ V + LI+PDGG LVEL+V E   G+KK E+ ++P+++L+ ID++WVHV+SEGWASPL GFMRE E+LQ+LHFNSLRL +G+  NMS+PIVLAIDD  
Subjt:  RLRVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQ

Query:  KHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADA
        K +IG S +VAL   + + I  L+ +EIYKH KEERIARTWGTT+PGLPYV++ IT +GNWLIGGDLEV EPIKY+DGLD +R SP +LREEF  R ADA
Subjt:  KHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADA

Query:  VFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAG
        VFAFQLRNPVHNGHALLM DTR+RLLEMGYKNP+LLLHPLGG+TKADDVPL  RM+QH KVLEDGVLDP+TT+VSIFPSPMHYAGPTEVQWHAKARINAG
Subjt:  VFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAG

Query:  ANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVL
        ANFYIVGRDPAGMGHPTEKRDLYD DHGK+VLSMA GLE+LNILPFRVAAYD  + KMAFFDPSRA++FLFISGTKMRT A+  ENPP+GFMCP GW VL
Subjt:  ANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVL

Query:  VEYYDSL
        V+YY+SL
Subjt:  VEYYDSL

Q9LIK9 ATP sulfurylase 1, chloroplastic1.8e-22281.39Show/hide
Query:  MASMATRFTKTPSPFHSIQRTSYSHLGAPIKVSIASSKSKKTHLRL-RVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEG
        MASMA   +KTP     + ++S +   + +  +  S  SK    R+  + AGLI PDGG+LVEL+VEEP R  KK EA  LPR+EL++ID+QW+HVLSEG
Subjt:  MASMATRFTKTPSPFHSIQRTSYSHLGAPIKVSIASSKSKKTHLRL-RVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEG

Query:  WASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAI
        WASPL GFMRESEFLQTLHFNSLRL DGSV NMSVPIVLAIDD QK RIG+ST VALF+   NP+AIL DIEIYKHPKEERIARTWGTTAPGLPYVD+AI
Subjt:  WASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAI

Query:  TNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRM
        TNAGNWLIGGDLEV+EP+KY+DGLDRFR SPAELR+E  +RNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGG+TKADDVPL WRM
Subjt:  TNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRM

Query:  KQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKTQ
        KQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHP EKRDLYDADHGKKVLSMA GLERLNILPFRVAAYDKTQ
Subjt:  KQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKTQ

Query:  GKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYYDSLVPASSGKLPE
        GKMAFFDPSR QDFLFISGTKMRTLAKN ENPP+GFMCPGGWKVLV+YY+SL PA +G+LPE
Subjt:  GKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYYDSLVPASSGKLPE

Q9S7D8 ATP sulfurylase 4, chloroplastic1.6e-20776.65Show/hide
Query:  MASMATRFTKTPSPFHSIQRTSYSHLG--APIKVSIASSKSKKTHLRLRVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLP-RIELSSIDIQWVHVLS
        MAS A     + SPF S       H    AP  +S+ S   + +   L V +GLIEPDGG+L+ L+VEE  R + K EA ++P RI+L+ +D++WVHVLS
Subjt:  MASMATRFTKTPSPFHSIQRTSYSHLG--APIKVSIASSKSKKTHLRLRVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLP-RIELSSIDIQWVHVLS

Query:  EGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQ
        EGWASPL GFMR+SEFLQTLHFNS RL DGSV NMSVPIVLAIDD QK RIGDS  V L D   NPIAIL DIEIYKHPKEERIARTWGTTA GLPY ++
Subjt:  EGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQ

Query:  AITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSW
        AIT AGNWLIGGDL+V+EPIKY+DGLDRFR SP++LREEF RR ADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNP+LLL+PLGG+TKADDVPLSW
Subjt:  AITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSW

Query:  RMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDK
        RM+QHEKVLEDGVLDPETTVVSIFPSPM YAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMA GLERLNILPF+VAAYDK
Subjt:  RMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDK

Query:  TQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYYDSLVPASSGKLPEPEAVLAS
        TQGKMAFFDPSR+QDFLFISGTKMR LAK KENPP+GFMCP GWKVLV+YYDSL  A +G     EAV ++
Subjt:  TQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYYDSLVPASSGKLPEPEAVLAS

Arabidopsis top hitse value%identityAlignment
AT1G19920.1 Pseudouridine synthase/archaeosine transglycosylase-like family protein3.9e-18874.94Show/hide
Query:  RLRVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQ
        ++ V + LI+PDGG LVEL+V E   G+KK E+ ++P+++L+ ID++WVHV+SEGWASPL GFMRE E+LQ+LHFNSLRL +G+  NMS+PIVLAIDD  
Subjt:  RLRVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQ

Query:  KHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADA
        K +IG S +VAL   + + I  L+ +EIYKH KEERIARTWGTT+PGLPYV++ IT +GNWLIGGDLEV EPIKY+DGLD +R SP +LREEF  R ADA
Subjt:  KHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADA

Query:  VFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAG
        VFAFQLRNPVHNGHALLM DTR+RLLEMGYKNP+LLLHPLGG+TKADDVPL  RM+QH KVLEDGVLDP+TT+VSIFPSPMHYAGPTEVQWHAKARINAG
Subjt:  VFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAG

Query:  ANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVL
        ANFYIVGRDPAGMGHPTEKRDLYD DHGK+VLSMA GLE+LNILPFRVAAYD  + KMAFFDPSRA++FLFISGTKMRT A+  ENPP+GFMCP GW VL
Subjt:  ANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVL

Query:  VEYYDSL
        V+YY+SL
Subjt:  VEYYDSL

AT3G22890.1 ATP sulfurylase 11.3e-22381.39Show/hide
Query:  MASMATRFTKTPSPFHSIQRTSYSHLGAPIKVSIASSKSKKTHLRL-RVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEG
        MASMA   +KTP     + ++S +   + +  +  S  SK    R+  + AGLI PDGG+LVEL+VEEP R  KK EA  LPR+EL++ID+QW+HVLSEG
Subjt:  MASMATRFTKTPSPFHSIQRTSYSHLGAPIKVSIASSKSKKTHLRL-RVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEG

Query:  WASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAI
        WASPL GFMRESEFLQTLHFNSLRL DGSV NMSVPIVLAIDD QK RIG+ST VALF+   NP+AIL DIEIYKHPKEERIARTWGTTAPGLPYVD+AI
Subjt:  WASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAI

Query:  TNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRM
        TNAGNWLIGGDLEV+EP+KY+DGLDRFR SPAELR+E  +RNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGG+TKADDVPL WRM
Subjt:  TNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRM

Query:  KQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKTQ
        KQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHP EKRDLYDADHGKKVLSMA GLERLNILPFRVAAYDKTQ
Subjt:  KQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKTQ

Query:  GKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYYDSLVPASSGKLPE
        GKMAFFDPSR QDFLFISGTKMRTLAKN ENPP+GFMCPGGWKVLV+YY+SL PA +G+LPE
Subjt:  GKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYYDSLVPASSGKLPE

AT4G14680.1 Pseudouridine synthase/archaeosine transglycosylase-like family protein1.5e-21680.56Show/hide
Query:  MASMATRFTKTPSPFHSIQRTSYSHLGAPIKVSIA-SSKSKKTHLR-LRVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSE
        MASM+T F K P+ F S Q  + SH    +  SI+  S SK   LR + V AGLIEPDGG+LV+L+V EP R  KK EA  LPR+ L++ID+QW+HVLSE
Subjt:  MASMATRFTKTPSPFHSIQRTSYSHLGAPIKVSIA-SSKSKKTHLR-LRVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSE

Query:  GWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQA
        GWASPL GFMRESEFLQTLHFN L L DGSV NMSVPIVLAIDD QK  IG+S  V+L D  +NPIAIL DIEIYKHPKEERIARTWGTTAPGLPYV++A
Subjt:  GWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQA

Query:  ITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWR
        ITNAG+WLIGGDLEV+EP+KY+DGLDRFR SP ELR+E  +R ADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGG+TKADDVPLSWR
Subjt:  ITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWR

Query:  MKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKT
        MKQHEKVLEDGVLDPETTVVSIFPSPM YAGPTEVQWHAKARINAGANFYIVGRDPAGMGHP EKRDLYDADHGKKVLSMA GLERLNILPFRVAAYDKT
Subjt:  MKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKT

Query:  QGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYYDSLVPASSGKLPE
        QGKMAFFDPSRAQDFLFISGTKMR LAKN+ENPP+GFMCPGGWKVLV+YYDSL    + KLPE
Subjt:  QGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYYDSLVPASSGKLPE

AT5G43780.1 Pseudouridine synthase/archaeosine transglycosylase-like family protein1.2e-20876.65Show/hide
Query:  MASMATRFTKTPSPFHSIQRTSYSHLG--APIKVSIASSKSKKTHLRLRVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLP-RIELSSIDIQWVHVLS
        MAS A     + SPF S       H    AP  +S+ S   + +   L V +GLIEPDGG+L+ L+VEE  R + K EA ++P RI+L+ +D++WVHVLS
Subjt:  MASMATRFTKTPSPFHSIQRTSYSHLG--APIKVSIASSKSKKTHLRLRVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLP-RIELSSIDIQWVHVLS

Query:  EGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQ
        EGWASPL GFMR+SEFLQTLHFNS RL DGSV NMSVPIVLAIDD QK RIGDS  V L D   NPIAIL DIEIYKHPKEERIARTWGTTA GLPY ++
Subjt:  EGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQ

Query:  AITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSW
        AIT AGNWLIGGDL+V+EPIKY+DGLDRFR SP++LREEF RR ADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNP+LLL+PLGG+TKADDVPLSW
Subjt:  AITNAGNWLIGGDLEVIEPIKYHDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSW

Query:  RMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDK
        RM+QHEKVLEDGVLDPETTVVSIFPSPM YAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMA GLERLNILPF+VAAYDK
Subjt:  RMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDK

Query:  TQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYYDSLVPASSGKLPEPEAVLAS
        TQGKMAFFDPSR+QDFLFISGTKMR LAK KENPP+GFMCP GWKVLV+YYDSL  A +G     EAV ++
Subjt:  TQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGGWKVLVEYYDSLVPASSGKLPEPEAVLAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCATGGCCACTCGTTTCACCAAAACCCCATCCCCATTTCATTCTATTCAAAGAACTTCTTACTCCCATTTGGGCGCTCCGATTAAAGTCTCCATTGCTAGCTC
TAAGTCGAAGAAGACCCATCTCCGCCTCCGGGTTTCCGCCGGTTTGATCGAACCCGACGGCGGAAGGCTGGTGGAGCTTCTTGTTGAGGAGCCTCTGAGGGGTCTGAAGA
AGAAGGAAGCCTTATCCTTGCCGAGGATCGAGCTTTCTAGCATCGATATTCAATGGGTTCATGTGCTCAGCGAAGGATGGGCCAGTCCTCTTACCGGATTCATGAGAGAA
TCCGAGTTCCTCCAAACGCTTCATTTCAATTCCCTCAGGCTTCCGGATGGGTCTGTGGCGAACATGTCAGTGCCCATTGTTCTGGCTATTGACGATGCCCAAAAGCACCG
CATCGGGGATTCCACCAGTGTGGCTCTCTTTGACGACAAGAACAACCCCATCGCTATTCTCAAGGATATTGAGATCTACAAGCACCCTAAAGAAGAAAGGATAGCTAGAA
CCTGGGGCACCACTGCCCCGGGGCTCCCTTATGTTGATCAAGCTATAACCAATGCTGGTAATTGGCTGATAGGAGGTGACTTAGAGGTTATAGAGCCAATCAAGTACCAT
GATGGCCTTGACCGTTTTAGACAGTCGCCTGCAGAACTACGGGAGGAATTTACCCGGCGAAATGCAGATGCCGTGTTTGCATTTCAACTTCGAAACCCAGTGCATAACGG
GCATGCATTACTAATGACTGATACCCGCCGTCGACTGCTCGAGATGGGATACAAGAATCCCATTTTGTTGCTTCATCCTTTGGGTGGCTACACCAAGGCAGATGATGTTC
CACTTAGTTGGCGAATGAAGCAACATGAGAAGGTGCTCGAGGACGGTGTTCTTGATCCAGAGACAACAGTCGTCTCCATATTTCCATCTCCAATGCACTATGCAGGGCCA
ACTGAGGTGCAGTGGCATGCAAAGGCAAGAATTAATGCAGGAGCCAACTTTTACATTGTTGGACGCGACCCAGCTGGAATGGGTCATCCTACTGAGAAAAGGGATTTATA
TGATGCTGACCATGGGAAAAAAGTGTTGAGCATGGCACGTGGGCTAGAGCGGCTAAACATCCTTCCTTTCAGAGTTGCTGCTTACGATAAAACTCAGGGGAAGATGGCCT
TCTTTGACCCCTCTAGAGCTCAGGATTTCCTTTTCATATCCGGCACCAAGATGAGAACTCTAGCAAAGAATAAAGAGAATCCACCAGAAGGGTTTATGTGCCCAGGTGGA
TGGAAGGTGTTGGTTGAATATTACGATAGTTTGGTACCTGCAAGCAGTGGCAAACTTCCTGAACCTGAAGCTGTTCTGGCAAGCATTGCTCAATTTGCCTCTCTTTGTAA
GTTAGATGCCCTCATCGCATCTTTGGATTTGCTGGCTACAACAGATCAAGCTGTCAAATATCACCACCATCATAACAATGGATTTGTTGAGTTGAATATGGGAAATGAGA
TGGGAAACAATAACACATCTGAGTTTAGAGAGGAAGAAAAGGCAAAAGCTGAAGGTCCAGAAAAATCTTTACTAGAAGGCGGGGAAAATGGAGTCAAAGCAGATGAAGTT
GCAGGCTTCCACCAAAAAGAGGCAAGGCGGAATTCTGGTGATGCAGATAGAGTAAATGGAGATCATCATATCACAGAGAAAGAGGAAGAGAAAAACAAAGCATGTAATAT
AGCAGAATTTCAAGTGGTTTCTGAAAAATTACTTGACCGAACTAAAGAGGACAATGAAGTCGACGTCAAGCCTAAAGAAGATAAAGAAATGTTCAACTCAGAAGAGAATA
GGTCCGATGGAAATGAGCACGAGCATGAAAGGCAGGCTTCAAATCAGAAAGAGGAAGAAGGGGAGAGAGGTTCCAACTTGAACACAACAGTACTTTCATTAGATCAGCCG
GAAGTTGAAAAGATTGCTGATTTCAAGAGTAATCAGCACGAATTGCTAGCCGAATCATTGATAGAAGATAGCAGCAAATCCCAAGAAGAATGCAAGGATGCTTTGGAATT
GAGCTTGAACAACACTTACCATTCAGCAGATTGTGTCATGGCAGATTCAGAAGAAAACACCAATATGGATCAAGTTATTGATACTGACCGAGGTACTAATAGGGAAAATG
ATGGAGAGAAGGGAAAGTCCAACATCGACATGATAACAAATGCATCAGAAGATGAAAAGTCTGAAAAGACATCAGATTTTGATATTGATCAAGTCATTGATAATGATCGA
GATACAGATAAGGAAAGTGATGGAGAGAAAGGAAAAGGTTCAAACTTCAACATGATAACACATGCATCAAGAGATCCAAAATCTGAAATGAATTCAGAATTCGATACTGA
TCAAGTAATTGATACTGATCTAGACGCATATAAGGAAAATAATGAAGAGAGGAGAAACGGTTCCAACTTCGACACTTCAAAAGATCCAAATTTTGAAAAGATTTCAAATT
TGGAAACTGATCAAGTTGTTCATACTGATCGAGATACAGATAAGGAAAACGATGAAGAGAGGGGCAACGGCTCCAACTTCGACACATTAATACATACATCAAAAGATCCA
AATTCTGAAACTAGTTCAGATCTTAAGTCTATTCAGCATGAGTCACTTGAAACCAACGCTGAATCATTGACTCGATCAAGTGATGATGGAGACACAGATATGGAGAAGAA
AAAAGGTGATCTAGTAGAGCCAAGGCCATGCCATGGATATAAAATGCCAACTGCCAAAATTGTTGATACTAAAGATGAAGGAACTGCAACTGATTTGATATGTCATAATT
CTTCATGTTCACTAGCAGAAGAGAGTCTGGTGATAGAATCACCCACTTCATCTATGCAAGTTCCTCAGGTTGAAGACAGATGCACGGTTTTGAAAAAAGAATCCCAATTA
AGAGAAGAACATTTAGGAACTGAAACGGTCGTTGAAGACAATATTCCAACTCAATCTAAGATTACAAATGGGGTCCAAGAGAAATTTAACACGATTGGGTCACACTCGGA
GAACAATGCAGAAGAAACTGAAGTTTCTCCTGAATTTGTTGCCAAGAACGAGAACCAGAACGAGGCTCCTGAAGAGGATTGCGAGGATTCAGATGGAGAATATCTGGAAA
TTTCTGAACAAGGTATGAATATTTTGAATTCATCTATTGGAGATTGCAAGCATAAGAATGAGGAGATGGGAGAGATGACGGAAATTTCAACAAACGATGAACATGCAGAA
CGACGAGAACCAGATGAAAGCCTATTTGAACCTGTTTTAGGATTTCAACCTCAAATCCAGCAAAAGGAAAATTCAATCAAATCTCAAACTGCAGAAAGTACAGATGAATT
GATGTCAGCACCGGGGCAAGAAATCGATACAGCAACTGAAAAATCCAAGAACAATCCCTCGGATTCCCCAAGTTATATTCAAACAGCTTCAGCGACACTCACAGAAACTA
AACCATCGATAAACCCGATCGACGAACAGAGTTTAACAACACTTCCATTCTCCACATTCGGAGGAGAAGATCAAGATTCTCCAGGAAGAACAAGCAACGAATCAATTTCC
GAAAATTCAATTGGTCATATCGAGATGCGTAAATCGCCTAGCTTCAATATAGATATCCAAAGCGAAGGAAGGGCAGGAGAAACAGAGAAAACCCCATTGCTATACCAAAT
TAAGACAATCGAAGACTTACCAAATCTGCAGGAGATTAGCTTTCCAAATCCAATGGAGAAACGAGTGGTGAAGTTGGGAAGAAGCGATTCAGAGAAATCGAGATCGTCTT
TCCCAGGGTTCGCGAAAGAAAAAGAAGAATCCGGGATGGAATTCAAAGCAATCGATCAAAATAACTTCGCCGCCGATAAGAAGGTAGCAAAAGACTTACCACCGCCATCG
CCGATTCGTAAAGGGAAGCGCAGAACGAAATCCCTAATTTTTGGGACTTGCATCTGCTGTGCAACAGCGATCAATTGA
mRNA sequenceShow/hide mRNA sequence
AAAAACTAAAAAGCAAAGAAAGAGGACAATGACGACAGGGGAAAAGGGTGTCCAGGTTCATCCTCCCAACCAAAGGAGAAAAAAGTTATTCAATACTCTCTCAAACACAA
GGAATCTTCATTCCACCTTCAATTTTTTTTTATCTCTTCTAAGTTGCATCATTATAAATTTATATCCCATCCCACTCTCTCAAAAAAGTTGCAAAATCCCCCTCCCTGAT
TCCCCTTTTTGTTCCTTTTTTCTCTCCCACTTTTCACCTCTCGGATTAATCCTCTTCCCTAATTTCAGCTTAATTTTCCCTTAATCTCGCTTTCTAGCCTTGGTCTCCAT
CTCAGATGGCGTCCATGGCCACTCGTTTCACCAAAACCCCATCCCCATTTCATTCTATTCAAAGAACTTCTTACTCCCATTTGGGCGCTCCGATTAAAGTCTCCATTGCT
AGCTCTAAGTCGAAGAAGACCCATCTCCGCCTCCGGGTTTCCGCCGGTTTGATCGAACCCGACGGCGGAAGGCTGGTGGAGCTTCTTGTTGAGGAGCCTCTGAGGGGTCT
GAAGAAGAAGGAAGCCTTATCCTTGCCGAGGATCGAGCTTTCTAGCATCGATATTCAATGGGTTCATGTGCTCAGCGAAGGATGGGCCAGTCCTCTTACCGGATTCATGA
GAGAATCCGAGTTCCTCCAAACGCTTCATTTCAATTCCCTCAGGCTTCCGGATGGGTCTGTGGCGAACATGTCAGTGCCCATTGTTCTGGCTATTGACGATGCCCAAAAG
CACCGCATCGGGGATTCCACCAGTGTGGCTCTCTTTGACGACAAGAACAACCCCATCGCTATTCTCAAGGATATTGAGATCTACAAGCACCCTAAAGAAGAAAGGATAGC
TAGAACCTGGGGCACCACTGCCCCGGGGCTCCCTTATGTTGATCAAGCTATAACCAATGCTGGTAATTGGCTGATAGGAGGTGACTTAGAGGTTATAGAGCCAATCAAGT
ACCATGATGGCCTTGACCGTTTTAGACAGTCGCCTGCAGAACTACGGGAGGAATTTACCCGGCGAAATGCAGATGCCGTGTTTGCATTTCAACTTCGAAACCCAGTGCAT
AACGGGCATGCATTACTAATGACTGATACCCGCCGTCGACTGCTCGAGATGGGATACAAGAATCCCATTTTGTTGCTTCATCCTTTGGGTGGCTACACCAAGGCAGATGA
TGTTCCACTTAGTTGGCGAATGAAGCAACATGAGAAGGTGCTCGAGGACGGTGTTCTTGATCCAGAGACAACAGTCGTCTCCATATTTCCATCTCCAATGCACTATGCAG
GGCCAACTGAGGTGCAGTGGCATGCAAAGGCAAGAATTAATGCAGGAGCCAACTTTTACATTGTTGGACGCGACCCAGCTGGAATGGGTCATCCTACTGAGAAAAGGGAT
TTATATGATGCTGACCATGGGAAAAAAGTGTTGAGCATGGCACGTGGGCTAGAGCGGCTAAACATCCTTCCTTTCAGAGTTGCTGCTTACGATAAAACTCAGGGGAAGAT
GGCCTTCTTTGACCCCTCTAGAGCTCAGGATTTCCTTTTCATATCCGGCACCAAGATGAGAACTCTAGCAAAGAATAAAGAGAATCCACCAGAAGGGTTTATGTGCCCAG
GTGGATGGAAGGTGTTGGTTGAATATTACGATAGTTTGGTACCTGCAAGCAGTGGCAAACTTCCTGAACCTGAAGCTGTTCTGGCAAGCATTGCTCAATTTGCCTCTCTT
TGTAAGTTAGATGCCCTCATCGCATCTTTGGATTTGCTGGCTACAACAGATCAAGCTGTCAAATATCACCACCATCATAACAATGGATTTGTTGAGTTGAATATGGGAAA
TGAGATGGGAAACAATAACACATCTGAGTTTAGAGAGGAAGAAAAGGCAAAAGCTGAAGGTCCAGAAAAATCTTTACTAGAAGGCGGGGAAAATGGAGTCAAAGCAGATG
AAGTTGCAGGCTTCCACCAAAAAGAGGCAAGGCGGAATTCTGGTGATGCAGATAGAGTAAATGGAGATCATCATATCACAGAGAAAGAGGAAGAGAAAAACAAAGCATGT
AATATAGCAGAATTTCAAGTGGTTTCTGAAAAATTACTTGACCGAACTAAAGAGGACAATGAAGTCGACGTCAAGCCTAAAGAAGATAAAGAAATGTTCAACTCAGAAGA
GAATAGGTCCGATGGAAATGAGCACGAGCATGAAAGGCAGGCTTCAAATCAGAAAGAGGAAGAAGGGGAGAGAGGTTCCAACTTGAACACAACAGTACTTTCATTAGATC
AGCCGGAAGTTGAAAAGATTGCTGATTTCAAGAGTAATCAGCACGAATTGCTAGCCGAATCATTGATAGAAGATAGCAGCAAATCCCAAGAAGAATGCAAGGATGCTTTG
GAATTGAGCTTGAACAACACTTACCATTCAGCAGATTGTGTCATGGCAGATTCAGAAGAAAACACCAATATGGATCAAGTTATTGATACTGACCGAGGTACTAATAGGGA
AAATGATGGAGAGAAGGGAAAGTCCAACATCGACATGATAACAAATGCATCAGAAGATGAAAAGTCTGAAAAGACATCAGATTTTGATATTGATCAAGTCATTGATAATG
ATCGAGATACAGATAAGGAAAGTGATGGAGAGAAAGGAAAAGGTTCAAACTTCAACATGATAACACATGCATCAAGAGATCCAAAATCTGAAATGAATTCAGAATTCGAT
ACTGATCAAGTAATTGATACTGATCTAGACGCATATAAGGAAAATAATGAAGAGAGGAGAAACGGTTCCAACTTCGACACTTCAAAAGATCCAAATTTTGAAAAGATTTC
AAATTTGGAAACTGATCAAGTTGTTCATACTGATCGAGATACAGATAAGGAAAACGATGAAGAGAGGGGCAACGGCTCCAACTTCGACACATTAATACATACATCAAAAG
ATCCAAATTCTGAAACTAGTTCAGATCTTAAGTCTATTCAGCATGAGTCACTTGAAACCAACGCTGAATCATTGACTCGATCAAGTGATGATGGAGACACAGATATGGAG
AAGAAAAAAGGTGATCTAGTAGAGCCAAGGCCATGCCATGGATATAAAATGCCAACTGCCAAAATTGTTGATACTAAAGATGAAGGAACTGCAACTGATTTGATATGTCA
TAATTCTTCATGTTCACTAGCAGAAGAGAGTCTGGTGATAGAATCACCCACTTCATCTATGCAAGTTCCTCAGGTTGAAGACAGATGCACGGTTTTGAAAAAAGAATCCC
AATTAAGAGAAGAACATTTAGGAACTGAAACGGTCGTTGAAGACAATATTCCAACTCAATCTAAGATTACAAATGGGGTCCAAGAGAAATTTAACACGATTGGGTCACAC
TCGGAGAACAATGCAGAAGAAACTGAAGTTTCTCCTGAATTTGTTGCCAAGAACGAGAACCAGAACGAGGCTCCTGAAGAGGATTGCGAGGATTCAGATGGAGAATATCT
GGAAATTTCTGAACAAGGTATGAATATTTTGAATTCATCTATTGGAGATTGCAAGCATAAGAATGAGGAGATGGGAGAGATGACGGAAATTTCAACAAACGATGAACATG
CAGAACGACGAGAACCAGATGAAAGCCTATTTGAACCTGTTTTAGGATTTCAACCTCAAATCCAGCAAAAGGAAAATTCAATCAAATCTCAAACTGCAGAAAGTACAGAT
GAATTGATGTCAGCACCGGGGCAAGAAATCGATACAGCAACTGAAAAATCCAAGAACAATCCCTCGGATTCCCCAAGTTATATTCAAACAGCTTCAGCGACACTCACAGA
AACTAAACCATCGATAAACCCGATCGACGAACAGAGTTTAACAACACTTCCATTCTCCACATTCGGAGGAGAAGATCAAGATTCTCCAGGAAGAACAAGCAACGAATCAA
TTTCCGAAAATTCAATTGGTCATATCGAGATGCGTAAATCGCCTAGCTTCAATATAGATATCCAAAGCGAAGGAAGGGCAGGAGAAACAGAGAAAACCCCATTGCTATAC
CAAATTAAGACAATCGAAGACTTACCAAATCTGCAGGAGATTAGCTTTCCAAATCCAATGGAGAAACGAGTGGTGAAGTTGGGAAGAAGCGATTCAGAGAAATCGAGATC
GTCTTTCCCAGGGTTCGCGAAAGAAAAAGAAGAATCCGGGATGGAATTCAAAGCAATCGATCAAAATAACTTCGCCGCCGATAAGAAGGTAGCAAAAGACTTACCACCGC
CATCGCCGATTCGTAAAGGGAAGCGCAGAACGAAATCCCTAATTTTTGGGACTTGCATCTGCTGTGCAACAGCGATCAATTGAATGGATTTTGTCGAGATCATTCACATC
CAGCATTGGAGTTTCAGAGTGGTTTTCTTTCAATGATTTTTCTTCTTTTTTCATATTTCAGCCACGTTTCAAGAGCGAAGAACGCCATGATCGGCTTGAAGTTGAACTCG
CTGTTCGCCTTCGAAAAGCGTTGCACTTGCTGTTCTTATGAATTTATTTTGTTCCCGCCAAAATACATGTGATCCTATGATTTAAATTTAACCCCTTATGTGTTTAATGA
TTTGTTTGTCAGTGGGATTTGTAATTTATTTGATATATAAATTTTAAATCTGTCTCTAAAGGTTTTTTTTTTAATAAAATATATATAATCTGCCA
Protein sequenceShow/hide protein sequence
MASMATRFTKTPSPFHSIQRTSYSHLGAPIKVSIASSKSKKTHLRLRVSAGLIEPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRE
SEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHRIGDSTSVALFDDKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYH
DGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGP
TEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMARGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPGG
WKVLVEYYDSLVPASSGKLPEPEAVLASIAQFASLCKLDALIASLDLLATTDQAVKYHHHHNNGFVELNMGNEMGNNNTSEFREEEKAKAEGPEKSLLEGGENGVKADEV
AGFHQKEARRNSGDADRVNGDHHITEKEEEKNKACNIAEFQVVSEKLLDRTKEDNEVDVKPKEDKEMFNSEENRSDGNEHEHERQASNQKEEEGERGSNLNTTVLSLDQP
EVEKIADFKSNQHELLAESLIEDSSKSQEECKDALELSLNNTYHSADCVMADSEENTNMDQVIDTDRGTNRENDGEKGKSNIDMITNASEDEKSEKTSDFDIDQVIDNDR
DTDKESDGEKGKGSNFNMITHASRDPKSEMNSEFDTDQVIDTDLDAYKENNEERRNGSNFDTSKDPNFEKISNLETDQVVHTDRDTDKENDEERGNGSNFDTLIHTSKDP
NSETSSDLKSIQHESLETNAESLTRSSDDGDTDMEKKKGDLVEPRPCHGYKMPTAKIVDTKDEGTATDLICHNSSCSLAEESLVIESPTSSMQVPQVEDRCTVLKKESQL
REEHLGTETVVEDNIPTQSKITNGVQEKFNTIGSHSENNAEETEVSPEFVAKNENQNEAPEEDCEDSDGEYLEISEQGMNILNSSIGDCKHKNEEMGEMTEISTNDEHAE
RREPDESLFEPVLGFQPQIQQKENSIKSQTAESTDELMSAPGQEIDTATEKSKNNPSDSPSYIQTASATLTETKPSINPIDEQSLTTLPFSTFGGEDQDSPGRTSNESIS
ENSIGHIEMRKSPSFNIDIQSEGRAGETEKTPLLYQIKTIEDLPNLQEISFPNPMEKRVVKLGRSDSEKSRSSFPGFAKEKEESGMEFKAIDQNNFAADKKVAKDLPPPS
PIRKGKRRTKSLIFGTCICCATAIN