; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G004780 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G004780
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionSulfate adenylyltransferase
Genome locationCG_Chr05:4592372..4600803
RNA-Seq ExpressionClCG05G004780
SyntenyClCG05G004780
Gene Ontology termsGO:0000103 - sulfate assimilation (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0004020 - adenylylsulfate kinase activity (molecular function)
GO:0004781 - sulfate adenylyltransferase (ATP) activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR002650 - Sulphate adenylyltransferase
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold
IPR015947 - PUA-like superfamily
IPR024951 - Sulphate adenylyltransferase catalytic domain
IPR025980 - ATP-sulfurylase PUA-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049322.1 serine-aspartate repeat-containing protein F isoform X1 [Cucumis melo var. makuwa]0.0e+0073.66Show/hide
Query:  SPCFAIAILSTTDQAVTYHHHHHNGKKVLRFVELNMGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHV
        +P  +   LSTT QAV YHHHHHNG     FVELNMGNEMGNNNTSEFREEEK KAEGPE  L+EGG+N+ EADEVAGF QKE +PSSG+ D+ NGDHHV
Subjt:  SPCFAIAILSTTDQAVTYHHHHHNGKKVLRFVELNMGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHV

Query:  TEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEVNSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQH
        +EKEEEK+K  +  EFQVVSE L D+T EDN VEV SKE+KEME +SEENRSD       +QAS+QK+EE +RGSNLNTTVLSL+ P+L+KT        
Subjt:  TEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEVNSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQH

Query:  ELLAESLIEDSNKSPDECKDALELSLNNTSHSADFVIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------
                       ++CKDALE S  NT HSAD  + DSEENTNM+QVIDTDRDTN+EN GEKGKGS I+MI HASEDEKSE TS+ ++DQVI      
Subjt:  ELLAESLIEDSNKSPDECKDALELSLNNTSHSADFVIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------

Query:  DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTDQVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSN
        DKENDGE+G+GS+ NT  HA KDPKSE+NS+FD+DQVIDTD+DA+KENDEERRKGSNF   N+SE+P  EK SNL+++QVV T    DKENDE RGN SN
Subjt:  DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTDQVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSN

Query:  FNTLIHTSKHPNSETSSDLKSIQHESPETNAESLTGSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVN
        F+ LI  SK PNSE SS+L+  QHESPETNAESLTGSSDDG TD EKKKGDLVEPR  HGYT PA KNVDTKD+GT T + CH +S SLAEE L IES N
Subjt:  FNTLIHTSKHPNSETSSDLKSIQHESPETNAESLTGSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVN

Query:  SSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDNIPTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGM
        SS+QVPEVED     K E +LR E LGTET  EDNIPTQ KISN VQEEFNT  SH E NAEE EVSP+FVA+NK  NEAP EDCEDSDGEYLEISEQGM
Subjt:  SSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDNIPTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGM

Query:  DILNLSIGDCKHNNEEMGETTEISTNNEHEVERREPDESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASA
        DILNLSIGDCKH NEEMGETTE+STNNEHE +RREPDES FEP+LGFQPQ Q+KET+I FQ+AESTDESIS PRQE DT TEKSK NPSDS SY +TAS+
Subjt:  DILNLSIGDCKHNNEEMGETTEISTNNEHEVERREPDESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASA

Query:  TLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGRTSNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKR
        T T+T P+TNPIDEQSL TLPFSTFGGEDQDSPGRTSNES+SENSIG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL N+QEISFPNPMEKR
Subjt:  TLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGRTSNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKR

Query:  VVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQNNFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        V+KLGRSDSEKSRPSFPGF KE EES MEFKAIDQNNF  EKK AK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  VVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQNNFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

XP_008438604.1 PREDICTED: serine-aspartate repeat-containing protein F isoform X1 [Cucumis melo]0.0e+0074.47Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV
        MGNEMGNNNTSEFREEEK KAEGPE  L+EGG+N+ EADEVAGF QKE +PSSG+ D+ NGDHHV+EKEEEK+K  +  EFQVVSE L D+T EDN VEV
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV

Query:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF
         SKE+KEME +SEENRSD       +QAS+QK+EE +RGSNLNTTVLSL+ P+L+KT                       ++CKDALE S  NT HSAD 
Subjt:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF

Query:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTD
         + DSEENTNM+QVIDTDRDTN+EN GEKGKGS I+MI HASEDEKSE TS+ ++DQVI      DKENDGE+G+GS+ NT  HA KDPKSE+NS+FD+D
Subjt:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTD

Query:  QVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLT
        QVIDTD+DA+KENDEERRKGSNF   N+SE+P  EK SNL+++QVV T    DKENDE RGN SNF+ LI  SK PNSE SS+L+  QHESPETNAESLT
Subjt:  QVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLT

Query:  GSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDN
        GSSDDG TD EKKKGDLVEPR  HGYT PA KNVDTKD+GT T + CH +S SLAEE L IES NSS+QVPEVED     K E +LR E LGTET  EDN
Subjt:  GSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDN

Query:  IPTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERRE
        IPTQ KISN VQEEFNT  SH E NAEE EVSP+FVA+NK  NEAP EDCEDSDGEYLEISEQGMDILNLSIGDCKH NEEMGETTE+ST+NEHE ERRE
Subjt:  IPTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERRE

Query:  PDESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGR
        PDES FEP+LGFQPQ Q+KET+I FQ+AESTDESIS PRQE DT TEKSK NPSDS SY +TAS+T T+T PSTNPIDEQSL TLPFSTFGGEDQDSPGR
Subjt:  PDESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGR

Query:  TSNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQ
        TSNES+SENSIG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL N+QEISFPNPMEKRVVKLGRSDSEKSRPSFPGF KE EES MEFKAIDQ
Subjt:  TSNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQ

Query:  NNFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        NNF  EKK AK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  NNFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

XP_008438605.1 PREDICTED: serine-aspartate repeat-containing protein F isoform X2 [Cucumis melo]1.0e-30972.68Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV
        MGNEMGNNNTSEFREEEK KAEGPE  L+EGG+N+ EADEVAGF QKE +PSSG+ D+ NGDHHV+E                         KEDN VEV
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV

Query:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF
         SKE+KEME +SEENRSD       +QAS+QK+EE +RGSNLNTTVLSL+ P+L+KT                       ++CKDALE S  NT HSAD 
Subjt:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF

Query:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTD
         + DSEENTNM+QVIDTDRDTN+EN GEKGKGS I+MI HASEDEKSE TS+ ++DQVI      DKENDGE+G+GS+ NT  HA KDPKSE+NS+FD+D
Subjt:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTD

Query:  QVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLT
        QVIDTD+DA+KENDEERRKGSNF   N+SE+P  EK SNL+++QVV T    DKENDE RGN SNF+ LI  SK PNSE SS+L+  QHESPETNAESLT
Subjt:  QVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLT

Query:  GSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDN
        GSSDDG TD EKKKGDLVEPR  HGYT PA KNVDTKD+GT T + CH +S SLAEE L IES NSS+QVPEVED     K E +LR E LGTET  EDN
Subjt:  GSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDN

Query:  IPTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERRE
        IPTQ KISN VQEEFNT  SH E NAEE EVSP+FVA+NK  NEAP EDCEDSDGEYLEISEQGMDILNLSIGDCKH NEEMGETTE+ST+NEHE ERRE
Subjt:  IPTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERRE

Query:  PDESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGR
        PDES FEP+LGFQPQ Q+KET+I FQ+AESTDESIS PRQE DT TEKSK NPSDS SY +TAS+T T+T PSTNPIDEQSL TLPFSTFGGEDQDSPGR
Subjt:  PDESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGR

Query:  TSNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQ
        TSNES+SENSIG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL N+QEISFPNPMEKRVVKLGRSDSEKSRPSFPGF KE EES MEFKAIDQ
Subjt:  TSNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQ

Query:  NNFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        NNF  EKK AK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  NNFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

XP_011650954.1 myb-like protein X isoform X1 [Cucumis sativus]0.0e+0075.98Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV
        MGNEMGNNNTSEFREEEK KAEGPE  L+EG +N+ E DEVAGF QKE + SSG  D+ NG+HHV EKEEEK+K CN  EFQVVSE L DRT EDN VEV
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV

Query:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF
         SKE+K++EFNSEENRSDGN  EHE+QASNQK+EE +RGSNLN  VLSL+ P L+KT                       ++CKDALE S  NT HSAD 
Subjt:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF

Query:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTD
         + DSEENTNMD VIDTDRDTN+EN GEKG  S IDMI HASEDEKSEKTS+ +IDQV+      DKENDGE+G+GS+ NT  HA KDPKSE NS FD+D
Subjt:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTD

Query:  QVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLT
        QVIDTD    KENDEERRKGSNF  IN+SE+P  EK SNL+T+QVV T    DK NDE+RGN SNF+ LI  SK+PNSE SSDL+SIQHE PETNAESLT
Subjt:  QVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLT

Query:  GSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELREHLGTETFFEDNI
        GSSDDGDTD EKKKGDLVEPR  HGYT+PA KNVDTKD+GT T LTCH +SCSLAEE L IES NSS+Q+PEVE+     KEE +LREHLGTET  EDNI
Subjt:  GSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELREHLGTETFFEDNI

Query:  PTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERREP
        PTQ KISN V+EEFNT  SH ENNAEE EVSP+FV +N+  NEAP EDCEDSDGEYLEISEQGMDILNLSIGDCKH NEEMGETTE+STNNEHEVERREP
Subjt:  PTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERREP

Query:  DESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGRT
        DESLFEP+LGFQPQ Q+KET+IAFQ+AESTDESIS PRQE DT TEKSK NPSDSPSY +TAS+T T+T PSTNPIDEQSL TLPFSTFGGEDQDSPGRT
Subjt:  DESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGRT

Query:  SNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQN
        SNES+SENSIGHIEMRKSPSFNIDIQ EG+ GETEK PLLYQIKTIEDLPN+QEISFPNPMEKRVVKLGRSDSEKSRPSFPGF KE EESRME KAIDQN
Subjt:  SNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQN

Query:  NFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        NF +EKK AK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  NFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

XP_038881451.1 uncharacterized protein LOC120072973 [Benincasa hispida]4.5e-29570.47Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV
        MGNEMGNNNTSEFREEEKAK+EGPEKSL EGG N+VEADEVAG  QKE +PSSG+ DR NGD HVTEK EEK+KACN  EFQ+V ++LPDRTKEDNEV++
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV

Query:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF
          KE KEMEF+SEENRS+GNEHEHE+QASNQK+E+GERGSNLNT VLSL  P L+K TDFK NQHELLAESLIEDSNKSP+ECKDALELSLN   HSAD 
Subjt:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF

Query:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVIDKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTDQVIDTD
         +ADSEENTNMDQVIDTDRDT+REN GEK KGSN DMI +AS+D                                                        
Subjt:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVIDKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTDQVIDTD

Query:  IDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHTDKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLTGSSDDGDTDT
                                                                         PNSE S DLKS+QHES ETNA+SLTGSSDDGDTD 
Subjt:  IDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHTDKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLTGSSDDGDTDT

Query:  EKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDNIPTQPKISNG
        EKKKGDLVEPRP HGYT+PA KNVDTKDEGTA  L CH SSCSLAEESL IES  S MQVPEVE+RCTVLKEE +LR EHLGTET  +DNIPTQ KISNG
Subjt:  EKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDNIPTQPKISNG

Query:  VQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERREPD-ESLFEPV
        VQ EFNTIGSH EN+AE+TEV P+FVAK+KN+ EAPGEDCEDSDGEYLEISEQG DI NLSIGD KH +E+ GETTE+STNN HE ERREPD ESLFEPV
Subjt:  VQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERREPD-ESLFEPV

Query:  LGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGRTSNESVSEN
        LGFQ QIQ+KETSI FQ+AEST +SIS PRQE  T TEKSK NPSDSPSYI+TASATLT+T PSTNP+DEQS T LPFSTFGGEDQ+SPGRTSNES SEN
Subjt:  LGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGRTSNESVSEN

Query:  SIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQNNFAAEKKA
        SIGHIEMRKSPSFNIDIQ EGR GETEKTPLLYQIKTIEDLPN+QEISFPNPMEKRVVKLGRSDSE SRPSFPGFAKE EE  MEFKAIDQNNFA + KA
Subjt:  SIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQNNFAAEKKA

Query:  AKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        AKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  AKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

TrEMBL top hitse value%identityAlignment
A0A0A0L8I8 Uncharacterized protein0.0e+0075.98Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV
        MGNEMGNNNTSEFREEEK KAEGPE  L+EG +N+ E DEVAGF QKE + SSG  D+ NG+HHV EKEEEK+K CN  EFQVVSE L DRT EDN VEV
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV

Query:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF
         SKE+K++EFNSEENRSDGN  EHE+QASNQK+EE +RGSNLN  VLSL+ P L+KT                       ++CKDALE S  NT HSAD 
Subjt:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF

Query:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTD
         + DSEENTNMD VIDTDRDTN+EN GEKG  S IDMI HASEDEKSEKTS+ +IDQV+      DKENDGE+G+GS+ NT  HA KDPKSE NS FD+D
Subjt:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTD

Query:  QVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLT
        QVIDTD    KENDEERRKGSNF  IN+SE+P  EK SNL+T+QVV T    DK NDE+RGN SNF+ LI  SK+PNSE SSDL+SIQHE PETNAESLT
Subjt:  QVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLT

Query:  GSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELREHLGTETFFEDNI
        GSSDDGDTD EKKKGDLVEPR  HGYT+PA KNVDTKD+GT T LTCH +SCSLAEE L IES NSS+Q+PEVE+     KEE +LREHLGTET  EDNI
Subjt:  GSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELREHLGTETFFEDNI

Query:  PTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERREP
        PTQ KISN V+EEFNT  SH ENNAEE EVSP+FV +N+  NEAP EDCEDSDGEYLEISEQGMDILNLSIGDCKH NEEMGETTE+STNNEHEVERREP
Subjt:  PTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERREP

Query:  DESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGRT
        DESLFEP+LGFQPQ Q+KET+IAFQ+AESTDESIS PRQE DT TEKSK NPSDSPSY +TAS+T T+T PSTNPIDEQSL TLPFSTFGGEDQDSPGRT
Subjt:  DESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGRT

Query:  SNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQN
        SNES+SENSIGHIEMRKSPSFNIDIQ EG+ GETEK PLLYQIKTIEDLPN+QEISFPNPMEKRVVKLGRSDSEKSRPSFPGF KE EESRME KAIDQN
Subjt:  SNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQN

Query:  NFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        NF +EKK AK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  NFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

A0A1S3AXG5 serine-aspartate repeat-containing protein F isoform X25.0e-31072.68Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV
        MGNEMGNNNTSEFREEEK KAEGPE  L+EGG+N+ EADEVAGF QKE +PSSG+ D+ NGDHHV+E                         KEDN VEV
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV

Query:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF
         SKE+KEME +SEENRSD       +QAS+QK+EE +RGSNLNTTVLSL+ P+L+KT                       ++CKDALE S  NT HSAD 
Subjt:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF

Query:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTD
         + DSEENTNM+QVIDTDRDTN+EN GEKGKGS I+MI HASEDEKSE TS+ ++DQVI      DKENDGE+G+GS+ NT  HA KDPKSE+NS+FD+D
Subjt:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTD

Query:  QVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLT
        QVIDTD+DA+KENDEERRKGSNF   N+SE+P  EK SNL+++QVV T    DKENDE RGN SNF+ LI  SK PNSE SS+L+  QHESPETNAESLT
Subjt:  QVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLT

Query:  GSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDN
        GSSDDG TD EKKKGDLVEPR  HGYT PA KNVDTKD+GT T + CH +S SLAEE L IES NSS+QVPEVED     K E +LR E LGTET  EDN
Subjt:  GSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDN

Query:  IPTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERRE
        IPTQ KISN VQEEFNT  SH E NAEE EVSP+FVA+NK  NEAP EDCEDSDGEYLEISEQGMDILNLSIGDCKH NEEMGETTE+ST+NEHE ERRE
Subjt:  IPTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERRE

Query:  PDESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGR
        PDES FEP+LGFQPQ Q+KET+I FQ+AESTDESIS PRQE DT TEKSK NPSDS SY +TAS+T T+T PSTNPIDEQSL TLPFSTFGGEDQDSPGR
Subjt:  PDESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGR

Query:  TSNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQ
        TSNES+SENSIG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL N+QEISFPNPMEKRVVKLGRSDSEKSRPSFPGF KE EES MEFKAIDQ
Subjt:  TSNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQ

Query:  NNFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        NNF  EKK AK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  NNFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

A0A1S3AXG7 serine-aspartate repeat-containing protein F isoform X34.9e-27974.56Show/hide
Query:  MEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADFVIADSEE
        ME +SEENRSD       +QAS+QK+EE +RGSNLNTTVLSL+ P+L+KT                       ++CKDALE S  NT HSAD  + DSEE
Subjt:  MEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADFVIADSEE

Query:  NTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTDQVIDTDI
        NTNM+QVIDTDRDTN+EN GEKGKGS I+MI HASEDEKSE TS+ ++DQVI      DKENDGE+G+GS+ NT  HA KDPKSE+NS+FD+DQVIDTD+
Subjt:  NTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTDQVIDTDI

Query:  DAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLTGSSDDGD
        DA+KENDEERRKGSNF   N+SE+P  EK SNL+++QVV T    DKENDE RGN SNF+ LI  SK PNSE SS+L+  QHESPETNAESLTGSSDDG 
Subjt:  DAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLTGSSDDGD

Query:  TDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDNIPTQPKI
        TD EKKKGDLVEPR  HGYT PA KNVDTKD+GT T + CH +S SLAEE L IES NSS+QVPEVED     K E +LR E LGTET  EDNIPTQ KI
Subjt:  TDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDNIPTQPKI

Query:  SNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERREPDESLFE
        SN VQEEFNT  SH E NAEE EVSP+FVA+NK  NEAP EDCEDSDGEYLEISEQGMDILNLSIGDCKH NEEMGETTE+ST+NEHE ERREPDES FE
Subjt:  SNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERREPDESLFE

Query:  PVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGRTSNESVS
        P+LGFQPQ Q+KET+I FQ+AESTDESIS PRQE DT TEKSK NPSDS SY +TAS+T T+T PSTNPIDEQSL TLPFSTFGGEDQDSPGRTSNES+S
Subjt:  PVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGRTSNESVS

Query:  ENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQNNFAAEK
        ENSIG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL N+QEISFPNPMEKRVVKLGRSDSEKSRPSFPGF KE EES MEFKAIDQNNF  EK
Subjt:  ENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQNNFAAEK

Query:  KAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        K AK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  KAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

A0A1S4DSH5 serine-aspartate repeat-containing protein F isoform X10.0e+0074.47Show/hide
Query:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV
        MGNEMGNNNTSEFREEEK KAEGPE  L+EGG+N+ EADEVAGF QKE +PSSG+ D+ NGDHHV+EKEEEK+K  +  EFQVVSE L D+T EDN VEV
Subjt:  MGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEV

Query:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF
         SKE+KEME +SEENRSD       +QAS+QK+EE +RGSNLNTTVLSL+ P+L+KT                       ++CKDALE S  NT HSAD 
Subjt:  NSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADF

Query:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTD
         + DSEENTNM+QVIDTDRDTN+EN GEKGKGS I+MI HASEDEKSE TS+ ++DQVI      DKENDGE+G+GS+ NT  HA KDPKSE+NS+FD+D
Subjt:  VIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTD

Query:  QVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLT
        QVIDTD+DA+KENDEERRKGSNF   N+SE+P  EK SNL+++QVV T    DKENDE RGN SNF+ LI  SK PNSE SS+L+  QHESPETNAESLT
Subjt:  QVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSNFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLT

Query:  GSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDN
        GSSDDG TD EKKKGDLVEPR  HGYT PA KNVDTKD+GT T + CH +S SLAEE L IES NSS+QVPEVED     K E +LR E LGTET  EDN
Subjt:  GSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDN

Query:  IPTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERRE
        IPTQ KISN VQEEFNT  SH E NAEE EVSP+FVA+NK  NEAP EDCEDSDGEYLEISEQGMDILNLSIGDCKH NEEMGETTE+ST+NEHE ERRE
Subjt:  IPTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGETTEISTNNEHEVERRE

Query:  PDESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGR
        PDES FEP+LGFQPQ Q+KET+I FQ+AESTDESIS PRQE DT TEKSK NPSDS SY +TAS+T T+T PSTNPIDEQSL TLPFSTFGGEDQDSPGR
Subjt:  PDESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGR

Query:  TSNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQ
        TSNES+SENSIG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL N+QEISFPNPMEKRVVKLGRSDSEKSRPSFPGF KE EES MEFKAIDQ
Subjt:  TSNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQ

Query:  NNFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        NNF  EKK AK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  NNFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

A0A5D3D317 Serine-aspartate repeat-containing protein F isoform X10.0e+0073.66Show/hide
Query:  SPCFAIAILSTTDQAVTYHHHHHNGKKVLRFVELNMGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHV
        +P  +   LSTT QAV YHHHHHNG     FVELNMGNEMGNNNTSEFREEEK KAEGPE  L+EGG+N+ EADEVAGF QKE +PSSG+ D+ NGDHHV
Subjt:  SPCFAIAILSTTDQAVTYHHHHHNGKKVLRFVELNMGNEMGNNNTSEFREEEKAKAEGPEKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHV

Query:  TEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEVNSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQH
        +EKEEEK+K  +  EFQVVSE L D+T EDN VEV SKE+KEME +SEENRSD       +QAS+QK+EE +RGSNLNTTVLSL+ P+L+KT        
Subjt:  TEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEVNSKEDKEMEFNSEENRSDGNEHEHERQASNQKDEEGERGSNLNTTVLSLDHPELKKTTDFKSNQH

Query:  ELLAESLIEDSNKSPDECKDALELSLNNTSHSADFVIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------
                       ++CKDALE S  NT HSAD  + DSEENTNM+QVIDTDRDTN+EN GEKGKGS I+MI HASEDEKSE TS+ ++DQVI      
Subjt:  ELLAESLIEDSNKSPDECKDALELSLNNTSHSADFVIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASEDEKSEKTSEFDIDQVI------

Query:  DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTDQVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSN
        DKENDGE+G+GS+ NT  HA KDPKSE+NS+FD+DQVIDTD+DA+KENDEERRKGSNF   N+SE+P  EK SNL+++QVV T    DKENDE RGN SN
Subjt:  DKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTDQVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHT----DKENDEERGNGSN

Query:  FNTLIHTSKHPNSETSSDLKSIQHESPETNAESLTGSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVN
        F+ LI  SK PNSE SS+L+  QHESPETNAESLTGSSDDG TD EKKKGDLVEPR  HGYT PA KNVDTKD+GT T + CH +S SLAEE L IES N
Subjt:  FNTLIHTSKHPNSETSSDLKSIQHESPETNAESLTGSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVN

Query:  SSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDNIPTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGM
        SS+QVPEVED     K E +LR E LGTET  EDNIPTQ KISN VQEEFNT  SH E NAEE EVSP+FVA+NK  NEAP EDCEDSDGEYLEISEQGM
Subjt:  SSMQVPEVEDRCTVLKEESELR-EHLGTETFFEDNIPTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGM

Query:  DILNLSIGDCKHNNEEMGETTEISTNNEHEVERREPDESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASA
        DILNLSIGDCKH NEEMGETTE+STNNEHE +RREPDES FEP+LGFQPQ Q+KET+I FQ+AESTDESIS PRQE DT TEKSK NPSDS SY +TAS+
Subjt:  DILNLSIGDCKHNNEEMGETTEISTNNEHEVERREPDESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASA

Query:  TLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGRTSNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKR
        T T+T P+TNPIDEQSL TLPFSTFGGEDQDSPGRTSNES+SENSIG IEMRKSPSFNIDIQ EG+ GETEKTPLLYQIKTIEDL N+QEISFPNPMEKR
Subjt:  TLTQTNPSTNPIDEQSLTTLPFSTFGGEDQDSPGRTSNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKR

Query:  VVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQNNFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        V+KLGRSDSEKSRPSFPGF KE EES MEFKAIDQNNF  EKK AK+LPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  VVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQNNFAAEKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN

SwissProt top hitse value%identityAlignment
O23324 ATP-sulfurylase 3, chloroplastic3.7e-20780.18Show/hide
Query:  MASMATRFTKTPSPFHSIQRTSYTHLGAPIKVSIA-SSKSKKTHLR-LRVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSE
        MASM+T F K P+ F S Q  + +H    +  SI+  S SK   LR + V AGLI+PDGG+LV+L+V EP R  KK EA  LPR+ L++ID+QW+HVLSE
Subjt:  MASMATRFTKTPSPFHSIQRTSYTHLGAPIKVSIA-SSKSKKTHLR-LRVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSE

Query:  GWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQA
        GWASPL GFMRESEFLQTLHFN L L DGSV NMSVPIVLAIDD QK  IG+S  V+L DS +NPIAIL DIEIYKHPKEERIARTWGTTAPGLPYV++A
Subjt:  GWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQA

Query:  ITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWR
        ITNAG+WLIGGDLEV+EP+KYNDGLDRFR SP ELR+E  +R ADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGG+TKADDVPLSWR
Subjt:  ITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWR

Query:  MKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKT
        MKQHEKVLEDGVLDPETTVVSIFPSPM YAGPTEVQWHAKARINAGANFYIVGRDPAGMGHP EKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKT
Subjt:  MKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKT

Query:  QGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDF
        QGKMAFFDPSRAQDFLFISGTKMR LAKN+ENPP+GFMCP     L+D+
Subjt:  QGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDF

Q27128 Bifunctional 3'-phosphoadenosine 5'-phosphosulfate synthase2.0e-13661.14Show/hide
Query:  ELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKN
        EL V E      K E + LP +E++ +D QWV VLSEGWA+PLTGFMRE E+LQ+ HF  L   DG V N S+PIVL +  A K  +  S++ AL   + 
Subjt:  ELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKN

Query:  NPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALL
          IAIL+  E Y+H KEER +R +GT+  G PYV + I  +G+WL+GGDLEV+E I +NDGLD +R +P ELR +F   NADAVFAFQLRNPVHNGHALL
Subjt:  NPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALL

Query:  MTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPT
        MTDTRRRL E GYK P+LLLHPLGG+TK DDVPL+WRMKQH+ +L++ VLDP+ TV++IFPSPM YAGPTEVQWHAKAR++ GANFYIVGRDPAGM HP 
Subjt:  MTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPT

Query:  EKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDF
         K+DLY+A HG KVL+MAPGL +L I+PFRVAAY+KT+  M F+DP R  +F+FISGTKMR +A+  E PP GFM P A   ++++
Subjt:  EKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDF

Q43870 ATP sulfurylase 24.1e-18273.83Show/hide
Query:  RLRVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQ
        ++ V + LI PDGG LVEL+V E   G+KK E+ ++P+++L+ ID++WVHV+SEGWASPL GFMRE E+LQ+LHFNSLRL +G+  NMS+PIVLAIDD  
Subjt:  RLRVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQ

Query:  KHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADA
        K  IG S +VAL   + + I  L+ +EIYKH KEERIARTWGTT+PGLPYV++ IT +GNWLIGGDLEV EPIKYNDGLD +R SP +LREEF  R ADA
Subjt:  KHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADA

Query:  VFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAG
        VFAFQLRNPVHNGHALLM DTR+RLLEMGYKNP+LLLHPLGG+TKADDVPL  RM+QH KVLEDGVLDP+TT+VSIFPSPMHYAGPTEVQWHAKARINAG
Subjt:  VFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAG

Query:  ANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHL
        ANFYIVGRDPAGMGHPTEKRDLYD DHGK+VLSMAPGLE+LNILPFRVAAYD  + KMAFFDPSRA++FLFISGTKMRT A+  ENPP+GFMCP   + L
Subjt:  ANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHL

Query:  LDFVQ
        + + +
Subjt:  LDFVQ

Q9LIK9 ATP sulfurylase 1, chloroplastic7.6e-21380.67Show/hide
Query:  MASMATRFTKTPSPFHSIQRTSYTHLGAPIKVSIASSKSKKTHLRL-RVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEG
        MASMA   +KTP     + ++S     + +  +  S  SK    R+  + AGLI PDGG+LVEL+VEEP R  KK EA  LPR+EL++ID+QW+HVLSEG
Subjt:  MASMATRFTKTPSPFHSIQRTSYTHLGAPIKVSIASSKSKKTHLRL-RVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEG

Query:  WASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAI
        WASPL GFMRESEFLQTLHFNSLRL DGSV NMSVPIVLAIDD QK  IG+ST VALF+S  NP+AIL DIEIYKHPKEERIARTWGTTAPGLPYVD+AI
Subjt:  WASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAI

Query:  TNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRM
        TNAGNWLIGGDLEV+EP+KYNDGLDRFR SPAELR+E  +RNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGG+TKADDVPL WRM
Subjt:  TNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRM

Query:  KQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQ
        KQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHP EKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQ
Subjt:  KQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQ

Query:  GKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDFVQ
        GKMAFFDPSR QDFLFISGTKMRTLAKN ENPP+GFMCP     L+D+ +
Subjt:  GKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDFVQ

Q9S7D8 ATP sulfurylase 4, chloroplastic1.0e-20177.33Show/hide
Query:  MASMATRFTKTPSPFHSIQRTSYTHLG--APIKVSIASSKSKKTHLRLRVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLP-RIELSSIDIQWVHVLS
        MAS A     + SPF S       H    AP  +S+ S   + +   L V +GLI+PDGG+L+ L+VEE  R + K EA ++P RI+L+ +D++WVHVLS
Subjt:  MASMATRFTKTPSPFHSIQRTSYTHLG--APIKVSIASSKSKKTHLRLRVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLP-RIELSSIDIQWVHVLS

Query:  EGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQ
        EGWASPL GFMR+SEFLQTLHFNS RL DGSV NMSVPIVLAIDD QK  IGDS  V L DS  NPIAIL DIEIYKHPKEERIARTWGTTA GLPY ++
Subjt:  EGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQ

Query:  AITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSW
        AIT AGNWLIGGDL+V+EPIKYNDGLDRFR SP++LREEF RR ADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNP+LLL+PLGG+TKADDVPLSW
Subjt:  AITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSW

Query:  RMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDK
        RM+QHEKVLEDGVLDPETTVVSIFPSPM YAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPF+VAAYDK
Subjt:  RMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDK

Query:  TQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDF
        TQGKMAFFDPSR+QDFLFISGTKMR LAK KENPP+GFMCP     L+D+
Subjt:  TQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDF

Arabidopsis top hitse value%identityAlignment
AT1G19920.1 Pseudouridine synthase/archaeosine transglycosylase-like family protein2.9e-18373.83Show/hide
Query:  RLRVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQ
        ++ V + LI PDGG LVEL+V E   G+KK E+ ++P+++L+ ID++WVHV+SEGWASPL GFMRE E+LQ+LHFNSLRL +G+  NMS+PIVLAIDD  
Subjt:  RLRVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQ

Query:  KHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADA
        K  IG S +VAL   + + I  L+ +EIYKH KEERIARTWGTT+PGLPYV++ IT +GNWLIGGDLEV EPIKYNDGLD +R SP +LREEF  R ADA
Subjt:  KHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADA

Query:  VFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAG
        VFAFQLRNPVHNGHALLM DTR+RLLEMGYKNP+LLLHPLGG+TKADDVPL  RM+QH KVLEDGVLDP+TT+VSIFPSPMHYAGPTEVQWHAKARINAG
Subjt:  VFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAG

Query:  ANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHL
        ANFYIVGRDPAGMGHPTEKRDLYD DHGK+VLSMAPGLE+LNILPFRVAAYD  + KMAFFDPSRA++FLFISGTKMRT A+  ENPP+GFMCP   + L
Subjt:  ANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHL

Query:  LDFVQ
        + + +
Subjt:  LDFVQ

AT3G22890.1 ATP sulfurylase 15.4e-21480.67Show/hide
Query:  MASMATRFTKTPSPFHSIQRTSYTHLGAPIKVSIASSKSKKTHLRL-RVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEG
        MASMA   +KTP     + ++S     + +  +  S  SK    R+  + AGLI PDGG+LVEL+VEEP R  KK EA  LPR+EL++ID+QW+HVLSEG
Subjt:  MASMATRFTKTPSPFHSIQRTSYTHLGAPIKVSIASSKSKKTHLRL-RVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEG

Query:  WASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAI
        WASPL GFMRESEFLQTLHFNSLRL DGSV NMSVPIVLAIDD QK  IG+ST VALF+S  NP+AIL DIEIYKHPKEERIARTWGTTAPGLPYVD+AI
Subjt:  WASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAI

Query:  TNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRM
        TNAGNWLIGGDLEV+EP+KYNDGLDRFR SPAELR+E  +RNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGG+TKADDVPL WRM
Subjt:  TNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRM

Query:  KQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQ
        KQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHP EKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQ
Subjt:  KQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQ

Query:  GKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDFVQ
        GKMAFFDPSR QDFLFISGTKMRTLAKN ENPP+GFMCP     L+D+ +
Subjt:  GKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDFVQ

AT4G14680.1 Pseudouridine synthase/archaeosine transglycosylase-like family protein2.6e-20880.18Show/hide
Query:  MASMATRFTKTPSPFHSIQRTSYTHLGAPIKVSIA-SSKSKKTHLR-LRVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSE
        MASM+T F K P+ F S Q  + +H    +  SI+  S SK   LR + V AGLI+PDGG+LV+L+V EP R  KK EA  LPR+ L++ID+QW+HVLSE
Subjt:  MASMATRFTKTPSPFHSIQRTSYTHLGAPIKVSIA-SSKSKKTHLR-LRVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSE

Query:  GWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQA
        GWASPL GFMRESEFLQTLHFN L L DGSV NMSVPIVLAIDD QK  IG+S  V+L DS +NPIAIL DIEIYKHPKEERIARTWGTTAPGLPYV++A
Subjt:  GWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQA

Query:  ITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWR
        ITNAG+WLIGGDLEV+EP+KYNDGLDRFR SP ELR+E  +R ADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGG+TKADDVPLSWR
Subjt:  ITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWR

Query:  MKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKT
        MKQHEKVLEDGVLDPETTVVSIFPSPM YAGPTEVQWHAKARINAGANFYIVGRDPAGMGHP EKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKT
Subjt:  MKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKT

Query:  QGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDF
        QGKMAFFDPSRAQDFLFISGTKMR LAKN+ENPP+GFMCP     L+D+
Subjt:  QGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDF

AT5G43780.1 Pseudouridine synthase/archaeosine transglycosylase-like family protein7.3e-20377.33Show/hide
Query:  MASMATRFTKTPSPFHSIQRTSYTHLG--APIKVSIASSKSKKTHLRLRVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLP-RIELSSIDIQWVHVLS
        MAS A     + SPF S       H    AP  +S+ S   + +   L V +GLI+PDGG+L+ L+VEE  R + K EA ++P RI+L+ +D++WVHVLS
Subjt:  MASMATRFTKTPSPFHSIQRTSYTHLG--APIKVSIASSKSKKTHLRLRVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLP-RIELSSIDIQWVHVLS

Query:  EGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQ
        EGWASPL GFMR+SEFLQTLHFNS RL DGSV NMSVPIVLAIDD QK  IGDS  V L DS  NPIAIL DIEIYKHPKEERIARTWGTTA GLPY ++
Subjt:  EGWASPLTGFMRESEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQ

Query:  AITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSW
        AIT AGNWLIGGDL+V+EPIKYNDGLDRFR SP++LREEF RR ADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNP+LLL+PLGG+TKADDVPLSW
Subjt:  AITNAGNWLIGGDLEVIEPIKYNDGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSW

Query:  RMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDK
        RM+QHEKVLEDGVLDPETTVVSIFPSPM YAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPF+VAAYDK
Subjt:  RMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGPTEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDK

Query:  TQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDF
        TQGKMAFFDPSR+QDFLFISGTKMR LAK KENPP+GFMCP     L+D+
Subjt:  TQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDASSHLLDF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCATGGCGACTCGTTTCACCAAAACCCCATCCCCATTTCATTCTATTCAAAGAACTTCTTACACCCATTTGGGCGCTCCGATTAAAGTCTCCATTGCTAGCTC
TAAGTCGAAGAAGACCCATCTCCGCCTCCGGGTTTCCGCCGGTTTGATCCAACCGGACGGCGGAAGGCTGGTGGAGCTTCTAGTTGAGGAGCCTCTGAGGGGTCTGAAGA
AGAAGGAAGCCTTATCCTTGCCAAGGATCGAGCTTTCTAGCATCGATATTCAATGGGTTCATGTGCTCAGCGAAGGATGGGCCAGCCCTCTTACCGGATTCATGAGAGAA
TCCGAGTTCCTCCAAACGCTTCATTTCAATTCCCTCAGGCTTCCAGATGGGTCTGTGGCGAACATGTCAGTCCCCATTGTTCTGGCCATTGACGATGCCCAAAAGCACCA
CATCGGTGACTCCACCAGTGTGGCTCTCTTTGACTCCAAGAACAACCCCATCGCTATTCTCAAAGATATCGAGATCTACAAGCACCCTAAAGAAGAAAGGATTGCTAGAA
CGTGGGGCACCACTGCCCCGGGGCTCCCTTATGTTGATCAAGCTATAACCAATGCTGGTAACTGGCTGATAGGAGGCGACTTAGAGGTTATAGAGCCAATCAAGTACAAT
GATGGCCTTGACCGTTTTCGACAGTCACCTGCAGAGCTACGGGAGGAATTTACCCGGCGAAATGCAGATGCAGTGTTTGCATTTCAACTTCGAAACCCAGTACATAACGG
GCATGCATTACTAATGACTGATACCCGCCGTCGACTGCTCGAGATGGGATACAAGAACCCCATTTTGTTGCTTCATCCTTTGGGTGGCTACACCAAGGCAGACGATGTTC
CACTTAGTTGGCGAATGAAGCAACATGAGAAGGTACTCGAGGATGGTGTTCTTGATCCAGAGACAACAGTCGTCTCCATATTTCCCTCTCCAATGCATTATGCAGGGCCA
ACTGAGGTGCAGTGGCATGCAAAGGCAAGAATTAATGCAGGAGCCAACTTTTACATTGTTGGACGGGACCCAGCTGGAATGGGTCATCCTACTGAGAAAAGGGATTTATA
TGATGCTGACCATGGGAAAAAAGTGTTGAGCATGGCACCTGGACTAGAGCGGCTAAACATCCTTCCTTTCAGAGTTGCTGCTTATGATAAAACTCAGGGCAAAATGGCCT
TCTTCGACCCCTCTAGAGCTCAGGATTTCCTTTTCATATCTGGCACCAAGATGAGAACACTAGCGAAGAATAAAGAGAATCCACCAGAAGGGTTTATGTGCCCAGATGCC
TCATCGCATCTTTTGGATTTTGTACAACACAGCATATTGTGTAATGGTGTGCTTTATGTTCAATTCTATGAAGGAGATGGGTTTACTGAATTGTTTCAAGTAGTGATGGG
GACGAGTGAGCTTTTTTATTCATATGGTTACTTTAAAACTTATTCTCCCTGTTTTGCAATTGCAATTCTGTCTACAACAGATCAAGCTGTCACATATCATCACCATCATC
ACAATGGTAAGAAAGTTTTGAGATTTGTTGAGTTGAATATGGGAAATGAGATGGGAAACAATAACACATCTGAGTTTAGAGAGGAAGAAAAGGCAAAAGCTGAAGGTCCA
GAAAAATCTTTAGTAGAAGGCGGGGAAAATAAAGTCGAAGCGGATGAAGTTGCAGGCTTTTGCCAAAAAGAGGTGAAGCCGAGTTCTGGTGACACAGATAGAGCAAATGG
AGATCATCATGTCACAGAGAAAGAGGAAGAAAAAAGTAAAGCTTGTAATACAGCAGAGTTTCAAGTGGTTTCTGAAAAATTACCTGACCGAACTAAAGAGGACAATGAAG
TCGAGGTCAATTCTAAAGAGGATAAAGAAATGGAGTTCAACTCGGAAGAGAATAGATCTGACGGAAATGAGCACGAGCATGAAAGGCAGGCTTCAAATCAGAAAGATGAA
GAAGGGGAGAGAGGTTCCAATTTGAACACAACAGTACTTTCATTAGATCACCCGGAACTTAAAAAGACTACAGATTTCAAGAGTAATCAGCACGAATTGCTAGCCGAATC
ATTGATAGAAGATAGCAACAAATCCCCAGATGAATGCAAGGATGCTTTGGAATTGAGCTTGAACAACACTTCCCATTCAGCAGATTTTGTCATCGCAGATTCAGAAGAAA
ACACCAATATGGATCAAGTTATTGATACTGACCGAGATACTAATAGGGAAAATCATGGAGAGAAGGGGAAGGGTTCCAACATCGACATGATACCCCATGCATCAGAAGAT
GAAAAATCTGAAAAGACATCAGAGTTTGATATTGATCAAGTTATTGATAAGGAAAATGATGGAGAGAAGGGAAAAGGTTCAAACTCCAACACGAAAGCACACGCATCAAA
AGATCCAAAATCTGAAATGAATTCAGAATTCGACACTGATCAAGTTATTGATACTGATATAGATGCATATAAGGAAAATGATGAAGAGAGGAGAAAGGGTTCCAACTTCA
ACACGATAAATGCATCAGAAGATCCAAATTTTGAAAAGATTTCAAATTTGGAAACTGATCAAGTAGTTCATACAGATAAGGAAAACGATGAAGAGAGGGGCAACGGCTCC
AACTTCAACACGTTAATACATACATCAAAGCATCCAAATTCTGAAACTAGTTCAGATCTTAAGTCTATTCAGCATGAGTCACCTGAAACCAACGCTGAATCATTGACAGG
ATCAAGTGATGATGGAGACACAGATACGGAGAAGAAAAAAGGTGATCTAGTAGAGCCAAGGCCGTCCCATGGATATACAATGCCAGCTACCAAAAATGTGGATACTAAAG
ATGAAGGAACTGCGACCTATTTGACATGTCATAAATCTTCATGTTCACTAGCAGAAGAGAGTCTGGGGATAGAATCAGTCAATTCATCTATGCAAGTTCCTGAGGTTGAA
GACAGATGCACGGTTTTGAAAGAAGAATCTGAATTAAGAGAACATTTAGGAACTGAAACGTTCTTTGAAGACAATATTCCAACTCAACCTAAGATTTCAAATGGGGTCCA
AGAGGAATTTAACACGATTGGGTCACACTACGAGAACAATGCAGAAGAGACTGAAGTTTCTCCTAAATTTGTTGCCAAGAACAAGAACCAGAACGAGGCTCCTGGAGAGG
ATTGCGAGGATTCAGATGGAGAATATTTGGAAATTTCGGAACAAGGTATGGATATTTTAAATTTATCTATTGGAGATTGCAAGCATAACAATGAGGAGATGGGAGAGACA
ACTGAAATTTCAACAAACAATGAACATGAAGTGGAAAGACGAGAACCAGATGAAAGCCTATTTGAACCTGTTTTAGGGTTTCAACCTCAAATTCAGCGAAAGGAAACTTC
AATCGCATTTCAATCTGCAGAAAGTACAGACGAATCGATCTCAGAACCGAGGCAAGAAATCGATACAGCAACTGAAAAATCCAAGAAAAATCCCTCGGATTCCCCAAGTT
ATATTGAAACAGCTTCAGCAACACTCACACAAACTAATCCATCGACAAATCCGATCGACGAACAGAGTTTAACAACACTTCCATTCTCCACATTCGGGGGTGAAGATCAA
GATTCTCCAGGAAGAACGAGCAACGAATCAGTTTCCGAAAATTCAATTGGTCATATCGAGATGCGTAAATCACCGAGCTTCAATATAGATATCCAATGCGAAGGAAGGGC
AGGAGAAACAGAGAAAACTCCATTGCTATACCAAATTAAGACAATCGAAGACTTACCAAATATGCAGGAGATTAGCTTCCCAAATCCAATGGAGAAACGAGTTGTGAAGT
TAGGAAGAAGCGATTCAGAGAAATCGAGACCTTCTTTCCCAGGGTTCGCGAAAGAAACAGAAGAATCCAGGATGGAATTCAAAGCAATCGATCAAAATAACTTCGCGGCT
GAGAAGAAGGCAGCAAAAGACTTACCACCGCCATCGCCAATCCGTAAAGGGAAGCGCAGAACGAAATCCCTAATTTTTGGGACATGCATCTGCTGTGCTACAGCGATCAA
TTGA
mRNA sequenceShow/hide mRNA sequence
CATTCCACCTCCAATTTTAGTTGCATTATAAATTTATATCCCATCCCACTAACTCCAAAAAAAAGTTGCAAAATCCCCCTCTCCGATTCCCCTTTTTGTTCCTTTTTACC
TCTCTGGATTAATCTTCTCCCATTTCATCCTCTCTTCCCTAATTTCAGCTTAATTTTCCCTTAATCTCGCTTTGAAGCCTTGGTCTCCATCTCAGATGGCGTCCATGGCG
ACTCGTTTCACCAAAACCCCATCCCCATTTCATTCTATTCAAAGAACTTCTTACACCCATTTGGGCGCTCCGATTAAAGTCTCCATTGCTAGCTCTAAGTCGAAGAAGAC
CCATCTCCGCCTCCGGGTTTCCGCCGGTTTGATCCAACCGGACGGCGGAAGGCTGGTGGAGCTTCTAGTTGAGGAGCCTCTGAGGGGTCTGAAGAAGAAGGAAGCCTTAT
CCTTGCCAAGGATCGAGCTTTCTAGCATCGATATTCAATGGGTTCATGTGCTCAGCGAAGGATGGGCCAGCCCTCTTACCGGATTCATGAGAGAATCCGAGTTCCTCCAA
ACGCTTCATTTCAATTCCCTCAGGCTTCCAGATGGGTCTGTGGCGAACATGTCAGTCCCCATTGTTCTGGCCATTGACGATGCCCAAAAGCACCACATCGGTGACTCCAC
CAGTGTGGCTCTCTTTGACTCCAAGAACAACCCCATCGCTATTCTCAAAGATATCGAGATCTACAAGCACCCTAAAGAAGAAAGGATTGCTAGAACGTGGGGCACCACTG
CCCCGGGGCTCCCTTATGTTGATCAAGCTATAACCAATGCTGGTAACTGGCTGATAGGAGGCGACTTAGAGGTTATAGAGCCAATCAAGTACAATGATGGCCTTGACCGT
TTTCGACAGTCACCTGCAGAGCTACGGGAGGAATTTACCCGGCGAAATGCAGATGCAGTGTTTGCATTTCAACTTCGAAACCCAGTACATAACGGGCATGCATTACTAAT
GACTGATACCCGCCGTCGACTGCTCGAGATGGGATACAAGAACCCCATTTTGTTGCTTCATCCTTTGGGTGGCTACACCAAGGCAGACGATGTTCCACTTAGTTGGCGAA
TGAAGCAACATGAGAAGGTACTCGAGGATGGTGTTCTTGATCCAGAGACAACAGTCGTCTCCATATTTCCCTCTCCAATGCATTATGCAGGGCCAACTGAGGTGCAGTGG
CATGCAAAGGCAAGAATTAATGCAGGAGCCAACTTTTACATTGTTGGACGGGACCCAGCTGGAATGGGTCATCCTACTGAGAAAAGGGATTTATATGATGCTGACCATGG
GAAAAAAGTGTTGAGCATGGCACCTGGACTAGAGCGGCTAAACATCCTTCCTTTCAGAGTTGCTGCTTATGATAAAACTCAGGGCAAAATGGCCTTCTTCGACCCCTCTA
GAGCTCAGGATTTCCTTTTCATATCTGGCACCAAGATGAGAACACTAGCGAAGAATAAAGAGAATCCACCAGAAGGGTTTATGTGCCCAGATGCCTCATCGCATCTTTTG
GATTTTGTACAACACAGCATATTGTGTAATGGTGTGCTTTATGTTCAATTCTATGAAGGAGATGGGTTTACTGAATTGTTTCAAGTAGTGATGGGGACGAGTGAGCTTTT
TTATTCATATGGTTACTTTAAAACTTATTCTCCCTGTTTTGCAATTGCAATTCTGTCTACAACAGATCAAGCTGTCACATATCATCACCATCATCACAATGGTAAGAAAG
TTTTGAGATTTGTTGAGTTGAATATGGGAAATGAGATGGGAAACAATAACACATCTGAGTTTAGAGAGGAAGAAAAGGCAAAAGCTGAAGGTCCAGAAAAATCTTTAGTA
GAAGGCGGGGAAAATAAAGTCGAAGCGGATGAAGTTGCAGGCTTTTGCCAAAAAGAGGTGAAGCCGAGTTCTGGTGACACAGATAGAGCAAATGGAGATCATCATGTCAC
AGAGAAAGAGGAAGAAAAAAGTAAAGCTTGTAATACAGCAGAGTTTCAAGTGGTTTCTGAAAAATTACCTGACCGAACTAAAGAGGACAATGAAGTCGAGGTCAATTCTA
AAGAGGATAAAGAAATGGAGTTCAACTCGGAAGAGAATAGATCTGACGGAAATGAGCACGAGCATGAAAGGCAGGCTTCAAATCAGAAAGATGAAGAAGGGGAGAGAGGT
TCCAATTTGAACACAACAGTACTTTCATTAGATCACCCGGAACTTAAAAAGACTACAGATTTCAAGAGTAATCAGCACGAATTGCTAGCCGAATCATTGATAGAAGATAG
CAACAAATCCCCAGATGAATGCAAGGATGCTTTGGAATTGAGCTTGAACAACACTTCCCATTCAGCAGATTTTGTCATCGCAGATTCAGAAGAAAACACCAATATGGATC
AAGTTATTGATACTGACCGAGATACTAATAGGGAAAATCATGGAGAGAAGGGGAAGGGTTCCAACATCGACATGATACCCCATGCATCAGAAGATGAAAAATCTGAAAAG
ACATCAGAGTTTGATATTGATCAAGTTATTGATAAGGAAAATGATGGAGAGAAGGGAAAAGGTTCAAACTCCAACACGAAAGCACACGCATCAAAAGATCCAAAATCTGA
AATGAATTCAGAATTCGACACTGATCAAGTTATTGATACTGATATAGATGCATATAAGGAAAATGATGAAGAGAGGAGAAAGGGTTCCAACTTCAACACGATAAATGCAT
CAGAAGATCCAAATTTTGAAAAGATTTCAAATTTGGAAACTGATCAAGTAGTTCATACAGATAAGGAAAACGATGAAGAGAGGGGCAACGGCTCCAACTTCAACACGTTA
ATACATACATCAAAGCATCCAAATTCTGAAACTAGTTCAGATCTTAAGTCTATTCAGCATGAGTCACCTGAAACCAACGCTGAATCATTGACAGGATCAAGTGATGATGG
AGACACAGATACGGAGAAGAAAAAAGGTGATCTAGTAGAGCCAAGGCCGTCCCATGGATATACAATGCCAGCTACCAAAAATGTGGATACTAAAGATGAAGGAACTGCGA
CCTATTTGACATGTCATAAATCTTCATGTTCACTAGCAGAAGAGAGTCTGGGGATAGAATCAGTCAATTCATCTATGCAAGTTCCTGAGGTTGAAGACAGATGCACGGTT
TTGAAAGAAGAATCTGAATTAAGAGAACATTTAGGAACTGAAACGTTCTTTGAAGACAATATTCCAACTCAACCTAAGATTTCAAATGGGGTCCAAGAGGAATTTAACAC
GATTGGGTCACACTACGAGAACAATGCAGAAGAGACTGAAGTTTCTCCTAAATTTGTTGCCAAGAACAAGAACCAGAACGAGGCTCCTGGAGAGGATTGCGAGGATTCAG
ATGGAGAATATTTGGAAATTTCGGAACAAGGTATGGATATTTTAAATTTATCTATTGGAGATTGCAAGCATAACAATGAGGAGATGGGAGAGACAACTGAAATTTCAACA
AACAATGAACATGAAGTGGAAAGACGAGAACCAGATGAAAGCCTATTTGAACCTGTTTTAGGGTTTCAACCTCAAATTCAGCGAAAGGAAACTTCAATCGCATTTCAATC
TGCAGAAAGTACAGACGAATCGATCTCAGAACCGAGGCAAGAAATCGATACAGCAACTGAAAAATCCAAGAAAAATCCCTCGGATTCCCCAAGTTATATTGAAACAGCTT
CAGCAACACTCACACAAACTAATCCATCGACAAATCCGATCGACGAACAGAGTTTAACAACACTTCCATTCTCCACATTCGGGGGTGAAGATCAAGATTCTCCAGGAAGA
ACGAGCAACGAATCAGTTTCCGAAAATTCAATTGGTCATATCGAGATGCGTAAATCACCGAGCTTCAATATAGATATCCAATGCGAAGGAAGGGCAGGAGAAACAGAGAA
AACTCCATTGCTATACCAAATTAAGACAATCGAAGACTTACCAAATATGCAGGAGATTAGCTTCCCAAATCCAATGGAGAAACGAGTTGTGAAGTTAGGAAGAAGCGATT
CAGAGAAATCGAGACCTTCTTTCCCAGGGTTCGCGAAAGAAACAGAAGAATCCAGGATGGAATTCAAAGCAATCGATCAAAATAACTTCGCGGCTGAGAAGAAGGCAGCA
AAAGACTTACCACCGCCATCGCCAATCCGTAAAGGGAAGCGCAGAACGAAATCCCTAATTTTTGGGACATGCATCTGCTGTGCTACAGCGATCAATTGAATTGATTTTGT
CGAGATCATTCACATCCAGCATTGGAGTTTCAGAGTGGTTTTCTTTCAATGATTTTTCTTCTTTTTTCCTATTTCAAGAGCGAAGAACGCCATGATCTGCTTGAAGTTGA
ATTCGGTGTTCGCCTTCGAAAAGAAAAGAATGTTGCACTTGCGGTTCTTATGAATTTCTTTTCTTCCCGCCAAAATACATTGTCATCCTGTGGTTTCAAATTTAACCCTA
TTTATTGTTTGATTTGTTTGTTAGTGGAATTTGTTTTGGGAAGGGATTTGTAATTTATTTATACATTAAAATTTTAATCTGTTTGGGAAGGGATTTGTAATTTATTTATA
CATATAAAATTTTAATCTGTCTCTAAAATTCTTTTTTCCTATAAAATATATCCC
Protein sequenceShow/hide protein sequence
MASMATRFTKTPSPFHSIQRTSYTHLGAPIKVSIASSKSKKTHLRLRVSAGLIQPDGGRLVELLVEEPLRGLKKKEALSLPRIELSSIDIQWVHVLSEGWASPLTGFMRE
SEFLQTLHFNSLRLPDGSVANMSVPIVLAIDDAQKHHIGDSTSVALFDSKNNPIAILKDIEIYKHPKEERIARTWGTTAPGLPYVDQAITNAGNWLIGGDLEVIEPIKYN
DGLDRFRQSPAELREEFTRRNADAVFAFQLRNPVHNGHALLMTDTRRRLLEMGYKNPILLLHPLGGYTKADDVPLSWRMKQHEKVLEDGVLDPETTVVSIFPSPMHYAGP
TEVQWHAKARINAGANFYIVGRDPAGMGHPTEKRDLYDADHGKKVLSMAPGLERLNILPFRVAAYDKTQGKMAFFDPSRAQDFLFISGTKMRTLAKNKENPPEGFMCPDA
SSHLLDFVQHSILCNGVLYVQFYEGDGFTELFQVVMGTSELFYSYGYFKTYSPCFAIAILSTTDQAVTYHHHHHNGKKVLRFVELNMGNEMGNNNTSEFREEEKAKAEGP
EKSLVEGGENKVEADEVAGFCQKEVKPSSGDTDRANGDHHVTEKEEEKSKACNTAEFQVVSEKLPDRTKEDNEVEVNSKEDKEMEFNSEENRSDGNEHEHERQASNQKDE
EGERGSNLNTTVLSLDHPELKKTTDFKSNQHELLAESLIEDSNKSPDECKDALELSLNNTSHSADFVIADSEENTNMDQVIDTDRDTNRENHGEKGKGSNIDMIPHASED
EKSEKTSEFDIDQVIDKENDGEKGKGSNSNTKAHASKDPKSEMNSEFDTDQVIDTDIDAYKENDEERRKGSNFNTINASEDPNFEKISNLETDQVVHTDKENDEERGNGS
NFNTLIHTSKHPNSETSSDLKSIQHESPETNAESLTGSSDDGDTDTEKKKGDLVEPRPSHGYTMPATKNVDTKDEGTATYLTCHKSSCSLAEESLGIESVNSSMQVPEVE
DRCTVLKEESELREHLGTETFFEDNIPTQPKISNGVQEEFNTIGSHYENNAEETEVSPKFVAKNKNQNEAPGEDCEDSDGEYLEISEQGMDILNLSIGDCKHNNEEMGET
TEISTNNEHEVERREPDESLFEPVLGFQPQIQRKETSIAFQSAESTDESISEPRQEIDTATEKSKKNPSDSPSYIETASATLTQTNPSTNPIDEQSLTTLPFSTFGGEDQ
DSPGRTSNESVSENSIGHIEMRKSPSFNIDIQCEGRAGETEKTPLLYQIKTIEDLPNMQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFAKETEESRMEFKAIDQNNFAA
EKKAAKDLPPPSPIRKGKRRTKSLIFGTCICCATAIN