; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024195 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024195
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDUF4378 domain-containing protein
Genome locationtig00001047:4079224..4083099
RNA-Seq ExpressionSgr024195
SyntenySgr024195
Gene Ontology termsNA
InterPro domainsIPR022212 - Domain of unknown function DUF3741
IPR025486 - Domain of unknown function DUF4378


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05761.1 uncharacterized protein E5676_scaffold98G002500 [Cucumis melo var. makuwa]1.0e-25157.94Show/hide
Query:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----
        MEP++ TASVLE LMGFDE QSQ   PRHS+V S+ Y QR ASIGI KK  PS+CHPFRMTIEEP ELF +LKV ++F     C +L +RE+ADS     
Subjt:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----

Query:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP
         + LTRH  M EKHFS  ++IQTSK F DLPEV+DSMDIS RP+R K        NGPSVSK++Y L +GNNDAGTKFKDR+QGQA  SEDL LLKSSRP
Subjt:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP

Query:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE
        FLEW +KL FSSS PTSL+GSHLV++K K  H+SQNGK+I +EKER T+   +EPIKQ S+VS ILDGSRR   H+ +NL +K SRSE+ Y ++ R E  
Subjt:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE

Query:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS
              + LSNW AE KHSC FSVES+KA ES E V+EEQRKTE LM S +GR MNEMP +PHYATLPSDLN KP KY FQKH CS+ EHLHSGSPLCLS
Subjt:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQLSKNSHRLRF-------------------------------------VLILLSNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIVS
         K +RLD+L K  HRLRF                                      L   SN+KDA++PT KLSSKKL+IFPCPDSAS H+ NDGCM+  
Subjt:  CKDERLDQLSKNSHRLRF-------------------------------------VLILLSNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIVS

Query:  DLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSCR
        DLK  VEKK  C QHS N L     VVF TQN PVK+ NQAT                                                          
Subjt:  DLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSCR

Query:  ATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDDGGE
            SIQQ        EG AFEHYPSKE DSIVSLEE +QPSPVSVLEPLFKEETL  SESSGINSRDL+MQLELLM DSPGTNSEGH++FVSSDDDGGE
Subjt:  ATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDDGGE

Query:  GSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKG----------------------------------------------CALCMSFVGV
        GSIC+S++IDDIM+TFKFKDSR FSYLVDVLSEA L CKNL+ G                                                L  SFVGV
Subjt:  GSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKG----------------------------------------------CALCMSFVGV

Query:  PEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGASDAF
        PEWAKPVSRRFRPL++ EMIEEELWILLDSQERE+NK+L+DKQFGKEI WIDLGDEIDSIC+ELERLL+ EL AEFG+ + F
Subjt:  PEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGASDAF

XP_008463525.1 PREDICTED: uncharacterized protein LOC103501659 [Cucumis melo]1.0e-25157.94Show/hide
Query:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----
        MEP++ TASVLE LMGFDE QSQ   PRHS+V S+ Y QR ASIGI KK  PS+CHPFRMTIEEP ELF +LKV ++F     C +L +RE+ADS     
Subjt:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----

Query:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP
         + LTRH  M EKHFS  ++IQTSK F DLPEV+DSMDIS RP+R K        NGPSVSK++Y L +GNNDAGTKFKDR+QGQA  SEDL LLKSSRP
Subjt:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP

Query:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE
        FLEW +KL FSSS PTSL+GSHLV++K K  H+SQNGK+I +EKER T+   +EPIKQ S+VS ILDGSRR   H+ +NL +K SRSE+ Y ++ R E  
Subjt:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE

Query:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS
              + LSNW AE KHSC FSVES+KA ES E V+EEQRKTE LM S +GR MNEMP +PHYATLPSDLN KP KY FQKH CS+ EHLHSGSPLCLS
Subjt:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQLSKNSHRLRF-------------------------------------VLILLSNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIVS
         K +RLD+L K  HRLRF                                      L   SN+KDA++PT KLSSKKL+IFPCPDSAS H+ NDGCM+  
Subjt:  CKDERLDQLSKNSHRLRF-------------------------------------VLILLSNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIVS

Query:  DLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSCR
        DLK  VEKK  C QHS N L     VVF TQN PVK+ NQAT                                                          
Subjt:  DLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSCR

Query:  ATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDDGGE
            SIQQ        EG AFEHYPSKE DSIVSLEE +QPSPVSVLEPLFKEETL  SESSGINSRDL+MQLELLM DSPGTNSEGH++FVSSDDDGGE
Subjt:  ATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDDGGE

Query:  GSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKG----------------------------------------------CALCMSFVGV
        GSIC+S++IDDIM+TFKFKDSR FSYLVDVLSEA L CKNL+ G                                                L  SFVGV
Subjt:  GSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKG----------------------------------------------CALCMSFVGV

Query:  PEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGASDAF
        PEWAKPVSRRFRPL++ EMIEEELWILLDSQERE+NK+L+DKQFGKEI WIDLGDEIDSIC+ELERLL+ EL AEFG+ + F
Subjt:  PEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGASDAF

XP_022133834.1 uncharacterized protein LOC111006294 [Momordica charantia]0.0e+0069.4Show/hide
Query:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----
        M  KQCTASVLEALMGF+EQQS  H  RHSRVLSEGY QR ASIG+ KK  PSKCHPFR T+EEP ELF TL V DSFKSD  CNELG REK  S     
Subjt:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----

Query:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDK-----GVRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP
         M LTRHNFM  +HF  D+MIQTS D  +LPEV DSMDIS RPTR+K      V NG S+SKSH+ L +G NDAGTKF +RKQGQAC+ +D DLLKSS P
Subjt:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDK-----GVRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP

Query:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE
         LEW+DKL FSSSS TSL+GSHLVSEK K FH SQNGKH+A+EKERKTMV VVEPIKQPS+VS+ILD S RKTRHD VNL MKASRSES Y DVHR+ETE
Subjt:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE

Query:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS
        ++ + S GLSN KAEYKHSC FSVES+KA   RE + EEQ++T+KL+LSRQG    EMPIL H+ATLP+DLN KP KY FQKHVCSNKEHLHSGSPLCLS
Subjt:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQLSKNSHRLRFVLILL--------------------------------------SNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIV
        CKDERLDQ+SKNSHRLRF                                           S+ KDAS+PT KL SKKLRIFPCP+SAS HI++DGC++V
Subjt:  CKDERLDQLSKNSHRLRFVLILL--------------------------------------SNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIV

Query:  SDLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSC
          L+ RVEKK LC Q S+NSLSS  +VVF  +NNP    N+A EC L SDY DDNFSG ASNVLAVKTDDA+V TVDKQEPDS+SC + ETDGDSSTNS 
Subjt:  SDLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSC

Query:  RATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDD-G
        R TC SIQQE ST FDKEGP FEHYP KELDSIVSLEEAYQPSPVSVLEPLFKEET+S SESSGINSRDLMMQLELLMSDSPG+NSEGHEMFVSSDDD G
Subjt:  RATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDD-G

Query:  GEGSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKGCA----------------------------------------------LCMSFV
        GEGS CSSEEIDDIM+TFKFKDSRDFSYL+DVLSEAGL+C NLDKGC                                               L  S V
Subjt:  GEGSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKGCA----------------------------------------------LCMSFV

Query:  GVPEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFG
        GVPEWAKPVSRRFRPLLDREM+EEELWILLDSQERELNKDLVDKQFGKEIGWIDLG+EI+SICRELERLLIKEL AEFG
Subjt:  GVPEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFG

XP_038889736.1 uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida]3.8e-27060.25Show/hide
Query:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----
        ME +QCT SVLEALMGFDE+Q Q HAPRHS VLS+ Y QRVASIGI KK  PS+CHPFRMT+EEP ELF + KV ++F     CNEL + EKADS     
Subjt:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----

Query:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP
         M LTRH  M EKHFS  ++IQTSKDF +LPEV+DSMDIS RPTR K       +NGPSVSK HY+  + NNDAGTK KDRK GQ  SSEDLD LKSSRP
Subjt:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP

Query:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAE------EKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDV
         LEWRDKL FSSSSPTSL GSHLV++K KD  SSQNGK+IA+      ++ ++TM Y ++PIKQ S+VS ILD SRR TRH  VNL++K SR  + Y DV
Subjt:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAE------EKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDV

Query:  HREETEYKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSG
         R ET+Y+R+SS  LSNW A+YKHSCFFSVES+KA ESRE V EEQRKTE L+ S QGR MNEMP LPH+A+LPSDLN KP K+ FQKHVCSNKEH HSG
Subjt:  HREETEYKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSG

Query:  SPLCLSCKDERLDQLSKNSHRLRFVLILL-------------------------------------SNEKDASEPTSKLSSKKLRIFPCPDSASDHIIND
        SPLCLS K +RLDQL KNSHRLRF                                          SN+KDASEP+ KLSSKKL+IFPCPDSASDH+ ND
Subjt:  SPLCLSCKDERLDQLSKNSHRLRFVLILL-------------------------------------SNEKDASEPTSKLSSKKLRIFPCPDSASDHIIND

Query:  GCMIVSDLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDS
         CM+  DLK +VEKK  C QHSLN LS     VF TQN PVK+ NQAT                                                    
Subjt:  GCMIVSDLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDS

Query:  STNSCRATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSS
                  SIQQ        EG  FEHYPSKE DSIVSLEEA+QPSPVSVLEPLFK+ETL  SES GIN RDLMMQLELLMSDSPGTNSEGH++FVSS
Subjt:  STNSCRATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSS

Query:  DDDGGEGSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKGCALC----------------------------------------------
        DDDGGEGSICSS EIDDIM+TFKFKDSRDFSYLVDVLSEA LHCK+L+ G   C                                              
Subjt:  DDDGGEGSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKGCALC----------------------------------------------

Query:  MSFVGVPEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGASDAF
         SF GVPEWAKPVSRRFRPLL+ EMIEEELWILLDSQERE+NKDLVDKQFGKEIGWIDLGDEIDSICRELERLL+ EL AEFG+ + F
Subjt:  MSFVGVPEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGASDAF

XP_038889740.1 uncharacterized protein LOC120079578 isoform X2 [Benincasa hispida]4.7e-26860.14Show/hide
Query:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----
        ME +QCT SVLEALMGFDE+Q Q HAPRHS VLS+ Y QRVASIGI KK  PS+CHPFRMT+EEP ELF + KV ++F     CNEL + EKADS     
Subjt:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----

Query:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP
         M LTRH  M EKHFS  ++IQTSKDF +LPEV+DSMDIS RPTR K       +NGPSVSK HY+  + NNDAGTK KDRK GQ  SSEDLD LKSSRP
Subjt:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP

Query:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAE------EKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDV
         LEWRDKL FSSSSPTSL GSHLV++K KD  SSQNGK+IA+      ++ ++TM Y ++PIKQ S+VS ILD SRR TRH  VNL++K SR  + Y DV
Subjt:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAE------EKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDV

Query:  HREETEYKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSG
         R ET+Y+R+SS  LSNW A+YKHSCFFSVES+KA ESRE V EEQRKTE L+ S QGR MNEMP LPH+A+LPSDLN KP K+ FQKHVCSNKEH HSG
Subjt:  HREETEYKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSG

Query:  SPLCLSCKDERLDQLSKNSHRLRFVLILL-------------------------------------SNEKDASEPTSKLSSKKLRIFPCPDSASDHIIND
        SPLCLS K +RLDQL KNSHRLRF                                          SN+KDASEP+ KLSSKKL+IFPCPDSASDH+ ND
Subjt:  SPLCLSCKDERLDQLSKNSHRLRFVLILL-------------------------------------SNEKDASEPTSKLSSKKLRIFPCPDSASDHIIND

Query:  GCMIVSDLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDS
         CM+  DLK +VEKK  C QHSLN LS     VF TQN PVK+ NQAT                                                    
Subjt:  GCMIVSDLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDS

Query:  STNSCRATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSS
                  SIQQ        EG  FEHYPSKE DSIVSLEEA+QPSPVSVLEPLFK+ETL  SES GIN  DLMMQLELLMSDSPGTNSEGH++FVSS
Subjt:  STNSCRATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSS

Query:  DDDGGEGSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKGCALC----------------------------------------------
        DDDGGEGSICSS EIDDIM+TFKFKDSRDFSYLVDVLSEA LHCK+L+ G   C                                              
Subjt:  DDDGGEGSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKGCALC----------------------------------------------

Query:  MSFVGVPEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGASDAF
         SF GVPEWAKPVSRRFRPLL+ EMIEEELWILLDSQERE+NKDLVDKQFGKEIGWIDLGDEIDSICRELERLL+ EL AEFG+ + F
Subjt:  MSFVGVPEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGASDAF

TrEMBL top hitse value%identityAlignment
A0A0A0KNN6 DUF4378 domain-containing protein3.1e-24958.38Show/hide
Query:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----
        MEP+Q TASVLEALMGFDE QSQ  A RHS+V S+ Y QRVASIGI KK  PS+CHPFRMTIEEP ELF +LKV ++F     C +L +RE+ADS     
Subjt:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----

Query:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP
           LTRH    EKHFS  ++IQTSK F DLPEV+DSMDIS RPTR K       ++G SVS +HY L +GNNDAGTKFKDRKQGQA  SEDL LLKSSRP
Subjt:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP

Query:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE
        FLEW +KL FSSS P SL+GSHLV++K K  H+SQNGK+IA+EKER T+   +EPIKQ S+VS ILDGSRR  R +  NL++K SRSE+ Y +V R    
Subjt:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE

Query:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS
            + + LSNW AE KHSC FSVES+KA ES E V+EEQRKT  LM S QGR MNEMP +P YATLPSDLN KP +Y FQKHVCS+KEHLHSGSPLCLS
Subjt:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQLSKNSHRLRF------------------------------------VLILLSNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIVSD
         K +RLD+L K  HRLRF                                     L   SN+KDA++PT KLSSKKL+IFPCPDSAS H  NDGCM+  D
Subjt:  CKDERLDQLSKNSHRLRF------------------------------------VLILLSNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIVSD

Query:  LKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSCRA
         K  V+KK  C QHSLN L     VVF TQN PVK+ NQAT                                                           
Subjt:  LKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSCRA

Query:  TCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDDGGEG
           SIQQ        EG AF+HYPSKE DSIVSLEEA+QPSPVSVLEPLFKEETL  SES GINSRDL+MQLELLMSDSPGTNSEGH++FVSSDDD GEG
Subjt:  TCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDDGGEG

Query:  SICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKG----------------------------------------------CALCMSFVGVP
        SIC+S++IDDIM+TFKFKDSR FSYLVDVLSEA LHCKNL+ G                                                L  SFVGVP
Subjt:  SICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKG----------------------------------------------CALCMSFVGVP

Query:  EWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGA
        EWAKPVSRRFRPLL+ EMIEEELWILLDSQERE+NK+LVDKQFGKEI WIDLGDEI+SICRELE LL+ EL AEFG+
Subjt:  EWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGA

A0A1S4E497 uncharacterized protein LOC1035016595.1e-25257.94Show/hide
Query:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----
        MEP++ TASVLE LMGFDE QSQ   PRHS+V S+ Y QR ASIGI KK  PS+CHPFRMTIEEP ELF +LKV ++F     C +L +RE+ADS     
Subjt:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----

Query:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP
         + LTRH  M EKHFS  ++IQTSK F DLPEV+DSMDIS RP+R K        NGPSVSK++Y L +GNNDAGTKFKDR+QGQA  SEDL LLKSSRP
Subjt:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP

Query:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE
        FLEW +KL FSSS PTSL+GSHLV++K K  H+SQNGK+I +EKER T+   +EPIKQ S+VS ILDGSRR   H+ +NL +K SRSE+ Y ++ R E  
Subjt:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE

Query:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS
              + LSNW AE KHSC FSVES+KA ES E V+EEQRKTE LM S +GR MNEMP +PHYATLPSDLN KP KY FQKH CS+ EHLHSGSPLCLS
Subjt:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQLSKNSHRLRF-------------------------------------VLILLSNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIVS
         K +RLD+L K  HRLRF                                      L   SN+KDA++PT KLSSKKL+IFPCPDSAS H+ NDGCM+  
Subjt:  CKDERLDQLSKNSHRLRF-------------------------------------VLILLSNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIVS

Query:  DLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSCR
        DLK  VEKK  C QHS N L     VVF TQN PVK+ NQAT                                                          
Subjt:  DLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSCR

Query:  ATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDDGGE
            SIQQ        EG AFEHYPSKE DSIVSLEE +QPSPVSVLEPLFKEETL  SESSGINSRDL+MQLELLM DSPGTNSEGH++FVSSDDDGGE
Subjt:  ATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDDGGE

Query:  GSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKG----------------------------------------------CALCMSFVGV
        GSIC+S++IDDIM+TFKFKDSR FSYLVDVLSEA L CKNL+ G                                                L  SFVGV
Subjt:  GSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKG----------------------------------------------CALCMSFVGV

Query:  PEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGASDAF
        PEWAKPVSRRFRPL++ EMIEEELWILLDSQERE+NK+L+DKQFGKEI WIDLGDEIDSIC+ELERLL+ EL AEFG+ + F
Subjt:  PEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGASDAF

A0A5D3C1E7 DUF4378 domain-containing protein5.1e-25257.94Show/hide
Query:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----
        MEP++ TASVLE LMGFDE QSQ   PRHS+V S+ Y QR ASIGI KK  PS+CHPFRMTIEEP ELF +LKV ++F     C +L +RE+ADS     
Subjt:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----

Query:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP
         + LTRH  M EKHFS  ++IQTSK F DLPEV+DSMDIS RP+R K        NGPSVSK++Y L +GNNDAGTKFKDR+QGQA  SEDL LLKSSRP
Subjt:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP

Query:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE
        FLEW +KL FSSS PTSL+GSHLV++K K  H+SQNGK+I +EKER T+   +EPIKQ S+VS ILDGSRR   H+ +NL +K SRSE+ Y ++ R E  
Subjt:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE

Query:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS
              + LSNW AE KHSC FSVES+KA ES E V+EEQRKTE LM S +GR MNEMP +PHYATLPSDLN KP KY FQKH CS+ EHLHSGSPLCLS
Subjt:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQLSKNSHRLRF-------------------------------------VLILLSNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIVS
         K +RLD+L K  HRLRF                                      L   SN+KDA++PT KLSSKKL+IFPCPDSAS H+ NDGCM+  
Subjt:  CKDERLDQLSKNSHRLRF-------------------------------------VLILLSNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIVS

Query:  DLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSCR
        DLK  VEKK  C QHS N L     VVF TQN PVK+ NQAT                                                          
Subjt:  DLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSCR

Query:  ATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDDGGE
            SIQQ        EG AFEHYPSKE DSIVSLEE +QPSPVSVLEPLFKEETL  SESSGINSRDL+MQLELLM DSPGTNSEGH++FVSSDDDGGE
Subjt:  ATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDDGGE

Query:  GSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKG----------------------------------------------CALCMSFVGV
        GSIC+S++IDDIM+TFKFKDSR FSYLVDVLSEA L CKNL+ G                                                L  SFVGV
Subjt:  GSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKG----------------------------------------------CALCMSFVGV

Query:  PEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGASDAF
        PEWAKPVSRRFRPL++ EMIEEELWILLDSQERE+NK+L+DKQFGKEI WIDLGDEIDSIC+ELERLL+ EL AEFG+ + F
Subjt:  PEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGASDAF

A0A6J1BX36 uncharacterized protein LOC1110062940.0e+0069.4Show/hide
Query:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----
        M  KQCTASVLEALMGF+EQQS  H  RHSRVLSEGY QR ASIG+ KK  PSKCHPFR T+EEP ELF TL V DSFKSD  CNELG REK  S     
Subjt:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADS-----

Query:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDK-----GVRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP
         M LTRHNFM  +HF  D+MIQTS D  +LPEV DSMDIS RPTR+K      V NG S+SKSH+ L +G NDAGTKF +RKQGQAC+ +D DLLKSS P
Subjt:  -MSLTRHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDK-----GVRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRP

Query:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE
         LEW+DKL FSSSS TSL+GSHLVSEK K FH SQNGKH+A+EKERKTMV VVEPIKQPS+VS+ILD S RKTRHD VNL MKASRSES Y DVHR+ETE
Subjt:  FLEWRDKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETE

Query:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS
        ++ + S GLSN KAEYKHSC FSVES+KA   RE + EEQ++T+KL+LSRQG    EMPIL H+ATLP+DLN KP KY FQKHVCSNKEHLHSGSPLCLS
Subjt:  YKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQLSKNSHRLRFVLILL--------------------------------------SNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIV
        CKDERLDQ+SKNSHRLRF                                           S+ KDAS+PT KL SKKLRIFPCP+SAS HI++DGC++V
Subjt:  CKDERLDQLSKNSHRLRFVLILL--------------------------------------SNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIV

Query:  SDLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSC
          L+ RVEKK LC Q S+NSLSS  +VVF  +NNP    N+A EC L SDY DDNFSG ASNVLAVKTDDA+V TVDKQEPDS+SC + ETDGDSSTNS 
Subjt:  SDLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSC

Query:  RATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDD-G
        R TC SIQQE ST FDKEGP FEHYP KELDSIVSLEEAYQPSPVSVLEPLFKEET+S SESSGINSRDLMMQLELLMSDSPG+NSEGHEMFVSSDDD G
Subjt:  RATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDD-G

Query:  GEGSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKGCA----------------------------------------------LCMSFV
        GEGS CSSEEIDDIM+TFKFKDSRDFSYL+DVLSEAGL+C NLDKGC                                               L  S V
Subjt:  GEGSICSSEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKGCA----------------------------------------------LCMSFV

Query:  GVPEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFG
        GVPEWAKPVSRRFRPLLDREM+EEELWILLDSQERELNKDLVDKQFGKEIGWIDLG+EI+SICRELERLLIKEL AEFG
Subjt:  GVPEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFG

A0A6J1HFD2 uncharacterized protein LOC1114637983.2e-17446.91Show/hide
Query:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASI-GILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADSMSLT
        MEP QC+ASVLEALMGFDE QS+  A  HSRVLSE Y QRVASI G  KK SPSKC PFRMTIEEP E+F               N L DRE       +
Subjt:  MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASI-GILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADSMSLT

Query:  RHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRPFLEWR
          NFMNEKHFS D +I TSKDF DLPEVIDSMDIS R TR K      V NGP+VSK H                                         
Subjt:  RHNFMNEKHFSADRMIQTSKDFLDLPEVIDSMDISRRPTRDKG-----VRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRPFLEWR

Query:  DKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETEYKRSS
                                                                                               + HR         
Subjt:  DKLHFSSSSPTSLEGSHLVSEKYKDFHSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETEYKRSS

Query:  SSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLSCKDER
               K E K SCF  VES+K GESRE V+EEQRK   LMLS+QGR MNEM ILPHYAT PSDLN KP  Y F   VC NK+HLHSGSPLCLSCKD R
Subjt:  SSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQRKTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLSCKDER

Query:  LDQLSKNSHR---------------------LRFVLIL----------------LSNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIVSDLKPR
         D+L K SHR                     LR    L                 SN+K+ SEP+SKLSSKKLRIFPCPDS SDH+ NDGC++ +DLK R
Subjt:  LDQLSKNSHR---------------------LRFVLIL----------------LSNEKDASEPTSKLSSKKLRIFPCPDSASDHIINDGCMIVSDLKPR

Query:  VEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSCRATCNS
        VEK GLC QHS+NSLSS+ N                                                     E  SLS I+ ETDG SSTNSCRATC S
Subjt:  VEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNSCRATCNS

Query:  IQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDDGGEGSICS
        IQQ        +G +FE Y SKELDSIV LEE YQPSPVSVLE  FKEE  S SESSG+N R    +L+LLM D+PGTNS+ HE+FVSS++D GEGSIC+
Subjt:  IQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDDGGEGSICS

Query:  SEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKG----------------------------------------------CALCMSFVGVPEWAK
        S+EI DIM+TFKFKDSRDFSYLVDV+SEAGLH +NL+KG                                                L  SFVGVPEWAK
Subjt:  SEEIDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKG----------------------------------------------CALCMSFVGVPEWAK

Query:  PVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGAS
        PVSRR RPLLD+EM+E+E+W LLDSQE+E NKDLVDKQFGKEIGWIDLGDEI SICRELE LLI EL AE G+S
Subjt:  PVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAEFGAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39435.1 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related2.5e-1730.86Show/hide
Query:  EEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMM--------QLELLMSDSPGTNSEGHEMFVSSDDDGGEGSICSSEEIDDIMNTFKFKDSRDFSYL
        E+A+QPSPVSVLEP+F E+ L  SE    +S DL          QLE L S+S  + S+G  M VSSD++    S     +  + +     ++SRD SY+
Subjt:  EEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMM--------QLELLMSDSPGTNSEGHEMFVSSDDDGGEGSICSSEEIDDIMNTFKFKDSRDFSYL

Query:  VDVLSEAGLHCKNLDKG----------------------------------------CALCMSFVGVPEWAKPVSRRFRPLLDREMIEEELWILLDSQER
         D+L+E  L  KN   G                                          +  SF   P W KPVSRR    L    +++ELW +L  QE+
Subjt:  VDVLSEAGLHCKNLDKG----------------------------------------CALCMSFVGVPEWAKPVSRRFRPLLDREMIEEELWILLDSQER

Query:  ELNKDLVDKQFGKEIG-WIDLGDEIDSICRELERLLIKELAAE
           K  + K    +I  W++L  + +S+  ELE +++ EL +E
Subjt:  ELNKDLVDKQFGKEIG-WIDLGDEIDSICRELERLLIKELAAE

AT2G39435.2 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related1.4e-1530.64Show/hide
Query:  EEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMM--------QLELLMSDSPGTNSEGHEMFVSSDDDGGEGSICSSEEIDDIMNTFKFKDSRDFSYL
        E+A+QPSPVSVLEP+F E+ L  SE    +S DL          QLE L S+S  + S+G  M VSSD++    S     +  + +     ++SRD SY+
Subjt:  EEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMM--------QLELLMSDSPGTNSEGHEMFVSSDDDGGEGSICSSEEIDDIMNTFKFKDSRDFSYL

Query:  VDVLSEAGLHCKNLDKG----------------------------------------CALCMSFVGVPEWAKPVSRRFRPLLDREMIEEELWILLDSQER
         D+L+E  L  KN   G                                          +  SF   P W KPVSRR    L    +++ELW +L  QE+
Subjt:  VDVLSEAGLHCKNLDKG----------------------------------------CALCMSFVGVPEWAKPVSRRFRPLLDREMIEEELWILLDSQER

Query:  ELNKDLVDKQFGKEIG-WIDLGDEIDSICRELERL
           K  + K    +I  W++L  + +S+  ELE++
Subjt:  ELNKDLVDKQFGKEIG-WIDLGDEIDSICRELERL

AT3G53540.1 unknown protein2.2e-0524.24Show/hide
Query:  EPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETEYKRSSSSGLSNWKAEYKHSC-FFSVESHKAGESREIVMEEQRKTEK----LML
        E  K  SR  K+  G+ R    ++      A    S   D   E      +S +  +  +  Y  S    S  S  + E++  + E  + T K    + +
Subjt:  EPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETEYKRSSSSGLSNWKAEYKHSC-FFSVESHKAGESREIVMEEQRKTEK----LML

Query:  SRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLSCKDERLDQLSKNSHRLRFVLILLSNEKDASEPTSKLSSKKLRIFPCPD
        SR G +  EM         P+  N    + G  K   +N +      P+ +S +D      S++  + R ++    N++ A   T  L    +       
Subjt:  SRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLSCKDERLDQLSKNSHRLRFVLILLSNEKDASEPTSKLSSKKLRIFPCPD

Query:  SASDHIINDGCMIVSDLKPRVEKKGLCYQHS--LNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSL
          S H  +    + S  +P   K    Y  S  ++   S    V+     P K    A+     S +  D  S T     +  +DD K  T    E   L
Subjt:  SASDHIINDGCMIVSDLKPRVEKKGLCYQHS--LNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSL

Query:  SCILLETDGDSSTNSCRATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGS---ESSGINSRDLMMQLELLMSDS
        S +   TD D S    R T   +            P      SKE D         QPSPVSVLE  F ++  SGS   ES   + R L MQL+LL  +S
Subjt:  SCILLETDGDSSTNSCRATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGS---ESSGINSRDLMMQLELLMSDS

Query:  PGTNSEGHEMFVSSDDDGGEGSICSSEEIDDIMNTFKFKDSR-DFSYLVDVLSEAGLH-----------------CKNLDKGCALCMSFVGVPE------
          T  EG  M VSSD+D  +    SS   D+ M T + ++     SYLVD+L+ +                     ++L+K  +   +   +        
Subjt:  PGTNSEGHEMFVSSDDDGGEGSICSSEEIDDIMNTFKFKDSR-DFSYLVDVLSEAGLH-----------------CKNLDKGCALCMSFVGVPE------

Query:  -----------------WAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAE
                         W K  S +  P  D   I+E L  L+  ++ + +K  V++   KE+ W+ L D+I+ I RE+E +L  EL  E
Subjt:  -----------------WAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICRELERLLIKELAAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCTAAACAGTGTACAGCTAGTGTTCTTGAAGCATTGATGGGCTTTGATGAGCAGCAATCTCAGCCCCATGCTCCAAGGCATTCTAGAGTTCTTTCTGAGGGTTA
TTTCCAAAGGGTTGCTTCTATTGGAATCCTGAAGAAGACATCCCCCTCCAAATGTCATCCTTTTAGGATGACCATAGAAGAGCCGAAAGAACTCTTTGGTACTCTCAAAG
TAGGAGATAGCTTCAAGAGTGATAGGCATTGCAACGAGTTAGGGGATAGGGAGAAGGCAGATTCTATGTCACTTACAAGACACAACTTCATGAATGAAAAGCACTTTTCA
GCAGATAGGATGATACAAACTTCAAAGGATTTTCTAGATTTACCAGAAGTTATAGATTCTATGGACATCTCACGAAGGCCTACAAGAGATAAGGGTGTCAGGAATGGACC
GAGTGTGTCAAAGTCACATTATGCTTTGGCACAAGGAAATAATGATGCAGGCACTAAATTTAAGGACAGGAAACAAGGACAAGCATGCTCATCTGAGGATCTTGATCTTT
TGAAGTCTTCAAGACCTTTTTTGGAGTGGAGAGATAAACTACACTTTTCTTCCTCTTCACCAACTTCTTTGGAAGGCTCACATTTAGTTAGTGAGAAATACAAAGATTTT
CATAGTTCTCAAAATGGAAAGCATATTGCTGAAGAAAAAGAAAGAAAGACAATGGTGTATGTAGTGGAGCCCATCAAGCAACCATCTCGAGTTTCAAAGATTCTGGATGG
AAGTAGGAGAAAAACAAGGCACGATTCCGTCAATTTGTATATGAAGGCATCAAGATCAGAATCCAAATATGGTGATGTGCATAGAGAAGAAACTGAGTACAAAAGGAGCT
CTTCCTCAGGTTTATCTAATTGGAAGGCTGAATACAAGCATTCCTGCTTCTTTTCAGTTGAGTCTCACAAGGCCGGAGAATCTAGGGAGATAGTCATGGAAGAACAAAGG
AAGACTGAAAAATTGATGCTATCTAGGCAAGGTAGGATAATGAATGAAATGCCTATACTACCTCATTATGCAACTTTGCCCAGTGATTTGAATCGCAAGCCCGGCAAGTA
TGGTTTCCAGAAGCATGTTTGTTCGAATAAGGAACATTTGCATTCTGGCAGTCCCTTGTGCTTGAGCTGCAAGGATGAAAGACTAGATCAACTCAGTAAAAACTCCCACA
GATTGAGATTTGTTCTGATTCTACTTTCCAATGAAAAAGATGCTTCAGAACCTACCTCGAAATTGAGCTCTAAGAAGTTGAGGATTTTTCCTTGCCCTGATTCAGCAAGT
GATCACATTATCAATGATGGCTGCATGATTGTGAGTGATCTGAAGCCCAGAGTCGAGAAGAAGGGCCTTTGTTATCAGCATTCTTTAAATTCTCTATCATCAAGTTGCAA
TGTTGTTTTCTACACACAAAACAATCCCGTCAAACGAGAAAATCAGGCAACTGAGTGTTATTTGAACAGTGATTATCAAGATGATAATTTTTCAGGTACGGCTTCTAATG
TATTGGCTGTAAAGACTGATGATGCGAAGGTCCTTACTGTGGATAAGCAGGAACCTGATTCATTGTCCTGCATTCTTTTGGAGACTGATGGTGATTCATCTACCAACTCT
TGTCGTGCCACATGCAATTCTATTCAACAGGAAGTATCAACCAGATTTGATAAGGAAGGTCCTGCCTTTGAACACTACCCTAGCAAAGAGCTAGATTCTATTGTGAGTTT
GGAGGAGGCTTATCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAAAGAAGAAACATTGTCCGGTTCCGAATCCTCAGGCATTAACAGTAGAGATTTGATGATGC
AACTTGAACTTCTGATGTCGGACTCCCCGGGAACTAACTCAGAAGGACATGAAATGTTCGTATCAAGTGATGATGATGGTGGAGAAGGATCGATATGCAGCTCCGAAGAA
ATTGACGACATAATGAACACATTCAAATTCAAAGATAGTAGGGATTTTTCATACCTTGTTGATGTCTTAAGTGAGGCAGGCTTGCATTGTAAAAACCTGGATAAGGGTTG
TGCTTTGTGCATGTCATTTGTTGGTGTGCCAGAATGGGCAAAGCCTGTATCGAGAAGATTTCGGCCTTTGCTTGATCGGGAAATGATCGAGGAAGAACTATGGATTCTGC
TGGATAGCCAAGAAAGGGAATTGAACAAAGATTTAGTAGATAAGCAGTTTGGAAAGGAGATTGGGTGGATAGATCTTGGAGACGAGATTGATTCTATTTGTAGAGAACTA
GAGAGATTGTTGATCAAAGAGCTTGCTGCAGAGTTCGGAGCATCAGATGCTTTTGAACGGAATGATATATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAACCTAAACAGTGTACAGCTAGTGTTCTTGAAGCATTGATGGGCTTTGATGAGCAGCAATCTCAGCCCCATGCTCCAAGGCATTCTAGAGTTCTTTCTGAGGGTTA
TTTCCAAAGGGTTGCTTCTATTGGAATCCTGAAGAAGACATCCCCCTCCAAATGTCATCCTTTTAGGATGACCATAGAAGAGCCGAAAGAACTCTTTGGTACTCTCAAAG
TAGGAGATAGCTTCAAGAGTGATAGGCATTGCAACGAGTTAGGGGATAGGGAGAAGGCAGATTCTATGTCACTTACAAGACACAACTTCATGAATGAAAAGCACTTTTCA
GCAGATAGGATGATACAAACTTCAAAGGATTTTCTAGATTTACCAGAAGTTATAGATTCTATGGACATCTCACGAAGGCCTACAAGAGATAAGGGTGTCAGGAATGGACC
GAGTGTGTCAAAGTCACATTATGCTTTGGCACAAGGAAATAATGATGCAGGCACTAAATTTAAGGACAGGAAACAAGGACAAGCATGCTCATCTGAGGATCTTGATCTTT
TGAAGTCTTCAAGACCTTTTTTGGAGTGGAGAGATAAACTACACTTTTCTTCCTCTTCACCAACTTCTTTGGAAGGCTCACATTTAGTTAGTGAGAAATACAAAGATTTT
CATAGTTCTCAAAATGGAAAGCATATTGCTGAAGAAAAAGAAAGAAAGACAATGGTGTATGTAGTGGAGCCCATCAAGCAACCATCTCGAGTTTCAAAGATTCTGGATGG
AAGTAGGAGAAAAACAAGGCACGATTCCGTCAATTTGTATATGAAGGCATCAAGATCAGAATCCAAATATGGTGATGTGCATAGAGAAGAAACTGAGTACAAAAGGAGCT
CTTCCTCAGGTTTATCTAATTGGAAGGCTGAATACAAGCATTCCTGCTTCTTTTCAGTTGAGTCTCACAAGGCCGGAGAATCTAGGGAGATAGTCATGGAAGAACAAAGG
AAGACTGAAAAATTGATGCTATCTAGGCAAGGTAGGATAATGAATGAAATGCCTATACTACCTCATTATGCAACTTTGCCCAGTGATTTGAATCGCAAGCCCGGCAAGTA
TGGTTTCCAGAAGCATGTTTGTTCGAATAAGGAACATTTGCATTCTGGCAGTCCCTTGTGCTTGAGCTGCAAGGATGAAAGACTAGATCAACTCAGTAAAAACTCCCACA
GATTGAGATTTGTTCTGATTCTACTTTCCAATGAAAAAGATGCTTCAGAACCTACCTCGAAATTGAGCTCTAAGAAGTTGAGGATTTTTCCTTGCCCTGATTCAGCAAGT
GATCACATTATCAATGATGGCTGCATGATTGTGAGTGATCTGAAGCCCAGAGTCGAGAAGAAGGGCCTTTGTTATCAGCATTCTTTAAATTCTCTATCATCAAGTTGCAA
TGTTGTTTTCTACACACAAAACAATCCCGTCAAACGAGAAAATCAGGCAACTGAGTGTTATTTGAACAGTGATTATCAAGATGATAATTTTTCAGGTACGGCTTCTAATG
TATTGGCTGTAAAGACTGATGATGCGAAGGTCCTTACTGTGGATAAGCAGGAACCTGATTCATTGTCCTGCATTCTTTTGGAGACTGATGGTGATTCATCTACCAACTCT
TGTCGTGCCACATGCAATTCTATTCAACAGGAAGTATCAACCAGATTTGATAAGGAAGGTCCTGCCTTTGAACACTACCCTAGCAAAGAGCTAGATTCTATTGTGAGTTT
GGAGGAGGCTTATCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAAAGAAGAAACATTGTCCGGTTCCGAATCCTCAGGCATTAACAGTAGAGATTTGATGATGC
AACTTGAACTTCTGATGTCGGACTCCCCGGGAACTAACTCAGAAGGACATGAAATGTTCGTATCAAGTGATGATGATGGTGGAGAAGGATCGATATGCAGCTCCGAAGAA
ATTGACGACATAATGAACACATTCAAATTCAAAGATAGTAGGGATTTTTCATACCTTGTTGATGTCTTAAGTGAGGCAGGCTTGCATTGTAAAAACCTGGATAAGGGTTG
TGCTTTGTGCATGTCATTTGTTGGTGTGCCAGAATGGGCAAAGCCTGTATCGAGAAGATTTCGGCCTTTGCTTGATCGGGAAATGATCGAGGAAGAACTATGGATTCTGC
TGGATAGCCAAGAAAGGGAATTGAACAAAGATTTAGTAGATAAGCAGTTTGGAAAGGAGATTGGGTGGATAGATCTTGGAGACGAGATTGATTCTATTTGTAGAGAACTA
GAGAGATTGTTGATCAAAGAGCTTGCTGCAGAGTTCGGAGCATCAGATGCTTTTGAACGGAATGATATATAG
Protein sequenceShow/hide protein sequence
MEPKQCTASVLEALMGFDEQQSQPHAPRHSRVLSEGYFQRVASIGILKKTSPSKCHPFRMTIEEPKELFGTLKVGDSFKSDRHCNELGDREKADSMSLTRHNFMNEKHFS
ADRMIQTSKDFLDLPEVIDSMDISRRPTRDKGVRNGPSVSKSHYALAQGNNDAGTKFKDRKQGQACSSEDLDLLKSSRPFLEWRDKLHFSSSSPTSLEGSHLVSEKYKDF
HSSQNGKHIAEEKERKTMVYVVEPIKQPSRVSKILDGSRRKTRHDSVNLYMKASRSESKYGDVHREETEYKRSSSSGLSNWKAEYKHSCFFSVESHKAGESREIVMEEQR
KTEKLMLSRQGRIMNEMPILPHYATLPSDLNRKPGKYGFQKHVCSNKEHLHSGSPLCLSCKDERLDQLSKNSHRLRFVLILLSNEKDASEPTSKLSSKKLRIFPCPDSAS
DHIINDGCMIVSDLKPRVEKKGLCYQHSLNSLSSSCNVVFYTQNNPVKRENQATECYLNSDYQDDNFSGTASNVLAVKTDDAKVLTVDKQEPDSLSCILLETDGDSSTNS
CRATCNSIQQEVSTRFDKEGPAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEETLSGSESSGINSRDLMMQLELLMSDSPGTNSEGHEMFVSSDDDGGEGSICSSEE
IDDIMNTFKFKDSRDFSYLVDVLSEAGLHCKNLDKGCALCMSFVGVPEWAKPVSRRFRPLLDREMIEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGDEIDSICREL
ERLLIKELAAEFGASDAFERNDI