; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0005503 (gene) of Chayote v1 genome

Gene IDSed0005503
OrganismSechium edule (Chayote v1)
DescriptionDUF4378 domain-containing protein
Genome locationLG01:23093934..23099411
RNA-Seq ExpressionSed0005503
SyntenySed0005503
Gene Ontology termsNA
InterPro domainsIPR022212 - Domain of unknown function DUF3741
IPR025486 - Domain of unknown function DUF4378


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05761.1 uncharacterized protein E5676_scaffold98G002500 [Cucumis melo var. makuwa]4.3e-30168.62Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP
        MEPR+ TASVLE LMGFDE QSQH  PRH +V S+ YLQR ASIGISKKK PSRCHPFRMTIEEP+E F+SL+VE+   RC KL +R++ADS LSAA +P
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP

Query:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD
        LTR H M  KHFST KVIQTSK FQDLPEV+D  DI PRP+R  N I +H ENG SVSK+ Y LTEG+ND+GTKFK  +QGQAHLS DL L KSSRP L+
Subjt:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD

Query:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETVYERT
          +KL  SSS PTSLKGSH V DKC+  H+  NGK+I +EK R+   +L PIKQ S+VS+ILDGSRR    +++NLPLKTS+SE+IYD++ R E      
Subjt:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETVYERT

Query:  PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLSCKDKR
           LSNW  E KHSC  S  S KARE  EKVIEEQRK E+LM S + RKMNEMP +P YATL SD NCKPVK+DFQ H CS+ EHLHSGSPLCLS K KR
Subjt:  PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLSCKDKR

Query:  LDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCMVGDDLKTR
        LD+  K  HRL+F+  +TVT RS TRSRYEALRNTWFLK EG GTWLQCKPLNRSSNKK  ++PTLKL+S   KIFPCPDS S HV +D CMVG DLKT 
Subjt:  LDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCMVGDDLKTR

Query:  VEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSG
        VEKK  CDQH  N   P   RS K  +   N     G+ +T        SIQQEG+AFEH PSKE DSIV+LEE FQPSPVSVLEPLF+EETLFS ESSG
Subjt:  VEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSG

Query:  INGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFET
        IN RDL+MQLELLM DS G NSE HDLFVSS DDGGEGS C+ D IDD++ TFKF DSR FSYLVDVLSEA L C+NLE G  SW  QE HVI P++FE 
Subjt:  INGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFET

Query:  LEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICRE
        LEKKFGEQ SWRRSERKLLFDRINSGL ELFQ FVGVPEWAKPVS+RFRP  + E +EEELW LLDSQEREVNK+L+DKQFGKEI W++LGDEIDSIC+E
Subjt:  LEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICRE

Query:  VERLLINELVAEFNSKEFF
        +ERLL+NELVAEF S E F
Subjt:  VERLLINELVAEFNSKEFF

XP_008463525.1 PREDICTED: uncharacterized protein LOC103501659 [Cucumis melo]4.3e-30168.62Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP
        MEPR+ TASVLE LMGFDE QSQH  PRH +V S+ YLQR ASIGISKKK PSRCHPFRMTIEEP+E F+SL+VE+   RC KL +R++ADS LSAA +P
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP

Query:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD
        LTR H M  KHFST KVIQTSK FQDLPEV+D  DI PRP+R  N I +H ENG SVSK+ Y LTEG+ND+GTKFK  +QGQAHLS DL L KSSRP L+
Subjt:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD

Query:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETVYERT
          +KL  SSS PTSLKGSH V DKC+  H+  NGK+I +EK R+   +L PIKQ S+VS+ILDGSRR    +++NLPLKTS+SE+IYD++ R E      
Subjt:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETVYERT

Query:  PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLSCKDKR
           LSNW  E KHSC  S  S KARE  EKVIEEQRK E+LM S + RKMNEMP +P YATL SD NCKPVK+DFQ H CS+ EHLHSGSPLCLS K KR
Subjt:  PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLSCKDKR

Query:  LDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCMVGDDLKTR
        LD+  K  HRL+F+  +TVT RS TRSRYEALRNTWFLK EG GTWLQCKPLNRSSNKK  ++PTLKL+S   KIFPCPDS S HV +D CMVG DLKT 
Subjt:  LDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCMVGDDLKTR

Query:  VEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSG
        VEKK  CDQH  N   P   RS K  +   N     G+ +T        SIQQEG+AFEH PSKE DSIV+LEE FQPSPVSVLEPLF+EETLFS ESSG
Subjt:  VEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSG

Query:  INGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFET
        IN RDL+MQLELLM DS G NSE HDLFVSS DDGGEGS C+ D IDD++ TFKF DSR FSYLVDVLSEA L C+NLE G  SW  QE HVI P++FE 
Subjt:  INGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFET

Query:  LEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICRE
        LEKKFGEQ SWRRSERKLLFDRINSGL ELFQ FVGVPEWAKPVS+RFRP  + E +EEELW LLDSQEREVNK+L+DKQFGKEI W++LGDEIDSIC+E
Subjt:  LEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICRE

Query:  VERLLINELVAEFNSKEFF
        +ERLL+NELVAEF S E F
Subjt:  VERLLINELVAEFNSKEFF

XP_022133834.1 uncharacterized protein LOC111006294 [Momordica charantia]1.7e-30565.68Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKL---IRCDKLSDRKKADSALSAA
        M  +QCTASVLEALMGF+EQQS HH  RH RVLSEGYLQR ASIG+ KKK PS+CHPFR T+EEP E F++L+V D     I C++L  R+K  SALS+A
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKL---IRCDKLSDRKKADSALSAA

Query:  YMPLTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRP
         MPLTRH+FM  +HF TDK+IQTS D Q+LPEV D  DI PRPTRE  YI NHVENGLS+SKS +TLT G ND+GTKF   KQGQA    D DL KSS P
Subjt:  YMPLTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRP

Query:  LLDRRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGR-AMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETV
        LL+ +DKLC SSSS TSLKGSH V++KC+ FH   NGKH+A+EK R  M   + PIKQPS+VS ILD S RK+R D+VNL +K S+SESIYDDVHRKET 
Subjt:  LLDRRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGR-AMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETV

Query:  YERT-PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLS
        +  T   GLSN K EYKHSC  S  S KAR  RE  IEEQ++ + L+LS Q     EMPIL  +ATL +D NCKPVK+DFQ HVCSNKEHLHSGSPLCLS
Subjt:  YERT-PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLS

Query:  CKDKRLDQFSKNPHRLQFNRASTVTA-RSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNSK---IFPCPDSESDHVGSDDCMVG
        CKD+RLDQ SKN HRL+F  A+TVT  RS TRSRYE+LRNTWFLK EG  TWLQCKP ++SS+ K  S+PTLKL SK   IFPCP+S S H+  D C+V 
Subjt:  CKDKRLDQFSKNPHRLQFNRASTVTA-RSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNSK---IFPCPDSESDHVGSDDCMVG

Query:  DDLKTRVEKKGLCDQHPV--------------NHENPAIERSLKSDYQGDNF--------------------------------SETDGDSSTNSCRATR
          L+TRVEKK LC+Q  +              N+ N AIE SLKSDY  DNF                                SETDGDSSTNS R T 
Subjt:  DDLKTRVEKKGLCDQHPV--------------NHENPAIERSLKSDYQGDNF--------------------------------SETDGDSSTNSCRATR

Query:  ASIQQ--------EGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSGINGRDLMMQLELLMSDSQGANSE-HDLFVSSDD--GGEGS
         SIQQ        EG  FEH P KELDSIV+LEEA+QPSPVSVLEPLF+EET+ S ESSGIN RDLMMQLELLMSDS G+NSE H++FVSSDD  GGEGS
Subjt:  ASIQQ--------EGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSGINGRDLMMQLELLMSDSQGANSE-HDLFVSSDD--GGEGS

Query:  RCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPE
        +CS + IDD++ TFKF DSRDFSYL+DVLSEAGL+C NL+KGC SW GQE HVI PS+FETLEKKFGEQ SWRRSERKLLFDRINSGL+ELFQ  VGVPE
Subjt:  RCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPE

Query:  WAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICREVERLLINELVAEF
        WAKPVS+RFRP  D+E VEEELW LLDSQERE+NKDLVDKQFGKEIGW++LG+EI+SICRE+ERLLI EL+AEF
Subjt:  WAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICREVERLLINELVAEF

XP_038889736.1 uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida]0.0e+0070.13Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP
        ME RQCT SVLEALMGFDE+Q QHHAPRH  VLS+ YLQRVASIGISKKK PSRCHPFRMT+EEP+E F+S +VE+   RC++L + +KADS+LSA  MP
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP

Query:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD
        LTRH  M  KHFST KVIQTSKDFQ+LPEV+D  DI PRPTR  N I N  +NG SVSK  Y+ TE +ND+GTK K  K GQ H S DLD  KSSRPLL+
Subjt:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD

Query:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHS-------LLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRK
         RDKLC SSSSPTSL+GSH VNDKC++  S         NGK+IA+E  R MEYAL PIKQ S+VS+ILD SRR +R  +VNL LK S+  +IYDDV R 
Subjt:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHS-------LLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRK

Query:  ETVYER-TPQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPL
        ET Y R +   LSNW  +YKHSC+ S  S KARE REKV EEQRK ENL+ STQ R+MNEMP LP +A+L SD NCKPVK DFQ HVCSNKEH HSGSPL
Subjt:  ETVYER-TPQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPL

Query:  CLSCKDKRLDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCM
        CLS K KRLDQ  KN HRL+F+  S VT RS TRSRYEALRNTWFLK EG G WLQCKP NRSSNKK  SEP+LKL+S   KIFPCPDS SDHV +DDCM
Subjt:  CLSCKDKRLDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCM

Query:  VGDDLKTRVEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREET
        VGDDLKT+VEKK  CDQH +N  +P   RS K  +   N     G+ +T        SIQQEG+ FEH PSKE DSIV+LEEAFQPSPVSVLEPLF++ET
Subjt:  VGDDLKTRVEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREET

Query:  LFSCESSGINGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHV
        LFS ES GINGRDLMMQLELLMSDS G NSE HDLFVSS DDGGEGS CS + IDD++ TFKF DSRDFSYLVDVLSEA LHC++LE G  S   QE  V
Subjt:  LFSCESSGINGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHV

Query:  IRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGD
        I P++FETLEKKFGEQ+SWRRSERKLLFDRINSGLVELFQ F GVPEWAKPVS+RFRP  + E +EEELW LLDSQEREVNKDLVDKQFGKEIGW++LGD
Subjt:  IRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGD

Query:  EIDSICREVERLLINELVAEFNSKEFF
        EIDSICRE+ERLL+NELVAEF S E F
Subjt:  EIDSICREVERLLINELVAEFNSKEFF

XP_038889740.1 uncharacterized protein LOC120079578 isoform X2 [Benincasa hispida]0.0e+0069.89Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP
        ME RQCT SVLEALMGFDE+Q QHHAPRH  VLS+ YLQRVASIGISKKK PSRCHPFRMT+EEP+E F+S +VE+   RC++L + +KADS+LSA  MP
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP

Query:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD
        LTRH  M  KHFST KVIQTSKDFQ+LPEV+D  DI PRPTR  N I N  +NG SVSK  Y+ TE +ND+GTK K  K GQ H S DLD  KSSRPLL+
Subjt:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD

Query:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHS-------LLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRK
         RDKLC SSSSPTSL+GSH VNDKC++  S         NGK+IA+E  R MEYAL PIKQ S+VS+ILD SRR +R  +VNL LK S+  +IYDDV R 
Subjt:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHS-------LLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRK

Query:  ETVYER-TPQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPL
        ET Y R +   LSNW  +YKHSC+ S  S KARE REKV EEQRK ENL+ STQ R+MNEMP LP +A+L SD NCKPVK DFQ HVCSNKEH HSGSPL
Subjt:  ETVYER-TPQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPL

Query:  CLSCKDKRLDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCM
        CLS K KRLDQ  KN HRL+F+  S VT RS TRSRYEALRNTWFLK EG G WLQCKP NRSSNKK  SEP+LKL+S   KIFPCPDS SDHV +DDCM
Subjt:  CLSCKDKRLDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCM

Query:  VGDDLKTRVEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREET
        VGDDLKT+VEKK  CDQH +N  +P   RS K  +   N     G+ +T        SIQQEG+ FEH PSKE DSIV+LEEAFQPSPVSVLEPLF++ET
Subjt:  VGDDLKTRVEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREET

Query:  LFSCESSGINGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHV
        LFS ES GIN  DLMMQLELLMSDS G NSE HDLFVSS DDGGEGS CS + IDD++ TFKF DSRDFSYLVDVLSEA LHC++LE G  S   QE  V
Subjt:  LFSCESSGINGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHV

Query:  IRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGD
        I P++FETLEKKFGEQ+SWRRSERKLLFDRINSGLVELFQ F GVPEWAKPVS+RFRP  + E +EEELW LLDSQEREVNKDLVDKQFGKEIGW++LGD
Subjt:  IRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGD

Query:  EIDSICREVERLLINELVAEFNSKEFF
        EIDSICRE+ERLL+NELVAEF S E F
Subjt:  EIDSICREVERLLINELVAEFNSKEFF

TrEMBL top hitse value%identityAlignment
A0A0A0KNN6 DUF4378 domain-containing protein1.4e-29768.67Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP
        MEPRQ TASVLEALMGFDE QSQH A RH +V S+ YLQRVASIGISKKK PSRCHPFRMTIEEP+E F+SL+VE+   RC KL +R++ADS LSAAY P
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP

Query:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD
        LTRH     KHFST KVIQTSK FQDLPEV+D  DI PRPTR  N + +  ++GLSVS + Y LTEG+ND+GTKFK  KQGQAHLS DL L KSSRP L+
Subjt:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD

Query:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETVYERT
          +KL  SSS P SLKGSH V DKC+  H+  NGK+IA+EK R    +L PIKQ S+VS+ILDGSRR  R ++ NL LKTS+SE+IYD+V R +      
Subjt:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETVYERT

Query:  PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLSCKDKR
           LSNW  E KHSC  S  S KARE  EKVIEEQRK  NLM STQ RKMNEMP +P+YATL SD NCKPV++DFQ HVCS+KEHLHSGSPLCLS K KR
Subjt:  PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLSCKDKR

Query:  LDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCMVGDDLKTR
        LD+  K  HRL+F+  STVT RS TRSRYEAL NTWFLK EG GTWLQC PLNRSSNKK  ++PTLKL+S   KIFPCPDS S H  +D CMVG D KT 
Subjt:  LDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCMVGDDLKTR

Query:  VEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSG
        V+KK  CDQH +N   P   RS K  +   N     G+ +T        SIQQEG+AF+H PSKE DSIV+LEEAFQPSPVSVLEPLF+EETLFS ES G
Subjt:  VEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSG

Query:  INGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFET
        IN RDL+MQLELLMSDS G NSE HDLFVSS DD GEGS C+ D IDD++ TFKF DSR FSYLVDVLSEA LHC+NLE G  SW  QE+HVI P++FE 
Subjt:  INGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFET

Query:  LEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICRE
        LEKKFGEQ SWRRSERKLLFDRINSGL ELFQ FVGVPEWAKPVS+RFRP  + E +EEELW LLDSQEREVNK+LVDKQFGKEI W++LGDEI+SICRE
Subjt:  LEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICRE

Query:  VERLLINELVAEFNSKE
        +E LL+NELVAEF S E
Subjt:  VERLLINELVAEFNSKE

A0A1S4E497 uncharacterized protein LOC1035016592.1e-30168.62Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP
        MEPR+ TASVLE LMGFDE QSQH  PRH +V S+ YLQR ASIGISKKK PSRCHPFRMTIEEP+E F+SL+VE+   RC KL +R++ADS LSAA +P
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP

Query:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD
        LTR H M  KHFST KVIQTSK FQDLPEV+D  DI PRP+R  N I +H ENG SVSK+ Y LTEG+ND+GTKFK  +QGQAHLS DL L KSSRP L+
Subjt:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD

Query:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETVYERT
          +KL  SSS PTSLKGSH V DKC+  H+  NGK+I +EK R+   +L PIKQ S+VS+ILDGSRR    +++NLPLKTS+SE+IYD++ R E      
Subjt:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETVYERT

Query:  PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLSCKDKR
           LSNW  E KHSC  S  S KARE  EKVIEEQRK E+LM S + RKMNEMP +P YATL SD NCKPVK+DFQ H CS+ EHLHSGSPLCLS K KR
Subjt:  PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLSCKDKR

Query:  LDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCMVGDDLKTR
        LD+  K  HRL+F+  +TVT RS TRSRYEALRNTWFLK EG GTWLQCKPLNRSSNKK  ++PTLKL+S   KIFPCPDS S HV +D CMVG DLKT 
Subjt:  LDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCMVGDDLKTR

Query:  VEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSG
        VEKK  CDQH  N   P   RS K  +   N     G+ +T        SIQQEG+AFEH PSKE DSIV+LEE FQPSPVSVLEPLF+EETLFS ESSG
Subjt:  VEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSG

Query:  INGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFET
        IN RDL+MQLELLM DS G NSE HDLFVSS DDGGEGS C+ D IDD++ TFKF DSR FSYLVDVLSEA L C+NLE G  SW  QE HVI P++FE 
Subjt:  INGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFET

Query:  LEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICRE
        LEKKFGEQ SWRRSERKLLFDRINSGL ELFQ FVGVPEWAKPVS+RFRP  + E +EEELW LLDSQEREVNK+L+DKQFGKEI W++LGDEIDSIC+E
Subjt:  LEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICRE

Query:  VERLLINELVAEFNSKEFF
        +ERLL+NELVAEF S E F
Subjt:  VERLLINELVAEFNSKEFF

A0A5D3C1E7 DUF4378 domain-containing protein2.1e-30168.62Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP
        MEPR+ TASVLE LMGFDE QSQH  PRH +V S+ YLQR ASIGISKKK PSRCHPFRMTIEEP+E F+SL+VE+   RC KL +R++ADS LSAA +P
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMP

Query:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD
        LTR H M  KHFST KVIQTSK FQDLPEV+D  DI PRP+R  N I +H ENG SVSK+ Y LTEG+ND+GTKFK  +QGQAHLS DL L KSSRP L+
Subjt:  LTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLD

Query:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETVYERT
          +KL  SSS PTSLKGSH V DKC+  H+  NGK+I +EK R+   +L PIKQ S+VS+ILDGSRR    +++NLPLKTS+SE+IYD++ R E      
Subjt:  RRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETVYERT

Query:  PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLSCKDKR
           LSNW  E KHSC  S  S KARE  EKVIEEQRK E+LM S + RKMNEMP +P YATL SD NCKPVK+DFQ H CS+ EHLHSGSPLCLS K KR
Subjt:  PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLSCKDKR

Query:  LDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCMVGDDLKTR
        LD+  K  HRL+F+  +TVT RS TRSRYEALRNTWFLK EG GTWLQCKPLNRSSNKK  ++PTLKL+S   KIFPCPDS S HV +D CMVG DLKT 
Subjt:  LDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNS---KIFPCPDSESDHVGSDDCMVGDDLKTR

Query:  VEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSG
        VEKK  CDQH  N   P   RS K  +   N     G+ +T        SIQQEG+AFEH PSKE DSIV+LEE FQPSPVSVLEPLF+EETLFS ESSG
Subjt:  VEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSG

Query:  INGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFET
        IN RDL+MQLELLM DS G NSE HDLFVSS DDGGEGS C+ D IDD++ TFKF DSR FSYLVDVLSEA L C+NLE G  SW  QE HVI P++FE 
Subjt:  INGRDLMMQLELLMSDSQGANSE-HDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFET

Query:  LEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICRE
        LEKKFGEQ SWRRSERKLLFDRINSGL ELFQ FVGVPEWAKPVS+RFRP  + E +EEELW LLDSQEREVNK+L+DKQFGKEI W++LGDEIDSIC+E
Subjt:  LEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICRE

Query:  VERLLINELVAEFNSKEFF
        +ERLL+NELVAEF S E F
Subjt:  VERLLINELVAEFNSKEFF

A0A6J1BX36 uncharacterized protein LOC1110062948.1e-30665.68Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKL---IRCDKLSDRKKADSALSAA
        M  +QCTASVLEALMGF+EQQS HH  RH RVLSEGYLQR ASIG+ KKK PS+CHPFR T+EEP E F++L+V D     I C++L  R+K  SALS+A
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKL---IRCDKLSDRKKADSALSAA

Query:  YMPLTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRP
         MPLTRH+FM  +HF TDK+IQTS D Q+LPEV D  DI PRPTRE  YI NHVENGLS+SKS +TLT G ND+GTKF   KQGQA    D DL KSS P
Subjt:  YMPLTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRP

Query:  LLDRRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGR-AMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETV
        LL+ +DKLC SSSS TSLKGSH V++KC+ FH   NGKH+A+EK R  M   + PIKQPS+VS ILD S RK+R D+VNL +K S+SESIYDDVHRKET 
Subjt:  LLDRRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGR-AMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETV

Query:  YERT-PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLS
        +  T   GLSN K EYKHSC  S  S KAR  RE  IEEQ++ + L+LS Q     EMPIL  +ATL +D NCKPVK+DFQ HVCSNKEHLHSGSPLCLS
Subjt:  YERT-PQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLS

Query:  CKDKRLDQFSKNPHRLQFNRASTVTA-RSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNSK---IFPCPDSESDHVGSDDCMVG
        CKD+RLDQ SKN HRL+F  A+TVT  RS TRSRYE+LRNTWFLK EG  TWLQCKP ++SS+ K  S+PTLKL SK   IFPCP+S S H+  D C+V 
Subjt:  CKDKRLDQFSKNPHRLQFNRASTVTA-RSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNSK---IFPCPDSESDHVGSDDCMVG

Query:  DDLKTRVEKKGLCDQHPV--------------NHENPAIERSLKSDYQGDNF--------------------------------SETDGDSSTNSCRATR
          L+TRVEKK LC+Q  +              N+ N AIE SLKSDY  DNF                                SETDGDSSTNS R T 
Subjt:  DDLKTRVEKKGLCDQHPV--------------NHENPAIERSLKSDYQGDNF--------------------------------SETDGDSSTNSCRATR

Query:  ASIQQ--------EGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSGINGRDLMMQLELLMSDSQGANSE-HDLFVSSDD--GGEGS
         SIQQ        EG  FEH P KELDSIV+LEEA+QPSPVSVLEPLF+EET+ S ESSGIN RDLMMQLELLMSDS G+NSE H++FVSSDD  GGEGS
Subjt:  ASIQQ--------EGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSGINGRDLMMQLELLMSDSQGANSE-HDLFVSSDD--GGEGS

Query:  RCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPE
        +CS + IDD++ TFKF DSRDFSYL+DVLSEAGL+C NL+KGC SW GQE HVI PS+FETLEKKFGEQ SWRRSERKLLFDRINSGL+ELFQ  VGVPE
Subjt:  RCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPE

Query:  WAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICREVERLLINELVAEF
        WAKPVS+RFRP  D+E VEEELW LLDSQERE+NKDLVDKQFGKEIGW++LG+EI+SICRE+ERLLI EL+AEF
Subjt:  WAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICREVERLLINELVAEF

A0A6J1HFD2 uncharacterized protein LOC1114637981.6e-21654.9Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASI-GISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYM
        MEP QC+ASVLEALMGFDE QS+H A  H RVLSE YLQRVASI G  KKK+PS+C PFRMTIEEP E FS   V         L DR+           
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASI-GISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYM

Query:  PLTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLL
          +  +FMN KHFSTD++I TSKDF DLPEVID  DI PR TR  +   NHVENG +VSK                                        
Subjt:  PLTRHHFMNGKHFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLL

Query:  DRRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETVYER
                                                                                               +++ HR       
Subjt:  DRRDKLCSSSSSPTSLKGSHFVNDKCRNFHSLLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETVYER

Query:  TPQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLSCKDK
                KDE K SC++   S K  E REKVIEEQRK  NLMLS Q R MNEM ILP YAT  SD NCKPV +DF N VC NK+HLHSGSPLCLSCKD+
Subjt:  TPQGLSNWKDEYKHSCYLSAGSCKAREPREKVIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLSCKDK

Query:  RLDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNSK---IFPCPDSESDHVGSDDCMVGDDLKT
        R D+  K  HR   + A TV ARS  RSRYEALRNTWFLKPEG GTWLQ KPLN  SNKK+ SEP+ KL+SK   IFPCPDS SDHV +D C+VG+DLKT
Subjt:  RLDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKPEGHGTWLQCKPLNRSSNKKHTSEPTLKLNSK---IFPCPDSESDHVGSDDCMVGDDLKT

Query:  RVEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESS
        RVEK GLCDQH VN  +    R+L+         ETDG SSTNSCRAT  SIQQ+G++FE   SKELDSIV LEE +QPSPVSVLE  F+EE   S ESS
Subjt:  RVEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESS

Query:  GINGRDLMMQLELLMSDSQGANS-EHDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFE
        G+NGR    +L+LLM D+ G NS EH+LFVSS +D GEGS C+ D I D++ TFKF DSRDFSYLVDV+SEAGLH RNLEKG   W  QERHVI PS+FE
Subjt:  GINGRDLMMQLELLMSDSQGANS-EHDLFVSS-DDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFE

Query:  TLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICR
         LEKKFGEQ SWRRSERKLLFDRINSGL ELF+ FVGVPEWAKPVS+R RP  DQE VE+E+W LLDSQE+E NKDLVDKQFGKEIGW++LGDEI SICR
Subjt:  TLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICR

Query:  EVERLLINELVAEFNS
        E+E LLI ELVAE  S
Subjt:  EVERLLINELVAEFNS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39435.1 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related1.6e-3233.23Show/hide
Query:  GDDLKTRVEK----KGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFR
        G DL  + EK    K LCDQ         ++ SL      D  +        +S       I  E V    C     +   N E+A QPSPVSVLEP+F 
Subjt:  GDDLKTRVEK----KGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFR

Query:  EETLFSCESSGINGRDLMM--------QLELLMSDSQGANSEHDLFVSSD-DGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGC
        E+ L   E    +  DL          QLE L S+S+  +    + VSSD +    S        + +      +SRD SY+ D+L+E  L     +K C
Subjt:  EETLFSCESSGINGRDLMM--------QLELLMSDSQGANSEHDLFVSSD-DGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGC

Query:  ASWQGQERHVIRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFG
            G+   VI P +FE LEKK+  + SW+RS+RK+LFDR+NS LVE+ + F   P W KPVS+R   +     +++ELW +L  QE+   K  + K   
Subjt:  ASWQGQERHVIRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFG

Query:  KEIG-WMELGDEIDSICREVERLLINELVAE
         +I  W+EL  + +S+  E+E ++++EL++E
Subjt:  KEIG-WMELGDEIDSICREVERLLINELVAE

AT2G39435.2 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related2.6e-3033.13Show/hide
Query:  GDDLKTRVEK----KGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFR
        G DL  + EK    K LCDQ         ++ SL      D  +        +S       I  E V    C     +   N E+A QPSPVSVLEP+F 
Subjt:  GDDLKTRVEK----KGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQEGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFR

Query:  EETLFSCESSGINGRDLMM--------QLELLMSDSQGANSEHDLFVSSD-DGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGC
        E+ L   E    +  DL          QLE L S+S+  +    + VSSD +    S        + +      +SRD SY+ D+L+E  L     +K C
Subjt:  EETLFSCESSGINGRDLMM--------QLELLMSDSQGANSEHDLFVSSD-DGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEAGLHCRNLEKGC

Query:  ASWQGQERHVIRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFG
            G+   VI P +FE LEKK+  + SW+RS+RK+LFDR+NS LVE+ + F   P W KPVS+R   +     +++ELW +L  QE+   K  + K   
Subjt:  ASWQGQERHVIRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKDLVDKQFG

Query:  KEIG-WMELGDEIDSICREVERL
         +I  W+EL  + +S+  E+E++
Subjt:  KEIG-WMELGDEIDSICREVERL

AT3G53540.1 unknown protein2.7e-1931.54Show/hide
Query:  EEAFQPSPVSVLEPLFREETLFSC---ESSGINGRDLMMQLELLMSDSQGANSEHDLFVSSDDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEA
        +E  QPSPVSVLE  F ++        ES   + R L MQL+LL  +S     E  + VSSD+  +    S    + ++      +    SYLVD+L+ +
Subjt:  EEAFQPSPVSVLEPLFREETLFSC---ESSGINGRDLMMQLELLMSDSQGANSEHDLFVSSDDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLVDVLSEA

Query:  GLHCRNLEKGCASWQGQERHVIRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQERE
             +     A+        + PSLFE LEKK+    +  R ERKLLFD+I+  ++ + +       W K  S +  P  D   ++E L +L+  ++ +
Subjt:  GLHCRNLEKGCASWQGQERHVIRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQERE

Query:  VNKDLVDKQFGKEIGWMELGDEIDSICREVERLLINELVAE
         +K  V++   KE+ W+ L D+I+ I RE+E +L +EL+ E
Subjt:  VNKDLVDKQFGKEIGWMELGDEIDSICREVERLLINELVAE

AT4G00440.1 Protein of unknown function (DUF3741)3.2e-0432.04Show/hide
Query:  ERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFR-----PSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICREVERLLINEL
        + +LLFD IN  L+EL       P WA  V+ R R      S   E  E   W+LL          +V K   +   W+++  +ID I  E   L++NEL
Subjt:  ERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFR-----PSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICREVERLLINEL

Query:  VAE
        + E
Subjt:  VAE

AT4G00440.2 Protein of unknown function (DUF3741)3.2e-0432.04Show/hide
Query:  ERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFR-----PSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICREVERLLINEL
        + +LLFD IN  L+EL       P WA  V+ R R      S   E  E   W+LL          +V K   +   W+++  +ID I  E   L++NEL
Subjt:  ERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFR-----PSRDQETVEEELWNLLDSQEREVNKDLVDKQFGKEIGWMELGDEIDSICREVERLLINEL

Query:  VAE
        + E
Subjt:  VAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCTCGACAGTGTACAGCTAGTGTTCTTGAAGCATTGATGGGTTTTGATGAGCAGCAATCTCAGCACCATGCTCCAAGGCATTATAGAGTTCTTTCTGAGGGTTA
TTTGCAAAGGGTTGCTTCCATTGGAATCTCAAAGAAGAAAACCCCCTCTAGATGTCATCCATTTAGGATGACAATTGAAGAGCCATCAGAATTCTTTAGTTCTCTCGAAG
TTGAAGATAAACTTATTCGTTGCGACAAGTTATCGGATCGGAAGAAGGCTGATTCTGCTTTATCAGCAGCATATATGCCACTTACTAGACACCATTTCATGAATGGAAAA
CACTTTTCAACAGATAAGGTGATACAGACTTCAAAGGATTTTCAAGACTTGCCAGAGGTTATAGATTTTAAGGACATCTTTCCAAGACCTACAAGAGAAATGAATTACAT
ACCCAACCATGTCGAAAATGGATTGAGTGTGTCAAAGTCACGTTATACTTTGACAGAAGGAAGCAATGATTCCGGGACTAAATTTAAGAAAAGTAAACAAGGACAGGCAC
ACTTGTCTGTGGATCTAGATCTTTCGAAGTCTTCGAGACCTTTGTTGGATAGGAGAGATAAATTGTGCTCTTCTTCTTCTTCACCAACTTCTTTGAAAGGCTCCCATTTC
GTTAATGATAAGTGCAGAAATTTTCATAGTTTGCTAAATGGGAAGCATATTGCTGAAGAAAAAGGAAGGGCTATGGAGTATGCACTGCACCCGATCAAGCAACCATCTGA
AGTTTCAAATATTCTGGATGGAAGCAGGAGAAAATCAAGGCCCGATTATGTCAATTTGCCTTTGAAGACATCAAAATCAGAATCCATATATGATGATGTGCATAGAAAAG
AAACCGTGTACGAAAGAACTCCTCAAGGTTTATCTAATTGGAAGGATGAGTACAAGCATTCCTGCTACTTATCAGCTGGTTCATGCAAGGCCAGAGAACCCAGGGAAAAA
GTCATAGAAGAACAAAGGAAGATTGAAAACTTGATGCTATCTACTCAAAGTAGGAAAATGAATGAAATGCCTATACTACCTCAATATGCAACTTTGGCCAGTGATTTTAA
TTGTAAGCCTGTTAAGCATGATTTCCAGAACCATGTTTGTTCGAACAAGGAGCATTTGCATTCTGGAAGTCCCTTGTGTTTGAGCTGCAAGGATAAAAGGCTAGATCAAT
TCAGTAAAAACCCCCATAGATTGCAATTTAATCGTGCTTCTACGGTGACTGCAAGATCTATAACCAGGAGCAGATACGAGGCCCTCAGAAATACTTGGTTCTTAAAGCCT
GAAGGTCATGGTACTTGGCTACAATGCAAGCCACTGAACAGAAGTTCCAATAAAAAGCATACTTCAGAACCTACCTTGAAATTAAACTCCAAGATTTTCCCTTGCCCCGA
TTCAGAAAGCGATCATGTTGGCAGTGATGACTGCATGGTTGGGGATGATCTGAAGACCAGAGTTGAGAAGAAAGGCCTTTGTGATCAGCATCCCGTAAATCATGAAAATC
CGGCCATAGAGCGTTCATTGAAGAGTGATTATCAAGGTGATAATTTTTCAGAGACTGACGGTGATTCATCTACCAACTCTTGTCGAGCCACACGAGCTTCTATTCAACAG
GAAGGGGTTGCCTTTGAACACTGCCCTAGCAAAGAGCTAGATTCTATTGTGAATTTGGAAGAGGCTTTTCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAGAGA
AGAAACATTATTCAGTTGTGAATCCTCAGGCATTAATGGTAGAGATTTGATGATGCAACTTGAACTTCTTATGTCGGACTCCCAGGGAGCAAACTCAGAACATGATTTGT
TCGTATCAAGCGATGATGGTGGTGAAGGATCTAGATGCAGTTACGATGGAATTGATGATTTACTGAGGACTTTCAAGTTCAATGATAGTAGGGATTTTTCATACCTTGTT
GATGTCTTAAGCGAGGCAGGCTTGCATTGTAGAAACCTGGAGAAGGGTTGTGCTTCGTGGCAAGGTCAGGAACGTCACGTGATTCGCCCCTCGTTGTTCGAGACCTTAGA
GAAGAAGTTTGGGGAACAAGATTCTTGGAGGAGATCAGAAAGAAAGCTTCTCTTTGACAGAATAAATTCTGGGTTGGTAGAACTCTTTCAGCCATTCGTTGGCGTGCCAG
AATGGGCAAAGCCGGTATCGAAAAGATTTAGGCCTTCTCGTGACCAGGAAACGGTCGAGGAAGAACTATGGAACCTGCTGGATAGCCAAGAAAGGGAAGTGAACAAAGAT
TTGGTAGATAAGCAGTTTGGAAAGGAGATTGGTTGGATGGAACTTGGAGATGAGATTGATTCTATTTGTAGAGAAGTAGAGAGATTGTTGATCAATGAGCTTGTTGCTGA
GTTTAATAGCAAGGAATTTTTTTGA
mRNA sequenceShow/hide mRNA sequence
CTTGCGTAAGTTGGCGCGTAATTTTCTAGATACGACAATGGGAAAAGGGTTAGAAAAATACCCAAATCAAATGCATCAAATCTCCCATTTCATTTCACTGTTATTTTCTC
AGACCCATTTCCCTTCTCCAATGCGCCTCTTTCATTTTCTCTACTTTCAGTATCGTCGTTTAGCAACGATTAAGATCCATTACCCATCTTCCAAATTTTCCATTCTTGAC
CTTTATTTGTGTTTTTTTGCAGTATATATAACCAAACGTTTTCAAATCTTGTGTTTGACCTGAATCCCACTTGGTCGGTGTGTTCGTCAGAAAATTTGGGAAATGGGTTT
CGATTGGATTTTAGAATGATCCGGAGGGTTTTTGTAGTTGTTTTTGTTACGAATCATGGAGGTTCTGTTTTGGGGATTTACGGGAAAACCTTAGATCGGCGTTTTGCTGG
GGAAGGTTTTGAAAGAAGGAAGCCGAGCGATTGAGGAACAGAGATATCACGGGAAGACAAAGTAATAATTTTATTTTCAATATACCATAAAGTTTTGTCGGTATAGTTTC
TCCGATATGGAACCTCGACAGTGTACAGCTAGTGTTCTTGAAGCATTGATGGGTTTTGATGAGCAGCAATCTCAGCACCATGCTCCAAGGCATTATAGAGTTCTTTCTGA
GGGTTATTTGCAAAGGGTTGCTTCCATTGGAATCTCAAAGAAGAAAACCCCCTCTAGATGTCATCCATTTAGGATGACAATTGAAGAGCCATCAGAATTCTTTAGTTCTC
TCGAAGTTGAAGATAAACTTATTCGTTGCGACAAGTTATCGGATCGGAAGAAGGCTGATTCTGCTTTATCAGCAGCATATATGCCACTTACTAGACACCATTTCATGAAT
GGAAAACACTTTTCAACAGATAAGGTGATACAGACTTCAAAGGATTTTCAAGACTTGCCAGAGGTTATAGATTTTAAGGACATCTTTCCAAGACCTACAAGAGAAATGAA
TTACATACCCAACCATGTCGAAAATGGATTGAGTGTGTCAAAGTCACGTTATACTTTGACAGAAGGAAGCAATGATTCCGGGACTAAATTTAAGAAAAGTAAACAAGGAC
AGGCACACTTGTCTGTGGATCTAGATCTTTCGAAGTCTTCGAGACCTTTGTTGGATAGGAGAGATAAATTGTGCTCTTCTTCTTCTTCACCAACTTCTTTGAAAGGCTCC
CATTTCGTTAATGATAAGTGCAGAAATTTTCATAGTTTGCTAAATGGGAAGCATATTGCTGAAGAAAAAGGAAGGGCTATGGAGTATGCACTGCACCCGATCAAGCAACC
ATCTGAAGTTTCAAATATTCTGGATGGAAGCAGGAGAAAATCAAGGCCCGATTATGTCAATTTGCCTTTGAAGACATCAAAATCAGAATCCATATATGATGATGTGCATA
GAAAAGAAACCGTGTACGAAAGAACTCCTCAAGGTTTATCTAATTGGAAGGATGAGTACAAGCATTCCTGCTACTTATCAGCTGGTTCATGCAAGGCCAGAGAACCCAGG
GAAAAAGTCATAGAAGAACAAAGGAAGATTGAAAACTTGATGCTATCTACTCAAAGTAGGAAAATGAATGAAATGCCTATACTACCTCAATATGCAACTTTGGCCAGTGA
TTTTAATTGTAAGCCTGTTAAGCATGATTTCCAGAACCATGTTTGTTCGAACAAGGAGCATTTGCATTCTGGAAGTCCCTTGTGTTTGAGCTGCAAGGATAAAAGGCTAG
ATCAATTCAGTAAAAACCCCCATAGATTGCAATTTAATCGTGCTTCTACGGTGACTGCAAGATCTATAACCAGGAGCAGATACGAGGCCCTCAGAAATACTTGGTTCTTA
AAGCCTGAAGGTCATGGTACTTGGCTACAATGCAAGCCACTGAACAGAAGTTCCAATAAAAAGCATACTTCAGAACCTACCTTGAAATTAAACTCCAAGATTTTCCCTTG
CCCCGATTCAGAAAGCGATCATGTTGGCAGTGATGACTGCATGGTTGGGGATGATCTGAAGACCAGAGTTGAGAAGAAAGGCCTTTGTGATCAGCATCCCGTAAATCATG
AAAATCCGGCCATAGAGCGTTCATTGAAGAGTGATTATCAAGGTGATAATTTTTCAGAGACTGACGGTGATTCATCTACCAACTCTTGTCGAGCCACACGAGCTTCTATT
CAACAGGAAGGGGTTGCCTTTGAACACTGCCCTAGCAAAGAGCTAGATTCTATTGTGAATTTGGAAGAGGCTTTTCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTT
TAGAGAAGAAACATTATTCAGTTGTGAATCCTCAGGCATTAATGGTAGAGATTTGATGATGCAACTTGAACTTCTTATGTCGGACTCCCAGGGAGCAAACTCAGAACATG
ATTTGTTCGTATCAAGCGATGATGGTGGTGAAGGATCTAGATGCAGTTACGATGGAATTGATGATTTACTGAGGACTTTCAAGTTCAATGATAGTAGGGATTTTTCATAC
CTTGTTGATGTCTTAAGCGAGGCAGGCTTGCATTGTAGAAACCTGGAGAAGGGTTGTGCTTCGTGGCAAGGTCAGGAACGTCACGTGATTCGCCCCTCGTTGTTCGAGAC
CTTAGAGAAGAAGTTTGGGGAACAAGATTCTTGGAGGAGATCAGAAAGAAAGCTTCTCTTTGACAGAATAAATTCTGGGTTGGTAGAACTCTTTCAGCCATTCGTTGGCG
TGCCAGAATGGGCAAAGCCGGTATCGAAAAGATTTAGGCCTTCTCGTGACCAGGAAACGGTCGAGGAAGAACTATGGAACCTGCTGGATAGCCAAGAAAGGGAAGTGAAC
AAAGATTTGGTAGATAAGCAGTTTGGAAAGGAGATTGGTTGGATGGAACTTGGAGATGAGATTGATTCTATTTGTAGAGAAGTAGAGAGATTGTTGATCAATGAGCTTGT
TGCTGAGTTTAATAGCAAGGAATTTTTTTGAATGGTCCTGATATATAGTTGTCATGGATAAACATTAGCATACAAAAGCAAAGAGATTTGTTTCTTTTCTTTTCATTTTT
TTTAAAGAAAAAGAGAGCCATATATTTGTGTATAGGAACCAGTTTTGATCCAGTTTAAGTGTTTGAAGAGGATGTATTCTTTTATTCATGAGTGGAACCAGAGTTATTTG
TTGTCAAGTGTGACAGATTGATATTTATATCAAAGTTTTATCAGTTGAAAT
Protein sequenceShow/hide protein sequence
MEPRQCTASVLEALMGFDEQQSQHHAPRHYRVLSEGYLQRVASIGISKKKTPSRCHPFRMTIEEPSEFFSSLEVEDKLIRCDKLSDRKKADSALSAAYMPLTRHHFMNGK
HFSTDKVIQTSKDFQDLPEVIDFKDIFPRPTREMNYIPNHVENGLSVSKSRYTLTEGSNDSGTKFKKSKQGQAHLSVDLDLSKSSRPLLDRRDKLCSSSSSPTSLKGSHF
VNDKCRNFHSLLNGKHIAEEKGRAMEYALHPIKQPSEVSNILDGSRRKSRPDYVNLPLKTSKSESIYDDVHRKETVYERTPQGLSNWKDEYKHSCYLSAGSCKAREPREK
VIEEQRKIENLMLSTQSRKMNEMPILPQYATLASDFNCKPVKHDFQNHVCSNKEHLHSGSPLCLSCKDKRLDQFSKNPHRLQFNRASTVTARSITRSRYEALRNTWFLKP
EGHGTWLQCKPLNRSSNKKHTSEPTLKLNSKIFPCPDSESDHVGSDDCMVGDDLKTRVEKKGLCDQHPVNHENPAIERSLKSDYQGDNFSETDGDSSTNSCRATRASIQQ
EGVAFEHCPSKELDSIVNLEEAFQPSPVSVLEPLFREETLFSCESSGINGRDLMMQLELLMSDSQGANSEHDLFVSSDDGGEGSRCSYDGIDDLLRTFKFNDSRDFSYLV
DVLSEAGLHCRNLEKGCASWQGQERHVIRPSLFETLEKKFGEQDSWRRSERKLLFDRINSGLVELFQPFVGVPEWAKPVSKRFRPSRDQETVEEELWNLLDSQEREVNKD
LVDKQFGKEIGWMELGDEIDSICREVERLLINELVAEFNSKEFF