; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029857 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029857
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4378 domain-containing protein
Genome locationchr8:42616538..42620122
RNA-Seq ExpressionLag0029857
SyntenyLag0029857
Gene Ontology termsNA
InterPro domainsIPR022212 - Domain of unknown function DUF3741
IPR025486 - Domain of unknown function DUF4378


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05761.1 uncharacterized protein E5676_scaffold98G002500 [Cucumis melo var. makuwa]0.0e+0068.05Show/hide
Query:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGC----NREKVDSALSAAYMP
        ME R+ TA+VLE LMGFDE Q+QH  PRHS++ S+ YLQR ASIGI KKK PSRCHPFRMTIEEP+ELFNSLKVE++FS C     RE+ DS LSAA +P
Subjt:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGC----NREKVDSALSAAYMP

Query:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE
        LTRH  M  KHFST K+I+TSK FQDL EV+DSMDISPRP+R KN IF+H +NGP +SK++ +LTEGNNDAG +FKDR+ GQA  SEDL  LKSSRPF E
Subjt:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE

Query:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETEYKMT
        W +KL FSSS PTSL  SHLV DKCK  H+S+NGK+I +EKER+   +L+PIKQ SQVS ILDGSRR   ++F+NL LKTS SE+IYD++CR E      
Subjt:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETEYKMT

Query:  SSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQ------------RKMNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLSCKDK
            LSNW  E +HSC FSVESYKARE+ EKVIEEQ            RKMNEMP +PHYATLPSD +CKPVKYDFQKH CS+ EH HSGSPLCLS K K
Subjt:  SSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQ------------RKMNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLSCKDK

Query:  RLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCMVGDDLKT
        RLD++ K  HRLRFDS +TVT RSRTRSRYEALRNTWFLK EG GTWLQCKPLNRSSN             K+LKIFPCPDSAS HVD+DGCMVG DLKT
Subjt:  RLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCMVGDDLKT

Query:  --EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRT
          EKK  CD+HS N  P RSKVVFCT+N PVK+ NQ                                                               T
Subjt:  --EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRT

Query:  SIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDI
        SIQQEGLAFEHYPSKE DSIVSLEE  QPSPVSVLE  FKEETL  SESSGINSRDL+MQLELLM DS GTNSEGHDLFVSSDDDGGEGSIC+SD+IDDI
Subjt:  SIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDI

Query:  MSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFR
        MSTFKFKDSR FSYLVDVLSEA L  +NLE G VSW++QE HVIS +VFE LEKKFGEQISWRRSERKLLFDRI+SGL ELFQSF+GVPEWAKPVSRRFR
Subjt:  MSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFR

Query:  PLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVELF
        PL++ EMIEEELWILL+SQERE+NK+L+DKQFGKEI WIDLG+EIDSIC+ELERLL+NELVAEFGS+ELF
Subjt:  PLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVELF

XP_008463525.1 PREDICTED: uncharacterized protein LOC103501659 [Cucumis melo]0.0e+0068.05Show/hide
Query:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGC----NREKVDSALSAAYMP
        ME R+ TA+VLE LMGFDE Q+QH  PRHS++ S+ YLQR ASIGI KKK PSRCHPFRMTIEEP+ELFNSLKVE++FS C     RE+ DS LSAA +P
Subjt:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGC----NREKVDSALSAAYMP

Query:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE
        LTRH  M  KHFST K+I+TSK FQDL EV+DSMDISPRP+R KN IF+H +NGP +SK++ +LTEGNNDAG +FKDR+ GQA  SEDL  LKSSRPF E
Subjt:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE

Query:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETEYKMT
        W +KL FSSS PTSL  SHLV DKCK  H+S+NGK+I +EKER+   +L+PIKQ SQVS ILDGSRR   ++F+NL LKTS SE+IYD++CR E      
Subjt:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETEYKMT

Query:  SSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQ------------RKMNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLSCKDK
            LSNW  E +HSC FSVESYKARE+ EKVIEEQ            RKMNEMP +PHYATLPSD +CKPVKYDFQKH CS+ EH HSGSPLCLS K K
Subjt:  SSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQ------------RKMNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLSCKDK

Query:  RLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCMVGDDLKT
        RLD++ K  HRLRFDS +TVT RSRTRSRYEALRNTWFLK EG GTWLQCKPLNRSSN             K+LKIFPCPDSAS HVD+DGCMVG DLKT
Subjt:  RLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCMVGDDLKT

Query:  --EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRT
          EKK  CD+HS N  P RSKVVFCT+N PVK+ NQ                                                               T
Subjt:  --EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRT

Query:  SIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDI
        SIQQEGLAFEHYPSKE DSIVSLEE  QPSPVSVLE  FKEETL  SESSGINSRDL+MQLELLM DS GTNSEGHDLFVSSDDDGGEGSIC+SD+IDDI
Subjt:  SIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDI

Query:  MSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFR
        MSTFKFKDSR FSYLVDVLSEA L  +NLE G VSW++QE HVIS +VFE LEKKFGEQISWRRSERKLLFDRI+SGL ELFQSF+GVPEWAKPVSRRFR
Subjt:  MSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFR

Query:  PLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVELF
        PL++ EMIEEELWILL+SQERE+NK+L+DKQFGKEI WIDLG+EIDSIC+ELERLL+NELVAEFGS+ELF
Subjt:  PLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVELF

XP_022133834.1 uncharacterized protein LOC111006294 [Momordica charantia]0.0e+0071.4Show/hide
Query:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFS---GCN----REKVDSALSAA
        M  +QCTA+VLEALMGF+EQQ+ HH  RHSR+LSE YLQR ASIG+PKKK PS+CHPFR T+EEP ELFN+L V DSF     CN    REK  SALS+A
Subjt:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFS---GCN----REKVDSALSAA

Query:  YMPLTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRP
         MPLTRHNFM  +HF TDKMI+TS D Q+L EV DSMDISPRPTREK YIFNHV+NG  +SKSH +LT G NDAG +F +RK GQA + +D D LKSS P
Subjt:  YMPLTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRP

Query:  FSEWRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKER-TMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETE
          EW+DKLCFSSSS TSL  SHLV++KCK FH S+NGKH+A+EKER TM   ++PIKQPSQVSRILD S RKTR+DFVNL +K S SESIYDDV RKETE
Subjt:  FSEWRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKER-TMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETE

Query:  YKMTSSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQRK------------MNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLS
        ++ T SPGLSN K EY+HSC FSVESYKAR  RE  IEEQ++              EMP+L H+ATLP+D +CKPVKYDFQKHVCSNKEH HSGSPLCLS
Subjt:  YKMTSSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQRK------------MNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLS

Query:  CKDKRLDQVSKNSHRLRFDSASTVTA-RSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSS-------------NKRLKIFPCPDSASDHVDDDGCMVG
        CKD+RLDQVSKNSHRLRF SA+TVT  RSRTRSRYE+LRNTWFLK EG  TWLQCKP ++SS             +K+L+IFPCP+SAS H+ DDGC+V 
Subjt:  CKDKRLDQVSKNSHRLRFDSASTVTA-RSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSS-------------NKRLKIFPCPDSASDHVDDDGCMVG

Query:  DDLKT--EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCC
          L+T  EKK LC++ S+NS  SR+ VVFC  NNP    N+ IE S K DY DDNFSG AS VLAVK DD EVP V K+  DS SC + E  GDSSTN  
Subjt:  DDLKT--EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCC

Query:  RATRTSIQQ--------EGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDD-G
        R T  SIQQ        EG  FEHYP KELDSIVSLEEA+QPSPVSVLE  FKEET+S SESSGINSRDLMMQLELLMSDS G+NSEGH++FVSSDDD G
Subjt:  RATRTSIQQ--------EGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDD-G

Query:  GEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFL
        GEGS CSS+EIDDIMSTFKFKDSRDFSYL+DVLSEAGL   NL+KGCVSW  QE HVIS SVFETLEKKFGEQ SWRRSERKLLFDRI+SGLIELFQS +
Subjt:  GEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFL

Query:  GVPEWAKPVSRRFRPLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSV
        GVPEWAKPVSRRFRPLLD+EM+EEELWILL+SQERE+NKDLVDKQFGKEIGWIDLGEEI+SICRELERLLI EL+AEFG +
Subjt:  GVPEWAKPVSRRFRPLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSV

XP_038889736.1 uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida]0.0e+0071.49Show/hide
Query:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGCNR----EKVDSALSAAYMP
        ME RQCT +VLEALMGFDE+Q QHHAPRHS +LS+ YLQRVASIGI KKK PSRCHPFRMT+EEP+ELFNS KVE++FS CN     EK DS+LSA  MP
Subjt:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGCNR----EKVDSALSAAYMP

Query:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE
        LTRH  M  KHFST K+I+TSKDFQ+L EV+DSMDISPRPTR KN IFN  KNGP +SK H S TE NNDAG + KDRKLGQ  SSEDLD LKSSRP  E
Subjt:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE

Query:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAE-------EKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTSS-ESIYDDVCRK
        WRDKLCFSSSSPTSL  SHLVNDKCKD  SS+NGK+IA+       E +RTMEYALQPIKQ SQVS ILD SRR TR+ FVNLHLK S   +IYDDVCR 
Subjt:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAE-------EKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTSS-ESIYDDVCRK

Query:  ETEYKMTSSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQRK------------MNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPL
        ET+Y+  SSP LSNW  +Y+HSCFFSVESYKARE+REKV EEQRK            MNEMP LPH+A+LPSD +CKPVK+DFQKHVCSNKEHFHSGSPL
Subjt:  ETEYKMTSSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQRK------------MNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPL

Query:  CLSCKDKRLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCM
        CLS K KRLDQ+ KNSHRLRFDS S VT RSRTRSRYEALRNTWFLK EG G WLQCKP NRSSN             K+LKIFPCPDSASDHVD+D CM
Subjt:  CLSCKDKRLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCM

Query:  VGDDLKT--EKKGLCDRHSLNSPS-RSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTN
        VGDDLKT  EKK  CD+HSLN  S RSK VFCT+N PVK+ NQ                                                         
Subjt:  VGDDLKT--EKKGLCDRHSLNSPS-RSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTN

Query:  CCRATRTSIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICS
              TSIQQEGL FEHYPSKE DSIVSLEEA QPSPVSVLE  FK+ETL  SES GIN RDLMMQLELLMSDS GTNSEGHDLFVSSDDDGGEGSICS
Subjt:  CCRATRTSIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICS

Query:  SDEIDDIMSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAK
        S+EIDDIMSTFKFKDSRDFSYLVDVLSEA L  ++LE G VS H+QE  VIS +VFETLEKKFGEQ SWRRSERKLLFDRI+SGL+ELFQSF GVPEWAK
Subjt:  SDEIDDIMSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAK

Query:  PVSRRFRPLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVELF
        PVSRRFRPLL+ EMIEEELWILL+SQERE+NKDLVDKQFGKEIGWIDLG+EIDSICRELERLL+NELVAEFGS+ELF
Subjt:  PVSRRFRPLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVELF

XP_038889740.1 uncharacterized protein LOC120079578 isoform X2 [Benincasa hispida]0.0e+0071.38Show/hide
Query:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGCNR----EKVDSALSAAYMP
        ME RQCT +VLEALMGFDE+Q QHHAPRHS +LS+ YLQRVASIGI KKK PSRCHPFRMT+EEP+ELFNS KVE++FS CN     EK DS+LSA  MP
Subjt:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGCNR----EKVDSALSAAYMP

Query:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE
        LTRH  M  KHFST K+I+TSKDFQ+L EV+DSMDISPRPTR KN IFN  KNGP +SK H S TE NNDAG + KDRKLGQ  SSEDLD LKSSRP  E
Subjt:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE

Query:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAE-------EKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTSS-ESIYDDVCRK
        WRDKLCFSSSSPTSL  SHLVNDKCKD  SS+NGK+IA+       E +RTMEYALQPIKQ SQVS ILD SRR TR+ FVNLHLK S   +IYDDVCR 
Subjt:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAE-------EKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTSS-ESIYDDVCRK

Query:  ETEYKMTSSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQRK------------MNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPL
        ET+Y+  SSP LSNW  +Y+HSCFFSVESYKARE+REKV EEQRK            MNEMP LPH+A+LPSD +CKPVK+DFQKHVCSNKEHFHSGSPL
Subjt:  ETEYKMTSSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQRK------------MNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPL

Query:  CLSCKDKRLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCM
        CLS K KRLDQ+ KNSHRLRFDS S VT RSRTRSRYEALRNTWFLK EG G WLQCKP NRSSN             K+LKIFPCPDSASDHVD+D CM
Subjt:  CLSCKDKRLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCM

Query:  VGDDLKT--EKKGLCDRHSLNSPS-RSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTN
        VGDDLKT  EKK  CD+HSLN  S RSK VFCT+N PVK+ NQ                                                         
Subjt:  VGDDLKT--EKKGLCDRHSLNSPS-RSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTN

Query:  CCRATRTSIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICS
              TSIQQEGL FEHYPSKE DSIVSLEEA QPSPVSVLE  FK+ETL  SES GIN  DLMMQLELLMSDS GTNSEGHDLFVSSDDDGGEGSICS
Subjt:  CCRATRTSIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICS

Query:  SDEIDDIMSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAK
        S+EIDDIMSTFKFKDSRDFSYLVDVLSEA L  ++LE G VS H+QE  VIS +VFETLEKKFGEQ SWRRSERKLLFDRI+SGL+ELFQSF GVPEWAK
Subjt:  SDEIDDIMSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAK

Query:  PVSRRFRPLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVELF
        PVSRRFRPLL+ EMIEEELWILL+SQERE+NKDLVDKQFGKEIGWIDLG+EIDSICRELERLL+NELVAEFGS+ELF
Subjt:  PVSRRFRPLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVELF

TrEMBL top hitse value%identityAlignment
A0A0A0KNN6 DUF4378 domain-containing protein0.0e+0068.35Show/hide
Query:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGC----NREKVDSALSAAYMP
        ME RQ TA+VLEALMGFDE Q+QH A RHS++ S+ YLQRVASIGI KKK PSRCHPFRMTIEEP+ELFNSLKVE++FS C     RE+ DS LSAAY P
Subjt:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGC----NREKVDSALSAAYMP

Query:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE
        LTRH     KHFST K+I+TSK FQDL EV+DSMDISPRPTR KN +F+  K+G  +S +H +LTEGNNDAG +FKDRK GQA  SEDL  LKSSRPF E
Subjt:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE

Query:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETEYKMT
        W +KL FSSS P SL  SHLV DKCK  H+S+NGK+IA+EKERT   +L+PIKQ SQVS ILDGSRR  R +F NLHLKTS SE+IYD+VCR +      
Subjt:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETEYKMT

Query:  SSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQ------------RKMNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLSCKDK
            LSNW  E +HSC FSVESYKARE+ EKVIEEQ            RKMNEMP +P YATLPSD +CKPV+YDFQKHVCS+KEH HSGSPLCLS K K
Subjt:  SSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQ------------RKMNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLSCKDK

Query:  RLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCMVGDDLKT
        RLD++ K  HRLRFDS STVT RSRTRSRYEAL NTWFLK EG GTWLQC PLNRSSN             K+LKIFPCPDSAS H D+DGCMVG D KT
Subjt:  RLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCMVGDDLKT

Query:  --EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRT
          +KK  CD+HSLN  P RSKVVFCT+N PVK+ NQ                                                               T
Subjt:  --EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRT

Query:  SIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDI
        SIQQEGLAF+HYPSKE DSIVSLEEA QPSPVSVLE  FKEETL  SES GINSRDL+MQLELLMSDS GTNSEGHDLFVSSDDD GEGSIC+SD+IDDI
Subjt:  SIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDI

Query:  MSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFR
        MSTFKFKDSR FSYLVDVLSEA L  +NLE G VSWH+QE HVIS +VFE LEKKFGEQISWRRSERKLLFDRI+SGL ELFQSF+GVPEWAKPVSRRFR
Subjt:  MSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFR

Query:  PLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVEL
        PLL+ EMIEEELWILL+SQERE+NK+LVDKQFGKEI WIDLG+EI+SICRELE LL+NELVAEFGS+EL
Subjt:  PLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVEL

A0A1S4E497 uncharacterized protein LOC1035016590.0e+0068.05Show/hide
Query:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGC----NREKVDSALSAAYMP
        ME R+ TA+VLE LMGFDE Q+QH  PRHS++ S+ YLQR ASIGI KKK PSRCHPFRMTIEEP+ELFNSLKVE++FS C     RE+ DS LSAA +P
Subjt:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGC----NREKVDSALSAAYMP

Query:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE
        LTRH  M  KHFST K+I+TSK FQDL EV+DSMDISPRP+R KN IF+H +NGP +SK++ +LTEGNNDAG +FKDR+ GQA  SEDL  LKSSRPF E
Subjt:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE

Query:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETEYKMT
        W +KL FSSS PTSL  SHLV DKCK  H+S+NGK+I +EKER+   +L+PIKQ SQVS ILDGSRR   ++F+NL LKTS SE+IYD++CR E      
Subjt:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETEYKMT

Query:  SSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQ------------RKMNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLSCKDK
            LSNW  E +HSC FSVESYKARE+ EKVIEEQ            RKMNEMP +PHYATLPSD +CKPVKYDFQKH CS+ EH HSGSPLCLS K K
Subjt:  SSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQ------------RKMNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLSCKDK

Query:  RLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCMVGDDLKT
        RLD++ K  HRLRFDS +TVT RSRTRSRYEALRNTWFLK EG GTWLQCKPLNRSSN             K+LKIFPCPDSAS HVD+DGCMVG DLKT
Subjt:  RLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCMVGDDLKT

Query:  --EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRT
          EKK  CD+HS N  P RSKVVFCT+N PVK+ NQ                                                               T
Subjt:  --EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRT

Query:  SIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDI
        SIQQEGLAFEHYPSKE DSIVSLEE  QPSPVSVLE  FKEETL  SESSGINSRDL+MQLELLM DS GTNSEGHDLFVSSDDDGGEGSIC+SD+IDDI
Subjt:  SIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDI

Query:  MSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFR
        MSTFKFKDSR FSYLVDVLSEA L  +NLE G VSW++QE HVIS +VFE LEKKFGEQISWRRSERKLLFDRI+SGL ELFQSF+GVPEWAKPVSRRFR
Subjt:  MSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFR

Query:  PLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVELF
        PL++ EMIEEELWILL+SQERE+NK+L+DKQFGKEI WIDLG+EIDSIC+ELERLL+NELVAEFGS+ELF
Subjt:  PLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVELF

A0A5D3C1E7 DUF4378 domain-containing protein0.0e+0068.05Show/hide
Query:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGC----NREKVDSALSAAYMP
        ME R+ TA+VLE LMGFDE Q+QH  PRHS++ S+ YLQR ASIGI KKK PSRCHPFRMTIEEP+ELFNSLKVE++FS C     RE+ DS LSAA +P
Subjt:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGC----NREKVDSALSAAYMP

Query:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE
        LTRH  M  KHFST K+I+TSK FQDL EV+DSMDISPRP+R KN IF+H +NGP +SK++ +LTEGNNDAG +FKDR+ GQA  SEDL  LKSSRPF E
Subjt:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE

Query:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETEYKMT
        W +KL FSSS PTSL  SHLV DKCK  H+S+NGK+I +EKER+   +L+PIKQ SQVS ILDGSRR   ++F+NL LKTS SE+IYD++CR E      
Subjt:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETEYKMT

Query:  SSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQ------------RKMNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLSCKDK
            LSNW  E +HSC FSVESYKARE+ EKVIEEQ            RKMNEMP +PHYATLPSD +CKPVKYDFQKH CS+ EH HSGSPLCLS K K
Subjt:  SSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQ------------RKMNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLSCKDK

Query:  RLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCMVGDDLKT
        RLD++ K  HRLRFDS +TVT RSRTRSRYEALRNTWFLK EG GTWLQCKPLNRSSN             K+LKIFPCPDSAS HVD+DGCMVG DLKT
Subjt:  RLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCMVGDDLKT

Query:  --EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRT
          EKK  CD+HS N  P RSKVVFCT+N PVK+ NQ                                                               T
Subjt:  --EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRT

Query:  SIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDI
        SIQQEGLAFEHYPSKE DSIVSLEE  QPSPVSVLE  FKEETL  SESSGINSRDL+MQLELLM DS GTNSEGHDLFVSSDDDGGEGSIC+SD+IDDI
Subjt:  SIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDI

Query:  MSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFR
        MSTFKFKDSR FSYLVDVLSEA L  +NLE G VSW++QE HVIS +VFE LEKKFGEQISWRRSERKLLFDRI+SGL ELFQSF+GVPEWAKPVSRRFR
Subjt:  MSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFR

Query:  PLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVELF
        PL++ EMIEEELWILL+SQERE+NK+L+DKQFGKEI WIDLG+EIDSIC+ELERLL+NELVAEFGS+ELF
Subjt:  PLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVELF

A0A6J1BX36 uncharacterized protein LOC1110062940.0e+0071.4Show/hide
Query:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFS---GCN----REKVDSALSAA
        M  +QCTA+VLEALMGF+EQQ+ HH  RHSR+LSE YLQR ASIG+PKKK PS+CHPFR T+EEP ELFN+L V DSF     CN    REK  SALS+A
Subjt:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFS---GCN----REKVDSALSAA

Query:  YMPLTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRP
         MPLTRHNFM  +HF TDKMI+TS D Q+L EV DSMDISPRPTREK YIFNHV+NG  +SKSH +LT G NDAG +F +RK GQA + +D D LKSS P
Subjt:  YMPLTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRP

Query:  FSEWRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKER-TMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETE
          EW+DKLCFSSSS TSL  SHLV++KCK FH S+NGKH+A+EKER TM   ++PIKQPSQVSRILD S RKTR+DFVNL +K S SESIYDDV RKETE
Subjt:  FSEWRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKER-TMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTS-SESIYDDVCRKETE

Query:  YKMTSSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQRK------------MNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLS
        ++ T SPGLSN K EY+HSC FSVESYKAR  RE  IEEQ++              EMP+L H+ATLP+D +CKPVKYDFQKHVCSNKEH HSGSPLCLS
Subjt:  YKMTSSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQRK------------MNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLS

Query:  CKDKRLDQVSKNSHRLRFDSASTVTA-RSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSS-------------NKRLKIFPCPDSASDHVDDDGCMVG
        CKD+RLDQVSKNSHRLRF SA+TVT  RSRTRSRYE+LRNTWFLK EG  TWLQCKP ++SS             +K+L+IFPCP+SAS H+ DDGC+V 
Subjt:  CKDKRLDQVSKNSHRLRFDSASTVTA-RSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSS-------------NKRLKIFPCPDSASDHVDDDGCMVG

Query:  DDLKT--EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCC
          L+T  EKK LC++ S+NS  SR+ VVFC  NNP    N+ IE S K DY DDNFSG AS VLAVK DD EVP V K+  DS SC + E  GDSSTN  
Subjt:  DDLKT--EKKGLCDRHSLNS-PSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCC

Query:  RATRTSIQQ--------EGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDD-G
        R T  SIQQ        EG  FEHYP KELDSIVSLEEA+QPSPVSVLE  FKEET+S SESSGINSRDLMMQLELLMSDS G+NSEGH++FVSSDDD G
Subjt:  RATRTSIQQ--------EGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDD-G

Query:  GEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFL
        GEGS CSS+EIDDIMSTFKFKDSRDFSYL+DVLSEAGL   NL+KGCVSW  QE HVIS SVFETLEKKFGEQ SWRRSERKLLFDRI+SGLIELFQS +
Subjt:  GEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFL

Query:  GVPEWAKPVSRRFRPLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSV
        GVPEWAKPVSRRFRPLLD+EM+EEELWILL+SQERE+NKDLVDKQFGKEIGWIDLGEEI+SICRELERLLI EL+AEFG +
Subjt:  GVPEWAKPVSRRFRPLLDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSV

A0A6J1JSS4 uncharacterized protein LOC1114871972.0e-22253.94Show/hide
Query:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASI-GIPKKKSPSRCHPFRMTIEEPSELF---NSLKVEDSFSGCNREKVDSALSAAYMP
        ME  QC+A+VLEALMGFDE Q++H A   SR LSE YLQRVASI G  KKKSPSRC PFRMTIEEP E+F   N L   + FS                 
Subjt:  MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASI-GIPKKKSPSRCHPFRMTIEEPSELF---NSLKVEDSFSGCNREKVDSALSAAYMP

Query:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE
            NFMN KHFSTD++I TSKDF DL E +DSMDISPR TR K+  FNHV+NGP +SK                                         
Subjt:  LTRHNFMNGKHFSTDKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSE

Query:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTSSESIYDDVCRKETEYKMTS
                      LN++H                                                                                 
Subjt:  WRDKLCFSSSSPTSLNDSHLVNDKCKDFHSSENGKHIAEEKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTSSESIYDDVCRKETEYKMTS

Query:  SPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQRK------------MNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLSCKDKR
               K EY+ SCF SVESYK  E+REKVIEEQRK            MNEM +LPHYAT PSD +CKPV+YDF K +C NK+H HSGSPLCLSCKD+R
Subjt:  SPGLSNWKKEYEHSCFFSVESYKAREAREKVIEEQRK------------MNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLSCKDKR

Query:  LDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCMVGDDLKT-
         D++SK  HR R DSA TV ARSR RSRYEALRNTWFLKPEGLGTWLQ KPLN  SN             K+L+IFPCPDS SDHVD+DGC+VG+DLKT 
Subjt:  LDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSSN-------------KRLKIFPCPDSASDHVDDDGCMVGDDLKT-

Query:  -EKKGLCDRHSLNSPSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRTSI
         EK GLCD+HS+N  S                N  IE+                                     S S I+ E  G SST  CRAT TSI
Subjt:  -EKKGLCDRHSLNSPSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRTSI

Query:  QQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMS
        QQ+GL+F+ Y SKELDSIV LEE +QPSPVSVLE HFKEET S  ESSGINSR    +LELLM DS GTNS+ H+LFVSS++DGGEGSIC+SDEI DIMS
Subjt:  QQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMS

Query:  TFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFRPL
        TFKFKDSRDFSYLVDV+SEAGL  RNLEKGCV WHDQE +VIS SVFE LEKKFGEQ+SWRRSERKLLFDRI+SGL ELFQSF+GVPEWAKPVSRRFRPL
Subjt:  TFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFRPL

Query:  LDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGS
        LDQEM+E++LW LL+SQE+E NKDLVDKQFGKEIGWIDL +EI SICRELE LLI ELVAE GS
Subjt:  LDQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39435.1 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related4.0e-3437.75Show/hide
Query:  EEAHQPSPVSVLESHFKEETLSISESSGINSRDLMM--------QLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSRDFSYL
        E+AHQPSPVSVLE  F E+ L  SE    +S DL          QLE L S+S  + S+G  + VSSD++    S     +  + +     ++SRD SY+
Subjt:  EEAHQPSPVSVLESHFKEETLSISESSGINSRDLMM--------QLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSRDFSYL

Query:  VDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFRPLLDQEMIEEELWIL
         D+L+E  L  +N   G      +   VI+  +FE LEKK+  + SW+RS+RK+LFDR++S L+E+ +SF   P W KPVSRR    L    +++ELW +
Subjt:  VDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFRPLLDQEMIEEELWIL

Query:  LESQEREMNKDLVDKQFGKEIG-WIDLGEEIDSICRELERLLINELVAE
        L  QE+   K  + K    +I  W++L  + +S+  ELE ++++EL++E
Subjt:  LESQEREMNKDLVDKQFGKEIG-WIDLGEEIDSICRELERLLINELVAE

AT2G39435.2 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related6.4e-3237.76Show/hide
Query:  EEAHQPSPVSVLESHFKEETLSISESSGINSRDLMM--------QLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSRDFSYL
        E+AHQPSPVSVLE  F E+ L  SE    +S DL          QLE L S+S  + S+G  + VSSD++    S     +  + +     ++SRD SY+
Subjt:  EEAHQPSPVSVLESHFKEETLSISESSGINSRDLMM--------QLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSRDFSYL

Query:  VDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFRPLLDQEMIEEELWIL
         D+L+E  L  +N   G      +   VI+  +FE LEKK+  + SW+RS+RK+LFDR++S L+E+ +SF   P W KPVSRR    L    +++ELW +
Subjt:  VDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFRPLLDQEMIEEELWIL

Query:  LESQEREMNKDLVDKQFGKEIG-WIDLGEEIDSICRELERL
        L  QE+   K  + K    +I  W++L  + +S+  ELE++
Subjt:  LESQEREMNKDLVDKQFGKEIG-WIDLGEEIDSICRELERL

AT3G53540.1 unknown protein1.5e-2029.89Show/hide
Query:  SPSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRTSIQQEGLAFEHYPSK
        +PS SK V+     P K  +    RSS        FSG A+       +D      +K    S +  L      +  +  R T   +    +     P +
Subjt:  SPSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCILLEAGGDSSTNCCRATRTSIQQEGLAFEHYPSK

Query:  ELDSIVSLEEAHQPSPVSVLESHFKEETLSIS---ESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSR-D
              S +E  QPSPVSVLE+ F ++  S S   ES   + R L MQL+LL  +S  T  EG  + VSSD+D  +    SS   D+ M T + ++    
Subjt:  ELDSIVSLEEAHQPSPVSVLESHFKEETLSIS---ESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSR-D

Query:  FSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISA------SVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFRPLLDQ
         SYLVD+L+ +            S+ D + +++ A      S+FE LEKK+    +  R ERKLLFD+I   ++ + +       W K  S +  P  D 
Subjt:  FSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISA------SVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFRPLLDQ

Query:  EMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAE
          I+E L  L+  ++ + +K  V++   KE+ W+ L ++I+ I RE+E +L +EL+ E
Subjt:  EMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAE

AT5G43880.1 Protein of unknown function (DUF3741)1.1e-0425.48Show/hide
Query:  SLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLEL---LMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSRDFSYLVDV
        S E   +PSP+SVLE+ F EE      SS +N     ++ E+   L+  S    S G  L   S DD       SS       +T    +  D   L++ 
Subjt:  SLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLEL---LMSDSLGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSRDFSYLVDV

Query:  LSEAGLRRRNLEKGCVSWHDQEC----HVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFL----------GVPEWAKPVSRRFRPLLD
        L  A       +     WH  E      + ++    T +K+ G  +      + L+FD +++ L+EL  S+L          G P     ++R    L  
Subjt:  LSEAGLRRRNLEKGCVSWHDQEC----HVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFL----------GVPEWAKPVSRRFRPLLD

Query:  QEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGE----EIDSICRELERLLINELVAE
           +E+  W     ++ +++   V+K    E+  I   E    E+DS+  ELE  L+ ELV E
Subjt:  QEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGE----EIDSICRELERLLINELVAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACATAGACAGTGTACAGCTACTGTTCTTGAAGCATTGATGGGCTTTGATGAGCAGCAAACTCAGCATCATGCTCCAAGGCATTCGAGAATTCTTTCTGAAAGTTA
TTTACAAAGGGTTGCTTCCATTGGAATCCCGAAGAAGAAATCCCCCTCCAGATGTCATCCATTTAGGATGACAATAGAAGAGCCATCAGAACTCTTTAATTCTCTCAAAG
TAGAAGATAGCTTTAGCGGTTGCAATCGTGAGAAGGTAGATTCTGCTTTATCAGCAGCATATATGCCACTTACAAGACACAACTTCATGAATGGAAAGCATTTTTCAACA
GATAAGATGATTGAGACTTCAAAGGATTTTCAAGATTTACGGGAGGTTATAGATTCTATGGACATCTCACCAAGACCTACAAGAGAAAAGAATTATATATTCAACCATGT
CAAAAATGGTCCGTGTATGTCAAAGTCACATAATTCTTTGACAGAAGGAAATAATGATGCAGGCATTAGATTTAAGGACAGAAAACTAGGACAGGCACGCTCGTCTGAGG
ATCTAGATCCTTTGAAGTCTTCAAGACCCTTTTCGGAGTGGAGAGATAAACTATGCTTTTCTTCCTCTTCACCAACTTCTTTGAATGACTCACATTTAGTTAACGACAAA
TGCAAAGATTTTCATAGTTCTGAAAATGGAAAGCATATTGCTGAAGAAAAAGAAAGGACTATGGAGTATGCACTGCAGCCCATCAAGCAACCATCTCAAGTTTCAAGAAT
TTTGGATGGAAGTAGGAGAAAAACGAGGAACGATTTTGTCAATTTGCATTTGAAGACCTCATCAGAATCCATATATGATGATGTGTGTAGAAAAGAAACCGAGTACAAAA
TGACTTCTTCCCCAGGTTTATCTAATTGGAAGAAGGAATACGAGCATTCCTGCTTCTTTTCAGTTGAGTCATACAAGGCCAGAGAAGCCAGGGAGAAAGTCATAGAAGAA
CAAAGGAAAATGAATGAAATGCCTGTACTACCTCATTATGCAACTTTGCCCAGTGATTTCAGTTGCAAGCCTGTCAAATATGATTTCCAGAAGCATGTTTGTTCAAACAA
GGAACATTTTCATTCTGGCAGTCCCTTGTGCCTGAGCTGCAAGGATAAAAGATTAGATCAGGTCAGTAAAAACTCCCACAGATTGAGATTCGATTCTGCTTCTACGGTGA
CTGCAAGATCTAGAACCAGGAGTAGATATGAAGCCCTTAGAAATACATGGTTCTTAAAGCCTGAAGGTCTTGGTACCTGGCTACAATGCAAGCCACTGAATAGAAGTTCC
AATAAGAGGTTGAAGATTTTTCCTTGCCCTGATTCAGCAAGCGATCATGTTGATGATGATGGCTGTATGGTTGGTGATGATCTGAAGACTGAGAAGAAAGGTCTTTGTGA
TCGACATTCTTTAAACTCTCCATCAAGGAGCAAAGTTGTTTTTTGCACACGAAACAATCCCGTCAAACGAGAAAATCAGACCATAGAGCGTTCTTCGAAATGTGATTATC
AAGATGATAATTTTTCAGGTACGGCTTCTAAGGTATTGGCTGTAAAGAATGATGATGTGGAGGTCCCGGCTGTGGTTAAACGAGGTTTTGATTCATTTTCTTGCATCCTT
CTAGAGGCTGGCGGTGATTCATCTACCAACTGTTGTCGTGCCACACGTACTTCTATTCAACAGGAAGGTCTTGCCTTTGAACACTACCCTAGCAAAGAGCTAGATTCTAT
TGTGAGTTTGGAGGAGGCTCATCAACCTAGCCCAGTTTCAGTCCTTGAATCACATTTCAAAGAAGAAACATTATCCATTTCTGAATCCTCAGGCATTAACAGTAGAGATT
TAATGATGCAACTTGAACTTCTGATGTCGGACTCCCTGGGAACTAATTCAGAAGGACATGATTTGTTCGTATCAAGTGATGATGATGGTGGAGAAGGATCTATATGCAGT
TCTGACGAAATTGATGACATAATGAGCACATTCAAGTTCAAAGATAGTAGGGATTTTTCGTACCTTGTTGATGTCTTAAGTGAGGCAGGCTTGCGTCGTAGAAACCTGGA
GAAAGGTTGTGTTTCGTGGCATGATCAGGAATGTCACGTGATTAGCGCCTCAGTCTTCGAGACCTTAGAGAAGAAGTTTGGGGAACAAATTTCTTGGAGGAGATCAGAAA
GAAAGCTTCTCTTTGACAGAATAGATTCTGGGTTAATAGAACTTTTTCAGTCATTTCTTGGTGTGCCAGAATGGGCAAAGCCTGTATCGAGACGATTTCGGCCATTGCTT
GACCAGGAAATGATCGAGGAAGAACTATGGATCCTGCTGGAGAGCCAAGAAAGGGAAATGAACAAAGATTTAGTTGATAAACAGTTTGGAAAGGAGATTGGGTGGATAGA
TCTTGGAGAGGAGATTGATTCTATTTGTAGAGAACTAGAGAGATTGTTGATCAATGAGCTCGTTGCAGAGTTTGGTAGCGTCGAATTATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACATAGACAGTGTACAGCTACTGTTCTTGAAGCATTGATGGGCTTTGATGAGCAGCAAACTCAGCATCATGCTCCAAGGCATTCGAGAATTCTTTCTGAAAGTTA
TTTACAAAGGGTTGCTTCCATTGGAATCCCGAAGAAGAAATCCCCCTCCAGATGTCATCCATTTAGGATGACAATAGAAGAGCCATCAGAACTCTTTAATTCTCTCAAAG
TAGAAGATAGCTTTAGCGGTTGCAATCGTGAGAAGGTAGATTCTGCTTTATCAGCAGCATATATGCCACTTACAAGACACAACTTCATGAATGGAAAGCATTTTTCAACA
GATAAGATGATTGAGACTTCAAAGGATTTTCAAGATTTACGGGAGGTTATAGATTCTATGGACATCTCACCAAGACCTACAAGAGAAAAGAATTATATATTCAACCATGT
CAAAAATGGTCCGTGTATGTCAAAGTCACATAATTCTTTGACAGAAGGAAATAATGATGCAGGCATTAGATTTAAGGACAGAAAACTAGGACAGGCACGCTCGTCTGAGG
ATCTAGATCCTTTGAAGTCTTCAAGACCCTTTTCGGAGTGGAGAGATAAACTATGCTTTTCTTCCTCTTCACCAACTTCTTTGAATGACTCACATTTAGTTAACGACAAA
TGCAAAGATTTTCATAGTTCTGAAAATGGAAAGCATATTGCTGAAGAAAAAGAAAGGACTATGGAGTATGCACTGCAGCCCATCAAGCAACCATCTCAAGTTTCAAGAAT
TTTGGATGGAAGTAGGAGAAAAACGAGGAACGATTTTGTCAATTTGCATTTGAAGACCTCATCAGAATCCATATATGATGATGTGTGTAGAAAAGAAACCGAGTACAAAA
TGACTTCTTCCCCAGGTTTATCTAATTGGAAGAAGGAATACGAGCATTCCTGCTTCTTTTCAGTTGAGTCATACAAGGCCAGAGAAGCCAGGGAGAAAGTCATAGAAGAA
CAAAGGAAAATGAATGAAATGCCTGTACTACCTCATTATGCAACTTTGCCCAGTGATTTCAGTTGCAAGCCTGTCAAATATGATTTCCAGAAGCATGTTTGTTCAAACAA
GGAACATTTTCATTCTGGCAGTCCCTTGTGCCTGAGCTGCAAGGATAAAAGATTAGATCAGGTCAGTAAAAACTCCCACAGATTGAGATTCGATTCTGCTTCTACGGTGA
CTGCAAGATCTAGAACCAGGAGTAGATATGAAGCCCTTAGAAATACATGGTTCTTAAAGCCTGAAGGTCTTGGTACCTGGCTACAATGCAAGCCACTGAATAGAAGTTCC
AATAAGAGGTTGAAGATTTTTCCTTGCCCTGATTCAGCAAGCGATCATGTTGATGATGATGGCTGTATGGTTGGTGATGATCTGAAGACTGAGAAGAAAGGTCTTTGTGA
TCGACATTCTTTAAACTCTCCATCAAGGAGCAAAGTTGTTTTTTGCACACGAAACAATCCCGTCAAACGAGAAAATCAGACCATAGAGCGTTCTTCGAAATGTGATTATC
AAGATGATAATTTTTCAGGTACGGCTTCTAAGGTATTGGCTGTAAAGAATGATGATGTGGAGGTCCCGGCTGTGGTTAAACGAGGTTTTGATTCATTTTCTTGCATCCTT
CTAGAGGCTGGCGGTGATTCATCTACCAACTGTTGTCGTGCCACACGTACTTCTATTCAACAGGAAGGTCTTGCCTTTGAACACTACCCTAGCAAAGAGCTAGATTCTAT
TGTGAGTTTGGAGGAGGCTCATCAACCTAGCCCAGTTTCAGTCCTTGAATCACATTTCAAAGAAGAAACATTATCCATTTCTGAATCCTCAGGCATTAACAGTAGAGATT
TAATGATGCAACTTGAACTTCTGATGTCGGACTCCCTGGGAACTAATTCAGAAGGACATGATTTGTTCGTATCAAGTGATGATGATGGTGGAGAAGGATCTATATGCAGT
TCTGACGAAATTGATGACATAATGAGCACATTCAAGTTCAAAGATAGTAGGGATTTTTCGTACCTTGTTGATGTCTTAAGTGAGGCAGGCTTGCGTCGTAGAAACCTGGA
GAAAGGTTGTGTTTCGTGGCATGATCAGGAATGTCACGTGATTAGCGCCTCAGTCTTCGAGACCTTAGAGAAGAAGTTTGGGGAACAAATTTCTTGGAGGAGATCAGAAA
GAAAGCTTCTCTTTGACAGAATAGATTCTGGGTTAATAGAACTTTTTCAGTCATTTCTTGGTGTGCCAGAATGGGCAAAGCCTGTATCGAGACGATTTCGGCCATTGCTT
GACCAGGAAATGATCGAGGAAGAACTATGGATCCTGCTGGAGAGCCAAGAAAGGGAAATGAACAAAGATTTAGTTGATAAACAGTTTGGAAAGGAGATTGGGTGGATAGA
TCTTGGAGAGGAGATTGATTCTATTTGTAGAGAACTAGAGAGATTGTTGATCAATGAGCTCGTTGCAGAGTTTGGTAGCGTCGAATTATTTTGA
Protein sequenceShow/hide protein sequence
MEHRQCTATVLEALMGFDEQQTQHHAPRHSRILSESYLQRVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDSFSGCNREKVDSALSAAYMPLTRHNFMNGKHFST
DKMIETSKDFQDLREVIDSMDISPRPTREKNYIFNHVKNGPCMSKSHNSLTEGNNDAGIRFKDRKLGQARSSEDLDPLKSSRPFSEWRDKLCFSSSSPTSLNDSHLVNDK
CKDFHSSENGKHIAEEKERTMEYALQPIKQPSQVSRILDGSRRKTRNDFVNLHLKTSSESIYDDVCRKETEYKMTSSPGLSNWKKEYEHSCFFSVESYKAREAREKVIEE
QRKMNEMPVLPHYATLPSDFSCKPVKYDFQKHVCSNKEHFHSGSPLCLSCKDKRLDQVSKNSHRLRFDSASTVTARSRTRSRYEALRNTWFLKPEGLGTWLQCKPLNRSS
NKRLKIFPCPDSASDHVDDDGCMVGDDLKTEKKGLCDRHSLNSPSRSKVVFCTRNNPVKRENQTIERSSKCDYQDDNFSGTASKVLAVKNDDVEVPAVVKRGFDSFSCIL
LEAGGDSSTNCCRATRTSIQQEGLAFEHYPSKELDSIVSLEEAHQPSPVSVLESHFKEETLSISESSGINSRDLMMQLELLMSDSLGTNSEGHDLFVSSDDDGGEGSICS
SDEIDDIMSTFKFKDSRDFSYLVDVLSEAGLRRRNLEKGCVSWHDQECHVISASVFETLEKKFGEQISWRRSERKLLFDRIDSGLIELFQSFLGVPEWAKPVSRRFRPLL
DQEMIEEELWILLESQEREMNKDLVDKQFGKEIGWIDLGEEIDSICRELERLLINELVAEFGSVELF