; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018267 (gene) of Snake gourd v1 genome

Gene IDTan0018267
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF4378 domain-containing protein
Genome locationLG11:6562407..6566338
RNA-Seq ExpressionTan0018267
SyntenyTan0018267
Gene Ontology termsNA
InterPro domainsIPR022212 - Domain of unknown function DUF3741
IPR025486 - Domain of unknown function DUF4378


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05761.1 uncharacterized protein E5676_scaffold98G002500 [Cucumis melo var. makuwa]0.0e+0070.16Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP
        MEPR+ TASVLE LMGFDE QSQH  P+HS+V S+ YLQ+ ASIGI KKK PSRCHPFRMTIEEP+ELFNSLKVE+NFSRC +LW+ E+ADS LS A +P
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP

Query:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE
        LTRH  M EKHFST  VI+TSK FQDL EV+DSMDISPRP+R KN IF+H +NGPSVSK++Y LTEGNNDAGTKFKDR+QGQAH SEDL +LKSSRPF E
Subjt:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE

Query:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETEYKRTS
        W +KL  SSS PTSLKGSHLV  KCK  H+SQN K+  +EKER    L+P KQ SQVSSILDGSR    H+ +N  LKTSRSE+IYDN+ R E       
Subjt:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETEYKRTS

Query:  SPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHLHSGSP
           LSNW  E KHSC         CFSVESYKARES EKVIEEQRKTE+LM S +GRKMNEMP +PHYATLP++L CKPVKYDFQK    + EHLHSGSP
Subjt:  SPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHLHSGSP

Query:  LCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDHVDNDDC
        LCLS K KRLD+L K  HRLRF   +TV  RSRTR+RYEALR+TWFLK EG GTWLQCKPLNRSSNK+DA++PT KLSSKKL+IFPCPDSAS HVDND C
Subjt:  LCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDHVDNDDC

Query:  MVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSETDGDSST
        MVG DLKT VEKK  CDQHS N L  RSKVVFCTQN P ++ NQA                                                       
Subjt:  MVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSETDGDSST

Query:  NSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC
               TSIQQ+GLAFEHYPSKE DSIVSLEE +QPSPVSVLEPLFKEE+  SSESSGIN RDL+MQLELLM DSPGTNSEGHDLFVSSDDDGGEGSIC
Subjt:  NSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC

Query:  SSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWA
        +SD+IDDIMSTFKFKDSR FSYLVDVL+EA L  +NLE G VS + QE HVISP VFE LEKKFGEQ+SWRRSERKLLFDRINSGL ELFQSF+GVPEWA
Subjt:  SSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWA

Query:  KSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIELF
        K VSRRFRPL++ EMIEEELWILLDSQEREVNK+L+DKQFGKEI W DLG EI+SIC+ELERLL+ ELVAEFGSIELF
Subjt:  KSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIELF

XP_008463525.1 PREDICTED: uncharacterized protein LOC103501659 [Cucumis melo]0.0e+0070.16Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP
        MEPR+ TASVLE LMGFDE QSQH  P+HS+V S+ YLQ+ ASIGI KKK PSRCHPFRMTIEEP+ELFNSLKVE+NFSRC +LW+ E+ADS LS A +P
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP

Query:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE
        LTRH  M EKHFST  VI+TSK FQDL EV+DSMDISPRP+R KN IF+H +NGPSVSK++Y LTEGNNDAGTKFKDR+QGQAH SEDL +LKSSRPF E
Subjt:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE

Query:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETEYKRTS
        W +KL  SSS PTSLKGSHLV  KCK  H+SQN K+  +EKER    L+P KQ SQVSSILDGSR    H+ +N  LKTSRSE+IYDN+ R E       
Subjt:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETEYKRTS

Query:  SPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHLHSGSP
           LSNW  E KHSC         CFSVESYKARES EKVIEEQRKTE+LM S +GRKMNEMP +PHYATLP++L CKPVKYDFQK    + EHLHSGSP
Subjt:  SPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHLHSGSP

Query:  LCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDHVDNDDC
        LCLS K KRLD+L K  HRLRF   +TV  RSRTR+RYEALR+TWFLK EG GTWLQCKPLNRSSNK+DA++PT KLSSKKL+IFPCPDSAS HVDND C
Subjt:  LCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDHVDNDDC

Query:  MVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSETDGDSST
        MVG DLKT VEKK  CDQHS N L  RSKVVFCTQN P ++ NQA                                                       
Subjt:  MVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSETDGDSST

Query:  NSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC
               TSIQQ+GLAFEHYPSKE DSIVSLEE +QPSPVSVLEPLFKEE+  SSESSGIN RDL+MQLELLM DSPGTNSEGHDLFVSSDDDGGEGSIC
Subjt:  NSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC

Query:  SSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWA
        +SD+IDDIMSTFKFKDSR FSYLVDVL+EA L  +NLE G VS + QE HVISP VFE LEKKFGEQ+SWRRSERKLLFDRINSGL ELFQSF+GVPEWA
Subjt:  SSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWA

Query:  KSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIELF
        K VSRRFRPL++ EMIEEELWILLDSQEREVNK+L+DKQFGKEI W DLG EI+SIC+ELERLL+ ELVAEFGSIELF
Subjt:  KSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIELF

XP_022133834.1 uncharacterized protein LOC111006294 [Momordica charantia]0.0e+0073.26Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFS---RCNELWDGEKADSPLSVA
        M  +QCTASVLEALMGF+EQQS HH  +HSRVLSEGYLQ+ ASIG+PKKK PS+CHPFR T+EEP ELFN+L V D+F     CNEL   EK  S LS A
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFS---RCNELWDGEKADSPLSVA

Query:  YMPLTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRP
         MPLTRHNFM  +HF T+ +I+TS D Q+L EV DSMDISPRPTREK YIFNHV+NG S+SKSH+TLT G NDAGTKF +RKQGQA + +D D+LKSS P
Subjt:  YMPLTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRP

Query:  FFEWRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKER--VESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETE
          EW+DKLC SSSS TSLKGSHLV+ KCK FH SQN KH  +EKER  +  +++P KQPSQVS ILD S  +TRHD VN  +K SRSESIYD+VHRKETE
Subjt:  FFEWRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKER--VESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETE

Query:  YKRTSSPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHL
        ++ T SP LSN K EYKHSC         CFSVESYKAR  RE  IEEQ++T+ L+LS QG    EMPIL H+ATLPN+L CKPVKYDFQK    NKEHL
Subjt:  YKRTSSPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHL

Query:  HSGSPLCLSCKDKRLDQLSKNSHRLRFHPASTVIA-RSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDH
        HSGSPLCLSCKD+RLDQ+SKNSHRLRF  A+TV   RSRTR+RYE+LR+TWFLK EG  TWLQCKP ++SS+ +DAS+PT KL SKKLRIFPCP+SAS H
Subjt:  HSGSPLCLSCKDKRLDQLSKNSHRLRFHPASTVIA-RSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDH

Query:  VDNDDCMVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSET
        + +D C+V   L+TRVEKK LC+Q S+NSLSSR+ VVFC +NNP    N+AIE SLKSD+ DDNFSG AS+VLAVKTDDAEVPTV  QEPDS+SC +SET
Subjt:  VDNDDCMVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSET

Query:  DGDSSTNSCRATCTSIQQ--------DGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDL
        DGDSSTNS R TC SIQQ        +G  FEHYP KELDSIVSLEEAYQPSPVSVLEPLFKEE+ SSSESSGIN RDLMMQLELLMSDSPG+NSEGH++
Subjt:  DGDSSTNSCRATCTSIQQ--------DGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDL

Query:  FVSSDDD-GGEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSG
        FVSSDDD GGEGS CSS+EIDDIMSTFKFKDSRDFSYL+DVL+EAGL+  NL+KGCVS  GQE HVISP VFETLEKKFGEQ SWRRSERKLLFDRINSG
Subjt:  FVSSDDD-GGEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSG

Query:  LVELFQSFIGVPEWAKSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSI
        L+ELFQS +GVPEWAK VSRRFRPLLD+EM+EEELWILLDSQERE+NKDLVDKQFGKEIGW DLG EI SICRELERLLI EL+AEFG I
Subjt:  LVELFQSFIGVPEWAKSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSI

XP_038889736.1 uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida]0.0e+0073.02Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP
        ME RQCT SVLEALMGFDE+Q QHHAP+HS VLS+ YLQ+VASIGI KKK PSRCHPFRMT+EEP+ELFNS KVE+NFSRCNELW+ EKADS LS   MP
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP

Query:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE
        LTRH  M EKHFST  VI+TSKDFQ+L EV+DSMDISPRPTR KN IFN  KNGPSVSK HY+ TE NNDAGTK KDRK GQ HSSEDLD LKSSRP  E
Subjt:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE

Query:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKH--------SVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRK
        WRDKLC SSSSPTSL+GSHLVN KCKD  SSQN K+        + E +  +E  LQP KQ SQVSSILD SR  TRH  VN HLK SR  +IYD+V R 
Subjt:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKH--------SVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRK

Query:  ETEYKRTSSPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNK
        ET+Y+R SSP LSNW  +YKHSCF         FSVESYKARESREKV EEQRKTENL+ STQGR+MNEMP LPH+A+LP++L CKPVK+DFQK    NK
Subjt:  ETEYKRTSSPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNK

Query:  EHLHSGSPLCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSAS
        EH HSGSPLCLS K KRLDQL KNSHRLRF   S V  RSRTR+RYEALR+TWFLK EG G WLQCKP NRSSNK+DASEP+ KLSSKKL+IFPCPDSAS
Subjt:  EHLHSGSPLCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSAS

Query:  DHVDNDDCMVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILS
        DHVDNDDCMVGDDLKT+VEKK  CDQHSLN LS RSK VFCTQN P ++ NQA                                               
Subjt:  DHVDNDDCMVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILS

Query:  ETDGDSSTNSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDD
                       TSIQQ+GL FEHYPSKE DSIVSLEEA+QPSPVSVLEPLFK+E+  SSES GINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDD
Subjt:  ETDGDSSTNSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDD

Query:  DGGEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQS
        DGGEGSICSS+EIDDIMSTFKFKDSRDFSYLVDVL+EA LH ++LE G VS H QE  VISP VFETLEKKFGEQ SWRRSERKLLFDRINSGLVELFQS
Subjt:  DGGEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQS

Query:  FIGVPEWAKSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIELF
        F GVPEWAK VSRRFRPLL+ EMIEEELWILLDSQEREVNKDLVDKQFGKEIGW DLG EI+SICRELERLL+ ELVAEFGSIELF
Subjt:  FIGVPEWAKSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIELF

XP_038889740.1 uncharacterized protein LOC120079578 isoform X2 [Benincasa hispida]0.0e+0072.8Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP
        ME RQCT SVLEALMGFDE+Q QHHAP+HS VLS+ YLQ+VASIGI KKK PSRCHPFRMT+EEP+ELFNS KVE+NFSRCNELW+ EKADS LS   MP
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP

Query:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE
        LTRH  M EKHFST  VI+TSKDFQ+L EV+DSMDISPRPTR KN IFN  KNGPSVSK HY+ TE NNDAGTK KDRK GQ HSSEDLD LKSSRP  E
Subjt:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE

Query:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKH--------SVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRK
        WRDKLC SSSSPTSL+GSHLVN KCKD  SSQN K+        + E +  +E  LQP KQ SQVSSILD SR  TRH  VN HLK SR  +IYD+V R 
Subjt:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKH--------SVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRK

Query:  ETEYKRTSSPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNK
        ET+Y+R SSP LSNW  +YKHSCF         FSVESYKARESREKV EEQRKTENL+ STQGR+MNEMP LPH+A+LP++L CKPVK+DFQK    NK
Subjt:  ETEYKRTSSPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNK

Query:  EHLHSGSPLCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSAS
        EH HSGSPLCLS K KRLDQL KNSHRLRF   S V  RSRTR+RYEALR+TWFLK EG G WLQCKP NRSSNK+DASEP+ KLSSKKL+IFPCPDSAS
Subjt:  EHLHSGSPLCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSAS

Query:  DHVDNDDCMVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILS
        DHVDNDDCMVGDDLKT+VEKK  CDQHSLN LS RSK VFCTQN P ++ NQA                                               
Subjt:  DHVDNDDCMVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILS

Query:  ETDGDSSTNSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDD
                       TSIQQ+GL FEHYPSKE DSIVSLEEA+QPSPVSVLEPLFK+E+  SSES GIN  DLMMQLELLMSDSPGTNSEGHDLFVSSDD
Subjt:  ETDGDSSTNSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDD

Query:  DGGEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQS
        DGGEGSICSS+EIDDIMSTFKFKDSRDFSYLVDVL+EA LH ++LE G VS H QE  VISP VFETLEKKFGEQ SWRRSERKLLFDRINSGLVELFQS
Subjt:  DGGEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQS

Query:  FIGVPEWAKSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIELF
        F GVPEWAK VSRRFRPLL+ EMIEEELWILLDSQEREVNKDLVDKQFGKEIGW DLG EI+SICRELERLL+ ELVAEFGSIELF
Subjt:  FIGVPEWAKSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIELF

TrEMBL top hitse value%identityAlignment
A0A0A0KNN6 DUF4378 domain-containing protein0.0e+0070.47Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP
        MEPRQ TASVLEALMGFDE QSQH A +HS+V S+ YLQ+VASIGI KKK PSRCHPFRMTIEEP+ELFNSLKVE+NFSRC +LW+ E+ADS LS AY P
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP

Query:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE
        LTRH    EKHFST  VI+TSK FQDL EV+DSMDISPRPTR KN +F+  K+G SVS +HY LTEGNNDAGTKFKDRKQGQAH SEDL +LKSSRPF E
Subjt:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE

Query:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETEYKRTS
        W +KL  SSS P SLKGSHLV  KCK  H+SQN K+  +EKER    L+P KQ SQVSSILDGSR   R +  N HLKTSRSE+IYDNV R +       
Subjt:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETEYKRTS

Query:  SPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHLHSGSP
           LSNW  E KHSC         CFSVESYKARES EKVIEEQRKT NLM STQGRKMNEMP +P YATLP++L CKPV+YDFQK    +KEHLHSGSP
Subjt:  SPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHLHSGSP

Query:  LCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDHVDNDDC
        LCLS K KRLD+L K  HRLRF   STV  RSRTR+RYEAL +TWFLK EG GTWLQC PLNRSSNK+DA++PT KLSSKKL+IFPCPDSAS H DND C
Subjt:  LCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDHVDNDDC

Query:  MVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSETDGDSST
        MVG D KT V+KK  CDQHSLN L  RSKVVFCTQN P ++ NQA                                                       
Subjt:  MVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSETDGDSST

Query:  NSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC
               TSIQQ+GLAF+HYPSKE DSIVSLEEA+QPSPVSVLEPLFKEE+  SSES GIN RDL+MQLELLMSDSPGTNSEGHDLFVSSDDD GEGSIC
Subjt:  NSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC

Query:  SSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWA
        +SD+IDDIMSTFKFKDSR FSYLVDVL+EA LH +NLE G VS H QE+HVISP VFE LEKKFGEQ+SWRRSERKLLFDRINSGL ELFQSF+GVPEWA
Subjt:  SSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWA

Query:  KSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIEL
        K VSRRFRPLL+ EMIEEELWILLDSQEREVNK+LVDKQFGKEI W DLG EI SICRELE LL+ ELVAEFGSIEL
Subjt:  KSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIEL

A0A1S4E497 uncharacterized protein LOC1035016590.0e+0070.16Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP
        MEPR+ TASVLE LMGFDE QSQH  P+HS+V S+ YLQ+ ASIGI KKK PSRCHPFRMTIEEP+ELFNSLKVE+NFSRC +LW+ E+ADS LS A +P
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP

Query:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE
        LTRH  M EKHFST  VI+TSK FQDL EV+DSMDISPRP+R KN IF+H +NGPSVSK++Y LTEGNNDAGTKFKDR+QGQAH SEDL +LKSSRPF E
Subjt:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE

Query:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETEYKRTS
        W +KL  SSS PTSLKGSHLV  KCK  H+SQN K+  +EKER    L+P KQ SQVSSILDGSR    H+ +N  LKTSRSE+IYDN+ R E       
Subjt:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETEYKRTS

Query:  SPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHLHSGSP
           LSNW  E KHSC         CFSVESYKARES EKVIEEQRKTE+LM S +GRKMNEMP +PHYATLP++L CKPVKYDFQK    + EHLHSGSP
Subjt:  SPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHLHSGSP

Query:  LCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDHVDNDDC
        LCLS K KRLD+L K  HRLRF   +TV  RSRTR+RYEALR+TWFLK EG GTWLQCKPLNRSSNK+DA++PT KLSSKKL+IFPCPDSAS HVDND C
Subjt:  LCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDHVDNDDC

Query:  MVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSETDGDSST
        MVG DLKT VEKK  CDQHS N L  RSKVVFCTQN P ++ NQA                                                       
Subjt:  MVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSETDGDSST

Query:  NSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC
               TSIQQ+GLAFEHYPSKE DSIVSLEE +QPSPVSVLEPLFKEE+  SSESSGIN RDL+MQLELLM DSPGTNSEGHDLFVSSDDDGGEGSIC
Subjt:  NSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC

Query:  SSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWA
        +SD+IDDIMSTFKFKDSR FSYLVDVL+EA L  +NLE G VS + QE HVISP VFE LEKKFGEQ+SWRRSERKLLFDRINSGL ELFQSF+GVPEWA
Subjt:  SSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWA

Query:  KSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIELF
        K VSRRFRPL++ EMIEEELWILLDSQEREVNK+L+DKQFGKEI W DLG EI+SIC+ELERLL+ ELVAEFGSIELF
Subjt:  KSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIELF

A0A5D3C1E7 DUF4378 domain-containing protein0.0e+0070.16Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP
        MEPR+ TASVLE LMGFDE QSQH  P+HS+V S+ YLQ+ ASIGI KKK PSRCHPFRMTIEEP+ELFNSLKVE+NFSRC +LW+ E+ADS LS A +P
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMP

Query:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE
        LTRH  M EKHFST  VI+TSK FQDL EV+DSMDISPRP+R KN IF+H +NGPSVSK++Y LTEGNNDAGTKFKDR+QGQAH SEDL +LKSSRPF E
Subjt:  LTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFE

Query:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETEYKRTS
        W +KL  SSS PTSLKGSHLV  KCK  H+SQN K+  +EKER    L+P KQ SQVSSILDGSR    H+ +N  LKTSRSE+IYDN+ R E       
Subjt:  WRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETEYKRTS

Query:  SPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHLHSGSP
           LSNW  E KHSC         CFSVESYKARES EKVIEEQRKTE+LM S +GRKMNEMP +PHYATLP++L CKPVKYDFQK    + EHLHSGSP
Subjt:  SPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHLHSGSP

Query:  LCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDHVDNDDC
        LCLS K KRLD+L K  HRLRF   +TV  RSRTR+RYEALR+TWFLK EG GTWLQCKPLNRSSNK+DA++PT KLSSKKL+IFPCPDSAS HVDND C
Subjt:  LCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDHVDNDDC

Query:  MVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSETDGDSST
        MVG DLKT VEKK  CDQHS N L  RSKVVFCTQN P ++ NQA                                                       
Subjt:  MVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSETDGDSST

Query:  NSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC
               TSIQQ+GLAFEHYPSKE DSIVSLEE +QPSPVSVLEPLFKEE+  SSESSGIN RDL+MQLELLM DSPGTNSEGHDLFVSSDDDGGEGSIC
Subjt:  NSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC

Query:  SSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWA
        +SD+IDDIMSTFKFKDSR FSYLVDVL+EA L  +NLE G VS + QE HVISP VFE LEKKFGEQ+SWRRSERKLLFDRINSGL ELFQSF+GVPEWA
Subjt:  SSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWA

Query:  KSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIELF
        K VSRRFRPL++ EMIEEELWILLDSQEREVNK+L+DKQFGKEI W DLG EI+SIC+ELERLL+ ELVAEFGSIELF
Subjt:  KSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIELF

A0A6J1BX36 uncharacterized protein LOC1110062940.0e+0073.26Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFS---RCNELWDGEKADSPLSVA
        M  +QCTASVLEALMGF+EQQS HH  +HSRVLSEGYLQ+ ASIG+PKKK PS+CHPFR T+EEP ELFN+L V D+F     CNEL   EK  S LS A
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFS---RCNELWDGEKADSPLSVA

Query:  YMPLTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRP
         MPLTRHNFM  +HF T+ +I+TS D Q+L EV DSMDISPRPTREK YIFNHV+NG S+SKSH+TLT G NDAGTKF +RKQGQA + +D D+LKSS P
Subjt:  YMPLTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRP

Query:  FFEWRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKER--VESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETE
          EW+DKLC SSSS TSLKGSHLV+ KCK FH SQN KH  +EKER  +  +++P KQPSQVS ILD S  +TRHD VN  +K SRSESIYD+VHRKETE
Subjt:  FFEWRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKER--VESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETE

Query:  YKRTSSPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHL
        ++ T SP LSN K EYKHSC         CFSVESYKAR  RE  IEEQ++T+ L+LS QG    EMPIL H+ATLPN+L CKPVKYDFQK    NKEHL
Subjt:  YKRTSSPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHL

Query:  HSGSPLCLSCKDKRLDQLSKNSHRLRFHPASTVIA-RSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDH
        HSGSPLCLSCKD+RLDQ+SKNSHRLRF  A+TV   RSRTR+RYE+LR+TWFLK EG  TWLQCKP ++SS+ +DAS+PT KL SKKLRIFPCP+SAS H
Subjt:  HSGSPLCLSCKDKRLDQLSKNSHRLRFHPASTVIA-RSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDH

Query:  VDNDDCMVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSET
        + +D C+V   L+TRVEKK LC+Q S+NSLSSR+ VVFC +NNP    N+AIE SLKSD+ DDNFSG AS+VLAVKTDDAEVPTV  QEPDS+SC +SET
Subjt:  VDNDDCMVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSET

Query:  DGDSSTNSCRATCTSIQQ--------DGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDL
        DGDSSTNS R TC SIQQ        +G  FEHYP KELDSIVSLEEAYQPSPVSVLEPLFKEE+ SSSESSGIN RDLMMQLELLMSDSPG+NSEGH++
Subjt:  DGDSSTNSCRATCTSIQQ--------DGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDL

Query:  FVSSDDD-GGEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSG
        FVSSDDD GGEGS CSS+EIDDIMSTFKFKDSRDFSYL+DVL+EAGL+  NL+KGCVS  GQE HVISP VFETLEKKFGEQ SWRRSERKLLFDRINSG
Subjt:  FVSSDDD-GGEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSG

Query:  LVELFQSFIGVPEWAKSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSI
        L+ELFQS +GVPEWAK VSRRFRPLLD+EM+EEELWILLDSQERE+NKDLVDKQFGKEIGW DLG EI SICRELERLLI EL+AEFG I
Subjt:  LVELFQSFIGVPEWAKSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSI

A0A6J1HFD2 uncharacterized protein LOC1114637981.4e-24757.83Show/hide
Query:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASI-GIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYM
        MEP QC+ASVLEALMGFDE QS+H A  HSRVLSE YLQ+VASI G  KKKSPS+C PFRMTIEEP E+         FS  N LWD E           
Subjt:  MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASI-GIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYM

Query:  PLTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFF
          +  NFMNEKHFST+ +I TSKDF DL EVIDSMDISPR TR K+  FNHV+NGP+VSK H                                      
Subjt:  PLTRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFF

Query:  EWRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETEYKRT
                                                                                               +N HR        
Subjt:  EWRDKLCCSSSSPTSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETEYKRT

Query:  SSPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHLHSGS
                KDE K SCF +         VESYK  ESREKVIEEQRK  NLMLS QGR MNEM ILPHYAT P++L CKPV YDF     LNK+HLHSGS
Subjt:  SSPDLSNWKDEYKHSCFSVDEYKHSCFSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHLHSGS

Query:  PLCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDHVDNDD
        PLCLSCKD+R D+L K SHR     A TVIARSR R+RYEALR+TWFLKPEGLGTWLQ KPLN  SNK++ SEP+SKLSSKKLRIFPCPDS SDHVDND 
Subjt:  PLCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDHVDNDD

Query:  CMVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSETDGDSS
        C+VG+DLKTRVEK GLCDQHS+NSLSS S              N+A+E                                      SLS I+ ETDG SS
Subjt:  CMVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSETDGDSS

Query:  TNSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSI
        TNSCRATCTSIQQDGL+FE Y SKELDSIV LEE YQPSPVSVLE  FKEE+FSSSESSG+NGR    +L+LLM D+PGTNS+ H+LFVSS++D GEGSI
Subjt:  TNSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSI

Query:  CSSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEW
        C+SDEI DIMSTFKFKDSRDFSYLVDV++EAGLH RNLEKG V  H QERHVISP VFE LEKKFGEQVSWRRSERKLLFDRINSGL ELF+SF+GVPEW
Subjt:  CSSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEW

Query:  AKSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGS
        AK VSRR RPLLDQEM+E+E+W LLDSQE+E NKDLVDKQFGKEIGW DLG EI SICRELE LLIIELVAE GS
Subjt:  AKSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20240.1 Protein of unknown function (DUF3741)5.9e-0425.4Show/hide
Query:  VSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSP-GTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVL
        V  E   QPSPVSVL+P F+EE   S +     G ++ ++  L+    P GT +      ++ +D+    +   +  I++        D   + ++  +L
Subjt:  VSLEEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMMQLELLMSDSP-GTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVL

Query:  TEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWR---RSERKLLFDRINSGLVELFQSFIGVPEWAKSVSRRFRPLLDQEMIEEELWILL
        T +G        G  SL  +   + SPL     +K   +++  R   RS RKL+FD +N+ + E   +          +++ F  L       E +W  L
Subjt:  TEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWR---RSERKLLFDRINSGLVELFQSFIGVPEWAKSVSRRFRPLLDQEMIEEELWILL

Query:  DSQEREVNKDLVDK--QFGKEIGWTDLGGEIESI-CRELERLLIIELV
          QE  VN ++  K   +G ++   +LG EIE I  +EL    + +L+
Subjt:  DSQEREVNKDLVDK--QFGKEIGWTDLGGEIESI-CRELERLLIIELV

AT2G39435.1 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related5.9e-3638.96Show/hide
Query:  EEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMM--------QLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSRDFSYL
        E+A+QPSPVSVLEP+F E++   SE    +  DL          QLE L S+S  + S+G  + VSSD++    S     +  + +     ++SRD SY+
Subjt:  EEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMM--------QLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSRDFSYL

Query:  VDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWAKSVSRRFRPLLDQEMIEEELWIL
         D+L E  L  +N    CV   G+   VI+P +FE LEKK+  + SW+RS+RK+LFDR+NS LVE+ +SF   P W K VSRR    L    +++ELW +
Subjt:  VDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWAKSVSRRFRPLLDQEMIEEELWIL

Query:  LDSQEREVNKDLVDKQFGKEIG-WTDLGGEIESICRELERLLIIELVAE
        L  QE+   K  + K    +I  W +L  + ES+  ELE +++ EL++E
Subjt:  LDSQEREVNKDLVDKQFGKEIG-WTDLGGEIESICRELERLLIIELVAE

AT2G39435.2 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related1.9e-3439Show/hide
Query:  EEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMM--------QLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSRDFSYL
        E+A+QPSPVSVLEP+F E++   SE    +  DL          QLE L S+S  + S+G  + VSSD++    S     +  + +     ++SRD SY+
Subjt:  EEAYQPSPVSVLEPLFKEESFSSSESSGINGRDLMM--------QLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSRDFSYL

Query:  VDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWAKSVSRRFRPLLDQEMIEEELWIL
         D+L E  L  +N    CV   G+   VI+P +FE LEKK+  + SW+RS+RK+LFDR+NS LVE+ +SF   P W K VSRR    L    +++ELW +
Subjt:  VDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSWRRSERKLLFDRINSGLVELFQSFIGVPEWAKSVSRRFRPLLDQEMIEEELWIL

Query:  LDSQEREVNKDLVDKQFGKEIG-WTDLGGEIESICRELERL
        L  QE+   K  + K    +I  W +L  + ES+  ELE++
Subjt:  LDSQEREVNKDLVDKQFGKEIG-WTDLGGEIESICRELERL

AT3G53540.1 unknown protein5.3e-2123.67Show/hide
Query:  EPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMPL
        E ++ + S++  LMG D   SQ  + K  + +     Q+  S G    KS  +        ++  E+ ++   E N      L+   + ++ L+ A M  
Subjt:  EPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMPL

Query:  TRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFEW
         R  FM  K  ST+  +R SK+F D LE +DS          K+ +   +++  S+   H    +         +       +S   +D LK+ +     
Subjt:  TRHNFMNEKHFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFEW

Query:  RDKLCCSSSSP-TSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDN--VHRKETEYKR
        RD L  S  SP  +  G      +    H+S +      E+ R  S LQPTK       +L  +    R+    F   +S S+    +  +    T  ++
Subjt:  RDKLCCSSSSP-TSLKGSHLVNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDN--VHRKETEYKR

Query:  TSSPDL----SNWKDEYKHSCFSVDEYKHSC-------FSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKP
         S+ D+     N +D  + +     + K SC       F    ++     E        +E+ ++              ++ +LP++     V  + ++ 
Subjt:  TSSPDL----SNWKDEYKHSCFSVDEYKHSC-------FSVESYKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKP

Query:  GRLNKEHLHSGSPLCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQC-KPLNRSSNKRDASEPTSKLSSKKLRIFP
         RL++          L+ K +   ++S++         S   AR  + N   +  D    + E    W +  +P+  SS  RD  + +   S  K R   
Subjt:  GRLNKEHLHSGSPLCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEALRDTWFLKPEGLGTWLQC-KPLNRSSNKRDASEPTSKLSSKKLRIFP

Query:  CPDSASDH--------------VDNDDCMVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQ-DDNFSGTASDVLAVKT
          +SA  +              V  D    G+   +   + G    HS  S +S  +V      +     N  I     S F+   +FSG A+      +
Subjt:  CPDSASDH--------------VDNDDCMVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSLKSDFQ-DDNFSGTASDVLAVKT

Query:  DDAEVPTVVNQEPDSLSCILSETDGDSSTNSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSS---ESSGINGRDLMMQ
           ++ T ++ E   LS + S TD D S    R T   +    +     P +      S +E  QPSPVSVLE  F ++  S S   ES   + R L MQ
Subjt:  DDAEVPTVVNQEPDSLSCILSETDGDSSTNSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSS---ESSGINGRDLMMQ

Query:  LELLMSDSPGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSR-DFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQ
        L+LL  +S  T  EG  + VSSD+D  +    SS   D+ M T + ++     SYLVD+L  +     +      +        + P +FE LEKK+   
Subjt:  LELLMSDSPGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSR-DFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQ

Query:  VSWRRSERKLLFDRINSGLVELFQSFIGVPEWAKSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIE
         +  R ERKLLFD+I+  ++ + +       W KS   +  P  D   I+E L  L+  ++ + +K  V++   KE+ W  L  +IE I RE+E +L  E
Subjt:  VSWRRSERKLLFDRINSGLVELFQSFIGVPEWAKSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIE

Query:  LVAE
        L+ E
Subjt:  LVAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCTAGACAGTGTACAGCTAGTGTTCTTGAAGCATTGATGGGCTTTGATGAGCAGCAATCTCAGCACCATGCTCCAAAGCATTCTAGAGTTCTTTCTGAGGGTTA
TTTACAAAAGGTCGCTTCCATTGGAATTCCGAAGAAGAAATCCCCCTCCAGATGTCATCCATTTAGGATGACAATAGAAGAGCCATCAGAACTCTTTAATTCTCTCAAAG
TAGAAGATAACTTTAGCCGTTGCAACGAGTTATGGGATGGGGAGAAGGCTGATTCTCCTCTATCAGTAGCATACATGCCACTTACAAGACACAACTTCATGAATGAAAAG
CACTTTTCAACAAATGGCGTGATACGGACTTCAAAGGATTTTCAAGATTTACTGGAGGTTATAGATTCTATGGACATTTCACCAAGACCTACAAGAGAAAAGAATTATAT
ATTCAACCATGTCAAAAATGGACCAAGTGTGTCAAAGTCACATTATACTTTGACAGAAGGAAATAATGATGCAGGCACTAAATTTAAGGACAGGAAACAAGGACAGGCAC
ACTCATCTGAGGATCTAGATATTTTGAAGTCTTCAAGACCTTTTTTTGAGTGGAGAGATAAACTATGCTGTTCGTCCTCTTCTCCAACTTCTTTGAAAGGCTCACATTTA
GTTAACGGAAAATGCAAAGATTTTCATAGTTCTCAAAATAGAAAGCATAGTGTTGAAGAAAAAGAAAGAGTGGAAAGCATACTGCAGCCCACCAAGCAACCATCTCAAGT
TTCAAGTATTCTGGATGGAAGTAGGAGCAGAACAAGGCACGATGTTGTCAATTTTCATTTGAAGACGTCTAGATCAGAATCCATATATGACAATGTGCATAGAAAAGAAA
CCGAGTACAAAAGGACTTCTTCCCCAGATTTATCCAATTGGAAGGATGAATACAAGCATTCCTGCTTTTCAGTTGATGAATACAAGCATTCCTGCTTTTCAGTTGAGTCA
TACAAGGCCAGAGAATCCAGGGAGAAAGTTATAGAAGAACAAAGGAAGACTGAAAACTTGATGCTATCTACGCAAGGTAGGAAAATGAATGAAATGCCTATACTACCTCA
CTATGCAACTTTGCCCAATGAACTGTATTGCAAACCAGTCAAGTATGATTTCCAGAAGCCTGGTCGTTTGAATAAGGAACATTTGCATTCTGGCAGTCCCTTGTGTTTGA
GCTGCAAGGATAAAAGATTAGATCAACTCAGTAAAAACTCTCATAGATTGAGATTTCATCCTGCTTCTACGGTGATTGCAAGATCTAGAACCAGGAACAGATACGAGGCC
CTTCGAGATACATGGTTCTTAAAGCCTGAAGGTCTTGGTACTTGGCTACAATGCAAGCCATTGAATAGAAGTTCCAATAAAAGGGATGCTTCAGAACCTACCTCGAAATT
AAGCTCTAAGAAGTTGAGGATTTTTCCTTGCCCTGATTCAGCAAGCGATCATGTTGACAATGATGACTGTATGGTTGGTGATGATCTGAAGACCAGAGTCGAGAAGAAAG
GCCTTTGTGATCAGCATTCTTTAAATTCTCTATCATCAAGGAGCAAAGTTGTTTTCTGCACACAAAACAATCCTGCCAGACGAGAAAATCAGGCCATAGAGCATTCTTTG
AAGAGTGATTTTCAAGATGATAATTTTTCAGGTACGGCTTCTGATGTATTAGCTGTAAAAACTGATGATGCGGAAGTCCCTACCGTGGTTAACCAAGAACCTGATTCATT
GTCCTGCATCCTTTCAGAGACTGACGGTGATTCATCTACCAACTCTTGTCGTGCCACATGTACTTCTATTCAACAGGATGGTCTTGCCTTTGAACACTACCCTAGCAAAG
AGCTAGATTCTATTGTGAGTTTGGAGGAGGCTTATCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAAAGAAGAATCATTTTCCAGTTCTGAATCCTCGGGCATT
AACGGTAGAGATTTGATGATGCAACTTGAACTTCTGATGTCTGACTCCCCGGGAACTAACTCAGAAGGGCATGATTTGTTCGTATCAAGTGATGATGATGGTGGAGAAGG
ATCTATATGCAGTTCTGATGAAATTGATGACATAATGAGCACATTCAAGTTCAAAGATAGTAGGGATTTTTCATACCTTGTTGATGTCTTAACCGAGGCAGGCTTGCATA
GTAGAAACCTGGAGAAGGGTTGTGTTTCATTGCATGGTCAGGAACGCCACGTGATTAGCCCCTTGGTCTTCGAGACCTTAGAGAAGAAGTTTGGGGAACAAGTTTCTTGG
AGGAGATCAGAAAGAAAGCTTCTCTTTGACAGAATAAATTCTGGGTTAGTAGAACTCTTTCAGTCATTTATTGGTGTGCCAGAATGGGCAAAGTCTGTATCAAGAAGATT
TCGGCCATTGCTCGACCAGGAAATGATCGAGGAAGAACTATGGATCCTACTGGATAGCCAAGAAAGGGAAGTGAACAAAGATTTAGTAGATAAGCAGTTTGGAAAGGAGA
TTGGATGGACAGATCTTGGAGGAGAGATTGAGTCTATTTGTAGAGAACTAGAGAGATTGTTGATCATTGAGCTTGTTGCAGAGTTTGGTAGCATCGAATTATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACCTAGACAGTGTACAGCTAGTGTTCTTGAAGCATTGATGGGCTTTGATGAGCAGCAATCTCAGCACCATGCTCCAAAGCATTCTAGAGTTCTTTCTGAGGGTTA
TTTACAAAAGGTCGCTTCCATTGGAATTCCGAAGAAGAAATCCCCCTCCAGATGTCATCCATTTAGGATGACAATAGAAGAGCCATCAGAACTCTTTAATTCTCTCAAAG
TAGAAGATAACTTTAGCCGTTGCAACGAGTTATGGGATGGGGAGAAGGCTGATTCTCCTCTATCAGTAGCATACATGCCACTTACAAGACACAACTTCATGAATGAAAAG
CACTTTTCAACAAATGGCGTGATACGGACTTCAAAGGATTTTCAAGATTTACTGGAGGTTATAGATTCTATGGACATTTCACCAAGACCTACAAGAGAAAAGAATTATAT
ATTCAACCATGTCAAAAATGGACCAAGTGTGTCAAAGTCACATTATACTTTGACAGAAGGAAATAATGATGCAGGCACTAAATTTAAGGACAGGAAACAAGGACAGGCAC
ACTCATCTGAGGATCTAGATATTTTGAAGTCTTCAAGACCTTTTTTTGAGTGGAGAGATAAACTATGCTGTTCGTCCTCTTCTCCAACTTCTTTGAAAGGCTCACATTTA
GTTAACGGAAAATGCAAAGATTTTCATAGTTCTCAAAATAGAAAGCATAGTGTTGAAGAAAAAGAAAGAGTGGAAAGCATACTGCAGCCCACCAAGCAACCATCTCAAGT
TTCAAGTATTCTGGATGGAAGTAGGAGCAGAACAAGGCACGATGTTGTCAATTTTCATTTGAAGACGTCTAGATCAGAATCCATATATGACAATGTGCATAGAAAAGAAA
CCGAGTACAAAAGGACTTCTTCCCCAGATTTATCCAATTGGAAGGATGAATACAAGCATTCCTGCTTTTCAGTTGATGAATACAAGCATTCCTGCTTTTCAGTTGAGTCA
TACAAGGCCAGAGAATCCAGGGAGAAAGTTATAGAAGAACAAAGGAAGACTGAAAACTTGATGCTATCTACGCAAGGTAGGAAAATGAATGAAATGCCTATACTACCTCA
CTATGCAACTTTGCCCAATGAACTGTATTGCAAACCAGTCAAGTATGATTTCCAGAAGCCTGGTCGTTTGAATAAGGAACATTTGCATTCTGGCAGTCCCTTGTGTTTGA
GCTGCAAGGATAAAAGATTAGATCAACTCAGTAAAAACTCTCATAGATTGAGATTTCATCCTGCTTCTACGGTGATTGCAAGATCTAGAACCAGGAACAGATACGAGGCC
CTTCGAGATACATGGTTCTTAAAGCCTGAAGGTCTTGGTACTTGGCTACAATGCAAGCCATTGAATAGAAGTTCCAATAAAAGGGATGCTTCAGAACCTACCTCGAAATT
AAGCTCTAAGAAGTTGAGGATTTTTCCTTGCCCTGATTCAGCAAGCGATCATGTTGACAATGATGACTGTATGGTTGGTGATGATCTGAAGACCAGAGTCGAGAAGAAAG
GCCTTTGTGATCAGCATTCTTTAAATTCTCTATCATCAAGGAGCAAAGTTGTTTTCTGCACACAAAACAATCCTGCCAGACGAGAAAATCAGGCCATAGAGCATTCTTTG
AAGAGTGATTTTCAAGATGATAATTTTTCAGGTACGGCTTCTGATGTATTAGCTGTAAAAACTGATGATGCGGAAGTCCCTACCGTGGTTAACCAAGAACCTGATTCATT
GTCCTGCATCCTTTCAGAGACTGACGGTGATTCATCTACCAACTCTTGTCGTGCCACATGTACTTCTATTCAACAGGATGGTCTTGCCTTTGAACACTACCCTAGCAAAG
AGCTAGATTCTATTGTGAGTTTGGAGGAGGCTTATCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAAAGAAGAATCATTTTCCAGTTCTGAATCCTCGGGCATT
AACGGTAGAGATTTGATGATGCAACTTGAACTTCTGATGTCTGACTCCCCGGGAACTAACTCAGAAGGGCATGATTTGTTCGTATCAAGTGATGATGATGGTGGAGAAGG
ATCTATATGCAGTTCTGATGAAATTGATGACATAATGAGCACATTCAAGTTCAAAGATAGTAGGGATTTTTCATACCTTGTTGATGTCTTAACCGAGGCAGGCTTGCATA
GTAGAAACCTGGAGAAGGGTTGTGTTTCATTGCATGGTCAGGAACGCCACGTGATTAGCCCCTTGGTCTTCGAGACCTTAGAGAAGAAGTTTGGGGAACAAGTTTCTTGG
AGGAGATCAGAAAGAAAGCTTCTCTTTGACAGAATAAATTCTGGGTTAGTAGAACTCTTTCAGTCATTTATTGGTGTGCCAGAATGGGCAAAGTCTGTATCAAGAAGATT
TCGGCCATTGCTCGACCAGGAAATGATCGAGGAAGAACTATGGATCCTACTGGATAGCCAAGAAAGGGAAGTGAACAAAGATTTAGTAGATAAGCAGTTTGGAAAGGAGA
TTGGATGGACAGATCTTGGAGGAGAGATTGAGTCTATTTGTAGAGAACTAGAGAGATTGTTGATCATTGAGCTTGTTGCAGAGTTTGGTAGCATCGAATTATTTTGA
Protein sequenceShow/hide protein sequence
MEPRQCTASVLEALMGFDEQQSQHHAPKHSRVLSEGYLQKVASIGIPKKKSPSRCHPFRMTIEEPSELFNSLKVEDNFSRCNELWDGEKADSPLSVAYMPLTRHNFMNEK
HFSTNGVIRTSKDFQDLLEVIDSMDISPRPTREKNYIFNHVKNGPSVSKSHYTLTEGNNDAGTKFKDRKQGQAHSSEDLDILKSSRPFFEWRDKLCCSSSSPTSLKGSHL
VNGKCKDFHSSQNRKHSVEEKERVESILQPTKQPSQVSSILDGSRSRTRHDVVNFHLKTSRSESIYDNVHRKETEYKRTSSPDLSNWKDEYKHSCFSVDEYKHSCFSVES
YKARESREKVIEEQRKTENLMLSTQGRKMNEMPILPHYATLPNELYCKPVKYDFQKPGRLNKEHLHSGSPLCLSCKDKRLDQLSKNSHRLRFHPASTVIARSRTRNRYEA
LRDTWFLKPEGLGTWLQCKPLNRSSNKRDASEPTSKLSSKKLRIFPCPDSASDHVDNDDCMVGDDLKTRVEKKGLCDQHSLNSLSSRSKVVFCTQNNPARRENQAIEHSL
KSDFQDDNFSGTASDVLAVKTDDAEVPTVVNQEPDSLSCILSETDGDSSTNSCRATCTSIQQDGLAFEHYPSKELDSIVSLEEAYQPSPVSVLEPLFKEESFSSSESSGI
NGRDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICSSDEIDDIMSTFKFKDSRDFSYLVDVLTEAGLHSRNLEKGCVSLHGQERHVISPLVFETLEKKFGEQVSW
RRSERKLLFDRINSGLVELFQSFIGVPEWAKSVSRRFRPLLDQEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWTDLGGEIESICRELERLLIIELVAEFGSIELF