; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0771 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0771
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDUF3741 domain-containing protein/DUF4378 domain-containing protein
Genome locationMC02:6115918..6121399
RNA-Seq ExpressionMC02g0771
SyntenyMC02g0771
Gene Ontology termsNA
InterPro domainsIPR025486 - Domain of unknown function DUF4378
IPR044257 - Protein TRM32-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570707.1 Protein TRM32, partial [Cucurbita argyrosperma subsp. sororia]0.062.07Show/hide
Query:  MEFGPGEETRVGTKGARIHQSWV------------------RELGGLLIMFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSR
        MEFGPG+ETR GTK  RIH SWV                   ELG LLIMFKMEKHI+RQDSN+QFNRNV GCFWS+FHT+DYH WHNVKKMLPYR+HS+
Subjt:  MEFGPGEETRVGTKGARIHQSWV------------------RELGGLLIMFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSR

Query:  SKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEVITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRV
        SKGSPKST+N HHVAE  EQI DENK L CT ESCP G   GE +  EVI+K+ +EEESQK  K+N NSKR LSRT+SIHHLE S +             
Subjt:  SKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEVITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRV

Query:  ASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVKETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPK
         S+  +PMEL A G +SNSLNAMD+ +YF+ RKI  + TS  EKSNGV++TLE+KQI RN+SS SFKED+ IQEI+KANRKLFAELL G  + ++LQ P+
Subjt:  ASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVKETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPK

Query:  NPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQPSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWN
        N KSSASLAKSRSFPAPGLARKGY+KLSSL+HKQNESFPK +K  + PQPSKLVESESP NFH      +S  TSS N+  QTSSS  G  RGL+HGGWN
Subjt:  NPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQPSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWN

Query:  QLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAREESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLER
        QLV+KRFNFIK KI+H FKERKK NNQKT K  PT      GHEL  +REE+Q+SLG  T  D   IRGY ET + ENDN   GVQ K   A    SLER
Subjt:  QLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAREESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLER

Query:  YSQLAN-----------SGFNRNT--------------------------EAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPHG
        YS+L++           S  N N                           EAKC +S+SLRL  E+ IPN EK +KS+GRNLS+PDIDLFCTLFTDP   
Subjt:  YSQLAN-----------SGFNRNT--------------------------EAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPHG

Query:  VSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHLDILDSKHHQILGSE
        VSRTEKPK G+VHS+   N+R + NPA+LL N   SE LDSDSQS+V++SDD  NMPVD +G LN V N+E  AW D+LEEKIPHLD  D +HHQ+LGSE
Subjt:  VSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHLDILDSKHHQILGSE

Query:  SRFDDVSELSDVSQLLEI--------ETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS---NS---
           +DV     V Q+ E+        ETC +DD+  KLSD EG  LN     A+EL+P  DQP   S EAL   E  V  EIID A KISN    NS   
Subjt:  SRFDDVSELSDVSQLLEI--------ETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS---NS---

Query:  -IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYL
         I NADFNYMRYILQLSSFIESGH   R LSSS+F+GEEA FYKKLE YW+KV+KDSDHQLL DLVYET HNVFE+SF  FLK+FS +S+IRPMPLGQYL
Subjt:  -IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYL

Query:  LEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF
        LE+VREKVAWYLCLGPELDQSLDDVVGRDL KGDDWMNLQ++ E+ ALELEDLI+DELL+EVI F
Subjt:  LEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF

XP_022148387.1 uncharacterized protein LOC111017051 [Momordica charantia]0.098.36Show/hide
Query:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVNHHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEVI
        MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVNHHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEVI
Subjt:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVNHHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEVI

Query:  TKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVKE
        TKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVKE
Subjt:  TKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVKE

Query:  TLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQP
        TLETKQI RNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQP
Subjt:  TLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQP

Query:  SKLVESESPTNFHNSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPTGHELLPAREESQDSLGITTRK
        SKLVESESPTNFHNSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPTGHEL PAREESQDSLGITTRK
Subjt:  SKLVESESPTNFHNSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPTGHELLPAREESQDSLGITTRK

Query:  DSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPH
        DSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPH
Subjt:  DSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPH

Query:  GVSRTEKPKAGLVHSNMGNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAWDELEEKIPHLDILDSKHHQILGSES
        GVSRTEKPKAGLVHSNM NIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAWDELEEKIPHLDILDSKHHQILGSES
Subjt:  GVSRTEKPKAGLVHSNMGNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAWDELEEKIPHLDILDSKHHQILGSES

Query:  RFDDVSELSDVSQLLEIETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS-------NSIENADFNY
        RFDDVSELSDVSQLLEIETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISN        +SIEN DFNY
Subjt:  RFDDVSELSDVSQLLEIETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS-------NSIENADFNY

Query:  MRYILQLSSFIESGHVRSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWY
        MRYILQLSSFIESGHVRSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQ LLEEVREKVAWY
Subjt:  MRYILQLSSFIESGHVRSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWY

Query:  LCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF
        LCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF
Subjt:  LCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF

XP_022943817.1 uncharacterized protein LOC111448443 isoform X4 [Cucurbita moschata]0.065.07Show/hide
Query:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV
        MFKMEKHI+RQDSN+QFNR+V GCFWS+FHT+DYH WHNVKKMLPYR+HS+SKGSPKST+N HHVAE  EQI DENK L CT ESCP G   GE +  EV
Subjt:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV

Query:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK
        I+K+ SEEESQ   K+N NSKR LSRT+SIHHLE S +              S+  +PMEL A G +SNSLNAMD+ +YF+ RKI  + TSF E+SNGV+
Subjt:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK

Query:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ
        +TLE+KQI RN+ S SFKED+ IQEI+KANRKLFAELL G  + ++LQ P+N KSSASLAKSRSFPAPGLARKGY+KLSSL+HKQNESFPK +K  + PQ
Subjt:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ

Query:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAR
        PSKLVESESP NFH      +S  TSS N+  QTSSS  G  RGL+HGGWNQLV+KRFNFIK KI+H FKERKK NNQKT K  PT      GHEL  +R
Subjt:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAR

Query:  EESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPD
        EE+Q+SLG  T  D   IRGY ET + ENDN   GVQ K   ASL A LERYSQ          EAKC +S+SLRL  E+ IPN EK +KS+GRNLS+PD
Subjt:  EESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPD

Query:  IDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHL
        IDLFCTLFTDP   VSRTEKPK G+VHS+   N+R + NPA+LL N   SE LDSDSQS+V++SDD  NMPVD +G LN V N+E  AW D+LEEKIPHL
Subjt:  IDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHL

Query:  DILDSKHHQILGSESRFDDVSELSDVSQLLEI--------ETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEA
        D  D +HHQ+LGSE   +DV     V Q+  +        ETC +DD+  KLSD EG  LN     A+EL+P  DQP   S EAL   E  V  EIID A
Subjt:  DILDSKHHQILGSESRFDDVSELSDVSQLLEI--------ETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEA

Query:  GKISNS---NS----IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFS
         KISN    NS    I NADFNYMRYILQLSSFIESGH   R LSSS+F+GEEA FYKKLE YW+KVDKDSDHQLL DLVYET HNVFE+SF  FLKTFS
Subjt:  GKISNS---NS----IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFS

Query:  WKSRIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF
         +++IRPMPLGQYLLE+VREKVAWYLCLGPELDQSLDDVVGRDL KGDDWMNLQ++ E+ ALELEDLI+DELL+EVI F
Subjt:  WKSRIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF

XP_023511930.1 uncharacterized protein LOC111776795 [Cucurbita pepo subsp. pepo]0.065.53Show/hide
Query:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV
        MFKMEKHI+RQDSN+QFNRNV GCFWS+FHT+DYH WHNVKKMLPYR+HS+SKGSPKST+N HHVAE  EQI DENK L CT ESCP G   GE +  EV
Subjt:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV

Query:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK
        I+K+ SEEESQK  KLN NSKR LSRT+SIHHLE S +               +  +PMEL A G +SNSLNAMD+ +YF+ RKI  + TSF EKSNGV+
Subjt:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK

Query:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ
        +TLE+KQI RN+SS SFKED+ IQEI+KANRKLFAELL G  + ++LQ P+N KSSASLAKSRSFPAPGLARKGY+KLSSL+HK NESFPK +K     Q
Subjt:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ

Query:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAR
        PSKLVESESP NFH      +S  TSS N+  QTSSS  G  RGL+HGGWNQLV+KRFNFIK KI+H FKERKK NNQKT K  PT      GHEL  +R
Subjt:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAR

Query:  EESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPD
        EE+Q+SLG  T  D   IRGY ET + ENDN   GVQ K   ASL A LERYSQ          EAKC +S+SLRL  E+ IPN EK +KS+GRNLS+PD
Subjt:  EESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPD

Query:  IDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHL
        IDLFCTLFTDP   VSRTEKPK G+VHS+   N+R + NPA+LL N   SE LDSDSQS+V++SDD  NMPVD +G LN V N+E  AW D+LEEKIPHL
Subjt:  IDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHL

Query:  DILDSKHHQILGSESRFDDVSELSDVSQLLEI--------ETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEA
        D  D +HHQ+LGSE   +DV     V Q+ E+        ETCF+DD+  KLSD EG  LN     A+EL+P  DQP   S EAL   E  V  EIID A
Subjt:  DILDSKHHQILGSESRFDDVSELSDVSQLLEI--------ETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEA

Query:  GKISNS---NS----IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFS
         KISN    NS    I NADFNYMRYILQLSSFIESGH   R LSSS+F+GEEA FYKKLE YW+KVDKDSDHQLL DLVYET HNVFE+SF  FLK FS
Subjt:  GKISNS---NS----IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFS

Query:  WKSRIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF
         +S+IRPMPLGQYLLE+VREKVAWYLCLGPELDQSLDDVVGRDL KGDDWMNLQ++ E+ ALELEDLI+DELL+EVI F
Subjt:  WKSRIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF

XP_038902824.1 uncharacterized protein LOC120089436 [Benincasa hispida]0.063.69Show/hide
Query:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV
        MFKMEKHIQRQDSNLQFN+NV GCFWS+F+T+DYH WHNVKKMLPYRKHSRSKG  KST+N HHVAEVPE I DEN++L+CT ESCP  ++S EAH  EV
Subjt:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV

Query:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK
        IT + S EE QK  KL+ NSKRR SRT+SIHHLEPS  S   NGE  DS     Q+ PM+LAASG  SNSLNAMD+ DY   R+IAIR TS  EKSNGVK
Subjt:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK

Query:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ
        +TLET +IIRN+SSRS+KED  IQEI+KANRKLFAELL G  +  +LQ  +N KS ASLAKS SFPAPGLARKGY+KLSSL+HKQ ESFPK +K V+ PQ
Subjt:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ

Query:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVK----LIPTGHELLPAREE
        PSKLVES SP NFH      +S  TS+H++RQQT+SS LG  RGL+HGGWNQLV+KRFNFIK KIKH FKERKK NNQKT K    L P+G EL   REE
Subjt:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVK----LIPTGHELLPAREE

Query:  SQDSLGITTRKDSTDIRGYGE-------------------------------------------TNSYENDNTGGGVQAKMRIASLNASLERYSQLANSG
        + +SLG TT +D +  RGY E                                           T+  ENDN    VQ K R ASL+ASLERYSQL+  G
Subjt:  SQDSLGITTRKDSTDIRGYGE-------------------------------------------TNSYENDNTGGGVQAKMRIASLNASLERYSQLANSG

Query:  FNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQS
        F++N EAK ++S SLRL SE+K+PN E+ +K FGRNLS PDIDLFCTLFTDP H VSRTEKPK GLVHS+   NIRT+ N  +   +   SE LDSDSQ 
Subjt:  FNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQS

Query:  MVQRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHLDILDSKHHQILGSESRFDD-------VSELSDVSQLLEIETCFQDDDFLKLSDLEGFEL
        M++R +D  NM VD +G LN + N+E  AW D L+E+IPHLDILD KHHQ+ G+E   +D       VSELS V+Q+LE+ TCFQDD+  K SD +G  L
Subjt:  MVQRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHLDILDSKHHQILGSESRFDD-------VSELSDVSQLLEIETCFQDDDFLKLSDLEGFEL

Query:  NTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS-------NSIENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKL
        N    IA+EL+P  DQ N   TEALP FE  V  EI+D+A KISN          I NADFNYMR+ILQLSSFIE+GH   R L+SS+F+GEEAHFYKKL
Subjt:  NTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS-------NSIENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKL

Query:  EYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFI
        E YW KVDKDSDHQLLLDLV ETLHNV+E+SFICFLKTFS +S+IRP+ LGQYLLEEVRE+V+WYLCLGPELDQSLDDVVGRDL KGDDWMNLQS+TE I
Subjt:  EYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFI

Query:  ALELEDLIIDELLDEVI
        ALELEDLI+DELLDEV+
Subjt:  ALELEDLIIDELLDEVI

TrEMBL top hitse value%identityAlignment
A0A6J1D4X8 uncharacterized protein LOC1110170510.098.36Show/hide
Query:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVNHHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEVI
        MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVNHHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEVI
Subjt:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVNHHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEVI

Query:  TKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVKE
        TKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVKE
Subjt:  TKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVKE

Query:  TLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQP
        TLETKQI RNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQP
Subjt:  TLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQP

Query:  SKLVESESPTNFHNSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPTGHELLPAREESQDSLGITTRK
        SKLVESESPTNFHNSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPTGHEL PAREESQDSLGITTRK
Subjt:  SKLVESESPTNFHNSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPTGHELLPAREESQDSLGITTRK

Query:  DSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPH
        DSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPH
Subjt:  DSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPH

Query:  GVSRTEKPKAGLVHSNMGNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAWDELEEKIPHLDILDSKHHQILGSES
        GVSRTEKPKAGLVHSNM NIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAWDELEEKIPHLDILDSKHHQILGSES
Subjt:  GVSRTEKPKAGLVHSNMGNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAWDELEEKIPHLDILDSKHHQILGSES

Query:  RFDDVSELSDVSQLLEIETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS-------NSIENADFNY
        RFDDVSELSDVSQLLEIETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISN        +SIEN DFNY
Subjt:  RFDDVSELSDVSQLLEIETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS-------NSIENADFNY

Query:  MRYILQLSSFIESGHVRSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWY
        MRYILQLSSFIESGHVRSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQ LLEEVREKVAWY
Subjt:  MRYILQLSSFIESGHVRSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWY

Query:  LCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF
        LCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF
Subjt:  LCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF

A0A6J1FSR2 uncharacterized protein LOC111448443 isoform X30.062.86Show/hide
Query:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV
        MFKMEKHI+RQDSN+QFNR+V GCFWS+FHT+DYH WHNVKKMLPYR+HS+SKGSPKST+N HHVAE  EQI DENK L CT ESCP G   GE +  EV
Subjt:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV

Query:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK
        I+K+ SEEESQ   K+N NSKR LSRT+SIHHLE S +              S+  +PMEL A G +SNSLNAMD+ +YF+ RKI  + TSF E+SNGV+
Subjt:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK

Query:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ
        +TLE+KQI RN+ S SFKED+ IQEI+KANRKLFAELL G  + ++LQ P+N KSSASLAKSRSFPAPGLARKGY+KLSSL+HKQNESFPK +K  + PQ
Subjt:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ

Query:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAR
        PSKLVESESP NFH      +S  TSS N+  QTSSS  G  RGL+HGGWNQLV+KRFNFIK KI+H FKERKK NNQKT K  PT      GHEL  +R
Subjt:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAR

Query:  EESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLN---------------------------------------ASLERYSQLANSGFN
        EE+Q+SLG  T  D   IRGY ET + ENDN   GVQ K   ASL                                        A LERYSQ       
Subjt:  EESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLN---------------------------------------ASLERYSQLANSGFN

Query:  RNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMV
           EAKC +S+SLRL  E+ IPN EK +KS+GRNLS+PDIDLFCTLFTDP   VSRTEKPK G+VHS+   N+R + NPA+LL N   SE LDSDSQS+V
Subjt:  RNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMV

Query:  QRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHLDILDSKHHQILGSESRFDDVSELSDVSQLLEIETCFQDDDFLKLSDLEGFELNTEFGIASE
        ++SDD  NMPVD +G LN V N+E  AW D+LEEKIPHLD  D +HHQ+LGSE   +DV     V Q+ E  TC +DD+  KLSD EG  LN     A+E
Subjt:  QRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHLDILDSKHHQILGSESRFDDVSELSDVSQLLEIETCFQDDDFLKLSDLEGFELNTEFGIASE

Query:  LDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS---NS----IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLEYYWKKVDK
        L+P  DQP   S EAL   E  V  EIID A KISN    NS    I NADFNYMRYILQLSSFIESGH   R LSSS+F+GEEA FYKKLE YW+KVDK
Subjt:  LDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS---NS----IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLEYYWKKVDK

Query:  DSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLII
        DSDHQLL DLVYET HNVFE+SF  FLKTFS +++IRPMPLGQYLLE+VREKVAWYLCLGPELDQSLDDVVGRDL KGDDWMNLQ++ E+ ALELEDLI+
Subjt:  DSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLII

Query:  DELLDEVIYF
        DELL+EVI F
Subjt:  DELLDEVIYF

A0A6J1FU32 uncharacterized protein LOC111448443 isoform X40.065.07Show/hide
Query:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV
        MFKMEKHI+RQDSN+QFNR+V GCFWS+FHT+DYH WHNVKKMLPYR+HS+SKGSPKST+N HHVAE  EQI DENK L CT ESCP G   GE +  EV
Subjt:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV

Query:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK
        I+K+ SEEESQ   K+N NSKR LSRT+SIHHLE S +              S+  +PMEL A G +SNSLNAMD+ +YF+ RKI  + TSF E+SNGV+
Subjt:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK

Query:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ
        +TLE+KQI RN+ S SFKED+ IQEI+KANRKLFAELL G  + ++LQ P+N KSSASLAKSRSFPAPGLARKGY+KLSSL+HKQNESFPK +K  + PQ
Subjt:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ

Query:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAR
        PSKLVESESP NFH      +S  TSS N+  QTSSS  G  RGL+HGGWNQLV+KRFNFIK KI+H FKERKK NNQKT K  PT      GHEL  +R
Subjt:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAR

Query:  EESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPD
        EE+Q+SLG  T  D   IRGY ET + ENDN   GVQ K   ASL A LERYSQ          EAKC +S+SLRL  E+ IPN EK +KS+GRNLS+PD
Subjt:  EESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPD

Query:  IDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHL
        IDLFCTLFTDP   VSRTEKPK G+VHS+   N+R + NPA+LL N   SE LDSDSQS+V++SDD  NMPVD +G LN V N+E  AW D+LEEKIPHL
Subjt:  IDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHL

Query:  DILDSKHHQILGSESRFDDVSELSDVSQLLEI--------ETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEA
        D  D +HHQ+LGSE   +DV     V Q+  +        ETC +DD+  KLSD EG  LN     A+EL+P  DQP   S EAL   E  V  EIID A
Subjt:  DILDSKHHQILGSESRFDDVSELSDVSQLLEI--------ETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEA

Query:  GKISNS---NS----IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFS
         KISN    NS    I NADFNYMRYILQLSSFIESGH   R LSSS+F+GEEA FYKKLE YW+KVDKDSDHQLL DLVYET HNVFE+SF  FLKTFS
Subjt:  GKISNS---NS----IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFS

Query:  WKSRIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF
         +++IRPMPLGQYLLE+VREKVAWYLCLGPELDQSLDDVVGRDL KGDDWMNLQ++ E+ ALELEDLI+DELL+EVI F
Subjt:  WKSRIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF

A0A6J1FYA4 uncharacterized protein LOC111448443 isoform X10.062.31Show/hide
Query:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV
        MFKMEKHI+RQDSN+QFNR+V GCFWS+FHT+DYH WHNVKKMLPYR+HS+SKGSPKST+N HHVAE  EQI DENK L CT ESCP G   GE +  EV
Subjt:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV

Query:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK
        I+K+ SEEESQ   K+N NSKR LSRT+SIHHLE S +              S+  +PMEL A G +SNSLNAMD+ +YF+ RKI  + TSF E+SNGV+
Subjt:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK

Query:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ
        +TLE+KQI RN+ S SFKED+ IQEI+KANRKLFAELL G  + ++LQ P+N KSSASLAKSRSFPAPGLARKGY+KLSSL+HKQNESFPK +K  + PQ
Subjt:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ

Query:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAR
        PSKLVESESP NFH      +S  TSS N+  QTSSS  G  RGL+HGGWNQLV+KRFNFIK KI+H FKERKK NNQKT K  PT      GHEL  +R
Subjt:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAR

Query:  EESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLN---------------------------------------ASLERYSQLANSGFN
        EE+Q+SLG  T  D   IRGY ET + ENDN   GVQ K   ASL                                        A LERYSQ       
Subjt:  EESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLN---------------------------------------ASLERYSQLANSGFN

Query:  RNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMV
           EAKC +S+SLRL  E+ IPN EK +KS+GRNLS+PDIDLFCTLFTDP   VSRTEKPK G+VHS+   N+R + NPA+LL N   SE LDSDSQS+V
Subjt:  RNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMV

Query:  QRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHLDILDSKHHQILGSESRFDDVSELSDVSQLLEI--------ETCFQDDDFLKLSDLEGFELN
        ++SDD  NMPVD +G LN V N+E  AW D+LEEKIPHLD  D +HHQ+LGSE   +DV     V Q+  +        ETC +DD+  KLSD EG  LN
Subjt:  QRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHLDILDSKHHQILGSESRFDDVSELSDVSQLLEI--------ETCFQDDDFLKLSDLEGFELN

Query:  TEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS---NS----IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLE
             A+EL+P  DQP   S EAL   E  V  EIID A KISN    NS    I NADFNYMRYILQLSSFIESGH   R LSSS+F+GEEA FYKKLE
Subjt:  TEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS---NS----IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLE

Query:  YYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIA
         YW+KVDKDSDHQLL DLVYET HNVFE+SF  FLKTFS +++IRPMPLGQYLLE+VREKVAWYLCLGPELDQSLDDVVGRDL KGDDWMNLQ++ E+ A
Subjt:  YYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIA

Query:  LELEDLIIDELLDEVIYF
        LELEDLI+DELL+EVI F
Subjt:  LELEDLIIDELLDEVIYF

A0A6J1J8H7 uncharacterized protein LOC111484426 isoform X20.065.56Show/hide
Query:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV
        MFKMEKHI+RQDSN+QFNRNV GCFWS+FHT+DYH WHNVKKMLPYR+HS+SKGSPKST+N HHVAE  EQI DENK L CT ESCP G   GE +  EV
Subjt:  MFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVN-HHVAEVPEQIKDENKTLLCTTESCPFGKSSGEAHEKEV

Query:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK
        I+K+ SEEESQK  KLN NSKR LSRT+SIHHLE S +              S+  +PMEL A G +SNSLNAMD+ +YF+ RKI  + TSF EKSNGV+
Subjt:  ITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRSTSFREKSNGVK

Query:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ
        +TLE+KQ      S SFKED+ IQEI+KANRKLFAELL G  + ++LQ P+N KSSASLAKSRSFPAPGLARKGY+KLSSL+HKQNESFPK +K    PQ
Subjt:  ETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ

Query:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAR
        PSKL ESESP NFH      +S  TSS N+  QTSSS LG  RG++HGGWNQLV+KRFNFIK KI+H FKERKK NNQKT K  PT      GHEL   R
Subjt:  PSKLVESESPTNFH------NSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPT------GHELLPAR

Query:  EESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPD
        EE+Q+SLG  T  D   IRGY ET + ENDN   GVQ K   ASL A LERYSQ          EAKC +S+SLRL  E+ IPN EK +KS+GRNLS+PD
Subjt:  EESQDSLGITTRKDSTDIRGYGETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPD

Query:  IDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHL
        IDLFCTLFTDP   VSRTEKPK G+VHS+   N+R + NPA+LL N   SE LDSDSQS+V++SDD  NMPVD + CLN V N+E  AW D LEEKIPHL
Subjt:  IDLFCTLFTDPPHGVSRTEKPKAGLVHSNM-GNIRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAW-DELEEKIPHL

Query:  DILDSKHHQILGSESRFDDVSELSDVSQLLEIETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS--
        D  D +HHQ+LGSE   +DV     V Q+ E  TCF+DD+  KLSD EG  LN    +A+EL+P  DQP     EAL   E  V  EIID A KISN   
Subjt:  DILDSKHHQILGSESRFDDVSELSDVSQLLEIETCFQDDDFLKLSDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNS--

Query:  -NS----IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPM
         NS    I NADFNYMRYILQLSSFIESGH   R LSSS+F+GEEA FYKKLE YW+KVDK SDHQLL DLVYET HNVFE+SF  FLKTFS +S+IRPM
Subjt:  -NS----IENADFNYMRYILQLSSFIESGHV--RSLSSSMFDGEEAHFYKKLEYYWKKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPM

Query:  PLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF
        PLGQYLLE+VREKV+WYLCLGPELDQSLDDVVGRDL KGDDWMNLQ++ E+ ALELEDLI+DELL+EVI F
Subjt:  PLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF

SwissProt top hitse value%identityAlignment
F4HSD5 Protein TRM325.0e-1634.07Show/hide
Query:  EGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQ-EIIDEAGKISNSNSI-------ENADFNYMRYILQLSSFIESGHVRSLSSSMFDGEEAHF
        E +E+  E    + LD  + +      +++P    T E+ E +   G   +S  I       E+A F Y++ +L++S F+E+      +   +  EE   
Subjt:  EGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQ-EIIDEAGKISNSNSI-------ENADFNYMRYILQLSSFIESGHVRSLSSSMFDGEEAHF

Query:  YKKLEYYWKKVDKD--SDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWYLC-LGPE-LDQSLDDVVGRD-LSKGDDWM
           L Y     +++  +D +LL DLV E +      S I F KTF         P G+  L+EV  +V W L  LG E  D+SLDD+VGRD L+K D WM
Subjt:  YKKLEYYWKKVDKD--SDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWYLC-LGPE-LDQSLDDVVGRD-LSKGDDWM

Query:  NLQSDTEFIALELEDLIIDELLDEVI
        NLQ ++E++ LELEDLI D++LDE++
Subjt:  NLQSDTEFIALELEDLIIDELLDEVI

Arabidopsis top hitse value%identityAlignment
AT1G07620.1 GTP-binding protein Obg/CgtA3.6e-1734.07Show/hide
Query:  EGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQ-EIIDEAGKISNSNSI-------ENADFNYMRYILQLSSFIESGHVRSLSSSMFDGEEAHF
        E +E+  E    + LD  + +      +++P    T E+ E +   G   +S  I       E+A F Y++ +L++S F+E+      +   +  EE   
Subjt:  EGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQ-EIIDEAGKISNSNSI-------ENADFNYMRYILQLSSFIESGHVRSLSSSMFDGEEAHF

Query:  YKKLEYYWKKVDKD--SDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWYLC-LGPE-LDQSLDDVVGRD-LSKGDDWM
           L Y     +++  +D +LL DLV E +      S I F KTF         P G+  L+EV  +V W L  LG E  D+SLDD+VGRD L+K D WM
Subjt:  YKKLEYYWKKVDKD--SDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWYLC-LGPE-LDQSLDDVVGRD-LSKGDDWM

Query:  NLQSDTEFIALELEDLIIDELLDEVI
        NLQ ++E++ LELEDLI D++LDE++
Subjt:  NLQSDTEFIALELEDLIIDELLDEVI

AT4G00440.1 Protein of unknown function (DUF3741)2.6e-0731.78Show/hide
Query:  DHQLLLDLVYETLHNVFERSFICFLKTFSWKS-RIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIID
        DH+LL D + E L         C     S+ + R R     + ++ EV+E V W+L   P L  +LD +V +D+++  +W++++ D + I  E  +LI++
Subjt:  DHQLLLDLVYETLHNVFERSFICFLKTFSWKS-RIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIID

Query:  ELLDEVI
        ELL+E+I
Subjt:  ELLDEVI

AT4G00440.2 Protein of unknown function (DUF3741)2.6e-0731.78Show/hide
Query:  DHQLLLDLVYETLHNVFERSFICFLKTFSWKS-RIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIID
        DH+LL D + E L         C     S+ + R R     + ++ EV+E V W+L   P L  +LD +V +D+++  +W++++ D + I  E  +LI++
Subjt:  DHQLLLDLVYETLHNVFERSFICFLKTFSWKS-RIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIID

Query:  ELLDEVI
        ELL+E+I
Subjt:  ELLDEVI

AT4G00440.3 Protein of unknown function (DUF3741)2.6e-0731.78Show/hide
Query:  DHQLLLDLVYETLHNVFERSFICFLKTFSWKS-RIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIID
        DH+LL D + E L         C     S+ + R R     + ++ EV+E V W+L   P L  +LD +V +D+++  +W++++ D + I  E  +LI++
Subjt:  DHQLLLDLVYETLHNVFERSFICFLKTFSWKS-RIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIID

Query:  ELLDEVI
        ELL+E+I
Subjt:  ELLDEVI

AT5G02390.1 Protein of unknown function (DUF3741)5.5e-2623.89Show/hide
Query:  RNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSP--KSTVNHHVAEVPE-----QIKDENKT-------------LLCTTESCPFGKSSGEAHEKE
        +N  G  W +F  + ++ W  +KK LP+++    + +P  K+ VN+ +   P+     +IK E+KT              + + ES   G+ + + H  E
Subjt:  RNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSP--KSTVNHHVAEVPE-----QIKDENKT-------------LLCTTESCPFGKSSGEAHEKE

Query:  VITKK-KSEEESQKDLKLNFNSKRRL---------SRTKSIHH------LEPSGFSLGDNGEIGDSRVASQQRTPMELAASGK--RSNSLNAMDAVDYFS
          +KK  SEE S+K       S + L         S+ K  HH      ++ +        E+G S   S          + K   S ++ +++ +   S
Subjt:  VITKK-KSEEESQKDLKLNFNSKRRL---------SRTKSIHH------LEPSGFSLGDNGEIGDSRVASQQRTPMELAASGK--RSNSLNAMDAVDYFS

Query:  HRKIAIRSTSFR--------EKSNGVKETLETKQIIRNVSS--------------RSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPK-----
               S  FR        + ++G K   + K+  +   S              + F E+ D     ++   L A  L  ++ N  L++ ++P      
Subjt:  HRKIAIRSTSFR--------EKSNGVKETLETKQIIRNVSS--------------RSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPK-----

Query:  --------SSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQPSKLVESESPTNFHNSAGTSSHN--VRQQTSSSYLGLGRGLKHGGW
                SS ++ K+ SFP  G+ R+ +          N++F         P     + +E     H + G    N  V +       G G   K G  
Subjt:  --------SSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQPSKLVESESPTNFHNSAGTSSHN--VRQQTSSSYLGLGRGLKHGGW

Query:  NQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPTGHELLPAREESQDSLGITTRKDSTDIRGYGETNS-YENDNTGGGVQAKM--RIASLNASLERYS
        NQ+VIKRF  ++ KIKH   E K   ++ T+  +    + +P +      L         DI  + + NS  +N+N   G + K   R +SL  SL+RY 
Subjt:  NQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPTGHELLPAREESQDSLGITTRKDSTDIRGYGETNS-YENDNTGGGVQAKM--RIASLNASLERYS

Query:  QLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPHGVSRTEKPKAGLVHSNMGNIRTNPNPANLLVNADRSEALD
        QL    F R  +    NSE  ++  E+     +++ K  GR LS+P +        D    V   E+ + GL   N+  I           + D S++  
Subjt:  QLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPHGVSRTEKPKAGLVHSNMGNIRTNPNPANLLVNADRSEALD

Query:  SDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAWDELEEKIPHLDILDSKHHQILGSESRFDDVSELSDVSQLLEIETCFQDDDFLKLSDLEGFELNTE
        S+ +++ Q  D  E+ PVD              A  E + +I  LD+   +H      ES  +  + +S V                   D++  E  + 
Subjt:  SDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAWDELEEKIPHLDILDSKHHQILGSESRFDDVSELSDVSQLLEIETCFQDDDFLKLSDLEGFELNTE

Query:  FGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNSNSIENADFNYMRYILQLSSFIESGHVRSLSSSMFDGE--EAHFYKKLEYYW------
        F I+ +LD                 E     + +DE  ++ N ++ +   FNY+R IL++S F       SLS    D +  +   Y+++          
Subjt:  FGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNSNSIENADFNYMRYILQLSSFIESGHVRSLSSSMFDGE--EAHFYKKLEYYW------

Query:  -------KKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWYLCLGP-ELDQSLDDVVGRDLSKGDDWMNLQSD
               ++   + +H LL DL+ E L  ++ERS+  + K  S   +I PMP+G  +L+EV  +++ YL   P +  QS D V+ RDLS+ D WM+LQ +
Subjt:  -------KKVDKDSDHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWYLCLGP-ELDQSLDDVVGRDLSKGDDWMNLQSD

Query:  TEFIALELEDLIIDELLDEVI
        +E + +E+EDLI +ELL+E++
Subjt:  TEFIALELEDLIIDELLDEVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTTGGACCAGGCGAAGAGACTCGTGTTGGGACCAAGGGTGCCAGGATTCACCAATCTTGGGTGCGAGAGCTTGGAGGACTTCTCATCATGTTTAAAATGGAAAA
GCACATCCAGCGCCAGGATTCCAATCTTCAGTTTAACAGAAACGTTCAAGGCTGCTTCTGGAGCATGTTCCATACTGTTGACTACCATGCCTGGCATAATGTTAAAAAGA
TGCTTCCTTACAGAAAGCATTCAAGAAGCAAGGGAAGTCCAAAATCAACTGTTAACCATCATGTTGCCGAAGTTCCAGAACAAATCAAGGATGAAAACAAGACTCTATTG
TGTACAACAGAGAGTTGCCCGTTTGGCAAAAGCTCGGGAGAAGCTCATGAAAAGGAAGTGATTACTAAGAAAAAGTCCGAGGAAGAAAGCCAAAAGGACTTGAAATTGAA
CTTCAATTCAAAACGAAGGTTAAGTCGAACTAAGTCGATTCATCATTTAGAGCCTTCAGGTTTTTCTCTAGGAGATAATGGCGAAATTGGAGACAGCAGAGTCGCTTCTC
AACAGAGAACTCCCATGGAATTAGCTGCATCTGGAAAGAGGAGTAATTCTCTGAATGCCATGGATGCTGTGGACTACTTTAGCCACAGAAAGATTGCTATCCGGTCAACA
TCTTTTAGAGAAAAATCTAATGGAGTAAAGGAAACCTTAGAAACTAAACAGATCATCAGAAACGTCTCCAGTCGCTCGTTTAAGGAAGACCTTGACATCCAAGAGATATA
CAAGGCAAACAGAAAACTATTTGCTGAATTGTTACATGGTGTAGATGCTAACCAGTCTCTCCAGATCCCGAAAAATCCAAAGTCCTCAGCAAGTCTAGCGAAATCAAGGT
CCTTTCCTGCCCCTGGTTTAGCACGGAAAGGATATAGAAAGCTCAGCTCACTCGAACACAAGCAGAATGAATCATTTCCAAAAGGAAAGAAATATGTTGCCGGCCCTCAA
CCATCAAAGCTGGTCGAATCTGAATCTCCTACGAATTTTCATAACTCTGCTGGTACTTCAAGCCATAACGTAAGACAACAAACTAGCTCTTCTTACTTGGGCCTGGGTCG
TGGATTAAAGCATGGGGGGTGGAATCAGTTGGTTATCAAGCGTTTCAATTTTATTAAGCATAAAATAAAGCACCCGTTCAAGGAGAGAAAGAAGGTAAATAACCAGAAAA
CAGTTAAACTAATACCAACTGGACATGAACTTCTCCCTGCTAGAGAAGAATCACAGGACAGTTTAGGAATTACCACAAGGAAAGATAGCACAGACATTAGAGGTTATGGT
GAGACTAACAGTTATGAGAATGATAATACCGGTGGTGGAGTTCAAGCAAAGATGAGAATTGCTTCATTAAATGCTTCGCTGGAAAGATATTCTCAATTGGCCAACTCCGG
TTTCAATAGAAACACAGAGGCAAAGTGCCACAACTCAGAAAGCTTAAGGCTGACAAGTGAAGATAAGATTCCGAATAGAGAGAAGTCTAAAAAATCCTTTGGAAGGAATC
TTTCTATGCCTGATATTGATCTTTTTTGTACACTATTTACTGATCCTCCTCATGGTGTTTCTCGCACAGAAAAACCAAAAGCGGGTTTGGTGCATTCAAATATGGGAAAT
ATTCGTACAAATCCAAATCCGGCCAATCTATTAGTGAATGCAGATAGATCCGAAGCACTGGATAGTGATTCACAGAGTATGGTACAAAGAAGTGATGACATTGAAAACAT
GCCTGTTGATCAAACAGGTTGTTTAAACGTGGTCCATAACAATGAGGTAACTGCTTGGGATGAGCTTGAGGAGAAAATACCTCACCTAGATATTTTAGACAGCAAACACC
ATCAGATATTGGGTAGTGAATCCAGATTCGATGATGTCAGTGAGCTTTCAGACGTCAGTCAACTCTTAGAAATTGAAACTTGTTTTCAAGATGATGACTTTTTAAAGCTC
TCCGACTTAGAAGGTTTTGAACTAAATACTGAGTTCGGTATTGCAAGTGAGCTTGACCCATTCTCTGACCAACCTAACATGGTTAGCACAGAGGCCTTACCAGATTTTGA
AGCCACGGTCGAGCAAGAGATAATTGATGAAGCTGGAAAGATTTCTAACTCGAATAGTATCGAAAATGCTGATTTCAACTATATGAGGTATATTCTTCAGCTCTCCAGCT
TTATCGAAAGTGGTCATGTGAGATCACTCAGTTCTTCGATGTTCGATGGAGAGGAGGCCCACTTTTACAAGAAACTTGAATACTATTGGAAGAAGGTTGACAAAGATTCT
GATCACCAGCTTCTGCTCGATTTAGTTTATGAGACATTACATAATGTTTTTGAAAGATCATTCATTTGTTTCCTCAAAACCTTCTCCTGGAAGAGCCGAATCCGCCCAAT
GCCACTGGGGCAATATCTTCTAGAGGAGGTCCGAGAAAAAGTTGCCTGGTACCTGTGCTTGGGACCGGAACTAGACCAGTCTCTAGATGATGTCGTGGGGCGAGATTTAA
GTAAAGGTGACGATTGGATGAACCTCCAATCTGATACAGAGTTCATAGCTCTTGAGTTGGAGGATTTGATTATTGATGAGCTTTTAGATGAAGTAATATATTTTTAG
mRNA sequenceShow/hide mRNA sequence
GTTTTCAGATACGGCTTCAACATTTTAGGTGATACGGCTGATGGAAGGGAGGAAGGATGAAGAATTGAAAGAGAAAACTAATTTCCATTTCATCCGTGGAGAATGGGGGA
ACGCACACTTGTTTGGTTGCCGAGAAACCGATGGAAAACTTTCAAGCTGAAGGTTTCAGGCTCTGAACATCCAAAATAACAATCACCAACCCAAAGGAACAGCCGAAATA
TAGACAAATTTCAGTTAGGTCAACCGAAAGGGGATTGAATTTGCAACCCATTTCAGTTTCGACGACTTGCGAGGAAATTGGAAGCTCTCAGACTCTGCCCTGAGGTCGAG
GTTATGGAGTTTGGACCAGGCGAAGAGACTCGTGTTGGGACCAAGGGTGCCAGGATTCACCAATCTTGGGTGCGAGAGCTTGGAGGACTTCTCATCATGTTTAAAATGGA
AAAGCACATCCAGCGCCAGGATTCCAATCTTCAGTTTAACAGAAACGTTCAAGGCTGCTTCTGGAGCATGTTCCATACTGTTGACTACCATGCCTGGCATAATGTTAAAA
AGATGCTTCCTTACAGAAAGCATTCAAGAAGCAAGGGAAGTCCAAAATCAACTGTTAACCATCATGTTGCCGAAGTTCCAGAACAAATCAAGGATGAAAACAAGACTCTA
TTGTGTACAACAGAGAGTTGCCCGTTTGGCAAAAGCTCGGGAGAAGCTCATGAAAAGGAAGTGATTACTAAGAAAAAGTCCGAGGAAGAAAGCCAAAAGGACTTGAAATT
GAACTTCAATTCAAAACGAAGGTTAAGTCGAACTAAGTCGATTCATCATTTAGAGCCTTCAGGTTTTTCTCTAGGAGATAATGGCGAAATTGGAGACAGCAGAGTCGCTT
CTCAACAGAGAACTCCCATGGAATTAGCTGCATCTGGAAAGAGGAGTAATTCTCTGAATGCCATGGATGCTGTGGACTACTTTAGCCACAGAAAGATTGCTATCCGGTCA
ACATCTTTTAGAGAAAAATCTAATGGAGTAAAGGAAACCTTAGAAACTAAACAGATCATCAGAAACGTCTCCAGTCGCTCGTTTAAGGAAGACCTTGACATCCAAGAGAT
ATACAAGGCAAACAGAAAACTATTTGCTGAATTGTTACATGGTGTAGATGCTAACCAGTCTCTCCAGATCCCGAAAAATCCAAAGTCCTCAGCAAGTCTAGCGAAATCAA
GGTCCTTTCCTGCCCCTGGTTTAGCACGGAAAGGATATAGAAAGCTCAGCTCACTCGAACACAAGCAGAATGAATCATTTCCAAAAGGAAAGAAATATGTTGCCGGCCCT
CAACCATCAAAGCTGGTCGAATCTGAATCTCCTACGAATTTTCATAACTCTGCTGGTACTTCAAGCCATAACGTAAGACAACAAACTAGCTCTTCTTACTTGGGCCTGGG
TCGTGGATTAAAGCATGGGGGGTGGAATCAGTTGGTTATCAAGCGTTTCAATTTTATTAAGCATAAAATAAAGCACCCGTTCAAGGAGAGAAAGAAGGTAAATAACCAGA
AAACAGTTAAACTAATACCAACTGGACATGAACTTCTCCCTGCTAGAGAAGAATCACAGGACAGTTTAGGAATTACCACAAGGAAAGATAGCACAGACATTAGAGGTTAT
GGTGAGACTAACAGTTATGAGAATGATAATACCGGTGGTGGAGTTCAAGCAAAGATGAGAATTGCTTCATTAAATGCTTCGCTGGAAAGATATTCTCAATTGGCCAACTC
CGGTTTCAATAGAAACACAGAGGCAAAGTGCCACAACTCAGAAAGCTTAAGGCTGACAAGTGAAGATAAGATTCCGAATAGAGAGAAGTCTAAAAAATCCTTTGGAAGGA
ATCTTTCTATGCCTGATATTGATCTTTTTTGTACACTATTTACTGATCCTCCTCATGGTGTTTCTCGCACAGAAAAACCAAAAGCGGGTTTGGTGCATTCAAATATGGGA
AATATTCGTACAAATCCAAATCCGGCCAATCTATTAGTGAATGCAGATAGATCCGAAGCACTGGATAGTGATTCACAGAGTATGGTACAAAGAAGTGATGACATTGAAAA
CATGCCTGTTGATCAAACAGGTTGTTTAAACGTGGTCCATAACAATGAGGTAACTGCTTGGGATGAGCTTGAGGAGAAAATACCTCACCTAGATATTTTAGACAGCAAAC
ACCATCAGATATTGGGTAGTGAATCCAGATTCGATGATGTCAGTGAGCTTTCAGACGTCAGTCAACTCTTAGAAATTGAAACTTGTTTTCAAGATGATGACTTTTTAAAG
CTCTCCGACTTAGAAGGTTTTGAACTAAATACTGAGTTCGGTATTGCAAGTGAGCTTGACCCATTCTCTGACCAACCTAACATGGTTAGCACAGAGGCCTTACCAGATTT
TGAAGCCACGGTCGAGCAAGAGATAATTGATGAAGCTGGAAAGATTTCTAACTCGAATAGTATCGAAAATGCTGATTTCAACTATATGAGGTATATTCTTCAGCTCTCCA
GCTTTATCGAAAGTGGTCATGTGAGATCACTCAGTTCTTCGATGTTCGATGGAGAGGAGGCCCACTTTTACAAGAAACTTGAATACTATTGGAAGAAGGTTGACAAAGAT
TCTGATCACCAGCTTCTGCTCGATTTAGTTTATGAGACATTACATAATGTTTTTGAAAGATCATTCATTTGTTTCCTCAAAACCTTCTCCTGGAAGAGCCGAATCCGCCC
AATGCCACTGGGGCAATATCTTCTAGAGGAGGTCCGAGAAAAAGTTGCCTGGTACCTGTGCTTGGGACCGGAACTAGACCAGTCTCTAGATGATGTCGTGGGGCGAGATT
TAAGTAAAGGTGACGATTGGATGAACCTCCAATCTGATACAGAGTTCATAGCTCTTGAGTTGGAGGATTTGATTATTGATGAGCTTTTAGATGAAGTAATATATTTTTAG
GATTTTTCTGGGTAGATGTTCATACTGTACAGTGGCAATCTCCAAATCTATGTACATAAGTTCTGATTCCAGAAAGCTGGTTCTCCTCCAGCATGAGTTTATAGTGTCAT
GTCCAAGTAAATGAAAAATATAAACAGCAAAATAGATGCAGGGATAGTTTGTTTTTCTTCTTTTTTCTTTCTTCTGAGAATTTTCAATCTCATGTAAATTATTATACATA
CTCTATATTTGTTCACCATAGAAATATGATTTCTTATAGTTACTTGTATGATTCATATCTTCATTTAGAATACTCTTAGTTCACTACAGTTTTTTAGAGTTACAATGAAG
TTTGCAGCAAACTTGGG
Protein sequenceShow/hide protein sequence
MEFGPGEETRVGTKGARIHQSWVRELGGLLIMFKMEKHIQRQDSNLQFNRNVQGCFWSMFHTVDYHAWHNVKKMLPYRKHSRSKGSPKSTVNHHVAEVPEQIKDENKTLL
CTTESCPFGKSSGEAHEKEVITKKKSEEESQKDLKLNFNSKRRLSRTKSIHHLEPSGFSLGDNGEIGDSRVASQQRTPMELAASGKRSNSLNAMDAVDYFSHRKIAIRST
SFREKSNGVKETLETKQIIRNVSSRSFKEDLDIQEIYKANRKLFAELLHGVDANQSLQIPKNPKSSASLAKSRSFPAPGLARKGYRKLSSLEHKQNESFPKGKKYVAGPQ
PSKLVESESPTNFHNSAGTSSHNVRQQTSSSYLGLGRGLKHGGWNQLVIKRFNFIKHKIKHPFKERKKVNNQKTVKLIPTGHELLPAREESQDSLGITTRKDSTDIRGYG
ETNSYENDNTGGGVQAKMRIASLNASLERYSQLANSGFNRNTEAKCHNSESLRLTSEDKIPNREKSKKSFGRNLSMPDIDLFCTLFTDPPHGVSRTEKPKAGLVHSNMGN
IRTNPNPANLLVNADRSEALDSDSQSMVQRSDDIENMPVDQTGCLNVVHNNEVTAWDELEEKIPHLDILDSKHHQILGSESRFDDVSELSDVSQLLEIETCFQDDDFLKL
SDLEGFELNTEFGIASELDPFSDQPNMVSTEALPDFEATVEQEIIDEAGKISNSNSIENADFNYMRYILQLSSFIESGHVRSLSSSMFDGEEAHFYKKLEYYWKKVDKDS
DHQLLLDLVYETLHNVFERSFICFLKTFSWKSRIRPMPLGQYLLEEVREKVAWYLCLGPELDQSLDDVVGRDLSKGDDWMNLQSDTEFIALELEDLIIDELLDEVIYF