; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G004860 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G004860
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionDUF3741 domain-containing protein/DUF4378 domain-containing protein
Genome locationCG_Chr04:18324581..18330471
RNA-Seq ExpressionClCG04G004860
SyntenyClCG04G004860
Gene Ontology termsNA
InterPro domainsIPR025486 - Domain of unknown function DUF4378
IPR044257 - Protein TRM32-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442053.1 PREDICTED: uncharacterized protein LOC103486033 isoform X1 [Cucumis melo]0.0e+0076.03Show/hide
Query:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEV
        MFKMEK I RQ SNLQFNKNVPGCFW+IFHT+D+HRWHNV+KMLPYKKHSR+K GPKST NNHH+A+VS QSN+GNNPLMCTAESCP+ RKPGEA +NEV
Subjt:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEV

Query:  ITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEKS-GVK
        I +E+ EEESQK+WKL+SSSKRRLIRTQSIHHIE  YYSPGY+ ENGD  IT RQKTP+KLAASGMRS+SL+AMDNEDY IQ    I+L SFT+KS GVK
Subjt:  ITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEKS-GVK

Query:  KTLETNK-NRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPKQKSNSPQPS
        K LE NK  RNVS RSFK D HIQEIFKANRKLFAELL+GAR KNTL T QNKKSSASLAKS SFPAPG A KGYKKL+SL HKQSES+PKQKSNSP PS
Subjt:  KTLETNK-NRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPKQKSNSPQPS

Query:  KQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEAQ
        + VESESPKNFHED++P DS  T SHNI+QQT PS  G+NRGLR GGWNQLVVKRFNFIKQKIR S+KERK+GN+QKTSKGI TV +SGHELP + EEA+
Subjt:  KQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEAQ

Query:  EESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGII-GYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSE
        E    I   TSEN SG RGYSETG  ENDNLSNGVQTKT IASP+ASLERY QLS+GSGII GYSETDNS NDNLSN VQ KTGTASLSASLE YS+LSE
Subjt:  EESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGII-GYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSE

Query:  YGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSP
         G NKNR+ K Y+SQS RLIS EKI NIE PKK FGRNLS  GIDLFCTLFTD PHAVSRTKKPKRGL HSSTYNNI+ DE  AHLL+ HV KP   DS 
Subjt:  YGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSP

Query:  RMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLN
         +I++GD+NV VDYS SL EV NDEGTAWV E  +KI H DIS+G+H QVSGSEC VEDVRE +DHV +LSHINQV+E E  FQDDETS L DS G +L+
Subjt:  RMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLN

Query:  PRCSIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLE
        P CSI  EL+ SDDQPNE RTEAL   ET VS EIID+ EK S YLHLHS+      ADFNYMRYILQL S I+S H I QPLNSL FE E A+FYKKLE
Subjt:  PRCSIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLE

Query:  CYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYIT
        CYW KVDKDSDHQLLLDLVYETLHN+ E+S    LKTFS   QIRPMPLG+YLLEEV+EKVAWYL LGPELDQCLDDVVGRD+NKGDDWMNLQ ETE+I+
Subjt:  CYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYIT

Query:  VELEDMILDELLDEILSF
        ++LEDMILDELLDE++SF
Subjt:  VELEDMILDELLDEILSF

XP_038882713.1 uncharacterized protein LOC120073877 isoform X1 [Benincasa hispida]0.0e+0081.39Show/hide
Query:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLM--CTAESCPLRRKPGEARIN
        MFKM K+   QDS LQFNKNVPGCFWSIFHTIDYHRW+NV+KMLPYKKHSRSK GPKST NNHH+AEVS+QSN+GNNPLM  CTAESCP+R+KPGEA +N
Subjt:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLM--CTAESCPLRRKPGEARIN

Query:  EVITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEKS-G
        EVITKE+ EE+  K+WKL+SSSKRRLIRTQSIHH+EP Y SPGYNGENGD G+TPRQKTPMKLAASGMRS+SLNAMDNEDYFIQG IAI+L SFTEKS G
Subjt:  EVITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEKS-G

Query:  VKKTLETNKNRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPKQKSNSPQP
         KKTLETN NRNVS RSFKEDTHIQEIFKANRKLFAELL+GARGKNTL +PQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQ ESFPKQKSN P P
Subjt:  VKKTLETNKNRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPKQKSNSPQP

Query:  SKQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEA
        SK VESESPKNFHED TPCDS  TSSHNIR+QT PS LG NRGLRHGGWNQLVVKRFN IKQKIRRS KERKKGNNQKTSK ISTV+ S HELP  R++ 
Subjt:  SKQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEA

Query:  QEESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGIIGYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSE
         +ESIG A  TSEN SG+R YSETGNSENDN+SNGV TKTGIASPSASLERY QLS+ SGIIGYSETDNS+NDNLSNR QTK GTASLSASL+ YSQLSE
Subjt:  QEESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGIIGYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSE

Query:  YGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSP
        Y  +KNRE K Y S +LRLI+E+KI NIEKPKK FGRNLSSP IDLFCTLFTD P AVSRTKK KRGLAHSSTYNNIR DE  AH+LS+HVF+P +RDSP
Subjt:  YGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSP

Query:  RMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLN
         MI+KG++N+ VD+SGSLNEVTNDEGTAWVDEL EK+PH DIS+G+HQQV GSEC VEDVRET+DH  + SHINQV+E E CFQDDETSEL DSEGA+L 
Subjt:  RMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLN

Query:  PRCSIANELQPSDDQPNEVRT-EALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKL
          CSIANEL+PSDDQPNE RT  AL T ETIV+DEIID+ EKI NYLHLHSELS +++ADFNYMRYILQL SFIESGH IDQ LN  IF  E AHFYKKL
Subjt:  PRCSIANELQPSDDQPNEVRT-EALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKL

Query:  ECYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYI
        ECYWE VD DSDH LLLDLVYETLHNV E+S I FLKTFS TSQIRPMPLG YLLEEVR KVAWYL LGPELDQCLDDVVGRDL+KGDDWMNLQSET+YI
Subjt:  ECYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYI

Query:  TVELEDMILDELLDEILSF
        T+ELED+ILDELLDE+LSF
Subjt:  TVELEDMILDELLDEILSF

XP_038882717.1 uncharacterized protein LOC120073877 isoform X2 [Benincasa hispida]0.0e+0081.57Show/hide
Query:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEV
        MFKM K+   QDS LQFNKNVPGCFWSIFHTIDYHRW+NV+KMLPYKKHSRSK GPKST NNHH+AEVS+QSN+GNNPLMCTAESCP+R+KPGEA +NEV
Subjt:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEV

Query:  ITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEKS-GVK
        ITKE+ EE+  K+WKL+SSSKRRLIRTQSIHH+EP Y SPGYNGENGD G+TPRQKTPMKLAASGMRS+SLNAMDNEDYFIQG IAI+L SFTEKS G K
Subjt:  ITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEKS-GVK

Query:  KTLETNKNRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPKQKSNSPQPSK
        KTLETN NRNVS RSFKEDTHIQEIFKANRKLFAELL+GARGKNTL +PQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQ ESFPKQKSN P PSK
Subjt:  KTLETNKNRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPKQKSNSPQPSK

Query:  QVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEAQE
         VESESPKNFHED TPCDS  TSSHNIR+QT PS LG NRGLRHGGWNQLVVKRFN IKQKIRRS KERKKGNNQKTSK ISTV+ S HELP  R++  +
Subjt:  QVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEAQE

Query:  ESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGIIGYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSEYG
        ESIG A  TSEN SG+R YSETGNSENDN+SNGV TKTGIASPSASLERY QLS+ SGIIGYSETDNS+NDNLSNR QTK GTASLSASL+ YSQLSEY 
Subjt:  ESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGIIGYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSEYG

Query:  SNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSPRM
         +KNRE K Y S +LRLI+E+KI NIEKPKK FGRNLSSP IDLFCTLFTD P AVSRTKK KRGLAHSSTYNNIR DE  AH+LS+HVF+P +RDSP M
Subjt:  SNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSPRM

Query:  IQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLNPR
        I+KG++N+ VD+SGSLNEVTNDEGTAWVDEL EK+PH DIS+G+HQQV GSEC VEDVRET+DH  + SHINQV+E E CFQDDETSEL DSEGA+L   
Subjt:  IQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLNPR

Query:  CSIANELQPSDDQPNEVRT-EALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLEC
        CSIANEL+PSDDQPNE RT  AL T ETIV+DEIID+ EKI NYLHLHSELS +++ADFNYMRYILQL SFIESGH IDQ LN  IF  E AHFYKKLEC
Subjt:  CSIANELQPSDDQPNEVRT-EALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLEC

Query:  YWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYITV
        YWE VD DSDH LLLDLVYETLHNV E+S I FLKTFS TSQIRPMPLG YLLEEVR KVAWYL LGPELDQCLDDVVGRDL+KGDDWMNLQSET+YIT+
Subjt:  YWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYITV

Query:  ELEDMILDELLDEILSF
        ELED+ILDELLDE+LSF
Subjt:  ELEDMILDELLDEILSF

XP_038882718.1 uncharacterized protein LOC120073877 isoform X3 [Benincasa hispida]0.0e+0081.36Show/hide
Query:  CTAESCPLRRKPGEARINEVITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYF
        CTAESCP+R+KPGEA +NEVITKE+ EE+  K+WKL+SSSKRRLIRTQSIHH+EP Y SPGYNGENGD G+TPRQKTPMKLAASGMRS+SLNAMDNEDYF
Subjt:  CTAESCPLRRKPGEARINEVITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYF

Query:  IQGNIAIQLTSFTEKS-GVKKTLETNKNRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSL
        IQG IAI+L SFTEKS G KKTLETN NRNVS RSFKEDTHIQEIFKANRKLFAELL+GARGKNTL +PQNKKSSASLAKSRSFPAPGLAGKGYKKLTSL
Subjt:  IQGNIAIQLTSFTEKS-GVKKTLETNKNRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSL

Query:  QHKQSESFPKQKSNSPQPSKQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKG
        QHKQ ESFPKQKSN P PSK VESESPKNFHED TPCDS  TSSHNIR+QT PS LG NRGLRHGGWNQLVVKRFN IKQKIRRS KERKKGNNQKTSK 
Subjt:  QHKQSESFPKQKSNSPQPSKQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKG

Query:  ISTVDASGHELPAYREEAQEESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGIIGYSETDNSDNDNLSNRVQTK
        ISTV+ S HELP  R++  +ESIG A  TSEN SG+R YSETGNSENDN+SNGV TKTGIASPSASLERY QLS+ SGIIGYSETDNS+NDNLSNR QTK
Subjt:  ISTVDASGHELPAYREEAQEESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGIIGYSETDNSDNDNLSNRVQTK

Query:  TGTASLSASLEGYSQLSEYGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEI
         GTASLSASL+ YSQLSEY  +KNRE K Y S +LRLI+E+KI NIEKPKK FGRNLSSP IDLFCTLFTD P AVSRTKK KRGLAHSSTYNNIR DE 
Subjt:  TGTASLSASLEGYSQLSEYGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEI

Query:  SAHLLSVHVFKPSSRDSPRMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEIC
         AH+LS+HVF+P +RDSP MI+KG++N+ VD+SGSLNEVTNDEGTAWVDEL EK+PH DIS+G+HQQV GSEC VEDVRET+DH  + SHINQV+E E C
Subjt:  SAHLLSVHVFKPSSRDSPRMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEIC

Query:  FQDDETSELLDSEGAMLNPRCSIANELQPSDDQPNEVRT-EALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQ
        FQDDETSEL DSEGA+L   CSIANEL+PSDDQPNE RT  AL T ETIV+DEIID+ EKI NYLHLHSELS +++ADFNYMRYILQL SFIESGH IDQ
Subjt:  FQDDETSELLDSEGAMLNPRCSIANELQPSDDQPNEVRT-EALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQ

Query:  PLNSLIFEGEVAHFYKKLECYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGR
         LN  IF  E AHFYKKLECYWE VD DSDH LLLDLVYETLHNV E+S I FLKTFS TSQIRPMPLG YLLEEVR KVAWYL LGPELDQCLDDVVGR
Subjt:  PLNSLIFEGEVAHFYKKLECYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGR

Query:  DLNKGDDWMNLQSETEYITVELEDMILDELLDEILSF
        DL+KGDDWMNLQSET+YIT+ELED+ILDELLDE+LSF
Subjt:  DLNKGDDWMNLQSETEYITVELEDMILDELLDEILSF

XP_038882719.1 uncharacterized protein LOC120073877 isoform X4 [Benincasa hispida]0.0e+0081.38Show/hide
Query:  MCTAESCPLRRKPGEARINEVITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDY
        MCTAESCP+R+KPGEA +NEVITKE+ EE+  K+WKL+SSSKRRLIRTQSIHH+EP Y SPGYNGENGD G+TPRQKTPMKLAASGMRS+SLNAMDNEDY
Subjt:  MCTAESCPLRRKPGEARINEVITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDY

Query:  FIQGNIAIQLTSFTEKS-GVKKTLETNKNRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTS
        FIQG IAI+L SFTEKS G KKTLETN NRNVS RSFKEDTHIQEIFKANRKLFAELL+GARGKNTL +PQNKKSSASLAKSRSFPAPGLAGKGYKKLTS
Subjt:  FIQGNIAIQLTSFTEKS-GVKKTLETNKNRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTS

Query:  LQHKQSESFPKQKSNSPQPSKQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSK
        LQHKQ ESFPKQKSN P PSK VESESPKNFHED TPCDS  TSSHNIR+QT PS LG NRGLRHGGWNQLVVKRFN IKQKIRRS KERKKGNNQKTSK
Subjt:  LQHKQSESFPKQKSNSPQPSKQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSK

Query:  GISTVDASGHELPAYREEAQEESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGIIGYSETDNSDNDNLSNRVQT
         ISTV+ S HELP  R++  +ESIG A  TSEN SG+R YSETGNSENDN+SNGV TKTGIASPSASLERY QLS+ SGIIGYSETDNS+NDNLSNR QT
Subjt:  GISTVDASGHELPAYREEAQEESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGIIGYSETDNSDNDNLSNRVQT

Query:  KTGTASLSASLEGYSQLSEYGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADE
        K GTASLSASL+ YSQLSEY  +KNRE K Y S +LRLI+E+KI NIEKPKK FGRNLSSP IDLFCTLFTD P AVSRTKK KRGLAHSSTYNNIR DE
Subjt:  KTGTASLSASLEGYSQLSEYGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADE

Query:  ISAHLLSVHVFKPSSRDSPRMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEI
          AH+LS+HVF+P +RDSP MI+KG++N+ VD+SGSLNEVTNDEGTAWVDEL EK+PH DIS+G+HQQV GSEC VEDVRET+DH  + SHINQV+E E 
Subjt:  ISAHLLSVHVFKPSSRDSPRMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEI

Query:  CFQDDETSELLDSEGAMLNPRCSIANELQPSDDQPNEVRT-EALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAID
        CFQDDETSEL DSEGA+L   CSIANEL+PSDDQPNE RT  AL T ETIV+DEIID+ EKI NYLHLHSELS +++ADFNYMRYILQL SFIESGH ID
Subjt:  CFQDDETSELLDSEGAMLNPRCSIANELQPSDDQPNEVRT-EALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAID

Query:  QPLNSLIFEGEVAHFYKKLECYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVG
        Q LN  IF  E AHFYKKLECYWE VD DSDH LLLDLVYETLHNV E+S I FLKTFS TSQIRPMPLG YLLEEVR KVAWYL LGPELDQCLDDVVG
Subjt:  QPLNSLIFEGEVAHFYKKLECYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVG

Query:  RDLNKGDDWMNLQSETEYITVELEDMILDELLDEILSF
        RDL+KGDDWMNLQSET+YIT+ELED+ILDELLDE+LSF
Subjt:  RDLNKGDDWMNLQSETEYITVELEDMILDELLDEILSF

TrEMBL top hitse value%identityAlignment
A0A0A0L1J3 DUF4378 domain-containing protein0.0e+0073.64Show/hide
Query:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEV
        MFKMEK I RQ SNLQFNKNVPGCFW+IFHTIDYHRWHNV+K LPYKKHSR+K GPKST N+H + +VS+QSN+GN+PL+CTAESCP+ RKPGEA +NEV
Subjt:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEV

Query:  ITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEKS-GVK
        + +E+ EEESQK+WK +SSSKRRLIRTQSIHHIE  YYSPGY+ ENGD GIT RQK+P+KLAASGMRSVSL+AMDNEDYFIQ  I IQL S T+KS GVK
Subjt:  ITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEKS-GVK

Query:  KTLETNK-NRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPKQKSNSPQPS
        K LE NK NRNVS RSFK D HIQEIFKANRKLFAELL+GA  KNTL T QNKKSSASLAKS SFPAP  A KGY+KL+SL+HKQSES+PKQKSNSP PS
Subjt:  KTLETNK-NRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPKQKSNSPQPS

Query:  KQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEAQ
        K VES+SP+NFHED+TPCDS  T SHNI+ QT PS  G+N GLRHGGWNQLVVKRFNFIKQKIR S+KERK+GN+QKTSKGI TV + GHELP + EEAQ
Subjt:  KQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEAQ

Query:  EESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGII-GYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSE
        E            G+  RG+SETGNSENDNLSNGVQTKT IASP ASLERY Q ++GSGI+ GYSETDNS NDNL+N VQTKTGTASLSASLE YS+LSE
Subjt:  EESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGII-GYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSE

Query:  YGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSP
         G +KNR+ K  +SQS RLIS EKI NIE PKK FGR+LS  GIDLFC LFTD PHAVSRTKKPKRGLAHSSTYNNIR DE   HLL+ HV  P   DS 
Subjt:  YGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSP

Query:  RMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLN
         +I++GD+NV VDYS SLNEV NDEG AWV E  +KI H DIS+G+H QVSGSEC VEDVRE +DHV +LSHINQV+E + CFQDDETS+L DS G +L+
Subjt:  RMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLN

Query:  PRCSIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLE
        P CSI  EL+ S+ QPNE RTE L   ET VS EIID+ +K   YLHLHS+      ADFNYMRYILQL SFI+S H IDQPLNS IFEGE A FY+KLE
Subjt:  PRCSIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLE

Query:  CYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYIT
        CYW KVDKDSDHQLL DLVYETLHN+ E+S +  LKTFS  SQIRPMPLG+YLLEEV+EK+AWYL LGPELDQCLDDVVGRDLNKGDDWMNL  ETE+I 
Subjt:  CYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYIT

Query:  VELEDMILDELLDEILSF
        ++LEDMILDELLDE++S+
Subjt:  VELEDMILDELLDEILSF

A0A1S3B4T4 uncharacterized protein LOC103486033 isoform X10.0e+0076.03Show/hide
Query:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEV
        MFKMEK I RQ SNLQFNKNVPGCFW+IFHT+D+HRWHNV+KMLPYKKHSR+K GPKST NNHH+A+VS QSN+GNNPLMCTAESCP+ RKPGEA +NEV
Subjt:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEV

Query:  ITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEKS-GVK
        I +E+ EEESQK+WKL+SSSKRRLIRTQSIHHIE  YYSPGY+ ENGD  IT RQKTP+KLAASGMRS+SL+AMDNEDY IQ    I+L SFT+KS GVK
Subjt:  ITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEKS-GVK

Query:  KTLETNK-NRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPKQKSNSPQPS
        K LE NK  RNVS RSFK D HIQEIFKANRKLFAELL+GAR KNTL T QNKKSSASLAKS SFPAPG A KGYKKL+SL HKQSES+PKQKSNSP PS
Subjt:  KTLETNK-NRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPKQKSNSPQPS

Query:  KQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEAQ
        + VESESPKNFHED++P DS  T SHNI+QQT PS  G+NRGLR GGWNQLVVKRFNFIKQKIR S+KERK+GN+QKTSKGI TV +SGHELP + EEA+
Subjt:  KQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEAQ

Query:  EESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGII-GYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSE
        E    I   TSEN SG RGYSETG  ENDNLSNGVQTKT IASP+ASLERY QLS+GSGII GYSETDNS NDNLSN VQ KTGTASLSASLE YS+LSE
Subjt:  EESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGII-GYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSE

Query:  YGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSP
         G NKNR+ K Y+SQS RLIS EKI NIE PKK FGRNLS  GIDLFCTLFTD PHAVSRTKKPKRGL HSSTYNNI+ DE  AHLL+ HV KP   DS 
Subjt:  YGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSP

Query:  RMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLN
         +I++GD+NV VDYS SL EV NDEGTAWV E  +KI H DIS+G+H QVSGSEC VEDVRE +DHV +LSHINQV+E E  FQDDETS L DS G +L+
Subjt:  RMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLN

Query:  PRCSIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLE
        P CSI  EL+ SDDQPNE RTEAL   ET VS EIID+ EK S YLHLHS+      ADFNYMRYILQL S I+S H I QPLNSL FE E A+FYKKLE
Subjt:  PRCSIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLE

Query:  CYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYIT
        CYW KVDKDSDHQLLLDLVYETLHN+ E+S    LKTFS   QIRPMPLG+YLLEEV+EKVAWYL LGPELDQCLDDVVGRD+NKGDDWMNLQ ETE+I+
Subjt:  CYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYIT

Query:  VELEDMILDELLDEILSF
        ++LEDMILDELLDE++SF
Subjt:  VELEDMILDELLDEILSF

A0A1S3CMA3 uncharacterized protein LOC1035020860.0e+0072.3Show/hide
Query:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEV
        MFKMEKHI RQDSNLQFNKNVPGCFWSIFHTIDYH WHNV+KMLP++KHSRSK  PKSTLN HH AE+ D         MC+ ESCP+ RKP  A +NEV
Subjt:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEV

Query:  ITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEK-SGVK
        IT  L EEESQKYWKL SSSKRRL RTQSIHH+EP +YSPGYNGE GD      QK  MKL ASG+RS SL+A+D+ DY  Q  IAI  TS TEK SGVK
Subjt:  ITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEK-SGVK

Query:  KTLETNK-NRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPK-QKSNSPQP
        KTLETN+ NRNVS RSFKED+H+QEIFKANRKLFAELL+GA  KNTL TPQNKKSSASLAKSRSFPAPGLA KGYKKL+SLQHKQ E+FPK QKS S QP
Subjt:  KTLETNK-NRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPK-QKSNSPQP

Query:  SKQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEA
        SK VES SPKNFHED+ PCDS  T+ HNI+Q T+ S LG NRG +HGGWNQLVVKRFNFIKQKIR S KERKKGNNQKTSKGIS  D SGHEL  Y EEA
Subjt:  SKQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEA

Query:  QEESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGIIGYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSE
          ES+G A  TSE+GSG+RGYSET  S +D LSN  QTKTGI S  AS ER  QLS GSG IG S TD+S+N+NLS+RVQT+TGTASLSASLE YSQLS 
Subjt:  QEESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGIIGYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSE

Query:  YGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSP
        Y  +KNRE K Y+SQS+RLISEEKI N+E P+K FGRNLSSP IDLFCTLFTD PHAVSRT+KPKRGL HSST NNIR DE   H L+ H+ +P   DS 
Subjt:  YGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSP

Query:  RMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLN
         MI++GD+N+ +DYS SLNE+T DEGT W D L EKIPH DISDG+H QV G+E  VEDV  T+D    LSH  QV+E + CFQDDETS+L DSEGA++N
Subjt:  RMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLN

Query:  PRCSIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLE
        PRCS+ANE + SDDQ NE  TEAL   ET V   IID+TEKISN+L+LHSEL  + NA+FNYMR+ILQL SFIE G  ID+PLN  IFEGE AHFYKKLE
Subjt:  PRCSIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLE

Query:  CYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYIT
        CYWEKVDKDSDHQLLLDLVYETLHN+ E S  CFLKTFS  SQIRPMPLG+YLLE+VREKV+WYL LGPELDQ LDDVV RDL KG++WMNLQSETE I 
Subjt:  CYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYIT

Query:  VELEDMILDELLDEILS
        +ELED+ILDELLDE++S
Subjt:  VELEDMILDELLDEILS

A0A5A7V3N4 Protein TRM32 isoform X10.0e+0072.21Show/hide
Query:  MEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEVITK
        MEKHI RQDSNLQFNKNVPGCFWSIFHTIDYH WHNV+KMLP++KHSRSK  PKSTLN HH AE+ D         MC+ ESCP+ RKP  A +NEVIT 
Subjt:  MEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEVITK

Query:  ELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEK-SGVKKTL
         L EEESQKYWKL SSSKRRL RTQSIHH+EP +YSPGYNGE GD      QK  MKL ASG+RS SL+A+D+ DY  Q  IAI  TS TEK SGVKKTL
Subjt:  ELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEK-SGVKKTL

Query:  ETNK-NRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPK-QKSNSPQPSKQ
        ETN+ NRNVS RSFKED+H+QEIFKANRKLFAELL+GA  KNTL TPQNKKSSASLAKSRSFPAPGLA KGYKKL+SLQHKQ E+FPK QKS S QPSK 
Subjt:  ETNK-NRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPK-QKSNSPQPSKQ

Query:  VESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEAQEE
        VES SPKNFHED+ PCDS  T+ HNI+Q T+ S LG NRG +HGGWNQLVVKRFNFIKQKIR S KERKKGNNQKTSKGIS  D SGHEL  Y EEA  E
Subjt:  VESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEAQEE

Query:  SIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGIIGYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSEYGS
        S+G A  TSE+GSG+RGYSET  S +D LSN  QTKTGI S  AS ER  QLS GSG IG S TD+S+N+NLS+RVQT+TGTASLSASLE YSQLS Y  
Subjt:  SIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGIIGYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSEYGS

Query:  NKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSPRMI
        +KNRE K Y+SQS+RLISEEKI N+E P+K FGRNLSSP IDLFCTLFTD PHAVSRT+KPKRGL HSST NNIR DE   H L+ H+ +P   DS  MI
Subjt:  NKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSPRMI

Query:  QKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLNPRC
        ++GD+N+ +DYS SLNE+T DEGT W D L EKIPH DISDG+H QV G+E  VEDV  T+D    LSH  QV+E + CFQDDETS+L DSEGA++NPRC
Subjt:  QKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLNPRC

Query:  SIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLECYW
        S+ANE + SDDQ NE  TEAL   ET V   IID+TEKISN+L+LHSEL  + NA+FNYMR+ILQL SFIE G  ID+PLN  IFEGE AHFYKKLECYW
Subjt:  SIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLECYW

Query:  EKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYITVEL
        EKVDKDSDHQLLLDLVYETLHN+ E S  CFLKTFS  SQIRPMPLG+YLLE+VREKV+WYL LGPELDQ LDDVV RDL KG++WMNLQSETE I +EL
Subjt:  EKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYITVEL

Query:  EDMILDELLDEILS
        ED+ILDELLDE++S
Subjt:  EDMILDELLDEILS

A0A5D3C4U3 DUF3741 domain-containing protein/DUF4378 domain-containing protein0.0e+0076.03Show/hide
Query:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEV
        MFKMEK I RQ SNLQFNKNVPGCFW+IFHT+D+HRWHNV+KMLPYKKHSR+K GPKST NNHH+A+VS QSN+GNNPLMCTAESCP+ RKPGEA +NEV
Subjt:  MFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEV

Query:  ITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEKS-GVK
        I +E+ EEESQK+WKL+SSSKRRLIRTQSIHHIE  YYSPGY+ ENGD  IT RQKTP+KLAASGMRS+SL+AMDNEDY IQ    I+L SFT+KS GVK
Subjt:  ITKELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEKS-GVK

Query:  KTLETNK-NRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPKQKSNSPQPS
        K LE NK  RNVS RSFK D HIQEIFKANRKLFAELL+GAR KNTL T QNKKSSASLAKS SFPAPG A KGYKKL+SL HKQSES+PKQKSNSP PS
Subjt:  KTLETNK-NRNVSARSFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPKQKSNSPQPS

Query:  KQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEAQ
        + VESESPKNFHED++P DS  T SHNI+QQT PS  G+NRGLR GGWNQLVVKRFNFIKQKIR S+KERK+GN+QKTSKGI TV +SGHELP + EEA+
Subjt:  KQVESESPKNFHEDLTPCDSVGTSSHNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEAQ

Query:  EESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGII-GYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSE
        E    I   TSEN SG RGYSETG  ENDNLSNGVQTKT IASP+ASLERY QLS+GSGII GYSETDNS NDNLSN VQ KTGTASLSASLE YS+LSE
Subjt:  EESIGIAITTSENGSGMRGYSETGNSENDNLSNGVQTKTGIASPSASLERYFQLSNGSGII-GYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSE

Query:  YGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSP
         G NKNR+ K Y+SQS RLIS EKI NIE PKK FGRNLS  GIDLFCTLFTD PHAVSRTKKPKRGL HSSTYNNI+ DE  AHLL+ HV KP   DS 
Subjt:  YGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDLFCTLFTD-PHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSP

Query:  RMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLN
         +I++GD+NV VDYS SL EV NDEGTAWV E  +KI H DIS+G+H QVSGSEC VEDVRE +DHV +LSHINQV+E E  FQDDETS L DS G +L+
Subjt:  RMIQKGDENV-VDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVEDVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLN

Query:  PRCSIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLE
        P CSI  EL+ SDDQPNE RTEAL   ET VS EIID+ EK S YLHLHS+      ADFNYMRYILQL S I+S H I QPLNSL FE E A+FYKKLE
Subjt:  PRCSIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQLCSFIESGHAIDQPLNSLIFEGEVAHFYKKLE

Query:  CYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYIT
        CYW KVDKDSDHQLLLDLVYETLHN+ E+S    LKTFS   QIRPMPLG+YLLEEV+EKVAWYL LGPELDQCLDDVVGRD+NKGDDWMNLQ ETE+I+
Subjt:  CYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYIT

Query:  VELEDMILDELLDEILSF
        ++LEDMILDELLDE++SF
Subjt:  VELEDMILDELLDEILSF

SwissProt top hitse value%identityAlignment
F4HSD5 Protein TRM329.8e-1937.57Show/hide
Query:  LHLHSELSTVENADFNYMRYILQLCSFIES------GHAIDQPLN-SLIFEGEVAHFYKKLECYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTF
        L ++      E+A F Y++ +L++  F+E+       ++ +QPLN SL++E ++           ++ +  +D +LL DLV E +      S I F KTF
Subjt:  LHLHSELSTVENADFNYMRYILQLCSFIES------GHAIDQPLN-SLIFEGEVAHFYKKLECYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTF

Query:  SWTSQIRPMPLGRYLLEEVREKVAWYLC-LGPE-LDQCLDDVVGRD-LNKGDDWMNLQSETEYITVELEDMILDELLDEIL
                 P G+  L+EV  +V W L  LG E  D+ LDD+VGRD L K D WMNLQ E+E++T+ELED+I D++LDE+L
Subjt:  SWTSQIRPMPLGRYLLEEVREKVAWYLC-LGPE-LDQCLDDVVGRD-LNKGDDWMNLQSETEYITVELEDMILDELLDEIL

Arabidopsis top hitse value%identityAlignment
AT1G07620.1 GTP-binding protein Obg/CgtA6.9e-2037.57Show/hide
Query:  LHLHSELSTVENADFNYMRYILQLCSFIES------GHAIDQPLN-SLIFEGEVAHFYKKLECYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTF
        L ++      E+A F Y++ +L++  F+E+       ++ +QPLN SL++E ++           ++ +  +D +LL DLV E +      S I F KTF
Subjt:  LHLHSELSTVENADFNYMRYILQLCSFIES------GHAIDQPLN-SLIFEGEVAHFYKKLECYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTF

Query:  SWTSQIRPMPLGRYLLEEVREKVAWYLC-LGPE-LDQCLDDVVGRD-LNKGDDWMNLQSETEYITVELEDMILDELLDEIL
                 P G+  L+EV  +V W L  LG E  D+ LDD+VGRD L K D WMNLQ E+E++T+ELED+I D++LDE+L
Subjt:  SWTSQIRPMPLGRYLLEEVREKVAWYLC-LGPE-LDQCLDDVVGRD-LNKGDDWMNLQSETEYITVELEDMILDELLDEIL

AT2G45900.1 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related2.3e-0733.93Show/hide
Query:  SDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRP----MPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYITVELED
        SD +LL D + E L   C            W S ++P     P     +E V+E+V W+L   P     LD +V +DL +  +WM+L+ +   I  E  +
Subjt:  SDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRP----MPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYITVELED

Query:  MILDELLDEILS
        +ILDELL+EI+S
Subjt:  MILDELLDEILS

AT4G00440.1 Protein of unknown function (DUF3741)2.3e-0730.19Show/hide
Query:  DHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYITVELEDMILDE
        DH+LL D + E L  +C     C       T + R     + ++ EV+E V W+L   P L   LD +V +D+ +  +W++++ + + I  E  ++IL+E
Subjt:  DHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYITVELEDMILDE

Query:  LLDEIL
        LL+E++
Subjt:  LLDEIL

AT4G00440.2 Protein of unknown function (DUF3741)2.3e-0730.19Show/hide
Query:  DHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYITVELEDMILDE
        DH+LL D + E L  +C     C       T + R     + ++ EV+E V W+L   P L   LD +V +D+ +  +W++++ + + I  E  ++IL+E
Subjt:  DHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDVVGRDLNKGDDWMNLQSETEYITVELEDMILDE

Query:  LLDEIL
        LL+E++
Subjt:  LLDEIL

AT5G02390.1 Protein of unknown function (DUF3741)1.9e-2230.42Show/hide
Query:  VEDVRETIDHVDNLSHI----NQVVEHEICFQDDETSE--------LLDSEGAMLNPRCSIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISN
        V+ + E  D +DN+S I    +Q  EHE   Q  + SE          D E + L+      +  + S++ PN V T  +  + ++       +TE +S 
Subjt:  VEDVRETIDHVDNLSHI----NQVVEHEICFQDDETSE--------LLDSEGAMLNPRCSIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISN

Query:  YLHLHSELSTVENAD---FNYMRYILQLCSF-----IESGHAIDQPLNSLIFE----GEVAHFYKKLECYW-EKVDKDSDHQLLLDLVYETLHNVCESSL
           L  E+  ++  D   FNY+R IL++  F     +       QPL+ L++E          ++  EC   E+   + +H LL DL+ E L  + E S 
Subjt:  YLHLHSELSTVENAD---FNYMRYILQLCSF-----IESGHAIDQPLNSLIFE----GEVAHFYKKLECYW-EKVDKDSDHQLLLDLVYETLHNVCESSL

Query:  ICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGP-ELDQCLDDVVGRDLNKGDDWMNLQSETEYITVELEDMILDELLDEIL
          + K  S   +I PMP+G  +L+EV  +++ YL   P +  Q  D V+ RDL++ D WM+LQ E+E + +E+ED+I +ELL+E+L
Subjt:  ICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGP-ELDQCLDDVVGRDLNKGDDWMNLQSETEYITVELEDMILDELLDEIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTCATCTTACGAAGACTGTGCCCTGAGGTCGAGGTTATGGAACCTGGACCAGGCGAAGAGACTCGTTCTGGGACCAACTGGACAAGGAGTTATCAATCTTGGGTG
CTGTGGGTCTCCATGTTTCTGCTGTATCCTTATGAACACAAATCAAAATGTTGTGGTAATCAAAAAGAATAAAGCGATAATGTCTCTCCTTGGTCATAGTCATTTCAAGA
AAAACCTTCGAGTTTCTTGTTTTCTCCTATGTTTAGATGGAACTTGTTGGCTTCTATTTCATCTCCAGGAGCCACCTTCTTCTGAACCTGTTATAACCTGTGAAACTTTA
GAGCTTGGAAGACTTCTCAACATGTTTAAAATGGAAAAGCACATCCACCGCCAGGACTCCAATCTGCAGTTTAACAAGAATGTTCCAGGCTGCTTCTGGAGCATATTCCA
TACTATTGACTACCATCGCTGGCATAATGTTAGAAAGATGCTTCCTTACAAAAAGCATTCAAGAAGCAAGAGAGGTCCAAAATCAACTCTGAACAACCACCACATTGCCG
AAGTGTCAGATCAAAGTAACAATGGAAACAACCCTCTAATGTGTACCGCAGAGAGTTGTCCTCTTCGCAGAAAACCTGGAGAAGCCCGTATAAATGAAGTGATAACTAAA
GAGCTGTTAGAGGAAGAAAGCCAAAAATATTGGAAATTGGATTCCAGTTCAAAACGAAGGTTGATTCGAACACAGTCCATACATCATATAGAGCCTTTGTACTATTCTCC
AGGTTATAATGGTGAAAATGGAGATGGCGGAATCACTCCTCGACAGAAAACTCCAATGAAATTAGCTGCATCTGGAATGAGGAGTGTTTCTCTGAATGCCATGGATAATG
AGGACTACTTTATCCAGGGAAACATTGCTATCCAATTGACATCTTTTACAGAAAAATCTGGAGTAAAGAAAACCTTAGAAACTAACAAGAACAGAAACGTCTCCGCTCGC
TCATTTAAGGAAGACACTCACATCCAAGAGATATTTAAGGCAAATAGAAAACTATTTGCTGAATTATTACGGGGTGCACGTGGTAAGAACACTCTCCTAACCCCGCAAAA
TAAGAAGTCCTCAGCAAGTCTAGCGAAATCAAGGTCCTTTCCTGCTCCTGGTTTAGCAGGAAAAGGATACAAAAAGCTTACCTCACTCCAACACAAGCAGAGCGAGTCCT
TTCCAAAACAAAAATCTAATTCTCCCCAGCCATCAAAGCAGGTTGAATCTGAATCTCCAAAGAATTTTCATGAAGATTTGACACCTTGTGATTCTGTTGGTACTTCAAGC
CATAACATAAGACAACAAACAAACCCTTCTTTTTTGGGCATGAATCGTGGACTAAGGCATGGGGGGTGGAATCAGTTGGTTGTCAAGCGTTTCAATTTTATTAAGCAGAA
AATAAGGCGCTCAATCAAGGAGCGGAAGAAGGGAAATAACCAGAAAACATCTAAAGGAATATCAACTGTGGATGCCTCTGGACATGAACTTCCCGCTTACAGAGAAGAGG
CGCAGGAGGAAAGTATAGGAATTGCCATAACCACAAGTGAAAATGGCTCAGGCATGAGAGGATACAGTGAGACTGGTAATTCTGAGAACGATAATCTCAGTAATGGAGTT
CAAACCAAGACAGGAATTGCTTCACCAAGTGCTTCTCTGGAAAGATATTTTCAACTTTCTAATGGCTCAGGCATTATTGGATACAGTGAGACTGACAATTCTGATAATGA
TAATCTCAGTAACAGAGTTCAAACCAAGACTGGAACTGCTTCATTAAGTGCTTCCCTGGAAGGATATTCTCAACTGTCCGAGTACGGTTCCAATAAAAACAGAGAGGGAA
AGTTTTACTACTCACAAAGCTTAAGGCTGATAAGTGAAGAAAAGATTCTGAATATAGAGAAGCCTAAAAAAGCCTTTGGAAGGAATCTTTCTTCGCCTGGTATTGATCTC
TTTTGTACATTATTTACTGACCCTCATGCTGTTTCTCGCACAAAAAAACCAAAGAGGGGTTTGGCGCATTCGAGTACATATAATAATATTCGAGCAGATGAGATTTCAGC
CCATCTATTAAGCGTACATGTATTTAAACCGTCGAGTAGAGATTCACCAAGAATGATACAAAAAGGTGATGAAAACGTTGTTGATTATTCAGGTAGTTTAAACGAGGTCA
CAAATGATGAGGGGACTGCCTGGGTAGATGAGCTCAATGAGAAAATACCTCACTTCGATATATCAGATGGTAGACACCAACAAGTATCGGGTAGTGAATGTAGAGTTGAA
GATGTCAGGGAGACCATTGATCATGTCGACAATCTTTCACACATCAATCAAGTCGTAGAACATGAAATTTGTTTTCAAGATGATGAAACTTCGGAGCTCTTGGACTCGGA
AGGTGCAATGCTAAATCCTAGGTGCAGTATTGCAAATGAGCTTCAACCTTCTGATGACCAACCTAACGAGGTCAGGACAGAAGCTTTACTAACTTCTGAAACCATTGTCA
GTGATGAGATAATTGATAATACTGAAAAGATTTCTAACTATCTCCATCTGCATTCCGAACTTAGCACAGTCGAAAATGCCGACTTCAACTATATGAGGTATATTCTTCAG
CTTTGTAGCTTTATCGAAAGTGGTCACGCAATAGACCAACCACTTAACTCTTTGATATTTGAGGGAGAGGTGGCTCATTTTTACAAAAAACTTGAATGCTATTGGGAAAA
GGTTGACAAAGATTCTGATCACCAACTTCTGCTTGATTTAGTTTATGAGACATTACATAATGTATGTGAAAGCTCACTCATTTGTTTCCTCAAAACCTTCTCCTGGACGA
GCCAAATCCGTCCAATGCCGCTTGGGCGATATCTTCTTGAGGAGGTTCGAGAAAAAGTTGCCTGGTACCTGTGCTTGGGACCAGAACTAGACCAATGTTTAGATGATGTG
GTGGGCCGAGATTTAAATAAAGGCGATGATTGGATGAACCTTCAATCTGAAACTGAGTACATAACAGTTGAGTTAGAGGATATGATTCTTGATGAGCTTTTAGATGAAAT
ACTAAGTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATTTTCGAGAATGAAGCTAATGTGAGGCAGCACGAAGAATTGCAAGAAAACCCTAAAGATCTTGTTTGATCCATCCTCAAAGTATGGAGGAGCAAACACTTGTTTGGTTG
CCAAGAAGCAAACAAACCACAACCAACCCCAAGAAAGTAAACGAAATTCAGACAAATTTCAGTTGGGTCAACCGGAAAACGATTGGATTTGGAACCCATTTCAATTTTGA
AGACATCGCGAACAAATTGGAAGGTCTCAGGTCAGTCTAGCAGCTCAATATTTCTGTACCTAATTTTGAAGAAGAAAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAG
AAGAAGAAGATGTTTTCCTGTTTTTTCAAATTTCCCATTTAGTTATGATTTCTATAAAATGGGACAGCCTAGTCTTTTACATGCTTGCCTATTATCAAATTTGTTCCTTC
TAACCACTGATTATTAGATTATTGTGAATCCCCAAATTCCAATCACATCATATTTCTACTTTTTATTCGATTGATATTCAAAGGATCAAGCCTTCCCATTCTATGAACCC
TATTAAATTTCTGTGTTCAACAACTTGAGAGAATAGGAACCATGAATTCTGGGTGGAATTGGCTATATCTTCATGTGGTAACAGTTTTTGGAACTTCAATGATTTTGCCA
CCAACTCTTTAATCTCTCTCTGTAAATTATTGAACTGGTTAACTAACCAATGTGTTCATCTTACGAAGACTGTGCCCTGAGGTCGAGGTTATGGAACCTGGACCAGGCGA
AGAGACTCGTTCTGGGACCAACTGGACAAGGAGTTATCAATCTTGGGTGCTGTGGGTCTCCATGTTTCTGCTGTATCCTTATGAACACAAATCAAAATGTTGTGGTAATC
AAAAAGAATAAAGCGATAATGTCTCTCCTTGGTCATAGTCATTTCAAGAAAAACCTTCGAGTTTCTTGTTTTCTCCTATGTTTAGATGGAACTTGTTGGCTTCTATTTCA
TCTCCAGGAGCCACCTTCTTCTGAACCTGTTATAACCTGTGAAACTTTAGAGCTTGGAAGACTTCTCAACATGTTTAAAATGGAAAAGCACATCCACCGCCAGGACTCCA
ATCTGCAGTTTAACAAGAATGTTCCAGGCTGCTTCTGGAGCATATTCCATACTATTGACTACCATCGCTGGCATAATGTTAGAAAGATGCTTCCTTACAAAAAGCATTCA
AGAAGCAAGAGAGGTCCAAAATCAACTCTGAACAACCACCACATTGCCGAAGTGTCAGATCAAAGTAACAATGGAAACAACCCTCTAATGTGTACCGCAGAGAGTTGTCC
TCTTCGCAGAAAACCTGGAGAAGCCCGTATAAATGAAGTGATAACTAAAGAGCTGTTAGAGGAAGAAAGCCAAAAATATTGGAAATTGGATTCCAGTTCAAAACGAAGGT
TGATTCGAACACAGTCCATACATCATATAGAGCCTTTGTACTATTCTCCAGGTTATAATGGTGAAAATGGAGATGGCGGAATCACTCCTCGACAGAAAACTCCAATGAAA
TTAGCTGCATCTGGAATGAGGAGTGTTTCTCTGAATGCCATGGATAATGAGGACTACTTTATCCAGGGAAACATTGCTATCCAATTGACATCTTTTACAGAAAAATCTGG
AGTAAAGAAAACCTTAGAAACTAACAAGAACAGAAACGTCTCCGCTCGCTCATTTAAGGAAGACACTCACATCCAAGAGATATTTAAGGCAAATAGAAAACTATTTGCTG
AATTATTACGGGGTGCACGTGGTAAGAACACTCTCCTAACCCCGCAAAATAAGAAGTCCTCAGCAAGTCTAGCGAAATCAAGGTCCTTTCCTGCTCCTGGTTTAGCAGGA
AAAGGATACAAAAAGCTTACCTCACTCCAACACAAGCAGAGCGAGTCCTTTCCAAAACAAAAATCTAATTCTCCCCAGCCATCAAAGCAGGTTGAATCTGAATCTCCAAA
GAATTTTCATGAAGATTTGACACCTTGTGATTCTGTTGGTACTTCAAGCCATAACATAAGACAACAAACAAACCCTTCTTTTTTGGGCATGAATCGTGGACTAAGGCATG
GGGGGTGGAATCAGTTGGTTGTCAAGCGTTTCAATTTTATTAAGCAGAAAATAAGGCGCTCAATCAAGGAGCGGAAGAAGGGAAATAACCAGAAAACATCTAAAGGAATA
TCAACTGTGGATGCCTCTGGACATGAACTTCCCGCTTACAGAGAAGAGGCGCAGGAGGAAAGTATAGGAATTGCCATAACCACAAGTGAAAATGGCTCAGGCATGAGAGG
ATACAGTGAGACTGGTAATTCTGAGAACGATAATCTCAGTAATGGAGTTCAAACCAAGACAGGAATTGCTTCACCAAGTGCTTCTCTGGAAAGATATTTTCAACTTTCTA
ATGGCTCAGGCATTATTGGATACAGTGAGACTGACAATTCTGATAATGATAATCTCAGTAACAGAGTTCAAACCAAGACTGGAACTGCTTCATTAAGTGCTTCCCTGGAA
GGATATTCTCAACTGTCCGAGTACGGTTCCAATAAAAACAGAGAGGGAAAGTTTTACTACTCACAAAGCTTAAGGCTGATAAGTGAAGAAAAGATTCTGAATATAGAGAA
GCCTAAAAAAGCCTTTGGAAGGAATCTTTCTTCGCCTGGTATTGATCTCTTTTGTACATTATTTACTGACCCTCATGCTGTTTCTCGCACAAAAAAACCAAAGAGGGGTT
TGGCGCATTCGAGTACATATAATAATATTCGAGCAGATGAGATTTCAGCCCATCTATTAAGCGTACATGTATTTAAACCGTCGAGTAGAGATTCACCAAGAATGATACAA
AAAGGTGATGAAAACGTTGTTGATTATTCAGGTAGTTTAAACGAGGTCACAAATGATGAGGGGACTGCCTGGGTAGATGAGCTCAATGAGAAAATACCTCACTTCGATAT
ATCAGATGGTAGACACCAACAAGTATCGGGTAGTGAATGTAGAGTTGAAGATGTCAGGGAGACCATTGATCATGTCGACAATCTTTCACACATCAATCAAGTCGTAGAAC
ATGAAATTTGTTTTCAAGATGATGAAACTTCGGAGCTCTTGGACTCGGAAGGTGCAATGCTAAATCCTAGGTGCAGTATTGCAAATGAGCTTCAACCTTCTGATGACCAA
CCTAACGAGGTCAGGACAGAAGCTTTACTAACTTCTGAAACCATTGTCAGTGATGAGATAATTGATAATACTGAAAAGATTTCTAACTATCTCCATCTGCATTCCGAACT
TAGCACAGTCGAAAATGCCGACTTCAACTATATGAGGTATATTCTTCAGCTTTGTAGCTTTATCGAAAGTGGTCACGCAATAGACCAACCACTTAACTCTTTGATATTTG
AGGGAGAGGTGGCTCATTTTTACAAAAAACTTGAATGCTATTGGGAAAAGGTTGACAAAGATTCTGATCACCAACTTCTGCTTGATTTAGTTTATGAGACATTACATAAT
GTATGTGAAAGCTCACTCATTTGTTTCCTCAAAACCTTCTCCTGGACGAGCCAAATCCGTCCAATGCCGCTTGGGCGATATCTTCTTGAGGAGGTTCGAGAAAAAGTTGC
CTGGTACCTGTGCTTGGGACCAGAACTAGACCAATGTTTAGATGATGTGGTGGGCCGAGATTTAAATAAAGGCGATGATTGGATGAACCTTCAATCTGAAACTGAGTACA
TAACAGTTGAGTTAGAGGATATGATTCTTGATGAGCTTTTAGATGAAATACTAAGTTTTTAG
Protein sequenceShow/hide protein sequence
MCSSYEDCALRSRLWNLDQAKRLVLGPTGQGVINLGCCGSPCFCCILMNTNQNVVVIKKNKAIMSLLGHSHFKKNLRVSCFLLCLDGTCWLLFHLQEPPSSEPVITCETL
ELGRLLNMFKMEKHIHRQDSNLQFNKNVPGCFWSIFHTIDYHRWHNVRKMLPYKKHSRSKRGPKSTLNNHHIAEVSDQSNNGNNPLMCTAESCPLRRKPGEARINEVITK
ELLEEESQKYWKLDSSSKRRLIRTQSIHHIEPLYYSPGYNGENGDGGITPRQKTPMKLAASGMRSVSLNAMDNEDYFIQGNIAIQLTSFTEKSGVKKTLETNKNRNVSAR
SFKEDTHIQEIFKANRKLFAELLRGARGKNTLLTPQNKKSSASLAKSRSFPAPGLAGKGYKKLTSLQHKQSESFPKQKSNSPQPSKQVESESPKNFHEDLTPCDSVGTSS
HNIRQQTNPSFLGMNRGLRHGGWNQLVVKRFNFIKQKIRRSIKERKKGNNQKTSKGISTVDASGHELPAYREEAQEESIGIAITTSENGSGMRGYSETGNSENDNLSNGV
QTKTGIASPSASLERYFQLSNGSGIIGYSETDNSDNDNLSNRVQTKTGTASLSASLEGYSQLSEYGSNKNREGKFYYSQSLRLISEEKILNIEKPKKAFGRNLSSPGIDL
FCTLFTDPHAVSRTKKPKRGLAHSSTYNNIRADEISAHLLSVHVFKPSSRDSPRMIQKGDENVVDYSGSLNEVTNDEGTAWVDELNEKIPHFDISDGRHQQVSGSECRVE
DVRETIDHVDNLSHINQVVEHEICFQDDETSELLDSEGAMLNPRCSIANELQPSDDQPNEVRTEALLTSETIVSDEIIDNTEKISNYLHLHSELSTVENADFNYMRYILQ
LCSFIESGHAIDQPLNSLIFEGEVAHFYKKLECYWEKVDKDSDHQLLLDLVYETLHNVCESSLICFLKTFSWTSQIRPMPLGRYLLEEVREKVAWYLCLGPELDQCLDDV
VGRDLNKGDDWMNLQSETEYITVELEDMILDELLDEILSF