; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020843 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020843
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationtig00153574:563629..567348
RNA-Seq ExpressionSgr020843
SyntenySgr020843
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458982.1 PREDICTED: uncharacterized protein LOC103498231 [Cucumis melo]4.9e-18780.37Show/hide
Query:  DEKKKASSQTDLHRSVDSE-----DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVA
        +E+KK+  Q D  +S  +E     + EAAELRRFESFLKWIC I+D SN +RASLSC VFFVF IAVPI SHFAL+CSDCDEDH RPFHVVVQLSLSAVA
Subjt:  DEKKKASSQTDLHRSVDSE-----DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVA

Query:  TLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTS
        TLSF CLS WL ++GL RFLF+DKL E+S K+R EY +QL+RSMEL+SFFLLPCF+AEA YK+WWY+SAA +IPYY NNMYISYIT+ TLELCSWLYRTS
Subjt:  TLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTS

Query:  IFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVT
        IFFFVCIL+RLIC LQMIRLEDFA +F  ETEVGTIL+QHLGLRRT TIISHRFRVFMLLSL+LVTASQFISLLM TRS AHVNLSKAGQLALCSISLVT
Subjt:  IFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVT

Query:  GIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGKTPTAARV--TTVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGIT
        G+FICLRSAAKITHKAQSITCLAAKWHVSA+INTFD LD +TP  A +    VESNSDDED DEDDLD+ KLMPVFA TISFQKRQALV YLRNNKAGIT
Subjt:  GIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGKTPTAARV--TTVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGIT

Query:  VYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
        VYGFMVDRTWLKSIFAIELAL LWLLNKTVG+S
Subjt:  VYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

XP_011660297.1 uncharacterized protein LOC101203162 [Cucumis sativus]2.1e-18278.39Show/hide
Query:  MEDEKKKASSQTDLHRSVDSE---DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVA
        ME  +KK+  Q D  +S  +E   + EA ELR+ ESFL+WIC I+D SN +RAS+SC +FFVF IAVPI SHF L+CSDCDEDH RPFHVVVQLSLSAVA
Subjt:  MEDEKKKASSQTDLHRSVDSE---DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVA

Query:  TLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTS
        TLSF CLS WL V+GL RFLF+DKLCE+S K+R EY +QL++SM+L+SFFLLPCF+AEA YK+WWY+SAA +IPYY NNMYISY+T+ TLELCSWLYRTS
Subjt:  TLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTS

Query:  IFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVT
        IFFFVCI +RLIC LQMIRLEDFA  F  ETEVGTIL+QHLGLRRT T+ISHRFRVFMLLSL+LVTASQFISLLM TRS AH NLSK+GQLALCSISLVT
Subjt:  IFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVT

Query:  GIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGK-TPTAARV-TTVESNSDDE--DEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAG
        G+FICLRSAAKITHKAQSITCLAAKWHVSA+INTFD LD + TPTA+ V   VESNSDDE  DEDEDDLD+AKLMPVFA TISFQKRQALV YLRNNKAG
Subjt:  GIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGK-TPTAARV-TTVESNSDDE--DEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAG

Query:  ITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
        ITVYGFMVDRTWLKSIFAIELAL LWLLNKTVG+S
Subjt:  ITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

XP_022954540.1 uncharacterized protein LOC111456780 [Cucurbita moschata]3.6e-18277.43Show/hide
Query:  MEDEKKKA-------------SSQTDLHRSVDSE-DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFH
        MED +KK+             S +++L    +SE + ++AELRRFESFLKWIC I+D SNP+ A+LSCF+F  FAIAVPI SHFAL+CSDCDEDH RPFH
Subjt:  MEDEKKKA-------------SSQTDLHRSVDSE-DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFH

Query:  VVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITAST
        VVVQLSLSAVATLSF CLS WL   GL RFLF+DKLCESS K RDEYSKQL+RSMELISFFLLPCF+AEAAYK+WWYVSAAN+IPYYG NMY+SYIT+ T
Subjt:  VVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITAST

Query:  LELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAG
        LEL SWLYRTSIFFFVCIL+RL+CRLQMIRLEDF  VFH+E++VGTILMQHLGLRRTLT ISHRFRVFM LSL+LVTASQFISLLM TRS+A  NLSK G
Subjt:  LELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAG

Query:  QLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGKTPTAARVTTVESNS-DDEDEDEDDLDNAKLMPVFAQTISFQKRQALVI
        QLALCSISLVTG+FICLRSAAKITHKAQSITCLAAKWHVSA INTFD LD +TPTA+ +   E NS DDE++DEDD D+ KLMPVFA TISFQKRQALV 
Subjt:  QLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGKTPTAARVTTVESNS-DDEDEDEDDLDNAKLMPVFAQTISFQKRQALVI

Query:  YLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
        YL+NNK GITVYGF+VDRTWLKS+FAIELAL+LWLLNKTVGIS
Subjt:  YLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

XP_022994504.1 uncharacterized protein LOC111490209 [Cucurbita maxima]1.9e-18380.57Show/hide
Query:  SSQTDLHRSVDSE-DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCW
        S +++L    +SE + ++AELRRFESFLKWIC I+D SNP+ A+LSCF+F  FAIAVPI SHFAL+CSDCDEDH RPFHVVVQLSLSAVATLSF CLS W
Subjt:  SSQTDLHRSVDSE-DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCW

Query:  LHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYR
        L  +GL RFLF+DKLCESS K RDEYSKQL+RSMELISFFLLPCF+AEAAYK+WWYVSAA +IPYYG NMY+SYIT+ TLEL SWLYRTSIFFFVCIL+R
Subjt:  LHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYR

Query:  LICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAA
        L+CRLQMIRLEDF  VFH+E++VGTILMQHLGLRRTLTIISHRFRVFM LSL+LVTASQFI LLM TRS+A  NLSK GQLALCSISLVTG+FICLRSAA
Subjt:  LICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAA

Query:  KITHKAQSITCLAAKWHVSAIINTFDHLDGKTPTAARVTTVESNS-DDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGITVYGFMVDRTWL
        KI+HKAQSITCLAAKWHVSA INTFD LD +TPT + + T E NS DDED+DEDD D+ KLMPVFA TISFQKRQALV YLRNNKAGITVYGF+VDRTWL
Subjt:  KITHKAQSITCLAAKWHVSAIINTFDHLDGKTPTAARVTTVESNS-DDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGITVYGFMVDRTWL

Query:  KSIFAIELALVLWLLNKTVGIS
        KS+FAIELALVLWLLNKTVGIS
Subjt:  KSIFAIELALVLWLLNKTVGIS

XP_038893800.1 uncharacterized protein LOC120082620 [Benincasa hispida]1.2e-18278.29Show/hide
Query:  MEDEKKKASSQTD---LHRSVDSE-DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAV
        ME E+KK+  Q D   L    +SE + EAAELRRF+S LKWIC I D SNP+RASLSC VFFVFAIAVP+ SHFAL+CSDCDEDH RPFHVVVQLSLSAV
Subjt:  MEDEKKKASSQTD---LHRSVDSE-DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAV

Query:  ATLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRT
        ATLSF CLS WL  +GL RFLF+DKL E+S +VR EY +QL+RSM L+SFFLLPCF+AEA YK+WWY+SAA +IPYY NN+YISYI + TLELCSWLYRT
Subjt:  ATLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRT

Query:  SIFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLV
        SIFFFVCIL+RLIC LQMIRLEDFA +FH+E EVGTILMQHL LRRT TIISHRFR F+LLSL+LVTASQFISLLM TRS AHVNLSKAGQLALCSISLV
Subjt:  SIFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLV

Query:  TGIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGKT-PTAARVTTVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGIT
        TG+FICLRSAAKITHKAQSITCLAAKWHVSA+INTFD LD +T PTA+ ++ +  ++ DE+ DEDDLD+AKLMPVFA TISFQKRQALV YLRNNKAGIT
Subjt:  TGIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGKT-PTAARVTTVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGIT

Query:  VYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
        VYGF VDRTWLKSIFAIELAL LWLLNKTVG+S
Subjt:  VYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

TrEMBL top hitse value%identityAlignment
A0A0A0M3M6 Uncharacterized protein1.0e-18278.39Show/hide
Query:  MEDEKKKASSQTDLHRSVDSE---DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVA
        ME  +KK+  Q D  +S  +E   + EA ELR+ ESFL+WIC I+D SN +RAS+SC +FFVF IAVPI SHF L+CSDCDEDH RPFHVVVQLSLSAVA
Subjt:  MEDEKKKASSQTDLHRSVDSE---DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVA

Query:  TLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTS
        TLSF CLS WL V+GL RFLF+DKLCE+S K+R EY +QL++SM+L+SFFLLPCF+AEA YK+WWY+SAA +IPYY NNMYISY+T+ TLELCSWLYRTS
Subjt:  TLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTS

Query:  IFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVT
        IFFFVCI +RLIC LQMIRLEDFA  F  ETEVGTIL+QHLGLRRT T+ISHRFRVFMLLSL+LVTASQFISLLM TRS AH NLSK+GQLALCSISLVT
Subjt:  IFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVT

Query:  GIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGK-TPTAARV-TTVESNSDDE--DEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAG
        G+FICLRSAAKITHKAQSITCLAAKWHVSA+INTFD LD + TPTA+ V   VESNSDDE  DEDEDDLD+AKLMPVFA TISFQKRQALV YLRNNKAG
Subjt:  GIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGK-TPTAARV-TTVESNSDDE--DEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAG

Query:  ITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
        ITVYGFMVDRTWLKSIFAIELAL LWLLNKTVG+S
Subjt:  ITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

A0A1S3C949 uncharacterized protein LOC1034982312.4e-18780.37Show/hide
Query:  DEKKKASSQTDLHRSVDSE-----DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVA
        +E+KK+  Q D  +S  +E     + EAAELRRFESFLKWIC I+D SN +RASLSC VFFVF IAVPI SHFAL+CSDCDEDH RPFHVVVQLSLSAVA
Subjt:  DEKKKASSQTDLHRSVDSE-----DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVA

Query:  TLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTS
        TLSF CLS WL ++GL RFLF+DKL E+S K+R EY +QL+RSMEL+SFFLLPCF+AEA YK+WWY+SAA +IPYY NNMYISYIT+ TLELCSWLYRTS
Subjt:  TLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTS

Query:  IFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVT
        IFFFVCIL+RLIC LQMIRLEDFA +F  ETEVGTIL+QHLGLRRT TIISHRFRVFMLLSL+LVTASQFISLLM TRS AHVNLSKAGQLALCSISLVT
Subjt:  IFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVT

Query:  GIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGKTPTAARV--TTVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGIT
        G+FICLRSAAKITHKAQSITCLAAKWHVSA+INTFD LD +TP  A +    VESNSDDED DEDDLD+ KLMPVFA TISFQKRQALV YLRNNKAGIT
Subjt:  GIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGKTPTAARV--TTVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGIT

Query:  VYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
        VYGFMVDRTWLKSIFAIELAL LWLLNKTVG+S
Subjt:  VYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

A0A6J1GSP8 uncharacterized protein LOC1114567801.7e-18277.43Show/hide
Query:  MEDEKKKA-------------SSQTDLHRSVDSE-DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFH
        MED +KK+             S +++L    +SE + ++AELRRFESFLKWIC I+D SNP+ A+LSCF+F  FAIAVPI SHFAL+CSDCDEDH RPFH
Subjt:  MEDEKKKA-------------SSQTDLHRSVDSE-DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFH

Query:  VVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITAST
        VVVQLSLSAVATLSF CLS WL   GL RFLF+DKLCESS K RDEYSKQL+RSMELISFFLLPCF+AEAAYK+WWYVSAAN+IPYYG NMY+SYIT+ T
Subjt:  VVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITAST

Query:  LELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAG
        LEL SWLYRTSIFFFVCIL+RL+CRLQMIRLEDF  VFH+E++VGTILMQHLGLRRTLT ISHRFRVFM LSL+LVTASQFISLLM TRS+A  NLSK G
Subjt:  LELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAG

Query:  QLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGKTPTAARVTTVESNS-DDEDEDEDDLDNAKLMPVFAQTISFQKRQALVI
        QLALCSISLVTG+FICLRSAAKITHKAQSITCLAAKWHVSA INTFD LD +TPTA+ +   E NS DDE++DEDD D+ KLMPVFA TISFQKRQALV 
Subjt:  QLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGKTPTAARVTTVESNS-DDEDEDEDDLDNAKLMPVFAQTISFQKRQALVI

Query:  YLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
        YL+NNK GITVYGF+VDRTWLKS+FAIELAL+LWLLNKTVGIS
Subjt:  YLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

A0A6J1K314 uncharacterized protein LOC1114902099.2e-18480.57Show/hide
Query:  SSQTDLHRSVDSE-DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCW
        S +++L    +SE + ++AELRRFESFLKWIC I+D SNP+ A+LSCF+F  FAIAVPI SHFAL+CSDCDEDH RPFHVVVQLSLSAVATLSF CLS W
Subjt:  SSQTDLHRSVDSE-DGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCW

Query:  LHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYR
        L  +GL RFLF+DKLCESS K RDEYSKQL+RSMELISFFLLPCF+AEAAYK+WWYVSAA +IPYYG NMY+SYIT+ TLEL SWLYRTSIFFFVCIL+R
Subjt:  LHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYR

Query:  LICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAA
        L+CRLQMIRLEDF  VFH+E++VGTILMQHLGLRRTLTIISHRFRVFM LSL+LVTASQFI LLM TRS+A  NLSK GQLALCSISLVTG+FICLRSAA
Subjt:  LICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAA

Query:  KITHKAQSITCLAAKWHVSAIINTFDHLDGKTPTAARVTTVESNS-DDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGITVYGFMVDRTWL
        KI+HKAQSITCLAAKWHVSA INTFD LD +TPT + + T E NS DDED+DEDD D+ KLMPVFA TISFQKRQALV YLRNNKAGITVYGF+VDRTWL
Subjt:  KITHKAQSITCLAAKWHVSAIINTFDHLDGKTPTAARVTTVESNS-DDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGITVYGFMVDRTWL

Query:  KSIFAIELALVLWLLNKTVGIS
        KS+FAIELALVLWLLNKTVGIS
Subjt:  KSIFAIELALVLWLLNKTVGIS

A0A6J1KGZ0 uncharacterized protein LOC1114956693.6e-18077.62Show/hide
Query:  MEDEKKKASSQTDLHRSVDSEDGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLS
        M++  KK+ S +         + EA ELRR ESFLKWIC + DQSNP+RASLSC +FF+FAIAVP+ SHFAL+CSDCDEDH RPFHVVVQLSLSAVA LS
Subjt:  MEDEKKKASSQTDLHRSVDSEDGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLS

Query:  FACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFF
        F  LS WL ++G  RFLF+DKL ++S +V+ EYS+QL+RSMELIS F++PCF+AEAAYK+WWY++AA QIPYY NNMY+SYIT+ TLELCSWLYRTSIFF
Subjt:  FACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFF

Query:  FVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIF
        FVC+L+RLIC LQMIRLEDFA VF +ET+VGTIL+ HLGLRRT TIISHRFR F+LLSL+LVTASQFISLLM T + AHVNLSKAGQLALCSISLVTG+F
Subjt:  FVCILYRLICRLQMIRLEDFALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIF

Query:  ICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGK-TPTAARVTTVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGITVYGF
        ICLRSAAKITHKAQSITCLAAKWHVSA+++TFD LD K TPTAA   T+E NSDDED DEDDLD+AKLMPVFA TISFQKRQALVIYLRNNKAGITVYGF
Subjt:  ICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHLDGK-TPTAARVTTVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGITVYGF

Query:  MVDRTWLKSIFAIELALVLWLLNKTVGIS
        MVDRTWLKSIFAIELAL LWLLNKTVGIS
Subjt:  MVDRTWLKSIFAIELALVLWLLNKTVGIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)2.0e-12254.87Show/hide
Query:  ELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESS
        EL  F  +L+W+C  VD S+PW A LS  +F VF + VP +SHF LAC+DCD  HSRP+  VVQLSLS+VAT+SF CL+ ++  YGLRRFLF DKL + S
Subjt:  ELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESS

Query:  RKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQ
          VR  Y+ QL  S+ ++S+F++PCF A +AYK+WWY S  ++IP+ GN + +S   A  +ELCSWLYRT++ F VC+L+RLIC LQ++RL+DFA +F  
Subjt:  RKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQ

Query:  ETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVS
        +++VG+IL +HL +RR L IISHR+R F+L  L+LVT SQF SLL+ T++   VN+ +AG+LALCS++LVT + I LRSA+KITHKAQ++TCLAAKWHV 
Subjt:  ETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVS

Query:  AIINTFDHL------DGKTPT-AAR-----------VTTVESNSDDEDEDEDDLDNAKLMPVFA-QTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLK
        A + +FD          +TPT  AR           VT  ES+SD+  ++EDDLDN  ++PV+A  T+SFQKRQALV Y  NN AGITVYGF +DR  L 
Subjt:  AIINTFDHL------DGKTPT-AAR-----------VTTVESNSDDEDEDEDDLDNAKLMPVFA-QTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLK

Query:  SIFAIELALVLWLLNKTVGIS
        +IF +EL+LVLWLL KT+GIS
Subjt:  SIFAIELALVLWLLNKTVGIS

AT3G20300.1 Protein of unknown function (DUF3537)1.0e-12656.28Show/hide
Query:  ELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESS
        EL  F  +L+W+C  VDQS+PW A LS  +F VF + VP  SHF LACSDCD  HSRP+  VVQLSLS+ A LSF CLS ++  YGLRRFLF DKL + S
Subjt:  ELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESS

Query:  RKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQ
          VR  Y+ QL RS++++S+F+ PCF+A ++YK+WWY S A+QIP+ G N+ +S   A  +ELCSWLYRT++ F VC+L+RLIC LQ++RL+DFA VF  
Subjt:  RKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQ

Query:  ETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVS
        +++VG+IL +HL +RR L IISHR+R F+LLSL+LVT SQF SLL+ T++ A +N+ +AG+LALCS++LVT + I LRSA+KITHKAQ++TCLAAKWHV 
Subjt:  ETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVS

Query:  AIINTFDHLDGKTPTAARVTT-----------VESNSDDEDEDEDDLDNAKLMPVFA-QTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIEL
        A I +F+ +DG+TP      +            ES+S+D  ++EDD DN  L+P +A  TISFQKRQALV Y  NN++GITV+GF +DR+ L +IF IE+
Subjt:  AIINTFDHLDGKTPTAARVTT-----------VESNSDDEDEDEDDLDNAKLMPVFA-QTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIEL

Query:  ALVLWLLNKTVGIS
        +LVLWLL KT+GIS
Subjt:  ALVLWLLNKTVGIS

AT4G03820.1 Protein of unknown function (DUF3537)1.4e-10752.66Show/hide
Query:  DQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSME
        DQSN  +  LS  +FF+ A+ VP++SHF L C+DCD  H RP+  +VQLSLS  A +SF  LS W   YG+RRFLF DKL + S KVR  Y  +++RSM+
Subjt:  DQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSME

Query:  LISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQE-TEVGTILMQHLGLR
        L++ F+LP    +A Y++WWY S  NQIPY   N  +S++ A TL+L SWLYRTS+F   CILY+ IC LQ++RL++FA  F  E  +  +IL +HL +R
Subjt:  LISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQE-TEVGTILMQHLGLR

Query:  RTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHL-DGKTP
        R L I+SHRFR F+LLSL  VTA+QF++LL   R++   N+ + G+LALCS SLV+G+FICL+SA ++THKAQS+T +A KW+V A ++TFD L DG+TP
Subjt:  RTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHL-DGKTP

Query:  TAARVT----------TVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTV
             T           V  +SDD++E E D ++ ++ P+FA+ IS QKRQALV YL NN+AGITVYGF+VD+TWL+ IF+IELAL+LWLL KT+
Subjt:  TAARVT----------TVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTV

AT4G03820.2 Protein of unknown function (DUF3537)1.4e-10752.66Show/hide
Query:  DQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSME
        DQSN  +  LS  +FF+ A+ VP++SHF L C+DCD  H RP+  +VQLSLS  A +SF  LS W   YG+RRFLF DKL + S KVR  Y  +++RSM+
Subjt:  DQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESSRKVRDEYSKQLRRSME

Query:  LISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQE-TEVGTILMQHLGLR
        L++ F+LP    +A Y++WWY S  NQIPY   N  +S++ A TL+L SWLYRTS+F   CILY+ IC LQ++RL++FA  F  E  +  +IL +HL +R
Subjt:  LISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQE-TEVGTILMQHLGLR

Query:  RTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHL-DGKTP
        R L I+SHRFR F+LLSL  VTA+QF++LL   R++   N+ + G+LALCS SLV+G+FICL+SA ++THKAQS+T +A KW+V A ++TFD L DG+TP
Subjt:  RTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVSAIINTFDHL-DGKTP

Query:  TAARVT----------TVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTV
             T           V  +SDD++E E D ++ ++ P+FA+ IS QKRQALV YL NN+AGITVYGF+VD+TWL+ IF+IELAL+LWLL KT+
Subjt:  TAARVT----------TVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTV

AT4G22270.1 Protein of unknown function (DUF3537)2.4e-12356.44Show/hide
Query:  FESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESSRKVR
        F S + W     DQSN   A LS  VFF+  + VP++SHF L CSDCD  H RP+ V+VQLSLS  A +SF  LS W   +G+RRFLF+DKL + S KVR
Subjt:  FESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCWLHVYGLRRFLFIDKLCESSRKVR

Query:  DEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQE-TE
         EY  +++RS++ +  F+LP    EA Y++WWY+S  NQIPY  N + +S++ A TL+L SWLYR S+F  VCILY++ C LQ +RL+DFA  F  E T+
Subjt:  DEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYRLICRLQMIRLEDFALVFHQE-TE

Query:  VGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVSAII
        V + L +H  +RR L I+SHRFR F+LLSL+LVTA+QF++LL  TR++  VN+ + G+LALCS+SLVTG+FICLRSA KITHKAQS+T LAAKW+V A +
Subjt:  VGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVSAII

Query:  NTFDHLDGKTPTAA--------RVTTVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLL
        ++FDHLDG+TPT +        R   +E++ D+E E +DDLDN K+ P++A TIS+QKRQALV YL NNKAGITVYGF+VDR+WL +IF IELAL+LWLL
Subjt:  NTFDHLDGKTPTAA--------RVTTVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLL

Query:  NKTV
        NKT+
Subjt:  NKTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACGAGAAGAAGAAAGCTTCGTCTCAAACTGATTTGCATCGATCAGTCGATTCGGAAGATGGAGAAGCAGCCGAATTGAGAAGATTCGAATCTTTCCTGAAATG
GATTTGCATAATCGTCGATCAGTCGAATCCATGGCGCGCTTCGCTCTCCTGCTTCGTCTTCTTCGTCTTCGCAATCGCCGTCCCTATCGTGTCGCACTTCGCTCTCGCTT
GCTCCGACTGCGACGAAGATCACAGCAGGCCCTTCCACGTCGTCGTGCAGCTCTCTCTCTCCGCCGTTGCTACGCTCTCCTTCGCCTGCCTTTCTTGTTGGCTTCACGTG
TACGGATTGAGGCGCTTTCTGTTCATCGACAAGCTCTGCGAATCAAGTCGGAAGGTTCGGGATGAGTATTCAAAGCAACTACGGAGATCAATGGAGCTCATCTCCTTCTT
CCTCCTCCCCTGTTTCATCGCGGAAGCAGCCTACAAGGTGTGGTGGTACGTCTCAGCAGCCAACCAAATCCCATATTACGGCAATAACATGTACATCAGCTACATCACCG
CCAGCACACTGGAGCTCTGCTCATGGCTCTATCGAACATCCATCTTCTTCTTCGTCTGCATCCTCTACCGTCTGATCTGCCGCCTGCAAATGATCAGACTCGAAGATTTC
GCCCTTGTCTTCCATCAGGAAACCGAGGTGGGCACCATCTTGATGCAGCATCTGGGCCTCAGAAGGACCTTGACCATCATCAGCCATCGGTTCAGAGTGTTCATGTTGCT
GTCTTTGGTTTTGGTCACTGCCAGTCAGTTTATCTCCCTGCTGATGGCTACCAGATCAAATGCTCATGTGAATCTCTCCAAGGCTGGACAACTTGCGCTATGCTCCATCA
GCCTGGTGACTGGCATCTTCATATGCCTCCGCAGTGCTGCAAAGATCACCCACAAGGCACAGTCCATCACTTGCCTTGCTGCAAAGTGGCACGTCTCCGCCATCATAAAC
ACCTTCGACCACCTCGACGGCAAGACGCCGACAGCAGCTCGGGTCACGACCGTCGAATCCAACTCCGATGACGAAGACGAGGACGAGGACGACTTGGACAATGCCAAATT
GATGCCAGTTTTTGCCCAAACAATCTCATTCCAAAAGAGGCAGGCACTGGTGATATATCTGAGGAATAACAAAGCAGGAATTACAGTGTATGGATTCATGGTGGACAGAA
CATGGCTGAAATCCATTTTTGCTATTGAACTTGCACTTGTGCTGTGGCTGCTCAACAAGACTGTTGGTATTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACGAGAAGAAGAAAGCTTCGTCTCAAACTGATTTGCATCGATCAGTCGATTCGGAAGATGGAGAAGCAGCCGAATTGAGAAGATTCGAATCTTTCCTGAAATG
GATTTGCATAATCGTCGATCAGTCGAATCCATGGCGCGCTTCGCTCTCCTGCTTCGTCTTCTTCGTCTTCGCAATCGCCGTCCCTATCGTGTCGCACTTCGCTCTCGCTT
GCTCCGACTGCGACGAAGATCACAGCAGGCCCTTCCACGTCGTCGTGCAGCTCTCTCTCTCCGCCGTTGCTACGCTCTCCTTCGCCTGCCTTTCTTGTTGGCTTCACGTG
TACGGATTGAGGCGCTTTCTGTTCATCGACAAGCTCTGCGAATCAAGTCGGAAGGTTCGGGATGAGTATTCAAAGCAACTACGGAGATCAATGGAGCTCATCTCCTTCTT
CCTCCTCCCCTGTTTCATCGCGGAAGCAGCCTACAAGGTGTGGTGGTACGTCTCAGCAGCCAACCAAATCCCATATTACGGCAATAACATGTACATCAGCTACATCACCG
CCAGCACACTGGAGCTCTGCTCATGGCTCTATCGAACATCCATCTTCTTCTTCGTCTGCATCCTCTACCGTCTGATCTGCCGCCTGCAAATGATCAGACTCGAAGATTTC
GCCCTTGTCTTCCATCAGGAAACCGAGGTGGGCACCATCTTGATGCAGCATCTGGGCCTCAGAAGGACCTTGACCATCATCAGCCATCGGTTCAGAGTGTTCATGTTGCT
GTCTTTGGTTTTGGTCACTGCCAGTCAGTTTATCTCCCTGCTGATGGCTACCAGATCAAATGCTCATGTGAATCTCTCCAAGGCTGGACAACTTGCGCTATGCTCCATCA
GCCTGGTGACTGGCATCTTCATATGCCTCCGCAGTGCTGCAAAGATCACCCACAAGGCACAGTCCATCACTTGCCTTGCTGCAAAGTGGCACGTCTCCGCCATCATAAAC
ACCTTCGACCACCTCGACGGCAAGACGCCGACAGCAGCTCGGGTCACGACCGTCGAATCCAACTCCGATGACGAAGACGAGGACGAGGACGACTTGGACAATGCCAAATT
GATGCCAGTTTTTGCCCAAACAATCTCATTCCAAAAGAGGCAGGCACTGGTGATATATCTGAGGAATAACAAAGCAGGAATTACAGTGTATGGATTCATGGTGGACAGAA
CATGGCTGAAATCCATTTTTGCTATTGAACTTGCACTTGTGCTGTGGCTGCTCAACAAGACTGTTGGTATTTCTTGA
Protein sequenceShow/hide protein sequence
MEDEKKKASSQTDLHRSVDSEDGEAAELRRFESFLKWICIIVDQSNPWRASLSCFVFFVFAIAVPIVSHFALACSDCDEDHSRPFHVVVQLSLSAVATLSFACLSCWLHV
YGLRRFLFIDKLCESSRKVRDEYSKQLRRSMELISFFLLPCFIAEAAYKVWWYVSAANQIPYYGNNMYISYITASTLELCSWLYRTSIFFFVCILYRLICRLQMIRLEDF
ALVFHQETEVGTILMQHLGLRRTLTIISHRFRVFMLLSLVLVTASQFISLLMATRSNAHVNLSKAGQLALCSISLVTGIFICLRSAAKITHKAQSITCLAAKWHVSAIIN
TFDHLDGKTPTAARVTTVESNSDDEDEDEDDLDNAKLMPVFAQTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS