; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021578 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021578
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationChr05:11770322..11775989
RNA-Seq ExpressionHG10021578
SyntenyHG10021578
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583771.1 hypothetical protein SDJN03_19703, partial [Cucurbita argyrosperma subsp. sororia]4.6e-17177.93Show/hide
Query:  METEEKKSQPQIDSQKSESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSF
        M+   KKS   +DSQ  ESESELEA ELRR ESFLKWICI D SNPYRASLSC++FF+FAIAVP+ASHF+LSCSDCDEDHQRPFHVVVQLSLSAVA LSF
Subjt:  METEEKKSQPQIDSQKSESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSF

Query:  LCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFF
        L LSLWLRLFG NRFLFLDKLS+ASPRVRAEYSRQLQRS EL+ LF++PCFMAEA YK+WWYI+AAK+IPYYTNNMY+SYITSCTLELCSWLYRTSIFFF
Subjt:  LCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFF

Query:  VCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALVKAKSFEKNAA
        VC+LFRLIC LQMIRLEDFASVF RET+VGTIL+ HLGLRRTFTIISHRFRAF+LLSLIL+TASQFISLLMTT +KAHVNLSKAGQLAL    S      
Subjt:  VCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALVKAKSFEKNAA

Query:  MLHKLGDRLVHMPTHETPLTA-----SLVSTMN-------------ESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAGITVYGFMVD
        +  +   ++ H     T L A     ++VST +             ESNSDDEDGDEDDLDDAKLMPVFA TISFQKRQALVIYLRNNKAGITVYGFMVD
Subjt:  MLHKLGDRLVHMPTHETPLTA-----SLVSTMN-------------ESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAGITVYGFMVD

Query:  RTWLKSIFAIELALFLWLLNKTVGVS
        RTWLKSIFAIELALFLWLLNKTVG+S
Subjt:  RTWLKSIFAIELALFLWLLNKTVGVS

XP_008458982.1 PREDICTED: uncharacterized protein LOC103498231 [Cucumis melo]6.0e-18783.94Show/hide
Query:  MET-EEKKSQPQIDSQK-----SESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSA
        MET EEKKSQPQ+DSQK     SESESELEAAELRRFESFLKWICIMDHSN YRASLSCIVFFVF IAVP+ASHF+LSCSDCDEDHQRPFHVVVQLSLSA
Subjt:  MET-EEKKSQPQIDSQK-----SESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSA

Query:  VATLSFLCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYR
        VATLSFLCLSLWLRLFGLNRFLFLDKLSEASP++RAEY RQLQRSMELM+ FLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYR
Subjt:  VATLSFLCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYR

Query:  TSIFFFVCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV----
        TSIFFFVCILFRLIC LQMIRLEDFAS+FR ETEVGTIL+QHLGLRRTFTIISHRFR FMLLSLIL+TASQFISLLMTTRSKAHVNLSKAGQLAL     
Subjt:  TSIFFFVCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV----

Query:  ----------------KAKSFEKNAAMLH--KLGDRLVHMPTHETPLTASLVSTMNESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKA
                        KA+S    AA  H   + +    + T ETP TASLV  + ESNSDDEDGDEDDLDD KLMPVFAHTISFQKRQALV YLRNNKA
Subjt:  ----------------KAKSFEKNAAMLH--KLGDRLVHMPTHETPLTASLVSTMNESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKA

Query:  GITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS
        GITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS
Subjt:  GITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS

XP_011660297.1 uncharacterized protein LOC101203162 [Cucumis sativus]3.6e-17980.23Show/hide
Query:  METEEKKSQPQIDSQKSES---ESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVAT
        MET EKKSQPQIDSQKS+S   ESELEA ELR+ ESFL+WICIMDHSN YRAS+SCI+FFVF IAVP+ASHF LSCSDCDEDHQRPFHVVVQLSLSAVAT
Subjt:  METEEKKSQPQIDSQKSES---ESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVAT

Query:  LSFLCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSI
        LSFLCLSLWLR+FGLNRFLFLDKL EASP++RAEY RQLQ+SM+LM+ FLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISY+TSCTLELCSWLYRTSI
Subjt:  LSFLCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSI

Query:  FFFVCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV-------
        FFFVCI FRLIC LQMIRLEDFAS FR ETEVGTIL+QHLGLRRTFT+ISHRFR FMLLSLIL+TASQFISLLMTTRSKAH NLSK+GQLAL        
Subjt:  FFFVCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV-------

Query:  -------------KAKSFEKNAAMLH--KLGDRLVHMPTHETPLTASLVSTMNESNSDDEDG--DEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAG
                     KA+S    AA  H   + +    + T  TP TAS V  + ESNSDDEDG  DEDDLDDAKLMPVFAHTISFQKRQALV YLRNNKAG
Subjt:  -------------KAKSFEKNAAMLH--KLGDRLVHMPTHETPLTASLVSTMNESNSDDEDG--DEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAG

Query:  ITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS
        ITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS
Subjt:  ITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS

XP_023001582.1 uncharacterized protein LOC111495669 [Cucurbita maxima]1.1e-16976.53Show/hide
Query:  METEEKKSQPQIDSQKSESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSF
        M+  +KKS     +   ESESELEA ELRR ESFLKWIC+ D SNPYRASLSC++FF+FAIAVP+ASHF+LSCSDCDEDHQRPFHVVVQLSLSAVA LSF
Subjt:  METEEKKSQPQIDSQKSESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSF

Query:  LCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFF
        + LSLWLRLFG NRFLFLDKLS+ASPRV+AEYSRQLQRSMEL++LF++PCFMAEA YK+WWYI+AAK+IPYYTNNMY+SYITSCTLELCSWLYRTSIFFF
Subjt:  LCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFF

Query:  VCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALVKAKSFEKNAA
        VC+LFRLIC LQMIRLEDFASVF RET+VGTIL+ HLGLRRTFTIISHRFRAF+LLSLIL+TASQFISLLMTT +KAHVNLSKAGQLAL    S      
Subjt:  VCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALVKAKSFEKNAA

Query:  MLHKLGDRLVHMPTHETPLTA-----SLVSTMN-------------ESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAGITVYGFMVD
        +  +   ++ H     T L A     ++VST +             E NSDDEDGDEDDLDDAKLMPVFA TISFQKRQALVIYLRNNKAGITVYGFMVD
Subjt:  MLHKLGDRLVHMPTHETPLTA-----SLVSTMN-------------ESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAGITVYGFMVD

Query:  RTWLKSIFAIELALFLWLLNKTVGVS
        RTWLKSIFAIELALFLWLLNKTVG+S
Subjt:  RTWLKSIFAIELALFLWLLNKTVGVS

XP_038893800.1 uncharacterized protein LOC120082620 [Benincasa hispida]5.0e-18182.15Show/hide
Query:  METEEKKSQPQIDSQK----SESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVA
        METEEKKSQPQIDSQK     ESE E EAAELRRF+S LKWICI DHSNPYRASLSCIVFFVFAIAVPVASHF+LSCSDCDEDHQRPFHVVVQLSLSAVA
Subjt:  METEEKKSQPQIDSQK----SESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVA

Query:  TLSFLCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTS
        TLSFLCLSLWLR FGLNRFLFLDKLSEASPRVRAEY RQLQRSM LM+ FLLPCFMAEAGYKIWWYISAAKEIPYYTNN+YISYI SCTLELCSWLYRTS
Subjt:  TLSFLCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTS

Query:  IFFFVCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV------
        IFFFVCILFRLIC LQMIRLEDFAS+F RE EVGTILMQHL LRRTFTIISHRFRAF+LLSLIL+TASQFISLLMTTRSKAHVNLSKAGQLAL       
Subjt:  IFFFVCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV------

Query:  --------------KAKSFEKNAAMLH-----KLGDRLVHMPTHETPLTASLVSTMNESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNK
                      KA+S    AA  H        D L      ETP TASL+S + ESNS DE+ DEDDLDDAKLMPVFAHTISFQKRQALV YLRNNK
Subjt:  --------------KAKSFEKNAAMLH-----KLGDRLVHMPTHETPLTASLVSTMNESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNK

Query:  AGITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS
        AGITVYGF VDRTWLKSIFAIELALFLWLLNKTVGVS
Subjt:  AGITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS

TrEMBL top hitse value%identityAlignment
A0A0A0M3M6 Uncharacterized protein1.7e-17980.23Show/hide
Query:  METEEKKSQPQIDSQKSES---ESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVAT
        MET EKKSQPQIDSQKS+S   ESELEA ELR+ ESFL+WICIMDHSN YRAS+SCI+FFVF IAVP+ASHF LSCSDCDEDHQRPFHVVVQLSLSAVAT
Subjt:  METEEKKSQPQIDSQKSES---ESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVAT

Query:  LSFLCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSI
        LSFLCLSLWLR+FGLNRFLFLDKL EASP++RAEY RQLQ+SM+LM+ FLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISY+TSCTLELCSWLYRTSI
Subjt:  LSFLCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSI

Query:  FFFVCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV-------
        FFFVCI FRLIC LQMIRLEDFAS FR ETEVGTIL+QHLGLRRTFT+ISHRFR FMLLSLIL+TASQFISLLMTTRSKAH NLSK+GQLAL        
Subjt:  FFFVCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV-------

Query:  -------------KAKSFEKNAAMLH--KLGDRLVHMPTHETPLTASLVSTMNESNSDDEDG--DEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAG
                     KA+S    AA  H   + +    + T  TP TAS V  + ESNSDDEDG  DEDDLDDAKLMPVFAHTISFQKRQALV YLRNNKAG
Subjt:  -------------KAKSFEKNAAMLH--KLGDRLVHMPTHETPLTASLVSTMNESNSDDEDG--DEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAG

Query:  ITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS
        ITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS
Subjt:  ITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS

A0A1S3C949 uncharacterized protein LOC1034982312.9e-18783.94Show/hide
Query:  MET-EEKKSQPQIDSQK-----SESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSA
        MET EEKKSQPQ+DSQK     SESESELEAAELRRFESFLKWICIMDHSN YRASLSCIVFFVF IAVP+ASHF+LSCSDCDEDHQRPFHVVVQLSLSA
Subjt:  MET-EEKKSQPQIDSQK-----SESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSA

Query:  VATLSFLCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYR
        VATLSFLCLSLWLRLFGLNRFLFLDKLSEASP++RAEY RQLQRSMELM+ FLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYR
Subjt:  VATLSFLCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYR

Query:  TSIFFFVCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV----
        TSIFFFVCILFRLIC LQMIRLEDFAS+FR ETEVGTIL+QHLGLRRTFTIISHRFR FMLLSLIL+TASQFISLLMTTRSKAHVNLSKAGQLAL     
Subjt:  TSIFFFVCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV----

Query:  ----------------KAKSFEKNAAMLH--KLGDRLVHMPTHETPLTASLVSTMNESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKA
                        KA+S    AA  H   + +    + T ETP TASLV  + ESNSDDEDGDEDDLDD KLMPVFAHTISFQKRQALV YLRNNKA
Subjt:  ----------------KAKSFEKNAAMLH--KLGDRLVHMPTHETPLTASLVSTMNESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKA

Query:  GITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS
        GITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS
Subjt:  GITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS

A0A6J1EG16 uncharacterized protein LOC1114339907.2e-17077.57Show/hide
Query:  METEEKKSQPQIDSQKSESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSF
        M+   KKS   +DSQ  ESESELEA ELRR ESFLKWICI D SNPYRASLSC++FF+FAIAVP+ASHF+LSCSDCDEDHQRPFHVVVQLSLSAVA LSF
Subjt:  METEEKKSQPQIDSQKSESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSF

Query:  LCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFF
        L LSLWLRLFG NRFLFLDKLS+ASPRVRAEYSRQLQRS EL+ LF++PCFMAEA YK+WWYI+AAK+IPYYTNNMY+SYITSCTLELCSWLYRTSIFFF
Subjt:  LCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFF

Query:  VCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALVKAKSFEKNAA
        VC+LFRLIC LQMIRLEDFASVF RET+VGTIL+ HLGLRRTFTIISHRFRAF+LLSLIL+TASQFISLLMTT +KAHVNLSKAGQLAL    S      
Subjt:  VCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALVKAKSFEKNAA

Query:  MLHKLGDRLVHMPTHETPLTA-----SLVSTMN-------------ESNSDDE--DGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAGITVYGFM
        +  +   ++ H     T L A     ++VST +             ESNSDDE  DGDEDDLDDAKLMPVFA TISFQKRQALVIYLRNNKAGITVYGFM
Subjt:  MLHKLGDRLVHMPTHETPLTA-----SLVSTMN-------------ESNSDDE--DGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAGITVYGFM

Query:  VDRTWLKSIFAIELALFLWLLNKTVGVS
        VDRTWLKSIFAIELALFLWLLNKTVG+S
Subjt:  VDRTWLKSIFAIELALFLWLLNKTVGVS

A0A6J1K314 uncharacterized protein LOC1114902094.6e-15672.2Show/hide
Query:  SQPQIDSQK------SESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSFL
        S  QI+S+K      SESESE ++AELRRFESFLKWICIMDHSNP+ A+LSC +F  FAIAVP+ASHF+LSCSDCDEDH+RPFHVVVQLSLSAVATLSF 
Subjt:  SQPQIDSQK------SESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSFL

Query:  CLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFFV
        CLS WLR FGL+RFLFLDKL E+S + R EYS+QL+RSMEL++ FLLPCFMAEA YKIWWY+SAA EIPYY  NMY+SYITSCTLEL SWLYRTSIFFFV
Subjt:  CLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFFV

Query:  CILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV-----------
        CILFRL+CRLQMIRLEDF SVF RE++VGTILMQHLGLRRT TIISHRFR FM LSLIL+TASQFI LLMTTRS A  NLSK GQLAL            
Subjt:  CILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV-----------

Query:  ---------KAKSFEKNAAMLH-KLGDRLVHMPTHETPLTASLVSTMNESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAGITVYGFM
                 KA+S    AA  H            +ETP T S+++T   ++ DDED DEDD DD KLMPVFAHTISFQKRQALV YLRNNKAGITVYGF+
Subjt:  ---------KAKSFEKNAAMLH-KLGDRLVHMPTHETPLTASLVSTMNESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAGITVYGFM

Query:  VDRTWLKSIFAIELALFLWLLNKTVGVS
        VDRTWLKS+FAIELAL LWLLNKTVG+S
Subjt:  VDRTWLKSIFAIELALFLWLLNKTVGVS

A0A6J1KGZ0 uncharacterized protein LOC1114956695.5e-17076.53Show/hide
Query:  METEEKKSQPQIDSQKSESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSF
        M+  +KKS     +   ESESELEA ELRR ESFLKWIC+ D SNPYRASLSC++FF+FAIAVP+ASHF+LSCSDCDEDHQRPFHVVVQLSLSAVA LSF
Subjt:  METEEKKSQPQIDSQKSESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSF

Query:  LCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFF
        + LSLWLRLFG NRFLFLDKLS+ASPRV+AEYSRQLQRSMEL++LF++PCFMAEA YK+WWYI+AAK+IPYYTNNMY+SYITSCTLELCSWLYRTSIFFF
Subjt:  LCLSLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFF

Query:  VCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALVKAKSFEKNAA
        VC+LFRLIC LQMIRLEDFASVF RET+VGTIL+ HLGLRRTFTIISHRFRAF+LLSLIL+TASQFISLLMTT +KAHVNLSKAGQLAL    S      
Subjt:  VCILFRLICRLQMIRLEDFASVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALVKAKSFEKNAA

Query:  MLHKLGDRLVHMPTHETPLTA-----SLVSTMN-------------ESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAGITVYGFMVD
        +  +   ++ H     T L A     ++VST +             E NSDDEDGDEDDLDDAKLMPVFA TISFQKRQALVIYLRNNKAGITVYGFMVD
Subjt:  MLHKLGDRLVHMPTHETPLTA-----SLVSTMN-------------ESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAGITVYGFMVD

Query:  RTWLKSIFAIELALFLWLLNKTVGVS
        RTWLKSIFAIELALFLWLLNKTVG+S
Subjt:  RTWLKSIFAIELALFLWLLNKTVGVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)9.5e-9847.62Show/hide
Query:  ELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSFLCLSLWLRLFGLNRFLFLDKLSEASP
        EL  F  +L+W+C+ DHS+P+ A LS  +F VF + VP  SHF L+C+DCD  H RP+  VVQLSLS+VAT+SFLCL+ ++  +GL RFLF DKL + S 
Subjt:  ELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSFLCLSLWLRLFGLNRFLFLDKLSEASP

Query:  RVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASVFRRE
         VR  Y+ QL  S+ +++ F++PCF A + YKIWWY S    IP +  N  +S   +C +ELCSWLYRT++ F VC+LFRLIC LQ++RL+DFA +F+ +
Subjt:  RVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASVFRRE

Query:  TEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV--------------------KAKSFEKNAAMLHKLG
        ++VG+IL +HL +RR   IISHR+R+F+L  LIL+T SQF SLL+TT++   VN+ +AG+LAL                     KA++    AA  H   
Subjt:  TEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV--------------------KAKSFEKNAAMLHKLG

Query:  -----DRLVHM--PTHETPLTASL----------VSTMNESNSDDEDGDEDDLDDAKLMPVFA-HTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKS
             D+ V     T ETP   +           V T+ ES+SD+   +EDDLD+  ++PV+A  T+SFQKRQALV Y  NN AGITVYGF +DR  L +
Subjt:  -----DRLVHM--PTHETPLTASL----------VSTMNESNSDDEDGDEDDLDDAKLMPVFA-HTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKS

Query:  IFAIELALFLWLLNKTVGVS
        IF +EL+L LWLL KT+G+S
Subjt:  IFAIELALFLWLLNKTVGVS

AT3G20300.1 Protein of unknown function (DUF3537)8.6e-9948.18Show/hide
Query:  ELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSFLCLSLWLRLFGLNRFLFLDKLSEASP
        EL  F  +L+W+C+ D S+P+ A LS  +F VF + VP  SHF L+CSDCD  H RP+  VVQLSLS+ A LSFLCLS ++  +GL RFLF DKL + S 
Subjt:  ELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSFLCLSLWLRLFGLNRFLFLDKLSEASP

Query:  RVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASVFRRE
         VR  Y+ QL RS+++++ F+ PCF+A + YKIWWY S A +IP +  N+ +S   +C +ELCSWLYRT++ F VC+LFRLIC LQ++RL+DFA VF+ +
Subjt:  RVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASVFRRE

Query:  TEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV--------------------KAKSFEKNAAMLHKLG
        ++VG+IL +HL +RR   IISHR+R F+LLSLIL+T SQF SLL+TT++ A +N+ +AG+LAL                     KA++    AA  H   
Subjt:  TEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV--------------------KAKSFEKNAAMLHKLG

Query:  D-RLVHMPTHETPLTASLVS---------TMNESNSDDEDGDEDDLDDAKLMPVFAH-TISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIELA
                  ETP      S            ES+S+D   +EDD D+  L+P +A+ TISFQKRQALV Y  NN++GITV+GF +DR+ L +IF IE++
Subjt:  D-RLVHMPTHETPLTASLVS---------TMNESNSDDEDGDEDDLDDAKLMPVFAH-TISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIELA

Query:  LFLWLLNKTVGVS
        L LWLL KT+G+S
Subjt:  LFLWLLNKTVGVS

AT4G03820.1 Protein of unknown function (DUF3537)2.3e-8342.4Show/hide
Query:  EEKKSQPQIDSQKSESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSFLCL
        EE  +Q  ++S+      +          SF +     D SN  +  LS  +FF+ A+ VP+ SHF L C+DCD  H+RP+  +VQLSLS  A +SF+ L
Subjt:  EEKKSQPQIDSQKSESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSFLCL

Query:  SLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFFVCI
        S W + +G+ RFLF DKL + S +VR  Y  ++QRSM+L+ +F+LP    +A Y+IWWY S   +IPY  N   +S++ +CTL+L SWLYRTS+F   CI
Subjt:  SLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFFVCI

Query:  LFRLICRLQMIRLEDFASVFRRE-TEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALVKAK-------SF
        L++ IC LQ++RL++FA  F  E  +  +IL +HL +RR   I+SHRFR F+LLSL  +TA+QF++LL T R+    N+ + G+LAL             
Subjt:  LFRLICRLQMIRLEDFASVFRRE-TEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALVKAK-------SF

Query:  EKNAAMLHKLG------------------DRLVHMPTHETPLTASLVSTMN------ESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNK
        +    M HK                    D L    T + P T      ++      +S+ DDE+G+ DD +D ++ P+FA  IS QKRQALV YL NN+
Subjt:  EKNAAMLHKLG------------------DRLVHMPTHETPLTASLVSTMN------ESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNK

Query:  AGITVYGFMVDRTWLKSIFAIELALFLWLLNKTV
        AGITVYGF+VD+TWL+ IF+IELAL LWLL KT+
Subjt:  AGITVYGFMVDRTWLKSIFAIELALFLWLLNKTV

AT4G03820.2 Protein of unknown function (DUF3537)2.3e-8342.4Show/hide
Query:  EEKKSQPQIDSQKSESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSFLCL
        EE  +Q  ++S+      +          SF +     D SN  +  LS  +FF+ A+ VP+ SHF L C+DCD  H+RP+  +VQLSLS  A +SF+ L
Subjt:  EEKKSQPQIDSQKSESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSFLCL

Query:  SLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFFVCI
        S W + +G+ RFLF DKL + S +VR  Y  ++QRSM+L+ +F+LP    +A Y+IWWY S   +IPY  N   +S++ +CTL+L SWLYRTS+F   CI
Subjt:  SLWLRLFGLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFFVCI

Query:  LFRLICRLQMIRLEDFASVFRRE-TEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALVKAK-------SF
        L++ IC LQ++RL++FA  F  E  +  +IL +HL +RR   I+SHRFR F+LLSL  +TA+QF++LL T R+    N+ + G+LAL             
Subjt:  LFRLICRLQMIRLEDFASVFRRE-TEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALVKAK-------SF

Query:  EKNAAMLHKLG------------------DRLVHMPTHETPLTASLVSTMN------ESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNK
        +    M HK                    D L    T + P T      ++      +S+ DDE+G+ DD +D ++ P+FA  IS QKRQALV YL NN+
Subjt:  EKNAAMLHKLG------------------DRLVHMPTHETPLTASLVSTMN------ESNSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNK

Query:  AGITVYGFMVDRTWLKSIFAIELALFLWLLNKTV
        AGITVYGF+VD+TWL+ IF+IELAL LWLL KT+
Subjt:  AGITVYGFMVDRTWLKSIFAIELALFLWLLNKTV

AT4G22270.1 Protein of unknown function (DUF3537)5.8e-9549.13Show/hide
Query:  SFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSFLCLSLWLRLFGLNRFLFLDKLSEASPRVRAEY
        +F+  +   D SN   A LS  VFF+  + VP+ SHF L CSDCD  H+RP+ V+VQLSLS  A +SF+ LS+W R FG+ RFLFLDKL + S +VR EY
Subjt:  SFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSFLCLSLWLRLFGLNRFLFLDKLSEASPRVRAEY

Query:  SRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASVFRRE-TEVGT
          ++QRS++ + +F+LP    EA Y+IWWYIS   +IPY  N + +S++ +CTL+L SWLYR S+F  VCIL+++ C LQ +RL+DFA  F  E T+V +
Subjt:  SRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASVFRRE-TEVGT

Query:  ILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV--------------------KAKSFEKNAAMLHKLG--DRL
         L +H  +RR   I+SHRFR F+LLSLIL+TA+QF++LL TTR+   VN+ + G+LAL                     KA+S    AA  +     D  
Subjt:  ILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALV--------------------KAKSFEKNAAMLHKLG--DRL

Query:  VHMPTHETPLTASLVSTMNE------SNSDDEDGD-EDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIELALFLWLLN
         H+   ETP T S++ +           SDDE+G+ +DDLD+ K+ P++A+TIS+QKRQALV YL NNKAGITVYGF+VDR+WL +IF IELAL LWLLN
Subjt:  VHMPTHETPLTASLVSTMNE------SNSDDEDGD-EDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIELALFLWLLN

Query:  KTV
        KT+
Subjt:  KTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACGGAGGAGAAGAAATCTCAACCTCAAATCGATTCACAGAAATCAGAATCAGAATCAGAATTAGAAGCAGCGGAATTGAGAAGGTTTGAATCATTCTTGAAATG
GATTTGTATAATGGATCACTCGAATCCATATCGCGCTTCGCTTTCCTGCATCGTCTTCTTCGTGTTCGCAATCGCAGTCCCTGTCGCATCGCACTTTTCTCTTTCTTGCT
CCGATTGCGACGAAGATCACCAAAGGCCTTTCCATGTCGTCGTTCAGCTCTCTCTCTCCGCCGTTGCTACGCTCTCTTTCCTCTGTCTCTCTCTCTGGCTTCGTCTCTTT
GGATTGAACCGGTTTCTCTTCCTTGATAAGCTGTCTGAAGCAAGCCCTAGGGTTCGGGCTGAGTATTCCAGGCAATTACAGAGATCAATGGAGCTGATGACCTTGTTTCT
ACTGCCATGTTTCATGGCAGAAGCAGGTTACAAAATTTGGTGGTACATTTCAGCAGCTAAAGAAATCCCATATTACACAAATAACATGTACATCAGCTACATTACTTCTT
GCACATTGGAGCTCTGTTCATGGCTTTACAGAACTTCCATCTTCTTCTTTGTTTGCATTCTGTTCCGCCTCATTTGCCGTCTCCAAATGATCAGACTTGAAGATTTCGCT
TCCGTTTTCCGTCGGGAAACCGAGGTCGGCACCATCTTGATGCAGCATTTGGGCCTTAGAAGAACCTTCACTATCATTAGCCATCGCTTCAGAGCCTTCATGTTGCTGTC
TTTGATTTTGATTACTGCTAGCCAGTTTATCTCTTTATTGATGACTACAAGATCAAAAGCTCATGTTAATCTCTCCAAGGCTGGACAACTTGCGTTGGTAAAAGCTAAGA
GTTTTGAGAAAAATGCAGCTATGCTCCATAAGCTTGGTGACAGGCTTGTTCATATGCCTACGCATGAGACGCCCCTGACAGCATCTCTGGTTTCGACCATGAATGAATCC
AACTCTGATGACGAGGACGGCGATGAAGACGACTTGGATGATGCCAAACTGATGCCAGTTTTTGCCCACACAATCTCATTCCAAAAGAGGCAGGCATTAGTAATATATTT
GAGGAATAACAAAGCAGGAATTACAGTGTATGGATTCATGGTGGACAGAACATGGCTGAAATCCATTTTTGCTATTGAACTTGCTCTTTTCTTGTGGCTGCTCAACAAGA
CTGTTGGTGTTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACGGAGGAGAAGAAATCTCAACCTCAAATCGATTCACAGAAATCAGAATCAGAATCAGAATTAGAAGCAGCGGAATTGAGAAGGTTTGAATCATTCTTGAAATG
GATTTGTATAATGGATCACTCGAATCCATATCGCGCTTCGCTTTCCTGCATCGTCTTCTTCGTGTTCGCAATCGCAGTCCCTGTCGCATCGCACTTTTCTCTTTCTTGCT
CCGATTGCGACGAAGATCACCAAAGGCCTTTCCATGTCGTCGTTCAGCTCTCTCTCTCCGCCGTTGCTACGCTCTCTTTCCTCTGTCTCTCTCTCTGGCTTCGTCTCTTT
GGATTGAACCGGTTTCTCTTCCTTGATAAGCTGTCTGAAGCAAGCCCTAGGGTTCGGGCTGAGTATTCCAGGCAATTACAGAGATCAATGGAGCTGATGACCTTGTTTCT
ACTGCCATGTTTCATGGCAGAAGCAGGTTACAAAATTTGGTGGTACATTTCAGCAGCTAAAGAAATCCCATATTACACAAATAACATGTACATCAGCTACATTACTTCTT
GCACATTGGAGCTCTGTTCATGGCTTTACAGAACTTCCATCTTCTTCTTTGTTTGCATTCTGTTCCGCCTCATTTGCCGTCTCCAAATGATCAGACTTGAAGATTTCGCT
TCCGTTTTCCGTCGGGAAACCGAGGTCGGCACCATCTTGATGCAGCATTTGGGCCTTAGAAGAACCTTCACTATCATTAGCCATCGCTTCAGAGCCTTCATGTTGCTGTC
TTTGATTTTGATTACTGCTAGCCAGTTTATCTCTTTATTGATGACTACAAGATCAAAAGCTCATGTTAATCTCTCCAAGGCTGGACAACTTGCGTTGGTAAAAGCTAAGA
GTTTTGAGAAAAATGCAGCTATGCTCCATAAGCTTGGTGACAGGCTTGTTCATATGCCTACGCATGAGACGCCCCTGACAGCATCTCTGGTTTCGACCATGAATGAATCC
AACTCTGATGACGAGGACGGCGATGAAGACGACTTGGATGATGCCAAACTGATGCCAGTTTTTGCCCACACAATCTCATTCCAAAAGAGGCAGGCATTAGTAATATATTT
GAGGAATAACAAAGCAGGAATTACAGTGTATGGATTCATGGTGGACAGAACATGGCTGAAATCCATTTTTGCTATTGAACTTGCTCTTTTCTTGTGGCTGCTCAACAAGA
CTGTTGGTGTTTCTTGA
Protein sequenceShow/hide protein sequence
METEEKKSQPQIDSQKSESESELEAAELRRFESFLKWICIMDHSNPYRASLSCIVFFVFAIAVPVASHFSLSCSDCDEDHQRPFHVVVQLSLSAVATLSFLCLSLWLRLF
GLNRFLFLDKLSEASPRVRAEYSRQLQRSMELMTLFLLPCFMAEAGYKIWWYISAAKEIPYYTNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFA
SVFRRETEVGTILMQHLGLRRTFTIISHRFRAFMLLSLILITASQFISLLMTTRSKAHVNLSKAGQLALVKAKSFEKNAAMLHKLGDRLVHMPTHETPLTASLVSTMNES
NSDDEDGDEDDLDDAKLMPVFAHTISFQKRQALVIYLRNNKAGITVYGFMVDRTWLKSIFAIELALFLWLLNKTVGVS