; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010037 (gene) of Snake gourd v1 genome

Gene IDTan0010037
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationLG05:81048691..81056365
RNA-Seq ExpressionTan0010037
SyntenyTan0010037
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573486.1 hypothetical protein SDJN03_27373, partial [Cucurbita argyrosperma subsp. sororia]2.1e-19085.06Show/hide
Query:  KSGQSLDSESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRLFGFT
        KS   L+SES+S ELRRFESFLKWICI+DHSNP++A+LSCF+F AFAIAVPIASHF LSCSDCDEDH+RPFHVVVQLSLSAVATLSF CLS WLR  G +
Subjt:  KSGQSLDSESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRLFGFT

Query:  RFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQI
        RFLFLDKL ++S K R EYSKQLKRSMELIS FLLPCFMAEAAYKIWWYVSAA EIPYYG NMY+SYITSCTLEL SWLYRTSIFFFVCILFRL+CRLQ+
Subjt:  RFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQI

Query:  IRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKITHKAQ
        IRLEDF SVFHRE++VGTILMQHLGLRRTLT++SHRFRVFM LSLILVTASQFISL MTTRS+A ANLSKTGQLALCSISLVTGLFICLRSAAKITHKAQ
Subjt:  IRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKITHKAQ

Query:  SITCLAAKWHVSAVINTFDDLDAETTIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKSVFAIE
        SITCLAAKWHVSA INTFDDLD ET  ASV+A  E NSDD+++ DEDD DD KLMPVFA TISFQKRQALVTYL+NNK GITVYGF+VDRTWLKSVFAIE
Subjt:  SITCLAAKWHVSAVINTFDDLDAETTIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKSVFAIE

Query:  LALVLWLLNKTIGIS
        LAL LWLLNKT+GIS
Subjt:  LALVLWLLNKTIGIS

XP_008458982.1 PREDICTED: uncharacterized protein LOC103498231 [Cucumis melo]1.1e-19184.2Show/hide
Query:  MEDEEKKSGQS-LDSESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLW
        M+ ++ KS +S  +SE E+ ELRRFESFLKWICI+DHSN Y ASLSC VFF F IAVPIASHF LSCSDCDEDHQRPFHVVVQLSLSAVATLSF CLSLW
Subjt:  MEDEEKKSGQS-LDSESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLW

Query:  LRLFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFR
        LRLFG  RFLFLDKLS+AS K+R EY +QL+RSMEL+S FLLPCFMAEA YKIWWY+SAAKEIPYY NNMYISYITSCTLELCSWLYRTSIFFFVCILFR
Subjt:  LRLFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFR

Query:  LICRLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAA
        LIC LQ+IRLEDFAS+F  ETEVGTIL+QHLGLRRT T++SHRFRVFMLLSLILVTASQFISL MTTRS AH NLSK GQLALCSISLVTGLFICLRSAA
Subjt:  LICRLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAA

Query:  KITHKAQSITCLAAKWHVSAVINTFDDLDAETTIASVVAP--IEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRT
        KITHKAQSITCLAAKWHVSAVINTFDDLD ET   + + P  +E+NSDDE DGDEDDLDD KLMPVFA TISFQKRQALVTYLRNNKAGITVYGFMVDRT
Subjt:  KITHKAQSITCLAAKWHVSAVINTFDDLDAETTIASVVAP--IEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRT

Query:  WLKSVFAIELALVLWLLNKTIGIS
        WLKS+FAIELAL LWLLNKT+G+S
Subjt:  WLKSVFAIELALVLWLLNKTIGIS

XP_022954540.1 uncharacterized protein LOC111456780 [Cucurbita moschata]7.9e-19084.29Show/hide
Query:  KKSGQSLDSESESD----ELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLR
        +KS   L+SESES+    ELRRFESFLKWICI+DHSNP++A+LSCF+F AFAIAVPIASHF LSCSDCDEDH+RPFHVVVQLSLSAVATLSF CLS WLR
Subjt:  KKSGQSLDSESESD----ELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLR

Query:  LFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLI
          G +RFLFLDKL ++S K R EYSKQLKRSMELIS FLLPCFMAEAAYKIWWYVSAA EIPYYG NMY+SYITSCTLEL SWLYRTSIFFFVCILFRL+
Subjt:  LFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLI

Query:  CRLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKI
        CRLQ+IRLEDF SVFHRE++VGTILMQHLGLRRTLT +SHRFRVFM LSLILVTASQFISL MTTRS+A ANLSKTGQLALCSISLVTGLFICLRSAAKI
Subjt:  CRLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKI

Query:  THKAQSITCLAAKWHVSAVINTFDDLDAETTIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKS
        THKAQSITCLAAKWHVSA INTFDDLD ET  ASV+A  E NSDD+++ DEDD DD KLMPVFA TISFQKRQALVTYL+NNK GITVYGF+VDRTWLKS
Subjt:  THKAQSITCLAAKWHVSAVINTFDDLDAETTIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKS

Query:  VFAIELALVLWLLNKTIGIS
        VFAIELAL+LWLLNKT+GIS
Subjt:  VFAIELALVLWLLNKTIGIS

XP_022994504.1 uncharacterized protein LOC111490209 [Cucurbita maxima]5.5e-19185.13Show/hide
Query:  EKKSGQSLDSESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRLFG
        E KS    +SES+S ELRRFESFLKWICI+DHSNP++A+LSCF+F AFAIAVPIASHF LSCSDCDEDH+RPFHVVVQLSLSAVATLSF CLS WLR FG
Subjt:  EKKSGQSLDSESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRLFG

Query:  FTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRL
         +RFLFLDKL ++S K R EYSKQLKRSMELIS FLLPCFMAEAAYKIWWYVSAA EIPYYG NMY+SYITSCTLEL SWLYRTSIFFFVCILFRL+CRL
Subjt:  FTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRL

Query:  QIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKITHK
        Q+IRLEDF SVFHRE++VGTILMQHLGLRRTLT++SHRFRVFM LSLILVTASQFI L MTTRS+A ANLSKTGQLALCSISLVTGLFICLRSAAKI+HK
Subjt:  QIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKITHK

Query:  AQSITCLAAKWHVSAVINTFDDLDAETTIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKSVFA
        AQSITCLAAKWHVSA INTFDDLD ET   SV+A  E NSDD++D DEDD DD KLMPVFA TISFQKRQALVTYLRNNKAGITVYGF+VDRTWLKSVFA
Subjt:  AQSITCLAAKWHVSAVINTFDDLDAETTIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKSVFA

Query:  IELALVLWLLNKTIGIS
        IELALVLWLLNKT+GIS
Subjt:  IELALVLWLLNKTIGIS

XP_038893800.1 uncharacterized protein LOC120082620 [Benincasa hispida]1.3e-18781.34Show/hide
Query:  MEDEEKKSGQSLDS-----------ESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVA
        ME EEKKS   +DS           ESE+ ELRRF+S LKWICI DHSNPY ASLSC VFF FAIAVP+ASHF LSCSDCDEDHQRPFHVVVQLSLSAVA
Subjt:  MEDEEKKSGQSLDS-----------ESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVA

Query:  TLSFACLSLWLRLFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTS
        TLSF CLSLWLR FG  RFLFLDKLS+AS +VR EY +QL+RSM L+S FLLPCFMAEA YKIWWY+SAAKEIPYY NN+YISYI SCTLELCSWLYRTS
Subjt:  TLSFACLSLWLRLFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTS

Query:  IFFFVCILFRLICRLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVT
        IFFFVCILFRLIC LQ+IRLEDFAS+FHRE EVGTILMQHL LRRT T++SHRFR F+LLSLILVTASQFISL MTTRS AH NLSK GQLALCSISLVT
Subjt:  IFFFVCILFRLICRLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVT

Query:  GLFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDAET--TIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGI
        GLFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLD+ET  T + +   +E+NSD+E   DEDDLDDAKLMPVFA TISFQKRQALVTYLRNNKAGI
Subjt:  GLFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDAET--TIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGI

Query:  TVYGFMVDRTWLKSVFAIELALVLWLLNKTIGIS
        TVYGF VDRTWLKS+FAIELAL LWLLNKT+G+S
Subjt:  TVYGFMVDRTWLKSVFAIELALVLWLLNKTIGIS

TrEMBL top hitse value%identityAlignment
A0A0A0M3M6 Uncharacterized protein7.9e-18882.23Show/hide
Query:  DEEKKSGQSLDSESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRL
        D +K     L+SE E+ ELR+ ESFL+WICI+DHSN Y AS+SC +FF F IAVPIASHF LSCSDCDEDHQRPFHVVVQLSLSAVATLSF CLSLWLR+
Subjt:  DEEKKSGQSLDSESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRL

Query:  FGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLIC
        FG  RFLFLDKL +AS K+R EY +QL++SM+L+S FLLPCFMAEA YKIWWY+SAAKEIPYY NNMYISY+TSCTLELCSWLYRTSIFFFVCI FRLIC
Subjt:  FGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLIC

Query:  RLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKIT
         LQ+IRLEDFAS F  ETEVGTIL+QHLGLRRT TV+SHRFRVFMLLSLILVTASQFISL MTTRS AHANLSK+GQLALCSISLVTGLFICLRSAAKIT
Subjt:  RLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKIT

Query:  HKAQSITCLAAKWHVSAVINTFDDLDAETTIASVVAP--IEANSDDED-DGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWL
        HKAQSITCLAAKWHVSAVINTFD+LD E T  +   P  +E+NSDDED D DEDDLDDAKLMPVFA TISFQKRQALVTYLRNNKAGITVYGFMVDRTWL
Subjt:  HKAQSITCLAAKWHVSAVINTFDDLDAETTIASVVAP--IEANSDDED-DGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWL

Query:  KSVFAIELALVLWLLNKTIGIS
        KS+FAIELAL LWLLNKT+G+S
Subjt:  KSVFAIELALVLWLLNKTIGIS

A0A1S3C949 uncharacterized protein LOC1034982315.3e-19284.2Show/hide
Query:  MEDEEKKSGQS-LDSESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLW
        M+ ++ KS +S  +SE E+ ELRRFESFLKWICI+DHSN Y ASLSC VFF F IAVPIASHF LSCSDCDEDHQRPFHVVVQLSLSAVATLSF CLSLW
Subjt:  MEDEEKKSGQS-LDSESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLW

Query:  LRLFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFR
        LRLFG  RFLFLDKLS+AS K+R EY +QL+RSMEL+S FLLPCFMAEA YKIWWY+SAAKEIPYY NNMYISYITSCTLELCSWLYRTSIFFFVCILFR
Subjt:  LRLFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFR

Query:  LICRLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAA
        LIC LQ+IRLEDFAS+F  ETEVGTIL+QHLGLRRT T++SHRFRVFMLLSLILVTASQFISL MTTRS AH NLSK GQLALCSISLVTGLFICLRSAA
Subjt:  LICRLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAA

Query:  KITHKAQSITCLAAKWHVSAVINTFDDLDAETTIASVVAP--IEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRT
        KITHKAQSITCLAAKWHVSAVINTFDDLD ET   + + P  +E+NSDDE DGDEDDLDD KLMPVFA TISFQKRQALVTYLRNNKAGITVYGFMVDRT
Subjt:  KITHKAQSITCLAAKWHVSAVINTFDDLDAETTIASVVAP--IEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRT

Query:  WLKSVFAIELALVLWLLNKTIGIS
        WLKS+FAIELAL LWLLNKT+G+S
Subjt:  WLKSVFAIELALVLWLLNKTIGIS

A0A6J1EG16 uncharacterized protein LOC1114339901.1e-18681.73Show/hide
Query:  MEDEEKKSGQSLDS-ESESD----ELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFAC
        M++  KKS   +DS ESES+    ELRR ESFLKWICI D SNPY ASLSC +FF FAIAVP+ASHF LSCSDCDEDHQRPFHVVVQLSLSAVA LSF  
Subjt:  MEDEEKKSGQSLDS-ESESD----ELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFAC

Query:  LSLWLRLFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVC
        LSLWLRLFGF RFLFLDKLS+AS +VR EYS+QL+RS ELI LF++PCFMAEAAYK+WWY++AAK+IPYY NNMY+SYITSCTLELCSWLYRTSIFFFVC
Subjt:  LSLWLRLFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVC

Query:  ILFRLICRLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICL
        +LFRLIC LQ+IRLEDFASVFHRET+VGTIL+ HLGLRRT T++SHRFR F+LLSLILVTASQFISL MTT + AH NLSK GQLALCSISLVTGLFICL
Subjt:  ILFRLICRLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICL

Query:  RSAAKITHKAQSITCLAAKWHVSAVINTFDDLDAETTIASVVAPIEANSDDED-DGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMV
        RSAAKITHKAQSITCLAAKWH+SAV++TFDDLD +TT  + +  IE+NSDDED DGDEDDLDDAKLMPVFADTISFQKRQALV YLRNNKAGITVYGFMV
Subjt:  RSAAKITHKAQSITCLAAKWHVSAVINTFDDLDAETTIASVVAPIEANSDDED-DGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMV

Query:  DRTWLKSVFAIELALVLWLLNKTIGIS
        DRTWLKS+FAIELAL LWLLNKT+GIS
Subjt:  DRTWLKSVFAIELALVLWLLNKTIGIS

A0A6J1GSP8 uncharacterized protein LOC1114567803.8e-19084.29Show/hide
Query:  KKSGQSLDSESESD----ELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLR
        +KS   L+SESES+    ELRRFESFLKWICI+DHSNP++A+LSCF+F AFAIAVPIASHF LSCSDCDEDH+RPFHVVVQLSLSAVATLSF CLS WLR
Subjt:  KKSGQSLDSESESD----ELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLR

Query:  LFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLI
          G +RFLFLDKL ++S K R EYSKQLKRSMELIS FLLPCFMAEAAYKIWWYVSAA EIPYYG NMY+SYITSCTLEL SWLYRTSIFFFVCILFRL+
Subjt:  LFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLI

Query:  CRLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKI
        CRLQ+IRLEDF SVFHRE++VGTILMQHLGLRRTLT +SHRFRVFM LSLILVTASQFISL MTTRS+A ANLSKTGQLALCSISLVTGLFICLRSAAKI
Subjt:  CRLQIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKI

Query:  THKAQSITCLAAKWHVSAVINTFDDLDAETTIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKS
        THKAQSITCLAAKWHVSA INTFDDLD ET  ASV+A  E NSDD+++ DEDD DD KLMPVFA TISFQKRQALVTYL+NNK GITVYGF+VDRTWLKS
Subjt:  THKAQSITCLAAKWHVSAVINTFDDLDAETTIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKS

Query:  VFAIELALVLWLLNKTIGIS
        VFAIELAL+LWLLNKT+GIS
Subjt:  VFAIELALVLWLLNKTIGIS

A0A6J1K314 uncharacterized protein LOC1114902092.6e-19185.13Show/hide
Query:  EKKSGQSLDSESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRLFG
        E KS    +SES+S ELRRFESFLKWICI+DHSNP++A+LSCF+F AFAIAVPIASHF LSCSDCDEDH+RPFHVVVQLSLSAVATLSF CLS WLR FG
Subjt:  EKKSGQSLDSESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRLFG

Query:  FTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRL
         +RFLFLDKL ++S K R EYSKQLKRSMELIS FLLPCFMAEAAYKIWWYVSAA EIPYYG NMY+SYITSCTLEL SWLYRTSIFFFVCILFRL+CRL
Subjt:  FTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRL

Query:  QIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKITHK
        Q+IRLEDF SVFHRE++VGTILMQHLGLRRTLT++SHRFRVFM LSLILVTASQFI L MTTRS+A ANLSKTGQLALCSISLVTGLFICLRSAAKI+HK
Subjt:  QIIRLEDFASVFHRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKITHK

Query:  AQSITCLAAKWHVSAVINTFDDLDAETTIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKSVFA
        AQSITCLAAKWHVSA INTFDDLD ET   SV+A  E NSDD++D DEDD DD KLMPVFA TISFQKRQALVTYLRNNKAGITVYGF+VDRTWLKSVFA
Subjt:  AQSITCLAAKWHVSAVINTFDDLDAETTIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKSVFA

Query:  IELALVLWLLNKTIGIS
        IELALVLWLLNKT+GIS
Subjt:  IELALVLWLLNKTIGIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)9.5e-11752.59Show/hide
Query:  ESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRLFGFTRFLFLDKLSQ
        + DEL  F  +L+W+C +DHS+P++A LS  +F  F + VP  SHF L+C+DCD  H RP+  VVQLSLS+VAT+SF CL+ ++  +G  RFLF DKL  
Subjt:  ESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRLFGFTRFLFLDKLSQ

Query:  ASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQIIRLEDFASVF
         S  VR  Y+ QL  S+ ++S F++PCF A +AYKIWWY S    IP+ GN + +S   +C +ELCSWLYRT++ F VC+LFRLIC LQI+RL+DFA +F
Subjt:  ASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQIIRLEDFASVF

Query:  HRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKITHKAQSITCLAAKWH
          +++VG+IL +HL +RR L ++SHR+R F+L  LILVT SQF SL +TT++    N+ + G+LALCS++LVT L I LRSA+KITHKAQ++TCLAAKWH
Subjt:  HRETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKITHKAQSITCLAAKWH

Query:  VSAVINTFD------DLDAET------------TIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFA-DTISFQKRQALVTYLRNNKAGITVYGFMVDRT
        V A + +FD      D   ET             +  VV   E++SD+  D +EDDLD+  ++PV+A  T+SFQKRQALV+Y  NN AGITVYGF +DR 
Subjt:  VSAVINTFD------DLDAET------------TIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFA-DTISFQKRQALVTYLRNNKAGITVYGFMVDRT

Query:  WLKSVFAIELALVLWLLNKTIGIS
         L ++F +EL+LVLWLL KTIGIS
Subjt:  WLKSVFAIELALVLWLLNKTIGIS

AT3G20300.1 Protein of unknown function (DUF3537)2.7e-11954.83Show/hide
Query:  DELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRLFGFTRFLFLDKLSQAS
        DEL  F  +L+W+C +D S+P++A LS  +F  F + VP  SHF L+CSDCD  H RP+  VVQLSLS+ A LSF CLS ++  +G  RFLF DKL   S
Subjt:  DELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRLFGFTRFLFLDKLSQAS

Query:  LKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQIIRLEDFASVFHR
          VR  Y+ QL RS++++S F+ PCF+A ++YKIWWY S A +IP+ G N+ +S   +C +ELCSWLYRT++ F VC+LFRLIC LQI+RL+DFA VF  
Subjt:  LKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQIIRLEDFASVFHR

Query:  ETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKITHKAQSITCLAAKWHVS
        +++VG+IL +HL +RR L ++SHR+R F+LLSLILVT SQF SL +TT++ A  N+ + G+LALCS++LVT L I LRSA+KITHKAQ++TCLAAKWHV 
Subjt:  ETEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKITHKAQSITCLAAKWHVS

Query:  AVINTFDDLDAET------TIASVVAPIE---ANSDDEDDGD-EDDLDDAKLMPVFA-DTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKSVFAIEL
        A I +F+ +D ET             P +     SD ED GD EDD D+  L+P +A  TISFQKRQALV Y  NN++GITV+GF +DR+ L ++F IE+
Subjt:  AVINTFDDLDAET------TIASVVAPIE---ANSDDEDDGD-EDDLDDAKLMPVFA-DTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKSVFAIEL

Query:  ALVLWLLNKTIGIS
        +LVLWLL KTIGIS
Subjt:  ALVLWLLNKTIGIS

AT4G03820.1 Protein of unknown function (DUF3537)3.5e-10349.31Show/hide
Query:  EEKKSGQSLDSESESDELRRFE-------SFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACL
        EE  + Q L+SE+  D +++         SF +     D SN     LS  +FF  A+ VP+ SHF L C+DCD  H+RP+  +VQLSLS  A +SF  L
Subjt:  EEKKSGQSLDSESESDELRRFE-------SFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACL

Query:  SLWLRLFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCI
        S W + +G  RFLF DKL   S KVR  Y  +++RSM+L+++F+LP    +A Y+IWWY S   +IPY   N  +S++ +CTL+L SWLYRTS+F   CI
Subjt:  SLWLRLFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCI

Query:  LFRLICRLQIIRLEDFASVFHRE-TEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICL
        L++ IC LQ++RL++FA  F  E  +  +IL +HL +RR L +VSHRFR F+LLSL  VTA+QF++L  T R++   N+ + G+LALCS SLV+GLFICL
Subjt:  LFRLICRLQIIRLEDFASVFHRE-TEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICL

Query:  RSAAKITHKAQSITCLAAKWHVSAVINTFDDL-DAET----------TIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNK
        +SA ++THKAQS+T +A KW+V A ++TFD L D ET           I S    +  +SDD+++G+ DD +D ++ P+FA  IS QKRQALVTYL NN+
Subjt:  RSAAKITHKAQSITCLAAKWHVSAVINTFDDL-DAET----------TIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNK

Query:  AGITVYGFMVDRTWLKSVFAIELALVLWLLNKTI
        AGITVYGF+VD+TWL+ +F+IELAL+LWLL KTI
Subjt:  AGITVYGFMVDRTWLKSVFAIELALVLWLLNKTI

AT4G03820.2 Protein of unknown function (DUF3537)3.5e-10349.31Show/hide
Query:  EEKKSGQSLDSESESDELRRFE-------SFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACL
        EE  + Q L+SE+  D +++         SF +     D SN     LS  +FF  A+ VP+ SHF L C+DCD  H+RP+  +VQLSLS  A +SF  L
Subjt:  EEKKSGQSLDSESESDELRRFE-------SFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACL

Query:  SLWLRLFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCI
        S W + +G  RFLF DKL   S KVR  Y  +++RSM+L+++F+LP    +A Y+IWWY S   +IPY   N  +S++ +CTL+L SWLYRTS+F   CI
Subjt:  SLWLRLFGFTRFLFLDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCI

Query:  LFRLICRLQIIRLEDFASVFHRE-TEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICL
        L++ IC LQ++RL++FA  F  E  +  +IL +HL +RR L +VSHRFR F+LLSL  VTA+QF++L  T R++   N+ + G+LALCS SLV+GLFICL
Subjt:  LFRLICRLQIIRLEDFASVFHRE-TEVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICL

Query:  RSAAKITHKAQSITCLAAKWHVSAVINTFDDL-DAET----------TIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNK
        +SA ++THKAQS+T +A KW+V A ++TFD L D ET           I S    +  +SDD+++G+ DD +D ++ P+FA  IS QKRQALVTYL NN+
Subjt:  RSAAKITHKAQSITCLAAKWHVSAVINTFDDL-DAET----------TIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNK

Query:  AGITVYGFMVDRTWLKSVFAIELALVLWLLNKTI
        AGITVYGF+VD+TWL+ +F+IELAL+LWLL KTI
Subjt:  AGITVYGFMVDRTWLKSVFAIELALVLWLLNKTI

AT4G22270.1 Protein of unknown function (DUF3537)6.6e-11855.72Show/hide
Query:  SFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRLFGFTRFLFLDKLSQASLKVRHEY
        +F+  +   D SN  +A LS  VFF   + VP+ SHF L CSDCD  H+RP+ V+VQLSLS  A +SF  LS+W R FG  RFLFLDKL   S KVR EY
Subjt:  SFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRLFGFTRFLFLDKLSQASLKVRHEY

Query:  SKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQIIRLEDFASVFHRE-TEVGT
          +++RS++ + +F+LP    EA Y+IWWY+S   +IPY  N + +S++ +CTL+L SWLYR S+F  VCIL+++ C LQ +RL+DFA  F  E T+V +
Subjt:  SKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQIIRLEDFASVFHRE-TEVGT

Query:  ILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKITHKAQSITCLAAKWHVSAVINTF
         L +H  +RR L +VSHRFR F+LLSLILVTA+QF++L  TTR++   N+ + G+LALCS+SLVTG+FICLRSA KITHKAQS+T LAAKW+V A +++F
Subjt:  ILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKITHKAQSITCLAAKWHVSAVINTF

Query:  DDLDAETTIASVVA--------PIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKSVFAIELALVLWLLNK
        D LD ET   S++          IE + D+E +GD DDLD+ K+ P++A+TIS+QKRQALVTYL NNKAGITVYGF+VDR+WL ++F IELAL+LWLLNK
Subjt:  DDLDAETTIASVVA--------PIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKSVFAIELALVLWLLNK

Query:  TI
        TI
Subjt:  TI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACGAGGAGAAGAAATCTGGACAATCACTCGATTCGGAATCAGAATCAGACGAATTGAGGAGGTTCGAATCATTCCTGAAATGGATTTGCATAATCGATCACTC
GAATCCGTATAGTGCTTCGCTCTCCTGCTTCGTGTTTTTCGCCTTCGCAATCGCCGTCCCCATCGCATCGCACTTCTTTCTCTCTTGCTCCGATTGCGACGAAGATCACC
AAAGGCCTTTTCATGTCGTCGTTCAGCTTTCTCTCTCCGCCGTTGCCACGCTTTCATTCGCTTGCCTCTCTCTTTGGCTCCGCCTCTTCGGATTCACCCGATTTCTCTTC
CTCGATAAGCTCTCTCAAGCCAGTCTCAAAGTTCGCCATGAGTATTCCAAGCAATTAAAGAGATCAATGGAGCTGATCTCCTTGTTCTTGCTGCCATGTTTCATGGCAGA
AGCCGCCTACAAAATCTGGTGGTACGTATCAGCAGCAAAAGAAATACCATACTACGGCAACAACATGTACATCAGCTACATCACCTCCTGCACATTGGAGCTCTGCTCAT
GGCTTTACAGAACTTCAATCTTCTTCTTCGTCTGCATTCTCTTCCGTCTCATCTGCCGCCTGCAAATAATCAGACTTGAAGATTTTGCTTCCGTCTTCCATCGGGAAACC
GAGGTCGGCACCATCTTGATGCAGCATTTGGGCCTCAGAAGAACCTTGACAGTCGTCAGCCATCGCTTCAGAGTTTTCATGTTGCTGTCTTTGATTTTGGTCACTGCCAG
TCAGTTTATCTCTCTCTTCATGACTACTAGATCAAATGCTCATGCTAACCTCTCCAAGACTGGACAACTTGCGCTATGCTCCATCAGCCTGGTGACAGGCTTGTTCATAT
GCCTCCGCAGTGCTGCAAAGATCACTCACAAAGCACAGTCCATCACTTGCCTTGCTGCAAAGTGGCACGTCTCCGCTGTCATAAACACCTTCGACGACCTCGACGCTGAG
ACAACAATAGCATCTGTGGTTGCGCCCATTGAAGCGAACTCCGACGACGAAGATGATGGAGACGAGGACGACTTGGATGATGCCAAACTGATGCCAGTTTTCGCCGACAC
AATCTCATTCCAAAAGAGGCAGGCACTAGTGACATATTTGAGGAATAACAAAGCAGGAATTACAGTGTATGGATTCATGGTGGACAGAACATGGCTTAAATCTGTTTTTG
CCATTGAACTTGCACTTGTGCTGTGGCTGCTCAACAAGACTATTGGTATTTCTTGA
mRNA sequenceShow/hide mRNA sequence
AATTAATATTGTTCTCTTCTGTTTGTTTTTTCCCCCCACGTCTCTCTCTCTCTATTCTTTTTCCTTCCTCAATCCACCGTGTATGAAAATTTTCCGCTCCCCTGTTCCAA
AGCTATGGAAGACGAGGAGAAGAAATCTGGACAATCACTCGATTCGGAATCAGAATCAGACGAATTGAGGAGGTTCGAATCATTCCTGAAATGGATTTGCATAATCGATC
ACTCGAATCCGTATAGTGCTTCGCTCTCCTGCTTCGTGTTTTTCGCCTTCGCAATCGCCGTCCCCATCGCATCGCACTTCTTTCTCTCTTGCTCCGATTGCGACGAAGAT
CACCAAAGGCCTTTTCATGTCGTCGTTCAGCTTTCTCTCTCCGCCGTTGCCACGCTTTCATTCGCTTGCCTCTCTCTTTGGCTCCGCCTCTTCGGATTCACCCGATTTCT
CTTCCTCGATAAGCTCTCTCAAGCCAGTCTCAAAGTTCGCCATGAGTATTCCAAGCAATTAAAGAGATCAATGGAGCTGATCTCCTTGTTCTTGCTGCCATGTTTCATGG
CAGAAGCCGCCTACAAAATCTGGTGGTACGTATCAGCAGCAAAAGAAATACCATACTACGGCAACAACATGTACATCAGCTACATCACCTCCTGCACATTGGAGCTCTGC
TCATGGCTTTACAGAACTTCAATCTTCTTCTTCGTCTGCATTCTCTTCCGTCTCATCTGCCGCCTGCAAATAATCAGACTTGAAGATTTTGCTTCCGTCTTCCATCGGGA
AACCGAGGTCGGCACCATCTTGATGCAGCATTTGGGCCTCAGAAGAACCTTGACAGTCGTCAGCCATCGCTTCAGAGTTTTCATGTTGCTGTCTTTGATTTTGGTCACTG
CCAGTCAGTTTATCTCTCTCTTCATGACTACTAGATCAAATGCTCATGCTAACCTCTCCAAGACTGGACAACTTGCGCTATGCTCCATCAGCCTGGTGACAGGCTTGTTC
ATATGCCTCCGCAGTGCTGCAAAGATCACTCACAAAGCACAGTCCATCACTTGCCTTGCTGCAAAGTGGCACGTCTCCGCTGTCATAAACACCTTCGACGACCTCGACGC
TGAGACAACAATAGCATCTGTGGTTGCGCCCATTGAAGCGAACTCCGACGACGAAGATGATGGAGACGAGGACGACTTGGATGATGCCAAACTGATGCCAGTTTTCGCCG
ACACAATCTCATTCCAAAAGAGGCAGGCACTAGTGACATATTTGAGGAATAACAAAGCAGGAATTACAGTGTATGGATTCATGGTGGACAGAACATGGCTTAAATCTGTT
TTTGCCATTGAACTTGCACTTGTGCTGTGGCTGCTCAACAAGACTATTGGTATTTCTTGAAGGTTTTGTTATCTTATACACTCTAATTTTGATCTTTATATAATATTTGC
ATATACTTGCGGCATTATTTGGGCTGTTTCTGAATGGCTTTTGTAGAGGAAAATTTGTCTTTCTAAAATTGCAGTGTTCTGTATTTATGGACACCAATCAACTGCATTAT
AAACGGAACATATGTTCATCTTTTGAAAGTAAATAGGTCTATTATAAAGAATTCATTCATGAGATGACTCGTGTTTTTCTTTTTTAATTAGAGTTAGAACCATATGTAGG
GTGAGAGTTTTGAGTCTCTCACCGCTTGATCGAAGATATATGTCTTAACTAGTTGAACTATATTCAAGTTGACCGTGAGTTGACTCCTTGAAAATGAGTTGGGC
Protein sequenceShow/hide protein sequence
MEDEEKKSGQSLDSESESDELRRFESFLKWICIIDHSNPYSASLSCFVFFAFAIAVPIASHFFLSCSDCDEDHQRPFHVVVQLSLSAVATLSFACLSLWLRLFGFTRFLF
LDKLSQASLKVRHEYSKQLKRSMELISLFLLPCFMAEAAYKIWWYVSAAKEIPYYGNNMYISYITSCTLELCSWLYRTSIFFFVCILFRLICRLQIIRLEDFASVFHRET
EVGTILMQHLGLRRTLTVVSHRFRVFMLLSLILVTASQFISLFMTTRSNAHANLSKTGQLALCSISLVTGLFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDAE
TTIASVVAPIEANSDDEDDGDEDDLDDAKLMPVFADTISFQKRQALVTYLRNNKAGITVYGFMVDRTWLKSVFAIELALVLWLLNKTIGIS