; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS022695 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS022695
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDUF761 domain-containing protein
Genome locationscaffold41:41528..43276
RNA-Seq ExpressionMS022695
SyntenyMS022695
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34231.1 hypothetical protein [Cucumis melo subsp. melo]7.5e-19969.61Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR
        MASS S P+T+P           TT   +SC HFLCK LFF I LLLLPLFPSEAP+FVN TLLT FWEL HL+FVGIAVSYGLFSRR++QVSV  DE R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHAYKSRYSRGGSVV
        FSNFENP SYLSK+ H  SIFEDV   + SDE K+ +V Y+QP+  S   F   NA SRQ          +RYENS EF DT++VGHA KSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHAYKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSLRSNLT   E DDVEF  G+ES LSS+SSSK+SE NCER SEFG+NCC NLEEKFDE VIA +SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSGKNMMRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSE
        E  GKNMMRERGV N+ +RPSHFRP SIDETQFESLKKSRSLHS LSQSSQTSS S S S   TRKHRKMSSL NIS KS HSRQYS+ SLSENSR SSE
Subjt:  EKSGKNMMRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSE

Query:  DHLIETENPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAI
        D LIE EN SECNES++SS  LDRNFA IPKA+SRGKSVRTIRAN  A EEMK+QEM RNQVEHD N+G KFE  GG   YMRED  G+GWP + +PNA 
Subjt:  DHLIETENPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAI

Query:  NSGRLPPKATLSEIERQIRMEDIESLLADD--AKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLKGGW
         S R P   T S IE Q   EDIES L DD   +DNSE ED S F SSDEEA  ASS+AG SESGA+EVDKKAGEFIAKFREQIQLQRMASV+ RL+GGW
Subjt:  NSGRLPPKATLSEIERQIRMEDIESLLADD--AKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLKGGW

Query:  GSFSSTSSSHFS
        GSFSSTSSS+FS
Subjt:  GSFSSTSSSHFS

KAG6575261.1 hypothetical protein SDJN03_25900, partial [Cucurbita argyrosperma subsp. sororia]8.1e-19367.99Show/hide
Query:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN
        MASS S+P+T+      P      +SCA FLCK +FF   LLLLPLFPSEAPDFV+ TL T FWEL HL+FVGIAVSYGLFS R+ Q++VDE R+S+FEN
Subjt:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN

Query:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHAYKSRYSRGGSVVVVADSE
        P SYLSK+ +  SIF+DV     SDE KV +V Y+QP   SA+   DLNA+SR          ++RYENSYEF DTDNV HA KSRY+RGGSVVVV ++ 
Subjt:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHAYKSRYSRGGSVVVVADSE

Query:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSGKNM
        R     SSSG IV+YKPLGLPVRSLRS+LT   E+DDVEF  G+ES LSS+SS KSSE NCE  SEFG+NCC NLEEKFDE  IAS+S FQ REK GK +
Subjt:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSGKNM

Query:  MRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSEDHLIETE
        +RERG GN+ +RPSHFRPPSIDETQFESL+KS SLHS LSQSSQTSS SS  S   TRKH KMSSLSNIS KSLHSRQYSM SLSENSR SSED LIE E
Subjt:  MRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSEDHLIETE

Query:  NPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAINSGRLPP
        N SECNES++SS   DRNFASIPKA+S+GKSVR IRANA A E+MK+QEM R QV+HD  IG KFEEGG +  YMRED  G+GWP V NPNA N  R  P
Subjt:  NPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAINSGRLPP

Query:  KATLSEIERQIRMEDIESLLADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLK---GGWGSFSST
        K T   I+ Q   E+ ESL+ADD+KD SE EDES+FASSDEEA   SS+AG SESGA EVDKKAGEFIAKFREQIQLQRMASVE RL+   GGWGSFSST
Subjt:  KATLSEIERQIRMEDIESLLADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLK---GGWGSFSST

Query:  SSSHFS
        SSS+FS
Subjt:  SSSHFS

KAG7013816.1 hypothetical protein SDJN02_23985, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-19267.99Show/hide
Query:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN
        MASS S+P+T+      P      +SCA FLCK +FF   LLLLPLFPSEAPDFV+ TL T FWEL HL+FVGIAVSYGLFS R+ Q++VDE R+S+FEN
Subjt:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN

Query:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHAYKSRYSRGGSVVVVADSE
        P SYLSK+ +  SIF+DV     SDE KV +V Y+QP   SA+   DLNA+SR          ++RYENSYEF DTDNV HA KSRY+RGGSVVVV ++ 
Subjt:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHAYKSRYSRGGSVVVVADSE

Query:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSGKNM
        R     SSSG IV+YKPLGLPVRSLRS+LT   E+DDVEF  G+ES LSS+SS KSSE NCE  SEFG+NCC NLEEKFDE  IAS+S FQ REK GK +
Subjt:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSGKNM

Query:  MRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSEDHLIETE
        +RERG GN+ +RPSHFRPPSIDETQFESL+KS SLHS LSQSSQTSS SS  S   TRKH KMSSLSNIS KSLHSRQYSM SLSENSR SSED LIE E
Subjt:  MRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSEDHLIETE

Query:  NPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAINSGRLPP
        N SECNES++SS   DRNFASIPKA+S+GKSVR IRANA A E+MK+QEM R QV+HD  IG KFEEGG +  YMRED  G+GWP V NPNA N  R  P
Subjt:  NPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAINSGRLPP

Query:  KATLSEIERQIRMEDIESLLADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLK---GGWGSFSST
        K T   I+ Q   E+ ESL+ADD+KD SE EDES FASSDEEA   SS+AG SESGA EVDKKAGEFIAKFREQIQLQRMASVE RL+   GGWGSFSST
Subjt:  KATLSEIERQIRMEDIESLLADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLK---GGWGSFSST

Query:  SSSHFS
        SSS+FS
Subjt:  SSSHFS

XP_004140631.1 uncharacterized protein LOC101220435 [Cucumis sativus]6.2e-19367.21Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR
        MA S S P+T+P           TT   +SC  F+CK LFF I LLLLPLFPSEAP+FVN T LT FWEL HL+F+GIAVSYGLFSRR++QVSV  DE R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHAYKSRYSRGGSVV
        FSNFENP SYLSK+FH  SIFEDV   + SDE K+ +V Y+QP+  S +    LNA SRQ          +RYENS EF +TDNVGHA KSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHAYKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSL+S+LT   E DDVEF  G+ES LSS+SSSK+SE NCER SEFG+NCC NLEEKFDE VIAS+SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSGKNMMRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSE
        EK  KNMMRER V N+ +RPSHFRP SIDETQFESLKKS SLHS LSQSSQTSS SS  S  +TRKHRKMSSL NIS KS HSRQYS+ SLSENSR SSE
Subjt:  EKSGKNMMRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSE

Query:  DHLIETENPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAI
        D LI+ EN SECNES++SS  LDRNFA+ PKA+SRGKSVRT+RA+  A EEMK+QEM RNQVEHD N+  KFE  GG   YMREDE G+GWP + N NA 
Subjt:  DHLIETENPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAI

Query:  NSGR---LPPKATLSEIERQIRMEDIESLLADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLKGG
         S R        T S IE Q   ED ES + DD KDNSE ED+S F SSDEEA  A S+ G SESGAHEVDKKAGEFIAKFREQIQLQRMASV+ RL+GG
Subjt:  NSGR---LPPKATLSEIERQIRMEDIESLLADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLKGG

Query:  WGSFSSTSSSHFS
        WGSFSST+SS+FS
Subjt:  WGSFSSTSSSHFS

XP_022157033.1 uncharacterized protein LOC111023860 [Momordica charantia]3.9e-30496.74Show/hide
Query:  MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS
        MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS
Subjt:  MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS

Query:  KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHAYKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK
        KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHA KSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK
Subjt:  KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHAYKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK

Query:  PLGLPVRSLRSNLTEESETDDVEFGEESGLSSESSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSGKNMMRERGVGNSFVRPSHFRPP
        PLGLPVRSLRSNLTEESETDDVEFGEESGLSS+SSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKS KNMMRERGVGN FVRPSHFRPP
Subjt:  PLGLPVRSLRSNLTEESETDDVEFGEESGLSSESSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSGKNMMRERGVGNSFVRPSHFRPP

Query:  SIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSEDHLIETENPSECNESMLSSLHLDRNF
        S DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMM TRKHRKMSSLSNISSKSLHSRQYSMGSLSENSR SSEDHLIET+N SECNESM+SSLHLDRNF
Subjt:  SIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSEDHLIETENPSECNESMLSSLHLDRNF

Query:  ASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAINSGRLPPKATLSEIERQIRMEDIESL
        ASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDE+GYGWPSVANPN INS RL PK TLS+IERQI+MEDIESL
Subjt:  ASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAINSGRLPPKATLSEIERQIRMEDIESL

Query:  LADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLKGGWGSFSSTSSSHFS
        LADD+KDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERL+GGWGSFSSTSSSHFS
Subjt:  LADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLKGGWGSFSSTSSSHFS

TrEMBL top hitse value%identityAlignment
A0A0A0K9X1 Uncharacterized protein3.0e-19367.21Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR
        MA S S P+T+P           TT   +SC  F+CK LFF I LLLLPLFPSEAP+FVN T LT FWEL HL+F+GIAVSYGLFSRR++QVSV  DE R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHAYKSRYSRGGSVV
        FSNFENP SYLSK+FH  SIFEDV   + SDE K+ +V Y+QP+  S +    LNA SRQ          +RYENS EF +TDNVGHA KSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHAYKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSL+S+LT   E DDVEF  G+ES LSS+SSSK+SE NCER SEFG+NCC NLEEKFDE VIAS+SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSGKNMMRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSE
        EK  KNMMRER V N+ +RPSHFRP SIDETQFESLKKS SLHS LSQSSQTSS SS  S  +TRKHRKMSSL NIS KS HSRQYS+ SLSENSR SSE
Subjt:  EKSGKNMMRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSE

Query:  DHLIETENPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAI
        D LI+ EN SECNES++SS  LDRNFA+ PKA+SRGKSVRT+RA+  A EEMK+QEM RNQVEHD N+  KFE  GG   YMREDE G+GWP + N NA 
Subjt:  DHLIETENPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAI

Query:  NSGR---LPPKATLSEIERQIRMEDIESLLADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLKGG
         S R        T S IE Q   ED ES + DD KDNSE ED+S F SSDEEA  A S+ G SESGAHEVDKKAGEFIAKFREQIQLQRMASV+ RL+GG
Subjt:  NSGR---LPPKATLSEIERQIRMEDIESLLADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLKGG

Query:  WGSFSSTSSSHFS
        WGSFSST+SS+FS
Subjt:  WGSFSSTSSSHFS

A0A5D3DMA5 DUF761 domain-containing protein3.6e-19969.61Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR
        MASS S P+T+P           TT   +SC HFLCK LFF I LLLLPLFPSEAP+FVN TLLT FWEL HL+FVGIAVSYGLFSRR++QVSV  DE R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHAYKSRYSRGGSVV
        FSNFENP SYLSK+ H  SIFEDV   + SDE K+ +V Y+QP+  S   F   NA SRQ          +RYENS EF DT++VGHA KSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHAYKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSLRSNLT   E DDVEF  G+ES LSS+SSSK+SE NCER SEFG+NCC NLEEKFDE VIA +SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSGKNMMRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSE
        E  GKNMMRERGV N+ +RPSHFRP SIDETQFESLKKSRSLHS LSQSSQTSS S S S   TRKHRKMSSL NIS KS HSRQYS+ SLSENSR SSE
Subjt:  EKSGKNMMRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSE

Query:  DHLIETENPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAI
        D LIE EN SECNES++SS  LDRNFA IPKA+SRGKSVRTIRAN  A EEMK+QEM RNQVEHD N+G KFE  GG   YMRED  G+GWP + +PNA 
Subjt:  DHLIETENPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAI

Query:  NSGRLPPKATLSEIERQIRMEDIESLLADD--AKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLKGGW
         S R P   T S IE Q   EDIES L DD   +DNSE ED S F SSDEEA  ASS+AG SESGA+EVDKKAGEFIAKFREQIQLQRMASV+ RL+GGW
Subjt:  NSGRLPPKATLSEIERQIRMEDIESLLADD--AKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLKGGW

Query:  GSFSSTSSSHFS
        GSFSSTSSS+FS
Subjt:  GSFSSTSSSHFS

A0A6J1DSC0 uncharacterized protein LOC1110238601.9e-30496.74Show/hide
Query:  MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS
        MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS
Subjt:  MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS

Query:  KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHAYKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK
        KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHA KSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK
Subjt:  KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHAYKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK

Query:  PLGLPVRSLRSNLTEESETDDVEFGEESGLSSESSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSGKNMMRERGVGNSFVRPSHFRPP
        PLGLPVRSLRSNLTEESETDDVEFGEESGLSS+SSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKS KNMMRERGVGN FVRPSHFRPP
Subjt:  PLGLPVRSLRSNLTEESETDDVEFGEESGLSSESSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSGKNMMRERGVGNSFVRPSHFRPP

Query:  SIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSEDHLIETENPSECNESMLSSLHLDRNF
        S DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMM TRKHRKMSSLSNISSKSLHSRQYSMGSLSENSR SSEDHLIET+N SECNESM+SSLHLDRNF
Subjt:  SIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSEDHLIETENPSECNESMLSSLHLDRNF

Query:  ASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAINSGRLPPKATLSEIERQIRMEDIESL
        ASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDE+GYGWPSVANPN INS RL PK TLS+IERQI+MEDIESL
Subjt:  ASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAINSGRLPPKATLSEIERQIRMEDIESL

Query:  LADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLKGGWGSFSSTSSSHFS
        LADD+KDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERL+GGWGSFSSTSSSHFS
Subjt:  LADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLKGGWGSFSSTSSSHFS

A0A6J1KUS4 uncharacterized protein LOC1114989001.3e-19167Show/hide
Query:  MASSTSNPYTR----------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFS
        MASS S+P+T+          P  T   +SCA FLCK LFF   LLLLPLFPSEAPDFV+ TL T FWEL HL+ VGIAVSYGLFS R+ Q++VDE R+S
Subjt:  MASSTSNPYTR----------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFS

Query:  NFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHAYKSRYSRGGSVVVV
        +FENP SYLSK+ +  SIF+DV   + SDE K+ +V Y+QP+  SA+   DLNA+SRQ          +RYENSYEF DTDNV HA KSRY+RGGSVVVV
Subjt:  NFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHAYKSRYSRGGSVVVV

Query:  ADSERNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKS
         ++ R     SSSG IV+YKPLGLPVRSL+S+LT   E+DDVEF  G+ES LSS+SS KSSE NCE  SEFG+NCC NLEEKFDE  IAS+S FQ REK 
Subjt:  ADSERNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKS

Query:  GKNMMRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSEDHL
        GK ++RERG GN+ +RPSHFRPPSIDETQFESLKKS SLHS LSQSSQTSS SSS S   TRKH KMSSLSNIS KSLHSRQYSM SLSENSR SSED L
Subjt:  GKNMMRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSEDHL

Query:  IETENPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAINSG
        IE EN SECNES++SS   D NF SIPKA+S+GKS+R I+ANA A E++K+QEM R QV+HD  IG KFEEGG +  Y+RED  G+GWP VANPNA N  
Subjt:  IETENPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAINSG

Query:  RLPPKATLSEIERQIRMEDIESLLADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLK--GGWGSF
        R P    L   E++   E+ ESL+ADD+KD+SE EDES FASSDEEA  ASS+AG SESGA EVDKKAGEFIAKFREQIQLQRMASVE RL+  GGWGSF
Subjt:  RLPPKATLSEIERQIRMEDIESLLADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLK--GGWGSF

Query:  SSTSSSHFS
        SSTSSS+FS
Subjt:  SSTSSSHFS

E5GCN2 Uncharacterized protein3.6e-19969.61Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR
        MASS S P+T+P           TT   +SC HFLCK LFF I LLLLPLFPSEAP+FVN TLLT FWEL HL+FVGIAVSYGLFSRR++QVSV  DE R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHAYKSRYSRGGSVV
        FSNFENP SYLSK+ H  SIFEDV   + SDE K+ +V Y+QP+  S   F   NA SRQ          +RYENS EF DT++VGHA KSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHAYKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSLRSNLT   E DDVEF  G+ES LSS+SSSK+SE NCER SEFG+NCC NLEEKFDE VIA +SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSESSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSGKNMMRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSE
        E  GKNMMRERGV N+ +RPSHFRP SIDETQFESLKKSRSLHS LSQSSQTSS S S S   TRKHRKMSSL NIS KS HSRQYS+ SLSENSR SSE
Subjt:  EKSGKNMMRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSE

Query:  DHLIETENPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAI
        D LIE EN SECNES++SS  LDRNFA IPKA+SRGKSVRTIRAN  A EEMK+QEM RNQVEHD N+G KFE  GG   YMRED  G+GWP + +PNA 
Subjt:  DHLIETENPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEMGYGWPSVANPNAI

Query:  NSGRLPPKATLSEIERQIRMEDIESLLADD--AKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLKGGW
         S R P   T S IE Q   EDIES L DD   +DNSE ED S F SSDEEA  ASS+AG SESGA+EVDKKAGEFIAKFREQIQLQRMASV+ RL+GGW
Subjt:  NSGRLPPKATLSEIERQIRMEDIESLLADD--AKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLKGGW

Query:  GSFSSTSSSHFS
        GSFSSTSSS+FS
Subjt:  GSFSSTSSSHFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G60380.1 FUNCTIONS IN: molecular_function unknown2.3e-3636.29Show/hide
Query:  STSNPYTR----------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFE
        ++ NPYT+          P+   +      F CK + F++ LL LPLFPS+APDFV  T+LT FWEL+HLLFVGIAV+YGLFSRR+++ +VD       E
Subjt:  STSNPYTR----------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFE

Query:  NPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAES---RQRRYENSYEFVDTDNVGHAYKSRYSRGGSVVVVADSERNRSSS
        +  SY+S+IF  +S+F++       D+N  + V        SA       +ES        E S EF +T+ V  A+ S+Y +G S VVVA     R + 
Subjt:  NPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAES---RQRRYENSYEFVDTDNVGHAYKSRYSRGGSVVVVADSERNRSSS

Query:  SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEFGEESGLSSESSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSGKNMMRERGVGNS
           G +VH +PLGLP+R LRS+L + +   D  F +        S   + N E  S   +N        FDE + A  SP  W+ +    MM   G+G++
Subjt:  SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEFGEESGLSSESSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSGKNMMRERGVGNS

Query:  FVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSS
        +  PS+F+P S+DET      KS S  ST S SSQTS  S + +        + S   ++S++SL+S    +  + E SR SS
Subjt:  FVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSS

AT3G60380.1 FUNCTIONS IN: molecular_function unknown1.0e-0446.77Show/hide
Query:  SEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLKGG
        +E + E  F   +EE E A      +    +EVD+KAGEFIAKFREQI+LQ++ S E+ +GG
Subjt:  SEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLKGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTTCCACTTCCAACCCTTACACCAGGCCCCGTACTACCGACCGCCGTCACTCCTGCGCCCATTTTCTGTGTAAATTCCTCTTCTTCTCCATTCTCCTCCTCCT
CCTCCCTCTCTTCCCTTCCGAGGCTCCAGATTTCGTCAATCACACCTTGCTCACCAATTTCTGGGAGCTCCTTCACCTCCTCTTCGTCGGCATTGCTGTTTCATACGGCC
TATTTAGCAGAAGGAGTATCCAGGTGAGTGTAGACGAAACTCGCTTCTCCAATTTCGAAAATCCGCCCTCGTATTTGTCTAAGATCTTTCACGCCACTTCGATTTTTGAA
GATGTCCAACTTTTGACTGCTTCCGATGAGAATAAGGTGGACGATGTTTGGTACGTTCAGCCGCATCGCGAATCGGCGACCGATTTTGGGGATTTGAATGCCGAATCTCG
CCAACGAAGGTATGAAAATTCGTATGAATTTGTAGATACTGATAATGTTGGGCATGCTTATAAATCGAGATATTCTCGGGGTGGATCCGTGGTGGTAGTTGCTGATAGTG
AAAGAAATCGTAGTTCTAGTTCTAGTTCTGGTGCCATTGTACATTATAAACCTCTAGGTTTGCCTGTAAGGAGTTTGAGGTCAAATCTTACGGAAGAATCGGAAACCGAT
GATGTTGAATTTGGTGAGGAATCTGGTTTGAGTTCTGAAAGTTCATCCAAGAGCTCTGAGAATTGTGAAAGAAGAAGTGAATTTGGTGAGAATTGTTGTACGAATTTGGA
GGAGAAGTTTGATGAAGCTGTTATTGCATCATTGTCCCCATTTCAATGGCGTGAGAAATCTGGAAAGAATATGATGAGAGAGAGAGGAGTGGGGAATTCTTTTGTTCGCC
CTTCCCATTTTAGGCCTCCCTCCATTGATGAAACTCAATTTGAGTCCCTGAAAAAATCAAGGTCTCTTCATTCTACCCTATCTCAGTCGTCACAAACTAGTTCCTTTTCG
TCTTCATCGTCGATGATGAAGACGAGAAAGCACCGGAAAATGTCGTCGCTCAGTAACATTTCCTCAAAGTCATTGCATTCTCGGCAATACAGTATGGGCTCTCTGTCTGA
AAACAGTAGAGTGAGCTCCGAAGACCATCTGATAGAAACCGAAAATCCATCTGAGTGCAACGAATCCATGTTAAGTTCCCTGCACTTGGACAGGAACTTCGCAAGTATTC
CGAAAGCTGTATCGCGGGGAAAATCCGTTAGAACGATTAGAGCAAATGCAGTTGCTGCAGAGGAAATGAAATCCCAAGAGATGGACAGAAACCAAGTTGAACATGATATC
AATATAGGGAAGAAGTTTGAAGAGGGTGGAGGAGCATTATCATATATGAGAGAAGATGAAATGGGATATGGATGGCCTAGTGTTGCTAACCCGAATGCTATTAATTCGGG
TCGTTTGCCACCAAAGGCGACGTTGTCGGAGATCGAGAGGCAGATCAGGATGGAAGACATCGAGAGTCTGCTGGCAGATGATGCGAAAGATAACTCCGAGATGGAGGATG
AGAGTATTTTTGCAAGTTCAGATGAAGAAGCTGAAGTTGCTTCGAGTGTGGCTGGGGGGTCGGAATCGGGGGCTCACGAGGTCGACAAGAAGGCCGGTGAGTTCATAGCG
AAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTAGAAAGATTGAAAGGAGGATGGGGATCATTCAGCAGCACAAGCAGCAGCCATTTCAGT
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTTCCACTTCCAACCCTTACACCAGGCCCCGTACTACCGACCGCCGTCACTCCTGCGCCCATTTTCTGTGTAAATTCCTCTTCTTCTCCATTCTCCTCCTCCT
CCTCCCTCTCTTCCCTTCCGAGGCTCCAGATTTCGTCAATCACACCTTGCTCACCAATTTCTGGGAGCTCCTTCACCTCCTCTTCGTCGGCATTGCTGTTTCATACGGCC
TATTTAGCAGAAGGAGTATCCAGGTGAGTGTAGACGAAACTCGCTTCTCCAATTTCGAAAATCCGCCCTCGTATTTGTCTAAGATCTTTCACGCCACTTCGATTTTTGAA
GATGTCCAACTTTTGACTGCTTCCGATGAGAATAAGGTGGACGATGTTTGGTACGTTCAGCCGCATCGCGAATCGGCGACCGATTTTGGGGATTTGAATGCCGAATCTCG
CCAACGAAGGTATGAAAATTCGTATGAATTTGTAGATACTGATAATGTTGGGCATGCTTATAAATCGAGATATTCTCGGGGTGGATCCGTGGTGGTAGTTGCTGATAGTG
AAAGAAATCGTAGTTCTAGTTCTAGTTCTGGTGCCATTGTACATTATAAACCTCTAGGTTTGCCTGTAAGGAGTTTGAGGTCAAATCTTACGGAAGAATCGGAAACCGAT
GATGTTGAATTTGGTGAGGAATCTGGTTTGAGTTCTGAAAGTTCATCCAAGAGCTCTGAGAATTGTGAAAGAAGAAGTGAATTTGGTGAGAATTGTTGTACGAATTTGGA
GGAGAAGTTTGATGAAGCTGTTATTGCATCATTGTCCCCATTTCAATGGCGTGAGAAATCTGGAAAGAATATGATGAGAGAGAGAGGAGTGGGGAATTCTTTTGTTCGCC
CTTCCCATTTTAGGCCTCCCTCCATTGATGAAACTCAATTTGAGTCCCTGAAAAAATCAAGGTCTCTTCATTCTACCCTATCTCAGTCGTCACAAACTAGTTCCTTTTCG
TCTTCATCGTCGATGATGAAGACGAGAAAGCACCGGAAAATGTCGTCGCTCAGTAACATTTCCTCAAAGTCATTGCATTCTCGGCAATACAGTATGGGCTCTCTGTCTGA
AAACAGTAGAGTGAGCTCCGAAGACCATCTGATAGAAACCGAAAATCCATCTGAGTGCAACGAATCCATGTTAAGTTCCCTGCACTTGGACAGGAACTTCGCAAGTATTC
CGAAAGCTGTATCGCGGGGAAAATCCGTTAGAACGATTAGAGCAAATGCAGTTGCTGCAGAGGAAATGAAATCCCAAGAGATGGACAGAAACCAAGTTGAACATGATATC
AATATAGGGAAGAAGTTTGAAGAGGGTGGAGGAGCATTATCATATATGAGAGAAGATGAAATGGGATATGGATGGCCTAGTGTTGCTAACCCGAATGCTATTAATTCGGG
TCGTTTGCCACCAAAGGCGACGTTGTCGGAGATCGAGAGGCAGATCAGGATGGAAGACATCGAGAGTCTGCTGGCAGATGATGCGAAAGATAACTCCGAGATGGAGGATG
AGAGTATTTTTGCAAGTTCAGATGAAGAAGCTGAAGTTGCTTCGAGTGTGGCTGGGGGGTCGGAATCGGGGGCTCACGAGGTCGACAAGAAGGCCGGTGAGTTCATAGCG
AAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTAGAAAGATTGAAAGGAGGATGGGGATCATTCAGCAGCACAAGCAGCAGCCATTTCAGT
Protein sequenceShow/hide protein sequence
MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLSKIFHATSIFE
DVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHAYKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETD
DVEFGEESGLSSESSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSGKNMMRERGVGNSFVRPSHFRPPSIDETQFESLKKSRSLHSTLSQSSQTSSFS
SSSSMMKTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRVSSEDHLIETENPSECNESMLSSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDI
NIGKKFEEGGGALSYMREDEMGYGWPSVANPNAINSGRLPPKATLSEIERQIRMEDIESLLADDAKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIA
KFREQIQLQRMASVERLKGGWGSFSSTSSSHFS