; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g09210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g09210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDUF761 domain-containing protein
Genome locationchr7:7067107..7068852
RNA-Seq ExpressionMoc07g09210
SyntenyMoc07g09210
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34231.1 hypothetical protein [Cucumis melo subsp. melo]1.2e-19970.59Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR
        MASS S P+T+P           TT   +SC HFLCK LFF I LLLLPLFPSEAP+FVN TLLT FWEL HL+FVGIAVSYGLFSRR++QVSV  DE R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV
        FSNFENP SYLSK+ H  SIFEDV   + SDE K+ +V Y+QP+  S   F   NA SRQ          +RYENS EF DT++VGHACKSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSLRSNLT   E DDVEF  G+ES LSSKSSSK+SE NCER SEFG+NCC NLEEKFDE VIA +SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE
        E   KNMMRERGV N  +RPSHFRP S DETQFESLKKSRSLHS LSQSSQTSS S S S  TTRKHRKMSSL NIS KS HSRQYS+ SLSENSRGSSE
Subjt:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE

Query:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI
        D LIE +NSSECNES+ISS  LDRNFA IPKA+SRGKSVRTIRAN  A EEMK+QEM RNQVEHD N+G KFE  GG   YMRED  G+GWP + +PN  
Subjt:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI

Query:  NSNRLPK-TTLSKIERQIKMEDIESLLADD--SKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGGW
         SNR PK TT S IE Q   EDIES L DD   +DNSE ED S F SSDEEA  ASS+AG SESGA+EVDKKAGEFIAKFREQIQLQRMASV+ RLRGGW
Subjt:  NSNRLPK-TTLSKIERQIKMEDIESLLADD--SKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGGW

Query:  GSFSSTSSSHFS
        GSFSSTSSS+FS
Subjt:  GSFSSTSSSHFS

KAG6575261.1 hypothetical protein SDJN03_25900, partial [Cucurbita argyrosperma subsp. sororia]7.8e-19668.93Show/hide
Query:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN
        MASS S+P+T+      P      +SCA FLCK +FF   LLLLPLFPSEAPDFV+ TL T FWEL HL+FVGIAVSYGLFS R+ Q++VDE R+S+FEN
Subjt:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN

Query:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSE
        P SYLSK+ +  SIF+DV     SDE KV +V Y+QP   SA+   DLNA+SR          ++RYENSYEF DTDNV HACKSRY+RGGSVVVV ++ 
Subjt:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSE

Query:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNM
        R     SSSG IV+YKPLGLPVRSLRS+LT   E+DDVEF  G+ES LSSKSS KSSE NCE  SEFG+NCC NLEEKFDE  IAS+S FQ REK  K +
Subjt:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNM

Query:  MRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETK
        +RERG GN  +RPSHFRPPS DETQFESL+KS SLHS LSQSSQTSS SS  S  TTRKH KMSSLSNIS KSLHSRQYSM SLSENSRGSSED LIE +
Subjt:  MRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETK

Query:  NSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPK
        NSSECNES++SS   DRNFASIPKA+S+GKSVR IRANA A E+MK+QEM R QV+HD  IG KFEEGG +  YMRED  G+GWP V NPN  N NR PK
Subjt:  NSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPK

Query:  TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLR---GGWGSFSSTS
        TT   I+ Q   E+ ESL+ADDSKD SE EDES+FASSDEEA   SS+AG SESGA EVDKKAGEFIAKFREQIQLQRMASVE RLR   GGWGSFSSTS
Subjt:  TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLR---GGWGSFSSTS

Query:  SSHFS
        SS+FS
Subjt:  SSHFS

KAG7013816.1 hypothetical protein SDJN02_23985, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-19568.93Show/hide
Query:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN
        MASS S+P+T+      P      +SCA FLCK +FF   LLLLPLFPSEAPDFV+ TL T FWEL HL+FVGIAVSYGLFS R+ Q++VDE R+S+FEN
Subjt:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN

Query:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSE
        P SYLSK+ +  SIF+DV     SDE KV +V Y+QP   SA+   DLNA+SR          ++RYENSYEF DTDNV HACKSRY+RGGSVVVV ++ 
Subjt:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSE

Query:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNM
        R     SSSG IV+YKPLGLPVRSLRS+LT   E+DDVEF  G+ES LSSKSS KSSE NCE  SEFG+NCC NLEEKFDE  IAS+S FQ REK  K +
Subjt:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNM

Query:  MRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETK
        +RERG GN  +RPSHFRPPS DETQFESL+KS SLHS LSQSSQTSS SS  S  TTRKH KMSSLSNIS KSLHSRQYSM SLSENSRGSSED LIE +
Subjt:  MRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETK

Query:  NSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPK
        NSSECNES++SS   DRNFASIPKA+S+GKSVR IRANA A E+MK+QEM R QV+HD  IG KFEEGG +  YMRED  G+GWP V NPN  N NR PK
Subjt:  NSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPK

Query:  TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLR---GGWGSFSSTS
        TT   I+ Q   E+ ESL+ADDSKD SE EDES FASSDEEA   SS+AG SESGA EVDKKAGEFIAKFREQIQLQRMASVE RLR   GGWGSFSSTS
Subjt:  TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLR---GGWGSFSSTS

Query:  SSHFS
        SS+FS
Subjt:  SSHFS

XP_004140631.1 uncharacterized protein LOC101220435 [Cucumis sativus]1.3e-19568.19Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR
        MA S S P+T+P           TT   +SC  F+CK LFF I LLLLPLFPSEAP+FVN T LT FWEL HL+F+GIAVSYGLFSRR++QVSV  DE R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV
        FSNFENP SYLSK+FH  SIFEDV   + SDE K+ +V Y+QP+  S +    LNA SRQ          +RYENS EF +TDNVGHACKSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSL+S+LT   E DDVEF  G+ES LSSKSSSK+SE NCER SEFG+NCC NLEEKFDE VIAS+SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE
        EK EKNMMRER V N  +RPSHFRP S DETQFESLKKS SLHS LSQSSQTSS SS  S   TRKHRKMSSL NIS KS HSRQYS+ SLSENSRGSSE
Subjt:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE

Query:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI
        D LI+ +NSSECNES++SS  LDRNFA+ PKA+SRGKSVRT+RA+  A EEMK+QEM RNQVEHD N+  KFE  GG   YMREDE G+GWP + N N  
Subjt:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI

Query:  NSNRLPK----TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG
         SNR  K    TT S IE Q   ED ES + DD KDNSE ED+S F SSDEEA  A S+ G SESGAHEVDKKAGEFIAKFREQIQLQRMASV+ RLRGG
Subjt:  NSNRLPK----TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG

Query:  WGSFSSTSSSHFS
        WGSFSST+SS+FS
Subjt:  WGSFSSTSSSHFS

XP_022157033.1 uncharacterized protein LOC111023860 [Momordica charantia]0.0e+00100Show/hide
Query:  MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS
        MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS
Subjt:  MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS

Query:  KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK
        KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK
Subjt:  KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK

Query:  PLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPP
        PLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPP
Subjt:  PLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPP

Query:  SDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFA
        SDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFA
Subjt:  SDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFA

Query:  SIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLA
        SIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLA
Subjt:  SIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLA

Query:  DDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGGWGSFSSTSSSHFS
        DDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGGWGSFSSTSSSHFS
Subjt:  DDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGGWGSFSSTSSSHFS

TrEMBL top hitse value%identityAlignment
A0A0A0K9X1 Uncharacterized protein6.4e-19668.19Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR
        MA S S P+T+P           TT   +SC  F+CK LFF I LLLLPLFPSEAP+FVN T LT FWEL HL+F+GIAVSYGLFSRR++QVSV  DE R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV
        FSNFENP SYLSK+FH  SIFEDV   + SDE K+ +V Y+QP+  S +    LNA SRQ          +RYENS EF +TDNVGHACKSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSL+S+LT   E DDVEF  G+ES LSSKSSSK+SE NCER SEFG+NCC NLEEKFDE VIAS+SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE
        EK EKNMMRER V N  +RPSHFRP S DETQFESLKKS SLHS LSQSSQTSS SS  S   TRKHRKMSSL NIS KS HSRQYS+ SLSENSRGSSE
Subjt:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE

Query:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI
        D LI+ +NSSECNES++SS  LDRNFA+ PKA+SRGKSVRT+RA+  A EEMK+QEM RNQVEHD N+  KFE  GG   YMREDE G+GWP + N N  
Subjt:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI

Query:  NSNRLPK----TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG
         SNR  K    TT S IE Q   ED ES + DD KDNSE ED+S F SSDEEA  A S+ G SESGAHEVDKKAGEFIAKFREQIQLQRMASV+ RLRGG
Subjt:  NSNRLPK----TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG

Query:  WGSFSSTSSSHFS
        WGSFSST+SS+FS
Subjt:  WGSFSSTSSSHFS

A0A5D3DMA5 DUF761 domain-containing protein5.6e-20070.59Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR
        MASS S P+T+P           TT   +SC HFLCK LFF I LLLLPLFPSEAP+FVN TLLT FWEL HL+FVGIAVSYGLFSRR++QVSV  DE R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV
        FSNFENP SYLSK+ H  SIFEDV   + SDE K+ +V Y+QP+  S   F   NA SRQ          +RYENS EF DT++VGHACKSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSLRSNLT   E DDVEF  G+ES LSSKSSSK+SE NCER SEFG+NCC NLEEKFDE VIA +SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE
        E   KNMMRERGV N  +RPSHFRP S DETQFESLKKSRSLHS LSQSSQTSS S S S  TTRKHRKMSSL NIS KS HSRQYS+ SLSENSRGSSE
Subjt:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE

Query:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI
        D LIE +NSSECNES+ISS  LDRNFA IPKA+SRGKSVRTIRAN  A EEMK+QEM RNQVEHD N+G KFE  GG   YMRED  G+GWP + +PN  
Subjt:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI

Query:  NSNRLPK-TTLSKIERQIKMEDIESLLADD--SKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGGW
         SNR PK TT S IE Q   EDIES L DD   +DNSE ED S F SSDEEA  ASS+AG SESGA+EVDKKAGEFIAKFREQIQLQRMASV+ RLRGGW
Subjt:  NSNRLPK-TTLSKIERQIKMEDIESLLADD--SKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGGW

Query:  GSFSSTSSSHFS
        GSFSSTSSS+FS
Subjt:  GSFSSTSSSHFS

A0A6J1DSC0 uncharacterized protein LOC1110238600.0e+00100Show/hide
Query:  MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS
        MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS
Subjt:  MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS

Query:  KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK
        KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK
Subjt:  KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK

Query:  PLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPP
        PLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPP
Subjt:  PLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPP

Query:  SDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFA
        SDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFA
Subjt:  SDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFA

Query:  SIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLA
        SIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLA
Subjt:  SIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLA

Query:  DDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGGWGSFSSTSSSHFS
        DDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGGWGSFSSTSSSHFS
Subjt:  DDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGGWGSFSSTSSSHFS

A0A6J1H4M0 uncharacterized protein LOC1114599981.6e-19468.59Show/hide
Query:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN
        MASS S+P+T+      P      +SCA FLCK +FF   LLLLPLFPSEAPDFV+ TL T FWEL HL+FVGIAVSYGLFS R+ Q++VDE R+S+FEN
Subjt:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN

Query:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSE
        P SYLSK+ +  SIF+DV     SDE KV +V Y+QP   SA+   DLNA+SR          ++RYENSYEF DTDNV HACKSRY+RGGSVVVV ++ 
Subjt:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSE

Query:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNM
        R     SSSG IV+YKPLGLPVRSLRS+LT   E+DDVEF  G+ES LSSKSS KSSE NCE  SEFG+NCC NLEEKFDE  IAS+S FQ REK  K +
Subjt:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNM

Query:  MRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETK
        +RERG GN  +RPSHFRPPS DETQFESLKKS SLHS LSQSSQTSS SS  S  TTRK RKMSSLSNIS KSLHSRQYS  SLSENSRGSSED LIE +
Subjt:  MRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETK

Query:  NSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPK
        NSSECNES++SS   DRNFASIPKA+S+GKSVR IRANA A E+MK+QEM R QV+HD  IG KFEEGG +  YMRED  G GWP V NPN  N NR PK
Subjt:  NSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPK

Query:  TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLR------GGWGSFS
        TT   I+ Q   E+ ESL+ADDSKD+SE EDES+FASSDEEA   SS+AG SESGA EVDKKAGEFIAKFREQIQLQRMASVE RLR      GGWGSFS
Subjt:  TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLR------GGWGSFS

Query:  STSSSHFS
        STSSS+FS
Subjt:  STSSSHFS

E5GCN2 Uncharacterized protein5.6e-20070.59Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR
        MASS S P+T+P           TT   +SC HFLCK LFF I LLLLPLFPSEAP+FVN TLLT FWEL HL+FVGIAVSYGLFSRR++QVSV  DE R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSV--DETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV
        FSNFENP SYLSK+ H  SIFEDV   + SDE K+ +V Y+QP+  S   F   NA SRQ          +RYENS EF DT++VGHACKSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSLRSNLT   E DDVEF  G+ES LSSKSSSK+SE NCER SEFG+NCC NLEEKFDE VIA +SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE
        E   KNMMRERGV N  +RPSHFRP S DETQFESLKKSRSLHS LSQSSQTSS S S S  TTRKHRKMSSL NIS KS HSRQYS+ SLSENSRGSSE
Subjt:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE

Query:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI
        D LIE +NSSECNES+ISS  LDRNFA IPKA+SRGKSVRTIRAN  A EEMK+QEM RNQVEHD N+G KFE  GG   YMRED  G+GWP + +PN  
Subjt:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI

Query:  NSNRLPK-TTLSKIERQIKMEDIESLLADD--SKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGGW
         SNR PK TT S IE Q   EDIES L DD   +DNSE ED S F SSDEEA  ASS+AG SESGA+EVDKKAGEFIAKFREQIQLQRMASV+ RLRGGW
Subjt:  NSNRLPK-TTLSKIERQIKMEDIESLLADD--SKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGGW

Query:  GSFSSTSSSHFS
        GSFSSTSSS+FS
Subjt:  GSFSSTSSSHFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G60380.1 FUNCTIONS IN: molecular_function unknown2.8e-3436.55Show/hide
Query:  STSNPYTR----------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFE
        ++ NPYT+          P+   +      F CK + F++ LL LPLFPS+APDFV  T+LT FWEL+HLLFVGIAV+YGLFSRR+++ +VD       E
Subjt:  STSNPYTR----------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFE

Query:  NPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAES---RQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSS
        +  SY+S+IF  +S+F++       D+N  + V        SA       +ES        E S EF +T+ V  A  S+Y +G S VVVA     R + 
Subjt:  NPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAES---RQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSS

Query:  SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNY
           G +VH +PLGLP+R LRS+L + +   D  F +        S   + N E  S   +N        FDE + A  SP  W+ + E  MM   G+G+ 
Subjt:  SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNY

Query:  FVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSS
        +  PS+F+P S DET      KS S  ST S SSQTS  S + +        + S   ++S++SL+S    +  + E SR SS
Subjt:  FVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSS

AT3G60380.1 FUNCTIONS IN: molecular_function unknown6.1e-0548.39Show/hide
Query:  SEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGG
        +E + E  F   +EE E A      +    +EVD+KAGEFIAKFREQI+LQ++ S E+ RGG
Subjt:  SEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTTCCACTTCCAACCCTTACACCAGGCCCCGTACTACCGACCGCCGTCACTCCTGCGCCCATTTTCTGTGTAAATTCCTCTTCTTCTCCATTCTCCTCCTCCT
CCTCCCTCTCTTCCCTTCCGAGGCTCCAGATTTCGTCAATCACACCTTGCTCACCAATTTCTGGGAGCTCCTTCACCTCCTCTTCGTCGGCATTGCTGTTTCATACGGCC
TGTTTAGCAGAAGGAGTATCCAGGTGAGTGTAGACGAAACTCGCTTCTCCAATTTCGAAAATCCGCCCTCGTATTTGTCTAAGATCTTTCACGCCACTTCGATTTTTGAA
GATGTCCAACTTTTGACTGCTTCCGATGAGAATAAGGTGGACGATGTTTGGTACGTTCAGCCGCATCGCGAATCGGCGACCGATTTTGGGGATTTGAATGCCGAATCTCG
CCAACGAAGGTATGAAAATTCGTATGAATTTGTAGATACTGATAATGTTGGGCATGCTTGTAAATCGAGATATTCTCGGGGTGGATCCGTGGTGGTAGTTGCTGATAGTG
AAAGAAATCGTAGTTCTAGTTCTAGTTCTGGTGCCATTGTACATTATAAACCTCTAGGTTTGCCTGTAAGGAGTTTGAGGTCAAATCTTACGGAAGAATCGGAAACCGAT
GATGTTGAATTTGGTGAGGAATCTGGTTTGAGTTCTAAAAGTTCATCCAAGAGCTCTGAGAATTGTGAAAGAAGAAGTGAATTTGGTGAGAATTGTTGTACGAATTTGGA
GGAGAAGTTTGATGAAGCTGTTATTGCATCATTGTCCCCATTTCAATGGCGTGAGAAATCTGAAAAGAATATGATGAGAGAGAGAGGAGTGGGGAATTATTTTGTTCGCC
CTTCCCATTTTAGGCCTCCCTCTGATGAAACTCAATTTGAATCCCTGAAAAAATCAAGGTCTCTTCATTCTACCCTATCTCAGTCGTCACAAACTAGTTCCTTCTCGTCT
TCATCGTCGATGATGACGACAAGAAAGCACCGGAAAATGTCGTCGCTCAGCAACATTTCCTCAAAGTCATTGCATTCTCGGCAATACAGTATGGGCTCTCTGTCTGAAAA
CAGTAGAGGGAGCTCCGAAGACCATCTGATAGAAACCAAAAATTCATCCGAGTGCAACGAATCCATGATAAGTTCCCTGCACTTGGACAGGAACTTCGCAAGTATTCCGA
AAGCTGTATCGCGGGGAAAATCCGTTAGAACGATTAGAGCAAATGCAGTTGCTGCAGAGGAAATGAAATCCCAAGAGATGGACAGAAACCAAGTTGAACATGATATCAAT
ATAGGGAAGAAGTTTGAAGAGGGTGGAGGAGCATTATCATATATGAGAGAAGATGAAATAGGATATGGATGGCCTAGTGTTGCTAACCCGAACACTATTAATTCGAATCG
TTTGCCAAAGACGACGTTGTCGAAGATTGAGAGGCAGATCAAGATGGAAGACATCGAGAGTCTGCTGGCAGATGATTCCAAAGATAACTCCGAGATGGAGGATGAGAGTA
TTTTTGCAAGTTCAGATGAAGAAGCTGAAGTTGCTTCGAGTGTGGCTGGCGGGTCGGAATCGGGGGCTCACGAGGTCGACAAGAAGGCCGGTGAGTTCATAGCAAAGTTC
AGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTAGAAAGATTGAGAGGAGGATGGGGATCATTCAGCAGCACAAGCAGCAGCCATTTCAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTTCCACTTCCAACCCTTACACCAGGCCCCGTACTACCGACCGCCGTCACTCCTGCGCCCATTTTCTGTGTAAATTCCTCTTCTTCTCCATTCTCCTCCTCCT
CCTCCCTCTCTTCCCTTCCGAGGCTCCAGATTTCGTCAATCACACCTTGCTCACCAATTTCTGGGAGCTCCTTCACCTCCTCTTCGTCGGCATTGCTGTTTCATACGGCC
TGTTTAGCAGAAGGAGTATCCAGGTGAGTGTAGACGAAACTCGCTTCTCCAATTTCGAAAATCCGCCCTCGTATTTGTCTAAGATCTTTCACGCCACTTCGATTTTTGAA
GATGTCCAACTTTTGACTGCTTCCGATGAGAATAAGGTGGACGATGTTTGGTACGTTCAGCCGCATCGCGAATCGGCGACCGATTTTGGGGATTTGAATGCCGAATCTCG
CCAACGAAGGTATGAAAATTCGTATGAATTTGTAGATACTGATAATGTTGGGCATGCTTGTAAATCGAGATATTCTCGGGGTGGATCCGTGGTGGTAGTTGCTGATAGTG
AAAGAAATCGTAGTTCTAGTTCTAGTTCTGGTGCCATTGTACATTATAAACCTCTAGGTTTGCCTGTAAGGAGTTTGAGGTCAAATCTTACGGAAGAATCGGAAACCGAT
GATGTTGAATTTGGTGAGGAATCTGGTTTGAGTTCTAAAAGTTCATCCAAGAGCTCTGAGAATTGTGAAAGAAGAAGTGAATTTGGTGAGAATTGTTGTACGAATTTGGA
GGAGAAGTTTGATGAAGCTGTTATTGCATCATTGTCCCCATTTCAATGGCGTGAGAAATCTGAAAAGAATATGATGAGAGAGAGAGGAGTGGGGAATTATTTTGTTCGCC
CTTCCCATTTTAGGCCTCCCTCTGATGAAACTCAATTTGAATCCCTGAAAAAATCAAGGTCTCTTCATTCTACCCTATCTCAGTCGTCACAAACTAGTTCCTTCTCGTCT
TCATCGTCGATGATGACGACAAGAAAGCACCGGAAAATGTCGTCGCTCAGCAACATTTCCTCAAAGTCATTGCATTCTCGGCAATACAGTATGGGCTCTCTGTCTGAAAA
CAGTAGAGGGAGCTCCGAAGACCATCTGATAGAAACCAAAAATTCATCCGAGTGCAACGAATCCATGATAAGTTCCCTGCACTTGGACAGGAACTTCGCAAGTATTCCGA
AAGCTGTATCGCGGGGAAAATCCGTTAGAACGATTAGAGCAAATGCAGTTGCTGCAGAGGAAATGAAATCCCAAGAGATGGACAGAAACCAAGTTGAACATGATATCAAT
ATAGGGAAGAAGTTTGAAGAGGGTGGAGGAGCATTATCATATATGAGAGAAGATGAAATAGGATATGGATGGCCTAGTGTTGCTAACCCGAACACTATTAATTCGAATCG
TTTGCCAAAGACGACGTTGTCGAAGATTGAGAGGCAGATCAAGATGGAAGACATCGAGAGTCTGCTGGCAGATGATTCCAAAGATAACTCCGAGATGGAGGATGAGAGTA
TTTTTGCAAGTTCAGATGAAGAAGCTGAAGTTGCTTCGAGTGTGGCTGGCGGGTCGGAATCGGGGGCTCACGAGGTCGACAAGAAGGCCGGTGAGTTCATAGCAAAGTTC
AGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTAGAAAGATTGAGAGGAGGATGGGGATCATTCAGCAGCACAAGCAGCAGCCATTTCAGTTGA
Protein sequenceShow/hide protein sequence
MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLSKIFHATSIFE
DVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETD
DVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPPSDETQFESLKKSRSLHSTLSQSSQTSSFSS
SSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDIN
IGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKF
REQIQLQRMASVERLRGGWGSFSSTSSSHFS