; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC07g0194 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC07g0194
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDUF761 domain-containing protein
Genome locationMC07:4662973..4664715
RNA-Seq ExpressionMC07g0194
SyntenyMC07g0194
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34231.1 hypothetical protein [Cucumis melo subsp. melo]8.10e-24870.59Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVD--ETR
        MASS S P+T+P           TT   +SC HFLCK LFF I LLLLPLFPSEAP+FVN TLLT FWEL HL+FVGIAVSYGLFSRR++QVSVD  E R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVD--ETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV
        FSNFENP SYLSK+ H  SIFEDV   + SDE K+ +V Y+QP+  S   F   NA SRQ          +RYENS EF DT++VGHACKSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSLRSNLTE    DDVEF  G+ES LSSKSSSK+SE NCER SEFG+NCC NLEEKFDE VIA +SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE
        E   KNMMRERGV N  +RPSHFRP S DETQFESLKKSRSLHS LSQSSQTSS S S S  TTRKHRKMSSL NIS KS HSRQYS+ SLSENSRGSSE
Subjt:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE

Query:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI
        D LIE +NSSECNES+ISS  LDRNFA IPKA+SRGKSVRTIRAN  A EEMK+QEM RNQVEHD N+G KFE  GG   YMRED  G+GWP + +PN  
Subjt:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI

Query:  NSNRLPKTT-LSKIERQIKMEDIESLLADDS--KDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGGW
         SNR PKTT  S IE Q   EDIES L DD   +DNSE ED S F SSDEEA  ASS+AG SESGA+EVDKKAGEFIAKFREQIQLQRMASV+ RLRGGW
Subjt:  NSNRLPKTT-LSKIERQIKMEDIESLLADDS--KDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGGW

Query:  GSFSSTSSSHFS
        GSFSSTSSS+FS
Subjt:  GSFSSTSSSHFS

KAG6575261.1 hypothetical protein SDJN03_25900, partial [Cucurbita argyrosperma subsp. sororia]2.88e-24368.93Show/hide
Query:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN
        MASS S+P+T+      P      +SCA FLCK +FF   LLLLPLFPSEAPDFV+ TL T FWEL HL+FVGIAVSYGLFS R+ Q++VDE R+S+FEN
Subjt:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN

Query:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSE
        P SYLSK+ +  SIF+DV     SDE KV +V Y+QP   SA+D   LNA+SR          ++RYENSYEF DTDNV HACKSRY+RGGSVVVV ++ 
Subjt:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSE

Query:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSEN-CERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNM
        R     SSSG IV+YKPLGLPVRSLRS+LTE   +DDVEF  G+ES LSSKSS KSSEN CE  SEFG+NCC NLEEKFDE  IAS+S FQ REK  K +
Subjt:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSEN-CERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNM

Query:  MRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETK
        +RERG GN  +RPSHFRPPS DETQFESL+KS SLHS LSQSSQTSS SS  S  TTRKH KMSSLSNIS KSLHSRQYSM SLSENSRGSSED LIE +
Subjt:  MRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETK

Query:  NSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPK
        NSSECNES++SS   DRNFASIPKA+S+GKSVR IRANA A E+MK+QEM R QV+HD  IG KFEEGG +  YMRED  G+GWP V NPN  N NR PK
Subjt:  NSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPK

Query:  TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG---WGSFSSTS
        TT   I+ Q   E+ ESL+ADDSKD SE EDES+FASSDEEA   SS+AG SESGA EVDKKAGEFIAKFREQIQLQRMASVE RLRGG   WGSFSSTS
Subjt:  TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG---WGSFSSTS

Query:  SSHFS
        SS+FS
Subjt:  SSHFS

KAG7013816.1 hypothetical protein SDJN02_23985, partial [Cucurbita argyrosperma subsp. argyrosperma]1.65e-24268.93Show/hide
Query:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN
        MASS S+P+T+      P      +SCA FLCK +FF   LLLLPLFPSEAPDFV+ TL T FWEL HL+FVGIAVSYGLFS R+ Q++VDE R+S+FEN
Subjt:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN

Query:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSE
        P SYLSK+ +  SIF+DV     SDE KV +V Y+QP   SA+D   LNA+SR          ++RYENSYEF DTDNV HACKSRY+RGGSVVVV ++ 
Subjt:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSE

Query:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSEN-CERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNM
        R     SSSG IV+YKPLGLPVRSLRS+LTE   +DDVEF  G+ES LSSKSS KSSEN CE  SEFG+NCC NLEEKFDE  IAS+S FQ REK  K +
Subjt:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSEN-CERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNM

Query:  MRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETK
        +RERG GN  +RPSHFRPPS DETQFESL+KS SLHS LSQSSQTSS SS  S  TTRKH KMSSLSNIS KSLHSRQYSM SLSENSRGSSED LIE +
Subjt:  MRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETK

Query:  NSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPK
        NSSECNES++SS   DRNFASIPKA+S+GKSVR IRANA A E+MK+QEM R QV+HD  IG KFEEGG +  YMRED  G+GWP V NPN  N NR PK
Subjt:  NSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPK

Query:  TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG---WGSFSSTS
        TT   I+ Q   E+ ESL+ADDSKD SE EDES FASSDEEA   SS+AG SESGA EVDKKAGEFIAKFREQIQLQRMASVE RLRGG   WGSFSSTS
Subjt:  TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG---WGSFSSTS

Query:  SSHFS
        SS+FS
Subjt:  SSHFS

XP_004140631.1 uncharacterized protein LOC101220435 [Cucumis sativus]8.57e-24368.19Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVD--ETR
        MA S S P+T+P           TT   +SC  F+CK LFF I LLLLPLFPSEAP+FVN T LT FWEL HL+F+GIAVSYGLFSRR++QVSVD  E R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVD--ETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV
        FSNFENP SYLSK+FH  SIFEDV   + SDE K+ +V Y+QP+  S +    LNA SRQ          +RYENS EF +TDNVGHACKSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSL+S+LTE    DDVEF  G+ES LSSKSSSK+SE NCER SEFG+NCC NLEEKFDE VIAS+SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE
        EK EKNMMRER V N  +RPSHFRP S DETQFESLKKS SLHS LSQSSQTSS SS  S  T RKHRKMSSL NIS KS HSRQYS+ SLSENSRGSSE
Subjt:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE

Query:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI
        D LI+ +NSSECNES++SS  LDRNFA+ PKA+SRGKSVRT+RA+  A EEMK+QEM RNQVEHD N+  KFE  GG   YMREDE G+GWP + N N  
Subjt:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI

Query:  NSNRLPKTT----LSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG
         SNR  KTT     S IE Q   ED ES + DD KDNSE ED+S F SSDEEA  A S+ G SESGAHEVDKKAGEFIAKFREQIQLQRMASV+ RLRGG
Subjt:  NSNRLPKTT----LSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG

Query:  WGSFSSTSSSHFS
        WGSFSST+SS+FS
Subjt:  WGSFSSTSSSHFS

XP_022157033.1 uncharacterized protein LOC111023860 [Momordica charantia]0.0100Show/hide
Query:  MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS
        MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS
Subjt:  MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS

Query:  KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK
        KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK
Subjt:  KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK

Query:  PLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPP
        PLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPP
Subjt:  PLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPP

Query:  SDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFA
        SDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFA
Subjt:  SDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFA

Query:  SIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLA
        SIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLA
Subjt:  SIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLA

Query:  DDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGGWGSFSSTSSSHFS
        DDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGGWGSFSSTSSSHFS
Subjt:  DDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGGWGSFSSTSSSHFS

TrEMBL top hitse value%identityAlignment
A0A0A0K9X1 Uncharacterized protein4.15e-24368.19Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVD--ETR
        MA S S P+T+P           TT   +SC  F+CK LFF I LLLLPLFPSEAP+FVN T LT FWEL HL+F+GIAVSYGLFSRR++QVSVD  E R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVD--ETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV
        FSNFENP SYLSK+FH  SIFEDV   + SDE K+ +V Y+QP+  S +    LNA SRQ          +RYENS EF +TDNVGHACKSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSL+S+LTE    DDVEF  G+ES LSSKSSSK+SE NCER SEFG+NCC NLEEKFDE VIAS+SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE
        EK EKNMMRER V N  +RPSHFRP S DETQFESLKKS SLHS LSQSSQTSS SS  S  T RKHRKMSSL NIS KS HSRQYS+ SLSENSRGSSE
Subjt:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE

Query:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI
        D LI+ +NSSECNES++SS  LDRNFA+ PKA+SRGKSVRT+RA+  A EEMK+QEM RNQVEHD N+  KFE  GG   YMREDE G+GWP + N N  
Subjt:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI

Query:  NSNRLPKTT----LSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG
         SNR  KTT     S IE Q   ED ES + DD KDNSE ED+S F SSDEEA  A S+ G SESGAHEVDKKAGEFIAKFREQIQLQRMASV+ RLRGG
Subjt:  NSNRLPKTT----LSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG

Query:  WGSFSSTSSSHFS
        WGSFSST+SS+FS
Subjt:  WGSFSSTSSSHFS

A0A5D3DMA5 DUF761 domain-containing protein3.92e-24870.59Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVD--ETR
        MASS S P+T+P           TT   +SC HFLCK LFF I LLLLPLFPSEAP+FVN TLLT FWEL HL+FVGIAVSYGLFSRR++QVSVD  E R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVD--ETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV
        FSNFENP SYLSK+ H  SIFEDV   + SDE K+ +V Y+QP+  S   F   NA SRQ          +RYENS EF DT++VGHACKSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSLRSNLTE    DDVEF  G+ES LSSKSSSK+SE NCER SEFG+NCC NLEEKFDE VIA +SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE
        E   KNMMRERGV N  +RPSHFRP S DETQFESLKKSRSLHS LSQSSQTSS S S S  TTRKHRKMSSL NIS KS HSRQYS+ SLSENSRGSSE
Subjt:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE

Query:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI
        D LIE +NSSECNES+ISS  LDRNFA IPKA+SRGKSVRTIRAN  A EEMK+QEM RNQVEHD N+G KFE  GG   YMRED  G+GWP + +PN  
Subjt:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI

Query:  NSNRLPKTT-LSKIERQIKMEDIESLLADDS--KDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGGW
         SNR PKTT  S IE Q   EDIES L DD   +DNSE ED S F SSDEEA  ASS+AG SESGA+EVDKKAGEFIAKFREQIQLQRMASV+ RLRGGW
Subjt:  NSNRLPKTT-LSKIERQIKMEDIESLLADDS--KDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGGW

Query:  GSFSSTSSSHFS
        GSFSSTSSS+FS
Subjt:  GSFSSTSSSHFS

A0A6J1DSC0 uncharacterized protein LOC1110238600.0100Show/hide
Query:  MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS
        MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS
Subjt:  MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLS

Query:  KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK
        KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK
Subjt:  KIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYK

Query:  PLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPP
        PLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPP
Subjt:  PLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPP

Query:  SDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFA
        SDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFA
Subjt:  SDETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFA

Query:  SIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLA
        SIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLA
Subjt:  SIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLA

Query:  DDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGGWGSFSSTSSSHFS
        DDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGGWGSFSSTSSSHFS
Subjt:  DDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGGWGSFSSTSSSHFS

A0A6J1H4M0 uncharacterized protein LOC1114599982.07e-24168.59Show/hide
Query:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN
        MASS S+P+T+      P      +SCA FLCK +FF   LLLLPLFPSEAPDFV+ TL T FWEL HL+FVGIAVSYGLFS R+ Q++VDE R+S+FEN
Subjt:  MASSTSNPYTR------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFEN

Query:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSE
        P SYLSK+ +  SIF+DV     SDE KV +V Y+QP   SA+D   LNA+SR          ++RYENSYEF DTDNV HACKSRY+RGGSVVVV ++ 
Subjt:  PPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESR----------QRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSE

Query:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSEN-CERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNM
        R     SSSG IV+YKPLGLPVRSLRS+LTE   +DDVEF  G+ES LSSKSS KSSEN CE  SEFG+NCC NLEEKFDE  IAS+S FQ REK  K +
Subjt:  RNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSEN-CERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNM

Query:  MRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETK
        +RERG GN  +RPSHFRPPS DETQFESLKKS SLHS LSQSSQTSS SS  S  TTRK RKMSSLSNIS KSLHSRQYS  SLSENSRGSSED LIE +
Subjt:  MRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETK

Query:  NSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPK
        NSSECNES++SS   DRNFASIPKA+S+GKSVR IRANA A E+MK+QEM R QV+HD  IG KFEEGG +  YMRED  G GWP V NPN  N NR PK
Subjt:  NSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPK

Query:  TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG------WGSFS
        TT   I+ Q   E+ ESL+ADDSKD+SE EDES+FASSDEEA   SS+AG SESGA EVDKKAGEFIAKFREQIQLQRMASVE RLRGG      WGSFS
Subjt:  TTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGG------WGSFS

Query:  STSSSHFS
        STSSS+FS
Subjt:  STSSSHFS

E5GCN2 Uncharacterized protein3.92e-24870.59Show/hide
Query:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVD--ETR
        MASS S P+T+P           TT   +SC HFLCK LFF I LLLLPLFPSEAP+FVN TLLT FWEL HL+FVGIAVSYGLFSRR++QVSVD  E R
Subjt:  MASSTSNPYTRPR----------TTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVD--ETR

Query:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV
        FSNFENP SYLSK+ H  SIFEDV   + SDE K+ +V Y+QP+  S   F   NA SRQ          +RYENS EF DT++VGHACKSRY+RGGSVV
Subjt:  FSNFENPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQ----------RRYENSYEFVDTDNVGHACKSRYSRGGSVV

Query:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR
        VVA++ R+ S     SGAIV+YKPLGLPVRSLRSNLTE    DDVEF  G+ES LSSKSSSK+SE NCER SEFG+NCC NLEEKFDE VIA +SPFQ R
Subjt:  VVADSERNRSSS-SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEF--GEESGLSSKSSSKSSE-NCERRSEFGENCCTNLEEKFDEAVIASLSPFQWR

Query:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE
        E   KNMMRERGV N  +RPSHFRP S DETQFESLKKSRSLHS LSQSSQTSS S S S  TTRKHRKMSSL NIS KS HSRQYS+ SLSENSRGSSE
Subjt:  EKSEKNMMRERGVGNYFVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSE

Query:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI
        D LIE +NSSECNES+ISS  LDRNFA IPKA+SRGKSVRTIRAN  A EEMK+QEM RNQVEHD N+G KFE  GG   YMRED  G+GWP + +PN  
Subjt:  DHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDINIGKKFEEGGGALSYMREDEIGYGWPSVANPNTI

Query:  NSNRLPKTT-LSKIERQIKMEDIESLLADDS--KDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGGW
         SNR PKTT  S IE Q   EDIES L DD   +DNSE ED S F SSDEEA  ASS+AG SESGA+EVDKKAGEFIAKFREQIQLQRMASV+ RLRGGW
Subjt:  NSNRLPKTT-LSKIERQIKMEDIESLLADDS--KDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVE-RLRGGW

Query:  GSFSSTSSSHFS
        GSFSSTSSS+FS
Subjt:  GSFSSTSSSHFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G60380.1 FUNCTIONS IN: molecular_function unknown2.8e-3436.55Show/hide
Query:  STSNPYTR----------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFE
        ++ NPYT+          P+   +      F CK + F++ LL LPLFPS+APDFV  T+LT FWEL+HLLFVGIAV+YGLFSRR+++ +VD       E
Subjt:  STSNPYTR----------PRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFE

Query:  NPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAES---RQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSS
        +  SY+S+IF  +S+F++       D+N  + V        SA       +ES        E S EF +T+ V  A  S+Y +G S VVVA     R + 
Subjt:  NPPSYLSKIFHATSIFEDVQLLTASDENKVDDVWYVQPHRESATDFGDLNAES---RQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSS

Query:  SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNY
           G +VH +PLGLP+R LRS+L + +   D  F +        S   + N E  S   +N        FDE + A  SP  W+ + E  MM   G+G+ 
Subjt:  SSSGAIVHYKPLGLPVRSLRSNLTEESETDDVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNY

Query:  FVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSS
        +  PS+F+P S DET      KS S  ST S SSQTS  S + +        + S   ++S++SL+S    +  + E SR SS
Subjt:  FVRPSHFRPPS-DETQFESLKKSRSLHSTLSQSSQTSSFSSSSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSS

AT3G60380.1 FUNCTIONS IN: molecular_function unknown6.1e-0548.39Show/hide
Query:  SEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGG
        +E + E  F   +EE E A      +    +EVD+KAGEFIAKFREQI+LQ++ S E+ RGG
Subjt:  SEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKFREQIQLQRMASVERLRGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTTCCACTTCCAACCCTTACACCAGGCCCCGTACTACCGACCGCCGTCACTCCTGCGCCCATTTTCTGTGTAAATTCCTCTTCTTCTCCATTCTCCTCCTCCT
CCTCCCTCTCTTCCCTTCCGAGGCTCCAGATTTCGTCAATCACACCTTGCTCACCAATTTCTGGGAGCTCCTTCACCTCCTCTTCGTCGGCATTGCTGTTTCATACGGCC
TGTTTAGCAGAAGGAGTATCCAGGTGAGTGTAGACGAAACTCGCTTCTCCAATTTCGAAAATCCGCCCTCGTATTTGTCTAAGATCTTTCACGCCACTTCGATTTTTGAA
GATGTCCAACTTTTGACTGCTTCCGATGAGAATAAGGTGGACGATGTTTGGTACGTTCAGCCGCATCGCGAATCGGCGACCGATTTTGGGGATTTGAATGCCGAATCTCG
CCAACGAAGGTATGAAAATTCGTATGAATTTGTAGATACTGATAATGTTGGGCATGCTTGTAAATCGAGATATTCTCGGGGTGGATCCGTGGTGGTAGTTGCTGATAGTG
AAAGAAATCGTAGTTCTAGTTCTAGTTCTGGTGCCATTGTACATTATAAACCTCTAGGTTTGCCTGTAAGGAGTTTGAGGTCAAATCTTACGGAAGAATCGGAAACCGAT
GATGTTGAATTTGGTGAGGAATCTGGTTTGAGTTCTAAAAGTTCATCCAAGAGCTCTGAGAATTGTGAAAGAAGAAGTGAATTTGGTGAGAATTGTTGTACGAATTTGGA
GGAGAAGTTTGATGAAGCTGTTATTGCATCATTGTCCCCATTTCAATGGCGTGAGAAATCTGAAAAGAATATGATGAGAGAGAGAGGAGTGGGGAATTATTTTGTTCGCC
CTTCCCATTTTAGGCCTCCCTCTGATGAAACTCAATTTGAATCCCTGAAAAAATCAAGGTCTCTTCATTCTACCCTATCTCAGTCGTCACAAACTAGTTCCTTCTCGTCT
TCATCGTCGATGATGACGACAAGAAAGCACCGGAAAATGTCGTCGCTCAGCAACATTTCCTCAAAGTCATTGCATTCTCGGCAATACAGTATGGGCTCTCTGTCTGAAAA
CAGTAGAGGGAGCTCCGAAGACCATCTGATAGAAACCAAAAATTCATCCGAGTGCAACGAATCCATGATAAGTTCCCTGCACTTGGACAGGAACTTCGCAAGTATTCCGA
AAGCTGTATCGCGGGGAAAATCCGTTAGAACGATTAGAGCAAATGCAGTTGCTGCAGAGGAAATGAAATCCCAAGAGATGGACAGAAACCAAGTTGAACATGATATCAAT
ATAGGGAAGAAGTTTGAAGAGGGTGGAGGAGCATTATCATATATGAGAGAAGATGAAATAGGATATGGATGGCCTAGTGTTGCTAACCCGAACACTATTAATTCGAATCG
TTTGCCAAAGACGACGTTGTCGAAGATTGAGAGGCAGATCAAGATGGAAGACATCGAGAGTCTGCTGGCAGATGATTCCAAAGATAACTCCGAGATGGAGGATGAGAGTA
TTTTTGCAAGTTCAGATGAAGAAGCTGAAGTTGCTTCGAGTGTGGCTGGCGGGTCGGAATCGGGGGCTCACGAGGTCGACAAGAAGGCCGGTGAGTTCATAGCAAAGTTC
AGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTAGAAAGATTGAGAGGAGGATGGGGATCATTCAGCAGCACAAGCAGCAGCCATTTCAGT
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTTCCACTTCCAACCCTTACACCAGGCCCCGTACTACCGACCGCCGTCACTCCTGCGCCCATTTTCTGTGTAAATTCCTCTTCTTCTCCATTCTCCTCCTCCT
CCTCCCTCTCTTCCCTTCCGAGGCTCCAGATTTCGTCAATCACACCTTGCTCACCAATTTCTGGGAGCTCCTTCACCTCCTCTTCGTCGGCATTGCTGTTTCATACGGCC
TGTTTAGCAGAAGGAGTATCCAGGTGAGTGTAGACGAAACTCGCTTCTCCAATTTCGAAAATCCGCCCTCGTATTTGTCTAAGATCTTTCACGCCACTTCGATTTTTGAA
GATGTCCAACTTTTGACTGCTTCCGATGAGAATAAGGTGGACGATGTTTGGTACGTTCAGCCGCATCGCGAATCGGCGACCGATTTTGGGGATTTGAATGCCGAATCTCG
CCAACGAAGGTATGAAAATTCGTATGAATTTGTAGATACTGATAATGTTGGGCATGCTTGTAAATCGAGATATTCTCGGGGTGGATCCGTGGTGGTAGTTGCTGATAGTG
AAAGAAATCGTAGTTCTAGTTCTAGTTCTGGTGCCATTGTACATTATAAACCTCTAGGTTTGCCTGTAAGGAGTTTGAGGTCAAATCTTACGGAAGAATCGGAAACCGAT
GATGTTGAATTTGGTGAGGAATCTGGTTTGAGTTCTAAAAGTTCATCCAAGAGCTCTGAGAATTGTGAAAGAAGAAGTGAATTTGGTGAGAATTGTTGTACGAATTTGGA
GGAGAAGTTTGATGAAGCTGTTATTGCATCATTGTCCCCATTTCAATGGCGTGAGAAATCTGAAAAGAATATGATGAGAGAGAGAGGAGTGGGGAATTATTTTGTTCGCC
CTTCCCATTTTAGGCCTCCCTCTGATGAAACTCAATTTGAATCCCTGAAAAAATCAAGGTCTCTTCATTCTACCCTATCTCAGTCGTCACAAACTAGTTCCTTCTCGTCT
TCATCGTCGATGATGACGACAAGAAAGCACCGGAAAATGTCGTCGCTCAGCAACATTTCCTCAAAGTCATTGCATTCTCGGCAATACAGTATGGGCTCTCTGTCTGAAAA
CAGTAGAGGGAGCTCCGAAGACCATCTGATAGAAACCAAAAATTCATCCGAGTGCAACGAATCCATGATAAGTTCCCTGCACTTGGACAGGAACTTCGCAAGTATTCCGA
AAGCTGTATCGCGGGGAAAATCCGTTAGAACGATTAGAGCAAATGCAGTTGCTGCAGAGGAAATGAAATCCCAAGAGATGGACAGAAACCAAGTTGAACATGATATCAAT
ATAGGGAAGAAGTTTGAAGAGGGTGGAGGAGCATTATCATATATGAGAGAAGATGAAATAGGATATGGATGGCCTAGTGTTGCTAACCCGAACACTATTAATTCGAATCG
TTTGCCAAAGACGACGTTGTCGAAGATTGAGAGGCAGATCAAGATGGAAGACATCGAGAGTCTGCTGGCAGATGATTCCAAAGATAACTCCGAGATGGAGGATGAGAGTA
TTTTTGCAAGTTCAGATGAAGAAGCTGAAGTTGCTTCGAGTGTGGCTGGCGGGTCGGAATCGGGGGCTCACGAGGTCGACAAGAAGGCCGGTGAGTTCATAGCAAAGTTC
AGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTAGAAAGATTGAGAGGAGGATGGGGATCATTCAGCAGCACAAGCAGCAGCCATTTCAGT
Protein sequenceShow/hide protein sequence
MASSTSNPYTRPRTTDRRHSCAHFLCKFLFFSILLLLLPLFPSEAPDFVNHTLLTNFWELLHLLFVGIAVSYGLFSRRSIQVSVDETRFSNFENPPSYLSKIFHATSIFE
DVQLLTASDENKVDDVWYVQPHRESATDFGDLNAESRQRRYENSYEFVDTDNVGHACKSRYSRGGSVVVVADSERNRSSSSSSGAIVHYKPLGLPVRSLRSNLTEESETD
DVEFGEESGLSSKSSSKSSENCERRSEFGENCCTNLEEKFDEAVIASLSPFQWREKSEKNMMRERGVGNYFVRPSHFRPPSDETQFESLKKSRSLHSTLSQSSQTSSFSS
SSSMMTTRKHRKMSSLSNISSKSLHSRQYSMGSLSENSRGSSEDHLIETKNSSECNESMISSLHLDRNFASIPKAVSRGKSVRTIRANAVAAEEMKSQEMDRNQVEHDIN
IGKKFEEGGGALSYMREDEIGYGWPSVANPNTINSNRLPKTTLSKIERQIKMEDIESLLADDSKDNSEMEDESIFASSDEEAEVASSVAGGSESGAHEVDKKAGEFIAKF
REQIQLQRMASVERLRGGWGSFSSTSSSHFS