; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010057 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010057
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDuplicated homeodomain-like superfamily protein isoform 1
Genome locationscaffold779:270386..276164
RNA-Seq ExpressionMS010057
SyntenyMS010057
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017884 - SANT domain
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022133837.1 uncharacterized protein LOC111006296 isoform X1 [Momordica charantia]0.0e+0099.42Show/hide
Query:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
        MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
Subjt:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY

Query:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
        ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
Subjt:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW

Query:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
        TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
Subjt:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST

Query:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV
        HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQ EEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV
Subjt:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV

Query:  DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLSTSD
        DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSD SNMIPPAATLPVVTSENNISKK  LSTSD
Subjt:  DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLSTSD

Query:  SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINESDDGGDSLFNLILASNKESACRASEALISMLP
        SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSV DNVVPSVDNSILIKREAVTLEPISSDINESDDGGDSLFNLILASNKESACRASEALISMLP
Subjt:  SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINESDDGGDSLFNLILASNKESACRASEALISMLP

Query:  ANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNP
        ANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNP
Subjt:  ANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNP

Query:  NPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTA
        NPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQK KMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTA
Subjt:  NPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTA

Query:  DCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNE
        DCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNE
Subjt:  DCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNE

Query:  REKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMI
        REKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMI
Subjt:  REKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMI

Query:  STHVGTKSTDQCKIFFSKARKCLGLDLICTA
        STHVGTKSTDQCKIFFSKARKCLGLDLICTA
Subjt:  STHVGTKSTDQCKIFFSKARKCLGLDLICTA

XP_022133838.1 uncharacterized protein LOC111006296 isoform X2 [Momordica charantia]0.0e+0099.32Show/hide
Query:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
        MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
Subjt:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY

Query:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
        ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
Subjt:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW

Query:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
        TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
Subjt:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST

Query:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV
        HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQ EEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV
Subjt:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV

Query:  DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLSTSD
        DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSD SNMIPPAATLPVVTSENNISKK  LSTSD
Subjt:  DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLSTSD

Query:  SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINESDDGGDSLFNLILASNKESACRASEALISMLP
        SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSV DNVVPSVDNSILIKREAVTLEPISSDINESDDGGDSLFNLILASNKESACRASEALISMLP
Subjt:  SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINESDDGGDSLFNLILASNKESACRASEALISMLP

Query:  ANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNP
        ANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSS GNP
Subjt:  ANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNP

Query:  NPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTA
        NPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQK KMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTA
Subjt:  NPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTA

Query:  DCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNE
        DCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNE
Subjt:  DCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNE

Query:  REKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMI
        REKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMI
Subjt:  REKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMI

Query:  STHVGTKSTDQCKIFFSKARKCLGLDLICTA
        STHVGTKSTDQCKIFFSKARKCLGLDLICTA
Subjt:  STHVGTKSTDQCKIFFSKARKCLGLDLICTA

XP_023516976.1 uncharacterized protein LOC111780724 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0083.38Show/hide
Query:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
        MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSD+ GHGYGPSMSF+NK+LENVSSRPS+SHG GKY
Subjt:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY

Query:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
        ARN RE+RSFSQR+WK HSWATSNGSTNN GRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDD NGLGTNQRRDR+YSVSSSGWKPLKW
Subjt:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW

Query:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
        TR GGLSSRTSTSGHSSSTK IDALDSNETKSE VL+N SQNLSPSAD AECA SS+PCDE   +KKPRLGWGEGLAKYEKKKVE+PDGT FT VNAE+T
Subjt:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST

Query:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYE-EPSFNLEKLDNFSIDDLGSPLIQLLQSNDSIS
         SLNSSLI+KGPRGSGF DCTSPATPSSV+CGSSPGGDEKSSGKA+SDNDV+N  GSPGSGFQNQYE  P+ NLEKLD FSI +LGSPL QLLQS DS S
Subjt:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYE-EPSFNLEKLDNFSIDDLGSPLIQLLQSNDSIS

Query:  VDPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSF-SFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLST
        VD  ++S TA+SKLL YK EISKVLE TE+EIDLLENEL  LRS SKG F SFPLASSSL  GDK FEEHS+V++MIPP ATLPVVTS NN S  M  ST
Subjt:  VDPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSF-SFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLST

Query:  SDSEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINE-SDDGGDSLFNLILASNKESACRASEALIS
        SDSEEVH DV E       +SDA ESVI  EKLT++GCSV DNVV SVDN++LIK E + LEPISSD+ E +D+GGD++F+LILASNK+SA  ASEALI 
Subjt:  SDSEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINE-SDDGGDSLFNLILASNKESACRASEALIS

Query:  MLPANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSA
        +LPANE KIDIW TN +CS+NQC M+ER+AK+K+LLRFKERVI LK+RAYQSLWKESLHVPPVR LR KSQKKYQLSLWTNYS YQKNRSSIR R+PS A
Subjt:  MLPANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSA

Query:  GNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHK
        GN NPVSNT IL+HVSKQLSN   KQYR+ LKMP L+LD+KDKMASRF+SNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKL  FGKDFGKIA+FLDHK
Subjt:  GNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHK

Query:  TTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMF
        TTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSST  YL+TTGKKWNPETNAASL++LGAASAMTACAHKYSSSRS GRTA +TTQ DD LSERAKSFH F
Subjt:  TTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMF

Query:  GNEREKVAADVLAGICGSLSSEAMG-SCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKN
        GNEREKVAADVLAGICGSLSSEAMG SCIT NF RRDGSQDLKCKKGATT+LRRRMTAN+ + +DDE+ SDESCGEMDPSYWTDGEKSLFIEAV  YGKN
Subjt:  GNEREKVAADVLAGICGSLSSEAMG-SCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKN

Query:  FSMISTHVGTKSTDQCKIFFSKARKCLGLDLICTA
        FSMISTHVG+KS DQCK+FFSKARKCLGLDLIC+A
Subjt:  FSMISTHVGTKSTDQCKIFFSKARKCLGLDLICTA

XP_038889119.1 uncharacterized protein LOC120079017 isoform X1 [Benincasa hispida]0.0e+0084.4Show/hide
Query:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
        MPPEPLPWDRKDFFKERKHERSEFLGPV RWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSF+NKMLENVS RPSVSHGDGKY
Subjt:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY

Query:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
        ARNGRE+RSFSQR+WKGHSWA SNGSTNN GRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDK+DDVNGLGTNQRRDREYSVSSSGWKPLKW
Subjt:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW

Query:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
        TR GGLSSRTSTSG SSSTKSIDALDS++ KSETVLKN SQNLSPSADPAECAMS +P DE + RKKPRLGWGEGLAKYEKKKVEVPDGT FTNVNAEST
Subjt:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST

Query:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV
        HSLNSSLIEKGPRGSGFSDCTSPATPSSV+ GSSPGGDEK  GKA+SDNDVSN HGSPGSGFQ QYE  S ++EKLDNFSI +L SPLIQLLQSNDSISV
Subjt:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV

Query:  DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLSTSD
        D      TA+SKLLIYKN+ISKVLETTESEIDLLENELK L+SESKG FSFPLASSSLM GDK FEE +DV+N +   A LP  TS N ISK+M+ STS 
Subjt:  DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLSTSD

Query:  SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINESD-DGGDSLFNLILASNKESACRASEALISML
         EE + DV EK   R GR D  ESVI  EKL IS C+V DN+V SVDNS+ +K E V LEPI +D+ E D +G DS+ +LILASNKESAC+ASEALI ML
Subjt:  SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINESD-DGGDSLFNLILASNKESACRASEALISML

Query:  PANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGN
        PANEHKIDIWSTN +CSQNQCL++ER+AK+KRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIR R+PS AGN
Subjt:  PANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGN

Query:  PNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTT
         NPVS+TAIL+H+S QLS   +KQYRRTLKMPALVLDQKDKM SRF+SNNGLVE+PCAVEKER M+NPWTSEEKDVF+EKL CFGKDFG+IA+FLDHKTT
Subjt:  PNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTT

Query:  ADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGN
        ADC+EFYYKNHKSDCFEKTKKLEFGKKAKSSTS YL+TTGKKWNPETNAASL++LGAASAMTA AHKYSSSRS GRTAYHTTQ DD LSERAK+FH FGN
Subjt:  ADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGN

Query:  EREKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSM
        EREKVAADVLAGICGSLSSEAMGSC+TSNF+R D SQDL+CK+GATTVLRR MT N+P+++DDEIFSDESCGEMDPSYWTDGEKSLFI+AV  YGKNFSM
Subjt:  EREKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSM

Query:  ISTHVGTKSTDQCKIFFSKARKCLGLDLICTA
        ISTHVG+KSTDQCK+FFSKARKCLGLDLIC+A
Subjt:  ISTHVGTKSTDQCKIFFSKARKCLGLDLICTA

XP_038889121.1 uncharacterized protein LOC120079017 isoform X2 [Benincasa hispida]0.0e+0084.3Show/hide
Query:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
        MPPEPLPWDRKDFFKERKHERSEFLGPV RWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSF+NKMLENVS RPSVSHGDGKY
Subjt:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY

Query:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
        ARNGRE+RSFSQR+WKGHSWA SNGSTNN GRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDK+DDVNGLGTNQRRDREYSVSSSGWKPLKW
Subjt:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW

Query:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
        TR GGLSSRTSTSG SSSTKSIDALDS++ KSETVLKN SQNLSPSADPAECAMS +P DE + RKKPRLGWGEGLAKYEKKKVEVPDGT FTNVNAEST
Subjt:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST

Query:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV
        HSLNSSLIEKGPRGSGFSDCTSPATPSSV+ GSSPGGDEK  GKA+SDNDVSN HGSPGSGFQ QYE  S ++EKLDNFSI +L SPLIQLLQSNDSISV
Subjt:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV

Query:  DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLSTSD
        D      TA+SKLLIYKN+ISKVLETTESEIDLLENELK L+SESKG FSFPLASSSLM GDK FEE +DV+N +   A LP  TS N ISK+M+ STS 
Subjt:  DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLSTSD

Query:  SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINESD-DGGDSLFNLILASNKESACRASEALISML
         EE + DV EK   R GR D  ESVI  EKL IS C+V DN+V SVDNS+ +K E V LEPI +D+ E D +G DS+ +LILASNKESAC+ASEALI ML
Subjt:  SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINESD-DGGDSLFNLILASNKESACRASEALISML

Query:  PANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGN
        PANEHKIDIWSTN +CSQNQCL++ER+AK+KRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIR R+P S GN
Subjt:  PANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGN

Query:  PNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTT
         NPVS+TAIL+H+S QLS   +KQYRRTLKMPALVLDQKDKM SRF+SNNGLVE+PCAVEKER M+NPWTSEEKDVF+EKL CFGKDFG+IA+FLDHKTT
Subjt:  PNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTT

Query:  ADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGN
        ADC+EFYYKNHKSDCFEKTKKLEFGKKAKSSTS YL+TTGKKWNPETNAASL++LGAASAMTA AHKYSSSRS GRTAYHTTQ DD LSERAK+FH FGN
Subjt:  ADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGN

Query:  EREKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSM
        EREKVAADVLAGICGSLSSEAMGSC+TSNF+R D SQDL+CK+GATTVLRR MT N+P+++DDEIFSDESCGEMDPSYWTDGEKSLFI+AV  YGKNFSM
Subjt:  EREKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSM

Query:  ISTHVGTKSTDQCKIFFSKARKCLGLDLICTA
        ISTHVG+KSTDQCK+FFSKARKCLGLDLIC+A
Subjt:  ISTHVGTKSTDQCKIFFSKARKCLGLDLICTA

TrEMBL top hitse value%identityAlignment
A0A6J1BXW8 uncharacterized protein LOC111006296 isoform X10.0e+0099.42Show/hide
Query:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
        MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
Subjt:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY

Query:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
        ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
Subjt:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW

Query:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
        TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
Subjt:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST

Query:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV
        HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQ EEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV
Subjt:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV

Query:  DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLSTSD
        DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSD SNMIPPAATLPVVTSENNISKK  LSTSD
Subjt:  DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLSTSD

Query:  SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINESDDGGDSLFNLILASNKESACRASEALISMLP
        SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSV DNVVPSVDNSILIKREAVTLEPISSDINESDDGGDSLFNLILASNKESACRASEALISMLP
Subjt:  SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINESDDGGDSLFNLILASNKESACRASEALISMLP

Query:  ANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNP
        ANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNP
Subjt:  ANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNP

Query:  NPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTA
        NPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQK KMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTA
Subjt:  NPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTA

Query:  DCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNE
        DCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNE
Subjt:  DCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNE

Query:  REKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMI
        REKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMI
Subjt:  REKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMI

Query:  STHVGTKSTDQCKIFFSKARKCLGLDLICTA
        STHVGTKSTDQCKIFFSKARKCLGLDLICTA
Subjt:  STHVGTKSTDQCKIFFSKARKCLGLDLICTA

A0A6J1C0C8 uncharacterized protein LOC111006296 isoform X20.0e+0099.32Show/hide
Query:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
        MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
Subjt:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY

Query:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
        ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
Subjt:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW

Query:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
        TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
Subjt:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST

Query:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV
        HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQ EEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV
Subjt:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISV

Query:  DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLSTSD
        DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSD SNMIPPAATLPVVTSENNISKK  LSTSD
Subjt:  DPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLSTSD

Query:  SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINESDDGGDSLFNLILASNKESACRASEALISMLP
        SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSV DNVVPSVDNSILIKREAVTLEPISSDINESDDGGDSLFNLILASNKESACRASEALISMLP
Subjt:  SEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINESDDGGDSLFNLILASNKESACRASEALISMLP

Query:  ANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNP
        ANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSS GNP
Subjt:  ANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNP

Query:  NPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTA
        NPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQK KMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTA
Subjt:  NPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTA

Query:  DCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNE
        DCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNE
Subjt:  DCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNE

Query:  REKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMI
        REKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMI
Subjt:  REKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMI

Query:  STHVGTKSTDQCKIFFSKARKCLGLDLICTA
        STHVGTKSTDQCKIFFSKARKCLGLDLICTA
Subjt:  STHVGTKSTDQCKIFFSKARKCLGLDLICTA

A0A6J1GQT1 uncharacterized protein LOC111456626 isoform X10.0e+0083.01Show/hide
Query:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
        MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSD+YGHGYGPSMSF+NK+LENV+SRPS+SHG GKY
Subjt:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY

Query:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
        ARN RE+RSFSQR+WK HSWATSNGSTNN GRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDD NGLGTNQRRDR+YS+SSSGWKPLKW
Subjt:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW

Query:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
        TR GGLSSRTSTSGHSSSTK IDALDSNETKSETVL++ SQNLSPSAD AECA SS+PCDE   +KKPRLGWGEGLAKYEKKKVE+PDGT FT VNAE+T
Subjt:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST

Query:  HSLNSSLIEKGPR-GSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYE-EPSFNLEKLDNFSIDDLGSPLIQLLQSNDSI
         SLNSSLI+KGPR GSGF DCTSPATPSSV+CGSSPGGDEKSSGKA+SDNDV+N  GSPGSGFQNQYE  P+ NLEKLD FSI +LGSPL QLLQS DS 
Subjt:  HSLNSSLIEKGPR-GSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYE-EPSFNLEKLDNFSIDDLGSPLIQLLQSNDSI

Query:  SVDPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSF-SFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLS
        SVD  ++S TA+SKLL YK EISKVLE TE+EIDLLENELK LR  SKG F SFPLASSSL  G K FEEH++V++ IPP ATLPVVTS NN S  M  S
Subjt:  SVDPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSF-SFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLS

Query:  TSDSEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINE-SDDGGDSLFNLILASNKESACRASEALI
        TSDSEEVH DV E       +SDA ESVI  EKLT++GCSV DNVV SVDN++L+K E + LEPISSDI E +D+GGD++F+LILASNK+SA  ASEALI
Subjt:  TSDSEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINE-SDDGGDSLFNLILASNKESACRASEALI

Query:  SMLPANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSS
         +LPANE KIDIWSTN +CSQNQC ++ER+AK+K+L+RFKERVI LK+RAYQSLWKESLHVPPVR LR KSQKKYQLSLWTNYS YQKNRSSIR R+PS 
Subjt:  SMLPANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSS

Query:  AGNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDH
        AGN NPVSNT IL+HVSKQLSN   KQYR+TLKMP L+LD+KDKMASRF+SNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKL  FGKDFGKIA+FLDH
Subjt:  AGNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDH

Query:  KTTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHM
        KTTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTS YL+TTGKKWNPETNAASL++LG ASAMTACAHKYSSSRS GRTA +TTQ DD LSERAKSFH 
Subjt:  KTTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHM

Query:  FGNEREKVAADVLAGICGSLSSEAMG-SCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGK
        FGNEREKVAADVLAGICGSLSSEAMG SCIT NF RRDGSQDLKCKKGATT+LRRRMTAN+ + +DDE+ SDESCGEMDPSYWTDGEKSLFIEAV  YGK
Subjt:  FGNEREKVAADVLAGICGSLSSEAMG-SCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGK

Query:  NFSMISTHVGTKSTDQCKIFFSKARKCLGLDLICTA
        NFSMISTHVG+KS DQCK+FFSKARKCLGLDLIC+A
Subjt:  NFSMISTHVGTKSTDQCKIFFSKARKCLGLDLICTA

A0A6J1JI09 uncharacterized protein LOC111487165 isoform X20.0e+0083Show/hide
Query:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
        MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSD+YGHGYGPSMSF+NK+LENVSSRPS+SHG GKY
Subjt:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY

Query:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
        ARN RE+RSFSQR+WK H+WATSNGSTN+ GRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDD NGLGTNQRRDR+YSVSSSGWKPLKW
Subjt:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW

Query:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
        TR GGLSSRTSTSGHSSSTK IDALDSNETKSETVL+N SQNLSPSAD AECA SS+PCDE   +KKPRLGWGEGLAKYEKKKVE+PDG+  T VNAE+T
Subjt:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST

Query:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYE-EPSFNLEKLDNFSIDDLGSPLIQLLQSNDSIS
         SLNSSLI+KGPRGSGF DCTSPATPSSV+CGSSPGGDEKSSGKA+SDNDV+N  GSPGSGFQNQYE  P+ NLEKLD FSI +LGSPL QLLQS DS S
Subjt:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYE-EPSFNLEKLDNFSIDDLGSPLIQLLQSNDSIS

Query:  VDPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSF-SFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLST
        VD  ++S TA+SKLL YK EISKVLE TE+EIDLLENELK LRS SKG F S PLAS SL  GDK FEEH++V++MIPP ATLPVVTS NN S  M  ST
Subjt:  VDPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSF-SFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLST

Query:  SDSEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINE-SDDGGDSLFNLILASNKESACRASEALIS
        SDSEEVH DV E       +SDA ESVI  EKLT++GCSV DNVV SVDN++LIK E + LEPISSDI E +D+GGD++F+LILASNK+SA  ASEALI 
Subjt:  SDSEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINE-SDDGGDSLFNLILASNKESACRASEALIS

Query:  MLPANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSA
        +LPANE KIDIWSTN +CSQNQC M+ER+AK+K+LLRFKERVI LK+RAYQSLWKESLHVPPVR LR KSQKKYQLSLWTNYS YQKNRSSIR R+P S 
Subjt:  MLPANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSA

Query:  GNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHK
        GN NPVSNT I +HVSKQLSN   KQYR+TLKMP L+LD+KDKMASRF+SNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKL  FGKDFGKIA+FLDHK
Subjt:  GNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHK

Query:  TTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMF
        TTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSST  YL+TTGKKWNPETNAASL++LGAASAMTACAHKYSSSR  GRTA +TTQ DD LSERAKSFH F
Subjt:  TTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMF

Query:  GNEREKVAADVLAGICGSLSSEAMG-SCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKN
        GNEREKVAADVLAGICGSLSSEAMG SCIT NF RRDGSQDLKCKKGATT+LRRRMTAN+ + +DDE+ SDESCGEMDPSYWTDGEKSLFIEAV  YGKN
Subjt:  GNEREKVAADVLAGICGSLSSEAMG-SCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKN

Query:  FSMISTHVGTKSTDQCKIFFSKARKCLGLDLICTA
        FSMISTHVG+KS DQCK+FF KARKCLGLDLIC+A
Subjt:  FSMISTHVGTKSTDQCKIFFSKARKCLGLDLICTA

A0A6J1JMA4 uncharacterized protein LOC111487165 isoform X10.0e+0083.09Show/hide
Query:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY
        MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSD+YGHGYGPSMSF+NK+LENVSSRPS+SHG GKY
Subjt:  MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKY

Query:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW
        ARN RE+RSFSQR+WK H+WATSNGSTN+ GRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDD NGLGTNQRRDR+YSVSSSGWKPLKW
Subjt:  ARNGRENRSFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKW

Query:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST
        TR GGLSSRTSTSGHSSSTK IDALDSNETKSETVL+N SQNLSPSAD AECA SS+PCDE   +KKPRLGWGEGLAKYEKKKVE+PDG+  T VNAE+T
Subjt:  TRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAEST

Query:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYE-EPSFNLEKLDNFSIDDLGSPLIQLLQSNDSIS
         SLNSSLI+KGPRGSGF DCTSPATPSSV+CGSSPGGDEKSSGKA+SDNDV+N  GSPGSGFQNQYE  P+ NLEKLD FSI +LGSPL QLLQS DS S
Subjt:  HSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYE-EPSFNLEKLDNFSIDDLGSPLIQLLQSNDSIS

Query:  VDPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSF-SFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLST
        VD  ++S TA+SKLL YK EISKVLE TE+EIDLLENELK LRS SKG F S PLAS SL  GDK FEEH++V++MIPP ATLPVVTS NN S  M  ST
Subjt:  VDPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSF-SFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLST

Query:  SDSEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINE-SDDGGDSLFNLILASNKESACRASEALIS
        SDSEEVH DV E       +SDA ESVI  EKLT++GCSV DNVV SVDN++LIK E + LEPISSDI E +D+GGD++F+LILASNK+SA  ASEALI 
Subjt:  SDSEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISSDINE-SDDGGDSLFNLILASNKESACRASEALIS

Query:  MLPANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSA
        +LPANE KIDIWSTN +CSQNQC M+ER+AK+K+LLRFKERVI LK+RAYQSLWKESLHVPPVR LR KSQKKYQLSLWTNYS YQKNRSSIR R+PS A
Subjt:  MLPANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSA

Query:  GNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHK
        GN NPVSNT I +HVSKQLSN   KQYR+TLKMP L+LD+KDKMASRF+SNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKL  FGKDFGKIA+FLDHK
Subjt:  GNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHK

Query:  TTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMF
        TTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSST  YL+TTGKKWNPETNAASL++LGAASAMTACAHKYSSSR  GRTA +TTQ DD LSERAKSFH F
Subjt:  TTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMF

Query:  GNEREKVAADVLAGICGSLSSEAMG-SCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKN
        GNEREKVAADVLAGICGSLSSEAMG SCIT NF RRDGSQDLKCKKGATT+LRRRMTAN+ + +DDE+ SDESCGEMDPSYWTDGEKSLFIEAV  YGKN
Subjt:  GNEREKVAADVLAGICGSLSSEAMG-SCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKN

Query:  FSMISTHVGTKSTDQCKIFFSKARKCLGLDLICTA
        FSMISTHVG+KS DQCK+FF KARKCLGLDLIC+A
Subjt:  FSMISTHVGTKSTDQCKIFFSKARKCLGLDLICTA

SwissProt top hitse value%identityAlignment
O75376 Nuclear receptor corepressor 12.1e-1320.63Show/hide
Query:  SLFNLILASNKESACRASEALISMLPANEHKI-------DIWSTNVSCSQNQCLMRER----YAKKKRLLRFKERVITLKYRAYQSLWK---ESLHVPPV
        S+  +I   N++ A  A +    + P  E  +        ++  N+  +Q   +MR++    + ++    + +E+ I  +Y      W+   + +   P 
Subjt:  SLFNLILASNKESACRASEALISMLPANEHKI-------DIWSTNVSCSQNQCLMRER----YAKKKRLLRFKERVITLKYRAYQSLWK---ESLHVPPV

Query:  RKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNPNPVSN-----TAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDP
        RK +    ++Y    +      ++ +   +      AG    ++      + I+  +S+Q +N   KQ R+   +P ++ D + +   +F++ NGL+EDP
Subjt:  RKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNPNPVSN-----TAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDP

Query:  CAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLG
          V K+R  +N WT  EK++F +K     K+FG IA++L+ K+  DC+ +YY   K++ ++   +  +GK+   +      +  +K   +          
Subjt:  CAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLG

Query:  AASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNEREKVAADVLAGICGSLSSEAMGSCITSNFDRRDG--SQDLKCKKGATTVLRRRMT
                A K        +      + +D   E  K         E+      A   G  ++ + G        RR G  ++ +  +  A +      T
Subjt:  AASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNEREKVAADVLAGICGSLSSEAMGSCITSNFDRRDG--SQDLKCKKGATTVLRRRMT

Query:  ANIPRFI--DDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMISTHVGTKSTDQCKIFFSKARKCLGLD
           P  +    E  S E    ++ S WT+ E  +  + +  +G+N++ I+  VGTKS  QCK F+   ++   LD
Subjt:  ANIPRFI--DDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMISTHVGTKSTDQCKIFFSKARKCLGLD

Q4KKX4 Nuclear receptor corepressor 19.5e-1423.62Show/hide
Query:  SLFNLILASNKESACRASEALISMLPANEHKI-------DIWSTNVSCSQNQCLMRER----YAKKKRLLRFKERVITLKYRAYQSLWK---ESLHVPPV
        S+  +I   N++ A  A + L  + P  E  +        ++  N+  +Q   +MR++    + ++    + +E+ I  +Y      W+   + +   P 
Subjt:  SLFNLILASNKESACRASEALISMLPANEHKI-------DIWSTNVSCSQNQCLMRER----YAKKKRLLRFKERVITLKYRAYQSLWK---ESLHVPPV

Query:  RKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNPNPVSN-----TAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDP
        RK +    ++Y    +      ++ +   +      AG    ++      + I+  +S+Q +N   KQ R+   +P ++ D + +   +F++ NGL+EDP
Subjt:  RKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNPNPVSN-----TAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDP

Query:  CAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTADCIEFYYKNHKSDCFEKTKKLEFGKK
          V K+R  +N WT  EK++F EK     K+FG IA++L+ KT +DC+ +YY   K++ F+   +  + K+
Subjt:  CAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTADCIEFYYKNHKSDCFEKTKKLEFGKK

Q60974 Nuclear receptor corepressor 15.2e-1220.81Show/hide
Query:  SLFNLILASNKESACRASEALISMLPANEHKI-------DIWSTNVSCSQNQCLMRER----YAKKKRLLRFKERVITLKYRAYQSLWK---ESLHVPPV
        S+  +I   N++ A  A +    + P  E  +        ++  N+  +Q   +MR++    + ++    + +E+ I  +Y      W+   + +   P 
Subjt:  SLFNLILASNKESACRASEALISMLPANEHKI-------DIWSTNVSCSQNQCLMRER----YAKKKRLLRFKERVITLKYRAYQSLWK---ESLHVPPV

Query:  RKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNPNPVSN-----TAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDP
        RK +    ++Y    +      ++ +   +      AG    ++      + I+  +S+Q +N   KQ R+   +P ++ D + +   +F++ NGL+EDP
Subjt:  RKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNPNPVSN-----TAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDP

Query:  CAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLG
          V K+R  +N WT  EK++F +K     K+FG IA++L+ K+  DC+ +YY   K++ ++   +  +GK+   +      +  +K   +      E   
Subjt:  CAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLG

Query:  AASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNEREKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTAN
          +       K    +     +  TT+  D     A+       ERE+V         G  ++ + G         R  + +      A           
Subjt:  AASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNEREKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTAN

Query:  IPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMISTHVGTKSTDQCKIFFSKARKCLGLD
         P     E  S E    ++ S WT+ E  +  + +  +G+N++ I+  VGTKS  QCK F+   ++   LD
Subjt:  IPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMISTHVGTKSTDQCKIFFSKARKCLGLD

Q9WU42 Nuclear receptor corepressor 23.8e-1522.9Show/hide
Query:  SLFNLILASNKESACRASEALISM-----LPANEHKIDIWSTNVSCSQNQCLMRE--RYAKKKRLLRFK-ERVITLKYRAYQSLWK---ESLHVPPVRKL
        SL  +I   N++ A  A   L  +     LP      D    + +   NQ + ++   Y K++   R + E+    +Y      W+   E +   P R+ 
Subjt:  SLFNLILASNKESACRASEALISM-----LPANEHKIDIWSTNVSCSQNQCLMRE--RYAKKKRLLRFK-ERVITLKYRAYQSLWK---ESLHVPPVRKL

Query:  RAKSQKKYQLSLWTNYSGYQKNRSSIRVRI--------PSSAGNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDP
        +    ++Y    +      ++ +  ++ R+         S+A + + VS   I+  +S+Q   +L KQ R+   +P ++ D  D+   +F++ NGL++DP
Subjt:  RAKSQKKYQLSLWTNYSGYQKNRSSIRVRI--------PSSAGNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDP

Query:  CAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLG
          V K+R + N W+ +E+D F EK     K+FG IA+FL+ KT A+C+ +YY   K++ ++   +  + ++ KS          ++              
Subjt:  CAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLG

Query:  AASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNEREKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMT--
            M   + +    +   + A    +  D             NE+E+++ +      G  + E     + S   +   SQ    +KG  T   R M   
Subjt:  AASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNEREKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMT--

Query:  ANIPRFIDDEIFSDESCGEM-DPSYWTDGEKSLFIEAVRAYGKNFSMISTHVGTKSTDQCKIFFSKARKCLGLDLI
        AN       +  S+ +  EM + S WT+ E     + +  +G+N+S I+  VG+K+  QCK F+   +K   LD I
Subjt:  ANIPRFIDDEIFSDESCGEM-DPSYWTDGEKSLFIEAVRAYGKNFSMISTHVGTKSTDQCKIFFSKARKCLGLDLI

Q9Y618 Nuclear receptor corepressor 21.5e-1422.48Show/hide
Query:  SLFNLILASNKESACRASEALISM-----LPANEHKIDIWSTNVSCSQNQCLMRE--RYAKKKRLLRFK-ERVITLKYRAYQSLWK---ESLHVPPVRKL
        SL  +I   N++ A  A   L  +     LP      D    + +   NQ + ++   Y K++   R + E+    +Y      W+   E +   P R+ 
Subjt:  SLFNLILASNKESACRASEALISM-----LPANEHKIDIWSTNVSCSQNQCLMRE--RYAKKKRLLRFK-ERVITLKYRAYQSLWK---ESLHVPPVRKL

Query:  RAKSQKKYQLSLWTNYSGYQKNRSSIRVRI--------PSSAGNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDP
        +    ++Y    +      ++ +  ++ R+         S+A + + VS   I+  +S+Q   +L KQ R+   +P ++ D  D+   +F++ NGL+ DP
Subjt:  RAKSQKKYQLSLWTNYSGYQKNRSSIRVRI--------PSSAGNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDP

Query:  CAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLG
          V K+R ++N W+ +EK+ F EK     K+FG IA+FL+ KT A+C+ +YY   K++ ++   +  + ++            GK    +      +   
Subjt:  CAVEKERMMINPWTSEEKDVFMEKLACFGKDFGKIATFLDHKTTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLG

Query:  AASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNEREKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMT--
            M   + +    +   + A           E+ +      N++E +  +      G  + E     + S   +   SQ    +KG  T   R M   
Subjt:  AASAMTACAHKYSSSRSSGRTAYHTTQCDDGLSERAKSFHMFGNEREKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMT--

Query:  ANIPRFIDDEIFSDESCGEM-DPSYWTDGEKSLFIEAVRAYGKNFSMISTHVGTKSTDQCKIFFSKARKCLGLDLI
        AN    I  +  ++ +  E+ + S WT+ E     + +  +G+N+S I+  VG+K+  QCK F+   +K   LD I
Subjt:  ANIPRFIDDEIFSDESCGEM-DPSYWTDGEKSLFIEAVRAYGKNFSMISTHVGTKSTDQCKIFFSKARKCLGLDLI

Arabidopsis top hitse value%identityAlignment
AT3G52250.1 Duplicated homeodomain-like superfamily protein4.3e-15535.48Show/hide
Query:  MPPEPLPWDRKDFFKERKHERSE--FLGPVPRWRD--SSSHGSREF-SRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSH
        MP +   WDRK+  ++RKH+R E  F  P  RWRD  SS H  REF SR GSGDFRRP  HG+QGG H F +E  HGY  S S S +M +N   RPS S 
Subjt:  MPPEPLPWDRKDFFKERKHERSE--FLGPVPRWRD--SSSHGSREF-SRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSH

Query:  GDGKYARNGRENR-SFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRD-------------------KVKGQH------
        GD +Y RN R++R S SQ+EWK ++W  SNGS+ +  R    +   +RSV +  ++ S +HS  VN  D                   K+K +H      
Subjt:  GDGKYARNGRENR-SFSQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRD-------------------KVKGQH------

Query:  --------------------------------------------------------------------------DKIDDVNGLGTNQRRDREYSVSSSGW
                                                                                  + +  +N +  +++  +E S+ ++G 
Subjt:  --------------------------------------------------------------------------DKIDDVNGLGTNQRRDREYSVSSSGW

Query:  KPLKWTRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEV---PDGTAF
         PL W   G  +S++S   HSSS KS+ A+DS++ K E + K  +   S S D   CA ++   +E + RKK RLGWGEGLAKYEKKKV+V    DGT  
Subjt:  KPLKWTRGGGLSSRTSTSGHSSSTKSIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEV---PDGTAF

Query:  TNVNAESTHSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAA-SDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQL
             E  HSLN ++ +K P  +   D  SP TPSSV C SSPG  +KSS KAA + +DVSNM  SP        E    N+E+LDN S++  G  L +L
Subjt:  TNVNAESTHSLNSSLIEKGPRGSGFSDCTSPATPSSVVCGSSPGGDEKSSGKAA-SDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQL

Query:  LQSNDSISVDPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNIS
        L ++DS + D   +  T+M+ LL +K EI K +E TESEIDLLEN+ + L+ E +        SS    GD N  +     ++ P A    V  +     
Subjt:  LQSNDSISVDPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKRLRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNIS

Query:  KKMSLSTSDSEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISS----DINESDDG----GDSLFNLILAS
         +  +  +   +V  DV E + G        +S  T E+         ++++P       +  + +     ++    +++ +DD      D  +  +L++
Subjt:  KKMSLSTSDSEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSILIKREAVTLEPISS----DINESDDG----GDSLFNLILAS

Query:  NKESACRASEALISMLPANEHKIDIWSTNVSC-SQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGY
        NK+ AC +S     +LP + +  D       C +Q    ++E+ A +  LLR +E+++ L+++A+Q  WK+ L    + K ++KS KK +L       GY
Subjt:  NKESACRASEALISMLPANEHKIDIWSTNVSC-SQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPVRKLRAKSQKKYQLSLWTNYSGY

Query:  QKNRSSIRVRIPSSA-GNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLA
         K   S+R+R  SSA    + V  T ++ ++ K L  + +K +R  LKMPA++LD+K+++ SRF+S+NGL+EDPC VEKER MINPWTSEEK++F+  LA
Subjt:  QKNRSSIRVRIPSSA-GNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTSEEKDVFMEKLA

Query:  CFGKDFGKIATFLDHKTTADCIEFYYKNHKSDCFEKTKKLE-FGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHT
          GKDF KIA+ L  KTTADCI++YYKNHKSDCF K KK   +GK+ K    TY++   KKW  E  AASL++LG  S + A A K +S+R         
Subjt:  CFGKDFGKIATFLDHKTTADCIEFYYKNHKSDCFEKTKKLE-FGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHT

Query:  TQCD-------DGLSERAKSFHMFGNEREKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQD-------LKCKKGATTVLRRRMTA--NIPRFIDDEI
          C        DG +    S+      +    ADVLA   G LS E + SC+ ++   R+   D       +K  + + T+         N     +D+ 
Subjt:  TQCD-------DGLSERAKSFHMFGNEREKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQD-------LKCKKGATTVLRRRMTA--NIPRFIDDEI

Query:  FSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMISTHVGTKSTDQCKIFFSKARKCLGLDLI
         S+ESCGE  P +WTD E+S FI+    +GKNF+ IS +VGT+S DQCK+FFSK RKCLGL+ I
Subjt:  FSDESCGEMDPSYWTDGEKSLFIEAVRAYGKNFSMISTHVGTKSTDQCKIFFSKARKCLGLDLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCCTGAACCATTGCCCTGGGATCGGAAGGACTTCTTCAAGGAGAGGAAACACGAGAGGTCAGAGTTTCTCGGACCTGTACCTAGATGGAGAGACTCATCTAGTCA
CGGATCCCGTGAGTTTAGTAGGTGGGGATCAGGCGATTTTCGCAGACCGCCAGGTCATGGTAGGCAGGGTGGTTGGCATGTGTTCTCTGACGAATATGGTCATGGATATG
GGCCTTCCATGTCATTCAGTAACAAGATGCTGGAAAATGTTAGTAGCCGGCCATCTGTTTCACATGGCGATGGGAAGTACGCTAGGAATGGTAGGGAAAACAGATCTTTT
AGTCAAAGAGAATGGAAAGGTCATTCCTGGGCAACAAGTAACGGGTCTACAAACAATTGTGGTAGGCTGCAGCATGATCTAAATTATGATCAGAGGTCAGTTCACGATAT
GCTGATATATCCCTCTCATTCTCATTCTGACTTTGTAAACCCACGAGATAAGGTAAAAGGCCAGCATGATAAGATCGATGATGTCAATGGGTTAGGCACAAACCAGAGAC
GTGATCGGGAGTATTCAGTGAGTTCCTCAGGGTGGAAGCCTCTTAAATGGACACGTGGTGGTGGCTTATCTTCACGCACTTCAACATCAGGTCATTCCAGTAGCACAAAG
AGTATTGATGCTTTAGATTCTAATGAGACAAAGTCTGAGACAGTGTTGAAAAATGAGTCACAAAATCTATCTCCTTCCGCTGATCCTGCTGAGTGTGCAATGTCTTCTAT
GCCATGTGATGAGACAACTGTCAGGAAGAAGCCACGGCTAGGATGGGGCGAGGGGCTTGCCAAGTATGAAAAAAAGAAGGTTGAAGTTCCTGATGGTACTGCCTTTACAA
ATGTTAATGCAGAATCTACCCATTCTCTTAATTCCAGCTTGATTGAAAAAGGCCCTAGAGGTTCAGGGTTTTCGGATTGTACCTCACCTGCGACCCCTTCTTCTGTCGTT
TGTGGTTCCTCTCCAGGTGGGGATGAAAAATCATCCGGAAAGGCAGCTAGTGATAACGATGTCAGTAACATGCATGGTTCACCTGGTTCTGGTTTTCAGAACCAGTACGA
GGAACCATCCTTTAATTTAGAGAAGTTGGATAACTTTTCAATTGATGATTTGGGCTCTCCACTCATTCAACTGCTGCAATCTAATGATTCGATTTCTGTGGATCCTGGTA
TTATGAGTTGCACTGCCATGAGTAAGTTACTTATATATAAAAATGAAATTTCTAAAGTGTTGGAGACGACTGAGTCTGAAATTGATTTACTTGAAAATGAACTGAAGAGG
TTGAGATCTGAAAGTAAGGGCTCCTTTTCTTTTCCTTTAGCATCCAGTTCTTTGATGACTGGGGATAAAAACTTTGAGGAACACAGTGATGTCTCTAATATGATCCCTCC
AGCTGCTACCCTGCCAGTTGTTACTTCCGAGAATAATATTTCAAAAAAGATGTCACTTTCCACAAGTGACTCGGAAGAAGTGCATACGGATGTCAACGAAAAGGCTGTCG
GTAGGTTTGGGAGGTCGGATGCAATGGAATCTGTCATCACCGGAGAGAAGCTAACAATATCTGGTTGCAGTGTTGGAGATAATGTTGTGCCATCTGTTGACAATAGCATT
CTTATTAAGAGGGAAGCTGTCACATTAGAACCCATTTCTAGTGATATAAATGAATCTGATGATGGGGGAGATAGCCTGTTCAATTTAATTCTGGCATCCAATAAAGAATC
TGCATGCAGGGCGTCTGAAGCTTTAATCAGCATGTTGCCTGCTAATGAACATAAGATTGACATTTGGAGCACAAATGTCTCTTGCTCACAGAATCAATGTTTAATGAGAG
AGAGATATGCAAAGAAGAAGAGGTTATTAAGATTTAAGGAGAGAGTAATAACCCTTAAGTATAGAGCCTACCAGTCCTTGTGGAAGGAGAGTTTGCATGTGCCACCTGTA
AGGAAGTTACGTGCAAAATCTCAGAAAAAATATCAGTTGAGTTTGTGGACCAATTACAGTGGTTATCAGAAGAACCGCTCTTCCATTCGAGTCCGTATTCCTTCATCTGC
AGGAAATCCGAACCCCGTTTCTAACACGGCGATTCTTAGGCATGTGAGCAAGCAGCTTTCTAATTCCCTGGTTAAGCAGTACCGGAGGACCTTGAAGATGCCCGCATTAG
TTTTGGACCAGAAGGATAAGATGGCTTCAAGATTCGTCTCTAATAATGGGTTAGTTGAGGACCCGTGCGCTGTTGAGAAGGAGAGGATGATGATCAACCCATGGACCTCC
GAAGAGAAAGATGTTTTCATGGAGAAGTTGGCATGTTTCGGGAAAGATTTTGGGAAAATTGCCACATTTCTTGATCACAAGACGACAGCAGACTGTATTGAGTTCTACTA
CAAGAACCACAAATCTGATTGCTTTGAGAAAACAAAGAAGCTGGAGTTTGGGAAGAAGGCGAAGTCCTCAACCAGTACCTATTTGGTGACAACAGGGAAGAAATGGAATC
CTGAAACAAATGCTGCTTCTCTTGAACTGTTGGGTGCTGCTTCAGCGATGACAGCTTGTGCTCATAAGTATTCAAGCAGCAGGTCGAGCGGAAGAACCGCATATCACACA
ACTCAATGTGATGATGGTCTATCGGAAAGGGCCAAGAGTTTTCATATGTTTGGAAATGAAAGAGAGAAAGTTGCTGCTGATGTGCTAGCTGGTATATGTGGTTCCCTGTC
TTCAGAAGCCATGGGTTCATGTATCACTAGTAATTTCGACCGAAGAGACGGTTCTCAGGATTTGAAGTGCAAAAAAGGTGCGACAACTGTATTAAGACGGCGAATGACAG
CAAATATTCCACGGTTTATTGATGATGAGATTTTTTCAGACGAGAGCTGTGGAGAAATGGATCCTTCTTATTGGACTGATGGGGAGAAGTCGCTTTTCATAGAAGCAGTG
AGAGCTTATGGAAAGAATTTCTCAATGATCTCTACACATGTAGGAACAAAATCCACAGATCAGTGCAAGATCTTCTTCAGCAAAGCACGGAAGTGCCTTGGGTTGGATTT
GATATGTACTGCA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCCTGAACCATTGCCCTGGGATCGGAAGGACTTCTTCAAGGAGAGGAAACACGAGAGGTCAGAGTTTCTCGGACCTGTACCTAGATGGAGAGACTCATCTAGTCA
CGGATCCCGTGAGTTTAGTAGGTGGGGATCAGGCGATTTTCGCAGACCGCCAGGTCATGGTAGGCAGGGTGGTTGGCATGTGTTCTCTGACGAATATGGTCATGGATATG
GGCCTTCCATGTCATTCAGTAACAAGATGCTGGAAAATGTTAGTAGCCGGCCATCTGTTTCACATGGCGATGGGAAGTACGCTAGGAATGGTAGGGAAAACAGATCTTTT
AGTCAAAGAGAATGGAAAGGTCATTCCTGGGCAACAAGTAACGGGTCTACAAACAATTGTGGTAGGCTGCAGCATGATCTAAATTATGATCAGAGGTCAGTTCACGATAT
GCTGATATATCCCTCTCATTCTCATTCTGACTTTGTAAACCCACGAGATAAGGTAAAAGGCCAGCATGATAAGATCGATGATGTCAATGGGTTAGGCACAAACCAGAGAC
GTGATCGGGAGTATTCAGTGAGTTCCTCAGGGTGGAAGCCTCTTAAATGGACACGTGGTGGTGGCTTATCTTCACGCACTTCAACATCAGGTCATTCCAGTAGCACAAAG
AGTATTGATGCTTTAGATTCTAATGAGACAAAGTCTGAGACAGTGTTGAAAAATGAGTCACAAAATCTATCTCCTTCCGCTGATCCTGCTGAGTGTGCAATGTCTTCTAT
GCCATGTGATGAGACAACTGTCAGGAAGAAGCCACGGCTAGGATGGGGCGAGGGGCTTGCCAAGTATGAAAAAAAGAAGGTTGAAGTTCCTGATGGTACTGCCTTTACAA
ATGTTAATGCAGAATCTACCCATTCTCTTAATTCCAGCTTGATTGAAAAAGGCCCTAGAGGTTCAGGGTTTTCGGATTGTACCTCACCTGCGACCCCTTCTTCTGTCGTT
TGTGGTTCCTCTCCAGGTGGGGATGAAAAATCATCCGGAAAGGCAGCTAGTGATAACGATGTCAGTAACATGCATGGTTCACCTGGTTCTGGTTTTCAGAACCAGTACGA
GGAACCATCCTTTAATTTAGAGAAGTTGGATAACTTTTCAATTGATGATTTGGGCTCTCCACTCATTCAACTGCTGCAATCTAATGATTCGATTTCTGTGGATCCTGGTA
TTATGAGTTGCACTGCCATGAGTAAGTTACTTATATATAAAAATGAAATTTCTAAAGTGTTGGAGACGACTGAGTCTGAAATTGATTTACTTGAAAATGAACTGAAGAGG
TTGAGATCTGAAAGTAAGGGCTCCTTTTCTTTTCCTTTAGCATCCAGTTCTTTGATGACTGGGGATAAAAACTTTGAGGAACACAGTGATGTCTCTAATATGATCCCTCC
AGCTGCTACCCTGCCAGTTGTTACTTCCGAGAATAATATTTCAAAAAAGATGTCACTTTCCACAAGTGACTCGGAAGAAGTGCATACGGATGTCAACGAAAAGGCTGTCG
GTAGGTTTGGGAGGTCGGATGCAATGGAATCTGTCATCACCGGAGAGAAGCTAACAATATCTGGTTGCAGTGTTGGAGATAATGTTGTGCCATCTGTTGACAATAGCATT
CTTATTAAGAGGGAAGCTGTCACATTAGAACCCATTTCTAGTGATATAAATGAATCTGATGATGGGGGAGATAGCCTGTTCAATTTAATTCTGGCATCCAATAAAGAATC
TGCATGCAGGGCGTCTGAAGCTTTAATCAGCATGTTGCCTGCTAATGAACATAAGATTGACATTTGGAGCACAAATGTCTCTTGCTCACAGAATCAATGTTTAATGAGAG
AGAGATATGCAAAGAAGAAGAGGTTATTAAGATTTAAGGAGAGAGTAATAACCCTTAAGTATAGAGCCTACCAGTCCTTGTGGAAGGAGAGTTTGCATGTGCCACCTGTA
AGGAAGTTACGTGCAAAATCTCAGAAAAAATATCAGTTGAGTTTGTGGACCAATTACAGTGGTTATCAGAAGAACCGCTCTTCCATTCGAGTCCGTATTCCTTCATCTGC
AGGAAATCCGAACCCCGTTTCTAACACGGCGATTCTTAGGCATGTGAGCAAGCAGCTTTCTAATTCCCTGGTTAAGCAGTACCGGAGGACCTTGAAGATGCCCGCATTAG
TTTTGGACCAGAAGGATAAGATGGCTTCAAGATTCGTCTCTAATAATGGGTTAGTTGAGGACCCGTGCGCTGTTGAGAAGGAGAGGATGATGATCAACCCATGGACCTCC
GAAGAGAAAGATGTTTTCATGGAGAAGTTGGCATGTTTCGGGAAAGATTTTGGGAAAATTGCCACATTTCTTGATCACAAGACGACAGCAGACTGTATTGAGTTCTACTA
CAAGAACCACAAATCTGATTGCTTTGAGAAAACAAAGAAGCTGGAGTTTGGGAAGAAGGCGAAGTCCTCAACCAGTACCTATTTGGTGACAACAGGGAAGAAATGGAATC
CTGAAACAAATGCTGCTTCTCTTGAACTGTTGGGTGCTGCTTCAGCGATGACAGCTTGTGCTCATAAGTATTCAAGCAGCAGGTCGAGCGGAAGAACCGCATATCACACA
ACTCAATGTGATGATGGTCTATCGGAAAGGGCCAAGAGTTTTCATATGTTTGGAAATGAAAGAGAGAAAGTTGCTGCTGATGTGCTAGCTGGTATATGTGGTTCCCTGTC
TTCAGAAGCCATGGGTTCATGTATCACTAGTAATTTCGACCGAAGAGACGGTTCTCAGGATTTGAAGTGCAAAAAAGGTGCGACAACTGTATTAAGACGGCGAATGACAG
CAAATATTCCACGGTTTATTGATGATGAGATTTTTTCAGACGAGAGCTGTGGAGAAATGGATCCTTCTTATTGGACTGATGGGGAGAAGTCGCTTTTCATAGAAGCAGTG
AGAGCTTATGGAAAGAATTTCTCAATGATCTCTACACATGTAGGAACAAAATCCACAGATCAGTGCAAGATCTTCTTCAGCAAAGCACGGAAGTGCCTTGGGTTGGATTT
GATATGTACTGCA
Protein sequenceShow/hide protein sequence
MPPEPLPWDRKDFFKERKHERSEFLGPVPRWRDSSSHGSREFSRWGSGDFRRPPGHGRQGGWHVFSDEYGHGYGPSMSFSNKMLENVSSRPSVSHGDGKYARNGRENRSF
SQREWKGHSWATSNGSTNNCGRLQHDLNYDQRSVHDMLIYPSHSHSDFVNPRDKVKGQHDKIDDVNGLGTNQRRDREYSVSSSGWKPLKWTRGGGLSSRTSTSGHSSSTK
SIDALDSNETKSETVLKNESQNLSPSADPAECAMSSMPCDETTVRKKPRLGWGEGLAKYEKKKVEVPDGTAFTNVNAESTHSLNSSLIEKGPRGSGFSDCTSPATPSSVV
CGSSPGGDEKSSGKAASDNDVSNMHGSPGSGFQNQYEEPSFNLEKLDNFSIDDLGSPLIQLLQSNDSISVDPGIMSCTAMSKLLIYKNEISKVLETTESEIDLLENELKR
LRSESKGSFSFPLASSSLMTGDKNFEEHSDVSNMIPPAATLPVVTSENNISKKMSLSTSDSEEVHTDVNEKAVGRFGRSDAMESVITGEKLTISGCSVGDNVVPSVDNSI
LIKREAVTLEPISSDINESDDGGDSLFNLILASNKESACRASEALISMLPANEHKIDIWSTNVSCSQNQCLMRERYAKKKRLLRFKERVITLKYRAYQSLWKESLHVPPV
RKLRAKSQKKYQLSLWTNYSGYQKNRSSIRVRIPSSAGNPNPVSNTAILRHVSKQLSNSLVKQYRRTLKMPALVLDQKDKMASRFVSNNGLVEDPCAVEKERMMINPWTS
EEKDVFMEKLACFGKDFGKIATFLDHKTTADCIEFYYKNHKSDCFEKTKKLEFGKKAKSSTSTYLVTTGKKWNPETNAASLELLGAASAMTACAHKYSSSRSSGRTAYHT
TQCDDGLSERAKSFHMFGNEREKVAADVLAGICGSLSSEAMGSCITSNFDRRDGSQDLKCKKGATTVLRRRMTANIPRFIDDEIFSDESCGEMDPSYWTDGEKSLFIEAV
RAYGKNFSMISTHVGTKSTDQCKIFFSKARKCLGLDLICTA