; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026965 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026965
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPlant protein of unknown function (DUF639)
Genome locationtig00153047:2662339..2667184
RNA-Seq ExpressionSgr026965
SyntenySgr026965
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006927 - Protein of unknown function DUF639


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589541.1 hypothetical protein SDJN03_14964, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0084.4Show/hide
Query:  IGFGHLFDC------RWGRMQLIWCLIG--------------------KTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANI
        +G   +F C       WG ++L WCLIG                    KT+STLEGLVRDSS KWLLGKRSFFDEE+EEIE+ PSAQRNW+SELSPFAN+
Subjt:  IGFGHLFDC------RWGRMQLIWCLIG--------------------KTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANI

Query:  VVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFRALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRI
        VVRRC+KILGVSASELQQSFN EAIDS+KL S+YARN LEYCCFRALAL  Q TGYL DKKFRRLTFDMMIAWEAPA SSQPLL+IDE+ASVGVEAFSRI
Subjt:  VVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFRALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRI

Query:  APAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIRKMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALY
        APAVPIISNVI+SENLFE L +S GARLQFSVYDKYLSGLEKVI+KMKNLS+SNLL SERS REEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALY
Subjt:  APAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIRKMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALY

Query:  FEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYRSISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKA
        FEALRVVS+DKAKRYDLSDDLKQV+KPELTGPWGTRLFDKAVLY+SISLSEPVVIEFPELKGHTRRDFW+ IIREVLYVHRFINKFQIKGIQR EAL+KA
Subjt:  FEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYRSISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKA

Query:  VLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLANMSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVG
        VLGILRLQAIQDICST SL CESLLMFNLCDQLPGGDLILETLANMS +KE++RTN SSHEKGMYSISALDV SHLGFG+G A +D NENEL+VGEIAVG
Subjt:  VLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLANMSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVG

Query:  KMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELMLPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVL
        KMTPLERAVKES+NNYEKVVMA+ETVDGAKVDGIDTNLAVMKELMLPVSELG YL+SLALWEDPMKSLAFC++SSYIICRDWL Y  ALLLAFMA+FM++
Subjt:  KMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELMLPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVL

Query:  TRFFNQ-GRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGNIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSP
        TR FNQ G P+DEVKV+APPAMNAMEQLLAVQN ISQ EQ IQDGNIVLLKLRALLLAIFPQAT+KFAVSLLVIA TLAFLP+KYI L+VFLEAFTRYSP
Subjt:  TRFFNQ-GRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGNIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSP

Query:  PRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK
        PRKASTERWTRRV+EWWFSIPAAPVIL+REKEDKK K
Subjt:  PRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK

XP_022134973.1 uncharacterized protein LOC111007092 [Momordica charantia]0.0e+0090.74Show/hide
Query:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR
        ++ KTRST+EGLVRDSS KWLLG+RSFFDEELEEIERSPSAQRNWISELSPFAN+VVRRCSKILGVS SELQQSFNVEAIDSIK+P HYARN LEYC FR
Subjt:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR

Query:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR
        ALALSTQ TGYLADKKFRRLTFDMMIAWEAPA SSQPL+NIDE+ASVGVEAFSRIA AVPIISNVIISENLFEVL +S GARLQFSVYDKYLSGLEKVIR
Subjt:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR

Query:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR
        KMK LSESNLLHSERS R+EKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLY+
Subjt:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR

Query:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN
        S+SLSEPVVIEFPELKGHTRRDFW+ II EV YVHRFI+KFQIKGIQR EAL++AVLGI+RLQAIQDICST SLRCESLLMFNLCDQLPGGD ILETLAN
Subjt:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN

Query:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM
        MS +KESDRT+KSS EKGMYSISALD+VS LGFG+GTASNDSNENELLVG+IAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM
Subjt:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM

Query:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGN
        LPVSELGKYLLSLA WEDPM+SLAFC++SSYII RDWLAY F+LLL FMA+FM+LTR FNQGRP++EVKVIAPP MNAMEQLLAVQN ISQAEQFIQDGN
Subjt:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGN

Query:  IVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK
        IVLLKLRAL+LAIFPQAT+KFAVSLLVIA TLAFLP+KYIFLVVFLEAFTRYSPPR ASTERWTRRVREWWFSIPAAPVILEREKEDKKRK
Subjt:  IVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK

XP_022921500.1 uncharacterized protein LOC111429749 isoform X1 [Cucurbita moschata]0.0e+0085.99Show/hide
Query:  WGRMQLIWCLIG--------------------KTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASEL
        WG ++L WCLIG                    KT+STLEGLVRDSS KWLLGKRSFFDEE+EEIE+ PSAQRNW+SELSPFAN+VVRRC+KILGVSASEL
Subjt:  WGRMQLIWCLIG--------------------KTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASEL

Query:  QQSFNVEAIDSIKLPSHYARNFLEYCCFRALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENL
        QQSFN EAIDS+KL S+YARN LEYCCFRALAL TQ TGYL DKKFRRLTFDMMIAWEAPA SSQPLL+IDE+ASVGVEAFSRIAPAVPIISNVI+SENL
Subjt:  QQSFNVEAIDSIKLPSHYARNFLEYCCFRALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENL

Query:  FEVLATSAGARLQFSVYDKYLSGLEKVIRKMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYD
        FE L +S  ARLQFSVYDKYLSGLEKVI+KMKNLS+SNLL SERS REEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVS+DKAKRYD
Subjt:  FEVLATSAGARLQFSVYDKYLSGLEKVIRKMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYD

Query:  LSDDLKQVIKPELTGPWGTRLFDKAVLYRSISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICST
        LSDDLKQV+KPELTGPWGTRLFDKAVLY+SISLSEPVVIEFPELKGHTRRDFW+ IIREVLYVHRFINKFQIKGIQR EAL+KAVLGILRLQAIQDICST
Subjt:  LSDDLKQVIKPELTGPWGTRLFDKAVLYRSISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICST

Query:  TSLRCESLLMFNLCDQLPGGDLILETLANMSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNY
         SL CESLLMFNLCDQLPGGDLILETLANMS +KE++RTN SSHEKGMYSISALDV SHLGFG+GTA +D NENEL+VGEIAVGKMTPLERAVKES+NNY
Subjt:  TSLRCESLLMFNLCDQLPGGDLILETLANMSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNY

Query:  EKVVMAQETVDGAKVDGIDTNLAVMKELMLPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQ-GRPIDEVKV
        EKVVMA+ETVDGAKVDGIDTNLAVMKELMLPVSELG YL+SLALWEDPMKSLAFC++SSYIICRDWL Y  ALLLAFMA+FM++TR FNQ G P+DEVKV
Subjt:  EKVVMAQETVDGAKVDGIDTNLAVMKELMLPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQ-GRPIDEVKV

Query:  IAPPAMNAMEQLLAVQNAISQAEQFIQDGNIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREW
        +APPAMNAMEQLLAVQN ISQ EQ IQDGNIVLLKLRALLLAIFPQAT+KFAVSLLVIA TLAFLP+KYI L+VFLEAFTRYSPPRKASTERWTRRV+EW
Subjt:  IAPPAMNAMEQLLAVQNAISQAEQFIQDGNIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREW

Query:  WFSIPAAPVILEREKEDKKRK
        WFSIPAAPVIL+REKEDKK K
Subjt:  WFSIPAAPVILEREKEDKKRK

XP_023516062.1 uncharacterized protein LOC111780036 [Cucurbita pepo subsp. pepo]0.0e+0088.73Show/hide
Query:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR
        ++ KT+STLEGLVRDSSLKWLLGKRSFFDEE+EEIE+ PSAQRNW+SELSPFAN+VVRRC+KILGVSASELQQSFN EAIDS+KL S+YARN LEYCCFR
Subjt:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR

Query:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR
        ALAL TQ TGYL DKKFRRLTFDMMIAWEAPA SSQPLL+IDE+ASVGVEAFSRIAPAVPIISNVI+SENLFE L +S GARLQFSVYDKYLSGLEKVI+
Subjt:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR

Query:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR
        KMKNLS+SNLL SERS REEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVS+DKAKRYDL DDLKQV+KPELTGPWGTRLFDKAVLY+
Subjt:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR

Query:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN
        SISLSEPVV+EFPELKGHTRRDFW+ IIREVLYVHRFINKFQIKGIQR EAL+KAVLGILRLQAIQDICST SL CESLLMFNLCDQLPGGDLILETLAN
Subjt:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN

Query:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM
        MS +KE++RTN SSHEKGMYSISALDV SHLGFG+GTA +DSNENEL+VGEIAVGKMTPLERAVKES+NNYEKVVMA+ETVDGAKVDGIDTNLAVMKELM
Subjt:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM

Query:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQ-GRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDG
        LPVSELG YL+SLALWEDPMKSLAFC+VSSYIICRDWL Y  ALLLAFMA+FM++TR FNQ G P+DEVKV+APPAMNAMEQLLAVQN ISQ EQ IQDG
Subjt:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQ-GRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDG

Query:  NIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK
        NIVLLKLRALLLAIFPQAT+KFAVSLLVIA TLAFLP+KYI L+VFLEAFTRYSPPRKASTERWTRRV+EWWFSIPAAPVIL+REKEDKK K
Subjt:  NIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK

XP_038899081.1 uncharacterized protein LOC120086478 [Benincasa hispida]0.0e+0089.87Show/hide
Query:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR
        ++ KTRST+EGLVRDSS KWLLGKRSFFDEELEEIERSPSAQRNWISELSPFAN+VVRRC+KILGVSASELQ+SFN+EAIDSIK+PS+YARNFLEYCCFR
Subjt:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR

Query:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR
        ALAL TQ TGYLADKKFRRLTFD+MIAWEAPA SSQPLLNIDEDASVGVEAFSRIAPAVPIISNVI+SENLF VL +SA ARLQFSVYDKYLSGLEK IR
Subjt:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR

Query:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR
        KMKNLSESNLL SERS REEKILE+DGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVS+DKAKRYDLSDDLKQV+KPELTGPWGTRLFDKAVLY+
Subjt:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR

Query:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN
        S SLSEPVVIEFPELKGHTRRDFW+ IIREVLYVHRFINKFQIKGIQR EAL+KAVLGILRLQAIQDI ST SL CESLLMFNLCDQLPGGDLILETLAN
Subjt:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN

Query:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM
        MS +KESDRTN+SS  KGMYSISALD+VSHLGFG+GTA +DSNENELLVGEIAVGKMTPLERAVKES+NNYEKVVMAQETVDGAKVDGIDTNLAVMKELM
Subjt:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM

Query:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGN
        LPVSELGK+LLSLALWEDP+KSLAF +VSSYII RDWL Y  ALLLAFMA+FM+LTR FNQ   +DEVKV+APPAMNAMEQLLAVQNAISQAEQFIQDGN
Subjt:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGN

Query:  IVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK
        I LLKLRALLLAIFPQATMKFAV LLV+A TLAFLP+KYI LVVFLE FTRYSPPRK STERWTRRVREWWFSIPAAPVILEREKEDKK+K
Subjt:  IVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK

TrEMBL top hitse value%identityAlignment
A0A1S3BW86 uncharacterized protein LOC1034941290.0e+0089.44Show/hide
Query:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR
        ++ KTRSTLEGLVRDSSLKWLLGKRSFFDEELEE+ERSPSAQRNWISELSPFAN+VVRRC+KILGVSASELQQSFN+EAIDSIK+PS+YARNFLEYCCFR
Subjt:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR

Query:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR
        ALAL TQ TGYLADKKFRRLTFD+MIAWEAPA SSQPLLNIDEDASVGVEAF RIAPAVPIISNVI+SENLF VL +SA +RLQFSVYDKYLS LEKVIR
Subjt:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR

Query:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR
        KMKNLSESNLL SERS R+EKILE+DGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVS+DKAKRYDLSDDLKQV+KPELTGPWGTRLFDKAVLY 
Subjt:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR

Query:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN
        S+SLSEPVVIEFPELKGHTRRDFW+ IIREVLYVHRFINKFQIKGIQR EAL+KAVLGILRLQAIQD+ ST SL CESLLMFNLCDQLPGGDLILETLAN
Subjt:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN

Query:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM
        MS +KE DRTN+SS  KGMYSISALD+VSHLGFG+GTA +DSNE+ELLVGEIAVGKMTPLERAV+ES+NNYEKVVMAQETVDGAKVDGIDTNLAVMKELM
Subjt:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM

Query:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGN
        LPVSELGK LLSLALWEDP+KSLAFC+VSSYII RDWL Y  ALLLAFMAIFM+LTRFFN   P+DEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGN
Subjt:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGN

Query:  IVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK
        I LLKLRALLLAIFPQATMKFA  LLV+A TLAFLP+KYI L+VFLE FTRYSPPRK STERWTRRVREWWFSIPAAPVILEREKEDKK+K
Subjt:  IVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK

A0A5D3DXE8 DUF639 domain-containing protein0.0e+0089.44Show/hide
Query:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR
        ++ KTRSTLEGLVRDSSLKWLLGKRSFFDEELEE+ERSPSAQRNWISELSPFAN+VVRRC+KILGVSASELQQSFN+EAIDSIK+PS+YARNFLEYCCFR
Subjt:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR

Query:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR
        ALAL TQ TGYLADKKFRRLTFD+MIAWEAPA SSQPLLNIDEDASVGVEAF RIAPAVPIISNVI+SENLF VL +SA +RLQFSVYDKYLS LEKVIR
Subjt:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR

Query:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR
        KMKNLSESNLL SERS R+EKILE+DGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVS+DKAKRYDLSDDLKQV+KPELTGPWGTRLFDKAVLY+
Subjt:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR

Query:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN
        S+SLSEPVVIEFPELKGHTRRDFW+ IIREVLYVHRFINKFQIKGIQR EAL+KAVLGILRLQAIQD+ ST SL CESLLMFNLCDQLPGGDLILETLAN
Subjt:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN

Query:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM
        MS +KE DRTN+SS  KGMYSISALD+VSHLGFG+GTA +DSNE+ELLVGEIAVGKMTPLERAV+ES+NNYEKVVMAQETVDGAKVDGIDTNLAVMKELM
Subjt:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM

Query:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGN
        LPVSELGK LLSLALWEDP+KSLAFC+VSSYII RDWL Y  ALLLAFMAIFM+LTRFFN   P+DEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGN
Subjt:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGN

Query:  IVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK
        I LLKLRALLLAIFPQATMKFA  LLV+A TLAFLP+KYI L+VFLE FTRYSPPRK STERWTRRVREWWFSIPAAPVILEREKEDKK+K
Subjt:  IVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK

A0A6J1C3H7 uncharacterized protein LOC1110070920.0e+0090.74Show/hide
Query:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR
        ++ KTRST+EGLVRDSS KWLLG+RSFFDEELEEIERSPSAQRNWISELSPFAN+VVRRCSKILGVS SELQQSFNVEAIDSIK+P HYARN LEYC FR
Subjt:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR

Query:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR
        ALALSTQ TGYLADKKFRRLTFDMMIAWEAPA SSQPL+NIDE+ASVGVEAFSRIA AVPIISNVIISENLFEVL +S GARLQFSVYDKYLSGLEKVIR
Subjt:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR

Query:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR
        KMK LSESNLLHSERS R+EKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLY+
Subjt:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR

Query:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN
        S+SLSEPVVIEFPELKGHTRRDFW+ II EV YVHRFI+KFQIKGIQR EAL++AVLGI+RLQAIQDICST SLRCESLLMFNLCDQLPGGD ILETLAN
Subjt:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN

Query:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM
        MS +KESDRT+KSS EKGMYSISALD+VS LGFG+GTASNDSNENELLVG+IAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM
Subjt:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM

Query:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGN
        LPVSELGKYLLSLA WEDPM+SLAFC++SSYII RDWLAY F+LLL FMA+FM+LTR FNQGRP++EVKVIAPP MNAMEQLLAVQN ISQAEQFIQDGN
Subjt:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGN

Query:  IVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK
        IVLLKLRAL+LAIFPQAT+KFAVSLLVIA TLAFLP+KYIFLVVFLEAFTRYSPPR ASTERWTRRVREWWFSIPAAPVILEREKEDKKRK
Subjt:  IVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK

A0A6J1E1J8 uncharacterized protein LOC111429749 isoform X20.0e+0088.44Show/hide
Query:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR
        ++ KT+STLEGLVRDSS KWLLGKRSFFDEE+EEIE+ PSAQRNW+SELSPFAN+VVRRC+KILGVSASELQQSFN EAIDS+KL S+YARN LEYCCFR
Subjt:  LIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFR

Query:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR
        ALAL TQ TGYL DKKFRRLTFDMMIAWEAPA SSQPLL+IDE+ASVGVEAFSRIAPAVPIISNVI+SENLFE L +S  ARLQFSVYDKYLSGLEKVI+
Subjt:  ALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIR

Query:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR
        KMKNLS+SNLL SERS REEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVS+DKAKRYDLSDDLKQV+KPELTGPWGTRLFDKAVLY+
Subjt:  KMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYR

Query:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN
        SISLSEPVVIEFPELKGHTRRDFW+ IIREVLYVHRFINKFQIKGIQR EAL+KAVLGILRLQAIQDICST SL CESLLMFNLCDQLPGGDLILETLAN
Subjt:  SISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLAN

Query:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM
        MS +KE++RTN SSHEKGMYSISALDV SHLGFG+GTA +D NENEL+VGEIAVGKMTPLERAVKES+NNYEKVVMA+ETVDGAKVDGIDTNLAVMKELM
Subjt:  MSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELM

Query:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQ-GRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDG
        LPVSELG YL+SLALWEDPMKSLAFC++SSYIICRDWL Y  ALLLAFMA+FM++TR FNQ G P+DEVKV+APPAMNAMEQLLAVQN ISQ EQ IQDG
Subjt:  LPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQ-GRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDG

Query:  NIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK
        NIVLLKLRALLLAIFPQAT+KFAVSLLVIA TLAFLP+KYI L+VFLEAFTRYSPPRKASTERWTRRV+EWWFSIPAAPVIL+REKEDKK K
Subjt:  NIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK

A0A6J1E5Y6 uncharacterized protein LOC111429749 isoform X10.0e+0085.99Show/hide
Query:  WGRMQLIWCLIG--------------------KTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASEL
        WG ++L WCLIG                    KT+STLEGLVRDSS KWLLGKRSFFDEE+EEIE+ PSAQRNW+SELSPFAN+VVRRC+KILGVSASEL
Subjt:  WGRMQLIWCLIG--------------------KTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASEL

Query:  QQSFNVEAIDSIKLPSHYARNFLEYCCFRALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENL
        QQSFN EAIDS+KL S+YARN LEYCCFRALAL TQ TGYL DKKFRRLTFDMMIAWEAPA SSQPLL+IDE+ASVGVEAFSRIAPAVPIISNVI+SENL
Subjt:  QQSFNVEAIDSIKLPSHYARNFLEYCCFRALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENL

Query:  FEVLATSAGARLQFSVYDKYLSGLEKVIRKMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYD
        FE L +S  ARLQFSVYDKYLSGLEKVI+KMKNLS+SNLL SERS REEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVS+DKAKRYD
Subjt:  FEVLATSAGARLQFSVYDKYLSGLEKVIRKMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYD

Query:  LSDDLKQVIKPELTGPWGTRLFDKAVLYRSISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICST
        LSDDLKQV+KPELTGPWGTRLFDKAVLY+SISLSEPVVIEFPELKGHTRRDFW+ IIREVLYVHRFINKFQIKGIQR EAL+KAVLGILRLQAIQDICST
Subjt:  LSDDLKQVIKPELTGPWGTRLFDKAVLYRSISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICST

Query:  TSLRCESLLMFNLCDQLPGGDLILETLANMSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNY
         SL CESLLMFNLCDQLPGGDLILETLANMS +KE++RTN SSHEKGMYSISALDV SHLGFG+GTA +D NENEL+VGEIAVGKMTPLERAVKES+NNY
Subjt:  TSLRCESLLMFNLCDQLPGGDLILETLANMSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNY

Query:  EKVVMAQETVDGAKVDGIDTNLAVMKELMLPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQ-GRPIDEVKV
        EKVVMA+ETVDGAKVDGIDTNLAVMKELMLPVSELG YL+SLALWEDPMKSLAFC++SSYIICRDWL Y  ALLLAFMA+FM++TR FNQ G P+DEVKV
Subjt:  EKVVMAQETVDGAKVDGIDTNLAVMKELMLPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQ-GRPIDEVKV

Query:  IAPPAMNAMEQLLAVQNAISQAEQFIQDGNIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREW
        +APPAMNAMEQLLAVQN ISQ EQ IQDGNIVLLKLRALLLAIFPQAT+KFAVSLLVIA TLAFLP+KYI L+VFLEAFTRYSPPRKASTERWTRRV+EW
Subjt:  IAPPAMNAMEQLLAVQNAISQAEQFIQDGNIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREW

Query:  WFSIPAAPVILEREKEDKKRK
        WFSIPAAPVIL+REKEDKK K
Subjt:  WFSIPAAPVILEREKEDKKRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48840.1 Plant protein of unknown function (DUF639)2.7e-26367.34Show/hide
Query:  KTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFRALA
        +T   LEGLV+DSSLKWLLGK+S FDEE+EEIE SPSA  NWI ELSP AN+V+RRCSKILGV+ SELQ SF  EA +S+K PS + RNFLEYCCFRALA
Subjt:  KTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFRALA

Query:  LSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVL-ATSAGARLQFSVYDKYLSGLEKVIRKM
        LS  +TG+L+DK FRRLTFDMM+AWE P+ +SQ LL++DED +VG+EAFSRIAPAVPII++VII ENLF +L + S   RLQF VYDKYL GLE+ I+KM
Subjt:  LSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVL-ATSAGARLQFSVYDKYLSGLEKVIRKM

Query:  KNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYRSI
        K+ SES+LL   R S+ EKILE+DGTVTTQPVLEH+GISTWPGRL+LTDH+LYFEA++VVS+D  KRY LSDDLKQVIKPELTGPWGTRLFDKAV Y+SI
Subjt:  KNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYRSI

Query:  SLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLANMS
        SL EPVV+EFPELKGHTRRD+W+ II EVLYVHR+I KF+I  + + EA++KAVLGILR+QAIQ++  T  +R E+LL FNLCDQLPGGD ILETLA MS
Subjt:  SLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLANMS

Query:  ALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELMLP
        + +  DRT K+  E  ++SISA D+VS LG   G A++  + + L+VGE+ VG + PLE+AVK+S+ NYEKVV+AQETV+G KVDGIDTN+AVMKEL+LP
Subjt:  ALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELMLP

Query:  VSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFF-NQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGNI
         +E+G +LLSL  WEDP+KS  FC++S++II R W+ YVFA+   F+A FMVLTR+F N+ + + E+KV+APP MN MEQLLAVQNAISQ EQ IQD NI
Subjt:  VSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFF-NQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGNI

Query:  VLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKR
        VLLK RALLL++FPQA+ KFAV++++ A+ +A +P   + LVVFLE FTRYSPPR+ASTER  RR++EWWFSIPAAPV+LE+ K+D K+
Subjt:  VLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKR

AT1G71240.2 Plant protein of unknown function (DUF639)1.5e-6932.15Show/hide
Query:  EELEEIERSPSAQRNWISE--LSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFRALAL-STQITGYLADKKFRRLTFDMMI
        E+LEE+       R+   +  LS  A I + + S++ G++  ++Q  F  E I S  L S  AR  +EYCCFR L+  S++    L +  F+RL F  M+
Subjt:  EELEEIERSPSAQRNWISE--LSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFRALAL-STQITGYLADKKFRRLTFDMMI

Query:  AWEAPAISSQPLLNIDEDAS---------VGVEAFSRIAPAVPIISNVIISENLFEVLATSAGAR-LQFSVYDKYLSGLEKVI--RKMKNLSESNLLHSE
        AW  P    +   N   DAS         +G EAF RIAPA+  +++     NLF+ LAT+   + +   ++  Y+  L K+   RK    ++   L SE
Subjt:  AWEAPAISSQPLLNIDEDAS---------VGVEAFSRIAPAVPIISNVIISENLFEVLATSAGAR-LQFSVYDKYLSGLEKVI--RKMKNLSESNLLHSE

Query:  RSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYRSISLSEPVVIEFPE
        R       L         PVL+      WPG+L LTD ALYFE + +       R DL+ D K  ++    GP G  LFD AV   S       V+EF +
Subjt:  RSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYRSISLSEPVVIEFPE

Query:  LKGHTRRDFWMEIIREVLYVHRFINKF----------QIKGIQRG--EALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLA---
        L G  RRD W  II EV+ +H F+ +F          Q+ G ++G  +A+A A   I RLQA+Q + +      + L+ F+   Q+  GD++ +TLA   
Subjt:  LKGHTRRDFWMEIIREVLYVHRFINKF----------QIKGIQRG--EALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLA---

Query:  --NMSALKESD------RTNKSSHEKGMYSISALDVVSHL-------GFGLGTASNDSNENELLVG-----EIAVGKMTPLERAVKESKNNYEKVVMAQE
               K SD      R ++ S+E    ++S LD   +L        +G   + N    + L  G      +AV  +T +ERA +  +  Y+ V   Q 
Subjt:  --NMSALKESD------RTNKSSHEKGMYSISALDVVSHL-------GFGLGTASNDSNENELLVG-----EIAVGKMTPLERAVKESKNNYEKVVMAQE

Query:  TVDGAKVDGIDTNLAVMKELMLPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMV----LTRFFNQGRPIDEVKVIAPPA
        T+D A + GI +N+ + KEL+LP+S        L  WE+P  +++F   +S II R+ L YV  + L F+A  M+    L R    GR    + +   P+
Subjt:  TVDGAKVDGIDTNLAVMKELMLPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMV----LTRFFNQGRPIDEVKVIAPPA

Query:  MNAMEQLLAVQNAISQAEQFIQDGNIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIP
         N +++++AV++A+   E ++Q  N+VLLKLR ++L+  PQ T + A+++L IA+ L  +P KY+   V  + FTR    RK   +++   +RE W  +P
Subjt:  MNAMEQLLAVQNAISQAEQFIQDGNIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIP

Query:  AAPVIL
        AAPVI+
Subjt:  AAPVIL

AT2G21720.1 Plant protein of unknown function (DUF639)5.5e-7529.08Show/hide
Query:  ISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFRALA-LSTQITGYLADKKFRRLTFDMMIAWEAP----------AIS
        +  LS  AN VV+RCS+ L  +  +L + F  +          Y++ F+E+C  +  + +   I   + D  F RLTFDMM+AW+ P          A+ 
Subjt:  ISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFRALA-LSTQITGYLADKKFRRLTFDMMIAWEAP----------AIS

Query:  SQ-------------------------PLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIRKMKNLSESN
         +                         PLL +D + SVG +AF  +   +P+  ++I     FE L    G +L F  YD ++  + K    MK+L + +
Subjt:  SQ-------------------------PLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIRKMKNLSESN

Query:  LLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYRSISLSEPVV
               + +E IL V+GT+ +Q V+ H+  ++WPGRL LT++ALYFEA  +++Y+ A + DLS D ++  KP  TGP G  LFDKA++Y S    E +V
Subjt:  LLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYRSISLSEPVV

Query:  IEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKG-IQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLANMSALKESD
        IEFPE+   TRRD W+ +++E+  +H+F+ KF ++  +Q  E  ++ +LGI+RL A +++   +    ++ L+F+L +++P GD +LE LA +S LK   
Subjt:  IEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKG-IQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLANMSALKESD

Query:  RTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELMLPVSELGK
          N  S    + +++    +  LG  +     D  + +++        +  LE AV +S+   + +  A+ T    + +GI  ++AV+ EL+ P+ ++  
Subjt:  RTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELMLPVSELGK

Query:  YLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTR-FFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGNIVLLKLR
        +   +  WE P ++L    ++   + ++W+    A  L ++   M   R      +  D V V         E +++ Q  + +  Q +Q  N+ +LKLR
Subjt:  YLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTR-FFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGNIVLLKLR

Query:  ALLLAIFPQATMKFAVSLLVIASTLAFLPSK--YIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPV
        +L  +   +        +LV+AS  A +P K   IF +V+    T  S     S ++  RR++EWW SIP  PV
Subjt:  ALLLAIFPQATMKFAVSLLVIASTLAFLPSK--YIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPV

AT3G18350.1 Plant protein of unknown function (DUF639)1.3e-26567.1Show/hide
Query:  KTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFRALA
        KTR+ LEGLVRD+S KWLLGK+S FDEE+EE+ RSPSA  NWI ELSP AN+VVRRCSKILGVSA+EL+ SF  EA +S+K PS + RNFLEYCCFRAL+
Subjt:  KTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFRALA

Query:  LSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIRKMK
        LS  +TG+LADKKFRRLTFDMM+ WE PA++SQ LL+++EDA+V +EAFSRIAPAVPII++VII +NLF++L +S G RLQFSVYDKYL GLE+ I+KM+
Subjt:  LSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIRKMK

Query:  NLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYRSIS
          SES+LL   RS R EKILE+DGTVTTQPVLEHVGISTWPGRL+LTDH+LYFEAL+VVSYD  KRY LS+DLKQ+IKPELTGPWGTRLFDKAV Y+SIS
Subjt:  NLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPELTGPWGTRLFDKAVLYRSIS

Query:  LSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLANMSA
        LSEPVV+EFPELKGHTRRD+W+ II+EVLYVHR+INK++I G+ R EAL+KAVLG++R+QA+Q++  T ++R E+LL FNLCDQLPGGDLILETLA MS 
Subjt:  LSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNLCDQLPGGDLILETLANMSA

Query:  LKESDRTNKSSHEKGMYSISALDVVSHLG--FGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELML
         +E  R+NKS  + G    SA D+VS LG  FG  +  +    + L+VGE+ VG + PLERAVKES+  YEKVV+AQET++G K+ GIDTNLAVMKELML
Subjt:  LKESDRTNKSSHEKGMYSISALDVVSHLG--FGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLAVMKELML

Query:  PVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGNI
        P+ E    +LS+  W+DP KS  FC+++++II R WL YVFAL   F AIFMVLTR F++ + + E+KV APP MN MEQLLAVQN IS+ EQ IQD NI
Subjt:  PVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGNI

Query:  VLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK
        VLLK RALL ++FPQA+ KFA++++V A+ +AF+P +Y+  VVF+E FTRYSPPR+ASTER  RR+REWWFSIPAAPV+L  +K +KK+K
Subjt:  VLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK

AT5G23390.1 Plant protein of unknown function (DUF639)1.6e-15444.18Show/hide
Query:  SSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFRALALSTQITGYLADK
        SSLK L  ++        +   SP A    I +LS  AN VV RCSKIL +   +LQ  F+VE  +S+K    YARNFLE+C F+AL    +   YL+D+
Subjt:  SSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFANIVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFRALALSTQITGYLADK

Query:  KFRRLTFDMMIAWEAPAISSQP-----------------------------LLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFS
        +FR+L FDMM+AWE P+++S+                               + +DE  SVG EAF+RIAP  P I++ I   NLF+ L +S+G RL + 
Subjt:  KFRRLTFDMMIAWEAPAISSQP-----------------------------LLNIDEDASVGVEAFSRIAPAVPIISNVIISENLFEVLATSAGARLQFS

Query:  VYDKYLSGLEKVIRKMKNL---SESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPE
        VYDKYL  L+K+ +  K+    S +NL    + ++ E +L++DG     PVL+HVGIS WPG+L LT+ ALYF+++     +K  RYDL++D KQVIKPE
Subjt:  VYDKYLSGLEKVIRKMKNL---SESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSDDLKQVIKPE

Query:  LTGPWGTRLFDKAVLYRSISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFN
        LTGP G R+FDKA++Y+SI++ EPV  EF E KG+ RRD+W+ I  E+L V  FI ++  KGIQR E LA+A+LGI R +AI++     S + ++LL+FN
Subjt:  LTGPWGTRLFDKAVLYRSISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFN

Query:  LCDQLPGGDLILETLAN-MSALKESDRTNKSS--HEKGMYSIS--ALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQ
        L + LPGGD++LE L++ +S +  +  ++  S  + K   ++S  +L ++ H G  L T +N   E   +VG+  VG+ +PLE A+K+S  + ++   AQ
Subjt:  LCDQLPGGDLILETLAN-MSALKESDRTNKSS--HEKGMYSIS--ALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQ

Query:  ETVDGAKVDGIDTNLAVMKELMLPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNA
         TV+  KV+GIDTN+AVMKEL+LP  +LG ++  LA W+DP KS  F ++ SY+I   W+ ++   +L  +AI M+  + FN+G+    V+V APP+ NA
Subjt:  ETVDGAKVDGIDTNLAVMKELMLPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNA

Query:  MEQLLAVQNAISQAEQFIQDGNIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAP
        +EQLL +Q+AISQ E  IQ  N+ LLK+RA+ LAI PQAT   A+SL+V+A  LA +P KY+  V F+E FTR    RKAS++R  RR+REWWF +PAAP
Subjt:  MEQLLAVQNAISQAEQFIQDGNIVLLKLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAP

Query:  VILEREKEDKKRK
        V L R ++ KK+K
Subjt:  VILEREKEDKKRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGATCGGAACCATCCCACAATATTGACATTCCTTGCGATTGTGGAGACTGATCCAACCGACTCTCTGCTGCTCTGTCAGCACGCGTCATTCTTCAAAATCCGGCG
AATCGGTTTTGGCCACCTCTTCGATTGCCGTTGGGGACGTATGCAATTGATATGGTGCTTAATTGGCAAAACCAGAAGTACGCTCGAGGGGCTTGTCAGAGATAGTTCAT
TGAAATGGCTGTTAGGAAAGCGAAGCTTTTTCGATGAAGAACTTGAAGAAATTGAAAGGTCTCCTTCTGCTCAGAGGAATTGGATAAGTGAGCTCTCTCCATTTGCCAAT
ATAGTTGTTCGGAGATGTTCAAAAATCCTTGGTGTTTCTGCTAGCGAACTTCAACAAAGTTTCAATGTGGAGGCAATTGATTCAATAAAGCTCCCATCACATTATGCGAG
AAACTTTCTGGAGTATTGCTGTTTCAGGGCACTTGCACTGTCTACACAAATTACGGGCTATTTGGCTGATAAAAAATTTCGACGCTTGACATTTGACATGATGATAGCTT
GGGAAGCCCCAGCAATTTCTAGCCAACCTTTACTCAATATTGATGAAGATGCGTCAGTTGGGGTGGAGGCTTTCTCTCGAATTGCTCCAGCAGTTCCCATAATTTCAAAT
GTGATCATAAGTGAAAATCTTTTTGAGGTGCTTGCTACATCGGCGGGTGCTAGACTTCAGTTCTCTGTCTATGACAAGTACCTGAGCGGGCTAGAAAAAGTGATAAGAAA
AATGAAAAACCTATCAGAGTCAAACCTTCTTCATTCTGAAAGATCATCGAGAGAAGAAAAGATTTTAGAAGTGGATGGAACCGTCACTACACAACCAGTTCTTGAACATG
TAGGAATTTCTACATGGCCCGGTAGGTTGGTTCTGACAGATCATGCTCTTTACTTTGAAGCTCTTCGTGTTGTGTCATATGACAAGGCGAAAAGATATGACTTATCTGAT
GATCTGAAACAGGTTATCAAACCTGAATTGACTGGACCATGGGGTACTCGGCTCTTTGATAAGGCGGTTCTGTATAGATCCATTTCATTATCAGAACCAGTCGTGATTGA
GTTTCCTGAGCTTAAAGGCCATACTCGTCGAGATTTTTGGATGGAAATTATACGGGAGGTATTATATGTACATAGATTTATCAACAAGTTCCAGATCAAGGGAATTCAAA
GGGGTGAAGCTCTCGCAAAAGCTGTCCTTGGAATTCTACGTCTGCAAGCCATTCAAGATATATGCTCTACAACATCTCTTAGGTGTGAGTCGCTTCTCATGTTTAATCTT
TGCGATCAGCTCCCTGGTGGGGACTTAATATTGGAAACCCTTGCAAATATGTCAGCCTTAAAGGAGAGTGACCGGACAAACAAATCTAGTCATGAAAAGGGAATGTATTC
AATCTCAGCACTAGACGTGGTTTCTCACTTGGGATTTGGGTTGGGAACAGCTTCAAATGATTCAAATGAGAATGAGCTTCTTGTGGGTGAGATTGCAGTTGGAAAAATGA
CTCCTTTGGAAAGGGCAGTCAAGGAATCAAAAAACAATTATGAAAAGGTGGTGATGGCTCAAGAGACAGTTGATGGAGCAAAAGTAGATGGCATTGATACTAATTTGGCA
GTGATGAAGGAATTAATGTTGCCAGTGAGTGAACTTGGGAAGTACCTTCTTTCTTTGGCATTATGGGAAGATCCAATGAAGTCTTTGGCATTCTGTATGGTCTCCAGTTA
CATTATCTGCAGGGACTGGCTGGCCTATGTCTTTGCTCTGCTACTCGCGTTTATGGCAATTTTCATGGTGCTTACGCGATTTTTCAACCAAGGGAGGCCGATTGACGAAG
TTAAGGTCATAGCCCCTCCGGCAATGAATGCAATGGAGCAGCTTTTAGCCGTTCAGAATGCTATTTCTCAAGCAGAACAATTCATCCAAGACGGAAATATTGTTCTCCTT
AAGCTACGCGCCTTGCTGCTGGCCATTTTCCCTCAGGCCACAATGAAGTTTGCAGTTTCTCTTCTAGTGATAGCCTCAACTCTGGCCTTCCTACCCTCCAAATACATATT
TCTAGTGGTGTTTTTGGAGGCATTTACAAGGTACTCACCCCCAAGAAAGGCAAGCACAGAGAGATGGACAAGGAGAGTGAGGGAATGGTGGTTCAGCATTCCAGCAGCTC
CTGTTATTCTTGAGAGGGAAAAGGAAGACAAGAAGAGGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGATCGGAACCATCCCACAATATTGACATTCCTTGCGATTGTGGAGACTGATCCAACCGACTCTCTGCTGCTCTGTCAGCACGCGTCATTCTTCAAAATCCGGCG
AATCGGTTTTGGCCACCTCTTCGATTGCCGTTGGGGACGTATGCAATTGATATGGTGCTTAATTGGCAAAACCAGAAGTACGCTCGAGGGGCTTGTCAGAGATAGTTCAT
TGAAATGGCTGTTAGGAAAGCGAAGCTTTTTCGATGAAGAACTTGAAGAAATTGAAAGGTCTCCTTCTGCTCAGAGGAATTGGATAAGTGAGCTCTCTCCATTTGCCAAT
ATAGTTGTTCGGAGATGTTCAAAAATCCTTGGTGTTTCTGCTAGCGAACTTCAACAAAGTTTCAATGTGGAGGCAATTGATTCAATAAAGCTCCCATCACATTATGCGAG
AAACTTTCTGGAGTATTGCTGTTTCAGGGCACTTGCACTGTCTACACAAATTACGGGCTATTTGGCTGATAAAAAATTTCGACGCTTGACATTTGACATGATGATAGCTT
GGGAAGCCCCAGCAATTTCTAGCCAACCTTTACTCAATATTGATGAAGATGCGTCAGTTGGGGTGGAGGCTTTCTCTCGAATTGCTCCAGCAGTTCCCATAATTTCAAAT
GTGATCATAAGTGAAAATCTTTTTGAGGTGCTTGCTACATCGGCGGGTGCTAGACTTCAGTTCTCTGTCTATGACAAGTACCTGAGCGGGCTAGAAAAAGTGATAAGAAA
AATGAAAAACCTATCAGAGTCAAACCTTCTTCATTCTGAAAGATCATCGAGAGAAGAAAAGATTTTAGAAGTGGATGGAACCGTCACTACACAACCAGTTCTTGAACATG
TAGGAATTTCTACATGGCCCGGTAGGTTGGTTCTGACAGATCATGCTCTTTACTTTGAAGCTCTTCGTGTTGTGTCATATGACAAGGCGAAAAGATATGACTTATCTGAT
GATCTGAAACAGGTTATCAAACCTGAATTGACTGGACCATGGGGTACTCGGCTCTTTGATAAGGCGGTTCTGTATAGATCCATTTCATTATCAGAACCAGTCGTGATTGA
GTTTCCTGAGCTTAAAGGCCATACTCGTCGAGATTTTTGGATGGAAATTATACGGGAGGTATTATATGTACATAGATTTATCAACAAGTTCCAGATCAAGGGAATTCAAA
GGGGTGAAGCTCTCGCAAAAGCTGTCCTTGGAATTCTACGTCTGCAAGCCATTCAAGATATATGCTCTACAACATCTCTTAGGTGTGAGTCGCTTCTCATGTTTAATCTT
TGCGATCAGCTCCCTGGTGGGGACTTAATATTGGAAACCCTTGCAAATATGTCAGCCTTAAAGGAGAGTGACCGGACAAACAAATCTAGTCATGAAAAGGGAATGTATTC
AATCTCAGCACTAGACGTGGTTTCTCACTTGGGATTTGGGTTGGGAACAGCTTCAAATGATTCAAATGAGAATGAGCTTCTTGTGGGTGAGATTGCAGTTGGAAAAATGA
CTCCTTTGGAAAGGGCAGTCAAGGAATCAAAAAACAATTATGAAAAGGTGGTGATGGCTCAAGAGACAGTTGATGGAGCAAAAGTAGATGGCATTGATACTAATTTGGCA
GTGATGAAGGAATTAATGTTGCCAGTGAGTGAACTTGGGAAGTACCTTCTTTCTTTGGCATTATGGGAAGATCCAATGAAGTCTTTGGCATTCTGTATGGTCTCCAGTTA
CATTATCTGCAGGGACTGGCTGGCCTATGTCTTTGCTCTGCTACTCGCGTTTATGGCAATTTTCATGGTGCTTACGCGATTTTTCAACCAAGGGAGGCCGATTGACGAAG
TTAAGGTCATAGCCCCTCCGGCAATGAATGCAATGGAGCAGCTTTTAGCCGTTCAGAATGCTATTTCTCAAGCAGAACAATTCATCCAAGACGGAAATATTGTTCTCCTT
AAGCTACGCGCCTTGCTGCTGGCCATTTTCCCTCAGGCCACAATGAAGTTTGCAGTTTCTCTTCTAGTGATAGCCTCAACTCTGGCCTTCCTACCCTCCAAATACATATT
TCTAGTGGTGTTTTTGGAGGCATTTACAAGGTACTCACCCCCAAGAAAGGCAAGCACAGAGAGATGGACAAGGAGAGTGAGGGAATGGTGGTTCAGCATTCCAGCAGCTC
CTGTTATTCTTGAGAGGGAAAAGGAAGACAAGAAGAGGAAATGA
Protein sequenceShow/hide protein sequence
MQDRNHPTILTFLAIVETDPTDSLLLCQHASFFKIRRIGFGHLFDCRWGRMQLIWCLIGKTRSTLEGLVRDSSLKWLLGKRSFFDEELEEIERSPSAQRNWISELSPFAN
IVVRRCSKILGVSASELQQSFNVEAIDSIKLPSHYARNFLEYCCFRALALSTQITGYLADKKFRRLTFDMMIAWEAPAISSQPLLNIDEDASVGVEAFSRIAPAVPIISN
VIISENLFEVLATSAGARLQFSVYDKYLSGLEKVIRKMKNLSESNLLHSERSSREEKILEVDGTVTTQPVLEHVGISTWPGRLVLTDHALYFEALRVVSYDKAKRYDLSD
DLKQVIKPELTGPWGTRLFDKAVLYRSISLSEPVVIEFPELKGHTRRDFWMEIIREVLYVHRFINKFQIKGIQRGEALAKAVLGILRLQAIQDICSTTSLRCESLLMFNL
CDQLPGGDLILETLANMSALKESDRTNKSSHEKGMYSISALDVVSHLGFGLGTASNDSNENELLVGEIAVGKMTPLERAVKESKNNYEKVVMAQETVDGAKVDGIDTNLA
VMKELMLPVSELGKYLLSLALWEDPMKSLAFCMVSSYIICRDWLAYVFALLLAFMAIFMVLTRFFNQGRPIDEVKVIAPPAMNAMEQLLAVQNAISQAEQFIQDGNIVLL
KLRALLLAIFPQATMKFAVSLLVIASTLAFLPSKYIFLVVFLEAFTRYSPPRKASTERWTRRVREWWFSIPAAPVILEREKEDKKRK