; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013273 (gene) of Snake gourd v1 genome

Gene IDTan0013273
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBSD domain-containing protein
Genome locationLG06:7943169..7945311
RNA-Seq ExpressionTan0013273
SyntenyTan0013273
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR005607 - BSD domain
IPR035925 - BSD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593139.1 BSD domain-containing protein 1, partial [Cucurbita argyrosperma subsp. sororia]1.2e-19982.62Show/hide
Query:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETV
        MNFFRSVF+DD DPSTT ESE +SPEKSL+KEGEESSDPSSPSPVADS A AGAWSFGGL+KTLSA+SESV+ETYRRDLQEFGSGLKKEIEVAQGSLETV
Subjt:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETV

Query:  GHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENL
        GH FDEFGSSVLKGTAQIIAQGKDAILAID ESDSDNSSNQNLSNQR+S SKPYSRFDAQVRSIQGD+ATYCDEPED+ DYEKWKSQFVL+D++EEIENL
Subjt:  GHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENL

Query:  LEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANST
        LEENGA+DNIHKKVVP VVD+ETFWYRYFYKVHKLKQAE VR NLVKRAIA EEEEDLSWDVDDDDD++ GYDV SKGD       VGETVKEENTANST
Subjt:  LEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANST

Query:  LKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSSVGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNNKDQGSKSDEKVAAEAKSDHGESSKD
        L                   KKEVP++EL AGSSVGDDKEEEN NK+GGNVEELDG+KGSDQKVH +GGS + N+KDQG KSDEKVA +A+SDHGESSK 
Subjt:  LKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSSVGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNNKDQGSKSDEKVAAEAKSDHGESSKD

Query:  SDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKKAVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE
        SDVSIVSTQPSMPEDEDLGWDEIEDLS++EEKK  TQGG +NREEMRKRLST EDDEDLDWGTDTE
Subjt:  SDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKKAVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE

KAG7025539.1 BSD domain-containing protein 1, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-19882.4Show/hide
Query:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETV
        MNFFRSVF+DD DPSTT ESE +SPEKSL+KEGEESSDPSSPSPVADS A AGAWSFGGL+KTLSA+SESV+ETYRRDLQEFGSGLKKEIEVAQGSLETV
Subjt:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETV

Query:  GHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENL
        GH FDEFGSSVLKGTAQIIAQGKDAILAID ESDSD SSNQNLSNQR+S SKPYSRFDAQVRSIQGD+ATYCDEPED+ DYEKWKSQFVL+D++EEIENL
Subjt:  GHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENL

Query:  LEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANST
        LEENGA+DNIHKKVVP+VVD+ETFWYRYFYKVHKLKQAE VR NLVKRAIA EEEEDLSWDVDDDDDN+ GYDV  KGD       VGETVKEENTANST
Subjt:  LEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANST

Query:  LKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSSVGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNNKDQGSKSDEKVAAEAKSDHGESSKD
        L                   KKEVP++EL AGSSVGDDKEEEN NK+GGNVEELDG+KGSDQKVH +GGS + N+KDQG KSDEKVA +A+SDHGESSK 
Subjt:  LKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSSVGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNNKDQGSKSDEKVAAEAKSDHGESSKD

Query:  SDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKKAVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE
        SDVSIVSTQPSMPEDEDLGWDEIEDLS++EEKK  TQGG +NREEMRKRLST EDDEDLDWGTDTE
Subjt:  SDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKKAVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE

XP_004148869.1 BSD domain-containing protein 1 [Cucumis sativus]6.0e-18378.47Show/hide
Query:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDP---SSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSL
        MNFFRSVFSDDPDPST  ++EP+SP KS  +EGE+SSD    S+P+PV     DAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVA GSL
Subjt:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDP---SSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSL

Query:  ETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEI
        ETVGHAFDEFGSSVLKGTAQIIAQGK+AI AIDQESDSD+S+NQNLSNQRSS+SKPYSRFDAQVRS+QGD+ATYCDEPEDLGDYEKW+SQFVLNDK+EEI
Subjt:  ETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEI

Query:  ENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGY---DVGSKGDVVKKEVN--------
        ENL+EENGAIDNIHKKVVPNVVDNETFW+RYFYKVHKLKQAE+VRANLVKRAIA EEEEDLSWDVDDDDDN +GY   + GSKGD VK +V+        
Subjt:  ENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGY---DVGSKGDVVKKEVN--------

Query:  VGETVK-EENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSSVGDDKEEENLNKSGGNVEELDGD-KGSDQKVHSEGGSGLNNNKDQGSKSDE
         GE V  E +TAN  +KDDL AKEEVGGKES EVVK      ELN GSSVGDD + E       +VEEL+G+ KGSDQKVH EGGSG +NNKD+G K   
Subjt:  VGETVK-EENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSSVGDDKEEENLNKSGGNVEELDGD-KGSDQKVHSEGGSGLNNNKDQGSKSDE

Query:  KVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK-AVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE
         +A EAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK  VTQGG  NREE++KRLST EDDEDLDWGTDTE
Subjt:  KVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK-AVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE

XP_022960409.1 uncharacterized protein LOC111461144 [Cucurbita moschata]3.2e-18482.64Show/hide
Query:  ESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESD
        +SSDPSSPSPVADS A AGAWSFGGL+KTLSA+SESV+ETYRRDLQEFGSGLKKEIEVAQGSLETVGH FDEFGSSVLKGTAQIIAQGKDAILAID ESD
Subjt:  ESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESD

Query:  SDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHK
        SDNSSNQNLSNQR+S SKPYSRFDAQVRSIQGD+ATYCDEPED+ DYEKWKSQFVL+D++EEIENLLEENGA+DNIHKKVVP+VVD+ETFWYRYFYKVHK
Subjt:  SDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHK

Query:  LKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSS
        LKQAE VR NLVKRAIA EEEEDLSWDVDDDDD++ GYDV SKGD       VGETVKEENTANSTL                   KKEVP++EL AGSS
Subjt:  LKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSS

Query:  VGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNNKDQGSKSDEKVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKKA
        VGDDKEEEN NK+GGNVEELDG+KGSDQKVH +GGS L N+KDQG KSDEKVA +A+SDHGESSK SDVSIVSTQPSMPEDEDLGWDEIEDLS++EEKK 
Subjt:  VGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNNKDQGSKSDEKVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKKA

Query:  VTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE
         TQGG +NREEMRKRLST EDDEDLDWGTDTE
Subjt:  VTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE

XP_038896504.1 BSD domain-containing protein 1 [Benincasa hispida]1.4e-19581.82Show/hide
Query:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETV
        MNFFRSVFSDDPDPSTT E+EPRSPEKSLLKEGEESSDP   SPV DSS DAGAWSFGGLIKTLSA+SESVIETYRRDLQEFGSGLKKEIEVAQGSLETV
Subjt:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETV

Query:  GHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENL
        GHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSD+SSNQNLSNQRSS+SKPYSRFDAQVRS+QGD+ATYC+EPED+GDYEKWKSQFVLNDK+EEIENL
Subjt:  GHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENL

Query:  LEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDD----NDDGY---DVGSKGDVVKKEVN-------
        +EENG IDN+HKKVVPNVVDNETFW+RYFYKVHKLKQAENVRANLVKRAIA EEEEDLSWDVDDDDD     ++GY   +VGSKGD VK +V+       
Subjt:  LEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDD----NDDGY---DVGSKGDVVKKEVN-------

Query:  -VGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEV-PIRELNAGSSVGDDKEEENLNKSGGNVEELDG-DKGSDQKVHSEGGSGLNNNKDQGSKSD
           E V  E+ A++ LKDDL  KEEVGGKESAEVVKKEV P++ELN GSSVGDD++ E       +VEELDG +KGS+QKV  EGGSG+ NNKDQG K  
Subjt:  -VGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEV-PIRELNAGSSVGDDKEEENLNKSGGNVEELDG-DKGSDQKVHSEGGSGLNNNKDQGSKSD

Query:  EKVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK-AVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE
          VA EAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK  VTQGG VN+EEMRKRLST EDDEDLDWGTDTE
Subjt:  EKVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK-AVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE

TrEMBL top hitse value%identityAlignment
A0A0A0K7J0 BSD domain-containing protein2.9e-18378.47Show/hide
Query:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDP---SSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSL
        MNFFRSVFSDDPDPST  ++EP+SP KS  +EGE+SSD    S+P+PV     DAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVA GSL
Subjt:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDP---SSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSL

Query:  ETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEI
        ETVGHAFDEFGSSVLKGTAQIIAQGK+AI AIDQESDSD+S+NQNLSNQRSS+SKPYSRFDAQVRS+QGD+ATYCDEPEDLGDYEKW+SQFVLNDK+EEI
Subjt:  ETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEI

Query:  ENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGY---DVGSKGDVVKKEVN--------
        ENL+EENGAIDNIHKKVVPNVVDNETFW+RYFYKVHKLKQAE+VRANLVKRAIA EEEEDLSWDVDDDDDN +GY   + GSKGD VK +V+        
Subjt:  ENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGY---DVGSKGDVVKKEVN--------

Query:  VGETVK-EENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSSVGDDKEEENLNKSGGNVEELDGD-KGSDQKVHSEGGSGLNNNKDQGSKSDE
         GE V  E +TAN  +KDDL AKEEVGGKES EVVK      ELN GSSVGDD + E       +VEEL+G+ KGSDQKVH EGGSG +NNKD+G K   
Subjt:  VGETVK-EENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSSVGDDKEEENLNKSGGNVEELDGD-KGSDQKVHSEGGSGLNNNKDQGSKSDE

Query:  KVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK-AVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE
         +A EAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK  VTQGG  NREE++KRLST EDDEDLDWGTDTE
Subjt:  KVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK-AVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE

A0A1S3BRG3 BSD domain-containing protein 1-A7.2e-18277.89Show/hide
Query:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDP---SSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSL
        MNFFRSVFSDDPDPST  ++EP+SP+KS   EGEESSDP   S+P+PV     DAGAWSFGGLIKTLSA+SESVIETYRRDLQEFGSGLKKEIEVA GSL
Subjt:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDP---SSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSL

Query:  ETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEI
        ETVGHAFDEFGSSVLKGTAQIIAQGK+AI AIDQESDSD+SSNQNLSNQRSSSSKPYSRFDAQVRS+QGD+ATYCDEPEDLGDYEKWK Q VLNDK+EEI
Subjt:  ETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEI

Query:  ENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDD--GY---DVGSKGDVVKKEVN------
        ENL+EENGAIDNIHKKVVPNVVDNETFW+RYFYKVHKLKQAE+VRANLVKRAIA EEEEDLSWDVDDDDD DD  GY   + GSKGD VK + +      
Subjt:  ENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDD--GY---DVGSKGDVVKKEVN------

Query:  --VGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSSVGDDKEEENLNKSGGNVEELDGD-KGSDQKVHSEGGSGLNNNKDQGSKSD
            E V  E+   +   D+L AKEEVGGKE  EVVK      ELN GSSVGDD++ E       +VEEL+G+ KGSDQKVH EGGSG +NNKDQG K  
Subjt:  --VGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSSVGDDKEEENLNKSGGNVEELDGD-KGSDQKVHSEGGSGLNNNKDQGSKSD

Query:  EKVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK-AVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE
          +A EAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK  VTQGG  NREEMRKRLST EDDEDLDWGTDTE
Subjt:  EKVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK-AVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE

A0A5A7UNZ5 BSD domain-containing protein 1-A2.1e-18177.53Show/hide
Query:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDP---SSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSL
        MNFFRSVFSDDPDPST  ++EP+SP+KS   EGEESSDP   S+P+PV     DAGAWSFGGLIKTLSA+SESVIETYRRDLQEFGSGLKKEIEVA GSL
Subjt:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDP---SSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSL

Query:  ETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEI
        ETVGHAFDEFGSSVLKGTAQIIAQGK+AI AIDQESDSD+SSNQNLSNQRSSSSKPYSRFDAQVRS+QGD+ATYCDEPEDLGDYEKWK Q VLNDK+EEI
Subjt:  ETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEI

Query:  ENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDD---GY---DVGSKGDVVKKEVN-----
        ENL+EENGAIDNIHKKVVPNVVDNETFW+RYFYKVHKLKQAE+VRANLVKRAIA EEEEDLSWDVDDDDD++D   GY   + GSKGD VK + +     
Subjt:  ENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDD---GY---DVGSKGDVVKKEVN-----

Query:  ---VGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSSVGDDKEEENLNKSGGNVEELDGD-KGSDQKVHSEGGSGLNNNKDQGSKS
             E V  E+   +   D+L AKEEVGGKE  EVVK      ELN GSSVGDD++ E       +VEEL+G+ KGSDQKVH EGGSG +NNKDQG K 
Subjt:  ---VGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSSVGDDKEEENLNKSGGNVEELDGD-KGSDQKVHSEGGSGLNNNKDQGSKS

Query:  DEKVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK-AVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE
           +A EAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK  VTQGG  NREEMRKRLST EDDEDLDWGTDTE
Subjt:  DEKVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKK-AVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE

A0A6J1HAY6 uncharacterized protein LOC1114611441.6e-18482.64Show/hide
Query:  ESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESD
        +SSDPSSPSPVADS A AGAWSFGGL+KTLSA+SESV+ETYRRDLQEFGSGLKKEIEVAQGSLETVGH FDEFGSSVLKGTAQIIAQGKDAILAID ESD
Subjt:  ESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESD

Query:  SDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHK
        SDNSSNQNLSNQR+S SKPYSRFDAQVRSIQGD+ATYCDEPED+ DYEKWKSQFVL+D++EEIENLLEENGA+DNIHKKVVP+VVD+ETFWYRYFYKVHK
Subjt:  SDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHK

Query:  LKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSS
        LKQAE VR NLVKRAIA EEEEDLSWDVDDDDD++ GYDV SKGD       VGETVKEENTANSTL                   KKEVP++EL AGSS
Subjt:  LKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSS

Query:  VGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNNKDQGSKSDEKVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKKA
        VGDDKEEEN NK+GGNVEELDG+KGSDQKVH +GGS L N+KDQG KSDEKVA +A+SDHGESSK SDVSIVSTQPSMPEDEDLGWDEIEDLS++EEKK 
Subjt:  VGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNNKDQGSKSDEKVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKKA

Query:  VTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE
         TQGG +NREEMRKRLST EDDEDLDWGTDTE
Subjt:  VTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE

A0A6J1KZM8 uncharacterized protein LOC1114977681.5e-18281.8Show/hide
Query:  ESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESD
        +SSDPSSPSPVADS A AGAWSFGGL+KTLSA+SESV+ETYRRDLQEFGSGLKKEIEVAQGSLETVGH FDEFGSSVLKGTAQIIAQGKDAILAID ESD
Subjt:  ESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESD

Query:  SDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHK
        SDNSSNQNLSNQR+S SKPYSRFDAQVRSIQGD+ATYCDEPED+ DYEKWKSQFVL+D++EEIENLLEENGA+DNIHKKVVP+VVD+ETFWYRYFYKVHK
Subjt:  SDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHK

Query:  LKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSS
        LKQAE VR NLVKRAIA EEEEDLSWDVDDDDD++ GYDV SKGD       VGETVKEENTANSTL                   KK++P++EL AGSS
Subjt:  LKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELNAGSS

Query:  VGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNNKDQGSKSDEKVAAEAKSDH--GESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEK
        VGDDKEEEN NK+GGNVEELDG+KGSDQKVH +GGS L N+KDQG KSDEKVA +A+SDH  GESSK SDVSIVSTQPSMPEDEDLGWDEIEDLS++EEK
Subjt:  VGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNNKDQGSKSDEKVAAEAKSDH--GESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEK

Query:  KAVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE
        K  TQGG +NREEMRKRLST EDDEDLDWGTDTE
Subjt:  KAVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE

SwissProt top hitse value%identityAlignment
A2BIJ3 BSD domain-containing protein 11.2e-0524.48Show/hide
Query:  GAWSFGGLIKTLSARSESVIETY---RRDLQEFGS---------------GLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAIL-AIDQES
        G W  G L ++  +  +   E Y   +RDL EF S                +K ++ V +GS ET           V KG   I+    D +    D+  
Subjt:  GAWSFGGLIKTLSARSESVIETY---RRDLQEFGS---------------GLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAIL-AIDQES

Query:  DSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPE-DLGDYEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKV
        D D  +   L    + +++ Y    A++ S+Q D ATYC+EP+     ++ W S F L ++  EI  LL  + AI  ++ K+VP  V +  FW RYFYKV
Subjt:  DSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPE-DLGDYEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKV

Query:  HKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVK--KEVNVGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEV--PIRE
         +L+Q E  R  L +RA  ++  E L W+ +D++    G    S+ D     +E  V      +   +S+    +++   V    +  V    V  P + 
Subjt:  HKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVK--KEVNVGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEV--PIRE

Query:  LNAGSSVGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNNKDQGSKSDEKVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSI
         N   SV     +   + +   + +   D G  +    E  +       +    D +V  E  SD G+S+  +     + +     D    W++  DL +
Subjt:  LNAGSSVGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNNKDQGSKSDEKVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSI

Query:  IEEKKAVTQGGTVNREEMRKRLSTTEDDEDLDW
         EE+  +             ++  TE+ ED DW
Subjt:  IEEKKAVTQGGTVNREEMRKRLSTTEDDEDLDW

Q3SX22 BSD domain-containing protein 17.3e-1428.45Show/hide
Query:  VADSSADAGAWS--FGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIE-VAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQ
        V     D G W        + +  +S   +E  +RDL EF   ++++       +   V       GSS   G  + + +G    L +  ++ + +    
Subjt:  VADSSADAGAWS--FGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIE-VAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQ

Query:  ------NLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGD-YEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHK
               L    S +++PY    A++ S+Q D ATYC+EP+   + ++ W SQF L +K  EI  LL  + +I  ++ K+VP  V +  FW+RYFYKVH+
Subjt:  ------NLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGD-YEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHK

Query:  LKQAENVRANLVKRAIASEEEEDLSWDVDDDD
        L+Q +  R  L +RA  S  EE   W+ ++++
Subjt:  LKQAENVRANLVKRAIASEEEEDLSWDVDDDD

Q5ZIK6 BSD domain-containing protein 11.7e-1528.44Show/hide
Query:  DAGAWS--FGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQ------N
        DAG W        + +  +S   +E  +RDL EF   ++ +      +  +V    D    +   G  + + +G    L +  ++ + +           
Subjt:  DAGAWS--FGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQ------N

Query:  LSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGD-YEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENV
        L    + +++PY    A++ S+Q D ATYC+EP+   +  E W S+F L +K  EI  LL  + +I  ++ K+VP  V +  FW RYFYKVH+L+Q E  
Subjt:  LSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGD-YEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENV

Query:  RANLVKRAIASEEEEDLSWDVDDDD
        R  L +RA  S  +E+  W+ D+++
Subjt:  RANLVKRAIASEEEEDLSWDVDDDD

Q80Y55 BSD domain-containing protein 16.8e-1225.47Show/hide
Query:  KTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQ------NLSNQRSSSSKPYS
        + +  +S   +E  +RDL EF   ++ +      +  +V    ++  +    G  + + +G    L +  ++ + +           L    S +++PY 
Subjt:  KTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQ------NLSNQRSSSSKPYS

Query:  RFDAQVRSIQGDSATYCDEPEDLGD-YEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEE
           A++ S+Q D ATYC+EP+   + ++ W S+F L +K  EI  LL  + +I  ++ K+VP  V +  FW+RYFYKVH+L+Q +  R  L +RA  S  
Subjt:  RFDAQVRSIQGDSATYCDEPEDLGD-YEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEE

Query:  EEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANSTLKD-DLSAKEEVGGKESAEVV
        EE   W  +++++  +G     K   + KE     + ++E    S  ++  +    E    ES+E +
Subjt:  EEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANSTLKD-DLSAKEEVGGKESAEVV

Q9NW68 BSD domain-containing protein 14.3e-1427.83Show/hide
Query:  KTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQ------NLSNQRSSSSKPYS
        + +  +S   +E  +RDL EF   ++ +      +  +V    ++  +    G  + + +G    L +  ++ + +           L    S +++PY 
Subjt:  KTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETVGHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESDSDNSSNQ------NLSNQRSSSSKPYS

Query:  RFDAQVRSIQGDSATYCDEPEDLGD-YEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEE
           A++ S+Q D ATYC+EP+   + ++ W SQF L +K  EI  LL  + +I  ++ K+VP  V +  FW+RYFYKVH+L+Q +  R  L +RA  S  
Subjt:  RFDAQVRSIQGDSATYCDEPEDLGD-YEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEE

Query:  EEDLSWDVDDDD
        EE   W+ ++++
Subjt:  EEDLSWDVDDDD

Arabidopsis top hitse value%identityAlignment
AT1G03350.1 BSD domain-containing protein5.2e-10048.33Show/hide
Query:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETV
        MNFF+SVF++D DP  T ESE  SP     K  EE   P    P    S D G WSFGGL+KTL+ RSESVIETYRRDL+EFG+GLKKEIEVAQGSL TV
Subjt:  MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETV

Query:  GHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESD-SDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIEN
        GHA DE G++VLKGTA+IIAQGK+AILA   ESD SDN+S+Q+   + S SSKPYSRFDAQ+R++QGD  TYC+EPED  DY+KW+S F L+ K EE+E 
Subjt:  GHAFDEFGSSVLKGTAQIIAQGKDAILAIDQESD-SDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIEN

Query:  LLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANS
        LLEENG +  ++K+VVP++VD+ETFW+RYFY+V+KLKQAE++RANLVKRAI+ ++EE+LSWD+DD++++ +     +K DV + ++  G         + 
Subjt:  LLEENGAIDNIHKKVVPNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANS

Query:  TLKDDLSAKEEVGGKESAEVVKKEVPIREL-NAGSSVGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNN-------KDQGSKSDEKVAAEAK
        T+KD++ +   V    + + V     + E+ N G     D EE+         +E D ++  ++K   +     ++        K    ++  + + + K
Subjt:  TLKDDLSAKEEVGGKESAEVVKKEVPIREL-NAGSSVGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNN-------KDQGSKSDEKVAAEAK

Query:  SDHGESSKDS---DVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKKAVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE
        SD    S+DS   DV+  S+    P +EDLGWDEIED+S I+ K+    GG+ NR E+RKRLS  E+DEDL W  D +
Subjt:  SDHGESSKDS---DVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKKAVTQGGTVNREEMRKRLSTTEDDEDLDWGTDTE

AT4G13110.1 BSD domain-containing protein6.7e-4740.34Show/hide
Query:  EESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKE------------------IEVAQGSLETVGHAFDEFGSSVLKGT
        + SSD  SP   + +S+ + +WSFG LIKTLS +SESVI +YRRDL EFGS LKKE                    VA  SLE+VG   D+ G++V K T
Subjt:  EESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKE------------------IEVAQGSLETVGHAFDEFGSSVLKGT

Query:  AQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVV
        A+II++GK+++             N++ +NQ   S KPY RF+  + ++Q D  T+  EP+DL D+E W     L +K  EI  L+  N  +  I++++V
Subjt:  AQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVV

Query:  PNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANSTLKDDLSAKEE
        P  VD ETFW RY+YKV+KL+Q E  R  LVKRAI+ EE+EDLSWD+DD+ +  +  DV SK             +  E+     +++D+ + EE
Subjt:  PNVVDNETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANSTLKDDLSAKEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTCTTCAGATCGGTGTTTTCCGACGACCCGGATCCCTCGACGACGCCCGAGTCTGAACCACGATCGCCTGAGAAATCCTTGCTGAAAGAAGGAGAAGAATCATC
GGATCCGAGTTCCCCAAGCCCTGTCGCCGATTCCTCCGCGGATGCCGGAGCGTGGAGTTTCGGTGGTCTGATCAAGACTTTGAGCGCCAGATCGGAATCGGTTATCGAGA
CCTACCGGCGCGATCTCCAGGAATTTGGCTCCGGTTTGAAGAAGGAAATTGAGGTCGCTCAGGGATCGCTGGAGACGGTTGGGCACGCCTTCGATGAATTCGGTAGCTCC
GTTTTGAAAGGTACGGCTCAGATTATCGCGCAAGGTAAGGATGCGATCCTCGCTATAGATCAGGAATCTGATTCTGATAATAGTAGTAATCAAAATTTGAGTAATCAAAG
AAGCTCGAGTTCCAAGCCCTACAGTAGGTTTGATGCTCAGGTTCGTTCGATTCAAGGCGATTCGGCCACGTACTGTGATGAACCTGAAGATTTGGGTGATTATGAGAAGT
GGAAATCACAATTTGTGCTGAATGATAAGAATGAGGAAATTGAAAACTTGCTTGAAGAAAATGGAGCAATAGATAATATACACAAAAAAGTCGTTCCTAATGTTGTTGAT
AATGAAACTTTCTGGTATAGGTATTTTTATAAAGTGCATAAGCTTAAGCAAGCTGAGAATGTGAGGGCAAATCTTGTGAAGAGAGCTATTGCTAGTGAAGAAGAGGAGGA
TTTGAGCTGGGATGTTGATGATGATGATGATAATGATGATGGGTACGATGTCGGGTCGAAAGGGGATGTGGTGAAGAAGGAAGTGAATGTCGGTGAGACGGTGAAAGAGG
AGAACACCGCCAATTCGACATTGAAGGATGATTTATCGGCGAAGGAGGAAGTGGGTGGGAAAGAATCTGCTGAAGTTGTTAAGAAGGAAGTTCCTATCAGGGAATTGAAT
GCAGGAAGTTCTGTTGGTGATGATAAGGAGGAGGAGAATTTGAACAAGAGTGGTGGAAATGTTGAGGAATTAGATGGAGATAAAGGGTCTGATCAGAAAGTCCATTCGGA
AGGTGGCTCTGGGCTGAATAACAACAAAGATCAAGGATCGAAATCAGACGAGAAGGTGGCTGCGGAGGCCAAGTCTGATCATGGTGAATCTTCGAAAGACAGTGATGTTT
CAATTGTTTCGACACAGCCTTCGATGCCTGAGGACGAAGATCTTGGGTGGGATGAGATTGAGGATCTGAGCATCATTGAGGAAAAGAAAGCAGTAACTCAGGGTGGGACA
GTAAATCGAGAGGAAATGCGAAAGCGGCTGAGTACTACTGAAGATGATGAAGATTTGGACTGGGGTACGGATACCGAATGA
mRNA sequenceShow/hide mRNA sequence
CCCGAAAGAAATAACTGAACTACGAAATGGGATTGAGATTTCGAACTTTACCGATTATGGTGTGGCCTTGCGTTCTTCCAGATAGAGATTTCCCACCGTCTTGACGAACA
CTGCACTCTTCAAATTCACGCTTCTCTCGATCTTTGCTGTCTCCGATAATCGCAACTAGTTTCCCCCATTTCCCAATTTTGATTAATCTTCGTTTCAAATCCGTCAAATG
AATTTCTTCAGATCGGTGTTTTCCGACGACCCGGATCCCTCGACGACGCCCGAGTCTGAACCACGATCGCCTGAGAAATCCTTGCTGAAAGAAGGAGAAGAATCATCGGA
TCCGAGTTCCCCAAGCCCTGTCGCCGATTCCTCCGCGGATGCCGGAGCGTGGAGTTTCGGTGGTCTGATCAAGACTTTGAGCGCCAGATCGGAATCGGTTATCGAGACCT
ACCGGCGCGATCTCCAGGAATTTGGCTCCGGTTTGAAGAAGGAAATTGAGGTCGCTCAGGGATCGCTGGAGACGGTTGGGCACGCCTTCGATGAATTCGGTAGCTCCGTT
TTGAAAGGTACGGCTCAGATTATCGCGCAAGGTAAGGATGCGATCCTCGCTATAGATCAGGAATCTGATTCTGATAATAGTAGTAATCAAAATTTGAGTAATCAAAGAAG
CTCGAGTTCCAAGCCCTACAGTAGGTTTGATGCTCAGGTTCGTTCGATTCAAGGCGATTCGGCCACGTACTGTGATGAACCTGAAGATTTGGGTGATTATGAGAAGTGGA
AATCACAATTTGTGCTGAATGATAAGAATGAGGAAATTGAAAACTTGCTTGAAGAAAATGGAGCAATAGATAATATACACAAAAAAGTCGTTCCTAATGTTGTTGATAAT
GAAACTTTCTGGTATAGGTATTTTTATAAAGTGCATAAGCTTAAGCAAGCTGAGAATGTGAGGGCAAATCTTGTGAAGAGAGCTATTGCTAGTGAAGAAGAGGAGGATTT
GAGCTGGGATGTTGATGATGATGATGATAATGATGATGGGTACGATGTCGGGTCGAAAGGGGATGTGGTGAAGAAGGAAGTGAATGTCGGTGAGACGGTGAAAGAGGAGA
ACACCGCCAATTCGACATTGAAGGATGATTTATCGGCGAAGGAGGAAGTGGGTGGGAAAGAATCTGCTGAAGTTGTTAAGAAGGAAGTTCCTATCAGGGAATTGAATGCA
GGAAGTTCTGTTGGTGATGATAAGGAGGAGGAGAATTTGAACAAGAGTGGTGGAAATGTTGAGGAATTAGATGGAGATAAAGGGTCTGATCAGAAAGTCCATTCGGAAGG
TGGCTCTGGGCTGAATAACAACAAAGATCAAGGATCGAAATCAGACGAGAAGGTGGCTGCGGAGGCCAAGTCTGATCATGGTGAATCTTCGAAAGACAGTGATGTTTCAA
TTGTTTCGACACAGCCTTCGATGCCTGAGGACGAAGATCTTGGGTGGGATGAGATTGAGGATCTGAGCATCATTGAGGAAAAGAAAGCAGTAACTCAGGGTGGGACAGTA
AATCGAGAGGAAATGCGAAAGCGGCTGAGTACTACTGAAGATGATGAAGATTTGGACTGGGGTACGGATACCGAATGACGAGACTGGTAAAGCTTGAAAGATCAAATCTG
TAGAAGGTTATTAATTGATGTCCTATGTTCTCTTTCATTTTTTTCCAAGTATTATTCCTTATCTGTTAAATGCTATGTGCCGTGATTGTACATCTTTAAACTGTCCTCGA
AGGAATCATGTTCAAATTTGTACTTGATTTCCTAATATCATAAAAGGTGAATTCCTTTCCTTCAACTATTGATCTCTCTAGGCTCCACGTCGTTTTTTTACAGATAGGAC
TTTCTTTTCCTGAAACTCTAGATTTCGAGCCTTTTATCGTTCTTCCTTCTACTTCAATGTGCATATCTACCTAAATTGAAAGGCCTTAGGCAACACTTGTGAATTGCTTC
AAAGAGCTACCTATATTTTAACCTATCCTTTCGGGATTTGAATAATATCTCGGGTGCTATATTTTCCTGTAAATTCTTTATATTTTAGGCGTCGAAATGAAAATAATCAA
GCATGCTAGAAAATTCCAGAGGCTGATTCAGTTTCTTTCAAGTCACTCCCATG
Protein sequenceShow/hide protein sequence
MNFFRSVFSDDPDPSTTPESEPRSPEKSLLKEGEESSDPSSPSPVADSSADAGAWSFGGLIKTLSARSESVIETYRRDLQEFGSGLKKEIEVAQGSLETVGHAFDEFGSS
VLKGTAQIIAQGKDAILAIDQESDSDNSSNQNLSNQRSSSSKPYSRFDAQVRSIQGDSATYCDEPEDLGDYEKWKSQFVLNDKNEEIENLLEENGAIDNIHKKVVPNVVD
NETFWYRYFYKVHKLKQAENVRANLVKRAIASEEEEDLSWDVDDDDDNDDGYDVGSKGDVVKKEVNVGETVKEENTANSTLKDDLSAKEEVGGKESAEVVKKEVPIRELN
AGSSVGDDKEEENLNKSGGNVEELDGDKGSDQKVHSEGGSGLNNNKDQGSKSDEKVAAEAKSDHGESSKDSDVSIVSTQPSMPEDEDLGWDEIEDLSIIEEKKAVTQGGT
VNREEMRKRLSTTEDDEDLDWGTDTE