; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0600 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0600
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationMC11:4686503..4692895
RNA-Seq ExpressionMC11g0600
SyntenyMC11g0600
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591699.1 hypothetical protein SDJN03_14045, partial [Cucurbita argyrosperma subsp. sororia]5.88e-26672.66Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL
        MIE KEKQKKG IS+ED  S +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PL +     DSDL
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL

Query:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN
        DFEIE FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGIS NS+S RA+LENPQS CL+Q MYV +PISIQRQP+ +  A  TEVFDVN
Subjt:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN

Query:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN
        GAAGGNAASRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVNN
Subjt:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN

Query:  SKTEY-SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARN
        SK+   SN+NS  +SS SGAEAPVQ+QNQSPQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIV ATAVAAGARIVSPSDAASL+KAAQ +N
Subjt:  SKTEY-SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARN

Query:  AIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEE
        AIHIKS C SSI+PPV GNA  H D RP++HYISTG+ A+PGSNYVGGK  +  NNS+K +SP   ++ ST A+L N  S+Q SP TESP K+E+KSSEE
Subjt:  AIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEE

Query:  RKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG
         K+ +P+ TPK + RE+  V    FA++ SD E  S STCIEN N S N +IDEN IKA C +Q E KK ANDV+I G
Subjt:  RKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG

KAG7024581.1 hypothetical protein SDJN02_13399, partial [Cucurbita argyrosperma subsp. argyrosperma]1.47e-26772.46Show/hide
Query:  GSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDD
        GS+MIE KEKQKKG IS+ED  S +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PL +     D
Subjt:  GSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDD

Query:  SDLDFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVF
        SDLDFEIE FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGIS NS+S RA+LENPQS CL+Q MYV +PISIQRQP+ +  A  TEVF
Subjt:  SDLDFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVF

Query:  DVNGAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLP
        DVNGAAG NAASRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALD P
Subjt:  DVNGAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLP

Query:  VNNSKTEY-SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQ
        VNNSK+   SN+NS  +SS SGAEAPVQ+QNQSPQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ
Subjt:  VNNSKTEY-SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQ

Query:  ARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKS
         +NAIHIKS C SSI+PP+ GNA  H D RP++HYISTG+ A+PGSNYVGGK  +  NNS+K +SP   ++ ST A+L N  S+Q SP TESP K+E+KS
Subjt:  ARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKS

Query:  SEERKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG
        SEE K+ +P+ TPK + RE+  V    FA++ SD E  S STCIEN N S N EI+EN IKA C +Q E KK ANDV+IRG
Subjt:  SEERKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG

XP_022137164.1 uncharacterized protein LOC111008703 [Momordica charantia]0.099.18Show/hide
Query:  MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDD
        MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPL     DD
Subjt:  MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDD

Query:  DSDLDFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVF
        DSDLDFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVF
Subjt:  DSDLDFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVF

Query:  DVNGAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLP
        DVNGAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLP
Subjt:  DVNGAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLP

Query:  VNNSKTEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQA
        VNNSKTEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQA
Subjt:  VNNSKTEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQA

Query:  RNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSS
        RNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSS
Subjt:  RNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSS

Query:  EERKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGSNTEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQ
        EERKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGSNTEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQ
Subjt:  EERKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGSNTEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQ

Query:  NEQMNLKAYIRR
        NEQMNLKAYIRR
Subjt:  NEQMNLKAYIRR

XP_023534835.1 uncharacterized protein LOC111796458 isoform X1 [Cucurbita pepo subsp. pepo]7.26e-25971.28Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL
        MIE KEK KKG IS+ED  S +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PL +     DSDL
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL

Query:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN
        DFEIE FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGIS NS+  RA+LENPQS CL+Q MYV  PIS+QRQP+ +  A  TEVFDVN
Subjt:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN

Query:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN
        GAAGGNAASRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKRR  L+VGAN T TQ SKAQIDA HRALS ALDLPVNN
Subjt:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN

Query:  SKTEY-SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARN
        SK+   SN+NS  +SS SGAEAPVQ+QNQSPQ+  P RP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +N
Subjt:  SKTEY-SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARN

Query:  AIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEE
        AIHIKS C SSI+PPV GNA  H D RP++HYISTG+ A+PGSNYVGGK  +    S+K +SP   ++ ST A+  N  S+Q SP TESP K+E+KSSEE
Subjt:  AIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEE

Query:  RKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG
         K+ +P+ T K++ RE+  V    FA++ SD E  S STCIEN N S N EIDEN IKA C +Q E K  ANDV+IRG
Subjt:  RKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG

XP_023534838.1 uncharacterized protein LOC111796458 isoform X2 [Cucurbita pepo subsp. pepo]1.05e-26071.4Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL
        MIE KEK KKG IS+ED  S +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PL +     DSDL
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL

Query:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN
        DFEIE FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGIS NS+  RA+LENPQS CL+Q MYV  PIS+QRQP+ +  A  TEVFDVN
Subjt:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN

Query:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN
        GAAGGNAASRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKRR  L+VGAN T TQ SKAQIDA HRALS ALDLPVNN
Subjt:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN

Query:  SKTEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNA
        SK+  SN+NS  +SS SGAEAPVQ+QNQSPQ+  P RP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +NA
Subjt:  SKTEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNA

Query:  IHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEER
        IHIKS C SSI+PPV GNA  H D RP++HYISTG+ A+PGSNYVGGK  +    S+K +SP   ++ ST A+  N  S+Q SP TESP K+E+KSSEE 
Subjt:  IHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEER

Query:  KMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG
        K+ +P+ T K++ RE+  V    FA++ SD E  S STCIEN N S N EIDEN IKA C +Q E K  ANDV+IRG
Subjt:  KMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG

TrEMBL top hitse value%identityAlignment
A0A6J1C5S4 uncharacterized protein LOC1110087030.099.18Show/hide
Query:  MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDD
        MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPL     DD
Subjt:  MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDD

Query:  DSDLDFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVF
        DSDLDFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVF
Subjt:  DSDLDFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVF

Query:  DVNGAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLP
        DVNGAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLP
Subjt:  DVNGAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLP

Query:  VNNSKTEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQA
        VNNSKTEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQA
Subjt:  VNNSKTEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQA

Query:  RNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSS
        RNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSS
Subjt:  RNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSS

Query:  EERKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGSNTEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQ
        EERKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGSNTEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQ
Subjt:  EERKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGSNTEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQ

Query:  NEQMNLKAYIRR
        NEQMNLKAYIRR
Subjt:  NEQMNLKAYIRR

A0A6J1FAZ9 uncharacterized protein LOC111443670 isoform X23.55e-24874.56Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL
        MIE KEKQKKG IS+ED  S +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PL +     DSDL
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL

Query:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN
        DFEIE FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGIS NS+S RA+LENPQS CL+Q MYV +PISIQRQP+ +  A  TEVFDVN
Subjt:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN

Query:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN
        GAAG NAASRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVNN
Subjt:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN

Query:  SKTEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNA
        SK+  SN+NS  +SS SGAEAPVQ+QNQSPQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +NA
Subjt:  SKTEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNA

Query:  IHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEER
        IHIKS C SSI+PP+ GNA  H D RP++HYISTG+ A+PG+NYVGGK  +  NNS+K +SP   ++ ST A+L N  S+Q SP TESP K+E+KSSEE 
Subjt:  IHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEER

Query:  KMPEPVATPKEEARE
        K+ +P+ TPK + RE
Subjt:  KMPEPVATPKEEARE

A0A6J1FGE2 uncharacterized protein LOC111443670 isoform X12.45e-24674.42Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL
        MIE KEKQKKG IS+ED  S +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PL +     DSDL
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL

Query:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN
        DFEIE FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGIS NS+S RA+LENPQS CL+Q MYV +PISIQRQP+ +  A  TEVFDVN
Subjt:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN

Query:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN
        GAAG NAASRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVNN
Subjt:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN

Query:  SKTEY-SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARN
        SK+   SN+NS  +SS SGAEAPVQ+QNQSPQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +N
Subjt:  SKTEY-SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARN

Query:  AIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEE
        AIHIKS C SSI+PP+ GNA  H D RP++HYISTG+ A+PG+NYVGGK  +  NNS+K +SP   ++ ST A+L N  S+Q SP TESP K+E+KSSEE
Subjt:  AIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEE

Query:  RKMPEPVATPKEEARE
         K+ +P+ TPK + RE
Subjt:  RKMPEPVATPKEEARE

A0A6J1IGI9 uncharacterized protein LOC111476736 isoform X27.44e-25371.43Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL
        MIE KEKQKKG IS+ED  + +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PL +     DSDL
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL

Query:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN
        DFEIE FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGIS NS+S RA+LENPQS CL+Q MYV +PISI+RQP+ +  A  TEVFDVN
Subjt:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN

Query:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN
        GAAG NAASRKRRKPWSK +D+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVN 
Subjt:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN

Query:  SKTEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNA
        SK+  SN+NS  +SS SGAEAPVQ+QNQSPQ+  P RP+ V+PLP A K GI+T KN LMMKSTHNSDSIVRATAVAAGARIVSP DAASL+K AQ +NA
Subjt:  SKTEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNA

Query:  IHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEER
        IHIKS C SSI+PPV GNA  H D +P++HYISTG+ A+PGSNYVGGK  +  NNS+K ++P   ++ ST A+L N  S+Q SP TESP K+E+KSSEE 
Subjt:  IHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEER

Query:  KMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGC
        K+ +P+ T K+++RE+  V    FA+  SD E  S STCIEN N S N EIDEN IKA C
Subjt:  KMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGC

A0A6J1IN48 uncharacterized protein LOC111476736 isoform X15.12e-25171.3Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL
        MIE KEKQKKG IS+ED  + +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PL +     DSDL
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL

Query:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN
        DFEIE FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGIS NS+S RA+LENPQS CL+Q MYV +PISI+RQP+ +  A  TEVFDVN
Subjt:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGIS-NSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVN

Query:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN
        GAAG NAASRKRRKPWSK +D+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVN 
Subjt:  GAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNN

Query:  SKTEY-SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARN
        SK+   SN+NS  +SS SGAEAPVQ+QNQSPQ+  P RP+ V+PLP A K GI+T KN LMMKSTHNSDSIVRATAVAAGARIVSP DAASL+K AQ +N
Subjt:  SKTEY-SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARN

Query:  AIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEE
        AIHIKS C SSI+PPV GNA  H D +P++HYISTG+ A+PGSNYVGGK  +  NNS+K ++P   ++ ST A+L N  S+Q SP TESP K+E+KSSEE
Subjt:  AIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEE

Query:  RKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGC
         K+ +P+ T K+++RE+  V    FA+  SD E  S STCIEN N S N EIDEN IKA C
Subjt:  RKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G09710.1 Homeodomain-like superfamily protein5.3e-6036.69Show/hide
Query:  ISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDLDFEIESFPSVNS
        I +E DI+TLL RY + TIL +L+E++  SE ++DW+ LVK T+TGI+NAREYQ+LWRHL+YRH LL   D    PL     DDDSD++ E+E+ P+V+ 
Subjt:  ISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDLDFEIESFPSVNS

Query:  ESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRK
        E+  EA A VKV+ A+ + SESDI   S VEAPLTI I  +  + S+   E+P S    + M +  P+ +Q+         STE  + NG+AG + A R+
Subjt:  ESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRK

Query:  RRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTEYSNINSC
        +RK WS  ED EL AAV++CGEGNWA+I+KGDF+G RTASQLSQRW++IRK RC+ +   +  G Q ++A++   H ALS AL     ++K     + + 
Subjt:  RRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTEYSNINSC

Query:  IISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMK----STHNSDSIVRATAVAAGA-----------RIVSP--SDAASLLKA
           + +  EA     +Q  Q  KP    +V+ LP A    +  +K+ ++ K    ST  SD +V A +VAA A           R V P  +DA  + K 
Subjt:  IISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMK----STHNSDSIVRATAVAAGA-----------RIVSP--SDAASLLKA

Query:  AQARNA--IHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKR
           ++A  + +     S   P V     + +    +I  ++ GKL    ++    KP ++   S       +L   +  A L  + S+QR  A   P+  
Subjt:  AQARNA--IHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKR

Query:  EIKSSEERKMPEPVATPKEEARESEA
         +         +P    K +AR +EA
Subjt:  EIKSSEERKMPEPVATPKEEARESEA

AT1G09710.2 Homeodomain-like superfamily protein2.8e-6149.12Show/hide
Query:  ISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDLDFEIESFPSVNS
        I +E DI+TLL RY + TIL +L+E++  SE ++DW+ LVK T+TGI+NAREYQ+LWRHL+YRH LL   D    PL     DDDSD++ E+E+ P+V+ 
Subjt:  ISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDLDFEIESFPSVNS

Query:  ESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRK
        E+  EA A VKV+ A+ + SESDI   S VEAPLTI I  +  + S+   E+P S    + M +  P+ +Q+         STE  + NG+AG + A R+
Subjt:  ESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRK

Query:  RRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFAL
        +RK WS  ED EL AAV++CGEGNWA+I+KGDF+G RTASQLSQRW++IRK RC+ +   +  G Q ++A++   H ALS AL
Subjt:  RRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFAL

AT1G58220.1 Homeodomain-like superfamily protein4.2e-5738.64Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL
        M++    +K+    SE DI+TLL+RY   TIL LL+E+A  +E +++W++LVK TSTGI++AREYQ+LWRHLAYR +L+       G  A  L DDDSD+
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDL

Query:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNG
        + E+E+ P V+ + + EA A VKV+ A+ +PSESDIP  S VEAPLTI I  S   R   E   S    + M +  P+ + +         + E  + NG
Subjt:  DFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNG

Query:  AAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNS
         A  + A RKRRK WS  ED ELIAAV++ GEG+WA I K +F+G RTASQLSQRW  IR+R    N  +  TG Q ++AQ+ A +RALS A+   + + 
Subjt:  AAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNS

Query:  K-----TEYSNINSCIISSASGAEAPVQMQNQ---SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLK
        K     T   +  +   + A+GA +   +Q Q    PQI   SR     P+  +  P   T+ N     ST  +D +V A +VAA A +   + A ++ K
Subjt:  K-----TEYSNINSCIISSASGAEAPVQMQNQ---SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLK

Query:  AAQARNAIH--------IKSSCASSIKPPVHGNAPIHSDP
            +NA+         +K++   S+  P   ++ ++++P
Subjt:  AAQARNAIH--------IKSSCASSIKPPVHGNAPIHSDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTCTCTAATGATTGAGAGAAAAGAGAAGCAAAAGAAAGGGATAATTAGTAGTGAGGATGATATTTCCACTCTACTCGAAAGATATTCGGTAAGGACGATACTGAC
ATTGCTTCGGGAGGTGGCCCAGGTTTCGGAAGTGAGAATTGATTGGGACAAGTTAGTGAAGAACACGTCAACTGGGATTTCTAATGCTCGAGAATATCAGATGTTATGGC
GGCATTTGGCTTATCGTCACACGTTACTGGAGAATATGGATTGTCTTACTGGTCCACTGGCGTTTGGTTTACAGGATGATGATAGTGATTTAGACTTTGAAATAGAGTCT
TTTCCATCAGTGAACAGCGAGTCCTTAAATGAAGCTGCAGCATTTGTGAAGGTACTGATTGCCAATGCTATACCGAGCGAGTCAGATATTCCTAGTAGTTCTGCAGTTGA
GGCCCCATTGACTATAGGTATATCCAATAGTCAATCATCTAGAGCCAATCTTGAAAATCCTCAATCTGGTTGTTTGTTGCAAGAGATGTATGTTGCAATTCCAATTTCCA
TTCAGAGACAGCCTATTTCTTCTACACCAGCCGTATCAACTGAAGTATTTGATGTGAATGGAGCAGCCGGTGGTAATGCAGCTTCTCGAAAAAGAAGAAAACCCTGGTCA
AAGGCAGAGGATATGGAACTGATTGCTGCTGTTCAGAAGTGTGGTGAAGGGAACTGGGCGAATATCTTGAAAGGAGATTTTAAGGGGAATAGAACTGCTTCACAGCTATC
TCAGAGGTGGTCCATTATTAGGAAGCGACGCTGTAATTTGAATGTGGGAGCTAACGCCACAGGTACTCAGATATCTAAAGCTCAGATTGATGCTACACACCGTGCATTGT
CCTTTGCGCTTGATTTGCCCGTGAATAACTCGAAAACAGAGTATTCAAACATAAACAGTTGCATTATCTCTTCTGCAAGTGGTGCCGAAGCTCCGGTTCAAATGCAGAAT
CAGTCTCCTCAGATTCCCAAGCCTTCAAGGCCAGTGCTGGTAGAGCCTTTGCCTTCAGCAGTGAAACCTGGAATCGACACTTCCAAGAATGCATTGATGATGAAGTCTAC
TCACAATTCCGATTCTATAGTTCGAGCGACCGCAGTTGCTGCGGGAGCCCGCATCGTTTCTCCATCAGATGCTGCATCTCTACTGAAGGCTGCACAGGCAAGAAATGCCA
TTCACATAAAGTCCAGTTGTGCCTCTTCGATCAAACCACCTGTGCATGGTAATGCACCAATTCACTCGGATCCACGCCCTAACATACATTACATTTCCACGGGAAAATTG
GCATCTCCAGGTTCGAACTATGTCGGTGGTAAACCGAATGTGGTATGTAATAACTCAGTGAAGGCCATCTCGCCAATTGTTCTGCATCATCGTTCCACTTCTGCTATTTT
GATGAATGTGCAGTCAGACCAAAGAAGCCCAGCAACCGAGTCTCCGTCGAAGCGAGAGATTAAGAGTTCAGAAGAACGCAAAATGCCTGAGCCGGTTGCAACTCCGAAAG
AGGAGGCTCGAGAATCCGAAGCTGTTCGGGGAGGAAACTTTGCTACCGAAAGATCAGATGGGGAATTGAGAAGTCTTTCAACTTGCATTGAGAATCACAATGGTTCAAAT
ACGGAGATAGATGAAAATGGTATTAAAGCAGGATGTTCCGAGCAGGTCGAAACGAAAAAGAACGCAAATGATGTCGAGATTAGGGGTTCACCAGACACTCAAGGAAGCAA
GAATGAATCCGTAGCAGATATTGTGGGCCATAGCACACAAAATGAACAGATGAACTTGAAAGCATATATTAGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
TCCAGTTCCAAATAGCGATCTATTCGGTTTCCACGAGCGCCAATTCTCTGCCAGCGTGTCCACCATCGAGCCCTGCACCCTCATCACCGTCGCCGATAAGTCCGTAATTC
TGCTGCAACTGCAACGTTTTAATTTCCCTCTCCGGCTTCGTCTCATAACAATTTCCGAGAAGCGATTCGAGTCTCGTCGGCGACGGCCCACTGCCGGTGCACCTCTAGGG
TTTAGGGTTTTGGCCGACTTCCGATCATAGTATAATTGGTTACTTTTGCTTCTAAGGCTTCTCTGATTCTTGCTCTTCATATGCGATTTCATCATGGGCTCTCTAATGAT
TGAGAGAAAAGAGAAGCAAAAGAAAGGGATAATTAGTAGTGAGGATGATATTTCCACTCTACTCGAAAGATATTCGGTAAGGACGATACTGACATTGCTTCGGGAGGTGG
CCCAGGTTTCGGAAGTGAGAATTGATTGGGACAAGTTAGTGAAGAACACGTCAACTGGGATTTCTAATGCTCGAGAATATCAGATGTTATGGCGGCATTTGGCTTATCGT
CACACGTTACTGGAGAATATGGATTGTCTTACTGGTCCACTGGCGTTTGGTTTACAGGATGATGATAGTGATTTAGACTTTGAAATAGAGTCTTTTCCATCAGTGAACAG
CGAGTCCTTAAATGAAGCTGCAGCATTTGTGAAGGTACTGATTGCCAATGCTATACCGAGCGAGTCAGATATTCCTAGTAGTTCTGCAGTTGAGGCCCCATTGACTATAG
GTATATCCAATAGTCAATCATCTAGAGCCAATCTTGAAAATCCTCAATCTGGTTGTTTGTTGCAAGAGATGTATGTTGCAATTCCAATTTCCATTCAGAGACAGCCTATT
TCTTCTACACCAGCCGTATCAACTGAAGTATTTGATGTGAATGGAGCAGCCGGTGGTAATGCAGCTTCTCGAAAAAGAAGAAAACCCTGGTCAAAGGCAGAGGATATGGA
ACTGATTGCTGCTGTTCAGAAGTGTGGTGAAGGGAACTGGGCGAATATCTTGAAAGGAGATTTTAAGGGGAATAGAACTGCTTCACAGCTATCTCAGAGGTGGTCCATTA
TTAGGAAGCGACGCTGTAATTTGAATGTGGGAGCTAACGCCACAGGTACTCAGATATCTAAAGCTCAGATTGATGCTACACACCGTGCATTGTCCTTTGCGCTTGATTTG
CCCGTGAATAACTCGAAAACAGAGTATTCAAACATAAACAGTTGCATTATCTCTTCTGCAAGTGGTGCCGAAGCTCCGGTTCAAATGCAGAATCAGTCTCCTCAGATTCC
CAAGCCTTCAAGGCCAGTGCTGGTAGAGCCTTTGCCTTCAGCAGTGAAACCTGGAATCGACACTTCCAAGAATGCATTGATGATGAAGTCTACTCACAATTCCGATTCTA
TAGTTCGAGCGACCGCAGTTGCTGCGGGAGCCCGCATCGTTTCTCCATCAGATGCTGCATCTCTACTGAAGGCTGCACAGGCAAGAAATGCCATTCACATAAAGTCCAGT
TGTGCCTCTTCGATCAAACCACCTGTGCATGGTAATGCACCAATTCACTCGGATCCACGCCCTAACATACATTACATTTCCACGGGAAAATTGGCATCTCCAGGTTCGAA
CTATGTCGGTGGTAAACCGAATGTGGTATGTAATAACTCAGTGAAGGCCATCTCGCCAATTGTTCTGCATCATCGTTCCACTTCTGCTATTTTGATGAATGTGCAGTCAG
ACCAAAGAAGCCCAGCAACCGAGTCTCCGTCGAAGCGAGAGATTAAGAGTTCAGAAGAACGCAAAATGCCTGAGCCGGTTGCAACTCCGAAAGAGGAGGCTCGAGAATCC
GAAGCTGTTCGGGGAGGAAACTTTGCTACCGAAAGATCAGATGGGGAATTGAGAAGTCTTTCAACTTGCATTGAGAATCACAATGGTTCAAATACGGAGATAGATGAAAA
TGGTATTAAAGCAGGATGTTCCGAGCAGGTCGAAACGAAAAAGAACGCAAATGATGTCGAGATTAGGGGTTCACCAGACACTCAAGGAAGCAAGAATGAATCCGTAGCAG
ATATTGTGGGCCATAGCACACAAAATGAACAGATGAACTTGAAAGCATATATTAGAAGGTGAATGCATAGACAATTTTTGAAACCATTGTACAAATGATGAATAAACAGT
GAAGGCACAAAATTTTCTGTAGGTGTAAATAAAGAGTGAATCTTGCAGTTGCAGTGATACGGAATTCCATGTTCTCTTACTAGAGAGTGAAAATTAATTTTCCAGTTTTT
TTTCTCAGAATTTTGTGTAAATAAATATCTCAATGGCAAGACCAACATTTGTAAGGTCATCATATTCAACATTTGTATGTAGGTTTCATTGTAAGTCCCTTTTTTTTTTA
AGACTTCATTAAAAGAAAAATTGCAC
Protein sequenceShow/hide protein sequence
MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLAFGLQDDDSDLDFEIES
FPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWS
KAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTEYSNINSCIISSASGAEAPVQMQN
QSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKL
ASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGSN
TEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQNEQMNLKAYIRR