; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g06920 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g06920
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionHTH myb-type domain-containing protein
Genome locationchr11:4719366..4725047
RNA-Seq ExpressionMoc11g06920
SyntenyMoc11g06920
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591699.1 hypothetical protein SDJN03_14045, partial [Cucurbita argyrosperma subsp. sororia]8.6e-21473.65Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE
        MIE KEKQKKG IS+ED  S +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE

Query:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG
         FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISIQRQP+  TP+ +TEVFDVNGAAGG
Subjt:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG

Query:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-E
        NAASRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVNNSK+  
Subjt:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-E

Query:  YSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIK
         SN+NS  +SS SGAEAPVQ+QNQSPQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIV ATAVAAGARIVSPSDAASL+KAAQ +NAIHIK
Subjt:  YSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIK

Query:  SSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPE
        S C SSI+PPV GNA  H D RP++HYISTG+ A+PGSNYVGGK  +  NNS+K +SP   ++ ST A+L N  S+Q SP TESP K+E+KSSEE K+ +
Subjt:  SSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPE

Query:  PVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG
        P+ TPK + RE+  V    FA++ SD E  S STCIEN N S N +IDEN IKA C +Q E KK ANDV+I G
Subjt:  PVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG

KAG7024581.1 hypothetical protein SDJN02_13399, partial [Cucurbita argyrosperma subsp. argyrosperma]2.7e-21573.44Show/hide
Query:  GSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDF
        GS+MIE KEKQKKG IS+ED  S +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDF
Subjt:  GSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDF

Query:  EIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGA
        EIE FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISIQRQP+  TP+ +TEVFDVNGA
Subjt:  EIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGA

Query:  AGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSK
        AG NAASRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALD PVNNSK
Subjt:  AGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSK

Query:  T-EYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAI
        +   SN+NS  +SS SGAEAPVQ+QNQSPQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +NAI
Subjt:  T-EYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAI

Query:  HIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERK
        HIKS C SSI+PP+ GNA  H D RP++HYISTG+ A+PGSNYVGGK  +  NNS+K +SP   ++ ST A+L N  S+Q SP TESP K+E+KSSEE K
Subjt:  HIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERK

Query:  MPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG
        + +P+ TPK + RE+  V    FA++ SD E  S STCIEN N S N EI+EN IKA C +Q E KK ANDV+IRG
Subjt:  MPEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG

XP_022137164.1 uncharacterized protein LOC111008703 [Momordica charantia]0.0e+00100Show/hide
Query:  MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLD
        MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLD
Subjt:  MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLD

Query:  FEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGA
        FEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGA
Subjt:  FEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGA

Query:  AGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSK
        AGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSK
Subjt:  AGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSK

Query:  TEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIH
        TEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIH
Subjt:  TEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIH

Query:  IKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKM
        IKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKM
Subjt:  IKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKM

Query:  PEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGSNTEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQNEQMN
        PEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGSNTEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQNEQMN
Subjt:  PEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGSNTEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQNEQMN

Query:  LKAYIRR
        LKAYIRR
Subjt:  LKAYIRR

XP_023534835.1 uncharacterized protein LOC111796458 isoform X1 [Cucurbita pepo subsp. pepo]2.4e-20872.25Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE
        MIE KEK KKG IS+ED  S +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE

Query:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG
         FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGI SNS+  RA+LENPQS CL+Q MYV  PIS+QRQP+  TP+ +TEVFDVNGAAGG
Subjt:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG

Query:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-E
        NAASRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKRR  L+VGAN T TQ SKAQIDA HRALS ALDLPVNNSK+  
Subjt:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-E

Query:  YSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIK
         SN+NS  +SS SGAEAPVQ+QNQSPQ+  P RP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +NAIHIK
Subjt:  YSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIK

Query:  SSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPE
        S C SSI+PPV GNA  H D RP++HYISTG+ A+PGSNYVGGK  +    S+K +SP   ++ ST A+  N  S+Q SP TESP K+E+KSSEE K+ +
Subjt:  SSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPE

Query:  PVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG
        P+ T K++ RE+  V    FA++ SD E  S STCIEN N S N EIDEN IKA C +Q E K  ANDV+IRG
Subjt:  PVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG

XP_023534838.1 uncharacterized protein LOC111796458 isoform X2 [Cucurbita pepo subsp. pepo]9.9e-21072.38Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE
        MIE KEK KKG IS+ED  S +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE

Query:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG
         FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGI SNS+  RA+LENPQS CL+Q MYV  PIS+QRQP+  TP+ +TEVFDVNGAAGG
Subjt:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG

Query:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTEY
        NAASRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKRR  L+VGAN T TQ SKAQIDA HRALS ALDLPVNNSK+  
Subjt:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTEY

Query:  SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKS
        SN+NS  +SS SGAEAPVQ+QNQSPQ+  P RP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +NAIHIKS
Subjt:  SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKS

Query:  SCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPEP
         C SSI+PPV GNA  H D RP++HYISTG+ A+PGSNYVGGK  +    S+K +SP   ++ ST A+  N  S+Q SP TESP K+E+KSSEE K+ +P
Subjt:  SCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPEP

Query:  VATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG
        + T K++ RE+  V    FA++ SD E  S STCIEN N S N EIDEN IKA C +Q E K  ANDV+IRG
Subjt:  VATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSEQVETKKNANDVEIRG

TrEMBL top hitse value%identityAlignment
A0A6J1C5S4 uncharacterized protein LOC1110087030.0e+00100Show/hide
Query:  MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLD
        MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLD
Subjt:  MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLD

Query:  FEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGA
        FEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGA
Subjt:  FEIESFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGA

Query:  AGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSK
        AGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSK
Subjt:  AGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSK

Query:  TEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIH
        TEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIH
Subjt:  TEYSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIH

Query:  IKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKM
        IKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKM
Subjt:  IKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKM

Query:  PEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGSNTEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQNEQMN
        PEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGSNTEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQNEQMN
Subjt:  PEPVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGSNTEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQNEQMN

Query:  LKAYIRR
        LKAYIRR
Subjt:  LKAYIRR

A0A6J1FAZ9 uncharacterized protein LOC111443670 isoform X23.4e-20075.69Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE
        MIE KEKQKKG IS+ED  S +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE

Query:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG
         FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISIQRQP+  TP+ +TEVFDVNGAAG 
Subjt:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG

Query:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTEY
        NAASRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVNNSK+  
Subjt:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTEY

Query:  SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKS
        SN+NS  +SS SGAEAPVQ+QNQSPQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +NAIHIKS
Subjt:  SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKS

Query:  SCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPEP
         C SSI+PP+ GNA  H D RP++HYISTG+ A+PG+NYVGGK  +  NNS+K +SP   ++ ST A+L N  S+Q SP TESP K+E+KSSEE K+ +P
Subjt:  SCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPEP

Query:  VATPKEEARE
        + TPK + RE
Subjt:  VATPKEEARE

A0A6J1FGE2 uncharacterized protein LOC111443670 isoform X18.5e-19975.54Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE
        MIE KEKQKKG IS+ED  S +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE

Query:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG
         FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISIQRQP+  TP+ +TEVFDVNGAAG 
Subjt:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG

Query:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-E
        NAASRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVNNSK+  
Subjt:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-E

Query:  YSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIK
         SN+NS  +SS SGAEAPVQ+QNQSPQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +NAIHIK
Subjt:  YSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIK

Query:  SSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPE
        S C SSI+PP+ GNA  H D RP++HYISTG+ A+PG+NYVGGK  +  NNS+K +SP   ++ ST A+L N  S+Q SP TESP K+E+KSSEE K+ +
Subjt:  SSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPE

Query:  PVATPKEEARE
        P+ TPK + RE
Subjt:  PVATPKEEARE

A0A6J1IGI9 uncharacterized protein LOC111476736 isoform X28.7e-20472.17Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE
        MIE KEKQKKG IS+ED  + +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE

Query:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG
         FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISI+RQP+  TP+ +TEVFDVNGAAG 
Subjt:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG

Query:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTEY
        NAASRKRRKPWSK +D+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVN SK+  
Subjt:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTEY

Query:  SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKS
        SN+NS  +SS SGAEAPVQ+QNQSPQ+  P RP+ V+PLP A K GI+T KN LMMKSTHNSDSIVRATAVAAGARIVSP DAASL+K AQ +NAIHIKS
Subjt:  SNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKS

Query:  SCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPEP
         C SSI+PPV GNA  H D +P++HYISTG+ A+PGSNYVGGK  +  NNS+K ++P   ++ ST A+L N  S+Q SP TESP K+E+KSSEE K+ +P
Subjt:  SCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPEP

Query:  VATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSE
        + T K+++RE+  V    FA+  SD E  S STCIEN N S N EIDEN IKA C +
Subjt:  VATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSE

A0A6J1IN48 uncharacterized protein LOC111476736 isoform X12.2e-20272.04Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE
        MIE KEKQKKG IS+ED  + +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE

Query:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG
         FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISI+RQP+  TP+ +TEVFDVNGAAG 
Subjt:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGG

Query:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-E
        NAASRKRRKPWSK +D+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVN SK+  
Subjt:  NAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-E

Query:  YSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIK
         SN+NS  +SS SGAEAPVQ+QNQSPQ+  P RP+ V+PLP A K GI+T KN LMMKSTHNSDSIVRATAVAAGARIVSP DAASL+K AQ +NAIHIK
Subjt:  YSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIK

Query:  SSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPE
        S C SSI+PPV GNA  H D +P++HYISTG+ A+PGSNYVGGK  +  NNS+K ++P   ++ ST A+L N  S+Q SP TESP K+E+KSSEE K+ +
Subjt:  SSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPE

Query:  PVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSE
        P+ T K+++RE+  V    FA+  SD E  S STCIEN N S N EIDEN IKA C +
Subjt:  PVATPKEEARESEAVRGGNFATERSDGELRSLSTCIENHNGS-NTEIDENGIKAGCSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G09710.1 Homeodomain-like superfamily protein7.4e-6237.04Show/hide
Query:  ISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNE
        I +E DI+TLL RY + TIL +L+E++  SE ++DW+ LVK T+TGI+NAREYQ+LWRHL+YRH LL   D    PLDDDSD++ E+E+ P+V+ E+  E
Subjt:  ISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNE

Query:  AAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPW
        A A VKV+ A+ + SESDI   S VEAPLTI I  +  + S+   E+P S    + M +  P+ +Q+         STE  + NG+AG + A R++RK W
Subjt:  AAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPW

Query:  SKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTEYSNINSCIISSA
        S  ED EL AAV++CGEGNWA+I+KGDF+G RTASQLSQRW++IRK RC+ +   +  G Q ++A++   H ALS AL     ++K     + +    + 
Subjt:  SKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTEYSNINSCIISSA

Query:  SGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMK----STHNSDSIVRATAVAAGA-----------RIVSP--SDAASLLKAAQARN
        +  EA     +Q  Q  KP    +V+ LP A    +  +K+ ++ K    ST  SD +V A +VAA A           R V P  +DA  + K    ++
Subjt:  SGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMK----STHNSDSIVRATAVAAGA-----------RIVSP--SDAASLLKAAQARN

Query:  A--IHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSS
        A  + +     S   P V     + +    +I  ++ GKL    ++    KP ++   S       +L   +  A L  + S+QR  A   P+   +   
Subjt:  A--IHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSS

Query:  EERKMPEPVATPKEEARESEA
              +P    K +AR +EA
Subjt:  EERKMPEPVATPKEEARESEA

AT1G09710.2 Homeodomain-like superfamily protein3.9e-6350Show/hide
Query:  ISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNE
        I +E DI+TLL RY + TIL +L+E++  SE ++DW+ LVK T+TGI+NAREYQ+LWRHL+YRH LL   D    PLDDDSD++ E+E+ P+V+ E+  E
Subjt:  ISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNE

Query:  AAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPW
        A A VKV+ A+ + SESDI   S VEAPLTI I  +  + S+   E+P S    + M +  P+ +Q+         STE  + NG+AG + A R++RK W
Subjt:  AAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPW

Query:  SKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFAL
        S  ED EL AAV++CGEGNWA+I+KGDF+G RTASQLSQRW++IRK RC+ +   +  G Q ++A++   H ALS AL
Subjt:  SKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFAL

AT1G58220.1 Homeodomain-like superfamily protein8.4e-5838.62Show/hide
Query:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE
        M++    +K+    SE DI+TLL+RY   TIL LL+E+A  +E +++W++LVK TSTGI++AREYQ+LWRHLAYR +L+  +      LDDDSD++ E+E
Subjt:  MIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIE

Query:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGN
        + P V+ + + EA A VKV+ A+ +PSESDIP  S VEAPLTI I  S   R   E   S    + M +  P+ + +         + E  + NG A  +
Subjt:  SFPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGN

Query:  AASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSK----
         A RKRRK WS  ED ELIAAV++ GEG+WA I K +F+G RTASQLSQRW  IR+R    N  +  TG Q ++AQ+ A +RALS A+   + + K    
Subjt:  AASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSK----

Query:  -TEYSNINSCIISSASGAEAPVQMQNQ---SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQAR
         T   +  +   + A+GA +   +Q Q    PQI   SR     P+  +  P   T+ N     ST  +D +V A +VAA A +   + A ++ K    +
Subjt:  -TEYSNINSCIISSASGAEAPVQMQNQ---SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQAR

Query:  NAIH--------IKSSCASSIKPPVHGNAPIHSDP
        NA+         +K++   S+  P   ++ ++++P
Subjt:  NAIH--------IKSSCASSIKPPVHGNAPIHSDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTCTCTAATGATTGAGAGAAAAGAGAAGCAAAAGAAAGGGATAATTAGTAGTGAGGATGATATTTCCACTCTACTCGAAAGATATTCGGTAAGGACGATA
CTGACATTGCTTCGGGAGGTGGCCCAGGTTTCGGAAGTGAGAATTGATTGGGACAAGTTAGTGAAGAACACGTCAACTGGGATTTCTAATGCTCGAGAATATCAG
ATGTTATGGCGGCATTTGGCTTATCGTCACACGTTACTGGAGAATATGGATTGTCTTACTGGTCCACTGGATGATGATAGTGATTTAGACTTTGAAATAGAGTCT
TTTCCATCAGTGAACAGCGAGTCCTTAAATGAAGCTGCAGCATTTGTGAAGGTACTGATTGCCAATGCTATACCGAGCGAGTCAGATATTCCTAGTAGTTCTGCA
GTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAATCATCTAGAGCCAATCTTGAAAATCCTCAATCTGGTTGTTTGTTGCAAGAGATGTATGTTGCAATT
CCAATTTCCATTCAGAGACAGCCTATTTCTTCTACACCAGCCGTATCAACTGAAGTATTTGATGTGAATGGAGCAGCCGGTGGTAATGCAGCTTCTCGAAAAAGA
AGAAAACCCTGGTCAAAGGCAGAGGATATGGAACTGATTGCTGCTGTTCAGAAGTGTGGTGAAGGGAACTGGGCGAATATCTTGAAAGGAGATTTTAAGGGGAAT
AGAACTGCTTCACAGCTATCTCAGAGGTGGTCCATTATTAGGAAGCGACGCTGTAATTTGAATGTGGGAGCTAACGCCACAGGTACTCAGATATCTAAAGCTCAG
ATTGATGCTACACACCGTGCATTGTCCTTTGCGCTTGATTTGCCCGTGAATAACTCGAAAACAGAGTATTCAAACATAAACAGTTGCATTATCTCTTCTGCAAGT
GGTGCCGAAGCTCCGGTTCAAATGCAGAATCAGTCTCCTCAGATTCCCAAGCCTTCAAGGCCAGTGCTGGTAGAGCCTTTGCCTTCAGCAGTGAAACCTGGAATC
GACACTTCCAAGAATGCATTGATGATGAAGTCTACTCACAATTCCGATTCTATAGTTCGAGCGACCGCAGTTGCTGCGGGAGCCCGCATCGTTTCTCCATCAGAT
GCTGCATCTCTACTGAAGGCTGCACAGGCAAGAAATGCCATTCACATAAAGTCCAGTTGTGCCTCTTCGATCAAACCACCTGTGCATGGTAATGCACCAATTCAC
TCGGATCCACGCCCTAACATACATTACATTTCCACGGGAAAATTGGCATCTCCAGGTTCGAACTATGTCGGTGGTAAACCGAATGTGGTATGTAATAACTCAGTG
AAGGCCATCTCGCCAATTGTTCTGCATCATCGTTCCACTTCTGCTATTTTGATGAATGTGCAGTCAGACCAAAGAAGCCCAGCAACCGAGTCTCCGTCGAAGCGA
GAGATTAAGAGTTCAGAAGAACGCAAAATGCCTGAGCCGGTTGCAACTCCGAAAGAGGAGGCTCGAGAATCCGAAGCTGTTCGGGGAGGAAACTTTGCTACCGAA
AGATCAGATGGGGAATTGAGAAGTCTTTCAACTTGCATTGAGAATCACAATGGTTCAAATACGGAGATAGATGAAAATGGTATTAAAGCAGGATGTTCCGAGCAG
GTCGAAACGAAAAAGAACGCAAATGATGTCGAGATTAGGGGTTCACCAGACACTCAAGGAAGCAAGAATGAATCCGTAGCAGATATTGTGGGCCATAGCACACAA
AATGAACAGATGAACTTGAAAGCATATATTAGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTCTCTAATGATTGAGAGAAAAGAGAAGCAAAAGAAAGGGATAATTAGTAGTGAGGATGATATTTCCACTCTACTCGAAAGATATTCGGTAAGGACGATA
CTGACATTGCTTCGGGAGGTGGCCCAGGTTTCGGAAGTGAGAATTGATTGGGACAAGTTAGTGAAGAACACGTCAACTGGGATTTCTAATGCTCGAGAATATCAG
ATGTTATGGCGGCATTTGGCTTATCGTCACACGTTACTGGAGAATATGGATTGTCTTACTGGTCCACTGGATGATGATAGTGATTTAGACTTTGAAATAGAGTCT
TTTCCATCAGTGAACAGCGAGTCCTTAAATGAAGCTGCAGCATTTGTGAAGGTACTGATTGCCAATGCTATACCGAGCGAGTCAGATATTCCTAGTAGTTCTGCA
GTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAATCATCTAGAGCCAATCTTGAAAATCCTCAATCTGGTTGTTTGTTGCAAGAGATGTATGTTGCAATT
CCAATTTCCATTCAGAGACAGCCTATTTCTTCTACACCAGCCGTATCAACTGAAGTATTTGATGTGAATGGAGCAGCCGGTGGTAATGCAGCTTCTCGAAAAAGA
AGAAAACCCTGGTCAAAGGCAGAGGATATGGAACTGATTGCTGCTGTTCAGAAGTGTGGTGAAGGGAACTGGGCGAATATCTTGAAAGGAGATTTTAAGGGGAAT
AGAACTGCTTCACAGCTATCTCAGAGGTGGTCCATTATTAGGAAGCGACGCTGTAATTTGAATGTGGGAGCTAACGCCACAGGTACTCAGATATCTAAAGCTCAG
ATTGATGCTACACACCGTGCATTGTCCTTTGCGCTTGATTTGCCCGTGAATAACTCGAAAACAGAGTATTCAAACATAAACAGTTGCATTATCTCTTCTGCAAGT
GGTGCCGAAGCTCCGGTTCAAATGCAGAATCAGTCTCCTCAGATTCCCAAGCCTTCAAGGCCAGTGCTGGTAGAGCCTTTGCCTTCAGCAGTGAAACCTGGAATC
GACACTTCCAAGAATGCATTGATGATGAAGTCTACTCACAATTCCGATTCTATAGTTCGAGCGACCGCAGTTGCTGCGGGAGCCCGCATCGTTTCTCCATCAGAT
GCTGCATCTCTACTGAAGGCTGCACAGGCAAGAAATGCCATTCACATAAAGTCCAGTTGTGCCTCTTCGATCAAACCACCTGTGCATGGTAATGCACCAATTCAC
TCGGATCCACGCCCTAACATACATTACATTTCCACGGGAAAATTGGCATCTCCAGGTTCGAACTATGTCGGTGGTAAACCGAATGTGGTATGTAATAACTCAGTG
AAGGCCATCTCGCCAATTGTTCTGCATCATCGTTCCACTTCTGCTATTTTGATGAATGTGCAGTCAGACCAAAGAAGCCCAGCAACCGAGTCTCCGTCGAAGCGA
GAGATTAAGAGTTCAGAAGAACGCAAAATGCCTGAGCCGGTTGCAACTCCGAAAGAGGAGGCTCGAGAATCCGAAGCTGTTCGGGGAGGAAACTTTGCTACCGAA
AGATCAGATGGGGAATTGAGAAGTCTTTCAACTTGCATTGAGAATCACAATGGTTCAAATACGGAGATAGATGAAAATGGTATTAAAGCAGGATGTTCCGAGCAG
GTCGAAACGAAAAAGAACGCAAATGATGTCGAGATTAGGGGTTCACCAGACACTCAAGGAAGCAAGAATGAATCCGTAGCAGATATTGTGGGCCATAGCACACAA
AATGAACAGATGAACTTGAAAGCATATATTAGAAGGTGA
Protein sequenceShow/hide protein sequence
MGSLMIERKEKQKKGIISSEDDISTLLERYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIES
FPSVNSESLNEAAAFVKVLIANAIPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKR
RKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTEYSNINSCIISSAS
GAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIH
SDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHHRSTSAILMNVQSDQRSPATESPSKREIKSSEERKMPEPVATPKEEARESEAVRGGNFATE
RSDGELRSLSTCIENHNGSNTEIDENGIKAGCSEQVETKKNANDVEIRGSPDTQGSKNESVADIVGHSTQNEQMNLKAYIRR