; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008350 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008350
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationscaffold4:744847..750062
RNA-Seq ExpressionMS008350
SyntenyMS008350
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591699.1 hypothetical protein SDJN03_14045, partial [Cucurbita argyrosperma subsp. sororia]3.1e-19376.17Show/hide
Query:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
        RYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE FPSV++ESLNEAAA VKVLIAN 
Subjt:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA

Query:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ
        IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISIQRQP+  TP+ +TEVFDVNGAAGGNAASRKRRKPWSK ED+EL+AAV+
Subjt:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ

Query:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-ENSNINSCIISSASGAEAPVQMQNQ
        K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVNNSK+  NSN+NS  +SS SGAEAPVQ+QNQ
Subjt:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-ENSNINSCIISSASGAEAPVQMQNQ

Query:  SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPN
        SPQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIV ATAVAAGARIVSPSDAASL+KAAQ +NAIHIKS C SSI+PPV GNA  H D RP+
Subjt:  SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPN

Query:  IHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEAV
        +HYISTG+ A+PGSNYVGGK  +  NNS+K +SP   +N ST A+L N  S+Q SP TESP K+E+KSSEE K+ +P+ TPK + RE+  V
Subjt:  IHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEAV

KAG7024581.1 hypothetical protein SDJN02_13399, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-19275.76Show/hide
Query:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
        RYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE FPSV++ESLNEAAA VKVLIAN 
Subjt:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA

Query:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ
        IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISIQRQP+  TP+ +TEVFDVNGAAG NAASRKRRKPWSK ED+EL+AAV+
Subjt:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ

Query:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-ENSNINSCIISSASGAEAPVQMQNQ
        K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALD PVNNSK+  NSN+NS  +SS SGAEAPVQ+QNQ
Subjt:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-ENSNINSCIISSASGAEAPVQMQNQ

Query:  SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPN
        SPQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +NAIHIKS C SSI+PP+ GNA  H D RP+
Subjt:  SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPN

Query:  IHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEAV
        +HYISTG+ A+PGSNYVGGK  +  NNS+K +SP   +N ST A+L N  S+Q SP TESP K+E+KSSEE K+ +P+ TPK + RE+  V
Subjt:  IHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEAV

XP_022137164.1 uncharacterized protein LOC111008703 [Momordica charantia]8.7e-26599.39Show/hide
Query:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
        RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
Subjt:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA

Query:  IPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQK
        IPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQK
Subjt:  IPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQK

Query:  CGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTENSNINSCIISSASGAEAPVQMQNQSP
        CGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTE SNINSCIISSASGAEAPVQMQNQSP
Subjt:  CGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTENSNINSCIISSASGAEAPVQMQNQSP

Query:  QIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIH
        QIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIH
Subjt:  QIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIH

Query:  YISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEAVRG
        YISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLH+RSTSAILMNVQSDQRSPATESPSKREIKSSEE KMPEPVATPKEEARESEAVRG
Subjt:  YISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEAVRG

XP_022937359.1 uncharacterized protein LOC111443670 isoform X1 [Cucurbita moschata]2.6e-19276.18Show/hide
Query:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
        RYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE FPSV++ESLNEAAA VKVLIAN 
Subjt:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA

Query:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ
        IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISIQRQP+  TP+ +TEVFDVNGAAG NAASRKRRKPWSK ED+EL+AAV+
Subjt:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ

Query:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-ENSNINSCIISSASGAEAPVQMQNQ
        K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVNNSK+  NSN+NS  +SS SGAEAPVQ+QNQ
Subjt:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-ENSNINSCIISSASGAEAPVQMQNQ

Query:  SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPN
        SPQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +NAIHIKS C SSI+PP+ GNA  H D RP+
Subjt:  SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPN

Query:  IHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARE
        +HYISTG+ A+PG+NYVGGK  +  NNS+K +SP   +N ST A+L N  S+Q SP TESP K+E+KSSEE K+ +P+ TPK + RE
Subjt:  IHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARE

XP_022937362.1 uncharacterized protein LOC111443670 isoform X2 [Cucurbita moschata]1.1e-19376.34Show/hide
Query:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
        RYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE FPSV++ESLNEAAA VKVLIAN 
Subjt:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA

Query:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ
        IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISIQRQP+  TP+ +TEVFDVNGAAG NAASRKRRKPWSK ED+EL+AAV+
Subjt:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ

Query:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTENSNINSCIISSASGAEAPVQMQNQS
        K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVNNSK+ NSN+NS  +SS SGAEAPVQ+QNQS
Subjt:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTENSNINSCIISSASGAEAPVQMQNQS

Query:  PQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNI
        PQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +NAIHIKS C SSI+PP+ GNA  H D RP++
Subjt:  PQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNI

Query:  HYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARE
        HYISTG+ A+PG+NYVGGK  +  NNS+K +SP   +N ST A+L N  S+Q SP TESP K+E+KSSEE K+ +P+ TPK + RE
Subjt:  HYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARE

TrEMBL top hitse value%identityAlignment
A0A6J1C5S4 uncharacterized protein LOC1110087034.2e-26599.39Show/hide
Query:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
        RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
Subjt:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA

Query:  IPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQK
        IPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQK
Subjt:  IPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQK

Query:  CGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTENSNINSCIISSASGAEAPVQMQNQSP
        CGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTE SNINSCIISSASGAEAPVQMQNQSP
Subjt:  CGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTENSNINSCIISSASGAEAPVQMQNQSP

Query:  QIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIH
        QIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIH
Subjt:  QIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIH

Query:  YISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEAVRG
        YISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLH+RSTSAILMNVQSDQRSPATESPSKREIKSSEE KMPEPVATPKEEARESEAVRG
Subjt:  YISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEAVRG

A0A6J1FAZ9 uncharacterized protein LOC111443670 isoform X25.1e-19476.34Show/hide
Query:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
        RYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE FPSV++ESLNEAAA VKVLIAN 
Subjt:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA

Query:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ
        IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISIQRQP+  TP+ +TEVFDVNGAAG NAASRKRRKPWSK ED+EL+AAV+
Subjt:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ

Query:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTENSNINSCIISSASGAEAPVQMQNQS
        K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVNNSK+ NSN+NS  +SS SGAEAPVQ+QNQS
Subjt:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTENSNINSCIISSASGAEAPVQMQNQS

Query:  PQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNI
        PQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +NAIHIKS C SSI+PP+ GNA  H D RP++
Subjt:  PQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNI

Query:  HYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARE
        HYISTG+ A+PG+NYVGGK  +  NNS+K +SP   +N ST A+L N  S+Q SP TESP K+E+KSSEE K+ +P+ TPK + RE
Subjt:  HYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARE

A0A6J1FGE2 uncharacterized protein LOC111443670 isoform X11.3e-19276.18Show/hide
Query:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
        RYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE FPSV++ESLNEAAA VKVLIAN 
Subjt:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA

Query:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ
        IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISIQRQP+  TP+ +TEVFDVNGAAG NAASRKRRKPWSK ED+EL+AAV+
Subjt:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ

Query:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-ENSNINSCIISSASGAEAPVQMQNQ
        K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVNNSK+  NSN+NS  +SS SGAEAPVQ+QNQ
Subjt:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-ENSNINSCIISSASGAEAPVQMQNQ

Query:  SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPN
        SPQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +NAIHIKS C SSI+PP+ GNA  H D RP+
Subjt:  SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPN

Query:  IHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARE
        +HYISTG+ A+PG+NYVGGK  +  NNS+K +SP   +N ST A+L N  S+Q SP TESP K+E+KSSEE K+ +P+ TPK + RE
Subjt:  IHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARE

A0A6J1IGI9 uncharacterized protein LOC111476736 isoform X23.4e-19074.49Show/hide
Query:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
        RYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE FPSV++ESLNEAAA VKVLIAN 
Subjt:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA

Query:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ
        IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISI+RQP+  TP+ +TEVFDVNGAAG NAASRKRRKPWSK +D+EL+AAV+
Subjt:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ

Query:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTENSNINSCIISSASGAEAPVQMQNQS
        K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVN SK+ NSN+NS  +SS SGAEAPVQ+QNQS
Subjt:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTENSNINSCIISSASGAEAPVQMQNQS

Query:  PQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNI
        PQ+  P RP+ V+PLP A K GI+T KN LMMKSTHNSDSIVRATAVAAGARIVSP DAASL+K AQ +NAIHIKS C SSI+PPV GNA  H D +P++
Subjt:  PQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNI

Query:  HYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEAV
        HYISTG+ A+PGSNYVGGK  +  NNS+K ++P   +N ST A+L N  S+Q SP TESP K+E+KSSEECK+ +P+ T K+++RE+  V
Subjt:  HYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEAV

A0A6J1IN48 uncharacterized protein LOC111476736 isoform X18.4e-18974.34Show/hide
Query:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
        RYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDFEIE FPSV++ESLNEAAA VKVLIAN 
Subjt:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA

Query:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ
        IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISI+RQP+  TP+ +TEVFDVNGAAG NAASRKRRKPWSK +D+EL+AAV+
Subjt:  IPSESDIPSSSAVEAPLTIGI-SNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQ

Query:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-ENSNINSCIISSASGAEAPVQMQNQ
        K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVN SK+  NSN+NS  +SS SGAEAPVQ+QNQ
Subjt:  KCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKT-ENSNINSCIISSASGAEAPVQMQNQ

Query:  SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPN
        SPQ+  P RP+ V+PLP A K GI+T KN LMMKSTHNSDSIVRATAVAAGARIVSP DAASL+K AQ +NAIHIKS C SSI+PPV GNA  H D +P+
Subjt:  SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPN

Query:  IHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEAV
        +HYISTG+ A+PGSNYVGGK  +  NNS+K ++P   +N ST A+L N  S+Q SP TESP K+E+KSSEECK+ +P+ T K+++RE+  V
Subjt:  IHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G09710.1 Homeodomain-like superfamily protein3.6e-5937.5Show/hide
Query:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
        RY + TIL +L+E++  SE ++DW+ LVK T+TGI+NAREYQ+LWRHL+YRH LL   D    PLDDDSD++ E+E+ P+V+ E+  EA A VKV+ A+ 
Subjt:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA

Query:  IPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAV
        + SESDI   S VEAPLTI I  +  + S+   E+P S    + M +  P+ +Q+         STE  + NG+AG + A R++RK WS  ED EL AAV
Subjt:  IPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAV

Query:  QKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFAL-DLPVNNSKTEN--SNINSCIISSASGAEAPVQM
        ++CGEGNWA+I+KGDF+G RTASQLSQRW++IRK RC+ +   +  G Q ++A++   H ALS AL + P +N          +SC I+     EA    
Subjt:  QKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFAL-DLPVNNSKTEN--SNINSCIISSASGAEAPVQM

Query:  QNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMK----STHNSDSIVRATAVAAGA-----------RIVSP--SDAASLLKAAQARNA--IHIKSS
         +Q  Q  KP    +V+ LP A    +  +K+ ++ K    ST  SD +V A +VAA A           R V P  +DA  + K    ++A  + +   
Subjt:  QNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMK----STHNSDSIVRATAVAAGA-----------RIVSP--SDAASLLKAAQARNA--IHIKSS

Query:  CASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPV
          S   P V     + +    +I  ++ GKL    ++    KP ++   S       +L   +  A L  + S+QR  A   P+   +     CK   P 
Subjt:  CASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPV

Query:  ATPKEEARESEA
           K +AR +EA
Subjt:  ATPKEEARESEA

AT1G09710.2 Homeodomain-like superfamily protein4.9e-5635.53Show/hide
Query:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
        RY + TIL +L+E++  SE ++DW+ LVK T+TGI+NAREYQ+LWRHL+YRH LL   D    PLDDDSD++ E+E+ P+V+ E+  EA A VKV+ A+ 
Subjt:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA

Query:  IPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAV
        + SESDI   S VEAPLTI I  +  + S+   E+P S    + M +  P+ +Q+         STE  + NG+AG + A R++RK WS  ED EL AAV
Subjt:  IPSESDIPSSSAVEAPLTIGISNS--QSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAV

Query:  QKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFAL-------DLPVNNSKTENSNINSCI--ISSASGA
        ++CGEGNWA+I+KGDF+G RTASQLSQRW++IRK RC+ +   +  G Q ++A++   H ALS AL        L +  S   +   NS I  I+  +  
Subjt:  QKCGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFAL-------DLPVNNSKTENSNINSCI--ISSASGA

Query:  EAPVQMQNQ----------------------------SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMK----STHNSDSIVRATAVAAGA-------
          P+   NQ                            S Q  + S+P+ V+ LP A    +  +K+ ++ K    ST  SD +V A +VAA A       
Subjt:  EAPVQMQNQ----------------------------SPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMK----STHNSDSIVRATAVAAGA-------

Query:  ----RIVSP--SDAASLLKAAQARNA--IHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSA
            R V P  +DA  + K    ++A  + +     S   P V     + +    +I  ++ GKL    ++    KP ++   S       +L   +  A
Subjt:  ----RIVSP--SDAASLLKAAQARNA--IHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHNRSTSA

Query:  ILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEA
         L  + S+QR  A   P+   +     CK   P    K +AR +EA
Subjt:  ILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEA

AT1G58220.1 Homeodomain-like superfamily protein3.0e-5338.2Show/hide
Query:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA
        RY   TIL LL+E+A  +E +++W++LVK TSTGI++AREYQ+LWRHLAYR +L+  +      LDDDSD++ E+E+ P V+ + + EA A VKV+ A+ 
Subjt:  RYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANA

Query:  IPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQK
        +PSESDIP  S VEAPLTI I  S   R   E   S    + M +  P+ + +         + E  + NG A  + A RKRRK WS  ED ELIAAV++
Subjt:  IPSESDIPSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQK

Query:  CGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSK--------TENSNINSCIISSASGAEAP
         GEG+WA I K +F+G RTASQLSQRW  IR+R    N  +  TG Q ++AQ+ A +RALS A+   + + K          +  I     + AS     
Subjt:  CGEGNWANILKGDFKGNRTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSK--------TENSNINSCIISSASGAEAP

Query:  VQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIH--------IKSSCASSIKPP
           Q   PQI   SR     P+  +  P   T+ N     ST  +D +V A +VAA A +   + A ++ K    +NA+         +K++   S+  P
Subjt:  VQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKNALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIH--------IKSSCASSIKPP

Query:  VHGNAPIHSDP
           ++ ++++P
Subjt:  VHGNAPIHSDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTGTGGGGCAGATATTCGGTAAGGACGATACTGACATTGCTTCGGGAGGTGGCCCAGGTTTCGGAAGTGAGAATTGATTGGGACAAGTTAGTGAAGAACACGTCAACTGG
GATTTCTAATGCTCGAGAATATCAGATGTTATGGCGGCATTTGGCTTATCGTCACACGTTACTGGAGAATATGGATTGTCTTACTGGTCCACTGGATGATGATAGTGATT
TAGACTTTGAAATAGAGTCTTTTCCATCAGTGAACAGCGAGTCCTTAAATGAAGCTGCAGCATTTGTGAAGGTACTGATTGCCAATGCTATACCGAGCGAGTCAGATATT
CCTAGTAGTTCTGCAGTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAATCGTCTAGAGCCAATCTTGAAAATCCTCAATCTGGTTGTTTGTTGCAAGAGATGTA
TGTTGCAATTCCAATTTCCATTCAGAGACAGCCTATTTCTTCTACACCAGCCGTATCAACTGAAGTATTTGATGTGAATGGAGCAGCCGGTGGTAATGCAGCTTCTCGAA
AAAGAAGAAAACCCTGGTCAAAGGCAGAGGATATGGAACTGATTGCTGCTGTTCAGAAGTGTGGTGAAGGGAACTGGGCGAATATCTTGAAAGGAGATTTTAAGGGGAAT
AGAACTGCTTCACAGCTATCTCAGAGGTGGTCCATTATTAGGAAGCGACGCTGTAATTTGAATGTGGGAGCTAACGCCACAGGTACTCAGATATCTAAAGCTCAGATTGA
TGCTACACACCGTGCATTGTCCTTTGCGCTTGATTTGCCCGTGAATAACTCGAAAACAGAGAATTCAAACATAAACAGTTGCATTATCTCTTCTGCAAGTGGTGCCGAAG
CTCCGGTTCAAATGCAGAATCAGTCTCCTCAGATTCCCAAGCCTTCAAGGCCAGTGCTGGTAGAGCCTTTGCCTTCAGCAGTGAAACCTGGAATCGACACTTCCAAGAAT
GCATTGATGATGAAGTCTACTCACAATTCCGATTCTATAGTTCGAGCGACCGCAGTTGCTGCGGGAGCCCGCATCGTTTCTCCATCAGATGCTGCATCTCTACTGAAGGC
TGCACAGGCAAGAAATGCCATTCACATAAAGTCCAGTTGTGCCTCTTCGATCAAACCACCTGTGCATGGTAATGCACCAATTCACTCGGATCCACGCCCTAACATACATT
ACATTTCCACGGGAAAATTGGCATCTCCAGGTTCGAACTATGTCGGTGGTAAACCGAATGTGGTATGTAATAACTCAGTGAAGGCCATCTCGCCAATTGTTCTGCATAAT
CGTTCCACTTCTGCTATTTTGATGAATGTGCAGTCAGACCAAAGAAGCCCAGCAACCGAGTCTCCGTCGAAGCGAGAGATTAAGAGTTCAGAAGAATGCAAAATGCCTGA
GCCGGTTGCAACTCCGAAAGAGGAGGCTCGAGAATCCGAAGCTGTTCGGGGA
mRNA sequenceShow/hide mRNA sequence
TTGTGGGGCAGATATTCGGTAAGGACGATACTGACATTGCTTCGGGAGGTGGCCCAGGTTTCGGAAGTGAGAATTGATTGGGACAAGTTAGTGAAGAACACGTCAACTGG
GATTTCTAATGCTCGAGAATATCAGATGTTATGGCGGCATTTGGCTTATCGTCACACGTTACTGGAGAATATGGATTGTCTTACTGGTCCACTGGATGATGATAGTGATT
TAGACTTTGAAATAGAGTCTTTTCCATCAGTGAACAGCGAGTCCTTAAATGAAGCTGCAGCATTTGTGAAGGTACTGATTGCCAATGCTATACCGAGCGAGTCAGATATT
CCTAGTAGTTCTGCAGTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAATCGTCTAGAGCCAATCTTGAAAATCCTCAATCTGGTTGTTTGTTGCAAGAGATGTA
TGTTGCAATTCCAATTTCCATTCAGAGACAGCCTATTTCTTCTACACCAGCCGTATCAACTGAAGTATTTGATGTGAATGGAGCAGCCGGTGGTAATGCAGCTTCTCGAA
AAAGAAGAAAACCCTGGTCAAAGGCAGAGGATATGGAACTGATTGCTGCTGTTCAGAAGTGTGGTGAAGGGAACTGGGCGAATATCTTGAAAGGAGATTTTAAGGGGAAT
AGAACTGCTTCACAGCTATCTCAGAGGTGGTCCATTATTAGGAAGCGACGCTGTAATTTGAATGTGGGAGCTAACGCCACAGGTACTCAGATATCTAAAGCTCAGATTGA
TGCTACACACCGTGCATTGTCCTTTGCGCTTGATTTGCCCGTGAATAACTCGAAAACAGAGAATTCAAACATAAACAGTTGCATTATCTCTTCTGCAAGTGGTGCCGAAG
CTCCGGTTCAAATGCAGAATCAGTCTCCTCAGATTCCCAAGCCTTCAAGGCCAGTGCTGGTAGAGCCTTTGCCTTCAGCAGTGAAACCTGGAATCGACACTTCCAAGAAT
GCATTGATGATGAAGTCTACTCACAATTCCGATTCTATAGTTCGAGCGACCGCAGTTGCTGCGGGAGCCCGCATCGTTTCTCCATCAGATGCTGCATCTCTACTGAAGGC
TGCACAGGCAAGAAATGCCATTCACATAAAGTCCAGTTGTGCCTCTTCGATCAAACCACCTGTGCATGGTAATGCACCAATTCACTCGGATCCACGCCCTAACATACATT
ACATTTCCACGGGAAAATTGGCATCTCCAGGTTCGAACTATGTCGGTGGTAAACCGAATGTGGTATGTAATAACTCAGTGAAGGCCATCTCGCCAATTGTTCTGCATAAT
CGTTCCACTTCTGCTATTTTGATGAATGTGCAGTCAGACCAAAGAAGCCCAGCAACCGAGTCTCCGTCGAAGCGAGAGATTAAGAGTTCAGAAGAATGCAAAATGCCTGA
GCCGGTTGCAACTCCGAAAGAGGAGGCTCGAGAATCCGAAGCTGTTCGGGGA
Protein sequenceShow/hide protein sequence
LWGRYSVRTILTLLREVAQVSEVRIDWDKLVKNTSTGISNAREYQMLWRHLAYRHTLLENMDCLTGPLDDDSDLDFEIESFPSVNSESLNEAAAFVKVLIANAIPSESDI
PSSSAVEAPLTIGISNSQSSRANLENPQSGCLLQEMYVAIPISIQRQPISSTPAVSTEVFDVNGAAGGNAASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGN
RTASQLSQRWSIIRKRRCNLNVGANATGTQISKAQIDATHRALSFALDLPVNNSKTENSNINSCIISSASGAEAPVQMQNQSPQIPKPSRPVLVEPLPSAVKPGIDTSKN
ALMMKSTHNSDSIVRATAVAAGARIVSPSDAASLLKAAQARNAIHIKSSCASSIKPPVHGNAPIHSDPRPNIHYISTGKLASPGSNYVGGKPNVVCNNSVKAISPIVLHN
RSTSAILMNVQSDQRSPATESPSKREIKSSEECKMPEPVATPKEEARESEAVRG