; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027924 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027924
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationtig00153056:1423796..1434102
RNA-Seq ExpressionSgr027924
SyntenySgr027924
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591699.1 hypothetical protein SDJN03_14045, partial [Cucurbita argyrosperma subsp. sororia]1.3e-21974.87Show/hide
Query:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP
        MIE KEK+KKGTIS ED S +LE YSVRTI TLLREVA VSEVRIDW+KLVKNTSTGI+N REYQ+LWRHLAYRHTLLEN+D VTD LD DSDLDFE+EP
Subjt:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP

Query:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA
        FPSVS ESLNEAA+CVKVLIANGIPSESD+P+SS VEAPLTIGI SNS+S RA+LENPQSACLMQG+YVT+PISIQRQP+ TPS+TEVFD +GAAGGN A
Subjt:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA

Query:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKS-ASSN
        SRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKGDRTASQLSQRWSIIRKR GNL+VGAN TS QISKAQIDAAHRALS ALDLPVNNSKS A+SN
Subjt:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKS-ASSN

Query:  ISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNC
        ++SS VSS SGAEAPVQ+QNQS Q+ +P+RP+ V+PL SAAKSG+NT+KN+L  KSTHNSDS+V ATAVAAGARI+SPSDAASL+KAAQTKNAIHIKS C
Subjt:  ISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNC

Query:  TSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIS
         SS + PV+G+A  HLDA  SVHYISTG+ AT GSNYVGGK  M  NNS++ VSPK  +N ST A+L NP S+QISP TESP KQEV S EE KIS+PI 
Subjt:  TSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIS

Query:  TQREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATCSKQVEMKKNANDVEI
        T + +  E+  V+ D           ASQ SD E  S STCIEN+N+SLNM+ID+NDIKA C KQ E KK ANDV+I
Subjt:  TQREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATCSKQVEMKKNANDVEI

KAG7024581.1 hypothetical protein SDJN02_13399, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-21974.57Show/hide
Query:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP
        MIE KEK+KKGTIS ED S +LE YSVRTI TLLREVA VSEVRIDW+KLVKNTSTGI+N REYQ+LWRHLAYRHTLLEN+D VTD LD DSDLDFE+EP
Subjt:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP

Query:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA
        FPSVS ESLNEAA+CVKVLIANGIPSESD+P+SS VEAPLTIGI SNS+S RA+LENPQSACLMQG+YVT+PISIQRQP+ TPS+TEVFD +GAAG N A
Subjt:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA

Query:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKS-ASSN
        SRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKGDRTASQLSQRWSIIRKR GNL+VGAN TS QISKAQIDAAHRALS ALD PVNNSKS A+SN
Subjt:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKS-ASSN

Query:  ISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNC
        ++SS VSS SGAEAPVQ+QNQS Q+ +P+RP+ V+PL SAAKSG+NT+KN+L  KSTHNSDS+VRATAVAAGARI+SPSDAASL+KAAQTKNAIHIKS C
Subjt:  ISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNC

Query:  TSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIS
         SS + P++G+A  HLDA  SVHYISTG+ AT GSNYVGGK  M  NNS++ VSPK  +N ST A+L NP S+QISP TESP KQEV S EE KIS+PI 
Subjt:  TSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIS

Query:  TQREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATCSKQVEMKKNANDVEIR
        T + +  E+  V+ D           ASQ SD E  S STCIEN+N+SLNMEI++NDIKA C KQ E KK ANDV+IR
Subjt:  TQREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATCSKQVEMKKNANDVEIR

XP_022137164.1 uncharacterized protein LOC111008703 [Momordica charantia]6.2e-24679.15Show/hide
Query:  MIERKEKRKKGTI-SEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELE
        MIERKEK+KKG I SE+D+STLLE YSVRTILTLLREVAQVSEVRIDW+KLVKNTSTGI+NAREYQMLWRHLAYRHTLLENMDC+T  LDDDSDLDFE+E
Subjt:  MIERKEKRKKGTI-SEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELE

Query:  PFPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGISNSQSSRANLENPQSACLMQGIYVTIPISIQRQPI-STPS-STEVFD-DGAAGGN
         FPSV+ ESLNEAA+ VKVLIAN IPSESDIP+SSAVEAPLTIGISNSQSSRANLENPQS CL+Q +YV IPISIQRQPI STP+ STEVFD +GAAGGN
Subjt:  PFPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGISNSQSSRANLENPQSACLMQGIYVTIPISIQRQPI-STPS-STEVFD-DGAAGGN

Query:  VASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSASS
         ASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKG+RTASQLSQRWSIIRKRR NL+VGANAT  QISKAQIDA HRALSFALDLPVNNSK+  S
Subjt:  VASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSASS

Query:  NISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSN
        NI+S I+SSASGAEAPVQMQNQS QIP P+RPVLVEPL SA K G++TSKN+L  KSTHNSDS+VRATAVAAGARI+SPSDAASLLKAAQ +NAIHIKS+
Subjt:  NISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSN

Query:  CTSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPI
        C SS K PV G+AP+H D   ++HYISTGK+A+ GSNYVGGKPN+VCNNSV+ +SP VLH+RSTSAILMN  SDQ SPATESPSK+E+ S EE K+ EP+
Subjt:  CTSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPI

Query:  STQREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATCSKQVEMKKNANDVEIRESPDIQGSKNESVADNHGNHS
        +T +EE  E EAV G          N A++RSDGE RSLSTCIEN N S N EID+N IKA CS+QVE KKNANDVEIR SPD QGSKNESVAD  G HS
Subjt:  STQREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATCSKQVEMKKNANDVEIRESPDIQGSKNESVADNHGNHS

Query:  TNNEQLNLK
        T NEQ+NLK
Subjt:  TNNEQLNLK

XP_023534835.1 uncharacterized protein LOC111796458 isoform X1 [Cucurbita pepo subsp. pepo]9.4e-21874.22Show/hide
Query:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP
        MIE KEK KKGTIS ED S +LE YSVRTI TLLREVA VSEVRIDW+KLVKNTSTGI+N REYQ+LWRHLAYRHTLLEN+D VTD LD DSDLDFE+EP
Subjt:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP

Query:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA
        FPSVS ESLNEAA+CVKVLIANGIPSESD+P+SS VEAPLTIGI SNS+  RA+LENPQSACLMQG+YVT PIS+QRQP+ TPS+TEVFD +GAAGGN A
Subjt:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA

Query:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKS-ASSN
        SRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKGDRTASQLSQRWSIIRKRRG L VGAN TS Q SKAQIDAAHRALS ALDLPVNNSKS A+SN
Subjt:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKS-ASSN

Query:  ISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNC
        ++SS VSS SGAEAPVQ+QNQS Q+ +P RP+ V+PL SAAKSG+NT+KN+L  KSTHNSDS+VRATAVAAGARI+SPSDAASL+KAAQTKNAIHIKS C
Subjt:  ISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNC

Query:  TSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIS
         SS + PV+G+A  HLDA  SVHYISTG+ AT GSNYVGGK  M    S++ VSPK  +N ST A+  NP S+QISP TESP KQEV S EECKIS+PI 
Subjt:  TSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIS

Query:  TQREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATCSKQVEMKKNANDVEIR
        T +++  E+  V+ D           ASQ SD E  S STCIEN+N+SLNMEID+NDIKA C KQ E K  ANDV+IR
Subjt:  TQREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATCSKQVEMKKNANDVEIR

XP_023534838.1 uncharacterized protein LOC111796458 isoform X2 [Cucurbita pepo subsp. pepo]3.8e-21974.35Show/hide
Query:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP
        MIE KEK KKGTIS ED S +LE YSVRTI TLLREVA VSEVRIDW+KLVKNTSTGI+N REYQ+LWRHLAYRHTLLEN+D VTD LD DSDLDFE+EP
Subjt:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP

Query:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA
        FPSVS ESLNEAA+CVKVLIANGIPSESD+P+SS VEAPLTIGI SNS+  RA+LENPQSACLMQG+YVT PIS+QRQP+ TPS+TEVFD +GAAGGN A
Subjt:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA

Query:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSASSNI
        SRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKGDRTASQLSQRWSIIRKRRG L VGAN TS Q SKAQIDAAHRALS ALDLPVNNSKSA+SN+
Subjt:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSASSNI

Query:  SSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNCT
        +SS VSS SGAEAPVQ+QNQS Q+ +P RP+ V+PL SAAKSG+NT+KN+L  KSTHNSDS+VRATAVAAGARI+SPSDAASL+KAAQTKNAIHIKS C 
Subjt:  SSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNCT

Query:  SSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIST
        SS + PV+G+A  HLDA  SVHYISTG+ AT GSNYVGGK  M    S++ VSPK  +N ST A+  NP S+QISP TESP KQEV S EECKIS+PI T
Subjt:  SSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIST

Query:  QREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATCSKQVEMKKNANDVEIR
         +++  E+  V+ D           ASQ SD E  S STCIEN+N+SLNMEID+NDIKA C KQ E K  ANDV+IR
Subjt:  QREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATCSKQVEMKKNANDVEIR

TrEMBL top hitse value%identityAlignment
A0A6J1C5S4 uncharacterized protein LOC1110087033.0e-24679.15Show/hide
Query:  MIERKEKRKKGTI-SEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELE
        MIERKEK+KKG I SE+D+STLLE YSVRTILTLLREVAQVSEVRIDW+KLVKNTSTGI+NAREYQMLWRHLAYRHTLLENMDC+T  LDDDSDLDFE+E
Subjt:  MIERKEKRKKGTI-SEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELE

Query:  PFPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGISNSQSSRANLENPQSACLMQGIYVTIPISIQRQPI-STPS-STEVFD-DGAAGGN
         FPSV+ ESLNEAA+ VKVLIAN IPSESDIP+SSAVEAPLTIGISNSQSSRANLENPQS CL+Q +YV IPISIQRQPI STP+ STEVFD +GAAGGN
Subjt:  PFPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGISNSQSSRANLENPQSACLMQGIYVTIPISIQRQPI-STPS-STEVFD-DGAAGGN

Query:  VASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSASS
         ASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKG+RTASQLSQRWSIIRKRR NL+VGANAT  QISKAQIDA HRALSFALDLPVNNSK+  S
Subjt:  VASRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSASS

Query:  NISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSN
        NI+S I+SSASGAEAPVQMQNQS QIP P+RPVLVEPL SA K G++TSKN+L  KSTHNSDS+VRATAVAAGARI+SPSDAASLLKAAQ +NAIHIKS+
Subjt:  NISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSN

Query:  CTSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPI
        C SS K PV G+AP+H D   ++HYISTGK+A+ GSNYVGGKPN+VCNNSV+ +SP VLH+RSTSAILMN  SDQ SPATESPSK+E+ S EE K+ EP+
Subjt:  CTSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPI

Query:  STQREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATCSKQVEMKKNANDVEIRESPDIQGSKNESVADNHGNHS
        +T +EE  E EAV G          N A++RSDGE RSLSTCIEN N S N EID+N IKA CS+QVE KKNANDVEIR SPD QGSKNESVAD  G HS
Subjt:  STQREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATCSKQVEMKKNANDVEIRESPDIQGSKNESVADNHGNHS

Query:  TNNEQLNLK
        T NEQ+NLK
Subjt:  TNNEQLNLK

A0A6J1FAZ9 uncharacterized protein LOC111443670 isoform X22.4e-20377.32Show/hide
Query:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP
        MIE KEK+KKGTIS ED S +LE YSVRTI TLLREVA VSEVRIDW+KLVKNTSTGI+N REYQ+LWRHLAYRHTLLEN+D VTD LD DSDLDFE+EP
Subjt:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP

Query:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA
        FPSVS ESLNEAA+CVKVLIANGIPSESD+P+SS VEAPLTIGI SNS+S RA+LENPQSACLMQG+YVT+PISIQRQP+ TPS+TEVFD +GAAG N A
Subjt:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA

Query:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSASSNI
        SRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKGDRTASQLSQRWSIIRKR GNL+VGAN TS QISKAQIDAAHRALS ALDLPVNNSKSA+SN+
Subjt:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSASSNI

Query:  SSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNCT
        +SS VSS SGAEAPVQ+QNQS Q+ +P+RP+ V+PL SAAKSG+NT+KN+L  KSTHNSDS+VRATAVAAGARI+SPSDAASL+KAAQTKNAIHIKS C 
Subjt:  SSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNCT

Query:  SSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIST
        SS + P++G+A  HLDA  SVHYISTG+ AT G+NYVGGK  M  NNS++ VSPK  +N ST A+L NP S+QISP TESP KQEV S EE KIS+PI T
Subjt:  SSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIST

Query:  QREETHE
         + +  E
Subjt:  QREETHE

A0A6J1FGE2 uncharacterized protein LOC111443670 isoform X16.0e-20277.17Show/hide
Query:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP
        MIE KEK+KKGTIS ED S +LE YSVRTI TLLREVA VSEVRIDW+KLVKNTSTGI+N REYQ+LWRHLAYRHTLLEN+D VTD LD DSDLDFE+EP
Subjt:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP

Query:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA
        FPSVS ESLNEAA+CVKVLIANGIPSESD+P+SS VEAPLTIGI SNS+S RA+LENPQSACLMQG+YVT+PISIQRQP+ TPS+TEVFD +GAAG N A
Subjt:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA

Query:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKS-ASSN
        SRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKGDRTASQLSQRWSIIRKR GNL+VGAN TS QISKAQIDAAHRALS ALDLPVNNSKS A+SN
Subjt:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKS-ASSN

Query:  ISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNC
        ++SS VSS SGAEAPVQ+QNQS Q+ +P+RP+ V+PL SAAKSG+NT+KN+L  KSTHNSDS+VRATAVAAGARI+SPSDAASL+KAAQTKNAIHIKS C
Subjt:  ISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNC

Query:  TSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIS
         SS + P++G+A  HLDA  SVHYISTG+ AT G+NYVGGK  M  NNS++ VSPK  +N ST A+L NP S+QISP TESP KQEV S EE KIS+PI 
Subjt:  TSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIS

Query:  TQREETHE
        T + +  E
Subjt:  TQREETHE

A0A6J1IGI9 uncharacterized protein LOC111476736 isoform X27.5e-21373.98Show/hide
Query:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP
        MIE KEK+KKGTIS ED   +LE YSVRTI TLLREVA VSEVRIDW+KLVKNTSTGI+N REYQ+LWRHLAYRHTLLEN+D VTD LD DSDLDFE+EP
Subjt:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP

Query:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA
        FPSVS ESLNEAA+CVKVLIANGIPSESD+P+SS VEAPLTIGI SNS+S RA+LENPQSACLMQG+YVT+PISI+RQP+ TPS+TEVFD +GAAG N A
Subjt:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA

Query:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSASSNI
        SRKRRKPWSK +D+EL+AAV+K GEGNWANILK DFKGDRTASQLSQRWSIIRKR GNL+VGAN TS QISKAQIDAAHRALS ALDLPVN SKSA+SN+
Subjt:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSASSNI

Query:  SSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNCT
        +SS VSS SGAEAPVQ+QNQS Q+ +P RP+ V+PL  AAKSG+NT KN+L  KSTHNSDS+VRATAVAAGARI+SP DAASL+K AQTKNAIHIKS C 
Subjt:  SSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNCT

Query:  SSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIST
        SS + PV+G+A  HLDA  SVHYISTG+ AT GSNYVGGK  M  NNS++ V+PK  +N ST A+L NP S+QISP TESP KQEV S EECKIS+PI T
Subjt:  SSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIST

Query:  QREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATC
         ++++ E+  V+ D           AS  SD E  S STCIEN+N+SLNMEID+NDIKA C
Subjt:  QREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATC

A0A6J1IN48 uncharacterized protein LOC111476736 isoform X11.9e-21173.84Show/hide
Query:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP
        MIE KEK+KKGTIS ED   +LE YSVRTI TLLREVA VSEVRIDW+KLVKNTSTGI+N REYQ+LWRHLAYRHTLLEN+D VTD LD DSDLDFE+EP
Subjt:  MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEP

Query:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA
        FPSVS ESLNEAA+CVKVLIANGIPSESD+P+SS VEAPLTIGI SNS+S RA+LENPQSACLMQG+YVT+PISI+RQP+ TPS+TEVFD +GAAG N A
Subjt:  FPSVSGESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGI-SNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFD-DGAAGGNVA

Query:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKS-ASSN
        SRKRRKPWSK +D+EL+AAV+K GEGNWANILK DFKGDRTASQLSQRWSIIRKR GNL+VGAN TS QISKAQIDAAHRALS ALDLPVN SKS A+SN
Subjt:  SRKRRKPWSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKS-ASSN

Query:  ISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNC
        ++SS VSS SGAEAPVQ+QNQS Q+ +P RP+ V+PL  AAKSG+NT KN+L  KSTHNSDS+VRATAVAAGARI+SP DAASL+K AQTKNAIHIKS C
Subjt:  ISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRSAAKSGMNTSKNSL--KSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNC

Query:  TSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIS
         SS + PV+G+A  HLDA  SVHYISTG+ AT GSNYVGGK  M  NNS++ V+PK  +N ST A+L NP S+QISP TESP KQEV S EECKIS+PI 
Subjt:  TSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMVCNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPIS

Query:  TQREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATC
        T ++++ E+  V+ D           AS  SD E  S STCIEN+N+SLNMEID+NDIKA C
Subjt:  TQREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDDNDIKATC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G09710.1 Homeodomain-like superfamily protein6.6e-6043.2Show/hide
Query:  RKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEPFPSVSGE
        R+K  I+E D++TLL  Y + TIL +L+E++  SE ++DW  LVK T+TGI NAREYQ+LWRHL+YRH LL   D     LDDDSD++ ELE  P+VS E
Subjt:  RKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEPFPSVSGE

Query:  SLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGISNS--QSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFDDGAAGGNVASRKRRKP
        +  EA + VKV+ A+ + SESDI + S VEAPLTI I  +  + S+   E+P S+   +G+ +  P+ +Q+      S+  +  +G+AG ++A R++RK 
Subjt:  SLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGISNS--QSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFDDGAAGGNVASRKRRKP

Query:  WSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSASSNISSSIVSS
        WS  ED EL AAV++CGEGNWA+I+KGDF+G+RTASQLSQRW++IRK R +     +    Q ++A++ A + ALS AL     ++K A   + ++   +
Subjt:  WSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSASSNISSSIVSS

Query:  ASGAEAPVQMQNQSSQIPLPTRPVLVEPLRS--AAKSGMNTSKNSLKSTHNSDSMVRATAVAAGARIISPSDAAS
         +  EA     +Q  Q   P    L     S  AAKS +   K +  ST  SD MV A +VAA A +     AAS
Subjt:  ASGAEAPVQMQNQSSQIPLPTRPVLVEPLRS--AAKSGMNTSKNSLKSTHNSDSMVRATAVAAGARIISPSDAAS

AT1G09710.2 Homeodomain-like superfamily protein3.1e-5739.52Show/hide
Query:  RKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEPFPSVSGE
        R+K  I+E D++TLL  Y + TIL +L+E++  SE ++DW  LVK T+TGI NAREYQ+LWRHL+YRH LL   D     LDDDSD++ ELE  P+VS E
Subjt:  RKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEPFPSVSGE

Query:  SLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGISNS--QSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFDDGAAGGNVASRKRRKP
        +  EA + VKV+ A+ + SESDI + S VEAPLTI I  +  + S+   E+P S+   +G+ +  P+ +Q+      S+  +  +G+AG ++A R++RK 
Subjt:  SLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGISNS--QSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFDDGAAGGNVASRKRRKP

Query:  WSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALD---------LPVNNSKSASS
        WS  ED EL AAV++CGEGNWA+I+KGDF+G+RTASQLSQRW++IRK R +     +    Q ++A++ A + ALS AL          +  ++ +S  +
Subjt:  WSKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALD---------LPVNNSKSASS

Query:  NISSSIVSSASGAEAPVQMQNQ----------------------------SSQIPLPTRPVLVEPLRS-----AAKSGMNTSKNSLKSTHNSDSMVRATA
        N S  +++  +    P+   NQ                            SSQ    ++P++    R+     AAKS +   K +  ST  SD MV A +
Subjt:  NISSSIVSSASGAEAPVQMQNQ----------------------------SSQIPLPTRPVLVEPLRS-----AAKSGMNTSKNSLKSTHNSDSMVRATA

Query:  VAAGARIISPSDAAS
        VAA A +     AAS
Subjt:  VAAGARIISPSDAAS

AT1G58220.1 Homeodomain-like superfamily protein1.9e-5942.27Show/hide
Query:  KRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEPFPSVSG
        K++K  ISE D++TLL+ Y   TIL LL+E+A  +E +++W +LVK TSTGI +AREYQ+LWRHLAYR +L+  +      LDDDSD++ ELE  P VS 
Subjt:  KRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEPFPSVSG

Query:  ESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGISNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFDDGAAGGNVASRKRRKPW
        + + EA + VKV+ A+ +PSESDIP  S VEAPLTI I  S   R   E   S    +G+ +T P+ +       P + E  +      ++A RKRRK W
Subjt:  ESLNEAASCVKVLIANGIPSESDIPNSSAVEAPLTIGISNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFDDGAAGGNVASRKRRKPW

Query:  SKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSA---SSNISSSIV
        S  ED ELIAAV++ GEG+WA I K +F+G+RTASQLSQRW  IR+R    +  +  T  Q ++AQ+ AA+RALS A+   + + K A   +  +SS  +
Subjt:  SKAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSA---SSNISSSIV

Query:  SSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRS--AAKSGMNTSKNSLKSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAI
          A    A      Q  Q P P    L     S   AKS +   K +  ST  +D MV A +VAA A +   + A ++ K    KNA+
Subjt:  SSASGAEAPVQMQNQSSQIPLPTRPVLVEPLRS--AAKSGMNTSKNSLKSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGAGAGAAAAGAGAAGCGAAAGAAAGGGACAATTAGTGAGGAAGACATGTCCACTCTATTGGAAAGTTATTCTGTAAGGACGATACTGACATTGCTTCGGGAGGT
GGCCCAGGTTTCGGAAGTGAGAATCGATTGGGAGAAGTTGGTGAAGAACACGTCAACTGGGATTGCTAATGCTCGAGAATATCAGATGTTATGGCGGCATTTGGCTTATC
GTCACACGTTACTGGAGAATATGGATTGTGTTACTGATTCACTGGATGATGATAGTGATTTAGATTTTGAATTAGAACCTTTTCCATCTGTCAGCGGTGAGTCCTTGAAT
GAAGCTGCATCATGTGTGAAGGTACTGATAGCTAATGGTATACCAAGCGAGTCAGATATTCCAAATAGTTCTGCAGTTGAGGCCCCATTGACTATAGGTATATCCAATAG
TCAATCATCTAGAGCCAATCTTGAAAATCCTCAATCTGCTTGTTTGATGCAAGGGATATATGTTACAATTCCAATTTCCATTCAGAGGCAGCCTATTTCTACGCCATCAT
CAACTGAAGTATTTGATGATGGAGCAGCTGGAGGTAATGTAGCTTCTCGAAAAAGAAGAAAACCCTGGTCAAAGGCAGAGGATATGGAATTGATTGCTGCTGTGCAGAAG
TGTGGTGAAGGGAACTGGGCGAATATCTTGAAAGGAGACTTCAAGGGGGATAGAACTGCTTCACAGCTTTCTCAGAGGTGGTCCATTATTAGGAAGCGACGGGGTAATTT
GGATGTGGGAGCTAACGCCACAAGTGCTCAGATATCTAAAGCTCAGATTGATGCTGCACACCGTGCATTGTCCTTTGCCCTTGATTTGCCTGTGAATAACTCAAAATCAG
CAAGTTCAAATATTAGCAGCAGCATTGTCTCTTCTGCAAGTGGTGCTGAAGCTCCAGTTCAAATGCAGAATCAGTCTTCCCAGATTCCCTTGCCTACAAGGCCGGTGCTA
GTAGAGCCTTTACGTTCAGCAGCGAAATCTGGAATGAACACTTCCAAGAATTCATTGAAGTCTACTCACAATTCTGATTCTATGGTTAGAGCTACAGCAGTAGCCGCTGG
GGCCCGAATTATTTCTCCATCAGATGCTGCGTCTCTACTGAAGGCTGCACAGACAAAAAATGCTATCCACATAAAGTCCAATTGCACTTCCTCGAACAAATCACCTGTGA
TTGGTAGTGCACCTATGCACTTGGATGCACACCACAGTGTACATTATATTTCCACTGGAAAAATGGCAACTTCAGGCTCGAATTATGTGGGTGGTAAACCTAATATGGTA
TGTAATAACTCAGTGCAGACTGTTTCACCAAAAGTTCTGCATAATCGTTCTACTTCTGCTATTTTGATGAACCCGCTGTCAGACCAAATAAGCCCAGCAACTGAGTCTCC
ATCGAAGCAAGAGGTTAACAGTTTAGAAGAATGCAAAATTTCTGAGCCAATCTCAACCCAAAGAGAGGAGACTCATGAAGATGAAGCTGTTTTAGGAGACACAAATTGCA
ACAAAGCTCAGGAAGAAAACTTAGCTTCACAAAGATCAGATGGGGAATTCAGAAGTCTTTCAACTTGTATTGAGAATCGCAATAGTTCATTGAATATGGAGATAGATGAC
AATGATATAAAAGCAACATGTTCCAAGCAGGTCGAAATGAAAAAGAATGCAAATGATGTTGAGATTAGGGAGTCCCCAGACATACAAGGAAGCAAGAACGAATCTGTAGC
AGATAATCATGGGAACCATAGCACAAACAATGAACAGCTGAACTTGAAAGGAAATAGTAGTAGGAGAGGGGTGAAGCAGGACACCAGTGATCCGGTAGTGACCTTGGCAG
TTGAAGAGAGCCAGATGGAAACAAATAGTCCCTCAGCACTGAAGTATACTCCTCTAGTAAAAGCCTTTCTCTATCTTCTAATCCTTTCAAGAACAGCTGCCTCAAGGATT
GGCCATCACATCTTGTACGAACTCAGTGTTTTCAGGATCCTCATCCAACCTCACGCTTCTTGCTCATGGTCTCAGAACTTTCCATCAGAAGCTCCTTGTCCTCTTCCAGT
TGCTGGACATTTGCTTGGAGCACACCTGAGCTCTATATCTTGTCCATCTTCTCTGGTGATAGCGAACTGTTGGTCTGAGATTTTCAATTCTGGTTCAGTGCTTAAACCGT
CATCCGTTTTGTCCTGTTCCTCTTGCAACTTAGTCAGAGTTTCTTGGCAGGATTTTAGAGCCGCTGTTGCCATTAAAGTTCGAGCTTCATTATCTTCAATAACAGTACCA
ATGCCGAACTCGTCCTGCAAGTTACTGACTCTTTTTTGCATCGATGTGATACTGCTTTCGATATCCCAATATTTCTCGCACTCGCGTTCATACAAGCTCTTCACGAATTC
CATTTCGGTTTGTCGAGCCAAAATTTCTTTCTGTAGCATATCAATTTCTTCTAAGGCCTCGGTTTTGGATTATTGGCAGAAAAAGATTCTCGAGGAAAAAAATCTACTTC
GCAGTCGTCGTCATCGATCGAGTAATGAACTCGTTCGGGAAAGATTGAAGCTATTGTTCGATTTGCGCCTTGAAACTCTTTTGATAGGTGATCATATCTCTCCGCTAAAG
CTCGCTCTCTTTGCAAAAGAATCCCCATCGCCTTCAATGATTCTCATCATGTTATCAACCTTCTCCTCCATGTCTGCAGCACCAAAAGCTTATGCAAATCAGTCCAAAGA
TCATAATCGCACGAGATTCAAAAATTCATTTGAAGAAGATGATATCGTCGTCGATGATAACGATGAAAGAGAAAGCAAAAAAGAAAAAAAGAAACGGCGCCAGCGAGCAA
TGGAGATGAGGAGGACGTTGAGAGATGAGATCGCAGGCAGCAGCAGCCTCCAGTCATGGAGGAGAGGATACAGCGGCTTGTTTATAGGTGTTGGCAATGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGATTGAGAGAAAAGAGAAGCGAAAGAAAGGGACAATTAGTGAGGAAGACATGTCCACTCTATTGGAAAGTTATTCTGTAAGGACGATACTGACATTGCTTCGGGAGGT
GGCCCAGGTTTCGGAAGTGAGAATCGATTGGGAGAAGTTGGTGAAGAACACGTCAACTGGGATTGCTAATGCTCGAGAATATCAGATGTTATGGCGGCATTTGGCTTATC
GTCACACGTTACTGGAGAATATGGATTGTGTTACTGATTCACTGGATGATGATAGTGATTTAGATTTTGAATTAGAACCTTTTCCATCTGTCAGCGGTGAGTCCTTGAAT
GAAGCTGCATCATGTGTGAAGGTACTGATAGCTAATGGTATACCAAGCGAGTCAGATATTCCAAATAGTTCTGCAGTTGAGGCCCCATTGACTATAGGTATATCCAATAG
TCAATCATCTAGAGCCAATCTTGAAAATCCTCAATCTGCTTGTTTGATGCAAGGGATATATGTTACAATTCCAATTTCCATTCAGAGGCAGCCTATTTCTACGCCATCAT
CAACTGAAGTATTTGATGATGGAGCAGCTGGAGGTAATGTAGCTTCTCGAAAAAGAAGAAAACCCTGGTCAAAGGCAGAGGATATGGAATTGATTGCTGCTGTGCAGAAG
TGTGGTGAAGGGAACTGGGCGAATATCTTGAAAGGAGACTTCAAGGGGGATAGAACTGCTTCACAGCTTTCTCAGAGGTGGTCCATTATTAGGAAGCGACGGGGTAATTT
GGATGTGGGAGCTAACGCCACAAGTGCTCAGATATCTAAAGCTCAGATTGATGCTGCACACCGTGCATTGTCCTTTGCCCTTGATTTGCCTGTGAATAACTCAAAATCAG
CAAGTTCAAATATTAGCAGCAGCATTGTCTCTTCTGCAAGTGGTGCTGAAGCTCCAGTTCAAATGCAGAATCAGTCTTCCCAGATTCCCTTGCCTACAAGGCCGGTGCTA
GTAGAGCCTTTACGTTCAGCAGCGAAATCTGGAATGAACACTTCCAAGAATTCATTGAAGTCTACTCACAATTCTGATTCTATGGTTAGAGCTACAGCAGTAGCCGCTGG
GGCCCGAATTATTTCTCCATCAGATGCTGCGTCTCTACTGAAGGCTGCACAGACAAAAAATGCTATCCACATAAAGTCCAATTGCACTTCCTCGAACAAATCACCTGTGA
TTGGTAGTGCACCTATGCACTTGGATGCACACCACAGTGTACATTATATTTCCACTGGAAAAATGGCAACTTCAGGCTCGAATTATGTGGGTGGTAAACCTAATATGGTA
TGTAATAACTCAGTGCAGACTGTTTCACCAAAAGTTCTGCATAATCGTTCTACTTCTGCTATTTTGATGAACCCGCTGTCAGACCAAATAAGCCCAGCAACTGAGTCTCC
ATCGAAGCAAGAGGTTAACAGTTTAGAAGAATGCAAAATTTCTGAGCCAATCTCAACCCAAAGAGAGGAGACTCATGAAGATGAAGCTGTTTTAGGAGACACAAATTGCA
ACAAAGCTCAGGAAGAAAACTTAGCTTCACAAAGATCAGATGGGGAATTCAGAAGTCTTTCAACTTGTATTGAGAATCGCAATAGTTCATTGAATATGGAGATAGATGAC
AATGATATAAAAGCAACATGTTCCAAGCAGGTCGAAATGAAAAAGAATGCAAATGATGTTGAGATTAGGGAGTCCCCAGACATACAAGGAAGCAAGAACGAATCTGTAGC
AGATAATCATGGGAACCATAGCACAAACAATGAACAGCTGAACTTGAAAGGAAATAGTAGTAGGAGAGGGGTGAAGCAGGACACCAGTGATCCGGTAGTGACCTTGGCAG
TTGAAGAGAGCCAGATGGAAACAAATAGTCCCTCAGCACTGAAGTATACTCCTCTAGTAAAAGCCTTTCTCTATCTTCTAATCCTTTCAAGAACAGCTGCCTCAAGGATT
GGCCATCACATCTTGTACGAACTCAGTGTTTTCAGGATCCTCATCCAACCTCACGCTTCTTGCTCATGGTCTCAGAACTTTCCATCAGAAGCTCCTTGTCCTCTTCCAGT
TGCTGGACATTTGCTTGGAGCACACCTGAGCTCTATATCTTGTCCATCTTCTCTGGTGATAGCGAACTGTTGGTCTGAGATTTTCAATTCTGGTTCAGTGCTTAAACCGT
CATCCGTTTTGTCCTGTTCCTCTTGCAACTTAGTCAGAGTTTCTTGGCAGGATTTTAGAGCCGCTGTTGCCATTAAAGTTCGAGCTTCATTATCTTCAATAACAGTACCA
ATGCCGAACTCGTCCTGCAAGTTACTGACTCTTTTTTGCATCGATGTGATACTGCTTTCGATATCCCAATATTTCTCGCACTCGCGTTCATACAAGCTCTTCACGAATTC
CATTTCGGTTTGTCGAGCCAAAATTTCTTTCTGTAGCATATCAATTTCTTCTAAGGCCTCGGTTTTGGATTATTGGCAGAAAAAGATTCTCGAGGAAAAAAATCTACTTC
GCAGTCGTCGTCATCGATCGAGTAATGAACTCGTTCGGGAAAGATTGAAGCTATTGTTCGATTTGCGCCTTGAAACTCTTTTGATAGGTGATCATATCTCTCCGCTAAAG
CTCGCTCTCTTTGCAAAAGAATCCCCATCGCCTTCAATGATTCTCATCATGTTATCAACCTTCTCCTCCATGTCTGCAGCACCAAAAGCTTATGCAAATCAGTCCAAAGA
TCATAATCGCACGAGATTCAAAAATTCATTTGAAGAAGATGATATCGTCGTCGATGATAACGATGAAAGAGAAAGCAAAAAAGAAAAAAAGAAACGGCGCCAGCGAGCAA
TGGAGATGAGGAGGACGTTGAGAGATGAGATCGCAGGCAGCAGCAGCCTCCAGTCATGGAGGAGAGGATACAGCGGCTTGTTTATAGGTGTTGGCAATGGATGA
Protein sequenceShow/hide protein sequence
MIERKEKRKKGTISEEDMSTLLESYSVRTILTLLREVAQVSEVRIDWEKLVKNTSTGIANAREYQMLWRHLAYRHTLLENMDCVTDSLDDDSDLDFELEPFPSVSGESLN
EAASCVKVLIANGIPSESDIPNSSAVEAPLTIGISNSQSSRANLENPQSACLMQGIYVTIPISIQRQPISTPSSTEVFDDGAAGGNVASRKRRKPWSKAEDMELIAAVQK
CGEGNWANILKGDFKGDRTASQLSQRWSIIRKRRGNLDVGANATSAQISKAQIDAAHRALSFALDLPVNNSKSASSNISSSIVSSASGAEAPVQMQNQSSQIPLPTRPVL
VEPLRSAAKSGMNTSKNSLKSTHNSDSMVRATAVAAGARIISPSDAASLLKAAQTKNAIHIKSNCTSSNKSPVIGSAPMHLDAHHSVHYISTGKMATSGSNYVGGKPNMV
CNNSVQTVSPKVLHNRSTSAILMNPLSDQISPATESPSKQEVNSLEECKISEPISTQREETHEDEAVLGDTNCNKAQEENLASQRSDGEFRSLSTCIENRNSSLNMEIDD
NDIKATCSKQVEMKKNANDVEIRESPDIQGSKNESVADNHGNHSTNNEQLNLKGNSSRRGVKQDTSDPVVTLAVEESQMETNSPSALKYTPLVKAFLYLLILSRTAASRI
GHHILYELSVFRILIQPHASCSWSQNFPSEAPCPLPVAGHLLGAHLSSISCPSSLVIANCWSEIFNSGSVLKPSSVLSCSSCNLVRVSWQDFRAAVAIKVRASLSSITVP
MPNSSCKLLTLFCIDVILLSISQYFSHSRSYKLFTNSISVCRAKISFCSISISSKASVLDYWQKKILEEKNLLRSRRHRSSNELVRERLKLLFDLRLETLLIGDHISPLK
LALFAKESPSPSMILIMLSTFSSMSAAPKAYANQSKDHNRTRFKNSFEEDDIVVDDNDERESKKEKKKRRQRAMEMRRTLRDEIAGSSSLQSWRRGYSGLFIGVGNG