; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G005810 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G005810
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationCmo_Chr09:2844245..2849874
RNA-Seq ExpressionCmoCh09G005810
SyntenyCmoCh09G005810
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591699.1 hypothetical protein SDJN03_14045, partial [Cucurbita argyrosperma subsp. sororia]2.4e-27299.01Show/hide
Query:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
        MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
Subjt:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP

Query:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA
        FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAG NAA
Subjt:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKS-ANSN
        SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKS ANSN
Subjt:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKS-ANSN

Query:  MNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKC
        MNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIV ATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKC
Subjt:  MNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKC

Query:  VSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIIT
        VSSIQPP+LGNASTHLDARPSVHYISTGRTATPG+NYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIIT
Subjt:  VSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIIT

Query:  PKGDFRE
        PKGDFRE
Subjt:  PKGDFRE

KAG7024581.1 hypothetical protein SDJN02_13399, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-28299.24Show/hide
Query:  LLLLFRFFSCDFITGSIMIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDS
        LLLLFRFFSCDFITGSIMIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDS
Subjt:  LLLLFRFFSCDFITGSIMIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDS

Query:  VTDPLDYDSDLDFEIEPFPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTP
        VTDPLDYDSDLDFEIEPFPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTP
Subjt:  VTDPLDYDSDLDFEIEPFPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTP

Query:  SATEVFDVNGAAGSNAASRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALS
        SATEVFDVNGAAGSNAASRKRRKPWSK EDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALS
Subjt:  SATEVFDVNGAAGSNAASRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALS

Query:  LALDLPVNNSKS-ANSNMNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAAS
        LALD PVNNSKS ANSNMNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAAS
Subjt:  LALDLPVNNSKS-ANSNMNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAAS

Query:  LMKAAQTKNAIHIKSKCVSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLK
        LMKAAQTKNAIHIKSKCVSSIQPPMLGNASTHLDARPSVHYISTGRTATPG+NYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLK
Subjt:  LMKAAQTKNAIHIKSKCVSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLK

Query:  QEVKSSEEGKISKPIITPKGDFRE
        QEVKSSEEGKISKPIITPKGDFRE
Subjt:  QEVKSSEEGKISKPIITPKGDFRE

XP_022937359.1 uncharacterized protein LOC111443670 isoform X1 [Cucurbita moschata]1.5e-27499.8Show/hide
Query:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
        MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
Subjt:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP

Query:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA
        FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA
Subjt:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKS-ANSN
        SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKS ANSN
Subjt:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKS-ANSN

Query:  MNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKC
        MNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKC
Subjt:  MNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKC

Query:  VSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIIT
        VSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIIT
Subjt:  VSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIIT

Query:  PKGDFRE
        PKGDFRE
Subjt:  PKGDFRE

XP_022937362.1 uncharacterized protein LOC111443670 isoform X2 [Cucurbita moschata]6.0e-276100Show/hide
Query:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
        MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
Subjt:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP

Query:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA
        FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA
Subjt:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSANSNM
        SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSANSNM
Subjt:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSANSNM

Query:  NSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKCV
        NSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKCV
Subjt:  NSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKCV

Query:  SSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIITP
        SSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIITP
Subjt:  SSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIITP

Query:  KGDFRE
        KGDFRE
Subjt:  KGDFRE

XP_023534838.1 uncharacterized protein LOC111796458 isoform X2 [Cucurbita pepo subsp. pepo]1.2e-26396.25Show/hide
Query:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
        MIE+KEK KKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
Subjt:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP

Query:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA
        FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSR FRASLENPQSACLMQGMYVT PIS+QRQPLPTPSATEVFDVNGAAG NAA
Subjt:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSANSNM
        SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKR G L+VGANTTSTQ SKAQIDAAHRALSLALDLPVNNSKSANSNM
Subjt:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSANSNM

Query:  NSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKCV
        NSSTVSSTSGAEAPVQIQNQSPQVLVP RPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKCV
Subjt:  NSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKCV

Query:  SSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIITP
        SSIQPP+LGNASTHLDARPSVHYISTGRTATPG+NYVGGKSTMAG  SMKYVSPKAPYNCSTAV TNPPSNQISPTTESPLKQEVKSSEE KISKPIIT 
Subjt:  SSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIITP

Query:  KGDFRE
        K DFRE
Subjt:  KGDFRE

TrEMBL top hitse value%identityAlignment
A0A6J1C5S4 uncharacterized protein LOC1110087031.6e-20275.63Show/hide
Query:  GSIMIEMKEKQKKGTISNEDS-SAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDF
        GS+MIE KEKQKKG IS+ED  S +LERYSVRTI TLLREVA VSEVRIDWDKLVKNTSTGISN REYQ+LWRHLAYRHTLLEN+D +T PLD DSDLDF
Subjt:  GSIMIEMKEKQKKGTISNEDS-SAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDF

Query:  EIEPFPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPL-PTPS-ATEVFDVNGA
        EIE FPSV++ESLNEAAA VKVLIAN IPSESD+PSSS VEAPLTIGI SNS+S RA+LENPQS CL+Q MYV +PISIQRQP+  TP+ +TEVFDVNGA
Subjt:  EIEPFPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPL-PTPS-ATEVFDVNGA

Query:  AGSNAASRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSK
        AG NAASRKRRKPWSK ED+EL+AAV+K GEGNWANILK DFKG+RTASQLSQRWSIIRKR  NLNVGAN T TQISKAQIDA HRALS ALDLPVNNSK
Subjt:  AGSNAASRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSK

Query:  SANSNMNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIH
        +  SN+NS  +SS SGAEAPVQ+QNQSPQ+  PSRP+ V+PLPSA K GI+T+KN LMMKSTHNSDSIVRATAVAAGARIVSPSDAASL+KAAQ +NAIH
Subjt:  SANSNMNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIH

Query:  IKSKCVSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCST-AVLTNPPSNQISPTTESPLKQEVKSSEEGKI
        IKS C SSI+PP+ GNA  H D RP++HYISTG+ A+PG+NYVGGK  +  NNS+K +SP   ++ ST A+L N  S+Q SP TESP K+E+KSSEE K+
Subjt:  IKSKCVSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCST-AVLTNPPSNQISPTTESPLKQEVKSSEEGKI

Query:  SKPIITPKGDFRE
         +P+ TPK + RE
Subjt:  SKPIITPKGDFRE

A0A6J1FAZ9 uncharacterized protein LOC111443670 isoform X22.9e-276100Show/hide
Query:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
        MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
Subjt:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP

Query:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA
        FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA
Subjt:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSANSNM
        SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSANSNM
Subjt:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSANSNM

Query:  NSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKCV
        NSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKCV
Subjt:  NSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKCV

Query:  SSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIITP
        SSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIITP
Subjt:  SSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIITP

Query:  KGDFRE
        KGDFRE
Subjt:  KGDFRE

A0A6J1FGE2 uncharacterized protein LOC111443670 isoform X17.2e-27599.8Show/hide
Query:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
        MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
Subjt:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP

Query:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA
        FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA
Subjt:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKS-ANSN
        SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKS ANSN
Subjt:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKS-ANSN

Query:  MNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKC
        MNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKC
Subjt:  MNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKC

Query:  VSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIIT
        VSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIIT
Subjt:  VSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIIT

Query:  PKGDFRE
        PKGDFRE
Subjt:  PKGDFRE

A0A6J1IGI9 uncharacterized protein LOC111476736 isoform X22.8e-26396.44Show/hide
Query:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
        MIEMKEKQKKGTISNEDS AVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
Subjt:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP

Query:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA
        FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISI+RQPLPTPSATEVFDVNGAAGSNAA
Subjt:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSANSNM
        SRKRRKPWSKT+DLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVN SKSANSNM
Subjt:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSANSNM

Query:  NSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKCV
        NSSTVSSTSGAEAPVQIQNQSPQVLVP RPLQVKPLP AAKSGINT KNTLMMKSTHNSDSIVRATAVAAGARIVSP DAASLMK AQTKNAIHIKSKCV
Subjt:  NSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKCV

Query:  SSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIITP
        SSIQPP+LGNASTHLDA+PSVHYISTGRTATPG+NYVGGKSTMAGNNSMKYV+PKAPYNCSTAVLTN PSNQISPTTESPLKQEVKSSEE KISKPIIT 
Subjt:  SSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIITP

Query:  KGDFRE
        K D RE
Subjt:  KGDFRE

A0A6J1IN48 uncharacterized protein LOC111476736 isoform X16.9e-26296.25Show/hide
Query:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
        MIEMKEKQKKGTISNEDS AVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP
Subjt:  MIEMKEKQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEP

Query:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA
        FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISI+RQPLPTPSATEVFDVNGAAGSNAA
Subjt:  FPSVSNESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAA

Query:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKS-ANSN
        SRKRRKPWSKT+DLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVN SKS ANSN
Subjt:  SRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKS-ANSN

Query:  MNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKC
        MNSSTVSSTSGAEAPVQIQNQSPQVLVP RPLQVKPLP AAKSGINT KNTLMMKSTHNSDSIVRATAVAAGARIVSP DAASLMK AQTKNAIHIKSKC
Subjt:  MNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKC

Query:  VSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIIT
        VSSIQPP+LGNASTHLDA+PSVHYISTGRTATPG+NYVGGKSTMAGNNSMKYV+PKAPYNCSTAVLTN PSNQISPTTESPLKQEVKSSEE KISKPIIT
Subjt:  VSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKAPYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIIT

Query:  PKGDFRE
         K D RE
Subjt:  PKGDFRE

SwissProt top hitse value%identityAlignment
O35144 Telomeric repeat-binding factor 26.7e-0433.82Show/hide
Query:  EVFDVNGAAGSNAASRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRK
        ++F+V       ++S  R++ W+  E   +   V KYGEGNWA I K+    +RTA  +  RW  ++K
Subjt:  EVFDVNGAAGSNAASRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRK

P70371 Telomeric repeat-binding factor 13.0e-0440Show/hide
Query:  ASRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRK
        + R++R+ W   ED  L   V+KYGEGNWA IL      +RT+  L  RW  +++
Subjt:  ASRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRK

Arabidopsis top hitse value%identityAlignment
AT1G09710.1 Homeodomain-like superfamily protein5.8e-5941.86Show/hide
Query:  QKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEPFPSVSNE
        ++K  I+  D + +L RY + TI  +L+E+++ SE ++DW+ LVK T+TGI+N REYQLLWRHL+YRH LL   D    PLD DSD++ E+E  P+VS+E
Subjt:  QKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEPFPSVSNE

Query:  SLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAASRKRRKP
        +  EA A VKV+ A+ + SESD+   S VEAPLTI I         S E  +S    +GM +  P+ +Q+      ++TE  + NG+AG + A R++RK 
Subjt:  SLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAASRKRRKP

Query:  WSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSANSNMNSSTVSS
        WS  ED EL AAV++ GEGNWA+I+K DF+G+RTASQLSQRW++IRKR  + +   +    Q ++A++ A + ALSLAL     ++K A   M +++  +
Subjt:  WSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSANSNMNSSTVSS

Query:  TSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMK----STHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTK
         +  EA     +Q  Q    S+P+ V+ LP A  S +  AK+ ++ K    ST  SD +V A +VAA A +     AAS  K    K
Subjt:  TSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMK----STHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTK

AT1G09710.2 Homeodomain-like superfamily protein1.3e-5539.39Show/hide
Query:  QKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEPFPSVSNE
        ++K  I+  D + +L RY + TI  +L+E+++ SE ++DW+ LVK T+TGI+N REYQLLWRHL+YRH LL   D    PLD DSD++ E+E  P+VS+E
Subjt:  QKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEPFPSVSNE

Query:  SLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAASRKRRKP
        +  EA A VKV+ A+ + SESD+   S VEAPLTI I         S E  +S    +GM +  P+ +Q+      ++TE  + NG+AG + A R++RK 
Subjt:  SLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAASRKRRKP

Query:  WSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLAL-------DLPVNNSKSANSNM
        WS  ED EL AAV++ GEGNWA+I+K DF+G+RTASQLSQRW++IRKR  + +   +    Q ++A++ A + ALSLAL        L +  S   +   
Subjt:  WSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLAL-------DLPVNNSKSANSNM

Query:  NSSTVSSTSGAEA--PVQIQNQ----------------------------SPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMK----STHNSDSIVRAT
        NSS    T  A    P+   NQ                            S Q    S+P+ V+ LP A  S +  AK+ ++ K    ST  SD +V A 
Subjt:  NSSTVSSTSGAEA--PVQIQNQ----------------------------SPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMK----STHNSDSIVRAT

Query:  AVAAGARIVSPSDAASLMKAAQTK
        +VAA A +     AAS  K    K
Subjt:  AVAAGARIVSPSDAASLMKAAQTK

AT1G58220.1 Homeodomain-like superfamily protein6.8e-6037.38Show/hide
Query:  KQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEPFPSVSN
        K++K  IS  D + +L+RY   TI  LL+E+A+ +E +++W++LVK TSTGI++ REYQLLWRHLAYR +L+  V +    LD DSD++ E+E  P VS 
Subjt:  KQKKGTISNEDSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEPFPSVSN

Query:  ESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAASRKRRK
        + + EA A VKV+ A+ +PSESD+P  S VEAPLTI I  +    R   E   S    +GM +T P+ +       P A E  + NG A S+ A RKRRK
Subjt:  ESLNEAAACVKVLIANGIPSESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAASRKRRK

Query:  PWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSA---NSNMNSS
         WS  ED EL+AAV+++GEG+WA I K +F+G+RTASQLSQRW  IR+R    N  +  T  Q ++AQ+ AA+RALSLA+   + + K A      ++S 
Subjt:  PWSKTEDLELMAAVEKYGEGNWANILKADFKGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSA---NSNMNSS

Query:  TV--SSTSGAEAPVQIQNQ---SPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHI---
        T+  +  +GA +   +Q Q    PQ+   SR     P+   AKS +   K T    ST  +D +V A +VAA A +   + A ++ K    KNA+     
Subjt:  TV--SSTSGAEAPVQIQNQ---SPQVLVPSRPLQVKPLPSAAKSGINTAKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHI---

Query:  ---KSKCVSSIQPPMLGNASTHLDARPSVHYI------STGRTATPGANYVGGKSTMAG--NNSMKYVSPKA-PYNCSTAVLTNP-PSNQISPTTESPLK
             K  S++  P     S+ L+  P    +      S+G  + P    V   ++ A     S    +PK  P   + +V + P PS  IS     P+K
Subjt:  ---KSKCVSSIQPPMLGNASTHLDARPSVHYI------STGRTATPGANYVGGKSTMAG--NNSMKYVSPKA-PYNCSTAVLTNP-PSNQISPTTESPLK

Query:  QEVKSSEEGKISKPIITPK
            ++   + S  I  PK
Subjt:  QEVKSSEEGKISKPIITPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAATGATAGCACAACTGAACTCAAGGAGAAGTCAATTGGACAAGGAGAAGGCAAGCAGATCGCAACTGAAAAACATAATAGTGATGATGAATTCAAATATTGTAC
TTCTTTTGATGGTGTTCTGCGAGCAACGAAGATTGCAGATCAAGAAGCTTTGCGGCTTCCACCACCGAAGCTTCTTCTTCCTCCGGCTTTTGAGGCTTGCGTGGTTGTCT
GTGCATTCCGGCTTCTCTTACTTTTTCGGTTCTTTTCATGCGATTTCATTACGGGCTCTATAATGATTGAGATGAAAGAGAAGCAAAAGAAAGGAACAATCAGTAACGAA
GATAGTTCCGCCGTGTTGGAAAGATATTCAGTAAGGACGATATTTACTTTGCTTCGCGAGGTGGCTCATGTTTCGGAAGTGAGAATTGATTGGGACAAGTTGGTGAAGAA
CACGTCAACTGGGATTTCTAATGTTCGGGAGTATCAGTTGTTATGGCGGCACTTGGCTTATCGTCATACGTTACTGGAAAACGTGGATTCTGTTACTGATCCATTGGATT
ATGACAGTGACTTGGATTTTGAAATCGAACCTTTTCCATCTGTTAGCAATGAGTCCTTGAATGAAGCTGCGGCATGTGTGAAGGTATTGATTGCTAATGGTATACCAAGT
GAGTCAGATGTTCCAAGTAGTTCTGTAGTTGAGGCCCCATTGACTATCGGTATATCATCGAACAGTCGATCATTTAGAGCCAGTCTTGAAAATCCTCAATCTGCTTGTTT
GATGCAAGGAATGTATGTTACCGTTCCTATTTCGATTCAGAGACAGCCTCTTCCAACACCATCAGCAACTGAAGTATTTGACGTGAATGGAGCAGCTGGTAGCAATGCAG
CTTCTCGAAAAAGAAGAAAACCTTGGTCGAAGACGGAGGATTTGGAATTGATGGCTGCTGTGGAGAAGTATGGTGAAGGGAACTGGGCAAATATCTTGAAAGCAGACTTC
AAGGGGGATAGGACTGCGTCACAGCTATCTCAGAGGTGGTCCATTATTCGGAAGCGACATGGTAATTTGAATGTGGGAGCTAACACCACAAGTACTCAGATATCTAAAGC
TCAGATTGATGCTGCACACCGCGCATTGTCCCTTGCCCTCGATTTGCCTGTGAATAACTCAAAATCAGCAAATTCAAACATGAATAGTAGCACTGTCTCTTCTACAAGTG
GTGCTGAAGCTCCGGTTCAAATACAGAATCAGTCTCCACAGGTTCTCGTGCCTTCACGGCCTCTGCAGGTGAAGCCTTTACCTTCAGCAGCGAAATCAGGAATCAACACT
GCCAAGAATACGTTGATGATGAAGTCTACTCACAATTCTGATTCTATAGTTAGAGCAACTGCAGTAGCTGCAGGGGCCCGGATTGTTTCTCCATCCGATGCTGCGTCTCT
AATGAAAGCTGCACAGACAAAAAATGCCATCCACATAAAATCCAAATGTGTTTCTTCAATCCAGCCACCCATGCTTGGTAATGCATCGACGCACTTGGATGCACGGCCCA
GTGTACATTATATTTCTACCGGAAGAACAGCAACTCCAGGCGCAAACTACGTTGGTGGTAAATCTACAATGGCTGGTAATAACTCGATGAAGTATGTCTCCCCAAAAGCT
CCGTATAATTGTTCTACTGCTGTTTTGACAAACCCACCATCAAATCAAATAAGCCCAACAACTGAGTCTCCATTGAAGCAAGAGGTTAAGAGTTCAGAAGAAGGCAAAAT
TTCCAAGCCAATCATTACTCCGAAAGGCGATTTTCGAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAATAATGATAGCACAACTGAACTCAAGGAGAAGTCAATTGGACAAGGAGAAGGCAAGCAGATCGCAACTGAAAAACATAATAGTGATGATGAATTCAAATATTGTAC
TTCTTTTGATGGTGTTCTGCGAGCAACGAAGATTGCAGATCAAGAAGCTTTGCGGCTTCCACCACCGAAGCTTCTTCTTCCTCCGGCTTTTGAGGCTTGCGTGGTTGTCT
GTGCATTCCGGCTTCTCTTACTTTTTCGGTTCTTTTCATGCGATTTCATTACGGGCTCTATAATGATTGAGATGAAAGAGAAGCAAAAGAAAGGAACAATCAGTAACGAA
GATAGTTCCGCCGTGTTGGAAAGATATTCAGTAAGGACGATATTTACTTTGCTTCGCGAGGTGGCTCATGTTTCGGAAGTGAGAATTGATTGGGACAAGTTGGTGAAGAA
CACGTCAACTGGGATTTCTAATGTTCGGGAGTATCAGTTGTTATGGCGGCACTTGGCTTATCGTCATACGTTACTGGAAAACGTGGATTCTGTTACTGATCCATTGGATT
ATGACAGTGACTTGGATTTTGAAATCGAACCTTTTCCATCTGTTAGCAATGAGTCCTTGAATGAAGCTGCGGCATGTGTGAAGGTATTGATTGCTAATGGTATACCAAGT
GAGTCAGATGTTCCAAGTAGTTCTGTAGTTGAGGCCCCATTGACTATCGGTATATCATCGAACAGTCGATCATTTAGAGCCAGTCTTGAAAATCCTCAATCTGCTTGTTT
GATGCAAGGAATGTATGTTACCGTTCCTATTTCGATTCAGAGACAGCCTCTTCCAACACCATCAGCAACTGAAGTATTTGACGTGAATGGAGCAGCTGGTAGCAATGCAG
CTTCTCGAAAAAGAAGAAAACCTTGGTCGAAGACGGAGGATTTGGAATTGATGGCTGCTGTGGAGAAGTATGGTGAAGGGAACTGGGCAAATATCTTGAAAGCAGACTTC
AAGGGGGATAGGACTGCGTCACAGCTATCTCAGAGGTGGTCCATTATTCGGAAGCGACATGGTAATTTGAATGTGGGAGCTAACACCACAAGTACTCAGATATCTAAAGC
TCAGATTGATGCTGCACACCGCGCATTGTCCCTTGCCCTCGATTTGCCTGTGAATAACTCAAAATCAGCAAATTCAAACATGAATAGTAGCACTGTCTCTTCTACAAGTG
GTGCTGAAGCTCCGGTTCAAATACAGAATCAGTCTCCACAGGTTCTCGTGCCTTCACGGCCTCTGCAGGTGAAGCCTTTACCTTCAGCAGCGAAATCAGGAATCAACACT
GCCAAGAATACGTTGATGATGAAGTCTACTCACAATTCTGATTCTATAGTTAGAGCAACTGCAGTAGCTGCAGGGGCCCGGATTGTTTCTCCATCCGATGCTGCGTCTCT
AATGAAAGCTGCACAGACAAAAAATGCCATCCACATAAAATCCAAATGTGTTTCTTCAATCCAGCCACCCATGCTTGGTAATGCATCGACGCACTTGGATGCACGGCCCA
GTGTACATTATATTTCTACCGGAAGAACAGCAACTCCAGGCGCAAACTACGTTGGTGGTAAATCTACAATGGCTGGTAATAACTCGATGAAGTATGTCTCCCCAAAAGCT
CCGTATAATTGTTCTACTGCTGTTTTGACAAACCCACCATCAAATCAAATAAGCCCAACAACTGAGTCTCCATTGAAGCAAGAGGTTAAGAGTTCAGAAGAAGGCAAAAT
TTCCAAGCCAATCATTACTCCGAAAGGCGATTTTCGAGAATAGAACTGTCGTAAGAGATGTCTTTGCTTCACAAATATCAGATTGGGAATCGGGAAGTCGTTCAACTTGC
ATTGAGAATCAAAATACTTCTTTGAATATGGAGATAGATGAAAATGATATTAAAGCAGCATGCCCCAAGCAGGACGAAAATAAAAAGAAGGCAAATGATGTCAAGATTAG
GGGGTGACCAAATGTAGAAGGAAACAAGAACAAATCTATACCATATAATCATGGGAGACATAGCACAAACAACGAACAGCAGAAGTCGACATTGCAGTAGGTAGAAGCAC
AAAAATTGTAGGTACCTTTGTACAAATGATGAACAGCCAAGCTCGACGAGCATGGTATTAGCAGGTGAATGCACAGAAACATCAGTAAGTGTAAGTAGTGGGTTATACAG
CAACAAAGAATTCCAGGTTTTCTTGCTAGAGAATGAAC
Protein sequenceShow/hide protein sequence
MNNDSTTELKEKSIGQGEGKQIATEKHNSDDEFKYCTSFDGVLRATKIADQEALRLPPPKLLLPPAFEACVVVCAFRLLLLFRFFSCDFITGSIMIEMKEKQKKGTISNE
DSSAVLERYSVRTIFTLLREVAHVSEVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENVDSVTDPLDYDSDLDFEIEPFPSVSNESLNEAAACVKVLIANGIPS
ESDVPSSSVVEAPLTIGISSNSRSFRASLENPQSACLMQGMYVTVPISIQRQPLPTPSATEVFDVNGAAGSNAASRKRRKPWSKTEDLELMAAVEKYGEGNWANILKADF
KGDRTASQLSQRWSIIRKRHGNLNVGANTTSTQISKAQIDAAHRALSLALDLPVNNSKSANSNMNSSTVSSTSGAEAPVQIQNQSPQVLVPSRPLQVKPLPSAAKSGINT
AKNTLMMKSTHNSDSIVRATAVAAGARIVSPSDAASLMKAAQTKNAIHIKSKCVSSIQPPMLGNASTHLDARPSVHYISTGRTATPGANYVGGKSTMAGNNSMKYVSPKA
PYNCSTAVLTNPPSNQISPTTESPLKQEVKSSEEGKISKPIITPKGDFRE