; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh07G009440 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh07G009440
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationCmo_Chr07:4528108..4533834
RNA-Seq ExpressionCmoCh07G009440
SyntenyCmoCh07G009440
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595236.1 Telomere repeat-binding protein 4, partial [Cucurbita argyrosperma subsp. sororia]1.2e-27795.28Show/hide
Query:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
        MDQEAHFC+KFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
Subjt:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE

Query:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
        RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
Subjt:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG

Query:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
        SICDTILDNRSIHKFNTNESYIEN DLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
Subjt:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS

Query:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
         PKSEWHCGTSVPVQSRSQRRHPKKHVPVS                     GFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
Subjt:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE

Query:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
        YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
Subjt:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF

Query:  NWGRKKDE
        NWGRKKDE
Subjt:  NWGRKKDE

KAG7027258.1 Telomere repeat-binding protein 4 [Cucurbita argyrosperma subsp. argyrosperma]7.1e-27895.28Show/hide
Query:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
        MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTY EVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
Subjt:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE

Query:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
        RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
Subjt:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG

Query:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
        SICDTILDNRSIHKFNTNESYIEN DLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
Subjt:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS

Query:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
        TPKSEWHCGTSVPVQSRSQRRHPKKHVPVS                     GFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
Subjt:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE

Query:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
        YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIE+KQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
Subjt:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF

Query:  NWGRKKDE
        NWGRKKDE
Subjt:  NWGRKKDE

XP_022963224.1 uncharacterized protein LOC111463502 isoform X1 [Cucurbita moschata]5.8e-296100Show/hide
Query:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
        MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
Subjt:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE

Query:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
        RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
Subjt:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG

Query:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
        SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
Subjt:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS

Query:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
        TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
Subjt:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE

Query:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
        YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
Subjt:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF

Query:  NWGRKKDE
        NWGRKKDE
Subjt:  NWGRKKDE

XP_022963226.1 uncharacterized protein LOC111463502 isoform X2 [Cucurbita moschata]2.2e-27995.87Show/hide
Query:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
        MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
Subjt:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE

Query:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
        RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
Subjt:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG

Query:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
        SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
Subjt:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS

Query:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
        TPKSEWHCGTSVPVQSRSQRRHPKKHVPVS                     GFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
Subjt:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE

Query:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
        YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
Subjt:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF

Query:  NWGRKKDE
        NWGRKKDE
Subjt:  NWGRKKDE

XP_023518018.1 uncharacterized protein LOC111781580 isoform X1 [Cucurbita pepo subsp. pepo]6.4e-27995.67Show/hide
Query:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
        MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
Subjt:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE

Query:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
        RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
Subjt:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG

Query:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
        SICDTILDNRSIHKFNTNES+IENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
Subjt:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS

Query:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
        TPKSEWHCGTSVPVQSRSQRRHPKKHVPVS                     GFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
Subjt:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE

Query:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
        YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
Subjt:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF

Query:  NWGRKKDE
        NWGRKKDE
Subjt:  NWGRKKDE

TrEMBL top hitse value%identityAlignment
A0A6J1HFK5 uncharacterized protein LOC111463502 isoform X21.1e-27995.87Show/hide
Query:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
        MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
Subjt:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE

Query:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
        RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
Subjt:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG

Query:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
        SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
Subjt:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS

Query:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
        TPKSEWHCGTSVPVQSRSQRRHPKKHVPVS                     GFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
Subjt:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE

Query:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
        YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
Subjt:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF

Query:  NWGRKKDE
        NWGRKKDE
Subjt:  NWGRKKDE

A0A6J1HH52 uncharacterized protein LOC111463502 isoform X37.2e-244100Show/hide
Query:  MIMEAEYNDTHVERGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAG
        MIMEAEYNDTHVERGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAG
Subjt:  MIMEAEYNDTHVERGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAG

Query:  LRTNSSRVTPHEGSICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTST
        LRTNSSRVTPHEGSICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTST
Subjt:  LRTNSSRVTPHEGSICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTST

Query:  EESSHVSHEVQVSTPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWT
        EESSHVSHEVQVSTPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWT
Subjt:  EESSHVSHEVQVSTPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWT

Query:  LTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATT
        LTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATT
Subjt:  LTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATT

Query:  PAPDLTESNALPFNWGRKKDE
        PAPDLTESNALPFNWGRKKDE
Subjt:  PAPDLTESNALPFNWGRKKDE

A0A6J1HJG4 uncharacterized protein LOC111463502 isoform X12.8e-296100Show/hide
Query:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
        MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
Subjt:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE

Query:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
        RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
Subjt:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG

Query:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
        SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
Subjt:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS

Query:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
        TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
Subjt:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE

Query:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
        YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
Subjt:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF

Query:  NWGRKKDE
        NWGRKKDE
Subjt:  NWGRKKDE

A0A6J1I5L7 uncharacterized protein LOC111471232 isoform X11.0e-27494.49Show/hide
Query:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
        MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGL LGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE
Subjt:  MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVE

Query:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
        R FTHDLCAGTNSS VTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG
Subjt:  RGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEG

Query:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
        SICDTILDNR+IHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS
Subjt:  SICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVS

Query:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
        TPK EWHCGTSVPVQSRSQRRHPKKHVPV                      GFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE
Subjt:  TPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAE

Query:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF
        YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAP LTESNALPF
Subjt:  YGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPF

Query:  NWGRKKDE
        NWGRKKDE
Subjt:  NWGRKKDE

A0A6J1ICD6 uncharacterized protein LOC111471232 isoform X24.1e-22393.59Show/hide
Query:  MIMEAEYNDTHVERGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAG
        MIMEAEYNDTHVER FTHDLCAGTNSS VTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAG
Subjt:  MIMEAEYNDTHVERGFTHDLCAGTNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAG

Query:  LRTNSSRVTPHEGSICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTST
        LRTNSSRVTPHEGSICDTILDNR+IHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTST
Subjt:  LRTNSSRVTPHEGSICDTILDNRSIHKFNTNESYIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTST

Query:  EESSHVSHEVQVSTPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWT
        EESSHVSHEVQVSTPK EWHCGTSVPVQSRSQRRHPKKHVPV                      GFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWT
Subjt:  EESSHVSHEVQVSTPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWT

Query:  LTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATT
        LTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATT
Subjt:  LTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATT

Query:  PAPDLTESNALPFNWGRKKDE
        PAP LTESNALPFNWGRKKDE
Subjt:  PAPDLTESNALPFNWGRKKDE

SwissProt top hitse value%identityAlignment
Q9C7B1 Telomere repeat-binding protein 32.4e-1031.67Show/hide
Query:  RRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKER
        +R+ ++ +++TEV  LV  + E GTGRW  +K   F  + HRT +DL+DKW+ L+  + ++ Q R+G          P+P+ LL RV      Y Y  + 
Subjt:  RRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKER

Query:  SPK-SVEATTPAPDLTESNA
          K      +  PD+    A
Subjt:  SPK-SVEATTPAPDLTESNA

Q9FFY9 Telomere repeat-binding protein 41.4e-1036.78Show/hide
Query:  RRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV
        +R+ ++ +++TEV  LV  + E GTGRW  +K   F ++ HRT +DL+DKW+ L+  + ++ Q R+G          P+P+ LL RV
Subjt:  RRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV

Q9LL45 Telomere-binding protein 13.5e-0934Show/hide
Query:  NVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV
        NV  S  +     +R+ ++ +T+ EV  LV+ +   GTGRW  +K   F +  HRT +DL+DKW+ L+  + +  Q R+G          P+P+ LL RV
Subjt:  NVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV

Q9M347 Telomere repeat-binding protein 68.2e-1136.78Show/hide
Query:  RRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV
        +R+ ++ +T++EV  LV  +   GTGRW  +K H F H  HRT +DL+DKW+ L+  + ++ + R+G          P+P+ LL RV
Subjt:  RRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV

Q9SNB9 Telomere repeat-binding protein 21.6e-0935.63Show/hide
Query:  RRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV
        +R+ ++ +++TEV  LV  + + GTGRW  +K   F  + HRT +DL+DKW+ L+  + ++ Q R+G          P+P+ LL RV
Subjt:  RRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 31.1e-1528.89Show/hide
Query:  RRLRKPTRRFIEEFSDSKSESNKGRKKTPTKD-KYMKGTSTEESSHVSHEVQVSTPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFL---SEDVYSATEC
        +R+RKPTRR+IEE ++ +          P+KD   ++  S+E    V+  V ++  + +      VP  S  +R  P++++   G     S +V +A E 
Subjt:  RRLRKPTRRFIEEFSDSKSESNKGRKKTPTKD-KYMKGTSTEESSHVSHEVQVSTPKSEWHCGTSVPVQSRSQRRHPKKHVPVSGFL---SEDVYSATEC

Query:  KNVYSILQGFLSEDVYSATECKNVYSSAKRC--KKHDR------------------------------------------RKHQKMWTLTEVMQLVDGIA
         N+ ++    LS DV      K    SA RC  K+ D+                                          RK  + WT++EV +LV+G++
Subjt:  KNVYSILQGFLSEDVYSATECKNVYSSAKRC--KKHDR------------------------------------------RKHQKMWTLTEVMQLVDGIA

Query:  EYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELA
        +YG G+WT IK+  F+   HRT +DL+DKWRNL +AS  N +   G+++   H S  +P  ++ +V ELA
Subjt:  EYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELA

AT1G72650.1 TRF-like 61.6e-2229.78Show/hide
Query:  RRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVSTPKSEWHCGT--SVPVQSRSQRRHPKKHVPV-----SGFLSEDVYSA
        +R+RKPTRR+IEE S++  +    +   P+KD+ +   S   S  VS   +V+  +     G+   VP  S  +R  P++++       S +L ED  SA
Subjt:  RRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVSTPKSEWHCGT--SVPVQSRSQRRHPKKHVPV-----SGFLSEDVYSA

Query:  TE---------------------------CKNVYSI-----LQGFLSEDVYSATECKNVYSSAKRCKKHD-----------RRKHQKMWTLTEVMQLVDG
         E                            +N ++      ++  LSE V    E +++ SS     +++           RRKH + WTL+E+ +LV+G
Subjt:  TE---------------------------CKNVYSI-----LQGFLSEDVYSATECKNVYSSAKRCKKHD-----------RRKHQKMWTLTEVMQLVDG

Query:  IAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELA
        +++YG G+W+ IK+HLF+   +RT +DL+DKWRNLL+ S     +   +   + H S  +P  +L RV ELA
Subjt:  IAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELA

AT1G72650.2 TRF-like 61.6e-2229.78Show/hide
Query:  RRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVSTPKSEWHCGT--SVPVQSRSQRRHPKKHVPV-----SGFLSEDVYSA
        +R+RKPTRR+IEE S++  +    +   P+KD+ +   S   S  VS   +V+  +     G+   VP  S  +R  P++++       S +L ED  SA
Subjt:  RRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVSTPKSEWHCGT--SVPVQSRSQRRHPKKHVPV-----SGFLSEDVYSA

Query:  TE---------------------------CKNVYSI-----LQGFLSEDVYSATECKNVYSSAKRCKKHD-----------RRKHQKMWTLTEVMQLVDG
         E                            +N ++      ++  LSE V    E +++ SS     +++           RRKH + WTL+E+ +LV+G
Subjt:  TE---------------------------CKNVYSI-----LQGFLSEDVYSATECKNVYSSAKRCKKHD-----------RRKHQKMWTLTEVMQLVDG

Query:  IAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELA
        +++YG G+W+ IK+HLF+   +RT +DL+DKWRNLL+ S     +   +   + H S  +P  +L RV ELA
Subjt:  IAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELA

AT2G37025.1 TRF-like 83.6e-3044.93Show/hide
Query:  KNVYSILQGFLSEDVYSATECKNVYSSAKRCK-KHDRRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQN
        K +++ ++   S+D  + +E ++  S  K  + K DRRK+Q++WTL EVM LVDGI+ +G G+WT IK H F ++ HR P+D+RDKWRNLL+AS     N
Subjt:  KNVYSILQGFLSEDVYSATECKNVYSSAKRCK-KHDRRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQN

Query:  RKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSP
            E K+   +R +PK +L RV ELA+++PYP  +SP
Subjt:  RKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSP

AT2G37025.2 TRF-like 83.6e-3044.93Show/hide
Query:  KNVYSILQGFLSEDVYSATECKNVYSSAKRCK-KHDRRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQN
        K +++ ++   S+D  + +E ++  S  K  + K DRRK+Q++WTL EVM LVDGI+ +G G+WT IK H F ++ HR P+D+RDKWRNLL+AS     N
Subjt:  KNVYSILQGFLSEDVYSATECKNVYSSAKRCK-KHDRRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNIQN

Query:  RKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSP
            E K+   +R +PK +L RV ELA+++PYP  +SP
Subjt:  RKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAAGAAGCGCATTTCTGCCGGAAGTTCACAAATATGAAATCTCATTGGGATCTTGATCAACTGCTGGGTGATGCCAATCAAGTAGGGGAATTCCATGCAGCAAA
CAATCTGCCGAATACATATGCAGAAGTTGCTGAAAATTCTTTCAGACAGAATAGGGGATTGCATTTGGGAAACTTAAGTTCAGAGTGCAAGTCTCAGGGATCAAGCAGGA
GTGATGCTGATGCTTTTGGGATATCAGAATTGTCAGCAGCAATGATAATGGAGGCTGAATACAATGATACACATGTTGAGAGGGGTTTCACTCATGATTTGTGTGCTGGT
ACCAATAGTAGTCGTGTAACACCACTTGAAGGAAGCATCTGTGATACAATACTTGATAATAGAAATATCCATAAATTGGTAAACTTAAGTTCAGAGTGCAAGTCTCAAGG
ATCAAGCAGGAGTGATACAGATGCTTTTGGAATATCAGAATTGTCAGCAGCAATGATAATGGAGACTGAATTCAATAATACACATGTTGAGAGGGGTTTCACTCATGATT
TGTGTGCTGGTCTGAGGACCAATAGTAGTCGTGTAACACCACACGAAGGCAGCATCTGCGATACAATACTTGATAATAGAAGTATCCATAAATTTAATACTAATGAAAGC
TATATAGAAAATCGTGATTTATCTGATGAAAATGTGAAGGGTGATATTGTGGCAAGCAAACTCGTCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACACGAAGGTT
CATTGAAGAATTTTCAGATTCTAAGTCTGAAAGTAACAAGGGAAGGAAAAAAACTCCTACAAAAGATAAATACATGAAAGGGACATCTACTGAAGAATCTAGTCATGTTA
GCCATGAGGTACAGGTGTCCACGCCTAAAAGTGAATGGCATTGTGGTACTTCTGTTCCAGTGCAGTCTCGATCTCAAAGAAGACATCCAAAGAAGCATGTACCAGTTTCG
GGATTTCTATCAGAAGATGTATATTCTGCAACAGAGTGTAAAAATGTTTATTCTATATTACAGGGATTTCTATCAGAAGATGTATATTCTGCAACAGAGTGTAAAAATGT
TTATTCGTCTGCTAAAAGATGTAAAAAGCATGATAGGAGGAAGCACCAGAAGATGTGGACCCTTACTGAAGTAATGCAATTAGTTGATGGAATTGCTGAATATGGAACTG
GCCGCTGGACTCATATAAAGAGGCACCTATTTGCACATTCTCCTCATCGCACACCTATAGATCTCAGGGACAAATGGCGAAATCTTCTGAGAGCTAGCTGTGTTAACATA
CAGAACAGAAAAGGGATCGAACGGAAGCAATCACATGCCTCACGTCCCCTTCCAAAGTCCCTGCTCCAACGCGTCTATGAACTGGCCAATATTTATCCATATCCAAAGGA
GCGCAGTCCAAAATCAGTCGAAGCAACTACACCTGCCCCGGATCTTACTGAAAGTAATGCTTTGCCATTCAATTGGGGGCGGAAGAAGGACGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCAAGAAGCGCATTTCTGCCGGAAGTTCACAAATATGAAATCTCATTGGGATCTTGATCAACTGCTGGGTGATGCCAATCAAGTAGGGGAATTCCATGCAGCAAA
CAATCTGCCGAATACATATGCAGAAGTTGCTGAAAATTCTTTCAGACAGAATAGGGGATTGCATTTGGGAAACTTAAGTTCAGAGTGCAAGTCTCAGGGATCAAGCAGGA
GTGATGCTGATGCTTTTGGGATATCAGAATTGTCAGCAGCAATGATAATGGAGGCTGAATACAATGATACACATGTTGAGAGGGGTTTCACTCATGATTTGTGTGCTGGT
ACCAATAGTAGTCGTGTAACACCACTTGAAGGAAGCATCTGTGATACAATACTTGATAATAGAAATATCCATAAATTGGTAAACTTAAGTTCAGAGTGCAAGTCTCAAGG
ATCAAGCAGGAGTGATACAGATGCTTTTGGAATATCAGAATTGTCAGCAGCAATGATAATGGAGACTGAATTCAATAATACACATGTTGAGAGGGGTTTCACTCATGATT
TGTGTGCTGGTCTGAGGACCAATAGTAGTCGTGTAACACCACACGAAGGCAGCATCTGCGATACAATACTTGATAATAGAAGTATCCATAAATTTAATACTAATGAAAGC
TATATAGAAAATCGTGATTTATCTGATGAAAATGTGAAGGGTGATATTGTGGCAAGCAAACTCGTCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACACGAAGGTT
CATTGAAGAATTTTCAGATTCTAAGTCTGAAAGTAACAAGGGAAGGAAAAAAACTCCTACAAAAGATAAATACATGAAAGGGACATCTACTGAAGAATCTAGTCATGTTA
GCCATGAGGTACAGGTGTCCACGCCTAAAAGTGAATGGCATTGTGGTACTTCTGTTCCAGTGCAGTCTCGATCTCAAAGAAGACATCCAAAGAAGCATGTACCAGTTTCG
GGATTTCTATCAGAAGATGTATATTCTGCAACAGAGTGTAAAAATGTTTATTCTATATTACAGGGATTTCTATCAGAAGATGTATATTCTGCAACAGAGTGTAAAAATGT
TTATTCGTCTGCTAAAAGATGTAAAAAGCATGATAGGAGGAAGCACCAGAAGATGTGGACCCTTACTGAAGTAATGCAATTAGTTGATGGAATTGCTGAATATGGAACTG
GCCGCTGGACTCATATAAAGAGGCACCTATTTGCACATTCTCCTCATCGCACACCTATAGATCTCAGGGACAAATGGCGAAATCTTCTGAGAGCTAGCTGTGTTAACATA
CAGAACAGAAAAGGGATCGAACGGAAGCAATCACATGCCTCACGTCCCCTTCCAAAGTCCCTGCTCCAACGCGTCTATGAACTGGCCAATATTTATCCATATCCAAAGGA
GCGCAGTCCAAAATCAGTCGAAGCAACTACACCTGCCCCGGATCTTACTGAAAGTAATGCTTTGCCATTCAATTGGGGGCGGAAGAAGGACGAATGACATCTACTTTGGA
AACACCCGAAAGGCCTTTGCAATGAAGTGGAAGTTCAAATAATACTTGTAATTAGATGTAAAAAGATCTCTGCTTTTGTTTTGATCCTATTGTAACAGTGATATGCTCTT
GAAAATGGGAAAAAATCTTCCATTATGATAGCTGCGGAGCTAATTAACTAAAAATTTATAATATCATCTCTTCTTTTCCCTTGTTTTTTTTAAGAATTATCCATAAACAT
TTCCTAATACC
Protein sequenceShow/hide protein sequence
MDQEAHFCRKFTNMKSHWDLDQLLGDANQVGEFHAANNLPNTYAEVAENSFRQNRGLHLGNLSSECKSQGSSRSDADAFGISELSAAMIMEAEYNDTHVERGFTHDLCAG
TNSSRVTPLEGSICDTILDNRNIHKLVNLSSECKSQGSSRSDTDAFGISELSAAMIMETEFNNTHVERGFTHDLCAGLRTNSSRVTPHEGSICDTILDNRSIHKFNTNES
YIENRDLSDENVKGDIVASKLVSCSRERRLRKPTRRFIEEFSDSKSESNKGRKKTPTKDKYMKGTSTEESSHVSHEVQVSTPKSEWHCGTSVPVQSRSQRRHPKKHVPVS
GFLSEDVYSATECKNVYSILQGFLSEDVYSATECKNVYSSAKRCKKHDRRKHQKMWTLTEVMQLVDGIAEYGTGRWTHIKRHLFAHSPHRTPIDLRDKWRNLLRASCVNI
QNRKGIERKQSHASRPLPKSLLQRVYELANIYPYPKERSPKSVEATTPAPDLTESNALPFNWGRKKDE