; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G067410 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G067410
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationCicolChr04:24183283..24198644
RNA-Seq ExpressionCcUC04G067410
SyntenyCcUC04G067410
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008449224.1 PREDICTED: uncharacterized protein LOC103491166 isoform X1 [Cucumis melo]7.7e-23178.61Show/hide
Query:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW
        MDQEVHFCQKFTNMKSHW KVEGPFLPAPLNDSNEVEDLLVE KS+HVLGNCLRVQDFSCDFGYGIQTNGG                             
Subjt:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW

Query:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE
          L   +K+       F D DQLLDDANEVGEFHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS GPSR DTDAFGISELSATMVME EFNNTPVE
Subjt:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE

Query:  RGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPT
        RGLTHELSPGL TKGR VTPLEGNIC TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF DSKSE NKGRRK P 
Subjt:  RGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPT

Query:  KDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRC
        KDKYL+V STEES HI HEV+M  PR +S CGTSVPVQ +S+RRHP KHVPV                        SGFLSEDESSATECK+VYSSA+RC
Subjt:  KDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRC

Query:  KKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIYPYP
        KKYDRR QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN+KG+E KQ+HASRPLPKSLLQRV+ELANIYPYP
Subjt:  KKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIYPYP

Query:  KERSPKSVKATTPPMHHVESNSLSFNWGRKKYE
        KER PKSVKA TPPM  +ESNSLSFNWGRKKYE
Subjt:  KERSPKSVKATTPPMHHVESNSLSFNWGRKKYE

XP_008449225.1 PREDICTED: uncharacterized protein LOC103491166 isoform X2 [Cucumis melo]5.9e-23178.61Show/hide
Query:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW
        MDQEVHFCQKFTNMKSHW KVEGPFLPAPLNDSNEVEDLLVE KS+HVLGNCLRVQDFSCDFGYGIQTNGG                             
Subjt:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW

Query:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE
          L   +K+       F D DQLLDDANEVGEFHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS GPSR DTDAFGISELSATMVME EFNNTPVE
Subjt:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE

Query:  RGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPT
        RGLTHELSPGL TKGR VTPLEGNIC TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF DSKSE NKGRRK P 
Subjt:  RGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPT

Query:  KDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRC
        KDKYL+V STEES HI HEV+M  PR +S CGTSVPVQ +S+RRHP KHVPV                        SGFLSEDESSATECK+VYSSA+RC
Subjt:  KDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRC

Query:  KKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIYPYP
        KKYDRR QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN+KG+E KQ+HASRPLPKSLLQRV+ELANIYPYP
Subjt:  KKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIYPYP

Query:  KERSPKSVKATTPPMHHVESNSLSFNWGRKKYE
        KER PKSVKA TPPM  +ESNSLSFNWGRKKYE
Subjt:  KERSPKSVKATTPPMHHVESNSLSFNWGRKKYE

XP_038881566.1 uncharacterized protein LOC120073047 isoform X1 [Benincasa hispida]9.4e-24583.02Show/hide
Query:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW
        MDQEVHFCQKFTNMKSHW +VEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG            GGL S             
Subjt:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW

Query:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE
              +K+       F DLDQLLDDANEVGEFHATNNL +TYAEVAENSFRQNRGLQLGN SS SKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE
Subjt:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE

Query:  RGLTHELSPGLRTKGRPV--TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKP
        RGLTHELSPGLRTKGR V  TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVAN+LASCSRERRLRKPTRRYIEEFADSKSE+NKGRRKP
Subjt:  RGLTHELSPGLRTKGRPV--TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKP

Query:  PTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAK
        PTKDKYL+VTSTEESNHI HEV+M TPR E HCGTSVPVQSRSQRRHPKKHVPV                        SGFLSEDESSATECK+VYSS K
Subjt:  PTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAK

Query:  RCKKYD-RRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIY
        RCKKYD RRHQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV+ELANIY
Subjt:  RCKKYD-RRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIY

Query:  PYPKERSPKSVKATTPPMHHVESNSLSFNWGRKKYE
        PYPKERSPKSVKATTPPMH +ESNSLSFNWGRKKYE
Subjt:  PYPKERSPKSVKATTPPMHHVESNSLSFNWGRKKYE

XP_038881567.1 uncharacterized protein LOC120073047 isoform X2 [Benincasa hispida]4.1e-23282.59Show/hide
Query:  KVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALWRHLSQRNKKRIPELSHFLD
        +VEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG            GGL S                   +K+       F D
Subjt:  KVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALWRHLSQRNKKRIPELSHFLD

Query:  LDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVERGLTHELSPGLRTKGRPV-
        LDQLLDDANEVGEFHATNNL +TYAEVAENSFRQNRGLQLGN SS SKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVERGLTHELSPGLRTKGR V 
Subjt:  LDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVERGLTHELSPGLRTKGRPV-

Query:  -TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLRVTSTEESNHIS
         TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVAN+LASCSRERRLRKPTRRYIEEFADSKSE+NKGRRKPPTKDKYL+VTSTEESNHI 
Subjt:  -TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLRVTSTEESNHIS

Query:  HEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRCKKYD-RRHQKMWTLTEV
        HEV+M TPR E HCGTSVPVQSRSQRRHPKKHVPV                        SGFLSEDESSATECK+VYSS KRCKKYD RRHQKMW+LTEV
Subjt:  HEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRCKKYD-RRHQKMWTLTEV

Query:  MRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIYPYPKERSPKSVKATTPPMH
        MRLVDGIAEYGTGRWT IKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV+ELANIYPYPKERSPKSVKATTPPMH
Subjt:  MRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIYPYPKERSPKSVKATTPPMH

Query:  HVESNSLSFNWGRKKYE
         +ESNSLSFNWGRKKYE
Subjt:  HVESNSLSFNWGRKKYE

XP_038881569.1 uncharacterized protein LOC120073047 isoform X3 [Benincasa hispida]2.6e-24282.84Show/hide
Query:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW
        MDQEVHFCQKFTNMKSHW +VEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNG            GGL S             
Subjt:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW

Query:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE
              +K+       F DLDQLLDDANEVGEFHATNNL N  AEVAENSFRQNRGLQLGN SS SKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE
Subjt:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE

Query:  RGLTHELSPGLRTKGRPV--TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKP
        RGLTHELSPGLRTKGR V  TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVAN+LASCSRERRLRKPTRRYIEEFADSKSE+NKGRRKP
Subjt:  RGLTHELSPGLRTKGRPV--TPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKP

Query:  PTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAK
        PTKDKYL+VTSTEESNHI HEV+M TPR E HCGTSVPVQSRSQRRHPKKHVPV                        SGFLSEDESSATECK+VYSS K
Subjt:  PTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAK

Query:  RCKKYD-RRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIY
        RCKKYD RRHQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV+ELANIY
Subjt:  RCKKYD-RRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIY

Query:  PYPKERSPKSVKATTPPMHHVESNSLSFNWGRKKYE
        PYPKERSPKSVKATTPPMH +ESNSLSFNWGRKKYE
Subjt:  PYPKERSPKSVKATTPPMHHVESNSLSFNWGRKKYE

TrEMBL top hitse value%identityAlignment
A0A1S3BKX9 uncharacterized protein LOC103491166 isoform X13.7e-23178.61Show/hide
Query:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW
        MDQEVHFCQKFTNMKSHW KVEGPFLPAPLNDSNEVEDLLVE KS+HVLGNCLRVQDFSCDFGYGIQTNGG                             
Subjt:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW

Query:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE
          L   +K+       F D DQLLDDANEVGEFHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS GPSR DTDAFGISELSATMVME EFNNTPVE
Subjt:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE

Query:  RGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPT
        RGLTHELSPGL TKGR VTPLEGNIC TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF DSKSE NKGRRK P 
Subjt:  RGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPT

Query:  KDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRC
        KDKYL+V STEES HI HEV+M  PR +S CGTSVPVQ +S+RRHP KHVPV                        SGFLSEDESSATECK+VYSSA+RC
Subjt:  KDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRC

Query:  KKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIYPYP
        KKYDRR QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN+KG+E KQ+HASRPLPKSLLQRV+ELANIYPYP
Subjt:  KKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIYPYP

Query:  KERSPKSVKATTPPMHHVESNSLSFNWGRKKYE
        KER PKSVKA TPPM  +ESNSLSFNWGRKKYE
Subjt:  KERSPKSVKATTPPMHHVESNSLSFNWGRKKYE

A0A1S3BLK0 uncharacterized protein LOC103491166 isoform X22.9e-23178.61Show/hide
Query:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW
        MDQEVHFCQKFTNMKSHW KVEGPFLPAPLNDSNEVEDLLVE KS+HVLGNCLRVQDFSCDFGYGIQTNGG                             
Subjt:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW

Query:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE
          L   +K+       F D DQLLDDANEVGEFHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS GPSR DTDAFGISELSATMVME EFNNTPVE
Subjt:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE

Query:  RGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPT
        RGLTHELSPGL TKGR VTPLEGNIC TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF DSKSE NKGRRK P 
Subjt:  RGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPT

Query:  KDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRC
        KDKYL+V STEES HI HEV+M  PR +S CGTSVPVQ +S+RRHP KHVPV                        SGFLSEDESSATECK+VYSSA+RC
Subjt:  KDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRC

Query:  KKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIYPYP
        KKYDRR QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN+KG+E KQ+HASRPLPKSLLQRV+ELANIYPYP
Subjt:  KKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIYPYP

Query:  KERSPKSVKATTPPMHHVESNSLSFNWGRKKYE
        KER PKSVKA TPPM  +ESNSLSFNWGRKKYE
Subjt:  KERSPKSVKATTPPMHHVESNSLSFNWGRKKYE

A0A6J1CRG2 uncharacterized protein LOC111013581 isoform X21.7e-21574.81Show/hide
Query:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW
        MDQEVHFCQKFTNMKSHW KV+G FLPAPLN+ NEVE LLVEPKS+HVLG+CLR QDFSCDF YGIQTN             GGL S             
Subjt:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW

Query:  RHLSQRNKKRIPELS-HFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNN-TP
              N K+  E    F DLDQLL D NEV EFHATNNLPNTY EVAENSFR+NRGLQLGNLSSESKSQG SR+DT+AF ISELSA MV E E NN TP
Subjt:  RHLSQRNKKRIPELS-HFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNN-TP

Query:  VERGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKP
        V+RGLTHEL  GLRTKGR  TPL+G+IC TILDN NIHKF+TNE  +ENG LSDENVKG+I A++LA CSR+RRLRKPTRRYIEEFADSKSES+KG+RKP
Subjt:  VERGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKP

Query:  PTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAK
        PTKDKY++VTS EESNHI H+V+M TP  ESHCGTS+PVQSRSQRR PKKHVPV                        SGFLSE+ESSATECK VYSSAK
Subjt:  PTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAK

Query:  RCKKYDRR-HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIY
        RCKK+DRR HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDLRDKWRNLLRASCVNIQNR GIERKQSHASRPLPKSLLQRV+ELANIY
Subjt:  RCKKYDRR-HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIY

Query:  PYPKERSPKSVKATTPPMHHVESNSLSFNWGRKKYE
        PYPKERSPKSVKATT PMH +ESNSLSFNWGRKKY+
Subjt:  PYPKERSPKSVKATTPPMHHVESNSLSFNWGRKKYE

A0A6J1CRQ1 uncharacterized protein LOC111013581 isoform X14.1e-21474.67Show/hide
Query:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW
        MDQEVHFCQKFTNMKSHW KV+G FLPAPLN+ NEVE LLVEPKS+HVLG+CLR QDFSCDF YGIQTN             GGL S             
Subjt:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW

Query:  RHLSQRNKKRIPELS-HFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNN-TP
              N K+  E    F DLDQLL D NEV EFHATNNLPNTY EVAENSFR+NRGLQLGNLSSESKSQG SR+DT+AF ISELSA MV E E NN TP
Subjt:  RHLSQRNKKRIPELS-HFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNN-TP

Query:  VERGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKP
        V+RGLTHEL  GLRTKGR  TPL+G+IC TILDN NIHKF+TNE  +ENG LSDENVKG+I A++LA CSR+RRLRKPTRRYIEEFADSKSES+KG+RKP
Subjt:  VERGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKP

Query:  PTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAK
        PTKDKY++VTS EESNHI H+V+M TP  ESHCGTS+PVQSRSQRR PKKHVPV                        SGFLSE+ESSATECK VYSSAK
Subjt:  PTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAK

Query:  RCKKYDRR-HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR-DKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANI
        RCKK+DRR HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDLR DKWRNLLRASCVNIQNR GIERKQSHASRPLPKSLLQRV+ELANI
Subjt:  RCKKYDRR-HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR-DKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANI

Query:  YPYPKERSPKSVKATTPPMHHVESNSLSFNWGRKKYE
        YPYPKERSPKSVKATT PMH +ESNSLSFNWGRKKY+
Subjt:  YPYPKERSPKSVKATTPPMHHVESNSLSFNWGRKKYE

A0A6J1ECM4 uncharacterized protein LOC111433102 isoform X15.8e-20070.22Show/hide
Query:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW
        MDQEVHFCQKFTNM  HW K+EG FLPAPLN+SNEV+  LVEPKSDH LGNCLRVQDFS DFGY IQTNG                              
Subjt:  MDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALW

Query:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE
                                                   AEV ENSFRQNRGLQLG  SSESKSQG SRSDTDAF ISELSATMVME EFNNTPVE
Subjt:  RHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVE

Query:  RGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPT
        R LT EL  GLRT+G   TP EGNICDTILDN NIHKFNTNENY+EN  +SDENVKGDIVA++LASCSRERRLRKPTRRYIEEFADSKSE+NKGRRKPPT
Subjt:  RGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPT

Query:  KDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRC
        KDKYL+VTSTEESNHI H+V+M TP+ ESHCGTSVPVQSRSQRRHP+KHVPV                        SGFLSEDE SATECK+VYSSAK C
Subjt:  KDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKRC

Query:  KKYDRR-HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIYPY
        KKYDRR HQKMWTLTEVMRLVDGIAEYGTGRWT IK+HLFASSPHRTPIDLRDKWRNLL+ASCVNIQN KG E KQ HASRPLPKSLLQRV+ELANIYPY
Subjt:  KKYDRR-HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELANIYPY

Query:  PKERSPKSVKATTPPMHHVESNSLSFNWGRKKYE
        PKERSPK V+  TPPM+ +ESNSLSFNWGRKKYE
Subjt:  PKERSPKSVKATTPPMHHVESNSLSFNWGRKKYE

SwissProt top hitse value%identityAlignment
Q6R0E3 Telomere repeat-binding protein 51.4e-0926Show/hide
Query:  PPTKDKYLRVTSTEESN-----HISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLP--ANSVAYLPPSQSSGFLSEDESSATEC
        PP   K L   S E+S+     ++ H +    P    H   S  V+S    +    +   ++    L+P+ P  A ++  +PP ++              
Subjt:  PPTKDKYLRVTSTEESN-----HISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLP--ANSVAYLPPSQSSGFLSEDESSATEC

Query:  KSVYSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV
               KR +   RR ++ +++ EV  LV  +   GTGRW  +K   F ++ HRT +DL+DKW+ L+  + ++ Q R+G          P+P+ LL RV
Subjt:  KSVYSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV

Q9C7B1 Telomere repeat-binding protein 32.6e-1135.14Show/hide
Query:  LSEDESSATECKSVYSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHAS
        L E E  A     +    KR +   RR ++ +++TEV  LV  + E GTGRW  +K   F  + HRT +DL+DKW+ L+  + ++ Q R+G         
Subjt:  LSEDESSATECKSVYSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHAS

Query:  RPLPKSLLQRV
         P+P+ LL RV
Subjt:  RPLPKSLLQRV

Q9FFY9 Telomere repeat-binding protein 46.8e-1236.45Show/hide
Query:  ESSATECKSVYSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLP
        ES A     V    KR +   RR ++ +++TEV  LV  + E GTGRW  +K   F ++ HRT +DL+DKW+ L+  + ++ Q R+G          P+P
Subjt:  ESSATECKSVYSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLP

Query:  KSLLQRV
        + LL RV
Subjt:  KSLLQRV

Q9M347 Telomere repeat-binding protein 62.8e-1034.21Show/hide
Query:  SEDESSATECKSV----YSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQS
        S D + A   KSV       A + +   RR ++ +T++EV  LV  +   GTGRW  +K H F    HRT +DL+DKW+ L+  + ++ + R+G      
Subjt:  SEDESSATECKSV----YSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQS

Query:  HASRPLPKSLLQRV
            P+P+ LL RV
Subjt:  HASRPLPKSLLQRV

Q9SNB9 Telomere repeat-binding protein 24.8e-1037.21Show/hide
Query:  RRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV
        RR ++ +++TEV  LV  + + GTGRW  +K   F  + HRT +DL+DKW+ L+  + ++ Q R+G          P+P+ LL RV
Subjt:  RRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRV

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 31.6e-1628.2Show/hide
Query:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGT--SVPVQSRSQRRHPKKHVPV------------SLLL
        +R+RKPTRRYIEE  + +          P+KD             +S E R+   R  S  G+   VP  S  +R  P++++              +   
Subjt:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGT--SVPVQSRSQRRHPKKHVPV------------SLLL

Query:  SGLLPLLP---ANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR-------------------------CKKYDRRHQKMWTLTEVMRLVDGIAEYGT
         G L L P   +N V  +P  +S+    + ES     K +++   +                              R+  + WT++EV +LV+G+++YG 
Subjt:  SGLLPLLP---ANSVAYLPPSQSSGFLSEDESSATECKSVYSSAKR-------------------------CKKYDRRHQKMWTLTEVMRLVDGIAEYGT

Query:  GRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELA
        G+WT IKK  F+   HRT +DL+DKWRNL +AS  N +   G+++   H S  +P  ++ +V ELA
Subjt:  GRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELA

AT1G72650.1 TRF-like 61.0e-2330.74Show/hide
Query:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGT--SVPVQSRSQRRHPKKHVPVSL-LLSGLL--PLLPA
        +R+RKPTRRYIEE +++  +    +   P+KD+ L   S   S  +S   R+   R  S  G+   VP  S  +R  P++++   L   S  L      A
Subjt:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGT--SVPVQSRSQRRHPKKHVPVSL-LLSGLL--PLLPA

Query:  NSVAYLPPSQSSG-------------------FLSEDESSATECKSVYSSAKRCKKYD----------------------RRHQKMWTLTEVMRLVDGIA
         S   L PSQ S                    F + DE++     S        +  D                      R+H + WTL+E+ +LV+G++
Subjt:  NSVAYLPPSQSSG-------------------FLSEDESSATECKSVYSSAKRCKKYD----------------------RRHQKMWTLTEVMRLVDGIA

Query:  EYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELA
        +YG G+W+ IKKHLF+S  +RT +DL+DKWRNLL+ S     +   +   + H S  +P  +L RV ELA
Subjt:  EYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELA

AT1G72650.2 TRF-like 61.0e-2330.74Show/hide
Query:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGT--SVPVQSRSQRRHPKKHVPVSL-LLSGLL--PLLPA
        +R+RKPTRRYIEE +++  +    +   P+KD+ L   S   S  +S   R+   R  S  G+   VP  S  +R  P++++   L   S  L      A
Subjt:  RRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGT--SVPVQSRSQRRHPKKHVPVSL-LLSGLL--PLLPA

Query:  NSVAYLPPSQSSG-------------------FLSEDESSATECKSVYSSAKRCKKYD----------------------RRHQKMWTLTEVMRLVDGIA
         S   L PSQ S                    F + DE++     S        +  D                      R+H + WTL+E+ +LV+G++
Subjt:  NSVAYLPPSQSSG-------------------FLSEDESSATECKSVYSSAKRCKKYD----------------------RRHQKMWTLTEVMRLVDGIA

Query:  EYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELA
        +YG G+W+ IKKHLF+S  +RT +DL+DKWRNLL+ S     +   +   + H S  +P  +L RV ELA
Subjt:  EYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRPLPKSLLQRVHELA

AT2G37025.1 TRF-like 82.6e-2742.07Show/hide
Query:  YLPPSQSSGFLSEDESSATECKSVYSSAK--RCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN
        Y+  S    F S+D+ + +E +   S  K  R K   R++Q++WTL EVM LVDGI+ +G G+WT IK H F  + HR P+D+RDKWRNLL+AS     N
Subjt:  YLPPSQSSGFLSEDESSATECKSVYSSAK--RCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN

Query:  RKGIERKQSHASRPLPKSLLQRVHELANIYPYPKERSPKSVKATTPPMHHVESNSLSFNWGRKK
            E K+   +R +PK +L RV ELA+++PYP  +SP  V        H  S S S +  +KK
Subjt:  RKGIERKQSHASRPLPKSLLQRVHELANIYPYPKERSPKSVKATTPPMHHVESNSLSFNWGRKK

AT2G37025.2 TRF-like 82.6e-2742.07Show/hide
Query:  YLPPSQSSGFLSEDESSATECKSVYSSAK--RCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN
        Y+  S    F S+D+ + +E +   S  K  R K   R++Q++WTL EVM LVDGI+ +G G+WT IK H F  + HR P+D+RDKWRNLL+AS     N
Subjt:  YLPPSQSSGFLSEDESSATECKSVYSSAK--RCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQN

Query:  RKGIERKQSHASRPLPKSLLQRVHELANIYPYPKERSPKSVKATTPPMHHVESNSLSFNWGRKK
            E K+   +R +PK +L RV ELA+++PYP  +SP  V        H  S S S +  +KK
Subjt:  RKGIERKQSHASRPLPKSLLQRVHELANIYPYPKERSPKSVKATTPPMHHVESNSLSFNWGRKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAACGGAATGGGAACGACGCGGCAACCTTATCATCTTAAATTAAACCACAACCGTCTCAACGCCGTCAAAGCGGTAACGTTAGGTTACGTGACCAGGTCAAACAG
TGAAACCGCTAACGAAATCCACGTGTATTGGAACCGGATTTCAAATGCATATAAATTGAGACATGTAGGATTGATTGAGCTTAAAAGTAAACAAATTATGGATCAAGAAG
TGCATTTCTGCCAGAAGTTCACAAATATGAAATCTCATTGGGCAAAAGTGGAGGGACCTTTTCTTCCTGCACCATTAAATGATTCAAATGAAGTTGAGGATTTACTTGTG
GAGCCTAAAAGCGACCATGTTTTAGGAAATTGCTTGAGAGTTCAAGATTTCTCTTGTGACTTTGGCTATGGAATACAAACAAACGGTGGTGAATTCTTAAGATCCATTCC
TTATGCTTTTATTGGTGGCTTGAAATCCTTTAGAGTGGAAAAGCTAAAACCTGTTGAGGCACTTTGGAGGCATTTGTCTCAGAGGAACAAGAAAAGAATTCCAGAACTCT
CACATTTCCTTGATCTTGATCAACTGCTCGATGATGCCAATGAAGTAGGGGAATTCCATGCAACAAACAATCTGCCAAATACATATGCCGAAGTTGCTGAAAATTCTTTC
AGACAGAATAGGGGATTACAATTGGGAAACTTAAGTTCAGAGAGTAAATCTCAGGGACCAAGCAGGAGTGATACTGATGCTTTTGGAATATCAGAACTGTCAGCAACAAT
GGTAATGGAGGATGAATTCAATAATACACCTGTTGAGAGGGGTTTAACTCATGAGTTGTCCCCTGGTCTGAGGACCAAAGGTAGGCCTGTAACACCACTTGAAGGCAACA
TCTGTGATACGATACTTGATAATAGAAATATCCATAAGTTCAATACTAATGAAAACTATATAGAAAATGGCGATTTATCTGATGAAAATGTGAAGGGTGATATTGTGGCA
AACGAACTTGCCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACTCGAAGATACATTGAAGAATTTGCAGATTCAAAGTCTGAAAGTAACAAGGGAAGGAGAAAACC
TCCTACAAAAGATAAATACCTGAGAGTGACGTCCACTGAAGAATCCAATCACATTAGTCATGAGGTACGAATGTTCACGCCTAGATGTGAATCACATTGTGGTACATCTG
TTCCAGTGCAGTCTCGATCTCAAAGAAGACATCCAAAGAAGCATGTACCAGTTTCATTGTTACTTTCTGGTCTTCTGCCACTCCTCCCCGCTAATTCAGTTGCTTATCTT
CCCCCTTCCCAATCCTCTGGATTTCTATCTGAAGATGAATCTTCTGCAACTGAGTGTAAAAGTGTTTATTCATCTGCTAAAAGATGTAAAAAGTATGATAGGAGGCACCA
GAAGATGTGGACCCTTACGGAAGTAATGCGATTAGTTGATGGAATTGCTGAATATGGAACTGGCCGCTGGACTCATATAAAGAAGCACCTATTTGCATCTTCTCCTCATC
GCACACCTATAGACCTCAGGGACAAATGGCGAAATCTTCTGAGAGCTAGCTGTGTTAACATACAGAACAGAAAAGGGATCGAACGGAAGCAGTCACATGCCTCGCGTCCA
CTGCCAAAGTCCCTGCTTCAACGTGTTCATGAACTGGCCAATATATATCCATACCCAAAGGAGCGCAGTCCAAAATCAGTCAAAGCAACTACACCTCCTATGCATCATGT
CGAAAGTAACTCATTGTCATTCAATTGGGGGCGGAAGAAGTATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAACGGAATGGGAACGACGCGGCAACCTTATCATCTTAAATTAAACCACAACCGTCTCAACGCCGTCAAAGCGGTAACGTTAGGTTACGTGACCAGGTCAAACAG
TGAAACCGCTAACGAAATCCACGTGTATTGGAACCGGATTTCAAATGCATATAAATTGAGACATGTAGGATTGATTGAGCTTAAAAGTAAACAAATTATGGATCAAGAAG
TGCATTTCTGCCAGAAGTTCACAAATATGAAATCTCATTGGGCAAAAGTGGAGGGACCTTTTCTTCCTGCACCATTAAATGATTCAAATGAAGTTGAGGATTTACTTGTG
GAGCCTAAAAGCGACCATGTTTTAGGAAATTGCTTGAGAGTTCAAGATTTCTCTTGTGACTTTGGCTATGGAATACAAACAAACGGTGGTGAATTCTTAAGATCCATTCC
TTATGCTTTTATTGGTGGCTTGAAATCCTTTAGAGTGGAAAAGCTAAAACCTGTTGAGGCACTTTGGAGGCATTTGTCTCAGAGGAACAAGAAAAGAATTCCAGAACTCT
CACATTTCCTTGATCTTGATCAACTGCTCGATGATGCCAATGAAGTAGGGGAATTCCATGCAACAAACAATCTGCCAAATACATATGCCGAAGTTGCTGAAAATTCTTTC
AGACAGAATAGGGGATTACAATTGGGAAACTTAAGTTCAGAGAGTAAATCTCAGGGACCAAGCAGGAGTGATACTGATGCTTTTGGAATATCAGAACTGTCAGCAACAAT
GGTAATGGAGGATGAATTCAATAATACACCTGTTGAGAGGGGTTTAACTCATGAGTTGTCCCCTGGTCTGAGGACCAAAGGTAGGCCTGTAACACCACTTGAAGGCAACA
TCTGTGATACGATACTTGATAATAGAAATATCCATAAGTTCAATACTAATGAAAACTATATAGAAAATGGCGATTTATCTGATGAAAATGTGAAGGGTGATATTGTGGCA
AACGAACTTGCCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACTCGAAGATACATTGAAGAATTTGCAGATTCAAAGTCTGAAAGTAACAAGGGAAGGAGAAAACC
TCCTACAAAAGATAAATACCTGAGAGTGACGTCCACTGAAGAATCCAATCACATTAGTCATGAGGTACGAATGTTCACGCCTAGATGTGAATCACATTGTGGTACATCTG
TTCCAGTGCAGTCTCGATCTCAAAGAAGACATCCAAAGAAGCATGTACCAGTTTCATTGTTACTTTCTGGTCTTCTGCCACTCCTCCCCGCTAATTCAGTTGCTTATCTT
CCCCCTTCCCAATCCTCTGGATTTCTATCTGAAGATGAATCTTCTGCAACTGAGTGTAAAAGTGTTTATTCATCTGCTAAAAGATGTAAAAAGTATGATAGGAGGCACCA
GAAGATGTGGACCCTTACGGAAGTAATGCGATTAGTTGATGGAATTGCTGAATATGGAACTGGCCGCTGGACTCATATAAAGAAGCACCTATTTGCATCTTCTCCTCATC
GCACACCTATAGACCTCAGGGACAAATGGCGAAATCTTCTGAGAGCTAGCTGTGTTAACATACAGAACAGAAAAGGGATCGAACGGAAGCAGTCACATGCCTCGCGTCCA
CTGCCAAAGTCCCTGCTTCAACGTGTTCATGAACTGGCCAATATATATCCATACCCAAAGGAGCGCAGTCCAAAATCAGTCAAAGCAACTACACCTCCTATGCATCATGT
CGAAAGTAACTCATTGTCATTCAATTGGGGGCGGAAGAAGTATGAATGACATCAACTTTGGAAGCAGCAGAAATTCCTTTGCTGTGAAGTGGAAGTCTAATGAATACTTA
CAATTAGATGTAAAAAGATCTCTGTTTCTGTTTTCACCCTTTGGTAACGGTGATATGCACTTGAAACTGGGAAGAAAATCTTCCATTATAAAAGCCACGGAGCTAATTAA
CTGAAAATTTATACTAGAGGTTCTCAGATTCTGTTGTAGCGAGGAAAGGAAGTCATGTTCAAGTTGGAACTAATTGCTTGGATGGAGGAGAAATCAGCGTATGCCTTGAT
CGAATATGGATACTAATGTTTGCCTCTACTGAAGAA
Protein sequenceShow/hide protein sequence
MSNGMGTTRQPYHLKLNHNRLNAVKAVTLGYVTRSNSETANEIHVYWNRISNAYKLRHVGLIELKSKQIMDQEVHFCQKFTNMKSHWAKVEGPFLPAPLNDSNEVEDLLV
EPKSDHVLGNCLRVQDFSCDFGYGIQTNGGEFLRSIPYAFIGGLKSFRVEKLKPVEALWRHLSQRNKKRIPELSHFLDLDQLLDDANEVGEFHATNNLPNTYAEVAENSF
RQNRGLQLGNLSSESKSQGPSRSDTDAFGISELSATMVMEDEFNNTPVERGLTHELSPGLRTKGRPVTPLEGNICDTILDNRNIHKFNTNENYIENGDLSDENVKGDIVA
NELASCSRERRLRKPTRRYIEEFADSKSESNKGRRKPPTKDKYLRVTSTEESNHISHEVRMFTPRCESHCGTSVPVQSRSQRRHPKKHVPVSLLLSGLLPLLPANSVAYL
PPSQSSGFLSEDESSATECKSVYSSAKRCKKYDRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLRASCVNIQNRKGIERKQSHASRP
LPKSLLQRVHELANIYPYPKERSPKSVKATTPPMHHVESNSLSFNWGRKKYE